BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 048503
         (374 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score =  311 bits (798), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 187/413 (45%), Positives = 246/413 (59%), Gaps = 53/413 (12%)

Query: 2   QNSQKLPFYNDNETPKSPI-------------------SIIY----QAEIISVDDIYLMH 38
           ++S K PFYN  ETP   I                   S I+    Q+E+IS    YLM 
Sbjct: 36  RDSPKSPFYNPRETPTQRIVSAVRRSMSRVHHFSPTKNSDIFTDTAQSEMISNQGEYLMK 95

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
            S+GTP  DI    DTGSD  WTQC+PC +  C++Q+ PLFDPK SSTY  ISCS+ QC 
Sbjct: 96  FSLGTPAFDILAIADTGSDLIWTQCKPCDQ--CYEQDAPLFDPKSSSTYRDISCSTKQCD 153

Query: 99  VVTSNCS---EGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           ++    S   EG+  C YS+ YG     SF+SGN+A +T+T  STSG PV +P  I GCG
Sbjct: 154 LLKEGASCSGEGNKTCHYSYSYGD---RSFTSGNVAADTITLGSTSGRPVLLPKAIIGCG 210

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG-- 206
           H N  S T        +G GP   SLISQ+G++I GKFSYCL         SSK+NFG  
Sbjct: 211 HNNGGSFTEKGSGIVGLGGGP--ISLISQLGSTIDGKFSYCLVPLSSNATNSSKLNFGSN 268

Query: 207 GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTL 259
           GIV+G GV STPLI +D    Y+L+LEA+SVG++R++F  SS     GNI +D+G   TL
Sbjct: 269 GIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTSEGNIIIDSGTTLTL 328

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLS 319
            P ++ S L S + + +   PV+    +P     LCY+I +  KFP +T HF GADVKL+
Sbjct: 329 FPEDFFSELSSAVQDAVAGTPVE----DPSGILSLCYSIDADLKFPSITAHFDGADVKLN 384

Query: 320 PSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           P N F  +SD ++C AF   N+  ++G + Q+NFL+GYD+E   VSFKP+ CT
Sbjct: 385 PLNTFVQVSDTVLCFAFNPINSGAIFGNLAQMNFLVGYDLEGKTVSFKPTDCT 437


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  298 bits (762), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 173/407 (42%), Positives = 241/407 (59%), Gaps = 53/407 (13%)

Query: 8   PFYNDNETP--------KSPISIIYQAEIISVDDI---------------YLMHLSIGTP 44
           PFYN  ET         +  IS ++  + I+   +               YLM LS+GTP
Sbjct: 45  PFYNSEETDLQRINNALRRSISRVHHFDPIAAASVSPKAAESDVTSNRGEYLMSLSLGTP 104

Query: 45  PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-TSN 103
           P  I G  DTGSD  WTQC+PC    C+KQ  PLFDPK S TY   SC + QC+++  S 
Sbjct: 105 PFKIMGIADTGSDLIWTQCKPCER--CYKQVDPLFDPKSSKTYRDFSCDARQCSLLDQST 162

Query: 104 CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSD 163
           CS   C Y + YG  +Y   + GN+A++T+T +ST+G PV  P  + GCGH+N    T  
Sbjct: 163 CSGNICQYQYSYGDRSY---TMGNVASDTITLDSTTGSPVSFPKTVIGCGHEN--DGTFS 217

Query: 164 SKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG--GIVAGAGVVS 216
            K +GI+GLG G  SLISQMG+S+ GKFSYCL         SSK+NFG   +V+G GV S
Sbjct: 218 DKGSGIVGLGAGPLSLISQMGSSVGGKFSYCLVPLSSRAGNSSKLNFGSNAVVSGPGVQS 277

Query: 217 TPLI----IRDHYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLPLEYHSNL 268
           TPL+    +   Y+L+LEA+SVGN+R++F  SS     GNI +D+G   T++P ++ SNL
Sbjct: 278 TPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLGTGEGNIIIDSGTTLTIVPDDFFSNL 337

Query: 269 KSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNI 327
            + + N ++ +  +    +P GF  V CY+ +S  K P +T HF GADVKL P N F  +
Sbjct: 338 STAVGNQVEGRRAE----DPSGFLSV-CYSATSDLKVPAITAHFTGADVKLKPINTFVQV 392

Query: 328 SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           SD+++C AF    + I +YG + Q+NFL+ Y+I+   +SFKP+ CT 
Sbjct: 393 SDDVVCLAFASTTSGISIYGNVAQMNFLVEYNIQGKSLSFKPTDCTK 439


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  294 bits (752), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 182/411 (44%), Positives = 246/411 (59%), Gaps = 52/411 (12%)

Query: 2   QNSQKLPFYNDNETPKS--------PISIIYQAEIISVDDI---------------YLMH 38
           ++S K PFYN  ET            +S ++    IS  D                YLM+
Sbjct: 38  RDSPKSPFYNPTETSSQRLRNAIHRSVSRVFHFTDISQKDASDNAPQIDLTSNSGEYLMN 97

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
           +S+GTPP  I    DTGSD  WTQC+PC   DC+ Q  PLFDPK SSTY  +SCSSSQC 
Sbjct: 98  ISLGTPPFPIMAIADTGSDLLWTQCKPCD--DCYTQVDPLFDPKASSTYKDVSCSSSQCT 155

Query: 99  VV--TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
            +   ++CS  D  CSYS  YG  +Y   + GN+A +TLT  ST   PV++ N+I GCGH
Sbjct: 156 ALENQASCSTEDNTCSYSTSYGDRSY---TKGNIAVDTLTLGSTDTRPVQLKNIIIGCGH 212

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG--G 207
            N    T + K +GI+GLG G  SLI+Q+G SI GKFSYCL      +  +SKINFG   
Sbjct: 213 NNAG--TFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFGTNA 270

Query: 208 IVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSSSTG----NIFVDTGVLRTLL 260
           +V+G GVVSTPLI +     YYL+L++ISVG++ +++  S +G    NI +D+G   TLL
Sbjct: 271 VVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNIIIDSGTTLTLL 330

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSP 320
           P E++S L+  +++ I A+  K    + G S  LCY+ +   K P +T+HF GADV L P
Sbjct: 331 PTEFYSELEDAVASSIDAE--KKQDPQTGLS--LCYSATGDLKVPAITMHFDGADVNLKP 386

Query: 321 SNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           SN F  IS++++C AFRG  +  +YG + Q+NFL+GYD     VSFKP+ C
Sbjct: 387 SNCFVQISEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 437


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score =  291 bits (745), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 177/383 (46%), Positives = 246/383 (64%), Gaps = 33/383 (8%)

Query: 7   LPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPC 66
           L F ND+ +P SP     Q+ I S    YLM++SIGTPPV I    DTGSD  WTQC PC
Sbjct: 63  LQFSNDDASPNSP-----QSFITSNRGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPC 117

Query: 67  PELDCFKQEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCS--EGDCSYSFLYGRGAYASF 123
              DC++Q  PLFDPK+SSTY  +SCSSSQC A+  ++CS  E  CSY+  YG  +Y   
Sbjct: 118 E--DCYQQTSPLFDPKESSTYRKVSCSSSQCRALEDASCSTDENTCSYTITYGDNSY--- 172

Query: 124 SSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM 183
           + G++A +T+T  S+   PV + N+I GCGH+N    T D   +GIIGLG G++SL+SQ+
Sbjct: 173 TKGDVAVDTVTMGSSGRRPVSLRNMIIGCGHENTG--TFDPAGSGIIGLGGGSTSLVSQL 230

Query: 184 GTSIAGKFSYCL----PDQG-SSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAIS 233
             SI GKFSYCL     + G +SKINFG  GIV+G GVVST ++ +D   +Y+L+LEAIS
Sbjct: 231 RKSINGKFSYCLVPFTSETGLTSKINFGTNGIVSGDGVVSTSMVKKDPATYYFLNLEAIS 290

Query: 234 VGNQRLEFVSS----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPG 289
           VG+++++F S+      GNI +D+G   TLLP  ++  L+SV+++ IKA+ V+    +P 
Sbjct: 291 VGSKKIQFTSTIFGTGEGNIVIDSGTTLTLLPSNFYYELESVVASTIKAERVQ----DPD 346

Query: 290 FSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIM 349
               LCY  SS  K P++T+HF+G DVKL   N F  +S+++ C AF       ++G + 
Sbjct: 347 GILSLCYRDSSSFKVPDITVHFKGGDVKLGNLNTFVAVSEDVSCFAFAANEQLTIFGNLA 406

Query: 350 QINFLIGYDIEQAMVSFKPSRCT 372
           Q+NFL+GYD     VSFK + C+
Sbjct: 407 QMNFLVGYDTVSGTVSFKKTDCS 429


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score =  285 bits (728), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 172/367 (46%), Positives = 233/367 (63%), Gaps = 32/367 (8%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           Q ++ S    YLM++SIGTPP  I    DTGSD  WTQC PC   DC+ Q  PLFDPK S
Sbjct: 80  QIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCD--DCYTQVDPLFDPKTS 137

Query: 85  STYNSISCSSSQCAVV--TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
           STY  +SCSSSQC  +   ++CS  D  CSYS  YG  +Y   + GN+A +TLT  S+  
Sbjct: 138 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSY---TKGNIAVDTLTLGSSDT 194

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL----- 195
            P+++ N+I GCGH N    T + K +GI+GLG G  SLI Q+G SI GKFSYCL     
Sbjct: 195 RPMQLKNIIIGCGHNNAG--TFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTS 252

Query: 196 -PDQGSSKINFG--GIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSSTG- 247
             DQ +SKINFG   IV+G+GVVSTPLI +      YYL+L++ISVG++++++  S +  
Sbjct: 253 KKDQ-TSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSES 311

Query: 248 ---NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF 304
              NI +D+G   TLLP E++S L+  +++ I A+  K    + G S  LCY+ +   K 
Sbjct: 312 SEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAE--KKQDPQSGLS--LCYSATGDLKV 367

Query: 305 PEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
           P +T+HF GADVKL  SN F  +S++++C AFRG  +  +YG + Q+NFL+GYD     V
Sbjct: 368 PVITMHFDGADVKLDSSNAFVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTV 427

Query: 365 SFKPSRC 371
           SFKP+ C
Sbjct: 428 SFKPTDC 434


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  285 bits (728), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 156/353 (44%), Positives = 222/353 (62%), Gaps = 27/353 (7%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM  S+GTPP +++G VDTGSD  W QC+PC +  C+KQ  P+F+P KSS+Y +I CSS
Sbjct: 87  YLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPCEQ--CYKQTTPIFNPSKSSSYKNIPCSS 144

Query: 95  SQCAVV--TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           + C  V  TS   +  C Y+  +   +Y   S G L+ ETLT +ST+G  V  P  + GC
Sbjct: 145 NLCQSVRYTSCNKQNSCEYTINFSDQSY---SQGELSVETLTLDSTTGHSVSFPKTVIGC 201

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG- 206
           GH N      ++  +GI+GLG G  SL +Q+ +SI GKFSYCL         +SK+NFG 
Sbjct: 202 GHNNRGMFQGET--SGIVGLGIGPVSLTTQLKSSIGGKFSYCLLPLLVDSNKTSKLNFGD 259

Query: 207 -GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFV---SSSTGNIFVDTGVLRTL 259
             +V+G GVVSTP + +D    YYL+LEA SVGN+R+EF     S  GNI +D+G   TL
Sbjct: 260 AAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFEVLDDSEEGNIILDSGTTLTL 319

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS-QPKFPEVTIHFRGADVKL 318
           LP   ++NL+S ++ ++K   V     +P     LCY+I+S Q  FP +T HF+GAD+KL
Sbjct: 320 LPSHVYTNLESAVAQLVKLDRVD----DPNQLLNLCYSITSDQYDFPIITAHFKGADIKL 375

Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +P + F +++D ++C AF       ++G + Q+N L+GYD++Q +VSFKPS C
Sbjct: 376 NPISTFAHVADGVVCLAFTSSQTGPIFGNLAQLNLLVGYDLQQNIVSFKPSDC 428


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 172/367 (46%), Positives = 233/367 (63%), Gaps = 32/367 (8%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           Q ++ S    YLM++SIGTPP  I    DTGSD  WTQC PC   DC+ Q  PLFDPK S
Sbjct: 80  QIDLTSNSGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCD--DCYTQVDPLFDPKTS 137

Query: 85  STYNSISCSSSQCAVV--TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
           STY  +SCSSSQC  +   ++CS  D  CSYS  YG  +Y   + GN+A +TLT  S+  
Sbjct: 138 STYKDVSCSSSQCTALENQASCSTNDNTCSYSLSYGDNSY---TKGNIAVDTLTLGSSDT 194

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL----- 195
            P+++ N+I GCGH N    T + K +GI+GLG G  SLI Q+G SI GKFSYCL     
Sbjct: 195 RPMQLKNIIIGCGHNNAG--TFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTS 252

Query: 196 -PDQGSSKINFG--GIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSSTG- 247
             DQ +SKINFG   IV+G+GVVSTPLI +      YYL+L++ISVG++++++  S +  
Sbjct: 253 KKDQ-TSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSES 311

Query: 248 ---NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF 304
              NI +D+G   TLLP E++S L+  +++ I A+  K    + G S  LCY+ +   K 
Sbjct: 312 SEGNIIIDSGTTLTLLPTEFYSELEDAVASSIDAE--KKQDPQSGLS--LCYSATGDLKV 367

Query: 305 PEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
           P +T+HF GADVKL  SN F  +S++++C AFRG  +  +YG + Q+NFL+GYD     V
Sbjct: 368 PVITMHFDGADVKLDSSNAFVQVSEDLVCFAFRGSPSFSIYGNVAQMNFLVGYDTVSKTV 427

Query: 365 SFKPSRC 371
           SFKP+ C
Sbjct: 428 SFKPTDC 434


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  279 bits (714), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 167/370 (45%), Positives = 227/370 (61%), Gaps = 36/370 (9%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           ++EII+    YLM LS+GTPP +I    DTGSD  WTQC PC +  C+KQ  PLFDPK S
Sbjct: 83  ESEIIANGGEYLMSLSLGTPPFEILAIADTGSDLIWTQCTPCDK--CYKQIAPLFDPKSS 140

Query: 85  STYNSISCSSSQCAVV--TSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
            TY  +SC + QC  +  +S+CS E  C YS+ YG     SF++GNLA +T+T  ST+G 
Sbjct: 141 KTYRDLSCDTRQCQNLGESSSCSSEQLCQYSYYYGD---RSFTNGNLAVDTVTLPSTNGG 197

Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL------ 195
           PV  P  + GCG +N  + T D K +GIIGLG G  SLISQMG+S+ GKFSYCL      
Sbjct: 198 PVYFPKTVIGCGRRN--NGTFDKKDSGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSE 255

Query: 196 PDQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSSTG--- 247
               SSK++FG   +V+G+GV STPLI ++    YYL+LEA+SVG++++EF  SS G   
Sbjct: 256 SAGNSSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIEFGGSSFGGSE 315

Query: 248 -NIFVDTGVLRTLLPLEYHSNLKSVMSNMI----KAQPVKGVGAEPGFSDVLCYNISSQP 302
            NI +D+G   TL P+ + +   + + N +    + Q   G+ +        CY  +   
Sbjct: 316 GNIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSH-------CYRPTPDL 368

Query: 303 KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
           K P +T HF GADV L   N F  ISD+++C AF    +  ++G + Q+NFLIGYDI+  
Sbjct: 369 KVPVITAHFNGADVVLQTLNTFILISDDVLCLAFNSTQSGAIFGNVAQMNFLIGYDIQGK 428

Query: 363 MVSFKPSRCT 372
            VSFKP+ CT
Sbjct: 429 SVSFKPTDCT 438


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score =  274 bits (701), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 161/369 (43%), Positives = 219/369 (59%), Gaps = 30/369 (8%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           Q+ I+     YLM+L IGTPPV +   VDTGSD TWTQC PC    C+KQ  PLFDPK S
Sbjct: 82  QSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTH--CYKQVVPLFDPKNS 139

Query: 85  STYNSISCSSSQCAVVTSN--CS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
           STY   SC +S C  +  +  CS E  C++ + Y  G   SF+ GNLA+ETLT +ST+G 
Sbjct: 140 STYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADG---SFTGGNLASETLTVDSTAGK 196

Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----P 196
           PV  P   FGCGH   +    D   +GI+GLG G  SLISQ+ ++I G FSYCL      
Sbjct: 197 PVSFPGFAFGCGHS--SGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTD 254

Query: 197 DQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSST----- 246
              SS+INFG  G V+G G VSTPL+ +     YYL+LE ISVG +RL +   S      
Sbjct: 255 SSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGYSKKTEVE 314

Query: 247 -GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFP 305
            GNI VD+G   T LP E++S L+  ++N IK + V+    +P     LCYN +++   P
Sbjct: 315 EGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVR----DPNGIFSLCYNTTAEINAP 370

Query: 306 EVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
            +T HF+ A+V+L P N F  + ++++C      +   V G + Q+NFL+G+D+ +  VS
Sbjct: 371 IITAHFKDANVELQPLNTFMRMQEDLVCFTVAPTSDIGVLGNLAQVNFLVGFDLRKKRVS 430

Query: 366 FKPSRCTNY 374
           FK + CT +
Sbjct: 431 FKAADCTQH 439


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  273 bits (697), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 158/358 (44%), Positives = 224/358 (62%), Gaps = 30/358 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM  S+GTPP  I+G  DTGSD  W QCEPC +  C+ Q  P+F+P KSS+Y +I CSS
Sbjct: 87  YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQ--CYNQTTPIFNPSKSSSYKNIPCSS 144

Query: 95  SQC-AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             C +V  ++CS+ + C Y   YG    +S S G+L+ +TL+  STSG PV  P ++ GC
Sbjct: 145 KLCHSVRDTSCSDQNSCQYKISYGD---SSHSQGDLSVDTLSLESTSGSPVSFPKIVIGC 201

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL------PDQGSSKINFG 206
           G  N    T     +GI+GLG G  SLI+Q+G+SI GKFSYCL          SS ++FG
Sbjct: 202 GTDNAG--TFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFG 259

Query: 207 --GIVAGAGVVSTPLIIRD--HYYLSLEAISVGNQRLEFVSSS-----TGNIFVDTGVLR 257
              +V+G GVVSTPLI +D   Y+L+L+A SVGN+R+EF  SS      GNI +D+G   
Sbjct: 260 DAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGTTL 319

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI-SSQPKFPEVTIHFRGADV 316
           TL+P + ++NL+S + +++K   V     +P     LCY++ S++  FP +T+HF+GADV
Sbjct: 320 TLIPSDVYTNLESAVVDLVKLDRVD----DPNQQFSLCYSLKSNEYDFPIITVHFKGADV 375

Query: 317 KLSPSNLFRNISDEIMCSAFR-GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           +L   + F  I+D I+C AF+       ++G + Q N L+GYD++Q  VSFKP+ CT 
Sbjct: 376 ELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSFKPTDCTK 433


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  271 bits (692), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 156/382 (40%), Positives = 228/382 (59%), Gaps = 33/382 (8%)

Query: 13  NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
           N + K+ ++   ++ +IS +  Y+M  S+GTPP+  +G VDTGSD  W QCEPC +  C+
Sbjct: 65  NHSNKNSLASTPESTVISYEGDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCEPCEQ--CY 122

Query: 73  KQEPPLFDPKKSSTYNSISCSSSQCAVV--TSNCSEGDCSYSFLYGRGAYASFSSGNLAT 130
            Q  P F+P KSS+Y +ISCSS  C  V  TS   + +C YS  YG  ++   S G+L+ 
Sbjct: 123 NQTTPKFNPSKSSSYKNISCSSKLCQSVRDTSCNDKKNCEYSINYGNQSH---SQGDLSL 179

Query: 131 ETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK 190
           ETLT  ST+G PV  P  + GCG  N+ S    S     +G GP  +SLI+Q+G SI GK
Sbjct: 180 ETLTLESTTGRPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGP--ASLITQLGPSIGGK 237

Query: 191 FSYCLP---------DQGSSKINFG--GIVAGAGVVSTPLIIRDH---YYLSLEAISVGN 236
           FSYCL            GSSK+NFG   IV+G  V+STP++ +DH   YYL++EA SVG+
Sbjct: 238 FSYCLVRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSVGD 297

Query: 237 QRLEFVSSSTG----NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD 292
           +R+EF  SS G    NI +D+  + T +P + ++ L S + +++  + V     +P    
Sbjct: 298 KRVEFAGSSKGVEEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVD----DPNQQF 353

Query: 293 VLCYNISSQPK--FPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQ 350
            LCYN+SS  +  FP +T HF+GAD+ L  +N F  ++ +++C AF   N   ++G   Q
Sbjct: 354 SLCYNVSSDEEYDFPYMTAHFKGADILLYATNTFVEVARDVLCFAFAPSNGGAIFGSFSQ 413

Query: 351 INFLIGYDIEQAMVSFKPSRCT 372
            +F++GYD++Q  VSFK   CT
Sbjct: 414 QDFMVGYDLQQKTVSFKSVDCT 435


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score =  270 bits (691), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 153/381 (40%), Positives = 232/381 (60%), Gaps = 32/381 (8%)

Query: 13  NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
           N   K  ++ I Q+ +I     YLM  S+GTPP  ++G VDTGSD  W QCEPC E  C+
Sbjct: 65  NHFYKYSLANIPQSTVIPDIGEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCEPCQE--CY 122

Query: 73  KQEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLAT 130
            Q  P+F+P KSS+Y +I C S  C ++  ++C++ + C YS  YG  ++   S G+L+ 
Sbjct: 123 NQTTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSH---SGGDLSV 179

Query: 131 ETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK 190
           +TLT  ST+GL V  PN++ GCG  N+ S   +   +GI+G G G +S I+Q+G+S  GK
Sbjct: 180 DTLTLESTNGLTVSFPNIVIGCGTNNILS--YEGASSGIVGFGSGPASFITQLGSSTGGK 237

Query: 191 FSYCLP---------DQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGN 236
           FSYCL             +SK+NFG    V+G GVV+TP++ +D    YYL+LEA SVGN
Sbjct: 238 FSYCLTPLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGN 297

Query: 237 QRLEF----VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD 292
           +R+E        + GNI +D+G   T L  + +S L+S + +++K + V     +P  + 
Sbjct: 298 RRVEIGGVPNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVD----DPTQTL 353

Query: 293 VLCYNISSQP-KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI 351
            LCY++ ++   FP +T+HF+GADV L P + F +++D + C AF     + ++G + Q 
Sbjct: 354 NLCYSVKAEGYDFPIITMHFKGADVDLHPISTFVSVADGVFCLAFESSQDHAIFGNLAQQ 413

Query: 352 NFLIGYDIEQAMVSFKPSRCT 372
           N ++GYD++Q +VSFKPS CT
Sbjct: 414 NLMVGYDLQQKIVSFKPSDCT 434


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 168/406 (41%), Positives = 237/406 (58%), Gaps = 47/406 (11%)

Query: 2   QNSQKLPFYNDNETPKSPI-SIIY----------------------QAEIISVDDIYLMH 38
           ++S K PFYN  ETP   I + I+                      Q +I      YLM+
Sbjct: 38  RDSPKSPFYNPAETPSQRIRNAIHRSFNRVSHFTDLSEMDASLNSPQTDITPCGGEYLMN 97

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
           LS+GTPP  I    DTGS+  WTQC+PC   DC+ Q  PLFDPK SSTY  +SCSSSQC 
Sbjct: 98  LSLGTPPSPIMAVADTGSNLIWTQCKPCD--DCYTQVDPLFDPKASSTYKDVSCSSSQCT 155

Query: 99  VV--TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
            +   ++CS  D  CSY   Y  G+Y   + G  A +TLT  ST   PV++ N+I GCG 
Sbjct: 156 ALENQASCSTEDKTCSYLVSYADGSY---TMGKFAVDTLTLGSTDNRPVQLKNIIIGCGQ 212

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--PDQGSSKINFG--GIVA 210
            N    T  +K +G++GLG G  SLI Q+G SI GKFSYCL   +  +SKINFG   +V+
Sbjct: 213 NNAV--TFRNKSSGVVGLGGGAVSLIKQLGDSIDGKFSYCLVPENDQTSKINFGTNAVVS 270

Query: 211 GAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEYHS 266
           G G VSTPL+++     YYL+L++ISVG++ ++   S+  GN+ +D+G   TLLP++Y+ 
Sbjct: 271 GPGTVSTPLVVKSRDTFYYLTLKSISVGSKNMQTPDSNIKGNMVIDSGTTLTLLPVKYYI 330

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRN 326
            +++ ++++I A   K    +      LCYN ++    P +T+HF GADVKL P N F  
Sbjct: 331 EIENAVASLINADKSK----DERIGSSLCYNATADLNIPVITMHFEGADVKLYPYNSFFK 386

Query: 327 ISDEIMCSAF-RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           ++++++C AF      N +YG + Q NFL+GYD     +SFKP+ C
Sbjct: 387 VTEDLVCLAFGMSFYRNGIYGNVAQKNFLVGYDTASKTMSFKPTDC 432


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  267 bits (682), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 156/358 (43%), Positives = 221/358 (61%), Gaps = 30/358 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM  S+GTPP  I+G  DTGSD  W QCEPC +  C+ Q  P+F+P KSS+Y +I C S
Sbjct: 87  YLMTYSVGTPPTKIYGIADTGSDIVWLQCEPCEQ--CYNQTTPIFNPSKSSSYKNIPCLS 144

Query: 95  SQC-AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             C +V  ++CS+ + C Y   YG    +S S G+L+ +TL+  STSG PV  P  + GC
Sbjct: 145 KLCHSVRDTSCSDQNSCQYKISYGD---SSHSQGDLSVDTLSLESTSGSPVSFPKTVIGC 201

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL------PDQGSSKINFG 206
           G  N    T     +GI+GLG G  SLI+Q+G+SI GKFSYCL          SS ++FG
Sbjct: 202 GTDNAG--TFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVPLLNKESNASSILSFG 259

Query: 207 --GIVAGAGVVSTPLIIRD--HYYLSLEAISVGNQRLEFVSSS-----TGNIFVDTGVLR 257
              +V+G GVVSTPLI +D   Y+L+L+A SVGN+R+EF  SS      GNI +D+G   
Sbjct: 260 DAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEGGDDEGNIIIDSGTTL 319

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI-SSQPKFPEVTIHFRGADV 316
           TL+P + ++NL+S + +++K   V     +P     LCY++ S++  FP +T HF+GAD+
Sbjct: 320 TLIPSDVYTNLESAVVDLVKLDRVD----DPNQQFSLCYSLKSNEYDFPIITAHFKGADI 375

Query: 317 KLSPSNLFRNISDEIMCSAFR-GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           +L   + F  I+D I+C AF+       ++G + Q N L+GYD++Q  VSFKP+ CT 
Sbjct: 376 ELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSFKPTDCTK 433


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 156/391 (39%), Positives = 224/391 (57%), Gaps = 52/391 (13%)

Query: 3   NSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQ 62
           N+ +      N   K+ ++   Q+ +I     YLM  S+GTPP  ++G  DTGSD  W Q
Sbjct: 55  NAARRSINRANHFYKTALTNTPQSTVIPDHGEYLMTYSVGTPPFKLYGIADTGSDIVWLQ 114

Query: 63  CEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYAS 122
           CEPC E  C+ Q  P F P KSSTY +I CSS  C                        S
Sbjct: 115 CEPCKE--CYNQTTPKFKPSKSSTYKNIPCSSDLC-----------------------KS 149

Query: 123 FSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ 182
              GNL+ +TLT  S++G P+  P  + GCG  N  S   +   +GI+GLG G +SLI+Q
Sbjct: 150 GQQGNLSVDTLTLESSTGHPISFPKTVIGCGTDNTVS--FEGASSGIVGLGGGPASLITQ 207

Query: 183 MGTSIAGKFSYCL-----PDQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAI 232
           +G+SI  KFSYCL         +SK+NFG   +V+G GVVSTP++ +D    YYL+LEA 
Sbjct: 208 LGSSIDAKFSYCLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPIVFYYLTLEAF 267

Query: 233 SVGNQRLEFVSSST----GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEP 288
           SVGN+R+EF  SS     GNI +D+G   T++P + ++NL+S +  ++K + V     +P
Sbjct: 268 SVGNKRIEFEGSSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVN----DP 323

Query: 289 GFSDVLCYNISSQP-KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANI---- 343
                LCY+++S    FP +T HF+GADVKL P + F +++D I+C AF   +A I    
Sbjct: 324 TRLFNLCYSVTSDGYDFPIITTHFKGADVKLHPISTFVDVADGIVCLAFATTSAFIPSDV 383

Query: 344 --VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
             ++G + Q N L+GYD++Q +VSFKP+ C+
Sbjct: 384 VSIFGNLAQQNLLVGYDLQQKIVSFKPTDCS 414


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score =  263 bits (673), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 155/369 (42%), Positives = 218/369 (59%), Gaps = 32/369 (8%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           Q+ ++     Y+M+LSIGTPPV +   VDTGSD TWTQC PC    C+KQ  P FDPK S
Sbjct: 82  QSRLVPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTH--CYKQVVPFFDPKNS 139

Query: 85  STYNSISCSSSQCAVVTSN--CSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
           STY   SC +S C  + ++  C  G  C++ + Y  G   SF+ GNLA ETLT  ST+G 
Sbjct: 140 STYRDSSCGTSFCLALGNDRSCRNGKKCTFMYSYADG---SFTGGNLAVETLTVASTAGK 196

Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----P 196
           PV  P   FGC H+  +    D   +GI+GLG    S+ISQ+ ++I G+FSYCL      
Sbjct: 197 PVSFPGFAFGCVHR--SGGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSYCLLPVFTD 254

Query: 197 DQGSSKINFG--GIVAGAGVVSTPLIIR--DHYY--LSLEAISVGNQRLEFVSSST---- 246
              SS+INFG  GIV+GAG VSTPL+++  D YY  ++LE  SVG +RL +   S     
Sbjct: 255 SSMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEV 314

Query: 247 --GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-SQPK 303
             GNI VD+G   T LPLE++  L+  +++ IK + V+    +P     LCYN +  Q  
Sbjct: 315 EEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVR----DPNGISSLCYNTTVDQID 370

Query: 304 FPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
            P +T HF+ A+V+L P N F  + ++++C      +   + G + Q+NFL+G+D+ +  
Sbjct: 371 APIITAHFKDANVELQPWNTFLRMQEDLVCFTVLPTSDIGILGNLAQVNFLVGFDLRKKR 430

Query: 364 VSFKPSRCT 372
           VSFK + CT
Sbjct: 431 VSFKAADCT 439


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  262 bits (669), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 151/359 (42%), Positives = 211/359 (58%), Gaps = 33/359 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM LS+GTPP  I    DTGSD  WTQCEPC   +C++Q+ P+F+P KS+TY  +SCSS
Sbjct: 85  YLMKLSVGTPPFPIIAVADTGSDIIWTQCEPC--TNCYQQDLPMFNPSKSTTYRKVSCSS 142

Query: 95  SQCAVV--TSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C+     ++CS + DC+YS  YG  ++   S G+ A +TLT  STSG  V  P    G
Sbjct: 143 PVCSFTGEDNSCSFKPDCTYSISYGDNSH---SQGDFAVDTLTMGSTSGRVVAFPRTAIG 199

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-----DQGSSKINFG 206
           CGH N  S   D+  +GI+GLG G +SLI QMG+++ GKFSYCL      D GS+K+NFG
Sbjct: 200 CGHDNAGS--FDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFG 257

Query: 207 --GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST-----GNIFVDTGV 255
               V+G+G VSTP+ I D     Y L L+A+SVG     + ++++      NI +D+G 
Sbjct: 258 SNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDSGT 317

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP-KFPEVTIHFRGA 314
             TLLP++ + N    +SN I  Q       +P      C+  ++   K P + +HF GA
Sbjct: 318 TLTLLPVDLYHNFAKAISNSINLQRTD----DPNQFLEYCFETTTDDYKVPFIAMHFEGA 373

Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +++L   N+   +SD ++C AF G   N   +YG I QINFL+GYD+    +SFKP  C
Sbjct: 374 NLRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 150/359 (41%), Positives = 210/359 (58%), Gaps = 33/359 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM LS+GTPP  I    DTGSD  WTQC PC   +C++Q+ P+F+P KS+TY  +SCSS
Sbjct: 85  YLMKLSVGTPPFPIIAVADTGSDIIWTQCVPC--TNCYQQDLPMFNPSKSTTYRKVSCSS 142

Query: 95  SQCAVV--TSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C+     ++CS + DC+YS  YG  ++   S G+ A +TLT  STSG  V  P    G
Sbjct: 143 PVCSFTGEDNSCSFKPDCTYSISYGDNSH---SQGDFAVDTLTMGSTSGRVVAFPRTAIG 199

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-----DQGSSKINFG 206
           CGH N  S   D+  +GI+GLG G +SLI QMG+++ GKFSYCL      D GS+K+NFG
Sbjct: 200 CGHDNAGS--FDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLTPIGNDDGGSNKLNFG 257

Query: 207 --GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST-----GNIFVDTGV 255
               V+G+G VSTP+ I D     Y L L+A+SVG     + ++++      NI +D+G 
Sbjct: 258 SNANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVGRNNTFYSTANSILGGKANIIIDSGT 317

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP-KFPEVTIHFRGA 314
             TLLP++ + N    +SN I  Q       +P      C+  ++   K P + +HF GA
Sbjct: 318 TLTLLPVDLYHNFAKAISNSINLQRTD----DPNQFLEYCFETTTDDYKVPFIAMHFEGA 373

Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +++L   N+   +SD ++C AF G   N   +YG I QINFL+GYD+    +SFKP  C
Sbjct: 374 NLRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432


>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 447

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 155/372 (41%), Positives = 227/372 (61%), Gaps = 34/372 (9%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           Q+ +IS +  YLM++S+GTPPV + G  DTGSD  W QC+PC    C++Q  P+FDP KS
Sbjct: 85  QSPVISNNGEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPCDS--CYEQIEPIFDPAKS 142

Query: 85  STYNSISCSSSQCAVV--TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
            TY  +SC    C+ +     CS+ + C YS+ YG G++   +SG+LA +TLT  ST+G 
Sbjct: 143 KTYQILSCEGKSCSNLGGQGGCSDDNTCIYSYSYGDGSH---TSGDLAVDTLTIGSTTGR 199

Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG-- 199
           PV +P V+FGCGH N    T +   +G++GLG G  S+ISQ+   I G+FSYCL   G  
Sbjct: 200 PVSVPKVVFGCGHNN--GGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYCLVPLGND 257

Query: 200 ---SSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEF---------- 241
              SSK++FG  GIV+GAG VSTPL  R     YYL+LE++SVG+++L +          
Sbjct: 258 PSVSSKMHFGSRGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPL 317

Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ 301
             +  GNI +D+G   TLLP +++  L+S + + I  +PV+    +P     LCY+  S 
Sbjct: 318 ADADEGNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVR----DPNNVFSLCYSNLSG 373

Query: 302 PKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
            + P +T HF GAD++L P N F  + +++ C A    +   ++G + Q+NFL+GYD++ 
Sbjct: 374 LRIPTITAHFVGADLELKPLNTFVQVQEDLFCFAMIPVSDLAIFGNLAQMNFLVGYDLKS 433

Query: 362 AMVSFKPSRCTN 373
             VSFKP+ CT 
Sbjct: 434 RTVSFKPTDCTK 445


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 152/368 (41%), Positives = 217/368 (58%), Gaps = 31/368 (8%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           +A I   D  YL+  S+G PP  ++G +DTGSD  W QC+PC +  C+ Q   +FDP KS
Sbjct: 76  KATITQNDGEYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEK--CYNQTTRIFDPSKS 133

Query: 85  STYNSISCSSSQC-AVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
           +TY  +  SS+ C +V  ++CS  +   C Y+  YG G+Y   S G+L+ ETLT  ST+G
Sbjct: 134 NTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSY---SQGDLSVETLTLGSTNG 190

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM---GTSIAGKFSYCLPD 197
             V+    + GCG  N  S   + K +GI+GLG G  SLI+Q+    +SI  KFSYCL  
Sbjct: 191 SSVKFRRTVIGCGRNNTVS--FEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLAS 248

Query: 198 QG--SSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSS----- 245
               SSK+NFG   +V+G G VSTP++  D    YYL+LEA SVGN R+EF SSS     
Sbjct: 249 MSNISSKLNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRFGE 308

Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-SQPKF 304
            GNI +D+G   TLLP + +S L+S ++++++   VK    +P     LCY  +  +   
Sbjct: 309 KGNIIIDSGTTLTLLPNDIYSKLESAVADLVELDRVK----DPLKQLSLCYRSTFDELNA 364

Query: 305 PEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
           P +  HF GADVKL+  N F  +   + C AF       ++G + Q NFL+GYD+++ +V
Sbjct: 365 PVIMAHFSGADVKLNAVNTFIEVEQGVTCLAFISSKIGPIFGNMAQQNFLVGYDLQKKIV 424

Query: 365 SFKPSRCT 372
           SFKP+ C+
Sbjct: 425 SFKPTDCS 432


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score =  256 bits (654), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 167/414 (40%), Positives = 230/414 (55%), Gaps = 55/414 (13%)

Query: 3   NSQKLPFYNDNETPKSPISIIYQAEI-----------ISVDDI------------YLMHL 39
           +S + PFYN  ET    IS +    I           +S +D+            Y+M  
Sbjct: 35  DSSRSPFYNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHNDLPKPTIIPYAGSYYVMSY 94

Query: 40  SIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAV 99
           SIGTPP  ++G VDTGSD  W QC+PC    C  Q  P+F+P KSSTY +I CSS  C  
Sbjct: 95  SIGTPPFQLYGVVDTGSDGIWFQCKPCKP--CLNQTSPIFNPSKSSTYKNIRCSSPICKR 152

Query: 100 -VTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK 155
              + CS   +  C Y   Y      S S G+++ +TLT NS  G P+  P ++ GCGHK
Sbjct: 153 GEKTRCSSNRKRKCEYEITY---LDRSGSQGDISKDTLTLNSNDGSPISFPKIVIGCGHK 209

Query: 156 NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD-----QGSSKINFG--GI 208
           N  S T++   +GIIG G GN S++SQ+G+SI GKFSYCL         SSK+ FG   +
Sbjct: 210 N--SLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSKANISSKLYFGDMAV 267

Query: 209 VAGAGVVSTPLI---IRDHYYLSLEAISVGNQRLEFVSSS-----TGNIFVDTGVLRTLL 260
           V+G GVVSTPLI      +Y+ +LEA SVG+  ++   SS      GN  +D+G   T L
Sbjct: 268 VSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNEGNAVIDSGSTITQL 327

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-SQPKFPEVTIHFRGADVKLS 319
           P + +S L++ + +M+K + VK    +P     LCY  +  + + P +T HFRGADVKL+
Sbjct: 328 PNDVYSQLETAVISMVKLKRVK----DPTQQLSLCYKTTLKKYEVPIITAHFRGADVKLN 383

Query: 320 PSNLFRNISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
             N F  ++ E+MC AF       +VYG I Q NFL+GYD  + ++SFKP+ CT
Sbjct: 384 AFNTFIQMNHEVMCFAFNSSAFPWVVYGNIAQQNFLVGYDTLKNIISFKPTNCT 437


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 160/392 (40%), Positives = 226/392 (57%), Gaps = 28/392 (7%)

Query: 1   AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
           + ++  LP     E      ++  Q+ I +    YLM LSIGTPP  I+G  DTGSD TW
Sbjct: 38  SSHAHVLPLRRLMELSAMEKTLTPQSPIYAYLGHYLMELSIGTPPFKIYGIADTGSDLTW 97

Query: 61  TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-TSNCS-EGDCSYSFLYGRG 118
           T C PC   +C+KQ  P+FDP+KS+TY +ISC S  C  + T  CS +  C+Y++ Y   
Sbjct: 98  TSCVPCN--NCYKQRNPMFDPQKSTTYRNISCDSKLCHKLDTGVCSPQKRCNYTYAY--- 152

Query: 119 AYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSS 178
           A A+ + G LA ET+T +ST G  V +  ++FGCGH N      +  + GIIGLG G  S
Sbjct: 153 ASAAITRGVLAQETITLSSTKGKSVPLKGIVFGCGHNNTGG--FNDHEMGIIGLGGGPVS 210

Query: 179 LISQMGTSIAGK-FSYCL-PDQG----SSKINF--GGIVAGAGVVSTPLIIRDH---YYL 227
           LISQMG+S  GK FS CL P       SSK++F  G  V+G GVVSTPL+ +     Y++
Sbjct: 211 LISQMGSSFGGKRFSQCLVPFHTDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKTPYFV 270

Query: 228 SLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG 283
           +L  ISV N  L F  SS     GN+F+D+G   T+LP + +  + + + + +  +PV  
Sbjct: 271 TLLGISVENTYLHFNGSSQNVEKGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTD 330

Query: 284 VGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNAN- 342
              +P     LCY   +  + P +T HF GADVKLSP+  F +  D + C  F   +++ 
Sbjct: 331 ---DPDLGPQLCYRTKNNLRGPVLTAHFEGADVKLSPTQTFISPKDGVFCLGFTNTSSDG 387

Query: 343 IVYGRIMQINFLIGYDIEQAMVSFKPSRCTNY 374
            VYG   Q N+LIG+D+++ +VSFKP  CT +
Sbjct: 388 GVYGNFAQSNYLIGFDLDRQVVSFKPKDCTKH 419


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 158/408 (38%), Positives = 234/408 (57%), Gaps = 47/408 (11%)

Query: 2   QNSQKLPFYNDNETPKSPISIIY----------------QAEIISVDDIYLMHLSIGTPP 45
           ++S K P YN +ETP   +   +                +  + S +  YLM +SIGTPP
Sbjct: 42  RDSPKSPLYNPSETPAERLDRFFRRFMSFSEASISPNTPEPPVSSNNGEYLMKISIGTPP 101

Query: 46  VDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-TSNC 104
            D++G  DTGSD  WTQC PC  L C+KQ+ P+FDP KS+++  +SC S QC ++ T +C
Sbjct: 102 FDVYGIYDTGSDLMWTQCLPC--LSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSC 159

Query: 105 SEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTS 162
           S+    C +S+ YG G   S + G +ATETLT NS SG P  + N++FGCGH N  S T 
Sbjct: 160 SQPQKLCDFSYGYGDG---SLAQGVIATETLTLNSNSGQPTSILNIVFGCGHNN--SGTF 214

Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAG--KFSYCL-PDQG----SSKINFG--GIVAGAG 213
           +  + G+ G G    SL SQ+ +++    KFS CL P +     +SKI FG    V+G+ 
Sbjct: 215 NENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSD 274

Query: 214 VVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLPLEYHS 266
           VVSTPL+ +D   +Y+++L+ ISVG++   F SSS     GN+F+D G   TLLP ++++
Sbjct: 275 VVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYN 334

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRN 326
            L   +   I  +PV+    +P     LCY  ++    P +T HF GADV+L P N F +
Sbjct: 335 RLVQGVKEAIPMEPVQ----DPDLQPQLCYRSATLIDGPILTAHFDGADVQLKPLNTFIS 390

Query: 327 ISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
             + + C A +  + +  ++G  +Q+NFLIG+D++   VSFK   CT 
Sbjct: 391 PKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTK 438


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 158/408 (38%), Positives = 234/408 (57%), Gaps = 47/408 (11%)

Query: 2   QNSQKLPFYNDNETPKSPISIIY----------------QAEIISVDDIYLMHLSIGTPP 45
           ++S K P YN +ETP   +   +                +  + S +  YLM +SIGTPP
Sbjct: 42  RDSPKSPLYNPSETPAERLDRFFRRFMSFSEASISPNTPEPPVSSNNGEYLMKISIGTPP 101

Query: 46  VDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-TSNC 104
            D++G  DTGSD  WTQC PC  L C+KQ+ P+FDP KS+++  +SC S QC ++ T +C
Sbjct: 102 FDVYGIYDTGSDLMWTQCLPC--LSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSC 159

Query: 105 SEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTS 162
           S+    C +S+ YG G   S + G +ATETLT NS SG P  + N++FGCGH N  S T 
Sbjct: 160 SQPQKLCDFSYGYGDG---SLAQGVIATETLTLNSNSGQPXSIXNIVFGCGHNN--SGTF 214

Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAG--KFSYCL-PDQG----SSKINFG--GIVAGAG 213
           +  + G+ G G    SL SQ+ +++    KFS CL P +     +SKI FG    V+G+ 
Sbjct: 215 NENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSX 274

Query: 214 VVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLPLEYHS 266
           VVSTPL+ +D   +Y+++L+ ISVG++   F SSS     GN+F+D G   TLLP ++++
Sbjct: 275 VVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYN 334

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRN 326
            L   +   I  +PV+    +P     LCY  ++    P +T HF GADV+L P N F +
Sbjct: 335 RLVQGVKEAIPMEPVQ----DPDLQPQLCYRSATLIDGPILTAHFDGADVQLKPLNTFIS 390

Query: 327 ISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
             + + C A +  + +  ++G  +Q+NFLIG+D++   VSFK   CT 
Sbjct: 391 PKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTK 438


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 151/374 (40%), Positives = 212/374 (56%), Gaps = 32/374 (8%)

Query: 15  TPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQ 74
           +P SP     +  +IS    YL+  S+GTP + +FG +DTGSD  W QC+PC +  C++Q
Sbjct: 74  SPNSP-----ETTVISALGEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPCKK--CYEQ 126

Query: 75  EPPLFDPKKSSTYNSISCSSSQCAVV--TSNCSEGDCSYSFLYGRGAYASFSSGNLATET 132
             P+FD  KS TY ++ C S+ C  V  T   S   C YS  Y  G   S S G+L+ ET
Sbjct: 127 TTPIFDSSKSQTYKTLPCPSNTCQSVQGTFCSSRKHCLYSIHYVDG---SQSLGDLSVET 183

Query: 133 LTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFS 192
           LT  ST+G PV+ P  + GCG  N      + K +GI+GLG G  SLI+Q+  S  GKFS
Sbjct: 184 LTLGSTNGSPVQFPGTVIGCGRYNAIG--IEEKNSGIVGLGRGPMSLITQLSPSTGGKFS 241

Query: 193 YCLP---DQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS 244
           YCL       SSK+NFG   +V+G G VSTPL  ++    Y+L+LEA SVG  R+EF S 
Sbjct: 242 YCLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSP 301

Query: 245 STG---NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ 301
            +G   NI +D+G   T LP   +S L++ ++  +  Q V+    +P     LCY ++  
Sbjct: 302 GSGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVR----DPNQVLGLCYKVTPD 357

Query: 302 ---PKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYD 358
                 P +T HF GADV L+  N F  ++D+++C AF+      V+G + Q N L+GYD
Sbjct: 358 KLDASVPVITAHFSGADVTLNAINTFVQVADDVVCFAFQPTETGAVFGNLAQQNLLVGYD 417

Query: 359 IEQAMVSFKPSRCT 372
           ++   VSFK + CT
Sbjct: 418 LQMNTVSFKHTDCT 431


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 159/363 (43%), Positives = 222/363 (61%), Gaps = 36/363 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM  S+GTPP +I G VDTGS  TW QC+ C   DC++Q  P+FDP KS TY ++ CSS
Sbjct: 97  YLMSYSVGTPPFEILGVVDTGSGITWMQCQRCE--DCYEQTTPIFDPSKSKTYKTLPCSS 154

Query: 95  SQCAVV--TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
           + C  V  T +CS     C Y+  YG G++   S G+L+ ETLT  ST+G  V+ PN + 
Sbjct: 155 NMCQSVISTPSCSSDKIGCKYTIKYGDGSH---SQGDLSVETLTLGSTNGSSVQFPNTVI 211

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-----DQGSSKINF 205
           GCGH N    T   + +G++GLG G  SLISQ+ +SI GKFSYCL         SSK+NF
Sbjct: 212 GCGHNNKG--TFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYCLAPMFSQSNSSSKLNF 269

Query: 206 G--GIVAGAGVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSSTGN--------IFV 251
           G   +V+G G VSTPL+ +      YYL+LEA SVG++R+EFV  S+ +        I +
Sbjct: 270 GDAAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIII 329

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTI 309
           D+G   TLLP E +SNL+S +++ I+A  V    ++P     LCY    S Q   P +T 
Sbjct: 330 DSGTTLTLLPQEDYSNLESAVADAIQANRV----SDPSNFLSLCYQTTPSGQLDVPVITA 385

Query: 310 HFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPS 369
           HF+GADV+L+P + F  +++ ++C AF       ++G + Q+N L+GYD+ +  VSFKP+
Sbjct: 386 HFKGADVELNPISTFVQVAEGVVCFAFHSSEVVSIFGNLAQLNLLVGYDLMEQTVSFKPT 445

Query: 370 RCT 372
            CT
Sbjct: 446 DCT 448


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  247 bits (630), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 159/409 (38%), Positives = 225/409 (55%), Gaps = 50/409 (12%)

Query: 2   QNSQKLPFYNDNETPKSPI----------------SIIYQAEIISVDDIYLMHLSIGTPP 45
           ++S K P YN +ET    I                S   +A I +    YL+ +S+GTPP
Sbjct: 34  RDSPKSPMYNSSETHFDRIVNALRRSSHRNTVVLESDTAEAPIFNNGGEYLVEISVGTPP 93

Query: 46  VDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV--TSN 103
             I    DTGSD  WTQC+PC   +C++Q  P+FDP KS+TY +++CSS  C+     S+
Sbjct: 94  FSIVAVADTGSDVIWTQCKPCS--NCYQQNAPMFDPSKSTTYKNVACSSPVCSYSGDGSS 151

Query: 104 CS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTS 162
           CS + +C YS  YG  ++   S GNLA +T+T  STSG PV  P  + GCGH N    T 
Sbjct: 152 CSDDSECLYSIAYGDDSH---SQGNLAVDTVTMQSTSGRPVAFPRTVIGCGHDNAG--TF 206

Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG------SSKINFG--GIVAGAGV 214
           ++  +GI+GLG G +SL++Q+G +  GKFSYCL   G      S+K+NFG    V+G+G 
Sbjct: 207 NANVSGIVGLGRGPASLVTQLGPATGGKFSYCLIPIGTGSTNDSTKLNFGSNANVSGSGT 266

Query: 215 VSTPLI----IRDHYYLSLEAISVGNQRLEFVSSST-----GNIFVDTGVLRTLLPLEYH 265
           VSTP+      +  Y L LEA+SVG+ +  F   ++      NI +D+G   T LP    
Sbjct: 267 VSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKLGGESNIIIDSGTTLTYLPSALL 326

Query: 266 SNLKSVMSNMIKAQPVKGVGAEPG-FSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLF 324
           ++  S +S  +     +    +P  F D      +   + P VT+HF GADV L   NLF
Sbjct: 327 NSFGSAISQSMSLPHAQ----DPSEFLDYCFATTTDDYEMPPVTMHFEGADVPLQRENLF 382

Query: 325 RNISDEIMCSAFRG-GNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             +SD+ +C AF    + NI +YG I Q NFL+GYDI+   VSF+P+ C
Sbjct: 383 VRLSDDTICLAFGSFPDDNIFIYGNIAQSNFLVGYDIKNLAVSFQPAHC 431


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score =  247 bits (630), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 156/415 (37%), Positives = 233/415 (56%), Gaps = 54/415 (13%)

Query: 1   AQNSQKLPFYNDNETPKSPISIIY--------------------QAEIISVDDIYLMHLS 40
           +++S   PFYN +ET    +   +                    Q+++IS    YLM++S
Sbjct: 40  SRDSPHSPFYNPSETKYQRLQKAFRRSILRGNHFRAMRASPNDIQSDVISGGGAYLMNIS 99

Query: 41  IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV 100
           +GTPPV + G  DTGSD  W QC PCP  +C++Q  PLFDPK+S TY ++ C +  C  +
Sbjct: 100 LGTPPVPMLGIADTGSDLIWRQCLPCP--NCYEQVEPLFDPKESETYKTLDCDNEFCQDL 157

Query: 101 TSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNL 157
               S  D   C+YS+ YG  +Y   + G+L+++TLT  ST G P   P + FGCGH N 
Sbjct: 158 GQQGSCDDDNTCTYSYSYGDRSY---TRGDLSSDTLTIGSTEGDPASFPGIAFGCGHDN- 213

Query: 158 ASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG--GIVA 210
              T + K  G+IGLG G  SL+ Q+ + + G+FSYCL         SSKINFG  G+V+
Sbjct: 214 -GGTFNEKDGGLIGLGGGPLSLVMQLSSEVGGQFSYCLVPLSSDSTVSSKINFGKSGVVS 272

Query: 211 GAGVVSTPLII---RDHYYLSLEAISVGNQRLEF----------VSSSTGNIFVDTGVLR 257
           G+G VSTPLI       YYL+LE +SVG++ + F           +   GNI +D+G   
Sbjct: 273 GSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNIIIDSGTTL 332

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVK 317
           TLLP +++++++S ++N I  Q       +P     LCY+  +  + P +T HF GADV+
Sbjct: 333 TLLPQDFYTDVESALTNAIGGQTT----TDPNGIFSLCYSSVNNLEIPTITAHFTGADVQ 388

Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           L P N F  + ++++C +    +   ++G + QINFL+GYD++   VSFK + CT
Sbjct: 389 LPPLNTFVQVQEDLVCFSMIPSSNLAIFGNLAQINFLVGYDLKNNKVSFKQTDCT 443


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  244 bits (623), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 148/356 (41%), Positives = 211/356 (59%), Gaps = 32/356 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+  S+GTPP  ++G +DTGS+  W QC+PC    CF Q  P+F+P KSS+Y +I C+S
Sbjct: 89  YLISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNT--CFNQTSPIFNPSKSSSYKNIPCTS 146

Query: 95  SQCAVVTS---NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           S C        +CS G   C YS  YG  A    S G+L+ ++LT +STSG  V  PN++
Sbjct: 147 STCKDTNDTHISCSNGGDVCEYSITYGGDAK---SQGDLSNDSLTLDSTSGSSVLFPNIV 203

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG-KFSYCL-----PDQGSSKI 203
            GCGH N+     +S+ +G++G+G G  SLI Q+G+S  G KFSYCL         SSK+
Sbjct: 204 IGCGHINVLQ--DNSQSSGVVGMGRGPMSLIKQVGSSSVGSKFSYCLIPYNSDSNSSSKL 261

Query: 204 NFGG--IVAGAGVVSTPLII----RDHYYLSLEAISVGNQRLEF---VSSSTGNIFVDTG 254
            FG   +V+G  VVSTP++      ++Y+L+LEA SVGN R+E+    ++ST NI +D+G
Sbjct: 262 IFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAFSVGNNRIEYGERSNASTQNILIDSG 321

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-SQPKFPEVTIHFRG 313
              T+LP  + S L S ++  +K   ++     P     LCYN +  Q   P++T HF G
Sbjct: 322 TPLTMLPNLFLSKLVSYVAQEVKLPRIE----PPDHHLSLCYNTTGKQLNVPDITAHFNG 377

Query: 314 ADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPS 369
           ADVKL+ +  F    D IMC  F   N   ++G I Q N LI YD+E+ ++SFKP+
Sbjct: 378 ADVKLNSNGTFFPFEDGIMCFGFISSNGLEIFGNIAQNNLLIDYDLEKEIISFKPT 433


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 150/369 (40%), Positives = 215/369 (58%), Gaps = 29/369 (7%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           Q+ I +    YLM +SIGTPP  I+G  DTGSD TWT C PC +  C+KQ  P+FDP+KS
Sbjct: 15  QSPIYAYLGHYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNK--CYKQRNPIFDPQKS 72

Query: 85  STYNSISCSSSQCAVV-TSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
           ++Y +ISC S  C  + T  CS +  C+Y++ Y   A A+ + G LA ET+T +ST G  
Sbjct: 73  TSYRNISCDSKLCHKLDTGVCSPQKHCNYTYAY---ASAAITQGVLAQETITLSSTKGES 129

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPD---- 197
           V +  ++FGCGH N      + ++ GIIGLG G  S ISQ+G+S  GK FS CL      
Sbjct: 130 VPLKGIVFGCGHNNTGG--FNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVPFHTD 187

Query: 198 -QGSSKINF--GGIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSSST----- 246
              SSK++   G  V+G GVVSTPL+ +     Y+++L  ISVGN  L F  SS+     
Sbjct: 188 VSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSVEK 247

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPE 306
           GN+F+D+G   T+LP + +  L + + + +  +PV     +      LCY   +  + P 
Sbjct: 248 GNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTN---DLDLGPQLCYRTKNNLRGPV 304

Query: 307 VTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNAN-IVYGRIMQINFLIGYDIEQAMVS 365
           +T HF G DVKL P+  F +  D + C  F   +++  VYG   Q N+LIG+D+++ +VS
Sbjct: 305 LTAHFEGGDVKLLPTQTFVSPKDGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVS 364

Query: 366 FKPSRCTNY 374
           FKP  CT +
Sbjct: 365 FKPMDCTKH 373


>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 154/416 (37%), Positives = 235/416 (56%), Gaps = 54/416 (12%)

Query: 1   AQNSQKLPFYNDNETPKSPISIIYQAEII----------SVDDI----------YLMHLS 40
           +++S + PFYN +ET    +   ++  I+          S +DI          YLM++S
Sbjct: 40  SRDSPRSPFYNPSETKYQRLQKAFRRSILRGNHFRAIRASPNDIQSNVISGGGSYLMNIS 99

Query: 41  IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV 100
           +GTPPV + G  DTGSD  W QC PC   DC+KQ  PLFDPKKS TY ++ C++  C  +
Sbjct: 100 LGTPPVSMLGIADTGSDLIWRQCLPCD--DCYKQVEPLFDPKKSKTYKTLGCNNDFCQDL 157

Query: 101 TSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNL 157
               S GD   C+ S+ YG  +Y   +  +L++ET T  ST G P   P + FGCGH N 
Sbjct: 158 GQQGSCGDDNTCTSSYSYGDQSY---TRRDLSSETFTIGSTEGDPASFPGLAFGCGHSN- 213

Query: 158 ASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG--GIVA 210
              T + K +G+IGLG G  SL+ Q+ + + G+FSYCL         SSKINFG   +V+
Sbjct: 214 -GGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTASSKINFGKSAVVS 272

Query: 211 GAGVVSTPLII---RDHYYLSLEAISVGNQRLEF----------VSSSTGNIFVDTGVLR 257
           G+G VSTPLI       YYL+LE +S+G++++ F           ++   NI +D+G   
Sbjct: 273 GSGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPAAAEESNIIIDSGTTL 332

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVK 317
           TLLP +++++++S ++ +I  Q       +P  +  LCY+   + + P +T HF GADV+
Sbjct: 333 TLLPRDFYTDMESALTKVIGGQTT----TDPRGTFSLCYSGVKKLEIPTITAHFIGADVQ 388

Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           L P N F    ++++C +    +   ++G + Q+NFL+GYD++   VSFKP+ CT 
Sbjct: 389 LPPLNTFVQAQEDLVCFSMIPSSNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDCTK 444


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score =  243 bits (619), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 161/412 (39%), Positives = 225/412 (54%), Gaps = 51/412 (12%)

Query: 2   QNSQKLPFYNDNETPKSPISII----------YQAEIISVDDI----------YLMHLSI 41
           ++S + P Y   ETP   ++            ++   +S D            YLM  S+
Sbjct: 38  RDSSRSPLYRPTETPFQRVANAVRRSINRGNHFKKAFVSTDSAESTVVASQGEYLMRYSV 97

Query: 42  GTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVT 101
           G+PP  + G VDTGSD  W QCEPC   DC+KQ  P+FDP KS TY ++ CSS+ C  + 
Sbjct: 98  GSPPFQVLGIVDTGSDILWLQCEPCE--DCYKQTTPIFDPSKSKTYKTLPCSSNTCESLR 155

Query: 102 SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLAS 159
           +     D  C YS  YG G++   S G+L+ ETLT  ST G  V  P  + GCGH N  +
Sbjct: 156 NTACSSDNVCEYSIDYGDGSH---SDGDLSVETLTLGSTDGSSVHFPKTVIGCGHNNGGT 212

Query: 160 PTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-----DQGSSKINFG--GIVAGA 212
              +      +G GP   SLISQ+ +SI GKFSYCL         SSK+NFG   +V+G 
Sbjct: 213 FQEEGSGIVGLGGGP--VSLISQLSSSIGGKFSYCLAPIFSESNSSSKLNFGDAAVVSGR 270

Query: 213 GVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSST-------GNIFVDTGVLRTLLPL 262
           G VSTPL   +    Y+L+LEA SVG+ R+EF  SS+       GNI +D+G   TLLP 
Sbjct: 271 GTVSTPLDPLNGQVFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSGTTLTLLPQ 330

Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS-QPKFPEVTIHFRGADVKLSPS 321
           E + NL+S +S++IK +  +    +P     LCY  +S +   P +T HF+GADV+L+P 
Sbjct: 331 EDYLNLESAVSDVIKLERAR----DPSKLLSLCYKTTSDELDLPVITAHFKGADVELNPI 386

Query: 322 NLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           + F  +   ++C AF       ++G + Q N L+GYD+ +  VSFKP+ CT 
Sbjct: 387 STFVPVEKGVVCFAFISSKIGAIFGNLAQQNLLVGYDLVKKTVSFKPTDCTK 438


>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
           Precursor
 gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 447

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 151/379 (39%), Positives = 217/379 (57%), Gaps = 40/379 (10%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           Q+ +I  D  + M ++IGTPP+ +F   DTGSD TW QC+PC +  C+K+  P+FD KKS
Sbjct: 75  QSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQ--CYKENGPIFDKKKS 132

Query: 85  STYNSISCSSSQCAVVTSN---CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
           STY S  C S  C  ++S    C E +  C Y + YG     SFS G++ATET++ +S S
Sbjct: 133 STYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGD---QSFSKGDVATETVSIDSAS 189

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ- 198
           G PV  P  +FGCG+ N    T D   +GIIGLG G+ SLISQ+G+SI+ KFSYCL  + 
Sbjct: 190 GSPVSFPGTVFGCGYNN--GGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKS 247

Query: 199 ----GSSKINFG------GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSS 245
               G+S IN G       +   +GVVSTPL+ ++   +YYL+LEAISVG +++ +  SS
Sbjct: 248 ATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSS 307

Query: 246 ------------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV 293
                       +GNI +D+G   TLL   +     S +   +     K V    G    
Sbjct: 308 YNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTG--AKRVSDPQGLLSH 365

Query: 294 LCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINF 353
              + S++   PE+T+HF GADV+LSP N F  +S++++C +        +YG   Q++F
Sbjct: 366 CFKSGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVCLSMVPTTEVAIYGNFAQMDF 425

Query: 354 LIGYDIEQAMVSFKPSRCT 372
           L+GYD+E   VSF+   C+
Sbjct: 426 LVGYDLETRTVSFQHMDCS 444


>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 150/379 (39%), Positives = 217/379 (57%), Gaps = 40/379 (10%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           Q+ +I  D  + M ++IGTPP+ +F   DTGSD TW QC+PC +  C+K+  P+FD KKS
Sbjct: 75  QSGLIGADGEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPCQQ--CYKENGPIFDKKKS 132

Query: 85  STYNSISCSSSQCAVVTSN---CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
           STY S  C S  C  ++S+   C E    C Y + YG     SFS G++ATET++ +S S
Sbjct: 133 STYKSEPCDSRNCHALSSSERGCDESKNVCKYRYSYGD---QSFSKGDVATETISIDSAS 189

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ- 198
           G PV  P  +FGCG+ N    T D   +GIIGLG G+ SLISQ+G+SI+ KFSYCL  + 
Sbjct: 190 GSPVSFPGTVFGCGYNN--GGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKS 247

Query: 199 ----GSSKINFG------GIVAGAGVVSTPLII---RDHYYLSLEAISVGNQRLEFVSSS 245
               G+S IN G       +   +GV+STPL+    R +YYL+LEAISVG +++ +  SS
Sbjct: 248 ATTNGTSVINLGTNSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSS 307

Query: 246 ------------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV 293
                       +GNI +D+G   TLL   +     + +  ++     K V    G    
Sbjct: 308 YNPNDGGIFSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTG--AKRVSDPQGLLSH 365

Query: 294 LCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINF 353
              + S++   PE+T+HF GADV+LSP N F  +S++++C +        +YG   Q++F
Sbjct: 366 CFKSGSAEIGLPEITVHFTGADVRLSPINAFVKVSEDMVCLSMVPTTEVAIYGNFAQMDF 425

Query: 354 LIGYDIEQAMVSFKPSRCT 372
           L+GYD+E   VSF+   C+
Sbjct: 426 LVGYDLETRTVSFQRMDCS 444


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score =  241 bits (614), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 148/364 (40%), Positives = 206/364 (56%), Gaps = 43/364 (11%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           Q+ + S    YLM  SIGTPP  +FG VDTGSD  W QCEPC +  C+ Q  P+FDP  S
Sbjct: 78  QSTVNSDKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPCKQ--CYPQITPIFDPSLS 135

Query: 85  STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           S+Y +I C S  C ++ T++C                     G L+ ETLT +ST+G  V
Sbjct: 136 SSYQNIPCLSDTCHSMRTTSCD------------------VRGYLSVETLTLDSTTGYSV 177

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC----LPDQG 199
             P  + GCG++N    T     +GI+GLG G  SL SQ+GTSI GKFSYC    LP+  
Sbjct: 178 SFPKTMIGCGYRNTG--TFHGPSSGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPWLPNS- 234

Query: 200 SSKINFG--GIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEF----VSSSTGNIF 250
           +SK+NFG   IV G G ++TP++ +D    YYL+LEA SVGN+ +EF       + GNI 
Sbjct: 235 TSKLNFGDAAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYGGNEGNIL 294

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP-KFPEVTI 309
           +D+G   T LP + +   +S ++  I  + V+    +P  +  LCYN++    + P +T 
Sbjct: 295 IDSGTTFTFLPYDVYYRFESAVAEYINLEHVE----DPNGTFKLCYNVAYHGFEAPLITA 350

Query: 310 HFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPS 369
           HF+GAD+KL   + F  +SD I C AF      I +G + Q N L+GY++ Q  V+FKP 
Sbjct: 351 HFKGADIKLYYISTFIKVSDGIACLAFIPSQTAI-FGNVAQQNLLVGYNLVQNTVTFKPV 409

Query: 370 RCTN 373
            CT 
Sbjct: 410 DCTK 413


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 154/393 (39%), Positives = 217/393 (55%), Gaps = 38/393 (9%)

Query: 4   SQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQC 63
           S KL   N +     P +I  Q+ + + D  YLM LSIGTPP+ I+   DTGSD  W QC
Sbjct: 31  SVKLIRRNSSHDSYKPSTI--QSPVSAYDCEYLMELSIGTPPIKIYAEADTGSDLVWFQC 88

Query: 64  EPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGD---CSYSFLYGRGAY 120
            PC +  C+KQ+ P+FDP+ SS+Y +I+C +  C  + S+    D   C+Y++ Y   A 
Sbjct: 89  IPCTK--CYKQQNPMFDPRSSSSYTNITCGTESCNKLDSSLCSTDQKTCNYTYSY---AD 143

Query: 121 ASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLI 180
            S + G LA ETLT  ST+G PV    +IFGCGH N      + ++ G+IGLG G  SLI
Sbjct: 144 NSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHNNSG---FNDREMGLIGLGRGPLSLI 200

Query: 181 SQMGTSIAG---KFSYCL-----PDQGSSKINF--GGIVAGAGVVSTPLIIRD--HYYLS 228
           SQ+G+S+      FS CL         +S++NF  G  V G G VSTPLI +D   Y+ +
Sbjct: 201 SQIGSSLGAGGNMFSQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLISKDGTGYFAT 260

Query: 229 LEAISVGNQRLEFVSSST------GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVK 282
           L  ISV +  L F + S+      GNI +D+G   T LP E++  L   + N +  +P +
Sbjct: 261 LLGISVEDINLPFSNGSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFR 320

Query: 283 GVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSA-FRGGNA 341
             G E      LCY   +    P +TIHF G DV L+P+ +F  + D+  C A F     
Sbjct: 321 IDGYE------LCYQTPTNLNGPTLTIHFEGGDVLLTPAQMFIPVQDDNFCFAVFDTNEE 374

Query: 342 NIVYGRIMQINFLIGYDIEQAMVSFKPSRCTNY 374
            + YG   Q N+LIG+D+E+ +VSFK + CT +
Sbjct: 375 YVTYGNYAQSNYLIGFDLERQVVSFKATDCTKF 407


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score =  240 bits (612), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 146/360 (40%), Positives = 210/360 (58%), Gaps = 31/360 (8%)

Query: 33  DIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISC 92
           D Y++   IGTPP  ++G +DT +D  W QC PC    CF    P+FDP KSSTY +I C
Sbjct: 87  DGYIISFLIGTPPFQLYGVMDTANDNIWFQCNPCKP--CFNTTSPMFDPSKSSTYKTIPC 144

Query: 93  SSSQCA-VVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           SS +C  V  ++CS  D   C YSF YG  AY   S G+L+ +TLT NS +  P+   N+
Sbjct: 145 SSPKCKNVENTHCSSDDKKVCEYSFTYGGEAY---SQGDLSIDTLTLNSNNDTPISFKNI 201

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL----PDQG-SSKI 203
           + GCGH+N      +   +G IGLG G  S ISQ+ +SI GKFSYCL     ++G S K+
Sbjct: 202 VIGCGHRNKGP--LEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLVPLFSNEGISGKL 259

Query: 204 NFG--GIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLEFVSSST-----GNIFVDTGV 255
           +FG   +V+G G VSTP+   +  Y  +L A+SVG+  ++F +S++     GN  +D+G 
Sbjct: 260 HFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHIIKFENSTSKNDNLGNTIIDSGT 319

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-SQPKFPEVTIHFRGA 314
             T+LP   +S L+S++++M+K +  K     P     LCY  +      P +T HF GA
Sbjct: 320 TLTILPENVYSRLESIVTSMVKLERAK----SPNQQFKLCYKATLKNLDVPIITAHFNGA 375

Query: 315 DVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           DV L+  N F  I  E++C AF   G     + G I Q NFL+G+D+++ ++SFKP+ CT
Sbjct: 376 DVHLNSLNTFYPIDHEVVCFAFVSVGNFPGTIIGNIAQQNFLVGFDLQKNIISFKPTDCT 435


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score =  239 bits (610), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 145/347 (41%), Positives = 200/347 (57%), Gaps = 33/347 (9%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           Q+ I+     YLM+L IGTPPV +   VDTGSD TWTQC PC    C+KQ  PLFDPK S
Sbjct: 82  QSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTH--CYKQVVPLFDPKNS 139

Query: 85  STYNSISCSSSQCAVVTSN--CS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
           STY   SC +S C  +  +  CS E  C++ + Y  G   SF+ GNLA+ETLT +ST+G 
Sbjct: 140 STYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADG---SFTGGNLASETLTVDSTAGK 196

Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----P 196
           PV  P   FGCGH   +    D   +GI+GLG G  SLISQ+ ++I G FSYCL      
Sbjct: 197 PVSFPGFAFGCGHS--SGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSYCLLPVSTD 254

Query: 197 DQGSSKINFG--GIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTG 254
              SS+INFG  G V+G G VSTPL +    Y         +++ E      GNI VD+G
Sbjct: 255 SSISSRINFGASGRVSGYGTVSTPLRLPYKGY---------SKKTEV---EEGNIIVDSG 302

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGA 314
              T LP E++S L+  ++N IK + V+    +P     LCYN +++   P +T HF+ A
Sbjct: 303 TTYTFLPQEFYSKLEKSVANSIKGKRVR----DPNGIFSLCYNTTAEINAPIITAHFKDA 358

Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
           +V+L P N F  + ++++C      +   V G + Q+NFL+G+D+ +
Sbjct: 359 NVELQPLNTFMRMQEDLVCFTVAPTSDIGVLGNLAQVNFLVGFDLRK 405



 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 45/127 (35%), Positives = 74/127 (58%), Gaps = 5/127 (3%)

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-SQPKFP 305
           GNI VD+G   T LPLE++  L+  +++ IK + V+    +P     LCYN +  Q   P
Sbjct: 418 GNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVR----DPNGISSLCYNTTVDQIDAP 473

Query: 306 EVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
            +T HF+ A+V+L P N F  + ++++C      +   + G + Q+NFL+G+D+ +  VS
Sbjct: 474 IITAHFKDANVELQPWNTFLRMQEDLVCFTVLPTSDIGILGNLAQVNFLVGFDLRKKRVS 533

Query: 366 FKPSRCT 372
           FK + CT
Sbjct: 534 FKAADCT 540


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score =  238 bits (608), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 157/395 (39%), Positives = 228/395 (57%), Gaps = 43/395 (10%)

Query: 2   QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
           +NS ++ F  +  T ++P+S+ +          YLM LSIGTPPV  +  VDTGSD  W 
Sbjct: 36  RNSSQVLF--NRITAQTPVSVHHYD--------YLMELSIGTPPVKTYAQVDTGSDLIWL 85

Query: 62  QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA-VVTSNCS--EGDCSYSFLYGRG 118
           QC PC   +C+KQ  P+FDP+ SSTY++I+  S  C+ + +++CS  + +C+Y++ Y   
Sbjct: 86  QCIPC--TNCYKQLNPMFDPQSSSTYSNIAYGSESCSKLYSTSCSPDQNNCNYTYSYEDD 143

Query: 119 AYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSS 178
              S + G LA ETLT  ST+G PV +  VIFGCGH N  +   + K+ GIIGLG G  S
Sbjct: 144 ---SITEGVLAQETLTLTSTTGKPVALKGVIFGCGHNN--NGVFNDKEMGIIGLGRGPLS 198

Query: 179 LISQMGTSIAGK-FSYCL-PDQG----SSKINF--GGIVAGAGVVSTPLIIRD----HYY 226
           L+SQ+G+S  GK FS CL P       +S ++F  G  V G GVVSTPL+ ++     Y+
Sbjct: 199 LVSQIGSSFGGKMFSQCLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYF 258

Query: 227 LSLEAISVGNQRLEFVSSST------GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
           ++L  ISV +  L F   S+      GN+ +D+G   TLLP +++  L   + N +   P
Sbjct: 259 VTLLGISVEDINLPFNDGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDP 318

Query: 281 VKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGN 340
           +     +P     LCY   +  K   +T HF GADV L+P+ +F  + D I C AF    
Sbjct: 319 IP---IDPTLGYQLCYRTPTNLKGTTLTAHFEGADVLLTPTQIFIPVQDGIFCFAFTSTF 375

Query: 341 ANI--VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           +N   +YG   Q N+LIG+D+E+ +VSFK + CTN
Sbjct: 376 SNEYGIYGNHAQSNYLIGFDLEKQLVSFKATDCTN 410


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  238 bits (606), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 146/357 (40%), Positives = 208/357 (58%), Gaps = 39/357 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM +SIGTPPVD  G  DTGSD TW QC PC  L C++Q  P+F+P KS++++ + C++
Sbjct: 92  YLMSVSIGTPPVDYLGIADTGSDLTWAQCLPC--LKCYQQLRPIFNPLKSTSFSHVPCNT 149

Query: 95  SQC-AVVTSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             C AV   +C  +G C YS+ YG   Y   S G+L  E +T  S+S         + GC
Sbjct: 150 QTCHAVDDGHCGVQGVCDYSYTYGDRTY---SKGDLGFEKITIGSSS------VKSVIGC 200

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMG--TSIAGKFSYCLP---DQGSSKINFG- 206
           GH   AS       +G+IGLG G  SL+SQM   + I+ +FSYCLP      + KINFG 
Sbjct: 201 GH---ASSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGE 257

Query: 207 -GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPL 262
             +V+G GVVSTPLI ++   +YY++LEAIS+GN+R     +  GN+ +D+G   T+LP 
Sbjct: 258 NAVVSGPGVVSTPLISKNTVTYYYITLEAISIGNER-HMAFAKQGNVIIDSGTTLTILPK 316

Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY----NISSQPKFPEVTIHFR-GADVK 317
           E +  + S +  ++KA+ VK    +P  S  LC+    N ++    P +T HF  GA+V 
Sbjct: 317 ELYDGVVSSLLKVVKAKRVK----DPHGSLDLCFDDGINAAASLGIPVITAHFSGGANVN 372

Query: 318 LSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           L P N FR ++D + C   +  +      + G + Q NFLIGYD+E   +SFKP+ C
Sbjct: 373 LLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQANFLIGYDLEAKRLSFKPTVC 429


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score =  234 bits (596), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 151/370 (40%), Positives = 219/370 (59%), Gaps = 33/370 (8%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           ++ +I+    YLM  S+GTPP  I G VDTGSD  W QC+PC   DC+ Q  P+FDP +S
Sbjct: 84  ESTVIASQGEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPCE--DCYNQTTPIFDPSQS 141

Query: 85  STYNSISCSSSQCAVVTS--NCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
            TY ++ CSS+ C  V S  +CS    +C Y+  YG  ++   S G+L+ ETLT  ST G
Sbjct: 142 KTYKTLPCSSNICQSVQSAASCSSNNDECEYTITYGDNSH---SQGDLSVETLTLGSTDG 198

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---- 196
             V+ P  + GCGH N  +   +      +G GP   SLISQ+ +SI GKFSYCL     
Sbjct: 199 SSVQFPKTVIGCGHNNKGTFQREGSGIVGLGGGP--VSLISQLSSSIGGKFSYCLAPLFS 256

Query: 197 -DQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSSTG--- 247
               SSK+NFG   +V+G G VSTP++ ++    Y+L+LEA SVG+ R+EF SSS     
Sbjct: 257 QSNSSSKLNFGDEAVVSGRGTVSTPIVPKNGLGFYFLTLEAFSVGDNRIEFGSSSFESSG 316

Query: 248 ---NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF 304
              NI +D+G   T+LP + + NL+S +++ I+ + V+    +P     LCY  +S  + 
Sbjct: 317 GEGNIIIDSGTTLTILPEDDYLNLESAVADAIELERVE----DPSKFLRLCYRTTSSDEL 372

Query: 305 --PEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
             P +T HF+GADV+L+P + F  + + ++C AFR      ++G + Q N L+GYD+ + 
Sbjct: 373 NVPVITAHFKGADVELNPISTFIEVDEGVVCFAFRSSKIGPIFGNLAQQNLLVGYDLVKQ 432

Query: 363 MVSFKPSRCT 372
            VSFKP+ CT
Sbjct: 433 TVSFKPTDCT 442


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score =  233 bits (594), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 143/372 (38%), Positives = 211/372 (56%), Gaps = 32/372 (8%)

Query: 24  YQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKK 83
           +Q +++     Y M +SIGTP V++    DTGSD TW QC PC    C++Q+ PLFDP +
Sbjct: 83  FQNDLVPNGGEYFMKMSIGTPLVEVIVIADTGSDLTWVQCLPCDP--CYRQKSPLFDPSR 140

Query: 84  SSTYNSISCSSSQCAV--VTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNST 138
           SS+Y  + C S  C    V+      D   C Y + YG  +Y   ++GNLATE  T  ST
Sbjct: 141 SSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSY---TNGNLATEKFTIGST 197

Query: 139 SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--- 195
           S  PV +  ++FGCG  N    T D   +GI+GLG G  SL+SQ+ + I GKFSYCL   
Sbjct: 198 SSRPVHLSPIVFGCGTGN--GGTFDELGSGIVGLGGGALSLVSQLSSIIKGKFSYCLVPL 255

Query: 196 --PDQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS---- 244
                 +SKI FG   +++G  VVSTPL+ +    +YY++LEAISVGN+RL + +     
Sbjct: 256 SEQSNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNG 315

Query: 245 --STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
               GN+ +D+G   T L  E+ + L+ V+   +KA+ V    ++P     +C+  +   
Sbjct: 316 NVEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERV----SDPRGLFSVCFRSAGDI 371

Query: 303 KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
             P + +HF  ADVKL P N F    ++++C      N   ++G + Q++FL+GYD+E+ 
Sbjct: 372 DLPVIAVHFNDADVKLQPLNTFVKADEDLLCFTMISSNQIGIFGNLAQMDFLVGYDLEKR 431

Query: 363 MVSFKPSRCTNY 374
            VSFKP+ CT +
Sbjct: 432 TVSFKPTDCTKH 443


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score =  233 bits (593), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 151/400 (37%), Positives = 216/400 (54%), Gaps = 38/400 (9%)

Query: 2   QNSQKLPFYNDNETPKSPISIIYQ------------AEIISVDDIYLMHLSIGTPPVDIF 49
           +NS   PFY  N   K+ +   YQ              + S +  YLM L++G+PPVDI+
Sbjct: 37  KNSPNSPFYKSNNFHKNKLRSFYQVPKKSFVQKSPYTRVTSNNGDYLMKLTLGSPPVDIY 96

Query: 50  GSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCS-EGD 108
           G VDTGSD  W QC PC    C++Q+ P+F+P +S TY+ I C S QC+    +CS +  
Sbjct: 97  GLVDTGSDLVWAQCTPCG--GCYRQKSPMFEPLRSKTYSPIPCESEQCSFFGYSCSPQKM 154

Query: 109 CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTG 168
           C+YS+ Y   A +S + G LA E +TF+ST G PV + ++IFGCGH N  +   +     
Sbjct: 155 CAYSYSY---ADSSVTKGVLAREAITFSSTDGDPVVVGDIIFGCGHSNSGTFNENDMGII 211

Query: 169 IIGLGPGNSSLISQMGTSIAGK-FSYCL-----PDQGSSKINFG--GIVAGAGVVSTPLI 220
            +G GP   SL+SQ+GT    K FS CL         S  INFG    V+G GVV+TPL 
Sbjct: 212 GMGGGP--LSLVSQIGTLYGSKRFSQCLVPFHTDAHTSGTINFGEESDVSGEGVVTTPLA 269

Query: 221 IRD---HYYLSLEAISVGNQRLEFVSSST---GNIFVDTGVLRTLLPLEYHSNLKSVMSN 274
             +    Y ++LE ISVG+  + F SS T   GNI +D+G   T +P E++  L   +  
Sbjct: 270 SEEGQTSYLVTLEGISVGDTFVRFNSSETLSKGNIMIDSGTPATYIPQEFYERLVEELKV 329

Query: 275 MIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCS 334
                P++    +P     LCY   +  + P +T HF GADV+L P   F    D + C 
Sbjct: 330 QSSLLPIED---DPDLGTQLCYRSETNLEGPILTAHFEGADVQLLPIQTFIPPKDGVFCF 386

Query: 335 AFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           A  G  + + ++G   Q N L+G+D+++  +SFKP+ CTN
Sbjct: 387 AMAGSTDGDYIFGNFAQSNILMGFDLDRKTISFKPTDCTN 426


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score =  231 bits (589), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 142/359 (39%), Positives = 206/359 (57%), Gaps = 30/359 (8%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           D  YLM  S+GTPP  ++G VDT SD  W QC+ C    C+    P+FDP  S TY ++ 
Sbjct: 85  DGDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLCET--CYNDTSPMFDPSYSKTYKNLP 142

Query: 92  CSSSQCAVVT-SNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           CSS+ C  V  ++CS  +   C ++  Y  G++   S G+L  ET+T  S +   V  P 
Sbjct: 143 CSSTTCKSVQGTSCSSDERKICEHTVNYKDGSH---SQGDLIVETVTLGSYNDPFVHFPR 199

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINF 205
            + GC      S  S     GI+GLG G  SL+ Q+ +SI+ KFSYCL      SSK+ F
Sbjct: 200 TVIGCIRNTNVSFDS----IGIVGLGGGPVSLVPQLSSSISKKFSYCLAPISDRSSKLKF 255

Query: 206 G--GIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSSST-----GNIFVDTGV 255
           G   +V+G G VST ++ +D    YYL+LEA SVGN R+EF SSS+     GNI +D+G 
Sbjct: 256 GDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSSSSRSSGKGNIIIDSGT 315

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-SQPKFPEVTIHFRGA 314
             T+LP + +S L+S +++++K +  +    +P     LCY  +  +   P +T HF GA
Sbjct: 316 TFTVLPDDVYSKLESAVADVVKLERAE----DPLKQFSLCYKSTYDKVDVPVITAHFSGA 371

Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           DVKL+  N F   S  ++C AF    +  ++G + Q NFL+GYD+++ +VSFKP+ CT 
Sbjct: 372 DVKLNALNTFIVASHRVVCLAFLSSQSGAIFGNLAQQNFLVGYDLQRKIVSFKPTDCTK 430


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  229 bits (584), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 143/358 (39%), Positives = 208/358 (58%), Gaps = 39/358 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM +SIGTPPVD  G  DTGSD  W QC PC  L C+KQ  P+FDP KS++++ + C+S
Sbjct: 92  YLMSVSIGTPPVDYIGMADTGSDLMWAQCLPC--LKCYKQSRPIFDPLKSTSFSHVPCNS 149

Query: 95  SQC-AVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             C A+  S+C ++G C YS+ YG   Y   + G+L  E +T  S+S         + GC
Sbjct: 150 QNCKAIDDSHCGAQGVCDYSYTYGDQTY---TKGDLGFEKITIGSSS------VKSVIGC 200

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP---DQGSSKINFG- 206
           GH++       S    +IGLG G  SL+SQM  +  I+ +FSYCLP      + KINFG 
Sbjct: 201 GHESGGGFGFASG---VIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQ 257

Query: 207 -GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPL 262
             +V+G GVVSTPLI ++   +YY++LEAIS+GN+R    S+  GN+ +D+G   + LP 
Sbjct: 258 NAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNER-HMASAKQGNVIIDSGTTLSFLPK 316

Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY----NISSQPKFPEVTIHFR-GADVK 317
           E +  + S +  ++KA+ VK    +PG    LC+    N+++    P +T  F  GA+V 
Sbjct: 317 ELYDGVVSSLLKVVKAKRVK----DPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVN 372

Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI---NFLIGYDIEQAMVSFKPSRCT 372
           L P N F+ +++ + C      +    +G I  +   NFLIGYD+E   +SFKP+ CT
Sbjct: 373 LLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVCT 430


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score =  228 bits (581), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 141/361 (39%), Positives = 197/361 (54%), Gaps = 38/361 (10%)

Query: 26  AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
           A  +  + +YLM L +GTPP +I   +DTGS+ TWTQC PC  + C++Q  P+FDP KSS
Sbjct: 56  ANTVFDNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCLPC--VHCYEQNAPIFDPSKSS 113

Query: 86  TYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           T+    C    C             Y   Y    Y   + G LATET+T +STSG P  M
Sbjct: 114 TFKEKRCDGHSCP------------YEVDYFDHTY---TMGTLATETITLHSTSGEPFVM 158

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINF 205
           P  I GCGH N     S S   G++GL  G SSLI+QMG    G  SYC   QG+SKINF
Sbjct: 159 PETIIGCGHNNSWFKPSFS---GMVGLNWGPSSLITQMGGEYPGLMSYCFSGQGTSKINF 215

Query: 206 G--GIVAGAGVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGV 255
           G   IVAG GVVST + +       YYL+L+A+SVGN R+E + ++     GNI +D+G 
Sbjct: 216 GANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETMGTTFHALEGNIVIDSGT 275

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GA 314
             T  P+ Y + ++  + +++ A       A+P  +D+LCYN  +   FP +T+HF  G 
Sbjct: 276 TLTYFPVSYCNLVRQAVEHVVTAVR----AADPTGNDMLCYNSDTIDIFPVITMHFSGGV 331

Query: 315 DVKLSPSNLFRNISD-EIMCSAF--RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           D+ L   N++   ++  + C A          ++G   Q NFL+GYD    +VSF P+ C
Sbjct: 332 DLVLDKYNMYMESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYDSSSLLVSFSPTNC 391

Query: 372 T 372
           +
Sbjct: 392 S 392


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score =  228 bits (580), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 150/408 (36%), Positives = 217/408 (53%), Gaps = 68/408 (16%)

Query: 4   SQKLPFYNDNETPKSPISII---------YQAEIIS-----VDDI---------YLMHLS 40
           S + PFYN  ET    IS I         Y   + S     + D+         Y+M  S
Sbjct: 36  SSRSPFYNPKETQIQRISSILNYSINRVRYLNHVFSFSPNKIQDVPLSSFMGAGYVMSYS 95

Query: 41  IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV 100
           IGTPP  ++  +DTG+D  W QC+PC    C  Q  P+F P KSSTY +I C+S  C   
Sbjct: 96  IGTPPFQLYSLIDTGNDNIWFQCKPCKP--CLNQTSPMFHPSKSSTYKTIPCTSPICK-- 151

Query: 101 TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASP 160
                  D  Y                L  +TLT NS +G P+   N++ GCGH+N    
Sbjct: 152 -----NADGHY----------------LGVDTLTLNSNNGTPISFKNIVIGCGHRNQGP- 189

Query: 161 TSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG--GIVAGAG 213
             +   +G IGL  G  S ISQ+ +SI GKFSYCL      +  SSK++FG    V+G G
Sbjct: 190 -LEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCLVPLFSKENVSSKLHFGDKSTVSGLG 248

Query: 214 VVSTPLIIRDHYYLSLEAISVGNQRLEFVSS-STGNIFVDTGVLRTLLPLEYHSNLKSVM 272
            VSTP+   + Y++SLEA SVG+  ++  +S + GN  +D+G   T+LP + +S L+SV+
Sbjct: 249 TVSTPIKEENGYFVSLEAFSVGDHIIKLENSDNRGNSIIDSGTTMTILPKDVYSRLESVV 308

Query: 273 SNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTI---HFRGADVKLSPSNLFRNISD 329
            +M+K + VK    +P     LCY  +S     +V I   HF G++V L+  N F  I+D
Sbjct: 309 LDMVKLKRVK----DPSQQFNLCYQTTSTTLLTKVLIITAHFSGSEVHLNALNTFYPITD 364

Query: 330 EIMCSAF-RGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRCTNY 374
           E++C AF  GGN +   ++G ++Q NFL+G+D+ +  +SFKP+ CT +
Sbjct: 365 EVICFAFVSGGNFSSLAIFGNVVQQNFLVGFDLNKKTISFKPTDCTKH 412


>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 392

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 139/355 (39%), Positives = 195/355 (54%), Gaps = 40/355 (11%)

Query: 33  DIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISC 92
           +IYLM L +GTPP +I   +DTGSD  WTQC PC   +C+ Q  P+FDP  SST+     
Sbjct: 59  NIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCT--NCYSQYAPIFDPSNSSTFKE--- 113

Query: 93  SSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                      C+   C Y  +Y   A  ++S G LATET+T +STSG P  MP    GC
Sbjct: 114 ---------KRCNGNSCHYKIIY---ADTTYSKGTLATETVTIHSTSGEPFVMPETTIGC 161

Query: 153 GHK-NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG--GIV 209
           GH  +   PT     +G++GL  G SSLI+QMG    G  SYC   QG+SKINFG   IV
Sbjct: 162 GHNSSWFKPTF----SGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIV 217

Query: 210 AGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLP 261
           AG GVVST + +       YYL+L+A+SVG+  +E + ++     GNI +D+G   T  P
Sbjct: 218 AGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFP 277

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSP 320
           + Y + ++  + + + A       A+P  +D+LCY   +   FP +T+HF  GAD+ L  
Sbjct: 278 VSYCNLVREAVDHYVTAVRT----ADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDK 333

Query: 321 SNLF-RNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
            N++   I+    C A    N   + ++G   Q NFL+GYD    +VSF P+ C+
Sbjct: 334 YNMYIETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  226 bits (575), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 150/373 (40%), Positives = 208/373 (55%), Gaps = 29/373 (7%)

Query: 20  ISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLF 79
           I  I QA I +    +LM + IGTPP+ I G VDTGSD  W QC PC  L C+KQ  P+F
Sbjct: 53  IQNIVQAPINAYIGQHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPC--LGCYKQIKPMF 110

Query: 80  DPKKSSTYNSISCSSSQCAVV-TSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNS 137
           DP KSSTYN+ISC S  C  + T  CS E  C+Y++ YG     S + G LA +T TF S
Sbjct: 111 DPLKSSTYNNISCDSPLCHKLDTGVCSPEKRCNYTYGYGDN---SLTKGVLAQDTATFTS 167

Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG-KFSYCLP 196
            +G PV +   +FGCGH N      +  + G+IGLG G +SLISQ+G    G KFS CL 
Sbjct: 168 NTGKPVSLSRFLFGCGHNNTGG--FNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLV 225

Query: 197 D-----QGSSKINF--GGIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSS-S 245
                 + SS+++F  G  V G GVV+TPL+ R+    Y+++L  ISV +      S+  
Sbjct: 226 PFLTDIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYFPMNSTIG 285

Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFP 305
             N+ VD+G    LLP + +  + + + N +  +P+     +P     LCY   +  K P
Sbjct: 286 KANMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPIT---DDPSLGTQLCYRTQTNLKGP 342

Query: 306 EVTIHFRGADVKLSPSNLFRNISDE---IMCSAF--RGGNANIVYGRIMQINFLIGYDIE 360
            +T HF GA+V L+P   F   + +   I C A   R  +   VYG   Q N+LIG+D++
Sbjct: 343 TLTFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIGFDLD 402

Query: 361 QAMVSFKPSRCTN 373
           + +VSFKP+ CT 
Sbjct: 403 RQVVSFKPTDCTK 415


>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 445

 Score =  225 bits (574), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 145/380 (38%), Positives = 206/380 (54%), Gaps = 44/380 (11%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           Q+ +IS    Y M +SIGTPP       DTGSD TW QC+PC +  C+KQ  PLFD KKS
Sbjct: 75  QSGLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPCQQ--CYKQNTPLFDKKKS 132

Query: 85  STYNSISCSSSQCAVVTSN---CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
           STY + SC S  C  ++ +   C E    C Y + YG     SF+ G +ATET++ +S+S
Sbjct: 133 STYKTESCDSITCNALSEHEEGCDESRNACKYRYSYGD---ESFTKGEVATETISIDSSS 189

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD-- 197
           G PV  P   FGCG+ N  +          +G GP   SL+SQ+G+SI  KFSYCL    
Sbjct: 190 GSPVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGP--LSLVSQLGSSIGKKFSYCLSHTS 247

Query: 198 ---QGSSKINFG------GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFV--- 242
               G+S IN G           + +++TPLI +D   +Y+L+LEAI+VG  +L +    
Sbjct: 248 ATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTKLPYTGGG 307

Query: 243 -------SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
                  S  TGNI +D+G   TLL   ++ +  +V+   +     K V    G   +L 
Sbjct: 308 GYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTG--AKRVSDPQG---ILT 362

Query: 296 YNISSQPK---FPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQIN 352
           +   S  K    P +T+HF GADVKLSP N F  +S++I+C +        +YG ++Q++
Sbjct: 363 HCFKSGDKEIGLPTITMHFTGADVKLSPINSFVKLSEDIVCLSMIPTTEVAIYGNMVQMD 422

Query: 353 FLIGYDIEQAMVSFKPSRCT 372
           FL+GYD+E   VSF+   C+
Sbjct: 423 FLVGYDLETKTVSFQRMDCS 442


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 147/373 (39%), Positives = 208/373 (55%), Gaps = 30/373 (8%)

Query: 20  ISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLF 79
           I  I QA I +    YLM L IGTPP+ I G+VDTGSD  W QC PC  L C+ Q  P+F
Sbjct: 49  IQDIVQAPINAYIGQYLMELYIGTPPIKISGTVDTGSDLIWVQCVPC--LGCYNQINPMF 106

Query: 80  DPKKSSTYNSISCSSSQC-AVVTSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNS 137
           DP KSSTY +ISC S  C       CS E  C Y++ Y   A +S + G LA ET+T  S
Sbjct: 107 DPLKSSTYTNISCDSPLCYKPYIGECSPEKRCDYTYGY---ADSSLTKGVLAQETVTLTS 163

Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG-KFSYCLP 196
            +G P+ +  ++FGCGH N  +   +  + G+IGLG G +SL+SQ+G    G KFS CL 
Sbjct: 164 NTGKPISLQGILFGCGHNNTGN--FNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLV 221

Query: 197 D-----QGSSKINF--GGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS- 244
                   SS+++F  G  V G GVV+TPL+ R+     YY++L  ISV +  L   S+ 
Sbjct: 222 PFLTDITISSQMSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTI 281

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF 304
             GN+ VD+G    +LP + +  +   + N +  +P+     +P     LCY   +  K 
Sbjct: 282 EKGNMLVDSGTPPNILPQQLYDRVYVEVKNKVPLEPIT---DDPSLGPQLCYRTQTNLKG 338

Query: 305 PEVTIHFRGADVKLSPSNLFRNISDE---IMCSAFRG-GNANI-VYGRIMQINFLIGYDI 359
           P +T HF GA++ L+P   F   + E   + C A     N++  +YG   Q N+LIG+D+
Sbjct: 339 PTLTYHFEGANLLLTPIQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNYLIGFDL 398

Query: 360 EQAMVSFKPSRCT 372
           ++ +VSFKP+ CT
Sbjct: 399 DRQIVSFKPTDCT 411


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score =  224 bits (571), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 145/380 (38%), Positives = 205/380 (53%), Gaps = 34/380 (8%)

Query: 11  NDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD 70
            +N+ P+S + I++  E       YLM   IGTPPV+   + DTGSD  W QC PC    
Sbjct: 74  QNNKLPQS-VLILHNGE-------YLMRFYIGTPPVERLATADTGSDLIWVQCSPCA--S 123

Query: 71  CFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSE-GDCSYSFLYGRGAYASFSSG 126
           CF Q  PLF P KSST+   +C S  C ++      C + G+C Y++ YG     SFS G
Sbjct: 124 CFPQSTPLFQPLKSSTFMPTTCRSQPCTLLLPEQKGCGKSGECIYTYKYGDQY--SFSEG 181

Query: 127 NLATETLTFNSTSGL-PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT 185
            L+TETL F+S  G+  V  PN  FGCG  N  +     K TGI+GLG G  SL+SQ+G 
Sbjct: 182 LLSTETLRFDSQGGVQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQIGD 241

Query: 186 SIAGKFSYCLPDQGS---SKINFGG--IVAGAGVVSTPLIIR----DHYYLSLEAISVGN 236
            I  KFSYCL   GS   SK+ FG   I+ G GVVSTP+II+     +Y+L+LEA++V  
Sbjct: 242 QIGHKFSYCLLPLGSTSTSKLKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQ 301

Query: 237 QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
           + +    S+ GN+ +D+G L T L   ++ N  + +   +  + V+ V +   F    C+
Sbjct: 302 KTVP-TGSTDGNVIIDSGTLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPF----CF 356

Query: 297 NISSQPKFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNAN--IVYGRIMQINF 353
                  FPE+   F GA V L P+NLF    D   +C      + +   ++G   QI+F
Sbjct: 357 PYRDNFVFPEIAFQFTGARVSLKPANLFVMTEDRNTVCLMIAPSSVSGISIFGSFSQIDF 416

Query: 354 LIGYDIEQAMVSFKPSRCTN 373
            + YD+E   VSF+P+ C+ 
Sbjct: 417 QVEYDLEGKKVSFQPTDCSK 436


>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 449

 Score =  224 bits (570), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 145/378 (38%), Positives = 209/378 (55%), Gaps = 39/378 (10%)

Query: 23  IYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
           + Q++I+     YLM +SIG P V+I    DTGSD  W QC+PC    C+KQ  P+FDP+
Sbjct: 81  LVQSDIVPGGGEYLMRISIGNPQVEILAIADTGSDLIWVQCQPCEM--CYKQNSPIFDPR 138

Query: 83  KSSTYNSISCSSSQCAVVTSNCSEGD-------CSYSFLYGRGAYASFSSGNLATETL-- 133
           +SS+Y ++ C +  C  +       D       C Y++ YG     SFS G+LA E    
Sbjct: 139 RSSSYRNVLCGNEFCNKLDGEARSCDARGFVKTCGYTYSYGD---QSFSDGHLAIERFGI 195

Query: 134 --TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
             T ++TS        V FGCG KN    T D   +GIIGLG G+ SL+SQ+G  ++GKF
Sbjct: 196 GSTNSNTSAAIAYFQEVAFGCGTKN--GGTFDELGSGIIGLGGGSMSLVSQLGPKLSGKF 253

Query: 192 SYCL-----PDQGSSKINFGGIVAGAG----VVSTPLIIRD---HYYLSLEAISVGNQRL 239
           SYCL         +SKINFG  +  +G    VVSTPL+ +    +YYL+LEAISV N+RL
Sbjct: 254 SYCLVPTSEQSNYTSKINFGNDINISGSNYNVVSTPLLPKKPETYYYLTLEAISVENKRL 313

Query: 240 EFVS-----SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL 294
            + +        GNI +D+G   T L  E+ +NL S +   +K + V    ++P     +
Sbjct: 314 PYTNLWNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGERV----SDPHGLFNI 369

Query: 295 CYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFL 354
           C+      + P +T HF GADV+L P N F  + ++++C      N   ++G + Q+NFL
Sbjct: 370 CFKDEKAIELPIITAHFTGADVELQPVNTFAKVEEDLLCFTMIPSNDIAIFGNLAQMNFL 429

Query: 355 IGYDIEQAMVSFKPSRCT 372
           +GYD+E+  VSF P+ CT
Sbjct: 430 VGYDLEKKAVSFLPTDCT 447


>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 392

 Score =  224 bits (570), Expect = 6e-56,   Method: Compositional matrix adjust.
 Identities = 138/355 (38%), Positives = 194/355 (54%), Gaps = 40/355 (11%)

Query: 33  DIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISC 92
           +IYLM L +GTPP +I   +DTGSD  WTQC PC   +C+ Q  P+FDP  SST+     
Sbjct: 59  NIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCT--NCYSQYAPIFDPSNSSTFKE--- 113

Query: 93  SSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                      C+   C Y  +Y   A  ++S G LATET+T +STSG P  MP    GC
Sbjct: 114 ---------KRCNGNSCHYKIIY---ADTTYSKGTLATETVTIHSTSGEPFVMPETTIGC 161

Query: 153 GHK-NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG--GIV 209
           GH  +   PT     +G++GL  G SSLI+QMG    G  SYC   QG+SKINFG   IV
Sbjct: 162 GHNSSWFKPTF----SGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNAIV 217

Query: 210 AGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLP 261
           AG GVVST + +       YYL+L+A+SVG+  +E + ++     GNI +D+G   T  P
Sbjct: 218 AGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTYFP 277

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSP 320
           + Y + ++  + + + A       A+P  +D+LCY   +   FP +T+HF  GAD+ L  
Sbjct: 278 VSYCNLVREAVDHYVTAVRT----ADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDK 333

Query: 321 SNLF-RNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
            N++   I+    C A    N   + ++G   Q NFL+GYD    +V F P+ C+
Sbjct: 334 YNMYIETITRGTFCLAIICNNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score =  223 bits (567), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 139/362 (38%), Positives = 196/362 (54%), Gaps = 40/362 (11%)

Query: 26  AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
           A+ +  + +YLM L +GTPP +I   +DTGS+ TWTQC PC  + C+KQ  P+FDP KSS
Sbjct: 371 ADTVFDNSVYLMKLQVGTPPFEIEAVIDTGSEITWTQCLPC--VHCYKQNAPIFDPSKSS 428

Query: 86  TYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           T+                C +  C Y   Y    Y   + G LAT+T+T +STSG P  M
Sbjct: 429 TFKE------------KRCHDHSCPYEVDYFDKTY---TKGTLATDTVTIHSTSGEPFVM 473

Query: 146 PNVIFGCGHKN-LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
              I GCG  N    P+ +    G +GL  G  SLI+QMG    G  SYC    G+SKIN
Sbjct: 474 AETIIGCGRNNSWFRPSFE----GFVGLNWGPLSLITQMGGEYPGLMSYCFAGNGTSKIN 529

Query: 205 FG--GIVAGAGVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTG 254
           FG   IV G GVVST + +       YYL+L+A+SVG+ R+E + +      GNI +D+G
Sbjct: 530 FGTNAIVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSG 589

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-G 313
              T  P  Y + ++  + +++ A P     A+P  +D+LCY  ++   FP +T+HF  G
Sbjct: 590 TTLTYFPESYCNLVRQAVEHVVPAVP----AADPTGNDLLCYYSNTTEIFPVITMHFSGG 645

Query: 314 ADVKLSPSNLF-RNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQAMVSFKPSR 370
           AD+ L   N+F  + S  + C A    N     ++G   Q NFL+GYD    +VSFKP+ 
Sbjct: 646 ADLVLDKYNMFMESYSGGLFCLAIICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTN 705

Query: 371 CT 372
           C+
Sbjct: 706 CS 707



 Score =  187 bits (476), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 123/336 (36%), Positives = 173/336 (51%), Gaps = 54/336 (16%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM L IGTPP ++   +DTGS+  WTQC PC  L C+ Q+ P+FDP KSST+    C  
Sbjct: 65  YLMKLQIGTPPFEVEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRC-- 120

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
                   N  +  C Y  +Y   +Y   + G LATET+T +STSG+P  MP  I GC  
Sbjct: 121 --------NTPDHSCPYKLVYDDKSY---TQGTLATETVTIHSTSGVPFVMPETIIGCSR 169

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGV 214
            N  S    S  +GI+GL  G+ SLISQM                      GG   G GV
Sbjct: 170 NNSGSGFRPSS-SGIVGLSRGSLSLISQM----------------------GGAYPGDGV 206

Query: 215 VSTPLII----RDHYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLPLEYHS 266
           VST +      R  YYL+L+A+SVG+ R+E V +      GNI +D+G   T  P+ Y +
Sbjct: 207 VSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFHALNGNIVIDSGTPLTYFPVSYCN 266

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPSNLFR 325
            ++  +  ++ A  V     +P  +D+LCY  ++   FP +T+HF  GAD+ L   N++ 
Sbjct: 267 LVRKAVERVVTADRV----VDPSRNDMLCYYSNTIEIFPVITVHFSGGADLVLDKYNMYM 322

Query: 326 NISD-EIMCSAFRGGNAN--IVYGRIMQINFLIGYD 358
            ++   + C A    N     ++G   Q NFL+GYD
Sbjct: 323 ELNRGGVFCLAIICNNPTQVAIFGNRAQNNFLVGYD 358


>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 397

 Score =  222 bits (566), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 139/354 (39%), Positives = 190/354 (53%), Gaps = 35/354 (9%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           IYLM L +GTPP +I   +DTGSD  WTQC PCP  +C+ Q  P+FDP KSST+      
Sbjct: 60  IYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCP--NCYTQFAPIFDPSKSSTFKE---- 113

Query: 94  SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
                     C    C Y  +Y   A  S+S+G LATET+T  STSG P  M     GCG
Sbjct: 114 --------KRCHGNSCPYEIIY---ADESYSTGILATETVTIQSTSGEPFVMAETSIGCG 162

Query: 154 --HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG--GIV 209
             + NL +P   +  +GI+GL  G SSLISQM   I G  SYC   QG+SKINFG   +V
Sbjct: 163 LNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQGTSKINFGTNAVV 222

Query: 210 AGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS----STGNIFVDTGVLRTLLPL 262
           AG G V+  + I+     YYL+L+A+SVG++R+E + +      GNIF+D+G   T LP 
Sbjct: 223 AGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFIDSGTTYTYLPT 282

Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPS 321
            Y  NL                  +P   ++LCYN  +   FP +T+HF  GAD+ L   
Sbjct: 283 SY-CNLVREAVAASVVA--ANQVPDPSSENLLCYNWDTMEIFPVITLHFAGGADLVLDKY 339

Query: 322 NLF-RNISDEIMCSAFRGGNANI--VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           N++   I+    C A    + ++  ++G     N L+GYD    ++SF P+ C+
Sbjct: 340 NMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNCS 393


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 137/352 (38%), Positives = 190/352 (53%), Gaps = 39/352 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM L IGTPP +I   +DTGS+  WTQC PC  + C+ Q  P+FDP KSST+  I C +
Sbjct: 65  YLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPC--VHCYNQTAPIFDPSKSSTFKEIRCDT 122

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
                      +  C Y  +YG  +Y   + G L TET+T +STSG P  MP  I GCG 
Sbjct: 123 H----------DHSCPYELVYGGKSY---TKGTLVTETVTIHSTSGQPFVMPETIIGCGR 169

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG--GIVAGA 212
            N       +   G++GL  G  SLI+QMG    G  SYC   +G+SKINFG   IVAG 
Sbjct: 170 NNSGFKPGFA---GVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKINFGANAIVAGD 226

Query: 213 GVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLPLEY 264
           GVVST + ++      YYL+L+A+SVGN R+E V +      GNI +D+G   T  P  Y
Sbjct: 227 GVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGSTLTYFPESY 286

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPSNL 323
            + ++  +  ++ A            SD+LCY   +   FP +T+HF  GAD+ L   N+
Sbjct: 287 CNLVRKAVEQVVTAVRFPR-------SDILCYYSKTIDIFPVITMHFSGGADLVLDKYNM 339

Query: 324 F-RNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           +  + +  + C A    +     ++G   Q NFL+GYD    +VSFKP+ C+
Sbjct: 340 YVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 391


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 137/352 (38%), Positives = 190/352 (53%), Gaps = 39/352 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM L IGTPP +I   +DTGS+  WTQC PC  + C+ Q  P+FDP KSST+  I C +
Sbjct: 59  YLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPC--VHCYNQTAPIFDPSKSSTFKEIRCDT 116

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
                      +  C Y  +YG  +Y   + G L TET+T +STSG P  MP  I GCG 
Sbjct: 117 H----------DHSCPYELVYGGKSY---TKGTLVTETVTIHSTSGQPFVMPETIIGCGR 163

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG--GIVAGA 212
            N       +   G++GL  G  SLI+QMG    G  SYC   +G+SKINFG   IVAG 
Sbjct: 164 NNSGFKPGFA---GVVGLDRGPKSLITQMGGEYPGLMSYCFAGKGTSKINFGANAIVAGD 220

Query: 213 GVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRTLLPLEY 264
           GVVST + ++      YYL+L+A+SVGN R+E V +      GNI +D+G   T  P  Y
Sbjct: 221 GVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGNIVIDSGSTLTYFPESY 280

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPSNL 323
            + ++  +  ++ A            SD+LCY   +   FP +T+HF  GAD+ L   N+
Sbjct: 281 CNLVRKAVEQVVTAVRFPR-------SDILCYYSKTIDIFPVITMHFSGGADLVLDKYNM 333

Query: 324 F-RNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           +  + +  + C A    +     ++G   Q NFL+GYD    +VSFKP+ C+
Sbjct: 334 YVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 385


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 139/352 (39%), Positives = 202/352 (57%), Gaps = 39/352 (11%)

Query: 41  IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC-AV 99
           IGTPPVD  G  DTGSD TW QC PC  L C++Q  P+F+P KS++++ + C++  C AV
Sbjct: 86  IGTPPVDYLGIADTGSDLTWAQCLPC--LKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAV 143

Query: 100 VTSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLA 158
              +C  +G C YS+ YG   Y   S G+L  E +T  S+S         + GCGH   A
Sbjct: 144 DDGHCGVQGVCDYSYTYGDRTY---SKGDLGFEKITIGSSS------VKSVIGCGH---A 191

Query: 159 SPTSDSKQTGIIGLGPGNSSLISQMG--TSIAGKFSYCLP---DQGSSKINFG--GIVAG 211
           S       +G+IGLG G  SL+SQM   + I+ +FSYCLP      + KINFG   +V+G
Sbjct: 192 SSGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLLSHANGKINFGQNAVVSG 251

Query: 212 AGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNL 268
            GVVSTPLI ++   +YY++LEAIS+GN+R     +  GN+ +D+G   + LP E +  +
Sbjct: 252 PGVVSTPLISKNTVTYYYITLEAISIGNER-HMAFAKQGNVIIDSGTTLSFLPKELYDGV 310

Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDVLCY----NISSQPKFPEVTIHFR-GADVKLSPSNL 323
            S +  ++KA+ VK    +PG    LC+    N+++    P +T  F  GA+V L P N 
Sbjct: 311 VSSLLKVVKAKRVK----DPGNFWDLCFDDGINVATSSGIPIITAQFSGGANVNLLPVNT 366

Query: 324 FRNISDEIMCSAFRGGNANIVYGRIMQI---NFLIGYDIEQAMVSFKPSRCT 372
           F+ +++ + C      +    +G I  +   NFLIGYD+E   +SFKP+ CT
Sbjct: 367 FQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAKRLSFKPTVCT 418


>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 756

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 136/354 (38%), Positives = 190/354 (53%), Gaps = 36/354 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           IYLM L +GTPP +I   +DTGSD  WTQC PCP  +C+ Q  P+FDP KSST+      
Sbjct: 420 IYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCP--NCYSQFAPIFDPSKSSTFRE---- 473

Query: 94  SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
                     C+   C Y  +Y    Y   S G LATET+T  STSG P  M     GCG
Sbjct: 474 --------QRCNGNSCHYEIIYADKTY---SKGILATETVTIPSTSGEPFVMAETKIGCG 522

Query: 154 --HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG--GIV 209
             + NL      S  +GI+GL  G  SLISQM     G  SYC   QG+SKINFG   IV
Sbjct: 523 LDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCFSGQGTSKINFGTNAIV 582

Query: 210 AGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFV----SSSTGNIFVDTGVLRTLLPL 262
           AG G V+  + I+     YYL+L+A+SV +  +  +     +  GNIF+D+G   T  P+
Sbjct: 583 AGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFHAEDGNIFIDSGTTLTYFPM 642

Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPS 321
            Y + ++  +  ++ A  V  +G++    ++LCY   +   FP +T+HF  GAD+ L   
Sbjct: 643 SYCNLVREAVEQVVTAVKVPDMGSD----NLLCYYSDTIDIFPVITMHFSGGADLVLDKY 698

Query: 322 NLF-RNISDEIMCSAFRGGNANI--VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           N++   I+  I C A    + ++  V+G   Q NFL+GYD    ++SF P+ C+
Sbjct: 699 NMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSPTNCS 752



 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 134/341 (39%), Positives = 181/341 (53%), Gaps = 36/341 (10%)

Query: 33  DIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISC 92
           +IYLM L +GTPP +I   +DTGSD  WTQC PCP  DC+ Q  P+FDP KSST+N    
Sbjct: 80  NIYLMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCP--DCYSQFDPIFDPSKSSTFNE--- 134

Query: 93  SSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                      C    C Y  +Y    Y   S G LATET+T +STSG P  M     GC
Sbjct: 135 ---------QRCHGKSCHYEIIYEDNTY---SKGILATETVTIHSTSGEPFVMAETTIGC 182

Query: 153 GHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG--GI 208
           G  N  L +    S  +GI+GL  G  SLISQM     G  SYC   QG+SKINFG   I
Sbjct: 183 GLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCFSGQGTSKINFGTNAI 242

Query: 209 VAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFV----SSSTGNIFVDTGVLRTLLP 261
           VAG G V+  + I+     YYL+L+A+SV + R+E +     +  GNI +D+G   T  P
Sbjct: 243 VAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFHAEDGNIVIDSGSTVTYFP 302

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSP 320
           + Y + ++  +  ++ A  V     +P  +D+LCY   +   FP +T+HF  GAD+ L  
Sbjct: 303 VSYCNLVRKAVEQVVTAVRVP----DPSGNDMLCYFSETIDIFPVITMHFSGGADLVLDK 358

Query: 321 SNLF-RNISDEIMCSAF--RGGNANIVYGRIMQINFLIGYD 358
            N++  + S  + C A          ++G   Q NFL+GYD
Sbjct: 359 YNMYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNFLVGYD 399


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 139/365 (38%), Positives = 203/365 (55%), Gaps = 39/365 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M +SIGTPP+++    DTGSD  W QC+PC E  C+KQ+ P+F+PK+SSTY  + C +
Sbjct: 94  YFMRISIGTPPIEVLVIADTGSDLIWVQCQPCQE--CYKQKSPIFNPKQSSTYRRVLCET 151

Query: 95  SQCAVVTSN---CSE----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
             C  + S+   CS       C YS+ YG     SF+ G LATE     ST+    E+  
Sbjct: 152 RYCNALNSDMRACSAHGFFKACGYSYSYGD---HSFTMGYLATERFIIGSTNNSIQELA- 207

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGSSKINFG 206
             FGCG+ N      D   +GI+GLG G+ SLISQ+GT I  KFSYCL P    S  + G
Sbjct: 208 --FGCGNSN--GGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLG 263

Query: 207 GIVAGAG--------VVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS------STGNI 249
            IV G           VSTPL+ ++    YYL+LEAISVGN+RL + +S        GNI
Sbjct: 264 KIVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRNDGNVEKGNI 323

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTI 309
            +D+G   T L  + ++ L+ V+   ++ + V    ++P     +C+      + P +T+
Sbjct: 324 IIDSGTTLTFLDSKLYNKLELVLEKAVEGERV----SDPNGIFSICFRDKIGIELPIITV 379

Query: 310 HFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPS 369
           HF  ADV+L P N F    ++++C      N   ++G + Q+NFL+GYD+++  VSF P+
Sbjct: 380 HFTDADVELKPINTFAKAEEDLLCFTMIPSNGIAIFGNLAQMNFLVGYDLDKNCVSFMPT 439

Query: 370 RCTNY 374
            C+ +
Sbjct: 440 DCSGH 444


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score =  213 bits (543), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 141/382 (36%), Positives = 202/382 (52%), Gaps = 35/382 (9%)

Query: 9   FYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPE 68
           F ++N+ P+S         +I     YLM   IG+PPV+    VDTGS   W QC PC  
Sbjct: 71  FLDENKLPES--------LLIPDKGEYLMRFYIGSPPVERLAMVDTGSSLIWLQCSPCH- 121

Query: 69  LDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVT---SNCSE-GDCSYSFLYGRGAYASFS 124
            +CF QE PLF+P KSSTY   +C S  C ++     +C + G C Y  +YG     SFS
Sbjct: 122 -NCFPQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQCIYGIMYGD---KSFS 177

Query: 125 SGNLATETLTFNSTSGL-PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM 183
            G L TETL+F ST G   V  PN IFGCG  N  +  + +K  GI GLG G  SL+SQ+
Sbjct: 178 VGILGTETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQL 237

Query: 184 GTSIAGKFSYCL---PDQGSSKINFG--GIVAGAGVVSTPLIIR----DHYYLSLEAISV 234
           G  I  KFSYCL       +SK+ FG   I+   GVVSTPLII+     +Y+L+LEA+++
Sbjct: 238 GAQIGHKFSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTI 297

Query: 235 GNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL 294
           G Q++     + GNI +D+G   T L   +++N  + +   +  + ++ + +        
Sbjct: 298 G-QKVVSTGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLPSPLK----T 352

Query: 295 CYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISD-EIMCSAF--RGGNANIVYGRIMQI 351
           C+   +    P++   F GA V L P N+   ++D  I+C A     G    ++G I Q 
Sbjct: 353 CFPNRANLAIPDIAFQFTGASVALRPKNVLIPLTDSNILCLAVVPSSGIGISLFGSIAQY 412

Query: 352 NFLIGYDIEQAMVSFKPSRCTN 373
           +F + YD+E   VSF P+ C  
Sbjct: 413 DFQVEYDLEGKKVSFAPTDCAK 434


>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 396

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 150/387 (38%), Positives = 212/387 (54%), Gaps = 34/387 (8%)

Query: 8   PFYNDNETPKSPI-SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPC 66
           PFY  +E     + S      + S +  YLM L++GTPPVD++G VDTGSD  W QC PC
Sbjct: 22  PFYKSDELHMHRLGSNGVFTRVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPC 81

Query: 67  PELDCFKQEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCS-EGDCSYSFLYGRGAYASFS 124
               C++Q+ P+F+P +S+TY  I C S +C ++   +CS +  C+YS+ Y   A +S +
Sbjct: 82  Q--GCYRQKSPMFEPLRSNTYTPIPCDSEECNSLFGHSCSPQKLCAYSYAY---ADSSVT 136

Query: 125 SGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG 184
            G LA ET+TF+ST G PV + +++FGCGH N  S T +    GIIGLG G  SL+SQ G
Sbjct: 137 KGVLARETVTFSSTDGEPVVVGDIVFGCGHSN--SGTFNENDMGIIGLGGGPLSLVSQFG 194

Query: 185 TSIAGK-FSYCLPDQGSSKINFGGI-------VAGAGVVSTPLIIRDH---YYLSLEAIS 233
                K FS CL    +     G I       V+G GV +TPL+  +    Y ++LE IS
Sbjct: 195 NLYGSKRFSQCLVPFHADPHTLGTISFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGIS 254

Query: 234 VGNQRLEFVSS---STGNIFVDTGVLRTLLPLEYHSNL---KSVMSNMIKAQPVKGVGAE 287
           VG+  + F SS   S GNI +D+G   T LP E++  L     V SNM+       +  +
Sbjct: 255 VGDTFVSFNSSEMLSKGNIMIDSGTPATYLPQEFYDRLVKELKVQSNMLP------IDDD 308

Query: 288 PGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG-GNANIVYG 346
           P     LCY   +  + P +  HF GADV+L P   F    D + C A  G  +   ++G
Sbjct: 309 PDLGTQLCYRSETNLEGPILIAHFEGADVQLMPIQTFIPPKDGVFCFAMAGTTDGEYIFG 368

Query: 347 RIMQINFLIGYDIEQAMVSFKPSRCTN 373
              Q N LIG+D+++  VSFK + C+N
Sbjct: 369 NFAQSNVLIGFDLDRKTVSFKATDCSN 395


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 134/392 (34%), Positives = 206/392 (52%), Gaps = 34/392 (8%)

Query: 1   AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
           A++ ++L    +++  +SP +I    E I+    YLM   IGTPPV+ F   DTGSD  W
Sbjct: 63  ARSKRRLRLSQNDD--RSPGTITIPDEPITE---YLMRFYIGTPPVERFAIADTGSDLIW 117

Query: 61  TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNC--SEGDCSYSFLY 115
            QC PC +  C  Q  PLFDP+KSST+ ++ C S  C ++      C    G C Y ++Y
Sbjct: 118 VQCAPCEK--CVPQNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKSGQCYYQYIY 175

Query: 116 GRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPG 175
           G     +  SG L  E++ F S +   ++ P + FGC   N  +     +  G++GLG G
Sbjct: 176 GD---HTLVSGILGFESINFGSKNN-AIKFPKLTFGCTFSNNDTVDESKRNMGLVGLGVG 231

Query: 176 NSSLISQMGTSIAGKFSYCLP---DQGSSKINFGG---IVAGAGVVSTPLIIR----DHY 225
             SLISQ+G  I  KFSYC P      +SK+ FG    +    GVVSTPLII+     +Y
Sbjct: 232 PLSLISQLGYQIGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPLIIKSIGPSYY 291

Query: 226 YLSLEAISVGNQRLEFVSSST-GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV 284
           YL+LE +S+GN++++   S T GNI +D+G   T+L   +++   +++  +   + VK  
Sbjct: 292 YLNLEGVSIGNKKVKTSESQTDGNILIDSGTSFTILKQSFYNKFVALVKEVYGVEAVK-- 349

Query: 285 GAEPGFSDVLCY-NISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAF--RGGNA 341
              P      C+ N   + +FP+V   F GA V++  SNLF    + ++C          
Sbjct: 350 --IPPLVYNFCFENKGKRKRFPDVVFLFTGAKVRVDASNLFEAEDNNLLCMVALPTSDED 407

Query: 342 NIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           + ++G   QI + + YD++  MVSF P+ C  
Sbjct: 408 DSIFGNHAQIGYQVEYDLQGGMVSFAPADCAK 439


>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 445

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 137/377 (36%), Positives = 202/377 (53%), Gaps = 38/377 (10%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           Q+ +IS    Y M +SIGTPP  +F   DTGSD TW QC+PC +  C+KQ  PLFD KKS
Sbjct: 75  QSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPCQQ--CYKQNSPLFDKKKS 132

Query: 85  STYNSISCSSSQCAVVTSN---CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
           STY + SC S  C  ++ +   C E    C Y + YG     SF+ G++ATET++ +S+S
Sbjct: 133 STYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDN---SFTKGDVATETISIDSSS 189

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD-- 197
           G  V  P  +FGCG+ N  +          +G GP   SL+SQ+G+SI  KFSYCL    
Sbjct: 190 GSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGP--LSLVSQLGSSIGKKFSYCLSHTA 247

Query: 198 ---QGSSKINFG------GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFV--- 242
               G+S IN G           +  ++TPLI +D   +Y+L+LEA++VG  +L +    
Sbjct: 248 ATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGG 307

Query: 243 -------SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
                  S  TGNI +D+G   TLL   ++ +  + +   +     K V    G      
Sbjct: 308 YGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTG--AKRVSDPQGLLTHCF 365

Query: 296 YNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLI 355
            +   +   P +T+HF  ADVKLSP N F  ++++ +C +        +YG ++Q++FL+
Sbjct: 366 KSGDKEIGLPAITMHFTNADVKLSPINAFVKLNEDTVCLSMIPTTEVAIYGNMVQMDFLV 425

Query: 356 GYDIEQAMVSFKPSRCT 372
           GYD+E   VSF+   C+
Sbjct: 426 GYDLETKTVSFQRMDCS 442


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 135/363 (37%), Positives = 190/363 (52%), Gaps = 31/363 (8%)

Query: 28  IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
           II  +  YLM + IGTP V+     DTGSD TW QC PC    CF Q  PL+DP  SST+
Sbjct: 89  IIPNNGNYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTF 148

Query: 88  NSISCSSSQCAVVTSN---CSE-GDCSYSFLYGRGAYA----SFSSGNLATETLTFNSTS 139
             + C S  C  +  +   CS+ GDC Y++ YG  +Y+    S  S  L    L +NS  
Sbjct: 149 TLLPCDSQPCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLLQLHYNS-- 206

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---P 196
                   + FGCG +N  +     K TGI+GLG G  SL+SQ+G  I  KFSYCL    
Sbjct: 207 -------KICFGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFS 259

Query: 197 DQGSSKINFG--GIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSSSTGNIFV 251
              +SK+ FG   IV G GVVSTPLII+     YYL+LE I+VG + ++    + GNI +
Sbjct: 260 SNSNSKLKFGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVK-TGQTDGNIII 318

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHF 311
           D+G   T L   +++   S++   +  +  + +     F       +S+    P+V  HF
Sbjct: 319 DSGSTLTYLEESFYNEFVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTP---PDVVFHF 375

Query: 312 RGADVKLSPSNLFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPS 369
            G DV L P N    I D ++CS     + +   ++G + QI+F +GYDI+   VSF P+
Sbjct: 376 TGGDVVLKPMNTLVLIEDNLICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPT 435

Query: 370 RCT 372
            C+
Sbjct: 436 DCS 438


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  207 bits (526), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 148/371 (39%), Positives = 209/371 (56%), Gaps = 41/371 (11%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           ++ II     +LM + IGTPPV++    DTGSD TWTQC PC E  CF Q  P+F+P++S
Sbjct: 80  RSPIIPDSGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRE--CFNQSQPIFNPRRS 137

Query: 85  STYNSISCSSSQCAVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
           S+Y  +SC+S  C  + S     D   CSY + YG     SF+ G+LA++ +T  S    
Sbjct: 138 SSYRKVSCASDTCRSLESYHCGPDLQSCSYGYSYGD---RSFTYGDLASDQITIGS---- 190

Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG---KFSYCLPDQ 198
             ++P  + GCGH+N    T     +GIIGLG G+ SL+SQM T IAG   +FSYCLP  
Sbjct: 191 -FKLPKTVIGCGHQN--GGTFGGVTSGIIGLGGGSLSLVSQMRT-IAGVKPRFSYCLPTF 246

Query: 199 GSSK-----INFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVS----- 243
            S+      I+FG   +V+G  VVSTPL+ R     Y+L+LEAISVG +R +  +     
Sbjct: 247 FSNANITGTISFGRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAM 306

Query: 244 SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--Q 301
           ++ GNI +D+G   TLLP   +  + S ++ +IKA+ V     +P     LCY+      
Sbjct: 307 TNHGNIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVD----DPSGILELCYSAGQVDD 362

Query: 302 PKFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
              P +T HF G ADVKL P N F  ++D + C  F       ++G + QINF +GYD+ 
Sbjct: 363 LNIPIITAHFAGGADVKLLPVNTFAPVADNVTCLTFAPATQVAIFGNLAQINFEVGYDLG 422

Query: 361 QAMVSFKPSRC 371
              +SF+P  C
Sbjct: 423 NKRLSFEPKLC 433


>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
          Length = 334

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 134/376 (35%), Positives = 200/376 (53%), Gaps = 72/376 (19%)

Query: 15  TPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQ 74
           TP+ P+S        S +  YLM +SIGTPP D++G  DTGSD  WTQC PC  L C+KQ
Sbjct: 12  TPEPPVS--------SNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPC--LSCYKQ 61

Query: 75  EPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLT 134
           + P+FDP KS+++  +SC S QC ++ +                                
Sbjct: 62  KNPMFDPSKSTSFKEVSCESQQCRLLDT-------------------------------- 89

Query: 135 FNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--KFS 192
                  P  + N++FGCGH N  S T +  + G+ G G    SL SQ+ +++    KFS
Sbjct: 90  -------PTSILNIVFGCGHNN--SGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFS 140

Query: 193 YCL-PDQG----SSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFV 242
            CL P +     +SKI FG    V+G+ VVSTPL+ +D   +Y+++L+ ISVG++   F 
Sbjct: 141 QCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFS 200

Query: 243 SSS----TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI 298
           SSS     GN+F+D G   TLLP ++++ L   +   I  +PV+    +P     LCY  
Sbjct: 201 SSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQ----DPDLQPQLCYRS 256

Query: 299 SSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGY 357
           ++    P +T HF GADV+L P N F +  + + C A +  + +  ++G  +Q+NFLIG+
Sbjct: 257 ATLIDGPILTAHFDGADVQLKPLNTFISPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGF 316

Query: 358 DIEQAMVSFKPSRCTN 373
           D++   VSFK   CT 
Sbjct: 317 DLDGKKVSFKAVDCTK 332


>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 455

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 138/377 (36%), Positives = 194/377 (51%), Gaps = 40/377 (10%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
            A I S D  YLM L IGTPP +I  ++DTGS+  W  C  C   DCF Q   +F+P  S
Sbjct: 88  HASIFSGDGNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCINCK--DCFNQSSSIFNPLAS 145

Query: 85  STYNSISCSSSQCAVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           STY    C S QC   +S+C S+  C YS         +  +G +A +T+T  S+ G P 
Sbjct: 146 STYQDAPCDSYQCETTSSSCQSDNVCLYSC--DEKHQLNCPNGRIAVDTMTLTSSDGRPF 203

Query: 144 EMPNVIFGCG---HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
            +P   F CG   +K  A         G+IGLG G  SL S++     GKFSYCL D  S
Sbjct: 204 PLPYSDFVCGNSIYKTFAG-------VGVIGLGRGALSLTSKLYHLSDGKFSYCLADYYS 256

Query: 201 ---SKINFG--GIVAGAG--VVSTPLIIRDH---YYLSLEAISVGNQRLEF------VSS 244
              SKINFG    ++     VVST L    H   YY++LE ISVG +R +        + 
Sbjct: 257 KQPSKINFGLQSFISDDDLEVVSTTLGHHRHSGNYYVTLEGISVGEKRQDLYYVDDPFAP 316

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG-------VGAEPGFSDVLCYN 297
             GN+ +D+G + TLLP +++  L S +S  I   P             +       C+ 
Sbjct: 317 PVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSMDNTLKLSPCFW 376

Query: 298 ISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGN--ANIVYGRIMQINFLI 355
              + KFP++TIHF  ADV+LS  N F  ++++++C AF       + VYG   Q+NF++
Sbjct: 377 YYPELKFPKITIHFTDADVELSDDNSFIRVAEDVVCFAFAATQPGQSTVYGSWQQMNFIL 436

Query: 356 GYDIEQAMVSFKPSRCT 372
           GYD+++  VSFK + C+
Sbjct: 437 GYDLKRGTVSFKRTDCS 453


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 143/405 (35%), Positives = 197/405 (48%), Gaps = 50/405 (12%)

Query: 8   PFYNDNETP-----KSPISIIYQAEIISVDDI----------------YLMHLSIGTPPV 46
           PFY  + TP      + +  IYQ    S  D+                YLM   IGTPPV
Sbjct: 42  PFYKPSLTPSDRIINTALRSIYQLNRASHSDLNEKKTLERVRIPNHGEYLMRFYIGTPPV 101

Query: 47  DIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA---VVTSN 103
           +     DT SD  W QC PC    CF Q+ PLF+P KSST+ ++SC S  C    +    
Sbjct: 102 ERLAIADTASDLIWVQCSPCET--CFPQDTPLFEPHKSSTFANLSCDSQPCTSSNIYYCP 159

Query: 104 CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSD 163
                C Y+  YG G   S + G L TE++ F S +   V  P  IFGCG  N       
Sbjct: 160 LVGNLCLYTNTYGDG---SSTKGVLCTESIHFGSQT---VTFPKTIFGCGSNNDFMHQIS 213

Query: 164 SKQTGIIGLGPGNSSLISQMGTSIAGKFSYC-LPDQGSS--KINFGG--IVAGAGVVSTP 218
           +K TGI+GLG G  SL+SQ+G  I  KFSYC LP   +S  K+ FG    + G GVVSTP
Sbjct: 214 NKVTGIVGLGAGPLSLVSQLGDQIGHKFSYCLLPFTSTSTIKLKFGNDTTITGNGVVSTP 273

Query: 219 LIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHSNLKSVM 272
           LII  H    Y+L L  I++G + L+  ++  + GNI +D G + T L + ++ N  +++
Sbjct: 274 LIIDPHYPSYYFLHLVGITIGQKMLQVRTTDHTNGNIIIDLGTVLTYLEVNFYHNFVTLL 333

Query: 273 SNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISD-EI 331
              +     K     P F    C+   +   FP++   F GA V LSP NLF    D  +
Sbjct: 334 REALGISETKDDIPYP-FD--FCFPNQANITFPKIVFQFTGAKVFLSPKNLFFRFDDLNM 390

Query: 332 MCSAFRG---GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           +C A           V+G + Q++F + YD +   VSF P+ C+ 
Sbjct: 391 ICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPADCSK 435


>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 139/373 (37%), Positives = 193/373 (51%), Gaps = 34/373 (9%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           Q+E+      YL+ +S+GTPP +I    D   D TW  C+ C   DC K     F P +S
Sbjct: 87  QSELNFSKGNYLIKISVGTPPAEILALADITGDLTWLPCKTCQ--DCTKDGFTFF-PSES 143

Query: 85  STYNSISCSSSQCAVVT-SNCSEGDCSYSFLYG---RGAYASFSSGNLATETLTFNSTSG 140
           STY S +C S QC +   + C    C Y  L G   +   +  + G +A +T++F+S+SG
Sbjct: 144 STYTSAACESYQCQITNGAVCQTKMCIY--LCGPLPQQRSSCTNKGLVAMDTISFHSSSG 201

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PD 197
             +  PN  F CG              GI+GLG G  S+ SQM   I G FS CL     
Sbjct: 202 QALSYPNTNFICG---TFIDNWHYIGAGIVGLGRGLFSMTSQMKHLINGTFSQCLVPYSS 258

Query: 198 QGSSKINFG--GIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRL--EFVSSSTGNIF 250
           + SSKINFG  G+V+G GVVSTP+        Y+L LEA+SVG  R+   F S+   NI+
Sbjct: 259 KQSSKINFGLKGVVSGEGVVSTPIADDGESGAYFLFLEAMSVGGNRVANNFYSAPKSNIY 318

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF--PEVT 308
           +D     T LP +++ N+++ +   I   P+     E   S  LCY   S   F  P +T
Sbjct: 319 IDWRTTFTSLPHDFYENVEAEVRKAINLTPIN-YNNERKLS--LCYKSESDHDFDAPPIT 375

Query: 309 IHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANI-------VYGRIMQINFLIGYDIEQ 361
           +HF  ADV+LSP N F  +   ++C AF  G  N        VYG   Q+NF++GYD++ 
Sbjct: 376 MHFTNADVQLSPLNTFVRMDWNVVCFAFLDGTFNATKRITHAVYGSWQQMNFIVGYDLKS 435

Query: 362 AMVSFKPSRCTNY 374
           + VSFK + CT Y
Sbjct: 436 STVSFKQADCTLY 448


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 132/395 (33%), Positives = 213/395 (53%), Gaps = 56/395 (14%)

Query: 22  IIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDP 81
           + +Q +++     Y+M+LSIGTPP  I    DTGSD TW Q +PC +  C+ Q+ P+FDP
Sbjct: 67  VDFQTDLLPSGGEYMMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQ--CYPQKGPIFDP 124

Query: 82  KKSSTYNSISCSSSQCAVV---TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNS 137
             S+T++ + C+++ C  +     +C++   C Y++ YG  +Y   ++G LA++T+T  +
Sbjct: 125 SNSTTFHKLPCTTAPCNALDESARSCTDPTTCGYTYSYGDHSY---TTGYLASDTVTVGN 181

Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-- 195
            S   V++ NV FGCG +N  +   D + +GI+GLG GN S +SQ+G +I  KFSYCL  
Sbjct: 182 AS---VQIRNVAFGCGTRNGGN--FDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLP 236

Query: 196 ----------PDQGSSKINFG-------GIVAGAGVVSTPLIIRD---HYYLSLEAISVG 235
                         +S+I FG           G    +TPL+ ++   +YYL++EAI+VG
Sbjct: 237 LENEISSQPSDSPATSRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVG 296

Query: 236 NQRLEFV---------------SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
            ++L +                S   GNI +D+G   T L  E++  L++ +   IK + 
Sbjct: 297 RKKLLYSSSSSKTASYDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMER 356

Query: 281 VKGVGAEPGFSDVLCYNISSQP-KFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRG 338
           V  V     FS  LC+    +  + P + +HFRG ADV+L P N F    + ++C     
Sbjct: 357 VNDV-KNSMFS--LCFKSGKEEVELPLMKVHFRGGADVELKPVNTFVRAEEGLVCFTMLP 413

Query: 339 GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
            N   +YG + Q+NF++GYD+ +  VSF P+ C+ 
Sbjct: 414 TNDVGIYGNLAQMNFVVGYDLGKRTVSFLPADCSK 448


>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 437

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 143/383 (37%), Positives = 204/383 (53%), Gaps = 38/383 (9%)

Query: 9   FYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPE 68
           F ++N  P+S         +I  +  YLM L IGTPPV+     DTGSD  W QC PC  
Sbjct: 74  FLDENNLPES--------LLIPENGEYLMTLYIGTPPVERLAIADTGSDLIWVQCSPCQ- 124

Query: 69  LDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSE-GDCSYSFLYGRGAYASFS 124
            +CF Q+ PLF+P KSST+ + +C S  C  V      C + G C YS+ YG     SF+
Sbjct: 125 -NCFPQDTPLFEPLKSSTFKAATCDSQPCTSVPPSQRQCGKVGQCIYSYSYGD---KSFT 180

Query: 125 SGNLATETLTFNST-SGLPVEMPNVIFGCG-HKNLASPTSDSKQTGIIGLGPGNSSLISQ 182
            G + TETL+F ST     V  P+ IFGCG + N    TSD K TG++GLG G  SL+SQ
Sbjct: 181 VGVVGTETLSFGSTGDAQTVSFPSSIFGCGVYNNFTFHTSD-KVTGLVGLGGGPLSLVSQ 239

Query: 183 MGTSIAGKFSYCL---PDQGSSKINFG--GIVAGAGVVSTPLIIR----DHYYLSLEAIS 233
           +G  I  KFSYCL       +SK+ FG   IV   GVVSTPLII+      Y+L+LEA++
Sbjct: 240 LGPQIGYKFSYCLLPFSSNSTSKLKFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVT 299

Query: 234 VGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV 293
           +G Q++     + GNI +D+G + T L   +++N  + +  ++  +  + +     F   
Sbjct: 300 IG-QKVVPTGRTDGNIIIDSGTVLTYLEQTFYNNFVASLQEVLSVESAQDL----PFPFK 354

Query: 294 LCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNAN--IVYGRIMQ 350
            C+        P +   F GA V L P NL   + D  ++C A    + +   ++G + Q
Sbjct: 355 FCFPYRDM-TIPVIAFQFTGASVALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQ 413

Query: 351 INFLIGYDIEQAMVSFKPSRCTN 373
            +F + YD+E   VSF P+ CT 
Sbjct: 414 FDFQVVYDLEGKKVSFAPTDCTK 436


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 130/386 (33%), Positives = 187/386 (48%), Gaps = 34/386 (8%)

Query: 3   NSQKLPFYNDNETPKSPISIIYQAEIISVDDI--YLMHLSIGTPPVDIFGSVDTGSDCTW 60
            S+++ F      P SPI       I  + D   YLM  S+GTP V+     DTGSD +W
Sbjct: 61  RSKRVNFIGQISPPLSPI-------ITPIPDHGEYLMRFSLGTPSVERLAIFDTGSDLSW 113

Query: 61  TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSE----GDCSYSFLYG 116
            QC PC    C+ QE PLFDP +SSTY  + C S  C +   N  E      C Y   YG
Sbjct: 114 LQCTPCKT--CYPQEAPLFDPTQSSTYVDVPCESQPCTLFPQNQRECGSSKQCIYLHQYG 171

Query: 117 RGAYASFSSGNLATETLTFNSTSGLP---VEMPNVIFGCGHKNLASPTSDSKQTGIIGLG 173
                SF+ G L  +T++F+ST G+       P  +FGC   +  +    +K  G +GLG
Sbjct: 172 TD---SFTIGRLGYDTISFSST-GMGQGGATFPKSVFGCAFYSNFTFKISTKANGFVGLG 227

Query: 174 PGNSSLISQMGTSIAGKFSYCLPDQGSS---KINFGGIVAGAGVVSTPLIIR----DHYY 226
           PG  SL SQ+G  I  KFSYC+    S+   K+ FG +     VVSTP +I      +Y 
Sbjct: 228 PGPLSLASQLGDQIGHKFSYCMVPFSSTSTGKLKFGSMAPTNEVVSTPFMINPSYPSYYV 287

Query: 227 LSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGA 286
           L+LE I+VG +++       GNI +D+  + T L    +++  S +   I  +    V  
Sbjct: 288 LNLEGITVGQKKV-LTGQIGGNIIIDSVPILTHLEQGIYTDFISSVKEAINVE----VAE 342

Query: 287 EPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYG 346
           +       C    +   FPE   HF GADV L P N+F  + + ++C          ++G
Sbjct: 343 DAPTPFEYCVRNPTNLNFPEFVFHFTGADVVLGPKNMFIALDNNLVCMTVVPSKGISIFG 402

Query: 347 RIMQINFLIGYDIEQAMVSFKPSRCT 372
              Q+NF + YD+ +  VSF P+ C+
Sbjct: 403 NWAQVNFQVEYDLGEKKVSFAPTNCS 428


>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
          Length = 308

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 125/355 (35%), Positives = 184/355 (51%), Gaps = 74/355 (20%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           Q+ +IS    YLM++S+GTPPV + G  DTGSD  W QC PC   DC+KQ  PLFDPKKS
Sbjct: 19  QSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPCD--DCYKQVEPLFDPKKS 76

Query: 85  STYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
            TY ++                                   G L++ET T  ST G P  
Sbjct: 77  KTYKTL-----------------------------------GYLSSETFTIGSTEGDPAS 101

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQG 199
            P + FGCGH N    T + K +G+IGLG G  SL+ Q+ + + G+FSYCL         
Sbjct: 102 FPGLAFGCGHSN--GGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLVPLSSDSTA 159

Query: 200 SSKINFG--GIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLR 257
           SSKINFG   +V+G+G  S+P                        ++   NI +D+G   
Sbjct: 160 SSKINFGKSAVVSGSGT-SSP-----------------------AAAEESNIIIDSGTTL 195

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVK 317
           TLLP +++++++S ++ +I  Q       +P  +  LCY+   + + P +T HF GADV+
Sbjct: 196 TLLPRDFYTDMESALTKVIGGQTT----TDPRGTFSLCYSGVKKLEIPTITAHFIGADVQ 251

Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           L P N F    ++++C +    +   ++G + Q+NFL+GYD++   VSFKP+ CT
Sbjct: 252 LPPLNTFVQAQEDLVCFSMIPSSNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDCT 306


>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 364

 Score =  187 bits (474), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 138/355 (38%), Positives = 188/355 (52%), Gaps = 38/355 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDP-KKSSTYNSISCS 93
           YLM L++GTPPVD++G VDT SD  W QC PC    C+KQ+ P+FDP K+ +++   SCS
Sbjct: 31  YLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPC--QGCYKQKNPMFDPLKECNSFFDHSCS 88

Query: 94  SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
                       E  C Y + Y   A  S + G LA E  TF+ST G P+ + ++IFGCG
Sbjct: 89  -----------PEKACDYVYAY---ADDSATKGMLAKEIATFSSTDGKPI-VESIIFGCG 133

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCL-----PDQGSSKINFGG 207
           H N      +    G+IGLG G  SL+SQMG     K FS CL         S  I+ G 
Sbjct: 134 HNNTG--VFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADPHTSGTISLGE 191

Query: 208 I--VAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSS---STGNIFVDTGVLRTL 259
              V+G GVV+TPL+  +    Y ++LE ISVG+  + F SS   S GNI +D+G   T 
Sbjct: 192 ASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSEMLSKGNIMIDSGTPETY 251

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLS 319
           LP E++  L   +   I   P+     +P     LCY   +  + P +T HF GADVKL 
Sbjct: 252 LPQEFYDRLVEELKVQINLPPIH---VDPDLGTQLCYKSETNLEGPILTAHFEGADVKLL 308

Query: 320 PSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           P   F    D + C A  G    + ++G   Q N LIG+D+++ +V FKP+  T 
Sbjct: 309 PLQTFIPPKDGVFCFAMTGTTDGLYIFGNFAQSNVLIGFDLDKRIVFFKPTDFTK 363


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 135/371 (36%), Positives = 193/371 (52%), Gaps = 53/371 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +LM LSIG P V     VDTGSD  WTQC+PC E  CF Q  P+FDP+KSS+Y+ + CSS
Sbjct: 107 FLMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTE--CFDQPTPIFDPEKSSSYSKVGCSS 164

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF---NSTSGLPVEMPNVIF 150
             C A+  SNC+E   +  +LY  G Y+S + G LATET TF   NS SG+        F
Sbjct: 165 GLCNALPRSNCNEDKDACEYLYTYGDYSS-TRGLLATETFTFEDENSISGIG-------F 216

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFG 206
           GCG +N       S+ +G++GLG G  SLISQ+  +   KFSYCL      + SS +  G
Sbjct: 217 GCGVENEGDGF--SQGSGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIG 271

Query: 207 GIVAG----------AGVVSTPLIIRD-----HYYLSLEAISVGNQRL-------EFVSS 244
            + +G            V  T  ++R+      YYL L+ I+VG +RL       E    
Sbjct: 272 SLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAED 331

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK- 303
            TG + +D+G   T L       LK   ++ + + PV   G+  G    LC+ +    K 
Sbjct: 332 GTGGMIIDSGTTITYLEETAFKVLKEEFTSRM-SLPVDDSGS-TGLD--LCFKLPDAAKN 387

Query: 304 --FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
              P++  HF+GAD++L   N +  + S  ++C A    N   ++G + Q NF + +D+E
Sbjct: 388 IAVPKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLE 447

Query: 361 QAMVSFKPSRC 371
           +  VSF P+ C
Sbjct: 448 KETVSFVPTEC 458


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 124/392 (31%), Positives = 196/392 (50%), Gaps = 35/392 (8%)

Query: 2   QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
           ++ +++ FY    +P +  S  +Q+ + + +  YLM L++G+PP      VDTGSD  W 
Sbjct: 6   RSHERVAFYTLKLSPDAFGSQEFQSPVKAGNGEYLMTLTLGSPPQSFDVIVDTGSDLNWV 65

Query: 62  QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC---AVVTSNCSEGDCSYSFLYGRG 118
           QC PC    C++Q  P FDP KS ++   +C+ + C   A+    C+   C Y + YG  
Sbjct: 66  QCLPCRV--CYQQPGPKFDPSKSRSFRKAACTDNLCNVSALPLKACAANVCQYQYTYGD- 122

Query: 119 AYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSS 178
              S ++G+LA ET++ N+ +G    +PN  FGCG +NL    + +   G++GLG G  S
Sbjct: 123 --QSNTNGDLAFETISLNNGAGTQ-SVPNFAFGCGTQNLG---TFAGAAGLVGLGQGPLS 176

Query: 179 LISQMGTSIAGKFSYCLPDQGS---SKINFGGIVAGAGVVSTPLIIR----DHYYLSLEA 231
           L SQ+  + A KFSYCL    S   S + FG I A A +  T +++      +YY+ L +
Sbjct: 177 LNSQLSHTFANKFSYCLVSLNSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNS 236

Query: 232 ISVGNQRLEFV--------SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG 283
           I VG Q L           S+  G   +D+G   T+L L  +S +     + +    + G
Sbjct: 237 IEVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDG 296

Query: 284 VGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGADVKLSPSNLF--RNISDEIMCSAFRGG 339
                 +   LC+NI+  S P  P++   F+GAD ++   NLF   + S   +C A  G 
Sbjct: 297 ----SAYGLDLCFNIAGVSNPSVPDMVFKFQGADFQMRGENLFVLVDTSATTLCLAMGGS 352

Query: 340 NANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
               + G I Q N L+ YD+E   + F  + C
Sbjct: 353 QGFSIIGNIQQQNHLVVYDLEAKKIGFATADC 384


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 133/392 (33%), Positives = 197/392 (50%), Gaps = 47/392 (11%)

Query: 13  NETPKSP-ISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSV-DTGSDCTWTQCEPCPELD 70
           +E P  P +S  Y++ + S    Y+  +S+GTP   +F  + DTGSD  W QC+PC    
Sbjct: 17  SEVPYPPSVSTDYESPVASGGGDYVTTISLGTP-AKVFSVIADTGSDLIWIQCKPCQA-- 73

Query: 71  CFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLAT 130
           CF Q+ P+FDP+ SS+Y ++SC  + C  +       DC YS+ YG G   S + G L++
Sbjct: 74  CFNQKDPIFDPEGSSSYTTMSCGDTLCDSLPRKSCSPDCDYSYGYGDG---SGTRGTLSS 130

Query: 131 ETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK 190
           ET+T  ST G  +   N+ FGCGH N  S    S   G++GLG GN S +SQ+G     K
Sbjct: 131 ETVTLTSTQGEKLAAKNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGDLFGHK 187

Query: 191 FSYCL------PDQGSSKINFGGIVAGAG------VVSTPLI----IRDHYYLSLEAISV 234
           FSYCL      P + +S + FG   +            TP+I    +   YY+ L+ IS+
Sbjct: 188 FSYCLVPWRDAPSK-TSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISI 246

Query: 235 GNQRLEFVSSS-------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAE 287
             + L   + S       +G +  D+G   TLLP   +  +   + + I    + G  A 
Sbjct: 247 AGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSA- 305

Query: 288 PGFSDVLCYNISS-----QPKFPEVTIHFRGADVKLSPSNLF--RNISDEIMCSAFRGGN 340
            G    LCY++S      + K P +  HF GAD +L   N F   N +  I+C A    N
Sbjct: 306 -GLD--LCYDVSGSKASYKMKIPAMVFHFEGADYQLPVENYFIAANDAGTIVCLAMVSSN 362

Query: 341 ANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            +I +YG +MQ NF + YDI  + + + PS+C
Sbjct: 363 MDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQC 394


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  185 bits (470), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 135/371 (36%), Positives = 194/371 (52%), Gaps = 53/371 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +LM LSIG P V     VDTGSD  WTQC+PC E  CF Q  P+FDP+KSS+Y+ + CSS
Sbjct: 108 FLMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTE--CFDQPTPIFDPEKSSSYSKVGCSS 165

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF---NSTSGLPVEMPNVIF 150
             C A+  SNC+E   S  +LY  G Y+S + G LATET TF   NS SG+        F
Sbjct: 166 GLCNALPRSNCNEDKDSCEYLYTYGDYSS-TRGLLATETFTFEDENSISGIG-------F 217

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFG 206
           GCG +N       S+ +G++GLG G  SLISQ+  +   KFSYCL      + SS +  G
Sbjct: 218 GCGVENEGDGF--SQGSGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIG 272

Query: 207 GIVAG----------AGVVSTPLIIRD-----HYYLSLEAISVGNQRL-------EFVSS 244
            + +G            V  T  ++R+      YYL L+ I+VG +RL       E    
Sbjct: 273 SLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSED 332

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK- 303
            TG + +D+G   T L       LK   ++ + + PV   G+  G    LC+ + +  K 
Sbjct: 333 GTGGMIIDSGTTITYLEETAFKVLKEEFTSRM-SLPVDDSGS-TGLD--LCFKLPNAAKN 388

Query: 304 --FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
              P++  HF+GAD++L   N +  + S  ++C A    N   ++G + Q NF + +D+E
Sbjct: 389 IAVPKLIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLE 448

Query: 361 QAMVSFKPSRC 371
           +  V+F P+ C
Sbjct: 449 KETVTFVPTEC 459


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score =  184 bits (468), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 130/375 (34%), Positives = 200/375 (53%), Gaps = 45/375 (12%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           +A +++ +  +LM L+IG+PP      +DTGSD  WTQC+PC +  CF Q  P+FDPK+S
Sbjct: 101 KAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQ--CFDQSTPIFDPKQS 158

Query: 85  STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           S++  ISCSS  C A+ TS CS   C Y + YG    +S + G LA ET TF  ++   +
Sbjct: 159 SSFYKISCSSELCGALPTSTCSSDGCEYLYTYGD---SSSTQGVLAFETFTFGDSTEDQI 215

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGS 200
            +P + FGCG+ N  +    S+  G++GLG G  SL+SQ+      KF+YCL    D   
Sbjct: 216 SIPGLGFGCGNDN--NGDGFSQGAGLVGLGRGPLSLVSQLKEQ---KFAYCLTAIDDSKP 270

Query: 201 SKINFGGIV------AGAGVVSTPLIIR----DHYYLSLEAISVGNQRL-------EFVS 243
           S +  G +       +   + +TPLI        YYLSL+ ISVG  +L       E   
Sbjct: 271 SSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHD 330

Query: 244 SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ---PVKGVGAEPGFSDVLCYNI-- 298
             +G + +D+G   T +     ++LK    N   AQ   PV   G   G  D LC+N+  
Sbjct: 331 DGSGGVIIDSGTTITYVENSAFTSLK----NEFIAQMNLPVDDSGT--GGLD-LCFNLPA 383

Query: 299 -SSQPKFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIG 356
            ++Q + P++T HF+GAD++L   N     S   ++C A        ++G + Q NF++ 
Sbjct: 384 GTNQVEVPKLTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVV 443

Query: 357 YDIEQAMVSFKPSRC 371
           +D+++  +SF P++C
Sbjct: 444 HDLQEETLSFLPTQC 458


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score =  184 bits (468), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 130/375 (34%), Positives = 200/375 (53%), Gaps = 45/375 (12%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           +A +++ +  +LM L+IG+PP      +DTGSD  WTQC+PC +  CF Q  P+FDPK+S
Sbjct: 356 KAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQ--CFDQSTPIFDPKQS 413

Query: 85  STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           S++  ISCSS  C A+ TS CS   C Y + YG    +S + G LA ET TF  ++   +
Sbjct: 414 SSFYKISCSSELCGALPTSTCSSDGCEYLYTYGD---SSSTQGVLAFETFTFGDSTEDQI 470

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGS 200
            +P + FGCG+ N  +    S+  G++GLG G  SL+SQ+      KF+YCL    D   
Sbjct: 471 SIPGLGFGCGNDN--NGDGFSQGAGLVGLGRGPLSLVSQLKEQ---KFAYCLTAIDDSKP 525

Query: 201 SKINFGGIV------AGAGVVSTPLIIR----DHYYLSLEAISVGNQRL-------EFVS 243
           S +  G +       +   + +TPLI        YYLSL+ ISVG  +L       E   
Sbjct: 526 SSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHD 585

Query: 244 SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ---PVKGVGAEPGFSDVLCYNI-- 298
             +G + +D+G   T +     ++LK    N   AQ   PV   G   G  D LC+N+  
Sbjct: 586 DGSGGVIIDSGTTITYVENSAFTSLK----NEFIAQMNLPVDDSGT--GGLD-LCFNLPA 638

Query: 299 -SSQPKFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIG 356
            ++Q + P++T HF+GAD++L   N     S   ++C A        ++G + Q NF++ 
Sbjct: 639 GTNQVEVPKLTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVV 698

Query: 357 YDIEQAMVSFKPSRC 371
           +D+++  +SF P++C
Sbjct: 699 HDLQEETLSFLPTQC 713


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 134/369 (36%), Positives = 191/369 (51%), Gaps = 53/369 (14%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
           M LSIG P V     VDTGSD  WTQC+PC E  CF Q  P+FDP+KSS+Y+ + CSS  
Sbjct: 1   MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTE--CFDQPTPIFDPEKSSSYSKVGCSSGL 58

Query: 97  C-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF---NSTSGLPVEMPNVIFGC 152
           C A+  SNC+E   +  +LY  G Y+S + G LATET TF   NS SG+        FGC
Sbjct: 59  CNALPRSNCNEDKDACEYLYTYGDYSS-TRGLLATETFTFEDENSISGIG-------FGC 110

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGGI 208
           G +N       S+ +G++GLG G  SLISQ+  +   KFSYCL      + SS +  G +
Sbjct: 111 GVENEGDGF--SQGSGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIGSL 165

Query: 209 VAG----------AGVVSTPLIIRD-----HYYLSLEAISVGNQRL-------EFVSSST 246
            +G            V  T  ++R+      YYL L+ I+VG +RL       E     T
Sbjct: 166 ASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGT 225

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--- 303
           G + +D+G   T L       LK   ++ + + PV   G+  G    LC+ +    K   
Sbjct: 226 GGMIIDSGTTITYLEETAFKVLKEEFTSRM-SLPVDDSGS-TGLD--LCFKLPDAAKNIA 281

Query: 304 FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
            P++  HF+GAD++L   N +  + S  ++C A    N   ++G + Q NF + +D+E+ 
Sbjct: 282 VPKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLEKE 341

Query: 363 MVSFKPSRC 371
            VSF P+ C
Sbjct: 342 TVSFVPTEC 350


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score =  183 bits (464), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 131/392 (33%), Positives = 197/392 (50%), Gaps = 47/392 (11%)

Query: 13  NETPKSP-ISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSV-DTGSDCTWTQCEPCPELD 70
           +E P  P +S  Y++ + S    Y+  +S+GTP   +F  + DTGSD  W QC+PC    
Sbjct: 17  SEVPYPPSVSTDYESPVASGGGDYVTTISLGTP-AKVFSVIADTGSDLIWIQCKPCQA-- 73

Query: 71  CFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLAT 130
           CF Q+ P+FDP+ SS+Y ++SC  + C  +       +C YS+ YG G   S + G L++
Sbjct: 74  CFNQKDPIFDPEGSSSYTTMSCGDTLCDSLPRKSCSPNCDYSYGYGDG---SGTRGTLSS 130

Query: 131 ETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK 190
           ET+T  ST G  +   N+ FGCGH N  S    S   G++GLG GN S +SQ+G     K
Sbjct: 131 ETVTLTSTQGEKLAAKNIAFGCGHLNRGSFNDAS---GLVGLGRGNLSFVSQLGDLFGHK 187

Query: 191 FSYCL------PDQGSSKINFGGIVAGAG------VVSTPLI----IRDHYYLSLEAISV 234
           FSYCL      P + +S + FG   +            TP+I    +   YY+ L+ IS+
Sbjct: 188 FSYCLVPWRDAPSK-TSPMFFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISI 246

Query: 235 GNQRLEFVSSS-------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAE 287
             + L   + S       +G +  D+G   TLLP   +  +   + + +    + G  A 
Sbjct: 247 AGRALRIPAGSFDIKPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSA- 305

Query: 288 PGFSDVLCYNISS-----QPKFPEVTIHFRGADVKLSPSNLF--RNISDEIMCSAFRGGN 340
            G    LCY++S      + K P +  HF GAD +L   N F   N +  I+C A    N
Sbjct: 306 -GLD--LCYDVSGSKASYKKKIPAMVFHFEGADHQLPVENYFIAANDAGTIVCLAMVSSN 362

Query: 341 ANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            +I +YG +MQ NF + YDI  + + + PS+C
Sbjct: 363 MDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQC 394


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score =  182 bits (461), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 134/394 (34%), Positives = 196/394 (49%), Gaps = 50/394 (12%)

Query: 5   QKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE 64
           + LP   DNE+ + P S  Y          +L+ + +GTPP      +DTGSD TW Q E
Sbjct: 3   ETLPGQTDNESYEFPESAGYGE--------FLVPIYLGTPPQKAVVIIDTGSDLTWIQSE 54

Query: 65  PCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV--TSNCS-EGDCSYSFLYGRGAYA 121
           PC    CF+Q  P+FDP KSSTYN I+CSSS CA +  T  CS   +C Y++ YG G   
Sbjct: 55  PCRA--CFEQADPIFDPSKSSTYNKIACSSSACADLLGTQTCSAAANCIYAYGYGDG--- 109

Query: 122 SFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLIS 181
           S + G  + ET+T   T+G  V+    ++  G         D+   GI+GLG G  S+ S
Sbjct: 110 SVTRGYFSKETITATDTAGEEVKFGASVYNTGTFG------DTGGEGILGLGQGPVSMPS 163

Query: 182 QMGTSIAGKFSYCLPDQGS-----SKINFGGIVAGAGVVS-TPLIIR-DH---YYLSLEA 231
           Q+G+ +  KFSYCL D  S     S + FG     +G V  TP++   DH   YY++++ 
Sbjct: 164 QLGSVLGNKFSYCLVDWLSAGSETSTMYFGDAAVPSGEVQYTPIVPNADHPTYYYIAVQG 223

Query: 232 ISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIK-AQPVKG 283
           ISVG   L       E  S  +G   +D+G   T L  E  + L +  ++ ++       
Sbjct: 224 ISVGGSLLDIDQSVYEIDSGGSGGTIIDSGTTITYLQQEVFNALVAAYTSQVRYPTTTSA 283

Query: 284 VGAEPGFSDVLCYNI--SSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGG-- 339
            G +      LC+N   +  P FP +TIH  G  ++L  +N F ++   I+C AF     
Sbjct: 284 TGLD------LCFNTRGTGSPVFPAMTIHLDGVHLELPTANTFISLETNIICLAFASALD 337

Query: 340 NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
               ++G I Q NF I YD++   + F P+ C +
Sbjct: 338 FPIAIFGNIQQQNFDIVYDLDNMRIGFAPADCAS 371


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score =  181 bits (459), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 130/369 (35%), Positives = 188/369 (50%), Gaps = 49/369 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +LM +SIGTP +     VDTGSD  WTQC+PC E  CF Q  P+FDP  SSTY+++ CSS
Sbjct: 118 FLMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVE--CFNQSTPVFDPSSSSTYSTLPCSS 175

Query: 95  SQCA-VVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
           S C+ + TS C+    DC Y++ YG    AS + G LA ET T   T     ++P V FG
Sbjct: 176 SLCSDLPTSTCTSAAKDCGYTYTYGD---ASSTQGVLAAETFTLAKT-----KLPGVAFG 227

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGI 208
           CG  N       ++  G++GLG G  SL+SQ+G    GKFSYCL    D   S +  G +
Sbjct: 228 CGDTNEGD--GFTQGAGLVGLGRGPLSLVSQLGL---GKFSYCLTSLDDTSKSPLLLGSL 282

Query: 209 VA-------GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIF 250
            A        A + +TPLI        YY++L+A++VG+ R+    S+       TG + 
Sbjct: 283 AAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTGGVI 342

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS----QPKFPE 306
           VD+G   T L L+ +  LK   +  +K  PV   G+  G    LC+   +      + P+
Sbjct: 343 VDSGTSITYLELQGYRPLKKAFAAQMKL-PVAD-GSAVGLD--LCFKAPASGVDDVEVPK 398

Query: 307 VTIHFR-GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
           + +HF  GAD+ L   N +  + +   +C    G     + G   Q N    YD+++  +
Sbjct: 399 LVLHFDGGADLDLPAENYMVLDSASGALCLTVMGSRGLSIIGNFQQQNIQFVYDVDKDTL 458

Query: 365 SFKPSRCTN 373
           SF P +C  
Sbjct: 459 SFAPVQCAK 467


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 132/369 (35%), Positives = 196/369 (53%), Gaps = 38/369 (10%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           +A I + +  YLM L+IGTPPV     +DTGSD  WTQC+PC +  C+KQ  P+FDPKKS
Sbjct: 98  EAPIHAGNGEYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQ--CYKQPTPIFDPKKS 155

Query: 85  STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           S+++ +SC SS C AV +S CS+G C Y + YG     S + G LATET TF  +    V
Sbjct: 156 SSFSKVSCGSSLCSAVPSSTCSDG-CEYVYSYGD---YSMTQGVLATETFTFGKSKN-KV 210

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI 203
            + N+ FGCG  N        + +G++GLG G  SL+SQ+      +FSYCL     +K 
Sbjct: 211 SVHNIGFGCGEDNEGD--GFEQASGLVGLGRGPLSLVSQLKEP---RFSYCLTPMDDTKE 265

Query: 204 ------NFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------T 246
                 + G +     VV+TPL+        YYLSLE ISVG+ RL    S+        
Sbjct: 266 SILLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGN 325

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPK 303
           G + +D+G   T +  +    LK    +  K  P+    +  G    LC+++   S+Q +
Sbjct: 326 GGVIIDSGTTITYIEQKAFEALKKEFISQTKL-PLDKTSST-GLD--LCFSLPSGSTQVE 381

Query: 304 FPEVTIHFRGADVKLSPSNLFRNISD-EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
            P++  HF+G D++L   N     S+  + C A    +   ++G + Q N L+ +D+E+ 
Sbjct: 382 IPKIVFHFKGGDLELPAENYMIGDSNLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKE 441

Query: 363 MVSFKPSRC 371
            +SF P+ C
Sbjct: 442 TISFVPTSC 450


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 133/371 (35%), Positives = 185/371 (49%), Gaps = 42/371 (11%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           +A + + +  +LM ++IGTP +     +DTGSD TWTQC+PC   DC+ Q  P++DP +S
Sbjct: 105 EAPVYAGNGEFLMKMAIGTPSLSFSAILDTGSDLTWTQCKPC--TDCYPQPTPIYDPSQS 162

Query: 85  STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           STY+ + CSSS C A+   +CS  +C Y + YG     S + G L+ E+ T  S S    
Sbjct: 163 STYSKVPCSSSMCQALPMYSCSGANCEYLYSYGD---QSSTQGILSYESFTLTSQS---- 215

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL------PD 197
            +P++ FGCG +N     S        G GP   SLISQ+G S+  KFSYCL      P 
Sbjct: 216 -LPHIAFGCGQENEGGGFSQGGGLVGFGRGP--LSLISQLGQSLGNKFSYCLVSITDSPS 272

Query: 198 QGSSK-INFGGIVAGAGVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSS------- 245
           + S   I     +    V STPL+        YYLSLE ISVG Q L+    +       
Sbjct: 273 KTSPLFIGKTASLNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDG 332

Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG--VGAEPGFSDVLCYNI---SS 300
           TG + +D+G   T L    +  +K  + + I    V G  +G +      LC+     SS
Sbjct: 333 TGGVIIDSGTTVTYLEQSGYDVVKKAVISSINLPQVDGSNIGLD------LCFEPQSGSS 386

Query: 301 QPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
              FP +T HF GAD  L   N     S  I C A    N   ++G I Q N+ I YD E
Sbjct: 387 TSHFPTITFHFEGADFNLPKENYIYTDSSGIACLAMLPSNGMSIFGNIQQQNYQILYDNE 446

Query: 361 QAMVSFKPSRC 371
           + ++SF P+ C
Sbjct: 447 RNVLSFAPTVC 457


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 131/360 (36%), Positives = 186/360 (51%), Gaps = 46/360 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +LM L+IGTP       +DTGSD  WTQC+PC   DCF Q  P+FDPKKSS+++ + CSS
Sbjct: 97  FLMKLAIGTPAETYSAIMDTGSDLIWTQCKPCK--DCFDQPTPIFDPKKSSSFSKLPCSS 154

Query: 95  SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             CA +  S+CS+G C Y  LY  G Y+S + G LATET  F   S     +  + FGCG
Sbjct: 155 DLCAALPISSCSDG-CEY--LYSYGDYSS-TQGVLATETFAFGDAS-----VSKIGFGCG 205

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGGIV 209
             N  S    S+  G++GLG G  SLISQ+G     KFSYCL      +G S +  G   
Sbjct: 206 EDNDGSGF--SQGAGLVGLGRGPLSLISQLGEP---KFSYCLTSMDDSKGISSLLVGSEA 260

Query: 210 AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRT 258
                ++TPLI        YYLSLE ISVG+  L    S+       +G + +D+G   T
Sbjct: 261 TMKNAITTPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTIT 320

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNI---SSQPKFPEVTIHFRGA 314
            L     + LK    + +K         E G + + LC+ +   +S    P++  HF GA
Sbjct: 321 YLEDSAFAALKKEFISQLKLDV-----DESGSTGLDLCFTLPPDASTVDVPQLVFHFEGA 375

Query: 315 DVKLSPSNLFRNISDE---IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           D+KL   N    I+D    ++C      +   ++G   Q N ++ +D+E+  +SF P++C
Sbjct: 376 DLKLPAENYI--IADSGLGVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 127/369 (34%), Positives = 191/369 (51%), Gaps = 41/369 (11%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           +  + + D  YLM+LSIGTP       +DTGSD  WTQC+PC +  CF Q  P+F+P+ S
Sbjct: 85  ETSVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQ--CFNQSTPIFNPQGS 142

Query: 85  STYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           S+++++ CSS  C  ++S  CS   C Y++ YG G   S + G++ TETLTF S     V
Sbjct: 143 SSFSTLPCSSQLCQALSSPTCSNNFCQYTYGYGDG---SETQGSMGTETLTFGS-----V 194

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--- 200
            +PN+ FGCG  N           G++G+G G  SL SQ+  +   KFSYC+   GS   
Sbjct: 195 SIPNITFGCGENNQG--FGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTP 249

Query: 201 SKINFGGIV--AGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------T 246
           S +  G +     AG  +T LI    I   YY++L  +SVG+ RL    S+        T
Sbjct: 250 SNLLLGSLANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGT 309

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP---K 303
           G I +D+G   T      + +++    + I    V   G+  GF   LC+   S P   +
Sbjct: 310 GGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVN--GSSSGFD--LCFQTPSDPSNLQ 365

Query: 304 FPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQA 362
            P   +HF G D++L   N F + S+ ++C A    +  + ++G I Q N L+ YD   +
Sbjct: 366 IPTFVMHFDGGDLELPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNS 425

Query: 363 MVSFKPSRC 371
           +VSF  ++C
Sbjct: 426 VVSFASAQC 434


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 131/369 (35%), Positives = 196/369 (53%), Gaps = 38/369 (10%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           +A I + +  YL+ L+IGTPPV     +DTGSD  WTQC+PC    C+KQ  P+FDPKKS
Sbjct: 98  EAPIHAGNGEYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPCTR--CYKQPTPIFDPKKS 155

Query: 85  STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           S+++ +SC SS C A+ +S CS+G C Y + YG     S + G LATET TF  +    V
Sbjct: 156 SSFSKVSCGSSLCSALPSSTCSDG-CEYVYSYGD---YSMTQGVLATETFTFGKSKN-KV 210

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI 203
            + N+ FGCG  N        + +G++GLG G  SL+SQ+      +FSYCL     +K 
Sbjct: 211 SVHNIGFGCGEDNEGD--GFEQASGLVGLGRGPLSLVSQLKEQ---RFSYCLTPIDDTKE 265

Query: 204 ------NFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------T 246
                 + G +     VV+TPL+        YYLSLEAISVG+ RL    S+        
Sbjct: 266 SVLLLGSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGN 325

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPK 303
           G + +D+G   T +  + +  LK    +  K    K   +  G    LC+++   S+Q +
Sbjct: 326 GGVIIDSGTTITYVQQKAYEALKKEFISQTKLALDK--TSSTGLD--LCFSLPSGSTQVE 381

Query: 304 FPEVTIHFRGADVKLSPSNLFRNISD-EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
            P++  HF+G D++L   N     S+  + C A    +   ++G + Q N L+ +D+E+ 
Sbjct: 382 IPKLVFHFKGGDLELPAENYMIGDSNLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKE 441

Query: 363 MVSFKPSRC 371
            +SF P+ C
Sbjct: 442 TISFVPTSC 450


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score =  178 bits (451), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 131/368 (35%), Positives = 186/368 (50%), Gaps = 52/368 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +LM +SIGTP +     VDTGSD  WTQC+PC  +DCFKQ  P+FDP  SSTY ++ CSS
Sbjct: 105 FLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATVPCSS 162

Query: 95  SQCA-VVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           + C+ + TS C S   C Y++ YG    +S + G LATET T   +     ++P V+FGC
Sbjct: 163 ASCSDLPTSKCTSASKCGYTYTYGD---SSSTQGVLATETFTLAKS-----KLPGVVFGC 214

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIV 209
           G  N       S+  G++GLG G  SL+SQ+G     KFSYCL    D  +S +  G + 
Sbjct: 215 GDTNEGD--GFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPLLLGSLA 269

Query: 210 A-------GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFV 251
                    + V +TPLI        YY+SL+AI+VG+ R+   SS+       TG + V
Sbjct: 270 GISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIV 329

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISS----QPKFP 305
           D+G   T L ++ +  LK   +  +   A    GVG +      LC+   +    Q + P
Sbjct: 330 DSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLD------LCFRAPAKGVDQVEVP 383

Query: 306 EVTIHFR-GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
            +  HF  GAD+ L   N +  +     +C    G     + G   Q NF   YD+    
Sbjct: 384 RLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDT 443

Query: 364 VSFKPSRC 371
           +SF P +C
Sbjct: 444 LSFAPVQC 451


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  177 bits (450), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 131/368 (35%), Positives = 186/368 (50%), Gaps = 52/368 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +LM +SIGTP +     VDTGSD  WTQC+PC  +DCFKQ  P+FDP  SSTY ++ CSS
Sbjct: 95  FLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATVPCSS 152

Query: 95  SQCA-VVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           + C+ + TS C S   C Y++ YG    +S + G LATET T   +     ++P V+FGC
Sbjct: 153 ASCSDLPTSKCTSASKCGYTYTYGD---SSSTQGVLATETFTLAKS-----KLPGVVFGC 204

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIV 209
           G  N       S+  G++GLG G  SL+SQ+G     KFSYCL    D  +S +  G + 
Sbjct: 205 GDTNEGD--GFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPLLLGSLA 259

Query: 210 A-------GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFV 251
                    + V +TPLI        YY+SL+AI+VG+ R+   SS+       TG + V
Sbjct: 260 GISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIV 319

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISS----QPKFP 305
           D+G   T L ++ +  LK   +  +   A    GVG +      LC+   +    Q + P
Sbjct: 320 DSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLD------LCFRAPAKGVDQVEVP 373

Query: 306 EVTIHFR-GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
            +  HF  GAD+ L   N +  +     +C    G     + G   Q NF   YD+    
Sbjct: 374 RLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDT 433

Query: 364 VSFKPSRC 371
           +SF P +C
Sbjct: 434 LSFAPVQC 441


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  177 bits (449), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 131/368 (35%), Positives = 186/368 (50%), Gaps = 52/368 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +LM +SIGTP +     VDTGSD  WTQC+PC  +DCFKQ  P+FDP  SSTY ++ CSS
Sbjct: 74  FLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATVPCSS 131

Query: 95  SQCA-VVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           + C+ + TS C S   C Y++ YG    +S + G LATET T   +     ++P V+FGC
Sbjct: 132 ASCSDLPTSKCTSASKCGYTYTYGD---SSSTQGVLATETFTLAKS-----KLPGVVFGC 183

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIV 209
           G  N       S+  G++GLG G  SL+SQ+G     KFSYCL    D  +S +  G + 
Sbjct: 184 GDTNEGD--GFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPLLLGSLA 238

Query: 210 A-------GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFV 251
                    + V +TPLI        YY+SL+AI+VG+ R+   SS+       TG + V
Sbjct: 239 GISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIV 298

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISS----QPKFP 305
           D+G   T L ++ +  LK   +  +   A    GVG +      LC+   +    Q + P
Sbjct: 299 DSGTSITYLEVQGYRALKKAFAAQMALPAADGSGVGLD------LCFRAPAKGVDQVEVP 352

Query: 306 EVTIHFR-GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
            +  HF  GAD+ L   N +  +     +C    G     + G   Q NF   YD+    
Sbjct: 353 RLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDT 412

Query: 364 VSFKPSRC 371
           +SF P +C
Sbjct: 413 LSFAPVQC 420


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 131/363 (36%), Positives = 189/363 (52%), Gaps = 50/363 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ ++IGTP + +   +DTGSD  WT+C PC   DC      ++DP  SSTY+ + C S
Sbjct: 42  YLIQMAIGTPALSLSAIMDTGSDLVWTKCNPC--TDC--STSSIYDPSSSSTYSKVLCQS 97

Query: 95  SQC---AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
           S C   ++ + N ++GDC Y + YG     S +SG L+ ET + +S S     +PN+ FG
Sbjct: 98  SLCQPPSIFSCN-NDGDCEYVYPYGD---RSSTSGILSDETFSISSQS-----LPNITFG 148

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------INF 205
           CGH N        K  G++G G G+ SL+SQ+G S+  KFSYCL  +  S       I  
Sbjct: 149 CGHDNQGF----DKVGGLVGFGRGSLSLVSQLGPSMGNKFSYCLVSRTDSSKTSPLFIGN 204

Query: 206 GGIVAGAGVVSTPLIIR---DHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGV 255
              +    V STPL+     +HYYLSLE ISVG Q L       +  S  +G + +D+G 
Sbjct: 205 TASLEATTVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGT 264

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHFRG 313
             T L    +  +K  M + I      G          LC+N   SS P FP +T HF+G
Sbjct: 265 TLTFLQQTAYDAVKEAMVSSINLPQADG-------QLDLCFNQQGSSNPGFPSMTFHFKG 317

Query: 314 ADVKLSPSN-LFRNISDEIMCSAFRGGNANI----VYGRIMQINFLIGYDIEQAMVSFKP 368
           AD  +   N LF + + +I+C A    N+N+    ++G + Q N+ I YD E  ++SF P
Sbjct: 318 ADYDVPKENYLFPDSTSDIVCLAMMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAP 377

Query: 369 SRC 371
           + C
Sbjct: 378 TAC 380


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 116/367 (31%), Positives = 179/367 (48%), Gaps = 44/367 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ +S+G+PP + +  VD+GSD  W QC+PC  L+C+ Q  PLFDP  S+T++ +SC S
Sbjct: 171 YLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPC--LECYVQADPLFDPATSATFSGVSCGS 228

Query: 95  SQCAVV-TSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
           + C ++ TS C +G+   C Y   Y  G+Y   + G LA ETLT   T+     +  V+ 
Sbjct: 229 AICRILPTSACGDGELGGCEYEVSYADGSY---TKGALALETLTLGGTA-----VEGVVI 280

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ-----GSSKINF 205
           GCGH+N           G++GLG G  SL+ Q+G  + G FSYCL  +     G++  + 
Sbjct: 281 GCGHRNRGLFVG---AAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGYGSGAADDDA 337

Query: 206 GGIVAG------AGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGN 248
           G +V G       G V  PL+        YY+ L  I VG++RL       +      G+
Sbjct: 338 GWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGD 397

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPE 306
           + +DTG   T LP E ++ L+      +     +  G      D  CY++S  +  + P 
Sbjct: 398 VVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDT-CYDLSGYASVRVPT 456

Query: 307 VTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMV 364
           V+  F G A + L+  N+   +   I C AF   ++ + + G   Q    I  D     +
Sbjct: 457 VSFCFDGDARLILAARNVLLEVDMGIYCLAFAPSSSGLSIMGNTQQAGIQITVDSANGYI 516

Query: 365 SFKPSRC 371
            F P+ C
Sbjct: 517 GFGPANC 523


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 126/373 (33%), Positives = 189/373 (50%), Gaps = 51/373 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM LSIGTPP+      DTGSD  WTQC PC    CF Q  PL++P  S+T+  + C+S
Sbjct: 92  YLMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNS 151

Query: 95  --SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
             S CA V +       C+   C Y+  YG G    +++G   +ET TF S +     +P
Sbjct: 152 SLSMCAGVLAGKAPPPGCA---CMYNQTYGTG----WTAGVQGSETFTFGSAAADQARVP 204

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQ---GSSK 202
            + FGC +   AS +  +   G++GLG G+ SL+SQ+G   AG+FSYCL P Q    +S 
Sbjct: 205 GIAFGCSN---ASSSDWNGSAGLVGLGRGSLSLVSQLG---AGRFSYCLTPFQDTNSTST 258

Query: 203 INFGGIVA--GAGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EFVSSST 246
           +  G   A  G GV STP +       +  +YYL+L  IS+G + L          +  T
Sbjct: 259 LLLGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSLKADGT 318

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI----SSQP 302
           G + +D+G   T L    +  +++ + +++    + G  +  G    LCY +    S+ P
Sbjct: 319 GGLIIDSGTTITSLVNAAYQQVRAAVQSLVTLPAIDGSDST-GLD--LCYALPTPTSAPP 375

Query: 303 KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG--GNANIVYGRIMQINFLIGYDIE 360
             P +T+HF GAD+ L P++ +      + C A R     A   +G   Q N  I YD+ 
Sbjct: 376 AMPSMTLHFDGADMVL-PADSYMISGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVR 434

Query: 361 QAMVSFKPSRCTN 373
             M+SF P++C+ 
Sbjct: 435 NEMLSFAPAKCST 447


>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 128/375 (34%), Positives = 197/375 (52%), Gaps = 39/375 (10%)

Query: 23  IYQAEIISVDD--IYLMHLSIGTPPVDIFGSVDTGSDCTWTQC---EPCPELDCFKQEPP 77
           I  AE  S+ D   +LM +SIG PP ++  +V TGSD  W  C   +PC   +C   +  
Sbjct: 84  ITAAEFPSILDNGDFLMKISIGIPPTELLVNVATGSDLVWIPCLSFKPCTH-NC---DLR 139

Query: 78  LFDPKKSSTYNSISCSSSQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFN 136
            FDP +SSTY ++ C S +C +   + C   DC YS         S   G+LA +TLT N
Sbjct: 140 FFDPMESSTYKNVPCDSYRCQITNAATCQFSDCFYSC--DPRHQDSCPDGDLAMDTLTLN 197

Query: 137 STSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL- 195
           ST+G    +PN  F CG++       D    GI+GLG G+ SL++++   I GKFS+C+ 
Sbjct: 198 STTGKSFMLPNTGFICGNR----IGGDYPGVGILGLGHGSLSLLNRISHLIDGKFSHCIV 253

Query: 196 --PDQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRL-------EF 241
                 +SK++FG   +V+G+ + ST L +      Y LS   ISVGN+ +       ++
Sbjct: 254 PYSSNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGIGSDY 313

Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ 301
             +  G   +D+G + T  P  ++S L+  +   I+ +P+     +P     LCY  S  
Sbjct: 314 YMNGLG---MDSGTMFTYFPEYFYSQLEYDVRYAIQQEPLY---PDPTRRLRLCYRYSPD 367

Query: 302 PKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQINFLIGYDI 359
              P +T+HF G  V+LS SN F  ++++I+C AF   ++  + V+G   Q N LIGYD+
Sbjct: 368 FSPPTITMHFEGGSVELSSSNSFIRMTEDIVCLAFATSSSEQDAVFGYWQQTNLLIGYDL 427

Query: 360 EQAMVSFKPSRCTNY 374
           +   +SF  + CT Y
Sbjct: 428 DAGFLSFLKTDCTKY 442


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  175 bits (443), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 136/367 (37%), Positives = 201/367 (54%), Gaps = 43/367 (11%)

Query: 28  IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
           ++S +  +LM+L+IGTPP      +DTGSD  WTQC+PC +  CF Q  P+FDPKKSS++
Sbjct: 93  VLSGNGEFLMNLAIGTPPETYSAIMDTGSDLIWTQCKPCTQ--CFDQPSPIFDPKKSSSF 150

Query: 88  NSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           + +SCSS  C A+  S+CS+  C Y  LY  G Y+S + G +ATET TF       V +P
Sbjct: 151 SKLSCSSQLCKALPQSSCSD-SCEY--LYTYGDYSS-TQGTMATETFTFGK-----VSIP 201

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKI 203
           NV FGCG  N       ++ +G++GLG G  SL+SQ+  +   KFSYCL    D  +S +
Sbjct: 202 NVGFGCGEDNEGD--GFTQGSGLVGLGRGPLSLVSQLKEA---KFSYCLTSIDDTKTSTL 256

Query: 204 NFGGIVA----GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGN 248
             G + +     A + +TPLI        YYLSLE ISVG  RL    S+       TG 
Sbjct: 257 LMGSLASVNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGG 316

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFP 305
           + +D+G   T L       +K   ++ +   PV   GA  G    LCYN+   +S+ + P
Sbjct: 317 LIIDSGTTITYLEESAFDLVKKEFTSQM-GLPVDNSGAT-GLE--LCYNLPSDTSELEVP 372

Query: 306 EVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
           ++ +HF GAD++L   N +  + S  ++C A        ++G + Q N  + +D+E+  +
Sbjct: 373 KLVLHFTGADLELPGENYMIADSSMGVICLAMGSSGGMSIFGNVQQQNMFVSHDLEKETL 432

Query: 365 SFKPSRC 371
           SF P+ C
Sbjct: 433 SFLPTNC 439


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score =  174 bits (440), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 126/382 (32%), Positives = 182/382 (47%), Gaps = 56/382 (14%)

Query: 31  VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           V + YL+HL++GTPP  +  ++DTGSD  WTQC PC   DCF Q  PL DP  SSTY ++
Sbjct: 88  VTNEYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPC--RDCFHQGLPLLDPAASSTYAAL 145

Query: 91  SCSSSQC-AVVTSNCSEG----------DCSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
            C + +C A+  ++C  G           C+Y + YG     S + G +AT+  TF   +
Sbjct: 146 PCGAPRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGD---KSVTVGEIATDRFTFGGDN 202

Query: 140 G-----LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC 194
           G     LP     + FGCGH N       S +TGI G G G  SL SQ+  +    FSYC
Sbjct: 203 GDGDSRLPTR--RLTFGCGHFNKG--VFQSNETGIAGFGRGRWSLPSQLNVT---TFSYC 255

Query: 195 LP---DQGSSKINFGGIVAGA-----------GVVSTPLIIRDH----YYLSLEAISVGN 236
                +  SS +  GG  A A            V +TPL+        Y+LSL+ ISVG 
Sbjct: 256 FTSMFESKSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGK 315

Query: 237 QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
            RL    +   +  +D+G   T LP   +  +K+  +  +   P    G   G +  LC+
Sbjct: 316 TRLAVPEAKLRSTIIDSGASITTLPEAVYEAVKAEFAAQVGLPP---TGVVEGSALDLCF 372

Query: 297 NIS-----SQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNAN-IVYGRIM 349
            +       +P  P +T+H  GAD +L   N +F +++  +MC        +  V G   
Sbjct: 373 ALPVTALWRRPPVPSLTLHLDGADWELPRGNYVFEDLAARVMCVVLDAAPGDQTVIGNFQ 432

Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
           Q N  + YD+E   +SF P+RC
Sbjct: 433 QQNTHVVYDLENDWLSFAPARC 454


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score =  174 bits (440), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 124/357 (34%), Positives = 181/357 (50%), Gaps = 40/357 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +LM+L+IGTP       +DTGSD  WTQC+PC    CF Q  P+FDP+KSS+++ + CSS
Sbjct: 97  FLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKV--CFDQPTPIFDPEKSSSFSKLPCSS 154

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C A+  S+CS+G C Y + YG     S + G LATET TF   S     +  + FGCG
Sbjct: 155 DLCVALPISSCSDG-CEYRYSYGD---HSSTQGVLATETFTFGDAS-----VSKIGFGCG 205

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGGIV 209
             N     + S+  G++GLG G  SLISQ+G     KFSYCL      +G S +  G   
Sbjct: 206 EDNRGR--AYSQGAGLVGLGRGPLSLISQLGVP---KFSYCLTSIDDSKGISTLLVGSEA 260

Query: 210 AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRT 258
                + TPLI        YYLSLE ISVG+  L    S+       +G + +D+G   T
Sbjct: 261 TVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTIT 320

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFPEVTIHFRGAD 315
            L     + LK    + +K      V A       LC+ +    S  + P++  HF G D
Sbjct: 321 YLKDNAFAALKKEFISQMKLD----VDASGSTELELCFTLPPDGSPVEVPQLVFHFEGVD 376

Query: 316 VKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +KL   N +  + +  ++C      +   ++G   Q N ++ +D+E+  +SF P++C
Sbjct: 377 LKLPKENYIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 124/357 (34%), Positives = 180/357 (50%), Gaps = 40/357 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +LM+L+IGTP       +DTGSD  WTQC+PC    CF Q  P+FDP+KSS+++ + CSS
Sbjct: 97  FLMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKV--CFDQPTPIFDPEKSSSFSKLPCSS 154

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C A+  S+CS+G C Y + YG     S + G LATET TF   S     +  + FGCG
Sbjct: 155 DLCVALPISSCSDG-CEYRYSYGD---HSSTQGVLATETFTFGDAS-----VSKIGFGCG 205

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGGIV 209
             N     + S+  G++GLG G  SLISQ+G     KFSYCL      +G S +  G   
Sbjct: 206 EDNRGR--AYSQGAGLVGLGRGPLSLISQLGVP---KFSYCLTSIDDSKGISTLLVGSEA 260

Query: 210 AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRT 258
                + TPLI        YYLSLE ISVG+  L    S+       +G + +D+G   T
Sbjct: 261 TVKSAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTIT 320

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFPEVTIHFRGAD 315
            L     + LK    + +K      V A       LC+ +    S    P++  HF G D
Sbjct: 321 YLKDSAFAALKKEFISQMKLD----VDASGSTELELCFTLPPDGSPVDVPQLVFHFEGVD 376

Query: 316 VKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +KL   N +  + +  ++C      +   ++G   Q N ++ +D+E+  +SF P++C
Sbjct: 377 LKLPKENYIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 134/381 (35%), Positives = 190/381 (49%), Gaps = 55/381 (14%)

Query: 26  AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
           A + S    YLM L+IGTPPV      DTGSD TWTQC+PC    CF Q+ P++D   SS
Sbjct: 84  ARLRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKL--CFPQDTPIYDTAVSS 141

Query: 86  TYNSISCSSSQCAVVTS--NC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
           +++ + C+S+ C  + S  NC  S   C Y + YG GAY   S+G L TETLTF    G 
Sbjct: 142 SFSPVPCASATCLPIWSSRNCTASSSPCRYRYAYGDGAY---SAGVLGTETLTFPGAPG- 197

Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---- 197
            V +  + FGCG  N          TG +GLG G+ SL++Q+G    GKFSYCL D    
Sbjct: 198 -VSVGGIAFGCGVDNGG---LSYNSTGTVGLGRGSLSLVAQLGV---GKFSYCLTDFFNT 250

Query: 198 QGSSKINFGGIV------AGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------E 240
              S + FG +        GA V STPL+    +   YY+SLE IS+G+ RL       +
Sbjct: 251 SLGSPVLFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFD 310

Query: 241 FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNM--IKAQPVKGVGAEPGFSDVLCYNI 298
                +G + VD+G   T L     S  + V+ ++  +  QPV    +     D  C+  
Sbjct: 311 LRDDGSGGMIVDSGTTFTFL---VESAFRVVVDHVAGVLRQPVVNASSL----DSPCFPA 363

Query: 299 SSQ----PKFPEVTIHFR-GADVKLSPSNLFR-NISDEIMCSAFRGG-NANI-VYGRIMQ 350
           ++     P  P++ +HF  GAD++L   N    N  +   C    G  +A++ + G   Q
Sbjct: 364 ATGEQQLPAMPDMVLHFAGGADMRLHRDNYMSFNQEESSFCLNIAGSPSADVSILGNFQQ 423

Query: 351 INFLIGYDIEQAMVSFKPSRC 371
            N  + +DI    +SF P+ C
Sbjct: 424 QNIQMLFDITVGQLSFMPTDC 444


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/364 (32%), Positives = 173/364 (47%), Gaps = 47/364 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP D +  VD+GSD  W QC PC +  C+ Q  PLFDP  SS+++ +SC S
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQ--CYAQTDPLFDPAASSSFSGVSCGS 187

Query: 95  SQCAVVT-----SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           + C  ++          G C YS  YG G+Y   + G LA ETLT   T+     +  V 
Sbjct: 188 AICRTLSGTGCGGGGDAGKCDYSVTYGDGSY---TKGELALETLTLGGTA-----VQGVA 239

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
            GCGH+N       +   G++GLG G  SLI Q+G +  G FSYCL  +G+     G +V
Sbjct: 240 IGCGHRNSGLFVGAA---GLLGLGWGAMSLIGQLGGAAGGVFSYCLASRGAGGA--GSLV 294

Query: 210 AG------AGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVD 252
            G       G V  PL+  +     YY+ L  I VG +RL       +      G + +D
Sbjct: 295 LGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMD 354

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTI 309
           TG   T LP E ++ L+      + A P       P  S +  CY++S  +  + P V+ 
Sbjct: 355 TGTAVTRLPREAYAALRGAFDGAMGALP-----RSPAVSLLDTCYDLSGYASVRVPTVSF 409

Query: 310 HF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
           +F +GA + L   NL   +   + C AF   ++ I + G I Q    I  D     V F 
Sbjct: 410 YFDQGAVLTLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFG 469

Query: 368 PSRC 371
           P+ C
Sbjct: 470 PNTC 473


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 120/352 (34%), Positives = 181/352 (51%), Gaps = 30/352 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QC PC  + C+KQ+ PLFDP KSSTY ++SC+ 
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPC-VVKCYKQKEPLFDPAKSSTYANVSCTD 221

Query: 95  SQCAVVTSN-CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S CA + +N C+ G C Y+  YG G+Y   + G  A +TLT    +     +    FGCG
Sbjct: 222 SACADLDTNGCTGGHCLYAVQYGDGSY---TVGFFAQDTLTIAHDA-----IKGFRFGCG 273

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAG 211
            KN        K  G++GLG G +SL  Q      G F+YCLP    G+  ++FG   AG
Sbjct: 274 EKNNG---LFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDFGPGSAG 330

Query: 212 AGVVSTPLII---RDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHS 266
                TP++    +  YY+ +  I VG Q++    S  ST    VD+G + T LP   ++
Sbjct: 331 NNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYT 390

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSPSN 322
            L S    ++ A   +G    PG+S +  CY+ +  S  + P V++ F+ GA + +  S 
Sbjct: 391 ALSSAFDKVMLA---RGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSG 447

Query: 323 LFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +   IS+  +C AF   G + ++ + G   Q  + + YD+ +  V F P  C
Sbjct: 448 IVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 125/371 (33%), Positives = 185/371 (49%), Gaps = 38/371 (10%)

Query: 25  QAEIISVD--DIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
           QA +++ D    +L++ S+G PPV     +DTGSD  W QC PC   DCF+Q  P+FDP 
Sbjct: 47  QANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCA--DCFRQSTPIFDPS 104

Query: 83  KSSTYNSISCSSSQC--AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
           KSSTY  +S  S  C  +          C Y+  Y  G   S SSGNLATE + F ++  
Sbjct: 105 KSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADG---STSSGNLATEDIVFETSDQ 161

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
             V + +V+FGCGH N      D +Q+GI+GL  G+ S++S++G+    +FSYC+ D   
Sbjct: 162 GTVTVSSVVFGCGHSNRGR--FDGQQSGILGLSAGDQSIVSRLGS----RFSYCIGDLFD 215

Query: 201 SKINFGGIVAGAGVV----STPL-IIRDHYYLSLEAISVGNQRLEF-------VSSSTGN 248
                  +V G GV     STP       YY++LE ISVG  RL+          S  G 
Sbjct: 216 PHYTHNQLVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGG 275

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAE-PGFSDVLCYNISSQPK---F 304
           + +D+G   T L  +    L + +  +++    + +    PG+   LCY          F
Sbjct: 276 VVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGW---LCYKGRVNEDLRGF 332

Query: 305 PEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGN-ANI--VYGRIMQINFLIGYDIE 360
           PE+  HF  GAD+ L  ++LF   + ++ C A    N  NI  V G + Q ++ + YD+ 
Sbjct: 333 PELAFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLI 392

Query: 361 QAMVSFKPSRC 371
              V F+ + C
Sbjct: 393 GKRVYFQRTDC 403


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 178/366 (48%), Gaps = 46/366 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +LM +SIGTP V     +DTGSD  WTQC+PC E  CF Q  P+FDP  SSTY ++ CSS
Sbjct: 102 FLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVE--CFNQSTPVFDPSSSSTYAALPCSS 159

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + C+ + +S C+   C Y++ YG    +S + G LA ET T   T     ++P+V FGCG
Sbjct: 160 TLCSDLPSSKCTSAKCGYTYTYGD---SSSTQGVLAAETFTLAKT-----KLPDVAFGCG 211

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIVA 210
             N       ++  G++GLG G  SL+SQ+G +   KFSYCL    D   S +  G +  
Sbjct: 212 DTNEGD--GFTQGAGLVGLGRGPLSLVSQLGLN---KFSYCLTSLDDTSKSPLLLGSLAT 266

Query: 211 -------GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFVD 252
                   + V +TPLI        YY++L+ ++VG+  +   SS+       TG + VD
Sbjct: 267 ISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIVD 326

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS----QPKFPEVT 308
           +G   T L L+ +  LK   +  +K     G     G     C+   +    Q + P++ 
Sbjct: 327 SGTSITYLELQGYRALKKAFAAQMKLPAADG----SGIGLDTCFEAPASGVDQVEVPKLV 382

Query: 309 IHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFK 367
            H  GAD+ L   N +  +     +C    G     + G   Q N    YD+ +  +SF 
Sbjct: 383 FHLDGADLDLPAENYMVLDSGSGALCLTVMGSRGLSIIGNFQQQNIQFVYDVGENTLSFA 442

Query: 368 PSRCTN 373
           P +C  
Sbjct: 443 PVQCAK 448


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 125/371 (33%), Positives = 185/371 (49%), Gaps = 38/371 (10%)

Query: 25  QAEIISVD--DIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
           QA +++ D    +L++ S+G PPV     +DTGSD  W QC PC   DCF+Q  P+FDP 
Sbjct: 47  QANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCA--DCFRQSTPIFDPS 104

Query: 83  KSSTYNSISCSSSQC--AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
           KSSTY  +S  S  C  +          C Y+  Y  G   S SSGNLATE + F ++  
Sbjct: 105 KSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADG---STSSGNLATEDIVFETSDQ 161

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
             V + +V+FGCGH N      D +Q+GI+GL  G+ S++S++G+    +FSYC+ D   
Sbjct: 162 GTVTVSSVVFGCGHSNRGR--FDGQQSGILGLSAGDQSIVSRLGS----RFSYCIGDLFD 215

Query: 201 SKINFGGIVAGAGVV----STPL-IIRDHYYLSLEAISVGNQRLEF-------VSSSTGN 248
                  +V G GV     STP       YY++LE ISVG  RL+          S  G 
Sbjct: 216 PHYTHNQLVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGG 275

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAE-PGFSDVLCYNISSQPK---F 304
           + +D+G   T L  +    L + +  +++    + +    PG+   LCY          F
Sbjct: 276 VVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGW---LCYKGRVNEDLRGF 332

Query: 305 PEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGN-ANI--VYGRIMQINFLIGYDIE 360
           PE+  HF  GAD+ L  ++LF   + ++ C A    N  NI  V G + Q ++ + YD+ 
Sbjct: 333 PELAFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLI 392

Query: 361 QAMVSFKPSRC 371
              V F+ + C
Sbjct: 393 GKRVYFQRTDC 403


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 118/364 (32%), Positives = 173/364 (47%), Gaps = 47/364 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP D +  VD+GSD  W QC PC +  C+ Q  PLFDP  SS+++ +SC S
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQ--CYAQTDPLFDPAASSSFSGVSCGS 187

Query: 95  SQCAVVT-----SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           + C  ++          G C YS  YG G+Y   + G LA ETLT   T+     +  V 
Sbjct: 188 AICRTLSGTGCGGGGDAGKCDYSVTYGDGSY---TKGELALETLTLGGTA-----VQGVA 239

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
            GCGH+N       +   G++GLG G  SL+ Q+G +  G FSYCL  +G+     G +V
Sbjct: 240 IGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAG--GAGSLV 294

Query: 210 AG------AGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVD 252
            G       G V  PL+  +     YY+ L  I VG +RL       +      G + +D
Sbjct: 295 LGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMD 354

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTI 309
           TG   T LP E ++ L+      + A P       P  S +  CY++S  +  + P V+ 
Sbjct: 355 TGTAVTRLPREAYAALRGAFDGAMGALP-----RSPAVSLLDTCYDLSGYASVRVPTVSF 409

Query: 310 HF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
           +F +GA + L   NL   +   + C AF   ++ I + G I Q    I  D     V F 
Sbjct: 410 YFDQGAVLTLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFG 469

Query: 368 PSRC 371
           P+ C
Sbjct: 470 PNTC 473


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 125/371 (33%), Positives = 188/371 (50%), Gaps = 38/371 (10%)

Query: 25  QAEIISVD--DIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
           QA +++ D    +L++ S+G PPV     +DTGSD  W QC PC   DCF+Q  P+FDP 
Sbjct: 79  QANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCA--DCFRQSTPIFDPS 136

Query: 83  KSSTYNSISCSSSQC--AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
           KSSTY  +S  S  C  +          C Y+  Y  G   S SSGNLATE + F ++  
Sbjct: 137 KSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADG---STSSGNLATEDIVFETSDQ 193

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
             V + +V+FGCGH N      D +Q+GI+GL  G+ S++S++G+    +FSYC+ D   
Sbjct: 194 GTVTVSSVVFGCGHSNRGR--FDGQQSGILGLSAGDQSIVSRLGS----RFSYCIGDLFD 247

Query: 201 SKINFGGIVAGAGVV----STPL-IIRDHYYLSLEAISVGNQRLEF-------VSSSTGN 248
                  +V G GV     STP       YY++LE ISVG  RL+          S  G 
Sbjct: 248 PHYTHNQLVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGG 307

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAE-PGFSDVLCY--NISSQPK-F 304
           + +D+G   T L  +    L + +  +++    + +    PG+   LCY   ++   + F
Sbjct: 308 VVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGW---LCYKGRVNEDLRGF 364

Query: 305 PEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGN-ANI--VYGRIMQINFLIGYDIE 360
           PE+  HF  GAD+ L  ++LF   + ++ C A    N  NI  V G + Q ++ + YD+ 
Sbjct: 365 PELAFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLI 424

Query: 361 QAMVSFKPSRC 371
              V F+ + C
Sbjct: 425 GKRVYFQRTDC 435


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score =  171 bits (434), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 119/356 (33%), Positives = 175/356 (49%), Gaps = 37/356 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + IG+PP + +  VD+GSD  W QC+PC  L+C+ Q  PLFDP  S+T++++ C S
Sbjct: 127 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPC--LECYAQADPLFDPATSATFSAVPCGS 184

Query: 95  SQCAVV-TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           + C  + TS C + G C Y   YG G+Y   + G LA ETLT   T+     +  V  GC
Sbjct: 185 AVCRTLRTSGCGDSGGCDYEVSYGDGSY---TKGALALETLTLGGTA-----VEGVAIGC 236

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA-G 211
           GH+N       +   G++GLG G  SL+ Q+G +  G FSYCL  +G+  +  G   A  
Sbjct: 237 GHRNRGLFVGAA---GLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGAGSLVLGRSEAVP 293

Query: 212 AGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLL 260
            G V  PL+        YY+ L  I VG++RL       +      G + +DTG   T L
Sbjct: 294 EGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVTRL 353

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRG-ADV 316
           P E ++ L+      + A P       PG S +  CY++S  +  + P V+ +F G A +
Sbjct: 354 PQEAYAALRDAFVAAVGALP-----RAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAATL 408

Query: 317 KLSPSNLFRNISDEIMCSAFRGGNAN-IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            L   NL   +   I C AF   ++   + G I Q    I  D     + F P+ C
Sbjct: 409 TLPARNLLLEVDGGIYCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  171 bits (434), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 120/352 (34%), Positives = 181/352 (51%), Gaps = 30/352 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QC PC  + C+KQ+ PLFDP KSSTY ++SC+ 
Sbjct: 163 YVVTVGLGTPASKYTVVFDTGSDTTWVQCRPC-VVKCYKQKGPLFDPAKSSTYANVSCTD 221

Query: 95  SQCAVVTSN-CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S CA + +N C+ G C Y+  YG G+Y   + G  A +TLT    +     +    FGCG
Sbjct: 222 SACADLDTNGCTGGHCLYAVQYGDGSY---TVGFFAQDTLTIAHDA-----IKGFRFGCG 273

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAG 211
            KN        K  G++GLG G +SL  Q      G F+YCLP    G+  ++FG   AG
Sbjct: 274 EKNNG---LFGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDFGPGSAG 330

Query: 212 AGVVSTPLII---RDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHS 266
                TP++    +  YY+ +  I VG Q++    S  ST    VD+G + T LP   ++
Sbjct: 331 NNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYT 390

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSPSN 322
            L S    ++ A   +G    PG+S +  CY+ +  S  + P V++ F+ GA + +  S 
Sbjct: 391 ALSSAFDKVMLA---RGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSG 447

Query: 323 LFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +   IS+  +C AF   G + ++ + G   Q  + + YD+ +  V F P  C
Sbjct: 448 IVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 124/362 (34%), Positives = 183/362 (50%), Gaps = 41/362 (11%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           D  YLM+LSIGTP       +DTGSD  WTQC+PC +  CF Q  P+F+P+ SS+++++ 
Sbjct: 92  DGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQ--CFNQSTPIFNPQGSSSFSTLP 149

Query: 92  CSSSQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
           CSS  C  + S  CS   C Y++ YG G   S + G++ TETLTF S     V +PN+ F
Sbjct: 150 CSSQLCQALQSPTCSNNSCQYTYGYGDG---SETQGSMGTETLTFGS-----VSIPNITF 201

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGG 207
           GCG  N           G++G+G G  SL SQ+  +   KFSYC+   GSS    +  G 
Sbjct: 202 GCGENNQG--FGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSNSSTLLLGS 256

Query: 208 IV--AGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--------STGNIFVDT 253
           +     AG  +T LI    I   YY++L  +SVG+  L    S         TG I +D+
Sbjct: 257 LANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDS 316

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFPEVTIH 310
           G   T      +  ++    + +    V   G+  GF   LC+ +    S  + P   +H
Sbjct: 317 GTTLTYFVDNAYQAVRQAFISQMNLSVVN--GSSSGFD--LCFQMPSDQSNLQIPTFVMH 372

Query: 311 FRGADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPS 369
           F G D+ L   N F + S+ ++C A    +  + ++G I Q N L+ YD   ++VSF  +
Sbjct: 373 FDGGDLVLPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLSA 432

Query: 370 RC 371
           +C
Sbjct: 433 QC 434


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 121/365 (33%), Positives = 177/365 (48%), Gaps = 46/365 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + IG+PP + +  VD+GSD  W QC+PC  L+C+ Q  PLFDP  S+T++++SC S
Sbjct: 125 YFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPC--LECYAQADPLFDPASSATFSAVSCGS 182

Query: 95  SQCAVV-TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           + C  + TS C + G C Y   YG G+Y   + G LA ETLT   T+     +  V  GC
Sbjct: 183 AICRTLRTSGCGDSGGCEYEVSYGDGSY---TKGTLALETLTLGGTA-----VEGVAIGC 234

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN----FGGI 208
           GH+N           G++GLG G  SL+ Q+G +  G FSYCL  +G S        G +
Sbjct: 235 GHRNRGLFVG---AAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGSL 291

Query: 209 VAG------AGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFV 251
           V G       G V  PL+        YY+ +  I VG++RL       +      G + +
Sbjct: 292 VLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVVM 351

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVT 308
           DTG   T LP E ++ L+      + A P       PG S +  CY++S  +  + P V+
Sbjct: 352 DTGTAVTRLPQEAYAALRDAFVGAVGALP-----RAPGVSLLDTCYDLSGYTSVRVPTVS 406

Query: 309 IHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSF 366
            +F G A + L   NL   +   I C AF   ++ + + G I Q    I  D     + F
Sbjct: 407 FYFDGAATLTLPARNLLLEVDGGIYCLAFAPSSSGLSILGNIQQEGIQITVDSANGYIGF 466

Query: 367 KPSRC 371
            P+ C
Sbjct: 467 GPATC 471


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score =  171 bits (433), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 117/358 (32%), Positives = 184/358 (51%), Gaps = 40/358 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM+++IGTP   +   +DTGSD  WTQCEPC +  CF Q  P+F+P+ SS+++++ C S
Sbjct: 96  YLMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQ--CFSQPTPIFNPQDSSSFSTLPCES 153

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
             C  + S     DC Y++ YG G   S + G +ATET TF ++S     +PN+ FGCG 
Sbjct: 154 QYCQDLPSESCYNDCQYTYGYGDG---SSTQGYMATETFTFETSS-----VPNIAFGCGE 205

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIVAG 211
            N           G+IG+G G  SL SQ+G    G+FSYC+        S +  G   +G
Sbjct: 206 DNQG--FGQGNGAGLIGMGWGPLSLPSQLG---VGQFSYCMTSSGSSSPSTLALGSAASG 260

Query: 212 A--GVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRT 258
              G  ST LI       +YY++L+ I+VG   L   SS+       TG + +D+G   T
Sbjct: 261 VPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLT 320

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFPEVTIHFRGAD 315
            LP + ++ +    ++ I   PV    +  G S   C+ +    S  + PE+++ F G  
Sbjct: 321 YLPQDAYNAVAQAFTDQINLSPVD--ESSSGLST--CFQLPSDGSTVQVPEISMQFDGGV 376

Query: 316 VKLSPSNLFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           + L   N+  + ++ ++C A    +     ++G I Q    + YD++   VSF P++C
Sbjct: 377 LNLGEENVLISPAEGVICLAMGSSSQQGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score =  171 bits (433), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 122/367 (33%), Positives = 181/367 (49%), Gaps = 44/367 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ L++GTP   +  ++DTGSD  WTQC PC   DCF Q+ P+ DP  SSTY ++ C +
Sbjct: 84  YLVRLAVGTPRRPVALTLDTGSDLVWTQCAPC--RDCFDQDLPVLDPAASSTYAALPCGA 141

Query: 95  SQC-AVVTSNC------SEGDCSYSFLYGRGAYASFSSGNLATETLTFNST--SGLPVEM 145
           ++C A+  ++C      +   C Y++ YG     S + G +AT+  TF  +  SG  +  
Sbjct: 142 ARCRALPFTSCGVRTLGNHRSCIYAYHYGD---KSLTVGEIATDRFTFGDSGGSGESLHT 198

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSK 202
             + FGCGH N       S +TGI G G G  SL SQ+  +    FSYC     +  SS 
Sbjct: 199 RRLTFGCGHLNKG--VFQSNETGIAGFGRGRWSLPSQLNVT---SFSYCFTSMFESKSSL 253

Query: 203 INFGGIVAG------AGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSSTGNIFV 251
           +  GG  A       +G V T  I+++      Y+LSL+ ISVG  RL    +   +  +
Sbjct: 254 VTLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTII 313

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-----SQPKFPE 306
           D+G   T LP E +  +K+  +  +   P    G E    D LC+ +       +P  P 
Sbjct: 314 DSGASITTLPEEVYEAVKAEFAAQVGLPP---SGVEGSALD-LCFALPVTALWRRPAVPS 369

Query: 307 VTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMV 364
           +T+H  GAD +L  SN +F ++   +MC           V G   Q N  + YD+E   +
Sbjct: 370 LTLHLEGADWELPRSNYVFEDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVYDLENDRL 429

Query: 365 SFKPSRC 371
           SF P+RC
Sbjct: 430 SFAPARC 436


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score =  171 bits (433), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 131/375 (34%), Positives = 183/375 (48%), Gaps = 46/375 (12%)

Query: 27  EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSST 86
            + SV   YLM L+IG PPV      DTGSD TWTQC+PC    CF Q+ P++DP  SST
Sbjct: 63  RLHSVQVEYLMELAIGKPPVPFVALADTGSDLTWTQCQPCKL--CFPQDTPVYDPSASST 120

Query: 87  YNSISCSSSQCAVVTS-NCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           ++ + CSS+ C  + S NC+    C Y + YG GAY   S+G L TETLT   +S  PV 
Sbjct: 121 FSPLPCSSATCLPIWSRNCTPSSLCRYRYAYGDGAY---SAGILGTETLTLGPSSA-PVS 176

Query: 145 MPNVIFGCGHKNLASPTSDS-KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI 203
           +  V FGCG  N      DS   TG +GLG G  SL++Q+G    GKFSYCL D  +S +
Sbjct: 177 VGGVAFGCGTDN----GGDSLNSTGTVGLGRGTLSLLAQLG---VGKFSYCLTDFFNSAL 229

Query: 204 N-------FGGIVAGAGVV-STPLIIR----DHYYLSLEAISVGNQRL-------EFVSS 244
           +          +  G   V STPL+        Y++SL+ IS+G+ RL       +    
Sbjct: 230 DSPFLLGTLAELAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGD 289

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QP 302
            TG + VD+G   T+L       +   ++ ++   PV          D  C+   +   P
Sbjct: 290 GTGGMIVDSGTTFTILAESGFREVVGRVARVLGQPPVNASSL-----DAPCFPAPAGEPP 344

Query: 303 KFPEVTIHFR-GADVKLSPSNLFR-NISDEIMCSAFRGG--NANIVYGRIMQINFLIGYD 358
             P++ +HF  GAD++L   N    N  D   C    G    +  V G   Q N  + +D
Sbjct: 345 YMPDLVLHFAGGADMRLYRDNYMSYNEEDSSFCLNIAGTTPESTSVLGNFQQQNIQMLFD 404

Query: 359 IEQAMVSFKPSRCTN 373
                +SF P+ C+ 
Sbjct: 405 TTVGQLSFLPTDCSK 419


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score =  171 bits (432), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 120/363 (33%), Positives = 189/363 (52%), Gaps = 43/363 (11%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           D  YLM+++IGTP       +DTGSD  WTQCEPC +  CF Q  P+F+P+ SS+++++ 
Sbjct: 93  DGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQ--CFSQPTPIFNPQDSSSFSTLP 150

Query: 92  CSSSQCAVVTSN-CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
           C S  C  + S  C+  +C Y++ YG G   S + G +ATET TF ++S     +PN+ F
Sbjct: 151 CESQYCQDLPSETCNNNECQYTYGYGDG---STTQGYMATETFTFETSS-----VPNIAF 202

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGG 207
           GCG  N           G+IG+G G  SL SQ+G    G+FSYC+   GSS    +  G 
Sbjct: 203 GCGEDNQG--FGQGNGAGLIGMGWGPLSLPSQLG---VGQFSYCMTSYGSSSPSTLALGS 257

Query: 208 IVAGA--GVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTG 254
             +G   G  ST LI       +YY++L+ I+VG   L   SS+       TG + +D+G
Sbjct: 258 AASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSG 317

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFPEVTIHF 311
              T LP + ++ +    ++ I    V    +  G S   C+      S  + PE+++ F
Sbjct: 318 TTLTYLPQDAYNAVAQAFTDQINLPTVD--ESSSGLS--TCFQQPSDGSTVQVPEISMQF 373

Query: 312 RGADVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKP 368
            G  + L   N+  + ++ ++C A  G ++ +   ++G I Q    + YD++   VSF P
Sbjct: 374 DGGVLNLGEQNILISPAEGVICLAM-GSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVP 432

Query: 369 SRC 371
           ++C
Sbjct: 433 TQC 435


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score =  171 bits (432), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 124/369 (33%), Positives = 186/369 (50%), Gaps = 41/369 (11%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           +  + + D  YLM+LSIGTP       +DTGSD  WTQC+PC +  CF Q  P+F+P+ S
Sbjct: 85  ETPVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQ--CFNQSTPIFNPQGS 142

Query: 85  STYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           S+++++ CSS  C  + S  CS   C Y++ YG G   S + G++ TETLTF S     V
Sbjct: 143 SSFSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDG---SETQGSMGTETLTFGS-----V 194

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---S 200
            +PN+ FGCG  N           G++G+G G  SL SQ+  +   KFSYC+   G   S
Sbjct: 195 SIPNITFGCGENNQG--FGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTS 249

Query: 201 SKINFGGIV--AGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--------ST 246
           S +  G +     AG  +T LI    I   YY++L  +SVG+  L    S         T
Sbjct: 250 STLLLGSLANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGT 309

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPK 303
           G I +D+G   T      +  ++    + +    V   G+  GF   LC+ +    S  +
Sbjct: 310 GGIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVN--GSSSGFD--LCFQMPSDQSNLQ 365

Query: 304 FPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQA 362
            P   +HF G D+ L   N F + S+ ++C A    +  + ++G I Q N L+ YD   +
Sbjct: 366 IPTFVMHFDGGDLVLPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNS 425

Query: 363 MVSFKPSRC 371
           +VSF  ++C
Sbjct: 426 VVSFLFAQC 434


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score =  170 bits (431), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 128/362 (35%), Positives = 181/362 (50%), Gaps = 52/362 (14%)

Query: 41  IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA-V 99
           IGTP +     VDTGSD  WTQC+PC  +DCFKQ  P+FDP  SSTY ++ CSS+ C+ +
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATVPCSSASCSDL 230

Query: 100 VTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLA 158
            TS C S   C Y++ YG    +S + G LATET T   +     ++P V+FGCG  N  
Sbjct: 231 PTSKCTSASKCGYTYTYGD---SSSTQGVLATETFTLAKS-----KLPGVVFGCGDTNEG 282

Query: 159 SPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIVA----- 210
                S+  G++GLG G  SL+SQ+G     KFSYCL    D  +S +  G +       
Sbjct: 283 D--GFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPLLLGSLAGISEAS 337

Query: 211 --GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLR 257
              + V +TPLI        YY+SL+AI+VG+ R+   SS+       TG + VD+G   
Sbjct: 338 AAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSI 397

Query: 258 TLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISS----QPKFPEVTIHF 311
           T L ++ +  LK   +  +   A    GVG +      LC+   +    Q + P +  HF
Sbjct: 398 TYLEVQGYRALKKAFAAQMALPAADGSGVGLD------LCFRAPAKGVDQVEVPRLVFHF 451

Query: 312 R-GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPS 369
             GAD+ L   N +  +     +C    G     + G   Q NF   YD+    +SF P 
Sbjct: 452 DGGADLDLPAENYMVLDGGSGALCLTVMGSRGLSIIGNFQQQNFQFVYDVGHDTLSFAPV 511

Query: 370 RC 371
           +C
Sbjct: 512 QC 513


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 132/379 (34%), Positives = 189/379 (49%), Gaps = 49/379 (12%)

Query: 27  EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSST 86
            + SV   YLM L+IGTPPV      DTGSD TWTQC+PC    CF Q+ P++DP  SST
Sbjct: 69  RLHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKL--CFPQDTPVYDPSASST 126

Query: 87  YNSISCSSSQCAVV--TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNST-SGL 141
           ++ + CSS+ C  V  + NCS     C Y + Y  GAY   S+G L TETLT  S+  G 
Sbjct: 127 FSPVPCSSATCLPVLRSRNCSTPSSLCRYGYSYSDGAY---SAGILGTETLTLGSSVPGQ 183

Query: 142 PVEMPNVIFGCGHKNLASPTSDS-KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
            V + +V FGCG  N      DS   TG +GLG G  SL++Q+G    GKFSYCL D  +
Sbjct: 184 AVSVSDVAFGCGTDN----GGDSLNSTGTVGLGRGTLSLLAQLG---VGKFSYCLTDFFN 236

Query: 201 SKIN-------FGGIVAGAGVV-STPLIIR----DHYYLSLEAISVGNQRL-------EF 241
           S ++          +  G G V STPL+        Y +SL+ I++G+ RL       + 
Sbjct: 237 STLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDL 296

Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS- 300
            ++STG + VD+G   ++LP      +   ++ ++   PV          D  C+   + 
Sbjct: 297 HANSTGGMVVDSGTTFSILPESGFRVVVDHVAQVLGQPPVNASSL-----DSPCFPAPAG 351

Query: 301 ---QPKFPEVTIHFR-GADVKLSPSNLFR-NISDEIMCSAFRGGNANI-VYGRIMQINFL 354
               P  P++ +HF  GAD++L   N    N  D   C    G  +   + G   Q N  
Sbjct: 352 ERQLPFMPDLVLHFAGGADMRLHRDNYMSYNQEDSSFCLNIVGTTSTWSMLGNFQQQNIQ 411

Query: 355 IGYDIEQAMVSFKPSRCTN 373
           + +D+    +SF P+ C+ 
Sbjct: 412 MLFDMTVGQLSFLPTDCSK 430


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score =  168 bits (426), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 114/354 (32%), Positives = 169/354 (47%), Gaps = 49/354 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP D +  VD+GSD  W QC PC +  C+ Q  PLFDP  SS+++ +SC S
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQ--CYAQTDPLFDPAASSSFSGVSCGS 187

Query: 95  SQCAVVT-----SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           + C  ++          G C YS  YG G+Y   + G LA ETLT   T+     +  V 
Sbjct: 188 AICRTLSGTGCGGGGDAGKCDYSVTYGDGSY---TKGELALETLTLGGTA-----VQGVA 239

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
            GCGH+N       +   G++GLG G  SL+ Q+G +  G FSYCL  +G+         
Sbjct: 240 IGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGA--------- 287

Query: 210 AGAGVVSTPLIIRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPL 262
            GAG +++       YY+ L  I VG +RL       +      G + +DTG   T LP 
Sbjct: 288 GGAGSLASSF-----YYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPR 342

Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHF-RGADVKL 318
           E ++ L+      + A P       P  S +  CY++S  +  + P V+ +F +GA + L
Sbjct: 343 EAYAALRGAFDGAMGALP-----RSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTL 397

Query: 319 SPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
              NL   +   + C AF   ++ I + G I Q    I  D     V F P+ C
Sbjct: 398 PARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 451


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score =  167 bits (424), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 122/370 (32%), Positives = 179/370 (48%), Gaps = 53/370 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ L+IGTPP+     +DTGSD  WTQC PC  L C  Q  P FD K+S+TY ++ C S
Sbjct: 89  YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCAAQPTPYFDVKRSATYRALPCRS 146

Query: 95  SQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S+CA ++S +C +  C Y + YG  A    ++G LA ET TF + S   V   N+ FGCG
Sbjct: 147 SRCAALSSPSCFKKMCVYQYYYGDTAS---TAGVLANETFTFGAASSTKVRAANISFGCG 203

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIV- 209
             N     + S   G++G G G  SL+SQ+G S   +FSYCL    S   S++ FG    
Sbjct: 204 SLNAGELANSS---GMVGFGRGPLSLVSQLGPS---RFSYCLTSYLSPTPSRLYFGVFAN 257

Query: 210 -------AGAGVVSTPLIIR----DHYYLSLEAISVGNQRL-------EFVSSSTGNIFV 251
                  +G+ V STP +I     + Y+LS++ IS+G +RL             TG + +
Sbjct: 258 LNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTGGVII 317

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-----LCYNISSQPK--- 303
           D+G   T L  + +  ++  +++ I           P  +D       C+     P    
Sbjct: 318 DSGTSITWLQQDAYEAVRRGLASTIPL---------PAMNDTDIGLDTCFQWPPPPNVTV 368

Query: 304 -FPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
             P+   HF GA++ L P N     S    +C A    +   + G   Q N  + YDI  
Sbjct: 369 TVPDFVFHFDGANMTLPPENYMLIASTTGYLCLAMAPTSVGTIIGNYQQQNLHLLYDIAN 428

Query: 362 AMVSFKPSRC 371
           + +SF P+ C
Sbjct: 429 SFLSFVPAPC 438


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score =  167 bits (423), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 116/359 (32%), Positives = 170/359 (47%), Gaps = 46/359 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP D +  VD+GSD  W QC PC +  C+ Q  PLFDP  SS+++ +SC S
Sbjct: 130 YFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQ--CYAQTDPLFDPAASSSFSGVSCGS 187

Query: 95  SQCAVVT-----SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           + C  ++          G C YS  YG G+Y   + G LA ETLT   T+     +  V 
Sbjct: 188 AICRTLSGTGCGGGGDAGKCDYSVTYGDGSY---TKGELALETLTLGGTA-----VQGVA 239

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
            GCGH+N       +   G++GLG G  SL+ Q+G +  G FSYCL  +G+     G +V
Sbjct: 240 IGCGHRNSGLFVGAA---GLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAG--GAGSLV 294

Query: 210 AGAGVVSTPLIIRDH-----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLR 257
            G     T  + R       YY+ L  I VG +RL       +      G + +DTG   
Sbjct: 295 LG----RTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAV 350

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHF-RG 313
           T LP E ++ L+      + A P       P  S +  CY++S  +  + P V+ +F +G
Sbjct: 351 TRLPREAYAALRGAFDGAMGALP-----RSPAVSLLDTCYDLSGYASVRVPTVSFYFDQG 405

Query: 314 ADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           A + L   NL   +   + C AF   ++ I + G I Q    I  D     V F P+ C
Sbjct: 406 AVLTLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 464


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  167 bits (423), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 126/367 (34%), Positives = 182/367 (49%), Gaps = 47/367 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +LM ++IGTP +     VDTGSD  WTQC+PC  +DCFKQ  P+FDP  SSTY ++ CSS
Sbjct: 100 FLMDVAIGTPALSYAAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATVPCSS 157

Query: 95  SQCA-VVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           + C+ + TS C S   C Y++ YG    AS + G LA+ET T         ++P V FGC
Sbjct: 158 ALCSDLPTSTCTSASKCGYTYTYGD---ASSTQGVLASETFTLGKEK---KKLPGVAFGC 211

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSSKINFGGI 208
           G  N       ++  G++GLG G  SL+SQ+G     KFSYCL       G S +  GG 
Sbjct: 212 GDTNEGD--GFTQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDGDGKSPLLLGGS 266

Query: 209 VAGAG-------VVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIF 250
            A          V +TPL+        YY+SL  ++VG+ R+   +S+       TG + 
Sbjct: 267 AAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVI 326

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS----QPKFPE 306
           VD+G   T L L+ +  LK      +    V   G+E G    LC+   +    + + P+
Sbjct: 327 VDSGTSITYLELQGYRALKKAFVAQMALPTVD--GSEIGLD--LCFQGPAKGVDEVQVPK 382

Query: 307 VTIHFR-GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
           + +HF  GAD+ L   N +  + +   +C          + G   Q NF   YD+    +
Sbjct: 383 LVLHFDGGADLDLPAENYMVLDSASGALCLTVAPSRGLSIIGNFQQQNFQFVYDVAGDTL 442

Query: 365 SFKPSRC 371
           SF P +C
Sbjct: 443 SFAPVQC 449


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 134/360 (37%), Positives = 195/360 (54%), Gaps = 43/360 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +LM L+IGTPP      +DTGSD  WTQC+PC +  CF Q  P+FDPKKSS+++ +SCSS
Sbjct: 97  FLMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQ--CFDQPTPIFDPKKSSSFSKLSCSS 154

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C A+  S CS+G C Y  LYG G Y+S + G LA+ETLTF       V +P V FGCG
Sbjct: 155 KLCEALPQSTCSDG-CEY--LYGYGDYSS-TQGMLASETLTFGK-----VSVPEVAFGCG 205

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGI-- 208
             N  S    S+ +G++GLG G  SL+SQ+      KFSYCL    D  +S +  G +  
Sbjct: 206 EDNEGS--GFSQGSGLVGLGRGPLSLVSQLKEP---KFSYCLTSVDDTKASTLLMGSLAS 260

Query: 209 --VAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGV 255
              + + + +TPLI        YYLSLE ISVG+  L    S+       +G + +D+G 
Sbjct: 261 VKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGT 320

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFPEVTIHFR 312
             T L       +    ++ I   PV   G+  G    +C+ +   S+  + P++  HF 
Sbjct: 321 TITYLEQSAFDLVAKEFTSQINL-PVDNSGST-GLE--VCFTLPSGSTDIEVPKLVFHFD 376

Query: 313 GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           GAD++L   N +  + S  + C A    +   ++G I Q N L+ +D+E+  +SF P++C
Sbjct: 377 GADLELPAENYMIADASMGVACLAMGSSSGMSIFGNIQQQNMLVLHDLEKETLSFLPTQC 436


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 136/370 (36%), Positives = 200/370 (54%), Gaps = 43/370 (11%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           +A ++  +  +LM L+IGTPP      +DTGSD  WTQC+PC +  CF Q  P+FDPKKS
Sbjct: 87  EAPVLPGNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCKPCTQ--CFHQSTPIFDPKKS 144

Query: 85  STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           S+++ +SCSS  C A+  S+C+ G C Y  LY  G Y+S + G LA+ETLTF   S    
Sbjct: 145 SSFSKLSCSSQLCEALPQSSCNNG-CEY--LYSYGDYSS-TQGILASETLTFGKAS---- 196

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGS 200
            +PNV FGCG  N  S    S+  G++GLG G  SL+SQ+      KFSYCL    D  +
Sbjct: 197 -VPNVAFGCGADNEGS--GFSQGAGLVGLGRGPLSLVSQLKEP---KFSYCLTTVDDTKT 250

Query: 201 SKINFGGI----VAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS------- 245
           S +  G +     + + + +TPLI        YYLSLE ISVG+ RL    S+       
Sbjct: 251 STLLMGSLASVNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDG 310

Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQP 302
           +G + +D+G   T L  E   NL +         PV   G+  G    +C+ +   S+  
Sbjct: 311 SGGLIIDSGTTITYLE-ESAFNLVAKEFTAKINLPVDSSGST-GLD--VCFTLPSGSTNI 366

Query: 303 KFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
           + P++  HF GAD++L   N +  + S  + C A    +   ++G + Q N L+ +D+E+
Sbjct: 367 EVPKLVFHFDGADLELPAENYMIGDSSMGVACLAMGSSSGMSIFGNVQQQNMLVLHDLEK 426

Query: 362 AMVSFKPSRC 371
             +SF P++C
Sbjct: 427 ETLSFLPTQC 436


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 132/376 (35%), Positives = 187/376 (49%), Gaps = 46/376 (12%)

Query: 27  EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSST 86
            + SV   YLM L+IGTPPV      DTGSD TWTQC+PC    CF Q+ P++DP  SST
Sbjct: 58  RLHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQCQPCKL--CFPQDTPVYDPSASST 115

Query: 87  YNSISCSSSQC--AVVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNST-SGL 141
           ++ + CSS+ C     + NCS     C Y + Y  GAY   S G L TETLT  S+  G 
Sbjct: 116 FSPVPCSSATCLPTWRSRNCSNPSSPCRYIYSYSDGAY---SVGILGTETLTIGSSVPGQ 172

Query: 142 PVEMPNVIFGCGHKNLASPTSDS-KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
            V + +V FGCG  N      DS   TG +GLG G  SL++Q+G    GKFSYCL D  +
Sbjct: 173 TVSVGSVAFGCGTDN----GGDSLNSTGTVGLGRGTLSLLAQLG---VGKFSYCLTDFFN 225

Query: 201 SKIN---FGGIVA----GAGVV-STPLIIR----DHYYLSLEAISVGNQRL-------EF 241
           S ++   F G +A    G G V STPL+        Y+++L+ IS+G+ RL       + 
Sbjct: 226 STMDSPFFLGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDL 285

Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-NISS 300
            +   G + VD+G   T+L       +   ++ ++   PV          D  C+ +   
Sbjct: 286 RADGNGGMMVDSGTTFTILAKSGFREVVDRVAQLLGQPPVNASSL-----DSPCFPSPDG 340

Query: 301 QPKFPEVTIHFR-GADVKLSPSNLFR-NISDEIMCSAFRGGNANI-VYGRIMQINFLIGY 357
           +P  P++ +HF  GAD++L   N    N  D   C    G  +     G   Q N  + +
Sbjct: 341 EPFMPDLVLHFAGGADMRLHRDNYMSYNEDDSSFCLNIVGSPSTWSRLGNFQQQNIQMLF 400

Query: 358 DIEQAMVSFKPSRCTN 373
           D+    +SF P+ C+ 
Sbjct: 401 DMTVGQLSFLPTDCSK 416


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 133/392 (33%), Positives = 196/392 (50%), Gaps = 63/392 (16%)

Query: 26  AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
           A + S    YLM L+IGTPPV      DTGSD TWTQC+PC    CF Q+ P++D   S+
Sbjct: 86  ARLRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCKPCKL--CFPQDTPIYDTAASA 143

Query: 86  TYNSISCSSSQCAVV---TSNCSE---GDCSYSFLYGRGAYASFSSGNLATETLTF-NST 138
           +++ + C+S+ C  +   + NC+      C Y + Y  GAY   S+G L TETLTF  S+
Sbjct: 144 SFSPVPCASATCLPIWRSSRNCTATTTSPCRYRYAYDDGAY---SAGVLGTETLTFAGSS 200

Query: 139 SGLP---VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL 195
            G P   V +  V FGCG  N          TG +GLG G+ SL++Q+G    GKFSYCL
Sbjct: 201 PGAPGPGVSVGGVAFGCGVDNGG---LSYNSTGTVGLGRGSLSLVAQLGV---GKFSYCL 254

Query: 196 PD----QGSSKINFGGI--------VAGAGVVSTPLIIRDH----YYLSLEAISVGNQRL 239
            D       S + FG +        + GA V STPL+   +    YY+SLE IS+G+ RL
Sbjct: 255 TDFFNTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARL 314

Query: 240 -------EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNM--IKAQPVKGVGAEPGF 290
                  +     +G + VD+G + T+L     S  + V++++  +  QPV    +    
Sbjct: 315 PIPNGTFDLRDDGSGGMIVDSGTIFTVL---VESAFRVVVNHVAGVLNQPVVNASSL--- 368

Query: 291 SDVLCYNISSQ----PKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVY 345
            D  C+  ++     P  P++ +HF  GAD++L   N + + + E          A   Y
Sbjct: 369 -DSPCFPATAGEQQLPDMPDMLLHFAGGADMRLHRDN-YMSFNQESSSFCLNIAGAPSAY 426

Query: 346 GRIM----QINFLIGYDIEQAMVSFKPSRCTN 373
           G I+    Q N  + +DI    +SF P+ C+ 
Sbjct: 427 GSILGNFQQQNIQMLFDITVGQLSFVPTDCSK 458


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 122/373 (32%), Positives = 186/373 (49%), Gaps = 48/373 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM L+IGTPPV      DTGSD  WTQC PC    CF+Q  PL++P  S+T+  + C+S
Sbjct: 86  YLMTLAIGTPPVSYQAIADTGSDLIWTQCAPCSS-QCFQQPTPLYNPSSSTTFAVLPCNS 144

Query: 95  --SQCAVVTSNCSEG---DCSYSFLYGRGAYASFSSGNLATETLTF-NSTSGLPVEMPNV 148
             S CA   +  +      C Y+  YG G    ++S    +ET TF +ST      +P +
Sbjct: 145 SLSMCAAALAGTTPPPGCTCMYNMTYGSG----WTSVYQGSETFTFGSSTPANQTGVPGI 200

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINF 205
            FGC   N +   + S  +G++GLG G+ SL+SQ+G     KFSYCL    D  S+    
Sbjct: 201 AFGC--SNASGGFNTSSASGLVGLGRGSLSLVSQLGVP---KFSYCLTPYQDTNSTSTLL 255

Query: 206 GGIVAG----AGVVSTPLI-------IRDHYYLSLEAISVGNQRLEFVSSS-------TG 247
            G  A      GV STP +       +  +YYL+L  IS+G   L   +++       TG
Sbjct: 256 LGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGTG 315

Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI----SSQPK 303
              +D+G   TLL    +  +++ + +++      G  A  G    LC+ +    S+ P 
Sbjct: 316 GFIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGGSAATGLD--LCFELPSSTSAPPT 373

Query: 304 FPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG---GNANIVYGRIMQINFLIGYDIE 360
            P +T+HF GAD+ L P++ +  +   + C A +    G  +I+ G   Q N  I YD+ 
Sbjct: 374 MPSMTLHFDGADMVL-PADSYMMLDSNLWCLAMQNQTDGGVSIL-GNYQQQNMHILYDVG 431

Query: 361 QAMVSFKPSRCTN 373
           Q  ++F P++C+ 
Sbjct: 432 QETLTFAPAKCST 444


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score =  164 bits (416), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 119/353 (33%), Positives = 180/353 (50%), Gaps = 31/353 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QC+PC  + C++Q   LFDP +SSTY ++SC++
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVV-CYEQREKLFDPARSSTYANVSCAA 237

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C+ + T  CS G C Y   YG G+Y   S G  A +TLT +S   +        FGCG
Sbjct: 238 PACSDLDTRGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYDAV----KGFRFGCG 290

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFGGIVAG 211
            +N        +  G++GLG G +SL  Q      G F++CLP +  G+  ++FG     
Sbjct: 291 ERNEG---LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSPA 347

Query: 212 AGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHS 266
           A + +TP+++ +    YY+ L  I VG + L    S  +T    VD+G + T LP   +S
Sbjct: 348 ARLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTIVDSGTVITRLPPAAYS 407

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSPSN 322
           +L+S  +  + A   +G    P  S +  CY+ +  SQ   P V++ F+ GA + +  S 
Sbjct: 408 SLRSAFAAAMSA---RGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQGGARLDVDASG 464

Query: 323 LFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +    S   +C AF     GG+  IV G      F + YDI + +VSF P  C
Sbjct: 465 IMYAASASQVCLAFAANEDGGDVGIV-GNTQLKTFGVAYDIGKKVVSFSPGAC 516


>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 469

 Score =  164 bits (416), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 122/371 (32%), Positives = 187/371 (50%), Gaps = 46/371 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM L+IGTPP+      DTGSD  WTQC PC    CF+Q  PL++P  S+T++ + C+S
Sbjct: 112 YLMTLAIGTPPLPYAAVADTGSDLIWTQCAPC-GTQCFEQPAPLYNPASSTTFSVLPCNS 170

Query: 95  --SQCAVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
             S CA   +  +      C Y+  YG G    +++G   +ET TF S++     +P V 
Sbjct: 171 SLSMCAGALAGAAPPPGCACMYNQTYGTG----WTAGVQGSETFTFGSSAADQARVPGVA 226

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQ---GSSKINF 205
           FGC +   AS +  +   G++GLG G+ SL+SQ+G   AG+FSYCL P Q    +S +  
Sbjct: 227 FGCSN---ASSSDWNGSAGLVGLGRGSLSLVSQLG---AGRFSYCLTPFQDTNSTSTLLL 280

Query: 206 GGIVA--GAGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EFVSSSTGNI 249
           G   A  G GV STP +       +  +YYL+L  IS+G + L             TG +
Sbjct: 281 GPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGL 340

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPK--F 304
            +D+G   T L    +  +++ + +++   P        G    LC+ +   +S P    
Sbjct: 341 IIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLD--LCFALPAPTSAPPAVL 398

Query: 305 PEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG--GNANIVYGRIMQINFLIGYDIEQA 362
           P +T+HF GAD+ L P++ +      + C A R     A   +G   Q N  I YD+ + 
Sbjct: 399 PSMTLHFDGADMVL-PADSYMISGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREE 457

Query: 363 MVSFKPSRCTN 373
            +SF P++C+ 
Sbjct: 458 TLSFAPAKCST 468


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  164 bits (414), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 125/369 (33%), Positives = 188/369 (50%), Gaps = 47/369 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM L+IGTPP+     VDTGSD  WTQC PC  + C  Q  P F P +S+TY  + C S
Sbjct: 92  YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPC--VLCADQPTPYFRPARSATYRLVPCRS 149

Query: 95  SQCAVVT-SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             CA +    C +   C Y + YG  A    ++G LA+ET TF + +   V + +V FGC
Sbjct: 150 PLCAALPYPACFQRSVCVYQYYYGDEAS---TAGVLASETFTFGAANSSKVMVSDVAFGC 206

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIV 209
           G+ N     + S   G++GLG G  SL+SQ+G S   +FSYCL    S   S++NFG   
Sbjct: 207 GNINSGQLANSS---GMVGLGRGPLSLVSQLGPS---RFSYCLTSFLSPEPSRLNFGVFA 260

Query: 210 ---------AGAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNI 249
                    +G+ V STPL++       Y++SL+ IS+G +RL             TG +
Sbjct: 261 TLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGV 320

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK----FP 305
           F+D+G   T L  + +  ++  + ++++  P      E G     C+     P      P
Sbjct: 321 FIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTN-DTEIGLET--CFPWPPPPSVAVTVP 377

Query: 306 EVTIHFR-GADVKLSPSN-LFRNISDEIMCSAF-RGGNANIVYGRIMQINFLIGYDIEQA 362
           ++ +HF  GA++ + P N +  + +   +C A  R G+A I+ G   Q N  I YDI  +
Sbjct: 378 DMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDATII-GNYQQQNMHILYDIANS 436

Query: 363 MVSFKPSRC 371
           ++SF P+ C
Sbjct: 437 LLSFVPAPC 445


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score =  164 bits (414), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 124/369 (33%), Positives = 187/369 (50%), Gaps = 47/369 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM L+IGTPP+     VDTGSD  WTQC PC  + C  Q  P F P +S+TY  + C S
Sbjct: 92  YLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPC--VLCADQPTPYFRPARSATYRLVPCRS 149

Query: 95  SQCAVV--TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             CA +   +      C Y + YG  A    ++G LA+ET TF + +   V + +V FGC
Sbjct: 150 PLCAALPYPACFQRSVCVYQYYYGDEAS---TAGVLASETFTFGAANSSKVMVSDVAFGC 206

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIV 209
           G+ N     + S   G++GLG G  SL+SQ+G S   +FSYCL    S   S++NFG   
Sbjct: 207 GNINSGQLANSS---GMVGLGRGPLSLVSQLGPS---RFSYCLTSFLSPEPSRLNFGVFA 260

Query: 210 ---------AGAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNI 249
                    +G+ V STPL++       Y++SL+ IS+G +RL             TG +
Sbjct: 261 TLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGV 320

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK----FP 305
           F+D+G   T L  + +  ++  + ++++  P      E G     C+     P      P
Sbjct: 321 FIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTN-DTEIGLET--CFPWPPPPSVAVTVP 377

Query: 306 EVTIHFR-GADVKLSPSN-LFRNISDEIMCSAF-RGGNANIVYGRIMQINFLIGYDIEQA 362
           ++ +HF  GA++ + P N +  + +   +C A  R G+A I+ G   Q N  I YDI  +
Sbjct: 378 DMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDATII-GNYQQQNMHILYDIANS 436

Query: 363 MVSFKPSRC 371
           ++SF P+ C
Sbjct: 437 LLSFVPAPC 445


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 117/355 (32%), Positives = 172/355 (48%), Gaps = 34/355 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  L +GTP       VDTGS  TW QC PC  + C +Q  PLFDP+ SSTY S+ CS+
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPC-VVSCHRQVGPLFDPRASSTYASVRCSA 192

Query: 95  SQC------AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           SQC       +  S CS  + C Y   YG    +SFS G+L+T+T++F ST       P+
Sbjct: 193 SQCDELQAATLNPSACSASNVCIYQASYGD---SSFSVGSLSTDTVSFGST-----RYPS 244

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-INFG 206
             +GCG  N        +  G+IGL     SL+ Q+  S+   FSYCLP   S+  ++ G
Sbjct: 245 FYYGCGQDNEG---LFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTGYLSIG 301

Query: 207 GIVAGAGVVSTPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLL 260
               G     TP+    +    Y+++L  +SVG   L    S   ++   +D+G + T L
Sbjct: 302 PYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRL 361

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFR-GADVK 317
           P   H+ L   +     AQ + G    P FS  D      +SQ + P V + F  GA +K
Sbjct: 362 PTAVHTALSKAV-----AQAMAGAQRAPAFSILDTCFEGQASQLRVPTVAMAFAGGASMK 416

Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           L+  N+  ++ D   C AF   ++  + G   Q  F + YD+ Q+ + F    C+
Sbjct: 417 LTTRNVLIDVDDSTTCLAFAPTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 178/365 (48%), Gaps = 43/365 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ L+IGTPP+     +DTGSD  WTQC PC  L C  Q  P FD KKS+TY ++ C S
Sbjct: 89  YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATYRALPCRS 146

Query: 95  SQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S+CA ++S +C +  C Y + YG  A    ++G LA ET TF + +   V   N+ FGCG
Sbjct: 147 SRCASLSSPSCFKKMCVYQYYYGDTAS---TAGVLANETFTFGAANSTKVRATNIAFGCG 203

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFG---- 206
             N     + S   G++G G G  SL+SQ+G S   +FSYCL    S   S++ FG    
Sbjct: 204 SLNAGDLANSS---GMVGFGRGPLSLVSQLGPS---RFSYCLTSYLSATPSRLYFGVYAN 257

Query: 207 ----GIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRL-------EFVSSSTGNIFV 251
                  +G+ V STP +I     + Y+LSL+AIS+G + L             TG + +
Sbjct: 258 LSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVII 317

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK----FPEV 307
           D+G   T L  + +   ++V   ++ A P+  +       D  C+     P      P++
Sbjct: 318 DSGTSITWLQQDAY---EAVRRGLVSAIPLPAMNDTDIGLDT-CFQWPPPPNVTVTVPDL 373

Query: 308 TIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
             HF  A++ L P N     S    +C          + G   Q N  + YDI  + +SF
Sbjct: 374 VFHFDSANMTLLPENYMLIASTTGYLCLVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSF 433

Query: 367 KPSRC 371
            P+ C
Sbjct: 434 VPAPC 438


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 125/366 (34%), Positives = 178/366 (48%), Gaps = 34/366 (9%)

Query: 23  IYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
           ++   + S +  YL+ +S G+PP      VDTGSD  WTQC PC    C      +FDP 
Sbjct: 68  LFSTPVASGNGEYLIDISFGSPPQKASVIVDTGSDLIWTQCLPCET--CNAAASVIFDPV 125

Query: 83  KSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
           KSSTY+++SC+S+ C+ +        C Y ++YG G   S +SG L+TET+T  + +   
Sbjct: 126 KSSTYDTVSCASNFCSSLPFQSCTTSCKYDYMYGDG---SSTSGALSTETVTVGTGT--- 179

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
             +PNV FGCGH NL    S +   GI+GLG G  SLISQ  +  + KFSYCL   GS+K
Sbjct: 180 --IPNVAFGCGHTNLG---SFAGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTK 234

Query: 203 IN---FGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFV-------SSSTGN 248
            +    G   A  GV  T L+        YY  L  ISV  + + +        +S  G 
Sbjct: 235 TSPMLIGDSAAAGGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGG 294

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG--VGAEPGFSDVLCYNISSQPKFPE 306
             +D+G   T L     + L + +   +      G   G +  FS     N    P +P 
Sbjct: 295 FILDSGTTLTYLETGAFNALVAALKAEVPFPEADGSLYGLDYCFSTAGVAN----PTYPT 350

Query: 307 VTIHFRGADVKLSPSNLFRNI-SDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
           +T HF+GAD +L P N+F  + +   +C A        + G I Q N LI +D+    V 
Sbjct: 351 MTFHFKGADYELPPENVFVALDTGGSICLAMAASTGFSIMGNIQQQNHLIVHDLVNQRVG 410

Query: 366 FKPSRC 371
           FK + C
Sbjct: 411 FKEANC 416


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 118/355 (33%), Positives = 172/355 (48%), Gaps = 34/355 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  L +GTP       VDTGS  TW QC PC  + C +Q  PLFDP+ SSTY S+ CS+
Sbjct: 134 YVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPC-VVSCHRQVGPLFDPRASSTYTSVRCSA 192

Query: 95  SQC------AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           SQC       +  S CS  + C Y   YG    +SFS G L+T+T++F STS      P+
Sbjct: 193 SQCDELQAATLNPSACSASNVCIYQASYGD---SSFSVGYLSTDTVSFGSTS-----YPS 244

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-INFG 206
             +GCG  N        +  G+IGL     SL+ Q+  S+   FSYCLP   S+  ++ G
Sbjct: 245 FYYGCGQDNEG---LFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTGYLSIG 301

Query: 207 GIVAGAGVVSTPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLL 260
               G     TP+    +    Y+++L  +SVG   L    S   ++   +D+G + T L
Sbjct: 302 PYNTGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRL 361

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFR-GADVK 317
           P   H+ L   +     AQ + G    P FS  D      +SQ + P V + F  GA +K
Sbjct: 362 PTAVHTALSKAV-----AQAMAGAQRAPAFSILDTCFEGQASQLRVPTVVMAFAGGASMK 416

Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           L+  N+  ++ D   C AF   ++  + G   Q  F + YD+ Q+ + F    C+
Sbjct: 417 LTTRNVLIDVDDSTTCLAFAPTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 123/373 (32%), Positives = 179/373 (47%), Gaps = 49/373 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM L+IGTPP+      DTGSD  WTQC PC    CF+Q  PL++P  S+T+  + C+S
Sbjct: 92  YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTS-QCFRQPTPLYNPSSSTTFAVLPCNS 150

Query: 95  SQCAVVTSNCSEGD-------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           S      +    G        C+Y+  YG G  + F      +ET TF ST      +P 
Sbjct: 151 SLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQ----GSETFTFGSTPAGHARVPG 206

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKIN 204
           + FGC     +S  + S  +G++GLG G  SL+SQ+G     KFSYCL    D  S+   
Sbjct: 207 IAFGC--STASSGFNASSASGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTSTL 261

Query: 205 FGGIVAG----AGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EFVSSST 246
             G  A     AGV STP +       +   YYL+L  IS+G   L          +  T
Sbjct: 262 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGT 321

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI----SSQP 302
           G + +D+G   TLL    +  +++ + +++      G  A+ G    LC+ +    S+ P
Sbjct: 322 GGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDG-SADTGLD--LCFMLPSSTSAPP 378

Query: 303 KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG---GNANIVYGRIMQINFLIGYDI 359
             P +T+HF GAD+ L   +   +    + C A +    G  NI+ G   Q N  I YDI
Sbjct: 379 AMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNIL-GNYQQQNMHILYDI 437

Query: 360 EQAMVSFKPSRCT 372
            Q  +SF P++C+
Sbjct: 438 GQETLSFAPAKCS 450


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 185/366 (50%), Gaps = 48/366 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           YL+ ++IGTPP+ +   +DTGSD  WTQC+ PC    CF Q  PL+ P +S+TY ++SC 
Sbjct: 92  YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRR--CFPQPAPLYAPARSATYANVSCR 149

Query: 94  SSQCAVVT---SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S  C  +    S CS  D  C+Y F YG G   + + G LATET T  S +     +  V
Sbjct: 150 SPMCQALQSPWSRCSPPDTGCAYYFSYGDG---TSTDGVLATETFTLGSDTA----VRGV 202

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN--FG 206
            FGCG +NL S  + S   G++G+G G  SL+SQ+G +   +FSYC     ++  +  F 
Sbjct: 203 AFGCGTENLGSTDNSS---GLVGMGRGPLSLVSQLGVT---RFSYCFTPFNATAASPLFL 256

Query: 207 GIVA--GAGVVSTPLI---------IRDHYYLSLEAISVGNQRL-------EFVSSSTGN 248
           G  A   +   +TP +            +YYLSLE I+VG+  L              G 
Sbjct: 257 GSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGG 316

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPE 306
           + +D+G   T L       L   +++ ++  P+   GA  G S  LC+  +S    + P 
Sbjct: 317 VIIDSGTTFTALEERAFVALARALASRVR-LPLAS-GAHLGLS--LCFAAASPEAVEVPR 372

Query: 307 VTIHFRGADVKL-SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
           + +HF GAD++L   S +  + S  + C          V G + Q N  I YD+E+ ++S
Sbjct: 373 LVLHFDGADMELRRESYVVEDRSAGVACLGMVSARGMSVLGSMQQQNTHILYDLERGILS 432

Query: 366 FKPSRC 371
           F+P++C
Sbjct: 433 FEPAKC 438


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 128/406 (31%), Positives = 199/406 (49%), Gaps = 59/406 (14%)

Query: 2   QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
            N++KL   + + T  +P+S        +V   +LM L+IGTPP+      DTGSD  WT
Sbjct: 58  HNARKLAASSSDGTVSAPVSPT------TVPGEFLMTLAIGTPPLPFLAIADTGSDLIWT 111

Query: 62  QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYA 121
           QC PC    CF+Q  PL++P  S+T++++ C+SS   +    C+   C Y+  YG G   
Sbjct: 112 QCAPCSR-QCFQQPTPLYNPSSSTTFSALPCNSS-LGLCAPACA---CMYNMTYGSGWTY 166

Query: 122 SFSSGNLATETLTF-NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLI 180
            F      TET TF +ST    V +P + FGC   N +S  + S  +G++GLG G+ SL+
Sbjct: 167 VFQ----GTETFTFGSSTPADQVRVPGIAFGC--SNASSGFNASSASGLVGLGRGSLSLV 220

Query: 181 SQMGTSIAGKFSYCL-PDQG-----------SSKINFGGIVAGAGVVSTPLIIRDHYYLS 228
           SQ+G   A KFSYCL P Q            S+ +N  G+V+    V++P  I  +YYL+
Sbjct: 221 SQLG---APKFSYCLTPYQDTNSTSTLLLGPSASLNDTGVVSSTPFVASPSSI--YYYLN 275

Query: 229 LEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPV 281
           L  IS+G   L          +  TG + +D+G   T+L    +  +++ + +++     
Sbjct: 276 LTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTTITMLGNTAYQQVRAAVLSLVTLPTT 335

Query: 282 KGVGAEPGFSDVLCYNI----SSQPKFPEVTIHFRGADVKLSPSNLFR-----NISDEIM 332
            G  A  G    LC+ +    S+ P  P +T+HF GAD+ L   N        +    + 
Sbjct: 336 DGSAAT-GLD--LCFELPSSTSAPPSMPSMTLHFDGADMVLPADNYMMSLSDPDSDSSLW 392

Query: 333 CSAFRG-----GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           C A +      G    + G   Q N  I YD+ +  +SF P++C+ 
Sbjct: 393 CLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGKETLSFAPAKCST 438


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 185/366 (50%), Gaps = 48/366 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           YL+ ++IGTPP+ +   +DTGSD  WTQC+ PC    CF Q  PL+ P +S+TY ++SC 
Sbjct: 92  YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRR--CFPQPAPLYAPARSATYANVSCR 149

Query: 94  SSQCAVVT---SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S  C  +    S CS  D  C+Y F YG G   + + G LATET T  S +     +  V
Sbjct: 150 SPMCQALQSPWSRCSPPDTGCAYYFSYGDG---TSTDGVLATETFTLGSDTA----VRGV 202

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN--FG 206
            FGCG +NL S  + S   G++G+G G  SL+SQ+G +   +FSYC     ++  +  F 
Sbjct: 203 AFGCGTENLGSTDNSS---GLVGMGRGPLSLVSQLGVT---RFSYCFTPFNATAASPLFL 256

Query: 207 GIVA--GAGVVSTPLI---------IRDHYYLSLEAISVGNQRL-------EFVSSSTGN 248
           G  A   +   +TP +            +YYLSLE I+VG+  L              G 
Sbjct: 257 GSSARLSSAAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGG 316

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPE 306
           + +D+G   T L       L   +++ ++  P+   GA  G S  LC+  +S    + P 
Sbjct: 317 VIIDSGTTFTALEESAFVALARALASRVR-LPLAS-GAHLGLS--LCFAAASPEAVEVPR 372

Query: 307 VTIHFRGADVKL-SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
           + +HF GAD++L   S +  + S  + C          V G + Q N  I YD+E+ ++S
Sbjct: 373 LVLHFDGADMELRRESYVVEDRSAGVACLGMVSARGMSVLGSMQQQNTHILYDLERGILS 432

Query: 366 FKPSRC 371
           F+P++C
Sbjct: 433 FEPAKC 438


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 123/373 (32%), Positives = 179/373 (47%), Gaps = 49/373 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM L+IGTPP+      DTGSD  WTQC PC    CF+Q  PL++P  S+T+  + C+S
Sbjct: 32  YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTS-QCFRQPTPLYNPSSSTTFAVLPCNS 90

Query: 95  SQCAVVTSNCSEGD-------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           S      +    G        C+Y+  YG G  + F      +ET TF ST      +P 
Sbjct: 91  SLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQ----GSETFTFGSTPAGHARVPG 146

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKIN 204
           + FGC     +S  + S  +G++GLG G  SL+SQ+G     KFSYCL    D  S+   
Sbjct: 147 IAFGC--STASSGFNASSASGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTSTL 201

Query: 205 FGGIVAG----AGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EFVSSST 246
             G  A     AGV STP +       +   YYL+L  IS+G   L          +  T
Sbjct: 202 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFSLNADGT 261

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI----SSQP 302
           G + +D+G   TLL    +  +++ + +++      G  A+ G    LC+ +    S+ P
Sbjct: 262 GGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDG-SADTGLD--LCFMLPSSTSAPP 318

Query: 303 KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG---GNANIVYGRIMQINFLIGYDI 359
             P +T+HF GAD+ L   +   +    + C A +    G  NI+ G   Q N  I YDI
Sbjct: 319 AMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNIL-GNYQQQNMHILYDI 377

Query: 360 EQAMVSFKPSRCT 372
            Q  +SF P++C+
Sbjct: 378 GQETLSFAPAKCS 390


>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 121/369 (32%), Positives = 183/369 (49%), Gaps = 45/369 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+M L+IGTPP+      DTGSD  WTQC PC    CFKQ    ++P  S+T+  + C+S
Sbjct: 88  YIMTLAIGTPPLSYPAIADTGSDLIWTQCAPCGS-QCFKQAGQPYNPSSSTTFGVLPCNS 146

Query: 95  --SQCAVVTSNCSEGDCS--YSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
             S CA +        CS  Y+  YG G    +++G  + ET TF ST      +P + F
Sbjct: 147 SVSMCAALAGPSPPPGCSCMYNQTYGTG----WTAGIQSVETFTFGSTPADQTRVPGIAF 202

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQ---GSSKINFG 206
           GC +   AS    +   G++GLG G+ SL+SQ+G   AG FSYCL P Q    +S +  G
Sbjct: 203 GCSN---ASSDDWNGSAGLVGLGRGSMSLVSQLG---AGMFSYCLTPFQDANSTSTLLLG 256

Query: 207 GIVA--GAGVVSTPLI-------IRDHYYLSLEAISVGNQRLE-------FVSSSTGNIF 250
              A  G GV++TP +       +  +YYL+L  IS+G   L          +  TG + 
Sbjct: 257 PSAALNGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLI 316

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ----PKFPE 306
           +D+G   T L    +  +++ + +++   PV       G    LC+ ++S+    P  P 
Sbjct: 317 IDSGTTITSLVDAAYQQVRAAIESLVTL-PVADGSDSTGLD--LCFALTSETSTPPSMPS 373

Query: 307 VTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQAMV 364
           +T HF GAD+ L P + +  +   + C A R     A   +G   Q N  + YDI +  +
Sbjct: 374 MTFHFDGADMVL-PVDNYMILGSGVWCLAMRNQTVGAMSTFGNYQQQNVHLLYDIHEETL 432

Query: 365 SFKPSRCTN 373
           SF P++C+ 
Sbjct: 433 SFAPAKCST 441


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 124/374 (33%), Positives = 186/374 (49%), Gaps = 39/374 (10%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPE--LDCFKQEPPLFDPK 82
           +++II+    YLM++++GTPP  +    DTGSD  W  C        D       +F P 
Sbjct: 93  ESKIITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPT 152

Query: 83  KSSTYNSISCSSSQC-AVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
           +SSTY+ +SC S+ C A+  ++C ++ +C Y + YG G   S + G L+TET +F    G
Sbjct: 153 RSSTYSQLSCQSNACQALSQASCDADSECQYQYSYGDG---SRTIGVLSTETFSFVDGGG 209

Query: 141 L-PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG--TSIAGKFSYCL-- 195
              V +P V FGC   +  +  SD    G++GLG G  SL+SQ+G  T I  K SYCL  
Sbjct: 210 KGQVRVPRVNFGCSTASAGTFRSD----GLVGLGAGAFSLVSQLGATTHIDRKLSYCLIP 265

Query: 196 --PDQGSSKINFG--GIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSSTGN 248
                 SS +NFG   +V+  G  STPL+  D   +Y ++LE+++VG Q    V++    
Sbjct: 266 SYDANSSSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQE---VATHDSR 322

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK----- 303
           I VD+G   T L       L + +   IK Q V+     P     LCY++  + +     
Sbjct: 323 IIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQ----PPEQLLQLCYDVQGKSETDNFG 378

Query: 304 FPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLIGYDI 359
            P+VT+ F  GA V L P N F  + +  +C             + G I Q NF +GYD+
Sbjct: 379 IPDVTLRFGGGAAVTLRPENTFSLLQEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDL 438

Query: 360 EQAMVSFKPSRCTN 373
           +   V+F  + C  
Sbjct: 439 DARTVTFAAADCAR 452


>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
 gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 472

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 124/372 (33%), Positives = 185/372 (49%), Gaps = 47/372 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM L+IGTPP+      DTGSD  WTQC PC    CF+Q  PL++P  S+T++ + C+S
Sbjct: 114 YLMTLAIGTPPLPYAAVADTGSDLIWTQCAPC-GTQCFEQPAPLYNPASSTTFSVLPCNS 172

Query: 95  --SQCAVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
             S CA   +  +      C Y   YG G    +++G   +ET TF S++     +P V 
Sbjct: 173 SLSMCAGALAGAAPPPGCACMYYQTYGTG----WTAGVQGSETFTFGSSAADQARVPGVA 228

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQ---GSSKINF 205
           FGC +   AS +  +   G++GLG G+ SL+SQ+G   AG+FSYCL P Q    +S +  
Sbjct: 229 FGCSN---ASSSDWNGSAGLVGLGRGSLSLVSQLG---AGRFSYCLTPFQDTNSTSTLLL 282

Query: 206 GGIVA--GAGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EFVSSSTGNI 249
           G   A  G GV STP +       +  +YYL+L  IS+G + L             TG +
Sbjct: 283 GPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTGGL 342

Query: 250 FVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPK-- 303
            +D+G  + +L    Y     +V S ++   P        G    LC+ +   +S P   
Sbjct: 343 IIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLD--LCFALPAPTSAPPAV 400

Query: 304 FPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG--GNANIVYGRIMQINFLIGYDIEQ 361
            P +T+HF GAD+ L P++ +      + C A R     A   +G   Q N  I YD+ +
Sbjct: 401 LPSMTLHFDGADMVL-PADSYMISGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVRE 459

Query: 362 AMVSFKPSRCTN 373
             +SF P++C+ 
Sbjct: 460 ETLSFAPAKCST 471


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 123/373 (32%), Positives = 178/373 (47%), Gaps = 49/373 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM L+IGTPP+      DTGSD  WTQC PC    CF+Q  PL++P  S+T+  + C+S
Sbjct: 90  YLMALAIGTPPLPYQAIADTGSDLIWTQCAPCTS-QCFRQPTPLYNPSSSTTFAVLPCNS 148

Query: 95  SQCAVVTSNCSEGD-------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           S      +    G        C+Y+  YG G  + F      +ET TF ST      +P 
Sbjct: 149 SLSVCAAALAGTGTAPPPGCACTYNVTYGSGWTSVFQ----GSETFTFGSTPAGQSRVPG 204

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKIN 204
           + FGC     +S  + S  +G++GLG G  SL+SQ+G     KFSYCL    D  S+   
Sbjct: 205 IAFGC--STASSGFNASSASGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTSTL 259

Query: 205 FGGIVAG----AGVVSTPLI-------IRDHYYLSLEAISVGNQRLE-------FVSSST 246
             G  A     AGV STP +       +   YYL+L  IS+G   L          +  T
Sbjct: 260 LLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFLLNADGT 319

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI----SSQP 302
           G + +D+G   TLL    +  +++ + +++      G  A  G    LC+ +    S+ P
Sbjct: 320 GGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSAAT-GLD--LCFMLPSSTSAPP 376

Query: 303 KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRG---GNANIVYGRIMQINFLIGYDI 359
             P +T+HF GAD+ L   +   +    + C A +    G  NI+ G   Q N  I YDI
Sbjct: 377 AMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNIL-GNYQQQNMHILYDI 435

Query: 360 EQAMVSFKPSRCT 372
            Q  +SF P++C+
Sbjct: 436 GQETLSFAPAKCS 448


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 124/371 (33%), Positives = 181/371 (48%), Gaps = 41/371 (11%)

Query: 28  IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
           +++ D  YLM + IGTP       +DTGSD  WTQC PC  L C  Q  P FDP  SSTY
Sbjct: 85  VLASDGEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPANSSTY 142

Query: 88  NSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
            S+ CS+  C A+    C +  C Y + YG  A    ++G LA ET TF  T+   V +P
Sbjct: 143 RSLGCSAPACNALYYPLCYQKTCVYQYFYGDSAS---TAGVLANETFTFG-TNDTRVTLP 198

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKI 203
            + FGCG+ N  S  + S   G++G G G+ SL+SQ+G+    +FSYCL    S   S++
Sbjct: 199 RISFGCGNLNAGSLANGS---GMVGFGRGSLSLVSQLGSP---RFSYCLTSFLSPVRSRL 252

Query: 204 NFGGIVA-----GAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF--------VSSST 246
            FG          + V STP II       Y+L++  ISVG  RL           +  T
Sbjct: 253 YFGAYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGT 312

Query: 247 GNIFVDTGVLRTLLPL-EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK-- 303
           G   +D+G   T L    Y++  ++ +  +    P+  V  E    D  C+     P+  
Sbjct: 313 GGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDV-TETSVLDT-CFQWPPPPRQS 370

Query: 304 --FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
              P++ +HF GAD +L   N +  + S   +C A    +   + G     NF + YD+E
Sbjct: 371 VTLPQLVLHFDGADWELPLQNYMLVDPSTGGLCLAMATSSDGSIIGSYQHQNFNVLYDLE 430

Query: 361 QAMVSFKPSRC 371
            +++SF P+ C
Sbjct: 431 NSLLSFVPAPC 441


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 120/378 (31%), Positives = 187/378 (49%), Gaps = 45/378 (11%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           +++II+    YLM++++GTPP  +    DTGSD  W  C              +F P +S
Sbjct: 90  ESKIITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRS 149

Query: 85  STYNSISCSSSQC-AVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTF---NSTS 139
           +TY+ +SC S+ C A+  ++C ++ +C Y + YG G   S + G L+TET +F       
Sbjct: 150 TTYSLLSCQSAACQALSQASCDADSECQYQYAYGDG---SRTIGVLSTETFSFAAAGGGG 206

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCL-- 195
              V +P V FGC   +  S  SD    G++GLG G  SL+SQ+G +  IA +FSYCL  
Sbjct: 207 EGQVRVPRVSFGCSTGSAGSFRSD----GLVGLGAGALSLVSQLGAAARIARRFSYCLVP 262

Query: 196 ---PDQGSSKINFG--GIVAGAGVVSTPLI---IRDHYYLSLEAISVGNQRLEFVSSSTG 247
                  SS ++FG   +V+  G  STPL+   +  +Y ++LE+++V  Q  +  S+++ 
Sbjct: 263 PYAAANSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQ--DVASANSS 320

Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVLCYNISSQPK- 303
            I VD+G   T L       L + +   I   +AQP       P     LCY++  + + 
Sbjct: 321 RIIVDSGTTLTFLDPALLRPLVAELERRIRLPRAQP-------PEQLLQLCYDVQGKSQA 373

Query: 304 ----FPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLI 355
                P+VT+ F  GA V L P N F  + +  +C             + G I Q NF +
Sbjct: 374 EDFGIPDVTLRFGGGASVTLRPENTFSLLEEGTLCLVLVPVSESQPVSILGNIAQQNFHV 433

Query: 356 GYDIEQAMVSFKPSRCTN 373
           GYD++   V+F    CT 
Sbjct: 434 GYDLDARTVTFAAVDCTR 451


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 119/358 (33%), Positives = 173/358 (48%), Gaps = 36/358 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  D+    DTGSD TWTQCEPC    C+ Q+ P+F+P KS++Y +ISCSS
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARY-CYHQQEPIFNPSKSTSYTNISCSS 196

Query: 95  SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
             C  + S      +CS   C Y   YG  +Y   S G  A + L   ST        N 
Sbjct: 197 PTCDELKSGTGNSPSCSASTCVYGIQYGDQSY---SVGFFAQDKLALTSTD----VFNNF 249

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFG 206
           +FGCG  N       +   G+IGLG    SL+SQ        FSYCLP   SS   + FG
Sbjct: 250 LFGCGQNNRGLFVGVA---GLIGLGRNALSLVSQTAQKYGKLFSYCLPSTSSSTGYLTFG 306

Query: 207 -GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTL 259
            G      V  TP ++       Y+L+L AISVG ++L   +S  ST    +D+G + + 
Sbjct: 307 SGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTAGTIIDSGTVISR 366

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADV 316
           LP   +S+L++     +   P     A P      CY+ S       P++ ++F  GA++
Sbjct: 367 LPPTAYSDLRASFQQQMSKYP----KAAPASILDTCYDFSQYDTVDVPKINLYFSDGAEM 422

Query: 317 KLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            L PS +F  ++   +C AF G +      + G + Q  F + YD+    + F P  C
Sbjct: 423 DLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPGGC 480


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 119/364 (32%), Positives = 180/364 (49%), Gaps = 46/364 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +S+GTPP      VDTGSD  W QC PC    CF+Q  PLF P  SS+Y++ SC+ 
Sbjct: 8   YVLQISLGTPPQQFSAIVDTGSDLCWVQCAPCAR--CFEQPDPLFIPLASSSYSNASCTD 65

Query: 95  SQC-AVVTSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           S C A+    CS    C+YS+ YG G   S + G+ A ET+T N ++     +  + FGC
Sbjct: 66  SLCDALPRPTCSMRNTCTYSYSYGDG---SNTRGDFAFETVTLNGST-----LARIGFGC 117

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGGI 208
           GH       + +   G+IGLG G  SL SQ+ +S    FSYCL DQ +    S I FG  
Sbjct: 118 GHNQEG---TFAGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFGNA 174

Query: 209 VAGAGVVSTPLIIRD----HYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLR 257
              +    TPL+  +    +YY+ +E+ISVGN+R+          ++  G + +D+G   
Sbjct: 175 AENSRASFTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTI 234

Query: 258 TLLPLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVLCYNIS----SQPKFPEVTIH 310
           T   L     + + +   I   +A P         +   LCY+IS    S    P +T+H
Sbjct: 235 TYWRLAAFIPILAELRRQISYPEADPTP-------YGLNLCYDISSVSASSLTLPSMTVH 287

Query: 311 FRGADVKLSPSNLFRNISD--EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKP 368
               D ++  SNL+  + +  E +C+A    +   + G + Q N LI  D+  + V F  
Sbjct: 288 LTNVDFEIPVSNLWVLVDNFGETVCTAMSTSDQFSIIGNVQQQNNLIVTDVANSRVGFLA 347

Query: 369 SRCT 372
           + C+
Sbjct: 348 TDCS 351


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 120/360 (33%), Positives = 172/360 (47%), Gaps = 41/360 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L +GTPP  ++  +DTGSD  W QC PC +  C+ Q  P+FDPKKS +++SISC S
Sbjct: 147 YFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRK--CYSQTDPVFDPKKSGSFSSISCRS 204

Query: 95  SQCAVVTS-NC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             C  + S  C S   C Y   YG G   SF+ G  +TETLTF  T      +P V  GC
Sbjct: 205 PLCLRLDSPGCNSRQSCLYQVAYGDG---SFTFGEFSTETLTFRGT-----RVPKVALGC 256

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGGI 208
           GH N       +   G+        S  +Q G     KFSYCL D+ +    S + FG  
Sbjct: 257 GHDNEGLFVGAAGLLGLGRG---RLSFPTQTGLRFGRKFSYCLVDRSASSKPSSVVFGQS 313

Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGVL 256
                 V TPLI    +   YYL L  ISVG  R+  +++S         G + +D+G  
Sbjct: 314 AVSRTAVFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNGGVIIDSGTS 373

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRG 313
            T L    + +L+        A  +K     P +S    C+++S  ++ K P V +HFRG
Sbjct: 374 VTRLTRRAYVSLRDAFR--AGAADLK---RAPDYSLFDTCFDLSGKTEVKVPTVVMHFRG 428

Query: 314 ADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           ADV L  +N    + ++ + C AF G  + + + G I Q  F + +D+  + + F    C
Sbjct: 429 ADVSLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQGFRVVFDVAASRIGFAARGC 488


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score =  160 bits (406), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 121/355 (34%), Positives = 182/355 (51%), Gaps = 33/355 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QC+PC  + C++Q   LFDP +SSTY +ISC++
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVV-CYEQREKLFDPARSSTYANISCAA 238

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C+ + T  CS G+C Y   YG G+Y   S G  A +TLT +S       +    FGCG
Sbjct: 239 PACSDLDTRGCSGGNCLYGVQYGDGSY---SIGFFAMDTLTLSSYD----AVKGFRFGCG 291

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFG-GIVA 210
            +N        +  G++GLG G +SL  Q      G F++CLP +  G+  ++FG G  A
Sbjct: 292 ERNEG---LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPA 348

Query: 211 GAGV-VSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
            AG  ++TP++  +    YY+ +  I VG Q L    S  +T    VD+G + T LP   
Sbjct: 349 AAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFTTAGTIVDSGTVITRLPPAA 408

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
           +S+L+S  ++ + A   +G    P  S +  CY+ +  SQ   P V++ F+ GA + +  
Sbjct: 409 YSSLRSAFASAMAA---RGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDA 465

Query: 321 SNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           S +    S   +C  F     GG+  IV G      F + YDI + +V F P  C
Sbjct: 466 SGIMYAASVSQVCLGFAANEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 119/354 (33%), Positives = 171/354 (48%), Gaps = 32/354 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QC+PC    C++Q+ PLFDP KS+TY +ISCSS
Sbjct: 96  YVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAY-CYRQKEPLFDPTKSATYANISCSS 154

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S C+ +  S CS G C Y   YG G+Y   + G  A +TLT    +     + N  FGCG
Sbjct: 155 SYCSDLYVSGCSGGHCLYGIQYGDGSY---TIGFYAQDTLTLAYDT-----IKNFRFGCG 206

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAG 211
            KN        +  G++GLG G +SL  Q      G F+YCLP    G+  ++ G     
Sbjct: 207 EKNRG---LFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGFLDLGPGAPA 263

Query: 212 AGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHS 266
           A    TP+++      YY+ +  I VG   L    S  ST    VD+G + T LP   ++
Sbjct: 264 ANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPPSAYA 323

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP----KFPEVTIHFRGA---DVKL 318
            L+S  S   KA    G  A P FS +  CY+++         P V++ F+G    DV  
Sbjct: 324 PLRSAFS---KAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDA 380

Query: 319 SPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           S      ++S   +  A    + ++ + G   Q    + YDI + +V F P  C
Sbjct: 381 SGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 119/354 (33%), Positives = 171/354 (48%), Gaps = 32/354 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QC+PC    C++Q+ PLFDP KS+TY +ISCSS
Sbjct: 161 YVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAY-CYRQKEPLFDPTKSATYANISCSS 219

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S C+ +  S CS G C Y   YG G   S++ G  A +TLT    +     + N  FGCG
Sbjct: 220 SYCSDLYVSGCSGGHCLYGIQYGDG---SYTIGFYAQDTLTLAYDT-----IKNFRFGCG 271

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAG 211
            KN        +  G++GLG G +SL  Q      G F+YCLP    G+  ++ G     
Sbjct: 272 EKNRG---LFGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGFLDLGPGAPA 328

Query: 212 AGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHS 266
           A    TP+++      YY+ +  I VG   L    S  ST    VD+G + T LP   ++
Sbjct: 329 ANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPPSAYA 388

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP----KFPEVTIHFRGA---DVKL 318
            L+S  S   KA    G  A P FS +  CY+++         P V++ F+G    DV  
Sbjct: 389 PLRSAFS---KAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDA 445

Query: 319 SPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           S      ++S   +  A    + ++ + G   Q    + YDI + +V F P  C
Sbjct: 446 SGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499


>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 120/363 (33%), Positives = 184/363 (50%), Gaps = 42/363 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           ++L++ S+G P       +DTGS+  W +C PC    C +Q  PL DP KSSTY S+ C+
Sbjct: 98  LFLVNFSMGQPATPQLAIMDTGSNILWVRCAPCKR--CTQQNGPLLDPSKSSTYASLPCT 155

Query: 94  SSQCAVV-TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
           ++ C    ++ C+    C Y+  Y  G     S+G LATE L F+S+      +P+V+FG
Sbjct: 156 NTMCHYAPSAYCNRLNQCGYNLSYATGLS---SAGVLATEQLIFHSSDEGVNAVPSVVFG 212

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG 206
           C H+N      D + TG+ GLG G +S +++MG+    KFSYCL     P  G +++ FG
Sbjct: 213 CSHEN--GDYKDRRFTGVFGLGKGITSFVTRMGS----KFSYCLGNIADPHYGYNQLVFG 266

Query: 207 GIVAGAGVVSTPL-IIRDHYYLSLEAISVGNQRLEFVSSS---TGN---IFVDTGVLRTL 259
                 G  STPL ++  HYY++LE ISVG +RL+  S++    GN     +D+G   T 
Sbjct: 267 EKANFEG-YSTPLKVVNGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSALIDSGTALTW 325

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK---FPEVTIHFR-GAD 315
           L     S  ++ + N ++ Q + GV          CY  +       FP VT HF  GAD
Sbjct: 326 LA---ESAFRA-LDNEVR-QLLDGVLMPFWRGSFACYKGTVSQDLIGFPVVTFHFSGGAD 380

Query: 316 VKLSPSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIGYDIEQAMVSFKP 368
           + L   ++F   + +I+C A R  +A         V G + Q  + + YD+    + F+ 
Sbjct: 381 LDLDTESMFYQATPDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKLFFQR 440

Query: 369 SRC 371
             C
Sbjct: 441 IDC 443


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 119/356 (33%), Positives = 175/356 (49%), Gaps = 47/356 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+HL+ GTPP ++  ++DTGSD TWTQC+ CP   CF Q  PLFDP  SS++ S+ CSS
Sbjct: 88  YLVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSS 147

Query: 95  SQCAVVTSNCSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGL--PVEMP 146
             C   T  C  G+      C+YS  YG G   S S G +  E  TF S +G      +P
Sbjct: 148 PACE-TTPPCGGGNDATSRPCNYSISYGDG---SVSRGEIGREVFTFASGTGEGSSAAVP 203

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG 206
            ++FGCGH N    TS+  +TGI G G G+ SL SQ+     G FS+C      SK +  
Sbjct: 204 GLVFGCGHANRGVFTSN--ETGIAGFGRGSLSLPSQLKV---GNFSHCFTTITGSKTS-- 256

Query: 207 GIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHS 266
            ++ G   V+ P            A  +G +R  +   ST     ++G   T LP   + 
Sbjct: 257 AVLLGLPGVAPP-----------SASPLGRRRGSYRCRSTPRS-SNSGTSITSLPPRTYR 304

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFPEVTIHFRGADVKLSPSN- 322
            ++   +  +K   V G   +P      C++      +P  P + +HF GA ++L   N 
Sbjct: 305 AVREEFAAQVKLPVVPGNATDP----FTCFSAPLRGPKPDVPTMALHFEGATMRLPQENY 360

Query: 323 LFRNISDE-------IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +F  + D+       I+C A   G   I+ G I Q N  + YD++ + +SF P++C
Sbjct: 361 VFEVVDDDDAGNSSRIICLAVIEG-GEIILGNIQQQNMHVLYDLQNSKLSFVPAQC 415


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 125/362 (34%), Positives = 179/362 (49%), Gaps = 47/362 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +GTP  +     DTGSD TWTQCEPC +  C+KQ+ P  DP KS++Y +ISCSS
Sbjct: 133 YAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAK-TCYKQKEPRLDPTKSTSYKNISCSS 191

Query: 95  SQCAVVTS----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
           + C ++ +    +CS   C Y   YG G+Y   S G  ATETLT +S++       N +F
Sbjct: 192 AFCKLLDTEGGESCSSPTCLYQVQYGDGSY---SIGFFATETLTLSSSN----VFKNFLF 244

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGI 208
           GCG +N       +   G++GLG    SL SQ        FSYCLP   SSK  ++FGG 
Sbjct: 245 GCGQQNSGLFRGAA---GLLGLGRTKLSLPSQTAQKYKKLFSYCLPASSSSKGYLSFGGQ 301

Query: 209 VAGAGVVSTPL----IIRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPL 262
           V+   V  TPL         Y L +  +SVG  +L   +S  ST    +D+G + T LP 
Sbjct: 302 VSKT-VKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFSTSGTVIDSGTVITRLPS 360

Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGA---DV 316
             +S L S    ++   P     +  G+S    CY+ S     K P+V + F+G    D+
Sbjct: 361 TAYSALSSAFQKLMTDYP-----STDGYSIFDTCYDFSKNETIKIPKVGVSFKGGVEMDI 415

Query: 317 KLS----PSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPS 369
            +S    P N  + +     C AF G   ++   ++G   Q  + + YD  +  V F PS
Sbjct: 416 DVSGILYPVNGLKKV-----CLAFAGNGDDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPS 470

Query: 370 RC 371
            C
Sbjct: 471 GC 472


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 120/359 (33%), Positives = 173/359 (48%), Gaps = 41/359 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++    GTP  +    +DTGSD TW QC+PC   DC+ Q  P+F+P++SS+Y  +SC S
Sbjct: 138 YIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCS--DCYSQVDPIFEPQQSSSYKHLSCLS 195

Query: 95  SQCAVVTS--NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           S C  +T+  +C  G C Y   YG G   S S G+ + ETLT  S S      P+  FGC
Sbjct: 196 SACTELTTMNHCRLGGCVYEINYGDG---SRSQGDFSQETLTLGSDS-----FPSFAFGC 247

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD------QGSSKINFG 206
           GH N       +   G++GLG    S  SQ  +   G+FSYCLPD       GS  +  G
Sbjct: 248 GHTNTGLFKGSA---GLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTSTGSFSVGQG 304

Query: 207 GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
            I A A  V  PL+   +    Y++ L  ISVG +RL    +    G   VD+G + T L
Sbjct: 305 SIPATATFV--PLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIVDSGTVITRL 362

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVK 317
             + +  LK+   +  +  P     A+P      CY++S  SQ + P +T HF+  ADV 
Sbjct: 363 VPQAYDALKTSFRSKTRNLP----SAKPFSILDTCYDLSSYSQVRIPTITFHFQNNADVA 418

Query: 318 LSPSNLFRNISDE--IMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +S   +   I  +   +C AF   + +I   + G   Q    + +D     + F P  C
Sbjct: 419 VSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRIGFAPGSC 477


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 121/371 (32%), Positives = 178/371 (47%), Gaps = 54/371 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM + IG+PP      +DTGSD  WTQC PC  L C +Q  P F+P KS++Y S+ CSS
Sbjct: 88  YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPC--LLCVEQPTPYFEPAKSTSYASLPCSS 145

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + C A+ +  C +  C Y   YG  A    S+G LA ET TF + S   V +P V FGCG
Sbjct: 146 AMCNALYSPLCFQNACVYQAFYGDSAS---SAGVLANETFTFGTNS-TRVAVPRVSFGCG 201

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIV- 209
           + N  +  + S   G++G G G  SL+SQ+G+    +FSYCL       +S++ FG    
Sbjct: 202 NMNAGTLFNGS---GMVGFGRGALSLVSQLGSP---RFSYCLTSFMSPATSRLYFGAYAT 255

Query: 210 -------AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--------STGNIF 250
                  +   V STP I+       Y+L++  ISV    L    S         TG + 
Sbjct: 256 LNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVI 315

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG-----AEPGFSDVLCYNISSQPK-- 303
           +D+G   T L    ++        M++   V  VG     A P  +   C+     P+  
Sbjct: 316 IDSGTTVTFLAQPAYA--------MVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRRM 367

Query: 304 --FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
              PE+ +HF GAD++L   N +  +     +C A    +   + G     NF + YD+E
Sbjct: 368 VTLPEMVLHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLE 427

Query: 361 QAMVSFKPSRC 371
            +++SF P+ C
Sbjct: 428 NSLLSFVPAPC 438


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 121/371 (32%), Positives = 178/371 (47%), Gaps = 54/371 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM + IG+PP      +DTGSD  WTQC PC  L C +Q  P F+P KS++Y S+ CSS
Sbjct: 85  YLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPC--LLCVEQPTPYFEPAKSTSYASLPCSS 142

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + C A+ +  C +  C Y   YG  A    S+G LA ET TF + S   V +P V FGCG
Sbjct: 143 AMCNALYSPLCFQNACVYQAFYGDSAS---SAGVLANETFTFGTNS-TRVAVPRVSFGCG 198

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIV- 209
           + N  +  + S   G++G G G  SL+SQ+G+    +FSYCL       +S++ FG    
Sbjct: 199 NMNAGTLFNGS---GMVGFGRGALSLVSQLGSP---RFSYCLTSFMSPATSRLYFGAYAT 252

Query: 210 -------AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--------STGNIF 250
                  +   V STP I+       Y+L++  ISV    L    S         TG + 
Sbjct: 253 LNSTNTSSSGPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVI 312

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG-----AEPGFSDVLCYNISSQPK-- 303
           +D+G   T L    ++        M++   V  VG     A P  +   C+     P+  
Sbjct: 313 IDSGTTVTFLAQPAYA--------MVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRRM 364

Query: 304 --FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
              PE+ +HF GAD++L   N +  +     +C A    +   + G     NF + YD+E
Sbjct: 365 VTLPEMVLHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLE 424

Query: 361 QAMVSFKPSRC 371
            +++SF P+ C
Sbjct: 425 NSLLSFVPAPC 435


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score =  157 bits (398), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 124/365 (33%), Positives = 176/365 (48%), Gaps = 43/365 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+HL+IGTPP  +  ++DTGSD  WTQC+PCP   CF Q  P FDP  SST +  SC S
Sbjct: 82  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPA--CFDQALPYFDPSTSSTLSLTSCDS 139

Query: 95  SQC-AVVTSNCSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           + C  +  ++C          C Y++ YG     S ++G L  +  TF    G    +P 
Sbjct: 140 TLCQGLPVASCGSPKFWPNQTCVYTYSYGD---KSVTTGFLEVDKFTF---VGAGASVPG 193

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-------PDQGS 200
           V FGCG  N  +    S +TGI G G G  SL SQ+     G FS+C        P    
Sbjct: 194 VAFGCGLFN--NGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPSTVL 248

Query: 201 SKINFGGIVAGAGVV-STPLIIR----DHYYLSLEAISVGNQRL-----EF-VSSSTGNI 249
             +      +G G V STPLI        YYLSL+ I+VG+ RL     EF + + TG  
Sbjct: 249 LDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKNGTGGT 308

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTI 309
            +D+G   T LP   +  ++   +  +K   V G   +P F   L   + ++P  P++ +
Sbjct: 309 IIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--CLSAPLRAKPYVPKLVL 366

Query: 310 HFRGADVKLSPSNLFRNISD---EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
           HF GA + L   N    + D    I+C A   G      G   Q N  + YD++ + +SF
Sbjct: 367 HFEGATMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSF 426

Query: 367 KPSRC 371
            P++C
Sbjct: 427 VPAQC 431


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 129/377 (34%), Positives = 188/377 (49%), Gaps = 53/377 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M + +G+PP      VDTGSD  W QC+PC +  C+ Q  P++DP  SST+   SCS+
Sbjct: 4   YTMEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQ--CYSQSDPIYDPSASSTFAKTSCST 61

Query: 95  SQCAVV-TSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
           S C  +  S CS     C Y + YG    +S + G+ A ETLT  S+ G     PN  FG
Sbjct: 62  SSCQSLPASGCSSSAKTCIYGYQYGD---SSSTQGDFALETLTLRSSGGSSKAFPNFQFG 118

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD-----QGSSKINFG 206
           CG  N  S        GI+GLG G  SL +Q+G++I  KFSYCL D       +S + FG
Sbjct: 119 CGRLNSGSF---GGAAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFG 175

Query: 207 GIVA-GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVS----------- 243
              + G+G +STP+I       +Y++ LE ISVG ++L       +F+S           
Sbjct: 176 SSASTGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRA 235

Query: 244 ---SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
              +S G IF D+G   TLL    +S +KS  ++ +    V    +  GF   LCY++S 
Sbjct: 236 LEVNSGGTIF-DSGTTLTLLDDAVYSKVKSAFASSVSLPTVD--ASSSGFD--LCYDVSK 290

Query: 301 QP--KFPEVTIHFRGADVKLSPSNLF--RNISDEIMCSAF--RGGNANIVYGRIMQINFL 354
               KFP +T+ F+G        N F   + ++ + C A    G     + G +MQ N+ 
Sbjct: 291 SKNFKFPALTLAFKGTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYH 350

Query: 355 IGYDIEQAMVSFKPSRC 371
           + YD   + +S  P++C
Sbjct: 351 VVYDRGTSTISMSPAQC 367


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 122/371 (32%), Positives = 176/371 (47%), Gaps = 52/371 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+HL+IGTPP  +  ++DTGSD  WTQC+PC  + CF Q  P FD  +SST   + C S
Sbjct: 35  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPC--VSCFDQPLPYFDTSRSSTNALLPCES 92

Query: 95  SQCAV-------VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           +QC +       V  N +   C+Y   YG     S + G LA +  TF + + L    P 
Sbjct: 93  TQCKLDPTVTVCVKLNQTVQTCAYYTSYGDN---SVTIGLLAADKFTFVAGTSL----PG 145

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC-------LPDQGS 200
           V FGCG  N      +S +TGI G G G  SL SQ+     G FS+C       +P    
Sbjct: 146 VTFGCGLNNTG--VFNSNETGIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVL 200

Query: 201 SKINFGGIVAGAGVV-STPLIIRDH-------YYLSLEAISVGNQRLEF------VSSST 246
             +       G G V +TPLI           YYLSL+ I+VG+ RL        +++ T
Sbjct: 201 LDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGT 260

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKF 304
           G   +D+G   T LP + +  ++   +  IK   V G           C++  SQ  P  
Sbjct: 261 GGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNAT----GHYTCFSAPSQAKPDV 316

Query: 305 PEVTIHFRGADVKLSPSNLFRNISDE----IMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
           P++ +HF GA + L   N    + D+    I+C A   G+   + G   Q N  + YD++
Sbjct: 317 PKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDLQ 376

Query: 361 QAMVSFKPSRC 371
             M+SF  ++C
Sbjct: 377 NNMLSFVAAQC 387


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 120/355 (33%), Positives = 179/355 (50%), Gaps = 33/355 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QC+PC  + C++Q+  LFDP +SSTY ++SC++
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVV-CYEQQEKLFDPARSSTYANVSCAA 237

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C  + T  CS G C Y   YG G+Y   S G  A +TLT +S       +    FGCG
Sbjct: 238 PACFDLDTRGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYD----AVKGFRFGCG 290

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFG-GIVA 210
            +N        +  G++GLG G +SL  Q      G F++CLP +  G+  ++FG G  A
Sbjct: 291 ERNEG---LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPA 347

Query: 211 GAGV-VSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
            AG  ++TP++  +    YY+ +  I VG Q L    S  +T    VD+G + T LP   
Sbjct: 348 AAGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPPA 407

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
           +S+L+S     + A   +G    P  S +  CY+ +  SQ   P V++ F+ GA + +  
Sbjct: 408 YSSLRSA---FVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAILDVDA 464

Query: 321 SNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           S +    S   +C  F     GG+  IV G      F + YDI + +V F P  C
Sbjct: 465 SGIMYAASVSQVCLGFAANEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFSPGAC 518


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score =  157 bits (398), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 125/375 (33%), Positives = 182/375 (48%), Gaps = 50/375 (13%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           D  YL+HL+IGTPP  +   +DTGSD  WTQC PCP   CF +     DP  SST++ + 
Sbjct: 412 DTEYLVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPV--CFSRALGPLDPSNSSTFDVLP 469

Query: 92  CSSSQCAVVT-SNCSEGD-----CSYSFLYGRGAYASFSSGNLATETLTFNSTSGL-PVE 144
           CSS  C  +T S+C + +     C Y + Y  G   S ++G+L  ET TF +  G     
Sbjct: 470 CSSPVCDNLTWSSCGKHNWGNQTCVYVYAYADG---SITTGHLDAETFTFAAADGTGQAT 526

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-- 202
           +P++ FGCG  N    TS+  +TGI G G G  SL SQ+       FS+C      S+  
Sbjct: 527 VPDLAFGCGLFNNGIFTSN--ETGIAGFGRGALSLPSQLKVD---NFSHCFTAITGSEPS 581

Query: 203 -------INFGGIVAGAGVVSTPLI-----IRDHYYLSLEAISVGNQRLEFVSSS----- 245
                   N      GA V STPL+     +R  YYLSL+ I+VG+ RL    S+     
Sbjct: 582 SVLLGLPANLYSDADGA-VQSTPLVQNFSSLR-AYYLSLKGITVGSTRLPIPESTFALKQ 639

Query: 246 --TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---- 299
             TG   +D+G   T LP + +  +    +  ++  PV    A       LC++ S    
Sbjct: 640 DGTGGTIIDSGTGMTTLPQDAYKLVHDAFTAQVRL-PVD--NATSSSLSRLCFSFSVPRR 696

Query: 300 SQPKFPEVTIHFRGADVKLSPSNL---FRNISDEIMCSAFRGGNANIVYGRIMQINFLIG 356
           ++P  P++ +HF GA + L   N    F +    + C A   G+   + G   Q N  + 
Sbjct: 697 AKPDVPKLVLHFEGATLDLPRENYMFEFEDAGGSVTCLAINAGDDLTIIGNYQQQNLHVL 756

Query: 357 YDIEQAMVSFKPSRC 371
           YD+ + M+SF P++C
Sbjct: 757 YDLVRNMLSFVPAQC 771


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score =  157 bits (398), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 124/365 (33%), Positives = 176/365 (48%), Gaps = 43/365 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+HL+IGTPP  +  ++DTGSD  WTQC+PCP   CF Q  P FDP  SST +  SC S
Sbjct: 82  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPA--CFDQALPYFDPSTSSTLSLTSCDS 139

Query: 95  SQC-AVVTSNCS------EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           + C  +  ++C          C Y++ YG     S ++G L  +  TF    G    +P 
Sbjct: 140 TLCQGLPVASCGSPKFWPNQTCVYTYSYGD---KSVTTGFLEVDKFTF---VGAGASVPG 193

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-------PDQGS 200
           V FGCG  N  +    S +TGI G G G  SL SQ+     G FS+C        P    
Sbjct: 194 VAFGCGLFN--NGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPSTVL 248

Query: 201 SKINFGGIVAGAGVV-STPLIIR----DHYYLSLEAISVGNQRL-----EF-VSSSTGNI 249
             +      +G G V STPLI        YYLSL+ I+VG+ RL     EF + + TG  
Sbjct: 249 LDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGT 308

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTI 309
            +D+G   T LP   +  ++   +  +K   V G   +P F   L   + ++P  P++ +
Sbjct: 309 IIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--CLSAPLRAKPYVPKLVL 366

Query: 310 HFRGADVKLSPSNLFRNISD---EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
           HF GA + L   N    + D    I+C A   G      G   Q N  + YD++ + +SF
Sbjct: 367 HFEGATMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSF 426

Query: 367 KPSRC 371
            P++C
Sbjct: 427 VPAQC 431


>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 294

 Score =  157 bits (397), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 114/276 (41%), Positives = 159/276 (57%), Gaps = 32/276 (11%)

Query: 2   QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
           +NS K  F+N N T +SP+S  +          YLM LSIGTPPV I+   DTGSD  W 
Sbjct: 36  RNSSK-DFFNRN-TIQSPVSANHYD--------YLMELSIGTPPVKIYAQADTGSDLIWL 85

Query: 62  QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA-VVTSNCS--EGDCSYSFLYGRG 118
           QC PC   +C+KQ  P+FD + SST+++I+C S  C+ + +++CS  + +C Y++ Y  G
Sbjct: 86  QCIPC--TNCYKQLNPMFDSQSSSTFSNIACGSESCSKLYSTSCSPDQINCKYNYSYVDG 143

Query: 119 AYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSS 178
              S + G LA ETLT  ST+G PV    VIFGCGH N  +   + K+ GIIGLG G  S
Sbjct: 144 ---SETQGVLAQETLTLTSTTGEPVAFKGVIFGCGHNNNGA--FNDKEMGIIGLGRGPLS 198

Query: 179 LISQMGTSIAGK-FSYCLPDQG-----SSKINF--GGIVAGAGVVSTPLI----IRDHYY 226
           L+SQ+G+S+ G  FS CL         SS ++F  G  V G GVVSTPL+     +  Y+
Sbjct: 199 LVSQIGSSLGGNMFSQCLVPFNTNPSISSPMSFGKGSEVLGNGVVSTPLVSKTTYQSFYF 258

Query: 227 LSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPL 262
           ++L  ISV +  L F + S+        V+  + P+
Sbjct: 259 VTLLGISVEDINLPFNAGSSLEPAAKGNVIPQIWPV 294


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 117/354 (33%), Positives = 178/354 (50%), Gaps = 31/354 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QC+PC  + C+KQ+  LFDP +SSTY ++SC++
Sbjct: 182 YVVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVV-CYKQQEKLFDPARSSTYANVSCAA 240

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C+ + T  CS G C YS  YG G+Y   S G  A +TLT +S   +        FGCG
Sbjct: 241 PACSDLYTRGCSGGHCLYSVQYGDGSY---SIGFFAMDTLTLSSYDAV----KGFRFGCG 293

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFG-GIVA 210
            +N        +  G++GLG G +SL  Q      G F++CLP +  G+  ++FG G  A
Sbjct: 294 ERNEG---LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPA 350

Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
             G   T  ++ D+    YY+ +  I VG Q L    S  ST    VD+G + T LP   
Sbjct: 351 AVGARQTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFSTAGTIVDSGTVITRLPPAA 410

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
           +S+L+S  ++ + A   +G    P  S +  CY+ +  S+   P+V++ F+ GA + ++ 
Sbjct: 411 YSSLRSAFASAMAA---RGYKKAPALSLLDTCYDFTGMSEVAIPKVSLLFQGGAYLDVNA 467

Query: 321 SNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           S +    S   +C  F     +    + G      F + YDI +  V F P  C
Sbjct: 468 SGIMYAASLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFSPGAC 521


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 122/379 (32%), Positives = 181/379 (47%), Gaps = 51/379 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL------DCFKQEPPLFDPKKSSTYN 88
           Y+M LSIGTPP+      DTGSD  WTQC PC +        CFKQ   L++P  S+T+ 
Sbjct: 87  YIMTLSIGTPPLSYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFG 146

Query: 89  SISCSS--SQCAVVTSNCSEGDCS--YSFLYGRGAYASFSSGNLATETLTFNSTSGLP-V 143
            + C+S  S CA +        C+  Y+  YG G    +++G  + ET TF S+S  P V
Sbjct: 147 VLPCNSPLSMCAAMAGPSPPPGCACMYNQTYGTG----WTAGVQSVETFTFGSSSTPPAV 202

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGS 200
            +PN+ FGC +   AS    +   G++GLG G+ SL+SQ+G   AG FSYCL    D  S
Sbjct: 203 RVPNIAFGCSN---ASSNDWNGSAGLVGLGRGSMSLVSQLG---AGAFSYCLTPFQDANS 256

Query: 201 SKINFGGIVAGAG------VVSTPLI-------IRDHYYLSLEAISVGNQRL-------E 240
           +     G  A A       V STP +       +  +YYL+L  ISVG   L        
Sbjct: 257 TSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAIPPDAFS 316

Query: 241 FVSSSTGNIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS 299
             +  TG + +D+G  + TL+   Y     +V S ++   P+   G +      LC+ + 
Sbjct: 317 LRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAH-GPDHSTGLDLCFALK 375

Query: 300 SQ---PKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGN--ANIVYGRIMQINFL 354
           +    P  P +T+HF G    + P   +  +   + C A R     A  + G   Q N  
Sbjct: 376 ASTPPPAMPSMTLHFEGGADMVLPVENYMILGSGVWCLAMRNQTVGAMSMVGNYQQQNIH 435

Query: 355 IGYDIEQAMVSFKPSRCTN 373
           + YD+ +  +SF P+ C++
Sbjct: 436 VLYDVRKETLSFAPAVCSS 454


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 122/359 (33%), Positives = 175/359 (48%), Gaps = 41/359 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +GTP  D     DTGSD TWTQCEPC    CF Q    FDP KS++Y ++SCSS
Sbjct: 132 YAVTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSG-GCFPQNDEKFDPTKSTSYKNLSCSS 190

Query: 95  SQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
             C  +    ++G      C Y   YG G    ++ G LATETLT   +        N +
Sbjct: 191 EPCKSIGKESAQGCSSSNSCLYGVKYGTG----YTVGFLATETLTITPSD----VFENFV 242

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--KINFGG 207
            GCG +N       S   G++GLG    +L SQ  ++    FSYCLP   SS   ++FGG
Sbjct: 243 IGCGERNGG---RFSGTAGLLGLGRSPVALPSQTSSTYKNLFSYCLPASSSSTGHLSFGG 299

Query: 208 IVAGAGVVSTPLI--IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLE 263
            V+ A    TP+   I + Y L +  ISVG ++L    S   T    +D+G   T LP  
Sbjct: 300 GVSQAAKF-TPITSKIPELYGLDVSGISVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPST 358

Query: 264 YHSNLKSVMSNMIKAQPV-KGV-GAEPGFSDVLCYNISSQPK----FPEVTIHFRGA-DV 316
            HS L S    M+    + KG  G +P      CY+ S         P+++I F G  +V
Sbjct: 359 AHSALSSAFQEMMTNYTLTKGTSGLQP------CYDFSKHANDNITIPQISIFFEGGVEV 412

Query: 317 KLSPSNLFRNISD-EIMCSAFR--GGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            +  S +F   +  E +C AF+  G + ++ ++G + Q  + + YD+ + MV F P  C
Sbjct: 413 DIDDSGIFIAANGLEEVCLAFKDNGNDTDVAIFGNVQQKTYEVVYDVAKGMVGFAPGGC 471


>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 429

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 110/366 (30%), Positives = 173/366 (47%), Gaps = 34/366 (9%)

Query: 23  IYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
           +++  + S +  YL+ +S G PP      VDTGSD  W QC PC    C++     FDP 
Sbjct: 78  LFETPVASGNGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKS--CYETLSAKFDPS 135

Query: 83  KSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
           KS++Y ++ C S+ C  +        C Y ++YG G   S +SG L+T+ +T  +     
Sbjct: 136 KSASYKTLGCGSNFCQDLPFQSCAASCQYDYMYGDG---SSTSGALSTDDVTIGTG---- 188

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
            ++PNV FGCG+ NL +        G+        SL+SQ+G +   KFSYCL   GS+K
Sbjct: 189 -KIPNVAFGCGNSNLGTFAGAGGLVGLGKG---PLSLVSQLGGTATKKFSYCLVPLGSTK 244

Query: 203 ---INFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEF-------VSSSTGN 248
              +  G      GV  TP++  ++    YY  L+ ISV  + + +        ++  G 
Sbjct: 245 TSPLYIGDSTLAGGVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGG 304

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG--VGAEPGFSDVLCYNISSQPKFPE 306
           + +D+G   T L ++  + + + +   +      G   G E  FS     N    P +P 
Sbjct: 305 LILDSGTTLTYLDVDAFNPMVAALKAALPYPEADGSFYGLEYCFSTAGVAN----PTYPT 360

Query: 307 VTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
           V  HF GADV L+P N F  +  E   C A        ++G I Q+N +I +D+    + 
Sbjct: 361 VVFHFNGADVALAPDNTFIALDFEGTTCLAMASSTGFSIFGNIQQLNHVIVHDLVNKRIG 420

Query: 366 FKPSRC 371
           FK + C
Sbjct: 421 FKSANC 426


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 120/354 (33%), Positives = 183/354 (51%), Gaps = 37/354 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +S+GTP V     VDTGSD +W QC+PCP   C+ Q  PLFDP +SS+Y+++ C++
Sbjct: 142 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAA 201

Query: 95  SQC---AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
           + C   A+ ++ CS G C Y   YG G   S ++G  +++TLT   ++ L       +FG
Sbjct: 202 ASCSQLALYSNGCSGGQCGYVVSYGDG---STTTGVYSSDTLTLTGSNAL----KGFLFG 254

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--KINFGGIV 209
           CGH   A     +   G++GLG    SL+SQ  ++  G FSYCLP   +S   I+ GG  
Sbjct: 255 CGH---AQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQNSVGYISLGGPS 311

Query: 210 AGAGVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLE 263
           + AG  +TPL+       +Y + L  ISVG Q L   +S  ++G + VDTG + T LP  
Sbjct: 312 STAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAV-VDTGTVVTRLPPT 370

Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHF-RGADVKLSP 320
            +S L+S     +         A  G  D  CY+ +       P ++I F  GA + L  
Sbjct: 371 AYSALRSAFRAAMAPYGYPSAPAT-GILDT-CYDFTRYGTVTLPTISIAFGGGAAMDLGT 428

Query: 321 SNLFRNISDEIMCSAFR--GGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           S +  +      C AF   GG++   + G + Q +F + +D   + V F P+ C
Sbjct: 429 SGILTS-----GCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 475


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 120/361 (33%), Positives = 175/361 (48%), Gaps = 37/361 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +L+++SIG+PPV     +DT SD  W QC PC  ++C+ Q  P+FDP +S T+ + SC +
Sbjct: 85  FLVNISIGSPPVTQLLHMDTASDLLWLQCRPC--INCYAQSLPIFDPSRSYTHRNESCRT 142

Query: 95  SQCAV--VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST--SGLPVEMPNVIF 150
           SQ ++  +  N     C YS  Y  G   + S G LA E L FN+         + +V+F
Sbjct: 143 SQYSMPSLRFNAKTRSCEYSMRYMDG---TGSKGILAKEMLMFNTIYDESSSAALHDVVF 199

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA 210
           GCGH N   P      TGI+GLG G  SL+ + GT    KFSYC             +V 
Sbjct: 200 GCGHDNYGEPLVG---TGILGLGYGEFSLVHRFGT----KFSYCFGSLDDPSYPHNVLVL 252

Query: 211 ---GAGVV--STPL-IIRDHYYLSLEAISVGNQRLEF--------VSSSTGNIFVDTGVL 256
              GA ++  +TPL I    YY+++EAISV    L            +  G   +DTG  
Sbjct: 253 GDDGANILGDTTPLEIYNGFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNS 312

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK-----FPEVTIHF 311
            T L  E +  LK+ + +  + +       +     V CYN + +       FP VT HF
Sbjct: 313 LTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDMFKVECYNGNLERDLVESGFPIVTFHF 372

Query: 312 R-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSR 370
             GA++ L   ++F  +S  + C A   GN N + G   Q ++ IGYD+E   +SF+   
Sbjct: 373 SDGAELSLDVKSVFMKLSPNVFCLAVTPGNMNSI-GATAQQSYNIGYDLEAKKISFERID 431

Query: 371 C 371
           C
Sbjct: 432 C 432


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 121/365 (33%), Positives = 172/365 (47%), Gaps = 50/365 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTPP  ++  +DTGSD  W QC PC    C+ Q  P+FDP+KS ++ SI+C S
Sbjct: 126 YFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKR--CYAQSDPVFDPRKSRSFASIACRS 183

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   N  +  C Y   YG G   SF+ G+ +TETLTF  T      +  V  G
Sbjct: 184 PLCHRLDSPGCNTQKQTCMYQVSYGDG---SFTFGDFSTETLTFRRT-----RVARVALG 235

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGG 207
           CGH N       +   G+        S  SQ G     KFSYCL D+ +    S + FG 
Sbjct: 236 CGHDNEGLFVGAAGLLGLGRG---RLSFPSQTGRRFNHKFSYCLVDRSASSKPSSMVFGD 292

Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS------TGN--IFVDTG- 254
                    TPL+    +   YY+ L  ISVG  R+  +++S      TGN  + +D+G 
Sbjct: 293 SAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGT 352

Query: 255 -VLRTLLP--LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVT 308
            V R   P  + +    ++  SN+ +A         P FS    C+++S  ++ K P V 
Sbjct: 353 SVTRLTRPAYIAFRDAFRAGASNLKRA---------PQFSLFDTCFDLSGKTEVKVPTVV 403

Query: 309 IHFRGADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSF 366
           +HFRGADV L  SN    + +    C AF G    + + G I Q  F + YD+  + V F
Sbjct: 404 LHFRGADVSLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGF 463

Query: 367 KPSRC 371
            P  C
Sbjct: 464 APHGC 468


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score =  155 bits (393), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 118/352 (33%), Positives = 183/352 (51%), Gaps = 33/352 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +S+GTP V     VDTGSD +W QC+PCP   C+ Q  PLFDP +SS+Y+++ C++
Sbjct: 131 YVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAA 190

Query: 95  SQC---AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
           + C   A+ ++ CS G C Y   YG G   S ++G  +++TLT   ++ L       +FG
Sbjct: 191 ASCSQLALYSNGCSGGQCGYVVSYGDG---STTTGVYSSDTLTLTGSNAL----KGFLFG 243

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--KINFGGIV 209
           CGH   A     +   G++GLG    SL+SQ  ++  G FSYCLP   +S   I+ GG  
Sbjct: 244 CGH---AQQGLFAGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQNSVGYISLGGPS 300

Query: 210 AGAGVVSTPLIIRD----HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLE 263
           + AG  +TPL+       +Y + L  ISVG Q L   +S  ++G + VDTG + T LP  
Sbjct: 301 STAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAV-VDTGTVVTRLPPT 359

Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHF-RGADVKLSP 320
            +S L+S     +         A  G  D  CY+ +       P ++I F  GA + L  
Sbjct: 360 AYSALRSAFRAAMAPYGYPSAPAT-GILDT-CYDFTRYGTVTLPTISIAFGGGAAMDLGT 417

Query: 321 SNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           S +   ++   +  A  GG++   + G + Q +F + +D   + V F P+ C
Sbjct: 418 SGI---LTSGCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPASC 464


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 120/362 (33%), Positives = 178/362 (49%), Gaps = 44/362 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L +GTP  D+   +DTGSD +W QC+PCP  DC++Q   LFDP KSSTY+ I+CSS
Sbjct: 134 YFTSLRLGTPATDLLVELDTGSDQSWIQCKPCP--DCYEQHEALFDPSKSSTYSDITCSS 191

Query: 95  SQCAVVTS----NC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
            +C  + S    NC S+  C Y   Y   A  S++ GNLA +TLT + T      +P  +
Sbjct: 192 RECQELGSSHKHNCSSDKKCPYEITY---ADDSYTVGNLARDTLTLSPTDA----VPGFV 244

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGG 207
           FGCGH N     S  +  G++GLG G +SL SQ+       FSYCLP   S+   ++F G
Sbjct: 245 FGCGHNNAG---SFGEIDGLLGLGRGKASLSSQVAARYGAGFSYCLPSSPSATGYLSFSG 301

Query: 208 IVAGAGVVS--TPLIIRDH---YYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTL 259
             A A   +  T ++   H   YYL+L  I+V  + ++    V ++     +D+G   + 
Sbjct: 302 AAAAAPTNAQFTEMVAGQHPSFYYLNLTGITVAGRAIKVPPSVFATAAGTIIDSGTAFSC 361

Query: 260 LPLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-G 313
           LP   ++ L+S + + +   K  P   +     F    CY+++     + P V + F  G
Sbjct: 362 LPPSAYAALRSSVRSAMGRYKRAPSSTI-----FD--TCYDLTGHETVRIPSVALVFADG 414

Query: 314 ADVKLSPSNLFRNISD-EIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPS 369
           A V L PS +    S+    C AF     +    V G   Q    + YD++   V F  +
Sbjct: 415 ATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGAN 474

Query: 370 RC 371
            C
Sbjct: 475 GC 476


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 115/366 (31%), Positives = 170/366 (46%), Gaps = 42/366 (11%)

Query: 35  YLMHLSIGTP-PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           YL+H  IGTP P  +   VDTGSD  WTQC PC   DCF Q  P FD   S T + + C+
Sbjct: 92  YLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPC--FDCFTQPLPRFDTSASDTVHGVLCT 149

Query: 94  SSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
              C A+    C  G C+Y   YG     S + G LA ++ TF+   G  V +P+++FGC
Sbjct: 150 DPICRALRPHACFLGGCTYQVNYGDN---SVTIGQLAKDSFTFDGKGGGKVTVPDLVFGC 206

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGIV 209
           G  N  +    S +TGI G G G  SL  Q+G S    FSYC     +  S+ +  GG  
Sbjct: 207 GQYNTGN--FHSNETGIAGFGRGPLSLPRQLGVS---SFSYCFTTIFESKSTPVFLGGAP 261

Query: 210 AG-------AGVVSTPLIIR--DHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDT 253
           A          ++STP +    ++YYLSL+ I+VG  RL    S+       +G   +D+
Sbjct: 262 ADGLRAHATGPILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIIDS 321

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG-AEPGFSDVLCYNISSQPK-----FPEV 307
           G   T  P    +  +S+    +   P+      + G   + C++  S P       P++
Sbjct: 322 GTAITAFP---RAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTESVPDASKVPVPKM 378

Query: 308 TIHFRGADVKLSPSNLFRNI--SDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
           T+H  GAD +L   N       SD++      G +   + G   Q N  I +D+    + 
Sbjct: 379 TLHLEGADWELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIVHDLAGNKLV 438

Query: 366 FKPSRC 371
            +P++C
Sbjct: 439 IEPAQC 444


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score =  155 bits (391), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 122/361 (33%), Positives = 170/361 (47%), Gaps = 40/361 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L +GTPP  ++  +DTGSD  W QC PC +  C+ Q  P+F+P KS ++  I CSS
Sbjct: 110 YFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRK--CYSQSDPIFNPYKSKSFAGIPCSS 167

Query: 95  SQCAVV-TSNCS--EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + +S CS     C Y   YG G   SF++G+ ATETLTF        ++  V  G
Sbjct: 168 PLCRRLDSSGCSTRRHTCLYQVSYGDG---SFTTGDFATETLTFRGN-----KIAKVALG 219

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGG 207
           CGH N       +   G+        S  SQ G     KFSYCL D+ +    S + FG 
Sbjct: 220 CGHHNEGLFVGAAGLLGLGRG---RLSFPSQTGIRFNHKFSYCLVDRSASSKPSSMVFGD 276

Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGV 255
                    TPLI    +   YY+ L  ISVG  R+  VS S         G + +D+G 
Sbjct: 277 AAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGT 336

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRG 313
             T L    ++ L+      + A+ +K  G E    D  CY++S Q   K P V +HFRG
Sbjct: 337 SVTRLTRPAYTALRDAFR--VGARHLK-RGPEFSLFDT-CYDLSGQSSVKVPTVVLHFRG 392

Query: 314 ADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           AD+ L  +N    + +    C AF G  + + + G I Q  F + YD+  + + F P  C
Sbjct: 393 ADMALPATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 452

Query: 372 T 372
           T
Sbjct: 453 T 453


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 120/378 (31%), Positives = 180/378 (47%), Gaps = 52/378 (13%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           D  YL+H++IGTPP  +   +DTGSD TWTQC PC  + CF+Q  P F+P +S T++ + 
Sbjct: 108 DTEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLP 165

Query: 92  CSSSQCAVVT-SNCSE-----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL--PV 143
           C    C  +T S+C E     G C Y++ Y   A  S ++G+L ++T +F S        
Sbjct: 166 CDLRICRDLTWSSCGEQSWGNGICVYAYAY---ADHSITTGHLDSDTFSFASADHAIGGA 222

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI 203
            +P++ FGCG  N  +    S +TGI G   G  S+ +Q+       FSYC      S+ 
Sbjct: 223 SVPDLTFGCGLFN--NGIFVSNETGIAGFSRGALSMPAQLKVD---NFSYCFTAITGSEP 277

Query: 204 N--FGGIV---------AGAGVVSTPLIIRDH------YYLSLEAISVGNQRLEFVSS-- 244
           +  F G+           G GVV +  +IR H      YY+SL+ ++VG  RL    S  
Sbjct: 278 SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVF 337

Query: 245 -----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI- 298
                 TG   VD+G   T+LP      + +++ +   AQ    V         LC+++ 
Sbjct: 338 ALKEDGTGGTIVDSGTGMTMLP----EAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVP 393

Query: 299 -SSQPKFPEVTIHFRGADVKLSPSNLFRNISD----EIMCSAFRGGNANIVYGRIMQINF 353
             ++P  P + +HF GA + L   N    I +     + C A   G    V G   Q N 
Sbjct: 394 PGAKPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSVIGNFQQQNM 453

Query: 354 LIGYDIEQAMVSFKPSRC 371
            + YD+   M+SF P+RC
Sbjct: 454 HVLYDLANDMLSFVPARC 471


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score =  154 bits (390), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 119/375 (31%), Positives = 177/375 (47%), Gaps = 47/375 (12%)

Query: 28  IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
           +++ +  YLM + IGTPP      +DTGSD  WTQC PC  + C  Q  P FDP +S +Y
Sbjct: 82  VLASEGEYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPC--MLCVDQPTPFFDPAQSPSY 139

Query: 88  NSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
             + C+S  C A+    C    C Y + YG  A    ++G L+ ET TF  T+   V +P
Sbjct: 140 AKLPCNSPMCNALYYPLCYRNVCVYQYFYGDSAN---TAGVLSNETFTFG-TNDTRVTVP 195

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKI 203
            + FGCG+ N  S  + S   G++G G G  SL+SQ+G+    +FSYCL    S   S++
Sbjct: 196 RIAFGCGNLNAGSLFNGS---GMVGFGRGPLSLVSQLGSP---RFSYCLTSFMSPVPSRL 249

Query: 204 NFGGIV--------AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS------- 244
            FG            G  V STP I+       YYL++  ISVG + L    S       
Sbjct: 250 YFGAYATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDA 309

Query: 245 -STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQ 301
             TG + +D+G   T L    +  +    ++ +   P+    +    +DVL  C+     
Sbjct: 310 DGTGGVIIDSGSTITYLARAAYDMVHQAFADQV-GLPLTNATS---LADVLDTCFVWPPP 365

Query: 302 PK----FPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIG 356
           P+     PE+  HF GA+++L   N      D   +C A    +   + G     NF + 
Sbjct: 366 PRKIVTMPELAFHFEGANMELPLENYMLIDGDTGNLCLAIAASDDGSIIGSFQHQNFHVL 425

Query: 357 YDIEQAMVSFKPSRC 371
           YD E +++SF P+ C
Sbjct: 426 YDNENSLLSFTPATC 440


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score =  154 bits (390), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 120/378 (31%), Positives = 180/378 (47%), Gaps = 52/378 (13%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           D  YL+H++IGTPP  +   +DTGSD TWTQC PC  + CF+Q  P F+P +S T++ + 
Sbjct: 82  DTEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLP 139

Query: 92  CSSSQCAVVT-SNCSE-----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL--PV 143
           C    C  +T S+C E     G C Y++ Y   A  S ++G+L ++T +F S        
Sbjct: 140 CDLRICRDLTWSSCGEQSWGNGICVYAYAY---ADHSITTGHLDSDTFSFASADHAIGGA 196

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI 203
            +P++ FGCG  N  +    S +TGI G   G  S+ +Q+       FSYC      S+ 
Sbjct: 197 SVPDLTFGCGLFN--NGIFVSNETGIAGFSRGALSMPAQLKVD---NFSYCFTAITGSEP 251

Query: 204 N--FGGIV---------AGAGVVSTPLIIRDH------YYLSLEAISVGNQRLEFVSS-- 244
           +  F G+           G GVV +  +IR H      YY+SL+ ++VG  RL    S  
Sbjct: 252 SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVF 311

Query: 245 -----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI- 298
                 TG   VD+G   T+LP      + +++ +   AQ    V         LC+++ 
Sbjct: 312 ALKEDGTGGTIVDSGTGMTMLP----EAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVP 367

Query: 299 -SSQPKFPEVTIHFRGADVKLSPSNLFRNISD----EIMCSAFRGGNANIVYGRIMQINF 353
             ++P  P + +HF GA + L   N    I +     + C A   G    V G   Q N 
Sbjct: 368 PGAKPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSVIGNFQQQNM 427

Query: 354 LIGYDIEQAMVSFKPSRC 371
            + YD+   M+SF P+RC
Sbjct: 428 HVLYDLANDMLSFVPARC 445


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 118/373 (31%), Positives = 185/373 (49%), Gaps = 42/373 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + IGTPP +     DTGSD TW QC PCP+  C+ Q+ PLFDP KSSTY  + CS+
Sbjct: 122 YVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDVPCSA 181

Query: 95  SQC---AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
            +C    V  + C    C YS  YG     S + G+LA ET T +  S L      V+FG
Sbjct: 182 PECHIGGVQQTRCGATSCEYSVKYGD---ESETHGSLAEETFTLSPPSPLAPAATGVVFG 238

Query: 152 CGHKNLASPTSDSKQ--TGIIGLGPGNSSLISQMGTSI---AGKFSYCLPDQGSSK--IN 204
           C H+ + S  +D+     G++GLG G+SS++SQ   SI    G FSYCLP +GSS   + 
Sbjct: 239 CSHEYI-SVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRGSSTGYLT 297

Query: 205 FGGIVAG-----AGVVSTPLI-----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVD 252
            GG  A      + +  TPLI     +R  Y ++L  +SV    ++  +S  S G + +D
Sbjct: 298 IGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLGAV-ID 356

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIH 310
           +G + T +P   +  L+      + +  +   G+        CY+++ Q     P V + 
Sbjct: 357 SGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDT--CYDVTGQDVVTAPRVALE 414

Query: 311 F-RGADVKLSPSNLFRNISDE--------IMCSAFRGGNAN--IVYGRIMQINFLIGYDI 359
           F  GA + +  S +   +  E        + C AF   N+   ++ G + Q  + + +D+
Sbjct: 415 FGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVGNMQQRAYNVVFDV 474

Query: 360 EQAMVSFKPSRCT 372
           +   + F P+ C+
Sbjct: 475 DGGRIGFGPNGCS 487


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 120/378 (31%), Positives = 180/378 (47%), Gaps = 52/378 (13%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           D  YL+H++IGTPP  +   +DTGSD TWTQC PC  + CF+Q  P F+P +S T++ + 
Sbjct: 108 DTEYLVHMAIGTPPQPVQLILDTGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLP 165

Query: 92  CSSSQCAVVT-SNCSE-----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL--PV 143
           C    C  +T S+C E     G C Y++ Y   A  S ++G+L ++T +F S        
Sbjct: 166 CDLRICRDLTWSSCGEQSWGNGICVYAYAY---ADHSITTGHLDSDTFSFASADHAIGGA 222

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI 203
            +P++ FGCG  N  +    S +TGI G   G  S+ +Q+       FSYC      S+ 
Sbjct: 223 SVPDLTFGCGLFN--NGIFVSNETGIAGFSRGALSMPAQLKVD---NFSYCFTAITGSEP 277

Query: 204 N--FGGIV---------AGAGVVSTPLIIRDH------YYLSLEAISVGNQRLEFVSS-- 244
           +  F G+           G GVV +  +IR H      YY+SL+ ++VG  RL    S  
Sbjct: 278 SPVFLGVPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVF 337

Query: 245 -----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI- 298
                 TG   VD+G   T+LP      + +++ +   AQ    V         LC+++ 
Sbjct: 338 ALKEDGTGGTIVDSGTGMTMLP----EAVYNLVCDAFVAQTKLTVHNSTSSLSQLCFSVP 393

Query: 299 -SSQPKFPEVTIHFRGADVKLSPSNLFRNISD----EIMCSAFRGGNANIVYGRIMQINF 353
             ++P  P + +HF GA + L   N    I +     + C A   G    V G   Q N 
Sbjct: 394 PGAKPDVPALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDLSVIGNFQQQNM 453

Query: 354 LIGYDIEQAMVSFKPSRC 371
            + YD+   M+SF P+RC
Sbjct: 454 HVLYDLANDMLSFVPARC 471


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score =  154 bits (389), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 117/364 (32%), Positives = 176/364 (48%), Gaps = 44/364 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP + +  VD+GSD  W QC PC E  C++Q  PLFDP  S+++ ++ C S
Sbjct: 133 YFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAE--CYQQADPLFDPAASASFTAVPCDS 190

Query: 95  SQCAVV---TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
             C  +   +S C++ G C Y   YG G+Y   + G LA ETLTF  ++  PV+   V  
Sbjct: 191 GVCRTLPGGSSGCADSGACRYQVSYGDGSY---TQGVLAMETLTFGDST--PVQ--GVAI 243

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA 210
           GCGH+N           G++GLG G  SL+ Q+G +  G FSYCL  +G+     G +V 
Sbjct: 244 GCGHRNRGLFVG---AAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGADA-GAGSLVF 299

Query: 211 G------AGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDT 253
           G       G V  PL+        YY+ L  + VG +RL       +      G + +DT
Sbjct: 300 GRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDT 359

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIH 310
           G   T LP + ++ L+   ++ I     +     PG S +  CY++S  +  + P V ++
Sbjct: 360 GTAVTRLPPDAYAALRDAFASTIGGDLPR----APGVSLLDTCYDLSGYASVRVPTVALY 415

Query: 311 F--RGADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
           F   GA + L   NL   +   + C AF    + + + G I Q    I  D     V F 
Sbjct: 416 FGRDGAALTLPARNLLVEMGGGVYCLAFAASASGLSILGNIQQQGIQITVDSANGYVGFG 475

Query: 368 PSRC 371
           PS C
Sbjct: 476 PSTC 479


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score =  154 bits (388), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 119/369 (32%), Positives = 171/369 (46%), Gaps = 43/369 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M LS+GTPP+     +DTGSD TWTQC PC    CF Q  PL+DP +SST++ + C+S
Sbjct: 96  YHMILSVGTPPLAFPAIIDTGSDLTWTQCAPC-TTACFAQPTPLYDPARSSTFSKLPCAS 154

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV---EMPNV 148
             C  + S    C+   C Y + Y  G    F++G LA +TL      G          V
Sbjct: 155 PLCQALPSAFRACNATGCVYDYRYAVG----FTAGYLAADTLAIGDGDGDGDASSSFAGV 210

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGSSKINF 205
            FGC   N       S   GI+GLG    SL+SQ+G    G+FSYCL    D G+S I F
Sbjct: 211 AFGCSTANGGDMDGAS---GIVGLGRSALSLLSQIGV---GRFSYCLRSDADAGASPILF 264

Query: 206 GGI--VAGAGVVST-----PLIIRD---HYYLSLEAISVGNQRLE-------FVSSSTGN 248
           G +  V G  V ST     P+  R    +YY++L  I+VG+  L        F ++  G 
Sbjct: 265 GALANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGG 324

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI-SSQPKFPEV 307
           + VD+G   T L    ++ L+    +       +  GA+  F   LC+   ++    P +
Sbjct: 325 VIVDSGTTFTYLAEAGYTMLRQAFLSQTAGLLTRVSGAQFDFD--LCFEAGAADTPVPRL 382

Query: 308 TIHFRGADVKLSPSNLFRNISDE---IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
              F G      P   + +  DE   + C          V G +MQ++  + YD++ A  
Sbjct: 383 VFRFAGGAEYAVPRQSYFDAVDEGGRVACLLVLPTRGVSVIGNVMQMDLHVLYDLDGATF 442

Query: 365 SFKPSRCTN 373
           SF P+ C +
Sbjct: 443 SFAPADCAS 451


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score =  154 bits (388), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 114/355 (32%), Positives = 171/355 (48%), Gaps = 35/355 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  L +GTP       VDTGS  TW QC PC  + C +Q  PL+DP+ SSTY ++ CS+
Sbjct: 134 YVTELGLGTPATSYAMVVDTGSSLTWLQCSPC-VVSCHRQVGPLYDPRASSTYATVPCSA 192

Query: 95  SQC------AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           SQC       +  S CS  + C Y   YG    +SFS G L+ +T++F S S      PN
Sbjct: 193 SQCDELQAATLNPSACSVRNVCIYQASYGD---SSFSVGYLSRDTVSFGSGS-----YPN 244

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
             +GCG  N        +  G+IGL     SL+ Q+  S+   FSYCLP   S+     G
Sbjct: 245 FYYGCGQDNEG---LFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTPASTGYLSIG 301

Query: 208 IVAGAGVVSTPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLLP 261
                    TP+    +    Y+++L  +SVG   L    +   ++   +D+G + T LP
Sbjct: 302 PYTSGHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSLPTIIDSGTVITRLP 361

Query: 262 LEYHSNL-KSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFR-GADVK 317
              ++ L K+V + M+      GV + P FS  D      +SQ + P V + F  GA +K
Sbjct: 362 TAVYTALSKAVAAAMV------GVQSAPAFSILDTCFQGQASQLRVPAVAMAFAGGATLK 415

Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           L+  N+  ++ D   C AF   ++  + G   Q  F + YD+ Q+ + F    C+
Sbjct: 416 LATQNVLIDVDDSTTCLAFAPTDSTTIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 470


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 116/355 (32%), Positives = 177/355 (49%), Gaps = 33/355 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QC+PC  + C++Q   LFDP +SSTY ++SC++
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVV-CYEQREKLFDPARSSTYANVSCAA 238

Query: 95  SQCAVVT-SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C+ +    CS G C Y   YG G+Y   S G  A +TLT +S   +        FGCG
Sbjct: 239 PACSDLNIHGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYDAV----KGFRFGCG 291

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFGG--IV 209
            +N        +  G++GLG G +SL  Q      G F++CLP +  G+  ++FG   + 
Sbjct: 292 ERNEG---LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSLA 348

Query: 210 AGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
           A    ++TP++  +    YY+ +  I VG Q L    S  +T    VD+G + T LP   
Sbjct: 349 AARARLTTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPAA 408

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
           +S+L+   +  + A   +G    P  S +  CY+ +  SQ   P V++ F+ GA + +  
Sbjct: 409 YSSLRYAFAAAMAA---RGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDA 465

Query: 321 SNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           S +    S   +C AF     GG+  IV G      F + YDI + +V F P  C
Sbjct: 466 SGIMYAASASQVCLAFAANEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 113/361 (31%), Positives = 176/361 (48%), Gaps = 46/361 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP  D++  +DTGSD  W QCEPC   DC++Q  P+F+P  SSTY S++CS+
Sbjct: 162 YFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPCA--DCYQQSDPVFNPTSSSTYKSLTCSA 219

Query: 95  SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC+++ TS C    C Y   YG G   SF+ G LAT+T+TF ++     ++ NV  GCG
Sbjct: 220 PQCSLLETSACRSNKCLYQVSYGDG---SFTVGELATDTVTFGNSG----KINNVALGCG 272

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
           H N      +   TG  GL      ++S      A  FSYCL D+ S K   ++F  +  
Sbjct: 273 HDN------EGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQL 326

Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
           G G  + PL+    I   YY+ L   SVG +++       +  +S +G + +D G   T 
Sbjct: 327 GGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTR 386

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS--SQPKFPEVTIHFRG 313
           L  + +++L+     +        V  + G S +     CY+ S  S  K P V  HF G
Sbjct: 387 LQTQAYNSLRDAFLKLT-------VNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTG 439

Query: 314 AD-VKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
              + L   N    + D    C AF   ++++ + G + Q    I YD+ + ++    ++
Sbjct: 440 GKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNK 499

Query: 371 C 371
           C
Sbjct: 500 C 500


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 118/359 (32%), Positives = 166/359 (46%), Gaps = 32/359 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL  + +GTP       VDTGSD TW QC PC    C+ Q   LF P  S+++  ++C +
Sbjct: 3   YLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGT--CYSQNDSLFIPNTSTSFTKLACGT 60

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C  +    C++  C Y + YG G   S S+G+   +T+T +  +G   ++PN  FGCG
Sbjct: 61  ELCNGLPYPMCNQTTCVYWYSYGDG---SLSTGDFVYDTITMDGINGQKQQVPNFAFGCG 117

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINFG-- 206
           H N     S +   GI+GLG G  S  SQ+ T   GKFSYCL     P   +S + FG  
Sbjct: 118 HDNEG---SFAGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDA 174

Query: 207 GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTG 254
            +    GV    L+    +  +YY+ L  ISVG + L   S++         G IF D+G
Sbjct: 175 AVPTFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIF-DSG 233

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-PKFPEVTIHFRG 313
              T L  E H  + + M+      P K   +  G    L      Q P  P +T HF G
Sbjct: 234 TTVTQLAGEVHQEVLAAMNASTMDYPRKSDDSS-GLDLCLGGFAEGQLPTVPSMTFHFEG 292

Query: 314 ADVKLSPSNLFRNI-SDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            D++L PSN F  + S +  C +        + G I Q NF + YD     + F P  C
Sbjct: 293 GDMELPPSNYFIFLESSQSYCFSMVSSPDVTIIGSIQQQNFQVYYDTVGRKIGFVPKSC 351


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  152 bits (384), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 113/357 (31%), Positives = 177/357 (49%), Gaps = 38/357 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP  +++  +DTGSD  W QCEPC   DC++Q  P+F+P  SSTY S++CS+
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCS--DCYQQSDPVFNPTSSSTYKSLTCSA 219

Query: 95  SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC+++ TS C    C Y   YG G   SF+ G LAT+T+TF ++     ++ +V  GCG
Sbjct: 220 PQCSLLETSACRSNKCLYQVSYGDG---SFTVGELATDTVTFGNSG----KINDVALGCG 272

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
           H N    T  +   G+ G      S+ +QM    A  FSYCL D+ S K   ++F  +  
Sbjct: 273 HDNEGLFTGAAGLLGLGGG---ALSITNQMK---ATSFSYCLVDRDSGKSSSLDFNSVQL 326

Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
           G+G  + PL+    I   YY+ L   SVG Q++       +  +S +G + +D G   T 
Sbjct: 327 GSGDATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVILDCGTAVTR 386

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGAD-V 316
           L  + +++L+     +      KG  +   F    CY+ S  S  K P V  HF G   +
Sbjct: 387 LQTQAYNSLRDAFLKLT-TNLKKGTSSISLFD--TCYDFSSLSSVKVPTVAFHFTGGKSL 443

Query: 317 KLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            L   N    + D    C AF   ++++ + G + Q    I YD+   ++    ++C
Sbjct: 444 DLPAKNYLIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANKIIGLSGNKC 500


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 117/361 (32%), Positives = 163/361 (45%), Gaps = 42/361 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L +GTP   ++  +DTGSD  W QC PC    C+ Q  P+FDP+KS TY +I CSS
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR--CYSQSDPIFDPRKSKTYATIPCSS 199

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   N     C Y   YG G   SF+ G+ +TETLTF         +  V  G
Sbjct: 200 PHCRRLDSAGCNTRRKTCLYQVSYGDG---SFTVGDFSTETLTFRRN-----RVKGVALG 251

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGG 207
           CGH N       +   G+        S   Q G     KFSYCL D+ +    S + FG 
Sbjct: 252 CGHDNEGLFVGAAGLLGLGKG---KLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGN 308

Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGV 255
                    TPL+    +   YY+ L  ISVG  R+  V++S         G + +D+G 
Sbjct: 309 AAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSGT 368

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR 312
             T L    +  ++       KA     +   P FS    C+++S  ++ K P V +HFR
Sbjct: 369 SVTRLIRPAYIAMRDAFRVGAKA-----LKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFR 423

Query: 313 GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
           GADV L  +N    + ++   C AF G    + + G I Q  F + YD+  + V F P  
Sbjct: 424 GADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGG 483

Query: 371 C 371
           C
Sbjct: 484 C 484


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 119/361 (32%), Positives = 180/361 (49%), Gaps = 39/361 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  D+    DTGSD TWTQCEPC    C+KQ+  +FDP KSS+Y +I+C+S
Sbjct: 46  YVVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAG-SCYKQQDAIFDPSKSSSYTNITCTS 104

Query: 95  SQCAVVTSNCSEGDCS--------YSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           S C  +TS+  + +CS        Y   YG     S S G L+ E LT  +T      + 
Sbjct: 105 SLCTQLTSDGIKSECSSSTDASCIYDAKYGDN---STSVGFLSQERLTITATD----IVD 157

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--KIN 204
           + +FGCG  N       +   G++GLG    S++ Q  ++    FSYCLP   SS   + 
Sbjct: 158 DFLFGCGQDNEGLFNGSA---GLMGLGRHPISIVQQTSSNYNKIFSYCLPATSSSLGHLT 214

Query: 205 FGGIVA-GAGVVSTPL--IIRDH--YYLSLEAISVGNQRLEFVSSST---GNIFVDTGVL 256
           FG   A  A ++ TPL  I  D+  Y L + +ISVG  +L  VSSST   G   +D+G +
Sbjct: 215 FGASAATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTV 274

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF--PEVTIHFRGA 314
            T L    ++ L+S     ++  PV     E G  D  CY++S   +   P +   F G 
Sbjct: 275 ITRLAPTVYAALRSAFRRXMEKYPVAN---EAGLLDT-CYDLSGYKEISVPRIDFEFSGG 330

Query: 315 -DVKLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSR 370
             V+L    +    S++ +C AF    ++    V+G + Q    + YD++   + F  + 
Sbjct: 331 VTVELXHRGILXVESEQQVCLAFAANGSDNDITVFGNVQQKTLEVVYDVKGGRIGFGAAG 390

Query: 371 C 371
           C
Sbjct: 391 C 391


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 118/350 (33%), Positives = 169/350 (48%), Gaps = 29/350 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +  GTP  +     DTGS+  W QC+PC  + C+ Q+ PLFDP  SSTY +ISC+S
Sbjct: 16  YVITVGFGTPKKNQTVIFDTGSNVNWIQCKPC-VVSCYPQQEPLFDPTLSSTYRNISCTS 74

Query: 95  SQCAVVTSN-CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + C  ++S  CS   C Y   YG G   S + G LATET T  + +       N IFGCG
Sbjct: 75  AACTGLSSRGCSGSTCVYGVTYGDG---SSTVGFLATETFTLAAGN----VFNNFIFGCG 127

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVAG 211
             N    T  +   G+IGLG    SL SQ+ TS+   FSYCLP   S+   +N G  +  
Sbjct: 128 QNNQGLFTGAA---GLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSSATGYLNIGNPLRT 184

Query: 212 AGVVSTPLIIR--DHYYLSLEAISVGNQRLEFVSS---STGNIFVDTGVLRTLLPLEYHS 266
            G  +     R    Y++ L  ISVG  RL   S+   S G I +D+G + T LP   + 
Sbjct: 185 PGYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVGTI-IDSGTVITRLPPTAYG 243

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGADVKLSPSNLF 324
            L++     +  Q  +   A     D  CY+ S  +   FP + +H+ G DV +  + +F
Sbjct: 244 ALRTAFRAAMT-QYTRAAAAS--ILDT-CYDFSRTTTVTFPTIKLHYTGLDVTIPGAGVF 299

Query: 325 RNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             IS   +C AF G + +    + G + Q    + YD     + F    C
Sbjct: 300 YVISSSQVCLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 117/361 (32%), Positives = 165/361 (45%), Gaps = 42/361 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L +GTP   ++  +DTGSD  W QC PC    C+ Q  P+FDP+KS TY +I CSS
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR--CYSQSDPIFDPRKSKTYATIPCSS 199

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   N     C Y   YG G   SF+ G+ +TETLTF         +  V  G
Sbjct: 200 PHCRRLDSAGCNTRRKTCLYQVSYGDG---SFTVGDFSTETLTFRRN-----RVKGVALG 251

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGG 207
           CGH N       +   G+        S   Q G     KFSYCL D+ +    S + FG 
Sbjct: 252 CGHDNEGLFVGAAGLLGLGKG---KLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGN 308

Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGV 255
                    TPL+    +   YY+ L  ISVG  R+  V++S         G + +D+G 
Sbjct: 309 AAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGT 368

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR 312
             T L    +  ++      + A+ +K     P FS    C+++S  ++ K P V +HFR
Sbjct: 369 SVTRLIRPAYIAMRDAF--RVGAKTLK---RAPDFSLFDTCFDLSNMNEVKVPTVVLHFR 423

Query: 313 GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
           GADV L  +N    + ++   C AF G    + + G I Q  F + YD+  + V F P  
Sbjct: 424 GADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGG 483

Query: 371 C 371
           C
Sbjct: 484 C 484


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score =  151 bits (382), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 114/354 (32%), Positives = 176/354 (49%), Gaps = 31/354 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QCEPC  + C+KQ+  LFDP +SSTY +ISC++
Sbjct: 161 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVV-CYKQQEKLFDPARSSTYANISCAA 219

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C+ +    CS G C Y   YG G+Y   S G  A +TLT +S   +        FGCG
Sbjct: 220 PACSDLYIKGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYDAI----KGFRFGCG 272

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINF--GGIV 209
            +N        +  G++GLG G +SL  Q      G F++C P +  G+  ++F  G + 
Sbjct: 273 ERNEG---LYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDFGPGSLP 329

Query: 210 AGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
           A +  ++TP+++ +    YY+ L  I VG + L    S  +T    VD+G + T LP   
Sbjct: 330 AVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTSGTIVDSGTVITRLPPAA 389

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
           +S+L+S  ++   A   +G    P  S +  CY+ +  S+   P V++ F+ GA + +  
Sbjct: 390 YSSLRSAFAS---AMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVSLLFQGGASLDVHA 446

Query: 321 SNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           S +    S    C  F G   +    + G      F + YDI + +V F P  C
Sbjct: 447 SGIIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKKVVGFCPGAC 500


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score =  151 bits (382), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 122/371 (32%), Positives = 176/371 (47%), Gaps = 51/371 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+HL+IGTPP  +  ++DTGSD  WTQC+PCP   CF Q  P FDP  SST +  SC S
Sbjct: 35  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPA--CFDQALPYFDPSTSSTLSLTSCDS 92

Query: 95  SQC-AVVTSNCSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           + C  +  ++C          C Y++ YG     S ++G L  +  TF    G    +P 
Sbjct: 93  TLCQGLPVASCGSPKFWPNQTCVYTYSYGD---KSVTTGFLEVDKFTF---VGAGASVPG 146

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC-------LPDQGS 200
           V FGCG  N  +    S +TGI G G G  SL SQ+     G FS+C       +P    
Sbjct: 147 VAFGCGLFN--NGVFKSNETGIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVL 201

Query: 201 SKINFGGIVAGAGVV-STPLIIRDH-------YYLSLEAISVGNQRLEF------VSSST 246
             +       G G V +TPLI           YYLSL+ I+VG+ RL        +++ T
Sbjct: 202 LDLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGT 261

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKF 304
           G   +D+G   T LP + +  ++   +  IK   V G           C++  SQ  P  
Sbjct: 262 GGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNAT----GHYTCFSAPSQAKPDV 317

Query: 305 PEVTIHFRGADVKLSPSNLFRNISDE----IMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
           P++ +HF GA + L   N    + D+    I+C A   G+   + G   Q N  + YD++
Sbjct: 318 PKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDLQ 377

Query: 361 QAMVSFKPSRC 371
             M+SF  ++C
Sbjct: 378 NNMLSFVAAQC 388


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score =  151 bits (382), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 124/382 (32%), Positives = 182/382 (47%), Gaps = 61/382 (15%)

Query: 31  VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQ-EPPLFDPKKSSTYNS 89
           V + YLMH+S+GTPP  +  ++DTGSD  WTQC PC  LDCF+Q   P+ DP  SST+ +
Sbjct: 86  VTNEYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPC--LDCFEQGAAPVLDPAASSTHAA 143

Query: 90  ISCSSSQC-AVVTSNC---SEGD--CSYSFLYGRGAYASFSSGNLATETLTF---NSTSG 140
           + C +  C A+  ++C   S GD  C Y + YG     S + G LAT++ TF   ++  G
Sbjct: 144 LPCDAPLCRALPFTSCGGRSWGDRSCVYVYHYGD---RSLTVGQLATDSFTFGGDDNAGG 200

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD--- 197
           L      V FGCGH N       + +TGI G G G  SL SQ+  +    FSYC      
Sbjct: 201 LAAR--RVTFGCGHIN--KGIFQANETGIAGFGRGRWSLPSQLNVT---SFSYCFTSMFD 253

Query: 198 -QGSSKINFGGI---------VAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFV 242
            + SS +  G            A  G V T  +I++      Y++ L  ISVG  R+   
Sbjct: 254 TKSSSVVTLGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVP 313

Query: 243 SSS-TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYN 297
            S    +  +D+G   T LP + +          +KA+ V  VG     +      LC+ 
Sbjct: 314 ESRLRSSTIIDSGASITTLPEDVY--------EAVKAEFVSQVGLPAAAAGSAALDLCFA 365

Query: 298 IS-----SQPKFPEVTIHFR-GADVKLSPSN-LFRNISDEIMCSAF-RGGNANIVYGRIM 349
           +       +P  P +T+H   GAD +L   N +F + +  ++C          +V G   
Sbjct: 366 LPVAALWRRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQ 425

Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
           Q N  + YD+E  ++SF P+RC
Sbjct: 426 QQNTHVVYDLENDVLSFAPARC 447


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score =  151 bits (381), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 122/368 (33%), Positives = 181/368 (49%), Gaps = 48/368 (13%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           ++LM+ SIG PP+     +DTGS  TW  C PC    C +Q  P+FDP KSSTY+++SCS
Sbjct: 92  VFLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPCSS--CSQQSVPIFDPSKSSTYSNLSCS 149

Query: 94  S-SQCAVVTSNCSEGDCSYSFLY-GRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             ++C VV      G+C YS  Y G G+    S G  A E LT  +     +++P++IFG
Sbjct: 150 ECNKCDVVN-----GECPYSVEYVGSGS----SQGIYAREQLTLETIDESIIKVPSLIFG 200

Query: 152 CGHKNLASPTSDSKQ--TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
           CG K   S      Q   G+ GLG G  SL+   G     KFSYC+ +  ++   F  +V
Sbjct: 201 CGRKFSISSNGYPYQGINGVFGLGSGRFSLLPSFGK----KFSYCIGNLRNTNYKFNRLV 256

Query: 210 AG-----AGVVSTPLIIRDHYYLSLEAISVGNQRLE-----FVSSSTGN---IFVDTGVL 256
            G      G  +T  +I   YY++LEAIS+G ++L+     F  S T N   + +D+G  
Sbjct: 257 LGDKANMQGDSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGAD 316

Query: 257 RTLLPLEYHSNLKSVMSNMIK-----AQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHF 311
            T L       L   + N+++     AQ  K       +S V+  ++S    FP VT HF
Sbjct: 317 HTWLTKYGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSG---FPLVTFHF 373

Query: 312 -RGADVKLSPSNLFRNISDEIMCSAFRGGN-------ANIVYGRIMQINFLIGYDIEQAM 363
             GA + L  +++F   ++   C A   GN       +    G + Q N+ +GYD+ +  
Sbjct: 374 AEGAVLDLDVTSMFIQTTENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMR 433

Query: 364 VSFKPSRC 371
           V F+   C
Sbjct: 434 VYFQRIDC 441


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score =  151 bits (381), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 129/396 (32%), Positives = 187/396 (47%), Gaps = 72/396 (18%)

Query: 31  VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-PLFDPKKSSTYNS 89
           V + YL+HLS+GTPP  +  ++DTGSD  WTQC PC  L+CF Q   P+ DP  SST+ +
Sbjct: 90  VTNEYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPC--LNCFDQGAIPVLDPAASSTHAA 147

Query: 90  ISCSSSQC-AVVTSNCSEG-------DCSYSFLYGRGAYASFSSGNLATETLTF---NST 138
           + C +  C A+  ++C  G        C Y + YG     S + G LA++  TF   ++ 
Sbjct: 148 VRCDAPVCRALPFTSCGRGGSSWGERSCVYVYHYGD---KSITVGKLASDRFTFGPGDNA 204

Query: 139 SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-- 196
            G  V    + FGCGH N       + +TGI G G G  SL SQ+G +    FSYC    
Sbjct: 205 DGGGVSERRLTFGCGHFNKG--IFQANETGIAGFGRGRWSLPSQLGVT---SFSYCFTSM 259

Query: 197 -DQGSSKINFGGIVAGA------GVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSS 244
            +  SS +  G  VA A       V STPL +RD      Y+LSL+AI+VG  R+     
Sbjct: 260 FESTSSLVTLG--VAPAELHLTGQVQSTPL-LRDPSQPSLYFLSLKAITVGATRIPIPER 316

Query: 245 ST----GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
                  +  +D+G   T LP + +  +K+     +   PV  V    G +  LC+ + S
Sbjct: 317 RQRLREASAIIDSGASITTLPEDVYEAVKAEFVAQV-GLPVSAV---EGSALDLCFALPS 372

Query: 301 QP-------------------KFPEVTIHF-RGADVKLSPSN-LFRNISDEIMC----SA 335
                                + P +  H   GAD +L   N +F +    +MC    +A
Sbjct: 373 AAAPKSAFGWRWRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAA 432

Query: 336 FRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             GG+  +V G   Q N  + YD+E  ++SF P+RC
Sbjct: 433 TGGGDQTVVIGNYQQQNTHVVYDLENDVLSFAPARC 468


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  151 bits (381), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 119/356 (33%), Positives = 171/356 (48%), Gaps = 39/356 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+M +S+GTP        DTGSD  W Q EPC    C      +FDP++SST+  + CSS
Sbjct: 55  YVMDISVGTPGKRFRAIADTGSDLVWVQSEPC--TGCSGGT--IFDPRQSSTFREMDCSS 110

Query: 95  SQCAVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             C  +  +C  G   CSYS+ YG G     + G  A +T++  +TSG   + P+   GC
Sbjct: 111 QLCTELPGSCEPGSSACSYSYEYGSGE----TEGEFARDTISLGTTSGGSQKFPSFAVGC 166

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSSKINFG-- 206
           G  N      D    G++GLG G  SL SQ+  +I  KFSYCL D      SS + FG  
Sbjct: 167 GMVNSGFDGVD----GLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPS 222

Query: 207 GIVAGAGVVSTPL-----IIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLP 261
             + G G+ ST +         +Y L++  I+V  Q +     S G   +D+G   T +P
Sbjct: 223 AALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTM----GSPGTTIIDSGTTLTYVP 278

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGADVKLS 319
              +  + S M +M+    V   G+  G    LCY+ SS    KFP +TI   GA +   
Sbjct: 279 SGVYGRVLSRMESMVTLPRVD--GSSMGLD--LCYDRSSNRNYKFPALTIRLAGATMTPP 334

Query: 320 PSNLFRNISD--EIMCSAF--RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            SN F  + D  + +C A    GG    + G +MQ  + I YD   + +SF  ++C
Sbjct: 335 SSNYFLVVDDSGDTVCLAMGSAGGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score =  150 bits (380), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 128/394 (32%), Positives = 186/394 (47%), Gaps = 62/394 (15%)

Query: 22  IIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDP 81
           + +Q  + +    Y M+LSIGTPPV      DTGS   WTQC PC E  C  +  P F P
Sbjct: 77  VSFQTLLDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTE--CAARPAPPFQP 134

Query: 82  KKSSTYNSISCSSSQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST 138
             SST++ + C+SS C  +TS    C+   C Y + YG G    F++G LATETL     
Sbjct: 135 ASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG----FTAGYLATETLHVGGA 190

Query: 139 SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--- 195
           S      P V FGC  +N    +S    +GI+GLG    SL+SQ+G    G+FSYCL   
Sbjct: 191 S-----FPGVAFGCSTENGVGNSS----SGIVGLGRSPLSLVSQVGV---GRFSYCLRSD 238

Query: 196 PDQGSSKINFGGI--VAGAGVVSTPLI------IRDHYYLSLEAISVGNQRLEFVSSS-- 245
            D G S I FG +  V G  V STPL+         +YY++L  I+VG   L   S++  
Sbjct: 239 ADAGDSPILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFG 298

Query: 246 ---------TGNIFVDTGVLRTLLPLEYHSNLK-SVMSNMIKAQPVKGV-GAEPGFSDVL 294
                     G   VD+G   T L  E ++ +K + +S M  A     V G   GF   L
Sbjct: 299 FTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFD--L 356

Query: 295 CYNIS-----SQPKFPEVTIHFRG-ADVKLSPSNLFRNISDE------IMCSAFRGGNAN 342
           C++ +     S    P + + F G A+  +   +    ++ +      + C      +  
Sbjct: 357 CFDATAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEK 416

Query: 343 I---VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           +   + G +MQ++  + YD++  M SF P+ C N
Sbjct: 417 LSISIIGNVMQMDLHVLYDLDGGMFSFAPADCAN 450


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score =  150 bits (380), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 112/361 (31%), Positives = 176/361 (48%), Gaps = 46/361 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP  +++  +DTGSD  W QCEPC   DC++Q  P+F+P  SSTY S++CS+
Sbjct: 162 YFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCA--DCYQQSDPVFNPTSSSTYKSLTCSA 219

Query: 95  SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC+++ TS C    C Y   YG G   SF+ G LAT+T+TF ++     ++ NV  GCG
Sbjct: 220 PQCSLLETSACRSNKCLYQVSYGDG---SFTVGELATDTVTFGNSG----KINNVALGCG 272

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
           H N      +   TG  GL      ++S      A  FSYCL D+ S K   ++F  +  
Sbjct: 273 HDN------EGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQL 326

Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
           G G  + PL+    I   YY+ L   SVG +++       +  +S +G + +D G   T 
Sbjct: 327 GGGDATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTR 386

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS--SQPKFPEVTIHFRG 313
           L  + +++L+     +        V  + G S +     CY+ S  S  K P V  HF G
Sbjct: 387 LQTQAYNSLRDAFLKLT-------VNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTG 439

Query: 314 AD-VKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
              + L   N    + D    C AF   ++++ + G + Q    I YD+ + ++    ++
Sbjct: 440 GKSLDLPAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNK 499

Query: 371 C 371
           C
Sbjct: 500 C 500


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score =  150 bits (380), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 119/350 (34%), Positives = 177/350 (50%), Gaps = 31/350 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M  SIGTPP ++    DTGSD  W +C  C    C  Q  P + P KSS+++ + CS 
Sbjct: 82  YDMTFSIGTPPQELSALADTGSDLIWAKCGACTR--CVPQGSPSYYPNKSSSFSKLPCSG 139

Query: 95  SQCA-VVTSNCSEG--DCSYSFLYGRGAY-ASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
           S C+ + +S CS G  +C Y + YG  +    ++ G L +ET T  S +     +P + F
Sbjct: 140 SLCSDLPSSQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTLGSDA-----VPGIGF 194

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SKINFG-G 207
           GC      S       +G++GLG G  SL+SQ+     G FSYCL    +  S + FG G
Sbjct: 195 GC---TTMSEGGYGSGSGLVGLGRGPLSLVSQLNV---GAFSYCLTSDAAKTSPLLFGSG 248

Query: 208 IVAGAGVVSTPLIIRDHYY--LSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYH 265
            + GAGV STPL+    YY  ++LE+IS+G        SS   I  D+G     L    +
Sbjct: 249 ALTGAGVQSTPLLRTSTYYYTVNLESISIGAATTAGTGSS--GIIFDSGTTVAFLAEPAY 306

Query: 266 SNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLF 324
           +  K +V+S         G     G+   +C+  S    FP + +HF G D+ L   N F
Sbjct: 307 TLAKEAVLSQTTNLTMASG---RDGYE--VCFQTSGA-VFPSMVLHFDGGDMDLPTENYF 360

Query: 325 RNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTNY 374
             + D + C   +   +  + G IMQ+N+ I YD+E++M+SF+P+ C N+
Sbjct: 361 GAVDDSVSCWIVQKSPSLSIVGNIMQMNYHIRYDVEKSMLSFQPANCDNF 410


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score =  150 bits (380), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 121/361 (33%), Positives = 169/361 (46%), Gaps = 42/361 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L +GTP   ++  +DTGSD  W QC PC  + C+ Q  P+FDP KS ++ +I C S
Sbjct: 145 YFTRLGVGTPARYVYMVLDTGSDIVWIQCAPC--IKCYSQTDPVFDPTKSRSFANIPCGS 202

Query: 95  SQCAVVT-SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  +    CS     C Y   YG G   SF+ G  +TETLTF  T      +  V+ G
Sbjct: 203 PLCRRLDYPGCSTKKQICLYQVSYGDG---SFTVGEFSTETLTFRGT-----RVGRVVLG 254

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGG 207
           CGH N       +   G+        S  SQ+G     KFSYCL D+ +    S I FG 
Sbjct: 255 CGHDNEGLFVGAAGLLGLGRG---RLSFPSQIGRRFNSKFSYCLGDRSASSRPSSIVFGD 311

Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS------STGN--IFVDTGV 255
                    TPL+    +   YY+ L  ISVG  R+  +S+      STGN  + +D+G 
Sbjct: 312 SAISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIIDSGT 371

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR 312
             T L    +  L+     ++ A  +K     P FS    C+++S  ++ K P V +HFR
Sbjct: 372 SVTRLTRAAYVALRDAF--LVGASNLK---RAPEFSLFDTCFDLSGKTEVKVPTVVLHFR 426

Query: 313 GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
           GADV L  SN    + +    C AF G  + + + G I Q  F + YD+  + V F P  
Sbjct: 427 GADVPLPASNYLIPVDNSGSFCFAFAGTASGLSIIGNIQQQGFRVVYDLATSRVGFAPRG 486

Query: 371 C 371
           C
Sbjct: 487 C 487


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  150 bits (379), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 123/385 (31%), Positives = 178/385 (46%), Gaps = 59/385 (15%)

Query: 24  YQAEIISVDDI----YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLF 79
           +QA +IS   +    Y + +S+GTPP  ++  +DTGSD  W QC PC  + C+ Q   +F
Sbjct: 22  FQAPVISGLSLGSGEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPC--VSCYHQCDEVF 79

Query: 80  DPKKSSTYNSISCSSSQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST 138
           DP KSSTY+++ C+S QC  +    C    C Y   YG G   SFS+G  AT+ ++ NST
Sbjct: 80  DPYKSSTYSTLGCNSRQCLNLDVGGCVGNKCLYQVDYGDG---SFSTGEFATDAVSLNST 136

Query: 139 S-GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD 197
           S G  V +  +  GCGH N       +   G+        S  +Q+ +   G+FSYCL  
Sbjct: 137 SGGGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKG---PLSFPNQINSENGGRFSYCLTG 193

Query: 198 QGSSKINFGGIVAG------AGVVSTP----LIIRDHYYLSLEAISVGNQRLEFVSSS-- 245
           + +       ++ G      AGV  TP    L +   YYL +  ISVG   L   +S+  
Sbjct: 194 RDTDSTERSSLIFGDAAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQ 253

Query: 246 -----TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL------ 294
                 G + +D+G   T L    +++L+                   G SD++      
Sbjct: 254 LDSLGNGGVIIDSGTSVTRLQNAAYASLREAF--------------RAGTSDLVLTTEFS 299

Query: 295 ----CYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNISD-EIMCSAFRGGNANIVYG 346
               CYN+S  S    P VT+HF+ GAD+KL  SN    + +    C AF G     + G
Sbjct: 300 LFDTCYNLSDLSSVDVPTVTLHFQGGADLKLPASNYLVPVDNSSTFCLAFAGTTGPSIIG 359

Query: 347 RIMQINFLIGYDIEQAMVSFKPSRC 371
            I Q  F + YD     V F PS+C
Sbjct: 360 NIQQQGFRVIYDNLHNQVGFVPSQC 384


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score =  150 bits (379), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 109/368 (29%), Positives = 175/368 (47%), Gaps = 43/368 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ L++GTPP  I   +DTGSD  WTQC+ C    C +Q  PLF P+ SS+Y  + C+ 
Sbjct: 98  YVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTA--CLRQPDPLFSPRMSSSYEPMRCAG 155

Query: 95  SQCA-VVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             C  ++  +C   D C+Y + YG G   + + G  ATE  TF S+SG    +P + FGC
Sbjct: 156 QLCGDILHHSCVRPDTCTYRYSYGDG---TTTLGYYATERFTFASSSGETQSVP-LGFGC 211

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIV 209
           G  N+ S  + S   GI+G G    SL+SQ+      +FSYCL    SS+   + FG + 
Sbjct: 212 GTMNVGSLNNAS---GIVGFGRDPLSLVSQLSIR---RFSYCLTPYASSRKSTLQFGSLA 265

Query: 210 -------AGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS-------TGNIFV 251
                  A   V +TP++        YY++   ++VG +RL   +S+       +G + +
Sbjct: 266 DVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVII 325

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIK------AQPVKGVGAEPGFSDVLCYNISSQPKFP 305
           D+G   TL P+   + +     + ++      + P  GV             ++ Q   P
Sbjct: 326 DSGTALTLFPVAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVP 385

Query: 306 EVTIHFRGADVKLSPSN-LFRNISDEIMCSAF-RGGNANIVYGRIMQINFLIGYDIEQAM 363
            +  HF+GAD+ L   N +  +     +C      G+     G  +Q +  + YD+E+  
Sbjct: 386 RMVFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERET 445

Query: 364 VSFKPSRC 371
           +SF P  C
Sbjct: 446 LSFAPVEC 453


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 119/376 (31%), Positives = 177/376 (47%), Gaps = 41/376 (10%)

Query: 24  YQAEIISVDDI----YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLF 79
           +QA ++S   +    Y + +S+GTPP  ++  +DTGSD  W QC PC  ++C+ Q   +F
Sbjct: 43  FQAPVVSGLSLGSGEYFIRISVGTPPRRMYLVMDTGSDILWLQCAPC--VNCYHQSDAIF 100

Query: 80  DPKKSSTYNSISCSSSQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST 138
           DP KSSTY+++ CS+ QC  +    C    C Y   YG G   SF++G   T+ ++ NST
Sbjct: 101 DPYKSSTYSTLGCSTRQCLNLDIGTCQANKCLYQVDYGDG---SFTTGEFGTDDVSLNST 157

Query: 139 SGL-PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD 197
           SG+  V +  +  GCGH N       +   G+        S  +Q+     G+FSYCL D
Sbjct: 158 SGVGQVVLNKIPLGCGHDNEGYFVGAAGLLGLGKG---PLSFPNQVDPQNGGRFSYCLTD 214

Query: 198 ------QGSSKINFGGIVAGAGVVSTP----LIIRDHYYLSLEAISVGNQRLEFVSSS-- 245
                 +GSS +     V  AG   TP    + +   YYL +  ISVG   L   +S+  
Sbjct: 215 RETDSTEGSSLVFGEAAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQ 274

Query: 246 -----TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS 299
                 G + +D+G   T L    +++L+              +    GFS    CY++S
Sbjct: 275 LDSLGNGGVIIDSGTSVTRLQNAAYASLRDAFR-----AGTSDLAPTAGFSLFDTCYDLS 329

Query: 300 --SQPKFPEVTIHFRGA-DVKLSPSNLFRNISD-EIMCSAFRGGNANIVYGRIMQINFLI 355
             +    P VT+HF+G  D+KL  SN    + +    C AF G     + G I Q  F +
Sbjct: 330 GLASVDVPTVTLHFQGGTDLKLPASNYLIPVDNSNTFCLAFAGTTGPSIIGNIQQQGFRV 389

Query: 356 GYDIEQAMVSFKPSRC 371
            YD     V F PS+C
Sbjct: 390 IYDNLHNQVGFVPSQC 405


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 118/394 (29%), Positives = 183/394 (46%), Gaps = 51/394 (12%)

Query: 11  NDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD 70
           ND++    P  +  +    S D  Y++ L+IGTPP  +   +DTGSD  WTQC PC    
Sbjct: 81  NDDQRTTPPTGVSVRP---SGDLEYVVDLAIGTPPQPVSALLDTGSDLIWTQCAPCAS-- 135

Query: 71  CFKQEPPLFDPKKSSTYNSISCSSSQCA-VVTSNCSEGD-CSYSFLYGRGAYASFSSGNL 128
           C  Q  PLF P +S++Y  + C+   C+ ++   C   D C+Y + YG G   + + G  
Sbjct: 136 CLAQPDPLFAPGESASYEPMRCAGQLCSDILHHGCEMPDTCTYRYNYGDG---TMTMGVY 192

Query: 129 ATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIA 188
           ATE  TF S+ G  +    + FGCG  N+ S  + S   GI+G G    SL+SQ+     
Sbjct: 193 ATERFTFTSSGGDRLMTVPLGFGCGSMNVGSLNNGS---GIVGFGRNPLSLVSQLSIR-- 247

Query: 189 GKFSYCLPDQGS---SKINFGGIVAG------AGVVSTPLIIR----DHYYLSLEAISVG 235
            +FSYCL   GS   S + FG +  G        V +TPL+        YY+ L  ++VG
Sbjct: 248 -RFSYCLTSYGSGRKSTLLFGSLSGGVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVG 306

Query: 236 NQRLEFVSSS-------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEP 288
            +RL    S+       +G + VD+G   TLLP    + +       ++     G   E 
Sbjct: 307 ARRLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRLPFANGGNPED 366

Query: 289 GFSDVLCYNI---------SSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFR- 337
           G    +C+ +         +SQ   P +  HF+ AD+ L   N +  +     +C     
Sbjct: 367 G----VCFLVPAAWRRSSSTSQVPVPRMVFHFQDADLDLPRRNYVLDDHRKGRLCLLLAD 422

Query: 338 GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            G+     G ++Q +  + YD+E   +SF P++C
Sbjct: 423 SGDDGSTIGNLVQQDMRVLYDLEAETLSFAPAQC 456


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 115/354 (32%), Positives = 176/354 (49%), Gaps = 34/354 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QC+PC  + C++Q   LFDP  SSTY ++SC++
Sbjct: 179 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPC-VVACYEQREKLFDPASSSTYANVSCAA 237

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C+ +  S CS G C Y   YG G+Y   S G  A +TLT +S       +    FGCG
Sbjct: 238 PACSDLDVSGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYD----AVKGFRFGCG 290

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFGGIVAG 211
            +N        +  G++GLG G +SL  Q      G F++CLP +  G+  ++FG   AG
Sbjct: 291 ERNDG---LFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFG---AG 344

Query: 212 A--GVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
           +     +TP++  +    YY+ +  I VG + L    S  +     VD+G + T LP   
Sbjct: 345 SPPATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAA 404

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPS 321
           +S+L+S  +  + A+  +   A        CY+ +  SQ   P V++ F+ GA + +  S
Sbjct: 405 YSSLRSAFAAAMAARGYRKAAAVSLLD--TCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 462

Query: 322 NLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            +   +S   +C AF     GG+  IV G      F + YDI + +V F P  C
Sbjct: 463 GIMYTVSASQVCLAFAGNEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFSPGAC 515


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 117/358 (32%), Positives = 177/358 (49%), Gaps = 41/358 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +L+++SIG+PP+     +DT SD  W QC PC  ++C+ Q  P+FDP +S T+ + +C +
Sbjct: 85  FLVNISIGSPPITQLLHMDTASDLLWIQCLPC--INCYAQSLPIFDPSRSYTHRNETCRT 142

Query: 95  SQCAV--VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST--SGLPVEMPNVIF 150
           SQ ++  +  N +   C YS  Y      + S G LA E L FN+         + +V+F
Sbjct: 143 SQYSMPSLKFNANTRSCEYSMRYVDD---TGSKGILAREMLLFNTIYDESSSAALHDVVF 199

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSKINF 205
           GCGH N   P      TGI+GLG G  SL+ + G     KFSYC      P    + +  
Sbjct: 200 GCGHDNYGEPLVG---TGILGLGYGEFSLVHRFGK----KFSYCFGSLDDPSYPHNVLVL 252

Query: 206 GGIVAGAGVV--STPLIIRD-HYYLSLEAISVGNQRLEF--------VSSSTGNIFVDTG 254
           G    GA ++  +TPL I +  YY+++EAISV    L            +  G   +DTG
Sbjct: 253 GD--DGANILGDTTPLEIHNGFYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTG 310

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK-----FPEVTI 309
              T L  E +  LK+ + ++ + +      ++     + CYN + +       FP VT 
Sbjct: 311 NSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECYNGNFERDLVESGFPIVTF 370

Query: 310 HF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
           HF  GA++ L   +LF  +S  + C A   GN N + G   Q ++ IGYD+E   VSF
Sbjct: 371 HFSEGAELSLDVKSLFMKLSPNVFCLAVTPGNLNSI-GATAQQSYNIGYDLEAMEVSF 427


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 109/368 (29%), Positives = 174/368 (47%), Gaps = 43/368 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ L++GTPP  I   +DTGSD  WTQC+ C    C +Q  PLF P+ SS+Y  + C+ 
Sbjct: 98  YVLDLAVGTPPQPITALLDTGSDLIWTQCDTCTA--CLRQPDPLFSPRMSSSYEPMRCAG 155

Query: 95  SQCA-VVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             C  ++  +C   D C+Y + YG G   + + G  ATE  TF S+SG    +P + FGC
Sbjct: 156 QLCGDILHHSCVRPDTCTYRYSYGDG---TTTLGYYATERFTFASSSGETQSVP-LGFGC 211

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIV 209
           G  N+ S  + S   GI+G G    SL+SQ+      +FSYCL    SS+   + FG + 
Sbjct: 212 GTMNVGSLNNAS---GIVGFGRDPLSLVSQLSIR---RFSYCLTPYASSRKSTLQFGSLA 265

Query: 210 -------AGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS-------TGNIFV 251
                  A   V +TP++        YY++   ++VG +RL   +S+       +G + +
Sbjct: 266 DVGLYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVII 325

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIK------AQPVKGVGAEPGFSDVLCYNISSQPKFP 305
           D+G   TL P    + +     + ++      + P  GV             ++ Q   P
Sbjct: 326 DSGTALTLFPAAVLAEVVRAFRSQLRLPFANGSSPDDGVCFAAPAVAAGGGRMARQVAVP 385

Query: 306 EVTIHFRGADVKLSPSN-LFRNISDEIMCSAF-RGGNANIVYGRIMQINFLIGYDIEQAM 363
            +  HF+GAD+ L   N +  +     +C      G+     G  +Q +  + YD+E+  
Sbjct: 386 RMVFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERET 445

Query: 364 VSFKPSRC 371
           +SF P  C
Sbjct: 446 LSFAPVEC 453


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 121/361 (33%), Positives = 171/361 (47%), Gaps = 42/361 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L +GTPP  ++  +DTGSD  W QC+PC +  C+ Q   +FDP KS ++  I C S
Sbjct: 130 YFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTK--CYSQTDQIFDPSKSKSFAGIPCYS 187

Query: 95  SQCAVVTS-NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S  CS  +  C Y   YG G   SF+ G+ +TETLTF   +     +P V  G
Sbjct: 188 PLCRRLDSPGCSLKNNLCQYQVSYGDG---SFTFGDFSTETLTFRRAA-----VPRVAIG 239

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFGG 207
           CGH N       +   G+        S  +Q GT    KFSYCL D+ +S     I FG 
Sbjct: 240 CGHDNEGLFVGAAGLLGLGRG---GLSFPTQTGTRFNNKFSYCLTDRTASAKPSSIVFGD 296

Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS------STGN--IFVDTGV 255
                    TPL+    +   YY+ L  ISVG   +  +S+      STGN  + +D+G 
Sbjct: 297 SAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGT 356

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR 312
             T L    + +L+      + A  +K     P FS    CY++S  S+ K P V +HFR
Sbjct: 357 SVTRLTRPAYVSLRDAF--RVGASHLK---RAPEFSLFDTCYDLSGLSEVKVPTVVLHFR 411

Query: 313 GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
           GADV L  +N    + +    C AF G  + + + G I Q  F + +D+  + V F P  
Sbjct: 412 GADVSLPAANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRG 471

Query: 371 C 371
           C
Sbjct: 472 C 472


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 115/354 (32%), Positives = 176/354 (49%), Gaps = 34/354 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QC+PC  + C++Q   LFDP  SSTY ++SC++
Sbjct: 183 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPC-VVACYEQREKLFDPASSSTYANVSCAA 241

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C+ +  S CS G C Y   YG G+Y   S G  A +TLT +S       +    FGCG
Sbjct: 242 PACSDLDVSGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYD----AVKGFRFGCG 294

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFGGIVAG 211
            +N        +  G++GLG G +SL  Q      G F++CLP +  G+  ++FG   AG
Sbjct: 295 ERNDG---LFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFG---AG 348

Query: 212 A--GVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
           +     +TP++  +    YY+ +  I VG + L    S  +     VD+G + T LP   
Sbjct: 349 SPPATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAA 408

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPS 321
           +S+L+S  +  + A+  +   A        CY+ +  SQ   P V++ F+ GA + +  S
Sbjct: 409 YSSLRSAFAAAMAARGYRKAAAVSLLD--TCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 466

Query: 322 NLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            +   +S   +C AF     GG+  IV G      F + YDI + +V F P  C
Sbjct: 467 GIMYTVSASQVCLAFAGNEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 115/354 (32%), Positives = 176/354 (49%), Gaps = 34/354 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QC+PC  + C++Q   LFDP  SSTY ++SC++
Sbjct: 180 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPC-VVACYEQREKLFDPASSSTYANVSCAA 238

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C+ +  S CS G C Y   YG G+Y   S G  A +TLT +S       +    FGCG
Sbjct: 239 PACSDLDVSGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYD----AVKGFRFGCG 291

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFGGIVAG 211
            +N        +  G++GLG G +SL  Q      G F++CLP +  G+  ++FG   AG
Sbjct: 292 ERNDG---LFGEAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPRSTGTGYLDFG---AG 345

Query: 212 A--GVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
           +     +TP++  +    YY+ +  I VG + L    S  +     VD+G + T LP   
Sbjct: 346 SPPATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAA 405

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPS 321
           +S+L+S  +  + A+  +   A        CY+ +  SQ   P V++ F+ GA + +  S
Sbjct: 406 YSSLRSAFAAAMAARGYRKAAAVSLLD--TCYDFTGMSQVAIPTVSLLFQGGAALDVDAS 463

Query: 322 NLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            +   +S   +C AF     GG+  IV G      F + YDI + +V F P  C
Sbjct: 464 GIMYTVSASQVCLAFAGNEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFSPGAC 516


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 118/364 (32%), Positives = 169/364 (46%), Gaps = 41/364 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTPP      +DTGSD  W QC+PC  + C++Q  PL+DP+ SSTY    CS 
Sbjct: 99  YFASVGVGTPPTPALLVIDTGSDVVWLQCKPC--VHCYRQLSPLYDPRGSSTYAQTPCSP 156

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF-NSTSGLPVEMPNVIFGC 152
            QC    T + + G C Y  +YG    AS +SGNLAT+ L F N TS     + NV  GC
Sbjct: 157 PQCRNPQTCDGTTGGCGYRIVYGD---ASSTSGNLATDRLVFSNDTS-----VGNVTLGC 208

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
           GH N     S     G++G+  GN+S  +Q+  S    F+YCL D+  S  +   +V G 
Sbjct: 209 GHDNEGLFGS---AAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSSYLVFGR 265

Query: 213 GV------VSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST---------GNIFVDT 253
                   V TPL         YY+ +   SVG + +   S+++         G + VD+
Sbjct: 266 TAPEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRGGVVVDS 325

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHF 311
           G   T    + +  L+           ++ VG      D  CY++   +    P V +HF
Sbjct: 326 GTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDA-CYDLRGVAVADAPGVVLHF 384

Query: 312 R-GADVKLSPSN-LFRNISDEIMCSAFR--GGNANIVYGRIMQINFLIGYDIEQAMVSFK 367
             GADV L P N L    S    C A    G +   V G ++Q  F + +D+E   V F+
Sbjct: 385 AGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVENERVGFE 444

Query: 368 PSRC 371
           P+ C
Sbjct: 445 PNGC 448


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 116/361 (32%), Positives = 164/361 (45%), Gaps = 42/361 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L +GTP   ++  +DTGSD  W QC PC    C+ Q  P+FDP+KS TY +I CSS
Sbjct: 142 YFTRLGVGTPARYVYMVLDTGSDIVWLQCAPCRR--CYSQSDPIFDPRKSKTYATIPCSS 199

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   N     C Y   YG G   SF+ G+ +TETLTF         +  V  G
Sbjct: 200 PHCRRLDSAGCNTRRKTCLYQVSYGDG---SFTVGDFSTETLTFRRN-----RVKGVALG 251

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGG 207
           CGH N       +   G+        S   Q G     KFSYCL D+ +    S + FG 
Sbjct: 252 CGHDNEGLFVGAAGLLGLGKG---KLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVVFGN 308

Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGV 255
                    TPL+    +   YY+ L  ISVG  R+  V++S         G + +D+G 
Sbjct: 309 AAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGT 368

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR 312
             T L    +  ++      + A+ +K     P FS    C+++S  ++ K P V +HFR
Sbjct: 369 SVTRLIRPAYIAMRDAFR--VGAKTLK---RAPNFSLFDTCFDLSNMNEVKVPTVVLHFR 423

Query: 313 GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
            ADV L  +N    + ++   C AF G    + + G I Q  F + YD+  + V F P  
Sbjct: 424 RADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGG 483

Query: 371 C 371
           C
Sbjct: 484 C 484


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 119/373 (31%), Positives = 180/373 (48%), Gaps = 44/373 (11%)

Query: 28  IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
           +++ D  YLM + IGTP       +DTGSD  WTQC PC  L C  Q  P FDP +S+TY
Sbjct: 83  VLASDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPARSATY 140

Query: 88  NSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
            S+ C+S  C A+    C +  C Y + YG  A    ++G LA ET TF  T+   V +P
Sbjct: 141 RSLGCASPACNALYYPLCYQKVCVYQYFYGDSAS---TAGVLANETFTFG-TNETRVSLP 196

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKI 203
            + FGCG+ N  S  + S   G++G G G+ SL+SQ+G+    +FSYCL    S   S++
Sbjct: 197 GISFGCGNLNAGSLANGS---GMVGFGRGSLSLVSQLGSP---RFSYCLTSFLSPVPSRL 250

Query: 204 NFGGI-------VAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF--------VSS 244
            FG          +   V STP ++       Y+L++  ISVG   L           + 
Sbjct: 251 YFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTD 310

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK- 303
            TG   +D+G   T L    +  +++  ++ I   P+  V  +    D  C+     P+ 
Sbjct: 311 GTGGTIIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNV-TDASVLDT-CFQWPPPPRQ 367

Query: 304 ---FPEVTIHFRGADVKLSPSN--LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYD 358
               P++ +HF GAD +L   N  L    +   +C A    +   + G     NF + YD
Sbjct: 368 SVTLPQLVLHFDGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYD 427

Query: 359 IEQAMVSFKPSRC 371
           +E +++SF P+ C
Sbjct: 428 LENSLMSFVPAPC 440


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score =  148 bits (374), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 114/348 (32%), Positives = 167/348 (47%), Gaps = 43/348 (12%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCS 110
           +DTGSD  WTQC PC  L C  Q  P FD KKS+TY ++ C SS+CA ++S +C +  C 
Sbjct: 1   MDTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCV 58

Query: 111 YSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGII 170
           Y + YG  A    ++G LA ET TF + +   V   N+ FGCG  N     + S   G++
Sbjct: 59  YQYYYGDTAS---TAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANSS---GMV 112

Query: 171 GLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFG--------GIVAGAGVVSTPL 219
           G G G  SL+SQ+G S   +FSYCL    S   S++ FG           +G+ V STP 
Sbjct: 113 GFGRGPLSLVSQLGPS---RFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPF 169

Query: 220 IIR----DHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPLEYHSNL 268
           +I     + Y+LSL+AIS+G + L             TG + +D+G   T L  + +   
Sbjct: 170 VINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAY--- 226

Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK----FPEVTIHFRGADVKLSPSNLF 324
           ++V   ++ A P+  +       D  C+     P      P++  HF  A++ L P N  
Sbjct: 227 EAVRRGLVSAIPLPAMNDTDIGLDT-CFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYM 285

Query: 325 RNISDE-IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
              S    +C          + G   Q N  + YDI  + +SF P+ C
Sbjct: 286 LIASTTGYLCLVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score =  148 bits (374), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 113/360 (31%), Positives = 172/360 (47%), Gaps = 35/360 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +L++LSIG+PPV     VDTGS   W QC PC  ++CF+Q    FDP KS ++ ++ C  
Sbjct: 104 FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPC--INCFQQSTSWFDPLKSVSFKTLGCGF 161

Query: 95  SQCAVVTS-NCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                +    C+      Y   Y  G     S G LA E+L F +     ++  N+ FGC
Sbjct: 162 PGYNYINGYKCNRFNQAEYKLRYLGG---DSSQGILAKESLLFETLDEGKIKKSNITFGC 218

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
           GH N+ +  +D    G+ GLG         M T +  KFSYC+ D  +       +V G 
Sbjct: 219 GHMNIKT-NNDDAYNGVFGLGAYPH---ITMATQLGNKFSYCIGDINNPLYTHNHLVLGQ 274

Query: 213 GVV----STPLIIR-DHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLL 260
           G      STPL I   HYY++L++ISVG++ L       +  S  +G + +D+G+  T L
Sbjct: 275 GSYIEGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKL 334

Query: 261 P---LE-YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GAD 315
                E  +  +  +M  +++  P +       F  V+  ++     FP VT HF  GAD
Sbjct: 335 ANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVG---FPAVTFHFAGGAD 391

Query: 316 VKLSPSNLFRNISDEIMCSAFRGGNANI----VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           + L   +LFR    +  C A    N+ +    V G + Q N+ +G+D+EQ  V F+   C
Sbjct: 392 LVLESGSLFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 107/356 (30%), Positives = 170/356 (47%), Gaps = 34/356 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+P       VDTGS  +W QC+PC  + C  Q  PLFDP  S TY S+SC+S
Sbjct: 13  YYVKVGLGSPARYYSMIVDTGSSLSWLQCKPC-VVYCHVQADPLFDPSASKTYKSLSCTS 71

Query: 95  SQCAVVTS--------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           SQC+ +            S   C Y+  YG    +S+S G L+ + LT   +      +P
Sbjct: 72  SQCSSLVDATLNNPLCETSSNVCVYTASYGD---SSYSMGYLSQDLLTLAPSQ----TLP 124

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-INF 205
             ++GCG     S     +  GI+GLG    S++ Q+ +     FSYCLP +G    ++ 
Sbjct: 125 GFVYGCGQD---SEGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGFLSI 181

Query: 206 GGI-VAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSSTG-NIFVDTGVLRTL 259
           G   +AG+    TP+         Y+L L AI+VG + L   ++       +D+G + T 
Sbjct: 182 GKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITR 241

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCY--NISSQPKFPEVTIHFRG-AD 315
           LP+  ++  +     ++ ++  +     PGFS +  C+  N+      PEV + F+G AD
Sbjct: 242 LPMSVYTPFQQAFVKIMSSKYAR----APGFSILDTCFKGNLKDMQSVPEVRLIFQGGAD 297

Query: 316 VKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           + L P N+   + + + C AF G N   + G   Q  F + +DI  A + F    C
Sbjct: 298 LNLRPVNVLLQVDEGLTCLAFAGNNGVAIIGNHQQQTFKVAHDISTARIGFATGGC 353


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 117/347 (33%), Positives = 168/347 (48%), Gaps = 37/347 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +GTP  D+    DTGSD TWTQCEPC    C+KQ+  +FDP KS++Y++I+C+S
Sbjct: 146 YFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCAR-SCYKQQDVIFDPSKSTSYSNITCTS 204

Query: 95  SQCAVVTS--------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           + C  +++        + S   C Y   YG    +SFS G  + E LT  +T      + 
Sbjct: 205 ALCTQLSTATGNDPGCSASTKACIYGIQYGD---SSFSVGYFSRERLTVTATD----VVD 257

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--KIN 204
           N +FGCG  N       +   G+IGLG    S + Q        FSYCLP   SS   ++
Sbjct: 258 NFLFGCGQNNQGLFGGSA---GLIGLGRHPISFVQQTAAKYRKIFSYCLPSTSSSTGHLS 314

Query: 205 FGGIVAGAGVVSTPL--IIR--DHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRT 258
           FG    G  +  TP   I R    Y L + AI+VG  +L   SS  STG   +D+G + T
Sbjct: 315 FGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFSTGGAIIDSGTVIT 374

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR---GAD 315
            LP   +  L+S     +   P  G   E    D  CY++S    F   TI F    G  
Sbjct: 375 RLPPTAYGALRSAFRQGMSKYPSAG---ELSILDT-CYDLSGYKVFSIPTIEFSFAGGVT 430

Query: 316 VKLSPSNLFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDI 359
           VKL P  +    S + +C AF   G ++++ +YG + Q    + YD+
Sbjct: 431 VKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVYDV 477


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score =  148 bits (373), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 114/354 (32%), Positives = 175/354 (49%), Gaps = 31/354 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QCEPC  + C++Q+  LFDP +SST  +ISC++
Sbjct: 186 YVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVV-CYEQQEKLFDPARSSTDANISCAA 244

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C+ + T  CS G C Y   YG G+Y   S G  A +TLT +S   +        FGCG
Sbjct: 245 PACSDLYTKGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYDAI----KGFRFGCG 297

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINF--GGIV 209
            +N        +  G++GLG G +SL  Q      G F++C P +  G+  ++F  G   
Sbjct: 298 ERNEG---LFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDFGPGSSP 354

Query: 210 AGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
           A +  ++TP+++ +    YY+ L  I VG + L    S  +T    VD+G + T LP   
Sbjct: 355 AVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTAGTIVDSGTVITRLPPAA 414

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
           +S+L+S  ++ I A   +G    P  S +  CY+ +  SQ   P V++ F+ GA + +  
Sbjct: 415 YSSLRSAFASAIAA---RGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGASLDVDA 471

Query: 321 SNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           S +    S    C  F     +    + G      F + YDI + +V F P  C
Sbjct: 472 SGIIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score =  147 bits (372), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 114/357 (31%), Positives = 172/357 (48%), Gaps = 38/357 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP  +++  +DTGSD  W QC PC E  C++Q  P+FDP  SST+ S++CS 
Sbjct: 164 YFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPCSE--CYQQSDPIFDPTSSSTFKSLTCSD 221

Query: 95  SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            +CA +  S C    C Y   YG G   SF+ GN AT+T+TF    G   ++ +V  GCG
Sbjct: 222 PKCASLDVSACRSNKCLYQVSYGDG---SFTVGNYATDTVTF----GESGKVNDVALGCG 274

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
           H N    T  +   G+ G      S+ +Q+    A  FSYCL D+ S+K   ++F  +  
Sbjct: 275 HDNEGLFTGAAGLLGLGGG---ALSMTNQIK---AKSFSYCLVDRDSAKSSSLDFNSVQI 328

Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
           GAG  + PL+    +   YY+ L   SVG Q++       E  +S  G + +D G   T 
Sbjct: 329 GAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTR 388

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGAD-V 316
           L  + +++L+      +K       G  P      CY+ S  S  K P VT HF G   +
Sbjct: 389 LQTQAYNSLRDA---FVKLTTDFKKGTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSL 445

Query: 317 KLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            L   N    I D    C AF   ++++ + G + Q    I YD+   ++    ++C
Sbjct: 446 NLPAKNYLIPIDDAGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANNLIGLSANKC 502


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  147 bits (372), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 119/358 (33%), Positives = 173/358 (48%), Gaps = 43/358 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+M +S+GTP        DTGSD  W Q EPC    C      +FDP++SST+  + CSS
Sbjct: 55  YVMDISVGTPGKRFRAIADTGSDLVWVQSEPC--TGCSGGT--IFDPRQSSTFREMDCSS 110

Query: 95  SQCAVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             CA +  +C  G   CSYS+ YG G     + G  A +T++  +TS    + P+   GC
Sbjct: 111 QLCAELPGSCEPGSSTCSYSYEYGSGE----TEGEFARDTISLGTTSDGSQKFPSFAVGC 166

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSSKINFG-- 206
           G  N      D    G++GLG G  SL SQ+  +I  KFSYCL D      SS + FG  
Sbjct: 167 GMVNSGFDGVD----GLVGLGQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPS 222

Query: 207 GIVAGAGVVSTPL-----IIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLP 261
             + G G+ ST +         +Y L++  I+V  Q +     S G   +D+G   T +P
Sbjct: 223 AALHGTGIQSTKITPPSDTYPTYYLLTVNGIAVAGQTM----GSPGTTIIDSGTTLTYVP 278

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGADVKLS 319
              +  + S M +M+    V   G+  G    LCY+ SS    KFP +TI   GA +   
Sbjct: 279 SGVYGRVLSRMESMVTLPRVD--GSSMGLD--LCYDRSSNRNYKFPALTIRLAGATMTPP 334

Query: 320 PSNLFRNISD--EIMCSAFRGGNAN----IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            SN F  + D  + +C A   G+A+     + G +MQ  + I YD   + +SF  ++C
Sbjct: 335 SSNYFLVVDDSGDTVCLAM--GSASGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score =  147 bits (372), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 119/366 (32%), Positives = 175/366 (47%), Gaps = 54/366 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IG+P   ++  +DTGSD TW QC+PC   DC++Q  P+FDP  S++Y ++SC S
Sbjct: 166 YFSRVGIGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAVSCDS 223

Query: 95  SQCA-VVTSNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
            +C  + T+ C  + G C Y   YG G+Y   + G+ ATETLT   ++  PV   NV  G
Sbjct: 224 QRCRDLDTAACRNATGACLYEVAYGDGSY---TVGDFATETLTLGDST--PVG--NVAIG 276

Query: 152 CGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFG 206
           CGH N  L    +     G   L     S  SQ+  S    FSYCL D+ S   S + FG
Sbjct: 277 CGHDNEGLFVGAAGLLALGGGPL-----SFPSQISAS---TFSYCLVDRDSPAASTLQFG 328

Query: 207 GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEF--------VSSSTGNIFVDTG 254
              A AG V+ PL+        YY++L  ISVG Q L           +S +G + VD+G
Sbjct: 329 DGAAEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSG 388

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVT 308
              T L    ++ L+           V+G  + P  S V     CY++S +   + P V+
Sbjct: 389 TAVTRLQSAAYAALRDAF--------VQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVS 440

Query: 309 IHFRGAD-VKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
           + F G   ++L   N    +      C AF   NA + + G + Q    + +D  +  V 
Sbjct: 441 LRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVG 500

Query: 366 FKPSRC 371
           F P++C
Sbjct: 501 FTPNKC 506


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 122/376 (32%), Positives = 175/376 (46%), Gaps = 58/376 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +LM LS+GTP +     VDTGSD  WTQC+PC E  CF Q  P+FDP  SSTY ++ CSS
Sbjct: 116 FLMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVE--CFNQTTPVFDPAASSTYAALPCSS 173

Query: 95  SQCA---------VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           + CA           +S+ +   C Y++ YG    AS + G LATET T         ++
Sbjct: 174 ALCADLPTSTCASSSSSSSASSPCGYTYTYGD---ASSTQGVLATETFTLARQ-----KV 225

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSS 201
           P V FGCG  N       ++  G++GLG G  SL+SQ+G     +FSYCL       G S
Sbjct: 226 PGVAFGCGDTNEGD--GFTQGAGLVGLGRGPLSLVSQLGID---RFSYCLTSLDDAAGRS 280

Query: 202 KINFGGIVAGAGVV------STPLIIR----DHYYLSLEAISVGNQRLEFVSSS------ 245
            +  G     +         +TPL+        YY+SL  ++VG+ RL   SS+      
Sbjct: 281 PLLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDD 340

Query: 246 -TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-------N 297
            TG + VD+G   T L L  +  L+      +    V    +E G    LC+       +
Sbjct: 341 GTGGVIVDSGTSITYLELRAYRALRKAFVAHMSLPTVD--ASEIGLD--LCFQGPAGAVD 396

Query: 298 ISSQPKFPEVTIHFR-GADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLI 355
              Q + P++ +HF  GAD+ L   N +  + +   +C          + G   Q NF  
Sbjct: 397 QDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMASRGLSIIGNFQQQNFQF 456

Query: 356 GYDIEQAMVSFKPSRC 371
            YD+    +SF P+ C
Sbjct: 457 VYDVAGDTLSFAPAEC 472


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 118/366 (32%), Positives = 173/366 (47%), Gaps = 48/366 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+HL+IGTPP  +  ++DTGSD  WTQC+PC    CF Q  P +D  +SST+   SC S
Sbjct: 91  YLLHLAIGTPPQPVQLTLDTGSDLVWTQCQPCAV--CFNQSLPYYDASRSSTFALPSCDS 148

Query: 95  SQCAVVTS-----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           +QC +  S     N +   C++S+ YG     S + G L  ET++F + +     +P V+
Sbjct: 149 TQCKLDPSVTMCVNQTVQTCAFSYSYGD---KSATIGFLDVETVSFVAGA----SVPGVV 201

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------- 202
           FGCG  N       S +TGI G G G  SL SQ+     G FS+C       K       
Sbjct: 202 FGCGLNNTG--IFRSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVSGRKPSTVLFD 256

Query: 203 INFGGIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS------TGNIFV 251
           +       G G V T  +I++      YYLSL+ I+VG+ RL    S+      TG   +
Sbjct: 257 LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTII 316

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN---ISSQPKFPEVT 308
           D+G   T LP   +  +    +  +K   V      P    +LC++   +   P  P++ 
Sbjct: 317 DSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGP----LLCFSAPPLGKAPHVPKLV 372

Query: 309 IHFRGADVKLSPSNLFRNISDEIMCS---AFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
           +HF GA + L   N      D   CS   A   G   I+ G   Q N  + YD++ + +S
Sbjct: 373 LHFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTII-GNFQQQNMHVLYDLKNSKLS 431

Query: 366 FKPSRC 371
           F  ++C
Sbjct: 432 FVRAKC 437


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score =  147 bits (371), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 120/371 (32%), Positives = 177/371 (47%), Gaps = 52/371 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  +++GTP V+   ++DT SD TW QC+PC    C+ Q  P+FDP+ S++Y  +S ++
Sbjct: 138 YIAKIAVGTPGVEALLALDTASDLTWLQCQPCRR--CYPQSGPVFDPRHSTSYREMSFNA 195

Query: 95  SQCAVV----TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
           + C  +      +   G C Y+  YG G   S + G+   ETLTF       V +P +  
Sbjct: 196 ADCQALGRSGGGDAKRGTCVYTVGYGDG---STTVGDFIEETLTFAGG----VRLPRISI 248

Query: 151 GCGHKN---LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL------PDQGSS 201
           GCGH N     +P +     GI+GLG G  S  +Q+  +  G FSYCL      P   SS
Sbjct: 249 GCGHDNKGLFGAPAA-----GILGLGRGLMSFPNQIDHN--GTFSYCLVDFLSGPGSLSS 301

Query: 202 KINFG-GIVAGAGVVS-TPLIIR----DHYYLSLEAISVGNQRLEFVS---------SST 246
            + FG G V  +  VS TP ++       YY+ L  ISVG  R+  V+         +  
Sbjct: 302 TLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTERDLQLDPYTGR 361

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKF 304
           G + VD+G   T L    ++  +     +        +G   GF D  CY +  +   K 
Sbjct: 362 GGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDT-CYTVGGRGMKKV 420

Query: 305 PEVTIHFRGA-DVKLSPSNLFRNI-SDEIMCSAFR--GGNANIVYGRIMQINFLIGYDIE 360
           P V++HF G+ +VKL P N    + S   +C AF   G ++  + G I Q  F I YDI 
Sbjct: 421 PTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIGNIQQQGFRIVYDI- 479

Query: 361 QAMVSFKPSRC 371
              V F P+ C
Sbjct: 480 GGRVGFAPNSC 490


>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  147 bits (371), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 125/385 (32%), Positives = 188/385 (48%), Gaps = 48/385 (12%)

Query: 24  YQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE---PCPELDCFKQ---EPP 77
           + +E+ S    YLM ++IGTPP  +    DTGSD  W  C      P L   +    +PP
Sbjct: 89  FVSELTSTPFEYLMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPP 148

Query: 78  --LFDPKKSSTYNSISCSSSQCAVV-TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETL 133
              FDP KS+T+  + C S  C+ +  ++C ++  C YS+ YG G++   +SG L+TET 
Sbjct: 149 GVQFDPSKSTTFRLVDCDSVACSELPEASCGADSKCRYSYSYGDGSH---TSGVLSTETF 205

Query: 134 TFNST-----SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG--TS 186
           TF         G    + NV FGC    + S    S   G++GLG G+ SL+SQ+G  TS
Sbjct: 206 TFADAPGARGDGTTTRVANVNFGCSTTFVGS----SVGDGLVGLGGGDLSLVSQLGADTS 261

Query: 187 IAGKFSYCLPD---QGSSKINFG--GIVAGAGVVSTPLI---IRDHYYLSLEAISVGNQR 238
           +  +FSYCL     + SS +NFG    V   G V+TPLI   ++ +Y + L ++ VGN+ 
Sbjct: 262 LGRRFSYCLVPYSVKASSALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGNKT 321

Query: 239 LEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI 298
            E    S   + VD+G   T LP      L   ++  IK  P +     P     LC+++
Sbjct: 322 FEAPDRSP--LIVDSGTTLTFLPEALVDPLVKELTGRIKLPPAQ----SPERLLPLCFDV 375

Query: 299 SS------QPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRI 348
           S           P+VT+    GA V L   N F  + +  +C A    +      + G I
Sbjct: 376 SGVREGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQEGTLCLAVSAMSEQFPASIIGNI 435

Query: 349 MQINFLIGYDIEQAMVSFKPSRCTN 373
            Q N  +GYD+++  V+F P+ C +
Sbjct: 436 AQQNMHVGYDLDKGTVTFAPAACAS 460


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score =  147 bits (371), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 117/364 (32%), Positives = 175/364 (48%), Gaps = 50/364 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IG+P  +++  +DTGSD TW QC+PC   DC++Q  P+FDP  S++Y ++SC S
Sbjct: 169 YFSRVGIGSPARELYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAVSCDS 226

Query: 95  SQCA-VVTSNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
            +C  + T+ C  + G C Y   YG G+Y   + G+ ATETLT   ++  PV   NV  G
Sbjct: 227 PRCRDLDTAACRNATGACLYEVAYGDGSY---TVGDFATETLTLGDST--PVT--NVAIG 279

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGI 208
           CGH N       +    + G      S IS      A  FSYCL D+ S   S + FG  
Sbjct: 280 CGHDNEGLFVGAAGLLALGGGPLSFPSQIS------ASTFSYCLVDRDSPAASTLQFGAD 333

Query: 209 VAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGVL 256
            A A  V+ PL+        YY++L  ISVG Q L   SS+        +G + VD+G  
Sbjct: 334 GAEADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSGTA 393

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIH 310
            T L    ++ L+           V+G  + P  S V     CY++S +   + P V++ 
Sbjct: 394 VTRLQSSAYAALRDAF--------VRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLR 445

Query: 311 FRGAD-VKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
           F G   ++L   N    +      C AF   NA + + G + Q    + +D  + +V F 
Sbjct: 446 FEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVVGFT 505

Query: 368 PSRC 371
           P++C
Sbjct: 506 PNKC 509


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score =  147 bits (370), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 118/375 (31%), Positives = 172/375 (45%), Gaps = 55/375 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           YL+  +IGTPP+ +   +DTGSD  WTQC+ PC    CF Q  PL+ P +S TY ++SC 
Sbjct: 100 YLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRR--CFPQPAPLYAPARSVTYANVSCG 157

Query: 94  SSQCAVVTS--------------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
           S  C  + S                  G C+Y + YG G   S + G LATET TF    
Sbjct: 158 SRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDG---SSTDGVLATETFTF---- 210

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--- 196
           G    + ++ FGCG  NL    + S   G++G+G G  SL+SQ+G +   KFSYC     
Sbjct: 211 GAGTTVHDLAFGCGTDNLGGTDNSS---GLVGMGRGPLSLVSQLGVT---KFSYCFTPFN 264

Query: 197 DQGSSKINFGGIVA--GAGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------E 240
           D  +S   F G  A       STP +          +YYLSLE I+VG+  L        
Sbjct: 265 DTTTSSPLFLGSSASLSPAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFR 324

Query: 241 FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
             +S  G + +D+G   T   LE  + +    +   +       GA  G S         
Sbjct: 325 LTASGRGGLIIDSGT--TFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAAPQGR 382

Query: 301 QPK---FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIG 356
            P+    P + +HF GAD++L  S+ +  +    + C          V G + Q N  + 
Sbjct: 383 GPEAVDVPRLVLHFDGADMELPRSSAVVEDRVAGVACLGIVSARGMSVLGSMQQQNMHVR 442

Query: 357 YDIEQAMVSFKPSRC 371
           YD+ + ++SF+P+ C
Sbjct: 443 YDVGRDVLSFEPANC 457


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 120/365 (32%), Positives = 170/365 (46%), Gaps = 50/365 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L +GTP   +F  +DTGSD  W QC PC +  C+ Q  P+F+P KS ++ +I C S
Sbjct: 147 YFTRLGVGTPARYVFMVLDTGSDVVWIQCAPCKK--CYSQTDPVFNPTKSRSFANIPCGS 204

Query: 95  SQCAVVTS-NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S  CS     C Y   YG G   SF+ G  +TETLTF  T      +  V  G
Sbjct: 205 PLCRRLDSPGCSTKKHICLYQVSYGDG---SFTYGEFSTETLTFRGT-----RVGRVALG 256

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFGG 207
           CGH N       +   G+        S  SQ+G   + KFSYCL D+ +S     + FG 
Sbjct: 257 CGHDNEGLFIGAAGLLGLGRG---RLSFPSQIGRRFSRKFSYCLVDRSASSKPSYMVFGD 313

Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS------STGN--IFVDTGV 255
                    TPL+    +   YY+ L  +SVG  R+  +++      STGN  + +D+G 
Sbjct: 314 SAISRTARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGT 373

Query: 256 LRTLLPLEYHSNLKSVM----SNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVT 308
             T L    +  L+       SN+ +A         P FS    C+++S  ++ K P V 
Sbjct: 374 SVTRLTRPAYVALRDAFRVGASNLKRA---------PEFSLFDTCFDLSGKTEVKVPTVV 424

Query: 309 IHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSF 366
           +HFRGADV L  SN    + +    C AF G  + + + G I Q  F + YD+  + V F
Sbjct: 425 LHFRGADVSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGF 484

Query: 367 KPSRC 371
            P  C
Sbjct: 485 APRGC 489


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 116/405 (28%), Positives = 187/405 (46%), Gaps = 55/405 (13%)

Query: 2   QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
           +N  +    N+ +TP   + +    ++      Y++ L+IGTPP  +   +DTGSD  WT
Sbjct: 68  RNRARFSGKNEQQTPAGVLPVRPSGDLE-----YVVDLAIGTPPQPVSALLDTGSDLIWT 122

Query: 62  QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA-VVTSNCSEGD-CSYSFLYGRGA 119
           QC PC    C  Q  PLF P +S++Y  + C+ + C+ ++  +C   D C+Y + YG G 
Sbjct: 123 QCAPCAS--CLSQPDPLFAPGQSASYEPMRCAGTLCSDILHHSCERPDTCTYRYNYGDG- 179

Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVI--FGCGHKNLASPTSDSKQTGIIGLGPGNS 177
             + + G  ATE  TF S+ G  +    V   FGCG  N+ S  + S   GI+G G    
Sbjct: 180 --TMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNNGS---GIVGFGRNPL 234

Query: 178 SLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVAG------AGVVSTPLIIRDH---- 224
           SL+SQ+      +FSYCL    S +   + FG +  G        V +TPL+        
Sbjct: 235 SLVSQLSIR---RFSYCLTSYASRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQNPTF 291

Query: 225 YYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIK 277
           YY+    ++VG +RL    S+       +G + VD+G   TLLP    + +       ++
Sbjct: 292 YYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFRQQLR 351

Query: 278 AQPVKGVGAEPGFSDVLCYNI---------SSQPKFPEVTIHFRGADVKLSPSN-LFRNI 327
                G   E G    +C+ +         +SQ   P + +HF+GAD+ L   N +  + 
Sbjct: 352 LPFANGGNPEDG----VCFLVPAAWRRSSSTSQMPVPRMVLHFQGADLDLPRRNYVLDDH 407

Query: 328 SDEIMCSAFR-GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
               +C      G+     G ++Q +  + YD+E   +S  P+RC
Sbjct: 408 RRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 452


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 111/361 (30%), Positives = 165/361 (45%), Gaps = 45/361 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP + +  +D+GSD  W QCEPC +  C+ Q  P+F+P  SS+Y  +SC+S
Sbjct: 134 YFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQ--CYHQSDPVFNPADSSSYAGVSCAS 191

Query: 95  SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + C+ V  + C EG C Y   YG G+Y   + G LA ETLTF  T      + NV  GCG
Sbjct: 192 TVCSHVDNAGCHEGRCRYEVSYGDGSY---TKGTLALETLTFGRT-----LIRNVAIGCG 243

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFGGIVA 210
           H N       +   G++GLG G  S + Q+G    G FSYCL  +G   S  + FG    
Sbjct: 244 HHNQGMFVGAA---GLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQSSGLLQFGREAV 300

Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
             G    PLI     +  YY+ L  + VG  R+       +      G + +DTG   T 
Sbjct: 301 PVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMDTGTAVTR 360

Query: 260 LPLEYHSNLKSVM----SNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRG 313
           LP   +   +       +N+ +A  V             CY++      + P V+ +F G
Sbjct: 361 LPTAAYEAFRDAFIAQTTNLPRASGVSIFDT--------CYDLFGFVSVRVPTVSFYFSG 412

Query: 314 ADVKLSPSNLFRNISDEI--MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
             +   P+  F    D++   C AF   ++ + + G I Q    I  D     V F P+ 
Sbjct: 413 GPILTLPARNFLIPVDDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFVGFGPNV 472

Query: 371 C 371
           C
Sbjct: 473 C 473


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 113/354 (31%), Positives = 169/354 (47%), Gaps = 33/354 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  + +GTP       VDTGS  TW QC PC  + C +Q  P+F+PK SSTY S+ CS+
Sbjct: 122 YVTRMGLGTPATQYVMVVDTGSSLTWLQCSPC-LVSCHRQSGPVFNPKSSSTYASVGCSA 180

Query: 95  SQCAVV------TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
            QC+ +       S CS  + C Y   YG    +SFS G L+ +T++F STS     +PN
Sbjct: 181 QQCSDLPSATLNPSACSSSNVCIYQASYGD---SSFSVGYLSKDTVSFGSTS-----LPN 232

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
             +GCG  N        +  G+IGL     SL+ Q+  S+   F+YCLP   SS     G
Sbjct: 233 FYYGCGQDNEG---LFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLG 289

Query: 208 IVAGAGVVSTPLI---IRDH-YYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLLP 261
                    TP++   + D  Y++ L  ++V    L   SS+  ++   +D+G + T LP
Sbjct: 290 SYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLP 349

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFR-GADVKL 318
              +S L   +     A  +KG      +S  D      +S+   P VT+ F  GA +KL
Sbjct: 350 TSVYSALSKAV-----AAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGAALKL 404

Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           S  NL  ++ D   C AF    +  + G   Q  F + YD++ + + F    C+
Sbjct: 405 SAQNLLVDVDDSTTCLAFAPARSAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 458


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 118/383 (30%), Positives = 182/383 (47%), Gaps = 59/383 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +GTPP  ++  +DTGSD +W QC+PC   DCF+Q    + PK SSTY +ISC  
Sbjct: 171 YFLDMFVGTPPKHVWLILDTGSDLSWIQCDPC--YDCFEQNGSHYYPKDSSTYRNISCYD 228

Query: 95  SQCAVVTS-----NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE--- 144
            +C +V+S     +C   +  C Y + Y  G   S ++G+ A+ET T N T     E   
Sbjct: 229 PRCQLVSSSDPLQHCKAENQTCPYFYDYADG---SNTTGDFASETFTVNLTWPNGKEKFK 285

Query: 145 -MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD-----Q 198
            + +V+FGCGH N       S   G++GLG G  S  SQ+ +     FSYCL D      
Sbjct: 286 QVVDVMFGCGHWNKGFFYGAS---GLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTS 342

Query: 199 GSSKINFG---GIVAGAGVVSTPLIIRDH------YYLSLEAISVGNQRLEFVSSSTGN- 248
            SSK+ FG    ++    +  T L+  +       YYL +++I VG + L+ +S  T + 
Sbjct: 343 VSSKLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLD-ISEQTWHW 401

Query: 249 ------------IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
                         +D+G   T  P   +  +K      IK Q +    A   F    CY
Sbjct: 402 SSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQI----AADDFVMSPCY 457

Query: 297 NISS---QPKFPEVTIHFRGADVKLSPSN--LFRNISDEIMCSAFR---GGNANIVYGRI 348
           N+S    Q + P+  IHF    V   P+    ++   DE++C A       +   + G +
Sbjct: 458 NVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTIIGNL 517

Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
           +Q NF I YD++++ + + P RC
Sbjct: 518 LQQNFHILYDVKRSRLGYSPRRC 540


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 113/357 (31%), Positives = 176/357 (49%), Gaps = 35/357 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +L ++SIG PPV     +DTGSD TW QC PC    C+ Q  P F P +SSTY + SC S
Sbjct: 88  FLANISIGDPPVPQLLLIDTGSDLTWIQCLPC---KCYPQTIPFFHPSRSSTYRNASCES 144

Query: 95  SQCAV--VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           +  A+  +  +   G+C Y   Y      S + G LA E LTF ++    +  PN++FGC
Sbjct: 145 APHAMPQIFRDEKTGNCRYHLRYRD---FSNTRGILAKEKLTFQTSDEGLISKPNIVFGC 201

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
           G  N    +  ++ +G++GLGPG  S++++   +   KFSYC             ++ G 
Sbjct: 202 GQDN----SGFTQYSGVLGLGPGTFSIVTR---NFGSKFSYCFGSLIDPTYPHNFLILGN 254

Query: 213 GVV----STPL-IIRDHYYLSLEAISVGNQRLEFVS------SSTGNIFVDTGVLRTLLP 261
           G       TPL I +D YYL L+AIS+G + L+          S G   +DTG   T+L 
Sbjct: 255 GARIEGDPTPLQIFQDRYYLDLQAISLGEKLLDIEPGIFQRYRSKGGTVIDTGCSPTILA 314

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK---FPEVTIHFR-GADVK 317
            E +  L   + + +  + ++ V     +++  CY  + +     FP VT HF  GA++ 
Sbjct: 315 REAYETLSEEI-DFLLGEVLRRVKDWEQYTNH-CYEGNLKLDLYGFPVVTFHFAGGAELA 372

Query: 318 LSPSNLF-RNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           L   +LF  + S +  C A      +   V G + Q N+ +GY++    V F+ + C
Sbjct: 373 LDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 429


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 118/373 (31%), Positives = 179/373 (47%), Gaps = 44/373 (11%)

Query: 28  IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
           +++ D  YLM + IGTP       +DTGSD  WTQC PC  L C  Q  P FDP +S+TY
Sbjct: 83  VLASDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPARSATY 140

Query: 88  NSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
            S+ C+S  C A+    C +  C Y + YG  A    ++G LA ET TF  T+   V +P
Sbjct: 141 RSLGCASPACNALYYPLCYQKVCVYQYFYGDSAS---TAGVLANETFTFG-TNETRVSLP 196

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKI 203
            + FGCG+ N     + S   G++G G G+ SL+SQ+G+    +FSYCL    S   S++
Sbjct: 197 GISFGCGNLNAGLLANGS---GMVGFGRGSLSLVSQLGSP---RFSYCLTSFLSPVPSRL 250

Query: 204 NFGGI-------VAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF--------VSS 244
            FG          +   V STP ++       Y+L++  ISVG   L           + 
Sbjct: 251 YFGVYATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTD 310

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK- 303
            TG   +D+G   T L    +  +++  ++ I   P+  V  +    D  C+     P+ 
Sbjct: 311 GTGGTIIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNV-TDASVLDT-CFQWPPPPRQ 367

Query: 304 ---FPEVTIHFRGADVKLSPSN--LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYD 358
               P++ +HF GAD +L   N  L    +   +C A    +   + G     NF + YD
Sbjct: 368 SVTLPQLVLHFDGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYD 427

Query: 359 IEQAMVSFKPSRC 371
           +E +++SF P+ C
Sbjct: 428 LENSLMSFVPAPC 440


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 122/387 (31%), Positives = 181/387 (46%), Gaps = 70/387 (18%)

Query: 26  AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
           A + S    YLM L+IGTPPV      DTGSD TWTQC+PC    CF Q+ P++D   SS
Sbjct: 74  ARLRSGQAEYLMELAIGTPPVPFIALADTGSDLTWTQCKPCKL--CFGQDTPIYDTTTSS 131

Query: 86  TYNSISCSSSQC-AVVTSNCS--EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
           +++ + CSS+ C  + +S CS     C Y + Y  GAY+   +G                
Sbjct: 132 SFSPLPCSSATCLPIWSSRCSTPSATCRYRYAYDDGAYSPECAG---------------- 175

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----Q 198
           + +  + FGCG  N          TG +GLG G+ SL++Q+G    GKFSYCL D     
Sbjct: 176 ISVGGIAFGCGVDNGG---LSYNSTGTVGLGRGSLSLVAQLG---VGKFSYCLTDFFNTS 229

Query: 199 GSSKINFGGIVAGAG---------VVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS 245
            SS + FG +   A          V STPL+   +    YY+SLE IS+G+ RL   + +
Sbjct: 230 LSSPVFFGSLAELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGT 289

Query: 246 --------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNM--IKAQPVKGVGAEPGFSDVLC 295
                   +G + VD+G + T+L     +  + V+ ++  +  QPV    +     D  C
Sbjct: 290 FDLNDDDGSGGMIVDSGTIFTIL---VETGFRVVVDHVAGVLGQPVVNASSL----DRPC 342

Query: 296 Y-----NISSQPKFPEVTIHFR-GADVKLSPSNLFR-NISDEIMCSAFRGGN--ANIVYG 346
           +      +   P  P++ +HF  GAD++L   N    N  +   C    G    +  V G
Sbjct: 343 FPAPAAGVQELPDMPDMVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLG 402

Query: 347 RIMQINFLIGYDIEQAMVSFKPSRCTN 373
              Q N  + +DI    +SF P+ C+ 
Sbjct: 403 NFQQQNIQMLFDITVGQLSFMPTDCSK 429


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 113/353 (32%), Positives = 170/353 (48%), Gaps = 32/353 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +S+GTP V    S+DTGSD +W QC PC    C  Q+  LFDP KS+TY++ SCSS
Sbjct: 130 YVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFSCSS 189

Query: 95  SQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
           +QCA +    + C    C Y   Y      S ++G   ++TL   ++  +     N  FG
Sbjct: 190 AQCAQLGGEGNGCLNSHCQYIVKY---VDHSNTTGTYGSDTLGLTTSDAV----KNFQFG 242

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG 211
           C H+   +     +  G++GLG    SL+SQ   +    FSYCLP   SS   F  + A 
Sbjct: 243 CSHR---ANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAGGFLTLGAA 299

Query: 212 AGVVS------TPLI---IRDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLP 261
           AG  S      TPL+   +   Y + L+AI+V   +L   +S  +G   VD+G + T LP
Sbjct: 300 AGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVFSGASVVDSGTVITQLP 359

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHF-RGADVKL 318
              +  L++     +KA P     A  G  D  C++ S     + P VT+ F RGA + L
Sbjct: 360 PTAYQALRTAFKKEMKAYP---SAAPVGILDT-CFDFSGIKTVRVPVVTLTFSRGAVMDL 415

Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             S +F         +A  G     + G + Q  F + +D+  + + F+P  C
Sbjct: 416 DVSGIFYAGCLAFTATAQDGDTG--ILGNVQQRTFEMLFDVGGSTLGFRPGAC 466


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 112/374 (29%), Positives = 169/374 (45%), Gaps = 40/374 (10%)

Query: 24  YQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKK 83
           + A + +    YL  + +GTP       VDTGSD TW QC PC +  C+ Q   LF P  
Sbjct: 2   FTAPVAAARGEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPCGK--CYSQNDALFLPNT 59

Query: 84  SSTYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
           S+++  ++C S+ C  +    C++  C Y + YG G   S ++G+   +T+T +  +G  
Sbjct: 60  STSFTKLACGSALCNGLPFPMCNQTTCVYWYSYGDG---SLTTGDFVYDTITMDGINGQK 116

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
            ++PN  FGCGH N     S +   GI+GLG G  S  SQ+ +   GKFSYCL D  +  
Sbjct: 117 QQVPNFAFGCGHDNEG---SFAGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPP 173

Query: 203 INFGGIVAGAGVVST-------PLI----IRDHYYLSLEAISVGNQRLEFVSS------- 244
                ++ G   V         P++    +  +YY+ L  ISVG+  L   S+       
Sbjct: 174 TQTSPLLFGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSV 233

Query: 245 -STGNIFVDTGVLRTLLPLEYHSNLKSVM--SNMIKAQPVKGVGAEPGFSDVLC---YNI 298
              G IF D+G   T L    +  + + M  S M  ++ +  +         LC   +  
Sbjct: 234 GGAGTIF-DSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDDISRLD-----LCLSGFPK 287

Query: 299 SSQPKFPEVTIHFRGADVKLSPSNLFRNI-SDEIMCSAFRGGNANIVYGRIMQINFLIGY 357
              P  P +T HF G D+ L PSN F  + S +  C A        + G + Q NF + Y
Sbjct: 288 DQLPTVPAMTFHFEGGDMVLPPSNYFIYLESSQSYCFAMTSSPDVNIIGSVQQQNFQVYY 347

Query: 358 DIEQAMVSFKPSRC 371
           D     + F P  C
Sbjct: 348 DTAGRKLGFVPKDC 361


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 109/352 (30%), Positives = 166/352 (47%), Gaps = 27/352 (7%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  + +GTP       VDTGS  TW QC PC  + C +Q  P+FDPK SS+Y ++SCSS
Sbjct: 117 YVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPC-RVSCHRQSGPVFDPKTSSSYAAVSCSS 175

Query: 95  SQC---AVVTSN---CSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
            QC   +  T N   CS  + C Y   YG    +SFS G L+ +T++F + S     +PN
Sbjct: 176 PQCDGLSTATLNPAVCSPSNVCIYQASYGD---SSFSVGYLSKDTVSFGANS-----VPN 227

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
             +GCG  N        +  G++GL     SL+ Q+  ++   FSYCLP   SS     G
Sbjct: 228 FYYGCGQDNEG---LFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSTSSSGYLSIG 284

Query: 208 IVAGAGVVSTPLI---IRDH-YYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLLP 261
                G   TP++   + D  Y++SL  ++V  + L   SS   ++   +D+G + T LP
Sbjct: 285 SYNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTIIDSGTVITRLP 344

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSP 320
              ++ L   ++  +K    K   A            S     P V++ F  GA +KLS 
Sbjct: 345 TSVYTALSKAVAAAMKGS-TKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGGATLKLSA 403

Query: 321 SNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
            NL  ++     C AF    +  + G   Q  F + YD++   + F  + C+
Sbjct: 404 GNLLVDVDGATTCLAFAPARSAAIIGNTQQQTFSVVYDVKSNRIGFAAAGCS 455


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 125/360 (34%), Positives = 173/360 (48%), Gaps = 38/360 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +GTP  D+    DTGSD TWTQCEPC    C+KQ+  +FDP KSS+Y +I+C+S
Sbjct: 136 YFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAG-SCYKQQDAIFDPSKSSSYINITCTS 194

Query: 95  SQCAVVT-----SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           S C  +T     S CS     C Y   YG     S S G L+ E LT  +T      + +
Sbjct: 195 SLCTQLTSAGIKSRCSSSTTACIYGIQYGD---KSTSVGFLSQERLTITATD----IVDD 247

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--KINF 205
            +FGCG  N       S   G+IGLG    S + Q  +     FSYCLP   SS   + F
Sbjct: 248 FLFGCGQDNEGLF---SGSAGLIGLGRHPISFVQQTSSIYNKIFSYCLPSTSSSLGHLTF 304

Query: 206 GGIVA-GAGVVSTPL--IIRDH--YYLSLEAISVGNQRLEFVSSST---GNIFVDTGVLR 257
           G   A  A +  TPL  I  D+  Y L +  ISVG  +L  VSSST   G   +D+G + 
Sbjct: 305 GASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSAGGSIIDSGTVI 364

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF--PEVTIHFRGA- 314
           T L    ++ L+S     ++  PV     E G  D  CY+ S   +   P++   F G  
Sbjct: 365 TRLAPTAYAALRSAFRQGMEKYPVAN---EDGLFDT-CYDFSGYKEISVPKIDFEFAGGV 420

Query: 315 DVKLSPSNLFRNISDEIMCSAFRG-GNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            V+L    +    S + +C AF   GN N   ++G + Q    + YD+E   + F  + C
Sbjct: 421 TVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGRIGFGAAGC 480


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 117/359 (32%), Positives = 173/359 (48%), Gaps = 37/359 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  D+    DTGSD TWTQC+PC    C+ Q+ P+F+P KS++Y ++SCSS
Sbjct: 133 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR-TCYDQKEPIFNPSKSTSYYNVSCSS 191

Query: 95  SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           + C  ++S      +CS  +C Y   YG     SFS G LA +  T  S+         V
Sbjct: 192 AACGSLSSATGNAGSCSASNCIYGIQYGD---QSFSVGFLAKDKFTLTSSD----VFDGV 244

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SKINFG 206
            FGCG  N    T  +   G++GLG    S  SQ  T+    FSYCLP   S    + FG
Sbjct: 245 YFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHLTFG 301

Query: 207 GIVAGAGVVSTPL-IIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
                  V  TP+  I D    Y L++ AI+VG Q+L   S+  ST    +D+G + T L
Sbjct: 302 SAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRL 361

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADV 316
           P + ++ L+S     +   P        G S +  C+++S       P+V   F  GA V
Sbjct: 362 PPKAYAALRSSFKAKMSKYPT-----TSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVV 416

Query: 317 KLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           +L    +F       +C AF G + +    ++G + Q    + YD     V F P+ C+
Sbjct: 417 ELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 475


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 117/359 (32%), Positives = 173/359 (48%), Gaps = 37/359 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  D+    DTGSD TWTQC+PC    C+ Q+ P+F+P KS++Y ++SCSS
Sbjct: 104 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVR-TCYDQKEPIFNPSKSTSYYNVSCSS 162

Query: 95  SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           + C  ++S      +CS  +C Y   YG     SFS G LA E  T  ++         V
Sbjct: 163 AACGSLSSATGNAGSCSASNCIYGIQYGD---QSFSVGFLAKEKFTLTNSD----VFDGV 215

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SKINFG 206
            FGCG  N    T  +   G++GLG    S  SQ  T+    FSYCLP   S    + FG
Sbjct: 216 YFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHLTFG 272

Query: 207 GIVAGAGVVSTPL-IIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
                  V  TP+  I D    Y L++ AI+VG Q+L   S+  ST    +D+G + T L
Sbjct: 273 SAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRL 332

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADV 316
           P + ++ L+S     +   P        G S +  C+++S       P+V   F  GA V
Sbjct: 333 PPKAYAALRSSFKAKMSKYPT-----TSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVV 387

Query: 317 KLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           +L    +F       +C AF G + +    ++G + Q    + YD     V F P+ C+
Sbjct: 388 ELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 446


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 121/399 (30%), Positives = 189/399 (47%), Gaps = 71/399 (17%)

Query: 21  SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQ--EPPL 78
           S+  QA++ +    Y M++S+GTPP+D    VDTGS+  W QC PC    CF +    P+
Sbjct: 77  SVNVQAQLENGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTR--CFPRPTPAPV 134

Query: 79  FDPKKSSTYNSISCSSSQCAVVTSNC------SEGDCSYSFLYGRGAYASFSSGNLATET 132
             P +SST++ + C+ S C  + ++       +   C+Y++ YG G    +++G LATET
Sbjct: 135 LQPARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG----YTAGYLATET 190

Query: 133 LTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFS 192
           LT    +      P V FGC  +N    +S     GI+GLG G  SL+SQ+     G+FS
Sbjct: 191 LTVGDGT-----FPKVAFGCSTENGVDNSS-----GIVGLGRGPLSLVSQLAV---GRFS 237

Query: 193 YCL----PDQGSSKINFGGIVA---GAGVVSTPLIIR------DHYYLSLEAISVGNQRL 239
           YCL     D G+S I FG +     G+ V STPL+         HYY++L  I+V +  L
Sbjct: 238 YCLRSDMADGGASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTEL 297

Query: 240 EFVSSS--------TGNIFVDTGVLRTLLPLEYHSNLK----SVMSNMIKAQPVKGVGAE 287
               S+         G   VD+G   T L  + ++ +K    S M+N+ +  P  G    
Sbjct: 298 PVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGA--- 354

Query: 288 PGFSDVLCYNISS-----QPKFPEVTIHFR-GADVKLSPSNLFRNISDE------IMCSA 335
             +   LCY  S+       + P + + F  GA   +   N F  +  +      + C  
Sbjct: 355 -PYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLL 413

Query: 336 FRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
                 ++   + G +MQ++  + YDI+  M SF P+ C
Sbjct: 414 VLPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 118/366 (32%), Positives = 172/366 (46%), Gaps = 48/366 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+HL+IGTPP  +  ++DTGS   WTQC+PC    CF Q  P +D  +SST+   SC S
Sbjct: 35  YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAV--CFNQSLPYYDASRSSTFALPSCDS 92

Query: 95  SQCAVVTS-----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           +QC +  S     N +   C+YS+ YG     S + G L  ET++F + +     +P V+
Sbjct: 93  TQCKLDPSVTMCVNQTVQTCAYSYSYGD---KSATIGFLDVETVSFVAGA----SVPGVV 145

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------- 202
           FGCG  N       S +TGI G G G  SL SQ+     G FS+C       K       
Sbjct: 146 FGCGLNNTG--IFRSNETGIAGFGRGPLSLPSQLK---VGNFSHCFTAVSGRKPSTVLFD 200

Query: 203 INFGGIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS------TGNIFV 251
           +       G G V T  +I++      YYLSL+ I+VG+ RL    S+      TG   +
Sbjct: 201 LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTII 260

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN---ISSQPKFPEVT 308
           D+G   T LP   +  +    +  +K   V      P    +LC++   +   P  P++ 
Sbjct: 261 DSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGP----LLCFSAPPLGKAPHVPKLV 316

Query: 309 IHFRGADVKLSPSNLFRNISDEIMCS---AFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
           +HF GA + L   N      D   CS   A   G   I+ G   Q N  + YD++ + +S
Sbjct: 317 LHFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTII-GNFQQQNMHVLYDLKNSKLS 375

Query: 366 FKPSRC 371
           F  ++C
Sbjct: 376 FVRAKC 381


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 117/359 (32%), Positives = 173/359 (48%), Gaps = 37/359 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  D+    DTGSD TWTQC+PC    C+ Q+ P+F+P KS++Y ++SCSS
Sbjct: 132 YIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRT-CYDQKEPIFNPSKSTSYYNVSCSS 190

Query: 95  SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           + C  ++S      +CS  +C Y   YG     SFS G LA E  T  ++         V
Sbjct: 191 AACGSLSSATGNAGSCSASNCIYGIQYGD---QSFSVGFLAKEKFTLTNSD----VFDGV 243

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SKINFG 206
            FGCG  N    T  +   G++GLG    S  SQ  T+    FSYCLP   S    + FG
Sbjct: 244 YFGCGENNQGLFTGVA---GLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYTGHLTFG 300

Query: 207 GIVAGAGVVSTPL-IIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
                  V  TP+  I D    Y L++ AI+VG Q+L   S+  ST    +D+G + T L
Sbjct: 301 SAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSGTVITRL 360

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADV 316
           P + ++ L+S     +   P        G S +  C+++S       P+V   F  GA V
Sbjct: 361 PPKAYAALRSSFKAKMSKYPT-----TSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAVV 415

Query: 317 KLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           +L    +F       +C AF G + +    ++G + Q    + YD     V F P+ C+
Sbjct: 416 ELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 474


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 113/356 (31%), Positives = 172/356 (48%), Gaps = 35/356 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M  SIGTPP  +    DTGSD  WT+C+              + P  SST+  + CS 
Sbjct: 100 YDMEFSIGTPPQKLTALADTGSDLIWTKCDAGGGAA--WGGSSSYHPNASSTFTRLPCSD 157

Query: 95  SQCAVVTS----NCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
             CA + S     C+ G  +C Y + YG G    F+ G L +ET T    +     +P V
Sbjct: 158 RLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTLGGDA-----VPGV 212

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SKINFG 206
            FGC     A      +  G++GLG G  SL+SQ+    AG F YCL    S  S + FG
Sbjct: 213 GFGC---TTALEGDYGEGAGLVGLGRGPLSLVSQLD---AGTFMYCLTADASKASPLLFG 266

Query: 207 GIV----AGAGVVSTPLIIRDHYY-LSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLP 261
            +     AGAGV ST L+    +Y ++L +I++G+     V    G +F D+G   T L 
Sbjct: 267 ALATMTGAGAGVQSTGLLASTTFYAVNLRSITIGSATTAGVGGPGGVVF-DSGTTLTYLA 325

Query: 262 LEYHSNLKSV-MSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF-PEVTIHFRG-ADVKL 318
              ++  K+  +S      PV+G     GF    CY      +  P + +HF G AD+ L
Sbjct: 326 EPAYTEAKAAFLSQTTSLTPVEG---RYGFE--ACYEKPDSARLIPAMVLHFDGGADMAL 380

Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTNY 374
             +N    + D ++C   +   +  + G IMQ+N+L+ +D+ ++++SF+P+ C +Y
Sbjct: 381 PVANYVVEVDDGVVCWVVQRSPSLSIIGNIMQMNYLVLHDVRKSVLSFQPANCDSY 436


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 118/366 (32%), Positives = 172/366 (46%), Gaps = 48/366 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+HL+IGTPP  +  ++DTGS   WTQC+PC    CF Q  P +D  +SST+   SC S
Sbjct: 91  YLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAV--CFNQSLPYYDASRSSTFALPSCDS 148

Query: 95  SQCAVVTS-----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           +QC +  S     N +   C+YS+ YG     S + G L  ET++F + +     +P V+
Sbjct: 149 TQCKLDPSVTMCVNQTVQTCAYSYSYGD---KSATIGFLDVETVSFVAGA----SVPGVV 201

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------- 202
           FGCG  N       S +TGI G G G  SL SQ+     G FS+C       K       
Sbjct: 202 FGCGLNNTG--IFRSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVSGRKPSTVLFD 256

Query: 203 INFGGIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS------TGNIFV 251
           +       G G V T  +I++      YYLSL+ I+VG+ RL    S+      TG   +
Sbjct: 257 LPADLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTII 316

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN---ISSQPKFPEVT 308
           D+G   T LP   +  +    +  +K   V      P    +LC++   +   P  P++ 
Sbjct: 317 DSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGP----LLCFSAPPLGKAPHVPKLV 372

Query: 309 IHFRGADVKLSPSNLFRNISDEIMCS---AFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
           +HF GA + L   N      D   CS   A   G   I+ G   Q N  + YD++ + +S
Sbjct: 373 LHFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTII-GNFQQQNMHVLYDLKNSKLS 431

Query: 366 FKPSRC 371
           F  ++C
Sbjct: 432 FVRAKC 437


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 116/376 (30%), Positives = 175/376 (46%), Gaps = 58/376 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y +   +GTPP      VD+GSD  W QC PC  L C+ Q+ PL+ P  SST+N + C S
Sbjct: 65  YFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPC--LQCYAQDTPLYAPSNSSTFNPVPCLS 122

Query: 95  SQCAVVTSN----CS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
            +C ++ +     C     G C+Y + Y   A  S S G  A E+ T +      V +  
Sbjct: 123 PECLLIPATEGFPCDFHYPGACAYEYRY---ADTSLSKGVFAYESATVDD-----VRIDK 174

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSK 202
           V FGCG  N     S +   G++GLG G  S  SQ+G +   KF+YCL     P   SS 
Sbjct: 175 VAFGCGRDNQG---SFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSW 231

Query: 203 INFGG--IVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST--------GN 248
           + FG   I     +  TP++        YY+ +E + VG + L    S+         G+
Sbjct: 232 LIFGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGS 291

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNIS--SQPKF 304
           IF     +   LP  Y + L +   N+   +A  V+G+         LC +++   QP F
Sbjct: 292 IFDSGTTVTYWLPPAYRNILAAFDKNVRYPRAASVQGLD--------LCVDVTGVDQPSF 343

Query: 305 PEVTIHFRGADV-KLSPSNLFRNISDEIMCSAFRG-----GNANIVYGRIMQINFLIGYD 358
           P  TI   G  V +    N F +++  + C A  G     G  N + G ++Q NFL+ YD
Sbjct: 344 PSFTIVLGGGAVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGGFNTI-GNLLQQNFLVQYD 402

Query: 359 IEQAMVSFKPSRCTNY 374
            E+  + F P++C+++
Sbjct: 403 REENRIGFAPAKCSSH 418


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 113/371 (30%), Positives = 175/371 (47%), Gaps = 46/371 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ L+IGTPP  +   +DTGSD  WTQC PC    C  Q  PLF P  SS+Y  + CS 
Sbjct: 103 YLIDLAIGTPPQPVSALLDTGSDLIWTQCAPCAS--CLAQPDPLFAPAASSSYVPMRCSG 160

Query: 95  SQC-AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             C  ++  +C   D C+Y + YG G   + + G  ATE  TF S+SG  + +P + FGC
Sbjct: 161 QLCNDILHHSCQRPDTCTYRYNYGDG---TTTLGVYATERFTFASSSGEKLSVP-LGFGC 216

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN-------- 204
           G  N+ S  + S   GI+G G    SL+SQ+      +FSYCL    S++ +        
Sbjct: 217 GTMNVGSLNNGS---GIVGFGRDPLSLVSQLSIR---RFSYCLTPYTSTRKSTLMFGSLS 270

Query: 205 ---FGGIVAGAGVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSSS-------TGNI 249
              F G  A  G V T  +++       YY+    ++VG +RL    S+       +G +
Sbjct: 271 DGVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSGGV 330

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIK------AQPVKGVG-AEPGFSDVLCYNISSQP 302
            VD+G   TL P    + +       ++      + P  GV  A P  +     + ++  
Sbjct: 331 IVDSGTALTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDGVCFATPMAAGGRRASAATVV 390

Query: 303 KFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFR-GGNANIVYGRIMQINFLIGYDIE 360
             P +  HF+GAD++L   N +  +     +C      G++    G  +Q +  + YD+E
Sbjct: 391 SVPRMAFHFQGADLELPRRNYVLDDPRRGSLCILLADSGDSGATIGNFVQQDMRVLYDLE 450

Query: 361 QAMVSFKPSRC 371
              +SF P++C
Sbjct: 451 AETLSFAPAQC 461


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 102/340 (30%), Positives = 163/340 (47%), Gaps = 32/340 (9%)

Query: 48  IFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS---NC 104
           +F  +DTGSD TW QC+PCP+  C+KQ+  LF P  S+TY  + C+S+ C  + S   +C
Sbjct: 1   MFLLIDTGSDITWIQCDPCPQ--CYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSC 58

Query: 105 SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDS 164
               C+Y   YG     S + G+ A ETLT  S   + V +PN  FGCGH N       +
Sbjct: 59  LNSSCNYMVSYGD---KSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKG---LFN 112

Query: 165 KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS----KINFG-GIVAGAGVVSTPL 219
              G++GLG  +    +Q   +    FSYCLP   S+     ++FG   +    V  TPL
Sbjct: 113 GAAGLMGLGKSSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPL 172

Query: 220 IIR----DHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNM 275
           +        Y++S+  I+VG++ L      +  + VD+G + +      +  L+   + +
Sbjct: 173 VDSSSGPSQYFVSMTGINVGDELLPI----SATVMVDSGTVISRFEQSAYERLRDAFTQI 228

Query: 276 IKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFR-GADVKLSPSNLFRNISDEIM 332
           +       V   P  +   C+ +S+      P +T+HFR  A+++LSP ++   + D +M
Sbjct: 229 LPGLQT-AVSVAPFDT---CFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPVDDGVM 284

Query: 333 CSAFR-GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           C AF    +   V G   Q N    YDI ++ +      C
Sbjct: 285 CFAFAPSSSGRSVLGNFQQQNLRFVYDIPKSRLGISAFEC 324


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  144 bits (364), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 116/360 (32%), Positives = 162/360 (45%), Gaps = 40/360 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP   ++  +DTGSD  W QC PC +  C+ Q  P+FDP KS TY  I C +
Sbjct: 129 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRK--CYTQADPVFDPTKSRTYAGIPCGA 186

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   N     C Y   YG G   SF+ G+ +TETLTF  T      +  V  G
Sbjct: 187 PLCRRLDSPGCNNKNKVCQYQVSYGDG---SFTFGDFSTETLTFRRT-----RVTRVALG 238

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFGG 207
           CGH N       +   G+        S   Q G     KFSYCL D+ +S     + FG 
Sbjct: 239 CGHDNEGLFIGAAGLLGLGRG---RLSFPVQTGRRFNQKFSYCLVDRSASAKPSSVVFGD 295

Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGV 255
                    TPLI    +   YYL L  ISVG   +  +S+S         G + +D+G 
Sbjct: 296 SAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIIDSGT 355

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRG 313
             T L    +  L+      + A  +K   AE    D  C+++S  ++ K P V +HFRG
Sbjct: 356 SVTRLTRPAYIALRDAFR--VGASHLK-RAAEFSLFDT-CFDLSGLTEVKVPTVVLHFRG 411

Query: 314 ADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           ADV L  +N    + +    C AF G  + + + G I Q  F + +D+  + V F P  C
Sbjct: 412 ADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRGC 471


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 117/352 (33%), Positives = 165/352 (46%), Gaps = 32/352 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +S+GTP V     VDTGSD +W QC PC    C+ Q+ PLFDP +SS+Y ++ C  
Sbjct: 140 YVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLFDPAQSSSYAAVPCGG 199

Query: 95  SQC---AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C    +  S+CS   C Y   YG G   S ++G  +++TLT +        +    FG
Sbjct: 200 PVCGGLGIYASSCSAAQCGYVVSYGDG---SKTTGVYSSDTLTLSPND----AVRGFFFG 252

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIV 209
           CGH       +D    G++GLG   +SL+ Q   +  G FSYCLP + S+   +  GG  
Sbjct: 253 CGHAQSGFTGND----GLLGLGREEASLVEQTAGTYGGVFSYCLPTRPSTTGYLTLGGPS 308

Query: 210 AGA--GVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPL 262
             A  G  +T L+       +Y + L  ISVG Q+L   SS   G   VDTG + T LP 
Sbjct: 309 GAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAGGTVVDTGTVITRLPP 368

Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLS 319
             ++ L+S   + + +       A  G  D  CYN S       P V + F  GA V L 
Sbjct: 369 TAYAALRSAFRSGMASYGYPSAPAT-GILDT-CYNFSGYGTVTLPNVALTFSGGATVTLG 426

Query: 320 PSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
              +          S   GG A  + G + Q +F +   I+   V FKPS C
Sbjct: 427 ADGILSFGCLAFAPSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 474


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 112/373 (30%), Positives = 167/373 (44%), Gaps = 52/373 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP       +DTGSD  W QC PC    C++Q  P+FDP++SS+Y ++ C +
Sbjct: 129 YFTKIGVGTPATQALMVLDTGSDVVWVQCAPCRR--CYEQSGPVFDPRRSSSYGAVGCGA 186

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
           + C  + S   +   G C Y   YG G   S ++G+  TETLTF   +     +  V  G
Sbjct: 187 ALCRRLDSGGCDLRRGACMYQVAYGDG---SVTAGDFVTETLTFAGGA----RVARVALG 239

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------------G 199
           CGH N     + +   G+        S  +Q+       FSYCL D+             
Sbjct: 240 CGHDNEGLFVAAAGLLGLGRG---GLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHR 296

Query: 200 SSKINFGGIVAGAGVVS-TPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------- 245
           SS ++FG    GA   S TP++    +   YY+ L  ISVG  R+  V+ S         
Sbjct: 297 SSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTG 356

Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP- 302
            G + VD+G   T L    +S L+    +  +A    G+   PG   +   CY++  +  
Sbjct: 357 RGGVIVDSGTSVTRLARASYSALR----DAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRV 412

Query: 303 -KFPEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYD 358
            K P V++HF  GA+  L P N    + S    C AF G +  + + G I Q  F + +D
Sbjct: 413 VKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFD 472

Query: 359 IEQAMVSFKPSRC 371
            +   V F P  C
Sbjct: 473 GDGQRVGFAPKGC 485


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score =  144 bits (363), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 110/352 (31%), Positives = 164/352 (46%), Gaps = 32/352 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +  GTP        DTGSD  W QC+PC  + C+ Q+ PLFDP  SSTY ++SC+ 
Sbjct: 16  YVITVGFGTPTRTQTVVFDTGSDVNWLQCKPC-AVRCYAQQEPLFDPSLSSTYRNVSCTE 74

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C  + T  CS   C Y   YG G   S + G LA +T           +  N IFGCG
Sbjct: 75  PACVGLSTRGCSSSTCLYGVFYGDG---SSTIGFLAMDTFMLTPAQ----KFKNFIFGCG 127

Query: 154 HKNLASPTSDSKQTGIIGLGPGNS-SLISQMGTSIAGKFSYCLPDQGSSK--INFGGI-- 208
             N       +   G++GLG  ++ SL SQ+  S+   FSYCLP   S+   +N G    
Sbjct: 128 QNNTGLFQGTA---GLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTSSATGYLNIGNPQN 184

Query: 209 VAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS---STGNIFVDTGVLRTLLPLEYH 265
             G   + T   +   Y++ L  ISVG  RL   S+   S G I +D+G + T LP   +
Sbjct: 185 TPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSVGTI-IDSGTVITRLPPTAY 243

Query: 266 SNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFRGADVKLSPSN 322
           S LK+ +   +    +      P  + +  CY+ S      +P + +HF G DV++  + 
Sbjct: 244 SALKTAVRAAMTQYTL-----APAVTILDTCYDFSRTTSVVYPVIVLHFAGLDVRIPATG 298

Query: 323 LFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +F   +   +C AF G   +    + G + Q+   + YD E   + F    C
Sbjct: 299 VFFVFNSSQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  144 bits (363), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 175/366 (47%), Gaps = 45/366 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++++ +GTP  D+    DTGSD TWTQC+PC +  C+ Q+ P+FDP  S TY++ISC+S
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVK-SCYAQQQPIFDPSTSKTYSNISCTS 212

Query: 95  SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           + C+ + S       CS  +C Y   YG    +SF+ G  A + LT              
Sbjct: 213 AACSSLKSATGNSPGCSSSNCVYGIQYGD---SSFTIGFFAKDKLTLTQND----VFDGF 265

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-DQGSS-KINFG 206
           +FGCG  N        K  G+IGLG    S++ Q        FSYCLP  +GS+  + FG
Sbjct: 266 MFGCGQNNKGLF---GKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSNGHLTFG 322

Query: 207 ---GIVAGA----GVVSTPLIIRD---HYYLSLEAISVGNQRLE---FVSSSTGNIFVDT 253
              G+ A      G+  TP        +Y++ +  ISVG + L     +  + G I +D+
Sbjct: 323 NGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQNAGTI-IDS 381

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIH 310
           G + T LP   + +LKS     +   P       P  S +  CY++S  +    P+++ +
Sbjct: 382 GTVITRLPSTAYGSLKSAFKQFMSKYPT-----APALSLLDTCYDLSNYTSISIPKISFN 436

Query: 311 FRG-ADVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSF 366
           F G A+V+L P+ +        +C AF G   +    ++G I Q    + YD+    + F
Sbjct: 437 FNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQLGF 496

Query: 367 KPSRCT 372
               C+
Sbjct: 497 GYKGCS 502


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score =  144 bits (363), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 119/390 (30%), Positives = 179/390 (45%), Gaps = 55/390 (14%)

Query: 26  AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP--LFDPKK 83
           AE++S    YLM + +GTPPV +    DTGSD  W +C+   + D     PP   F P  
Sbjct: 101 AEVVSRQFEYLMAIEVGTPPVRVLAIADTGSDLVWVKCKG-KDNDNNSTAPPSVYFVPSA 159

Query: 84  SSTYNSISCSSSQCAVVTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
           SSTY  + C +  C  ++S  S   +G C Y + YG G+ A   SG L+TET TF++ + 
Sbjct: 160 SSTYGRVGCDTKACRALSSAASCSPDGSCEYLYSYGDGSRA---SGQLSTETFTFSTIAD 216

Query: 141 -----------------LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM 183
                              VE+  + FGC      +  +D           G  SL SQ+
Sbjct: 217 SSKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRADGLVGLGG----GPVSLASQL 272

Query: 184 G--TSIAGKFSYCLP----DQGSSKINFG--GIVAGAGVVSTPLI---IRDHYYLSLEAI 232
           G  TS+  KFSYCL        SS +NFG   +V+  G  STPLI   +  +Y ++L++I
Sbjct: 273 GATTSLGRKFSYCLAPYANTNASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSI 332

Query: 233 SVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD 292
           +V   +    +++  +I VD+G   T L     + L   ++  IK    +     P    
Sbjct: 333 NVAGTKRP-TTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAE----SPEKIL 387

Query: 293 VLCYNIS-----SQPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANI--- 343
            LCY+IS          P+VT+    G +V L P N F  + + ++C A    +      
Sbjct: 388 DLCYDISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLALVATSERQSVS 447

Query: 344 VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           + G I Q N  +GYD+E+  V+F  + C  
Sbjct: 448 ILGNIAQQNLHVGYDLEKGTVTFAAADCAK 477


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score =  144 bits (363), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 116/355 (32%), Positives = 176/355 (49%), Gaps = 33/355 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QC+PC  + C++Q   LFDP +SSTY ++SC++
Sbjct: 180 YVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVV-CYEQREKLFDPARSSTYANVSCAA 238

Query: 95  SQCAVVT-SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C+ +    CS G C Y   YG G+Y   S G  A +TLT +S       +    FGCG
Sbjct: 239 PACSDLNIHGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYD----AVKGFRFGCG 291

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFGGIVAG 211
            +N        +  G++GLG G +SL  Q      G F++CLP +  G+  ++FG     
Sbjct: 292 ERNEG---LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSLA 348

Query: 212 AGV--VSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
           A    ++TP++  +    YY+ +  I VG Q L    S  +T    VD+G + T LP   
Sbjct: 349 AASARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPAA 408

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
           +S+L+   +  + A   +G    P  S +  CY+ +  SQ   P V++ F+ GA + +  
Sbjct: 409 YSSLRYAFAAAMAA---RGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDA 465

Query: 321 SNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           S +    S   +C AF     GG+  IV G      F + YDI + +V F P  C
Sbjct: 466 SGIMYAASASQVCLAFAANEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 459

 Score =  144 bits (363), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 113/369 (30%), Positives = 178/369 (48%), Gaps = 34/369 (9%)

Query: 24  YQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKK 83
           Y+A ++S    +L++ SIG PPV  +  +DTGS  TW QCEPC  ++C +Q+ PL++P  
Sbjct: 99  YEASLLSELCTFLVNFSIGQPPVPQYAVMDTGSSLTWIQCEPC--INCHQQKGPLYNPSS 156

Query: 84  SSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           SSTY S S         T+     DC+YS  Y   A  + + G  A E L F +      
Sbjct: 157 SSTYVSCSDFDRTDTTFTATHGS-DCNYSQTY---ADKTTTRGTYAREQLLFETPDDGIT 212

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI 203
            M +VIFGCGH N   P      +G+ GLG   SS+IS++G      FSYC+ + G    
Sbjct: 213 IMHDVIFGCGHNNTQLPGPTGYASGVFGLGDSGSSIISKLGFG----FSYCIGNIGDPLY 268

Query: 204 NFGGIVAGAGV----VSTPLIIRDHYYLSLEAISVGNQRLEF---------VSSSTGNIF 250
            F  +  G  +     STPL+ R  YY++L  IS+G +RL+          ++  +  I 
Sbjct: 269 GFHRLTLGNKLKIEGYSTPLVPRGLYYITLVGISIGQERLDIDPIVFQRVDLNGISSRIV 328

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY----NISSQPKFPE 306
           +D+G   + +P + ++ ++  +S+++     +        S  LCY    N   Q  FP+
Sbjct: 329 IDSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYIARHLS--LCYIGKLNQDLQ-GFPD 385

Query: 307 VTIHF-RGADVKLSPSNLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLIGYDIEQA 362
            T H   GAD+      LF   +D ++C A           + G + Q  + + YD++Q 
Sbjct: 386 ATFHLADGADLVFQVEGLFFQYTDNVLCLALVPTESDEETCLIGLLAQQYYNVAYDLKQQ 445

Query: 363 MVSFKPSRC 371
            + F+   C
Sbjct: 446 KLYFQRIEC 454


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  144 bits (363), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 121/401 (30%), Positives = 186/401 (46%), Gaps = 62/401 (15%)

Query: 7   LPFYNDNETPKSPISIIYQAEIISVDD-----------IYLMHLSIGTPPVDIFGSVDTG 55
           L   N ++    PIS +Y  E   ++             Y   + IG P  +++  +DTG
Sbjct: 109 LAINNISKADLKPISTMYTTEEQDIEAPLISGTTQGSGEYFTRVGIGKPAREVYMVLDTG 168

Query: 56  SDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGDCSYSFL 114
           SD  W QC PC   DC+ Q  P+F+P  SS+Y  +SC + QC A+  S C    C Y   
Sbjct: 169 SDVNWLQCTPCA--DCYHQTEPIFEPSSSSSYEPLSCDTPQCNALEVSECRNATCLYEVS 226

Query: 115 YGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKN---LASPTSDSKQTGIIG 171
           YG G+Y   + G+ ATETLT  ST      + NV  GCGH N              G + 
Sbjct: 227 YGDGSY---TVGDFATETLTIGST-----LVQNVAVGCGHSNEGLFVGAAGLLGLGGGLL 278

Query: 172 LGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIVAGAGVVSTPLIIRDH---- 224
             P      SQ+ T+    FSYCL D+ S   S ++FG  ++   VV+ PL +R+H    
Sbjct: 279 ALP------SQLNTT---SFSYCLVDRDSDSASTVDFGTSLSPDAVVA-PL-LRNHQLDT 327

Query: 225 -YYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRTLLPLEYHSNLK-SVMSNM 275
            YYL L  ISVG + L+   SS       +G I +D+G   T L  E +++L+ S +   
Sbjct: 328 FYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGT 387

Query: 276 IKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGADVKLSPSNLFRNISDEI-- 331
           +  +   GV          CYN+S++   + P V  HF G  +   P+  +    D +  
Sbjct: 388 LDLEKAAGVAMFD-----TCYNLSAKTTVEVPTVAFHFPGGKMLALPAKNYMIPVDSVGT 442

Query: 332 MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            C AF    +++ + G + Q    + +D+  +++ F  ++C
Sbjct: 443 FCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score =  144 bits (363), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 110/367 (29%), Positives = 173/367 (47%), Gaps = 53/367 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  L +GTP       VD+GS  TW QC PC  + C  Q  PL+DP+ SSTY ++ CS+
Sbjct: 108 YITRLGLGTPTTTYVMVVDSGSSLTWLQCAPC-AVSCHPQAGPLYDPRASSTYAAVPCSA 166

Query: 95  SQCAVVT------SNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
            QCA +       S+CS  G C Y   YG G   SFS G L+ +T++ +S+       P 
Sbjct: 167 PQCAELQAATLNPSSCSGSGVCQYQASYGDG---SFSFGYLSKDTVSLSSSG----SFPG 219

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---IN 204
             +GCG  N+       +  G+IGL     SL+SQ+  S+   F+YCLP   ++    ++
Sbjct: 220 FYYGCGQDNVG---LFGRAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSAAASAGYLS 276

Query: 205 FG--------GIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTG 254
           FG        G  +   +VS+ L     Y++SL  +SV    L   SS  G++   +D+G
Sbjct: 277 FGSNSDNKNPGKYSYTSMVSSSL-DASLYFVSLAGMSVAGSPLAVPSSEYGSLPTIIDSG 335

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--------SSQPKFPE 306
            + T LP   ++ L             K VGA         Y+I         ++   P 
Sbjct: 336 TVITRLPTPVYTALS------------KAVGAALAAPSAPAYSILQTCFKGQVAKLPVPA 383

Query: 307 VTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
           V + F  GA ++L+P N+  ++++   C AF   ++  + G   Q  F + YD++ + + 
Sbjct: 384 VNMAFAGGATLRLTPGNVLVDVNETTTCLAFAPTDSTAIIGNTQQQTFSVVYDVKGSRIG 443

Query: 366 FKPSRCT 372
           F    C+
Sbjct: 444 FAAGGCS 450


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score =  144 bits (362), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 116/376 (30%), Positives = 176/376 (46%), Gaps = 58/376 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y +   +GTPP      VD+GSD  W QC PC +  C+ Q+ PL+ P  SST++ + C S
Sbjct: 64  YFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQ--CYAQDSPLYVPSNSSTFSPVPCLS 121

Query: 95  SQCAVVTSN----CS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           S C ++ +     C     G C+Y +LY   A  S S G  A E+ T +      V +  
Sbjct: 122 SDCLLIPATEGFPCDFRYPGACAYEYLY---ADTSSSKGVFAYESATVDG-----VRIDK 173

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSSK 202
           V FGCG  N     S +   G++GLG G  S  SQ+G +   KF+YCL     P   SS 
Sbjct: 174 VAFGCGSDNQG---SFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSS 230

Query: 203 INFGG--IVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST--------GN 248
           + FG   I     +  TP++        YY+ +E ++VG + L    S+         G+
Sbjct: 231 LIFGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNGGS 290

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNIS--SQPKF 304
           IF     L    P  Y   L +  S +   +A+ V+G+         LC  ++   QP F
Sbjct: 291 IFDSGTTLTYWFPSAYSHILAAFDSGVHYPRAESVQGLD--------LCVELTGVDQPSF 342

Query: 305 PEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRG-----GNANIVYGRIMQINFLIGYD 358
           P  TI F  GA  +    N F +++  + C A  G     G  N + G ++Q NF + YD
Sbjct: 343 PSFTIEFDDGAVFQPEAENYFVDVAPNVRCLAMAGLASPLGGFNTI-GNLLQQNFFVQYD 401

Query: 359 IEQAMVSFKPSRCTNY 374
            E+ ++ F P++C+++
Sbjct: 402 REENLIGFAPAKCSSH 417


>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 315

 Score =  144 bits (362), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 109/302 (36%), Positives = 162/302 (53%), Gaps = 28/302 (9%)

Query: 91  SCSSSQCAVV-TSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           SC S  C  + T  CS E  C+Y++ YG     S + G LA +T TF S +G  V +   
Sbjct: 20  SCDSPLCHKLDTGVCSPEKRCNYTYGYGDN---SLTKGVLAQDTATFTSNTGKLVSLSRF 76

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPD-----QGSSK 202
           +FGCGH N      +  + G+IGLG G +SLISQ+G    GK FS CL       + SS+
Sbjct: 77  LFGCGHNNTGG--FNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSR 134

Query: 203 INFG--GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS-STGNIFVDTGV 255
           ++FG    V G GVV+TPL+ R+     Y+++L  ISV +  L   S+   GN+ VD+G 
Sbjct: 135 MSFGKGSQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDTYLPMNSTIEKGNMLVDSGT 194

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGAD 315
              +LP + +  +   + N +   P++ +  +P     LCY   +  K P +T HF GA+
Sbjct: 195 PPNILPQQLYDRVYVEVKNNV---PLELITNDPSLGPQLCYRTQTNLKGPTLTYHFEGAN 251

Query: 316 VKLSPSNLFRNISDE---IMCSAFRG-GNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
           + L+P   F   + E   + C A     N+N  VYG   Q N+LIG+D+++ +VSFK + 
Sbjct: 252 LLLTPIQTFIPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLIGFDLDRQVVSFKATD 311

Query: 371 CT 372
           CT
Sbjct: 312 CT 313


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 116/355 (32%), Positives = 177/355 (49%), Gaps = 33/355 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QC+PC  + C++Q+  LFDP +SSTY ++SC++
Sbjct: 178 YVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVV-CYEQQEKLFDPVRSSTYANVSCAA 236

Query: 95  SQCAVVT-SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C+ +    CS G C Y   YG G+Y   S G  A +TLT +S       +    FGCG
Sbjct: 237 PACSDLNIHGCSGGHCLYGVQYGDGSY---SIGFFAMDTLTLSSYD----AVKGFRFGCG 289

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFGGIVAG 211
            +N        +  G++GLG G +SL  Q      G F++CLP +  G+  ++FG     
Sbjct: 290 ERNEG---LFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSPA 346

Query: 212 AGV--VSTPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
           A    ++TP++  +    YY+ +  I VG Q L    S  +T    VD+G + T LP   
Sbjct: 347 AASARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPPA 406

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSP 320
           +S+L+   +  + A   +G    P  S +  CY+ +  SQ   P V++ F+ GA + +  
Sbjct: 407 YSSLRYAFAAAMAA---RGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDA 463

Query: 321 SNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           S +    S   +C AF     GG+  IV G      F + YDI + +V F P  C
Sbjct: 464 SGIMYAASASQVCLAFAANEDGGDVGIV-GNTQLKTFGVAYDIGKKVVGFYPGVC 517


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 105/361 (29%), Positives = 161/361 (44%), Gaps = 45/361 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP + +  +D+GSD  W QC+PC +  C+ Q  P+FDP  S+++  + CSS
Sbjct: 142 YFIRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQ--CYHQTDPVFDPADSASFMGVPCSS 199

Query: 95  SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S C  +  + C  G C Y  +YG G+Y   + G LA ETLTF  T      + NV  GCG
Sbjct: 200 SVCERIENAGCHAGGCRYEVMYGDGSY---TKGTLALETLTFGRTV-----VRNVAIGCG 251

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFGGIVA 210
           H+N       +   G+ G    + SL+ Q+G    G FSYCL  +G   +  + FG    
Sbjct: 252 HRNRGMFVGAAGLLGLGGG---SMSLVGQLGGQTGGAFSYCLVSRGTDSAGSLEFGRGAM 308

Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
             G    PLI        YY+ L  + VG  ++       +      G + +DTG   T 
Sbjct: 309 PVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTR 368

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISS--QPKFPEVTIHFRG 313
           +P   +   +           +   G  P  S V     CYN++     + P V+ +F G
Sbjct: 369 IPTVAYVAFRDAF--------IGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYFAG 420

Query: 314 ADVKLSPSNLFRNISDEI--MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
             +   P+  F    D++   C AF    + + + G I Q    I +D     V F P+ 
Sbjct: 421 GPILTLPARNFLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 480

Query: 371 C 371
           C
Sbjct: 481 C 481


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 115/358 (32%), Positives = 166/358 (46%), Gaps = 55/358 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+HL+IGTPP  +  ++DTGSD  WTQC+PCP   CF Q  P FDP  SST +  SC S
Sbjct: 89  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPA--CFDQALPYFDPSTSSTLSLTSCDS 146

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
           + C  +          ++F+   GA AS                      +P V FGCG 
Sbjct: 147 TLCQGLPVASLPRSDKFTFV---GAGAS----------------------VPGVAFGCGL 181

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC-------LPDQGSSKINFGG 207
            N  +    S +TGI G G G  SL SQ+     G FS+C       +P      +    
Sbjct: 182 FN--NGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTTITGAIPSTVLLDLPADL 236

Query: 208 IVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRL-----EF-VSSSTGNIFVDTGVL 256
              G G V T  +I++      YYLSL+ I+VG+ RL     EF + + TG   +D+G  
Sbjct: 237 FSNGQGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGTA 296

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADV 316
            T LP   +  ++   +  +K   V G   +P F   L   + ++P  P++ +HF GA +
Sbjct: 297 MTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--CLSAPLRAKPYVPKLVLHFEGATM 354

Query: 317 KLSPSNLFRNISD---EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            L   N    + D    I+C A   G      G   Q N  + YD++ + +SF P++C
Sbjct: 355 DLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 412


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 116/373 (31%), Positives = 177/373 (47%), Gaps = 41/373 (10%)

Query: 26  AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL-FDPKKS 84
           ++++S    YLM +++G+PP  +    DTGSD  W +C+           P   FDP +S
Sbjct: 92  SKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRS 151

Query: 85  STYNSISCSSSQC-AVVTSNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNS----T 138
           STY  +SC +  C A+  + C +G +C+Y + YG G   S ++G L+TET TF+      
Sbjct: 152 STYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDG---SNTTGVLSTETFTFDDGGSGR 208

Query: 139 SGLPVEMPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMG--TSIAGKFSYCL 195
           S   V +  V FGC      S P       G         SL++Q+G  TS+  +FSYCL
Sbjct: 209 SPRQVRVGGVKFGCSTATAGSFPADGLVGLGGG-----AVSLVTQLGGATSLGRRFSYCL 263

Query: 196 PDQ---GSSKINFGGI--VAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSSTG 247
                  SS +NFG +  V   G  STPL+  D   +Y + L+++ VGN+ +   +SS  
Sbjct: 264 VPHSVNASSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASAASS-- 321

Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-----P 302
            I VD+G   T L       +   +S  I   PV+     P     LCYN++ +      
Sbjct: 322 RIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQ----SPDGLLQLCYNVAGREVEAGE 377

Query: 303 KFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYD 358
             P++T+ F  GA V L P N F  + +  +C A           + G + Q N  +GYD
Sbjct: 378 SIPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYD 437

Query: 359 IEQAMVSFKPSRC 371
           ++   V+F  + C
Sbjct: 438 LDAGTVTFAGADC 450


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 119/361 (32%), Positives = 163/361 (45%), Gaps = 42/361 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP   ++  +DTGSD  W QC PC +  C+ Q   +FDP KS TY  I C +
Sbjct: 118 YFTRIGVGTPARYVYMVLDTGSDVVWLQCAPCRK--CYTQTDHVFDPTKSRTYAGIPCGA 175

Query: 95  SQCAVVTS-NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S  CS  +  C Y   YG G   SF+ G+ +TETLTF         +  V  G
Sbjct: 176 PLCRRLDSPGCSNKNKVCQYQVSYGDG---SFTFGDFSTETLTFRRN-----RVTRVALG 227

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFGG 207
           CGH N    T  +   G+        S   Q G     KFSYCL D+ +S     + FG 
Sbjct: 228 CGHDNEGLFTGAAGLLGLGRG---RLSFPVQTGRRFNHKFSYCLVDRSASAKPSSVIFGD 284

Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGV 255
                    TPLI    +   YYL L  ISVG   +  +S+S         G + +D+G 
Sbjct: 285 SAVSRTAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGT 344

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR 312
             T L    +  L+      I A  +K     P FS    C+++S  ++ K P V +HFR
Sbjct: 345 SVTRLTRPAYIALRDAFR--IGASHLK---RAPEFSLFDTCFDLSGLTEVKVPTVVLHFR 399

Query: 313 GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
           GADV L  +N    + +    C AF G  + + + G I Q  F I YD+  + V F P  
Sbjct: 400 GADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRG 459

Query: 371 C 371
           C
Sbjct: 460 C 460


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 111/354 (31%), Positives = 166/354 (46%), Gaps = 33/354 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  + +GTP       VDTGS  TW QC PC  + C +Q  P+FDPK SS+Y ++SCS+
Sbjct: 137 YVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPC-RVSCHRQSGPVFDPKTSSSYAAVSCST 195

Query: 95  SQC---AVVTSN---CSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
            QC   +  T N   CS  D C Y   YG    +SFS G L+ +T++F S S     +PN
Sbjct: 196 PQCNDLSTATLNPAACSSSDVCIYQASYGD---SSFSVGYLSKDTVSFGSNS-----VPN 247

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
             +GCG  N        +  G++GL     SL+ Q+  ++   FSYCLP   SS     G
Sbjct: 248 FYYGCGQDNEG---LFGRSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSSSSSGYLSIG 304

Query: 208 IVAGAGVVSTPLI---IRDH-YYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLLP 261
                    TP++   + D  Y++ L  ++V  + L   SS   ++   +D+G + T LP
Sbjct: 305 SYNPGQYSYTPMVSSTLDDSLYFIKLSGMTVAGKPLAVSSSEYSSLPTIIDSGTVITRLP 364

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFR-GADVKL 318
              +  L   +     A  +KG      +S  D      +S  + P V++ F  GA +KL
Sbjct: 365 TTVYDALSKAV-----AGAMKGTKRADAYSILDTCFVGQASSLRVPAVSMAFSGGAALKL 419

Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           S  NL  ++     C AF    +  + G   Q  F + YD++   + F    CT
Sbjct: 420 SAQNLLVDVDSSTTCLAFAPARSAAIIGNTQQQTFSVVYDVKSNRIGFAAGGCT 473


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 113/353 (32%), Positives = 172/353 (48%), Gaps = 51/353 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           YL+ ++IGTPP+ +   +DTGSD  WTQC+ PC    CF Q  PL+ P +S+TY ++SC 
Sbjct: 92  YLVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRR--CFPQPAPLYAPARSATYANVSCR 149

Query: 94  SSQCAVVT---SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S  C  +    S CS  D  C+Y F YG G   + + G LATET T  S +     +  V
Sbjct: 150 SPMCQALQSPWSRCSPPDTGCAYYFSYGDG---TSTDGVLATETFTLGSDTA----VRGV 202

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI 208
            FGCG +NL S  + S   G++G+G G  SL+SQ+G +   +         S +      
Sbjct: 203 AFGCGTENLGSTDNSS---GLVGMGRGPLSLVSQLGVTRPRR---------SCRARAAAR 250

Query: 209 VAGAGVVSTPLIIRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLP 261
             GA   ++P          LE I+VG+  L              G + +D+G   T L 
Sbjct: 251 GGGAPTTTSP----------LEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALE 300

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGADVKL- 318
                 L   +++ ++  P+   GA  G S  LC+  +S    + P + +HF GAD++L 
Sbjct: 301 ERAFVALARALASRVR-LPLAS-GAHLGLS--LCFAAASPEAVEVPRLVLHFDGADMELR 356

Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             S +  + S  + C          V G + Q N  I YD+E+ ++SF+P++C
Sbjct: 357 RESYVVEDRSAGVACLGMVSARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 409


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 115/361 (31%), Positives = 171/361 (47%), Gaps = 42/361 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +G+P  D+    DTGSD TWTQCEPC    C++Q   +FDP  S +Y+++SC S
Sbjct: 147 YVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGY-CYQQREHIFDPSTSLSYSNVSCDS 205

Query: 95  SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
             C  + S       CS   C Y   YG G+Y   S G  A E L+  ST        N 
Sbjct: 206 PSCEKLESATGNSPGCSSSTCLYGIRYGDGSY---SIGFFAREKLSLTSTD----VFNNF 258

Query: 149 IFGCGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKIN 204
            FGCG  N  L   T+     G++GL     SL+SQ        FSYCLP     +  ++
Sbjct: 259 QFGCGQNNRGLFGGTA-----GLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSSTGYLS 313

Query: 205 FG-GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLR 257
           FG G      V  TP  +       Y+L +  ISVG ++L    S  ST    +D+G + 
Sbjct: 314 FGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFSTAGTIIDSGTVI 373

Query: 258 TLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-G 313
           + LP   +S+++ V   ++   P VKGV          CY++S     K P++ ++F  G
Sbjct: 374 SRLPPTVYSSVQKVFRELMSDYPRVKGVSILD-----TCYDLSKYKTVKVPKIILYFSGG 428

Query: 314 ADVKLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSR 370
           A++ L+P  +   +    +C AF G + +    + G + Q    + YD  +  V F PS 
Sbjct: 429 AEMDLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSG 488

Query: 371 C 371
           C
Sbjct: 489 C 489


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 112/350 (32%), Positives = 167/350 (47%), Gaps = 33/350 (9%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
           + +GTP       VDTGS  TW QC PC  + C +Q  P+F+PK SSTY S+ CS+ QC+
Sbjct: 1   MGLGTPATQYVMVVDTGSSLTWLQCSPC-LVSCHRQSGPVFNPKSSSTYASVGCSAQQCS 59

Query: 99  VV------TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
            +       S CS  + C Y   YG    +SFS G L+ +T++F STS     +PN  +G
Sbjct: 60  DLPSATLNPSACSSSNVCIYQASYGD---SSFSVGYLSKDTVSFGSTS-----LPNFYYG 111

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG 211
           CG  N        +  G+IGL     SL+ Q+  S+   F+YCLP   SS     G    
Sbjct: 112 CGQDNEG---LFGRSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSYNP 168

Query: 212 AGVVSTPLI---IRDH-YYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLLPLEYH 265
                TP++   + D  Y++ L  ++V    L   SS+  ++   +D+G + T LP   +
Sbjct: 169 GQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTSVY 228

Query: 266 SNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFR-GADVKLSPSN 322
           S L   +     A  +KG      +S  D      +S+   P VT+ F  GA +KLS  N
Sbjct: 229 SALSKAV-----AAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGAALKLSAQN 283

Query: 323 LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           L  ++ D   C AF    +  + G   Q  F + YD++ + + F    C+
Sbjct: 284 LLVDVDDSTTCLAFAPARSAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 173/366 (47%), Gaps = 45/366 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++++ +GTP  D+    DTGSD TWTQC+PC +  C+ Q+ P+FDP  S TY++ISC+S
Sbjct: 154 YIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVK-SCYAQQQPIFDPSASKTYSNISCTS 212

Query: 95  SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           + C+ + S       CS  +C Y   YG    +SF+ G  A +TLT              
Sbjct: 213 TACSGLKSATGNSPGCSSSNCVYGIQYGD---SSFTVGFFAKDTLTLTQND----VFDGF 265

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-DQGSS-KINFG 206
           +FGCG  N        K  G+IGLG    S++ Q        FSYCLP  +GS+  + FG
Sbjct: 266 MFGCGQNNRG---LFGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYCLPTSRGSNGHLTFG 322

Query: 207 ---GI----VAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLE---FVSSSTGNIFVDT 253
              G+        G+  TP         Y++ +  ISVG + L     +  + G I +D+
Sbjct: 323 NGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQNAGTI-IDS 381

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIH 310
           G + T LP   + +LKS     +   P       P  S +  CY++S  +    P+++ +
Sbjct: 382 GTVITRLPSTVYGSLKSTFKQFMSKYPT-----APALSLLDTCYDLSNYTSISIPKISFN 436

Query: 311 FRG-ADVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSF 366
           F G A+V L P+ +        +C AF G   +    ++G I Q    + YD+    + F
Sbjct: 437 FNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQLGF 496

Query: 367 KPSRCT 372
               C+
Sbjct: 497 GYKGCS 502


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 109/354 (30%), Positives = 172/354 (48%), Gaps = 32/354 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD TW QC+PC    C++Q+ PLF P KS+TY +ISC+S
Sbjct: 165 YVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAY-CYQQKEPLFTPTKSATYANISCTS 223

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S C+ + T  CS G C Y+  YG G+Y   + G  A +TLT    +     + +  FGCG
Sbjct: 224 SYCSDLDTRGCSGGHCLYAVQYGDGSY---TVGFYAQDTLTLGYDT-----VKDFRFGCG 275

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAG 211
            KN        K  G++GLG G +S+  Q     +G F+YC+P    G+  ++FG     
Sbjct: 276 EKNRG---LFGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYCIPATSSGTGFLDFGPGAPA 332

Query: 212 AGVVS-TPLIIRD---HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYH 265
           A     TP+++ +    YY+ +  I VG   L   ++  S     VD+G + T LP   +
Sbjct: 333 AANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLPPSAY 392

Query: 266 SNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP---KFPEVTIHFR-GADVKLSP 320
             L+S  +  ++     G    P FS +  CY+++        P V++ F+ GA + +  
Sbjct: 393 EPLRSAFAKGMEGL---GYKTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGACLDVDA 449

Query: 321 SNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           S +         C AF   + +    + G   Q  + + YD+ + +V F P  C
Sbjct: 450 SGILYVADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 100/273 (36%), Positives = 145/273 (53%), Gaps = 37/273 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ L+IGTPP+     +DTGSD  WTQC PC  L C  Q  P FD KKS+TY ++ C S
Sbjct: 89  YLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATYRALPCRS 146

Query: 95  SQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S+CA ++S +C +  C Y + YG  A    ++G LA ET TF + +   V   N+ FGCG
Sbjct: 147 SRCASLSSPSCFKKMCVYQYYYGDTAS---TAGVLANETFTFGAANSTKVRATNIAFGCG 203

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFG---- 206
             N     + S   G++G G G  SL+SQ+G S   +FSYCL    S   S++ FG    
Sbjct: 204 SLNAGDLANSS---GMVGFGRGPLSLVSQLGPS---RFSYCLTSYLSATPSRLYFGVYAN 257

Query: 207 ----GIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRL-------EFVSSSTGNIFV 251
                  +G+ V STP +I     + Y+LSL+AIS+G + L             TG + +
Sbjct: 258 LSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVII 317

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV 284
           D+G   T L  + +   ++V   ++ A P+  +
Sbjct: 318 DSGTSITWLQQDAY---EAVRRGLVSAIPLTAM 347


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 120/399 (30%), Positives = 188/399 (47%), Gaps = 71/399 (17%)

Query: 21  SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQ--EPPL 78
           S+  QA++ +    Y M++S+GTPP+D    VDTGS+  W QC PC    CF +    P+
Sbjct: 77  SVNVQAQLENGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPCTR--CFPRPTPAPV 134

Query: 79  FDPKKSSTYNSISCSSSQCAVVTSNC------SEGDCSYSFLYGRGAYASFSSGNLATET 132
             P +SST++ + C+ S C  + ++       +   C+Y++ YG G    +++G LATET
Sbjct: 135 LQPARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG----YTAGYLATET 190

Query: 133 LTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFS 192
           LT    +      P V FGC  +N    +S     GI+GLG G  SL+SQ+     G+FS
Sbjct: 191 LTVGDGT-----FPKVAFGCSTENGVDNSS-----GIVGLGRGPLSLVSQLAV---GRFS 237

Query: 193 YCL----PDQGSSKINFGGIVA---GAGVVSTPLIIR------DHYYLSLEAISVGNQRL 239
           YCL     D G+S I FG +      + V STPL+         HYY++L  I+V +  L
Sbjct: 238 YCLRSDMADGGASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTEL 297

Query: 240 EFVSSS--------TGNIFVDTGVLRTLLPLEYHSNLK----SVMSNMIKAQPVKGVGAE 287
               S+         G   VD+G   T L  + ++ +K    S M+N+ +  P  G    
Sbjct: 298 PVTGSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGA--- 354

Query: 288 PGFSDVLCYNISS-----QPKFPEVTIHFR-GADVKLSPSNLFRNISDE------IMCSA 335
             +   LCY  S+       + P + + F  GA   +   N F  +  +      + C  
Sbjct: 355 -PYDLDLCYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLL 413

Query: 336 FRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
                 ++   + G +MQ++  + YDI+  M SF P+ C
Sbjct: 414 VLPATDDLPISIIGNLMQMDMHLLYDIDGGMFSFAPADC 452


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 104/354 (29%), Positives = 167/354 (47%), Gaps = 34/354 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  D+    DTGSD +W QC+PC   +C+KQ  PLFDP +S+TY+++ C +
Sbjct: 188 YIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCN--NCYKQHDPLFDPSQSTTYSAVPCGA 245

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
            +C + +  CS G C Y  +YG     S + GNLA +TLT   +S    ++   +FGCG 
Sbjct: 246 QEC-LDSGTCSSGKCRYEVVYGD---MSQTDGNLARDTLTLGPSSD---QLQGFVFGCGD 298

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVAGA 212
            +        +  G+ GLG    SL SQ        FSYCLP    ++  ++ G   A  
Sbjct: 299 DDTG---LFGRADGLFGLGRDRVSLASQAAARYGAGFSYCLPSSWRAEGYLSLGSAAAPP 355

Query: 213 GVVSTPLIIRDH----YYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLEYH 265
               T ++ R      YYL L  I V  + +     V  + G + +D+G + T LP   +
Sbjct: 356 HAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAPGTV-IDSGTVITRLPSRAY 414

Query: 266 SNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADVKLSPS 321
           S L+S  +  ++          P  S +  CY+ + + K   P V + F  GA + L   
Sbjct: 415 SALRSSFAGFMRRYK-----RAPALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGFG 469

Query: 322 NLFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
            +    +    C AF   G + ++ + G + Q  F + YD+    + F    C+
Sbjct: 470 GVLYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGCS 523


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 113/370 (30%), Positives = 167/370 (45%), Gaps = 50/370 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP       +DTGSD  W QC PC    C++Q   +FDP++S +YN++ C++
Sbjct: 140 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRR--CYEQSGQVFDPRRSRSYNAVGCAA 197

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   +     C Y   YG G   S ++G+ ATETLTF   +     +  V  G
Sbjct: 198 PLCRRLDSGGCDLRRSACLYQVAYGDG---SVTAGDFATETLTFAGGA----RVARVALG 250

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--------I 203
           CGH N     + +   G+     G+ S  +Q+       FSYCL D+ SS         +
Sbjct: 251 CGHDNEGLFVAAAGLLGLG---RGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTV 307

Query: 204 NFGGIVAGAGVVS--TPLI----IRDHYYLSLEAISVGNQRLEFV---------SSSTGN 248
            FG    G+ V S  TP++    +   YY+ L  ISVG  R+  V         SS  G 
Sbjct: 308 TFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSGRGG 367

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--KF 304
           + VD+G   T L    +S L+             G+   PG   +   CY++S +   K 
Sbjct: 368 VIVDSGTSVTRLARPAYSALRDAFRGA-----AAGLRLSPGGFSLFDTCYDLSGRKVVKV 422

Query: 305 PEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQ 361
           P V++HF  GA+  L P N    + S    C AF G +  + + G I Q  F + +D + 
Sbjct: 423 PTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDG 482

Query: 362 AMVSFKPSRC 371
             V+F P  C
Sbjct: 483 QRVAFTPKGC 492


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 116/380 (30%), Positives = 175/380 (46%), Gaps = 55/380 (14%)

Query: 35  YLMHLSIGTP-PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           YL+HL IGTP P  +   +DTGSD  WTQC  C    CF Q  P+F    S T++ + CS
Sbjct: 94  YLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA-CTV--CFDQPVPVFRASVSHTFSRVPCS 150

Query: 94  SSQCA----VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNS--TSGLPVEMPN 147
              C     +  S C+  D S  + YG   + S ++G +A +T TF +   +     +PN
Sbjct: 151 DPLCGHAVYLPLSGCAARDRSCFYAYGYMDH-SITTGKMAEDTFTFKAPDRADTAAAVPN 209

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN--- 204
           + FGCG  N    T +  Q+GI G G G  SL SQ+      +FSYC      S+++   
Sbjct: 210 IRFGCGMMNYGLFTPN--QSGIAGFGTGPLSLPSQLKVR---RFSYCFTAMEESRVSPVI 264

Query: 205 FGG----IVAGA-----------GVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSS---- 245
            GG    I A A           G    P+  +  Y+LSL  ++VG  RL F +S+    
Sbjct: 265 LGGEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALK 324

Query: 246 ---TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ- 301
              +G  F+D+G   T  P     +L+      +     KG   +P   ++LC+++ ++ 
Sbjct: 325 GDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVPLPVAKGY-TDP--DNLLCFSVPAKK 381

Query: 302 --PKFPEVTIHFRGADVKLSPSNLFRNISDE-------IMCSAFRGGNAN-IVYGRIMQI 351
             P  P++ +H  GAD +L   N   +  D+       +       GN+N  + G   Q 
Sbjct: 382 KAPAVPKLILHLEGADWELPRENYVLDNDDDGSGAGRKLCVVILSAGNSNGTIIGNFQQQ 441

Query: 352 NFLIGYDIEQAMVSFKPSRC 371
           N  I YD+E   + F P+RC
Sbjct: 442 NMHIVYDLESNKMVFAPARC 461


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 121/380 (31%), Positives = 168/380 (44%), Gaps = 61/380 (16%)

Query: 35  YLMHLSIGTP-PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISC- 92
           YL+H +IGTP P  +  ++DTGSD  WTQC PCP   CF Q  PLFDP  SST+ +++C 
Sbjct: 87  YLIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPV--CFDQPFPLFDPSVSSTFRAVACP 144

Query: 93  ----------SSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL- 141
                     S S CA+ T  C    CSY          S ++G +  +T TF S +G  
Sbjct: 145 DPICRPSSGLSVSACALKTFRCFY-LCSY-------GDKSITAGYIFKDTFTFMSPNGEG 196

Query: 142 --PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
             PV +  + FGCG  N       S ++GI G G G  SL SQ+     G+FSYCL    
Sbjct: 197 APPVAVSGLAFGCGDYNTG--VFASNESGIAGFGRGPLSLPSQLRV---GRFSYCLTSHD 251

Query: 200 SSKINFGGIV------------AGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVS 243
            ++ N    V            +     STP+I        YYLSLE I+VG  RL   S
Sbjct: 252 ETESNKTSAVFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDS 311

Query: 244 S-------STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
           S        +G   +D+G   T  P      LK   +  +   P+          ++LC+
Sbjct: 312 SVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLK---NEFVAQLPLPRYDNTSEVGNLLCF 368

Query: 297 NI---SSQPKFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANIVY-GRIMQI 351
                  Q   P++  H   AD+ L   N     +D  +MC    G   ++V  G   Q 
Sbjct: 369 QRPKGGKQVPVPKLIFHLASADMDLPRENYIPEDTDSGVMCLMINGAEVDMVLIGNFQQQ 428

Query: 352 NFLIGYDIEQAMVSFKPSRC 371
           N  I YD+E + + F  ++C
Sbjct: 429 NMHIVYDVENSKLLFASAQC 448


>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 469

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 115/373 (30%), Positives = 173/373 (46%), Gaps = 48/373 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +L++LSIG+PPV     VDTGS   W QC PC  ++CF+Q    FDP KS ++ ++ C  
Sbjct: 104 FLVNLSIGSPPVTQLVVVDTGSSLLWVQCLPC--INCFQQSTSWFDPLKSVSFKTLGCGF 161

Query: 95  SQCAVVTS-NCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFN-------------STS 139
                +    C+      Y   Y  G     S G LA E+L F              ST 
Sbjct: 162 PGYNYINGYKCNRFNQAEYKLRYLGG---DSSQGILAKESLLFETLDEGRVFQYNAISTQ 218

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
              ++  N+ FGCGH N+ +  +D    G+ GLG         M T +  KFSYC+ D  
Sbjct: 219 ISKIKKSNITFGCGHMNIKT-NNDDAYNGVFGLGAYPH---ITMATQLGNKFSYCIGDIN 274

Query: 200 SSKINFGGIVAGAGVV----STPLIIR-DHYYLSLEAISVGNQRL-------EFVSSSTG 247
           +       +V G G      STPL I   HYY++L++ISVG++ L       +  S  +G
Sbjct: 275 NPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVGSKTLKIDPNAFKISSDGSG 334

Query: 248 NIFVDTGVLRTLLP---LE-YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK 303
            + +D+G+  T L     E  +  +  +M  +++  P +       F  V+  ++     
Sbjct: 335 GVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEGLCFKGVVSRDLVG--- 391

Query: 304 FPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI----VYGRIMQINFLIGYD 358
           FP VT HF  GAD+ L   +LFR    +  C A    N+ +    V G + Q N+ +G+D
Sbjct: 392 FPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFD 451

Query: 359 IEQAMVSFKPSRC 371
           +EQ  V F+   C
Sbjct: 452 LEQMKVFFRRIDC 464


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score =  141 bits (356), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 106/374 (28%), Positives = 172/374 (45%), Gaps = 49/374 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   +++G PP      +DTGSD  W QC PC    C++Q  PL+DP+ SST+  I C+S
Sbjct: 88  YFAVINVGDPPTRALVVIDTGSDLIWLQCVPCRH--CYRQVTPLYDPRSSSTHRRIPCAS 145

Query: 95  SQCAVVTS----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
            +C  V      +   G C Y  +YG G   S SSG+LAT+ L F   +     + NV  
Sbjct: 146 PRCRDVLRYPGCDARTGGCVYMVVYGDG---SASSGDLATDRLVFPDDT----HVHNVTL 198

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------GSSKIN 204
           GCGH N+    S     G++G+G G  S  +Q+  +    FSYCL D+      GSS + 
Sbjct: 199 GCGHDNVGLLES---AAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYLV 255

Query: 205 FGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST---------GNIFV 251
           FG          TPL         YY+ +   SVG +R+   S+++         G I V
Sbjct: 256 FGRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIVV 315

Query: 252 DTGVLRTLLPLEYHSNLKSVM-SNMIKAQPVKGVGAEPGFSDVLCYNI------SSQPKF 304
           D+G   +    + ++ ++    S+   A  ++ +  +    D  CY++      ++  + 
Sbjct: 316 DSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDA-CYDLRGNGAPAAAVRV 374

Query: 305 PEVTIHFR-GADVKLSPSNLFRNIS----DEIMCSAFRGGNANI-VYGRIMQINFLIGYD 358
           P + +HF  GAD+ L  +N    +         C   +  +  + V G + Q  F + +D
Sbjct: 375 PSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLVFD 434

Query: 359 IEQAMVSFKPSRCT 372
           +E+  + F P+ C+
Sbjct: 435 VERGRIGFTPNGCS 448


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 106/355 (29%), Positives = 167/355 (47%), Gaps = 28/355 (7%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  D+    DTGSD +W QC+PC    C++Q  PLFDP +S+TY+++ C +
Sbjct: 138 YIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCD--GCYQQHDPLFDPSQSTTYSAVPCGA 195

Query: 95  SQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV--EMPNVIFG 151
            +C  + S +CS G C Y  +YG     S + GNLA +TLT   +S      ++   +FG
Sbjct: 196 QECRRLDSGSCSSGKCRYEVVYGD---MSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFG 252

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG-GIVA 210
           CG  +        K  G+ GLG    SL SQ        FSYCLP   +++     G  A
Sbjct: 253 CGDDDTG---LFGKADGLFGLGRDRVSLASQAAAKYGAGFSYCLPSSSTAEGYLSLGSAA 309

Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQ--RLEFVSSSTGNIFVDTGVLRTLLPLEY 264
                 T ++ R      YYL+L  I V  +  R+      T    +D+G + T LP   
Sbjct: 310 PPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTVITRLPSRA 369

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADVKLSP 320
           ++ L+S  + +++    K     P  S +  CY+ + + K   P V + F  GA + L  
Sbjct: 370 YAALRSSFAGLMRRYSYK---RAPALSILDTCYDFTGRNKVQIPSVALLFDGGATLNLGF 426

Query: 321 SNLFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
             +    +    C AF   G + +I + G + Q  F + YD+    + F    C+
Sbjct: 427 GEVLYVANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGCS 481


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 110/363 (30%), Positives = 168/363 (46%), Gaps = 47/363 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP  + +  +DTGSD  W QCEPC E  C+ Q  P+F+P  S++++++ C S
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRE--CYSQADPIFNPSYSASFSTVGCDS 214

Query: 95  SQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + C+ + + +C  G C Y   YG G+Y   S+G+ ATETLTF +TS     + NV  GCG
Sbjct: 215 AVCSQLDAYDCHSGGCLYEASYGDGSY---STGSFATETLTFGTTS-----VANVAIGCG 266

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGIVA 210
           HKN+      +   G+        S  +Q+GT     FSYCL D+    S  + FG    
Sbjct: 267 HKNVGLFIGAAGLLGLGAG---ALSFPNQIGTQTGHTFSYCLVDRESDSSGPLQFGPKSV 323

Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFV---------SSSTGNIFVDTGVLR 257
             G + TPL    H    YYLS+ AISVG   L+ +         +S  G   +D+G + 
Sbjct: 324 PVGSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGFIIDSGTVV 383

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIHF 311
           T L    +  ++           V G G  P    V     CY++S       P V  HF
Sbjct: 384 TRLVTSAYDAVRDAF--------VAGTGQLPRTDAVSIFDTCYDLSGLQFVSVPTVGFHF 435

Query: 312 RGADVKLSPSNLFRNISDEI--MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKP 368
                 + P+  +    D +   C AF    +++ + G   Q +  + +D   ++V F  
Sbjct: 436 SNGASLILPAKNYLIPMDTVGTFCFAFAPAASSVSIMGNTQQQHIRVSFDSANSLVGFAF 495

Query: 369 SRC 371
            +C
Sbjct: 496 DQC 498


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 116/389 (29%), Positives = 185/389 (47%), Gaps = 38/389 (9%)

Query: 6   KLPFYNDNETPKSPISIIYQAEIISV---DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQ 62
           K+ + +   TP S +  ++    ++       +L ++SIG PPV     +DTGSD TW  
Sbjct: 46  KIGYLHSKSTPASRLDNLWTVSHVTPIPNPAAFLANISIGNPPVPQLLLIDTGSDLTWIH 105

Query: 63  CEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAV--VTSNCSEGDCSYSFLYGRGAY 120
           C PC    C+ Q  P F P +SSTY + SC S+  A+  +  +   G+C Y   Y     
Sbjct: 106 CLPC---KCYPQTIPFFHPSRSSTYRNASCVSAPHAMPQIFRDEKTGNCQYHLRYRD--- 159

Query: 121 ASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLI 180
            S + G LA E LTF ++    +   N++FGCG  N    +  +K +G++GLGPG  S++
Sbjct: 160 FSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDN----SGFTKYSGVLGLGPGTFSIV 215

Query: 181 SQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVV----STPL-IIRDHYYLSLEAISVG 235
           ++   +   KFSYC     +       ++ G G       TPL I +D YYL L+AIS G
Sbjct: 216 TR---NFGSKFSYCFGSLTNPTYPHNILILGNGAKIEGDPTPLQIFQDRYYLDLQAISFG 272

Query: 236 NQRLEF------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPG 289
            + L+          S G   +DTG   T+L  E +  L   + + +  + ++ V     
Sbjct: 273 EKLLDIEPGTFQRYRSQGGTVIDTGCSPTILAREAYETLSEEI-DFLLGEVLRRVKDWDQ 331

Query: 290 FSDVLCYNISSQPK---FPEVTIHFR-GADVKLSPSNLF-RNISDEIMCSAFRGGNAN-- 342
           ++   CY  + +     FP VT HF  GA++ L   +LF  + S +  C A      +  
Sbjct: 332 YT-TPCYEGNLKLDLYGFPVVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDM 390

Query: 343 IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            V G + Q N+ +GY++    V F+ + C
Sbjct: 391 SVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  141 bits (355), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 106/356 (29%), Positives = 158/356 (44%), Gaps = 35/356 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP   +  +D+GSD  W QC+PC +  C+ Q  P+FDP  S+++  +SCSS
Sbjct: 140 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQ--CYHQSDPVFDPADSASFTGVSCSS 197

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S C  +  + C  G C Y   YG G+Y   + G LA ETLTF  T      + +V  GCG
Sbjct: 198 SVCDRLENAGCHAGRCRYEVSYGDGSY---TKGTLALETLTFGRTM-----VRSVAIGCG 249

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFGGIVA 210
           H+N       +   G+ G    + S + Q+G    G FSYCL  +G   S  + FG    
Sbjct: 250 HRNRGMFVGAAGLLGLGGG---SMSFVGQLGGQTGGAFSYCLVSRGTDSSGSLVFGREAL 306

Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
            AG    PL+        YY+ L  + VG  R+              G + +DTG   T 
Sbjct: 307 PAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTR 366

Query: 260 LP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKL 318
           LP L Y +   + ++         GV       D+L +      + P V+ +F G  +  
Sbjct: 367 LPTLAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGF---VSVRVPTVSFYFSGGPILT 423

Query: 319 SPSNLFRNISDE--IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            P+  F    D+    C AF    + + + G I Q    I +D     V F P+ C
Sbjct: 424 LPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 479


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 113/373 (30%), Positives = 175/373 (46%), Gaps = 54/373 (14%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           ++ ++ S+G PPV  F  +DTGS   W QC PC          P+F+P  SST+   SC 
Sbjct: 67  LFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCD 126

Query: 94  SSQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
              C      +CS   C Y  +Y  G   + S G LA E LTF + +G  V    + FGC
Sbjct: 127 DRFCRYAPNGHCSSNKCVYEQVYISG---TGSKGVLAKERLTFTTPNGNTVVTQPIAFGC 183

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG- 211
           GH+N      +S+ TGI+GLG   +SL  Q+G+    KFSYC+ D  +    +  +V G 
Sbjct: 184 GHEN--GEQLESEFTGILGLGAKPTSLAVQLGS----KFSYCIGDLANKNYGYNQLVLGE 237

Query: 212 -AGVVSTPLIIRDH-----YYLSLEAISVGNQRLEF-------VSSSTGNIFVDTGVLRT 258
            A ++  P  I        YY++LE ISVG+++L           S TG + +DTG L T
Sbjct: 238 DADILGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRGSRTG-VILDTGTLYT 296

Query: 259 LLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK---FPEVTIHF 311
            L      E ++ +KS++   ++            F D LCY+     +   FP VT HF
Sbjct: 297 WLADIAYRELYNEIKSILDPKLE---------RFWFRDFLCYHGRVNEELIGFPVVTFHF 347

Query: 312 R-GADVKLSPSNLFRNISD-----EIMCSAFR-----GGNAN--IVYGRIMQINFLIGYD 358
             GA++ +  +++F  +++      + C + R     GG        G + Q  + I YD
Sbjct: 348 AGGAELAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYD 407

Query: 359 IEQAMVSFKPSRC 371
           +++  +  +   C
Sbjct: 408 LKERNIYLQRIDC 420


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 115/360 (31%), Positives = 166/360 (46%), Gaps = 41/360 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTPP  ++  +DTGSD  W QC PC   +C+ Q  P+F+P KS ++  + C +
Sbjct: 129 YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCK--NCYSQTDPVFNPVKSGSFAKVLCRT 186

Query: 95  SQCAVVTS-NCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             C  + S  C++   C Y   YG G+Y   ++G   TETLTF  T     ++  V  GC
Sbjct: 187 PLCRRLESPGCNQRQTCLYQVSYGDGSY---TTGEFVTETLTFRRT-----KVEQVALGC 238

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGGI 208
           GH N       +   G+        S  SQ G +   KFSYCL D+ +    S + FG  
Sbjct: 239 GHDNEGLFVGAAGLLGLGRG---GLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNS 295

Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS------TGN--IFVDTGVL 256
                   TPL+    +   YY+ L  ISVG   +  +++S      TGN  + +D G  
Sbjct: 296 AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTS 355

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRG 313
            T L    +  L+        A  +K   + P FS    CY++S +   K P V +HFRG
Sbjct: 356 VTRLNKPAYIALRDAF--RAGASSLK---SAPEFSLFDTCYDLSGKTTVKVPTVVLHFRG 410

Query: 314 ADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           ADV L  SN    +      C AF G  + + + G I Q  F + YD+  + V F P  C
Sbjct: 411 ADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 117/375 (31%), Positives = 171/375 (45%), Gaps = 51/375 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ + +GTPP      +DTGSD  W QC PC  LDCF Q  P+FDP  S++Y +++C  
Sbjct: 150 YLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFDQRGPVFDPMASTSYRNVTCGD 207

Query: 95  SQCAVV-------TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           ++C +V       T   S  D C Y + YG     S ++G+LA E  T N T+     + 
Sbjct: 208 TRCGLVSPPAAPRTCRSSRSDPCPYYYWYGD---QSNTTGDLALEAFTVNLTASSSRRVD 264

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKI 203
            V+ GCGH+N       +   G+        S  SQ+       FSYCL D GS   SKI
Sbjct: 265 GVVLGCGHRNRGLFHGAAGLLGLGRG---PLSFASQLRAVYGHAFSYCLVDHGSAVGSKI 321

Query: 204 NFGGIVAGAGVVSTPLI----------IRDHYYLSLEAISVGNQRLEFVSSS-------- 245
            FG       ++S P +              YY+ L+ I VG + L+  S++        
Sbjct: 322 VFGDDNV---LLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDG 378

Query: 246 TGNIFVDTGVLRTLLPL-EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQP 302
           +G   +D+G   +  P   Y +  ++ +  M KA P+  +   P  S   CYN+S   + 
Sbjct: 379 SGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPL--IADFPVLSP--CYNVSGVERV 434

Query: 303 KFPEVTIHFR-GADVKLSPSNLFRNISDE-IMCSAFRG--GNANIVYGRIMQINFLIGYD 358
           + PE ++ F  GA       N F  +  E IMC A  G   +A  + G   Q NF + YD
Sbjct: 435 EVPEFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSIIGNYQQQNFHVLYD 494

Query: 359 IEQAMVSFKPSRCTN 373
           +    + F P RC  
Sbjct: 495 LHHNRLGFAPRRCAE 509


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 116/360 (32%), Positives = 167/360 (46%), Gaps = 41/360 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IGTP  + +  +DTGSD  W QCEPC E  C+ Q  P+F+P  S +++++ C S
Sbjct: 154 YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRE--CYSQADPIFNPSSSVSFSTVGCDS 211

Query: 95  SQCAVVTSN-CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + C+ + +N C  G C Y   YG G+Y   + G+ ATETLTF +TS     + NV  GCG
Sbjct: 212 AVCSQLDANDCHGGGCLYEVSYGDGSY---TVGSYATETLTFGTTS-----IQNVAIGCG 263

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGIVA 210
           H N+      +   G+      + S  +Q+GT     FSYCL D   + S  + FG    
Sbjct: 264 HDNVGLFVGAAGLLGLGAG---SLSFPAQLGTQTGRAFSYCLVDRDSESSGTLEFGPESV 320

Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSST---------GNIFVDTGVLR 257
             G + TPL+    +   YYLS+ AISVG   L+ V S           G I +D+G   
Sbjct: 321 PIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAV 380

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISS--QPKFPEVTIHF-RG 313
           T L    +  L+       +  P        G S    CY++S+      P V  HF  G
Sbjct: 381 TRLQTSAYDALRDAFIAGTQHLP-----RADGISIFDTCYDLSALQSVSIPAVGFHFSNG 435

Query: 314 ADVKLSPSN-LFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           A   L   N L    S    C AF   ++N+ + G I Q    + +D   ++V F   +C
Sbjct: 436 AGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 495


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score =  141 bits (355), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 114/347 (32%), Positives = 170/347 (48%), Gaps = 38/347 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +GTP  D+    DTGSD TWTQCEPC    C+KQ+  +FDP KS++Y++I+C+S
Sbjct: 145 YFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCAR-SCYKQQDAIFDPSKSTSYSNITCTS 203

Query: 95  SQCAVVTS--------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           + C  +++        + S   C Y   YG    +SFS G  + E L+  +T      + 
Sbjct: 204 TLCTQLSTATGNEPGCSASTKACIYGIQYGD---SSFSVGYFSRERLSVTATD----IVD 256

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--KIN 204
           N +FGCG  N       +   G+IGLG    S + Q        FSYCLP   SS  +++
Sbjct: 257 NFLFGCGQNNQGLFGGSA---GLIGLGRHPISFVQQTAAVYRKIFSYCLPATSSSTGRLS 313

Query: 205 FGGIVAGAGVVSTPL--IIR--DHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRT 258
           F G    + V  TP   I R    Y L +  ISVG  +L   SS  STG   +D+G + T
Sbjct: 314 F-GTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTGGAIIDSGTVIT 372

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF--PEVTIHFRGA-D 315
            LP   ++ L+S     +   P  G   E    D  CY++S    F  P++   F G   
Sbjct: 373 RLPPTAYTALRSAFRQGMSKYPSAG---ELSILDT-CYDLSGYEVFSIPKIDFSFAGGVT 428

Query: 316 VKLSPSNLFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDI 359
           V+L P  +    S + +C AF   G ++++ +YG + Q    + YD+
Sbjct: 429 VQLPPQGILYVASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYDV 475


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score =  140 bits (354), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 115/360 (31%), Positives = 166/360 (46%), Gaps = 41/360 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTPP  ++  +DTGSD  W QC PC   +C+ Q  P+F+P KS ++  + C +
Sbjct: 42  YFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPCK--NCYSQTDPVFNPVKSGSFAKVLCRT 99

Query: 95  SQCAVVTS-NCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             C  + S  C++   C Y   YG G+Y   ++G   TETLTF  T     ++  V  GC
Sbjct: 100 PLCRRLESPGCNQRQTCLYQVSYGDGSY---TTGEFVTETLTFRRT-----KVEQVALGC 151

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGGI 208
           GH N       +   G+        S  SQ G +   KFSYCL D+ +    S + FG  
Sbjct: 152 GHDNEGLFVGAAGLLGLGRG---GLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFGNS 208

Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS------TGN--IFVDTGVL 256
                   TPL+    +   YY+ L  ISVG   +  +++S      TGN  + +D G  
Sbjct: 209 AVSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTS 268

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRG 313
            T L    +  L+        A  +K   + P FS    CY++S +   K P V +HFRG
Sbjct: 269 VTRLNKPAYIALRDAF--RAGASSLK---SAPEFSLFDTCYDLSGKTTVKVPTVVLHFRG 323

Query: 314 ADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           ADV L  SN    +      C AF G  + + + G I Q  F + YD+  + V F P  C
Sbjct: 324 ADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 383


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score =  140 bits (354), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 115/360 (31%), Positives = 167/360 (46%), Gaps = 41/360 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IGTP  + +  +DTGSD  W QCEPC E  C+ Q  P+F+P  S +++++ C S
Sbjct: 8   YFTRIGIGTPTREQYMVLDTGSDVVWIQCEPCRE--CYSQADPIFNPSSSVSFSTVGCDS 65

Query: 95  SQCAVVTSN-CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + C+ + +N C  G C Y   YG G+Y   + G+ ATETLTF +TS     + NV  GCG
Sbjct: 66  AVCSQLDANDCHGGGCLYEVSYGDGSY---TVGSYATETLTFGTTS-----IQNVAIGCG 117

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGIVA 210
           H N+      +   G+      + S  +Q+GT     FSYCL D   + S  + FG    
Sbjct: 118 HDNVGLFVGAAGLLGLGAG---SLSFPAQLGTQTGRAFSYCLVDRDSESSGTLEFGPESV 174

Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSST---------GNIFVDTGVLR 257
             G + TPL+    +   YYLS+ AISVG   L+ V S           G I +D+G   
Sbjct: 175 PIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTAV 234

Query: 258 TLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF-RG 313
           T L    +  L+       +  P   G+          CY++S+      P V  HF  G
Sbjct: 235 TRLQTSAYDALRDAFIAGTQHLPRADGISIFD-----TCYDLSALQSVSIPAVGFHFSNG 289

Query: 314 ADVKLSPSN-LFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           A   L   N L    S    C AF   ++N+ + G I Q    + +D   ++V F   +C
Sbjct: 290 AGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score =  140 bits (354), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 112/359 (31%), Positives = 168/359 (46%), Gaps = 43/359 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +G P    +  +DTGSD  W QC+PC   DC++Q  P+FDP  SSTY  ++C S
Sbjct: 161 YFTRVGVGNPARQFYMVLDTGSDINWLQCQPCT--DCYQQTDPIFDPTASSTYAPVTCQS 218

Query: 95  SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC+ +  S+C  G C Y   YG G+Y   + G+ ATE+++F ++      + NV  GCG
Sbjct: 219 QQCSSLEMSSCRSGQCLYQVNYGDGSY---TFGDFATESVSFGNSG----SVKNVALGCG 271

Query: 154 HKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGI 208
           H N  L    +     G   L     SL +Q+    A  FSYCL ++   GSS ++F   
Sbjct: 272 HDNEGLFVGAAGLLGLGGGPL-----SLTNQLK---ATSFSYCLVNRDSAGSSTLDFNSA 323

Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLR 257
             G   V+ PL+    I   YY+ L  +SVG Q +           S  G I VD G   
Sbjct: 324 QLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAI 383

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GA 314
           T L  + ++ L+     M   Q +K   A   F    CY++S Q   + P V+ HF  G 
Sbjct: 384 TRLQTQAYNPLRDAFVRM--TQNLKLTSAVALFD--TCYDLSGQASVRVPTVSFHFADGK 439

Query: 315 DVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
              L  +N    + S    C AF    +++ + G + Q    + +D+    + F P++C
Sbjct: 440 SWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score =  140 bits (354), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 116/375 (30%), Positives = 171/375 (45%), Gaps = 50/375 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ + +GTPP      +DTGSD  W QC PC  LDCF+Q  P+FDP  SS+Y +++C  
Sbjct: 149 YLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAASSSYRNVTCGD 206

Query: 95  SQCAVVT--------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS-GLPVEM 145
            +C +V            +E  C Y + YG     S ++G+LA E+ T N T+ G    +
Sbjct: 207 QRCGLVAPPEAPRACRRPAEDSCPYYYWYGD---QSNTTGDLALESFTVNLTAPGASRRV 263

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SK 202
             V+FGCGH+N       +   G+        S  SQ+       FSYCL + GS   SK
Sbjct: 264 DGVVFGCGHRNRGLFHGAAGLLGLGRG---PLSFASQLRAVYGHTFSYCLVEHGSDAGSK 320

Query: 203 INFGG---IVAGAGVVSTPLI-----IRDHYYLSLEAISVGNQRLEFVSSS-------TG 247
           + FG    ++A   +  T            YY+ L+ + VG   L   S +       +G
Sbjct: 321 VVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGKDGSG 380

Query: 248 NIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNIS--SQP 302
              +D+G  L   +   Y    ++ +  M +  P+      P F  VL  CYN+S   +P
Sbjct: 381 GTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLI-----PDFP-VLNPCYNVSGVERP 434

Query: 303 KFPEVTIHFR-GADVKLSPSNLFRNIS-DEIMCSAFRG--GNANIVYGRIMQINFLIGYD 358
           + PE+++ F  GA       N F  +  D IMC A RG       + G   Q NF + YD
Sbjct: 435 EVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGMSIIGNFQQQNFHVVYD 494

Query: 359 IEQAMVSFKPSRCTN 373
           ++   + F P RC  
Sbjct: 495 LQNNRLGFAPRRCAE 509


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score =  140 bits (354), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 174/365 (47%), Gaps = 49/365 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++    GTP  +    +DTGSD TW QC+PC   DC+ Q   +F+PK+SS+Y ++ C S
Sbjct: 137 YIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCA--DCYSQVDAIFEPKQSSSYKTLPCLS 194

Query: 95  SQCA-VVTSN-----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           + C  ++TS      C  G C Y   YG G   S S G+ + ETLT  S S       N 
Sbjct: 195 ATCTELITSESNPTPCLLGGCVYEINYGDG---SSSQGDFSQETLTLGSDS-----FQNF 246

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD------QGSSK 202
            FGCGH N       S   G++GLG  + S  SQ  +   G+F+YCLPD       GS  
Sbjct: 247 AFGCGHTNTGLFKGSS---GLLGLGQNSLSFPSQSKSKYGGQFAYCLPDFGSSTSTGSFS 303

Query: 203 INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDTG-V 255
           +  G I A A  V TPL+        Y++ L  ISVG  RL    +    G+  VD+G V
Sbjct: 304 VGKGSIPASA--VFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGSTIVDSGTV 361

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR- 312
           +  LLP  Y++ LK+   +  +  P     A+P      CY++S  SQ + P +T HF+ 
Sbjct: 362 ITRLLPQAYNA-LKTSFRSKTRDLP----SAKPFSILDTCYDLSRHSQVRIPTITFHFQN 416

Query: 313 GADVKLSPSNLFRNISD--EIMCSAFRGGNA----NIVYGRIMQINFLIGYDIEQAMVSF 366
            ADV +S   +   + +    +C AF   +     NI+ G   Q    + +D     + F
Sbjct: 417 NADVAVSDVGILVPVQNGGSQVCLAFASASQMDGFNII-GNFQQQRMRVAFDTGAGRIGF 475

Query: 367 KPSRC 371
               C
Sbjct: 476 ASGSC 480


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score =  140 bits (354), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 122/360 (33%), Positives = 181/360 (50%), Gaps = 38/360 (10%)

Query: 30  SVDDI-YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
           S+D + YL+ + +G+P       +DTGSD +W QC+PC +  C  Q  PLFDP  SSTY+
Sbjct: 127 SLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQ--CHSQADPLFDPSSSSTYS 184

Query: 89  SISCSSSQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
             SCSS+ CA +    + CS   C Y+  YG G   S ++G  +++TL   S +     +
Sbjct: 185 PFSCSSAACAQLGQEGNGCSSSQCQYTVTYGDG---SSTTGTYSSDTLALGSNA-----V 236

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINF 205
               FGC   N+ S  +D +  G++GLG G  SL+SQ   +    FSYCLP   SS   F
Sbjct: 237 RKFQFGC--SNVESGFND-QTDGLMGLGGGAQSLVSQTAGTFGAAFSYCLPATSSSS-GF 292

Query: 206 GGIVAG-AGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRT 258
             + AG +G V TP++    +   Y + ++AI VG ++L   +S  S G I +D+G + T
Sbjct: 293 LTLGAGTSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSAGTI-MDSGTVLT 351

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQP--KFPEVTIHFR-GA 314
            LP   +S L S     +K  P     A P G  D  C++ S Q     P V + F  GA
Sbjct: 352 RLPPTAYSALSSAFKAGMKQYP----SAPPSGILDT-CFDFSGQSSVSIPTVALVFSGGA 406

Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            V ++   +    S+ I+C AF   + +    + G + Q  F + YD+    V FK   C
Sbjct: 407 VVDIASDGIMLQTSNSILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 466


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score =  140 bits (354), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 107/374 (28%), Positives = 168/374 (44%), Gaps = 48/374 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP       +DTGSD  W QC PC    C+ Q   +FDP++SSTY  + CSS
Sbjct: 86  YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRR--CYAQRGQVFDPRRSSTYRRVPCSS 143

Query: 95  SQCAVV------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
            QC  +      +   + G C Y   YG G   S S+G+LAT+ L F + +     + NV
Sbjct: 144 PQCRALRFPGCDSGGAAGGGCRYMVAYGDG---SSSTGDLATDKLAFANDT----YVNNV 196

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ-----GSSKI 203
             GCG  N     S     G++G+G G  S+ +Q+  +    F YCL D+      SS +
Sbjct: 197 TLGCGRDNEGLFDS---AAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYL 253

Query: 204 NFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST---------GNIF 250
            FG          T L+        YY+ +   SVG +R+   S+++         G + 
Sbjct: 254 VFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVV 313

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
           VD+G   +    + ++ L+       +A  ++ +  E    D  CY++  +P    P + 
Sbjct: 314 VDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDA-CYDLRGRPAASAPLIV 372

Query: 309 IHFR-GADVKLSPSNLF-------RNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDI 359
           +HF  GAD+ L P N F       R  +    C  F   +  + V G + Q  F + +D+
Sbjct: 373 LHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFDV 432

Query: 360 EQAMVSFKPSRCTN 373
           E+  + F P  CT+
Sbjct: 433 EKERIGFAPKGCTS 446


>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
 gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  140 bits (354), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 119/366 (32%), Positives = 164/366 (44%), Gaps = 46/366 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M L +GTP  +++  +DTGSD  W QC PC    C+ Q   +FDPKKS T+ ++ C S
Sbjct: 135 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKA--CYNQTDAIFDPKKSKTFATVPCGS 192

Query: 95  SQCAVV--TSNC---SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
             C  +  +S C       C Y   YG G   SF+ G+ +TETLTF+        + +V 
Sbjct: 193 RLCRRLDDSSECVTRRSKTCLYQVSYGDG---SFTEGDFSTETLTFHG-----ARVDHVP 244

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GSS 201
            GCGH N       +   G+        S  SQ      GKFSYCL D+          S
Sbjct: 245 LGCGHDNEGLFVGAAGLLGLGRG---GLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPS 301

Query: 202 KINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS------TGN--I 249
            I FG        V TPL+    +   YYL L  ISVG  R+  VS S      TGN  +
Sbjct: 302 TIVFGNAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGV 361

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEV 307
            +D+G   T L    +  L+      + A  +K   +   F    C+++S  +  K P V
Sbjct: 362 IIDSGTSVTRLTQPAYVALRDAF--RLGATKLKRAPSYSLFDT--CFDLSGMTTVKVPTV 417

Query: 308 TIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
             HF G +V L  SN    ++ E   C AF G   ++ + G I Q  F + YD+  + V 
Sbjct: 418 VFHFGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVG 477

Query: 366 FKPSRC 371
           F    C
Sbjct: 478 FLSRAC 483


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  140 bits (353), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 112/359 (31%), Positives = 168/359 (46%), Gaps = 43/359 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +G P    +  +DTGSD  W QC+PC   DC++Q  P+FDP  SSTY  ++C S
Sbjct: 20  YFTRVGVGNPARQFYMVLDTGSDINWLQCQPC--TDCYQQTDPIFDPTASSTYAPVTCQS 77

Query: 95  SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC+ +  S+C  G C Y   YG G+Y   + G+ ATE+++F ++      + NV  GCG
Sbjct: 78  QQCSSLEMSSCRSGQCLYQVNYGDGSY---TFGDFATESVSFGNSG----SVKNVALGCG 130

Query: 154 HKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGI 208
           H N  L    +     G   L     SL +Q+    A  FSYCL ++   GSS ++F   
Sbjct: 131 HDNEGLFVGAAGLLGLGGGPL-----SLTNQLK---ATSFSYCLVNRDSAGSSTLDFNSA 182

Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLR 257
             G   V+ PL+    I   YY+ L  +SVG Q +           S  G I VD G   
Sbjct: 183 QLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAI 242

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GA 314
           T L  + ++ L+     M   Q +K   A   F    CY++S Q   + P V+ HF  G 
Sbjct: 243 TRLQTQAYNPLRDAFVRM--TQNLKLTSAVALFD--TCYDLSGQASVRVPTVSFHFADGK 298

Query: 315 DVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
              L  +N    + S    C AF    +++ + G + Q    + +D+    + F P++C
Sbjct: 299 SWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 119/366 (32%), Positives = 165/366 (45%), Gaps = 46/366 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M L +GTP  +++  +DTGSD  W QC PC    C+ Q   +FDPKKS T+ ++ C S
Sbjct: 138 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKA--CYNQSDVIFDPKKSKTFATVPCGS 195

Query: 95  SQCAVV--TSNC---SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
             C  +  +S C       C Y   YG G   SF+ G+ +TETLTF+        + +V 
Sbjct: 196 RLCRRLDDSSECVTRRSKTCLYQVSYGDG---SFTEGDFSTETLTFHG-----ARVDHVP 247

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GSS 201
            GCGH N       +   G+        S  SQ  +   GKFSYCL D+          S
Sbjct: 248 LGCGHDNEGLFVGAAGLLGLGRG---GLSFPSQTKSRYNGKFSYCLVDRTSSGSSSKPPS 304

Query: 202 KINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS------TGN--I 249
            I FG        V TPL+    +   YYL L  ISVG  R+  VS S      TGN  +
Sbjct: 305 TIVFGNDAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGV 364

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEV 307
            +D+G   T L    +  L+      + A  +K   +   F    C+++S  +  K P V
Sbjct: 365 IIDSGTSVTRLTQSAYVALRDAF--RLGATKLKRAPSYSLFDT--CFDLSGMTTVKVPTV 420

Query: 308 TIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
             HF G +V L  SN    ++ E   C AF G   ++ + G I Q  F + YD+  + V 
Sbjct: 421 VFHFGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVG 480

Query: 366 FKPSRC 371
           F    C
Sbjct: 481 FLSRAC 486


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 118/369 (31%), Positives = 179/369 (48%), Gaps = 51/369 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+M LSIGTPP  I   +DTGSD  W +C+ C   D       +F    SS+Y  + C+S
Sbjct: 5   YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64

Query: 95  SQCAVVTS-----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE---MP 146
           + C+ ++S      C E  C Y + YG G   S +SG++ ++ ++F S            
Sbjct: 65  THCSGMSSAGIGPRCEE-TCKYKYEYGDG---SRTSGDVGSDRISFRSHGAGEDHRSFFD 120

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSS 201
             +FGCG K L    + ++  G+IGLG  + SLI Q+G  +  KFSYCL     P    S
Sbjct: 121 GFLFGCGRK-LKGDWNFTQ--GLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKS 177

Query: 202 KINFGGIVA--GAGVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSSSTGN------ 248
            +  G   A  G  VVSTP++  DH     YY+ L++I+VG   +      +G+      
Sbjct: 178 FLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGP 237

Query: 249 -----IFVDTGVLRTLL-PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
                  +D+G   TLL P  Y +  KS+   +I    +  +G   G    LC+N S   
Sbjct: 238 FLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVI----LPTLGNSAGLD--LCFNSSGDT 291

Query: 303 K--FPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLIGY 357
              FP VT +F     + L   N+F+  S +++C +    GG+ +I+ G + Q NF I Y
Sbjct: 292 SYGFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSII-GNMQQQNFHILY 350

Query: 358 DIEQAMVSF 366
           D+  + +SF
Sbjct: 351 DLVASQISF 359


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 116/364 (31%), Positives = 177/364 (48%), Gaps = 48/364 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  +     DTGSD TWTQCEPC +  C+KQ+ P  +P  S++Y +ISCSS
Sbjct: 119 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNPSTSTSYKNISCSS 177

Query: 95  SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           + C +V S      +CS   C Y   YG G+Y   S G  ATETLT +S++       N 
Sbjct: 178 ALCKLVASGKKFSQSCSSSTCLYQVQYGDGSY---SIGFFATETLTLSSSN----VFKNF 230

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFG 206
           +FGCG +N       +   G+        +L SQ   +    FSYCLP   SSK  ++ G
Sbjct: 231 LFGCGQQNNGLFGGAAGLLGLGRT---KLALPSQTAKTYKKLFSYCLPASSSSKGYLSLG 287

Query: 207 GIVAGAGVVSTPLII----RDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLP 261
           G V+ + V  TPL         Y L +  +SVG ++L    S+ +    +D+G + T L 
Sbjct: 288 GQVSKS-VKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLS 346

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGA---D 315
              +S L S   N++   P     +  G+S    CY+ S     + P+V + F+G    D
Sbjct: 347 PTAYSELSSAFQNLMTDYP-----STSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMD 401

Query: 316 VKLS----PSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKP 368
           + +S    P N  + +     C AF G + +    ++G + Q  + + YD  +  V F P
Sbjct: 402 IDVSGILYPVNGLKKV-----CLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAP 456

Query: 369 SRCT 372
             C+
Sbjct: 457 GGCS 460


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 110/364 (30%), Positives = 162/364 (44%), Gaps = 39/364 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP  D++  VDTGSD TW QC PC   +C+KQ+  LF+P  SS++  + CSS
Sbjct: 16  YFAVVGVGTPRRDMYLVVDTGSDITWLQCAPC--TNCYKQKDALFNPSSSSSFKVLDCSS 73

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL-PVEMPNVIFGC 152
           S C  +    C    C Y   YG G   SF+ G L T+ +  +   G   V + N+  GC
Sbjct: 74  SLCLNLDVMGCLSNKCLYQADYGDG---SFTMGELVTDNVVLDDAFGPGQVVLTNIPLGC 130

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
           GH N  +  +     GI+GLG G  S  + +  S    FSYCLPD+ S   +   +V G 
Sbjct: 131 GHDNEGTFGT---AAGILGLGRGPLSFPNNLDASTRNIFSYCLPDRESDPNHKSTLVFGD 187

Query: 213 GVV------STPLI-------IRDHYYLSLEAISVGNQRL--------EFVSSSTGNIFV 251
             +      S   I       +  +YY+ +  ISVG   L        +  S   G    
Sbjct: 188 AAIPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSHGNGGTIF 247

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF--PEVTI 309
           D+G   T L    ++ ++    +  +A  +    A        CY+ +       P VT 
Sbjct: 248 DSGTTITRLEARAYTAVR----DAFRAATMHLTSAADFKIFDTCYDFTGMNSISVPTVTF 303

Query: 310 HFRG-ADVKLSPSNLFRNIS-DEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFK 367
           HF+G  D++L PSN    +S + I C AF       V G + Q +F + YD     +   
Sbjct: 304 HFQGDVDMRLPPSNYIVPVSNNNIFCFAFAASMGPSVIGNVQQQSFRVIYDNVHKQIGLL 363

Query: 368 PSRC 371
           P +C
Sbjct: 364 PDQC 367


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 119/407 (29%), Positives = 182/407 (44%), Gaps = 59/407 (14%)

Query: 2   QNSQKLPFYNDNETPKSPI----SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSD 57
           ++S ++ F +D            S+ +QA + +    Y M++S+GTP +      DTGSD
Sbjct: 49  RDSHRIAFLSDATAAGKATTTNSSVSFQALLENGVGGYNMNISVGTPLLTFSVVADTGSD 108

Query: 58  CTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGDCSYSFL 114
             WTQC PC +  CF+Q  P F P  SST++ + C+SS C  +      C+   C Y++ 
Sbjct: 109 LIWTQCAPCTK--CFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYK 166

Query: 115 YGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGP 174
           YG G    +++G LATETL     S      P+V FGC  +N       +  +GI GLG 
Sbjct: 167 YGSG----YTAGYLATETLKVGDAS-----FPSVAFGCSTENGVG----NSTSGIAGLGR 213

Query: 175 GNSSLISQMGTSIAGKFSYCLPD---QGSSKINFG-------GIVAGAGVVSTPLIIRDH 224
           G  SLI Q+G    G+FSYCL      G+S I FG       G V     V+ P +   +
Sbjct: 214 GALSLIPQLG---VGRFSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSY 270

Query: 225 YYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI 276
           YY++L  I+VG   L   +S+         G   VD+G   T L  + +  +K     + 
Sbjct: 271 YYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAF--LS 328

Query: 277 KAQPVKGVGAEPGFSDVLCYNISSQP----KFPEVTIHFRGADVKLSPSNLFRNISDE-- 330
           +   V  V    G    LC+  +         P + + F G      P+      +D   
Sbjct: 329 QTADVTTVNGTRGLD--LCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQG 386

Query: 331 ------IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
                 +M    +G     V G +MQ++  + YD++  + SF P+ C
Sbjct: 387 SVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 433


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 116/364 (31%), Positives = 177/364 (48%), Gaps = 48/364 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  +     DTGSD TWTQCEPC +  C+KQ+ P  +P  S++Y +ISCSS
Sbjct: 131 YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNPSTSTSYKNISCSS 189

Query: 95  SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           + C +V S      +CS   C Y   YG G+Y   S G  ATETLT +S++       N 
Sbjct: 190 ALCKLVASGKKFSQSCSSSTCLYQVQYGDGSY---SIGFFATETLTLSSSN----VFKNF 242

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFG 206
           +FGCG +N       +   G+        +L SQ   +    FSYCLP   SSK  ++ G
Sbjct: 243 LFGCGQQNNGLFGGAAGLLGLGRT---KLALPSQTAKTYKKLFSYCLPASSSSKGYLSLG 299

Query: 207 GIVAGAGVVSTPLII----RDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLP 261
           G V+ + V  TPL         Y L +  +SVG ++L    S+ +    +D+G + T L 
Sbjct: 300 GQVSKS-VKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDSGTVITRLS 358

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGA---D 315
              +S L S   N++   P     +  G+S    CY+ S     + P+V + F+G    D
Sbjct: 359 PTAYSELSSAFQNLMTDYP-----STSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMD 413

Query: 316 VKLS----PSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKP 368
           + +S    P N  + +     C AF G + +    ++G + Q  + + YD  +  V F P
Sbjct: 414 IDVSGILYPVNGLKKV-----CLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAP 468

Query: 369 SRCT 372
             C+
Sbjct: 469 GGCS 472


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 108/352 (30%), Positives = 165/352 (46%), Gaps = 31/352 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ ++IGTP V    S+DTGSD +W QC PC    C  Q+  LFDP  S+TY++ SC S
Sbjct: 129 YVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMSATYSAFSCGS 188

Query: 95  SQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
           +QCA +    + C +  C Y   YG G   S ++G   ++TL+  S+  +     +  FG
Sbjct: 189 AQCAQLGDEGNGCLKSQCQYIVKYGDG---SNTAGTYGSDTLSLTSSDAV----KSFQFG 241

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG 211
           C H+         +  G++GLG    SL+SQ   +    FSYCLP   SS   F  + A 
Sbjct: 242 CSHRAAGFV---GELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGGFLTLGAA 298

Query: 212 AGVVS-----TPLI---IRDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPL 262
            G  S     TP++   +   Y + L+ I+V    L   +S  +G   VD+G + T LP 
Sbjct: 299 GGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVFSGASVVDSGTVITQLPP 358

Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHF-RGADVKLS 319
             +  L++     +KA P     A P  S   C++ S  +    P VT+ F RGA + L 
Sbjct: 359 TAYQALRTAFKKEMKAYP----SAAPVGSLDTCFDFSGFNTITVPTVTLTFSRGAAMDLD 414

Query: 320 PSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            S +          +A  G     + G + Q  F + +D+    + F+   C
Sbjct: 415 ISGILYAGCLAFTATAHDGDTG--ILGNVQQRTFEMLFDVGGRTIGFRSGAC 464


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 116/364 (31%), Positives = 177/364 (48%), Gaps = 48/364 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  +     DTGSD TWTQCEPC +  C+KQ+ P  +P  S++Y +ISCSS
Sbjct: 71  YVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNPSTSTSYKNISCSS 129

Query: 95  SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           + C +V S      +CS   C Y   YG G+Y   S G  ATETLT +S++       N 
Sbjct: 130 ALCKLVASGKKFSQSCSSSTCLYQVQYGDGSY---SIGFFATETLTLSSSN----VFKNF 182

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFG 206
           +FGCG +N       +   G+        +L SQ   +    FSYCLP   SSK  ++ G
Sbjct: 183 LFGCGQQNNGLFGGAAGLLGLGRT---KLALPSQTAKTYKKLFSYCLPASSSSKGYLSLG 239

Query: 207 GIVAGAGVVSTPLII----RDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLP 261
           G V+ + V  TPL         Y L +  +SVG ++L    S+ +    +D+G + T L 
Sbjct: 240 GQVSKS-VKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFSAGTVIDSGTVITRLS 298

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGA---D 315
              +S L S   N++   P     +  G+S    CY+ S     + P+V + F+G    D
Sbjct: 299 PTAYSELSSAFQNLMTDYP-----STSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVEMD 353

Query: 316 VKLS----PSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKP 368
           + +S    P N  + +     C AF G + +    ++G + Q  + + YD  +  V F P
Sbjct: 354 IDVSGILYPVNGLKKV-----CLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAP 408

Query: 369 SRCT 372
             C+
Sbjct: 409 GGCS 412


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 98/267 (36%), Positives = 131/267 (49%), Gaps = 32/267 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+HL++GTPP  +  ++DTGSD  WTQC PC   DCF Q  PL DP  SSTY ++ C +
Sbjct: 86  YLVHLAVGTPPRPVALTLDTGSDLVWTQCAPC--RDCFDQGIPLLDPAASSTYAALPCGA 143

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF------NSTSGLPVEMPN 147
            +C A+  ++C    C Y + YG     S + G +AT+  TF      N    LP     
Sbjct: 144 PRCRALPFTSCGGRSCVYVYHYGD---KSVTVGKIATDRFTFGDNGRRNGDGSLPATR-R 199

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKIN 204
           + FGCGH N       S +TGI G G G  SL SQ+    A  FSYC     D  SS + 
Sbjct: 200 LTFGCGHFNKG--VFQSNETGIAGFGRGRWSLPSQLN---ATSFSYCFTSMFDSKSSIVT 254

Query: 205 FGGIVAG-------AGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSSTGNIFVDT 253
            GG  A          V +TPL         Y+LSL+ ISVG  RL    +   +  +D+
Sbjct: 255 LGGAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTIIDS 314

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQP 280
           G   T LP E +  +K+  +  +   P
Sbjct: 315 GASITTLPEEVYEAVKAEFAAQVGLPP 341


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 116/376 (30%), Positives = 178/376 (47%), Gaps = 51/376 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +GTPP  ++  +DTGSD +W QC+PC   DCF+Q  P ++P +SS+Y +ISC  
Sbjct: 170 YFIDMFVGTPPKHVWLILDTGSDLSWIQCDPC--YDCFEQNGPHYNPNESSSYRNISCYD 227

Query: 95  SQCAVVTS-----NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE--- 144
            +C +V+S     +C   +  C Y + Y  G   S ++G+ A ET T N T     E   
Sbjct: 228 PRCQLVSSPDPLQHCKTENQTCPYFYDYADG---SNTTGDFALETFTVNLTWPNGKEKFK 284

Query: 145 -MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD-----Q 198
            + +V+FGCGH N           G+     G  S  SQ+ +     FSYCL D      
Sbjct: 285 HVVDVMFGCGHWNKGFFHGAGGLLGLGR---GPLSFPSQLQSIYGHSFSYCLTDLFSNTS 341

Query: 199 GSSKINFG---GIVAGAGVVSTPLIIRDH------YYLSLEAISVGNQRLE-------FV 242
            SSK+ FG    ++    +  T L+  +       YYL +++I VG + L+       + 
Sbjct: 342 VSSKLIFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWS 401

Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS-- 300
           S   G   +D+G   T  P   +  +K      IK Q +    A   F    CYN+S   
Sbjct: 402 SEGVGGTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQI----AADDFIMSPCYNVSGAM 457

Query: 301 QPKFPEVTIHF-RGADVKLSPSNLFRNIS-DEIMCSA-FRGGNAN--IVYGRIMQINFLI 355
           Q + P+  IHF  GA       N F     DE++C A  +  N +   + G ++Q NF I
Sbjct: 458 QVELPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQQNFHI 517

Query: 356 GYDIEQAMVSFKPSRC 371
            YD++++ + + P RC
Sbjct: 518 LYDVKRSRLGYSPRRC 533


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 119/406 (29%), Positives = 184/406 (45%), Gaps = 58/406 (14%)

Query: 2   QNSQKLPFYNDNETPKSPI----SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSD 57
           ++S ++ F +D            S+ +QA + +    Y M++S+GTP +      DTGSD
Sbjct: 49  RDSHRIAFLSDATAAGKATTTNSSVSFQALLENGVGGYNMNISVGTPLLTFPVVADTGSD 108

Query: 58  CTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSN---CSEGDCSYSFL 114
             WTQC PC +  CF+Q  P F P  SST++ + C+SS C  + ++   C+   C Y++ 
Sbjct: 109 LIWTQCAPCTK--CFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYK 166

Query: 115 YGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGP 174
           YG G    +++G LATETL     S      P+V FGC  +N       +  +GI GLG 
Sbjct: 167 YGSG----YTAGYLATETLKVGDAS-----FPSVAFGCSTENGVG----NSTSGIAGLGR 213

Query: 175 GNSSLISQMGTSIAGKFSYCLPD---QGSSKINFG-------GIVAGAGVVSTPLIIRDH 224
           G  SLI Q+G    G+FSYCL      G+S I FG       G V     V+ P +   +
Sbjct: 214 GALSLIPQLG---VGRFSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSY 270

Query: 225 YYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI 276
           YY++L  I+VG   L   +S+         G   VD+G   T L  + +  +K     + 
Sbjct: 271 YYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAF--LS 328

Query: 277 KAQPVKGVGAEPGFSDVLCYNISSQP---KFPEVTIHFRGADVKLSPSNLFRNISDE--- 330
           +   V  V    G    LC+  +        P + + F G      P+      +D    
Sbjct: 329 QTANVTTVNGTRGLD--LCFKSTGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQGS 386

Query: 331 -----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
                +M    +G     V G +MQ++  + YD++  + SF P+ C
Sbjct: 387 VTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSPADC 432


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 112/368 (30%), Positives = 171/368 (46%), Gaps = 48/368 (13%)

Query: 30  SVDDI-YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
           SVD + Y++ + +GTP V     +DTGSD +W QC PC    C+ Q+ PLFDP +SSTY 
Sbjct: 114 SVDSLEYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYA 173

Query: 89  SISCSSSQCAVVT-----SNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNST 138
            I C++  C  +T     S+C+ G      C Y+  YG G   S ++G  + ETLT    
Sbjct: 174 PIPCNTDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDG---SQTTGVYSNETLTMAPG 230

Query: 139 SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-- 196
               V + +  FGCGH        + K  G++GLG    SL+ Q  +   G FSYCLP  
Sbjct: 231 ----VTVKDFHFGCGHDQDG---PNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPAA 283

Query: 197 -DQGSSKINFGGIVAGAGVVSTPLIIRDH--YYLSLEAISVGNQRLEFVSSS-TGNIFVD 252
            DQ         +   +G V TP++      Y +++  I+VG + ++   S+ +G + +D
Sbjct: 284 NDQAGFLALGAPVNDASGFVFTPMVREQQTFYVVNMTGITVGGEPIDVPPSAFSGGMIID 343

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIH 310
           +G + T L    ++ L++     + A P+      P      CYN +  S    P V + 
Sbjct: 344 SGTVVTELQHTAYAALQAAFRKAMAAYPLL-----PNGELDTCYNFTGHSNVTVPRVALT 398

Query: 311 FR-GADVKLSPSNLFRNISDEIM---CSAFRGG---NANIVYGRIMQINFLIGYDIEQAM 363
           F  GA V L       ++ D I+   C AF+     N   + G + Q    + YD+    
Sbjct: 399 FSGGATVDL-------DVPDGILLDNCLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGR 451

Query: 364 VSFKPSRC 371
           V F    C
Sbjct: 452 VGFGADAC 459


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 113/347 (32%), Positives = 165/347 (47%), Gaps = 50/347 (14%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA-VVTSNC--SEGD 108
           +DTGSD TW QC+PC   DC++Q  P+FDP  S++Y ++SC S +C  + T+ C  + G 
Sbjct: 3   LDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60

Query: 109 CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTG 168
           C Y   YG G+Y   + G+ ATETLT   ++  PV   NV  GCGH N       +    
Sbjct: 61  CLYEVAYGDGSY---TVGDFATETLTLGDST--PVG--NVAIGCGHDNEGLFVGAAGLLA 113

Query: 169 IIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIVAGAGVVSTPLI----I 221
           + G      S IS      A  FSYCL D+ S   S + FG   A AG V+ PL+     
Sbjct: 114 LGGGPLSFPSQIS------ASTFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRT 167

Query: 222 RDHYYLSLEAISVGNQRLEF--------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMS 273
              YY++L  ISVG Q L           +S +G + VD+G   T L    ++ L+    
Sbjct: 168 STFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAF- 226

Query: 274 NMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIHFRGAD-VKLSPSNLFRN 326
                  V+G  + P  S V     CY++S +   + P V++ F G   ++L   N    
Sbjct: 227 -------VQGAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIP 279

Query: 327 ISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +      C AF   NA + + G + Q    + +D  +  V F P++C
Sbjct: 280 VDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 116/386 (30%), Positives = 168/386 (43%), Gaps = 59/386 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM + +GTPP      +DTGSD  W QC PC  LDCF+Q  P+FDP  SS+Y +++C  
Sbjct: 151 YLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAASSSYRNVTCGD 208

Query: 95  SQCAVV----------TSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS-G 140
            +C  V             C    E  C Y + YG     S ++G+LA E+ T N T+ G
Sbjct: 209 HRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGD---QSNTTGDLALESFTVNLTAPG 265

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
               +  V+FGCGH+N       +   G+        S  SQ+       FSYCL D GS
Sbjct: 266 ASRRVDGVVFGCGHRNRGLFHGAAGLLGLGRG---PLSFASQLRAVYGHTFSYCLVDHGS 322

Query: 201 ---SKINFGGIVAGAGVVSTPLI--------------IRDHYYLSLEAISVGNQRLEFVS 243
              SK+ FG       + + P +                  YY+ L+ + VG + L   S
Sbjct: 323 DVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNISS 382

Query: 244 SS-------TGNIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
            +       +G   +D+G  L   +   Y     + M  M ++ P+  V   P  S   C
Sbjct: 383 DTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPL--VPEFPVLSP--C 438

Query: 296 YNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNISDE---IMCSAFRG--GNANIVYGR 347
           YN+S   +P+ PE+++ F  GA       N F  +  +   IMC A  G       + G 
Sbjct: 439 YNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTPRTGMSIIGN 498

Query: 348 IMQINFLIGYDIEQAMVSFKPSRCTN 373
             Q NF + YD++   + F P RC  
Sbjct: 499 FQQQNFHVVYDLQNNRLGFAPRRCAE 524


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 117/366 (31%), Positives = 165/366 (45%), Gaps = 46/366 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M L +GTP  +++  +DTGSD  W QC PC    C+ Q  P+F+P KS T+ ++ C S
Sbjct: 136 YFMRLGVGTPATNMYMVLDTGSDVVWLQCSPCKV--CYNQSDPVFNPAKSKTFATVPCGS 193

Query: 95  SQCAVV--TSNC---SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
             C  +  +S C       C Y   YG G   SF+ G+ +TETLTF+        + +V 
Sbjct: 194 RLCRRLDDSSECVSRRSKACLYQVSYGDG---SFTVGDFSTETLTFHGA-----RVDHVA 245

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GSS 201
            GCGH N       +   G+        S  SQ      GKFSYCL D+          S
Sbjct: 246 LGCGHDNEGLFVGAAGLLGLGRG---GLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPS 302

Query: 202 KINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS------TGN--I 249
            I FG        V TPL+    +   YYL L  ISVG  R+  VS S      TGN  +
Sbjct: 303 TIVFGNGAVPKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGV 362

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEV 307
            +D+G   T L    +  L+      + A  +K   +   F    C+++S  +  K P V
Sbjct: 363 IIDSGTSVTRLTQSAYVALRDAFR--LGATRLKRAPSYSLFDT--CFDLSGMTTVKVPTV 418

Query: 308 TIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
             HF G +V L  SN    ++++   C AF G   ++ + G I Q  F + YD+  + V 
Sbjct: 419 VFHFTGGEVSLPASNYLIPVNNQGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVG 478

Query: 366 FKPSRC 371
           F    C
Sbjct: 479 FLSRAC 484


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 119/369 (32%), Positives = 163/369 (44%), Gaps = 45/369 (12%)

Query: 31  VDDIYLMHLSIGTP---PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
           V+  YL+HLSIG P   PV +  ++DTGSD  WTQCEPC E  CF Q  P FD   S+T 
Sbjct: 88  VNSEYLIHLSIGAPRSQPVVL--TLDTGSDVVWTQCEPCAE--CFTQPLPRFDTAASNTV 143

Query: 88  NSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNS-TSGLPVEM 145
            S++CS   C A     C    C+Y   YG G   S S G+   ++ TF+    G  V +
Sbjct: 144 RSVACSDPLCNAHSEHGCFLHGCTYVSGYGDG---SLSFGHFLRDSFTFDDGKGGGKVTV 200

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSK 202
           P++ FGCG  N         +TGI G G G  SL SQ+      +FSYC     +  SS 
Sbjct: 201 PDIGFGCGMYNAGRFL--QTETGIAGFGRGPLSLPSQLKVR---QFSYCFTTRFEAKSSP 255

Query: 203 INFGG-----IVAGAGVVSTPLII-------RDHYYLSLEAISVGNQRL---EFVSSSTG 247
           +  GG       A   ++STP +          HY LS + ++VG  RL   E  +  +G
Sbjct: 256 VFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVPEIKADGSG 315

Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKFP 305
             F+D+G   T  P      LKS       A PV     E    D +C++   +     P
Sbjct: 316 ATFIDSGTDITTFPDAVFRQLKSAFIAQ-AALPVNKTADE----DDICFSWDGKKTAAMP 370

Query: 306 EVTIHFRGADVKLSPSNLF---RNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
           ++  H  GAD  L   N     R      +  +  G     + G   Q N  I YD+   
Sbjct: 371 KLVFHLEGADWDLPRENYVTEDRESGQVCVAVSTSGQMDRTLIGNFQQQNTHIVYDLAAG 430

Query: 363 MVSFKPSRC 371
            +   P++C
Sbjct: 431 KLLLVPAQC 439


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 169/356 (47%), Gaps = 35/356 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  D+    DTGSD +W QC PC   DC++Q+ PLFDP +SSTY+++ C+S
Sbjct: 146 YVVSMGLGTPARDMTVVFDTGSDLSWVQCTPCS--DCYEQKDPLFDPARSSTYSAVPCAS 203

Query: 95  SQCAVVTSN-CS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
            +C  + S  CS +  C Y  +YG     S + G LA +TLT   +  L    P  +FGC
Sbjct: 204 PECQGLDSRSCSRDKKCRYEVVYGD---QSQTDGALARDTLTLTQSDVL----PGFVFGC 256

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVA 210
           G ++        +  G++GLG    SL SQ  +     FSYCLP   S+   ++ GG  A
Sbjct: 257 GEQDTG---LFGRADGLVGLGREKVSLSSQAASKYGAGFSYCLPSSPSAAGYLSLGG-PA 312

Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQ--RLEFVSSSTGNIFVDTGVLRTLLPLEY 264
            A    T +  R      YY+ L  + V  +  R+  +  S     +D+G + T LP   
Sbjct: 313 PANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAAGTVIDSGTVITRLPPRV 372

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFR-GADVKLSP 320
           ++ L+S  +   ++    G    P  S +  CY+ +     + P V + F  GA V L  
Sbjct: 373 YAALRSAFA---RSMGRYGYKRAPALSILDTCYDFTGHTTVRIPSVALVFAGGAAVGLDF 429

Query: 321 SNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           S +         C AF     G +A I+ G   Q    + YD+ +  + F  + C+
Sbjct: 430 SGVLYVAKVSQACLAFAPNGDGADAGII-GNTQQKTLAVVYDVARQKIGFGANGCS 484


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 115/356 (32%), Positives = 174/356 (48%), Gaps = 34/356 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTPP       DTGSD TW QC PC  + C+KQ+  LFDP KSSTY ++SC+ 
Sbjct: 163 YVVPIGLGTPPSRFTVVFDTGSDTTWVQCRPC-VVSCYKQKDRLFDPAKSSTYANVSCAD 221

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             CA +  S C+ G C Y   YG G+Y   + G  A +TL     +     +    FGCG
Sbjct: 222 PACADLDASGCNAGHCLYGIQYGDGSY---TVGFFAKDTLAVAQDA-----IKGFKFGCG 273

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INF---GGI 208
            KN        +  G++GLG G +S+  Q      G FSYCLP   ++   + F      
Sbjct: 274 EKNRG---LFGQTAGLLGLGRGPTSITVQAYEKYGGSFSYCLPASSAATGYLEFGPLSPS 330

Query: 209 VAGAGVVSTPLIIRD---HYYLSLEAISVGNQRL----EFVSSSTGNIFVDTGVLRTLLP 261
            +G+   +TP++       YY+ L  I VG ++L    E V S++G + VD+G + T LP
Sbjct: 331 SSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPESVFSNSGTL-VDSGTVITRLP 389

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKL 318
              ++ L S  +  + A   K   A        CY+ +  SQ   P V++ F+ GA + L
Sbjct: 390 DTAYAALSSAFAAAMAASGYKKAAAYSILD--TCYDFTGLSQVSLPTVSLVFQGGACLDL 447

Query: 319 SPSNLFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             S +   IS   +C  F   G + ++ + G   Q  + + YD+ + +V F P  C
Sbjct: 448 DASGIVYAISQSQVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAPGAC 503


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 111/370 (30%), Positives = 164/370 (44%), Gaps = 50/370 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP       +DTGSD  W QC PC    C+ Q   +FDP++S +Y ++ CS+
Sbjct: 142 YFTKIGVGTPATPALMVLDTGSDVVWLQCAPCRR--CYDQSGQVFDPRRSRSYGAVGCSA 199

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   +     C Y   YG G   S ++G+ ATETLTF   +     +  +  G
Sbjct: 200 PLCRRLDSGGCDLRRKACLYQVAYGDG---SVTAGDFATETLTFAGGA----RVARIALG 252

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--------I 203
           CGH N     + +   G+     G+ S  +Q+       FSYCL D+ SS         +
Sbjct: 253 CGHDNEGLFVAAAGLLGLG---RGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTV 309

Query: 204 NFGGIVAGAGVVS--TPLI----IRDHYYLSLEAISVGNQRLEFV---------SSSTGN 248
            FG    G+ V +  TP++    +   YY+ L  ISVG  R+  V         SS  G 
Sbjct: 310 TFGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGG 369

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--KF 304
           + VD+G   T L    +S L+             G+   PG   +   CY++S +   K 
Sbjct: 370 VIVDSGTSVTRLARPAYSALRDAFRAA-----AAGLRLSPGGFSLFDTCYDLSGRKVVKV 424

Query: 305 PEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQ 361
           P V++HF  GA+  L P N    + S    C AF G +  + + G I Q  F + +D + 
Sbjct: 425 PTVSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDG 484

Query: 362 AMVSFKPSRC 371
             V F P  C
Sbjct: 485 QRVGFVPKGC 494


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 112/368 (30%), Positives = 174/368 (47%), Gaps = 45/368 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + L +GTP   +F  VDTGSD  W QC+PC    C+KQ  P+FDP+ SS++  I C S
Sbjct: 129 YFVRLGVGTPARSLFMVVDTGSDLPWLQCQPCKS--CYKQADPIFDPRNSSSFQRIPCLS 186

Query: 95  SQC-AVVTSNCS-----EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
             C A+   +CS        CSY   YG G   SFS G+ +++  T  + S    +  +V
Sbjct: 187 PLCKALEIHSCSGSRGATSRCSYQVAYGDG---SFSVGDFSSDLFTLGTGS----KAMSV 239

Query: 149 IFGCGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD------QGS 200
            FGCG  N  L +  +     G   L   +    S   +S A  FSYCL D      + S
Sbjct: 240 AFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSS 299

Query: 201 SKINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQR-------LEFVSSSTGNI 249
           S + FG     +    +PL+    +   YY ++  +SVG  +       L+   S +G +
Sbjct: 300 SSLIFGAAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGV 359

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPE 306
            +D+G   T  P   ++ ++    N     P     + P +S    CYN S +     P 
Sbjct: 360 IIDSGTSVTRFPTSVYATIRDAFRNATTNLP-----SAPRYSLFDTCYNFSGKASVDVPA 414

Query: 307 VTIHFR-GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAM 363
           + +HF  GAD++L P+N    I+     C AF   +  + + G I Q +F IG+D++++ 
Sbjct: 415 LVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSH 474

Query: 364 VSFKPSRC 371
           ++F P +C
Sbjct: 475 LAFAPQQC 482


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 113/362 (31%), Positives = 170/362 (46%), Gaps = 51/362 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IG P  +++  +DTGSD  W QC PC   DC+ Q  P+F+P  SS+Y  +SC +
Sbjct: 151 YFTRVGIGNPAREVYMVLDTGSDVNWLQCTPCA--DCYHQTEPIFEPSSSSSYEPLSCDT 208

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC A+  S C    C Y   YG G+Y   + G+ ATETLT  ST      + NV  GCG
Sbjct: 209 PQCNALEVSECRNATCLYEVSYGDGSY---TVGDFATETLTIGST-----LVQNVAVGCG 260

Query: 154 HKN---LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGG 207
           H N              G +   P      SQ+ T+    FSYCL D+ S   S + FG 
Sbjct: 261 HSNEGLFVGAAGLLGLGGGLLALP------SQLNTT---SFSYCLVDRDSDSASTVEFGT 311

Query: 208 IVAGAGVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGV 255
            +    VV+ PL +R+H     YYL L  ISVG + L+   SS       +G I +D+G 
Sbjct: 312 SLPPDAVVA-PL-LRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGT 369

Query: 256 LRTLLPLEYHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR 312
             T L    +++L+ S +      +   GV          CYN+S++   + P V  HF 
Sbjct: 370 AVTRLQTGIYNSLRDSFLKGTSDLEKAAGVAMFD-----TCYNLSAKTTIEVPTVAFHFP 424

Query: 313 GADVKLSPSNLFRNISDEI--MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPS 369
           G  +   P+  +    D +   C AF    +++ + G + Q    + +D+  +++ F  +
Sbjct: 425 GGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSN 484

Query: 370 RC 371
           +C
Sbjct: 485 KC 486


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 116/369 (31%), Positives = 178/369 (48%), Gaps = 51/369 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+M LSIGTPP  I   +DTGSD  W +C+ C   D       +F    SS+Y  + C+S
Sbjct: 5   YMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNS 64

Query: 95  SQCAVVTS-----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE---MP 146
           + C+ ++S      C E  C Y + YG G   S +SG++ ++ ++F S            
Sbjct: 65  THCSGMSSAGIGPRCEE-TCKYKYEYGDG---SRTSGDVGSDRISFRSHGAGEDHRSFFD 120

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGSS 201
             +FGC  K L    + ++  G+IGLG  + SLI Q+G  +  KFSYCL     P    S
Sbjct: 121 GFLFGCARK-LKGDWNFTQ--GLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKS 177

Query: 202 KINFGGIVA--GAGVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSSSTGN------ 248
            +  G   A  G  VVSTP++  DH     YY+ L++I++G   +      +G+      
Sbjct: 178 FLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGP 237

Query: 249 -----IFVDTGVLRTLL-PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
                  +D+G   TLL P  Y +  KS+   +I    +  +G   G    LC+N S   
Sbjct: 238 FLANKTVIDSGTTYTLLTPPVYEAMRKSIEEQVI----LPTLGNSAGLD--LCFNSSGDT 291

Query: 303 K--FPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLIGY 357
              FP VT +F     + L   N+F+  S +++C +    GG+ +I+ G + Q NF I Y
Sbjct: 292 SYGFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSII-GNMQQQNFHILY 350

Query: 358 DIEQAMVSF 366
           D+  + +SF
Sbjct: 351 DLVASQISF 359


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score =  137 bits (346), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 115/359 (32%), Positives = 169/359 (47%), Gaps = 38/359 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP   +    DTGSD TWTQC+PC    C+ Q+ P+F P +S+TY++ISCSS
Sbjct: 131 YIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARY-CYNQKDPVFVPSQSTTYSNISCSS 189

Query: 95  SQCAVVTS------NCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
             C+ + S       CS    C Y   YG     SFS G  A ETLT  ST      + N
Sbjct: 190 PDCSQLESGTGNQPGCSAARACIYGIQYGD---QSFSVGYFAKETLTLTSTD----VIEN 242

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INF 205
            +FGCG  N     S +   G+IGLG    S++ Q        FSYCLP   SS   + F
Sbjct: 243 FLFGCGQNNRGLFGSAA---GLIGLGQDKISIVKQTAQKYGQVFSYCLPKTSSSTGYLTF 299

Query: 206 GGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTL 259
           GG   G  +  TP+     + + Y + +  + VG  ++   SS  ST    +D+G + T 
Sbjct: 300 GGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSVFSTSGAIIDSGTVITR 359

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRGA-D 315
           LP + +S LKS     +   P       P  S +  CY++S  S  + P+V   F+G  +
Sbjct: 360 LPPDAYSALKSAFEKGMAKYP-----KAPELSILDTCYDLSKYSTIQIPKVGFVFKGGEE 414

Query: 316 VKLSPSNLFRNISDEIMCSAFRGG---NANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           + L    +    S   +C AF G    +   + G + Q    + YD+    + F  + C
Sbjct: 415 LDLDGIGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYNGC 473


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score =  137 bits (346), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 106/374 (28%), Positives = 166/374 (44%), Gaps = 48/374 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP       +DTGSD  W QC PC    C+ Q   +FDP++SSTY  + CSS
Sbjct: 86  YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRR--CYAQRGQVFDPRRSSTYRRVPCSS 143

Query: 95  SQCAVV------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
            QC  +      +   + G C Y   YG G   S S+G LAT+ L F + +     + NV
Sbjct: 144 PQCRALRFPGCDSGGAAGGGCRYMVAYGDG---SSSTGELATDKLAFANDT----YVNNV 196

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ-----GSSKI 203
             GCG  N     S     G++G+  G  S+ +Q+  +    F YCL D+      SS +
Sbjct: 197 TLGCGRDNEGLFDS---AAGLLGVARGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYL 253

Query: 204 NFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST---------GNIF 250
            FG          T L+        YY+ +   SVG +R+   S+++         G + 
Sbjct: 254 VFGRTPEPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVV 313

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
           VD+G   +    + ++ L+       +A  ++ +  E    D  CY++  +P    P + 
Sbjct: 314 VDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDA-CYDLRGRPAASAPLIV 372

Query: 309 IHFR-GADVKLSPSNLF-------RNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDI 359
           +HF  GAD+ L P N F       R  +    C  F   +  + V G + Q  F + +D+
Sbjct: 373 LHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFDV 432

Query: 360 EQAMVSFKPSRCTN 373
           E+  + F P  CT+
Sbjct: 433 EKERIGFAPKGCTS 446


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 124/400 (31%), Positives = 176/400 (44%), Gaps = 51/400 (12%)

Query: 13  NETPKSPISIIYQAEIISVDDI----YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPE 68
           N TP+  ++    A + S   +    YL+ L +GTPP      +DTGSD  W QC PC  
Sbjct: 126 NSTPRRALAERIVATVESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPC-- 183

Query: 69  LDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVT-----SNCSE---GDCSYSFLYGRGAY 120
           LDCF+Q  P+FDP  S +Y +++C   +C +V        C       C Y + YG    
Sbjct: 184 LDCFEQRGPVFDPAASLSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGD--- 240

Query: 121 ASFSSGNLATETLTFNSTS-GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
            S ++G+LA E  T N T+ G    + +V+FGCGH N       +   G+        S 
Sbjct: 241 QSNTTGDLALEAFTVNLTAPGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRG---ALSF 297

Query: 180 ISQMGTSIAGKFSYCLPDQGS---SKINFGGIVAGAG---------VVSTPLIIRDHYYL 227
            SQ+       FSYCL D GS   SKI FG   A  G           S        YY+
Sbjct: 298 ASQLRAVYGHAFSYCLVDHGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYV 357

Query: 228 SLEAISVGNQRLEFVSSS-------TGNIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQ 279
            L+ + VG ++L    S+       +G   +D+G  L       Y    ++ +  M KA 
Sbjct: 358 QLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAY 417

Query: 280 PVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNIS-DEIMCSA 335
           P+  V   P  S   CYN+S   + + PE ++ F  GA       N F  +  D IMC A
Sbjct: 418 PL--VADFPVLSP--CYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLA 473

Query: 336 FRG--GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
             G   +A  + G   Q NF + YD++   + F P RC  
Sbjct: 474 VLGTPRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRCAE 513


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 110/358 (30%), Positives = 164/358 (45%), Gaps = 43/358 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IG P   ++  +DTGSD  W QC PC   DC+ Q  P+F+P  S++Y+ +SC +
Sbjct: 144 YFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPCA--DCYHQADPIFEPASSTSYSPLSCDT 201

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC ++  S C    C Y   YG G+Y   + G+  TET+T  S S     + NV  GCG
Sbjct: 202 KQCQSLDVSECRNNTCLYEVSYGDGSY---TVGDFVTETITLGSAS-----VDNVAIGCG 253

Query: 154 HKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGI 208
           H N  L    +     G   L     S  SQ+    A  FSYCL D+ S   S + F   
Sbjct: 254 HNNEGLFIGAAGLLGLGGGKL-----SFPSQIN---ASSFSYCLVDRDSDSASTLEFNSA 305

Query: 209 VAGAGVVSTPLIIRD---HYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRT 258
           +    + +  L  R+    YY+ +  +SVG + L       E   S  G I +D+G   T
Sbjct: 306 LLPHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVT 365

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGADV 316
            L    ++ L+       K  PV    +E    D  CY++S +   + P VT H  G  V
Sbjct: 366 RLQTAAYNALRDAFVKGTKDLPVT---SEVALFDT-CYDLSRKTSVEVPTVTFHLAGGKV 421

Query: 317 KLSPSN--LFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
              P+   L    SD   C AF   ++ + + G + Q    +G+D+  ++V F+P +C
Sbjct: 422 LPLPATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 124/400 (31%), Positives = 176/400 (44%), Gaps = 51/400 (12%)

Query: 13  NETPKSPISIIYQAEIISVDDI----YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPE 68
           N TP+  ++    A + S   +    YL+ L +GTPP      +DTGSD  W QC PC  
Sbjct: 126 NSTPRRALAERIVATVESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPC-- 183

Query: 69  LDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVT-----SNCSE---GDCSYSFLYGRGAY 120
           LDCF+Q  P+FDP  S +Y +++C   +C +V        C       C Y + YG    
Sbjct: 184 LDCFEQRGPVFDPATSLSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGD--- 240

Query: 121 ASFSSGNLATETLTFNSTS-GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
            S ++G+LA E  T N T+ G    + +V+FGCGH N       +   G+        S 
Sbjct: 241 QSNTTGDLALEAFTVNLTAPGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRG---ALSF 297

Query: 180 ISQMGTSIAGKFSYCLPDQGS---SKINFGGIVAGAG---------VVSTPLIIRDHYYL 227
            SQ+       FSYCL D GS   SKI FG   A  G           S        YY+
Sbjct: 298 ASQLRAVYGHAFSYCLVDHGSSVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYV 357

Query: 228 SLEAISVGNQRLEFVSSS-------TGNIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQ 279
            L+ + VG ++L    S+       +G   +D+G  L       Y    ++ +  M KA 
Sbjct: 358 QLKGVLVGGEKLNISPSTWDVGKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAY 417

Query: 280 PVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNIS-DEIMCSA 335
           P+  V   P  S   CYN+S   + + PE ++ F  GA       N F  +  D IMC A
Sbjct: 418 PL--VADFPVLSP--CYNVSGVERVEVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLA 473

Query: 336 FRG--GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
             G   +A  + G   Q NF + YD++   + F P RC  
Sbjct: 474 VLGTPRSAMSIIGNFQQQNFHVLYDLQNNRLGFAPRRCAE 513


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score =  137 bits (345), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 122/368 (33%), Positives = 170/368 (46%), Gaps = 40/368 (10%)

Query: 26  AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
             +I   D Y++ + +GTP  D+    DTGS  TWTQCEPC    C+KQ+ P+FDP KSS
Sbjct: 132 GRLIGSADYYVV-VGLGTPKRDLSLIFDTGSYLTWTQCEPCAG-SCYKQQDPIFDPSKSS 189

Query: 86  TYNSISCSSSQCAVVTS-NCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
           +Y +I C+SS C    S  CS   +  C Y   YG     S S G L+ E LT  +T   
Sbjct: 190 SYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDN---SISRGFLSQERLTITATD-- 244

Query: 142 PVEMPNVIFGCGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
              + + +FGCG  N  L   T+     G++GL     S + Q  +     FSYCLP   
Sbjct: 245 --IVHDFLFGCGQDNEGLFRGTA-----GLMGLSRHPISFVQQTSSIYNKIFSYCLPSTP 297

Query: 200 SS--KINFGGIVA-GAGVVSTPLII----RDHYYLSLEAISVGNQRLEFVSSST---GNI 249
           SS   + FG   A  A +  TP          Y L +  ISVG  +L  VSSST   G  
Sbjct: 298 SSLGHLTFGASAATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSAGGS 357

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF--PEV 307
            +D+G + T LP   ++ L+S     +   PV   G      D  CY+ S   +   P +
Sbjct: 358 IIDSGTVITRLPPTAYAALRSAFRQFMMKYPV-AYGTR--LLDT-CYDFSGYKEISVPRI 413

Query: 308 TIHFRGA-DVKLSPSNLFRNISDEIMCSAFRG-GNAN--IVYGRIMQINFLIGYDIEQAM 363
              F G   V+L    +    S + +C AF   GN N   ++G + Q    + YD+E   
Sbjct: 414 DFEFAGGVKVELPLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGR 473

Query: 364 VSFKPSRC 371
           + F  + C
Sbjct: 474 IGFGAAGC 481


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score =  137 bits (345), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 110/355 (30%), Positives = 169/355 (47%), Gaps = 39/355 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++  S+GTP +     VDTGSD +W QC+PC    C++Q+ PLFDP +SS+Y ++ C  
Sbjct: 137 YVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGR 196

Query: 95  SQCA---VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
           S CA   +  S CS   C Y   YG G   S ++G  +++TLT  + +     +   +FG
Sbjct: 197 SACAGLGIYASACSAAQCGYVVSYGDG---SNTTGVYSSDTLTLAANA----TVQGFLFG 249

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIV 209
           CGH    S    +   G++G G    SL+ Q   +  G FSYCLP + S+   +  GG  
Sbjct: 250 CGHAQ--SGGLFTGIDGLLGFGREQPSLVQQTAGAYGGVFSYCLPTKSSTTGYLTLGGPS 307

Query: 210 AGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLE 263
             A   ST  ++       +Y + L  ISVG Q L   +S+      VDTG + T LP  
Sbjct: 308 GVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTGTVITRLPPA 367

Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSN 322
            ++ L+S   + + + P     A P G  D  CY+ +        T++     +  S   
Sbjct: 368 AYAALRSAFRSGMASYP----SAPPIGILDT-CYSFAGYG-----TVNLTSVALTFSSGA 417

Query: 323 LFRNISDEIM---CSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
                +D IM   C AF    ++    + G + Q +F +   I+ + V F+PS C
Sbjct: 418 TMTLGADGIMSFGCLAFASSGSDGSMAILGNVQQRSFEV--RIDGSSVGFRPSSC 470


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 109/362 (30%), Positives = 163/362 (45%), Gaps = 46/362 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ L  GTP V     +DTGSD +W QC PC   +C+ Q+ PLFDP KSSTY  I+C +
Sbjct: 125 YMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIACGA 184

Query: 95  SQCAVVTSN----CSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
             C  +  +    C+ G   C Y   YG G   S + G  + ET+TF    G+ V+  + 
Sbjct: 185 DACNKLGDHYRNGCTSGGTQCGYRVEYGDG---SSTRGVYSNETITF--APGITVK--DF 237

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG-G 207
            FGCGH          K  G++GLG    SL+ Q  +   G FSYCLP   S       G
Sbjct: 238 HFGCGHDQRG---PSDKFDGLLGLGGAPESLVVQTASVYGGAFSYCLPALNSEAGFLALG 294

Query: 208 IVAGAGVVSTPLIIRDHYYLSLEA---------ISVGNQRLEFVSSS-TGNIFVDTGVLR 257
           +   A   ++  +    ++L ++A         ISVG + L+   S+  G + +D+G + 
Sbjct: 295 VRPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAFRGGMLIDSGTIV 354

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGA- 314
           T LP   ++ L + +     A P+    A   F    CYN +  S    P V + F G  
Sbjct: 355 TELPETAYNALNAALRKAFAAYPMV---ASEDFD--TCYNFTGYSNVTVPRVALTFSGGA 409

Query: 315 --DVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPS 369
             D+ +    L ++      C AFR    ++   + G + Q    + YD     V F+  
Sbjct: 410 TIDLDVPNGILVKD------CLAFRESGPDVGLGIIGNVNQRTLEVLYDAGHGKVGFRAG 463

Query: 370 RC 371
            C
Sbjct: 464 AC 465


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 116/365 (31%), Positives = 174/365 (47%), Gaps = 37/365 (10%)

Query: 26  AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
           A I+     Y++ + +GTP  D   S DTGSD TWTQCEPC    CF Q  P FDP  S+
Sbjct: 131 ASIVPTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLG-GCFPQNQPKFDPTTST 189

Query: 86  TYNSISCSSSQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
           +Y ++SCSS  C ++        +C    C Y   YG G    ++ G LATETL   S+ 
Sbjct: 190 SYKNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGSG----YTIGFLATETLAIASSD 245

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
                  N +FGC  +   S  + +  TG++GLG    +L SQ        FSYCLP   
Sbjct: 246 ----VFKNFLFGCSEE---SRGTFNGTTGLLGLGRSPIALPSQTTNKYKNLFSYCLPASP 298

Query: 200 SS--KINFGGIVAGAGVVSTPL--IIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGV 255
           SS   ++FG  V+ A   STP+   ++  Y L+   ISV  + L  ++ S     +D+G 
Sbjct: 299 SSTGHLSFGVEVSQAA-KSTPISPKLKQLYGLNTVGISVRGRELP-INGSISRTIIDSGT 356

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS----QPKFPEVTIHF 311
             T LP   +S L S    M+    +        F    CY+ S+        P ++I F
Sbjct: 357 TFTFLPSPTYSALGSAFREMMANYTL--TNGTSSFQP--CYDFSNIGNGTLTIPGISIFF 412

Query: 312 RGA-DVKLSPSNLFRNISD-EIMCSAF--RGGNANI-VYGRIMQINFLIGYDIEQAMVSF 366
            G  +V++  S +   ++  + +C AF   G +++  ++G   Q  + + YD+ + MV F
Sbjct: 413 EGGVEVEIDVSGIMIPVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDVAKGMVGF 472

Query: 367 KPSRC 371
            P  C
Sbjct: 473 APKGC 477


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score =  137 bits (344), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 113/358 (31%), Positives = 174/358 (48%), Gaps = 43/358 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IG PP  ++  +DTGSD +W QC PC E  C++Q  P+F+P  S+++ S+SC +
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAE--CYEQTDPIFEPTSSASFTSLSCET 208

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC ++  S C  G C Y   YG G+Y   + G+  TET+T  STS     + N+  GCG
Sbjct: 209 EQCKSLDVSECRNGTCLYEVSYGDGSY---TVGDFVTETVTLGSTS-----LGNIAIGCG 260

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIVA 210
           H N       +   G+ G    + S  SQ+    A  FSYCL D+ S   S ++F   + 
Sbjct: 261 HNNEGLFIGAAGLLGLGGG---SLSFPSQLN---ASSFSYCLVDRDSDSTSTLDFNSPIT 314

Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-----EFVSSSTGN--IFVDTGVLRTL 259
               V+ PL     +   +YL L  +SVG   L      F  S  GN  I VD+G   T 
Sbjct: 315 -PDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTR 373

Query: 260 LPLEYHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIHF-RGAD 315
           L    ++ L+ + + +    Q  +GV       D  CY++SS+ +   P V+ HF  G +
Sbjct: 374 LQTTVYNVLRDAFVKSTHDLQTARGV----ALFDT-CYDLSSKSRVEVPTVSFHFANGNE 428

Query: 316 VKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           + L   N    +  E   C AF   ++ + + G   Q    +G+D+  ++V F P++C
Sbjct: 429 LPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 115/376 (30%), Positives = 169/376 (44%), Gaps = 54/376 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  +++GTP V    ++DT SD TW QC+PC    C+ Q  P+FDP+ S++Y  ++  +
Sbjct: 134 YMAKIAVGTPAVQALLALDTASDLTWLQCQPCRR--CYPQSGPVFDPRHSTSYGEMNYDA 191

Query: 95  SQCAVV----TSNCSEGDCSYSFLYGRG-AYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
             C  +      +   G C Y+  YG G    S S G+L  ETLTF       V    + 
Sbjct: 192 PDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGG----VRQAYLS 247

Query: 150 FGCGHKN---LASPTSDSKQTGIIGLGPGNSSLISQMG-TSIAGKFSYCL------PDQG 199
            GCGH N     +P +     GI+GLG G  S+  Q+        FSYCL      P   
Sbjct: 248 IGCGHDNKGLFGAPAA-----GILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSP 302

Query: 200 SSKINFGGIVAGAGVVS-----TPLIIRDH----YYLSLEAISVGNQRLEFVS------- 243
           SS + FG   AGA   S     TP ++  +    YY+ L  +SVG  R+  V+       
Sbjct: 303 SSTLTFG---AGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDLQLD 359

Query: 244 --SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ 301
             +  G + +D+G   T L    +   +        +      G   G  D  CY +  +
Sbjct: 360 PYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDT-CYTVGGR 418

Query: 302 P--KFPEVTIHFRGA-DVKLSPSNLFRNI-SDEIMCSAFRG-GNANI-VYGRIMQINFLI 355
              K P V++HF G  +V L P N    + S   +C AF G G+ ++ V G I+Q  F +
Sbjct: 419 AGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRV 478

Query: 356 GYDIEQAMVSFKPSRC 371
            YD+    V F P+ C
Sbjct: 479 VYDLAGQRVGFAPNNC 494


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 117/375 (31%), Positives = 178/375 (47%), Gaps = 68/375 (18%)

Query: 2   QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
           ++ ++L  Y      K+P++   +         Y+M  SIG PP+ I+  VDTGSD  W 
Sbjct: 60  RSRRRLSVYTSGTGTKAPVTKSQKG------GKYIMQFSIGEPPLLIWAEVDTGSDLMWV 113

Query: 62  QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA------VVTSNCSEGD--CSYSF 113
           +C PC    C     PL+DP +S +   + CSS  C       +++  CS+    C Y +
Sbjct: 114 KCSPCN--GCNPPPSPLYDPARSRSSGKLPCSSQLCQALGRGRIISDQCSDDPPLCGYHY 171

Query: 114 LYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQ----TGI 169
            YG     S + G L TET TF    G      NV FG       S T D  Q     G+
Sbjct: 172 AYGHSGDHS-TQGVLGTETFTF----GDGYVANNVSFG------RSDTIDGSQFGGTAGL 220

Query: 170 IGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SKINFGGIVA----GAGVVSTPLII-- 221
           +GLG G+ SL+SQ+G   AG+F+YCL    +  S I FG + A       V STPL+   
Sbjct: 221 VGLGRGHLSLVSQLG---AGRFAYCLAADPNVYSTILFGSLAALDTSAGDVSSTPLVTNP 277

Query: 222 ---RD-HYYLSLEAISVGNQRLEFV-------SSSTGNIFVDTGVLRTLLPLEYHSNLKS 270
              RD HYY++L+ ISVG  RL          S  +G +F D+G + T L    +  ++ 
Sbjct: 278 KPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGAIDTSLKDAAYQVVRQ 337

Query: 271 VMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ---PKFPEVTIHF-RGADVKLSPSNLFRN 326
            +++ I+      +G + G  D  C+  ++Q    + P + +HF  GAD+ L+  N  + 
Sbjct: 338 AITSEIQR-----LGYDAG--DDTCFVAANQQAVAQMPPLVLHFDDGADMSLNGRNYLKT 390

Query: 327 I----SDEIMCSAFR 337
                S+ ++C A +
Sbjct: 391 STKGPSEVLVCMAIK 405


>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
           Short=AtASPG2; Flags: Precursor
 gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
 gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 109/361 (30%), Positives = 160/361 (44%), Gaps = 45/361 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP D +  +D+GSD  W QC+PC    C+KQ  P+FDP KS +Y  +SC S
Sbjct: 131 YFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKL--CYKQSDPVFDPAKSGSYTGVSCGS 188

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S C  +  S C  G C Y  +YG G+Y   + G LA ETLTF  T      + NV  GCG
Sbjct: 189 SVCDRIENSGCHSGGCRYEVMYGDGSY---TKGTLALETLTFAKTV-----VRNVAMGCG 240

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS---KINFGGIVA 210
           H+N       +   GI G    + S + Q+     G F YCL  +G+     + FG    
Sbjct: 241 HRNRGMFIGAAGLLGIGGG---SMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGREAL 297

Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
             G    PL+        YY+ L+ + VG  R+       +   +  G + +DTG   T 
Sbjct: 298 PVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTR 357

Query: 260 LP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF-R 312
           LP    + +    KS  +N+ +A  V             CY++S     + P V+ +F  
Sbjct: 358 LPTAAYVAFRDGFKSQTANLPRASGVSIFDT--------CYDLSGFVSVRVPTVSFYFTE 409

Query: 313 GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
           G  + L   N    + D    C AF      + + G I Q    + +D     V F P+ 
Sbjct: 410 GPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNV 469

Query: 371 C 371
           C
Sbjct: 470 C 470


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 121/381 (31%), Positives = 169/381 (44%), Gaps = 57/381 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM + +GTPP      +DTGSD  W QC PC  LDCF+Q  P+FDP  SS+Y +++C  
Sbjct: 146 YLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAASSSYRNLTCGD 203

Query: 95  SQCAVVTSNCS----------EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS-GLPV 143
            +C  V    +          E  C Y + YG     S S+G+LA E+ T N T+ G   
Sbjct: 204 PRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGD---QSNSTGDLALESFTVNLTAPGASS 260

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPDQGS-- 200
            +  V+FGCGH+N       +   G+        S  SQ+     G  FSYCL D GS  
Sbjct: 261 RVDGVVFGCGHRNRGLFHGAAGLLGLGRG---PLSFASQLRAVYGGHTFSYCLVDHGSDV 317

Query: 201 -SKINFGGIVAGAGVVSTPLI-----------IRDHYYLSLEAISVGNQRLEFVSSS--- 245
            SK+ FG   A A + + P +               YY+ L  + VG + L   S +   
Sbjct: 318 ASKVVFGEDDALA-LAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISSDTWDA 376

Query: 246 ----TGNIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNI 298
               +G   +D+G  L   +   Y    ++ +  M  + P       P F  VL  CYN+
Sbjct: 377 SEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYP-----PVPDFP-VLSPCYNV 430

Query: 299 S--SQPKFPEVTIHFR-GADVKLSPSNLF-RNISDEIMCSAFRG--GNANIVYGRIMQIN 352
           S   +P+ PE+++ F  GA       N F R   D IMC A  G       + G   Q N
Sbjct: 431 SGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQN 490

Query: 353 FLIGYDIEQAMVSFKPSRCTN 373
           F + YD+    + F P RC  
Sbjct: 491 FHVAYDLHNNRLGFAPRRCAE 511


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 115/369 (31%), Positives = 171/369 (46%), Gaps = 48/369 (13%)

Query: 27  EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSST 86
           ++   D  +L+ ++ GTPP      +DTGS  TWTQC+PC  + C K     FDP  S T
Sbjct: 154 KLFDEDGNFLVDVAFGTPPQKFTLILDTGSSITWTQCKPC--VRCLKASRRHFDPSASLT 211

Query: 87  YNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           Y+  SC  S              +Y+  YG     S S GN   +T+T   +       P
Sbjct: 212 YSLGSCIPSTVGN----------TYNMTYGD---KSTSVGNYGCDTMTLEHSD----VFP 254

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG------- 199
              FGCG  N     S +   G++GLG G  S +SQ  +     FSYCLP++        
Sbjct: 255 KFQFGCGRNNEGDFGSGAD--GMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSLLF 312

Query: 200 -------SSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS---STGNI 249
                  SS + F  +V G G  ++ L    +Y++ L  ISVGN+RL   SS   S G I
Sbjct: 313 GEKATSQSSSLKFTSLVNGPG--TSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTI 370

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPK--FP 305
            +D+G + T LP   +S LK+     +   P+     + G  D+L  CYN+S +     P
Sbjct: 371 -IDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKG--DILDTCYNLSGRKDVLLP 427

Query: 306 EVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
           E+ +HF  GADV+L+   +        +C AF G +   + G   Q++  + YDI+   +
Sbjct: 428 EIVLHFGEGADVRLNGKRVIWGNDASRLCLAFAGNSELTIIGNRQQVSLTVLYDIQGGRI 487

Query: 365 SFKPSRCTN 373
            F  + C+ 
Sbjct: 488 GFGGNGCSK 496


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 112/353 (31%), Positives = 171/353 (48%), Gaps = 33/353 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + IG+P V    S+DTGSD +W QC+PC +  C  +   LFDP  SSTY+  SCSS
Sbjct: 131 YVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQ--CHSEVDSLFDPSASSTYSPFSCSS 188

Query: 95  SQCAVVTSN-----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           + C  ++ +     CS   C Y   Y  G   S ++G  +++TLT  S +     +    
Sbjct: 189 AACVQLSQSQQGNGCSSSQCQYIVSYVDG---SSTTGTYSSDTLTLGSNA-----IKGFQ 240

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ-GSSKINFGGI 208
           FGC      S     +  G++GLG    SL+SQ   +    FSYCLP   GSS     G 
Sbjct: 241 FGCSQSE--SGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPTPGSSGFLTLGA 298

Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPL 262
            + +G V TP++    I  +Y + LEAI VG Q+L   +S  S G++ +D+G + T LP 
Sbjct: 299 ASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSAGSV-MDSGTVITRLPP 357

Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVKLS 319
             +S L S     +K  P     A+P      C++ S Q     P V + F  GA V L 
Sbjct: 358 TAYSALSSAFKAGMKKYPP----AQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVNLD 413

Query: 320 PSNLFRNISDEIMCSAFRGGNANIVY-GRIMQINFLIGYDIEQAMVSFKPSRC 371
            + +   + +  +  A    ++++ + G + Q  F + YD+    V F+   C
Sbjct: 414 FNGIMLELDNWCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGAC 466


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 110/313 (35%), Positives = 153/313 (48%), Gaps = 40/313 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+HL+IGTPP  +  ++DTGSD  WTQC+PCP   CF Q  P FDP  SST +  SC S
Sbjct: 82  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPA--CFDQALPYFDPSTSSTLSLTSCDS 139

Query: 95  SQC-AVVTSNCSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           + C  +  ++C          C Y++ YG     S ++G L  +  TF    G    +P 
Sbjct: 140 TLCQGLPVASCGSPKFWPNQTCVYTYSYGD---KSVTTGFLEVDKFTF---VGAGASVPG 193

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-------PDQGS 200
           V FGCG  N  +    S +TGI G G G  SL SQ+     G FS+C        P    
Sbjct: 194 VAFGCGLFN--NGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPSTVL 248

Query: 201 SKINFGGIVAGAGVV-STPLIIR----DHYYLSLEAISVGNQRL-----EF-VSSSTGNI 249
             +      +G G V STPLI        YYLSL+ I+VG+ RL     EF + + TG  
Sbjct: 249 LDLPADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGT 308

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTI 309
            +D+G   T LP   +  ++   +  +K   V G   +P F   L   + ++P  P++ +
Sbjct: 309 IIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYF--CLSAPLRAKPYVPKLVL 366

Query: 310 HFRGADVKLSPSN 322
           HF GA + L   N
Sbjct: 367 HFEGATMDLPREN 379


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 103/355 (29%), Positives = 167/355 (47%), Gaps = 33/355 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + L +G+PP      +DTGS  +W QC+PC  + C  Q  PLF+P  S+TY  + CSS
Sbjct: 120 YYLKLGLGSPPKYYTMILDTGSSLSWLQCKPC-VVYCHSQVDPLFEPSASNTYRPLYCSS 178

Query: 95  SQCAVVTSNC-------SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           S+C+++ +         + G C Y+  YG    AS+S G L+ + LT   +  LP    +
Sbjct: 179 SECSLLKAATLNDPLCTASGVCVYTASYGD---ASYSMGYLSRDLLTLTPSQTLP----S 231

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
             +GCG  N        K  GI+GL     S+++Q+       FSYCLP   SS   F  
Sbjct: 232 FTYGCGQDN---EGLFGKAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSGGGFLS 288

Query: 208 I--VAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSSTG-NIFVDTGVLRTLL 260
           I  ++ +    TP+I        Y+L L AI+V  + +   ++       +D+G + T L
Sbjct: 289 IGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTIIDSGTVVTRL 348

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCY--NISSQPKFPEVTIHFRG-ADV 316
           P+  ++ L+     ++  +  +     P +S +  C+  ++ S    PE+ + F+G AD+
Sbjct: 349 PISIYAALREAFVKIMSRRYEQA----PAYSILDTCFKGSLKSMSGAPEIRMIFQGGADL 404

Query: 317 KLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            L   N+       I C AF   N   + G   Q  + I YD+  + + F P  C
Sbjct: 405 SLRAPNILIEADKGIACLAFASSNQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 113/358 (31%), Positives = 173/358 (48%), Gaps = 43/358 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IG PP  ++  +DTGSD +W QC PC E  C++Q  P F+P  S+++ S+SC +
Sbjct: 151 YFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPCAE--CYEQTDPXFEPTSSASFTSLSCET 208

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC ++  S C  G C Y   YG G+Y   + G+  TET+T  STS     + N+  GCG
Sbjct: 209 EQCKSLDVSECRNGTCLYEVSYGDGSY---TVGDFVTETVTLGSTS-----LGNIAIGCG 260

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIVA 210
           H N       +   G+ G    + S  SQ+    A  FSYCL D+ S   S ++F   + 
Sbjct: 261 HNNEGLFIGAAGLLGLGGG---SLSFPSQLN---ASSFSYCLVDRDSDSTSTLDFNSPIT 314

Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-----EFVSSSTGN--IFVDTGVLRTL 259
               V+ PL     +   +YL L  +SVG   L      F  S  GN  I VD+G   T 
Sbjct: 315 -PDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTR 373

Query: 260 LPLEYHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIHF-RGAD 315
           L    ++ L+ + + +    Q  +GV       D  CY++SS+ +   P V+ HF  G +
Sbjct: 374 LQTTVYNVLRDAFVKSTHDLQTARGV----ALFDT-CYDLSSKSRVEVPTVSFHFANGNE 428

Query: 316 VKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           + L   N    +  E   C AF   ++ + + G   Q    +G+D+  ++V F P++C
Sbjct: 429 LPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 118/369 (31%), Positives = 170/369 (46%), Gaps = 42/369 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ + +GTPP      +DTGSD  W QC PC  LDCF+Q  P+FDP  S +Y +++C  
Sbjct: 149 YLVDVYLGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQSGPIFDPAASISYRNVTCGD 206

Query: 95  SQCAVVT-------SNCSE---GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
            +C +V+         C       C Y + YG     S ++G+LA E  T N T      
Sbjct: 207 DRCRLVSPPAESAPRECRRPRSDPCPYYYWYGD---QSNTTGDLALEAFTVNLTQSGTRR 263

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPDQGS--- 200
           +  V FGCGH+N       +   G+        S  SQ+     G  FSYCL + GS   
Sbjct: 264 VDGVAFGCGHRNRGLFHGAAGLLGLGRG---PLSFASQLRGVYGGHAFSYCLVEHGSAAG 320

Query: 201 SKINFG---GIVAGAGVVST---PLIIRD-HYYLSLEAISVGNQRLEFVSS--STGNIFV 251
           SKI FG    ++A   +  T   P    D  YYL L++I VG + +   S   S G   +
Sbjct: 321 SKIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLSAGGTII 380

Query: 252 DTGVLRTLLPL-EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
           D+G   +  P   Y +  ++ +  M  + P+  +   P  S   CYN+S   K   PE++
Sbjct: 381 DSGTTLSYFPEPAYQAIRQAFIDRMSPSYPL--ILGFPVLSP--CYNVSGAEKVEVPELS 436

Query: 309 IHFR-GADVKLSPSNLFRNISDE-IMCSAFRG--GNANIVYGRIMQINFLIGYDIEQAMV 364
           + F  GA  +    N F  +  E IMC A  G   +   + G   Q NF + YD+E   +
Sbjct: 437 LVFADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRSGMSIIGNYQQQNFHVLYDLEHNRL 496

Query: 365 SFKPSRCTN 373
            F P RC +
Sbjct: 497 GFAPRRCAD 505


>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
 gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
          Length = 471

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 109/361 (30%), Positives = 159/361 (44%), Gaps = 45/361 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP D +  +D+GSD  W QC+PC    C+KQ  P+FDP KS +Y  +SC S
Sbjct: 132 YFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKL--CYKQSDPVFDPAKSGSYTGVSCGS 189

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S C  +  S C  G C Y  +YG G+Y   + G LA ETLTF  T      + NV  GCG
Sbjct: 190 SVCDRIENSGCHSGGCRYEVMYGDGSY---TKGTLALETLTFAKTV-----VRNVAMGCG 241

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS---KINFGGIVA 210
           H+N       +   GI G    + S + Q+     G F YCL  +G+     + FG    
Sbjct: 242 HRNRGMFIGAAGLLGIGGG---SMSFVGQLSGQTGGAFGYCLVSRGTDSTGSLVFGREAL 298

Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
             G    PL+        YY+ L+ + VG  R+       +   +  G + +DTG   T 
Sbjct: 299 PVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTR 358

Query: 260 LPL----EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF-R 312
           LP      +    KS  +N+ +A  V             CY++S     + P V+ +F  
Sbjct: 359 LPTGAYAAFRDGFKSQTANLPRASGVSIFDT--------CYDLSGFVSVRVPTVSFYFTE 410

Query: 313 GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
           G  + L   N    + D    C AF      + + G I Q    + +D     V F P+ 
Sbjct: 411 GPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNV 470

Query: 371 C 371
           C
Sbjct: 471 C 471


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 114/384 (29%), Positives = 169/384 (44%), Gaps = 64/384 (16%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  +++GTP V+   ++DT SD TW QC+PC    C+ Q  P+FDP+ S++Y  ++  +
Sbjct: 141 YIAKIAVGTPAVEALLALDTASDLTWLQCQPCRR--CYPQSGPVFDPRHSTSYGEMNYDA 198

Query: 95  SQCAVV----TSNCSEGDCSYSFLYGRG---AYASFSSGNLATETLTFNSTSGLPVEMPN 147
             C  +      +   G C Y+ LYG G      S S G+L  ETLTF       V    
Sbjct: 199 PDCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAGG----VRQAY 254

Query: 148 VIFGCGHKN---LASPTSDSKQTGIIGLGPGNSSLISQMG-TSIAGKFSYCL------PD 197
           +  GCGH N     +P +     GI+GL  G  S+  Q+        FSYCL      P 
Sbjct: 255 LSIGCGHDNKGLFGAPAA-----GILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPG 309

Query: 198 QGSSKINFGGIVAGAGVVS-------TPLIIRDH----YYLSLEAISVGNQRLEFVS--- 243
             SS + F     GAG V        TP ++  +    YY+ L  +SVG  R+  V+   
Sbjct: 310 SPSSTLTF-----GAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERD 364

Query: 244 ------SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN 297
                 +  G + +D+G   T L    ++  +               G   G  D  CY 
Sbjct: 365 LQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDT-CYT 423

Query: 298 ISSQP------KFPEVTIHFRGA-DVKLSPSNLFRNI-SDEIMCSAFRG-GNANI-VYGR 347
           +  +       K P V++HF G  ++ L P N    + S   +C AF G G+ ++ V G 
Sbjct: 424 VGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSVIGN 483

Query: 348 IMQINFLIGYDIEQAMVSFKPSRC 371
           I+Q  F + YDI    V F P+ C
Sbjct: 484 ILQQGFRVVYDIGGQRVGFAPNSC 507


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 112/368 (30%), Positives = 174/368 (47%), Gaps = 45/368 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + L +GTP   +F  VDTGSD  W QC+PC    C+KQ  P+FDP+ SS++  I C S
Sbjct: 54  YFVRLGLGTPARSLFMVVDTGSDLPWLQCQPCKS--CYKQADPIFDPRNSSSFQRIPCLS 111

Query: 95  SQC-AVVTSNCS-----EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
             C A+   +CS        CSY   YG G   SFS G+ +++  T  + S    +  +V
Sbjct: 112 PLCKALEVHSCSGSRGATSRCSYQVAYGDG---SFSVGDFSSDLFTLGTGS----KAMSV 164

Query: 149 IFGCGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD------QGS 200
            FGCG  N  L +  +     G   L   +    S   +S A  FSYCL D      + S
Sbjct: 165 AFGCGFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSS 224

Query: 201 SKINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQR-------LEFVSSSTGNI 249
           S + FG     +    +PL+    +   YY ++  +SVG  +       L+   S +G +
Sbjct: 225 SSLIFGVAAIPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGV 284

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPE 306
            +D+G   T  P   ++ ++    N     P     + P +S    CYN S +     P 
Sbjct: 285 IIDSGTSVTRFPTSVYATIRDAFRNATINLP-----SAPRYSLFDTCYNFSGKASVDVPA 339

Query: 307 VTIHFR-GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAM 363
           + +HF  GAD++L P+N    I+     C AF   +  + + G I Q +F IG+D++++ 
Sbjct: 340 LVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSH 399

Query: 364 VSFKPSRC 371
           ++F P +C
Sbjct: 400 LAFAPQQC 407


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 161/356 (45%), Gaps = 52/356 (14%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS---NCSEGD 108
           +DTGSD  W QC PC    C++Q  P+FDP++SS+Y ++ C ++ C  + S   +   G 
Sbjct: 3   LDTGSDVVWVQCAPCRR--CYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGA 60

Query: 109 CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTG 168
           C Y   YG G   S ++G+  TETLTF   +     +  V  GCGH N     + +   G
Sbjct: 61  CMYQVAYGDG---SVTAGDFVTETLTFAGGA----RVARVALGCGHDNEGLFVAAAGLLG 113

Query: 169 IIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------------GSSKINFGGIVAGAGVVS 216
           +        S  +Q+       FSYCL D+             SS ++FG    GA   S
Sbjct: 114 LGRG---GLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSAS 170

Query: 217 -TPLI----IRDHYYLSLEAISVGNQRLEFV---------SSSTGNIFVDTGVLRTLLPL 262
            TP++    +   YY+ L  ISVG  R+  V         S+  G + VD+G   T L  
Sbjct: 171 FTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLAR 230

Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--KFPEVTIHFR-GADVK 317
             +S L+    +  +A    G+   PG   +   CY++  +   K P V++HF  GA+  
Sbjct: 231 ASYSALR----DAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAA 286

Query: 318 LSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           L P N    + S    C AF G +  + + G I Q  F + +D +   V F P  C
Sbjct: 287 LPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 110/351 (31%), Positives = 165/351 (47%), Gaps = 31/351 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +S+GTP V     VDTGSD +W QC+PC    C  Q   LFDP KSSTY+++ C +
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGA 202

Query: 95  SQCA---VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C+   +  + CS   C Y   YG G   S ++G   ++TL     +     +   +FG
Sbjct: 203 DACSELRIYEAGCSGSQCGYVVSYGDG---SNTTGVYGSDTLALAPGN----TVGTFLFG 255

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIV 209
           CGH   A     +   G++ LG  + SL SQ   +  G FSYCLP + S+   +  GG  
Sbjct: 256 CGH---AQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGGPT 312

Query: 210 AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEY 264
           + +G  +T L+        Y + L  ISVG Q++   +S+  G   VDTG + T LP   
Sbjct: 313 SASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTA 372

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVKLSPS 321
           ++ L+S     I         A  G  D  CY+ S       P V + F  GA + L   
Sbjct: 373 YAALRSAFRGAIAPYGYPSAPAN-GILDT-CYDFSRYGVVTLPTVALTFSGGATLALEAP 430

Query: 322 NLFRNISDEIMCSAFRGGNAN-IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            +   +S   +  A  GG+ +  + G + Q +F + +D   + V F P  C
Sbjct: 431 GI---LSSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 119/380 (31%), Positives = 170/380 (44%), Gaps = 55/380 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM + +GTPP      +DTGSD  W QC PC  LDCF Q  P+FDP  SS+Y +++C  
Sbjct: 151 YLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFDQVGPVFDPAASSSYRNVTCGD 208

Query: 95  SQCAVVT--------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS-GLPVEM 145
            +C +V             E  C Y + YG     S ++G+LA E+ T N T+ G    +
Sbjct: 209 QRCGLVAPPEPPRACRRPGEDSCPYYYWYGD---QSNTTGDLALESFTVNLTAPGASRRV 265

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SK 202
            +V+FGCGH N       +   G+        S  SQ+       FSYCL D GS   SK
Sbjct: 266 DDVVFGCGHWNRGLFHGAAGLLGLGRG---PLSFASQLRAVYGHTFSYCLVDHGSDVASK 322

Query: 203 INFGGIVAGAGVVSTPLI-----------IRDHYYLSLEAISVGNQRLEFVSSS------ 245
           + FG   A A   + P +               YY+ L+ + VG + L   S +      
Sbjct: 323 VVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVGEG 382

Query: 246 ---TGNIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNIS 299
              +G   +D+G  L   +   Y    ++ +  M ++ P+      P F  VL  CYN+S
Sbjct: 383 EGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLI-----PDFP-VLSPCYNVS 436

Query: 300 --SQPKFPEVTIHFR-GADVKLSPSNLF-RNISDEIMCSAFRG--GNANIVYGRIMQINF 353
              +P+ PE+++ F  GA       N F R   D IMC A  G       + G   Q NF
Sbjct: 437 GVDRPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNF 496

Query: 354 LIGYDIEQAMVSFKPSRCTN 373
            + YD++   + F P RC  
Sbjct: 497 HVVYDLKNNRLGFAPRRCAE 516


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score =  135 bits (340), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 115/358 (32%), Positives = 171/358 (47%), Gaps = 42/358 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP V    ++DTGSD +W QC PCP   C  Q   LFDP KSSTY ++SC++
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSCAA 186

Query: 95  SQCAVVTSN-----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           ++CA +         +  +C Y   YG G   S ++G  + +TLT    SG    +    
Sbjct: 187 AECAQLEQQGNGCGATNYECQYGVQYGDG---STTNGTYSRDTLTL---SGASDAVKGFQ 240

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGSSKINFGGI 208
           FGC H  L S  SD +  G++GLG G  SL+SQ   +    FSYCL P  GSS     G 
Sbjct: 241 FGCSH--LESGFSD-QTDGLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLGG 297

Query: 209 VAGAGVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLP 261
             GA    T  ++R       Y   L+ I+VG ++L    S  + G++ VD+G + T LP
Sbjct: 298 GGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLSPSVFAAGSV-VDSGTIITRLP 356

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADVK 317
              +S L S     +K        + P  S +  C++ + Q +   P V + F  GA + 
Sbjct: 357 PTAYSALSSAFKAGMKQYR-----SAPARSILDTCFDFAGQTQISIPTVALVFSGGAAID 411

Query: 318 LSPSNLFRNISDEIMCSAFRG----GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           L P+ +         C AF      G   I+ G + Q  F + YD+  + + F+   C
Sbjct: 412 LDPNGIMYG-----NCLAFAATGDDGTTGII-GNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 112/365 (30%), Positives = 169/365 (46%), Gaps = 51/365 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP  + +  +DTGSD  W QCEPC +  C+ Q  P+F+P  S++++++ C+S
Sbjct: 197 YFTRIGVGTPMREQYMVLDTGSDVVWIQCEPCSK--CYSQVDPIFNPSLSASFSTLGCNS 254

Query: 95  SQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + C+ + + NC  G C Y   YG G+Y   + G+ ATE LTF +TS     + NV  GCG
Sbjct: 255 AVCSYLDAYNCHGGGCLYKVSYGDGSY---TIGSFATEMLTFGTTS-----VRNVAIGCG 306

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGIVA 210
           H N       +   G+        S  SQ+GT     FSYCL D   + S  + FG    
Sbjct: 307 HDNAGLFVGAAGLLGLGAG---LLSFPSQLGTQTGRAFSYCLVDRFSESSGTLEFGPESV 363

Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFV---------SSSTGNIFVDTGVLR 257
             G + TPL+    +   YY+ L +ISVG   L+ V         +S  G   VD+G   
Sbjct: 364 PLGSILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGGFIVDSGTAV 423

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIHF 311
           T         L++ + + ++   V G    P    V     CY++S  P    P V  HF
Sbjct: 424 T--------RLQTPVYDAVRDAFVAGTRQLPKAEGVSIFDTCYDLSGLPLVNVPTVVFHF 475

Query: 312 -RGADVKLSPSNLFRNISDEIM---CSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSF 366
             GA + L   N    I  + M   C AF    +++ + G I Q    + +D   ++V F
Sbjct: 476 SNGASLILPAKNYM--IPMDFMGTFCFAFAPATSDLSIMGNIQQQGIRVSFDTANSLVGF 533

Query: 367 KPSRC 371
              +C
Sbjct: 534 ALRQC 538


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 113/370 (30%), Positives = 177/370 (47%), Gaps = 47/370 (12%)

Query: 30  SVDDI-YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
           SVD + Y++ + +GTP V     +DTGSD +W QC+PC    C+ Q+ PLFDP KSSTY 
Sbjct: 118 SVDSLEYVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYA 177

Query: 89  SISCSSSQCAVVTSN-----CSEGD----CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
            I C++  C  +T +     C+ GD    C ++  YG G   S + G  + ETL      
Sbjct: 178 PIPCNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDG---SQTRGVYSNETLALAPG- 233

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--- 196
              V + +  FGCGH       ++ K  G++GLG    SL+ Q  +   G FSYCLP   
Sbjct: 234 ---VAVKDFRFGCGHDQDG---ANDKYDGLLGLGGAPESLVVQTASVYGGAFSYCLPALN 287

Query: 197 ------DQGSSKINFGGIVAGAGVVSTPLIIRDH--YYLSLEAISVGNQRLEFVSSS-TG 247
                   G      GG+V  +G V TP+I  +   Y +++  I+VG + ++   S+ +G
Sbjct: 288 NQVGFLALGGGGAPSGGVVNTSGFVFTPMIREEETFYVVNMTGITVGGEPIDVPPSAFSG 347

Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFP 305
            + +D+G + T L    ++ L++     + A P+       G  D  CY+ S  S    P
Sbjct: 348 GMIIDSGTVVTELQHTAYNALQAAFRKAMAAYPL----VRNGELDT-CYDFSGYSNVTLP 402

Query: 306 EVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQ 361
           +V + F  GA + L   N    + D+  C AF+    +    + G + Q    + YD  +
Sbjct: 403 KVALTFSGGATIDLDVPNGI--LLDD--CLAFQESGPDDQPGILGNVNQRTLEVLYDAGR 458

Query: 362 AMVSFKPSRC 371
             V F+ + C
Sbjct: 459 GRVGFRAAVC 468


>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
 gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
          Length = 332

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 112/368 (30%), Positives = 173/368 (47%), Gaps = 67/368 (18%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           +Y   +++G+PP D    +DTGSD TW +C+PC   DC       FD   S+TY +++C+
Sbjct: 2   VYYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSP-DC----SSTFDRLASNTYKALTCA 56

Query: 94  SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF-NSTSGLPVEMPNVIFGC 152
                            YS+ YG G   SF+ G+L+ +TL    + S    E P  +FGC
Sbjct: 57  D---------------DYSYGYGDG---SFTQGDLSVDTLKMAGAASDELEEFPGFVFGC 98

Query: 153 GH--KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS------SKIN 204
           G   K L      S + GI+ L PG+ S  SQ+G     KFSYCL  Q +      S + 
Sbjct: 99  GSLLKGLI-----SGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMV 153

Query: 205 FGGIVA-----GAGVVS----TPLIIRDHYY-LSLEAISVGNQRLE-----FVSSSTGNI 249
           FG         G+G +     TP+     YY + L+ ISVGNQRL+     F++      
Sbjct: 154 FGEAAVELKEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAFLNGQDKPT 213

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ---PVKGVGAEPGFSDVLCYNI--SSQPKF 304
             D+G   T+LP     ++K  +++M+       +KG+ A        C+ +  SS    
Sbjct: 214 IFDSGTTLTMLPPGVCDSIKQSLASMVSGAEFVAIKGLDA--------CFRVPPSSGQGL 265

Query: 305 PEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
           P++T HF  GAD    PSN   ++   + C  F   N   ++G + Q +F + +D++   
Sbjct: 266 PDITFHFNGGADFVTRPSNYVIDLG-SLQCLIFVPTNEVSIFGNLQQQDFFVLHDMDNRR 324

Query: 364 VSFKPSRC 371
           + FK + C
Sbjct: 325 IGFKETDC 332


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 119/360 (33%), Positives = 176/360 (48%), Gaps = 43/360 (11%)

Query: 35  YLMHLSIGTPPVDIFGSV-DTGSDCTWTQCEPCP-ELDCFKQEPPLFDPKKSSTYNSISC 92
           YL  + +G P V +F  V DTGSD TW QC+PC  E  C+KQ  P+FDPK SS+Y+ +SC
Sbjct: 148 YLAQIGVGQP-VKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSC 206

Query: 93  SSSQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
           +S QC ++  +NC+   C Y   YG G   SF++G LATETL+F +++     +PN+  G
Sbjct: 207 NSQQCKLLDKANCNSDTCIYQVHYGDG---SFTTGELATETLSFGNSN----SIPNLPIG 259

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGI 208
           CGH N       +   G+ G   G  SL SQ+    A  FSYCL +     SS + F   
Sbjct: 260 CGHDNEGLFAGGAGLIGLGG---GAISLSSQLK---ASSFSYCLVNLDSDSSSTLEFNSN 313

Query: 209 VAGAGVVSTPLIIRDHY----YLSLEAISVGNQ-------RLEFVSSSTGNIFVDTGVLR 257
           +    + S PL+  D +    Y+ +  ISVG +       R E   S  G I VD+G + 
Sbjct: 314 MPSDSLTS-PLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTII 372

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPKFPEVTIHF---RG 313
           + LP + + +L+     +  +     +   PG S    CYN S Q      TI F    G
Sbjct: 373 SRLPSDVYESLREAFVKLTSS-----LSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEG 427

Query: 314 ADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             ++L   N    +      C AF    +++ + G   Q    + YD+  ++V F  ++C
Sbjct: 428 TSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFSTNKC 487


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 113/358 (31%), Positives = 171/358 (47%), Gaps = 42/358 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP V    ++DTGSD +W QC PCP   C+ Q   LFDP KSSTY ++SC++
Sbjct: 127 YVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSCAA 186

Query: 95  SQCAVVTSN-----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           ++CA +         +  +C Y   YG G   S ++G  + +TLT    SG    +    
Sbjct: 187 AECAQLEQQGNGCGATNYECQYGVQYGDG---STTNGTYSRDTLTL---SGASDAVKGFQ 240

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGSSKINFGGI 208
           FGC H  + S  SD +  G++GLG G  SL+SQ   +    FSYCL P  GSS     G 
Sbjct: 241 FGCSH--VESGFSD-QTDGLMGLGGGAQSLVSQTAAAYGNSFSYCLPPTSGSSGFLTLGG 297

Query: 209 VAGAGVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLP 261
             G     T  ++R       Y   L+ I+VG ++L    S  + G++ VD+G + T LP
Sbjct: 298 GGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLSPSVFAAGSV-VDSGTIITRLP 356

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADVK 317
              +S L S     +K        + P  S +  C++ + Q +   P V + F  GA + 
Sbjct: 357 PTAYSALSSAFKAGMKQYR-----SAPARSILDTCFDFAGQTQISIPTVALVFSGGAAID 411

Query: 318 LSPSNLFRNISDEIMCSAFRG----GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           L P+ +         C AF      G   I+ G + Q  F + YD+  + + F+   C
Sbjct: 412 LDPNGIMYG-----NCLAFAATGDDGTTGII-GNVQQRTFEVLYDVGSSTLGFRSGAC 463


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 116/359 (32%), Positives = 173/359 (48%), Gaps = 41/359 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCP-ELDCFKQEPPLFDPKKSSTYNSISCS 93
           YL  + +G P    +   DTGSD TW QC+PC  E  C+KQ  P+FDPK SS+Y+ +SC+
Sbjct: 148 YLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCN 207

Query: 94  SSQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           S QC ++  +NC+   C Y   YG G   SF++G LATETL+F +++     +PN+  GC
Sbjct: 208 SQQCKLLDKANCNSDTCIYQVHYGDG---SFTTGELATETLSFGNSN----SIPNLPIGC 260

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGIV 209
           GH N       +   G+ G   G  SL SQ+    A  FSYCL +     SS + F   +
Sbjct: 261 GHDNEGLFAGGAGLIGLGG---GAISLSSQLK---ASSFSYCLVNLDSDSSSTLEFNSYM 314

Query: 210 AGAGVVSTPLIIRDHY----YLSLEAISVGNQ-------RLEFVSSSTGNIFVDTGVLRT 258
               + S PL+  D +    Y+ +  ISVG +       R E   S  G I VD+G + +
Sbjct: 315 PSDSLTS-PLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDSGTIIS 373

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPKFPEVTIHF---RGA 314
            LP + + +L+     +  +     +   PG S    CYN S Q      TI F    G 
Sbjct: 374 RLPSDVYESLREAFVKLTSS-----LSPAPGISVFDTCYNFSGQSNVEVPTIAFVLSEGT 428

Query: 315 DVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            ++L   N    +      C AF    +++ + G   Q    + YD+  ++V F  ++C
Sbjct: 429 SLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFSTNKC 487


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 165/375 (44%), Gaps = 53/375 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +G PP      +DTGSD  W QC PC    C++Q  PL+DP+ S T+  I C+S
Sbjct: 92  YFAVIGVGDPPTHALVVIDTGSDLIWLQCLPCRR--CYRQVTPLYDPRNSKTHRRIPCAS 149

Query: 95  SQCAVVTS----NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
            QC  V      +   G C Y  +YG G   S SSG+LAT+TL     +     + NV  
Sbjct: 150 PQCRGVLRYPGCDARTGGCVYMVVYGDG---SASSGDLATDTLVLPDDT----RVHNVTL 202

Query: 151 GCGHKN---LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN--- 204
           GCGH N   LAS        G++G G G  S  +Q+  +    FSYCL D+ S   N   
Sbjct: 203 GCGHDNEGLLAS------AAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSS 256

Query: 205 ---FGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST---------GN 248
              FG          TPL         YY+ +   SVG +R+   S+++         G 
Sbjct: 257 YLVFGRTPELPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGRGG 316

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI-----SSQPK 303
           + VD+G   +    + ++ ++    +   A  ++ +  +    D  CY++      +  +
Sbjct: 317 VVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDT-CYDVHGNGPGTGVR 375

Query: 304 FPEVTIHF-RGADVKLSPSNLFRNI----SDEIMCSAFRGGNANI-VYGRIMQINFLIGY 357
            P + +HF   AD+ L  +N    +         C   +  +  + V G + Q  F + +
Sbjct: 376 VPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGVVF 435

Query: 358 DIEQAMVSFKPSRCT 372
           D+E+  + F P+ C+
Sbjct: 436 DVERGRIGFTPNGCS 450


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  134 bits (337), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 117/357 (32%), Positives = 171/357 (47%), Gaps = 40/357 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +G P    +  +DTGSD  W QC+PC   DC++Q  P+FDP+ SS++ S+ C S
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPC--TDCYQQTDPIFDPRSSSSFASLPCES 212

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC A+ TS C    C Y   YG G   SF+ G   TETLTF + SG+   + +V  GCG
Sbjct: 213 QQCQALETSGCRASKCLYQVSYGDG---SFTVGEFVTETLTFGN-SGM---INDVAVGCG 265

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGIVA 210
           H N       +   G+ G      SL SQM    A  FSYCL D+    SS + F    A
Sbjct: 266 HDNEGLFVGSAGLLGLGGG---PLSLTSQMK---ASSFSYCLVDRDSSSSSDLEFNS-AA 318

Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
            +  V+ PL+    +   YY+ L  +SVG Q L       +   S  G I VD+G   T 
Sbjct: 319 PSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITR 378

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIHFRGAD-V 316
           L  + ++ L+     + +   +K       F    CY++SSQ +   P V+  F G   +
Sbjct: 379 LQTQAYNTLRDAF--VSRTPYLKKTNGFALFD--TCYDLSSQSRVTIPTVSFEFAGGKSL 434

Query: 317 KLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +L P N    + S    C AF    +++ + G + Q    + YD+  ++V F P +C
Sbjct: 435 QLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  134 bits (336), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 167/356 (46%), Gaps = 41/356 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +S+GTP V     VDTGSD +W QC+PC    C  Q   LFDP KSSTY+++ C +
Sbjct: 143 YVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGA 202

Query: 95  SQCA---VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C+   +  + CS   C Y   YG G   S ++G   ++TL     +     +   +FG
Sbjct: 203 DACSELRIYEAGCSGSQCGYVVSYGDG---SNTTGVYGSDTLALAPGN----TVGTFLFG 255

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIV 209
           CGH   A     +   G++ LG  + SL SQ   +  G FSYCLP + S+   +  GG  
Sbjct: 256 CGH---AQAGMFAGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGGPS 312

Query: 210 AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEY 264
           + +G  +T L+        Y + L  ISVG Q++   +S+  G   VDTG + T LP   
Sbjct: 313 SASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTA 372

Query: 265 HSNLKSVMSNMIK-----AQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADV 316
           ++ L+S     I      + P  G+       D  CY+ S       P V + F  GA +
Sbjct: 373 YAALRSAFRGAIAPCGYPSAPANGI------LDT-CYDFSRYGVVTLPTVALTFSGGATL 425

Query: 317 KLSPSNLFRNISDEIMCSAFRGGNAN-IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            L    +   +S   +  A  GG+ +  + G + Q +F + +D   + V F P  C
Sbjct: 426 ALEAPGI---LSSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  134 bits (336), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 117/357 (32%), Positives = 171/357 (47%), Gaps = 40/357 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +G P    +  +DTGSD  W QC+PC   DC++Q  P+FDP+ SS++ S+ C S
Sbjct: 155 YFSRVGVGQPAKPFYMVLDTGSDINWLQCQPC--TDCYQQTDPIFDPRSSSSFASLPCES 212

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC A+ TS C    C Y   YG G   SF+ G    ETLTF + SG+   + NV  GCG
Sbjct: 213 QQCQALETSGCRASKCLYQVSYGDG---SFTVGEFVIETLTFGN-SGM---INNVAVGCG 265

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGIVA 210
           H N       +   G+ G    + SL SQM    A  FSYCL D+    SS + F    A
Sbjct: 266 HDNEGLFVGSAGLLGLGGG---SLSLTSQMK---ASSFSYCLVDRDSSSSSDLEFNS-AA 318

Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
            +  V+ PL+    +   YY+ L  +SVG Q L       +   S  G I VD+G   T 
Sbjct: 319 PSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGTAITR 378

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIHFRGAD-V 316
           L  + ++ L+     + +   +K       F    CY++SSQ +   P V+  F G   +
Sbjct: 379 LQTQAYNTLRDAF--VSRTPYLKKTNGFALFD--TCYDLSSQSRVTIPTVSFEFAGGKSL 434

Query: 317 KLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +L P N    + S    C AF    +++ + G + Q    + YD+  ++V F P +C
Sbjct: 435 QLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSPHKC 491


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 113/359 (31%), Positives = 162/359 (45%), Gaps = 40/359 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +G P  D    +DTGSD TW QCEPC   DC++Q  P+++P  SS+Y  + C +
Sbjct: 145 YFSRIGVGAPRRDQLMVLDTGSDVTWIQCEPCS--DCYQQSDPIYNPALSSSYKLVGCQA 202

Query: 95  SQCAVV-TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           + C  +  S CS  G C Y   YG G+Y   + GN ATETLT     G P++  NV  GC
Sbjct: 203 NLCQQLDVSGCSRNGSCLYQVSYGDGSY---TQGNFATETLTLG---GAPLQ--NVAIGC 254

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGIV 209
           GH N       +   G+ G    + S  SQ+       FSYCL D   + SS + FG   
Sbjct: 255 GHDNEGLFVGAAGLLGLGGG---SLSFPSQLTDENGKIFSYCLVDRDSESSSTLQFGRAA 311

Query: 210 AGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFV-------SSSTGNIFVDTGVLRT 258
              G V  P++    +   YY+SL  ISVG + L          +S  G + VD+G   T
Sbjct: 312 VPNGAVLAPMLKNSRLDTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAVT 371

Query: 259 LLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGAD 315
            L    + +L+       K  P   GV          CY++SS+     P V  HF G  
Sbjct: 372 RLQTAAYDSLRDAFRAGTKNLPSTDGVSLFD-----TCYDLSSKESVDVPTVVFHFSGGG 426

Query: 316 VKLSPSNLFRNISDEI--MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
               P+  +    D +   C AF   ++++ + G I Q    + +D     V F  ++C
Sbjct: 427 SMSLPAKNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485


>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 424

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 117/385 (30%), Positives = 178/385 (46%), Gaps = 43/385 (11%)

Query: 12  DNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC 71
           DN+   SP  +    E       YLM  +IG P   + G +DT +   W QC  C    C
Sbjct: 59  DNDVSLSPTLVNEGGE-------YLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNS-QC 110

Query: 72  FKQEPPL---FDPKKSSTYNSISCSSSQCAVVTS----NCSEGDCSYSFLYGRGAYASFS 124
             ++  L   F   KS TY    C S+ C  +T     N S+  C Y  +YG       +
Sbjct: 111 EPEKRGLTTKFLSSKSFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKA---T 167

Query: 125 SGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG 184
           SG L++++  F+++ G+ V++  + FGC    L     +   TG +GL     SLISQ+G
Sbjct: 168 SGILSSDSFGFDTSDGMLVDVGFLNFGCSEAPLTG--DEQSYTGNVGLNQTPLSLISQLG 225

Query: 185 TSIAGKFSYCL---PDQGS-SKINFGGIVAGAGVVSTPLII--RDHYYLSLEAISVGNQR 238
                KFSYCL    + GS SK+ FG +   +G   TPL+    D YY+ +  IS+GN  
Sbjct: 226 IK---KFSYCLVPFNNLGSTSKMYFGSLPVTSG-GQTPLLYPNSDAYYVKVLGISIGNDE 281

Query: 239 LEF-----VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV 293
             F     V        +DTG+  + L  +      S+++  +  +       +P     
Sbjct: 282 PHFDGVFDVYEVRDGWIIDTGITYSSLETD---AFDSLLAKFLTLKDFPQRKDDPKERFE 338

Query: 294 LCY---NISSQPKFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSA-FRGGNANIVYGRI 348
           LC+   N +    FP+VT+HF GAD+ L+  + F  I D+ I C A  R G+   + G  
Sbjct: 339 LCFELQNANDLESFPDVTVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNF 398

Query: 349 MQINFLIGYDIEQAMVSFKPSRCTN 373
              N+ +GYD+E  ++SF P  C +
Sbjct: 399 QLQNYHVGYDLEAQVISFAPVDCAD 423


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 112/361 (31%), Positives = 167/361 (46%), Gaps = 48/361 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IG+PP  ++  VDTGSD  W QC PC   DC++Q  P+F+P  SS+Y  ++C +
Sbjct: 155 YFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCA--DCYQQADPIFEPSFSSSYAPLTCET 212

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC ++  S C    C Y   YG G+Y   + G+ ATET+T + ++ L     NV  GCG
Sbjct: 213 HQCKSLDVSECRNDSCLYEVSYGDGSY---TVGDFATETITLDGSASL----NNVAIGCG 265

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGIVA 210
           H N       +   G+ G      S I+      A  FSYCL ++    +S + F   + 
Sbjct: 266 HDNEGLFVGAAGLLGLGGGSLSFPSQIN------ASSFSYCLVNRDTDSASTLEFNSPIP 319

Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
              V + PL+  +     YYL +  I VG Q L       E   S  G I VD+G   T 
Sbjct: 320 SHSVTA-PLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTAVT- 377

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIHF-R 312
                   L+S + N ++   V+G    P  S V     CY++SS+   + P V+ HF  
Sbjct: 378 -------RLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFPD 430

Query: 313 GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
           G  + L   N    + S    C AF    + + + G + Q    + YD+  ++V F P+ 
Sbjct: 431 GKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNG 490

Query: 371 C 371
           C
Sbjct: 491 C 491


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 111/377 (29%), Positives = 171/377 (45%), Gaps = 51/377 (13%)

Query: 30  SVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNS 89
           S D  YL+ L++GTPP  +   +DTGSD  WTQC PC    C  Q  P+F P  SS+Y  
Sbjct: 99  SGDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPCAS--CLPQPDPIFSPGASSSYEP 156

Query: 90  ISCSSSQC-AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTF------NSTSGL 141
           + C+   C  ++  +C   D C+Y + YG G   + + G  ATE  TF        T+ L
Sbjct: 157 MRCAGELCNDILHHSCQRPDTCTYRYSYGDG---TTTRGVYATERFTFSSSSSGGETTKL 213

Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS- 200
              +    FGCG  N  S  + S   GI+G G    SL+SQ+      +FSYCL    S 
Sbjct: 214 SAPLG---FGCGTMNKGSLNNGS---GIVGFGRAPLSLVSQLAIR---RFSYCLTPYASG 264

Query: 201 --SKINFGGIVAG-----AGVVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSSS--- 245
             S + FG +  G        V T  ++R       YY+    ++VG +RL    S+   
Sbjct: 265 RKSTLLFGSLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFAL 324

Query: 246 ----TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ 301
               +G   VD+G   TL P    + +     + ++  P    G+  G  D +C+  ++ 
Sbjct: 325 RPDGSGGAIVDSGTALTLFPAPVLAEVVRAFRSQLRL-PFAANGSS-GPDDGVCFAAAAS 382

Query: 302 --PK---FPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFR-GGNANIVYGRIMQINFL 354
             P+    P +  H +GAD+ L   N +  +     +C      G++    G  +Q +  
Sbjct: 383 RVPRPAVVPRMVFHLQGADLDLPRRNYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMR 442

Query: 355 IGYDIEQAMVSFKPSRC 371
           + YD+E   +SF P++C
Sbjct: 443 VLYDLEADTLSFAPAQC 459


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 113/368 (30%), Positives = 165/368 (44%), Gaps = 54/368 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IG+P   ++  +DTGSD TW QC PC   DC+ Q  PLFDP  SS+Y ++ C S
Sbjct: 196 YFSRIGIGSPARQLYMVLDTGSDVTWLQCAPC--ADCYAQSDPLFDPALSSSYATVPCDS 253

Query: 95  SQCAVVTS-----NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
             C  + +     N + G+  C Y   YG G+Y   + G+ ATETLT        V   +
Sbjct: 254 PHCRALDASACHNNAANGNSSCVYEVAYGDGSY---TVGDFATETLTLGGDGSAAVH--D 308

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKIN 204
           V  GCGH N       +    + G      S IS      A +FSYCL D+ S   S + 
Sbjct: 309 VAIGCGHDNEGLFVGAAGLLALGGGPLSFPSQIS------ATEFSYCLVDRDSPSASTLQ 362

Query: 205 FGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFV--------SSSTGNIFVD 252
           FG   + +  V+ PL+        YY++L  ISVG + L  +           +G + VD
Sbjct: 363 FG--ASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIVD 420

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS--SQPKFPE 306
           +G   T L    +S L+           V+G  A P  S V     CY+++  S  + P 
Sbjct: 421 SGTAVTRLQSSAYSALRDAF--------VRGTQALPRASGVSLFDTCYDLAGRSSVQVPA 472

Query: 307 VTIHFR-GADVKLSPSNLFRNISDE-IMCSAFRG-GNANIVYGRIMQINFLIGYDIEQAM 363
           V++ F  G ++KL   N    +      C AF   G A  + G + Q    + +D  +  
Sbjct: 473 VSLRFEGGGELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNT 532

Query: 364 VSFKPSRC 371
           V F P++C
Sbjct: 533 VGFSPNKC 540


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 110/355 (30%), Positives = 172/355 (48%), Gaps = 34/355 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  + +GTP       VDTGS  TW QC PC  + C +Q  P+F+P+ SS+Y S+SCS+
Sbjct: 121 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPC-LVSCHRQSGPVFNPRSSSSYASVSCSA 179

Query: 95  SQCAVVT------SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
            QC  +T      S CS  + C Y   YG    +SFS G L+ +T++F STS     +PN
Sbjct: 180 PQCDALTTATLNPSTCSTSNVCIYQASYGD---SSFSVGYLSKDTVSFGSTS-----VPN 231

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
             +GCG  N        +  G+IGL     SL+ Q+  S+   FSYCLP   SS      
Sbjct: 232 FYYGCGQDNEG---LFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSGYLSI 288

Query: 208 IVAGAGVVS-TPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTLL 260
                G  S TP+    +    Y++ +  I+V  + L   +S+  ++   +D+G + T L
Sbjct: 289 GSYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSASAYSSLPTIIDSGTVITRL 348

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFR-GADVK 317
           P + +S L   ++  +K  P         FS  D      +S+ + P+V++ F  GA +K
Sbjct: 349 PTDVYSALSKAVAGAMKGTPRAS-----AFSILDTCFQGQASRLRVPQVSMAFAGGAALK 403

Query: 318 LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           L  +NL  ++     C AF    +  + G   Q  F + YD++ + + F    C+
Sbjct: 404 LKATNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 458


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 107/359 (29%), Positives = 160/359 (44%), Gaps = 39/359 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +++ +  GTP        DTGSD +W QC PC    C+KQ  P+FDP KS+TY+ + C  
Sbjct: 135 FVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPC-SGHCYKQHDPIFDPTKSATYSVVPCGH 193

Query: 95  SQCAVVT-SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QCA    S CS G C Y   YG G   S S+G L+ ETL+  ST  L    P   FGCG
Sbjct: 194 PQCAAADGSKCSNGTCLYKVEYGDG---SSSAGVLSHETLSLTSTRAL----PGFAFGCG 246

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVAG 211
             NL          G+IGLG G  SL SQ   S  G FSYCLP   ++   +  G     
Sbjct: 247 QTNLG---DFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPSDNTTHGYLTIGPTTPA 303

Query: 212 AG--VVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLE 263
           +   V  T ++ +      Y++ L +I +G   L    +  +    F+D+G + T LP E
Sbjct: 304 SNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDDGTFLDSGTILTYLPPE 363

Query: 264 YHSNLKSVMS-NMIKAQPVKGVGAEPGFSDV-LCYNISSQPKFPEVTIHFRGADVKLSPS 321
            ++ L+      M + +P       P +     CY+ + Q       + F+ +D  +   
Sbjct: 364 AYTALRDRFKFTMTQYKPA------PAYDPFDTCYDFTGQSAIFIPAVSFKFSDGSVFDL 417

Query: 322 NLF------RNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           + F       + +  I C  F    + +   + G + Q N  + YD+    + F  + C
Sbjct: 418 SFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 112/371 (30%), Positives = 163/371 (43%), Gaps = 51/371 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           + + +SIGTPP      +DTGSD  WTQC+         +E PL+DP KSS++ +  C  
Sbjct: 89  HTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQ--HREKPLYDPAKSSSFAAAPCDG 146

Query: 95  SQC---AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C   +  T NCS   C Y++ YG    ++ + G LA+ET TF     + V +    FG
Sbjct: 147 RLCETGSFNTKNCSRNKCIYTYNYG----SATTKGELASETFTFGEHRRVSVSLD---FG 199

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFG- 206
           CG     S    S   GI+G+ P   SL+SQ+      +FSYCL        +S I FG 
Sbjct: 200 CGKLTSGSLPGAS---GILGISPDRLSLVSQLQIP---RFSYCLTPFLDRNTTSHIFFGA 253

Query: 207 ----------GIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSS-------TGNI 249
                     G +    +V+ P     +YY+ L  ISVG +RL    SS       +G  
Sbjct: 254 MADLSKYRTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGT 313

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN--------ISSQ 301
           FVD+G    +LP      LK  M   +K   V     + G+   LC+         + + 
Sbjct: 314 FVDSGDTTGMLPSVVMEALKEAMVEAVKLPVVNAT--DHGYEYELCFQLPRNGGGAVETA 371

Query: 302 PKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
            + P +  HF  GA + L   +    +S   MC     G    + G   Q N  + +D+E
Sbjct: 372 VQVPPLVYHFDGGAAMLLRRDSYMVEVSAGRMCLVISSGARGAIIGNYQQQNMHVLFDVE 431

Query: 361 QAMVSFKPSRC 371
               SF P++C
Sbjct: 432 NHEFSFAPTQC 442


>gi|356528675|ref|XP_003532925.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 342

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 116/390 (29%), Positives = 156/390 (40%), Gaps = 114/390 (29%)

Query: 8   PFYNDNETPKSPIS---------IIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDC 58
           PFYN + TP   I+          + ++ +I  +  YLM L IGTPPV+     DTGSD 
Sbjct: 42  PFYNPSLTPSERITDAALSSNENKLPESILIPNNGEYLMRLYIGTPPVERLVIADTGSDF 101

Query: 59  TWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRG 118
            W QC PC                                    NC    C Y  +Y   
Sbjct: 102 IWVQCSPC-----------------------------------QNC---QCVYLNIY--- 120

Query: 119 AYASFSSGNLATETLTFNSTSGL-PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNS 177
           A  SF+   + TETL+F+ST G   V  PN IFGCG  N  +  S  K TG++GL  G  
Sbjct: 121 ANKSFTIEVVGTETLSFDSTGGAQTVSFPNSIFGCGANNNLTFRSSDKATGLVGLVAGQL 180

Query: 178 SLISQMGTSIAGKFSYCLPDQGSSKINFG--GIVAGAGVVSTPLIIRDH---YYLSLEAI 232
           SL+SQ+G  I  KFSY         + FG   I+   GVVSTPLII+     Y+L+LE +
Sbjct: 181 SLVSQLGAQIGYKFSY---------LKFGSEAIITTNGVVSTPLIIKPSLPLYFLNLEVV 231

Query: 233 SVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEP---- 288
           ++G                                         K  P + +G E     
Sbjct: 232 TIGQ----------------------------------------KVVPTETLGVESVQDL 251

Query: 289 GFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNAN-----I 343
            F    C+        P +   F GA V L P NL   + D  M       +A+      
Sbjct: 252 PFPFKFCFPYRDNMTVPAIAFQFTGASVALRPKNLLIKLQDRNMLXLAVVPSASSLSVIS 311

Query: 344 VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           ++G I Q +F + YD++   VS  P+ CT 
Sbjct: 312 IFGIIAQFDFQVLYDLDGKKVSVAPTDCTK 341


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 113/360 (31%), Positives = 161/360 (44%), Gaps = 40/360 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L +GTPP   +  +DTGSD  W QC PC +  C+ Q  PLF+P  SSTY  + C++
Sbjct: 153 YFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPCAK--CYGQTDPLFNPAASSTYRKVPCAT 210

Query: 95  SQCAVV-TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             C  +  S C     C Y   YG G   SF+ G+ +TETLTF         +  V  GC
Sbjct: 211 PLCKKLDISGCRNKRYCEYQVSYGDG---SFTVGDFSTETLTFRGQV-----IRRVALGC 262

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SSKINFGGI 208
           GH N       +   G+     G+ S  SQ G   + +FSYCL D+     +S + FG  
Sbjct: 263 GHDNEGLFIGAAGLLGLG---RGSLSFPSQTGAQFSKRFSYCLVDRSASGTASSLIFGKA 319

Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLE------FVSSSTGN--IFVDTGVL 256
                 + TPL+    +   YY+ L  ISVG +RL       F   +TGN  + +D+G  
Sbjct: 320 AIPKSAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGVIIDSGTS 379

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-G 313
            T L    +S ++      +    +K  G    F    CY++S     K P +  HF+ G
Sbjct: 380 VTRLVDSAYSTMRDAFR--VGTGNLKSAGGFSLFDT--CYDLSGLKTVKVPTLVFHFQGG 435

Query: 314 ADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           A + L  +N    + S    C AF G    + + G I Q  + + +D     V FK   C
Sbjct: 436 AHISLPATNYLIPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVVFDSLANRVGFKAGSC 495


>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
          Length = 442

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 109/362 (30%), Positives = 180/362 (49%), Gaps = 34/362 (9%)

Query: 28  IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
           +I     +L +LSIG PP +++  +DTGSD  W QCEPC    C+KQ+ P+++  KS +Y
Sbjct: 86  LIRDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDV--CYKQKDPIYNRTKSDSY 143

Query: 88  NSISCSSSQCAVV--TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
             + C+   C  +     CS+ G C Y   Y  GA    +SG L+ E + F S      +
Sbjct: 144 TEMLCNEPPCVSLGREGQCSDSGSCLYQTAYADGAR---TSGLLSYEKVAFTSHYSDEDK 200

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT--SIAGKFSYCL-----PD 197
              V FGCG +NL   TS+ +  G++GLGPG  SL+SQ+     ++  F+YC      P+
Sbjct: 201 TAQVGFGCGLQNLNFITSN-RDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNISNPN 259

Query: 198 QGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAI--SVGNQRLEFVSSS-------TGN 248
            G   + FG      G + TP++I + YY++L  I   VG  RL+  SSS       +G 
Sbjct: 260 AGGFLV-FGDATYLNGDM-TPMVIAEFYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGG 317

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ---PKFP 305
           + +D+G   ++ P E +  +++ + + +K    KG    P  S   C+    +   P FP
Sbjct: 318 VIIDSGSTLSVFPPEVYEVVRNAVVDKLK----KGYNISPLTSSPDCFEGKIERDLPLFP 373

Query: 306 EVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
            + ++     +     ++F    DE+ C  F  G    + G + Q ++  GY++E + +S
Sbjct: 374 TLVLYLESTGILNDRWSIFLQRYDELFCLGFTSGEGLSIIGTLAQQSYKFGYNLELSTLS 433

Query: 366 FK 367
            +
Sbjct: 434 IE 435


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 119/386 (30%), Positives = 180/386 (46%), Gaps = 61/386 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + L +GTP V++   +DTGSD +W QC PC   DC     P F+P+ SS++  + C+S
Sbjct: 138 YYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPCK--DCVPALRPPFNPRHSSSFFKLPCAS 195

Query: 95  SQCAVVTSN----CSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTS---GLPVEM 145
           S C  V       CS     C +S  YG G   S SSG LA ET+  N+ +   G PV++
Sbjct: 196 STCTNVYQGVKPFCSPSGRTCLFSIQYGDG---SLSSGLLAMETIAGNTPNFGDGEPVKL 252

Query: 146 PNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
            N+  GC   +    PT  S   G++G+     S  SQ+ +  A KFS+C PD+  + +N
Sbjct: 253 SNITLGCADIDREGLPTGAS---GLLGMDRRPISFPSQLSSRYARKFSHCFPDK-IAHLN 308

Query: 205 FGGIV--AGAGVVS-----TPLIIR--------DHYYLSLEAISVGNQRLEF-------- 241
             G+V    + ++S     TPL+          D+YY+ L  ISV   RL          
Sbjct: 309 SSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDID 368

Query: 242 -VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
            V+ S G I +D+G   T L       ++     + +   +  V    GF+   CYNI+S
Sbjct: 369 KVTGSGGTI-IDSGTAFTYLKKPAFQAMRREF--LARTSHLAKVDDNSGFTP--CYNITS 423

Query: 301 QPK------FPEVTIHFRGA-DVKLSPSNLFRNIS----DEIMCSAFR--GGNANIVYGR 347
                     P +T+HFRG  DV L  +++   +S       +C AF+  G     + G 
Sbjct: 424 GTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIPFNIIGN 483

Query: 348 IMQINFLIGYDIEQAMVSFKPSRCTN 373
             Q N  + YD+E+  +   P++C  
Sbjct: 484 YQQQNLWVEYDLEKLRLGIAPAQCAT 509


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 107/338 (31%), Positives = 165/338 (48%), Gaps = 34/338 (10%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEG---- 107
           VDT SD  W QC PCP   C  Q+ PL+DP KSST+  I C S  C  + S+   G    
Sbjct: 173 VDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSPT 232

Query: 108 --DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSK 165
             +C Y   YG G     ++G   T+TLT + T    + + +  FGC H    S    ++
Sbjct: 233 TDECKYIVNYGDG---KATTGTYVTDTLTMSPT----IVVKDFRFGCSHAVRGS--FSNQ 283

Query: 166 QTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-INFGGIVAGAGVVS-TPLIIRD 223
             GI+ LG G  SL+ Q   +    FSYC+P   S+  ++ GG V  +   S TPLI   
Sbjct: 284 NAGILALGGGRGSLLEQTADAYGNAFSYCIPKPSSAGFLSLGGPVEASLKFSYTPLIKNK 343

Query: 224 H----YYLSLEAISVGNQRLEF--VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIK 277
           H    Y + LEAI V  ++L     + +TG + +D+G + T LP + ++ L++   + + 
Sbjct: 344 HAPTFYIVHLEAIIVAGKQLAVPPTAFATGAV-MDSGAVVTQLPPQVYAALRAAFRSAMA 402

Query: 278 AQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVKLSPSNLFRNISDEIMCS 334
           A    G  A P  +   CY+ +  P  K P+V++ F  GA + L P+++   I D  +  
Sbjct: 403 AY---GPLAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASI---ILDGCLAF 456

Query: 335 AFRGGNANIVY-GRIMQINFLIGYDIEQAMVSFKPSRC 371
           A   G  ++ + G + Q  + + YD+    V F+   C
Sbjct: 457 AATPGEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 104/358 (29%), Positives = 157/358 (43%), Gaps = 39/358 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP   +  +D+GSD  W QC+PC +  C+ Q  PLFDP  S+++  +SCSS
Sbjct: 43  YFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQ--CYHQTDPLFDPADSASFMGVSCSS 100

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + C  V  + C+ G C Y   YG G+Y   + G LA ETLTF  T      + NV  GCG
Sbjct: 101 AVCDRVENAGCNSGRCRYEVSYGDGSY---TKGTLALETLTFGRTV-----VRNVAIGCG 152

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
           H N       +   G+ G    + S + Q+       FSYCL  +G++    + FG    
Sbjct: 153 HSNRGMFVGAAGLLGLGGG---SMSFMGQLSGQTGNAFSYCLVSRGTNTNGFLEFGSEAM 209

Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
             G    PL+        YY+ L  + VG+ R+       +     +G + +DTG   T 
Sbjct: 210 PVGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTR 269

Query: 260 LPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRGADV 316
            P   +   ++      +  P   GV          CYN+      + P V+ +F G  +
Sbjct: 270 FPTVAYEAFRNAFIEQTQNLPRASGVSIFD-----TCYNLFGFLSVRVPTVSFYFSGGPI 324

Query: 317 KLSPSNLFRNISDE--IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
              P+N F    D+    C AF    + + + G I Q    I  D     V F P+ C
Sbjct: 325 LTIPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 156/361 (43%), Gaps = 45/361 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP   +  +D+GSD  W QC+PC E  C++Q  P+FDP  S+TY  ISC S
Sbjct: 137 YFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSE--CYQQSDPVFDPAGSATYAGISCDS 194

Query: 95  SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S C  +  + C++G C Y   YG G+Y   + G LA ETLTF       V + N+  GCG
Sbjct: 195 SVCDRLDNAGCNDGRCRYEVSYGDGSY---TRGTLALETLTFGR-----VLIRNIAIGCG 246

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
           H N       +   G+ G      S + Q+G    G FSYCL  +G+     + FG    
Sbjct: 247 HMNRGMFIGAAGLLGLGGG---AMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGRGAM 303

Query: 211 GAGVVSTPLIIRDH----YYLSLEA-------ISVGNQRLEFVSSSTGNIFVDTGVLRTL 259
             G    PLI        YY+ L         + +  Q  E      G + +DTG   T 
Sbjct: 304 PVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTR 363

Query: 260 LPLEYHSNLKSVM----SNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRG 313
           LP   +   +       +N+ ++  V             CYN++     + P V+ +F G
Sbjct: 364 LPAPAYEAFRDTFIGQTANLPRSDRVSIFDT--------CYNLNGFVSVRVPTVSFYFSG 415

Query: 314 ADVKLSPS-NLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
             +   P+ N    +  E   C AF    + + + G I Q    I  D     V F P+ 
Sbjct: 416 GPILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTI 475

Query: 371 C 371
           C
Sbjct: 476 C 476


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 108/357 (30%), Positives = 166/357 (46%), Gaps = 36/357 (10%)

Query: 36  LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
           + ++SIG PP+     +DTGSD  W  C PC   +C      LFDP  SST++ +    +
Sbjct: 102 MANISIGQPPIPQLVVMDTGSDILWVMCTPC--TNCDNHLGLLFDPSMSSTFSPL--CKT 157

Query: 96  QCAVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
            C      CS  D   ++  Y   A  S +SG    +T+ F +T      +P+V+FGCGH
Sbjct: 158 PCDF--KGCSRCDPIPFTVTY---ADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGCGH 212

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGV 214
            N+   T D    GI+GL  G  SL +++G     KFSYC+ D      N+  ++ G G 
Sbjct: 213 -NIGQDT-DPGHNGILGLNNGPDSLATKIGQ----KFSYCIGDLADPYYNYHQLILGEGA 266

Query: 215 ----VSTPLIIRD-HYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPL 262
                STP  + +  YY+++E ISVG +RL       E   + TG + +DTG   T L  
Sbjct: 267 DLEGYSTPFEVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLVD 326

Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-PKFPEVTIHFR-GADVKLSP 320
             H  L   + N++     +    +  +      +IS     FP VT HF  GAD+ L  
Sbjct: 327 SVHRLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFHFADGADLALDS 386

Query: 321 SNLFRNISDEIMC------SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            + F  ++D + C      S+    +   + G + Q ++ +GYD+    V F+   C
Sbjct: 387 GSFFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVNQFVYFQRIDC 443


>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
          Length = 443

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 107/357 (29%), Positives = 153/357 (42%), Gaps = 92/357 (25%)

Query: 41  IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC-AV 99
           +G P   ++G  DTGS+  W QC PC    C+ Q PP+FDP +S TY ++S  S  C AV
Sbjct: 63  LGVPSTLVYGIADTGSELIWLQCLPCTH--CYNQTPPIFDPAESYTYETVSSDSPICNAV 120

Query: 100 VTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNL 157
              +C EGD  C Y   YG G   + + G L+T+   F   +   VE+  + FGC H   
Sbjct: 121 RRISCREGDKSCCYQHTYGDG---TTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTK 177

Query: 158 ASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGS-SKINFG--GIVAG 211
           A       Q G++GL    +SL+SQ+      KFSYC+    D GS S++ FG   ++ G
Sbjct: 178 ARL--KGHQAGVVGLNRHPNSLVSQLKVK---KFSYCMVIPDDHGSGSRMYFGSRAVILG 232

Query: 212 AGVVSTPLIIRD--HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLK 269
                TPL+  D  HY+++L+ IS                                    
Sbjct: 233 G---KTPLLKGDYSHYFVTLKGIS------------------------------------ 253

Query: 270 SVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISD 329
                         VG E G SD L          P++T HF GAD  L+    +  +  
Sbjct: 254 --------------VGEEKGRSDEL------ASAGPDITFHFYGADFILTKXTTYVEVEK 293

Query: 330 EIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVS---------FKPSRCTNY 374
            + C A    N+     + G I Q N+ +GYD+E   V+         F PS+ + Y
Sbjct: 294 GLWCLAMLSSNSTRKLSILGNIQQQNYHVGYDLEAQEVAQCFNQTPPIFDPSKSSTY 350



 Score = 61.2 bits (147), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 35/114 (30%), Positives = 58/114 (50%), Gaps = 7/114 (6%)

Query: 71  CFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGN 127
           CF Q PP+FDP KSSTY+++   +  C        +  E DC Y   YG G+ +  + G 
Sbjct: 334 CFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYACHIDEEDCCYRISYGSGSTS--TEGT 391

Query: 128 LATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLIS 181
           ++ +   F       V++ +++FGC   +  + T    + GI+GL   + SL+S
Sbjct: 392 ISIDAFAFEDNRQNMVDVXHLVFGC--SDYTTGTFKGYEVGIVGLNQDSLSLVS 443


>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
 gi|219886805|gb|ACL53777.1| unknown [Zea mays]
 gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
          Length = 440

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 115/387 (29%), Positives = 175/387 (45%), Gaps = 66/387 (17%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+    IG PP      +DTGS+  WTQC  C    CF+Q  P +DP +S    ++ C+ 
Sbjct: 71  YIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRC-RPTCFRQNLPYYDPSRSRAARAVGCND 129

Query: 95  SQCAVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
           + CA+ +      D   C+    YG G  A    G LATE LTF S      E  +++FG
Sbjct: 130 AACALGSETQCLSDNKTCAVVTGYGAGNIA----GTLATENLTFQS------ETVSLVFG 179

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG 211
           C      SP S +  +GIIGLG G  SL SQ+G +   +FSYCL       I    +V G
Sbjct: 180 CIVVTKLSPGSLNGASGIIGLGRGKLSLPSQLGDT---RFSYCLTPYFEDTIEPSHMVVG 236

Query: 212 --AGVV-----STPL----IIRD--------HYYLSLEAISVGNQRLEFVSSS------- 245
             AG++     STP+     +R          YYL L  I+ G  +L   S++       
Sbjct: 237 ASAGLINGSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVA 296

Query: 246 ----TGNIFVDTGV-LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
               TG  F+D+G  L +L+ + Y + L++ ++  + A  V+ +    GF   LC  +  
Sbjct: 297 PGMWTGT-FIDSGAPLTSLVDVAYQA-LRAELARQLGAALVQPLAGTTGFD--LCVALKD 352

Query: 301 QPKF-PEVTIHF-----RGADVKLSPSNLFRNISDEIMCSAFRGG--------NANIVYG 346
             +  P + +HF      G D+ + P+N +  +     C              N   V G
Sbjct: 353 AERLVPPLVLHFGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVIG 412

Query: 347 RIMQINFLIGYDIEQAMVSFKPSRCTN 373
             MQ N  + YD+   ++SF+P+ C++
Sbjct: 413 NYMQQNMHVLYDLAGGVLSFQPADCSS 439


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 105/361 (29%), Positives = 162/361 (44%), Gaps = 39/361 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + L +GTPP      +DTGS  +W QC+PC  + C  Q  PL+DP  S TY  +SC+S
Sbjct: 125 YYVKLGLGTPPKYYAMILDTGSSLSWLQCQPC-AVYCHAQADPLYDPSVSKTYKKLSCAS 183

Query: 95  SQC-----AVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
            +C     A +     E D   C Y+  YG     SFS G L+ + LT  S+  L    P
Sbjct: 184 VECSRLKAATLNDPLCETDSNACLYTASYGD---TSFSIGYLSQDLLTLTSSQTL----P 236

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SSK 202
              +GCG  N        +  GIIGL     S+++Q+ T     FSYCLP          
Sbjct: 237 QFTYGCGQDNQG---LFGRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGG 293

Query: 203 INFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSSTG-NIFVDTGVLR 257
               G ++      TP++        Y+L L AI+V  + L+  ++       +D+G + 
Sbjct: 294 FLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVI 353

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCY--NISSQPKFPEVTIHFR-G 313
           T LP+  ++ L+     ++  +  K     P +S +  C+  ++ S    PE+ + F+ G
Sbjct: 354 TRLPMSMYAALRQAFVKIMSTKYAKA----PAYSILDTCFKGSLKSISAVPEIKMIFQGG 409

Query: 314 ADVKLSPSNLFRNISDEIMCSAFRGG---NANIVYGRIMQINFLIGYDIEQAMVSFKPSR 370
           AD+ L   ++       I C AF G    N   + G   Q  + I YD+  + + F P  
Sbjct: 410 ADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGS 469

Query: 371 C 371
           C
Sbjct: 470 C 470


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 110/354 (31%), Positives = 154/354 (43%), Gaps = 40/354 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +G+P  D+    DTGSD TW +C               FDP KS++Y ++SCS+
Sbjct: 134 YIVSIGLGSPKKDLMLIFDTGSDLTWARCSAAET----------FDPTKSTSYANVSCST 183

Query: 95  SQCAVVT------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
             C+ V       S C+   C Y   YG G+Y   S G L  E LT  ST        N 
Sbjct: 184 PLCSSVISATGNPSRCAASTCVYGIQYGDGSY---SIGFLGKERLTIGSTD----IFNNF 236

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI 208
            FGCG           K  G++GLG    S++SQ        FSYCLP   S+     G 
Sbjct: 237 YFGCGQD---VDGLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPSSSSTGFLSFGS 293

Query: 209 VAGAGVVSTPLII--RDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEY 264
                   TPL       Y L L  I+VG Q+L    S  ST    +D+G + T LP   
Sbjct: 294 SQSKSAKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTAGTIIDSGTVVTRLPPAA 353

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGA-DVKLSPS 321
           +S L+S     + + P   +G      D  CY+ S     K P++ I F G  DV +  +
Sbjct: 354 YSALRSAFRKAMASYP---MGKPLSILDT-CYDFSKYKTIKVPKIVISFSGGVDVDVDQA 409

Query: 322 NLFRNISDEIMCSAFRGGNA---NIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
            +F     + +C AF G        ++G   Q NF + YD+    V F P+ C+
Sbjct: 410 GIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVGFAPASCS 463


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 108/358 (30%), Positives = 168/358 (46%), Gaps = 43/358 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IG PP   +  +DTGSD  W QC PC   DC++Q  P+F+P  S++++++SC++
Sbjct: 149 YFSRVGIGKPPSQAYLILDTGSDVNWVQCAPCA--DCYQQADPIFEPASSASFSTLSCNT 206

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC ++  S C    C Y   YG G+Y   + G+  TET+T  S    PV+  NV  GCG
Sbjct: 207 RQCRSLDVSECRNDTCLYEVSYGDGSY---TVGDFVTETITLGSA---PVD--NVAIGCG 258

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIVA 210
           H N       +   G+ G      S I+      A  FSYCL D+ S   S + F   + 
Sbjct: 259 HNNEGLFVGAAGLLGLGGGSLSFPSQIN------ATSFSYCLVDRDSESASTLEFNSTLP 312

Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQ-------RLEFVSSSTGNIFVDTGVLRTL 259
               VS PL+   H    YY+ L  +SVG +         +   S  G + VD+G   T 
Sbjct: 313 -PNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITR 371

Query: 260 LPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQ--PKFPEVTIHF-RGAD 315
           L  + +++L+       +  P   G+       D  CY++SS+   + P V+ HF  G +
Sbjct: 372 LQTDVYNSLRDAFVKRTRDLPSTNGI----ALFDT-CYDLSSKGNVEVPTVSFHFPDGKE 426

Query: 316 VKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           + L   N    +  E   C AF    +++ + G + Q    + YD+   +V F P++C
Sbjct: 427 LPLPAKNYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484


>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
 gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
          Length = 453

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 123/378 (32%), Positives = 179/378 (47%), Gaps = 56/378 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+M L+IGTPP       DTGSD  WTQC PC E  CFKQ  PL++P  S T+  + CSS
Sbjct: 92  YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGE-RCFKQPSPLYNPSSSPTFRVLPCSS 150

Query: 95  S--QCAVVTS--------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           +   CA             C+   C Y+  YG G    ++SG   +ET TF S+    V 
Sbjct: 151 ALNLCAAEARLAGATPPPGCA---CRYNQTYGTG----WTSGLQGSETFTFGSSPADQVR 203

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGS--- 200
           +P + FGC +   AS    +   G++GLG G  SL+SQ+    AG FSYCL P Q +   
Sbjct: 204 VPGIAFGCSN---ASSDDWNGSAGLVGLGRGGLSLVSQLA---AGMFSYCLTPFQDTKSK 257

Query: 201 SKINFG-----GIVAGAGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EF 241
           S +  G       + G GV STP +       +  +YYL+L  ISVG   L         
Sbjct: 258 STLLLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFAL 317

Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--S 299
            +  TG + +D+G   T L    +  +++ + +++K     G  A  G    LC+ +  S
Sbjct: 318 RADGTGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKLPVTDGSNAT-GLD--LCFALPSS 374

Query: 300 SQP--KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIV--YGRIMQINFLI 355
           S P    P +T+HF G    + P   +  +   + C A R      +   G   Q N  I
Sbjct: 375 SAPPATLPSMTLHFGGGADMVLPVENYMILDGGMWCLAMRSQTDGELSTLGNYQQQNLHI 434

Query: 356 GYDIEQAMVSFKPSRCTN 373
            YD+++  +SF P++C+ 
Sbjct: 435 LYDVQKETLSFAPAKCST 452


>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
          Length = 440

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 115/391 (29%), Positives = 168/391 (42%), Gaps = 56/391 (14%)

Query: 26  AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSS 85
           A +   +  Y+    IG PP      +DTGS+  WTQC  C    CF Q    +DP +S 
Sbjct: 62  APVHWAESQYIAEYLIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSR 121

Query: 86  TYNSISCSSSQCAVVT-SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
           T   ++C+ + CA+ + + C+  +  C+    YG G       G L TE  TF   S   
Sbjct: 122 TARPVACNDTACALGSETRCARDNKACAVLTAYGAGVIG----GVLGTEAFTFQPQS--- 174

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
            E  ++ FGC      +P S    +GIIGLG GN SL+SQ+G +   KFSYCL    S  
Sbjct: 175 -ENVSLAFGCIAATRLTPGSLDGASGIIGLGRGNLSLVSQLGDN---KFSYCLTPYFSQS 230

Query: 203 IN----FGGIVA-----GAGVVSTPLI-------IRDHYYLSLEAISVGNQRL------- 239
            N    F G  A     GA   S P +           YYL L  I+VG+ +L       
Sbjct: 231 TNTSRLFVGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAF 290

Query: 240 EFVSSSTG---NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
           +    +TG      +D+G   T L    +  L+  +   + A  V       G    LC 
Sbjct: 291 DLRQVATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLD--LCA 348

Query: 297 NISS---QPKFPEVTIHF--RGADVKLSPSNLFRNISDEIMCS-AFRGG--------NAN 342
            ++        P + +HF   G DV + P N +  + D   C   F  G        N  
Sbjct: 349 AVAHGDVGKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNET 408

Query: 343 IVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
            + G  MQ +  + YD+E+ M+SF+P+ C++
Sbjct: 409 TIIGNYMQQDMHLLYDLEKGMLSFQPADCSS 439


>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
 gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
          Length = 459

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 114/380 (30%), Positives = 181/380 (47%), Gaps = 38/380 (10%)

Query: 11  NDNETPKSPISIIYQAEIISVDD---IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCP 67
           +DN T + P+ +         DD    Y M  S+GTPP  +    DTGSD  W +C    
Sbjct: 73  SDNNTQRIPLRM---------DDSGGAYDMEFSMGTPPQKLTALADTGSDLIWAKCGGAC 123

Query: 68  ELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSN------CSEGDCSYSFLYGRGAY- 120
              C  Q  P + P  SST+  + CS   C+++ S+       +  +C Y + YG G   
Sbjct: 124 TTSCEPQGSPSYLPNASSTFAKLPCSDRLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDD 183

Query: 121 ASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLI 180
             ++ G LA ET T  + +     +P+V FGC     AS       +G++GLG G  SL+
Sbjct: 184 HHYTQGFLARETFTLGADA-----VPSVRFGC---TTASEGGYGSGSGLVGLGRGPLSLV 235

Query: 181 SQMGTSIAGKFSYCLPDQGS--SKINFGGI--VAGAGVVSTPLIIRDHYY-LSLEAISVG 235
           SQ+    A  F YCL    S  S + FG +  + GA V ST L+    +Y ++L +IS+G
Sbjct: 236 SQLN---ASTFMYCLTSDASKASPLLFGSLASLTGAQVQSTGLLASTTFYAVNLRSISIG 292

Query: 236 NQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVL 294
           +     V    G +F D+G   T L    +S  K+   +      V+   G E  F    
Sbjct: 293 SATTPGVGEPEGVVF-DSGTTLTYLAEPAYSEAKAAFLSQTSLDQVEDTDGFEACFQKPA 351

Query: 295 CYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFL 354
              +S+    P + +HF GAD+ L  +N    + D ++C   +   +  + G IMQ+N+L
Sbjct: 352 NGRLSNA-AVPTMVLHFDGADMALPVANYVVEVEDGVVCWIVQRSPSLSIIGNIMQVNYL 410

Query: 355 IGYDIEQAMVSFKPSRCTNY 374
           + +D+ ++++SF+P+ C  Y
Sbjct: 411 VLHDVHRSVLSFQPANCDTY 430


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 111/372 (29%), Positives = 163/372 (43%), Gaps = 53/372 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP       +DTGSD  W QC PC    C+ Q  P+FDP++SS+Y ++ C++
Sbjct: 140 YFTKIGVGTPSTPALMVLDTGSDVVWLQCAPCRR--CYDQSGPVFDPRRSSSYGAVDCAA 197

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   +     C Y   YG G   S ++G+ ATETLTF   +     +  V  G
Sbjct: 198 PLCRRLDSGGCDLRRRACLYQVAYGDG---SVTAGDFATETLTFAGGA----RVARVALG 250

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------------G 199
           CGH N     + +   G+     G+ S  +Q+       FSYCL D+             
Sbjct: 251 CGHDNEGLFVAAAGLLGLG---RGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASRSR 307

Query: 200 SSKINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS---------T 246
           SS + FG   A A    TP++    +   YY+ L  ISVG  R+  V+ S          
Sbjct: 308 SSTVTFGPPSASAASF-TPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGR 366

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP-- 302
           G + VD+G   T L    +S L+             G+   PG   +   CY++  +   
Sbjct: 367 GGVIVDSGTSVTRLARPSYSALRDAFRAA-----AAGLRLSPGGFSLFDTCYDLGGRKVV 421

Query: 303 KFPEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDI 359
           K P V++HF  GA+  L P N    + S    C AF G +  + + G I Q  F + +D 
Sbjct: 422 KVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDG 481

Query: 360 EQAMVSFKPSRC 371
           +   V F P  C
Sbjct: 482 DGQRVGFAPKGC 493


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 119/386 (30%), Positives = 179/386 (46%), Gaps = 61/386 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + L +GTP V++   +DTGSD +W QC PC   DC     P F+P+ SS++  + C+S
Sbjct: 139 YYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPCK--DCVPALRPPFNPRHSSSFFKLPCAS 196

Query: 95  SQCAVVTSN----CSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTS---GLPVEM 145
           S C  V       CS     C +S  YG G   S SSG LA ET+  N+ +   G PV++
Sbjct: 197 STCTNVYQGVKPFCSPSGRTCLFSIQYGDG---SLSSGLLAMETIAGNTPNFGDGEPVKL 253

Query: 146 PNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
            N+  GC   +    PT  S   G++G+     S  SQ+ +  A KFS+C PD+  + +N
Sbjct: 254 SNITLGCADIDREGLPTGAS---GLLGMDRRPISFPSQLSSRYARKFSHCFPDK-IAHLN 309

Query: 205 FGGIV--AGAGVVS-----TPLIIR--------DHYYLSLEAISVGNQRLEF-------- 241
             G+V    + ++S     TPL+          D+YY+ L  ISV   RL          
Sbjct: 310 SSGLVFFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDID 369

Query: 242 -VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
            V+ S G I +D+G   T L       ++     + +   +  V    GF+   CYNI+S
Sbjct: 370 KVTGSGGTI-IDSGTAFTYLKKPAFQAMRREF--LARTSHLAKVDDNSGFTP--CYNITS 424

Query: 301 QPK------FPEVTIHFRGA-DVKLSPSNLFRNIS----DEIMCSAF--RGGNANIVYGR 347
                     P +T+HFRG  DV L  +++   +S       +C AF   G     + G 
Sbjct: 425 GTAALESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFNIIGN 484

Query: 348 IMQINFLIGYDIEQAMVSFKPSRCTN 373
             Q N  + YD+E+  +   P++C  
Sbjct: 485 YQQQNLWVEYDLEKLRLGIAPAQCAT 510


>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 453

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 115/375 (30%), Positives = 170/375 (45%), Gaps = 50/375 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+M L+IGTPP       DTGSD  WTQC PC E  CFKQ  PL++P  S T+  + CSS
Sbjct: 92  YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGE-RCFKQPSPLYNPSSSPTFRVLPCSS 150

Query: 95  ------SQCAVVTSNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
                 ++  +  +    G  C Y+  YG G    ++SG   +ET TF S+    V +P 
Sbjct: 151 ALNLCAAEARLAGATPPPGCACRYNQTYGTG----WTSGLQGSETFTFGSSPADQVRVPG 206

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGS---SKI 203
           + FGC +       S     G  GL       +S +    AG FSYCL P Q +   S +
Sbjct: 207 IAFGCSNA------SSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTL 260

Query: 204 NFG-----GIVAGAGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EFVSS 244
             G       + G GV STP +       +  +YYL+L  ISVG   L          + 
Sbjct: 261 LLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRAD 320

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQP 302
            TG + +D+G   T L    +  +++ + +++K     G  A  G    LC+ +  SS P
Sbjct: 321 GTGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKLPVTDGSNAT-GLD--LCFALPSSSAP 377

Query: 303 --KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIV--YGRIMQINFLIGYD 358
               P +T+HF G    + P   +  +   + C A R      +   G   Q N  I YD
Sbjct: 378 PATLPSMTLHFGGGADMVLPVENYMILDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYD 437

Query: 359 IEQAMVSFKPSRCTN 373
           +++  +SF P++C+ 
Sbjct: 438 VQKETLSFAPAKCST 452


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 115/362 (31%), Positives = 167/362 (46%), Gaps = 48/362 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ L  GTP V     +DTGSD +W QC PC    C+ Q+ PLFDP KSSTY  I+C++
Sbjct: 131 YVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDPLFDPSKSSTYAPIACNT 190

Query: 95  SQCAVVTSN----CSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
             C  +  +    C+ G   C YS  Y  G   S S G  + ETLT     G+ VE  + 
Sbjct: 191 DACRKLGDHYHNGCTSGGTQCGYSVEYADG---SHSRGVYSNETLTL--APGITVE--DF 243

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI 208
            FGCG ++   P+   K  G++GLG    SL+ Q  +   G FSYCLP   S     G +
Sbjct: 244 HFGCG-RDQRGPS--DKYDGLLGLGGAPVSLVVQTSSVYGGAFSYCLPALNSEA---GFL 297

Query: 209 VAG-------AGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVL 256
           V G       +  V TP+         Y +++  ISVG + L    S+  G + +D+G +
Sbjct: 298 VLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAFRGGMIIDSGTV 357

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-G 313
            T LP   ++ L++ +   +KA P+      P      CYN +  S    P V   F  G
Sbjct: 358 DTELPETAYNALEAALRKALKAYPLV-----PSDDFDTCYNFTGYSNITVPRVAFTFSGG 412

Query: 314 ADVKLS-PSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPS 369
           A + L  P+ +  N      C AF+    +    + G + Q    + YD  +  V F+  
Sbjct: 413 ATIDLDVPNGILVND-----CLAFQESGPDDGLGIIGNVNQRTLEVLYDAGRGNVGFRAG 467

Query: 370 RC 371
            C
Sbjct: 468 AC 469


>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
 gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
          Length = 458

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 115/375 (30%), Positives = 170/375 (45%), Gaps = 50/375 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+M L+IGTPP       DTGSD  WTQC PC E  CFKQ  PL++P  S T+  + CSS
Sbjct: 97  YIMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGE-RCFKQPSPLYNPSSSPTFRVLPCSS 155

Query: 95  ------SQCAVVTSNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
                 ++  +  +    G  C Y+  YG G    ++SG   +ET TF S+    V +P 
Sbjct: 156 ALNLCAAEARLAGATPPPGCACRYNQTYGTG----WTSGLQGSETFTFGSSPADQVRVPG 211

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGS---SKI 203
           + FGC +       S     G  GL       +S +    AG FSYCL P Q +   S +
Sbjct: 212 IAFGCSNA------SSDDWNGSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTL 265

Query: 204 NFG-----GIVAGAGVVSTPLI-------IRDHYYLSLEAISVGNQRL-------EFVSS 244
             G       + G GV STP +       +  +YYL+L  ISVG   L          + 
Sbjct: 266 LLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRAD 325

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQP 302
            TG + +D+G   T L    +  +++ + +++K     G  A  G    LC+ +  SS P
Sbjct: 326 GTGGLIIDSGTTITSLVDAAYKRVRAAVRSLVKLPVTDGSNAT-GLD--LCFALPSSSAP 382

Query: 303 --KFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIV--YGRIMQINFLIGYD 358
               P +T+HF G    + P   +  +   + C A R      +   G   Q N  I YD
Sbjct: 383 PATLPSMTLHFGGGADMVLPVENYMILDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYD 442

Query: 359 IEQAMVSFKPSRCTN 373
           +++  +SF P++C+ 
Sbjct: 443 VQKETLSFAPAKCST 457


>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
 gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 458

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 111/374 (29%), Positives = 174/374 (46%), Gaps = 52/374 (13%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           ++L++ S+G PPV     +DTGS   W QC+PC          P+F+P  SST+   SC 
Sbjct: 95  LFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCD 154

Query: 94  SSQCAVV-TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
              C      +C S   C Y  +Y  G   + S G LA E LTF + +G  V    + FG
Sbjct: 155 DRFCRYAPNGHCGSSNKCVYEQVYISG---TGSKGVLAKERLTFTTPNGNTVVTQPIAFG 211

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG 211
           CG++N      +S  TGI+GLG   +SL  Q+G+    KFSYC+ D  +    +  +V G
Sbjct: 212 CGYEN--GEQLESHFTGILGLGAKPTSLAVQLGS----KFSYCIGDLANKNYGYNQLVLG 265

Query: 212 --AGVVSTPLIIRDH-----YYLSLEAISVGNQRLEF---VSSSTG---NIFVDTGVLRT 258
             A ++  P  I        YY++LE ISVG+ +L     V    G    + +D+G L T
Sbjct: 266 EDADILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVFKRRGPRTGVILDSGTLYT 325

Query: 259 LLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK---FPEVTIHF 311
            L      E ++ +KS++   ++            F D LCY+     +   FP VT HF
Sbjct: 326 WLADIAYRELYNEIKSILDPKLE---------RFWFRDFLCYHGRVSEELIGFPVVTFHF 376

Query: 312 R-GADVKLSPSNLFRNISD----EIMCSAFR-----GGNAN--IVYGRIMQINFLIGYDI 359
             GA++ +  +++F  +S+     + C + +     GG        G + Q  + IGYD+
Sbjct: 377 AGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLMAQQYYNIGYDL 436

Query: 360 EQAMVSFKPSRCTN 373
           ++  +  +   C  
Sbjct: 437 KEKNIYLQRIDCVQ 450


>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
 gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 114/359 (31%), Positives = 168/359 (46%), Gaps = 47/359 (13%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQ-EPPLFDPKKSSTYNSISC 92
           ++L++ S+G PPV     +DTGS   W QC PC    C +Q   P+FDP  SSTY+S+SC
Sbjct: 101 LFLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKS--CSQQIIGPMFDPSISSTYDSLSC 158

Query: 93  SSSQCAVVTS-NC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
            +  C    S  C S   C Y+  Y  G     S G +ATE L F S+      + NV+F
Sbjct: 159 KNIICRYAPSGECDSSSQCVYNQTYVEGLP---SVGVIATEQLIFGSSDEGRNAVNNVLF 215

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA 210
           GC H+N      D + TG+ GLG G +S+++QMG+    KFSYC+ +      ++  +V 
Sbjct: 216 GCSHRN--GNYKDRRFTGVFGLGSGITSVVNQMGS----KFSYCIGNIADPDYSYNQLVL 269

Query: 211 GAGV----VSTPLIIRD-HYYLSLEAISVGNQRLEFVSSS------TGNIFVDTGVLRTL 259
             GV     STPL + D HY + LE ISVG  RL    S+         + +D+G   T 
Sbjct: 270 SEGVNMEGYSTPLDVVDGHYQVILEGISVGETRLVIDPSAFKRTEKQRRVIIDSGTAPTW 329

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD-VLCYNISSQPK---FPEVTIHF-RGA 314
           L    +  L+  + N++           P   +  LCY          FP VT HF  GA
Sbjct: 330 LAENEYRALEREVRNLLDR------FLTPFMRESFLCYKGKVGQDLVGFPAVTFHFAEGA 383

Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           D+          +  E+  ++  G +     V G + Q  + + YD+ +  + F+   C
Sbjct: 384 DLV---------VDTEMRQASVYGKDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQRIDC 433


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 110/369 (29%), Positives = 179/369 (48%), Gaps = 53/369 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +G   + +   VDTGSD TW QC+PC    C+ Q+ PL+DP  SS+Y ++ C+S
Sbjct: 138 YIVTVELGGKNMSLI--VDTGSDLTWVQCQPCR--SCYNQQGPLYDPSVSSSYKTVFCNS 193

Query: 95  SQCA-VVTSNCSEG-----------DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
           S C  +V +  + G            C Y   YG G+Y   + G+LA+E++    T    
Sbjct: 194 STCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSY---TRGDLASESIVLGDT---- 246

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQG 199
            ++ N++FGCG  N       S   G++GLG  + SL+SQ   +  G FSYCLP   D  
Sbjct: 247 -KLENLVFGCGRNNKGLFGGAS---GLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGA 302

Query: 200 SSKINFGGIVA----GAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGNIFV 251
           S  ++FG   +       V  TPL+    +R  Y L+L   S+G   L+ +S   G I +
Sbjct: 303 SGTLSFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGGVELKTLSFGRG-ILI 361

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVT 308
           D+G + T LP   +   K+V +  +K     G  + PG+S +  C+N++S      P + 
Sbjct: 362 DSGTVITRLPPSIY---KAVKTEFLKQ--FSGFPSAPGYSILDTCFNLTSYEDISIPTIK 416

Query: 309 IHFRG-ADVKLSPSNLFRNISDE--IMCSAFRG---GNANIVYGRIMQINFLIGYDIEQA 362
           + F G A++++  + +F  +  +  ++C A       N   + G   Q N  + YD  Q 
Sbjct: 417 MIFEGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQE 476

Query: 363 MVSFKPSRC 371
            +      C
Sbjct: 477 RLGIAGENC 485


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 111/354 (31%), Positives = 160/354 (45%), Gaps = 38/354 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + IGTP  D+    DTGSD TWTQCEPC    C+ Q+ P F+P  SSTY ++SCSS
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLG-SCYSQKEPKFNPSSSSTYQNVSCSS 190

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
             C    S CS  +C YS +YG     SF+ G LA E  T  ++  L     +V FGCG 
Sbjct: 191 PMCEDAES-CSASNCVYSIVYGD---KSFTQGFLAKEKFTLTNSDVL----EDVYFGCGE 242

Query: 155 KN---LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGI 208
            N                    P      +Q  T+    FSYCLP      +  + FG  
Sbjct: 243 NNQGLFDGVAGLLGLGPGKLSLP------AQTTTTYNNIFSYCLPSFTSNSTGHLTFGSA 296

Query: 209 VAGAGVVSTPLI---IRDHYYLSLEAISVGNQRLEFV--SSSTGNIFVDTGVLRTLLPLE 263
                V  TP+       +Y + +  ISVG++ L     S ST    +D+G + T LP +
Sbjct: 297 GISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTK 356

Query: 264 YHSNLKSVM-SNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGAD-VKLS 319
            ++ L+SV    M   +   G G         CY+ +      +P +   F G+  V+L 
Sbjct: 357 VYAELRSVFKEKMSSYKSTSGYGLFD-----TCYDFTGLDTVTYPTIAFSFAGSTVVELD 411

Query: 320 PSNLFRNISDEIMCSAFRGGNANI--VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            S +   I    +C AF  GN ++  ++G + Q    + YD+    V F P+ C
Sbjct: 412 GSGISLPIKISQVCLAF-AGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 106/358 (29%), Positives = 163/358 (45%), Gaps = 43/358 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +G P    +  +DTGSD  W QC+PC   DC++Q  P+FDP  SS+YN ++C +
Sbjct: 157 YFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPCS--DCYQQSDPIFDPTASSSYNPLTCDA 214

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC  +  S C  G C Y   YG G   SF+ G   TET++F + S     +  V  GCG
Sbjct: 215 QQCQDLEMSACRNGKCLYQVSYGDG---SFTVGEYVTETVSFGAGS-----VNRVAIGCG 266

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
           H N       +   G+ G     +S I       A  FSYCL D+ S K   + F     
Sbjct: 267 HDNEGLFVGSAGLLGLGGGPLSLTSQIK------ATSFSYCLVDRDSGKSSTLEFNSPRP 320

Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGN-------QRLEFVSSSTGNIFVDTGVLRTL 259
           G  VV+ PL+    +   YY+ L  +SVG        +      S  G + VD+G   T 
Sbjct: 321 GDSVVA-PLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITR 379

Query: 260 LPLEYHSNLKSVMSNMI-KAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRGADV 316
           L  + +++++          +P +GV       D  CY++SS    + P V+ HF G   
Sbjct: 380 LRTQAYNSVRDAFKRKTSNLRPAEGV----ALFDT-CYDLSSLQSVRVPTVSFHFSGDRA 434

Query: 317 KLSPSNLFRNISD--EIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
              P+  +    D     C AF    +++ + G + Q    + +D+  ++V F P++C
Sbjct: 435 WALPAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 108/356 (30%), Positives = 165/356 (46%), Gaps = 35/356 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  L +GTP       VDTGS  TW QC PC  + C +Q  P+FDP+ S TY ++ CSS
Sbjct: 131 YVTRLGLGTPATSYVMVVDTGSSLTWLQCSPC-SVSCHRQAGPVFDPRASGTYAAVQCSS 189

Query: 95  SQCAVVT------SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           S+C  +       S CS  + C Y   YG    +S+S G L+ +T++F S S      P 
Sbjct: 190 SECGELQAATLNPSACSVSNVCIYQASYGD---SSYSVGYLSKDTVSFGSGS-----FPG 241

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INF 205
             +GCG  N        +  G+IGL     SL+ Q+  S+   FSYCLP   ++   ++ 
Sbjct: 242 FYYGCGQDNEGL---FGRSAGLIGLAKNKLSLLYQLAPSLGYAFSYCLPTSSAAAGYLSI 298

Query: 206 GGIVAGAGVVSTPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTL 259
           G    G     TP+    +    Y+++L  ISV    L    S   ++   +D+G + T 
Sbjct: 299 GSYNPGQ-YSYTPMASSSLDASLYFVTLSGISVAGAPLAVPPSEYRSLPTIIDSGTVITR 357

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP-KFPEVTIHFRG-ADV 316
           LP     N+ + +S  + A         P +S +  C+  S+   + P V + F G A +
Sbjct: 358 LP----PNVYTALSRAVAAAMASAAPRAPTYSILDTCFRGSAAGLRVPRVDMAFAGGATL 413

Query: 317 KLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
            LSP N+  ++ D   C AF       + G   Q  F + YD+ Q+ + F    C+
Sbjct: 414 ALSPGNVLIDVDDSTTCLAFAPTGGTAIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 469


>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
 gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
          Length = 493

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 125/376 (33%), Positives = 181/376 (48%), Gaps = 50/376 (13%)

Query: 30  SVDDI-YLMHLSIGTPP-VDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
           S+D + Y++ + +G+PP       +DTGSD +W +C+PC +  C  Q  PLFDP  SSTY
Sbjct: 134 SLDTLEYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCWQ-QCRPQVDPLFDPSLSSTY 192

Query: 88  NSISCSSSQCAVV-----TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
           +  SCSS+ CA +      + C S G C Y  +YG G+    ++G  +++TL   S S  
Sbjct: 193 SPFSCSSAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVG--TTGTYSSDTLALGSNSNT 250

Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGII-------GLGPGNSSLISQ-MGTSIAGKFSY 193
            V +    FGC H           +TGI        GLG G  SL+SQ  GT     FSY
Sbjct: 251 -VVVSKFRFGCSH----------AETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSY 299

Query: 194 CLPDQGSSK--INFGGI-VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS- 245
           CLP   SS   +  G    + AG V TP++    +   Y + LEAI VG ++L   ++  
Sbjct: 300 CLPPTPSSSGFLTLGAAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVF 359

Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--K 303
           +  + +D+G + T LP   +S+L S     +K  P     A  GF D  C+++S Q    
Sbjct: 360 SAGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDT-CFDMSGQSSVS 418

Query: 304 FPEVTIHFRGAD---VKLSPSNLFRNI-SDEIMCSAFRG----GNANIVYGRIMQINFLI 355
            P V + F GA    V L  S +   + +  I C AF      G+  I+ G + Q  F +
Sbjct: 419 MPTVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGII-GNVQQRTFQV 477

Query: 356 GYDIEQAMVSFKPSRC 371
            YD+    V FK   C
Sbjct: 478 LYDVAGGAVGFKAGAC 493


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 170/382 (44%), Gaps = 60/382 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +GTPP      +DTGSD  W QC PC E  CF+Q  P +DP +SS+Y +I C  
Sbjct: 181 YFIDVFVGTPPKHFSLILDTGSDLNWIQCVPCYE--CFEQNGPHYDPGQSSSYRNIGCHD 238

Query: 95  SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST--SGLP--V 143
           S+C +V+S             C Y + YG    +S ++G+ A ET T N T  SG P   
Sbjct: 239 SRCHLVSSPDPPQPCKAENQTCPYYYWYGD---SSNTTGDFALETFTVNLTMSSGKPELR 295

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS-- 201
            + NV+FGCGH N       +   G+        S  SQ+ +     FSYCL D+ S   
Sbjct: 296 RVENVMFGCGHWNRGLFHGAAGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRNSDAN 352

Query: 202 -----------------KINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVG-------NQ 237
                            ++NF  +VAG      P  +   YY+ +++I VG        +
Sbjct: 353 VSSKLIFGEDKDLLSHPELNFTTLVAGK---ENP--VDTFYYVQIKSIVVGGEVVNIPEE 407

Query: 238 RLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN 297
           + +  +  +G   +D+G   +      +  +K      +K  PV  V   P      CYN
Sbjct: 408 KWQIATDGSGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPV--VKDFPVLEP--CYN 463

Query: 298 IS--SQPKFPEVTIHFR-GADVKLSPSNLFRNIS-DEIMCSAFRGG--NANIVYGRIMQI 351
           ++   QP  P+  I F  GA       N F  I   E++C A  G   +A  + G   Q 
Sbjct: 464 VTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQ 523

Query: 352 NFLIGYDIEQAMVSFKPSRCTN 373
           NF I YD +++ + F P++C +
Sbjct: 524 NFHILYDTKKSRLGFAPTKCAD 545


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 114/364 (31%), Positives = 169/364 (46%), Gaps = 53/364 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +G+P   ++  +DTGSD TW QC+PC   DC++Q  P+FDP  S++Y S++C +
Sbjct: 167 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSTSYASVACDN 224

Query: 95  SQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
            +C    A    N S G C Y   YG G+Y   + G+ ATETLT   ++  PV   +V  
Sbjct: 225 PRCHDLDAAACRN-STGACLYEVAYGDGSY---TVGDFATETLTLGDSA--PVS--SVAI 276

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGG 207
           GCGH N       +    + G      S IS      A  FSYCL D+    SS + FG 
Sbjct: 277 GCGHDNEGLFVGAAGLLALGGGPLSFPSQIS------ATTFSYCLVDRDSPSSSTLQFGD 330

Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLE-----FVSSST--GNIFVDTGVL 256
             A    V+ PLI        YY+ L  +SVG Q L      F   ST  G + VD+G  
Sbjct: 331 --AADAEVTAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTA 388

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIH 310
            T L    ++ L+           V+G  + P  S V     CY++S +   + P V++ 
Sbjct: 389 VTRLQSSAYAALRDAF--------VRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLR 440

Query: 311 FR-GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
           F  G +++L   N    +      C AF   NA + + G + Q    + +D  ++ V F 
Sbjct: 441 FAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFT 500

Query: 368 PSRC 371
            ++C
Sbjct: 501 TNKC 504


>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 108/362 (29%), Positives = 178/362 (49%), Gaps = 34/362 (9%)

Query: 28  IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY 87
           +I     +L +LSIG PP +++  +DTGSD  W QCEPC    C+KQ+ P+++  KS +Y
Sbjct: 99  LIRDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDV--CYKQKDPIYNRTKSDSY 156

Query: 88  NSISCSSSQCAVV--TSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
             + C+   C  +     CS+ G C Y   Y  G   S +SG L+ E + F S      +
Sbjct: 157 TEMLCNEPPCLSLGREGQCSDSGSCLYQTSYADG---SRTSGLLSYEKVAFTSHYSDEDK 213

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT--SIAGKFSYCLPDQGSSK 202
              V FGCG +NL   TS S+  G++GLGPG  SL+SQ+     ++  F+YC  +   S 
Sbjct: 214 TAQVGFGCGLQNLNFVTS-SRDGGVLGLGPGLVSLVSQLSAIGKVSKSFAYCFGNL--SN 270

Query: 203 INFGGIVAGAGVV-----STPLIIRDHYYLSLEAISVGNQ--RLEFVSSS-------TGN 248
            N GG +            TP++I + YY++L  I +G +  RL+  SSS       +G 
Sbjct: 271 PNAGGFLVFGDATYLNGDMTPMVIAEFYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGG 330

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQPKFP 305
           + +D+G   ++ P E +  +++ + + +K    KG    P  S   C+        P FP
Sbjct: 331 VIIDSGSTLSIFPPEVYEVVRNAVVDKLK----KGYNISPLTSSPDCFEGKIGRDLPLFP 386

Query: 306 EVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
            + ++     +     ++F    DE+ C  F  G    + G + Q ++  GY++E + +S
Sbjct: 387 TLVLYLESTGILNDRWSIFLQRYDELFCLGFTSGEGLSIIGTLAQQSYKFGYNLELSTLS 446

Query: 366 FK 367
            +
Sbjct: 447 IE 448


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 111/371 (29%), Positives = 161/371 (43%), Gaps = 51/371 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP       +DTGSD  W QC PC    C+ Q   +FDP+ S +Y ++ C++
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRR--CYDQSGQMFDPRASHSYGAVDCAA 204

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   +     C Y   YG G   S ++G+ ATETLTF S +     +P V  G
Sbjct: 205 PLCRRLDSGGCDLRRKACLYQVAYGDG---SVTAGDFATETLTFASGA----RVPRVALG 257

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---------QGSSK 202
           CGH N     + +   G+     G+ S  SQ+       FSYCL D           SS 
Sbjct: 258 CGHDNEGLFVAAAGLLGLG---RGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSST 314

Query: 203 INFGGIVAG--AGVVSTPLI----IRDHYYLSLEAISVGNQRLEFV---------SSSTG 247
           + FG    G  A    TP++    +   YY+ L  ISVG  R+  V         S+  G
Sbjct: 315 VTFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPSTGRG 374

Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--K 303
            + VD+G   T L    ++ L+             G+   PG   +   CY++S     K
Sbjct: 375 GVIVDSGTSVTRLARPAYAALRDAFRAA-----AAGLRLSPGGFSLFDTCYDLSGLKVVK 429

Query: 304 FPEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIE 360
            P V++HF  GA+  L P N    + S    C AF G +  + + G I Q  F + +D +
Sbjct: 430 VPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGD 489

Query: 361 QAMVSFKPSRC 371
              + F P  C
Sbjct: 490 GQRLGFVPKGC 500


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 119/387 (30%), Positives = 177/387 (45%), Gaps = 57/387 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP-------CPELDCFKQEPPLFDPKKSSTY 87
           YL+ ++ GTPP ++    DTGSD  W QC         CP+  C ++  P F   KS+T 
Sbjct: 54  YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR--PAFVASKSATL 111

Query: 88  NSISCSSSQCAVVTS------NCSEGD---CSYSFLYGRGAYASFSSGNLATETLTF-NS 137
           + + CS++QC +V +      +CS      C Y++ Y  G   S ++G LA +T T  N 
Sbjct: 112 SVVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADG---SSTTGFLARDTATISNG 168

Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD 197
           TSG    +  V FGCG +N     S S   G+IGLG G  S  +Q G+  A  FSYCL D
Sbjct: 169 TSG-GAAVRGVAFGCGTRNQGG--SFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLD 225

Query: 198 -------QGSSKINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS- 245
                  + SS +  G     A    TPL+        YY+ + AI VGN+ L    S  
Sbjct: 226 LEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEW 285

Query: 246 ------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNI 298
                  G   +D+G   T L L  + +L S  +  +    +    +   F  + LCYN+
Sbjct: 286 AIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIP--SSATFFQGLELCYNV 343

Query: 299 SSQPK-------FPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFR---GGNANIVYGR 347
           SS          FP +TI F +G  ++L   N   +++D++ C A R      A  V G 
Sbjct: 344 SSSSSLAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGN 403

Query: 348 IMQINFLIGYDIEQAMVSFKPSRCTNY 374
           +MQ  + + +D   A + F  + C  +
Sbjct: 404 LMQQGYHVEFDRASARIGFARTECVAH 430


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 113/364 (31%), Positives = 168/364 (46%), Gaps = 53/364 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +G+P   ++  +DTGSD TW QC+PC   DC++Q  P+FDP  S++Y S++C +
Sbjct: 163 YFSRVGVGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSTSYASVACDN 220

Query: 95  SQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
            +C    A    N S G C Y   YG G+Y   + G+ ATETLT   ++  PV   +V  
Sbjct: 221 PRCHDLDAAACRN-STGACLYEVAYGDGSY---TVGDFATETLTLGDSA--PVS--SVAI 272

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGG 207
           GCGH N       +    + G      S IS      A  FSYCL D+    SS + FG 
Sbjct: 273 GCGHDNEGLFVGAAGLLALGGGPLSFPSQIS------ATTFSYCLVDRDSPSSSTLQFGD 326

Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVL 256
             A    V+ PLI        YY+ L  ISVG Q L    S+        G + VD+G  
Sbjct: 327 --AADAEVTAPLIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVDSGTA 384

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIH 310
            T L    ++ L+           V+G  + P  S V     CY++S +   + P V++ 
Sbjct: 385 VTRLQSSAYAALRDAF--------VRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLR 436

Query: 311 FR-GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
           F  G +++L   N    +      C AF   NA + + G + Q    + +D  ++ V F 
Sbjct: 437 FAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFT 496

Query: 368 PSRC 371
            ++C
Sbjct: 497 SNKC 500


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 114/381 (29%), Positives = 175/381 (45%), Gaps = 70/381 (18%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
           L++G+PP  +   +DTGS+ +W  C+  P L        +FDP +SS+Y+ I C+S  C 
Sbjct: 67  LTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS------VFDPLRSSSYSPIPCTSPTCR 120

Query: 99  VVTSN------CSEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIF 150
             T +      C +    ++ +    +YA  SS  GNLA++T    +++     +P  IF
Sbjct: 121 TRTRDFSIPVSCDKKKLCHAII----SYADASSIEGNLASDTFHIGNSA-----IPATIF 171

Query: 151 GCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
           GC     +S +  DSK TG+IG+  G+ S ++QMG     KFSYC+  Q SS I   G  
Sbjct: 172 GCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQ---KFSYCISGQDSSGILLFGES 228

Query: 210 AGAGV----------VSTPLIIRDH--YYLSLEAISVGNQRLEFVSS-------STGNIF 250
           + + +          +STPL   D   Y + LE I V N  L+   S         G   
Sbjct: 229 SFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTM 288

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNI----SSQP 302
           VD+G   T L    ++ LK+      KA     V  +P F    +  LCY +     + P
Sbjct: 289 VDSGTQFTFLLGPVYTALKNEFVRQTKAS--LKVLEDPNFVFQGAMDLCYRVPLTRRTLP 346

Query: 303 KFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRIMQ 350
             P VT+ FRGA++ +S   L   +      SD + C  F  GN+ +      + G   Q
Sbjct: 347 PLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTF--GNSELLGVESYIIGHHHQ 404

Query: 351 INFLIGYDIEQAMVSFKPSRC 371
            N  + +D+ ++ V F   RC
Sbjct: 405 QNVWMEFDLAKSRVGFAEVRC 425


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score =  130 bits (328), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 111/361 (30%), Positives = 158/361 (43%), Gaps = 39/361 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IG P    +  +DTGSD TW QC PC    C+ Q  P++DP  SS+Y  + C S
Sbjct: 12  YFARMGIGNPQRSYYLELDTGSDVTWIQCAPCSS--CYSQVDPIYDPSNSSSYRRVYCGS 69

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + C A+  S C    CSY  +YG    +S SSG+L  E+      S     M N+ FGCG
Sbjct: 70  ALCQALDYSACQGMGCSYRVVYGD---SSASSGDLGIESFYLGPNSS--TAMRNIAFGCG 124

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------GSSKINFGG 207
           H N      ++   G+ G      S  SQ+  SI   FSYCL D+       SS + FG 
Sbjct: 125 HSNSGLFRGEAGLLGMGGG---TLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGR 181

Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVL 256
                    TPL+    I   YY  L  ISVG   L           + TG   +D+G  
Sbjct: 182 TAIPFAARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGTS 241

Query: 257 RT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHF-R 312
            T ++P  Y     +  +      P  GV     +    C+N    P  + P + +HF  
Sbjct: 242 VTRVVPPAYAVLRDAYRAASRNLPPAPGV-----YLLDTCFNFQGLPTVQIPSLVLHFDN 296

Query: 313 GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
           G D+ L   N+   +      C AF   +  I V G + Q  F IG+D+++++++  P  
Sbjct: 297 GVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPRE 356

Query: 371 C 371
           C
Sbjct: 357 C 357


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score =  130 bits (328), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 114/381 (29%), Positives = 175/381 (45%), Gaps = 70/381 (18%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
           L++G+PP  +   +DTGS+ +W  C+  P L        +FDP +SS+Y+ I C+S  C 
Sbjct: 60  LTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS------VFDPLRSSSYSPIPCTSPTCR 113

Query: 99  VVTSN------CSEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIF 150
             T +      C +    ++ +    +YA  SS  GNLA++T    +++     +P  IF
Sbjct: 114 TRTRDFSIPVSCDKKKLCHAII----SYADASSIEGNLASDTFHIGNSA-----IPATIF 164

Query: 151 GCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
           GC     +S +  DSK TG+IG+  G+ S ++QMG     KFSYC+  Q SS I   G  
Sbjct: 165 GCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQ---KFSYCISGQDSSGILLFGES 221

Query: 210 AGAGV----------VSTPLIIRDH--YYLSLEAISVGNQRLEFVSS-------STGNIF 250
           + + +          +STPL   D   Y + LE I V N  L+   S         G   
Sbjct: 222 SFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTM 281

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNI----SSQP 302
           VD+G   T L    ++ LK+      KA     V  +P F    +  LCY +     + P
Sbjct: 282 VDSGTQFTFLLGPVYTALKNEFVRQTKAS--LKVLEDPNFVFQGAMDLCYRVPLTRRTLP 339

Query: 303 KFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRIMQ 350
             P VT+ FRGA++ +S   L   +      SD + C  F  GN+ +      + G   Q
Sbjct: 340 PLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTF--GNSELLGVESYIIGHHHQ 397

Query: 351 INFLIGYDIEQAMVSFKPSRC 371
            N  + +D+ ++ V F   RC
Sbjct: 398 QNVWMEFDLAKSRVGFAEVRC 418


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 122/408 (29%), Positives = 177/408 (43%), Gaps = 58/408 (14%)

Query: 7   LPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPC 66
           L F   N T KSP+  I  A   S    Y + + +GTPP  +    DTGSD  W +C  C
Sbjct: 64  LLFSRPNPTLKSPL--ISGASTGSGQ--YFVDIRLGTPPQSLLLVADTGSDLVWVKCSAC 119

Query: 67  PELDCFKQEPP--LFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYS-------FLYGR 117
              +C    PP   F P+ SS+++   C    C ++  +     C+++       FLY  
Sbjct: 120 --RNC-SHHPPSSAFLPRHSSSFSPFHCFDPHCRLL-PHAPHHLCNHTRLHSPCRFLYSY 175

Query: 118 GAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQ----TGIIGLG 173
            A  S SSG  + ET T  S SG  + +  + FGCG + ++ P+    Q     G++GLG
Sbjct: 176 -ADGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGCGFR-ISGPSVSGAQFNGARGVMGLG 233

Query: 174 PGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPL-------------- 219
            G+ S  SQ+G     KFSYCL D   S      ++ G G+ S PL              
Sbjct: 234 RGSISFSSQLGRRFGNKFSYCLMDYTLSPPPTSFLMIGGGLHSLPLTNATKISYTPLQIN 293

Query: 220 -IIRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLL-PLEYHSNLKS 270
            +    YY+++ +I++   +L       E      G   VD+G   T L    Y   LKS
Sbjct: 294 PLSPTFYYITIHSITIDGVKLPINPAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKS 353

Query: 271 VMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEV-TIHFR---GADVKLSPSNLFRN 326
           V     + +        PGF   LC N S + + P +  + FR   GA     P N F  
Sbjct: 354 VRR---RVKLPNAAELTPGFD--LCVNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLE 408

Query: 327 ISDEIMCSAFRG---GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             + +MC A R    GN   V G +MQ  FL+ +D E++ + F    C
Sbjct: 409 TEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGFTRRGC 456


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 116/353 (32%), Positives = 167/353 (47%), Gaps = 39/353 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ + +G+P       +D+GSD +W QC+PC  L C  Q  PLFDP  SSTY+  SCSS
Sbjct: 131 YLITVRLGSPAKTQTVLIDSGSDVSWVQCKPC--LQCHSQVDPLFDPSLSSTYSPFSCSS 188

Query: 95  SQCAVVTSN----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
           + CA +  +     S   C Y   Y  G   S ++G  +++TL   S +     + N  F
Sbjct: 189 AACAQLGQDGNGCSSSSQCQYIVRYADG---SSTTGTYSSDTLALGSNT-----ISNFQF 240

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA 210
           GC H  + S  +D    G++GLG G  SL SQ   +    FSYCLP   SS   F  + A
Sbjct: 241 GCSH--VESGFNDLTD-GLMGLGGGAPSLASQTAGTFGTAFSYCLPPTPSSS-GFLTLGA 296

Query: 211 G-AGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEY 264
           G +G V TP++    +   Y + LEAI VG  +L   +S  +  + +D+G + T LP   
Sbjct: 297 GTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFSAGMVMDSGTIITRLPRTA 356

Query: 265 HSNLKSVM-SNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGADVKLSPS 321
           +S L S   + M + +P     A P      C++ S Q   + P V + F G  V     
Sbjct: 357 YSALSSAFKAGMKQYRP-----APPRSIMDTCFDFSGQSSVRLPSVALVFSGGAVV---- 407

Query: 322 NLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           NL  N      C AF   + +    + G + Q  F + YD+    V FK   C
Sbjct: 408 NLDANGIILGNCLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGGAVGFKAGAC 460


>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 475

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 104/358 (29%), Positives = 159/358 (44%), Gaps = 39/358 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP + +  +D+GSD  W QCEPC +  C+ Q  P+F+P  SS+++ +SC+S
Sbjct: 136 YFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQ--CYHQSDPVFNPADSSSFSGVSCAS 193

Query: 95  SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + C+ V  + C EG C Y   YG G+Y   + G LA ET+TF  T      + NV  GCG
Sbjct: 194 TVCSHVDNAACHEGRCRYEVSYGDGSY---TKGTLALETITFGRT-----LIRNVAIGCG 245

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFGGIVA 210
           H N       +   G+ G      S + Q+G    G FSYCL  +G   S  + FG    
Sbjct: 246 HHNQGMFVGAAGLLGLGGG---PMSFVGQLGGQTGGAFSYCLVSRGIESSGLLEFGREAM 302

Query: 211 GAGVVSTPLI----IRDHYYLSLEA-------ISVGNQRLEFVSSSTGNIFVDTGVLRTL 259
             G    PLI     +  YY+ L         +S+     +      G + +DTG   T 
Sbjct: 303 PVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELGDGGVVMDTGTAVTR 362

Query: 260 LP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRGADV 316
           LP + Y +     ++         GV          CY++      + P V+ +F G  +
Sbjct: 363 LPTVAYEAFRDGFIAQTTNLPRASGVSIFD-----TCYDLFGFVSVRVPTVSFYFSGGPI 417

Query: 317 KLSPSNLFRNISDEI--MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
              P+  F    D++   C AF   ++ + + G I Q    I  D     V F P+ C
Sbjct: 418 LTLPARNFLIPVDDVGTFCFAFAPSSSGLSIIGNIQQEGIQISVDGANGFVGFGPNVC 475


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 103/353 (29%), Positives = 154/353 (43%), Gaps = 48/353 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP   +  +D+GSD  W QC+PC +  C+ Q  P+FDP  S+++  +SCSS
Sbjct: 201 YFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQ--CYHQSDPVFDPADSASFTGVSCSS 258

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S C  +  + C  G C Y   YG G+Y   + G LA ETLTF  T      + +V  GCG
Sbjct: 259 SVCDRLENAGCHAGRCRYEVSYGDGSY---TKGTLALETLTFGRTM-----VRSVAIGCG 310

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAG 213
           H+N       +   G+ G    + S + Q+G    G FSYCL             V+ A 
Sbjct: 311 HRNRGMFVGAAGLLGLGGG---SMSFVGQLGGQTGGAFSYCL-------------VSAAW 354

Query: 214 VVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLP- 261
           V   PL+        YY+ L  + VG  R+              G + +DTG   T LP 
Sbjct: 355 V---PLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPT 411

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPS 321
           L Y +   + ++         GV       D+L +      + P V+ +F G  +   P+
Sbjct: 412 LAYQAFRDAFLAQTANLPRATGVAIFDTCYDLLGF---VSVRVPTVSFYFSGGPILTLPA 468

Query: 322 NLFRNISDE--IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             F    D+    C AF    + + + G I Q    I +D     V F P+ C
Sbjct: 469 RNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 521


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 173/366 (47%), Gaps = 47/366 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ L IGTP V     +DTGSD +W QC+PC   DC+ Q+ PLFDP KSST+ +I C+S
Sbjct: 125 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIPCAS 184

Query: 95  SQCAVV--------TSNCSEG---DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
             C  +         +N + G    C Y+  YG GA    + G  +TETL   S++    
Sbjct: 185 DACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGA---ITEGVYSTETLALGSSA---- 237

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSS 201
            + +  FGCG           K  G++GLG    SL+SQ  +   G FSYCLP  + G+ 
Sbjct: 238 VVKSFRFGCGSDQHG---PYDKFDGLLGLGGAPESLVSQTASVYGGAFSYCLPPLNSGAG 294

Query: 202 KINFGGI----VAGAGVVSTPL-----IIRDHYYLSLEAISVGNQRLEFVSS--STGNIF 250
            +  G       + +G V TP+      I   Y ++L  ISVG + L+   +  + GNI 
Sbjct: 295 FLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVFAKGNI- 353

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEV 307
           VD+G + T +P   +  L++   + +   P+      P  S +  CYN +       P+V
Sbjct: 354 VDSGTVITGIPTTAYKALRTAFRSAMAEYPL----LPPADSALDTCYNFTGHGTVTVPKV 409

Query: 308 TIHF-RGADVKLS-PSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
            + F  GA V L  PS +   + ++ +  A  G  +  + G +      + YD  +  + 
Sbjct: 410 ALTFVGGATVDLDVPSGV---LVEDCLAFADAGDGSFGIIGNVNTRTIEVLYDSGKGHLG 466

Query: 366 FKPSRC 371
           F+   C
Sbjct: 467 FRAGAC 472


>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
 gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 104/348 (29%), Positives = 167/348 (47%), Gaps = 35/348 (10%)

Query: 52  VDTGSDCTWTQCEPCPELD--CFKQEPPLFDPKKSSTYNSISCSS-SQCAVVTSNCSEGD 108
           +DTG++ +W QCE C      CF  + P +   +S +Y  +SC+  S C    + C EG 
Sbjct: 105 IDTGNELSWIQCEGCQNKGNMCFPHKDPPYTSSQSKSYKPVSCNQHSFCE--PNQCKEGL 162

Query: 109 CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG--HKNL--ASPTSDS 164
           C+Y+  YG G+Y   +SGNLA ET TF S  G    + ++ FGC    +N+  A     +
Sbjct: 163 CAYNVTYGPGSY---TSGNLANETFTFYSNHGKHTALKSISFGCSTDSRNMIYAFLLDKN 219

Query: 165 KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAGAGVVSTPLIIR 222
             +G++G+G G  S ++Q+G+   GKFSYC+   +  ++ + FG  V  +  + T  I++
Sbjct: 220 PVSGVLGMGWGPRSFLAQLGSISHGKFSYCITANNTHNTYLRFGKHVVKSKNLQTTKIMQ 279

Query: 223 ----DHYYLSLEAISVGNQRLEFVSS--------STGNIFVDTGVLRTLLPLEYHSNLKS 270
                 Y+++L  ISV   +L    +        S G I +D G L TLL       L +
Sbjct: 280 VKPSAAYHVNLLGISVNGVKLNITKTDLAVRKDGSRGCI-IDAGTLATLLVKPIFDTLHT 338

Query: 271 VMSNMIKA-QPVKGVGAEPGFSDVLCYNISS---QPKFPEVTIHFRGADVKLSPSN--LF 324
            +SN + + Q +K         D LCY   S   +   P VT H   AD+++ P    LF
Sbjct: 339 ALSNHLSSNQNLKRWVIHKLHKD-LCYEQLSDAGRKNLPVVTFHLENADLEVKPEAIFLF 397

Query: 325 RNISDE-IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           R    + + C +    ++  + G   Q+     YD +  ++SF P  C
Sbjct: 398 REFEGKNVFCLSMLSDDSKTIIGAYQQMKQKFVYDTKARVLSFGPEDC 445


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 103/356 (28%), Positives = 163/356 (45%), Gaps = 35/356 (9%)

Query: 36  LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
           + ++SIG PP+     +DTGSD  W  C PC   +C      LFDP KSST++ +    +
Sbjct: 102 MANISIGQPPIPQLVVMDTGSDILWVMCTPC--TNCDNDLGLLFDPSKSSTFSPL--CKT 157

Query: 96  QCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK 155
            C      C     + ++     A  S +SG    +T+ F +T      + +V+FGCGH 
Sbjct: 158 PCDFEGCRCDPIPFTVTY-----ADNSTASGTFGRDTVVFETTDEGTSRISDVLFGCGH- 211

Query: 156 NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGV- 214
           N+   T D    GI+GL  G  SL++++G     KFSYC+ +      N+  ++ G G  
Sbjct: 212 NIGHDT-DPGHNGILGLNNGPDSLVTKLGQ----KFSYCIGNLADPYYNYHQLILGEGAD 266

Query: 215 ---VSTPL-IIRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPLE 263
               STP  +    YY+++E ISVG +RL       E   +  G + +DTG   T L   
Sbjct: 267 LEGYSTPFEVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTITFLVDS 326

Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-PKFPEVTIHFR-GADVKLSPS 321
            H  L   + N++     +    +  +      +IS     FP VT HF  GAD+ L   
Sbjct: 327 VHKLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFHFSDGADLALDSG 386

Query: 322 NLFRNISDEIMC------SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           + F  ++D + C      S+    +   + G + Q ++ +GYD+    V F+   C
Sbjct: 387 SFFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVNQFVYFQRIDC 442


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 110/381 (28%), Positives = 180/381 (47%), Gaps = 30/381 (7%)

Query: 2   QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
           Q  + +   +  E  KS +     ++I + D  Y++++ IGTP  ++    DTGS   WT
Sbjct: 101 QARRSMNLTSSVEHMKSSVPFYGLSKITASD--YIVNVGIGTPKKEMPLIFDTGSGLIWT 158

Query: 62  QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYA 121
           QC+PC    C+ + P +FDP KS+++  + CSS  C  +   CS   C+Y   Y      
Sbjct: 159 QCKPCKA--CYPKVP-VFDPTKSASFKGLPCSSKLCQSIRQGCSSPKCTYLTAY---VDN 212

Query: 122 SFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLIS 181
           S S+G LATET++F   S L  +  N++ GC  +       +S   GI+GL     SL S
Sbjct: 213 SSSTGTLATETISF---SHLKYDFKNILIGCSDQVSGESLGES---GIMGLNRSPISLAS 266

Query: 182 QMGTSIAGKFSYCLPDQGSS--KINFGGIVAGAGVVS--TPLIIRDHYYLSLEAISVGNQ 237
           Q        FSYC+P    S   + FGG V      S  +       Y + +  ISVG +
Sbjct: 267 QTANIYDKLFSYCIPSTPGSTGHLTFGGKVPNDVRFSPVSKTAPSSDYDIKMTGISVGGR 326

Query: 238 RLEFVSSSTGNIF--VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
           +L  + +S   I   +D+G + T LP + +S L+SV   M+K  P+     +  F D  C
Sbjct: 327 KL-LIDASAFKIASTIDSGAVLTRLPPKAYSALRSVFREMMKGYPLLD---QDDFLDT-C 381

Query: 296 YNIS--SQPKFPEVTIHFRGA-DVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQ 350
           Y+ S  S    P +++ F G  ++ +  S +   +   ++ C AF   +  + ++G   Q
Sbjct: 382 YDFSNYSTVAIPSISVFFEGGVEMDIDVSGIMWQVPGSKVYCLAFAELDDEVSIFGNFQQ 441

Query: 351 INFLIGYDIEQAMVSFKPSRC 371
             + + +D  +  + F P  C
Sbjct: 442 KTYTVVFDGAKERIGFAPGGC 462


>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 460

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 113/371 (30%), Positives = 172/371 (46%), Gaps = 50/371 (13%)

Query: 24  YQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKK 83
           +   +   D  +L+ ++ GTP  +I   +DTGS  TWTQC+ C  ++C +     FD   
Sbjct: 117 HNNNLFDEDGNFLVDVAFGTPXTEIXLILDTGSSITWTQCKAC--VNCLQDSNRYFDSSA 174

Query: 84  SSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           SSTY+  SC  S    V +N       Y+  YG     S S GN   +T+T   +     
Sbjct: 175 SSTYSFGSCIPS---TVENN-------YNMTYGD---DSTSVGNYGCDTMTLEPSD---- 217

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
                 FGCG  N       S   G++GLG G  S +SQ  +     FSYCLP++     
Sbjct: 218 VFQKFQFGCGRNNKGD--FGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGS 275

Query: 200 ----------SSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS---ST 246
                     SS + F  +V G G +        +Y+++L  ISVGN+RL   SS   S 
Sbjct: 276 LLFGEKATSQSSSLKFTSLVNGPGTLQES----GYYFVNLSDISVGNERLNIPSSVFASP 331

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPK- 303
           G I +D+  + T LP   +S LK+     +   P+     + G  D+L  CYN+S +   
Sbjct: 332 GTI-IDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKG--DILDTCYNLSGRKDV 388

Query: 304 -FPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
             PE+ +HF  GADV+L+ +N+        +C AF G +   + G   Q++  + YDI+ 
Sbjct: 389 LLPEIVLHFGGGADVRLNGTNIVWGSDASRLCLAFAGTSELTIIGNRQQLSLTVLYDIQG 448

Query: 362 AMVSFKPSRCT 372
             + F  + C+
Sbjct: 449 RRIGFGGNGCS 459


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 113/370 (30%), Positives = 170/370 (45%), Gaps = 55/370 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +G+  + +   VDTGSD TW QCEPC    C+ Q  PLF P  S +Y  I C+S
Sbjct: 122 YIVTMGLGSQNMSVI--VDTGSDLTWVQCEPCR--SCYNQNGPLFKPSTSPSYQPILCNS 177

Query: 95  SQCAVVTSNCSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           + C  +       D      C Y   YG G+Y   +SG L  E L F   S     + N 
Sbjct: 178 TTCQSLELGACGSDPSTSATCDYVVNYGDGSY---TSGELGIEKLGFGGIS-----VSNF 229

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFG 206
           +FGCG  N       S   G++GLG    S+ISQ   +  G FSYCLP  DQ  +    G
Sbjct: 230 VFGCGRNNKGLFGGAS---GLMGLGRSELSMISQTNATFGGVFSYCLPSTDQAGAS---G 283

Query: 207 GIVAG--AGVVS--TP---------LIIRDHYYLSLEAISVGNQRLEFVSSSTGN--IFV 251
            +V G  +GV    TP         L + + Y L+L  I VG   L   +SS GN  + +
Sbjct: 284 SLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSFGNGGVIL 343

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVT 308
           D+G + + L    +  LK+         P     + PGFS +  C+N++   Q   P ++
Sbjct: 344 DSGTVISRLAPSVYKALKAKFLEQFSGFP-----SAPGFSILDTCFNLTGYDQVNIPTIS 398

Query: 309 IHFRG-ADVKLSPSNLFRNISDEI--MCSAFRGGNANI---VYGRIMQINFLIGYDIEQA 362
           ++F G A++ +  + +F  + ++   +C A    +      + G   Q N  + YD + +
Sbjct: 399 MYFEGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVLYDAKLS 458

Query: 363 MVSFKPSRCT 372
            V F    CT
Sbjct: 459 QVGFAKEPCT 468


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 111/354 (31%), Positives = 158/354 (44%), Gaps = 38/354 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + IGTP  D+    DTGSD TWTQCEPC    C+ Q+ P F+P  SSTY ++SCSS
Sbjct: 132 YIVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLG-SCYSQKEPKFNPSSSSTYQNVSCSS 190

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
             C    S CS  +C YS  YG     SF+ G LA E  T  ++  L     +V FGCG 
Sbjct: 191 PMCEDAES-CSASNCVYSIGYGD---KSFTQGFLAKEKFTLTNSDVL----EDVYFGCGE 242

Query: 155 KN---LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINFGGI 208
            N                    P      +Q  T+    FSYCLP      +  + FG  
Sbjct: 243 NNQGLFDGVAGLLGLGPGKLSLP------AQTTTTYNNIFSYCLPSFTSNSTGHLTFGSA 296

Query: 209 VAGAGVVSTPLI---IRDHYYLSLEAISVGNQRLEFV--SSSTGNIFVDTGVLRTLLPLE 263
                V  TP+       +Y + +  ISVG++ L     S ST    +D+G + T LP +
Sbjct: 297 GISESVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTK 356

Query: 264 YHSNLKSVM-SNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGAD-VKLS 319
            ++ L+SV    M   +   G G         CY+ +      +P +   F G   V+L 
Sbjct: 357 VYAELRSVFKEKMSSYKSTSGYGLFD-----TCYDFTGLDTVTYPTIAFSFAGGTVVELD 411

Query: 320 PSNLFRNISDEIMCSAFRGGNANI--VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            S +   I    +C AF  GN ++  ++G + Q    + YD+    V F P+ C
Sbjct: 412 GSGISLPIKISQVCLAF-AGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 110/362 (30%), Positives = 163/362 (45%), Gaps = 41/362 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IG+P    +  +DTGSD TW QC PC    C+ Q  P++DP  SS+Y  + C S
Sbjct: 45  YFARMGIGSPQRSYYLELDTGSDVTWIQCAPCSS--CYSQVDPIYDPSNSSSYRRVYCGS 102

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + C A+  S C    CSY  +YG    +S SSG+L  E+      S     M N+ FGCG
Sbjct: 103 ALCQALDYSACQGMGCSYRVVYGD---SSASSGDLGIESFYLGPNSS--TAMRNIAFGCG 157

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------GSSKINFGG 207
           H N      ++   G+ G      S  SQ+  SI   FSYCL D+       SS + FG 
Sbjct: 158 HSNSGLFRGEAGLLGMGGG---TLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGR 214

Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGV- 255
                    TPL+    I   YY  L  ISVG   L           + TG   +D+G  
Sbjct: 215 TAIPFAARFTPLLKNPRIDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTS 274

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHF- 311
           +  ++P  Y     +V+ +  +A   + +   PG   +  C+N    P  + P + +HF 
Sbjct: 275 VTRVVPAAY-----AVLRDAYRAAS-RNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFD 328

Query: 312 RGADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPS 369
              D+ L   N+   +      C AF   +  I V G + Q  F IG+D+++++++  P 
Sbjct: 329 NDVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPR 388

Query: 370 RC 371
            C
Sbjct: 389 EC 390


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 109/374 (29%), Positives = 171/374 (45%), Gaps = 59/374 (15%)

Query: 40  SIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA- 98
           S G+P  ++   VDTGSD TW QC+PC    C+ Q  PLFDP  S+TY ++ C++S CA 
Sbjct: 153 SSGSPAANLTVIVDTGSDLTWVQCKPCSA--CYAQRDPLFDPAGSATYAAVRCNASACAD 210

Query: 99  -----------VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
                        ++      C Y+  YG G   SFS G LAT+T+     S     +  
Sbjct: 211 SLRAATGTPGSCGSTGAGSEKCYYALAYGDG---SFSRGVLATDTVALGGAS-----LGG 262

Query: 148 VIFGCG--HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP------DQG 199
            +FGCG  ++ L   T+     G++GLG    SL+SQ  +   G FSYCLP        G
Sbjct: 263 FVFGCGLSNRGLFGGTA-----GLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASG 317

Query: 200 SSKINFGGIVAGAGVVSTPL----IIRD-----HYYLSLEAISVGNQRLEFVSSSTGNIF 250
           S  +  G   A +   +TP+    +I D      Y+L++   +VG   L        N+ 
Sbjct: 318 SLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVL 377

Query: 251 VDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISS--QPKFPE 306
           +D+G V+  L P  Y    ++V +  ++     G  A PGFS +  CY+++   + K P 
Sbjct: 378 IDSGTVITRLAPSVY----RAVRAEFMRQFGAAGYPAAPGFSILDTCYDLTGHDEVKVPL 433

Query: 307 VTIHFR-GADVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
           +T+    GADV +  + +   +  +     +  ++    +   + G   Q N  + YD  
Sbjct: 434 LTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKNKRVVYDTL 493

Query: 361 QAMVSFKPSRCTNY 374
            + + F    C NY
Sbjct: 494 GSRLGFADEDC-NY 506


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 114/373 (30%), Positives = 171/373 (45%), Gaps = 54/373 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y +  S+GTP       VDTGSD  + QC PC +L C++Q+ PL+ P  SST+  + C S
Sbjct: 34  YFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPC-DL-CYEQDGPLYQPSNSSTFTPVPCDS 91

Query: 95  SQC----AVVTSNCS--------EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
           ++C    A V + CS        +G CSY + YG     S + G  A ET T        
Sbjct: 92  AECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDN---SSTVGVFAYETATVGG----- 143

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
           + + +V FGCG++N  S  S     G++GLG G  S  SQ G +   KF+YCL    S  
Sbjct: 144 IRVNHVAFGCGNRNQGSFVSAG---GVLGLGQGALSFTSQAGYAFENKFAYCLTSYLSPT 200

Query: 203 INFGGIVAGAGVVS-------TPLIIR----DHYYLSLEAISVGNQRLEFVSSS------ 245
             F  ++ G  ++S       TPL+        YY+ +  I  G + L    S+      
Sbjct: 201 SVFSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDSV 260

Query: 246 -TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNIS--SQ 301
             G    D+G   T    + ++    +++   K+ P       P G    LC N+S    
Sbjct: 261 GNGGTIFDSGTTVTYWSPQAYAR---IIAAFEKSVPYPRAPPSPQGLP--LCVNVSGIDH 315

Query: 302 PKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYD 358
           P +P  TI F +GA  + +  N F  +S  I C A    +++   V G I+Q N+L+ YD
Sbjct: 316 PIYPSFTIEFDQGATYRPNQGNYFIEVSPNIDCLAMLESSSDGFNVIGNIIQQNYLVQYD 375

Query: 359 IEQAMVSFKPSRC 371
            E+  + F  + C
Sbjct: 376 REEHRIGFAHANC 388


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 105/361 (29%), Positives = 154/361 (42%), Gaps = 45/361 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP   +  +D+GSD  W QC+PC +  C+ Q  PLFDP  S+++  +SCSS
Sbjct: 43  YFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQ--CYHQTDPLFDPADSASFMGVSCSS 100

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + C  V  + C+ G C Y   YG G   S + G LA ETLT   T      + NV  GCG
Sbjct: 101 AVCDQVDNAGCNSGRCRYEVSYGDG---SSTKGTLALETLTLGRTV-----VQNVAIGCG 152

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
           H N       +   G+ G    + S + Q+       FSYCL  + ++    + FG    
Sbjct: 153 HMNQGMFVGAAGLLGLGGG---SMSFVGQLSRERGNAFSYCLVSRVTNSNGFLEFGSEAM 209

Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTL 259
             G    PLI   H    YY+ L  + VG+ ++       E      G + +DTG   T 
Sbjct: 210 PVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGGVVMDTGTAVTR 269

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISS--QPKFPEVTIHFRG 313
            P   +   +           +   G  P  S V     CYN+      + P V+ +F G
Sbjct: 270 FPTVAYEAFRDAF--------IDQTGNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSG 321

Query: 314 ADVKLSPSNLFRNISDE--IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSR 370
             +   P+N F    D+    C AF    + + + G I Q    I  D     V F P+ 
Sbjct: 322 GPILTLPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDGANEFVGFGPNV 381

Query: 371 C 371
           C
Sbjct: 382 C 382


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 118/414 (28%), Positives = 184/414 (44%), Gaps = 62/414 (14%)

Query: 3   NSQKLPFYNDNETPKSPISIIYQAEIIS---------VDDIYLMHLSIGTPPVDIFGSVD 53
           NS +L   ND     S  S   +  + S         V  I L   S G+P  ++   VD
Sbjct: 149 NSFQLRIRNDRAAAASTQSGSAEVPLTSGIRFQTLNYVTTIALGGGSSGSPAANLTVIVD 208

Query: 54  TGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAV-------VTSNCSE 106
           TGSD TW QC+PC    C+ Q  PLFDP  S+TY ++ C++S CA           +C  
Sbjct: 209 TGSDLTWVQCKPCSA--CYAQRDPLFDPAGSATYAAVRCNASACAASLKAATGTPGSCGG 266

Query: 107 GD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG--HKNLASPTS 162
           G+  C Y+  YG G   SFS G LAT+T+     S     +   +FGCG  ++ L   T+
Sbjct: 267 GNERCYYALAYGDG---SFSRGVLATDTVALGGAS-----LDGFVFGCGLSNRGLFGGTA 318

Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGGIVAGAGVVSTP 218
                G++GLG    SL+SQ      G FSYCLP       S  ++ GG  A +   +TP
Sbjct: 319 -----GLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGDASGSLSLGGD-ASSYRNTTP 372

Query: 219 L----IIRD-----HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLK 269
           +    +I D      Y+L++   +VG   L        N+ +D+G + T L    +  ++
Sbjct: 373 VAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLAPSVYRGVR 432

Query: 270 SVMSNMIKAQPVKGVGAEPGFSDV-LCYNISS--QPKFPEVTIHFR-GADVKLSPSNLFR 325
           +  +    A    G    PGFS +  CY+++   + K P +T+    GA+V +  + +  
Sbjct: 433 AEFTRQFAA---AGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAEVTVDAAGMLF 489

Query: 326 NISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTNY 374
            +  +     +  ++    +   + G   Q N  + YD   + + F    C NY
Sbjct: 490 VVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC-NY 542


>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
          Length = 404

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 92/232 (39%), Positives = 121/232 (52%), Gaps = 32/232 (13%)

Query: 22  IIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDP 81
           + +Q  + +    Y M+LSIGTPPV      DTGS   WTQC PC E  C  +  P F P
Sbjct: 77  VSFQTLLDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPCTE--CAARPAPPFQP 134

Query: 82  KKSSTYNSISCSSSQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST 138
             SST++ + C+SS C  +TS    C+   C Y + YG G    F++G LATETL     
Sbjct: 135 ASSSTFSKLPCASSLCQFLTSPYRTCNATGCVYYYPYGMG----FTAGYLATETLHVGGA 190

Query: 139 SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--- 195
           S      P V FGC  +N    +S    +GI+GLG    SL+SQ+G +   +FSYCL   
Sbjct: 191 S-----FPGVTFGCSTENGVGNSS----SGIVGLGRSPLSLVSQVGVA---RFSYCLRSN 238

Query: 196 PDQGSSKINFGGI--VAGAGVVSTPLIIR------DHYYLSLEAISVGNQRL 239
            D G S I FG +  V G  V STPL+         +YY++L  I+VG   L
Sbjct: 239 ADAGDSPILFGSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDL 290


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 109/370 (29%), Positives = 177/370 (47%), Gaps = 56/370 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +G+  + +   +DTGSD TW QCEPC  + C+ Q+ P+F P  SS+Y S+SC+S
Sbjct: 65  YIVTMGLGSTNMTVI--IDTGSDLTWVQCEPC--MSCYNQQGPIFKPSTSSSYQSVSCNS 120

Query: 95  SQCAVV------TSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           S C  +      T  C      C+Y   YG G+Y   ++G L  E L+F       V + 
Sbjct: 121 STCQSLQFATGNTGACGSNPSTCNYVVNYGDGSY---TNGELGVEQLSFGG-----VSVS 172

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKIN 204
           + +FGCG  N       S   G++GLG    SL+SQ   +  G FSYCLP  + G+S   
Sbjct: 173 DFVFGCGRNNKGLFGGVS---GLMGLGRSYLSLVSQTNATFGGVFSYCLPTTESGAS--- 226

Query: 205 FGGIVAG--AGVVS--TPLI---------IRDHYYLSLEAISVGNQRLEFVSSSTGNIFV 251
            G +V G  + V    TP+          + + Y L+L  I V    L+  S   G + +
Sbjct: 227 -GSLVMGNESSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVPSFGNGGVLI 285

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVT 308
           D+G + T LP   +  LK++        P     + PGFS +  C+N++   +   P ++
Sbjct: 286 DSGTVITRLPSSVYKALKALFLKQFTGFP-----SAPGFSILDTCFNLTGYDEVSIPTIS 340

Query: 309 IHFRG-ADVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
           +HF G A++K+  +  F  + ++     +  ++        + G   Q N  + YD +Q+
Sbjct: 341 MHFEGNAELKVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQS 400

Query: 363 MVSFKPSRCT 372
            V F    C+
Sbjct: 401 KVGFAEESCS 410


>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
          Length = 405

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 104/389 (26%), Positives = 169/389 (43%), Gaps = 48/389 (12%)

Query: 13  NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
           + TP +    +     +S   +Y+ + +IGTPP  +   VD   +  WTQC PC    CF
Sbjct: 35  DATPPAAGGAVAVPIYLSSQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CF 92

Query: 73  KQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGNLA 129
           +Q+ PLFDP KSST+  + C S  C  +   + NC+   C    +Y     A  + G   
Sbjct: 93  EQDLPLFDPTKSSTFRGLPCGSHLCESIPESSRNCTSDVC----IYEAPTKAGDTGGKAG 148

Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
           T+T    +          + FGC         +    +GI+GLG    SL++QM  +   
Sbjct: 149 TDTFAIGAAK------ETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT--- 199

Query: 190 KFSYCLPDQGSSKINFGGI---VAGAGVVSTPLIIRD-----------HYYLSLEAISVG 235
            FSYCL  + S  +  G     +AG    STP +I+            +Y + L  I  G
Sbjct: 200 AFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTG 259

Query: 236 NQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
              L+  SSS   + +DT    + L    +  LK  ++  +  QPV    A P     LC
Sbjct: 260 GAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPV----ASPPKPYDLC 315

Query: 296 YNISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI----------V 344
           +  +     PE+   F  GA + + P+N      +  +C    G +A++          +
Sbjct: 316 FPKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTVCLTI-GSSASLNLTGELEGASI 374

Query: 345 YGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
            G + Q N  + +D+++  +SFKP+ C++
Sbjct: 375 LGSLQQENVHVLFDLKEETLSFKPADCSS 403


>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
          Length = 465

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 114/373 (30%), Positives = 170/373 (45%), Gaps = 50/373 (13%)

Query: 30  SVDDI-YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
           SVD + Y++ L IGTP V     +DTGSD +W QC+PC   +C+ Q+ PLFDP  SS+Y 
Sbjct: 112 SVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYA 171

Query: 89  SISCSSSQCAVVTS-----NCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
           S+ C S  C  + +      C+ G    C Y   YG  A    ++G  +TETLT      
Sbjct: 172 SVPCDSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRAT---TTGVYSTETLTLKPG-- 226

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQ 198
             V + +  FGCG           K  G++GLG    SL+SQ  +   G FSYCLP    
Sbjct: 227 --VVVADFGFGCGDHQHG---PYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSG 281

Query: 199 GSSKINFGG------IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-TG 247
           G+  +  G         A AG + TP+     +   Y ++L  ISVG   L    S+ + 
Sbjct: 282 GAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPLAVPPSAFSS 341

Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPK-- 303
            + +D+G + T LP   ++ L+S   + +    +      P    VL  CY+ +      
Sbjct: 342 GMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLL----PPSNGAVLDTCYDFTGHTNVT 397

Query: 304 FPEVTIHFR-GADVKLS-PSNLFRNISDEIMCSAFRGGNANIVYGRIMQIN---FLIGYD 358
            P + + F  GA + L+ P+ +  +      C AF G   +   G I  +N   F + YD
Sbjct: 398 VPTIALTFSGGATIDLATPAGVLVD-----GCLAFAGAGTDDTIGIIGNVNQRTFEVLYD 452

Query: 359 IEQAMVSFKPSRC 371
             +  V F+   C
Sbjct: 453 SGKGTVGFRAGAC 465


>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
          Length = 405

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 104/389 (26%), Positives = 170/389 (43%), Gaps = 48/389 (12%)

Query: 13  NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
           + TP +    +     +S   +Y+ + +IGTPP  +   VD   +  WTQC PC    CF
Sbjct: 35  DATPPAAGGAVAVPIYLSSQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CF 92

Query: 73  KQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGNLA 129
           +Q+ PLFDP KSST+  + C S  C  +   + NC+   C    +Y     A  + G   
Sbjct: 93  EQDLPLFDPTKSSTFRGLPCGSHLCESIPESSRNCTSDVC----IYEAPTKAGDTGGMAG 148

Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
           T+T    +          + FGC         +    +GI+GLG    SL++QM  +   
Sbjct: 149 TDTFAIGAAK------ETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT--- 199

Query: 190 KFSYCLPDQGSSKINFGGI---VAGAGVVSTPLIIRD-----------HYYLSLEAISVG 235
            FSYCL  + S  +  G     +AG    STP +I+            +Y + L  I  G
Sbjct: 200 AFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAG 259

Query: 236 NQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
              L+  SSS   + +DT    + L    +  LK  ++  +  QPV    A P     LC
Sbjct: 260 GAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPV----ASPPKPYDLC 315

Query: 296 YNISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI----------V 344
           ++ +     PE+   F  GA + + P+N      +  +C    G +A++          +
Sbjct: 316 FSKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTVCLTI-GSSASLNLTGELEGASI 374

Query: 345 YGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
            G + Q N  + +D+++  +SFKP+ C++
Sbjct: 375 LGSLQQENVHVLFDLKEETLSFKPADCSS 403


>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 476

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 108/371 (29%), Positives = 171/371 (46%), Gaps = 40/371 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE---PCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+P  + +  VDTGSD  W  C     CP+      +  L+DP  S T N++
Sbjct: 71  LYYTKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSKTSNAV 130

Query: 91  SCSSSQCAVV----TSNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
            C    C        S C +   C YS  YG G   S +SG+   ++LTF+  SG     
Sbjct: 131 PCGDGFCTDTYSGPISGCKQDMSCPYSITYGDG---STTSGSFVNDSLTFDEVSGNLHTK 187

Query: 146 PN---VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
           P+   VIFGCG K   S +S+S +   GIIG G  NSS++SQ+  S  +   FS+CL   
Sbjct: 188 PDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSH 247

Query: 199 GSSKINFGGIVAGAGVVSTPLIIR-DHYYLSLEAISVGNQRL-----EFVSSSTGNIFVD 252
               I   G V      +TPL+ R  HY + L+ + V  + +      F S S     +D
Sbjct: 248 HGGGIFSIGQVMEPKFNTTPLVPRMAHYNVILKDMDVDGEPILLPLYLFDSGSGRGTIID 307

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQ--PKFPEVTI 309
           +G     LPL  ++ L   +  ++  QP +K +  E  F+   C++ S +    FP V  
Sbjct: 308 SGTTLAYLPLSIYNQL---LPKVLGRQPGLKLMIVEDQFT---CFHYSDKLDEGFPVVKF 361

Query: 310 HFRGADVKLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDIEQA 362
           HF G  + + P +      ++I C  +       + G   I+ G ++  N L+ YD+E  
Sbjct: 362 HFEGLSLTVHPHDYLFLYKEDIYCIGWQKSSTQTKEGRDLILIGDLVLSNKLVVYDLENM 421

Query: 363 MVSFKPSRCTN 373
           ++ +    C++
Sbjct: 422 VIGWTNFNCSS 432


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 113/362 (31%), Positives = 171/362 (47%), Gaps = 46/362 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  D     DTGS  TWTQC+PC    C+ Q+   FDP KS++YN++SCSS
Sbjct: 135 YVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLG-SCYPQKEQKFDPTKSTSYNNVSCSS 193

Query: 95  SQCAVVTSN---CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           + C ++ ++   CS  +  C Y  +YG  +Y   S G  ATETLT +S+        N +
Sbjct: 194 ASCNLLPTSERGCSASNSTCLYQIIYGDQSY---SQGFFATETLTISSSD----VFTNFL 246

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPG-------NSSLISQMGTSIAGKFSYCLPDQGSSK 202
           FGCG  N           G+ G   G       + SL SQ       +FSYCLP   SS 
Sbjct: 247 FGCGQSN----------NGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPSTPSST 296

Query: 203 --INFGGIVA-GAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLR 257
             +NFGG V+  AG           Y + +  ISV   +L    S  +T    +D+G + 
Sbjct: 297 GYLNFGGKVSQTAGFTPISPAFSSFYGIDIVGISVAGSQLPIDPSIFTTSGAIIDSGTVI 356

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRGA- 314
           T LP   +  LK      +   P K  G E    D  CY+ S  +   FP+V++ F+G  
Sbjct: 357 TRLPPTAYKALKEAFDEKMSNYP-KTNGDE--LLDT-CYDFSNYTTVSFPKVSVSFKGGV 412

Query: 315 DVKLSPSNLFRNISD-EIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSR 370
           +V +  S +   ++  +++C AF     +    ++G   Q  + + YD  + M+ F    
Sbjct: 413 EVDIDASGILYLVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGA 472

Query: 371 CT 372
           C+
Sbjct: 473 CS 474


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 112/383 (29%), Positives = 178/383 (46%), Gaps = 62/383 (16%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-PLFDPKKSSTYNSISCS 93
           Y + L IGTPP  +    DTGSD  W +C PC   +C  + P   F  + S+TY++I C 
Sbjct: 86  YFVSLRIGTPPQTLLLVADTGSDLIWVKCSPC--RNCSHRSPGSAFFARHSTTYSAIHCY 143

Query: 94  SSQCAVV----TSNCSE----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           S QC +V     + C+       C Y + Y   A +S ++G  + E LT N+++G   ++
Sbjct: 144 SPQCQLVPHPHPNPCNRTRLHSPCRYQYTY---ADSSTTTGFFSKEALTLNTSTGKVKKL 200

Query: 146 PNVIFGCGHK----NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG-- 199
             + FGCG +    +L   + +  Q G++GLG    S  SQ+G     KFSYCL D    
Sbjct: 201 NGLSFGCGFRISGPSLTGASFEGAQ-GVMGLGRAPISFSSQLGRRFGSKFSYCLMDYTLS 259

Query: 200 ---SSKINFGG----IVAGAGVVS-TPLIIR----DHYYLSLEAISVGNQRLEFVSS--- 244
              +S +  GG     V+  G++S TPL+I       YY++++ + V   +L    S   
Sbjct: 260 PPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWS 319

Query: 245 ----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIK----AQPVKGVGAEPGFSDVLCY 296
                 G   +D+G   T +    ++ +       +K    A+P       PGF   LC 
Sbjct: 320 IDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPT------PGFD--LCM 371

Query: 297 NIS--SQPKFPEVTIHFRGADV-KLSPSNLFRNISDEIMCSAFR-----GGNANIVYGRI 348
           N+S  ++P  P ++ +  G  V    P N F    D+I C A +     GG +  V G +
Sbjct: 372 NVSGVTRPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFS--VLGNL 429

Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
           MQ  FL+ +D +++ + F    C
Sbjct: 430 MQQGFLLEFDRDKSRLGFTRRGC 452


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 114/363 (31%), Positives = 168/363 (46%), Gaps = 53/363 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + IG PP   +  +DTGSD +W QC PC E  C++Q  P+FDP  S++Y+ I C +
Sbjct: 149 YFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSE--CYQQSDPIFDPVSSNSYSPIRCDA 206

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC ++  S C  G C Y   YG G+Y   + G  ATET+T  + +     + NV  GCG
Sbjct: 207 PQCKSLDLSECRNGTCLYEVSYGDGSY---TVGEFATETVTLGTAA-----VENVAIGCG 258

Query: 154 HKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGI 208
           H N  L    +     G   L     S  +Q+    A  FSYCL ++ S   S + F   
Sbjct: 259 HNNEGLFVGAAGLLGLGGGKL-----SFPAQVN---ATSFSYCLVNRDSDAVSTLEFNSP 310

Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLR 257
           +    VV+ PL     +   YYL L+ ISVG + L       E  +   G I +D+G   
Sbjct: 311 LP-RNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAV 369

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQP--KFPEVTIHF 311
           T L  E +  L+           VKG    P  + V     CY++SS+   + P V+ HF
Sbjct: 370 TRLRSEVYDALRDAF--------VKGAKGIPKANGVSLFDTCYDLSSRESVQVPTVSFHF 421

Query: 312 -RGADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKP 368
             G ++ L   N    + S    C AF    +++ + G + Q    +G+DI  ++V F  
Sbjct: 422 PEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGFSA 481

Query: 369 SRC 371
             C
Sbjct: 482 DSC 484


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 106/365 (29%), Positives = 162/365 (44%), Gaps = 53/365 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +G+PP + +  +D+GSD  W QC+PC    C++Q  P+FDP  SS++  +SC S
Sbjct: 143 YFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSR--CYQQSDPVFDPADSSSFAGVSCGS 200

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C  +  + C+ G C Y   YG G+Y   + G LA ETLT        V + +V  GCG
Sbjct: 201 DVCDRLENTGCNAGRCRYEVSYGDGSY---TKGTLALETLTVGQ-----VMIRDVAIGCG 252

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS-----KINFGGI 208
           H N       +   G+ G    + S I Q+G    G FSYCL  +G+      +   G +
Sbjct: 253 HTNQGMFIGAAGLLGLGGG---SMSFIGQLGGQTGGAFSYCLVSRGTGSTGALEFGRGAL 309

Query: 209 VAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVL 256
             GA  +S   +IR+      YY+ L  I VG  R+       +     T  + +DTG  
Sbjct: 310 PVGATWIS---LIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTA 366

Query: 257 RTLLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTI 309
            T  P    + +  +  +  SN+ +A         PG S    CY+++     + P V+ 
Sbjct: 367 VTRFPTAAYVAFRDSFTAQTSNLPRA---------PGVSIFDTCYDLNGFESVRVPTVSF 417

Query: 310 HFRGADVKLSPSNLFRNISD--EIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSF 366
           +F    V   P+  F    D     C AF    + + + G I Q    I +D     V F
Sbjct: 418 YFSDGPVLTLPARNFLIPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGF 477

Query: 367 KPSRC 371
            P+ C
Sbjct: 478 GPNIC 482


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 117/387 (30%), Positives = 175/387 (45%), Gaps = 57/387 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP-------CPELDCFKQEPPLFDPKKSSTY 87
           YL+ ++ GTPP ++    DTGSD  W QC         CP+  C ++  P F   KS+T 
Sbjct: 53  YLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRR--PAFVASKSATL 110

Query: 88  NSISCSSSQCAVVTSNCSEG---------DCSYSFLYGRGAYASFSSGNLATETLTF-NS 137
           + + CS++QC +V +    G          C Y++ Y  G   S ++G LA +T T  N 
Sbjct: 111 SVVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADG---SSTTGFLARDTATISNG 167

Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD 197
           TSG    +  V FGCG +N     S S   G+IGLG G  S  +Q G+  A  FSYCL D
Sbjct: 168 TSG-GAAVRGVAFGCGTRNQGG--SFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLD 224

Query: 198 -------QGSSKINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS- 245
                  + SS +  G     A    TPL+        YY+ + AI VGN+ L    S  
Sbjct: 225 LEGGRRGRSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEW 284

Query: 246 ------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNI 298
                  G   +D+G   T L L  + +L S  +  +    +    +   F  + LCYN+
Sbjct: 285 AIDVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIP--SSATFFQGLELCYNV 342

Query: 299 S-------SQPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFR---GGNANIVYGR 347
           S       +   FP +TI F +G  ++L   N   +++D++ C A R      A  V G 
Sbjct: 343 SSSSSSAPANGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGN 402

Query: 348 IMQINFLIGYDIEQAMVSFKPSRCTNY 374
           +MQ  + + +D   A + F  + C  +
Sbjct: 403 LMQQGYHVEFDRASARIGFARTECVAH 429


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 106/359 (29%), Positives = 168/359 (46%), Gaps = 32/359 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  L +GTP  ++   +DTGSD +W QC+PC   DC++Q  P+FDP  SSTY+++ C +
Sbjct: 139 YVASLRLGTPATELVVELDTGSDQSWVQCKPC--ADCYEQRDPVFDPTASSTYSAVPCGA 196

Query: 95  SQCAVVTSNCSEG--------DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-- 144
            +C  + S+ S          +C Y   Y      S + G+LA +TLT + +        
Sbjct: 197 RECQELASSSSSRNCSSDNNKNCPYEVSYDDD---SHTVGDLARDTLTLSPSPSPSPADT 253

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-- 202
           +P  +FGCGH N     +  +  G++GLG G +SL SQ+       FSYCLP   S+   
Sbjct: 254 VPGFVFGCGHSNAG---TFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCLPSSPSAAGY 310

Query: 203 INFGGIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLEFVSSS---TGNIFVDTGVL 256
           ++FGG  A A    T ++       YYL+L  I V  + ++  +S+        +D+G  
Sbjct: 311 LSFGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAAGTIIDSGTA 370

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-G 313
            + LP   ++ L+S   + +     K   + P F    CY+ +     + P V + F  G
Sbjct: 371 FSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFD--TCYDFTGHETVRIPAVELVFADG 428

Query: 314 ADVKLSPSNLFRNISDEIM-CSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           A V L PS +    +D    C AF   +   + G   Q    + YD+    + F    C
Sbjct: 429 ATVHLHPSGVLYTWNDVAQTCLAFVPNHDLGILGNTQQRTLAVIYDVGSQRIGFGRKGC 487


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 111/363 (30%), Positives = 164/363 (45%), Gaps = 51/363 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +G P   ++  +DTGSD TW QC+PC   DC+ Q  P++DP  S++Y ++ C S
Sbjct: 163 YFSRVGVGRPARQLYMVLDTGSDVTWLQCQPC--ADCYAQSDPVYDPSVSTSYATVGCDS 220

Query: 95  SQCAVVTSNC---SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
            +C  + +     S G C Y   YG G+Y   + G+ ATETLT   ++  PV   NV  G
Sbjct: 221 PRCRDLDAAACRNSTGSCLYEVAYGDGSY---TVGDFATETLTLGDSA--PVS--NVAIG 273

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGI 208
           CGH N       +    + G      S IS      A  FSYCL D+    SS + FG  
Sbjct: 274 CGHDNEGLFVGAAGLLALGGGPLSFPSQIS------ATTFSYCLVDRDSPSSSTLQFGDS 327

Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLR 257
              A  V+ PLI        YY++L  ISVG + L   SS+       +G + VD+G   
Sbjct: 328 EQPA--VTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTAV 385

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS--SQPKFPEVTIHF 311
           T L    +  L+           V+G  + P  S V     CY+++  S  + P V + F
Sbjct: 386 TRLQSGAYGALREAF--------VQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAVALWF 437

Query: 312 R-GADVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKP 368
             G ++KL   N    +      C AF G +  + + G + Q    + +D  +  V F  
Sbjct: 438 EGGGELKLPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTVGFTA 497

Query: 369 SRC 371
            +C
Sbjct: 498 DKC 500


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 114/384 (29%), Positives = 166/384 (43%), Gaps = 64/384 (16%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M + IGTPP      +DTGSD  W QC PC   DCF Q  P +DPK+SS++ +I C  
Sbjct: 192 YFMDVFIGTPPRHFSLILDTGSDLNWIQCVPC--YDCFVQNGPYYDPKESSSFKNIGCHD 249

Query: 95  SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV---- 143
            +C +V+S             C Y + YG    +S ++G+ A ET T N TS  P     
Sbjct: 250 PRCHLVSSPDPPQPCKAENQTCPYFYWYGD---SSNTTGDFALETFTVNLTS--PAGKSE 304

Query: 144 --EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS 201
              + NV+FGCGH N       +   G+        S  SQ+ +     FSYCL D+ S 
Sbjct: 305 FKRVENVMFGCGHWNRGLFHGAAGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRNSD 361

Query: 202 -------------------KINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFV 242
                              ++NF  +VAG      P  +   YY+ +++I VG + L+  
Sbjct: 362 TNVSSKLIFGEDKDLLNHPEVNFTSLVAGK---ENP--VDTFYYVQIKSIMVGGEVLKIP 416

Query: 243 SSS-------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
             +        G   VD+G   +      +  +K      +K  PV  +   P      C
Sbjct: 417 EETWHLSPEGAGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPV--IKDFPILDP--C 472

Query: 296 YNISSQPK--FPEVTIHFR-GADVKLSPSNLFRNIS-DEIMCSAFRG--GNANIVYGRIM 349
           YN+S   K   PE  I F  GA       N F  +  +EI+C A  G   +A  + G   
Sbjct: 473 YNVSGVEKMELPEFRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSALSIIGNYQ 532

Query: 350 QINFLIGYDIEQAMVSFKPSRCTN 373
           Q NF I YD +++ + + P +C +
Sbjct: 533 QQNFHILYDTKKSRLGYAPMKCAD 556


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 117/355 (32%), Positives = 175/355 (49%), Gaps = 36/355 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + IG+P V    S+DTGSD +W QC+PC +  C  +   LFDP  SSTY+  SCSS
Sbjct: 122 YVITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQ--CHSEVDSLFDPSSSSTYSPFSCSS 179

Query: 95  SQCAVVTSN-----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           + CA ++ +     C    C Y   YG  +  + +    +++TLT  S++     M +  
Sbjct: 180 APCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTY---SSDTLTLGSSA-----MTDFQ 231

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
           FGC      S   + +  G++GLG G  SL SQ   +    FSYCLP    S        
Sbjct: 232 FGCSQSE--SGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPTSGSSGFLTLGT 289

Query: 210 AGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLE 263
             +G V TP++    I  +Y + LE+I VG+Q+L   +S  S G++ +D+G + T LP  
Sbjct: 290 GSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFSAGSL-MDSGTIITRLPPT 348

Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQP--KFPEVTIHFR-GADVKLS 319
            +S L S     ++  P     A P G  D  C++ S Q     P VT+ F  GA V L+
Sbjct: 349 AYSALSSAFKAGMQQYPP----ATPSGILDT-CFDFSGQSSISIPTVTLVFSGGAAVDLA 403

Query: 320 PSNLFRNISDEIMCSAF--RGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
              +   IS  I C AF   G ++++ + G + Q  F + YD+    V FK   C
Sbjct: 404 FDGIMLEISSSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 107/359 (29%), Positives = 167/359 (46%), Gaps = 44/359 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +G P    +  +DTGSD  W QC+PC   DC++Q  P+F P  SS+Y+ ++C S
Sbjct: 159 YFTRVGVGNPAKSYYMVLDTGSDINWIQCQPCS--DCYQQSDPIFTPAASSSYSPLTCDS 216

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC ++  S+C  G C Y   YG G   SF+ G+  TET++F  +      + ++  GCG
Sbjct: 217 QQCNSLQMSSCRNGQCRYQVNYGDG---SFTFGDFVTETMSFGGSG----TVNSIALGCG 269

Query: 154 HKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGI 208
           H N  L    +     G   L     SL SQ+    A  FSYCL ++    SS ++F   
Sbjct: 270 HDNEGLFVGAAGLLGLGGGPL-----SLTSQLK---ATSFSYCLVNRDSAASSTLDFNSA 321

Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLR 257
             G  V++ PL+    I   YY+ L  +SVG + L       +   S  G + VD G   
Sbjct: 322 PVGDSVIA-PLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAI 380

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGAD 315
           T L  E +++L+    +M  ++ ++       F    CY++S Q   K P V+ HF G  
Sbjct: 381 TRLQSEAYNSLRDSFVSM--SRHLRSTSGVALFD--TCYDLSGQSSVKVPTVSFHFDGGK 436

Query: 316 VKLSPSN--LFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
               P+   L    S    C AF    +++ + G + Q    + +D+    V F  ++C
Sbjct: 437 SWDLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495


>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  127 bits (319), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 114/377 (30%), Positives = 173/377 (45%), Gaps = 53/377 (14%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTPP      VDTGSD  W     C  CP       +  L+DPK SS+ +++
Sbjct: 82  LYYTEIEIGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDPKGSSSGSTV 141

Query: 91  SCSSSQCAVVTS----NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LP 142
           SC    CA         C++   C YS +YG G   S ++G   +++L +N  SG     
Sbjct: 142 SCDQKFCAATYGGKLPGCAKNIPCEYSVMYGDG---SSTTGYFVSDSLQYNQVSGDGQTR 198

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQG 199
               +VIFGCG +      S ++   GIIG G  N+S++SQ+  +  +   FS+CL    
Sbjct: 199 HANASVIFGCGAQQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCL---- 254

Query: 200 SSKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGN 248
              I  GGI A   VV     STPL+    HY ++LE+I+VG   L+     F +     
Sbjct: 255 -DTIKGGGIFAIGDVVQPKVKSTPLVPDMPHYNVNLESINVGGTTLQLPSHMFETGEKKG 313

Query: 249 IFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC--YNISSQPKFP 305
             +D+G   T LP L Y    K V++ +    P     +     D LC  Y  S    FP
Sbjct: 314 TIIDSGTTLTYLPELVY----KDVLAAVFAKHPDTTFHS---VQDFLCIQYFQSVDDGFP 366

Query: 306 EVTIHFRGADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIG 356
           ++T HF   D+ L+  P + F    D + C  F+ G          ++ G ++  N ++ 
Sbjct: 367 KITFHFE-DDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDGKDMVLLGDLVLSNKVVV 425

Query: 357 YDIEQAMVSFKPSRCTN 373
           YD+E  +V +    C++
Sbjct: 426 YDLENQVVGWTDYNCSS 442


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score =  127 bits (319), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 109/360 (30%), Positives = 165/360 (45%), Gaps = 46/360 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + IG P    +  +DTGSD  W QC+PC   DC++Q  P+FDP  SS+++ + C +
Sbjct: 160 YFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPCD--DCYQQVDPIFDPASSSSFSRLGCQT 217

Query: 95  SQCA-VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC  +    C    C Y   YG G+Y   + G+ ATET++F ++      +  V  GCG
Sbjct: 218 PQCRNLDVFACRNDSCLYQVSYGDGSY---TVGDFATETVSFGNSG----SVDKVAIGCG 270

Query: 154 HKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGI 208
           H N  L    +     G   L     SL SQ+    A  FSYCL ++    SS + F   
Sbjct: 271 HDNEGLFVGAAGLIGLGGGPL-----SLTSQIK---ASSFSYCLVNRDSVDSSTLEFNS- 321

Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLR 257
              +  V+ P+     +   YY+ +  +SVG ++L       E   S  G I VD G   
Sbjct: 322 AKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAV 381

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGA 314
           T L  + ++ L+     + K  P     +  GF+    CYN+SS+   + P V   F G 
Sbjct: 382 TRLQTQAYNALRDTFVKLTKDLP-----STSGFALFDTCYNLSSRTSVRVPTVAFLFDGG 436

Query: 315 D-VKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             + L PSN    + S    C AF    A++ + G + Q    + YD+  + VSF   +C
Sbjct: 437 KSLPLPPSNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVTYDLANSQVSFSSRKC 496


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score =  127 bits (319), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 114/381 (29%), Positives = 170/381 (44%), Gaps = 59/381 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M + +GTPP      +DTGSD  W QC PC  + CF+Q  P +DPK SS++ +ISC  
Sbjct: 197 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKDSSSFRNISCHD 254

Query: 95  SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTF-----NSTSGLP 142
            +C +V++             C Y + YG G   S ++G+ A ET T      N TS L 
Sbjct: 255 PRCQLVSAPDPPKPCKAENQSCPYFYWYGDG---SNTTGDFALETFTVNLTTPNGTSELK 311

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG--- 199
             + NV+FGCGH N       +   G+        S  SQM +     FSYCL D+    
Sbjct: 312 -HVENVMFGCGHWNRGLFHGAAGLLGLGKG---PLSFASQMQSLYGQSFSYCLVDRNSNA 367

Query: 200 --SSKINFGGIVAGAGVVSTPLI------------IRDHYYLSLEAISVGNQRLE----- 240
             SSK+ FG       ++S P +            +   YY+ ++++ V ++ L+     
Sbjct: 368 SVSSKLIFG---EDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEET 424

Query: 241 --FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKA-QPVKGVGAEPGFSDVLCYN 297
               S   G   +D+G   T      +  +K      IK  Q V+G+   P      CYN
Sbjct: 425 WHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGL---PPLKP--CYN 479

Query: 298 ISSQPK--FPEVTIHFRGADVKLSP-SNLFRNISDEIMCSAFRGG--NANIVYGRIMQIN 352
           +S   K   P+  I F    V   P  N F  I  E++C A  G   +A  + G   Q N
Sbjct: 480 VSGIEKMELPDFGILFADEAVWNFPVENYFIWIDPEVVCLAILGNPRSALSIIGNYQQQN 539

Query: 353 FLIGYDIEQAMVSFKPSRCTN 373
           F I YD++++ + + P +C +
Sbjct: 540 FHILYDMKKSRLGYAPMKCAD 560


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 119/372 (31%), Positives = 175/372 (47%), Gaps = 50/372 (13%)

Query: 30  SVDDI-YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
           +VD + Y++ L IGTP V     +DTGSD +W QC+PC    C+ Q+ PL+DP  SSTY 
Sbjct: 121 AVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASSTYA 180

Query: 89  SISCSSSQCAVVTSNCSEGDCS---------YSFLYGRGAYASFSSGNLATETLTFNSTS 139
            + C S  C  +  +  +  C+         Y   YG       + G  +TETLT +   
Sbjct: 181 PVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGN---RDTTVGVYSTETLTLSPQ- 236

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
              V + +  FGCG   L    +     G++GLG    SL+SQ   +  G FSYCLP  G
Sbjct: 237 ---VSVKDFGFGCG---LVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCLP-PG 289

Query: 200 SSKINFGGIVA------GAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS-TGN 248
           +S   F  + A       AG + TPL         Y ++L  +SVG + L+   +  +G 
Sbjct: 290 NSTTGFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLSGG 349

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNIS--SQPKF 304
           + +D+G + T LP   +S L++     + A P+      P   DVL  CYN +  +    
Sbjct: 350 MIIDSGTIITGLPDTAYSALRTAFRTAMSAYPL----LPPNNDDVLDTCYNFTGIANVTV 405

Query: 305 PEVTIHFR-GADVKLS-PSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDI 359
           P V + F  GA + L  PS +   I D   C AF GG ++    + G + Q  F + YD 
Sbjct: 406 PTVALTFDGGATIDLDVPSGVL--IQD---CLAFAGGASDGDVGIIGNVNQRTFEVLYDS 460

Query: 360 EQAMVSFKPSRC 371
            +  V F+P  C
Sbjct: 461 GRGHVGFRPGAC 472


>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 111/373 (29%), Positives = 166/373 (44%), Gaps = 52/373 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ + IG+PP++     DTGSD  W QC PC   DC+ Q  PLFDP  S++++ + C+S
Sbjct: 123 YLVRVGIGSPPLEQHLVADTGSDVIWVQCSPCS--DCYAQGDPLFDPANSASFSPVPCNS 180

Query: 95  SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
             C               G+C Y   YG     S+++G LA ETLT +  +    E+  V
Sbjct: 181 GVCRAAARYSSSSCGGGGGECEYKVSYGD---KSYTNGVLALETLTLDGGT----EVQGV 233

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI 208
             GCGH+N       ++  G++GLG G  SL+ Q+G +  G FSYCL    S + +  G 
Sbjct: 234 AMGCGHENRG---LFAEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSGS 290

Query: 209 V-------AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF-------VSSSTGNIF 250
           +       A  G V  PL+        YY+ +  + V  +RL+             G + 
Sbjct: 291 LVLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGVV 350

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEV 307
           +DTG   T LP E ++ L+   +   +    +G    PG S    CY++S  +  + P V
Sbjct: 351 MDTGTAVTRLPAEAYAALRGAFAGAFE----EGAPRAPGVSLFDTCYDLSGYASVRVPTV 406

Query: 308 TIHF-------RGADVKLSPSNLFRNISD-EIMCSAFRG-GNANIVYGRIMQINFLIGYD 358
            ++F         A + L   NL   + D    C AF    +   + G I Q    I  D
Sbjct: 407 ALYFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILGNIQQQGIEITVD 466

Query: 359 IEQAMVSFKPSRC 371
                V F P+ C
Sbjct: 467 SASGYVGFGPATC 479


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 104/368 (28%), Positives = 159/368 (43%), Gaps = 39/368 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  D+    DTGSD +W QC PC    C+ Q+ PLF P  SST++++ C  
Sbjct: 85  YVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSAVRCGE 144

Query: 95  SQCAVVTSNCSE--GD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV------E 144
            +C     +CS   GD  C Y  +YG     S + G+L  +TLT  +T           +
Sbjct: 145 PECPRARQSCSSSPGDDRCPYEVVYGD---KSRTVGHLGNDTLTLGTTPSTNASENNSNK 201

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
           +P  +FGCG  N        K  G+ GLG G  SL SQ        FSYCLP   S+   
Sbjct: 202 LPGFVFGCGENNTG---LFGKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSNAHG 258

Query: 205 FGGI----VAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSST---GNIFVDT 253
           +  +     A A    TP++ R +    YY+ L  I V  + ++  S        + VD+
Sbjct: 259 YLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPAGLIVDS 318

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK----FPEVT 308
           G + T L    +S L++     + A    G    P  S +  CY+ ++        P V 
Sbjct: 319 GTVITRLAPRAYSALRTA---FLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVA 375

Query: 309 IHFR-GADVKLSPSNLFRNISDEIMCSAFR---GGNANIVYGRIMQINFLIGYDIEQAMV 364
           + F  GA + +  S +         C AF     G +  + G   Q    + YD+ +  +
Sbjct: 376 LVFAGGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAGILGNTQQRTVAVVYDVGRQKI 435

Query: 365 SFKPSRCT 372
            F    C+
Sbjct: 436 GFAAKGCS 443


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 108/373 (28%), Positives = 178/373 (47%), Gaps = 59/373 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTY-----NS 89
           Y++ + +G   + +   VDTGSD +W QC+PC    C+ Q+ P+F+P KS +Y     NS
Sbjct: 66  YIVTVELGGRKMTVI--VDTGSDLSWVQCQPCNR--CYNQQDPVFNPSKSPSYRTVLCNS 121

Query: 90  ISCSSSQCAVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           ++C S Q A   S     +   C+Y   YG G+Y   +SG +  E L   +T+     + 
Sbjct: 122 LTCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSY---TSGEVGMEHLNLGNTT-----VN 173

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKI 203
           N IFGCG KN       S   G++GLG  + SLISQ+     G FSYCLP    + S  +
Sbjct: 174 NFIFGCGRKNQGLFGGAS---GLVGLGRTDLSLISQISPMFGGVFSYCLPTTEAEASGSL 230

Query: 204 NFGGI---------VAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTG 254
             GG          ++   ++  PL+    Y+L+L  I+VG   ++  S     + +D+G
Sbjct: 231 VMGGNSSVYKNTTPISYTRMIHNPLL--PFYFLNLTGITVGGVEVQAPSFGKDRMIIDSG 288

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHF 311
            + + LP   +  LK+         P     + P F  +  C+N+S   + K P++ ++F
Sbjct: 289 TVISRLPPSIYQALKAEFVKQFSGYP-----SAPSFMILDSCFNLSGYQEVKIPDIKMYF 343

Query: 312 RG-ADVKLSPSNLFRNISDEI--MCSAFRGGNANIVY-------GRIMQINFLIGYDIEQ 361
            G A++ +  + +F ++  +   +C A     A++ Y       G   Q N  I YD + 
Sbjct: 344 EGSAELNVDVTGVFYSVKTDASQVCLAI----ASLPYEDEVGIIGNYQQKNQRIIYDTKG 399

Query: 362 AMVSFKPSRCTNY 374
           +M+ F    C+ Y
Sbjct: 400 SMLGFAEEACSFY 412


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 115/359 (32%), Positives = 168/359 (46%), Gaps = 41/359 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCP-ELDCFKQEPPLFDPKKSSTYNSISCS 93
           Y   + +G P    F   DTGSD +W QC+PC  E  C+KQ  P+FDPK SS+Y+ +SC 
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCD 243

Query: 94  SSQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           S QC ++  + C    C Y   YG G   SF+ G LATET +F  ++ +    PN+  GC
Sbjct: 244 SEQCHLLDEAACDANSCIYEVEYGDG---SFTVGELATETFSFRHSNSI----PNLPIGC 296

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGIV 209
           GH N       +   G+ G      SL SQ+    A  FSYCL D   + SS ++F    
Sbjct: 297 GHDNEGLFVGAAGLIGLGGG---AISLSSQLE---ATSFSYCLVDLDSESSSTLDFNADQ 350

Query: 210 AGAGVVSTPLIIRDHY----YLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRT 258
               + S PL+  D +    Y+ +  +SVG + L   SSS       +G I VD+G   T
Sbjct: 351 PSDSLTS-PLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTIT 409

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGAD 315
            +P + +  L+     + K  P       PG S    CY++SSQ   + P +     G +
Sbjct: 410 EIPSDVYDVLRDAFVGLTKNLP-----PAPGVSPFDTCYDLSSQSNVEVPTIAFILPGEN 464

Query: 316 VKLSPSN--LFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
               P+   LF+  S    C AF      + + G + Q    + YD+  ++V F   +C
Sbjct: 465 SLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 107/370 (28%), Positives = 165/370 (44%), Gaps = 40/370 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G PP D +  VDTGSD  W     C+ CP       +  L+DP+ S++   I
Sbjct: 81  LYFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATRI 140

Query: 91  SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
            C    CA   +   +G      C YS +YG G   S ++G    + L F+  +G L   
Sbjct: 141 YCDDDFCAATYNGVLQGCTKDLPCQYSVVYGDG---SSTAGFFVKDNLQFDRVTGNLQTS 197

Query: 145 MPN--VIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
             N  VIFGCG K      TS     GI+G G  NSS+ISQ+  + AGK    F++CL +
Sbjct: 198 SANGSVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQL--AAAGKVKRVFAHCLDN 255

Query: 198 QGSSKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
                I   G V    V +TP++    HY + ++ I VG   LE     F +       +
Sbjct: 256 VKGGGIFAIGEVVSPKVNTTPMVPNQPHYNVVMKEIEVGGNVLELPTDIFDTGDRRGTII 315

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQPKFPEVTIH 310
           D+G     LP   +   +S+M+ ++  QP +K    E  F+    Y  +    FP V  H
Sbjct: 316 DSGTTLAYLPEVVY---ESMMTKIVSEQPGLKLHTVEEQFT-CFQYTGNVNEGFPVVKFH 371

Query: 311 FRGA-DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFLIGYDIEQA 362
           F G+  + ++P +    I +E+ C  ++        GR M +       N L+ YD+E  
Sbjct: 372 FNGSLSLTVNPHDYLFQIHEEVWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLYDLENQ 431

Query: 363 MVSFKPSRCT 372
            + +    C+
Sbjct: 432 AIGWTDYNCS 441


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 114/377 (30%), Positives = 168/377 (44%), Gaps = 53/377 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + IG+PP      +DTGSD  W QC PC   DCF+Q  P +DPK S ++ +I+C+ 
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPC--FDCFEQNGPYYDPKDSISFRNITCND 253

Query: 95  SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP----- 142
            +C +V+S             C Y + YG    +S ++G+ A ET T N TS        
Sbjct: 254 PRCQLVSSPDPPRPCKFETQSCPYFYWYGD---SSNTTGDFALETFTVNLTSSTTGKSEF 310

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG--- 199
             + NV+FGCGH N       +   G+        S  SQ+ +     FSYCL D+    
Sbjct: 311 RRVENVMFGCGHWNRGLFHGAAGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRDSDT 367

Query: 200 --SSKINFG---GIVAGAGVVSTPLI------IRDHYYLSLEAISVGNQRLE-------F 241
             SSK+ FG    ++    +  T LI      +   YYL +++I VG ++L+        
Sbjct: 368 SVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNL 427

Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL-CYNIS- 299
            +   G   +D+G       L Y S+    +      + VKG      F  +  CYN+S 
Sbjct: 428 SADGAGGTIIDSGTT-----LSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSG 482

Query: 300 -SQPKFPEVTIHFR-GADVKLSPSNLFRNISD-EIMCSAFRG--GNANIVYGRIMQINFL 354
             +  FPE  I F  GA       N F  I   +I+C A  G   +A  + G   Q NF 
Sbjct: 483 TDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFH 542

Query: 355 IGYDIEQAMVSFKPSRC 371
           I YD + + + + P RC
Sbjct: 543 ILYDTKNSRLGYAPMRC 559


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 114/377 (30%), Positives = 168/377 (44%), Gaps = 53/377 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + IG+PP      +DTGSD  W QC PC   DCF+Q  P +DPK S ++ +I+C+ 
Sbjct: 196 YFIDVFIGSPPKHFSLILDTGSDLNWIQCVPC--FDCFEQNGPYYDPKDSISFRNITCND 253

Query: 95  SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP----- 142
            +C +V+S             C Y + YG    +S ++G+ A ET T N TS        
Sbjct: 254 PRCQLVSSPDPPRPCKFETQSCPYFYWYGD---SSNTTGDFALETFTVNLTSSTTGKSEF 310

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG--- 199
             + NV+FGCGH N       +   G+        S  SQ+ +     FSYCL D+    
Sbjct: 311 RRVENVMFGCGHWNRGLFHGAAGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRDSDT 367

Query: 200 --SSKINFG---GIVAGAGVVSTPLI------IRDHYYLSLEAISVGNQRLE-------F 241
             SSK+ FG    ++    +  T LI      +   YYL +++I VG ++L+        
Sbjct: 368 SVSSKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNL 427

Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL-CYNIS- 299
            +   G   +D+G       L Y S+    +      + VKG      F  +  CYN+S 
Sbjct: 428 SADGAGGTIIDSGTT-----LSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSG 482

Query: 300 -SQPKFPEVTIHFR-GADVKLSPSNLFRNISD-EIMCSAFRG--GNANIVYGRIMQINFL 354
             +  FPE  I F  GA       N F  I   +I+C A  G   +A  + G   Q NF 
Sbjct: 483 TDELNFPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFH 542

Query: 355 IGYDIEQAMVSFKPSRC 371
           I YD + + + + P RC
Sbjct: 543 ILYDTKNSRLGYAPMRC 559


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 108/381 (28%), Positives = 170/381 (44%), Gaps = 58/381 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-PLFDPKKSSTYNSISCS 93
           Y + L +GTPP  +    DTGSD  W +C  C   +C +  P   F  + S+T++   C 
Sbjct: 89  YFVDLRLGTPPQKLLLVADTGSDLVWVKCSAC--RNCTRHTPGSAFLARHSTTFSPNHCY 146

Query: 94  SSQCAVV----TSNCSEGD----CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
            S C +V       C+       C Y + YG G   S +SG  + ET T N++SG   ++
Sbjct: 147 DSACQLVPLPKHHRCNHARLHSPCRYEYSYGDG---SKTSGFFSKETTTLNTSSGREAKL 203

Query: 146 PNVIFGCGHKNLASPT----SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS 201
             + FGC  + ++ P+    S +   G++GLG G  SL SQ+G     KFSYCL D   S
Sbjct: 204 KGIAFGCAFR-ISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDIS 262

Query: 202 KINFGGIVAGA----------GVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--- 244
                 ++ G+           +  TPL I       YY+ +E++SV   +L    S   
Sbjct: 263 PSPTSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINPSVWA 322

Query: 245 ----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIK----AQPVKGVGAEPGFSDVLCY 296
                 G   VD+G   T LP   +  + +V+   ++    A+P       PGF   LC 
Sbjct: 323 LDELGNGGTIVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPT------PGFD--LCV 374

Query: 297 NIS--SQPKFPEVTIHFRGADV-KLSPSNLFRNISDEIMCSAFRG---GNANIVYGRIMQ 350
           N+S    P+ P+++    G  V    P N F +  +++ C A +     +   V G +MQ
Sbjct: 375 NVSEIEHPRLPKLSFKLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQ 434

Query: 351 INFLIGYDIEQAMVSFKPSRC 371
             FL+ +D ++  + F    C
Sbjct: 435 QGFLLEFDKDRTRLGFSRHGC 455


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 111/366 (30%), Positives = 166/366 (45%), Gaps = 52/366 (14%)

Query: 26  AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL-FDPKKS 84
           ++++S    YLM +++G+PP  +    DTGSD  W +C+           P   FDP +S
Sbjct: 92  SKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRS 151

Query: 85  STYNSISCSSSQC-AVVTSNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNS----T 138
           STY  +SC +  C A+  + C +G +C+Y + YG G   S ++G L+TET TF+      
Sbjct: 152 STYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDG---SNTTGVLSTETFTFDDGGAGR 208

Query: 139 SGLPVEMPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMG--TSIAGKFSYCL 195
           S   V +  V FGC      S P       G         SL++Q+G  TS+  +FSYCL
Sbjct: 209 SPRQVRIGGVKFGCSTATAGSFPADGLVGLGGG-----AVSLVTQLGGATSLGRRFSYCL 263

Query: 196 PDQ---GSSKINFGGI--VAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
                  SS +NFG +  V   G  STPL              VGN+ +   S+++  I 
Sbjct: 264 VPHSVNASSALNFGALADVTEPGAASTPL--------------VGNKTV--ASAASSRII 307

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-----PKFP 305
           VD+G   T L       +   +S  I   PV+     P     LCYN++ +        P
Sbjct: 308 VDSGTTLTFLDPSLLGPIVDELSRRITLPPVQ----SPDGLLQLCYNVAGREVEAGESIP 363

Query: 306 EVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQ 361
           ++T+ F  GA V L P N F  + +  +C A           + G + Q N  +GYD++ 
Sbjct: 364 DLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLDA 423

Query: 362 AMVSFK 367
             V  K
Sbjct: 424 GTVGNK 429



 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/157 (28%), Positives = 69/157 (43%), Gaps = 15/157 (9%)

Query: 224 HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG 283
           H    L+A +VGN+ +   +SS   I VD+G   T L       +   +S  I   PV+ 
Sbjct: 416 HVGYDLDAGTVGNKTVASAASS--RIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQ- 472

Query: 284 VGAEPGFSDVLCYNISSQP-----KFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFR 337
               P     LCYN++ +        P++T+ F G A V L P N F  + +  +C A  
Sbjct: 473 ---SPDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIV 529

Query: 338 GGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
                    + G + Q N  +GYD++   V+F  + C
Sbjct: 530 ATTEQQPVSILGNLAQQNIHVGYDLDAGTVTFAVADC 566


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 109/381 (28%), Positives = 176/381 (46%), Gaps = 57/381 (14%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
           M   IGTPP ++   VDT S+ TW Q   C   +C   + P F+P  SS++ S  C+SS 
Sbjct: 1   MQTKIGTPPREVLLLVDTASELTWVQGTSC--TNCSPTKVPPFNPGLSSSFISEPCTSSV 58

Query: 97  CAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           C   +        N S G CS+   Y  G+ A    G +A E  +  S  G    + +VI
Sbjct: 59  CLGRSKLGFQSACNRSTGSCSFQVAYLDGSEA---YGVIAREIFSLQSWDGAASTLGDVI 115

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS----IAGKFSYCLPDQGSSKINF 205
           FGC  K+L  P   S  +G +GL  G+ S  +Q+G+     ++ +FSYC P++     + 
Sbjct: 116 FGCASKDLQRPVDFS--SGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCFPNRAEHLNSS 173

Query: 206 GGIVAGAGVV-----------STPLI--IRDHYYLSLEAISVGNQRLEFVSSS------- 245
           G I+ G   +             P I  I D YY+ L+ ISVG + L    S+       
Sbjct: 174 GVIIFGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLG 233

Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ---- 301
            G  + D+G   + L    H+ L       +     +  G++  F+  LCY++++     
Sbjct: 234 NGGTYFDSGTTVSFLVEPAHTALVEAFGRRV-LHLNRTSGSD--FTKELCYDVAAGDARL 290

Query: 302 PKFPEVTIHFR-GADVKLSPSNLF----RNISDEIMCSAF------RGGNANIVYGRIMQ 350
           P  P VT+HF+   D++L  ++++    R      +C AF        G  N++ G   Q
Sbjct: 291 PTAPLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVI-GNYQQ 349

Query: 351 INFLIGYDIEQAMVSFKPSRC 371
            ++LI +D+E++ + F P+ C
Sbjct: 350 QDYLIEHDLERSRIGFAPANC 370


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 109/370 (29%), Positives = 170/370 (45%), Gaps = 45/370 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + IGTP  +     DTGSD TW QC+PC +  C++Q+ PLFDP KSSTY  + C +
Sbjct: 126 YVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTD-SCYQQQEPLFDPSKSSTYVDVPCGT 184

Query: 95  SQCAVVTSN---CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
            QC +       C    C YS  YG     S + GNLA E  T  S S  P     V+FG
Sbjct: 185 PQCKIGGGQDLTCGGTTCEYSVKYGD---QSVTRGNLAQEAFTL-SPSAPPAA--GVVFG 238

Query: 152 CGHK---NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPDQGSSKINFGG 207
           C H+    +     +    G++GLG G+SS++SQ     +G  FSYCLP +GSS    G 
Sbjct: 239 CSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPPRGSSA---GY 295

Query: 208 IVAGAG------VVSTPLIIRDH-----YYLSLEAISVGNQRLEFVSSS--TGNIFVDTG 254
           +  GA       +  TPL+  +      Y ++L  ISV    L   +S+   G + +D+G
Sbjct: 296 LTIGAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYIGTV-IDSG 354

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHF- 311
            + T +P   +  L+      +    +   G         CY+++       P V + F 
Sbjct: 355 TVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHVESLDT--CYDVTGHDVVTAPPVALEFG 412

Query: 312 RGADVKLSPSNLFRNISDE-------IMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQA 362
            GA + +  S +    + +       + C AF   N    ++ G + Q  + + +D+E  
Sbjct: 413 GGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAYNVVFDVEGR 472

Query: 363 MVSFKPSRCT 372
            + F  + C+
Sbjct: 473 RIGFGANGCS 482


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 111/355 (31%), Positives = 163/355 (45%), Gaps = 41/355 (11%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD-CFKQEPPLFDPKKSSTYNSISCSSSQC 97
           + +G P    F  +DTGSD TW QC PC   + C++Q  P+FDP+ SS+YN +SC S QC
Sbjct: 1   MRVGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQC 60

Query: 98  AVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKN 156
            ++  + C+   C Y   YG G   SF+ G LATETLTF  ++ +    PN+  GCGH N
Sbjct: 61  QLLDEAGCNVNSCIYKVEYGDG---SFTIGELATETLTFVHSNSI----PNISIGCGHDN 113

Query: 157 LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIVAGAG 213
                      G+ G      S+ SQ+    A  FSYCL D  S   S ++F        
Sbjct: 114 EGLFVGADGLIGLGGG---AISISSQLK---ASSFSYCLVDIDSPSFSTLDFNTDPPSDS 167

Query: 214 VVSTPLIIRDHY----YLSLEAISVGNQ-------RLEFVSSSTGNIFVDTGVLRTLLPL 262
           ++S PL+  D +    Y+ +  +SVG +       R E   S  G I VD+G   T LP 
Sbjct: 168 LIS-PLVKNDRFPSFRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPS 226

Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGADVKLS 319
           + +  L+     +    P       P  S    CY++SSQ   + P +     G +    
Sbjct: 227 DVYEVLREAFLGLTTNLP-----PAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQL 281

Query: 320 PSN--LFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           P+   L +  S    C AF      + + G   Q    + YD+  ++V F  ++C
Sbjct: 282 PAKNCLIQVDSAGTFCLAFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 117/357 (32%), Positives = 168/357 (47%), Gaps = 38/357 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL-DCFKQEPPLFDPKKSSTYNSISCS 93
           Y++  S+GTP V     VDTGSD +W QC+PC     C+ Q+ PLFDP +SS+Y ++ C 
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCG 199

Query: 94  SSQCAVV----TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
              CA +     S CS   C Y   YG G   S ++G  +++TLT +++S     +    
Sbjct: 200 GPVCAGLGIYAASACSAAQCGYVVSYGDG---SNTTGVYSSDTLTLSASS----AVQGFF 252

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INF 205
           FGCGH   A     +   G++GLG    SL+ Q   +  G FSYCLP + S+     +  
Sbjct: 253 FGCGH---AQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGL 309

Query: 206 GGIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTL 259
           GG    A   ST  ++       +Y + L  ISVG Q+L   +S+  G   VDTG + T 
Sbjct: 310 GGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVITR 369

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHF-RGA 314
           LP   ++ L+S   + + +    G    P  G  D  CYN +       P V + F  GA
Sbjct: 370 LPPTAYAALRSAFRSGMASY---GYPTAPSNGILDT-CYNFAGYGTVTLPNVALTFGSGA 425

Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            V L    +          S   GG A  + G + Q +F +   I+   V FKPS C
Sbjct: 426 TVMLGADGILSFGCLAFAPSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 478


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 115/414 (27%), Positives = 175/414 (42%), Gaps = 55/414 (13%)

Query: 1   AQNSQKLPFYNDNETP----KSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGS 56
           A ++++L F +    P    KSP+     +   S    Y + L IG PP  +    DTGS
Sbjct: 50  ALDTRRLHFLSLRRKPIPFVKSPVV----SGAASGSGQYFVDLRIGQPPQSLLLIADTGS 105

Query: 57  DCTWTQCEPCPELDCFKQEPP-LFDPKKSSTYNSISCSSSQCAVVTSN-----CSE---- 106
           D  W +C  C   +C    P  +F P+ SST++   C    C +V        C+     
Sbjct: 106 DLVWVKCSAC--RNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPKPDRAPICNHTRIH 163

Query: 107 GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK---NLASPTSD 163
             C Y + Y  G   S +SG  A ET +  ++SG    + +V FGCG +      S TS 
Sbjct: 164 STCHYEYGYADG---SLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRISGQSVSGTSF 220

Query: 164 SKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAG------VVST 217
           +   G++GLG G  S  SQ+G     KFSYCL D   S      ++ G G      +  T
Sbjct: 221 NGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGNGGDGISKLFFT 280

Query: 218 PLIIR----DHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPLEYHS 266
           PL+        YY+ L+++ V   +L       E   S  G   VD+G     L    + 
Sbjct: 281 PLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNGGTVVDSGTTLAFLAEPAYR 340

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK----FPEVTIHFRGADVKL-SPS 321
           ++ + +   +K      +   PGF   LC N+S   K     P +   F G  V +  P 
Sbjct: 341 SVIAAVRRRVKLPIADAL--TPGFD--LCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPR 396

Query: 322 NLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           N F    ++I C A +  +  +   V G +MQ  FL  +D +++ + F    C 
Sbjct: 397 NYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 450


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 111/380 (29%), Positives = 168/380 (44%), Gaps = 57/380 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M + +GTPP      +DTGSD  W QC PC  + CF+Q  P +DPK SS++ +ISC  
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKDSSSFRNISCHD 252

Query: 95  SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS----GLPV 143
            +C +V+S             C Y + YG G   S ++G+ A ET T N T+        
Sbjct: 253 PRCQLVSSPDPPNPCKAENQSCPYFYWYGDG---SNTTGDFALETFTVNLTTPNGKSELK 309

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
            + NV+FGCGH N       +   G+        S  SQM +     FSYCL D+     
Sbjct: 310 HVENVMFGCGHWNRGLFHGAAGLLGLGKG---PLSFASQMQSLYGQSFSYCLVDRNSNAS 366

Query: 200 -SSKINFGGIVAGAGVVSTPLI------------IRDHYYLSLEAISVGNQRLE------ 240
            SSK+ FG       ++S P +            +   YY+ + ++ V ++ L+      
Sbjct: 367 VSSKLIFG---EDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETW 423

Query: 241 -FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKA-QPVKGVGAEPGFSDVLCYNI 298
              S   G   +D+G   T      +  +K      IK  + V+G+   P      CYN+
Sbjct: 424 HLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGL---PPLKP--CYNV 478

Query: 299 SSQPK--FPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGG--NANIVYGRIMQINF 353
           S   K   P+  I F  GA       N F  I  +++C A  G   +A  + G   Q NF
Sbjct: 479 SGIEKMELPDFGILFADGAVWNFPVENYFIQIDPDVVCLAILGNPRSALSIIGNYQQQNF 538

Query: 354 LIGYDIEQAMVSFKPSRCTN 373
            I YD++++ + + P +C +
Sbjct: 539 HILYDMKKSRLGYAPMKCAD 558


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 109/366 (29%), Positives = 163/366 (44%), Gaps = 38/366 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  D+    DTGSD +W QC PC    C+KQ+ PLF P  SST++++ C +
Sbjct: 154 YVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSAVRCGA 213

Query: 95  SQCAVVTS-NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTF------NSTSGLPVEM 145
            +C    S   S GD  C Y  +YG     S + G+L  +TLT       N+++    ++
Sbjct: 214 RECRARQSCGGSPGDDRCPYEVVYGD---KSRTQGHLGNDTLTLGTMAPANASAENDNKL 270

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--- 202
           P  +FGCG  N        +  G+ GLG G  SL SQ        FSYCLP   SS    
Sbjct: 271 PGFVFGCGENNTG---LFGQADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAPGY 327

Query: 203 INFGGIV-AGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSSTG-NIFVDTG-V 255
           ++ G  V A A    TP++ R      YY+ L  I V  + +   S      + VD+G V
Sbjct: 328 LSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIVDSGTV 387

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK----FPEVTIH 310
           +  L P  Y +   + +S M K     G    P  S +  CY+ ++        P V + 
Sbjct: 388 ITRLAPRAYRALRAAFLSAMGK----YGYKRAPRLSILDTCYDFTAHANATVSIPAVALV 443

Query: 311 FR-GADVKLSPSNLFRNISDEIMCSAFR---GGNANIVYGRIMQINFLIGYDIEQAMVSF 366
           F  GA + +  S +         C AF     G +  + G   Q    + YD+ +  + F
Sbjct: 444 FAGGATISVDFSGVLYVAKVAQACLAFAPNGDGRSAGILGNTQQRTLAVVYDVARQKIGF 503

Query: 367 KPSRCT 372
               C+
Sbjct: 504 AAKGCS 509


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 115/379 (30%), Positives = 168/379 (44%), Gaps = 54/379 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M + IGTPP      +DTGSD  W QC PC   DCF+Q  P +DPK+SS++ +I C  
Sbjct: 90  YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPCH--DCFEQNGPYYDPKESSSFRNIGCHD 147

Query: 95  SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV---- 143
            +C +V+S             C Y + YG    +S ++G+ ATET T N TS  P     
Sbjct: 148 PRCHLVSSPDPPLPCKAENQTCPYFYWYGD---SSNTTGDFATETFTVNLTS--PTGKSE 202

Query: 144 --EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG-- 199
              + NV+FGCGH N       S   G+        S  SQ+ +     FSYCL D+   
Sbjct: 203 FKRVENVMFGCGHWNRGLFHGASGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRNSD 259

Query: 200 ---SSKINFG---GIVAGAGVVSTPLI------IRDHYYLSLEAISVGNQRL-------E 240
              SSK+ FG    ++    +  T L+      +   YY+ +++I VG + L        
Sbjct: 260 TNVSSKLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWN 319

Query: 241 FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
             S   G   VD+G   +      +  +K      +K  P+  V   P      CYN+S 
Sbjct: 320 MTSDGVGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPI--VQDFPILDP--CYNVSG 375

Query: 301 QPK--FPEVTIHF-RGADVKLSPSNLF-RNISDEIMCSAFRG--GNANIVYGRIMQINFL 354
             K   P+  I F  GA       N F R   +E++C A  G   +A  + G   Q NF 
Sbjct: 376 VEKIDLPDFGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSALSIIGNYQQQNFH 435

Query: 355 IGYDIEQAMVSFKPSRCTN 373
           + YD +++ + + P  C +
Sbjct: 436 VLYDTKKSRLGYAPMNCAD 454


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 102/370 (27%), Positives = 180/370 (48%), Gaps = 41/370 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC---FKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP + +  VDTGSD  W  C PCP+            L+D K SST  ++
Sbjct: 76  LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSKNV 135

Query: 91  SCSSSQCAVV--TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP- 146
            C  + C+ +  +  C ++  CSY  +YG G   S S G+   + +T +  +G     P 
Sbjct: 136 GCEDAFCSFIMQSETCGAKKPCSYHVVYGDG---STSDGDFVKDNITLDQVTGNLRTAPL 192

Query: 147 --NVIFGCGHKNLASP--TSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLPDQGS 200
              V+FGCG KN +     ++S   GI+G G  N+S+ISQ+  G S+   FS+CL +   
Sbjct: 193 AQEVVFGCG-KNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNG 251

Query: 201 SKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLEF---VSSSTGN--IFVDTG 254
             I   G V    V +TPL+    HY + L+ + V  + ++    ++S+ G+    +D+G
Sbjct: 252 GGIFAIGEVESPVVKTTPLVPNQVHYNVILKGMDVDGEPIDLPPSLASTNGDGGTIIDSG 311

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFR 312
                LP   ++   S++  +   Q VK    +  F+   C++ +S     FP V +HF 
Sbjct: 312 TTLAYLPQNLYN---SLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFE 365

Query: 313 GADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIGYDIEQAM 363
            + +KLS  P +   ++ +++ C  ++ G          I+ G ++  N L+ YD+E  +
Sbjct: 366 DS-LKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 424

Query: 364 VSFKPSRCTN 373
           + +    C++
Sbjct: 425 IGWADHNCSS 434


>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 497

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 113/380 (29%), Positives = 175/380 (46%), Gaps = 59/380 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           Y   + IGTPP      VDTGSD  W     C+ CP       +  L+DPK SS+ +++S
Sbjct: 87  YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146

Query: 92  CSSSQCAVVTSN------CSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
           C +  CA    +      C+ G  C Y   YG G   S ++G+  +++L +N  SG    
Sbjct: 147 CDNKFCAATYGSGEKLPGCTAGKPCEYRAEYGDG---SSTAGSFVSDSLQYNQLSGNAQT 203

Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
                NVIFGCG +      S ++   GIIG G  N+S +SQ+ ++  +   FS+CL   
Sbjct: 204 RHAKANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCL--- 260

Query: 199 GSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSSTG 247
               I  GGI A   VV     STPL+    HY ++L++I V    L+     F +S   
Sbjct: 261 --DTIKGGGIFAIGEVVQPKVKSTPLLPNMSHYNVNLQSIDVAGNALQLPPHIFETSEKR 318

Query: 248 NIFVDTGVLRTLLP-LEYHSNLKSVMSNM--IKAQPVKGVGAEPGFSDVLC--YNISSQP 302
              +D+G   T LP L Y   L +V      I  + ++G          LC  Y+ S   
Sbjct: 319 GTIIDSGTTLTYLPELVYKDILAAVFQKHQDITFRTIQG---------FLCFEYSESVDD 369

Query: 303 KFPEVTIHFRGADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINF 353
            FP++T HF   D+ L+  P + F    D + C  F+ G          ++ G ++  N 
Sbjct: 370 GFPKITFHFE-DDLGLNVYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVLSNK 428

Query: 354 LIGYDIEQAMVSFKPSRCTN 373
           ++ YD+E+ ++ +    C++
Sbjct: 429 VVVYDLEKQVIGWTDYNCSS 448


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 109/359 (30%), Positives = 160/359 (44%), Gaps = 38/359 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +GTP  D     DTGSD TWTQCEPC +  C+ Q+  +F+P +S++Y +ISC S
Sbjct: 153 YFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVK-SCYNQKEAIFNPSQSTSYANISCGS 211

Query: 95  SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           + C  + S      NC+   C Y   YG    +SFS G    E L+  +T        + 
Sbjct: 212 TLCDSLASATGNIFNCASSTCVYGIQYGD---SSFSIGFFGKEKLSLTATD----VFNDF 264

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFG 206
            FGCG  N       +   G+        SL+SQ        FSYCLP   SS   + FG
Sbjct: 265 YFGCGQNNKGLFGGAAGLLGLG---RDKLSLVSQTAQRYNKIFSYCLPSSSSSTGFLTFG 321

Query: 207 GIVAGAGVVSTPLII----RDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
           G  + +    TPL         Y L L  ISVG ++L    S  ST    +D+G + T L
Sbjct: 322 GSTSKSASF-TPLATISGGSSFYGLDLTGISVGGRKLAISPSVFSTAGTIIDSGTVITRL 380

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPKF--PEVTIHFRGA-DV 316
           P   +S L S    ++   P     A P  S +  C++ S+      P++ + F G   V
Sbjct: 381 PPAAYSALSSTFRKLMSQYP-----AAPALSILDTCFDFSNHDTISVPKIGLFFSGGVVV 435

Query: 317 KLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
            +  + +F       +C AF G +      ++G + Q    + YD     V F P+ C+
Sbjct: 436 DIDKTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRVGFAPAGCS 494


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 105/389 (26%), Positives = 180/389 (46%), Gaps = 48/389 (12%)

Query: 11  NDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD 70
           N +++  +PI +     + +++  Y++ + +G   + +   VDTGSD +W QC+PC    
Sbjct: 113 NIDDSVDAPIPLTSGIRLQTLN--YIVTVELGGRKMTVI--VDTGSDLSWVQCQPCKR-- 166

Query: 71  CFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSN------CSEG--DCSYSFLYGRGAYAS 122
           C+ Q+ P+F+P  S +Y ++ CSS  C  + S       C      C+Y   YG G+Y  
Sbjct: 167 CYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNLGVCGSNPPSCNYVVNYGDGSY-- 224

Query: 123 FSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ 182
            + G L TE L   +++ +     N IFGCG  N       S   G++GLG  + SLISQ
Sbjct: 225 -TRGELGTEHLDLGNSTAVN----NFIFGCGRNNQGLFGGAS---GLVGLGRSSLSLISQ 276

Query: 183 MGTSIAGKFSYCLP---DQGSSKINFGG---IVAGAGVVSTPLIIRD----HYYLSLEAI 232
                 G FSYCLP    + S  +  GG   +      +S   +I +     Y+L+L  I
Sbjct: 277 TSAMFGGVFSYCLPITETEASGSLVMGGNSSVYKNTTPISYTRMIPNPQLPFYFLNLTGI 336

Query: 233 SVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD 292
           +VG+  ++  S     + +D+G + T LP   +  LK          P     + P F  
Sbjct: 337 TVGSVAVQAPSFGKDGMMIDSGTVITRLPPSIYQALKDEFVKQFSGFP-----SAPAFMI 391

Query: 293 V-LCYNIS--SQPKFPEVTIHFRG-ADVKLSPSNLFRNISDE-----IMCSAFRGGNANI 343
           +  C+N+S   + + P + +HF G A++ +  + +F  +  +     +  ++    N   
Sbjct: 392 LDTCFNLSGYQEVEIPNIKMHFEGNAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVG 451

Query: 344 VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           + G   Q N  + YD + +M+ F    CT
Sbjct: 452 IIGNYQQKNQRVIYDTKGSMLGFAAEACT 480


>gi|356558489|ref|XP_003547539.1| PREDICTED: uncharacterized protein LOC100817234 [Glycine max]
          Length = 739

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 93/243 (38%), Positives = 135/243 (55%), Gaps = 22/243 (9%)

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL----PDQ 198
           V  P +  GCG  N  + T DSK  GI+GLG G  SLIS +G SI  K+SYCL       
Sbjct: 56  VSFPKIPIGCGLNN--AGTFDSKCFGIVGLGGGVVSLISHIGLSIDSKYSYCLVPLFEFN 113

Query: 199 GSSKINFG--GIVAGAGVVSTPLI---IRDHYYLSLEAISVGNQRLEFVSSST-----GN 248
            +SKINFG   +V G G VSTP+I       YYL LE +SVG++R++FV +ST     GN
Sbjct: 114 STSKINFGENAVVEGLGTVSTPIIPGSFDTFYYLKLEGMSVGSKRIDFVDASTSNELKGN 173

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPE 306
           I +D+G   T+L   +++ L++ +   I  + V            LCY    ++  + P 
Sbjct: 174 IIIDSGTTLTILLENFYTKLEAEVEAHINLERVNSTDQILS----LCYKSPPNNAIEVPI 229

Query: 307 VTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
           +T HF G D+ L+  N F ++ D+ M  AF    +  ++G + Q+N L+GYD+ +  VSF
Sbjct: 230 ITTHFAGVDIVLNSLNTFVSVFDDAMWFAFAPVASGSIFGNLAQMNHLVGYDLLRKTVSF 289

Query: 367 KPS 369
           KP+
Sbjct: 290 KPT 292


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 107/356 (30%), Positives = 164/356 (46%), Gaps = 37/356 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD +W QC+PC   DC++Q+ PLFDP  SSTY +++C +
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPC--ADCYEQQDPLFDPSLSSTYAAVACGA 206

Query: 95  SQCAVV-TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
            +C  +  S C S+  C Y   YG     S + GNL  +TLT +++      +P  +FGC
Sbjct: 207 PECQELDASGCSSDSRCRYEVQYGD---QSQTDGNLVRDTLTLSASD----TLPGFVFGC 259

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVA 210
           G +N        +  G+ GLG    SL SQ   S    F+YCLP   S +  ++ GG   
Sbjct: 260 GDQNAG---LFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSGRGYLSLGG-AP 315

Query: 211 GAGVVSTPL---IIRDHYYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLEY 264
            A    T L        YY+ L  I VG + +       ++ G   +D+G + T LP   
Sbjct: 316 PANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLPPRA 375

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQ--PKFPEVTIHFR-GADVKLSP 320
           ++ L++  +  + AQ  K     P  S +  CY+ +     + P V + F  GA V L  
Sbjct: 376 YAPLRAAFARSM-AQYKK----APALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDF 430

Query: 321 SNLFRNISDEIMCSAFRGGNAN----IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           + +         C AF   NA+     + G   Q  F + YD+    + F    C+
Sbjct: 431 TGVLYVSKVSQACLAF-APNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGCS 485


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 108/378 (28%), Positives = 175/378 (46%), Gaps = 57/378 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M + +G PP      +DTGSD TW QC+PC    CF Q  P+FDP +S+++  I C++
Sbjct: 87  YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKA--CFDQSGPVFDPSQSTSFKIIPCNA 144

Query: 95  SQCAVV--------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP--VE 144
           + C +V        +S  S   C Y + YG    +S +SG+LA E+L+  S S  P  +E
Sbjct: 145 AACDLVVHDECRDNSSKTSPKTCKYFYWYGD---SSRTSGDLALESLSV-SLSDHPSSLE 200

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPDQG---- 199
           + +++ GCGH N           G+     G  S  SQ+ +S  G+ FSYCL D+     
Sbjct: 201 IRDMVIGCGHSNKGLFQGAGGLLGLGQ---GALSFPSQLRSSPIGQSFSYCLVDRTNNLS 257

Query: 200 -SSKINFGGIVAGA----GVVSTPLI-----IRDHYYLSLEAISVGN-------QRLEFV 242
            SS I+FG   A +     +  TP +     +   YYL ++ I +         +R    
Sbjct: 258 VSSAISFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIA 317

Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVLCYNIS 299
           ++ +G   +D+G   T L  + +  ++S     I   +A P   +G        +CYN +
Sbjct: 318 TNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRADPFDILG--------ICYNAT 369

Query: 300 SQPK--FPEVTIHFR-GADVKLSPSNLF--RNISDEIMCSAFRGGNANIVYGRIMQINFL 354
            +    FP ++I F+ GA++ L   N F   +  +   C A    +   + G   Q N  
Sbjct: 370 GRAAVPFPALSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIH 429

Query: 355 IGYDIEQAMVSFKPSRCT 372
             YD++ A + F  + C+
Sbjct: 430 FLYDVQHARLGFANTDCS 447


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 108/376 (28%), Positives = 173/376 (46%), Gaps = 51/376 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M + IGTPP      +DTGSD  W QC PC  + CF+Q  P +DPK+SS++ +I+C  
Sbjct: 192 YFMDVFIGTPPKHYSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKESSSFENITCHD 249

Query: 95  SQCAVVTS-----NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS----GLPV 143
            +C +V+S      C + +  C Y + YG    +S ++G+ A ET T N T+        
Sbjct: 250 PRCKLVSSPDPPKPCKDENQTCPYFYWYGD---SSNTTGDFALETFTVNLTTPNGKSEQK 306

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
            + NV+FGCGH N       +   G+        S  SQ+ +     FSYCL D+     
Sbjct: 307 HVENVMFGCGHWNRGLFHGAAGLLGLGRG---PLSFASQLQSIYGHSFSYCLVDRNSDTS 363

Query: 200 -SSKINFG---GIVAGAGVVSTPLI------IRDHYYLSLEAISVGNQRLEFVSSST--- 246
            SSK+ FG    +++   +  T  +      +   YY+ +++I V  + L+    +    
Sbjct: 364 VSSKLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLS 423

Query: 247 ----GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL-CYNIS-- 299
               G   +D+G   T      +  +K         + +KG     GF  +  CYN+S  
Sbjct: 424 KEGGGGTIIDSGTTLTYFAEPAYEIIKEAF-----MKKIKGYELVEGFPPLKPCYNVSGI 478

Query: 300 SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRG--GNANIVYGRIMQINFLIG 356
            + + P+  I F  GA       N F  I  +++C A  G   +A  + G   Q NF I 
Sbjct: 479 EKMELPDFGILFSDGAMWDFPVENYFIQIEPDLVCLAILGTPKSALSIIGNYQQQNFHIL 538

Query: 357 YDIEQAMVSFKPSRCT 372
           YD++++ + + P +CT
Sbjct: 539 YDMKKSRLGYAPMKCT 554


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 99/344 (28%), Positives = 154/344 (44%), Gaps = 39/344 (11%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC-----AVVTSNCSE 106
           +DTGS  +W QC+PC  + C  Q  PL+DP  S TY  +SC+S +C     A +     E
Sbjct: 3   LDTGSSLSWLQCQPC-AVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLCE 61

Query: 107 GD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSD 163
            D   C Y+  YG     SFS G L+ + LT  S+  L    P   +GCG  N       
Sbjct: 62  TDSNACLYTASYGD---TSFSIGYLSQDLLTLTSSQTL----PQFTYGCGQDNQG---LF 111

Query: 164 SKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SSKINFGGIVAGAGVVSTPL 219
            +  GIIGL     S+++Q+ T     FSYCLP              G ++      TP+
Sbjct: 112 GRAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPM 171

Query: 220 IIRDH----YYLSLEAISVGNQRLEFVSSSTG-NIFVDTGVLRTLLPLEYHSNLKSVMSN 274
           +        Y+L L AI+V  + L+  ++       +D+G + T LP+  ++ L+     
Sbjct: 172 LTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFVK 231

Query: 275 MIKAQPVKGVGAEPGFSDV-LCY--NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDE 330
           ++  +  K     P +S +  C+  ++ S    PE+ + F+ GAD+ L   ++       
Sbjct: 232 IMSTKYAKA----PAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILIEADKG 287

Query: 331 IMCSAFRGG---NANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           I C AF G    N   + G   Q  + I YD+  + + F P  C
Sbjct: 288 ITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331


>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
          Length = 409

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 110/378 (29%), Positives = 170/378 (44%), Gaps = 55/378 (14%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP    +  VDTGSD  W     C+ CP       E  L+DPK SST + +
Sbjct: 3   LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 62

Query: 91  SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           SC    CA        G      C YS  YG G   S ++G   ++ L F+  SG     
Sbjct: 63  SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDG---SSTTGYFVSDLLQFDQVSGDGQTR 119

Query: 146 P---NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
           P    V FGCG +      +S+    GIIG G  N+S++SQ+  S AGK    F++CL  
Sbjct: 120 PANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQL--SAAGKVKKIFAHCL-- 175

Query: 198 QGSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSST 246
                IN GGI A   VV     +TPL+    HY ++L++I VG   L+     F +   
Sbjct: 176 ---DTINGGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEK 232

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKF 304
               +D+G   T LP   +  +  +++   K + +     +    + LC+    +    F
Sbjct: 233 KGTIIDSGTTLTYLPEIVYKEI--MLAVFAKHKDITFHNVQ----EFLCFQYVGRVDDDF 286

Query: 305 PEVTIHFRGADVKLS--PSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLI 355
           P++T HF   D+ L+  P + F    D + C  F       + G   ++ G ++  N L+
Sbjct: 287 PKITFHFEN-DLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLV 345

Query: 356 GYDIEQAMVSFKPSRCTN 373
            YD+E  ++ +    C++
Sbjct: 346 VYDLENQVIGWTEYNCSS 363


>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
 gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
 gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
          Length = 494

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 111/378 (29%), Positives = 166/378 (43%), Gaps = 55/378 (14%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP    +  VDTGSD  W     C+ CP       E  L+DPK SST + +
Sbjct: 88  LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 147

Query: 91  SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           SC    CA        G      C YS  YG G   S ++G   ++ L F+  SG     
Sbjct: 148 SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDG---SSTTGYFVSDLLQFDQVSGDGQTR 204

Query: 146 P---NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
           P    V FGCG +      +S+    GIIG G  N+S++SQ+  S AGK    F++CL  
Sbjct: 205 PANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQL--SAAGKVKKIFAHCL-- 260

Query: 198 QGSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSST 246
                IN GGI A   VV     +TPL+    HY ++L++I VG   L+     F +   
Sbjct: 261 ---DTINGGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEK 317

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKF 304
               +D+G   T LP   +  +   +    K      V       + LC+    +    F
Sbjct: 318 KGTIIDSGTTLTYLPEIVYKEIMLAVFAKHKDITFHNV------QEFLCFQYVGRVDDDF 371

Query: 305 PEVTIHFRGADVKLS--PSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLI 355
           P++T HF   D+ L+  P + F    D + C  F       + G   ++ G ++  N L+
Sbjct: 372 PKITFHFEN-DLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLV 430

Query: 356 GYDIEQAMVSFKPSRCTN 373
            YD+E  ++ +    C++
Sbjct: 431 VYDLENQVIGWTEYNCSS 448


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score =  124 bits (311), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 104/373 (27%), Positives = 160/373 (42%), Gaps = 41/373 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP-LFDPKKSSTYNSISCS 93
           Y + L IG PP  +    DTGSD  W +C  C   +C    P  +F P+ SST++   C 
Sbjct: 83  YFVDLRIGQPPQSLLLIADTGSDLVWVKCSAC--RNCSHHSPATVFFPRHSSTFSPAHCY 140

Query: 94  SSQCAVVTSNCSEGDCSYSFLYGRGAY------ASFSSGNLATETLTFNSTSGLPVEMPN 147
              C +V        C+++ ++    Y       S +SG  A ET +  ++SG   ++ +
Sbjct: 141 DPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKS 200

Query: 148 VIFGCGHK---NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
           V FGCG +      S TS +   G++GLG G  S  SQ+G     KFSYCL D   S   
Sbjct: 201 VAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPP 260

Query: 205 FGGIVAGAG------VVSTPLIIR----DHYYLSLEAISVGNQRL-------EFVSSSTG 247
              ++ G G      +  TPL+        YY+ L+++ V   +L       E   S  G
Sbjct: 261 TSYLIIGDGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDDSGNG 320

Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK---- 303
              +D+G     L    +  + + +   IK      +   PGF   LC N+S   K    
Sbjct: 321 GTVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADEL--TPGFD--LCVNVSGVTKPEKI 376

Query: 304 FPEVTIHFRGADVKL-SPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDI 359
            P +   F G  V +  P N F    ++I C A +  +  +   V G +MQ  FL  +D 
Sbjct: 377 LPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDR 436

Query: 360 EQAMVSFKPSRCT 372
           +++ + F    C 
Sbjct: 437 DRSRLGFSRRGCA 449


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score =  124 bits (311), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 113/366 (30%), Positives = 170/366 (46%), Gaps = 51/366 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + IG+P    +  +DTGSD  W QC PC    C+KQ   +FDP+ SS++  +SCS+
Sbjct: 14  YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKS--CYKQNDAVFDPRASSSFRRLSCST 71

Query: 95  SQCAVV-TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
            QC ++    C+  D  C Y   YG G   SF+ G+LA+++ + +     P     V+FG
Sbjct: 72  PQCKLLDVKACASTDNRCLYQVSYGDG---SFTVGDLASDSFSVSRGRTSP-----VVFG 123

Query: 152 CGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--PDQG---SSKIN 204
           CGH N  L    +     G   L     S  SQ+ +    KFSYCL   D G   SS + 
Sbjct: 124 CGHDNEGLFVGAAGLLGLGAGKL-----SFPSQLSSR---KFSYCLVSRDNGVRASSALL 175

Query: 205 FG--GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEF------VSSSTGN--IF 250
           FG   +   A    T L+    +   YY  L  IS+G   L        +SSSTG   + 
Sbjct: 176 FGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVI 235

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVT 308
           +D+G   T LP   ++ ++    +  +  P     A+    D  CY+ S  +    P V+
Sbjct: 236 IDSGTSVTRLPTYAYTVMRDAFRSATQKLP---RAADFSLFDT-CYDFSALTSVTIPTVS 291

Query: 309 IHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
            HF  GA V+L PSN    + +    C AF   + ++ + G I Q    +  D++ + V 
Sbjct: 292 FHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVG 351

Query: 366 FKPSRC 371
           F P +C
Sbjct: 352 FAPRQC 357


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 114/359 (31%), Positives = 166/359 (46%), Gaps = 41/359 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCP-ELDCFKQEPPLFDPKKSSTYNSISCS 93
           Y   + +G P    F   DTGSD +W QC+PC  E  C+KQ  P+FDPK SS+Y+ +SC 
Sbjct: 184 YFARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCD 243

Query: 94  SSQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           S QC ++  + C    C Y   YG G   SF+ G LATET +F  ++ +    PN+  GC
Sbjct: 244 SEQCHLLDEAACDANSCIYEVEYGDG---SFTVGELATETFSFRHSNSI----PNLPIGC 296

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSSKINFGGIV 209
           GH N           G+ G      SL SQ+    A  FSYCL D   + SS ++F    
Sbjct: 297 GHDNEGLFVGADGLIGLGGG---AISLSSQLE---ATSFSYCLVDLDSESSSTLDFNADQ 350

Query: 210 AGAGVVSTPLIIRDHY----YLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLRT 258
               + S PL+  D +    Y+ +  +SVG + L   SSS       +G I VD+G   T
Sbjct: 351 PSDSLTS-PLVKNDRFPTFRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTIT 409

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEVTIHFRGAD 315
            +P + +  L+     + K  P       PG S    CY++SSQ   + P +     G +
Sbjct: 410 EIPSDVYDVLRDAFVGLTKNLP-----PAPGVSPFDTCYDLSSQSNVEVPTIAFILPGEN 464

Query: 316 VKLSPSN--LFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
               P+   L +  S    C AF      + + G + Q    + YD+  ++V F   +C
Sbjct: 465 SLQLPAKNCLIQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 114/369 (30%), Positives = 165/369 (44%), Gaps = 52/369 (14%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           D  +L+ ++ GTPP      +DTGS  TWTQC+ C  + C K     FD   SSTY+  S
Sbjct: 124 DGNFLVDVAFGTPPQKFKLILDTGSSITWTQCKAC--VHCLKDSHRHFDSLASSTYSFGS 181

Query: 92  CSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
           C  S              +Y+  YG     S S GN   +T+T   +           FG
Sbjct: 182 CIPSTVG----------NTYNMTYGD---KSTSVGNYGCDTMTLEPSDVF----QKFQFG 224

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG------------ 199
           CG  N     S +   G++GLG G  S +SQ  +     FSYCLP++             
Sbjct: 225 CGRNNEGDFGSGAD--GMLGLGQGQLSTVSQTASKFKKVFSYCLPEENSIGSLLFGEKAT 282

Query: 200 --SSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS---STGNIFVDTG 254
             SS + F  +V G G  ++ L    +Y++ L  ISVGN+RL   SS   S G I +D+G
Sbjct: 283 SQSSSLKFTSLVNGPG--TSGLEESGYYFVKLLDISVGNKRLNIPSSVFASPGTI-IDSG 339

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPV-KGVGAEPGFSDVLCYNISSQPK--FPEVTIHF 311
            + T LP   +S LK+     +   P+  G   E    D  CYN+S +     PE  +HF
Sbjct: 340 TVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDT-CYNLSGRKDVLLPEXVLHF 398

Query: 312 -RGADVKLSPSNLFRNISDEIMCSAFRGGNAN------IVYGRIMQINFLIGYDIEQAMV 364
             GADV+L+   +        +C AF G + +       + G   Q++  + YDI    +
Sbjct: 399 GDGADVRLNGKRVVWGNDASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRI 458

Query: 365 SFKPSRCTN 373
            F  + C+N
Sbjct: 459 GFGGNGCSN 467


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 120/409 (29%), Positives = 180/409 (44%), Gaps = 73/409 (17%)

Query: 17  KSPISI--IYQAEIISVDDIYLMH-------LSIGTPPVDIFGSVDTGSDCTWTQCEPCP 67
           ++PIS   ++     + D +   H       L+ GTP  +I   +DTGS+ +W  C+  P
Sbjct: 40  RTPISTPRLFSTTSKTTDKLLFHHNVTLTVSLTAGTPLQNITMVLDTGSELSWLHCKKEP 99

Query: 68  ELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSN------CSEGD-CSYSFLYGRGAY 120
             +       +F+P  S TY  I CSS  C   T +      C     C +   Y   A 
Sbjct: 100 NFNS------IFNPLASKTYTKIPCSSPTCETRTRDLPLPVSCDPAKLCHFIISY---AD 150

Query: 121 ASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSL 179
           AS   GNLA ET    S +G     P  +FGC     +S +  D+K TG++G+  G+ S 
Sbjct: 151 ASSVEGNLAFETFRVGSVTG-----PATVFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSF 205

Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGV----------VSTPLIIRDH--YYL 227
           ++QMG     KFSYC+ D+ SS +   G  + + +          +STPL   D   Y +
Sbjct: 206 VNQMGFR---KFSYCISDRDSSGVLLLGEASFSWLKPLNYTPLVEMSTPLPYFDRVAYSV 262

Query: 228 SLEAISVGNQRLE-----FVSSST--GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
            LE I V ++ L      FV   T  G   VD+G   T L    +S LK     +++ + 
Sbjct: 263 QLEGIRVSDKVLSLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALKQEF--LLQTKG 320

Query: 281 VKGVGAEPGF----SDVLCYNI----SSQPKFPEVTIHFRGADVKLSPSNLFRNI----- 327
           V  V  EP +    +  LCY I    ++ P  P V + FRGA++ +S   L   +     
Sbjct: 321 VLRVLNEPRYVFQGAMDLCYLIEPTRAALPNLPVVNLMFRGAEMSVSGQRLLYRVPGEVR 380

Query: 328 -SDEIMCSAFRG----GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             D + C  F      G  + V G   Q N  + YD+E++ + F   RC
Sbjct: 381 GKDSVWCFTFGNSDSLGIESFVIGHHQQQNVWMEYDLEKSRIGFAEVRC 429


>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 115/379 (30%), Positives = 170/379 (44%), Gaps = 58/379 (15%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP    +  VDTGSD  W     C+ CP       E  L+DP  SS+   +
Sbjct: 80  LYFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGV 139

Query: 91  SCSSSQC-----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LP 142
           +C    C      V+ S      C YS  YG G   S ++G   T+ L +N  SG     
Sbjct: 140 TCGQDFCVATHGGVIPSCVPAAPCQYSISYGDG---SSTTGFFVTDFLQYNQVSGNSQTT 196

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
           +   ++ FGCG K      S S+   GI+G G  NSS++SQ+  + AGK    F++CL  
Sbjct: 197 LANTSITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQL--AAAGKVRKVFAHCL-- 252

Query: 198 QGSSKINFGGIVAGAGVV-----STPLII-RDHYYLSLEAISVGNQRLEF------VSSS 245
                IN GGI A   VV     +TPL+    HY ++LEAI VG  +L+       +  S
Sbjct: 253 ---DTINGGGIFAIGDVVQPKVSTTPLVPGMPHYNVNLEAIDVGGVKLQLPTNIFDIGES 309

Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPK 303
            G I +D+G     LP   ++ + S +       P+K         D  C+  S      
Sbjct: 310 KGTI-IDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKND------QDFQCFRYSGSVDDG 362

Query: 304 FPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFL 354
           FP +T HF G   + + P + LF+N   E+ C  F+ G      G+ M +       N L
Sbjct: 363 FPIITFHFEGGLPLNIHPHDYLFQN--GELYCMGFQTGGLQTKDGKDMVLLGDLAFSNRL 420

Query: 355 IGYDIEQAMVSFKPSRCTN 373
           + YD+E  ++ +    C++
Sbjct: 421 VLYDLENQVIGWTDYNCSS 439


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 113/363 (31%), Positives = 164/363 (45%), Gaps = 53/363 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + IG PP   +  +DTGSD +W QC PC E  C++Q  P+FDP  S++Y+ I C  
Sbjct: 149 YFLRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSE--CYQQSDPIFDPISSNSYSPIRCDE 206

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC ++  S C  G C Y   YG G+Y   + G  ATET+T  S +     + NV  GCG
Sbjct: 207 PQCKSLDLSECRNGTCLYEVSYGDGSY---TVGEFATETVTLGSAA-----VENVAIGCG 258

Query: 154 HKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGI 208
           H N  L    +     G   L     S  +Q+    A  FSYCL ++ S   S + F   
Sbjct: 259 HNNEGLFVGAAGLLGLGGGKL-----SFPAQVN---ATSFSYCLVNRDSDAVSTLEFNSP 310

Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLR 257
           +      + PL+    +   YYL L+ ISVG + L    SS        G I +D+G   
Sbjct: 311 LP-RNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAV 369

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQPKFPEVTIHFR- 312
           T L  E +  L+           VKG    P  + V     CY++SS+      T+ FR 
Sbjct: 370 TRLRSEVYDALRDAF--------VKGAKGIPKANGVSLFDTCYDLSSRESVEIPTVSFRF 421

Query: 313 --GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKP 368
             G ++ L   N    + S    C AF    +++ + G + Q    +G+DI  ++V F  
Sbjct: 422 PEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANSLVGFSV 481

Query: 369 SRC 371
             C
Sbjct: 482 DSC 484


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 107/356 (30%), Positives = 164/356 (46%), Gaps = 37/356 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP        DTGSD +W QC+PC   DC++Q+ PLFDP  SSTY +++C +
Sbjct: 149 YVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPC--ADCYEQQDPLFDPSLSSTYAAVACGA 206

Query: 95  SQCAVV-TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
            +C  +  S C S+  C Y   YG     S + GNL  +TLT +++      +P  +FGC
Sbjct: 207 PECQELDASGCSSDSRCRYEVQYGD---QSQTDGNLVRDTLTLSASD----TLPGFVFGC 259

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVA 210
           G +N        +  G+ GLG    SL SQ   S    F+YCLP   S +  ++ GG   
Sbjct: 260 GDQNAG---LFGQVDGLFGLGREKVSLPSQGAPSYGPGFTYCLPSSSSGRGYLSLGG-AP 315

Query: 211 GAGVVSTPL---IIRDHYYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLEY 264
            A    T L        YY+ L  I VG + +       ++ G   +D+G + T LP   
Sbjct: 316 PANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDSGTVITRLPPRA 375

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQ--PKFPEVTIHFR-GADVKLSP 320
           ++ L++  +  + AQ  K     P  S +  CY+ +     + P V + F  GA V L  
Sbjct: 376 YAPLRAAFARSM-AQYKK----APALSILDTCYDFTGHRTAQIPTVELAFAGGATVSLDF 430

Query: 321 SNLFRNISDEIMCSAFRGGNAN----IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           + +         C AF   NA+     + G   Q  F + YD+    + F    C+
Sbjct: 431 TGVLYVSKVSQACLAF-APNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGCS 485


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 113/372 (30%), Positives = 166/372 (44%), Gaps = 56/372 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ L IGTP V     +DTGSD +W QC+PC   +C+ Q+ PLFDP  SS+Y S+ C S
Sbjct: 171 YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDS 230

Query: 95  SQCAVVT-----------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
             C  +            S  +   C Y   YG  A    ++G  +TETLT        V
Sbjct: 231 DACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRAT---TTGVYSTETLTLKPG----V 283

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSS 201
            + +  FGCG           K  G++GLG    SL+SQ  +   G FSYCLP    G+ 
Sbjct: 284 VVADFGFGCGDHQHG---PYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAG 340

Query: 202 KINFGG------IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-TGNIF 250
            +  G         A +G+  TP+     +   Y ++L  ISVG   L    S+ +  + 
Sbjct: 341 FLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMV 400

Query: 251 VDTGVLRTLLPLEYHSNL----KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--F 304
           +D+G + T LP   ++ L    +S MS      P  G     G  D  CY+ +       
Sbjct: 401 IDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-----GVLDT-CYDFTGHANVTV 454

Query: 305 PEVTIHFR-GADVKL-SPSNLFRNISDEIMCSAFRGG---NANIVYGRIMQINFLIGYDI 359
           P +++ F  GA + L +P+ +  +      C AF G    NA  + G + Q  F + YD 
Sbjct: 455 PTISLTFSGGATIDLAAPAGVLVD-----GCLAFAGAGTDNAIGIIGNVNQRTFEVLYDS 509

Query: 360 EQAMVSFKPSRC 371
            +  V F+   C
Sbjct: 510 GKGTVGFRAGAC 521


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 114/379 (30%), Positives = 180/379 (47%), Gaps = 66/379 (17%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
           L++GTPP ++   +DTGS+ +W +C    +   F+     FDP +SS+Y+ + CSS  C 
Sbjct: 89  LTVGTPPQNVSMVLDTGSELSWLRCN---KTQTFQTT---FDPNRSSSYSPVPCSSLTCT 142

Query: 99  VVT------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             T      ++C      ++ L    A AS S GNLA++T    ++     +MP  IFGC
Sbjct: 143 DRTRDFPIPASCDSNQLCHAIL--SYADASSSEGNLASDTFYIGNS-----DMPGTIFGC 195

Query: 153 GHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI------NF 205
              + ++ T  DSK TG++G+  G+ S +SQM      KFSYC+ D   S +      NF
Sbjct: 196 MDSSFSTNTEEDSKNTGLMGMNRGSLSFVSQMDFP---KFSYCISDSDFSGVLLLGDANF 252

Query: 206 GGIVAGAGV----VSTPLIIRDH--YYLSLEAISVGNQRLE-----FVSSST--GNIFVD 252
             ++         +STPL   D   Y + LE I V ++ L      FV   T  G   VD
Sbjct: 253 SWLMPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVD 312

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNI----SSQPKF 304
           +G   T L    +S L++   N  +   +  V  +P +       LCY +    +S P  
Sbjct: 313 SGTQFTFLLGPVYSALRNEFLN--QTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPWL 370

Query: 305 PEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRIMQIN 352
           P V++ FRGA++K+S   L   +      SD + C  F  GN+++      V G   Q N
Sbjct: 371 PTVSLMFRGAEMKVSGDRLLYRVPGEVRGSDSVYCFTF--GNSDLLAVEAYVIGHHHQQN 428

Query: 353 FLIGYDIEQAMVSFKPSRC 371
             + +D+E++ + F   +C
Sbjct: 429 VWMEFDLEKSRIGFAQVQC 447


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 116/358 (32%), Positives = 175/358 (48%), Gaps = 45/358 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ + +G+P       +DTGSD +W QC+PC +  C  Q  PLFDP  SSTY+  SC S
Sbjct: 52  YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQ--CHSQADPLFDPSSSSTYSPFSCGS 109

Query: 95  SQCAVVTSN----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
           + CA +        S   C Y   YG G   S ++G  +++TL   S++     + +  F
Sbjct: 110 ADCAQLGQEGNGCSSSSQCQYIVTYGDG---SSTTGTYSSDTLALGSSA-----VRSFQF 161

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFG 206
           GC   N+ S  +D +  G++GLG G  SL+SQ   ++   FSYCLP   SS     +   
Sbjct: 162 GC--SNVESGFND-QTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAA 218

Query: 207 GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
           G    +G V TP++    +   Y + L+AI VG ++L   +S  S G + +D+G + T L
Sbjct: 219 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTV-MDSGTVITRL 277

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQP--KFPEVTIHFR-GADV 316
           P   +S L S     +K  P     A+P G  D  C++ S Q     P V + F  GA V
Sbjct: 278 PPTAYSALSSAFKAGMKQYPP----AQPSGILDT-CFDFSGQSSVSIPSVALVFSGGAVV 332

Query: 317 KLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            L  S +  +      C AF G + +    + G + Q  F + YD+ + +V F+   C
Sbjct: 333 SLDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 113/372 (30%), Positives = 166/372 (44%), Gaps = 56/372 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ L IGTP V     +DTGSD +W QC+PC   +C+ Q+ PLFDP  SS+Y S+ C S
Sbjct: 91  YVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDS 150

Query: 95  SQCAVVT-----------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
             C  +            S  +   C Y   YG  A    ++G  +TETLT        V
Sbjct: 151 DACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRAT---TTGVYSTETLTLKPG----V 203

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSS 201
            + +  FGCG           K  G++GLG    SL+SQ  +   G FSYCLP    G+ 
Sbjct: 204 VVADFGFGCGDHQHG---PYEKFDGLLGLGGAPESLVSQTSSQFGGPFSYCLPPTSGGAG 260

Query: 202 KINFGG------IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-TGNIF 250
            +  G         A +G+  TP+     +   Y ++L  ISVG   L    S+ +  + 
Sbjct: 261 FLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAFSSGMV 320

Query: 251 VDTGVLRTLLPLEYHSNL----KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--F 304
           +D+G + T LP   ++ L    +S MS      P  G     G  D  CY+ +       
Sbjct: 321 IDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-----GVLDT-CYDFTGHANVTV 374

Query: 305 PEVTIHFR-GADVKL-SPSNLFRNISDEIMCSAFRGG---NANIVYGRIMQINFLIGYDI 359
           P +++ F  GA + L +P+ +  +      C AF G    NA  + G + Q  F + YD 
Sbjct: 375 PTISLTFSGGATIDLAAPAGVLVD-----GCLAFAGAGTDNAIGIIGNVNQRTFEVLYDS 429

Query: 360 EQAMVSFKPSRC 371
            +  V F+   C
Sbjct: 430 GKGTVGFRAGAC 441


>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
 gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 116/361 (32%), Positives = 167/361 (46%), Gaps = 42/361 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  D+    DTGSD TWTQC+PC    C+KQ+  +FDP +S++Y +ISCSS
Sbjct: 149 YIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCAR-SCYKQKEQIFDPSQSTSYTNISCSS 207

Query: 95  SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C  +TS       C+   C Y   YG    +SFS G   TE LT  ST        N+
Sbjct: 208 SICNSLTSATGNTPGCASSACVYGIQYGD---SSFSVGFFGTEKLTLTSTDAF----NNI 260

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFG 206
            FGCG  N       +   G+        S++SQ        FSYCLP   SS   + FG
Sbjct: 261 YFGCGQNNQGLFGGSAGLLGLG---RDKLSVVSQTAQKYNKIFSYCLPSSSSSTGFLTFG 317

Query: 207 GIVAGAGVVSTPLII----RDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
           G  A      TPL         Y L    ISVG ++L   +S  ST    +D+G + T L
Sbjct: 318 G-SASKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISASVFSTAGAIIDSGTVITRL 376

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPKF--PEVTIHF-RGAD 315
           P   +S L++   N++   P+    +      +L  CY+ SS      P++   F  G +
Sbjct: 377 PPAAYSALRASFRNLMSKYPMTKALS------ILDTCYDFSSYTTISVPKIGFSFSSGIE 430

Query: 316 VKLSPSNLFRNISDEIMCSAFRGGNAN----IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           V +  + +    S   +C AF  GN++     ++G + Q    + YD     V F P  C
Sbjct: 431 VDIDATGILYASSLSQVCLAF-AGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAPGGC 489

Query: 372 T 372
           +
Sbjct: 490 S 490


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 101/357 (28%), Positives = 162/357 (45%), Gaps = 43/357 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +  GTP   ++  +DTGSD  W  C+ C          P+FDP KSS+Y   +C S
Sbjct: 115 YIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGC---HSTAPIFDPAKSSSYKPFACDS 171

Query: 95  SQCAVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C  ++ NC     C +  LYG G       G LA++ +T  S       +PN  FGC 
Sbjct: 172 QPCQEISGNCGGNSKCQFEVLYGDGTQV---DGTLASDAITLGSQ-----YLPNFSFGCA 223

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG-- 211
            ++L+  T  S     +G G  +    +       G FSYCLP   +S  +   +V G  
Sbjct: 224 -ESLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGS---LVLGKE 279

Query: 212 AGVVSTPL----IIRD-----HYYLSLEAISVGNQRLEFVSS---STGNIFVDTGVLRTL 259
           A V S+ L    +I+D      Y+++L+AISVGN R+   ++   S G   +D+G   T 
Sbjct: 280 AAVSSSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPATNIASGGGTIIDSGTTITY 339

Query: 260 LPLEYHSNLKSVMSNM---IKAQPVKGVGAEPGFSDVLCYNISSQP-KFPEVTIHF-RGA 314
           L    + +L+         ++  PV+ +          CY++SS     P +T+H  R  
Sbjct: 340 LVPSAYKDLRDAFRQQLSSLQPTPVEDMDT--------CYDLSSSSVDVPTITLHLDRNV 391

Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           D+ L   N+       + C AF   ++  + G + Q N+ I +D+  + V F   +C
Sbjct: 392 DLVLPKENILITQESGLSCLAFSSTDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 110/357 (30%), Positives = 160/357 (44%), Gaps = 38/357 (10%)

Query: 36  LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
           L++LSIG P +     +DTGSD  W  C PC   +C      LFDP  SST++ +    +
Sbjct: 102 LVNLSIGQPSIPQLVVMDTGSDILWIMCNPCT--NCDNHLGLLFDPSMSSTFSPL--CKT 157

Query: 96  QCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK 155
            C      C     + S++    A     SG    + L F +T     ++ +VI GCGH 
Sbjct: 158 PCGFKGCKCDPIPFTISYVDNSSA-----SGTFGRDILVFETTDEGTSQISDVIIGCGHN 212

Query: 156 NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGV- 214
                 SD    GI+GL  G +SL +Q+G     KFSYC+ +      N+  +  G G  
Sbjct: 213 --IGFNSDPGYNGILGLNNGPNSLATQIGR----KFSYCIGNLADPYYNYNQLRLGEGAD 266

Query: 215 ---VSTPL-IIRDHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPLE 263
               STP  +    YY+++E ISVG +RL       E   + TG + +D+G   T L   
Sbjct: 267 LEGYSTPFEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVDS 326

Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHF-RGADVKLSP 320
            H  L + + N++K    + V  E     +  Y I S+    FP VT HF  GAD+ L  
Sbjct: 327 AHKLLYNEVRNLLKWS-FRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADLALDT 385

Query: 321 SNLFRNISDEIMCSAFRGG---NANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            + F    D+I C         N  I   V G + Q ++ +GYD+    V F+   C
Sbjct: 386 GSFFSQ-RDDIFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRIDC 441


>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
          Length = 366

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 78/208 (37%), Positives = 112/208 (53%), Gaps = 21/208 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP  + +  +DTGSD  W QCEPC E  C+ Q  P+F+P  S++++++ C S
Sbjct: 157 YFTRIGVGTPTREQYMVLDTGSDVAWIQCEPCRE--CYSQADPIFNPSYSASFSTVGCDS 214

Query: 95  SQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + C+ + + +C  G C Y   YG G+Y   S+G+ ATETLTF +TS     + NV  GCG
Sbjct: 215 AVCSQLDAYDCHSGGCLYEASYGDGSY---STGSFATETLTFGTTS-----VANVAIGCG 266

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---INFGGIVA 210
           HKN+      +   G+        S  +Q+GT     FSYCL D+ S     + FG    
Sbjct: 267 HKNVGLFIGAAGLLGLGAG---ALSFPNQIGTQTGHTFSYCLVDRESDSSGPLQFGPKSV 323

Query: 211 GAGVVSTPLIIRDH----YYLSLEAISV 234
             G + TPL    H    YYLS+ AIS+
Sbjct: 324 PVGSIFTPLEKNPHLPTFYYLSVTAISI 351


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 113/366 (30%), Positives = 169/366 (46%), Gaps = 51/366 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + IG+P    +  +DTGSD  W QC PC    C+KQ   +FDP+ SS++  +SCS+
Sbjct: 14  YFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKS--CYKQNDAVFDPRASSSFRRLSCST 71

Query: 95  SQCAVV-TSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
            QC ++    C+  D  C Y   YG G   SF+ G+LA+++   +     P     V+FG
Sbjct: 72  PQCKLLDVKACASTDNRCLYQVSYGDG---SFTVGDLASDSFLVSRGRTSP-----VVFG 123

Query: 152 CGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--PDQG---SSKIN 204
           CGH N  L    +     G   L     S  SQ+ +    KFSYCL   D G   SS + 
Sbjct: 124 CGHDNEGLFVGAAGLLGLGAGKL-----SFPSQLSSR---KFSYCLVSRDNGVRASSALL 175

Query: 205 FG--GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEF------VSSSTGN--IF 250
           FG   +   A    T L+    +   YY  L  IS+G   L        +SSSTG   + 
Sbjct: 176 FGDSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVI 235

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVT 308
           +D+G   T LP   ++ ++    +  +  P     A+    D  CY+ S  +    P V+
Sbjct: 236 IDSGTSVTRLPTYAYTVMRDAFRSATQKLP---RAADFSLFDT-CYDFSALTSVTIPTVS 291

Query: 309 IHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
            HF  GA V+L PSN    + +    C AF   + ++ + G I Q    +  D++ + V 
Sbjct: 292 FHFEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVG 351

Query: 366 FKPSRC 371
           F P +C
Sbjct: 352 FAPRQC 357


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 107/378 (28%), Positives = 173/378 (45%), Gaps = 57/378 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M + +G PP      +DTGSD TW QC+PC    CF Q  P+FDP +S+++  I C++
Sbjct: 171 YFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPCKA--CFDQSGPVFDPSQSTSFKIIPCNA 228

Query: 95  SQCAVV--------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP--VE 144
           + C +V        +S  S   C Y + YG    +S +SG+LA E+L+  S S  P  +E
Sbjct: 229 AACDLVVHDECRDNSSKTSPKTCKYFYWYGD---SSRTSGDLALESLSV-SLSDHPSSLE 284

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPDQG---- 199
           + +++ GCGH N           G+        S  SQ+ +S  G+ FSYCL D+     
Sbjct: 285 IRDMVIGCGHSNKGLFQGAGGLLGLGQG---ALSFPSQLRSSPIGQSFSYCLVDRTNNLS 341

Query: 200 -SSKINFGGIVAGA----GVVSTPLI-----IRDHYYLSLEAISVGN-------QRLEFV 242
            SS I+FG   A +     +  TP +     +   YYL ++ I +         +R    
Sbjct: 342 VSSAISFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIA 401

Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVLCYNIS 299
            + +G   +D+G   T L  + +  ++S     I   +A P   +G        +CYN +
Sbjct: 402 PNGSGGTIIDSGTTLTYLNRDAYRAVESAFLARISYPRADPFDILG--------ICYNAT 453

Query: 300 SQPK--FPEVTIHFR-GADVKLSPSNLF--RNISDEIMCSAFRGGNANIVYGRIMQINFL 354
            +    FP ++I F+ GA++ L   N F   +  +   C A    +   + G   Q N  
Sbjct: 454 GRTAVPFPTLSIVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIH 513

Query: 355 IGYDIEQAMVSFKPSRCT 372
             YD++ A + F  + C+
Sbjct: 514 FLYDVQHARLGFANTDCS 531


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 114/379 (30%), Positives = 171/379 (45%), Gaps = 57/379 (15%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPP   +  VDTGSD  W     C+ CP       +  L+DPK SST +++
Sbjct: 87  LYYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASSTGSTV 146

Query: 91  SCSSSQCAVVTS----NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
            C    CA         CS    C YS  YG G   S + G+   + L F+  +G     
Sbjct: 147 MCDQGFCADTFGGRLPKCSANVPCEYSVTYGDG---SSTVGSFVNDALQFDQVTGDGQTQ 203

Query: 146 P---NVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
           P   +VIFGCG +      S S+   GI+G G  N+S++SQ+ T  AGK    F++CL  
Sbjct: 204 PANASVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLAT--AGKVKKIFAHCL-- 259

Query: 198 QGSSKINFGGIVAGAGVV-----STPLII-RDHYYLSLEAISVGNQRLE-----FVSSST 246
                I  GGI A   VV     +TPL+  + HY ++L+ I VG   LE     F     
Sbjct: 260 ---DTIKGGGIFAIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLELPADIFKPGEK 316

Query: 247 GNIFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC--YNISSQPK 303
               +D+G   T LP L +   + +V +   K Q +     +    D LC  Y+ S    
Sbjct: 317 RGTIIDSGTTLTYLPELVFKKVMLAVFN---KHQDITFHDVQ----DFLCFEYSGSVDDG 369

Query: 304 FPEVTIHFRGADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFL 354
           FP +T HF   D+ L   P   F    +++ C  F+ G          ++ G ++  N L
Sbjct: 370 FPTLTFHFE-DDLALHVYPHEYFFPNGNDVYCVGFQNGALQSKDGKDIVLMGDLVLSNKL 428

Query: 355 IGYDIEQAMVSFKPSRCTN 373
           + YD+E  ++ +    C++
Sbjct: 429 VVYDLENRVIGWTDYNCSS 447


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 116/358 (32%), Positives = 175/358 (48%), Gaps = 45/358 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ + +G+P       +DTGSD +W QC+PC +  C  Q  PLFDP  SSTY+  SC S
Sbjct: 198 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQ--CHSQADPLFDPSSSSTYSPFSCGS 255

Query: 95  SQCAVVTSN----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
           + CA +        S   C Y   YG G   S ++G  +++TL   S++     + +  F
Sbjct: 256 ADCAQLGQEGNGCSSSSQCQYIVTYGDG---SSTTGTYSSDTLALGSSA-----VRSFQF 307

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFG 206
           GC   N+ S  +D +  G++GLG G  SL+SQ   ++   FSYCLP   SS     +   
Sbjct: 308 GC--SNVESGFND-QTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAA 364

Query: 207 GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
           G    +G V TP++    +   Y + L+AI VG ++L   +S  S G + +D+G + T L
Sbjct: 365 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTV-MDSGTVITRL 423

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQP--KFPEVTIHFR-GADV 316
           P   +S L S     +K  P     A+P G  D  C++ S Q     P V + F  GA V
Sbjct: 424 PPTAYSALSSAFKAGMKQYPP----AQPSGILDT-CFDFSGQSSVSIPSVALVFSGGAVV 478

Query: 317 KLSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            L  S +  +      C AF G + +    + G + Q  F + YD+ + +V F+   C
Sbjct: 479 SLDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 531


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 114/357 (31%), Positives = 173/357 (48%), Gaps = 43/357 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ + +G+P       +DTGSD +W QC+PC +  C  Q  PLFDP  SSTY+  SC S
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQ--CHSQADPLFDPSSSSTYSPFSCGS 185

Query: 95  SQCAVVTSN----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
           + CA +        S   C Y   YG G   S ++G  +++TL   S++     + +  F
Sbjct: 186 ADCAQLGQEGNGCSSSSQCQYIVTYGDG---SSTTGTYSSDTLALGSSA-----VRSFQF 237

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFG 206
           GC   N+ S  +D +  G++GLG G  SL+SQ   ++   FSYCLP   SS     +   
Sbjct: 238 GC--SNVESGFND-QTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAA 294

Query: 207 GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
           G    +G V TP++    +   Y + L+AI VG ++L   +S  S G + +D+G + T L
Sbjct: 295 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTV-MDSGTVITRL 353

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVK 317
           P   +S L S     +K  P     A+P      C++ S Q     P V + F  GA V 
Sbjct: 354 PPTAYSALSSAFKAGMKQYPP----AQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVS 409

Query: 318 LSPSNLFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           L  S +  +      C AF G + +    + G + Q  F + YD+ + +V F+   C
Sbjct: 410 LDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 105/359 (29%), Positives = 160/359 (44%), Gaps = 41/359 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP   ++   DTGSD +W QC PC +  C++Q+ P+F+P  SS++  ++C+S
Sbjct: 81  YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRK--CYRQQDPIFNPSLSSSFKPLACAS 138

Query: 95  SQCAVVT-SNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           S C  +    CS + +C Y   YG G   SF+ G+ +TETL+F   +   V M     GC
Sbjct: 139 SICGKLKIKGCSRKNECMYQVSYGDG---SFTVGDFSTETLSFGEHAVRSVAM-----GC 190

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIV 209
           G  N       +   G+        S  SQ GTS A  FSYCLP + S   + + FG   
Sbjct: 191 GRNNQGLFHGAAGLLGLGRG---PLSFPSQTGTSYASVFSYCLPRRESAIAASLVFGPSA 247

Query: 210 AGAGVVSTPLI----IRDHYYLSLEAISVGN-------QRLEFVSSSTGNIFVDTGVLRT 258
                  T L+    +  +YY+ L  I V               S  TG + VD+G   +
Sbjct: 248 VPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAIS 307

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQ--PKFPEVTIHFR-GA 314
            L    ++ L+    +++         + PG S    CY++SS      P V + F  GA
Sbjct: 308 RLTTPAYTALRDAFRSLVT------FPSAPGISLFDTCYDLSSMKTATLPAVVLDFDGGA 361

Query: 315 DVKLSPSNLFRNISDE-IMCSAFR-GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            + L    +  N+ DE   C AF     A  + G + Q  F I  D ++  +   P +C
Sbjct: 362 SMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 420


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 109/373 (29%), Positives = 175/373 (46%), Gaps = 60/373 (16%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +G+  + +   +DTGSD TW QCEPC  + C+ Q+ P+F P  SS+Y S+SC+S
Sbjct: 65  YIVTMGLGSKNMTVI--IDTGSDLTWVQCEPC--MSCYNQQGPIFKPSTSSSYQSVSCNS 120

Query: 95  SQCAVV------TSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           S C  +      T  C   +   C+Y   YG G+Y   ++G L  E L+F       V +
Sbjct: 121 STCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSY---TNGELGVEALSFGG-----VSV 172

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKI 203
            + +FGCG  N       S   G++GLG    SL+SQ   +  G FSYCLP  + GSS  
Sbjct: 173 SDFVFGCGRNNKGLFGGVS---GLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSS-- 227

Query: 204 NFGGIVAG--------------AGVVSTPLIIRDHYYLSLEAISVGNQRLEF-VSSSTGN 248
             G +V G                ++S P  + + Y L+L  I VG   L+  +S   G 
Sbjct: 228 --GSLVMGNESSVFKNANPITYTRMLSNPQ-LSNFYILNLTGIDVGGVALKAPLSFGNGG 284

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFP 305
           I +D+G + T LP   +  LK+         P     + PGFS +  C+N++   +   P
Sbjct: 285 ILIDSGTVITRLPSSVYKALKAEFLKKFTGFP-----SAPGFSILDTCFNLTGYDEVSIP 339

Query: 306 EVTIHFRG-ADVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDI 359
            +++ F G A + +  +  F  + ++     +  ++        + G   Q N  + YD 
Sbjct: 340 TISLRFEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDT 399

Query: 360 EQAMVSFKPSRCT 372
           +Q+ V F    C+
Sbjct: 400 KQSKVGFAEEPCS 412


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 116/382 (30%), Positives = 173/382 (45%), Gaps = 61/382 (15%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP    +  VDTGSD  W     C+ CP       +  L+DP  S++  ++
Sbjct: 88  LYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTASASSKTV 147

Query: 91  SCSSSQCAVVT------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
           +C    CA  T      S  +   C YS  YG G   S ++G    + L ++  SG    
Sbjct: 148 TCGQEFCATATNGGVPPSCAANSPCQYSITYGDG---SSTTGFFVADFLQYDQVSGDGQT 204

Query: 142 PVEMPNVIFGCGHK-NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLP 196
            +   +V FGCG K   A  +S+    GI+G G  NSS++SQ+  + AGK    FS+CL 
Sbjct: 205 NLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQL--TSAGKVTKIFSHCL- 261

Query: 197 DQGSSKINFGGIVAGAGVV-----STPLII-RDHYYLSLEAISVGNQRLEF------VSS 244
                 +N GGI A   VV     +TPL+    HY + L+ I VG   L+       +  
Sbjct: 262 ----DTVNGGGIFAIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGG 317

Query: 245 STGNIFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK 303
            +    +D+G     LP + Y + L +V SN      +K V       D LC+  S    
Sbjct: 318 GSRGTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVT-LKNV------QDFLCFQYSGSVD 370

Query: 304 --FPEVTIHFRGADVKLS--PSN-LFRNISDEIMCSAFRGGNAN-------IVYGRIMQI 351
             FPEVT HF G D+ L   P + LF+N  D + C  F+ G          ++ G +   
Sbjct: 371 NGFPEVTFHFDG-DLPLVVYPHDYLFQNTED-VYCVGFQSGGVQSKDGKDMVLLGDLALS 428

Query: 352 NFLIGYDIEQAMVSFKPSRCTN 373
           N L+ YD+E  ++ +    C++
Sbjct: 429 NKLVVYDLENQVIGWTNYNCSS 450


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 105/372 (28%), Positives = 175/372 (47%), Gaps = 55/372 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  + +G     +   VDT S+ TW QC+PC    C  Q+ PLFDP  S +Y ++ C+S
Sbjct: 120 YVATVGLGAAEATVV--VDTASELTWVQCQPCES--CHDQQDPLFDPSSSPSYAAVPCNS 175

Query: 95  SQC-------AVVTSNCSEGD-----CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
           S C       A  TS C++ +     CSY+  Y  G+Y   S G LA + L     +G  
Sbjct: 176 SSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSY---SRGVLARDKLRL---AGQD 229

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGS 200
           +E    +FGCG  N  +P   +  +G++GLG  + SL+SQ      G FSYCLP  + GS
Sbjct: 230 IE--GFVFGCGTSNQGAPFGGT--SGLMGLGRSHVSLVSQTMDQFGGVFSYCLPMRESGS 285

Query: 201 SKINFGGIVAGAGVVSTPLI----IRDH-------YYLSLEAISVGNQRLEFVSSSTGNI 249
           S     G  + A   STP++    + D        Y+L+L  I+VG Q +E    S G +
Sbjct: 286 SGSLVLGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVESPWFSAGRV 345

Query: 250 FVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFP 305
            +D+G ++ TL+P  Y++     +S + +          P FS +  C+N++   + + P
Sbjct: 346 IIDSGTIITTLVPSVYNAVRAEFLSQLAEYPQA------PAFSILDTCFNLTGLKEVQVP 399

Query: 306 EVTIHFRGA-DVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDI 359
            +   F G+ +V++    +   +S +     +  ++ +      + G   Q N  + +D 
Sbjct: 400 SLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQKNLRVIFDT 459

Query: 360 EQAMVSFKPSRC 371
             + + F    C
Sbjct: 460 LGSQIGFAQETC 471


>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 456

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 97/356 (27%), Positives = 154/356 (43%), Gaps = 47/356 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + IG+P +  +  +D+GSD  W QCEPC +  C+ Q  P+F+P  S+++  ++CSS
Sbjct: 129 YFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQ--CYNQTDPIFNPATSASFIGVACSS 186

Query: 95  SQCAVVTSN--CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           + C  +  +  C +G C Y   YG G+Y   + G LA ET+T   T      + +   GC
Sbjct: 187 NVCNQLDDDVACRKGRCGYQVAYGDGSY---TKGTLALETITIGRTV-----IQDTAIGC 238

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
           GH N       +   G+ G      S + Q+G    G F YCL  +              
Sbjct: 239 GHWNEGMFVGAAGLLGLGGG---PMSFVGQLGAQTGGAFGYCLVSRAMP----------V 285

Query: 213 GVVSTPLIIR----DHYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLP 261
           G +  PLI        YY+SL  ++VG  R+       +     TG + +DTG   T LP
Sbjct: 286 GAMWVPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTGTAITRLP 345

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISS--QPKFPEVTIHFRGADVKL 318
              ++  +          P       PG S    CY+++     + P V+ +F G  +  
Sbjct: 346 TVAYNAFRDAFIAQTTNLP-----RAPGVSIFDTCYDLNGFVTVRVPTVSFYFSGGQILT 400

Query: 319 SPSNLFRNISDEI--MCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            P+  F   +D++   C AF    + + + G I Q    +  D     V F P+ C
Sbjct: 401 FPARNFLIPADDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNGFVGFGPNVC 456


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 109/379 (28%), Positives = 168/379 (44%), Gaps = 57/379 (15%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPP   +  VDTGSD  W     CE CP       +   +DPK SS+ +++
Sbjct: 83  LYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGSTV 142

Query: 91  SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           SC    CA        G      C YS +YG G   S ++G   T+ L F+  +G     
Sbjct: 143 SCDQGFCAATYGGKLPGCTANVPCEYSVMYGDG---SSTTGFFVTDALQFDQVTGDGQTQ 199

Query: 146 P---NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
           P    V FGCG +      +S+    GI+G G  N+S++SQ+  + AGK    F++CL  
Sbjct: 200 PGNATVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQL--AAAGKVKKIFAHCL-- 255

Query: 198 QGSSKINFGGIVAGAGVV-----STPLII-RDHYYLSLEAISVGNQRLE-----FVSSST 246
                I  GGI A   VV     +TPL+    HY ++L++I VG   L+     F +   
Sbjct: 256 ---DTIKGGGIFAIGNVVQPKVKTTPLVADMPHYNVNLKSIDVGGTTLQLPAHVFETGER 312

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMI-KAQPVKGVGAEPGFSDVLCYNI--SSQPK 303
               +D+G   T LP       K VM+ +  K Q +          D +C+    S    
Sbjct: 313 KGTIIDSGTTLTYLP---ELVFKEVMAAIFNKHQDI----VFHNVQDFMCFQYPGSVDDG 365

Query: 304 FPEVTIHFRGADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFL 354
           FP +T HF   D+ L   P   F    +++ C  F+ G          ++ G ++  N L
Sbjct: 366 FPTITFHFE-DDLALHVYPHEYFFPNGNDMYCVGFQNGALQSKDGKDIVLMGDLVLSNKL 424

Query: 355 IGYDIEQAMVSFKPSRCTN 373
           + YD+E  ++ +    C++
Sbjct: 425 VIYDLENQVIGWTDYNCSS 443


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 101/369 (27%), Positives = 172/369 (46%), Gaps = 41/369 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC---FKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP + +  VDTGSD  W  C PCP+            L+D K SST  ++
Sbjct: 73  LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNV 132

Query: 91  SCSSSQCAVV--TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP- 146
            C    C+ +  +  C ++  CSY  +YG G   S S G+   + +T    +G     P 
Sbjct: 133 GCEDDFCSFIMQSETCGAKKPCSYHVVYGDG---STSDGDFIKDNITLEQVTGNLRTAPL 189

Query: 147 --NVIFGCGHKNLASP--TSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLPDQGS 200
              V+FGCG KN +     +DS   GI+G G  N+S+ISQ+  G S    FS+CL +   
Sbjct: 190 AQEVVFGCG-KNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNG 248

Query: 201 SKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLEF-----VSSSTGNIFVDTG 254
             I   G V    V +TP++    HY + L+ + V    ++       ++  G   +D+G
Sbjct: 249 GGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSG 308

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFR 312
                LP   ++   S++  +   Q VK    +  F+   C++ +S     FP V +HF 
Sbjct: 309 TTLAYLPQNLYN---SLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFE 362

Query: 313 GADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIGYDIEQAM 363
            + +KLS  P +   ++ +++ C  ++ G          I+ G ++  N L+ YD+E  +
Sbjct: 363 DS-LKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 421

Query: 364 VSFKPSRCT 372
           + +    C+
Sbjct: 422 IGWADHNCS 430


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 109/359 (30%), Positives = 165/359 (45%), Gaps = 47/359 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +SIGTP +     +DTGSD +W  C               FDP KSSTY   SCSS
Sbjct: 125 YVITVSIGTPAMTQAVMIDTGSDVSWVHCH----ARAGAGSSLFFDPGKSSTYTPFSCSS 180

Query: 95  SQCAVVT---SNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
           + C  +    + CS    C Y+  YG G   S ++G   ++TL  NST     ++ N  F
Sbjct: 181 AACTRLEGRDNGCSLNSTCQYTVRYGDG---SNTTGTYGSDTLALNSTE----KVENFQF 233

Query: 151 GCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGG 207
           GC   +      D  QT G++GLG G  SL+SQ   +    FSYCLP   + S  +  G 
Sbjct: 234 GCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPATTRSSGFLTLGA 293

Query: 208 IVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLP 261
               +G V+TP+         Y++ L+ I+VG   +    +  + G+I +D+G + T LP
Sbjct: 294 STGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAAGSI-MDSGTIITRLP 352

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADVK 317
              +S L +     ++  P         FS +  C++ + Q     P V + F  GA V 
Sbjct: 353 PRAYSALSAAFRAGMRRYP-----RARAFSILDTCFDFTGQDNVSIPAVELVFSGGAVVD 407

Query: 318 LSPSNLFRNISDEIM---CSAFRGGNANI--VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           L         +D IM   C AF      I  + G + Q  F + +D+ Q+++ F+P  C
Sbjct: 408 LD--------ADGIMYGSCLAFAPATGGIGSIIGNVQQRTFEVLHDVGQSVLGFRPGAC 458


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 109/367 (29%), Positives = 168/367 (45%), Gaps = 52/367 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD-CFKQEPPLFDPKKSSTYNSISCS 93
           Y++ + +G+P +     +DTGSD +W QCEPCP    C      LFDP  SSTY + +CS
Sbjct: 135 YVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 194

Query: 94  SSQCAVV-----TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           ++ CA +      + C ++  C Y   YG G   S ++G  +++ LT + +      +  
Sbjct: 195 AAACAQLGDSGEANGCDAKSRCQYIVKYGDG---SNTTGTYSSDVLTLSGSD----VVRG 247

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----- 202
             FGC H  L +   D K  G+IGLG    SL+SQ        FSYCLP   +S      
Sbjct: 248 FQFGCSHAELGAGM-DDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCLPATPASSGFLTL 306

Query: 203 --INFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTG 254
                GG    +   +TP++    +  +Y+ +LE I+VG ++L    S  + G++ VD+G
Sbjct: 307 GAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAAGSL-VDSG 365

Query: 255 VLRTLLPLEYHSNLKSV----MSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
            + T LP   ++ L S     M+   +A+P+       G  D  C+N +   K   P V 
Sbjct: 366 TVITRLPPAAYAALSSAFRAGMTRYARAEPL-------GILDT-CFNFTGLDKVSIPTVA 417

Query: 309 IHFR-GADVKLSPSNLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLIGYDIEQAMV 364
           + F  GA V L    +         C AF   R   A    G + Q  F + YD+   + 
Sbjct: 418 LVFAGGAVVDLDAHGIVSG-----GCLAFAPTRDDKAFGTIGNVQQRTFEVLYDVGGGVF 472

Query: 365 SFKPSRC 371
            F+   C
Sbjct: 473 GFRAGAC 479


>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
 gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
          Length = 464

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 107/368 (29%), Positives = 163/368 (44%), Gaps = 63/368 (17%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           +Y   +++G+PP D    +DTGSD TW +C+PC   DC       FD   S+TY +++C+
Sbjct: 123 VYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCSP-DC----SSTFDRLASNTYKALTCA 177

Query: 94  SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF-NSTSGLPVEMPNVIFGC 152
                         D     L  R     F SG    +TL    + S    E P  +FGC
Sbjct: 178 D-------------DLRLPVLL-RLWRRLFHSGRSLRDTLKMAGAASDELEEFPGFVFGC 223

Query: 153 GH--KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ-GSSKINFGGIV 209
           G   K L      S + GI+ L PG+ S  SQ+G     KFSYCL  Q   + +    +V
Sbjct: 224 GSLLKGLI-----SGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMV 278

Query: 210 AGAGVVS--------------TPLIIRDHYY-LSLEAISVGNQRLE-----FVSSSTGNI 249
            G   V               TP+     YY + L+ ISVGNQRL+     F++      
Sbjct: 279 FGEAAVELKEPGSGKPQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSTFLNGQDKPT 338

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ---PVKGVGAEPGFSDVLCYNI--SSQPKF 304
             D+G   T+LP     ++K  +++M+       +KG+ A        C+ +  SS    
Sbjct: 339 IFDSGTTLTMLPSGVCDSIKQSLASMVSGAEFVAIKGLDA--------CFRVPPSSGQGL 390

Query: 305 PEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
           P++T HF  GAD    PSN   ++   + C  F   N   ++G + Q +F + +D++   
Sbjct: 391 PDITFHFNGGADFVTRPSNYVIDLG-SLQCLIFVPTNEVSIFGNLQQQDFFVLHDMDNRR 449

Query: 364 VSFKPSRC 371
           + FK + C
Sbjct: 450 IGFKETDC 457


>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Brachypodium distachyon]
          Length = 429

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 113/386 (29%), Positives = 175/386 (45%), Gaps = 44/386 (11%)

Query: 16  PKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQE 75
           P  P  ++   EI   +  + M +S+GTPPV    +VDTGS  +W  C+ C ++ C    
Sbjct: 58  PAEPSPVVGNHEIH--EGKFFMDISLGTPPVANLVTVDTGSTLSWVVCQRC-QISCHTTA 114

Query: 76  PP---LFDPKKSSTYNSISCSSSQCAVVTSN------CSE--GDCSYSFLYGRGAYASFS 124
           P    +FDP KS+TY  + CSS  CA V  +      C E    C YS  YG G    +S
Sbjct: 115 PEAGSVFDPDKSTTYELVGCSSRDCADVQRSLVAPFGCIEETDTCLYSLRYGSGPSGQYS 174

Query: 125 SGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG 184
           +G L T+ LT  S+S +   +   IFGC   +    +    ++G+IG G  N S  +Q+ 
Sbjct: 175 AGRLGTDKLTLASSSSI---IDGFIFGCSGDD----SFKGYESGVIGFGGANFSFFNQVA 227

Query: 185 TSIAGK-FSYCLPDQGSSKINFGGIVAGA----GVVSTPLI--IRDHYYLSLEAIS--VG 235
                + FSYC P   +++   G +  GA     +V T LI    D    SL+ I   V 
Sbjct: 228 RQTNYRAFSYCFPGDHTAE---GFLSIGAYPKDELVYTNLIPHFGDRSVYSLQQIDMMVD 284

Query: 236 NQRLEFVSS--STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPV--KGVGAEPGFS 291
             RL+   S  +   + VD+G + T L           M++ ++A+      VG E  F 
Sbjct: 285 GNRLQVDQSEYTKRMMVVDSGTVDTFLLGPVFDAFSKAMASAMQAKGFLSDTVGTETCFR 344

Query: 292 DVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNI--SDEIMCSAFRGGNANI----VY 345
                ++ S    P V + F G  +KL P N+F ++  S + +C AF+   A +    + 
Sbjct: 345 PNGGDSVDSG-DLPTVEMRFIGTTLKLPPENVFHDLLPSHDKICLAFKPDVAGVRNVQIL 403

Query: 346 GRIMQINFLIGYDIEQAMVSFKPSRC 371
           G     +F + YD++     F+   C
Sbjct: 404 GNKATXSFRVVYDLQAMYFGFQAGAC 429


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 110/377 (29%), Positives = 170/377 (45%), Gaps = 50/377 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M + +GTPP      +DTGSD  W QC PC    CF+Q  P +DPK SS++ +I+C  
Sbjct: 195 YFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--YACFEQNGPYYDPKDSSSFKNITCHD 252

Query: 95  SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS--GLPVE- 144
            +C +V+S             C Y + YG    +S ++G+ A ET T N T+  G P   
Sbjct: 253 PRCQLVSSPDPPQPCKGETQSCPYFYWYGD---SSNTTGDFALETFTVNLTTPEGKPELK 309

Query: 145 -MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
            + NV+FGCGH N       +   G+        S  +Q+ +     FSYCL D+     
Sbjct: 310 IVENVMFGCGHWNRGLFHGAAGLLGLGRG---PLSFATQLQSLYGHSFSYCLVDRNSNSS 366

Query: 200 -SSKINFG---GIVAGAGVVSTPLI------IRDHYYLSLEAISVGNQRLEFVSSST--- 246
            SSK+ FG    +++   +  T  +      +   YY+ +++I VG + L+    +    
Sbjct: 367 VSSKLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHLS 426

Query: 247 ----GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
               G   +D+G   T      +  +K      IK  P+  V   P      CYN+S   
Sbjct: 427 AQGGGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPL--VETFPPLKP--CYNVSGVE 482

Query: 303 K--FPEVTIHFR-GADVKLSPSNLFRNIS-DEIMCSAFRG--GNANIVYGRIMQINFLIG 356
           K   PE  I F  GA       N F  I  ++++C A  G   +A  + G   Q NF I 
Sbjct: 483 KMELPEFAILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSIIGNYQQQNFHIL 542

Query: 357 YDIEQAMVSFKPSRCTN 373
           YD++++ + + P +C +
Sbjct: 543 YDLKKSRLGYAPMKCAD 559


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 112/407 (27%), Positives = 174/407 (42%), Gaps = 75/407 (18%)

Query: 2   QNSQKLPFYNDNETPKSPI----SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSD 57
           ++S ++ F +D            S+ +QA + +    Y M++S+GTP +      DTGSD
Sbjct: 49  RDSHRIAFLSDATAAGKATTTNSSVSFQALLENGVGGYNMNISVGTPLLTFSVVADTGSD 108

Query: 58  CTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGDCSYSFL 114
             WTQC PC +  CF+Q  P F P  SST++ + C+SS C  +      C+   C Y++ 
Sbjct: 109 LIWTQCAPCTK--CFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYK 166

Query: 115 YGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGP 174
           YG G    +++G LATETL     S      P+V FGC  +N              GLG 
Sbjct: 167 YGSG----YTAGYLATETLKVGDAS-----FPSVAFGCSTEN--------------GLG- 202

Query: 175 GNSSLISQMGTSIAGKFSYCLPD---QGSSKINFG-------GIVAGAGVVSTPLIIRDH 224
                  Q+   + G+FSYCL      G+S I FG       G V     V+ P +   +
Sbjct: 203 -------QLDLGV-GRFSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHPSY 254

Query: 225 YYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI 276
           YY++L  I+VG   L   +S+         G   VD+G   T L  + +  +K     + 
Sbjct: 255 YYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAF--LS 312

Query: 277 KAQPVKGVGAEPGFSDVLCYNISSQP----KFPEVTIHFRGADVKLSPSNLFRNISDE-- 330
           +   V  V    G    LC+  +         P + + F G      P+      +D   
Sbjct: 313 QTADVTTVNGTRGLD--LCFKSTGGGGGGIAVPSLVLRFDGGAEYAVPTYFAGVETDSQG 370

Query: 331 ------IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
                 +M    +G     V G +MQ++  + YD++  + SF P+ C
Sbjct: 371 SVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFAPADC 417


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 101/370 (27%), Positives = 173/370 (46%), Gaps = 41/370 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC---FKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP + +  VDTGSD  W  C PCP+            L+D K SST  ++
Sbjct: 77  LYFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNV 136

Query: 91  SCSSSQCAVV--TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP- 146
            C    C+ +  +  C ++  CSY  +YG G   S S G+   + +T    +G     P 
Sbjct: 137 GCEDDFCSFIMQSETCGAKKPCSYHVVYGDG---STSDGDFIKDNITLEQVTGNLRTAPL 193

Query: 147 --NVIFGCGHKNLASP--TSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLPDQGS 200
              V+FGCG KN +     +DS   GI+G G  N+S+ISQ+  G S    FS+CL +   
Sbjct: 194 AQEVVFGCG-KNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNG 252

Query: 201 SKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLEF-----VSSSTGNIFVDTG 254
             I   G V    V +TP++    HY + L+ + V    ++       ++  G   +D+G
Sbjct: 253 GGIFAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSG 312

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFR 312
                LP   +++L   +  +   Q VK    +  F+   C++ +S     FP V +HF 
Sbjct: 313 TTLAYLPQNLYNSL---IEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFE 366

Query: 313 GADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIGYDIEQAM 363
            + +KLS  P +   ++ +++ C  ++ G          I+ G ++  N L+ YD+E  +
Sbjct: 367 DS-LKLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEV 425

Query: 364 VSFKPSRCTN 373
           + +    C++
Sbjct: 426 IGWADHNCSS 435


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 103/366 (28%), Positives = 160/366 (43%), Gaps = 36/366 (9%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC---FKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP +    VDTGSD  W  C+PCPE            LFD   SST   +
Sbjct: 73  LYFTKIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKV 132

Query: 91  SCSSSQCAVVTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP- 146
            C    C+ ++ + S      CSY  +Y   A  S S GN   + LT    +G     P 
Sbjct: 133 GCDDDFCSFISQSDSCQPAVGCSYHIVY---ADESTSEGNFIRDKLTLEQVTGDLQTGPL 189

Query: 147 --NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK--FSYCLPDQGSS 201
              V+FGCG         SDS   G++G G  N+S++SQ+  +   K  FS+CL +    
Sbjct: 190 GQEVVFGCGSDQSGQLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGG 249

Query: 202 KINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRT 258
            I   G+V    V +TP++    HY + L  + V    L+   S    G   VD+G    
Sbjct: 250 GIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTALDLPPSIMRNGGTIVDSGTTLA 309

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRGADV 316
             P   + +L   +  ++  QPVK    E  F    C++ S      FP V+  F  + V
Sbjct: 310 YFPKVLYDSL---IETILARQPVKLHIVEDTFQ---CFSFSENVDVAFPPVSFEFEDS-V 362

Query: 317 KLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIGYDIEQAMVSFK 367
           KL+  P +    +  E+ C  ++ G          I+ G ++  N L+ YD+E  ++ + 
Sbjct: 363 KLTVYPHDYLFTLEKELYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVYDLENEVIGWA 422

Query: 368 PSRCTN 373
              C++
Sbjct: 423 DHNCSS 428


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 116/378 (30%), Positives = 167/378 (44%), Gaps = 64/378 (16%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
           L+IGTPP +I   +DTGS+ +W +C+  P          +F+P  S TY  I CSS  C 
Sbjct: 71  LTIGTPPQNITMVLDTGSELSWLRCKKEPNFTS------IFNPLASKTYTKIPCSSQTCK 124

Query: 99  VVTSN------CSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             TS+      C     C +   Y   A AS   G+LA ET  F S     +  P  +FG
Sbjct: 125 TRTSDLTLPVTCDPAKLCHFIISY---ADASSVEGHLAFETFRFGS-----LTRPATVFG 176

Query: 152 CGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA 210
           C     +S T  D+K TG++G+  G+ S ++QMG     KFSYC+    S+     G   
Sbjct: 177 CMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMGFR---KFSYCISGLDSTGFLLLGEAR 233

Query: 211 GAGV----------VSTPLIIRDH--YYLSLEAISVGNQRLE-----FVSSST--GNIFV 251
            + +          +STPL   D   Y + LE I V N+ L      FV   T  G   V
Sbjct: 234 YSWLKPLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMV 293

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNI----SSQPK 303
           D+G   T L    +S L+     +++   V  V  EP +    +  LCY I    S+ P 
Sbjct: 294 DSGTQFTFLLGPVYSALRKEF--LLQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPN 351

Query: 304 FPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAF----RGGNANIVYGRIMQINF 353
            P V + FRGA++ +S   L   +       D + C  F      G ++ + G   Q N 
Sbjct: 352 LPVVKLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDELGISSFLIGHHQQQNV 411

Query: 354 LIGYDIEQAMVSFKPSRC 371
            + YD+E + + F   RC
Sbjct: 412 WMEYDLENSRIGFAELRC 429


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 114/355 (32%), Positives = 176/355 (49%), Gaps = 39/355 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ + +G+P       +DTGSD +W QC+PC +  C  Q  PLFDP  SSTY+  SC S
Sbjct: 128 YLITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQ--CHSQADPLFDPSSSSTYSPFSCGS 185

Query: 95  SQCAVVTSN----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
           + CA +        S   C Y   YG G   S ++G  +++TL   S++     + +  F
Sbjct: 186 AACAQLGQEGNGCSSSSQCQYIVTYGDG---SSTTGTYSSDTLALGSSA-----VKSFQF 237

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFG 206
           GC   N+ S  +D +  G++GLG G  SL+SQ   ++   FSYCLP   SS     +   
Sbjct: 238 GC--SNVESGFND-QTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAA 294

Query: 207 GIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
           G    +G V TP++    +   Y + L+AI VG ++L   +S  S G + +D+G + T L
Sbjct: 295 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTV-MDSGTVITRL 353

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQP--KFPEVTIHFR-GADV 316
           P   +S L S     +K  P     A+P G  D  C++ S Q     P V + F  GA V
Sbjct: 354 PPTAYSALSSAFKAGMKQYPP----AQPSGILDT-CFDFSGQSSVSIPSVALVFSGGAVV 408

Query: 317 KLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            L  S +   +S+ +  +A    ++  + G + Q  F + YD+ + +V F+   C
Sbjct: 409 SLDASGII--LSNCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 103/355 (29%), Positives = 160/355 (45%), Gaps = 39/355 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +  GTP   ++  +DTGSD  W  C+ C          P+FDP KSS+Y   +C S
Sbjct: 115 YIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGC---HSTAPIFDPAKSSSYKPFACDS 171

Query: 95  SQCAVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
             C  ++ NC     C +   YG G       G LA++ +T  S       +PN  FGC 
Sbjct: 172 QPCQEISGNCGGNSKCQFEVSYGDGTQV---DGTLASDAITLGSQ-----YLPNFSFGCA 223

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG-- 211
            ++L+  TS S     +G G  +    +       G FSYCLP   +S  +   +V G  
Sbjct: 224 -ESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGS---LVLGKE 279

Query: 212 AGVVSTPL----IIRD-----HYYLSLEAISVGNQRLEFVSS---STGNIFVDTGVLRT- 258
           A V S+ L    +I+D      Y+++L+AISVGN R+    +   S G   +D+G   T 
Sbjct: 280 AAVSSSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVPGTNIASGGGTIIDSGTTITH 339

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP-KFPEVTIHF-RGADV 316
           L+P  Y +   +    +   QP       P      CY++SS     P +T+H  R  D+
Sbjct: 340 LVPSAYTALRDAFRQQLSSLQPT------PVEDMDTCYDLSSSSVDVPTITLHLDRNVDL 393

Query: 317 KLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            L   N+       + C AF   ++  + G + Q N+ I +D+  + V F   +C
Sbjct: 394 VLPKENILITQESGLACLAFSSTDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 106/385 (27%), Positives = 172/385 (44%), Gaps = 66/385 (17%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M + +G+PP      +DTGSD  W QC PC   DCF+Q    +DPK S++Y +I+C+ 
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPC--YDCFQQNGAFYDPKASASYKNITCND 227

Query: 95  SQCAVVTS-------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFN-STSGLPVEMP 146
            +C +V+S             C Y + YG    +S ++G+ A ET T N +T+G   E+ 
Sbjct: 228 QRCNLVSSPDPPMPCKSDNQSCPYYYWYGD---SSNTTGDFAVETFTVNLTTNGGSSELY 284

Query: 147 NV---IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS-- 201
           NV   +FGCGH N       +   G+        S  SQ+ +     FSYCL D+ S   
Sbjct: 285 NVENMMFGCGHWNRGLFHGAAGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRNSDTN 341

Query: 202 -----------------KINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRL----- 239
                             +NF   VAG        ++   YY+ +++I V  + L     
Sbjct: 342 VSSKLIFGEDKDLLSHPNLNFTSFVAGK-----ENLVDTFYYVQIKSILVAGEVLNIPEE 396

Query: 240 --EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD--VL- 294
                S   G   +D+G   +      +  +K+ ++        K  G  P + D  +L 
Sbjct: 397 TWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAE-------KAKGKYPVYRDFPILD 449

Query: 295 -CYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRG--GNANIVYGRI 348
            C+N+S     + PE+ I F  GA       N F  ++++++C A  G   +A  + G  
Sbjct: 450 PCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNY 509

Query: 349 MQINFLIGYDIEQAMVSFKPSRCTN 373
            Q NF I YD +++ + + P++C +
Sbjct: 510 QQQNFHILYDTKRSRLGYAPTKCAD 534


>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
 gi|255641727|gb|ACU21134.1| unknown [Glycine max]
          Length = 475

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 109/370 (29%), Positives = 167/370 (45%), Gaps = 37/370 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTW---TQCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   L +G+PP D +  VDTGSD  W    +C  CP       +  L+DPK S T + +
Sbjct: 69  LYFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLTLYDPKGSETSDVV 128

Query: 91  SCSSSQCAVV----TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           SC    C+         C SE  C YS  YG G   S ++G    + LT+N  +G     
Sbjct: 129 SCDQDFCSATFDGPIPGCKSEIPCPYSITYGDG---SATTGYYVQDYLTYNRINGNLRTS 185

Query: 146 P---NVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
           P   ++IFGCG     +  S S++   GIIG G  NSS++SQ+  S  +   FS+CL + 
Sbjct: 186 PQNSSIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDNV 245

Query: 199 GSSKINFGGIVAGAGVVSTPLIIR-DHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
               I   G V    V +TPL+ R  HY + L++I V    L+     F S +     +D
Sbjct: 246 RGGGIFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSVNGKGTVID 305

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQPKFPEVTIHF 311
           +G     LP   +  L   +  ++  QP +K    E  F   L Y  +    FP V +HF
Sbjct: 306 SGTTLAYLPDIVYDEL---IQKVLARQPGLKLYLVEQQFRCFL-YTGNVDRGFPVVKLHF 361

Query: 312 RGA-DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFLIGYDIEQAM 363
           + +  + + P +      D I C  ++   A    G+ M +       N L+ YD+E  +
Sbjct: 362 KDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLENMV 421

Query: 364 VSFKPSRCTN 373
           + +    C++
Sbjct: 422 IGWTDYNCSS 431


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 105/359 (29%), Positives = 159/359 (44%), Gaps = 41/359 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP   ++   DTGSD +W QC PC +  C++Q+ P+F+P  SS++  ++C+S
Sbjct: 14  YFARIGVGTPARSVYMVADTGSDVSWLQCSPCRK--CYRQQDPIFNPSLSSSFKPLACAS 71

Query: 95  SQCAVVT-SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           S C  +    CS  + C Y   YG G   SF+ G+ +TETL+F   +   V M     GC
Sbjct: 72  SICGKLKIKGCSRKNKCMYQVSYGDG---SFTVGDFSTETLSFGEHAVRSVAM-----GC 123

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGGIV 209
           G  N       +   G+        S  SQ GTS A  FSYCLP + S   + + FG   
Sbjct: 124 GRNNQGLFHGAAGLLGLGRG---PLSFPSQTGTSYASVFSYCLPRRESAIAASLVFGPSA 180

Query: 210 AGAGVVSTPLI----IRDHYYLSLEAISVGN-------QRLEFVSSSTGNIFVDTGVLRT 258
                  T L+    +  +YY+ L  I V               S  TG + VD+G   +
Sbjct: 181 VPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGTAIS 240

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQ--PKFPEVTIHFR-GA 314
            L    ++ L+    +++         + PG S    CY++SS      P V + F  GA
Sbjct: 241 RLTTPAYTALRDAFRSLVT------FPSAPGISLFDTCYDLSSMKTATLPAVVLDFDGGA 294

Query: 315 DVKLSPSNLFRNISDE-IMCSAFR-GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            + L    +  N+ DE   C AF     A  + G + Q  F I  D ++  +   P +C
Sbjct: 295 SMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 107/377 (28%), Positives = 172/377 (45%), Gaps = 44/377 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-PLFDPKKSSTYNSISCS 93
           Y + + +G+PP  +    DTGSD TW +C  C + +C    P   F  + S+T++   C 
Sbjct: 83  YFVSIRLGSPPQTLLLVADTGSDLTWVRCSAC-KTNCSIHPPGSTFLARHSTTFSPTHCF 141

Query: 94  SSQCAVV----TSNCS----EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           SS C +V     + C+       C Y ++Y  G   S +SG  + ET T N++SG  +++
Sbjct: 142 SSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDG---SKTSGFFSKETTTLNTSSGREMKL 198

Query: 146 PNVIFGCG-HKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG--- 199
            ++ FGCG H +  S    S    +G++GLG G  S  SQ+G      FSYCL D     
Sbjct: 199 KSIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYTLSP 258

Query: 200 --SSKINFGGIVA-----GAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS---- 244
             +S +  G +V+      + +  TPL+I       YY+S++ + V   +L    S    
Sbjct: 259 PPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSVWSL 318

Query: 245 ---STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-- 299
                G   +D+G   T L    +  + S     +K       GA       LC N++  
Sbjct: 319 DELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKLPSPTPGGASTRSGFDLCVNVTGV 378

Query: 300 SQPKFPEVTIHFRGADV-KLSPSNLFRNISDEIMCSAFRGGNAN----IVYGRIMQINFL 354
           S+P+FP +++   G  +    P N F +IS+ I C A +   A      V G +MQ  FL
Sbjct: 379 SRPRFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVEAESGRFSVIGNLMQQGFL 438

Query: 355 IGYDIEQAMVSFKPSRC 371
           + +D  ++ + F    C
Sbjct: 439 LEFDRGKSRLGFSRRGC 455


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 111/362 (30%), Positives = 164/362 (45%), Gaps = 39/362 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ L  GTP V     +DTGSD +W QC+PC    C+ Q+ P+FDP  SSTY  + C S
Sbjct: 122 YVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGS 181

Query: 95  SQC--------AVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
             C        A   +N S G   C Y   YG G     + G  +TETLT +  +   V 
Sbjct: 182 EACRDLDPDSYANGCTNSSSGASLCQYGIQYGNG---DTTVGVYSTETLTLSPEAATVVN 238

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-- 202
             N  FGCG   L          G++GLG    SL+SQ   +  G FSYCLP   S+   
Sbjct: 239 --NFSFGCG---LVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYCLPAGNSTAGF 293

Query: 203 INFGGIVAG----AGVVSTPL-IIRDHYYL-SLEAISVGNQRLEFVSSS-TGNIFVDTGV 255
           +  G    G    AG   TPL ++   +YL  L  ISVG ++L+   +   G + +D+G 
Sbjct: 294 LALGAPATGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVFAGGMIIDSGT 353

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGA- 314
           + T LP   +S L++   + + A P+     +        +  ++    P V + F G  
Sbjct: 354 IVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGNTNVTVPTVALTFEGGV 413

Query: 315 --DVKLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPS 369
             D+ + PS +  +      C AF  G ++    + G + Q  F + YD  +  V F+  
Sbjct: 414 TIDLDV-PSGVLLD-----GCLAFVAGASDGDTGIIGNVNQRTFEVLYDSARGHVGFRAG 467

Query: 370 RC 371
            C
Sbjct: 468 AC 469


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 108/380 (28%), Positives = 177/380 (46%), Gaps = 56/380 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M + +G+PP      +DTGSD  W QC PC   DCF+Q    +DPK S++Y +I+C+ 
Sbjct: 155 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPCH--DCFQQNGAFYDPKASASYKNITCND 212

Query: 95  SQCAVVT-----SNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFN-STSGLPVEMP 146
            +C +V+       C   +  C Y + YG    +S ++G+ A ET T N +TSG   E+ 
Sbjct: 213 PRCNLVSPPDPPKPCKSDNQSCPYYYWYGD---SSNTTGDFAVETFTVNLTTSGGSSELY 269

Query: 147 NV---IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
           NV   +FGCGH N       +   G+        S  SQ+ +     FSYCL D+     
Sbjct: 270 NVENMMFGCGHWNRGLFHGAAGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRNSDTN 326

Query: 200 -SSKINFG---GIVAGAGVVSTPLIIRDH------YYLSLEAISVGNQRL-------EFV 242
            SSK+ FG    +++   +  T  + R        YY+ +++I V  + L          
Sbjct: 327 VSSKLIFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNIS 386

Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD--VL--CYNI 298
           S   G   +D+G   +      +  +K+ ++        K  G  P + D  +L  C+N+
Sbjct: 387 SDGAGGTIIDSGTTLSYFAEPAYEFIKNKIAE-------KAKGKYPVYRDFPILDPCFNV 439

Query: 299 S--SQPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRG--GNANIVYGRIMQINF 353
           S     + PE+ I F  GA       N F  ++++++C A  G   +A  + G   Q NF
Sbjct: 440 SGIDSIQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAILGTPKSAFSIIGNYQQQNF 499

Query: 354 LIGYDIEQAMVSFKPSRCTN 373
            I YD +++ + + P++C +
Sbjct: 500 HILYDTKRSRLGYAPTKCAD 519


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 116/405 (28%), Positives = 186/405 (45%), Gaps = 59/405 (14%)

Query: 2   QNSQKLPFYNDNETPKSPISIIYQAEIISVDDI-YLMHLSIGTPPVDIFGSVDTGSDCTW 60
           QN  +      N + +S    I  A  I+++ + Y++ + +G   + +   +DTGSD TW
Sbjct: 99  QNRIRAKVSGHNSSEQSSEIQIPLASGINLETLNYIVTIGLGNQNMTVI--IDTGSDLTW 156

Query: 61  TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV------TSNCSEGD---CSY 111
            QC+PC  + C+ Q+ P+F+P  SS+YNS+ C+SS C  +      T  C   +   C++
Sbjct: 157 VQCDPC--MSCYSQQGPVFNPSNSSSYNSLLCNSSTCQNLQFTTGNTEACESNNPSSCNH 214

Query: 112 SFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIG 171
           +  YG G   SF+ G L  E L+F   S     + N +FGCG  N       S   GI+G
Sbjct: 215 TVSYGDG---SFTDGELGVEHLSFGGIS-----VSNFVFGCGRNNKGLFGGVS---GIMG 263

Query: 172 LGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAG--------------AGVV 215
           LG  N S+ISQ  T+  G FSYCLP  D G+S    G +V G                +V
Sbjct: 264 LGRSNLSMISQTNTTFGGVFSYCLPTTDSGAS----GSLVIGNESSLFKNLTPIAYTSMV 319

Query: 216 STPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNM 275
           S P  + + Y L+L  I VG   ++  S   G I +D+G + T L    ++ LK+     
Sbjct: 320 SNPQ-LSNFYVLNLTGIDVGGVAIQDTSFGNGGILIDSGTVITRLAPSLYNALKAEFLKQ 378

Query: 276 IKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNISDE- 330
               P+      P  S +  C+N++   +   P +++HF    D+ +    +     D  
Sbjct: 379 FSGYPIA-----PALSILDTCFNLTGIEEVSIPTLSMHFENNVDLNVDAVGILYMPKDGS 433

Query: 331 ---IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
              +  ++    N   + G   Q N  + YD +Q+ + F    C+
Sbjct: 434 QVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIGFAREDCS 478


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score =  120 bits (302), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 109/359 (30%), Positives = 160/359 (44%), Gaps = 39/359 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + L +GTPP  +    DTGSD  W QC PC    C+ Q  PLF+P  SST+ SI+C S
Sbjct: 81  YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQS--CYGQTDPLFNPSFSSTFQSITCGS 138

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S C  ++   C    C Y   YG G   SF+ G  +TETL+F S +     + +V  GCG
Sbjct: 139 SLCQQLLIRGCRRNQCLYQVSYGDG---SFTVGEFSTETLSFGSNA-----VNSVAIGCG 190

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGIVA 210
           H N    T  +   G+        S  SQ+G      FSYCLP +   GS  + FG    
Sbjct: 191 HNNQGLFTGAAGLLGLGKG---LLSFPSQVGQLYGSVFSYCLPTRESTGSVPLIFGNQAV 247

Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRLEF------VSSSTGN--IFVDTGVLRT 258
            +    T L+    +   YY+ +  I VG   +        + SSTGN  + +D+G   T
Sbjct: 248 ASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTAVT 307

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GA 314
            L    ++ ++    +  +A          GFS    CY++S +     P V+  F  GA
Sbjct: 308 RLVTSAYNPMR----DAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGA 363

Query: 315 DVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            + L   N+   + +    C AF   + N  + G I Q +F + +D     V    ++C
Sbjct: 364 TMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score =  120 bits (302), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 111/385 (28%), Positives = 175/385 (45%), Gaps = 60/385 (15%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPE-----LDCFKQEPPLFDPKKSST 86
           D  + + + IGTPP      VDTGSD  WTQC              +Q  PL++P++SS+
Sbjct: 81  DQGHSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSS 140

Query: 87  YNSISCSSSQC---AVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTF--NSTSG 140
           +  + CS   C        NC+  + C Y  LYG    ++ + G LA+ET TF  N+   
Sbjct: 141 FAYLPCSDRLCQEGQFSYKNCARNNRCMYDELYG----SAEAGGVLASETFTFGVNAKVS 196

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PD 197
           LP+      FGCG  +       S   G++GL PG  SL+SQ+      +FSYCL    +
Sbjct: 197 LPLG-----FGCGALSAGDLVGAS---GLMGLSPGIMSLVSQLSVP---RFSYCLTPFAE 245

Query: 198 QGSSKINFGGIV-----AGAGVVSTPLIIRD------HYYLSLEAISVGNQRLEFVSSST 246
           + +S + FG +         G V T  I+R+      +YY+ L  +S+G +RL+  ++S 
Sbjct: 246 RKTSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSL 305

Query: 247 GNI--------FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYN 297
           G I         VD+G   + L       +K  +   ++  PV   G +  + D  LC+ 
Sbjct: 306 GMIKPDGSGGTIVDSGSTMSYLEETAFRAVKKAVVEAVRL-PVAN-GTDEDYDDYELCFA 363

Query: 298 ISS-----QPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFR---GGNANIVYGRI 348
           + +       K P + +HF  GA + L   N F+     +MC A      G    + G +
Sbjct: 364 LPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAGLMCLAVGTSPDGFGVSIIGNV 423

Query: 349 MQINFLIGYDIEQAMVSFKPSRCTN 373
            Q N  + +D+     SF P++C +
Sbjct: 424 QQQNMHVLFDVRNQKFSFAPTKCDD 448


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 109/359 (30%), Positives = 160/359 (44%), Gaps = 39/359 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + L +GTPP  +    DTGSD  W QC PC    C+ Q  PLF+P  SST+ SI+C S
Sbjct: 81  YFVSLGVGTPPRTVNMVADTGSDVLWLQCLPCQS--CYGQTDPLFNPSFSSTFQSITCGS 138

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           S C  ++   C    C Y   YG G   SF+ G  +TETL+F S +     + +V  GCG
Sbjct: 139 SLCQQLLIRGCRRNQCLYQVSYGDG---SFTVGEFSTETLSFGSNA-----VNSVAIGCG 190

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---GSSKINFGGIVA 210
           H N    T  +   G+        S  SQ+G      FSYCLP +   GS  + FG    
Sbjct: 191 HNNQGLFTGAAGLLGLGKG---LLSFPSQVGQLYGSVFSYCLPTRESTGSVPLIFGNQAV 247

Query: 211 GAGVVSTPLI----IRDHYYLSLEAISVGNQRLEF------VSSSTGN--IFVDTGVLRT 258
            +    T L+    +   YY+ +  I VG   +        + SSTGN  + +D+G   T
Sbjct: 248 ASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTAVT 307

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GA 314
            L    ++ ++    +  +A          GFS    CY++S +     P V+  F  GA
Sbjct: 308 RLVTSAYNPMR----DAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGA 363

Query: 315 DVKLSPSNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            + L   N+   + +    C AF   + N  + G I Q +F + +D     V    ++C
Sbjct: 364 TMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422


>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
          Length = 418

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 105/360 (29%), Positives = 169/360 (46%), Gaps = 48/360 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M  S+GTPP  +    DTGSD  W +C  C    C  +    + P KSS+++ + CSS
Sbjct: 81  YDMTFSMGTPPQTLSALADTGSDLIWAKCGACKR--CAPRGSASYYPTKSSSFSKLPCSS 138

Query: 95  SQCAVVTS---------NCSEGDCSYSFLYGRGAY-ASFSSGNLATETLTFNSTSGLPVE 144
           + C  + S               CSY + YG  +    ++ G + +ET T  S +     
Sbjct: 139 ALCRTLESQSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTLGSDA----- 193

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSK 202
           +  + FGC      S       +G++GLG G  SL+ Q+     G FSYCL      SS 
Sbjct: 194 VQGIGFGC---TTMSEGGYGSGSGLVGLGRGKLSLVRQLK---VGAFSYCLTSDPSTSSP 247

Query: 203 INFG-GIVAGAGVVSTPLI---IRDHYYLSLEAISVGNQRLEFVSSSTGN--IFVDTGVL 256
           + FG G + G GV STPL+       Y ++L++IS+G  +    +  TG   I  D+G  
Sbjct: 248 LLFGAGALTGPGVQSTPLVNLKTSTFYTVNLDSISIGAAK----TPGTGRHGIIFDSGTT 303

Query: 257 RTLLPLEYHS----NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR 312
            T L    ++     L S  +N+ +     G          +C+  S    FP + +HF 
Sbjct: 304 LTFLAEPAYTLAEAGLLSQTTNLTRVPGTDGY--------EVCFQTSGGAVFPSMVLHFD 355

Query: 313 GADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           G D+ L   N F  ++D + C   +   + + + G IMQ+++ I YD++++++SF+P+ C
Sbjct: 356 GGDMALKTENYFGAVNDSVSCWLVQKSPSEMSIVGNIMQMDYHIRYDLDKSVLSFQPTNC 415


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 111/371 (29%), Positives = 174/371 (46%), Gaps = 59/371 (15%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
           + + +GTPP      +D GSD  WTQC         KQ  P+FD  +SS+++ + C S  
Sbjct: 109 LTVGVGTPPQPSKVILDLGSDLLWTQCSLVGP--TAKQLEPVFDAARSSSFSVLPCDSKL 166

Query: 97  CAVVT---SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           C   T     C++  C+Y   YG       ++G LATET TF +  G+     N+ FGCG
Sbjct: 167 CEAGTFTNKTCTDRKCAYENDYG----IMTATGVLATETFTFGAHHGVSA---NLTFGCG 219

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGSSKINFG---- 206
              LA+ T  ++ +GI+GL PG  S++ Q+  +   KFSYCL    D+ +S + FG    
Sbjct: 220 --KLANGTI-AEASGILGLSPGPLSMLKQLAIT---KFSYCLTPFADRKTSPVMFGAMAD 273

Query: 207 -GIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-------TGNIFVDT 253
            G     G V T  ++++     +YY+ +  +SVG++RL+    +       TG   +D+
Sbjct: 274 LGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDS 333

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPV--KGVGAEPGFSDVLCYNIS-----SQPKFPE 306
                 L     + LK  +   IK  PV  + V   P     +C+ +         + P 
Sbjct: 334 ATTLAYLVEPAFTELKKAVMEGIKL-PVANRSVDDYP-----VCFELPRGMSMEGVQVPP 387

Query: 307 VTIHFRG-ADVKLSPSNLFRNISDEIMCSA-----FRGGNANIVYGRIMQINFLIGYDIE 360
           + +HF G A++ L   N F+  S  +MC A     F G  A  V G + Q N  + YD+ 
Sbjct: 388 LVLHFDGDAEMSLPRDNYFQEPSPGMMCLAVMQAPFEG--APNVIGNVQQQNMHVLYDVG 445

Query: 361 QAMVSFKPSRC 371
               S+ P++C
Sbjct: 446 NRKFSYAPTKC 456


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 116/389 (29%), Positives = 174/389 (44%), Gaps = 67/389 (17%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           ++ M L IG+   ++   +DTGS+    QC          +  P+FDP  S +Y  + C 
Sbjct: 99  LFSMQLGIGSLQKNLSAIIDTGSEAVLVQCG--------SRSRPVFDPAASQSYRQVPCI 150

Query: 94  SSQCAVVTSNCSEGD----------CSYSFLYGRGAYASFSSGNLATETLTFNST--SGL 141
           S  C  V    S G           C+YS  YG    +  S+G+ + + +  NST  SG 
Sbjct: 151 SQLCLAVQQQTSNGSSQPCVNSSATCTYSLSYGD---SRNSTGDFSQDVIFLNSTNSSGQ 207

Query: 142 PVEMPNVIFGCGHKNLASPTS---DSKQTGIIGLGPGNSSLISQMGTSIAG-KFSYCLPD 197
            V+  +V FGC H    SP     D    GI+G   GN SL SQ+   + G KFSYC P 
Sbjct: 208 AVQFRDVAFGCAH----SPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPS 263

Query: 198 QGSSKINFGGIVAGAGVVS------TPLI------IRDH-YYLSLEAISVGNQRLEFVSS 244
           Q       G I  G   +S      TPL+       R   YY+ L +ISV  + L    S
Sbjct: 264 QPWQPRATGVIFLGDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPES 323

Query: 245 S--------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
           +         G   +D+G   T +  + ++  ++  +   ++   K VGA  GF D  CY
Sbjct: 324 AFKLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDD--CY 381

Query: 297 NI---SSQPKFPEVTIHFR-GADVKLSPSNLFRNIS---DEI-----MCSAFRGGNANI- 343
           NI   SS P  PEV +  +    ++L   +LF  +S   +E+     + S+ + G   I 
Sbjct: 382 NISAGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKIN 441

Query: 344 VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           V G   Q N+L+ YD E++ V F+ + C+
Sbjct: 442 VLGNYQQSNYLVEYDNERSRVGFERADCS 470


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 102/366 (27%), Positives = 162/366 (44%), Gaps = 36/366 (9%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE---PCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTP  D    VDTGSD  W  C     CP      +  P +D   SST  S+
Sbjct: 84  LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP-YDADASSTAKSV 142

Query: 91  SCSSSQCAVVT--SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LPVE 144
           SCS + C+ V   S C  G  C Y  LYG G   S ++G L  + +  +  +G       
Sbjct: 143 SCSDNFCSYVNQRSECHSGSTCQYVILYGDG---SSTNGYLVRDVVHLDLVTGNRQTGST 199

Query: 145 MPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGT--SIAGKFSYCLPDQGSS 201
              +IFGCG K       S +   GI+G G  NSS ISQ+ +   +   F++CL +    
Sbjct: 200 NGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGG 259

Query: 202 KINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
            I   G V    V +TP++ +  HY ++L AI VGN  L+     F S     + +D+G 
Sbjct: 260 GIFAIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSDAFDSGDDKGVIIDSGT 319

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN-ISSQPKFPEVTIHF-RG 313
               LP   ++ L + +  +   Q +     +  F+   C++ I    +FP VT  F + 
Sbjct: 320 TLVYLPDAVYNPLMNQI--LASHQELNLHTVQDSFT---CFHYIDRLDRFPTVTFQFDKS 374

Query: 314 ADVKLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
             + + P      + ++  C  +       +GG +  + G +   N L+ YDIE  ++ +
Sbjct: 375 VSLAVYPQEYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGW 434

Query: 367 KPSRCT 372
               C+
Sbjct: 435 TNHNCS 440


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 109/370 (29%), Positives = 164/370 (44%), Gaps = 37/370 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTW---TQCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   L +G+PP D +  VDTGSD  W    +C  CP       +  L+DPK S T   I
Sbjct: 69  LYFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSELI 128

Query: 91  SCSSSQCAVVTS----NC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           SC    C+         C SE  C YS  YG G   S ++G    + LT+N  +      
Sbjct: 129 SCDQEFCSATYDGPIPGCKSEIPCPYSITYGDG---SATTGYYVQDYLTYNHVNDNLRTA 185

Query: 146 P---NVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
           P   ++IFGCG     + +S S++   GIIG G  NSS++SQ+  S  +   FS+CL + 
Sbjct: 186 PQNSSIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLDNI 245

Query: 199 GSSKINFGGIVAGAGVVSTPLIIR-DHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
               I   G V    V +TPL+ R  HY + L++I V    L+     F S +     +D
Sbjct: 246 RGGGIFAIGEVVEPKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSGNGKGTIID 305

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQPKFPEVTIHF 311
           +G     LP   +  L   +  ++  QP +K    E  FS    Y  +    FP V +HF
Sbjct: 306 SGTTLAYLPAIVYDEL---IPKVMARQPRLKLYLVEQQFS-CFQYTGNVDRGFPVVKLHF 361

Query: 312 RGA-DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFLIGYDIEQAM 363
             +  + + P +      D I C  ++   A    G+ M +       N L+ YD+E   
Sbjct: 362 EDSLSLTVYPHDYLFQFKDGIWCIGWQKSVAQTKNGKDMTLLGDLVLSNKLVIYDLENMA 421

Query: 364 VSFKPSRCTN 373
           + +    C++
Sbjct: 422 IGWTDYNCSS 431


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 110/377 (29%), Positives = 168/377 (44%), Gaps = 53/377 (14%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPP   +  VDTGSD  W     CE CP       +  L+DPK SST + +
Sbjct: 85  LYYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGSMV 144

Query: 91  SCSSSQCAVVTS----NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
            C  + CA         C     C YS  YG G   S + G+  T+ L F+  +      
Sbjct: 145 MCDQAFCAATFGGKLPKCGANVPCEYSVTYGDG---SSTIGSFVTDALQFDQVTRDGQTQ 201

Query: 146 P---NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
           P   +VIFGCG +      +S+    GI+G G  N+S++SQ+ T  AGK    F++CL  
Sbjct: 202 PANASVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTT--AGKVKKIFAHCLDT 259

Query: 198 QGSSKINFGGIVAGAGVVSTPLII-RDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
                I   G V    V +TPL+  + HY ++L+ I VG   L+     F         +
Sbjct: 260 IKGGGIFSIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGEKKGTII 319

Query: 252 DTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV---LCYNI--SSQPKFP 305
           D+G   T LP L +   + +V +   K Q +        F DV   LC+    S    FP
Sbjct: 320 DSGTTLTYLPELVFKEVMLAVFN---KHQDIT-------FHDVQGFLCFQYPGSVDDGFP 369

Query: 306 EVTIHFRGADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIG 356
            +T HF   D+ L   P   F    +++ C  F+ G +        ++ G ++  N L+ 
Sbjct: 370 TITFHFE-DDLALHVYPHEYFFANGNDVYCVGFQNGASQSKDGKDIVLMGDLVLSNKLVI 428

Query: 357 YDIEQAMVSFKPSRCTN 373
           YD+E  ++ +    C++
Sbjct: 429 YDLENRVIGWTDYNCSS 445


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 106/377 (28%), Positives = 175/377 (46%), Gaps = 55/377 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +G+P  +    VDTGS+ TW QC PC    C      ++D  +S++Y  ++C++
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLQCLPCKV--CAPSVDTIYDAARSASYRPVTCNN 157

Query: 95  SQCAVVTSN-----CSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNST-SGLPVEMPN 147
           SQ    +S      C+ G  C ++  YG G   SFS G+L+T+TL   +   G PV + +
Sbjct: 158 SQLCSNSSQGTYAYCARGSQCQFAAFYGDG---SFSYGSLSTDTLIMETVVGGKPVTVQD 214

Query: 148 VIFGCGHKNLA-SPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG 206
             FGC   +L   PT  S   GI+GL  G  +L  Q+G     KFS+C PD+ SS +N  
Sbjct: 215 FAFGCAQGDLELVPTGAS---GILGLNAGKMALPMQLGQRFGWKFSHCFPDR-SSHLNST 270

Query: 207 GIVAGAG-------------VVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDT 253
           G+V                  ++   + R  Y+++L+ +S+ +  L F+   +  + +D+
Sbjct: 271 GVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPRGS-VVILDS 329

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQP--VKGVGAEPGFSDV-LCYNISS------QPKF 304
           G   +     +HS L+      +K +P  +K +  +  F D+  C+ +S+          
Sbjct: 330 GSSFSSFVRPFHSQLREA---FLKHRPPSLKHLEGD-SFGDLGTCFKVSNDDIDELHRTL 385

Query: 305 PEVTIHFR-GADVK------LSPSNLFRNISDEIMCSAFRGGNANI--VYGRIMQINFLI 355
           P +++ F  G  +       L P   F+N     MC AF  G  N   V G   Q N  +
Sbjct: 386 PSLSLVFEDGVTIGIPSIGVLLPVARFQNHVK--MCFAFEDGGPNPVNVIGNYQQQNLWV 443

Query: 356 GYDIEQAMVSFKPSRCT 372
            YDI+++ V F  + C 
Sbjct: 444 EYDIQRSRVGFARASCV 460


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 101/367 (27%), Positives = 165/367 (44%), Gaps = 38/367 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE---PCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTP  D    VDTGSD  W  C     CP      +  P +D   SST  S+
Sbjct: 84  LYFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP-YDVDASSTAKSV 142

Query: 91  SCSSSQCAVVT--SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LPVE 144
           SCS + C+ V   S C  G  C Y  +YG G   S ++G L  + +  +  +G       
Sbjct: 143 SCSDNFCSYVNQRSECHSGSTCQYVIMYGDG---SSTNGYLVKDVVHLDLVTGNRQTGST 199

Query: 145 MPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGT--SIAGKFSYCLPDQGSS 201
              +IFGCG K       S +   GI+G G  NSS ISQ+ +   +   F++CL +    
Sbjct: 200 NGTIIFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGG 259

Query: 202 KINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
            I   G V    V +TP++ +  HY ++L AI VGN  LE     F S     + +D+G 
Sbjct: 260 GIFAIGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGT 319

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVG-AEPGFSDVLCYNISSQ-PKFPEVTIHF-R 312
               LP   ++ L   ++ ++ + P   +   +  F+   C++ + +  +FP VT  F +
Sbjct: 320 TLVYLPDAVYNPL---LNEILASHPELTLHTVQESFT---CFHYTDKLDRFPTVTFQFDK 373

Query: 313 GADVKLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDIEQAMVS 365
              + + P      + ++  C  +       +GG +  + G +   N L+ YDIE  ++ 
Sbjct: 374 SVSLAVYPREYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIG 433

Query: 366 FKPSRCT 372
           +    C+
Sbjct: 434 WTNHNCS 440


>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 506

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 111/344 (32%), Positives = 156/344 (45%), Gaps = 39/344 (11%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-------TSNC 104
           VDT SD  W QC PCP+  C+ Q   L+DP KS       CSS QC  +       T   
Sbjct: 178 VDTASDVPWVQCAPCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTGAG 237

Query: 105 SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDS 164
           + G C Y  LY  G   S +SG   ++ LT N+     V      FGC H  L   + ++
Sbjct: 238 NTGTCQYRVLYPDG---SGTSGTYVSDLLTLNADPKGAVS--KFQFGCSHALLRPGSFNN 292

Query: 165 KQTGIIGLGPGNSSLISQM-GTSIAGK-FSYCLPDQGSSK--INFG-GIVAGAGVVSTPL 219
           K  G + LG G  SL SQ  GT   G  FSYCLP  GS K  ++ G    A +    TP+
Sbjct: 293 KTAGFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYAVTPM 352

Query: 220 IIRDH----YYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEYHSNLKSVMSN 274
           +        Y + L  I V  QRL    +    N  +D+  + T LP   +  L++    
Sbjct: 353 LKSKMAPMIYMVRLIGIDVAGQRLPVPPAVFAANAAMDSRTIITRLPPTAYMALRAAFRA 412

Query: 275 MIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHF-RGADVKLSPSNLFRNISDEI 331
            ++A   + V A  G  D  CY+ +  P  + P+VT+ F R A V+L PS +  +     
Sbjct: 413 QMRA--YRAV-APKGQLDT-CYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVMLD----- 463

Query: 332 MCSAFRGGNANI----VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            C AF   NAN     + G + Q    + Y+++ A V F+ + C
Sbjct: 464 SCLAF-APNANDFMPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506


>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
 gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
          Length = 443

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 101/370 (27%), Positives = 152/370 (41%), Gaps = 44/370 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+    +G PP      +DTGS   WTQC  C    C +Q+ P F+   S ++  + C  
Sbjct: 86  YIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVPCQD 145

Query: 95  SQCA--VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             CA   +     +G C++   YG G    F    L T+  TF S          + FGC
Sbjct: 146 KACAGNYLHFCALDGTCTFRVTYGAGGIIGF----LGTDAFTFQSGGA------TLAFGC 195

Query: 153 -GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGG 207
                 A+P      +G+IGLG G  SL SQ G   A +FSYCL     + G+S   F G
Sbjct: 196 VSFTRFAAPDVLHGASGLIGLGRGRLSLASQTG---AKRFSYCLTPYFHNNGASSHLFVG 252

Query: 208 IVA----GAGVVSTPLII---RDH-----YYLSLEAISVGNQRLEFVSSS---------- 245
             A    G G V +   +   +D+     YYL L  I+VG  +L   S++          
Sbjct: 253 AAASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGF 312

Query: 246 -TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF 304
             G + +D+G   T L  + +  L   ++  +    V   G + G   +           
Sbjct: 313 WEGGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLDRVV 372

Query: 305 PEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
           P + +HF  GAD+ L P N +  +     C A   G    + G   Q N  I +D+    
Sbjct: 373 PTLVLHFSGGADMALPPENYWAPLEKSTACMAIVRGYLQSIIGNFQQQNMHILFDVGGGR 432

Query: 364 VSFKPSRCTN 373
           +SF+ + C+ 
Sbjct: 433 LSFQNADCST 442


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 109/383 (28%), Positives = 177/383 (46%), Gaps = 60/383 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M + +GTPP      +DTGSD  W QC PC   DCF Q    +DPK S+++ +I+C+ 
Sbjct: 162 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPC--YDCFHQNEAFYDPKTSASFKNITCND 219

Query: 95  SQCAVVTS-----NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFN--STSGLPVE- 144
            +C++++S      C   +  C Y + YG     S ++G+ A ET T N  +T G   E 
Sbjct: 220 PRCSLISSPEPPVQCKSDNQSCPYFYWYGD---RSNTTGDFAVETFTVNLTTTEGRSSEY 276

Query: 145 -MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
            + N++FGCGH N    +  S   G+       S   SQ+ +     FSYCL D+     
Sbjct: 277 KVENMMFGCGHWNRGLFSGASGLLGLGRGPLSFS---SQLQSLYGHSFSYCLVDRNSDTN 333

Query: 200 -SSKINFG---GIVAGAGVVSTPLI------IRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
            SSK+ FG    ++    +  T  +      +   YY+ +++I VG + L+ +   T NI
Sbjct: 334 VSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALD-IPEETWNI 392

Query: 250 --------FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD--VL--CYN 297
                    +D+G   +      +  +K+  +  +K   +        F D  VL  C+N
Sbjct: 393 SPDGAGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLV-------FRDFPVLDPCFN 445

Query: 298 IS----SQPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANI--VYGRIMQ 350
           +S    +    PE+ I F  GA       N F  +S++++C A  G   +   + G   Q
Sbjct: 446 VSGIEENNIHLPELGIAFADGAVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQ 505

Query: 351 INFLIGYDIEQAMVSFKPSRCTN 373
            NF I YD + + + F P++C +
Sbjct: 506 QNFHILYDTKMSRLGFTPTKCAD 528


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 108/371 (29%), Positives = 160/371 (43%), Gaps = 51/371 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP       +DTGSD  W QC PC    C+ Q   +FDP++S +Y ++ C +
Sbjct: 128 YFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRH--CYAQSGRVFDPRRSRSYAAVDCVA 185

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   +     C Y   YG G   S ++G+ A+ETLTF   +     +  V  G
Sbjct: 186 PICRRLDSAGCDRRRNSCLYQVAYGDG---SVTAGDFASETLTFARGA----RVQRVAIG 238

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---------GSSK 202
           CGH N     + S   G+        S  SQ+  S    FSYCL D+          SS 
Sbjct: 239 CGHDNEGLFIAASGLLGLGRG---RLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSST 295

Query: 203 INFGGIVAGAGVVS--TPL----IIRDHYYLSLEAISVGNQRLEFVSSS---------TG 247
           + FG     A   +  TP+     +   YY+ L   SVG  R++ VS S          G
Sbjct: 296 VTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRG 355

Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--K 303
            + +D+G   T L    +  ++    +  +A  V G+   PG   +   CYN+S +   K
Sbjct: 356 GVILDSGTSVTRLARPVYEAVR----DAFRAAAV-GLRVSPGGFSLFDTCYNLSGRRVVK 410

Query: 304 FPEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIE 360
            P V++H   GA V L P N    + +    C A  G +  + + G I Q  F + +D +
Sbjct: 411 VPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGD 470

Query: 361 QAMVSFKPSRC 371
              V F P  C
Sbjct: 471 AQRVGFVPKSC 481


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 108/371 (29%), Positives = 160/371 (43%), Gaps = 51/371 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP       +DTGSD  W QC PC    C+ Q   +FDP++S +Y ++ C +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRH--CYAQSGRVFDPRRSRSYAAVDCVA 179

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   +     C Y   YG G   S ++G+ A+ETLTF   +     +  V  G
Sbjct: 180 PICRRLDSAGCDRRRNSCLYQVAYGDG---SVTAGDFASETLTFARGA----RVQRVAIG 232

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---------GSSK 202
           CGH N     + S   G+        S  SQ+  S    FSYCL D+          SS 
Sbjct: 233 CGHDNEGLFIAASGLLGLGRG---RLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSST 289

Query: 203 INFGGIVAGAGVVS--TPL----IIRDHYYLSLEAISVGNQRLEFVSSS---------TG 247
           + FG     A   +  TP+     +   YY+ L   SVG  R++ VS S          G
Sbjct: 290 VTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRG 349

Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--K 303
            + +D+G   T L    +  ++    +  +A  V G+   PG   +   CYN+S +   K
Sbjct: 350 GVILDSGTSVTRLARPVYEAVR----DAFRAAAV-GLRVSPGGFSLFDTCYNLSGRRVVK 404

Query: 304 FPEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIE 360
            P V++H   GA V L P N    + +    C A  G +  + + G I Q  F + +D +
Sbjct: 405 VPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGD 464

Query: 361 QAMVSFKPSRC 371
              V F P  C
Sbjct: 465 AQRVGFVPKSC 475


>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 414

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 107/356 (30%), Positives = 160/356 (44%), Gaps = 54/356 (15%)

Query: 45  PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNC 104
           P +I   ++  S  TWTQC+PC  + C K     FDP  S TY+  SC  S         
Sbjct: 86  PQEILAEMNPDS-ITWTQCKPC--VRCLKDSHRHFDPSASLTYSLGSCIPSTVGN----- 137

Query: 105 SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDS 164
                +Y+  YG     S S GN   +T+T   +       P   FGCG  N     S +
Sbjct: 138 -----TYNMTYGD---KSTSVGNYGCDTMTLEPSD----VFPKFQFGCGRNNEGDFGSGA 185

Query: 165 KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS-------------SKINFGGIVAG 211
              G++GLG G  S +SQ  +     FSYCLP++ S             S + F  +V G
Sbjct: 186 D--GMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSLLFGEKATSQSSLKFTSLVNG 243

Query: 212 AGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS---STGNIFVDTGVLRTLLPLEYHSNL 268
            G  ++ L    +Y++ L  ISVGN+RL   SS   S G I +D+G + T LP   +S L
Sbjct: 244 PG--TSGLEESGYYFVKLLDISVGNKRLNVPSSVFASPGTI-IDSGTVITCLPQRAYSAL 300

Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPK--FPEVTIHF-RGADVKLSPSNL 323
            +     +   P+     + G  D+L  CYN+S +     PE+ +HF  GADV+L+   +
Sbjct: 301 TAAFKKAMAKYPLSNGRRKKG--DILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRV 358

Query: 324 FRNISDEIMCSAFRGGNAN------IVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
                   +C AF G + +       + G   Q++  + YDI+   + F  + C+ 
Sbjct: 359 IWGNDASRLCLAFAGNSKSTMNSELTIIGNRQQVSLTVLYDIQGGRIGFGGNGCSK 414


>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 372

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 106/368 (28%), Positives = 161/368 (43%), Gaps = 51/368 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP---LFDPKKSSTYNSIS 91
           Y M +S+GTPPV    ++DTGS  +W QC+ C ++ C+ Q      +F+P  SSTY+ + 
Sbjct: 25  YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNC-QIKCYDQAAKAGQIFNPYNSSTYSKVG 83

Query: 92  CSSSQC------AVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           CS+  C        V   C E D  C YS  YG G Y   S G L  + LT  S   +  
Sbjct: 84  CSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEY---SVGYLGKDRLTLASNRSI-- 138

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM-GTSIAGKFSYCLP--DQGS 200
              N IFGCG  NL +  +     GIIG G  + S  +Q+   +    FSYC P   +  
Sbjct: 139 --DNFIFGCGEDNLYNGVN----AGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENE 192

Query: 201 SKINFGGIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
             +  G       ++ T LI  DH   Y +    + V   RLE     ++S  T    VD
Sbjct: 193 GSLTIGPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT---IVD 249

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD-VLCY----NISSQPKFPEV 307
           +G   T +       L   M+  ++A+     G   G+ +  +C+      ++   FP V
Sbjct: 250 SGTADTYILSPVFDALDKAMTKEMQAK-----GYTRGWDERRICFISNSGSANWNDFPTV 304

Query: 308 TIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANI----VYGRIMQINFLIGYDIEQAM 363
            +    + +KL   N F   S+ ++CS F   +A +    + G     +F + +DI+   
Sbjct: 305 EMKLIRSTLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMN 364

Query: 364 VSFKPSRC 371
             FK   C
Sbjct: 365 FGFKARAC 372


>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
          Length = 353

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 106/368 (28%), Positives = 161/368 (43%), Gaps = 51/368 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP---LFDPKKSSTYNSIS 91
           Y M +S+GTPPV    ++DTGS  +W QC+ C ++ C+ Q      +F+P  SSTY+ + 
Sbjct: 6   YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNC-QIKCYDQAAKAGQIFNPYNSSTYSKVG 64

Query: 92  CSSSQC------AVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           CS+  C        V   C E D  C YS  YG G Y   S G L  + LT  S   +  
Sbjct: 65  CSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEY---SVGYLGKDRLTLASNRSI-- 119

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM-GTSIAGKFSYCLP--DQGS 200
              N IFGCG  NL +  +     GIIG G  + S  +Q+   +    FSYC P   +  
Sbjct: 120 --DNFIFGCGEDNLYNGVN----AGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENE 173

Query: 201 SKINFGGIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
             +  G       ++ T LI  DH   Y +    + V   RLE     ++S  T    VD
Sbjct: 174 GSLTIGPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT---IVD 230

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD-VLCY----NISSQPKFPEV 307
           +G   T +       L   M+  ++A+     G   G+ +  +C+      ++   FP V
Sbjct: 231 SGTADTYILSPVFDALDKAMTKEMQAK-----GYTRGWDERRICFISNSGSANWNDFPTV 285

Query: 308 TIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNANI----VYGRIMQINFLIGYDIEQAM 363
            +    + +KL   N F   S+ ++CS F   +A +    + G     +F + +DI+   
Sbjct: 286 EMKLIRSTLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMN 345

Query: 364 VSFKPSRC 371
             FK   C
Sbjct: 346 FGFKARAC 353


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 116/381 (30%), Positives = 174/381 (45%), Gaps = 61/381 (16%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP    +  VDTGSD  W     C+ CP       E  ++DP+ S +   +
Sbjct: 89  LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELV 148

Query: 91  SCSSSQC-----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           +C    C      V+ S  S   C YS  YG G   S ++G   T+ L +N  SG     
Sbjct: 149 TCDQQFCVANYGGVLPSCTSTSPCEYSISYGDG---SSTAGFFVTDFLQYNQVSGDGQTT 205

Query: 146 P---NVIFGCGHK---NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCL 195
           P   +V FGCG K   +L S  S+    GI+G G  NSS++SQ+  + AGK    F++CL
Sbjct: 206 PANASVSFGCGAKLGGDLGS--SNLALDGILGFGQSNSSMLSQL--AAAGKVRKMFAHCL 261

Query: 196 PDQGSSKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLE-----FVSS 244
                  +N GGI A   VV     +TPL+    HY + L+ I VG   L      F S 
Sbjct: 262 -----DTVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSG 316

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC--YNISSQP 302
           ++    +D+G     +P   +  L +++ +  +   V+ +       D  C  Y+ S   
Sbjct: 317 NSKGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTL------QDFSCFQYSGSVDD 370

Query: 303 KFPEVTIHFRGADVKL--SPSN-LFRNISDEIMCSAFRGGNAN-------IVYGRIMQIN 352
            FPEVT HF G DV L  SP + LF+N    + C  F+ G          ++ G ++  N
Sbjct: 371 GFPEVTFHFEG-DVSLIVSPHDYLFQN-GKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSN 428

Query: 353 FLIGYDIEQAMVSFKPSRCTN 373
            L+ YD+E   + +    C++
Sbjct: 429 KLVLYDLENQAIGWADYNCSS 449


>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
          Length = 494

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 116/381 (30%), Positives = 174/381 (45%), Gaps = 61/381 (16%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP    +  VDTGSD  W     C+ CP       E  ++DP+ S +   +
Sbjct: 89  LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELV 148

Query: 91  SCSSSQC-----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           +C    C      V+ S  S   C YS  YG G   S ++G   T+ L +N  SG     
Sbjct: 149 TCDQQFCVANYGGVLPSCTSTSPCEYSISYGDG---SSTAGFFVTDFLQYNQVSGDGQTT 205

Query: 146 P---NVIFGCGHK---NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCL 195
           P   +V FGCG K   +L S  S+    GI+G G  NSS++SQ+  + AGK    F++CL
Sbjct: 206 PANASVSFGCGAKLGGDLGS--SNLALDGILGFGQSNSSMLSQL--AAAGKVRKMFAHCL 261

Query: 196 PDQGSSKINFGGIVAGAGVV-----STPLII-RDHYYLSLEAISVGNQRLE-----FVSS 244
                  +N GGI A   VV     +TPL+    HY + L+ I VG   L      F S 
Sbjct: 262 -----DTVNGGGIFAIGNVVQPKVKTTPLVSDMPHYNVILKGIDVGGTALGLPTNIFDSG 316

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC--YNISSQP 302
           ++    +D+G     +P   +  L +++ +  +   V+ +       D  C  Y+ S   
Sbjct: 317 NSKGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTL------QDFSCFQYSGSVDD 370

Query: 303 KFPEVTIHFRGADVKL--SPSN-LFRNISDEIMCSAFRGGNAN-------IVYGRIMQIN 352
            FPEVT HF G DV L  SP + LF+N    + C  F+ G          ++ G ++  N
Sbjct: 371 GFPEVTFHFEG-DVSLIVSPHDYLFQN-GKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSN 428

Query: 353 FLIGYDIEQAMVSFKPSRCTN 373
            L+ YD+E   + +    C++
Sbjct: 429 KLVLYDLENQAIGWADYNCSS 449


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 112/373 (30%), Positives = 164/373 (43%), Gaps = 58/373 (15%)

Query: 35  YLMHLSIGTP-----PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNS 89
           Y+  +++GTP       +   S D GSD TW QC PC    C+ Q  P+++  KSS+ + 
Sbjct: 125 YIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPC--FRCYHQPGPVYNRLKSSSASD 182

Query: 90  ISCSSSQCAVVTSN--CSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           + C +  C  + S+  C +   +C Y   YG G   S S+G+   ETLTF       V +
Sbjct: 183 VGCYAPACRALGSSGGCVQFLNECQYKVEYGDG---SSSAGDFGVETLTFPPG----VRV 235

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SS 201
           P V  GCG  N       +   GI+GLG G+ S  SQ+       FSYCL  QG    SS
Sbjct: 236 PGVAIGCGSDNQG--LFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQGTGGRSS 293

Query: 202 KINFGGIVAGAGVVS---------TPLIIRDHYYLSLEAISVGNQRLEFVSSST------ 246
            + FG   +     +         T   +   YY+ L  ISVG  R+  V+ S       
Sbjct: 294 TLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTESDLRLDPS 353

Query: 247 ---GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGA-EPG----FSDVLCYNI 298
              G + VD+G   T L    ++  +    +  +   VK +G   PG    F D    ++
Sbjct: 354 TGHGGVIVDSGTAVTRLSGPAYAAFR----DAFRVAAVKELGWPSPGGPFAFFDTCYSSV 409

Query: 299 SSQ--PKFPEVTIHFRGA-DVKLSPSNLFRNISDE--IMCSAFRG-GNANI-VYGRIMQI 351
             +   K P V++HF G  +VKL P N    +      MC AF G G+  + + G I   
Sbjct: 410 RGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGVSIIGNIQLQ 469

Query: 352 NFLIGYDIEQAMV 364
            F + YD++   V
Sbjct: 470 GFRVVYDVDGQRV 482


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 109/360 (30%), Positives = 169/360 (46%), Gaps = 50/360 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +S GTP V     +DTGSD +W QC+PC    CF Q+ PL+DP  SSTY+++ C+S
Sbjct: 79  YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 138

Query: 95  SQCAVVT-----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
             C  +      S C+ G  C ++  Y  G   + + G  + + LT     G  V+  N 
Sbjct: 139 DVCKKLAADAYGSGCTSGKQCGFAISYADG---TSTVGAYSQDKLTL--APGAIVQ--NF 191

Query: 149 IFGCGH-KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
            FGCGH K+      D    G++GLG     L   +G    G FSYCLP   SSK  F  
Sbjct: 192 YFGCGHGKHAVRGLFD----GVLGLG----RLRESLGARYGGVFSYCLPSV-SSKPGFLA 242

Query: 208 IVAG---AGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTL 259
           + AG   +G V TP+           ++L  I+VG ++L+   S+ +G + VD+G + T 
Sbjct: 243 LGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITG 302

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIHFR-GADV 316
           L    +  L+S     ++A  +      P      CYN++       P++ + F  GA +
Sbjct: 303 LQSTAYRALRSAFRKAMEAYRL-----LPNGDLDTCYNLTGYKNVVVPKIALTFTGGATI 357

Query: 317 KLS-PSNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            L  P+ +  N      C AF      G+A ++ G + Q  F + +D   +   F+   C
Sbjct: 358 NLDVPNGILVN-----GCLAFAESGPDGSAGVL-GNVNQRAFEVLFDTSTSKFGFRAKAC 411


>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
          Length = 506

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 104/388 (26%), Positives = 166/388 (42%), Gaps = 59/388 (15%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPP   +  VDTGSD  W     C  CP       +   +DPK SS+ +++
Sbjct: 86  LYFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSGSTV 145

Query: 91  SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           SC    CA        G      C YS +YG G   S ++G   T+ L F+  +G     
Sbjct: 146 SCDQGFCAATYGGKLPGCTANVPCEYSVMYGDG---SSTTGFFITDALQFDQVTGDGQTQ 202

Query: 146 P---NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK--FSYCLPD-Q 198
           P    + FGCG +       S+    GI+G G  N+S++SQ+  +   K  F++CL   +
Sbjct: 203 PGNATITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTIK 262

Query: 199 GSSKINFGGIVA---------GAGVVSTPLII-------RDHYYLSLEAISVGNQRLE-- 240
           G      G +V            G+++ PL +       R HY ++L++I VG   L+  
Sbjct: 263 GGGIFAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQLP 322

Query: 241 ---FVSSSTGNIFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
              F +       +D+G   T LP L +   +  V S   K + +    A     D LC+
Sbjct: 323 AHVFETGEKKGTIIDSGTTLTYLPELVFKQVMDVVFS---KHRDI----AFHNLQDFLCF 375

Query: 297 NISS--QPKFPEVTIHFRGADVKLS--PSNLFRNISDEIMCSAFRGGNAN-------IVY 345
             S      FP +T HF   D+ L   P   F    ++I C  F+ G          ++ 
Sbjct: 376 QYSGSVDDGFPTITFHFE-DDLALHVYPHEYFFPNGNDIYCVGFQNGALQSKDGKDIVLM 434

Query: 346 GRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           G ++  N L+ YD+E  ++ +    C++
Sbjct: 435 GDLVLSNKLVVYDLENQVIGWTDYNCSS 462


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 103/370 (27%), Positives = 161/370 (43%), Gaps = 54/370 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISC-- 92
           Y + + +G+P       VDTGS  +W QC+PC  + C  QE P+F+P  S TY ++ C  
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPC-TIYCHIQEDPVFNPSASKTYKTVPCSS 161

Query: 93  ----SSSQCAVVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
               S     +    CS+    C Y   YG    +SFS G L+ + LT   +  L     
Sbjct: 162 SQCSSLKSATLNEPTCSKQSNACVYKASYGD---SSFSLGYLSQDVLTLTPSQTL----S 214

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---------PD 197
           + ++GCG  N        +  GIIGL     S++SQ+       FSYCL         P 
Sbjct: 215 SFVYGCGQDNQG---LFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPK 271

Query: 198 QGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSSTG-NIFVD 252
           +G   I    +   +    TPL+   +    Y++ LE+I+V  + L   +SS      +D
Sbjct: 272 EGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIID 331

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGV--------GAEPGFSDVLCYNISSQP 302
           +G + T LP   ++ LK+    ++  K Q   G+        G+  G S+V         
Sbjct: 332 SGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVA-------- 383

Query: 303 KFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
             P++ I F+ GAD++L   N    +   I C A  G ++  + G   Q    + YD+  
Sbjct: 384 --PDIRIIFKGGADLQLKGHNSLVELETGITCLAMAGSSSIAIIGNYQQQTVKVAYDVGN 441

Query: 362 AMVSFKPSRC 371
           + V F P  C
Sbjct: 442 SRVGFAPGGC 451


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 102/413 (24%), Positives = 179/413 (43%), Gaps = 87/413 (21%)

Query: 22  IIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDP 81
           ++ +A ++S    YL+ L +GTP      ++DT SD  WTQC+PC  + C+KQ  P+F+P
Sbjct: 75  VVAEAPVLSAGGEYLVKLGLGTPQHCFTAAIDTASDLIWTQCQPC--VKCYKQLDPVFNP 132

Query: 82  KKSSTYNSISCSSSQC-AVVTSNCS-EGD------CSYSFLYGRGAYASFSSGNLATETL 133
             S++Y  + C+S  C  + T  C+ +GD      C Y++ YG  A    + G LA + L
Sbjct: 133 VASTSYAVVPCNSDTCDELDTHRCARDGDSDDEDACQYTYSYGGNAT---TRGILAVDRL 189

Query: 134 TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSY 193
                         V+FGC   ++  P    + +G++GLG G  SL+SQ+      +F Y
Sbjct: 190 AIGDDV-----FRGVVFGCSSSSVGGPP--PQVSGVVGLGRGALSLVSQLSVR---RFMY 239

Query: 194 CLPDQGSSKINFGGIVAGAGVVSTPLIIRD-----------------HYYLSLEAISVGN 236
           CLP   S   + G +V GA   +T   +R+                 +YYL+L+ IS+G+
Sbjct: 240 CLPPPVSR--SAGRLVLGADAAAT---VRNASERVVVPMSTGSRYPSYYYLNLDGISIGD 294

Query: 237 QRLEFVSSSTGN--------------------------------IFVDTGVLRTLLPLEY 264
           + + F S +  N                                + +D     T L    
Sbjct: 295 RAMSFRSRNRMNATTPGTAAGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESL 354

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS-----SQPKFPEVTIHFRGADVKLS 319
           +  +   +   I+    +G G++ G    LC+ +      S+   P V++ F G  ++L 
Sbjct: 355 YEEMVDDLEEEIRLP--RGSGSDLGLD--LCFILPEGVPMSRVYAPPVSLAFEGVWLRLD 410

Query: 320 PSNLF-RNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
              +F  + +  +MC      +   + G   Q N  + Y++ +  ++F  + C
Sbjct: 411 KEQMFVEDRASGMMCLMVGKTDGVSILGNYQQQNMQVMYNLRRGRITFIKTAC 463


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 103/370 (27%), Positives = 161/370 (43%), Gaps = 54/370 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISC-- 92
           Y + + +G+P       VDTGS  +W QC+PC  + C  QE P+F+P  S TY ++ C  
Sbjct: 103 YYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPC-TIYCHIQEDPVFNPSASKTYKTVPCSS 161

Query: 93  ----SSSQCAVVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
               S     +    CS+    C Y   YG    +SFS G L+ + LT   +  L     
Sbjct: 162 SQCSSLKSATLNEPTCSKQSNACVYKASYGD---SSFSLGYLSQDVLTLTPSQTL----S 214

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---------PD 197
           + ++GCG  N        +  GIIGL     S++SQ+       FSYCL         P 
Sbjct: 215 SFVYGCGQDNQG---LFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPK 271

Query: 198 QGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSSTG-NIFVD 252
           +G   I    +   +    TPL+   +    Y++ LE+I+V  + L   +SS      +D
Sbjct: 272 EGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIID 331

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGV--------GAEPGFSDVLCYNISSQP 302
           +G + T LP   ++ LK+    ++  K Q   G+        G+  G S+V         
Sbjct: 332 SGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVA-------- 383

Query: 303 KFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
             P++ I F+ GAD++L   N    +   I C A  G ++  + G   Q    + YD+  
Sbjct: 384 --PDIRIIFKGGADLQLKGHNSLVELETGITCLAMAGSSSIAIIGNYQQQTVKVAYDVGN 441

Query: 362 AMVSFKPSRC 371
           + V F P  C
Sbjct: 442 SRVGFAPGGC 451


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 113/381 (29%), Positives = 183/381 (48%), Gaps = 66/381 (17%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
           L++G+PP +I   +DTGS+ +W  C+  P L        +F+P  SSTY+ + CSS  C 
Sbjct: 65  LAVGSPPQNISMVLDTGSELSWLHCKKSPNLGS------VFNPVSSSTYSPVPCSSPICR 118

Query: 99  VVTSNC---SEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIFGCG 153
             T +    +  D    F +   +YA  +S  GNLA +T    S     V  P  +FGC 
Sbjct: 119 TRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGS-----VTRPGTLFGCM 173

Query: 154 HKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
              L+S +  D+K TG++G+  G+ S ++Q+G S   KFSYC+    SS I   G  + +
Sbjct: 174 DSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCISGSDSSGILLLGDASYS 230

Query: 213 G---VVSTPLII---------RDHYYLSLEAISVGNQRLE-----FVSSST--GNIFVDT 253
               +  TPL++         R  Y + LE I VG++ L      FV   T  G   VD+
Sbjct: 231 WLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDS 290

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNI--SSQPKF--- 304
           G   T L    ++ LK+    + + + V  +  +P F    +  LCY +  S++P F   
Sbjct: 291 GTQFTFLMGPVYTALKNEF--IAQTKSVLRIVDDPNFVFQGTMDLCYRVGSSTRPNFTGL 348

Query: 305 PEVTIHFRGADVKLSPSNLFRNIS-------DEIMCSAFRGGNANI------VYGRIMQI 351
           P +++ FRGA++ +S   L   ++       +E+ C  F  GN+++      V G   Q 
Sbjct: 349 PVISLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTF--GNSDLLGIEAFVIGHHHQQ 406

Query: 352 NFLIGYDIEQAMVSFKPS-RC 371
           N  + +D+ ++ V F  + RC
Sbjct: 407 NVWMEFDLAKSRVGFAGNVRC 427


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 102/368 (27%), Positives = 167/368 (45%), Gaps = 52/368 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + IG   + +   VDTGSD TW QC+PC    C+ Q+ PLF+P  S +Y +I C+S
Sbjct: 67  YIVTVEIGGRNMTVI--VDTGSDLTWVQCQPCRL--CYNQQDPLFNPSGSPSYQTILCNS 122

Query: 95  SQC----------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           S C           V  SN     C+Y   YG G+Y   + G+L  E L   +T      
Sbjct: 123 STCQSLQYATGNLGVCGSNTPT--CNYVVNYGDGSY---TRGDLGMEQLNLGTT-----H 172

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---QGSS 201
           + N IFGCG  N       S   G++GLG  + SL+SQ      G FSYCLP      S 
Sbjct: 173 VSNFIFGCGRNNKGLFGGAS---GLMGLGKSDLSLVSQTSAIFEGVFSYCLPTTAADASG 229

Query: 202 KINFGG---IVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSSTGNIFVDT 253
            +  GG   +      +S   +I +      Y+L+L  IS+G   L+  +     I +D+
Sbjct: 230 SLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYRQSGILIDS 289

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIH 310
           G + T LP   + +LK+         P     + P FS +  C+N++   +   P + + 
Sbjct: 290 GTVITRLPPPVYRDLKAEFLKQFSGFP-----SAPPFSILDTCFNLNGYDEVDIPTIRMQ 344

Query: 311 FRG-ADVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMV 364
           F G A++ +  + +F  +  +     +  ++    +   + G   Q N  + Y+ +++ +
Sbjct: 345 FEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKL 404

Query: 365 SFKPSRCT 372
            F    C+
Sbjct: 405 GFAAEACS 412


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 107/382 (28%), Positives = 176/382 (46%), Gaps = 58/382 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M + +GTPP      +DTGSD  W QC PC   DCF Q    +DPK S+++ +I+C+ 
Sbjct: 160 YFMDVLVGTPPKHFSLILDTGSDLNWLQCLPC--YDCFHQNGMFYDPKTSASFKNITCND 217

Query: 95  SQCAVVTS-----NCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS----GLPV 143
            +C++++S      C   +  C Y + YG     S ++G+ A ET T N T+        
Sbjct: 218 PRCSLISSPDPPVQCESDNQSCPYFYWYGD---RSNTTGDFAVETFTVNLTTTEGGSSEY 274

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
           ++ N++FGCGH N    +  S   G+       S   SQ+ +     FSYCL D+     
Sbjct: 275 KVGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFS---SQLQSLYGHSFSYCLVDRNSNTN 331

Query: 200 -SSKINFG---GIVAGAGVVSTPLI------IRDHYYLSLEAISVGNQRLE-------FV 242
            SSK+ FG    ++    +  T  +      +   YY+ +++I VG + L+         
Sbjct: 332 VSSKLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNIS 391

Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD--VL--CYNI 298
           S   G   +D+G   +      +  +K+  +  +K          P F D  VL  C+N+
Sbjct: 392 SDGDGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKEN-------YPIFRDFPVLDPCFNV 444

Query: 299 S----SQPKFPEVTIHFRGADVKLSPS-NLFRNISDEIMCSAFRGGNANI--VYGRIMQI 351
           S    +    PE+ I F    V   P+ N F  +S++++C A  G   +   + G   Q 
Sbjct: 445 SGIEENNIHLPELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQ 504

Query: 352 NFLIGYDIEQAMVSFKPSRCTN 373
           NF I YD +++ + F P++C +
Sbjct: 505 NFHILYDTKRSRLGFTPTKCAD 526


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 107/371 (28%), Positives = 160/371 (43%), Gaps = 51/371 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP       +DTGSD  W QC PC    C+ Q   +FDP++S +Y ++ C +
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWLQCAPCRH--CYAQSGRVFDPRRSRSYAAVDCVA 179

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   +     C Y   YG G   S ++G+ A+ETLTF   +     +  V  G
Sbjct: 180 PICRRLDSAGCDRRRNSCLYQVAYGDG---SVTAGDFASETLTFARGA----RVQRVAIG 232

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---------GSSK 202
           CGH N     + S   G+        S  +Q+  S    FSYCL D+          SS 
Sbjct: 233 CGHDNEGLFIAASGLLGLGRG---RLSFPTQIARSFGRSFSYCLVDRTSSVRPSSTRSST 289

Query: 203 INFGGIVAGAGVVS--TPL----IIRDHYYLSLEAISVGNQRLEFVSSS---------TG 247
           + FG     A   +  TP+     +   YY+ L   SVG  R++ VS S          G
Sbjct: 290 VTFGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRG 349

Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--K 303
            + +D+G   T L    +  ++    +  +A  V G+   PG   +   CYN+S +   K
Sbjct: 350 GVILDSGTSVTRLARPVYEAVR----DAFRAAAV-GLRVSPGGFSLFDTCYNLSGRRVVK 404

Query: 304 FPEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIE 360
            P V++H   GA V L P N    + +    C A  G +  + + G I Q  F + +D +
Sbjct: 405 VPTVSMHLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGD 464

Query: 361 QAMVSFKPSRC 371
              V F P  C
Sbjct: 465 AQRVGFVPKSC 475


>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
 gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
 gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
          Length = 475

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 99/366 (27%), Positives = 160/366 (43%), Gaps = 36/366 (9%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC---FKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP +    VDTGSD  W  C+PCP+            LFD   SST   +
Sbjct: 73  LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKV 132

Query: 91  SCSSSQCAVVTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP- 146
            C    C+ ++ + S      CSY  +Y   A  S S G    + LT    +G     P 
Sbjct: 133 GCDDDFCSFISQSDSCQPALGCSYHIVY---ADESTSDGKFIRDMLTLEQVTGDLKTGPL 189

Query: 147 --NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK--FSYCLPDQGSS 201
              V+FGCG          DS   G++G G  N+S++SQ+  +   K  FS+CL +    
Sbjct: 190 GQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGG 249

Query: 202 KINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRT 258
            I   G+V    V +TP++    HY + L  + V    L+   S    G   VD+G    
Sbjct: 250 GIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLA 309

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRGADV 316
             P   +    S++  ++  QPVK    E  F    C++ S+     FP V+  F  + V
Sbjct: 310 YFPKVLYD---SLIETILARQPVKLHIVEETFQ---CFSFSTNVDEAFPPVSFEFEDS-V 362

Query: 317 KLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIGYDIEQAMVSFK 367
           KL+  P +    + +E+ C  ++ G          I+ G ++  N L+ YD++  ++ + 
Sbjct: 363 KLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGWA 422

Query: 368 PSRCTN 373
              C++
Sbjct: 423 DHNCSS 428


>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  117 bits (293), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 120/355 (33%), Positives = 173/355 (48%), Gaps = 43/355 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ + +G+P V     +DTGSD +W QC+PC +  C  Q   LFDP  SSTY++ SC+S
Sbjct: 127 YLITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQ--CHSQADSLFDPSSSSTYSAFSCTS 184

Query: 95  SQCAVVTSN-CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           + CA +    CS   C Y+  YG G   S  SG  +++TL   S++     + N  FGC 
Sbjct: 185 AACAQLRQRGCSSSQCQYTVKYGDG---STGSGTYSSDTLALGSST-----VENFQFGCS 236

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ-GSSKINFGGIVAGA 212
                +   D +  G++GLG G  SL +Q   +    FSYCLP   GSS     G     
Sbjct: 237 QSESGNLLQD-QTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPTPGSSGFLTLGASTSG 295

Query: 213 GVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHS 266
            VV TP++    +  +Y + L+AI VG ++L   +S  S G+I +D+G + T LP   +S
Sbjct: 296 FVVKTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFSAGSI-MDSGTIITRLPRTAYS 354

Query: 267 NLKSVMSNMIKAQPVKGVGAEP-GFSDVLCYNISSQP--KFPEVTIHFR-GADVKLSPSN 322
            L S     +K  P     A+P G  D  C++ S Q     P V + F  GA V L+   
Sbjct: 355 ALSSAFKAGMKQYPP----AQPMGIFDT-CFDFSGQSSVSIPTVALVFSGGAVVDLA--- 406

Query: 323 LFRNISDEIM---CSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
                SD I+   C AF   + +    + G + Q  F + YD+    V FK   C
Sbjct: 407 -----SDGIILGSCLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 108/359 (30%), Positives = 167/359 (46%), Gaps = 48/359 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +S GTP V     +DTGSD +W QC+PC    CF Q+ PL+DP  SSTY+++ C+S
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 172

Query: 95  SQCAVVT-----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
             C  +      S C+ G  C ++  Y  G   + + G  + + LT     G  V+  N 
Sbjct: 173 DVCKKLAADAYGSGCTSGKQCGFAISYADG---TSTVGAYSQDKLTL--APGAIVQ--NF 225

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI 208
            FGCGH   A         G++GLG     L   +G    G FSYCLP   SSK  F  +
Sbjct: 226 YFGCGHGKHA---VRGLFDGVLGLG----RLRESLGARYGGVFSYCLPSV-SSKPGFLAL 277

Query: 209 VAG---AGVVSTPLII----RDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLL 260
            AG   +G V TP+           ++L  I+VG ++L+   S+ +G + VD+G + T L
Sbjct: 278 GAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGL 337

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIHFR-GADVK 317
               +  L+S     ++A  +      P      CYN++       P++ + F  GA + 
Sbjct: 338 QSTAYRALRSAFRKAMEAYRLL-----PNGDLDTCYNLTGYKNVVVPKIALTFTGGATIN 392

Query: 318 LS-PSNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           L  P+ +  N      C AF      G+A ++ G + Q  F + +D   +   F+   C
Sbjct: 393 LDVPNGILVN-----GCLAFAESGPDGSAGVL-GNVNQRAFEVLFDTSTSKFGFRAKAC 445


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 112/364 (30%), Positives = 173/364 (47%), Gaps = 49/364 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+ +++GTP + +  ++DTGSD TWTQCEPC    C++Q    FDP+KSS+Y ++SCSS
Sbjct: 45  YLVKMALGTPKLSLSLALDTGSDITWTQCEPCVG-SCYRQAQTKFDPRKSSSYKNVSCSS 103

Query: 95  SQCAVVTSN-----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           S C ++T +     C    C Y   YG G+Y   S G  ATE LT + +      + N +
Sbjct: 104 SSCRIITDSGGARGCVSSTCIYKVQYGDGSY---SVGFFATEKLTISPSD----VISNFL 156

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQGSS---K 202
           FGCG +N       + + G I    G       +    + K    F+YCLP   SS    
Sbjct: 157 FGCGQQN-------AGRFGRIAGLLGLGRGKLSLALQTSEKYNNLFTYCLPSFSSSSTGH 209

Query: 203 INFGGIVAGAGVVSTPL--IIRD--HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVL 256
           +  GG V  + V  TPL    ++   Y + ++ +SVG   L   +S  S     +D+G +
Sbjct: 210 LTLGGQVPKS-VKFTPLSPAFKNTPFYGIDIKGLSVGGHVLPIDASVFSNAGAIIDSGTV 268

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPKF--PEVTIHFRG 313
            T L    +S L S    ++K  P        GFS +  CY+ S       P ++  F+G
Sbjct: 269 ITRLQPTVYSALSSKFQQLMKDYP-----KTDGFSILDTCYDFSGNESISVPRISFFFKG 323

Query: 314 A---DVKLSPSNLFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFK 367
               D+K        N  D++ C AF   + +   +V+G   Q  + + +D+ +  + F 
Sbjct: 324 GVEVDIKFFGILTVINAWDKV-CLAFAPNDDDGDFVVFGNSQQQTYDVVHDLAKGRIGFA 382

Query: 368 PSRC 371
           PS C
Sbjct: 383 PSGC 386


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 107/379 (28%), Positives = 170/379 (44%), Gaps = 57/379 (15%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQE--PPLFDPKKSSTYNS 89
           D  + + + IGTPP      VDTGSD  WTQC+         +   PP++DP +SST+  
Sbjct: 88  DQGHSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAF 147

Query: 90  ISCSSSQC---AVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           + CS   C        NC S+  C Y  +YG  A    + G LA+ET TF +   + + +
Sbjct: 148 LPCSDRLCQEGQFSFKNCTSKNRCVYEDVYGSAA----AVGVLASETFTFGARRAVSLRL 203

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGSSK 202
               FGCG  +  S       TGI+GL P + SLI+Q+      +FSYCL    D+ +S 
Sbjct: 204 G---FGCGALSAGSLIG---ATGILGLSPESLSLITQLKIQ---RFSYCLTPFADKKTSP 254

Query: 203 INFGGI-----------VAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSST----- 246
           + FG +           +    +VS P +   +YY+ L  IS+G++RL   ++S      
Sbjct: 255 LLFGAMADLSRHKTTRPIQTTAIVSNP-VKTVYYYVPLVGISLGHKRLAVPAASLAMRPD 313

Query: 247 --GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP-- 302
             G   VD+G     L       +K  + ++++  PV     E      LC+ +  +   
Sbjct: 314 GGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRL-PVANRTVE---DYELCFVLPRRTAA 369

Query: 303 ------KFPEVTIHFRGADVKLSPS-NLFRNISDEIMCSAF---RGGNANIVYGRIMQIN 352
                 + P + +HF G    + P  N F+     +MC A      G+   + G + Q N
Sbjct: 370 AAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQN 429

Query: 353 FLIGYDIEQAMVSFKPSRC 371
             + +D++    SF P++C
Sbjct: 430 MHVLFDVQHHKFSFAPTQC 448


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 115/367 (31%), Positives = 168/367 (45%), Gaps = 47/367 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +GTP       VDTGS  +W QC+PC  + C  Q  P+F P  S TY ++ CSS
Sbjct: 113 YYVKIGLGTPAKYFSMIVDTGSSLSWLQCQPC-VIYCHVQVDPIFTPSTSKTYKALPCSS 171

Query: 95  SQCAVVTS------NCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           SQC+ + S       CS   G C Y   YG     SFS G L+ + LT       P E P
Sbjct: 172 SQCSSLKSSTLNAPGCSNATGACVYKASYGD---TSFSIGYLSQDVLTLT-----PSEAP 223

Query: 147 N--VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
           +   ++GCG  N        + +GIIGL     S++ Q+       FSYCLP   S+  +
Sbjct: 224 SSGFVYGCGQDNQG---LFGRSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNS 280

Query: 205 F---GGIVAGAGVVS------TPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGNI-- 249
               G +  GA  ++      TPL+    I   Y+L L  I+V  + L  VS+S+ N+  
Sbjct: 281 SSLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLG-VSASSYNVPT 339

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCY--NISSQPKFPE 306
            +D+G + T LP+  ++ LK     ++     K     PGFS +  C+  ++      PE
Sbjct: 340 IIDSGTVITRLPVAVYNALKKSFVLIMS----KKYAQAPGFSILDTCFKGSVKEMSTVPE 395

Query: 307 VTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMV 364
           + I FR GA ++L   N    I     C A    +  I + G   Q  F + YD+    +
Sbjct: 396 IQIIFRGGAGLELKAHNSLVEIEKGTTCLAIAASSNPISIIGNYQQQTFKVAYDVANFKI 455

Query: 365 SFKPSRC 371
            F P  C
Sbjct: 456 GFAPGGC 462


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 103/375 (27%), Positives = 170/375 (45%), Gaps = 49/375 (13%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP  D +  VDTGSD  W     C+ CP       +  L+D K S+T +++
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 213

Query: 91  SCSSSQCAVVTS---NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
            C  + C++       C  G  C YS LYG G   S ++G    + + +N  SG     P
Sbjct: 214 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDG---SSTTGYFVQDFVQYNRISGNFQTTP 270

Query: 147 ---NVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGS 200
               V+FGCG+K      S S+   GI+G G  NSS++SQ+ +S  +   FS+CL +   
Sbjct: 271 TNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN--- 327

Query: 201 SKINFGGIVAGAGVVS-----TPLII-RDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
             ++ GGI A   VV      TPL+  + HY + ++ I VG   L+     F S      
Sbjct: 328 --VDGGGIFAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGT 385

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISS--QPKFPE 306
            +D+G      P E +  L   +  ++  QP ++    E  F+   C++ +      FP 
Sbjct: 386 IIDSGTTLAYFPQEVYVPL---IEKILSQQPDLRLHTVEQAFT---CFDYTGNVDDGFPT 439

Query: 307 VTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFLIGYD 358
           VT+HF +   + + P      + +   C  ++   A    G+ + +       N L+ YD
Sbjct: 440 VTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYD 499

Query: 359 IEQAMVSFKPSRCTN 373
           +E+  + +    C++
Sbjct: 500 LEKQGIGWVEYNCSS 514


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 103/375 (27%), Positives = 170/375 (45%), Gaps = 49/375 (13%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP  D +  VDTGSD  W     C+ CP       +  L+D K S+T +++
Sbjct: 73  LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 132

Query: 91  SCSSSQCAVVTS---NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
            C  + C++       C  G  C YS LYG G   S ++G    + + +N  SG     P
Sbjct: 133 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDG---SSTTGYFVQDFVQYNRISGNFQTTP 189

Query: 147 ---NVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGS 200
               V+FGCG+K      S S+   GI+G G  NSS++SQ+ +S  +   FS+CL +   
Sbjct: 190 TNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN--- 246

Query: 201 SKINFGGIVAGAGVVS-----TPLII-RDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
             ++ GGI A   VV      TPL+  + HY + ++ I VG   L+     F S      
Sbjct: 247 --VDGGGIFAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGT 304

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISS--QPKFPE 306
            +D+G      P E +  L   +  ++  QP ++    E  F+   C++ +      FP 
Sbjct: 305 IIDSGTTLAYFPQEVYVPL---IEKILSQQPDLRLHTVEQAFT---CFDYTGNVDDGFPT 358

Query: 307 VTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFLIGYD 358
           VT+HF +   + + P      + +   C  ++   A    G+ + +       N L+ YD
Sbjct: 359 VTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYD 418

Query: 359 IEQAMVSFKPSRCTN 373
           +E+  + +    C++
Sbjct: 419 LEKQGIGWVEYNCSS 433


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 123/393 (31%), Positives = 172/393 (43%), Gaps = 65/393 (16%)

Query: 31  VDDIYLMHLSIGTP-PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNS 89
           +D  YL+HLSIGTP P  +  ++DTGSD  WTQC  C    CF Q  P FD   S T  +
Sbjct: 96  IDSEYLIHLSIGTPRPQRVALTLDTGSDLVWTQCA-CHV--CFAQPFPTFDALASQTTLA 152

Query: 90  ISCSSSQCA---VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG------ 140
           + CS   C       S C+  D +  +LY   A  S +SG +  +T TF S  G      
Sbjct: 153 VPCSDPICTSGKYPLSGCTFNDNTCFYLYDY-ADKSITSGRIVEDTFTFRSPQGNNGSKA 211

Query: 141 -LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC---LP 196
              V +PNV FGCG  N       S ++GI G   G  SL SQ+  +   +FS+C   + 
Sbjct: 212 HAGVAVPNVRFGCGQYNKG--IFKSNESGIAGFSRGPMSLPSQLKVA---RFSHCFTAIA 266

Query: 197 DQGSSKINFGGI--------VAGAGVVSTPLIIRDH--YYLSLEAISVGNQRLEF----- 241
           D  +S +  GG          A   V STP    +   YYL+L+ I+VG  RL       
Sbjct: 267 DARTSPVFLGGAPGPDNLGAHATGPVQSTPFANSNGSLYYLTLKGITVGKTRLPLNALAF 326

Query: 242 ----VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN 297
                 S +G   +D+G     LP   + +L++     +K  PV    A    S  LC+ 
Sbjct: 327 AGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVKL-PVANESAADAES-TLCFE 384

Query: 298 ISSQPKFP---------EVTIHFRGADVKL-SPSNLFRNISDE---------IMCSAFRG 338
            +     P         +V +H  GAD  L   S +   + DE         +M SA  G
Sbjct: 385 AARSASLPPEAPAPALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGSGLCLVMNSA--G 442

Query: 339 GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            +   + G   Q N  + YD+E+  + F P+RC
Sbjct: 443 DSDLTIIGNFQQQNMHVAYDLEKNKLVFVPARC 475


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 108/353 (30%), Positives = 166/353 (47%), Gaps = 41/353 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + IG+P V     +DTGSD +W +C     L        LFDP KS+TY   SCSS
Sbjct: 129 YVITVGIGSPAVTQTMMIDTGSDVSWVRCNSTDGL-------TLFDPSKSTTYAPFSCSS 181

Query: 95  SQCAVVTSN---CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
           + CA + +N   CS   C Y   YG G   S ++G  +++TL  +++      + +  FG
Sbjct: 182 AACAQLGNNGDGCSNSGCQYRVQYGDG---SNTTGTYSSDTLALSASD----TVTDFHFG 234

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIV 209
           C H          K  G++GLG    SL+SQ   +    FSYCLP  ++ S  + FG   
Sbjct: 235 CSHHE--EDFDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYCLPPTNRTSGFLTFGAPN 292

Query: 210 A-GAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPL 262
               G V+TP++        Y + L+ ISVG   L    S  S G++ +D+G + T LP 
Sbjct: 293 GTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVLSNGSV-MDSGTVITWLPR 351

Query: 263 EYHSNLKSVM-SNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFR-GADVKL 318
             +S L S   S+M + +  +   A  G  D  CY+ +       P V++    GA V L
Sbjct: 352 RAYSALSSAFRSSMTRLRHQR--AAPLGILDT-CYDFTGLVNVSIPAVSLVLDGGAVVDL 408

Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             + +   I D   C AF   + + + G + Q  F + +D+ Q +  F+   C
Sbjct: 409 DGNGIM--IQD---CLAFAATSGDSIIGNVQQRTFEVLHDVGQGVFGFRSGAC 456


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 98/303 (32%), Positives = 134/303 (44%), Gaps = 39/303 (12%)

Query: 44  PPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV--- 100
           P V     +DT SD  W QC PCP   C+ Q   L+DP KS +  S +CSS  C  +   
Sbjct: 178 PGVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGPY 237

Query: 101 -----TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK 155
                +S+ S G C Y   Y  G   S +SG L  + L+ + TS    ++P   FGC H 
Sbjct: 238 ANGCSSSSNSAGQCQYRVRYPDG---STTSGTLVADQLSLSPTS----QVPKFEFGCSHA 290

Query: 156 NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINF--------GG 207
              S  S SK  GI+ LG G  SL+SQ  T     FSYC P   S K  F          
Sbjct: 291 ARGS-FSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPTASHKGFFVLGVPRRSSS 349

Query: 208 IVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYH 265
             A   ++ TP++    Y + LEAI+V  QRL+   +  + G       V+  L P  Y 
Sbjct: 350 RYAVTPMLKTPML----YQVRLEAIAVAGQRLDVPPTVFAAGAALDSRTVITRLPPTAYQ 405

Query: 266 SNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHF--RGADVKLSPS 321
           +   +    M   +P     A  G  D  CY+ +  S    P +++ F   GA V+L PS
Sbjct: 406 ALRSAFRDKMSMYRPA----AANGQLDT-CYDFTGVSSIMLPTISLVFDRTGAGVQLDPS 460

Query: 322 NLF 324
            + 
Sbjct: 461 GVL 463


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 114/375 (30%), Positives = 174/375 (46%), Gaps = 53/375 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  +++GTP V+   ++DTGSD TW QC+PC    C+ Q  P+FDP+ S++Y  +   +
Sbjct: 134 YMAKIAVGTPAVEALLAMDTGSDITWLQCQPCRR--CYPQSGPVFDPRHSTSYREMGYDA 191

Query: 95  SQCAVV----TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
             C  +      +     C Y+  YG     S + G+   ETLTF       V++P++  
Sbjct: 192 PDCQALGRSGGGDAKRMTCVYAVGYGDD--GSTTVGDFIEETLTFAGG----VQVPHMSI 245

Query: 151 GCGHKN---LASPTSDSKQTGIIGLGPGNSSLISQ---MGTSIAGKFSYCLPD------- 197
           GCGH N    A+P +     GI+GLG G  S  SQ   +G ++   FSYCL D       
Sbjct: 246 GCGHDNKGLFAAPAA-----GILGLGRGQISCPSQIAALGYNVT-SFSYCLADFFLSSPG 299

Query: 198 -QGSSKINFG-GIVAGAGVVS-TPLI----IRDHYYLSLEAISVGNQRLEFVS------- 243
              SS +  G G  AG+   S TP +    +   YY+ L  +SVG  R+  V+       
Sbjct: 300 RSVSSTLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTEDDLKLD 359

Query: 244 --SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ 301
             +  G + +D+G   T L    +   +              +G   GF D  CY +  +
Sbjct: 360 PYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDT-CYTMGGR 418

Query: 302 P-KFPEVTIHFRGA-DVKLSPSNLFRNI-SDEIMCSAFRG-GNANI-VYGRIMQINFLIG 356
             K P V++HF G  ++ L P N    + S   +C AF G G+ ++ + G I Q  F + 
Sbjct: 419 AMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIGNIQQQGFRVV 478

Query: 357 YDIEQAMVSFKPSRC 371
           Y+I    V F P+ C
Sbjct: 479 YNIGGGRVGFAPNSC 493


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 105/363 (28%), Positives = 159/363 (43%), Gaps = 44/363 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +++ +  G+P  +   S+DTGSD +W QC PC    C+KQ  P+FDP KS+TY+++ C  
Sbjct: 161 FVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPC-SGHCYKQHDPVFDPTKSATYSAVPCGH 219

Query: 95  SQCAVVTSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QCA     CS  G C Y   YG G   S ++G L+ ETL+ +ST     ++P   FGCG
Sbjct: 220 PQCAAAGGKCSNSGTCLYKVTYGDG---SSTAGVLSHETLSLSSTR----DLPGFAFGCG 272

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVAG 211
             NL          G+     G  SL SQ   +    FSYCLP  D     +  G     
Sbjct: 273 QTNLGEFGGVDGLVGLGR---GALSLPSQAAATFGATFSYCLPSYDTTHGYLTMGSTTPA 329

Query: 212 AG-----VVSTPLIIRDH----YYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTL 259
           A      V  T +I ++     Y++ + +I +G   L     V +  G +F D+G + T 
Sbjct: 330 ASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRDGTLF-DSGTILTY 388

Query: 260 LPLEYHSNLKSVMS-NMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GA 314
           LP E +++L+      M + +P       P +     CY+ +       P V   F  GA
Sbjct: 389 LPPEAYASLRDRFKFTMTQYKPA------PAYDPFDTCYDFTGHNAIFMPAVAFKFSDGA 442

Query: 315 DVKLSPSNLF---RNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKP 368
              LSP  +     + +    C AF    + +   + G   Q    + YD+    + F  
Sbjct: 443 VFDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGFGQ 502

Query: 369 SRC 371
             C
Sbjct: 503 FTC 505


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 110/360 (30%), Positives = 158/360 (43%), Gaps = 40/360 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +++ +  GTP        DTGSD +W QC PC    C+KQ  P+FDP KS+TY+++ C  
Sbjct: 120 FVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPC-SGHCYKQHDPIFDPTKSATYSAVPCGH 178

Query: 95  SQCAVVTSNCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QCA     CS  G C Y   YG G   S ++G L+ ETL+  S   L    P   FGCG
Sbjct: 179 PQCAAAGGKCSSNGTCLYKVQYGDG---SSTAGVLSHETLSLTSARAL----PGFAFGCG 231

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVAG 211
             NL          G+IGLG G  SL SQ   S    FSYCLP   +S   +  G     
Sbjct: 232 ETNLG---DFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGTTTPA 288

Query: 212 A---GVVSTPLIIRDH----YYLSLEAISVGNQRLEF--VSSSTGNIFVDTGVLRTLLPL 262
           +   GV  T +I +      Y++ L +I VG   L    +  +     +D+G + T LP 
Sbjct: 289 SGSDGVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDGTLLDSGTVLTYLPP 348

Query: 263 EYHSNLKSVMS-NMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADVK 317
           E ++ L+      M + +P       P +     CY+ + Q     P V+  F  G+   
Sbjct: 349 EAYTALRDRFKFTMTQYKPA------PAYDPFDTCYDFAGQNAIFMPLVSFKFSDGSSFD 402

Query: 318 LSPSNLF---RNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           LSP  +     + +    C AF    + +   + G   Q N  + YD+    + F    C
Sbjct: 403 LSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462


>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 117/408 (28%), Positives = 171/408 (41%), Gaps = 67/408 (16%)

Query: 2   QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
           Q  Q+L    D   P    +  Y AE +           IG PP      +DTGS+  WT
Sbjct: 62  QQQQQLRASGDVSAPVHLATRQYIAEYL-----------IGDPPQRAAALIDTGSNLIWT 110

Query: 62  QC-EPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ--CAV--VTSNCSEGDCSYSFLYG 116
           QC   C    C KQ+ P ++  +SST+ ++ C+ S   CA   V     +G C+++  YG
Sbjct: 111 QCGTTCGLKACAKQDLPYYNLSRSSTFAAVPCADSAKLCAANGVHLCGLDGSCTFAASYG 170

Query: 117 RGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGN 176
            G+      G+L TE  TF S +        + FGC      +  + +  +G+IGLG G 
Sbjct: 171 AGSV----FGSLGTEAFTFQSGA------AKLGFGCVSLTRITKGALNGASGLIGLGRGR 220

Query: 177 SSLISQMGTSIAGKFSYCLP----DQGSSKINFGGIVA-----GAGVVSTPLI------- 220
            SL+SQ G   A KFSYCL     + G+S   F G  A     G  V S P +       
Sbjct: 221 LSLVSQTG---ATKFSYCLTPYLRNHGASSHLFVGASASLSGGGGAVTSIPFVKSPEDYP 277

Query: 221 IRDHYYLSLEAISVGNQRLEFVSSS-----------TGNIFVDTGVLRTLLPLEYHSNLK 269
               YYL L  ISVG  +L   S++           +G + +DTG   T L    +S L 
Sbjct: 278 YSTFYYLPLVGISVGETKLPIPSAAFELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALS 337

Query: 270 SVMSNMIK---AQPVKGVGAEPGFSDVLCYNISSQPK-FPEVTIHF-RGADVKLSPSNLF 324
             ++  +     QP    G +      LC       K  P +  HF  GAD+ +S  + +
Sbjct: 338 DEVARQLNRSLVQPPADTGLD------LCVARQDVDKVVPVLVFHFGGGADMAVSAGSYW 391

Query: 325 RNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
             +     C     G    V G   Q +  + YDI +  +SF+ + C+
Sbjct: 392 GPVDKSTACMLIEEGGYETVIGNFQQQDVHLLYDIGKGELSFQTADCS 439


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 101/357 (28%), Positives = 160/357 (44%), Gaps = 57/357 (15%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNC------- 104
           VDTGSD TW QCEPCP   C+ Q  PLFDP  S T+ ++ C S  CA    +        
Sbjct: 198 VDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPCGSPACAASLKDATGAPGSC 257

Query: 105 ------SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG--HKN 156
                 SE  C Y+  YG G   SFS G LA +TL   +T+    ++   +FGCG  ++ 
Sbjct: 258 ARSAGNSEQRCYYALSYGDG---SFSRGVLAQDTLGLGTTT----KLDGFVFGCGLSNRG 310

Query: 157 LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVS 216
           L   T+     G++GLG  + SL+SQ      G FSYCLP   ++  + G +  G G  S
Sbjct: 311 LFGGTA-----GLMGLGRTDLSLVSQTAARFGGVFSYCLP---ATTTSTGSLSLGPGPSS 362

Query: 217 T------PLIIRD-----HYYLSLE-AISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEY 264
           +        +I D      Y++++  A   G   L       GN+ VD+G + T L    
Sbjct: 363 SFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGFGAGNVLVDSGTVITRLAPSV 422

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADVKLSP 320
           +  +++  +   +        A PGFS +  CY+++ + +   P +T+    GA V +  
Sbjct: 423 YKAVRAEFARRFEYP------AAPGFSILDACYDLTGRDEVNVPLLTLTLEGGAQVTVDA 476

Query: 321 SNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           + +   +  +     +  ++    +   + G   Q N  + YD   + + F    CT
Sbjct: 477 AGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYDTVGSRLGFADEDCT 533


>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
          Length = 494

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 116/381 (30%), Positives = 173/381 (45%), Gaps = 61/381 (16%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP    +  VDTGSD  W     C+ CP       E  ++DP+ S +   +
Sbjct: 89  LYFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELV 148

Query: 91  SCSSSQC-----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           +C    C      V+ S  S   C YS  YG G   S ++G   T+ L +N  SG     
Sbjct: 149 TCDQQFCVANYGGVLPSCTSTSPCEYSISYGDG---SSTAGFFVTDFLQYNQVSGDGQTT 205

Query: 146 P---NVIFGCGHK---NLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCL 195
           P   +V FGCG K   +L S  S+    GI+G G  NSS++SQ+  + AGK    F++CL
Sbjct: 206 PANASVSFGCGAKLGGDLGS--SNLALDGILGFGQSNSSMLSQL--AAAGKVRKMFAHCL 261

Query: 196 PDQGSSKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLE-----FVSS 244
                  +N GGI A   VV     +TPL+    HY + L+ I VG   L      F S 
Sbjct: 262 -----DTVNGGGIFAIGNVVQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSG 316

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC--YNISSQP 302
           ++    +D+G     +P   +  L +++ +  +   V+ +       D  C  Y+ S   
Sbjct: 317 NSKGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTL------QDFSCFQYSGSVDD 370

Query: 303 KFPEVTIHFRGADVKL--SPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQI-------N 352
            FPEVT HF G DV L  SP + LF+N    + C  F+ G      G+ + +       N
Sbjct: 371 GFPEVTFHFEG-DVSLIVSPHDYLFQN-GKNLYCMGFQNGGGKTKDGKDLGLLGDLVLSN 428

Query: 353 FLIGYDIEQAMVSFKPSRCTN 373
            L+ YD+E   + +    C++
Sbjct: 429 KLVLYDLENQAIGWADYNCSS 449


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 103/412 (25%), Positives = 171/412 (41%), Gaps = 73/412 (17%)

Query: 14  ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK 73
           E   +  +++ +  I+     YL+ L IGTPP     ++DT SD  WTQC+PC    C+ 
Sbjct: 68  EAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPC--TGCYH 125

Query: 74  QEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLA 129
           Q  P+F+P+ SSTY ++ CSS  C  +    C   D   C Y++ Y   A    + G LA
Sbjct: 126 QVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNAT---TEGTLA 182

Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
            + L     +        V FGC   +        + +G++GLG G  SL+SQ+      
Sbjct: 183 VDKLVIGEDA-----FRGVAFGCSTSSTGG-APPPQASGVVGLGRGPLSLVSQLSVR--- 233

Query: 190 KFSYCLPDQGSS---KINFGGIVAGAGVVSTPLII---RD-----HYYLSLEAISVGNQR 238
           +F+YCLP   S    K+  G     A   +  + +   RD     +YYL+L+ + +G++ 
Sbjct: 234 RFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRA 293

Query: 239 LEF---------------------------VSSSTGN---IFVDTGVLRTLLPLEYHSNL 268
           +                             V+    N   + +D     T L    +  L
Sbjct: 294 MSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDEL 353

Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF-----PEVTIHFRGADVKLSPSNL 323
            + +   I+    +G G+  G    LC+ +     F     P V + F G  ++L  + L
Sbjct: 354 VNDLEVEIRLP--RGTGSSLGLD--LCFILPDGVAFDRVYVPAVALAFDGRWLRLDKARL 409

Query: 324 FRNISDE----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           F    +     +M      G+ +I+ G   Q N  + Y++ +  V+F  S C
Sbjct: 410 FAEDRESGMMCLMVGRAEAGSVSIL-GNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 103/412 (25%), Positives = 171/412 (41%), Gaps = 73/412 (17%)

Query: 14  ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK 73
           E   +  +++ +  I+     YL+ L IGTPP     ++DT SD  WTQC+PC    C+ 
Sbjct: 68  EAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPC--TGCYH 125

Query: 74  QEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLA 129
           Q  P+F+P+ SSTY ++ CSS  C  +    C   D   C Y++ Y   A    + G LA
Sbjct: 126 QVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNAT---TEGTLA 182

Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
            + L     +        V FGC   +        + +G++GLG G  SL+SQ+      
Sbjct: 183 VDKLVIGEDA-----FRGVAFGCSTSSTGG-APPPQASGVVGLGRGPLSLVSQLSVR--- 233

Query: 190 KFSYCLPDQGSS---KINFGGIVAGAGVVSTPLII---RD-----HYYLSLEAISVGNQR 238
           +F+YCLP   S    K+  G     A   +  + +   RD     +YYL+L+ + +G++ 
Sbjct: 234 RFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRT 293

Query: 239 LEF---------------------------VSSSTGN---IFVDTGVLRTLLPLEYHSNL 268
           +                             V+    N   + +D     T L    +  L
Sbjct: 294 MSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRYGMIIDIASTITFLEASLYDEL 353

Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF-----PEVTIHFRGADVKLSPSNL 323
            + +   I+    +G G+  G    LC+ +     F     P V + F G  ++L  + L
Sbjct: 354 VNDLEVEIRLP--RGTGSSLGLD--LCFILPDGVAFDRVYVPAVALAFDGRWLRLDKARL 409

Query: 324 FRNISDE----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           F    +     +M      G+ +I+ G   Q N  + Y++ +  V+F  S C
Sbjct: 410 FAEDRESGMMCLMVGRAEAGSVSIL-GNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
 gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
          Length = 471

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 105/389 (26%), Positives = 171/389 (43%), Gaps = 49/389 (12%)

Query: 11  NDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD 70
           N  + P S ISII        D +Y+M  +IG+PPV+ +   DTGS+  W QC      +
Sbjct: 92  NSRKYPVSRISII--------DKVYVMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTN 143

Query: 71  CFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSE--------GDCSYSFLYGRGAYAS 122
           C+KQ+ PLF+P KSSTY    C   +C        E          C Y   Y      S
Sbjct: 144 CYKQKIPLFNPTKSSTYAIRLCGHRECKQALWGLGEYLGCKSSVQVCRYHISYED---HS 200

Query: 123 FSSGNLATETLTF-NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQ---TGIIGLGPGNSS 178
           FS G ++T+ +TF    +        + FGCG+ N  +P  D       G++GLG   +S
Sbjct: 201 FSEGTISTDIITFPEHIAEFGNYSLRMFFGCGYNNSETPGQDPNSFTAPGVVGLGNEMAS 260

Query: 179 LISQMGTSIAGKFSYCL--PD----QGSSKINFGGIVAGAGVVSTPLIIRDHYYL--SLE 230
           L+ Q+     G+FSYC+  PD     G+ +I FG   + +G  +      + +Y+  +++
Sbjct: 261 LVGQL---TLGQFSYCISTPDVQKPNGTIEIRFGLAASISGHSTALANNLEGWYIFQNVD 317

Query: 231 AISVGNQRL--------EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVK 282
            I V + ++        +F     G + +D+G   T L       L   +   I+  P  
Sbjct: 318 GIYVDDTKVKGYPEWVFQFAEGGIGGLIMDSGTTYTELYFSALDALIGELKEQIELAPDT 377

Query: 283 GVGAEPGFSDVLCYNISS--QPKFPEVTIHF---RGADVKLSPSNLFRNISDEIMCSAFR 337
              +   +S  LCYN ++      P + + F   + A    +  N + +  ++  C A  
Sbjct: 378 QDHSNSNYS--LCYNAANFLLTYVPAIELKFTDNKEAYFPFTLRNAWIDNGNDQYCLAMF 435

Query: 338 GGNANIVYGRIMQINFLIGYDIEQAMVSF 366
           G +   + G     +  IGYD++  +VSF
Sbjct: 436 GTSGISIIGIYQHRDIKIGYDLKYNLVSF 464


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 115/385 (29%), Positives = 170/385 (44%), Gaps = 67/385 (17%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
           M L IG+   ++   +DTGS+    QC          +  P+FDP  S +Y  + C S  
Sbjct: 1   MQLGIGSLQKNLSAIIDTGSEAVLVQCG--------SRSRPVFDPAASQSYRQVPCISQL 52

Query: 97  CAVVTSNCSEGD----------CSYSFLYGRGAYASFSSGNLATETLTFNST--SGLPVE 144
           C  V    S G           C+YS  YG    +  S+G+ + + +  NST  S   V+
Sbjct: 53  CLAVQQQTSNGSSQPCVNSSAACTYSLSYGD---SRNSTGDFSQDVIFLNSTNSSSQAVQ 109

Query: 145 MPNVIFGCGHKNLASPTS---DSKQTGIIGLGPGNSSLISQMGTSIAG-KFSYCLPDQGS 200
             +V FGC H    SP     D    GI+G   GN SL SQ+   + G KFSYC P Q  
Sbjct: 110 FRDVAFGCAH----SPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPW 165

Query: 201 SKINFGGIVAG------AGVVSTPLI------IRDH-YYLSLEAISVGNQRLEFVSSS-- 245
                G I  G      + V  TPL+       R   YY+ L +ISV  + L    S+  
Sbjct: 166 QPRATGVIFLGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFK 225

Query: 246 ------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI- 298
                  G   +D+G   T +  + ++  ++  +   ++   K VGA  GF D  CYNI 
Sbjct: 226 LDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAGFDD--CYNIS 283

Query: 299 --SSQPKFPEVTIHFR-GADVKLSPSNLFRNIS---DEI-----MCSAFRGGNANI-VYG 346
             SS P  PEV +  +    ++L   +LF  +S   +E+     + S+ + G   I V G
Sbjct: 284 AGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLG 343

Query: 347 RIMQINFLIGYDIEQAMVSFKPSRC 371
              Q N+L+ YD E++ V F+ + C
Sbjct: 344 NYQQSNYLVEYDNERSRVGFERADC 368


>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
          Length = 396

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 104/362 (28%), Positives = 168/362 (46%), Gaps = 43/362 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQC-EPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           Y+++L+IGTPP  +   +D G +  WTQC + C    CFKQ+ PLFD   SST+    C 
Sbjct: 51  YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRR--CFKQDLPLFDTNASSTFRPEPCG 108

Query: 94  SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           ++ C  + +    GD   +  Y        + G + T+ +   + +        + FGC 
Sbjct: 109 AAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAA-----TARLAFGCA 163

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--PDQGSSKINFGGIV-- 209
             +       S  +G +GLG  N SL +QM    A  FSYCL  PD G S   F G    
Sbjct: 164 VASEMDTMWGS--SGSVGLGRTNLSLAAQMN---ATAFSYCLAPPDTGKSSALFLGASAK 218

Query: 210 ---AGAGVVSTPLI---------IRDHYYLSLEAISVGNQRLEFVSSSTGN-IFVDTGVL 256
              AG G  +TP +         +   Y L LEAI  GN  +    S  GN I V T   
Sbjct: 219 LAGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATIAMPQS--GNTIMVSTATP 276

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-NISSQPKFPEVTIHFR-GA 314
            T L    + +L+  +++ + A PV      P  +  LC+   S+    P++ + F+ GA
Sbjct: 277 VTALVDSVYRDLRKAVADAVGAAPVP----PPVQNYDLCFPKASASGGAPDLVLAFQGGA 332

Query: 315 DVKLSPSNLFRNISDEIMCSAFRG----GNANIVYGRIMQINFLIGYDIEQAMVSFKPSR 370
           ++ +  S+   +  ++  C A  G    G  +I+ G + Q+N  + +D+++  +SF+P+ 
Sbjct: 333 EMTVPVSSYLFDAGNDTACVAILGSPALGGVSIL-GSLQQVNIHLLFDLDKETLSFEPAD 391

Query: 371 CT 372
           C+
Sbjct: 392 CS 393


>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
 gi|238015146|gb|ACR38608.1| unknown [Zea mays]
 gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
          Length = 467

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 171/356 (48%), Gaps = 35/356 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  + +GTP       VDTGS  TW QC PC  + C +Q  P+F+PK SS+Y S+SCS+
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPC-VVSCHRQSGPVFNPKASSSYTSVSCSA 187

Query: 95  SQCAVVT------SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
            QC+ +T      ++CS  + C Y   YG    +SFS G L+ +T++F STS     +PN
Sbjct: 188 QQCSDLTTATLNPASCSTSNVCIYQASYGD---SSFSVGYLSKDTVSFGSTS-----VPN 239

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
             +GCG  N        +  G+IGL     SL+ Q+  S+   FSYCLP   SS   +  
Sbjct: 240 FYYGCGQDNEG---LFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLS 296

Query: 208 IVA-GAGVVS-TPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTL 259
           I +   G  S TP+    +    Y++ +  I V  + L   SS+  ++   +D+G + T 
Sbjct: 297 IGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITR 356

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFRGADVK 317
           LP   +S L   ++  +K  P         FS  D      +++ + PEVT+ F G    
Sbjct: 357 LPTGVYSALSKAVAGAMKGTPRAS-----AFSILDTCFQGQAARLRVPEVTMAFAGGAAL 411

Query: 318 LSPS-NLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
              + NL  ++     C AF    +  + G   Q  F + YD++ + + F    C+
Sbjct: 412 KLAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467


>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
 gi|194706308|gb|ACF87238.1| unknown [Zea mays]
          Length = 467

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 171/356 (48%), Gaps = 35/356 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  + +GTP       VDTGS  TW QC PC  + C +Q  P+F+PK SS+Y S+SCS+
Sbjct: 129 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPC-VVSCHRQSGPVFNPKASSSYTSVSCSA 187

Query: 95  SQCAVVT------SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
            QC+ +T      ++CS  + C Y   YG    +SFS G L+ +T++F STS     +PN
Sbjct: 188 QQCSDLTTATLSPASCSTSNVCIYQASYGD---SSFSVGYLSKDTVSFGSTS-----VPN 239

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
             +GCG  N        +  G+IGL     SL+ Q+  S+   FSYCLP   SS   +  
Sbjct: 240 FYYGCGQDNEG---LFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLS 296

Query: 208 IVA-GAGVVS-TPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTL 259
           I +   G  S TP+    +    Y++ +  I V  + L   SS+  ++   +D+G + T 
Sbjct: 297 IGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITR 356

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFRGADVK 317
           LP   +S L   ++  +K  P         FS  D      +++ + PEVT+ F G    
Sbjct: 357 LPTGVYSALSKAVAGAMKGTPRAS-----AFSILDTCFQGQAARLRVPEVTMAFAGGAAL 411

Query: 318 LSPS-NLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
              + NL  ++     C AF    +  + G   Q  F + YD++ + + F    C+
Sbjct: 412 KLAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467


>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
          Length = 346

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 103/361 (28%), Positives = 155/361 (42%), Gaps = 41/361 (11%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP---LFDPKKSSTYNSISCS 93
           M +S+GTPPV    ++DTGS  +W QC+ C ++ C+ Q      +F+P  SSTY+ + CS
Sbjct: 1   MGISLGTPPVFNLVTIDTGSTLSWVQCKNC-QIKCYDQAAKAGQIFNPYNSSTYSKVGCS 59

Query: 94  SSQC------AVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           +  C        V   C E D  C YS  YG G Y   S G L  + LT  S   +    
Sbjct: 60  TEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEY---SVGYLGKDRLTLASNRSI---- 112

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM-GTSIAGKFSYCLP--DQGSSK 202
            N IFGCG  NL +  +     GIIG G  + S  +Q+   +    FSYC P   +    
Sbjct: 113 DNFIFGCGEDNLYNGVN----AGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGS 168

Query: 203 INFGGIVAGAGVVSTPLIIRDH---YYLSLEAISVGNQRLE-----FVSSSTGNIFVDTG 254
           +  G       ++ T LI  DH   Y +    + V   RLE     ++S  T    VD+G
Sbjct: 169 LTIGPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMT---IVDSG 225

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGA 314
              T +       L   M+  ++A+       E     +     ++   FP V +    +
Sbjct: 226 TADTYILSPVFDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIRS 285

Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANI----VYGRIMQINFLIGYDIEQAMVSFKPSR 370
            +KL   N F   S+ ++CS F   +A +    + G     +F + +DI+     FK   
Sbjct: 286 TLKLPVENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARA 345

Query: 371 C 371
           C
Sbjct: 346 C 346


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 107/374 (28%), Positives = 165/374 (44%), Gaps = 38/374 (10%)

Query: 30  SVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW---TQCEPCPELDCFKQEPPLFDPKKSST 86
           +V  +Y   + +G+P  D +  VDTGSD  W    +C  CP          L+DPK+S T
Sbjct: 64  TVTGLYFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKT 123

Query: 87  YNSISCSSSQCA-----VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
              +SC  + C+      +    +E  C YS  YG G   S ++G    + LTFN  +G 
Sbjct: 124 SEFVSCEHNFCSSTYEGRILGCKAENPCPYSISYGDG---SATTGYYVQDYLTFNRVNGN 180

Query: 142 P---VEMPNVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYC 194
           P    +  ++IFGCG     +  S S++   GIIG G  NSS++SQ+  S  +   FS+C
Sbjct: 181 PHTATQNSSIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHC 240

Query: 195 LPDQGSSKINFGGIVAGAGVVSTPLIIR-DHYYLSLEAISVGNQRLE-----FVSSSTGN 248
           L       I   G V    V +TPL+    HY + L+ I V    L+     F S +   
Sbjct: 241 LDTNVGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKG 300

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQPKFPEV 307
             +D+G     LP   +  L   MS ++  QP +K    E  +S    Y  +    FP V
Sbjct: 301 TVIDSGTTLAYLPRIVYDQL---MSKVLAKQPRLKVYLVEEQYS-CFQYTGNVDSGFPIV 356

Query: 308 TIHFRGA-DVKLSPSN-LFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFLIGYD 358
            +HF  +  + + P + LF    D   C  ++   +    G+ M +       N L+ YD
Sbjct: 357 KLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYD 416

Query: 359 IEQAMVSFKPSRCT 372
           +E   + +    C+
Sbjct: 417 LENMTIGWTDYNCS 430


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 109/371 (29%), Positives = 172/371 (46%), Gaps = 58/371 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ L  GTPP   +  +DTGS+  W  C PC      KQ+P  F+P KSSTYN ++C+S
Sbjct: 124 YIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSS-KQQP--FEPSKSSTYNYLTCAS 180

Query: 95  SQCAVVTSNCSEGD----CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
            QC ++   C++ D    CS +  YG     S     L++ETL+  S      ++ N +F
Sbjct: 181 QQCQLLRV-CTKSDNSVNCSLTQRYGD---QSEVDEILSSETLSVGSQ-----QVENFVF 231

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA 210
           GC +   A+     +   ++G G    S +SQ  T     FSYCLP   SS    G ++ 
Sbjct: 232 GCSN---AARGLIQRTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPSLFSSAFT-GSLLL 287

Query: 211 GAGVVS------TPLIIRDH----YYLSLEAISVGNQRLEF------VSSSTGN-IFVDT 253
           G   +S      TPL+        YY+ L  ISVG + +        +  STG    +D+
Sbjct: 288 GKEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDS 347

Query: 254 G-VLRTLLPLEYHS---NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI-SSQPKFPEVT 308
           G V+  L+   Y++   + +S +SN+  A P              CYN  S   +FP +T
Sbjct: 348 GTVITRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDT--------CYNRPSGDVEFPLIT 399

Query: 309 IHF-RGADVKLSPSNLFRNISDE--IMCSAFR---GGNANIV--YGRIMQINFLIGYDIE 360
           +HF    D+ L   N+    +D+  ++C AF    GG  +++  +G   Q    I +D+ 
Sbjct: 400 LHFDDNLDLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDVA 459

Query: 361 QAMVSFKPSRC 371
           ++ +      C
Sbjct: 460 ESRLGIASENC 470


>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 465

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 172/356 (48%), Gaps = 35/356 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  + +GTP       VDTGS  TW QC PC  + C +Q  P+F+PK SS+Y S+SCS+
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPC-VVSCHRQSGPVFNPKASSSYASVSCSA 185

Query: 95  SQCAVVT------SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
            QC+ +T      ++CS  + C Y   YG    +SFS G L+ +T++F STS     +PN
Sbjct: 186 QQCSDLTTATLNPASCSTSNVCIYQASYGD---SSFSVGYLSKDTVSFGSTS-----VPN 237

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
             +GCG  N        +  G+IGL     SL+ Q+  S+   FSYCLP   SS   +  
Sbjct: 238 FYYGCGQDNEG---LFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLS 294

Query: 208 IVA-GAGVVS-TPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTL 259
           I +   G  S TP+    +    Y++ +  I V  + L   SS+  ++   +D+G + T 
Sbjct: 295 IGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITR 354

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFRGADVK 317
           LP   +S L   ++  +K  P         FS  D      +++ + PEVT+ F G    
Sbjct: 355 LPTGVYSALSKAVAGAMKGTPRAS-----AFSILDTCFQGQAARLRVPEVTMAFAGGAAL 409

Query: 318 LSPS-NLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
              + NL  ++     C AF    +  + G   Q  F + YD++ + + F  + C+
Sbjct: 410 KLAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAAGCS 465


>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 481

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 109/378 (28%), Positives = 169/378 (44%), Gaps = 55/378 (14%)

Query: 36  LMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSISC 92
           L +  IG  P D +  VDTGSD  W     C  CP+      +  L+DP  S T  ++ C
Sbjct: 75  LYYTKIGLGPKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKTSKAVPC 134

Query: 93  SSSQCAVV----TSNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
               C        S C++G  C YS  YG G   S +SG+   + LTF+   G    +P+
Sbjct: 135 DDEFCTSTYDGQISGCTKGMSCPYSITYGDG---STTSGSYIKDDLTFDRVVGDLRTVPD 191

Query: 148 ---VIFGCGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQ 198
              VIFGCG K     S T+D+   GIIG G  NSS++SQ+  + AGK    FS+CL   
Sbjct: 192 NTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQL--AAAGKVKRIFSHCL--- 246

Query: 199 GSSKINFGGIVAGAGVV-----STPLII-RDHYYLSLEAISVGNQRLEF----VSSSTGN 248
               I+ GGI A   VV     +TPL+    HY + L+ I V    ++     + SS+G 
Sbjct: 247 --DSISGGGIFAIGEVVQPKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDSSSGR 304

Query: 249 -IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK---- 303
              +D+G     LP+  +  L   +  + +   +K    E  F+   C++ S +      
Sbjct: 305 GTIIDSGTTLAYLPVSIYDQLLEKI--LAQRSGMKLYLVEDQFT---CFHYSDEESVDDL 359

Query: 304 FPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLI 355
           FP V   F  G  +   P +      +++ C  +       + G   I+ G ++  N L+
Sbjct: 360 FPTVKFTFEEGLTLTTYPRDYLFLFKEDMWCVGWQKSMAQTKDGKELILLGDLVLANKLV 419

Query: 356 GYDIEQAMVSFKPSRCTN 373
            YD++   + +    C++
Sbjct: 420 VYDLDNMAIGWADYNCSS 437


>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 396

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 104/362 (28%), Positives = 168/362 (46%), Gaps = 43/362 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQC-EPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           Y+++L+IGTPP  +   +D G +  WTQC + C    CFKQ+ PLFD   SST+    C 
Sbjct: 51  YVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHCRR--CFKQDLPLFDTNASSTFRPEPCG 108

Query: 94  SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           ++ C  + +    GD   +  Y        + G + T+ +   + +        + FGC 
Sbjct: 109 AAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGTAA-----TARLAFGCA 163

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--PDQGSSKINFGGIV-- 209
             +       S  +G +GLG  N SL +QM    A  FSYCL  PD G S   F G    
Sbjct: 164 VASEMDTMWGS--SGSVGLGRTNLSLAAQMN---ATAFSYCLAPPDTGKSSALFLGASAK 218

Query: 210 ---AGAGVVSTPLI---------IRDHYYLSLEAISVGNQRLEFVSSSTGN-IFVDTGVL 256
              AG G  +TP +         +   Y L LEAI  GN  +    S  GN I V T   
Sbjct: 219 LAGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATIAMPQS--GNTITVSTATP 276

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-NISSQPKFPEVTIHFR-GA 314
            T L    + +L+  +++ + A PV      P  +  LC+   S+    P++ + F+ GA
Sbjct: 277 VTALVDSVYRDLRKAVADAVGAAPVP----PPVQNYDLCFPKASASGGAPDLVLAFQGGA 332

Query: 315 DVKLSPSNLFRNISDEIMCSAFRG----GNANIVYGRIMQINFLIGYDIEQAMVSFKPSR 370
           ++ +  S+   +  ++  C A  G    G  +I+ G + Q+N  + +D+++  +SF+P+ 
Sbjct: 333 EMTVPVSSYLFDAGNDTACVAILGSPALGGVSIL-GSLQQVNIHLLFDLDKETLSFEPAD 391

Query: 371 CT 372
           C+
Sbjct: 392 CS 393


>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
           Japonica Group]
          Length = 377

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 93/339 (27%), Positives = 144/339 (42%), Gaps = 37/339 (10%)

Query: 13  NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
           + TP +    +     +S   +Y+ + +IGTPP  +   VD   +  WTQC PC    CF
Sbjct: 35  DATPPAAGGAVAVPIYLSSQGLYVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CF 92

Query: 73  KQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGNLA 129
           +Q+ PLFDP KSST+  + C S  C  +   + NC+   C    +Y     A  + G   
Sbjct: 93  EQDLPLFDPTKSSTFRGLPCGSHLCESIPESSRNCTSDVC----IYEAPTKAGDTGGKAG 148

Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
           T+T    +          + FGC         +    +GI+GLG    SL++QM  +   
Sbjct: 149 TDTFAIGAAK------ETLGFGCVVMTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT--- 199

Query: 190 KFSYCLPDQGSSKINFGGI---VAGAGVVSTPLIIRD-----------HYYLSLEAISVG 235
            FSYCL  + S  +  G     +AG    STP +I+            +Y + L  I  G
Sbjct: 200 AFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTG 259

Query: 236 NQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
              L+  SSS   + +DT    + L    +  LK  ++  +  QPV    A P     LC
Sbjct: 260 GAPLQAASSSGSTVLLDTVSRASYLADGAYKALKKALTAAVGVQPV----ASPPKPYDLC 315

Query: 296 YNISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMC 333
           +  +     PE+   F  GA + + P+N      +  +C
Sbjct: 316 FPKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTVC 354


>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
 gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
          Length = 487

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 104/343 (30%), Positives = 146/343 (42%), Gaps = 42/343 (12%)

Query: 51  SVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEG 107
           S+DT  D  W QC PCP  +C+ Q+  LFDP++S T  ++ C S+ C  +    + CS  
Sbjct: 165 SIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNN 224

Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK---NLASPTSDS 164
            C Y   YG G     +SG    + LT N ++   V M N  FGC H    N ++ TS  
Sbjct: 225 QCQYFVDYGDG---RATSGTYMVDALTLNPST---VVM-NFRFGCSHAVRGNFSASTS-- 275

Query: 165 KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI---------NFGGIVAGAGVV 215
              G + LG G  SL+SQ   +    FSYC+PD  SS              G  A   +V
Sbjct: 276 ---GTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLV 332

Query: 216 STPLIIRDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEYHSNLKSVMSN 274
             P II   Y + L  I VG +RL        G   +D+ V+ T LP   +  L+    +
Sbjct: 333 RNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRS 392

Query: 275 MIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEI 331
            + A P +  G   G     CY+    +    P V++ F  GA V+L    +        
Sbjct: 393 AMAAYP-RVAGGRAGLDT--CYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMVE----- 444

Query: 332 MCSAFR---GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            C AF    G  A    G + Q    + YD+    V F+   C
Sbjct: 445 GCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487


>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
 gi|223975971|gb|ACN32173.1| unknown [Zea mays]
 gi|224034191|gb|ACN36171.1| unknown [Zea mays]
 gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
 gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
          Length = 465

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 171/356 (48%), Gaps = 35/356 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  + +GTP       VDTGS  TW QC PC  + C +Q  P+F+PK SS+Y S+SCS+
Sbjct: 127 YVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPC-VVSCHRQSGPVFNPKASSSYASVSCSA 185

Query: 95  SQCAVVT------SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
            QC+ +T      ++CS  + C Y   YG    +SFS G L+ +T++F STS     +PN
Sbjct: 186 QQCSDLTTATLNPASCSTSNVCIYQASYGD---SSFSVGYLSKDTVSFGSTS-----VPN 237

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
             +GCG  N        +  G+IGL     SL+ Q+  S+   FSYCLP   SS   +  
Sbjct: 238 FYYGCGQDNEG---LFGQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPTSSSSSSGYLS 294

Query: 208 IVA-GAGVVS-TPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGNI--FVDTGVLRTL 259
           I +   G  S TP+    +    Y++ +  I V  + L   SS+  ++   +D+G + T 
Sbjct: 295 IGSYNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITR 354

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVLCYNISSQPKFPEVTIHFRGADVK 317
           LP   +S L   ++  +K  P         FS  D      +++ + PEVT+ F G    
Sbjct: 355 LPTGVYSALSKAVAGAMKGTPRAS-----AFSILDTCFQGQAARLRVPEVTMAFAGGAAL 409

Query: 318 LSPS-NLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
              + NL  ++     C AF    +  + G   Q  F + YD++ + + F    C+
Sbjct: 410 KLAARNLLVDVDSATTCLAFAPARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 465


>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 482

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 109/378 (28%), Positives = 165/378 (43%), Gaps = 55/378 (14%)

Query: 36  LMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSISC 92
           L +  IG  P D +  VDTGSD  W     C  CP+      E  L+DP  S T   + C
Sbjct: 76  LYYTKIGLGPNDYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKVVPC 135

Query: 93  SSSQCAVV----TSNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
               C        S C +   C YS  YG G   S +SG+   + LTF+   G    +P+
Sbjct: 136 DDEFCTSTYDGPISGCKKDMSCPYSITYGDG---STTSGSYIKDDLTFDRVVGDLRTVPD 192

Query: 148 ---VIFGCGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQ 198
              VIFGCG K     S T+D+   GIIG G  NSS++SQ+  + AGK    FS+CL   
Sbjct: 193 NTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQL--AAAGKVKRVFSHCL--- 247

Query: 199 GSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSSTG 247
               +N GGI A   VV     +TPL+ R  HY + L+ I V    ++     F S+S  
Sbjct: 248 --DTVNGGGIFAIGEVVQPKVKTTPLVPRMAHYNVVLKDIEVAGDPIQLPTDIFDSTSGR 305

Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP----K 303
              +D+G     LP+  +  L  +   + +   ++    E  F+   C++ S +      
Sbjct: 306 GTIIDSGTTLAYLPVSIYDQL--LEKTLAQRSGMELYLVEDQFT---CFHYSDEKSLDDA 360

Query: 304 FPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLI 355
           FP V   F  G  +   P +      +++ C  ++   A        I+ G ++  N L 
Sbjct: 361 FPTVKFTFEEGLTLTAYPHDYLFPFKEDMWCIGWQKSTAQTKDGKDLILLGDLVLTNKLF 420

Query: 356 GYDIEQAMVSFKPSRCTN 373
            YD++   + +    C++
Sbjct: 421 IYDLDNMSIGWTDYNCSS 438


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 105/359 (29%), Positives = 157/359 (43%), Gaps = 51/359 (14%)

Query: 41  IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKK-SSTYNSISCSSSQCAV 99
           +GTPP  +   ++ G++  W    P PE  CF+Q  P F+P   S      SC S +   
Sbjct: 1   MGTPPNPVKLKLENGNELIWNHSNPSPE--CFEQAFPYFEPLTFSRGLPFASCGSPKF-- 56

Query: 100 VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLAS 159
                    C Y++ YG     S ++G L  +  TF    G    +P V FGCG  N  +
Sbjct: 57  ----WPNQTCVYTYSYGD---KSVTTGFLEVDKFTF---VGAGASVPGVAFGCGLFN--N 104

Query: 160 PTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC-------LPDQGSSKINFGGIVAGA 212
               S +TGI G G G  SL SQ+     G FS+C       +P      +       G 
Sbjct: 105 GVFKSNETGIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVLLDLPADLFSNGQ 161

Query: 213 GVV-STPLIIRDH-------YYLSLEAISVGNQRLEF------VSSSTGNIFVDTGVLRT 258
           G V +TPLI           YYLSL+ I+VG+ RL        +++ TG   +D+G   T
Sbjct: 162 GAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSGTSIT 221

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKFPEVTIHFRGADV 316
            LP + +  ++   +  IK   V G           C++  SQ  P  P++ +HF GA +
Sbjct: 222 SLPPQVYQVVRDEFAAQIKLPVVPGNAT----GHYTCFSAPSQAKPDVPKLVLHFEGATM 277

Query: 317 KLSPSNLFRNISDE----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            L   N    + D+    I+C A   G+   + G   Q N  + YD++  M+SF  ++C
Sbjct: 278 DLPRENYVFEVPDDAGNSIICLAINKGDETTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336


>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
          Length = 471

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 104/343 (30%), Positives = 146/343 (42%), Gaps = 42/343 (12%)

Query: 51  SVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEG 107
           S+DT  D  W QC PCP  +C+ Q+  LFDP++S T  ++ C S+ C  +    + CS  
Sbjct: 149 SIDTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNN 208

Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK---NLASPTSDS 164
            C Y   YG G     +SG    + LT N ++   V M N  FGC H    N ++ TS  
Sbjct: 209 QCQYFVDYGDG---RATSGTYMVDALTLNPST---VVM-NFRFGCSHAVRGNFSASTS-- 259

Query: 165 KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI---------NFGGIVAGAGVV 215
              G + LG G  SL+SQ   +    FSYC+PD  SS              G  A   +V
Sbjct: 260 ---GTMSLGGGRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLV 316

Query: 216 STPLIIRDHYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEYHSNLKSVMSN 274
             P II   Y + L  I VG +RL        G   +D+ V+ T LP   +  L+    +
Sbjct: 317 RNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRS 376

Query: 275 MIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEI 331
            + A P +  G   G     CY+    +    P V++ F  GA V+L    +        
Sbjct: 377 AMAAYP-RVAGGRAGLDT--CYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMVE----- 428

Query: 332 MCSAFR---GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            C AF    G  A    G + Q    + YD+    V F+   C
Sbjct: 429 GCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 104/355 (29%), Positives = 159/355 (44%), Gaps = 53/355 (14%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVT---------S 102
           VDTGSD TW QC PC    C+ Q+ PLF+P  SS++ S+ C+S  C  +          S
Sbjct: 81  VDTGSDLTWVQCLPCRL--CYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCS 138

Query: 103 NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTS 162
           N +   C Y   YG G+Y   S G L  E LT   T     E+ N IFGCG  N      
Sbjct: 139 NKNSTSCDYQIDYGDGSY---SRGELGFEKLTLGKT-----EIDNFIFGCGRNNKGLFGG 190

Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFGGI----------V 209
            S   G++GL     SL+SQ  +     FSYCLP  G   S  +  GG           +
Sbjct: 191 AS---GLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPI 247

Query: 210 AGAGVVSTPLIIRDHYYLSLEAISVGNQRLEF--VSSSTGNI-FVDTGVLRTLLPLEYHS 266
           +   ++  P  + + Y+L+L  IS+G   L    +SS+ G +  +D+G + T L    + 
Sbjct: 248 SYTRMIQNPQ-MSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYK 306

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRG-ADVKLSPSN 322
             K+        +   G    PGFS +  C+N++   +   P V   F G A++ +    
Sbjct: 307 AFKAEFE-----KQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEG 361

Query: 323 LFRNISDEI--MCSAFRG---GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           +F  +  +   +C AF      +  ++ G   Q N  + Y+ +++ V F    C+
Sbjct: 362 VFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 416


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 111/370 (30%), Positives = 173/370 (46%), Gaps = 55/370 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +G   + +   VDTGSD TW QC+PC    C+ Q+ PL+DP  SS+Y ++ C+S
Sbjct: 87  YIVTVELGGKNMSLI--VDTGSDLTWVQCQPCRS--CYNQQGPLYDPSVSSSYKTVFCNS 142

Query: 95  SQC---AVVTSN---CSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
           S C      TSN   C   +      C Y   YG G+Y   + G+LA+E++    T    
Sbjct: 143 STCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSY---TRGDLASESILLGDT---- 195

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQG 199
            ++ N +FGCG  N       S   G+      + SL+SQ   +  G FSYCLP   D  
Sbjct: 196 -KLENFVFGCGRNNKGLFGGSSGLMGLG---RSSVSLVSQTLKTFNGVFSYCLPSLEDGA 251

Query: 200 SSKINFGG----IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGN-IF 250
           S  ++FG           V  TPL+    +R  Y L+L   S+G   +E  SSS G  I 
Sbjct: 252 SGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFGRGIL 309

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEV 307
           +D+G + T LP   +   K+V    +K     G    PG+S +  C+N++S      P +
Sbjct: 310 IDSGTVITRLPPSIY---KAVKIEFLKQ--FSGFPTAPGYSILDTCFNLTSYEDISIPII 364

Query: 308 TIHFRG-ADVKLSPSNLFRNISDE--IMCSAFRG---GNANIVYGRIMQINFLIGYDIEQ 361
            + F+G A++++  + +F  +  +  ++C A       N   + G   Q N  + YD  Q
Sbjct: 365 KMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQ 424

Query: 362 AMVSFKPSRC 371
             +      C
Sbjct: 425 ERLGIVGENC 434


>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
           [Arabidopsis thaliana]
          Length = 449

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 98/359 (27%), Positives = 157/359 (43%), Gaps = 36/359 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC---FKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP +    VDTGSD  W  C+PCP+            LFD   SST   +
Sbjct: 73  LYFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKV 132

Query: 91  SCSSSQCAVVTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP- 146
            C    C+ ++ + S      CSY  +Y   A  S S G    + LT    +G     P 
Sbjct: 133 GCDDDFCSFISQSDSCQPALGCSYHIVY---ADESTSDGKFIRDMLTLEQVTGDLKTGPL 189

Query: 147 --NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK--FSYCLPDQGSS 201
              V+FGCG          DS   G++G G  N+S++SQ+  +   K  FS+CL +    
Sbjct: 190 GQEVVFGCGSDQSGQLGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGG 249

Query: 202 KINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRT 258
            I   G+V    V +TP++    HY + L  + V    L+   S    G   VD+G    
Sbjct: 250 GIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLA 309

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHFRGADV 316
             P   +    S++  ++  QPVK    E  F    C++ S+     FP V+  F  + V
Sbjct: 310 YFPKVLYD---SLIETILARQPVKLHIVEETFQ---CFSFSTNVDEAFPPVSFEFEDS-V 362

Query: 317 KLS--PSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIGYDIEQAMVSF 366
           KL+  P +    + +E+ C  ++ G          I+ G ++  N L+ YD++  ++ +
Sbjct: 363 KLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLDNEVIGW 421


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 104/355 (29%), Positives = 159/355 (44%), Gaps = 53/355 (14%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVT---------S 102
           VDTGSD TW QC PC    C+ Q+ PLF+P  SS++ S+ C+S  C  +          S
Sbjct: 160 VDTGSDLTWVQCLPCRL--CYNQQEPLFNPSNSSSFLSLPCNSPTCVALQPTAGSSGLCS 217

Query: 103 NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTS 162
           N +   C Y   YG G+Y   S G L  E LT   T     E+ N IFGCG  N      
Sbjct: 218 NKNSTSCDYQIDYGDGSY---SRGELGFEKLTLGKT-----EIDNFIFGCGRNNKGLFGG 269

Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFGGI----------V 209
            S   G++GL     SL+SQ  +     FSYCLP  G   S  +  GG           +
Sbjct: 270 AS---GLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLGGADFSNFKNISPI 326

Query: 210 AGAGVVSTPLIIRDHYYLSLEAISVGNQRLEF--VSSSTGNI-FVDTGVLRTLLPLEYHS 266
           +   ++  P  + + Y+L+L  IS+G   L    +SS+ G +  +D+G + T L    + 
Sbjct: 327 SYTRMIQNPQ-MSNFYFLNLTGISIGGVNLNVPRLSSNEGVLSLLDSGTVITRLSPSIYK 385

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRG-ADVKLSPSN 322
             K+        +   G    PGFS +  C+N++   +   P V   F G A++ +    
Sbjct: 386 AFKAEFE-----KQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFEGNAEMIVDVEG 440

Query: 323 LFRNISDEI--MCSAFRG---GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           +F  +  +   +C AF      +  ++ G   Q N  + Y+ +++ V F    C+
Sbjct: 441 VFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGFAGEPCS 495


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 111/370 (30%), Positives = 173/370 (46%), Gaps = 55/370 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +G   + +   VDTGSD TW QC+PC    C+ Q+ PL+DP  SS+Y ++ C+S
Sbjct: 135 YIVTVELGGKNMSLI--VDTGSDLTWVQCQPCR--SCYNQQGPLYDPSVSSSYKTVFCNS 190

Query: 95  SQC---AVVTSN---CSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
           S C      TSN   C   +      C Y   YG G+Y   + G+LA+E++    T    
Sbjct: 191 STCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSY---TRGDLASESILLGDT---- 243

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQG 199
            ++ N +FGCG  N       S   G+      + SL+SQ   +  G FSYCLP   D  
Sbjct: 244 -KLENFVFGCGRNNKGLFGGSSGLMGLG---RSSVSLVSQTLKTFNGVFSYCLPSLEDGA 299

Query: 200 SSKINFGG----IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGN-IF 250
           S  ++FG           V  TPL+    +R  Y L+L   S+G   +E  SSS G  I 
Sbjct: 300 SGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFGRGIL 357

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEV 307
           +D+G + T LP   +   K+V    +K     G    PG+S +  C+N++S      P +
Sbjct: 358 IDSGTVITRLPPSIY---KAVKIEFLKQ--FSGFPTAPGYSILDTCFNLTSYEDISIPII 412

Query: 308 TIHFRG-ADVKLSPSNLFRNISDE--IMCSAFRG---GNANIVYGRIMQINFLIGYDIEQ 361
            + F+G A++++  + +F  +  +  ++C A       N   + G   Q N  + YD  Q
Sbjct: 413 KMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQ 472

Query: 362 AMVSFKPSRC 371
             +      C
Sbjct: 473 ERLGIVGENC 482


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score =  114 bits (285), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 110/382 (28%), Positives = 183/382 (47%), Gaps = 59/382 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP--PLFDPKKSSTYNSISC 92
           Y + L +GTP       VDTGSD TW QC P P        P  P +D   SS+Y  I C
Sbjct: 59  YFVELRVGTPAKKFPLIVDTGSDLTWIQCNP-PNTTANSSSPPAPWYDKSSSSSYREIPC 117

Query: 93  SSSQC----AVVTSNC---SEGDCSYSFLYGRGAYASFSSGNLATETLTFNS--TSG--- 140
           +  +C    A + S+C   S   C Y++ Y   +  S ++G LA ET++  S   SG   
Sbjct: 118 TDDECQFLPAPIGSSCSITSPSPCDYTYGY---SDQSRTTGILAYETISMKSRKRSGKRA 174

Query: 141 -----LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ-MGTSIAGKFSYC 194
                  + + NV  GC  +++ +  S    +G++GLG G  SL +Q   T++ G FSYC
Sbjct: 175 GNHKTRRIRIKNVALGCSRESVGA--SFLGASGVLGLGQGPISLATQTRHTALGGIFSYC 232

Query: 195 LPD--QGSSKINF--GGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS- 245
           L D  +GS+  +F   G      +  TP++     +  YY+++  ++V  + ++ ++SS 
Sbjct: 233 LVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSD 292

Query: 246 --------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVL 294
                    G IF D+G   + L    +S +   ++  I   +AQ +       GF   L
Sbjct: 293 WGIDGDGNKGTIF-DSGTTLSYLREPAYSKVLGALNASIYLPRAQEI-----PEGFE--L 344

Query: 295 CYNISSQPK-FPEVTIHFRGADV-KLSPSNLFRNISDEIMCSAFRG---GNANIVYGRIM 349
           CYN++   K  P++ + F+G  V +L  +N    +++ + C A +     N + + G ++
Sbjct: 345 CYNVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLL 404

Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
           Q +  I YD+ +A + FK S C
Sbjct: 405 QQDHHIEYDLAKARIGFKWSPC 426


>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  114 bits (285), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 103/352 (29%), Positives = 152/352 (43%), Gaps = 70/352 (19%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +G+P  D+    DTGSD TWTQCEPC    C++Q   +FDP  S +Y+++SC S
Sbjct: 89  YVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGY-CYQQREHIFDPSTSLSYSNVSCDS 147

Query: 95  SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
             C  + S       CS   C Y   YG G+Y   S G  A E L+  ST        N 
Sbjct: 148 PSCEKLESATGNSPGCSSSTCLYGIRYGDGSY---SIGFFAREKLSLTSTD----VFNNF 200

Query: 149 IFGCGHKN--LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG 206
            FGCG  N  L   T+     G++GL     SL+SQ        FSYCLP   SS     
Sbjct: 201 QFGCGQNNRGLFGGTA-----GLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSST---- 251

Query: 207 GIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHS 266
                              YLS  +    ++ ++F                  LP   +S
Sbjct: 252 ------------------GYLSFGSGDGDSKAVKFTPR---------------LPPTVYS 278

Query: 267 NLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVKLSPSN 322
           +++ V   ++   P VKGV          CY++S     K P++ ++F  GA++ L+P  
Sbjct: 279 SVQKVFRELMSDYPRVKGVSILD-----TCYDLSKYKTVKVPKIILYFSGGAEMDLAPEG 333

Query: 323 LFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +   +    +C AF G + +    + G + Q    + YD  +  V F PS C
Sbjct: 334 IIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 385


>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
           [Brachypodium distachyon]
          Length = 452

 Score =  114 bits (284), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 106/363 (29%), Positives = 168/363 (46%), Gaps = 48/363 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +++ +  G+P        DTGSD +W QC+PC    C+KQ  P+FDP KSS+Y  + C +
Sbjct: 112 FVVVVGFGSPAQTSATMFDTGSDLSWIQCQPC-SGHCYKQHDPVFDPAKSSSYAVVPCGT 170

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
           ++CA     C+   C Y   YG G   S ++G LA ETLTF+S+S    E    IFGCG 
Sbjct: 171 TECAAAGGECNGTTCVYGVEYGDG---SSTTGVLARETLTFSSSS----EFTGFIFGCGE 223

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGV 214
            NL       +  G++GLG G+ SL SQ   +  G FSYCLP   ++    G +  GA  
Sbjct: 224 TNLG---DFGEVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSYNTTP---GYLSIGATP 277

Query: 215 VSTPLIIR-----------DHYYLSLEAISVGNQRL-----EFVSSSTGNIFVDTGVLRT 258
           V+  + ++             Y++ L +I++G   L     EF  + T    +D+G + T
Sbjct: 278 VTGQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKTGT---LLDSGTILT 334

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPKFPEVTIHFRGADVK 317
            LP   ++ L+      +     +G    P + ++  CY+ + Q       + F  +D  
Sbjct: 335 YLPPPAYTALRDRFKFTM-----QGSKPAPPYDELDTCYDFTGQSGILIPGVSFNFSDGA 389

Query: 318 LSPSNLF------RNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKP 368
           +   N F       +    + C AF    A++   V G   Q +  + YD+    + F P
Sbjct: 390 VFNLNFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDVPAQKIGFIP 449

Query: 369 SRC 371
           + C
Sbjct: 450 ASC 452


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 102/340 (30%), Positives = 152/340 (44%), Gaps = 38/340 (11%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGD 108
           +DT SD TW QC PCP   C+ Q+  L+DP KSS+    SC+S  C  +    + C+  +
Sbjct: 173 LDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNNN 232

Query: 109 -CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT 167
            C Y   Y  G   + ++G   ++ LT    + +     +  FGC H    S +  S   
Sbjct: 233 QCQYRVRYPDG---TSTAGTYISDLLTITPATAV----RSFQFGCSHGVQGSFSFGSSAA 285

Query: 168 GIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI--VAGAGVVSTPL-----I 220
           GI+ LG G  SL+SQ   +    FS+C P          G+  VA    V TP+     I
Sbjct: 286 GIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAI 345

Query: 221 IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKA 278
               Y + LEAI+V  QR+    +  + G        +  L P  Y +  ++    M   
Sbjct: 346 PPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMY 405

Query: 279 QPVKGVGAEPGFSDVLCYNISSQPKF--PEVTIHF-RGADVKLSPSN-LFRNISDEIMCS 334
           QP     A P      CY+++    F  P +T+ F + A V+L PS  LF+       C 
Sbjct: 406 QP-----APPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQG------CL 454

Query: 335 AFRGGNANIVYGRI--MQINFL-IGYDIEQAMVSFKPSRC 371
           AF  G  + V G I  +Q+  L + Y+I  A+V F+ + C
Sbjct: 455 AFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 102/340 (30%), Positives = 152/340 (44%), Gaps = 38/340 (11%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGD 108
           +DT SD TW QC PCP   C+ Q+  L+DP KSS+    SC+S  C  +    + C+  +
Sbjct: 148 LDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNNN 207

Query: 109 -CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT 167
            C Y   Y  G   + ++G   ++ LT    + +     +  FGC H    S +  S   
Sbjct: 208 QCQYRVRYPDG---TSTAGTYISDLLTITPATAV----RSFQFGCSHGVQGSFSFGSSAA 260

Query: 168 GIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI--VAGAGVVSTPL-----I 220
           GI+ LG G  SL+SQ   +    FS+C P          G+  VA    V TP+     I
Sbjct: 261 GIMALGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAI 320

Query: 221 IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKA 278
               Y + LEAI+V  QR+    +  + G        +  L P  Y +  ++    M   
Sbjct: 321 PPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMY 380

Query: 279 QPVKGVGAEPGFSDVLCYNISSQPKF--PEVTIHF-RGADVKLSPSN-LFRNISDEIMCS 334
           QP     A P      CY+++    F  P +T+ F + A V+L PS  LF+       C 
Sbjct: 381 QP-----APPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQG------CL 429

Query: 335 AFRGGNANIVYGRI--MQINFL-IGYDIEQAMVSFKPSRC 371
           AF  G  + V G I  +Q+  L + Y+I  A+V F+ + C
Sbjct: 430 AFTAGPNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 111/370 (30%), Positives = 173/370 (46%), Gaps = 55/370 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +G   + +   VDTGSD TW QC+PC    C+ Q+ PL+DP  SS+Y ++ C+S
Sbjct: 135 YIVTVELGGKNMSLI--VDTGSDLTWVQCQPCR--SCYNQQGPLYDPSVSSSYKTVFCNS 190

Query: 95  SQC---AVVTSN---CSEGD------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
           S C      TSN   C   +      C Y   YG G+Y   + G+LA+E++    T    
Sbjct: 191 STCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSY---TRGDLASESILLGDT---- 243

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQG 199
            ++ N +FGCG  N       S   G+      + SL+SQ   +  G FSYCLP   D  
Sbjct: 244 -KLENFVFGCGRNNKGLFGGSSGLMGLG---RSSVSLVSQTLKTFNGVFSYCLPSLEDGA 299

Query: 200 SSKINFGG----IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGN-IF 250
           S  ++FG           V  TPL+    +R  Y L+L   S+G   +E  SSS G  I 
Sbjct: 300 SGSLSFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGG--VELKSSSFGRGIL 357

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQP--KFPEV 307
           +D+G + T LP   +   K+V    +K     G    PG+S +  C+N++S      P +
Sbjct: 358 IDSGTVITRLPPSIY---KAVKIEFLKQ--FSGFPTAPGYSILDTCFNLTSYEDISIPII 412

Query: 308 TIHFRG-ADVKLSPSNLFRNISDE--IMCSAFRG---GNANIVYGRIMQINFLIGYDIEQ 361
            + F+G A++++  + +F  +  +  ++C A       N   + G   Q N  + YD  Q
Sbjct: 413 KMIFQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQ 472

Query: 362 AMVSFKPSRC 371
             +      C
Sbjct: 473 ERLGIVGENC 482


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 117/367 (31%), Positives = 169/367 (46%), Gaps = 47/367 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + + +GTP       VDTGS  +W QC+PC  + C  Q  P+F P  S TY ++SCSS
Sbjct: 107 YYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPC-VIYCHVQVDPIFTPSVSKTYKALSCSS 165

Query: 95  SQCAVVTS------NCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           SQC+ + S       CS   G C Y   YG     SFS G L+ + LT       P   P
Sbjct: 166 SQCSSLKSSTLNAPGCSNATGACVYKASYGD---TSFSIGYLSQDVLTLT-----PSAAP 217

Query: 147 N--VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
           +   ++GCG  N        +  GIIGL     S++ Q+       FSYCLP   S++ N
Sbjct: 218 SSGFVYGCGQDNQG---LFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPN 274

Query: 205 ---FGGIVAGAGVVS------TPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGNI-- 249
               G +  GA  +S      TPL+    I   Y+L L  I+V  + L  VS+S+ N+  
Sbjct: 275 SSVSGFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLG-VSASSYNVPT 333

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCY--NISSQPKFPE 306
            +D+G + T LP+  ++ LK     ++     K     PGFS +  C+  ++      PE
Sbjct: 334 IIDSGTVITRLPVAIYNALKKSFVMIMS----KKYAQAPGFSILDTCFKGSVKEMSTVPE 389

Query: 307 VTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMV 364
           + I FR GA ++L   N    I     C A    +  I + G   Q  F + YD+  + +
Sbjct: 390 IRIIFRGGAGLELKVHNSLVEIEKGTTCLAIAASSNPISIIGNYQQQTFTVAYDVANSKI 449

Query: 365 SFKPSRC 371
            F P  C
Sbjct: 450 GFAPGGC 456


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 109/382 (28%), Positives = 183/382 (47%), Gaps = 59/382 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP--PLFDPKKSSTYNSISC 92
           Y + L +GTP       +DTGSD TW QC P P        P  P +D   SS+Y  I C
Sbjct: 27  YFVELRVGTPAKKFPLIIDTGSDLTWIQCNP-PNTTANSSSPPAPWYDKSSSSSYREIPC 85

Query: 93  SSSQC----AVVTSNC---SEGDCSYSFLYGRGAYASFSSGNLATETLTFNS--TSG--- 140
           +  +C    A + S+C   S   C Y++ Y   +  S ++G LA ET++  S   SG   
Sbjct: 86  TDDECLFLPAPIGSSCSIKSPSPCDYTYGY---SDQSRTTGILAYETISMKSRKRSGKRA 142

Query: 141 -----LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ-MGTSIAGKFSYC 194
                  + + NV  GC  +++ +  S    +G++GLG G  SL +Q   T++ G FSYC
Sbjct: 143 GNHKTRTIRIKNVALGCSRESVGA--SFLGASGVLGLGQGPISLATQTRHTALGGIFSYC 200

Query: 195 LPD--QGSSKINF--GGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS- 245
           L D  +GS+  +F   G      +  TP++     +  YY+++  ++V  + ++ ++SS 
Sbjct: 201 LVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGIASSD 260

Query: 246 --------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVL 294
                    G IF D+G   + L    +S +   ++  I   +AQ +       GF   L
Sbjct: 261 WGIDGDGNKGTIF-DSGTTLSYLREPAYSKVLGALNASIYLPRAQEI-----PEGFE--L 312

Query: 295 CYNISSQPK-FPEVTIHFRGADV-KLSPSNLFRNISDEIMCSAFRG---GNANIVYGRIM 349
           CYN++   K  P++ + F+G  V +L  +N    +++ + C A +     N + + G ++
Sbjct: 313 CYNVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLL 372

Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
           Q +  I YD+ +A + FK S C
Sbjct: 373 QQDHHIEYDLAKARIGFKWSPC 394


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 113/381 (29%), Positives = 179/381 (46%), Gaps = 66/381 (17%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
           L++G PP +I   +DTGS+ +W  C+  P L        +F+P  SSTY+ + CSS  C 
Sbjct: 69  LAVGDPPQNISMVLDTGSELSWLHCKKSPNLGS------VFNPVSSSTYSPVPCSSPICR 122

Query: 99  VVTSNC---SEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIFGCG 153
             T +    +  D      +   +YA  +S  GNLA ET    S     V  P  +FGC 
Sbjct: 123 TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGS-----VTRPGTLFGCM 177

Query: 154 HKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
              L+S +  D+K TG++G+  G+ S ++Q+G S   KFSYC+    SS     G  + +
Sbjct: 178 DSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCISGSDSSGFLLLGDASYS 234

Query: 213 G---VVSTPLII---------RDHYYLSLEAISVGNQRLE-----FVSSST--GNIFVDT 253
               +  TPL++         R  Y + LE I VG++ L      FV   T  G   VD+
Sbjct: 235 WLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDS 294

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNISS--QPKF--- 304
           G   T L    ++ LK+    + + + V  +  +P F    +  LCY + S  +P F   
Sbjct: 295 GTQFTFLMGPVYTALKNEF--ITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGL 352

Query: 305 PEVTIHFRGADVKLSPSNLFRNIS-------DEIMCSAFRGGNANI------VYGRIMQI 351
           P V++ FRGA++ +S   L   ++       +E+ C  F  GN+++      V G   Q 
Sbjct: 353 PMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTF--GNSDLLGIEAFVIGHHHQQ 410

Query: 352 NFLIGYDIEQAMVSFKPS-RC 371
           N  + +D+ ++ V F  + RC
Sbjct: 411 NVWMEFDLAKSRVGFAGNVRC 431


>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
          Length = 442

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 113/381 (29%), Positives = 179/381 (46%), Gaps = 66/381 (17%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
           L++G PP +I   +DTGS+ +W  C+  P L        +F+P  SSTY+ + CSS  C 
Sbjct: 69  LAVGDPPQNISMVLDTGSELSWLHCKKSPNLGS------VFNPVSSSTYSPVPCSSPICR 122

Query: 99  VVTSNC---SEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIFGCG 153
             T +    +  D      +   +YA  +S  GNLA ET    S     V  P  +FGC 
Sbjct: 123 TRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGS-----VTRPGTLFGCM 177

Query: 154 HKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
              L+S +  D+K TG++G+  G+ S ++Q+G S   KFSYC+    SS     G  + +
Sbjct: 178 DSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCISGSDSSVFLLLGDASYS 234

Query: 213 G---VVSTPLII---------RDHYYLSLEAISVGNQRLE-----FVSSST--GNIFVDT 253
               +  TPL++         R  Y + LE I VG++ L      FV   T  G   VD+
Sbjct: 235 WLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQTMVDS 294

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNISS--QPKF--- 304
           G   T L    ++ LK+    + + + V  +  +P F    +  LCY + S  +P F   
Sbjct: 295 GTQFTFLMGPVYTALKNEF--ITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPNFSGL 352

Query: 305 PEVTIHFRGADVKLSPSNLFRNIS-------DEIMCSAFRGGNANI------VYGRIMQI 351
           P V++ FRGA++ +S   L   ++       +E+ C  F  GN+++      V G   Q 
Sbjct: 353 PMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTF--GNSDLLGIEAFVIGHHHQQ 410

Query: 352 NFLIGYDIEQAMVSFKPS-RC 371
           N  + +D+ ++ V F  + RC
Sbjct: 411 NVWMEFDLAKSRVGFAGNVRC 431


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 102/375 (27%), Positives = 172/375 (45%), Gaps = 51/375 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +G+P  +    VDTGS+ TW +C PC    C      ++D  +S +Y  ++C++
Sbjct: 100 YYTSIKLGSPGQEAILIVDTGSELTWLKCLPCKV--CAPSVDTIYDAARSVSYKPVTCNN 157

Query: 95  SQCAVVTSN-----CSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNST-SGLPVEMPN 147
           SQ    +S      C+ G  C ++  YG G   SFS G+L+T+TL   +   G PV + +
Sbjct: 158 SQLCSNSSQGTYAYCARGSQCQFAAFYGDG---SFSYGSLSTDTLIMETVVGGKPVTVQD 214

Query: 148 VIFGCGHKNLA-SPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG 206
             FGC   +L   PT  S   GI+GL  G  +L  Q+G     KFS+C PD+ SS +N  
Sbjct: 215 FAFGCAQGDLELVPTGAS---GILGLNAGKMALPMQLGQRFGWKFSHCFPDR-SSHLNST 270

Query: 207 GIV-------------AGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDT 253
           G+V               +  ++   + R  Y+++L+ +S+ +  L  +   +  + +D+
Sbjct: 271 GVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGS-VVILDS 329

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQP--VKGVGAEPGFSDV-LCYNISS------QPKF 304
           G   +     +HS L+      +K +P  +K +  +  F D+  C+ +S+          
Sbjct: 330 GSSFSSFVRPFHSQLREA---FLKHRPPSLKHLEGD-SFGDLGTCFKVSNDDIDELHRTL 385

Query: 305 PEVTIHFRGADVKLSPS-----NLFRNISDEIMCSAFRGGNANI--VYGRIMQINFLIGY 357
           P +++ F        PS      + R  +   MC AF  G  N   V G   Q N  + Y
Sbjct: 386 PSLSLVFEDGVTIGIPSIGVLLPVARYQNHVKMCFAFEDGGPNPVNVIGNYQQQNLWVEY 445

Query: 358 DIEQAMVSFKPSRCT 372
           DI+++ V F  + C 
Sbjct: 446 DIQRSRVGFARASCV 460


>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
          Length = 414

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 102/345 (29%), Positives = 155/345 (44%), Gaps = 59/345 (17%)

Query: 70  DCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSG 126
           +C  +  P F P  SST++ + C+SS C  +TS    C+   C Y + YG G    F++G
Sbjct: 87  ECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG----FTAG 142

Query: 127 NLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS 186
            LATETL     S      P V FGC  +N    +S    +GI+GLG    SL+SQ+G  
Sbjct: 143 YLATETLHVGGAS-----FPGVAFGCSTENGVGNSS----SGIVGLGRSPLSLVSQVGV- 192

Query: 187 IAGKFSYCL---PDQGSSKINFGGIVAGAGVVSTPLIIRD-------HYYLSLEAISVGN 236
             G+FSYCL    D G S I FG +    G  S+P I+ +       +YY++L  I+VG 
Sbjct: 193 --GRFSYCLRSDADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGA 250

Query: 237 QRLEFVSSS-----------TGNIFVDTGVLRTLLPLEYHSNLK-SVMSNMIKAQPVKGV 284
             L   S++            G   VD+G   T L  E ++ +K + +S M  A     V
Sbjct: 251 TDLPVTSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTV 310

Query: 285 -GAEPGFSDVLCYNIS-----SQPKFPEVTIHFRGADVKLSPSNLFRNISD-------EI 331
            G   GF   LC++ +     S    P + + F G          +  + +        +
Sbjct: 311 NGTRFGFD--LCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAV 368

Query: 332 MCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
            C      +  +   + G +MQ++  + YD++  M SF P+ C N
Sbjct: 369 ECLLVLPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCAN 413


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 106/354 (29%), Positives = 161/354 (45%), Gaps = 52/354 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD-CFKQEPPLFDPKKSSTYNSISCS 93
           Y++ + +G+P V     +DTGSD +W QCEPCP    C      LFDP  SSTY + +CS
Sbjct: 108 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 167

Query: 94  SSQCAVV-----TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           ++ CA +      + C ++  C Y   YG G   S ++G  +++ LT + +      +  
Sbjct: 168 AAACAQLGDSGEANGCDAKSRCQYIVKYGDG---SNTTGTYSSDVLTLSGSD----VVRG 220

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----- 202
             FGC H  L +   D K  G+IGLG    S +SQ        F YCLP   +S      
Sbjct: 221 FQFGCSHAELGAGM-DDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCLPATPASSGFLTL 279

Query: 203 --INFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTG 254
                GG    +   +TP++    +  +Y+ +LE I+VG ++L    S  + G++ VD+G
Sbjct: 280 GAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVFAAGSL-VDSG 338

Query: 255 VLRTLLPLEYHSNLKSV----MSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
            + T LP   ++ L S     M+   +A+P+       G  D  C+N +   K   P V 
Sbjct: 339 TVITRLPPAAYAALSSAFRAGMTRYARAEPL-------GILDT-CFNFTGLDKVSIPTVA 390

Query: 309 IHFR-GADVKLSPSNLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLIGYD 358
           + F  GA V L    +         C AF   R   A    G + Q  F + YD
Sbjct: 391 LVFAGGAVVDLDAHGIVSG-----GCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 439


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 103/356 (28%), Positives = 153/356 (42%), Gaps = 34/356 (9%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  + +GTP V     +DTGS  TW QC+PC    C+ Q  PLFDP  SS+Y+ + C S
Sbjct: 129 YVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVPCDS 188

Query: 95  SQCAVVTSN------CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
            +C  + +        S+GD  C+Y   YG GA     +G  +T+ LT     G    + 
Sbjct: 189 QECRALAAGIDGDGCTSDGDWGCAYEIHYGSGAT---PAGEYSTDALTL----GPGAIVK 241

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPDQGSSK--I 203
              FGCGH         +   G++GLG    SL  Q      G  FS+CLP  G S   +
Sbjct: 242 RFHFGCGHHQQRGKFDMAD--GVLGLGRLPQSLAWQASARRGGGVFSHCLPPTGVSTGFL 299

Query: 204 NFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRT 258
             G     +  V TPL+  D     Y L   AISV  Q L+   +     +  D+G + +
Sbjct: 300 ALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFREGVITDSGTVLS 359

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GAD 315
            L    ++ L++   + +   P+    A P      C+N +       P V++ FR GA 
Sbjct: 360 ALQETAYTALRTAFRSAMAEYPL----APPVGHLDTCFNFTGYDNVTVPTVSLTFRGGAT 415

Query: 316 VKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           V L  S+    + D  +     G     + G + Q    + YD+    V F+   C
Sbjct: 416 VHLDASSGV--LMDGCLAFWSSGDEYTGLIGSVSQRTIEVLYDMPGRKVGFRTGAC 469


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 112/387 (28%), Positives = 171/387 (44%), Gaps = 79/387 (20%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
           L++GTPP ++   +DTGS+ +W  C               F+P  SS+Y+ I CSSS C 
Sbjct: 77  LTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSST---FNPVWSSSYSPIPCSSSTCT 133

Query: 99  VVTSNCS-EGDC-SYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIFGCGH 154
             T +      C S  F +   +YA  SS  GNLAT+T    S+      +PNV+FGC  
Sbjct: 134 DQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSSG-----IPNVVFGCMD 188

Query: 155 KNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAG 213
              +S +  DSK TG++G+  G+ S +SQMG     KFSYC+     S+ +F G++    
Sbjct: 189 SIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFP---KFSYCI-----SEYDFSGLLLLGD 240

Query: 214 V---------------VSTPLIIRDH--YYLSLEAISVGNQRLEFVSS-------STGNI 249
                           +STPL   D   Y + LE I V ++ L    S         G  
Sbjct: 241 ANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGAGQT 300

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV---------LCYNISS 300
            VD+G   T L    ++ L+    N       K  G+   + D          LCY + +
Sbjct: 301 MVDSGTQFTFLLGPAYTALRDHFLN-------KTAGSLRVYEDSNFVFQGAMDLCYRVPT 353

Query: 301 Q----PKFPEVTIHFRGADVKLSPSNLF------RNISDEIMCSAFRGGNANI------V 344
                P  P VT+ FRGA++ ++   +       R  +D I C  F  GN+++      V
Sbjct: 354 NQTRLPPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTF--GNSDLLGVEAFV 411

Query: 345 YGRIMQINFLIGYDIEQAMVSFKPSRC 371
            G + Q N  + +D++++ +     RC
Sbjct: 412 IGHLHQQNVWMEFDLKKSRIGLAEIRC 438


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 105/374 (28%), Positives = 173/374 (46%), Gaps = 48/374 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP  D +  VDTGSD  W     C+ CP       +  L+D K S+T +++
Sbjct: 154 LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 213

Query: 91  SCSSSQCAVVTS---NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
            C  + C++       C  G  C YS LYG G   S ++G    + + +N  SG     P
Sbjct: 214 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDG---SSTTGYFVQDFVQYNRISGNFQTTP 270

Query: 147 ---NVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGS 200
               V+FGCG+K      S S+   GI+G G  NSS++SQ+ +S  +   FS+CL +   
Sbjct: 271 TNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN--- 327

Query: 201 SKINFGGIVAGAGVVS-----TPLII-RDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
             ++ GGI A   VV      TPL+  + HY + ++ I VG   L+     F S      
Sbjct: 328 --VDGGGIFAIGEVVEPKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGT 385

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISS--QPKFPE 306
            +D+G      P E +  L   +  ++  QP ++    E  F+   C++ +      FP 
Sbjct: 386 IIDSGTTLAYFPQEVYVPL---IEKILSQQPDLRLHTVEQAFT---CFDYTGNVDDGFPT 439

Query: 307 VTIHF-RGADVKLSPSN-LFRN-----ISDEIMCSAFRGGNANIVYGRIMQINFLIGYDI 359
           VT+HF +   + + P   LF++     I  +   +  + G    + G ++  N L+ YD+
Sbjct: 440 VTLHFDKSISLTVYPHEYLFQHEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDL 499

Query: 360 EQAMVSFKPSRCTN 373
           E+  + +    C++
Sbjct: 500 EKQGIGWVEYNCSS 513


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 101/359 (28%), Positives = 158/359 (44%), Gaps = 58/359 (16%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEG---- 107
           VDT S+ TW QC PC    C  Q+ PLFDP  S +Y ++ C S  C  +    + G    
Sbjct: 158 VDTASELTWVQCAPCES--CHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQQLATGAGAG 215

Query: 108 ----------DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNL 157
                      CSY+  Y  G+Y   S G LA + L+          +   +FGCG  N 
Sbjct: 216 APPCDAGRPAACSYALSYRDGSY---SRGVLAHDRLSLAGEV-----IDGFVFGCGTSNQ 267

Query: 158 ASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--------------DQGSSKI 203
             P   +  +G++GLG    SL+SQ      G FSYCLP              D  S+  
Sbjct: 268 GPPFGGT--SGLMGLGRSQLSLVSQTVDQFGGVFSYCLPLSRESDASGSLVLGDDPSAYR 325

Query: 204 NFGGIVAGAGVV-STPLIIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTG-VLRTLLP 261
           N   +V  + V  S PL+    Y ++L  I+VG Q +E    S   I VD+G V+ +L+P
Sbjct: 326 NSTPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVESTGFSARAI-VDSGTVITSLVP 384

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFR-GADVK 317
             Y++     MS + +          PGFS +  C+N++   + + P +T+ F  GA+V+
Sbjct: 385 SVYNAVRAEFMSQLAEYPQA------PGFSILDTCFNMTGLKEVQVPSLTLVFDGGAEVE 438

Query: 318 LSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +    +   +S +     +  ++ +  +   + G   Q N  + +D   + V F    C
Sbjct: 439 VDSGGVLYFVSSDSSQVCLAVASLKSEDETSIIGNYQQKNLRVVFDTSASQVGFAQETC 497


>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
 gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
          Length = 460

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 106/386 (27%), Positives = 160/386 (41%), Gaps = 57/386 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+    IG PP      +DTGS+  WTQC  C    CF Q+   +DP +S T   ++C+ 
Sbjct: 84  YIAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVACND 143

Query: 95  SQCAVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
           + C + +      D   C+    YG GA   F    L TE  TF         + ++ FG
Sbjct: 144 TACLLGSETRCARDGKACAVLTAYGAGAIGGF----LGTEVFTFGHGQSSENNV-SLAFG 198

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG 211
           C   +  +P S    +GIIGLG G  SL SQ+G +   KFSYCL    S   N   +  G
Sbjct: 199 CITASRLTPGSLDGASGIIGLGRGKLSLPSQLGDN---KFSYCLTPYFSDAANTSTLFVG 255

Query: 212 AG---------VVSTPLI-------IRDHYYLSLEAISVGNQRL----------EFVSSS 245
           A            S P +           YYL L  I+VG  +L          E   + 
Sbjct: 256 ASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVAPAK 315

Query: 246 TGNIFVDTGV-LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQ 301
            G   +D+G    +L+ + Y + L+  +   + A  V       G  D+    ++   + 
Sbjct: 316 WGGTLIDSGSPFTSLIDVAYQA-LRDELVRQLGASVVPPPAGAEGL-DLCVGGVAPGDAG 373

Query: 302 PKFPEVTIHF-----RGADVKLSPSNLFRNISDEIMCS-AFRGG--------NANIVYGR 347
              P + +HF      G DV + P N +  + D   C   F  G        N   + G 
Sbjct: 374 KLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTIIGN 433

Query: 348 IMQINFLIGYDIEQAMVSFKPSRCTN 373
            MQ +  + YD+ Q ++SF+P+ C++
Sbjct: 434 YMQQDMHLLYDLGQGVLSFQPADCSS 459


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 100/376 (26%), Positives = 163/376 (43%), Gaps = 46/376 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G P       VDTGSD  W  C P   CP          ++DP++SST + +
Sbjct: 1   LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLV 60

Query: 91  SCSSSQCA----VVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFN--STSGLP 142
           SCS   C        + CS+   +C Y F YG G   S S G    + + +N  S++GL 
Sbjct: 61  SCSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDG---STSEGYYVRDAMQYNVISSNGLA 117

Query: 143 VEMPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGT--SIAGKFSYCLP-DQ 198
                V+FGC  +      TS     GIIG G    S+ +Q+    +I   FS+CL  ++
Sbjct: 118 NTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEK 177

Query: 199 GSSKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRL-----EFVSSSTGNIFVD 252
               I   G +A  G+  TPL+    HY + L  ISV + RL     +F S++   + +D
Sbjct: 178 RGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMD 237

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIH 310
           +G      P   ++     +     A PV+  G      D  C+ +S +    FP VT++
Sbjct: 238 SGTTLAYFPSGAYNVFVQAIREATSATPVRVQGM-----DTQCFLVSGRLSDLFPNVTLN 292

Query: 311 FRGADVKLSPSNLFR------NISDEIMCSAF---------RGGNANIVYGRIMQINFLI 355
           F G  ++L P N           + ++ C  +         + G+   + G I+  + L+
Sbjct: 293 FEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLV 352

Query: 356 GYDIEQAMVSFKPSRC 371
            YD++ + + +    C
Sbjct: 353 VYDLDNSRIGWMSYNC 368


>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
          Length = 315

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 94/323 (29%), Positives = 141/323 (43%), Gaps = 47/323 (14%)

Query: 84  SSTYNSISCSSSQC----AVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNS 137
           SST+ +++C    C     V  S C+  +  C Y   YG     S ++G++  +T TF S
Sbjct: 2   SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGD---RSITAGHIFKDTFTFMS 58

Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-- 195
            +G+PV +  + FGCG  N     S+  ++GI G G G  SL SQ+     G+FSYCL  
Sbjct: 59  PNGVPVAVSELAFGCGDYNTGLFVSN--ESGIAGFGRGPQSLPSQLK---VGRFSYCLTL 113

Query: 196 --------------PDQGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEF 241
                         PD    + +  G      ++  PL I   YYLSLE I+VG  RL F
Sbjct: 114 VTESKSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPL-IPTFYYLSLEGITVGKTRLPF 172

Query: 242 VSS-------STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL 294
             S        +G   +D+G   T LP    +  + +   ++   P+      P   D L
Sbjct: 173 DKSVFALKKDGSGGTVIDSGTSLTTLP---EAVFELLQEELVAQFPLPRYDNTPEVGDRL 229

Query: 295 CYNI---SSQPKFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFRGGNAN--IVYGRI 348
           C+       Q   P++ +H  GAD+ L   N F    D  +MC    G      ++ G  
Sbjct: 230 CFRRPKGGKQVPVPKLILHLAGADMDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGNF 289

Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
            Q N  + YD+E   + F P++C
Sbjct: 290 QQQNMHVVYDVENNKLLFAPAQC 312


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 101/359 (28%), Positives = 161/359 (44%), Gaps = 57/359 (15%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQE--PPLFDPKKSSTYNSISCSSSQC---AVVTSNC-S 105
           VDTGSD  WTQC+         +   PP++DP +SST+  + CS   C        NC S
Sbjct: 30  VDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFSFKNCTS 89

Query: 106 EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSK 165
           +  C Y  +YG  A    + G LA+ET TF +   + + +    FGCG  +  S      
Sbjct: 90  KNRCVYEDVYGSAA----AVGVLASETFTFGARRAVSLRLG---FGCGALSAGSLIG--- 139

Query: 166 QTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGSSKINFGGI-----------VAG 211
            TGI+GL P + SLI+Q+      +FSYCL    D+ +S + FG +           +  
Sbjct: 140 ATGILGLSPESLSLITQLKIQ---RFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQT 196

Query: 212 AGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSST-------GNIFVDTGVLRTLLPLEY 264
             +VS P +   +YY+ L  IS+G++RL   ++S        G   VD+G     L    
Sbjct: 197 TAIVSNP-VETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAA 255

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--------KFPEVTIHFRGADV 316
              +K  + ++++  PV     E      LC+ +  +         + P + +HF G   
Sbjct: 256 FEAVKEAVMDVVRL-PVANRTVE---DYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAA 311

Query: 317 KLSPS-NLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            + P  N F+     +MC A      G+   + G + Q N  + +D++    SF P++C
Sbjct: 312 MVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 370


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 108/363 (29%), Positives = 162/363 (44%), Gaps = 46/363 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPC-PELDCFKQEPPLFDPKKSSTYNSISCS 93
           +++ + +GTP        DTGSD +W QC+PC     C  Q+ PLFDP KSSTY ++ C 
Sbjct: 144 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 203

Query: 94  SSQCAVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             QCA     CSE +  C Y   YG G   S ++G L+ +TL   S+  L        FG
Sbjct: 204 EPQCAAAGDLCSEDNTTCLYLVRYGDG---SSTTGVLSRDTLALTSSRAL----TGFPFG 256

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIV 209
           CG +NL       +  G++GLG G  SL SQ   S    FSYCLP   S+   +  G   
Sbjct: 257 CGTRNLG---DFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNSTTGYLTIGATP 313

Query: 210 A-GAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLP 261
           A   G      ++R       Y++ L +I +G   L    +  + G   +D+G + T LP
Sbjct: 314 ATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFTRGGTLLDSGTVLTYLP 373

Query: 262 LEYHSNLKS----VMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPKFPEVTIHFR--- 312
            + ++ L+      M     A P          +DVL  CY+ + + +     + FR   
Sbjct: 374 AQAYALLRDRFRLTMERYTPAPP----------NDVLDACYDFAGESEVVVPAVSFRFGD 423

Query: 313 GADVKLSPSNLFRNISDEIMCSAFR----GGNANIVYGRIMQINFLIGYDIEQAMVSFKP 368
           GA  +L    +   + + + C AF     GG    + G   Q +  + YD+    + F P
Sbjct: 424 GAVFELDFFGVMIFLDENVGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVP 483

Query: 369 SRC 371
           + C
Sbjct: 484 ASC 486


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 100/376 (26%), Positives = 163/376 (43%), Gaps = 46/376 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G P       VDTGSD  W  C P   CP          ++DP++SST + +
Sbjct: 28  LYFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLV 87

Query: 91  SCSSSQCA----VVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFN--STSGLP 142
           SCS   C        + CS+   +C Y F YG G   S S G    + + +N  S++GL 
Sbjct: 88  SCSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDG---STSEGYYVRDAMQYNVISSNGLA 144

Query: 143 VEMPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGT--SIAGKFSYCLP-DQ 198
                V+FGC  +      TS     GIIG G    S+ +Q+    +I   FS+CL  ++
Sbjct: 145 NTTSQVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEK 204

Query: 199 GSSKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRL-----EFVSSSTGNIFVD 252
               I   G +A  G+  TPL+    HY + L  ISV + RL     +F S++   + +D
Sbjct: 205 RGGGILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMD 264

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIH 310
           +G      P   ++     +     A PV+  G      D  C+ +S +    FP VT++
Sbjct: 265 SGTTLAYFPSGAYNVFVQAIREATSATPVRVQGM-----DTQCFLVSGRLSDLFPNVTLN 319

Query: 311 FRGADVKLSPSNLFR------NISDEIMCSAF---------RGGNANIVYGRIMQINFLI 355
           F G  ++L P N           + ++ C  +         + G+   + G I+  + L+
Sbjct: 320 FEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLV 379

Query: 356 GYDIEQAMVSFKPSRC 371
            YD++ + + +    C
Sbjct: 380 VYDLDNSRIGWMSYNC 395


>gi|115475303|ref|NP_001061248.1| Os08g0207800 [Oryza sativa Japonica Group]
 gi|45735815|dbj|BAD12851.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623217|dbj|BAF23162.1| Os08g0207800 [Oryza sativa Japonica Group]
 gi|125602549|gb|EAZ41874.1| hypothetical protein OsJ_26419 [Oryza sativa Japonica Group]
          Length = 449

 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 104/383 (27%), Positives = 161/383 (42%), Gaps = 50/383 (13%)

Query: 29  ISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
           I  D +YL  + IG      +  +DTGS   WTQC+ CP   C   + P +   +S T+ 
Sbjct: 76  IYEDVVYLAEMEIGERQQKQYLLIDTGSSLVWTQCDECPH--CHIGDVPPYGRSQSRTFQ 133

Query: 89  SISCS-----------SSQCAV----VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETL 133
            +SC            +S C        + C  G C +  LY          G ++ +T 
Sbjct: 134 EVSCGDDDDNDKEEAIASYCPAKPPGYITLCVNGRCMFKALYNLTGQGETVQGYMSMDTF 193

Query: 134 TFNSTSGLPVEMP-NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFS 192
            F        +    ++FGC H+     T+  + TGI+GLG G++S + Q G +   KFS
Sbjct: 194 HFIDDRRFDYQAKFRMVFGCAHQENIVLTAVKECTGILGLGMGDASFLRQTGIT---KFS 250

Query: 193 YCLPD-------QGSSKINFGGIVAGAGVVSTPLIIR-DHYYLSLEAISVG-NQRLEFV- 242
           YC+P        +  S + FG     +G    PL++R   YYL L AI+   N+ +  V 
Sbjct: 251 YCVPPRMPGYSYRRHSWLRFGSHAQISG-KKVPLVMRWGKYYLPLTAITYTYNELMSPVP 309

Query: 243 ------SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPV-KGVGAEPGFSDVLC 295
                      ++ VDTG     LP   H +L   M  +IK++ + +G    P      C
Sbjct: 310 IIAYKSQEDYLHMMVDTGTSLLSLPTSLHDDLIKEMEAIIKSENIMEGATRWPKH----C 365

Query: 296 YNIS-SQPKFPEVTIHFRGA-DVKLSPSNLF---RNISDEIMCSAFR--GGNANIVYGRI 348
           Y  +  + K   VT+ F G  D++L  S LF          +C A      ++  + G  
Sbjct: 366 YKRTMDEVKDITVTLSFDGGLDIELFTSALFIKTETTKGPAVCLAVNRVDDSSKAILGMF 425

Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
            Q N  +GYD+    ++  P RC
Sbjct: 426 AQTNINVGYDLLSREIAMDPIRC 448


>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 100/379 (26%), Positives = 156/379 (41%), Gaps = 40/379 (10%)

Query: 24  YQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKK 83
           ++  I   D  YL+ + IG+P V ++   DTGS   WTQCEPC     F+Q PP+F+   
Sbjct: 80  FRLRISQDDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRR--FRQLPPIFNSTA 137

Query: 84  SSTYNSISCSSSQCAVVTS--NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
           S TY  + C    C    +   C +  C Y   Y  G   S ++G  A + L       +
Sbjct: 138 SRTYRDLPCQHQFCTNNQNVFQCRDDKCVYRIAYAGG---SATAGVAAQDILQSAENDRI 194

Query: 142 PVEMPNVIFGCGH--KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---- 195
           P       FGC    +N ++  S  K  GIIGL     SL+ QM      +FSYCL    
Sbjct: 195 P-----FYFGCSRDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFD 249

Query: 196 ---PDQGSSKINFGGIVAGA--GVVSTPLII---RDHYYLSLEAISVGNQRLE------- 240
              P   +S + FG  +  +    +STP +      +Y+L+L  +SV   R++       
Sbjct: 250 LSSPSHATSLLRFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSVAGNRMQIPPGTFA 309

Query: 241 FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
                TG   +D+G   T +    +  + +   N       + V  +   S  +CY    
Sbjct: 310 LKPDGTGGTIIDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNIQ--LSGYICYKQQG 367

Query: 301 QP--KFPEVTIHFRGADVKLSPSNLFRNISDE-IMCSAFR--GGNANIVYGRIMQINFLI 355
                +P +  HF+GAD  + P  ++  + D    C A +        + G + Q N   
Sbjct: 368 HTFHNYPSMAFHFQGADFFVEPEYVYLTVQDRGAFCVALQPISPQQRTIIGALNQANTQF 427

Query: 356 GYDIEQAMVSFKPSRCTNY 374
            YD     + F P  C ++
Sbjct: 428 IYDAANRQLLFTPENCQDH 446


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 117/415 (28%), Positives = 183/415 (44%), Gaps = 76/415 (18%)

Query: 2   QNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT 61
           Q    LP     +TP  P  + +Q  +        + L+IG+PP ++   +DTGS+ +W 
Sbjct: 33  QKPLLLPLKTQTQTP--PRKLAFQHNVT-----LTISLTIGSPPQNVTMVLDTGSELSWL 85

Query: 62  QCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCS---EGDCSYSFLYGRG 118
            C+  P L+        F+P  SS+Y    C+SS C   T + +     D +    +   
Sbjct: 86  HCKKLPNLNS------TFNPLLSSSYTPTPCNSSVCMTRTRDLTIPASCDPNNKLCHVIV 139

Query: 119 AYASFSS--GNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTS----DSKQTGIIGL 172
           +YA  SS  G LA ET +    +      P  +FGC   + A  TS    D+K TG++G+
Sbjct: 140 SYADASSAEGTLAAETFSLAGAA-----QPGTLFGC--MDSAGYTSDINEDAKTTGLMGM 192

Query: 173 GPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAG---------------VVST 217
             G+ SL++QM   +  KFSYC+    S +  FG ++ G G                 S+
Sbjct: 193 NRGSLSLVTQM---VLPKFSYCI----SGEDAFGVLLLGDGPSAPSPLQYTPLVTATTSS 245

Query: 218 PLIIRDHYYLSLEAISVGNQRLE-----FVSSST--GNIFVDTGVLRTLLPLEYHSNLKS 270
           P   R  Y + LE I V  + L+     FV   T  G   VD+G   T L    +++LK 
Sbjct: 246 PYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYNSLKD 305

Query: 271 VMSNMIKAQPVKGVGAEPGF----SDVLCYNI-SSQPKFPEVTIHFRGADVKLSPSNLFR 325
                 K   V     +P F    +  LCY+  +S    P VT+ F GA++++S   L  
Sbjct: 306 EFLEQTKG--VLTRIEDPNFVFEGAMDLCYHAPASLAAVPAVTLVFSGAEMRVSGERLLY 363

Query: 326 NIS---DEIMCSAFRGGNANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            +S   D + C  F  GN+++      V G   Q N  + +D+ ++ V F  + C
Sbjct: 364 RVSKGRDWVYCFTF--GNSDLLGIEAYVIGHHHQQNVWMEFDLVKSRVGFTETTC 416


>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 406

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 98/339 (28%), Positives = 155/339 (45%), Gaps = 37/339 (10%)

Query: 63  CEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV----TSNCSEG-DCSYSFLYGR 117
           C  CP+      +  L+DP  S T N++ C    C        S C +   C YS  YG 
Sbjct: 33  CTACPKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGD 92

Query: 118 GAYASFSSGNLATETLTFNSTSGLPVEMPN---VIFGCGHKNLASPTSDSKQT--GIIGL 172
           G   S +SG+   ++LTF+  SG     P+   VIFGCG K   S +S+S +   GIIG 
Sbjct: 93  G---STTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGF 149

Query: 173 GPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIR-DHYYLSL 229
           G  NSS++SQ+  S  +   FS+CL       I   G V      +TPL+ R  HY + L
Sbjct: 150 GQANSSVLSQLAASGKVKRIFSHCLDSHHGGGIFSIGQVMEPKFNTTPLVPRMAHYNVIL 209

Query: 230 EAISVGNQRL-----EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKG 283
           + + V  + +      F S S     +D+G     LPL  ++ L   +  ++  QP +K 
Sbjct: 210 KDMDVDGEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIYNQL---LPKVLGRQPGLKL 266

Query: 284 VGAEPGFSDVLCYNISSQ--PKFPEVTIHFRGADVKLSPSNLFRNISDEIMCSAF----- 336
           +  E  F+   C++ S +    FP V  HF G  + + P +      ++I C  +     
Sbjct: 267 MIVEDQFT---CFHYSDKLDEGFPVVKFHFEGLSLTVHPHDYLFLYKEDIYCIGWQKSST 323

Query: 337 --RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
             + G   I+ G ++  N L+ YD+E  ++ +    C++
Sbjct: 324 QTKEGRDLILIGDLVLSNKLVVYDLENMVIGWTNFNCSS 362


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 110/388 (28%), Positives = 177/388 (45%), Gaps = 78/388 (20%)

Query: 36  LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
           ++ L++GTPP ++   +DTGS+ +W  C            P  FDP +S++Y +I CSS 
Sbjct: 32  IVSLTVGTPPQNVSMVIDTGSELSWLHCNKT------LSYPTTFDPTRSTSYQTIPCSSP 85

Query: 96  QCAVVT------SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
            C   T      ++C   + C  +  Y   A AS S GNLA++     S+     ++  +
Sbjct: 86  TCTNRTQDFPIPASCDSNNLCHATLSY---ADASSSDGNLASDVFHIGSS-----DISGL 137

Query: 149 IFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
           +FGC     +S +  DSK TG++G+  G+ S +SQ+G     KFSYC+     S  +F G
Sbjct: 138 VFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFP---KFSYCI-----SGTDFSG 189

Query: 208 IVAGAGV---------------VSTPLIIRDH--YYLSLEAISVGNQRLEFVSSS----- 245
           ++                    +STPL   D   Y + LE I V ++ L    S+     
Sbjct: 190 LLLLGESNLTWSVPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDH 249

Query: 246 --TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNIS 299
              G   VD+G   T L    ++ L+S   N  +   V  V  +P F    +  LCY + 
Sbjct: 250 TGAGQTMVDSGTQFTFLLGPVYNALRSAFLN--QTSSVLRVLEDPDFVFQGAMDLCYLVP 307

Query: 300 -SQ---PKFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------ 343
            SQ   P  P VT+ FRGA++ +S   +   +      +D + C +F  GN+++      
Sbjct: 308 LSQRVLPLLPTVTLVFRGAEMTVSGDRVLYRVPGELRGNDSVHCLSF--GNSDLLGVEAY 365

Query: 344 VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           V G   Q N  + +D+E++ +     RC
Sbjct: 366 VIGHHHQQNVWMEFDLEKSRIGLAQVRC 393


>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 492

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 109/374 (29%), Positives = 158/374 (42%), Gaps = 46/374 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP  D +  VDTGSD  W    QC  CP       E  L++ K S +   +
Sbjct: 85  LYYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLV 144

Query: 91  SCSSSQCAVVT----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
            C    C  V     S C+    C Y  +YG G   S ++G    + + ++  SG L   
Sbjct: 145 PCDEEFCYEVNGGPLSGCTANMSCPYLEIYGDG---SSTAGYFVKDVVQYDRVSGDLQTT 201

Query: 145 MPN--VIFGCGHKNLAS--PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK--FSYCLPDQ 198
             N  VIFGCG +      PTS+    GI+G G  NSS+ISQ+  +   K  F++CL   
Sbjct: 202 SSNGSVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCL--- 258

Query: 199 GSSKINFGGIVAGAGVVS-----TPLIIRD-HYYLSLEAISVGNQRL-----EFVSSSTG 247
               IN GGI A   VV      TPLI    HY +++ A+ VG   L     EF +    
Sbjct: 259 --DGINGGGIFAIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDRK 316

Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEV 307
              +D+G     LP   +  L   +S +I  QP   V           Y+ S    FP V
Sbjct: 317 GAIIDSGTTLAYLPEIVYEPL---VSKIISQQPDLKVHIVRDEYTCFQYSGSVDDGFPNV 373

Query: 308 TIHFRGAD-VKLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDI 359
           T HF  +  +K+ P        + + C  +       R      + G ++  N L+ YD+
Sbjct: 374 TFHFENSVFLKVHPHEYLFPF-EGLWCIGWQNSGMQSRDRRNMTLLGDLVLSNKLVLYDL 432

Query: 360 EQAMVSFKPSRCTN 373
           E   + +    C++
Sbjct: 433 ENQAIGWTEYNCSS 446


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 114/378 (30%), Positives = 169/378 (44%), Gaps = 70/378 (18%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPP      VDTGSD  W  C P   CP     K     +D K S++ + +
Sbjct: 35  LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKV 94

Query: 91  SCSSSQCAVVT----SNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTF--NSTSGLPV 143
            CS   C ++T    S C+ +  C YSF YG G   S + G L  + L +  N+T+    
Sbjct: 95  PCSDPSCTLITQISESGCNDQNQCGYSFQYGDG---SGTLGYLVEDVLHYMVNATA---- 147

Query: 144 EMPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQ 198
               VIFGCG K      TS+    GIIG G  + S  SQ+     GK    F++CL D 
Sbjct: 148 ---TVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQ--GKTPNVFAHCL-DG 201

Query: 199 GSSKINFGGIVAGAGVVS-----TPLI-IRDHYYLSLEAISVGNQRL----EFVSSST-- 246
           G      GGI+    V+      TPL+    HY + L++ISV N  L    +  S+    
Sbjct: 202 GERG---GGILVLGNVIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLFSNDVMQ 258

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ---PK 303
           G IF D+G     LP E +      +S ++                +LC    S+     
Sbjct: 259 GTIF-DSGTTLAYLPDEAYQAFTQAVSLVVAPF-------------LLCDTRLSRFIYKL 304

Query: 304 FPEVTIHFRGADVKLSPSN-LFRNISDE---IMCSAFRG-GNAN-----IVYGRIMQINF 353
           FP V ++F GA + L+P+  L R  S     I C  ++  G+A       ++G ++  N 
Sbjct: 305 FPNVVLYFEGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNK 364

Query: 354 LIGYDIEQAMVSFKPSRC 371
           L+ YD+E+  + ++P  C
Sbjct: 365 LVVYDLERGRIGWRPFDC 382


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 105/361 (29%), Positives = 164/361 (45%), Gaps = 45/361 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP   +F  +DT  D  W  C      DC     P F P  SSTY S+ CS 
Sbjct: 99  YVVRVKLGTPGQLMFMVLDTSRDAAWVPCA-----DCAGCSSPTFSPNTSSTYASLQCSV 153

Query: 95  SQCAVVTS-NC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-MPNVIFG 151
            QC  V   +C + G  +  F    G  +SFS+  L+ ++L      GL V+ +P+  FG
Sbjct: 154 PQCTQVRGLSCPTTGTAACFFNQTYGGDSSFSA-MLSQDSL------GLAVDTLPSYSFG 206

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SSKINFGG 207
           C   N  S ++   Q G++GLG G  SL+SQ G+  +G FSYC P       S  +  G 
Sbjct: 207 C--VNAVSGSTLPPQ-GLLGLGRGPMSLLSQSGSLYSGVFSYCFPSFKSYYFSGSLRLGP 263

Query: 208 IVAGAGVVSTPLIIRDH----YYLSLEAISVGN-------QRLEFVSSSTGNIFVDTGVL 256
           +     + +TPL+   H    YY++L  +SVG        + L F  ++     +D+G +
Sbjct: 264 LGQPKNIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGTV 323

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADV 316
            T     +   + + + +  + Q VKG  A  G  D  C+  +++   P VT HF G D+
Sbjct: 324 IT----RFVEPVYAAIRDEFRKQ-VKGPFATIGAFDT-CFAATNEDIAPPVTFHFTGMDL 377

Query: 317 KLSPSN-LFRNISDEIMCSAFRGGNANI-----VYGRIMQINFLIGYDIEQAMVSFKPSR 370
           KL   N L  + +  + C A      N+     V   + Q N  I +D+  + +      
Sbjct: 378 KLPLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIAREL 437

Query: 371 C 371
           C
Sbjct: 438 C 438


>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
 gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
          Length = 491

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 103/380 (27%), Positives = 171/380 (45%), Gaps = 59/380 (15%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IG+PP   +  VDTGSD  W    +C+ CP       E   +DP  S T  ++
Sbjct: 83  LYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TV 140

Query: 91  SCSSSQCAV-----VTSNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG--- 140
            C    C       V   C  +   C +   YG G   S ++G   T+ + +N  SG   
Sbjct: 141 GCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDG---STTTGFYVTDFVQYNQVSGNGQ 197

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLP 196
                 ++ FGCG + L      S Q   GI+G G  +SS++SQ+  +  +   F++CL 
Sbjct: 198 TTTSNASITFGCGAQ-LGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCL- 255

Query: 197 DQGSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSS 245
                 +  GGI A   VV     +TPL+    HY ++L+ ISVG   L+     F S  
Sbjct: 256 ----DTVRGGGIFAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFDSGD 311

Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPK 303
           +    +D+G     LP E +  L + + +  +  P+        + D +C+  S      
Sbjct: 312 SKGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLH------NYQDFVCFQFSGSIDDG 365

Query: 304 FPEVTIHFRGADVKLS--PSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINF 353
           FP +T  F+G D+ L+  P + LF+N +D + C  F       + G   ++ G ++  N 
Sbjct: 366 FPVITFSFKG-DLTLNVYPDDYLFQNRND-LYCMGFLDGGVQTKDGKDMLLLGDLVLSNK 423

Query: 354 LIGYDIEQAMVSFKPSRCTN 373
           L+ YD+E+ ++ +    C++
Sbjct: 424 LVVYDLEKEVIGWTDYNCSS 443


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 114/378 (30%), Positives = 169/378 (44%), Gaps = 70/378 (18%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPP      VDTGSD  W  C P   CP     K     +D K S++ + +
Sbjct: 35  LYFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKV 94

Query: 91  SCSSSQCAVVT----SNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTF--NSTSGLPV 143
            CS   C ++T    S C+ +  C YSF YG G   S + G L  + L +  N+T+    
Sbjct: 95  PCSDPSCTLITQISESGCNDQNQCGYSFQYGDG---SGTLGYLVEDVLHYMVNATA---- 147

Query: 144 EMPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQ 198
               VIFGCG K      TS+    GIIG G  + S  SQ+     GK    F++CL D 
Sbjct: 148 ---TVIFGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQ--GKTPNVFAHCL-DG 201

Query: 199 GSSKINFGGIVAGAGVVS-----TPLI-IRDHYYLSLEAISVGNQRL----EFVSSST-- 246
           G      GGI+    V+      TPL+    HY + L++ISV N  L    +  S+    
Sbjct: 202 GERG---GGILVLGNVIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLFSNDVMQ 258

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ---PK 303
           G IF D+G     LP E +      +S ++                +LC    S+     
Sbjct: 259 GTIF-DSGTTLAYLPDEAYQAFTQAVSLVVAPF-------------LLCDTRLSRFIYKL 304

Query: 304 FPEVTIHFRGADVKLSPSN-LFRNISDE---IMCSAFRG-GNAN-----IVYGRIMQINF 353
           FP V ++F GA + L+P+  L R  S     I C  ++  G+A       ++G ++  N 
Sbjct: 305 FPNVVLYFEGASMTLTPAEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNK 364

Query: 354 LIGYDIEQAMVSFKPSRC 371
           L+ YD+E+  + ++P  C
Sbjct: 365 LVVYDLERGRIGWRPFDC 382


>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
          Length = 363

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 25/177 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +G   + +   +DTGSD TW QCEPC  + C+ Q+ P+F P  SS+Y SI C+S
Sbjct: 145 YIVTMELGGQDMTVI--IDTGSDLTWVQCEPC--MSCYNQQGPVFKPSTSSSYQSIPCNS 200

Query: 95  SQCA---VVTSNCSE-----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           S C    + T N         +CSY+  YG G+Y   ++G L  E L+F   S     + 
Sbjct: 201 STCQSLQLTTGNAGACESNPSNCSYAVNYGDGSY---TNGELGAEHLSFGGIS-----VS 252

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSS 201
           N +FGCG  N       S   G++GLG  N SLISQ  ++  G FSYCLP  D G+S
Sbjct: 253 NFVFGCGKNNKGLFGGVS---GLMGLGRSNLSLISQTNSTFGGVFSYCLPPTDAGAS 306


>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
 gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
          Length = 431

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 165/361 (45%), Gaps = 37/361 (10%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           D+ Y + + IGTPP       DT SD TWTQC      D  KQ  PLFDP KSS++  ++
Sbjct: 88  DEGYTVTIGIGTPPQLHTLIADTASDLTWTQCNLFN--DTAKQVEPLFDPAKSSSFAFVT 145

Query: 92  CSSSQCAVV---TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           CSS  C      T  CS   C Y + Y     +  ++G LA E+ T  S +   + M + 
Sbjct: 146 CSSKLCTEDNPGTKRCSNKTCRYVYPY----VSVEAAGVLAYESFTL-SDNNQHICM-SF 199

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---DQGSSKINF 205
            FGCG     +    S   GI+G+ P   S++SQ+      KFSYCL    D+ SS + F
Sbjct: 200 GFGCGALTDGNLLGAS---GILGMSPAILSMVSQLAIP---KFSYCLTPYTDRKSSPLFF 253

Query: 206 GGIVAGAGVVSTPLIIRD---HYYLSLEAISVGNQRLEFVSSS----TGNIFVDTGVLRT 258
           G         +T  I +    +YY+ L  +S+G +RL+  +++     G   VD G    
Sbjct: 254 GAWADLGRYKTTGPIQKSLTFYYYVPLVGLSLGTRRLDVPAATFALKQGGTVVDLGCTVG 313

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS-----QPKFPEVTIHFRG 313
            L     + LK  + + +   P+     +      +C+ + S       + P + ++F G
Sbjct: 314 QLAEPAFTALKEAVLHTLNL-PLTNRTVK---DYKVCFALPSGVAMGAVQTPPLVLYFDG 369

Query: 314 -ADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
            AD+ L   N F+  +  +MC A   G    + G + Q NF + +D+  +   F P+ C 
Sbjct: 370 GADMVLPRDNYFQEPTAGLMCLALVPGGGMSIIGNVQQQNFHLLFDVHDSKFLFAPTICD 429

Query: 373 N 373
           +
Sbjct: 430 D 430


>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 386

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 117/357 (32%), Positives = 168/357 (47%), Gaps = 38/357 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL-DCFKQEPPLFDPKKSSTYNSISCS 93
           Y++  S+GTP V     VDTGSD +W QC+PC     C+ Q+ PLFDP +SS+Y ++ C 
Sbjct: 48  YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCG 107

Query: 94  SSQCAVV----TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
              CA +     S CS   C Y   YG G   S ++G  +++TLT +++S     +    
Sbjct: 108 GPVCAGLGIYAASACSAAQCGYVVSYGDG---SNTTGVYSSDTLTLSASS----AVQGFF 160

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INF 205
           FGCGH   A     +   G++GLG    SL+ Q   +  G FSYCLP + S+     +  
Sbjct: 161 FGCGH---AQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGV 217

Query: 206 GGIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTL 259
           GG    A   ST  ++       +Y + L  ISVG Q+L   +S+  G   VDTG + T 
Sbjct: 218 GGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTR 277

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHF-RGA 314
           LP   ++ L+S   + + +    G    P  G  D  CYN +       P V + F  GA
Sbjct: 278 LPPTAYAALRSAFRSGMAS---YGYPTAPSNGILDT-CYNFAGYGTVTLPNVALTFGSGA 333

Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            V L    +          S   GG A  + G + Q +F +   I+   V FKPS C
Sbjct: 334 TVTLGADGILSFGCLAFAPSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 386


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 108/363 (29%), Positives = 161/363 (44%), Gaps = 46/363 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPC-PELDCFKQEPPLFDPKKSSTYNSISCS 93
           +++ + +GTP        DTGSD +W QC+PC     C  Q+ PLFDP KSSTY ++ C 
Sbjct: 149 FVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCG 208

Query: 94  SSQCAVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             QCA     CSE +  C Y   YG G   S ++G L+ +TL   S+  L        FG
Sbjct: 209 EPQCAAAGGLCSEDNTTCLYLVHYGDG---SSTTGVLSRDTLALTSSRAL----AGFPFG 261

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIV 209
           CG +NL       +  G++GLG G  SL SQ   S    FSYCLP   S+   +  G   
Sbjct: 262 CGTRNLG---DFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCLPSSNSTTGYLTIGATP 318

Query: 210 A-GAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLP 261
           A   G      ++R       Y++ L +I +G   L    +  + G   +D+G + T LP
Sbjct: 319 ATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFTRGGTLLDSGTVLTYLP 378

Query: 262 LEYHSNLKS----VMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPKFPEVTIHFR--- 312
            + +  L+      M     A P          +DVL  CY+ + + +     + FR   
Sbjct: 379 AQAYELLRDRFRLTMERYTPAPP----------NDVLDACYDFAGESEVIVPAVSFRFGD 428

Query: 313 GADVKLSPSNLFRNISDEIMCSAF----RGGNANIVYGRIMQINFLIGYDIEQAMVSFKP 368
           GA  +L    +   + + + C AF     GG    + G   Q +  + YD+    + F P
Sbjct: 429 GAVFELDFFGVMIFLDENVGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVP 488

Query: 369 SRC 371
           + C
Sbjct: 489 ASC 491


>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
 gi|194704586|gb|ACF86377.1| unknown [Zea mays]
 gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 478

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 117/357 (32%), Positives = 168/357 (47%), Gaps = 38/357 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL-DCFKQEPPLFDPKKSSTYNSISCS 93
           Y++  S+GTP V     VDTGSD +W QC+PC     C+ Q+ PLFDP +SS+Y ++ C 
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCG 199

Query: 94  SSQCAVV----TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
              CA +     S CS   C Y   YG G   S ++G  +++TLT +++S     +    
Sbjct: 200 GPVCAGLGIYAASACSAAQCGYVVSYGDG---SNTTGVYSSDTLTLSASS----AVQGFF 252

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INF 205
           FGCGH   A     +   G++GLG    SL+ Q   +  G FSYCLP + S+     +  
Sbjct: 253 FGCGH---AQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGV 309

Query: 206 GGIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTL 259
           GG    A   ST  ++       +Y + L  ISVG Q+L   +S+  G   VDTG + T 
Sbjct: 310 GGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTR 369

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHF-RGA 314
           LP   ++ L+S   + + +    G    P  G  D  CYN +       P V + F  GA
Sbjct: 370 LPPTAYAALRSAFRSGMASY---GYPTAPSNGILDT-CYNFAGYGTVTLPNVALTFGSGA 425

Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            V L    +          S   GG A  + G + Q +F +   I+   V FKPS C
Sbjct: 426 TVTLGADGILSFGCLAFAPSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 478


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 106/382 (27%), Positives = 165/382 (43%), Gaps = 69/382 (18%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC- 97
           L++GTPP ++   +DTGS+ +W  C P    + F      F P+ SST+ ++ C+S+QC 
Sbjct: 89  LAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMS--FRPRASSTFAAVPCASAQCR 146

Query: 98  -----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                +    + +   CS S  Y  G   S S G LAT+   F   SG P+      FGC
Sbjct: 147 SRDLPSPPACDGASSRCSVSLSYADG---SSSDGALATD--VFAVGSGPPLR---AAFGC 198

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI--------- 203
                 S        G++G+  G  S +SQ  T    +FSYC+ D+  + +         
Sbjct: 199 MSSAFDSSPDGVASAGLLGMNRGALSFVSQASTR---RFSYCISDRDDAGVLLLGHSDLP 255

Query: 204 -----NFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS-------STGNIFV 251
                N+  +   A  +  P   R  Y + L  I VG + L   +S         G   V
Sbjct: 256 TFLPLNYTPMYQPA--LPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMV 313

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS-----DVLCYNI---SSQP- 302
           D+G   T L  + +S LK+  +   +A+P+     +P F+     D  C+ +    S P 
Sbjct: 314 DSGTQFTFLLGDAYSALKAEFTR--QARPLLPALDDPSFAFQEAFDT-CFRVPQGRSPPT 370

Query: 303 -KFPEVTIHFRGADVKLSPSNLF------RNISDEIMCSAFRGGNANI------VYGRIM 349
            + P VT+ F GA++ ++   L       R   D + C  F  GNA++      V G   
Sbjct: 371 ARLPGVTLLFNGAEMAVAGDRLLYKVPGERRGGDGVWCLTF--GNADMVPIMAYVIGHHH 428

Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
           Q+N  + YD+E+  V   P RC
Sbjct: 429 QMNVWVEYDLERGRVGLAPVRC 450


>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 478

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 117/357 (32%), Positives = 168/357 (47%), Gaps = 38/357 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL-DCFKQEPPLFDPKKSSTYNSISCS 93
           Y++  S+GTP V     VDTGSD +W QC+PC     C+ Q+ PLFDP +SS+Y ++ C 
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCG 199

Query: 94  SSQCAVV----TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
              CA +     S CS   C Y   YG G   S ++G  +++TLT +++S     +    
Sbjct: 200 GPVCAGLGIYAASACSAAQCGYVVSYGDG---SNTTGVYSSDTLTLSASS----AVQGFF 252

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INF 205
           FGCGH   A     +   G++GLG    SL+ Q   +  G FSYCLP + S+     +  
Sbjct: 253 FGCGH---AQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGV 309

Query: 206 GGIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTL 259
           GG    A   ST  ++       +Y + L  ISVG Q+L   +S+  G   VDTG + T 
Sbjct: 310 GGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTR 369

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHF-RGA 314
           LP   ++ L+S   + + +    G    P  G  D  CYN +       P V + F  GA
Sbjct: 370 LPPTAYAALRSAFRSGMASY---GYPTAPSNGILDT-CYNFAGYGTVTLPNVALTFGSGA 425

Query: 315 DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            V L    +          S   GG A  + G + Q +F +   I+   V FKPS C
Sbjct: 426 TVTLGADGILSFGCLAFAPSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 478


>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 488

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 101/375 (26%), Positives = 167/375 (44%), Gaps = 48/375 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTPP + +  VDTGSD  W    QC+ CP       +  L+D K+SS+   +
Sbjct: 82  LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLV 141

Query: 91  SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
            C    C  +      G      C Y  +YG G   S ++G    + + ++  SG L  +
Sbjct: 142 PCDQEFCKEINGGLLTGCTANISCPYLEIYGDG---SSTAGYFVKDIVLYDQVSGDLKTD 198

Query: 145 MPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
             N  ++FGCG +     +S +++   GI+G G  NSS+ISQ+ +S  +   F++CL   
Sbjct: 199 SANGSIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCL--- 255

Query: 199 GSSKINFGGIVAGAGVVS-----TPLI-IRDHYYLSLEAISVGNQRLEFVSSSTGN---- 248
             + +N GGI A   VV      TPL+  + HY +++ A+ VG+  L   + ++      
Sbjct: 256 --NGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDRK 313

Query: 249 -IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEV 307
              +D+G     LP   +  L   +  MI   P   V           Y+ S    FP V
Sbjct: 314 GTIIDSGTTLAYLPEGIYEPL---VYKMISQHPDLKVQTLHDEYTCFQYSESVDDGFPAV 370

Query: 308 TIHFR-GADVKLSPSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYD 358
           T  F  G  +K+ P + LF +++    C  +       R      + G ++  N L+ YD
Sbjct: 371 TFFFENGLSLKVYPHDYLFPSVN--FWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYD 428

Query: 359 IEQAMVSFKPSRCTN 373
           +E   + +    C++
Sbjct: 429 LENQAIGWAEYNCSS 443


>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
 gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
          Length = 491

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 103/380 (27%), Positives = 170/380 (44%), Gaps = 59/380 (15%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IG+PP   +  VDTGSD  W    +C+ CP       E   +DP  S T  ++
Sbjct: 83  LYYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TV 140

Query: 91  SCSSSQCAV-----VTSNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG--- 140
            C    C       V   C  +   C +   YG G   S ++G   T+ + +N  SG   
Sbjct: 141 GCEQEFCVANSAGGVPPTCPSTSSPCQFRITYGDG---STTTGFYVTDFVQYNQVSGNGQ 197

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLP 196
                 ++ FGCG + L      S Q   GI+G G  +SS++SQ+  +  +   F++CL 
Sbjct: 198 TTTSNASITFGCGAQ-LGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCL- 255

Query: 197 DQGSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSS 245
                 +  GGI A   VV     +TPL+    HY ++L+ ISVG   L+     F S  
Sbjct: 256 ----DTVRGGGIFAIGNVVQPKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFDSGD 311

Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPK 303
           +    +D+G     LP E +  L + + +  +  P+        + D +C+  S      
Sbjct: 312 SKGTIIDSGTTLAYLPREVYRTLLAAVFDKYQDLPLH------NYQDFVCFQFSGSIDDG 365

Query: 304 FPEVTIHFRGADVKLS--PSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINF 353
           FP +T  F G D+ L+  P + LF+N +D + C  F       + G   ++ G ++  N 
Sbjct: 366 FPVITFSFEG-DLTLNVYPDDYLFQNRND-LYCMGFLDGGVQTKDGKDMLLLGDLVLSNK 423

Query: 354 LIGYDIEQAMVSFKPSRCTN 373
           L+ YD+E+ ++ +    C++
Sbjct: 424 LVVYDLEKEVIGWTDYNCSS 443


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 105/380 (27%), Positives = 174/380 (45%), Gaps = 69/380 (18%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
           L++G+PP  +   +DTGS+ +W  C+  P L        +F+P  SS+Y+ I CSS  C 
Sbjct: 44  LTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS------VFNPLSSSSYSPIPCSSPVCR 97

Query: 99  VVTSN------CSEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIF 150
             T +      C      ++ +    +YA  SS  GNLA++     S++     +P  +F
Sbjct: 98  TRTRDLPNPVTCDPKKLCHAIV----SYADASSLEGNLASDNFRIGSSA-----LPGTLF 148

Query: 151 GCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GSS 201
           GC     +S +  D+K TG++G+  G+ S ++Q+G     KFSYC+  +        G S
Sbjct: 149 GCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP---KFSYCISGRDSSGVLLFGDS 205

Query: 202 KINFGGIVAGAGVV--STPLIIRDH--YYLSLEAISVGNQRLEFVSS-------STGNIF 250
            +++ G +    +V  STPL   D   Y + L+ I VGN+ L    S         G   
Sbjct: 206 HLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTM 265

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNISSQPKFPE 306
           VD+G   T L    ++ L++      K   V     +P F    +  LCY + +  K PE
Sbjct: 266 VDSGTQFTFLLGPVYTALRNEFLEQTKG--VLAPLGDPNFVFQGAMDLCYRVPAGGKLPE 323

Query: 307 ---VTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRIMQI 351
              V++ FRGA++ +    L   +       + + C  F  GN+++      V G   Q 
Sbjct: 324 LPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTF--GNSDLLGIEAFVIGHHHQQ 381

Query: 352 NFLIGYDIEQAMVSFKPSRC 371
           N  + +D+ ++ V F  +RC
Sbjct: 382 NVWMEFDLVKSRVGFVETRC 401


>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 502

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 106/373 (28%), Positives = 163/373 (43%), Gaps = 46/373 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP  D +  VDTGSD  W    QC  CP+      E  L+D K+S T   +
Sbjct: 97  LYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLV 156

Query: 91  SCSSSQCAVVT----SNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
           SC    C  +     S C +   CSY+ +Y  G   S S G    + + ++  SG L   
Sbjct: 157 SCDQDFCYAINGGPPSYCIANMSCSYTEIYADG---SSSFGYFVRDIVQYDQVSGDLETT 213

Query: 145 MPN--VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGS 200
             N  VIFGC        +S+    GI+G G  N+S+ISQ+ +S  +   F++CL     
Sbjct: 214 SANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL----- 268

Query: 201 SKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLEF------VSSSTGN 248
             +N GGI A   +V     +TPL+  + HY ++++A+ VG   L        V    G 
Sbjct: 269 DGLNGGGIFAIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGT 328

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVT 308
           I +D+G     LP   +  L   +S +   Q    V           Y+ S    FP VT
Sbjct: 329 I-IDSGTTLAYLPEVVYDQL---LSKIFSWQSDLKVHTIHDQFTCFQYSESLDDGFPAVT 384

Query: 309 IHFRGA-DVKLSPSNLFRNISDEIMCSAFRGG------NANI-VYGRIMQINFLIGYDIE 360
            HF  +  +K+ P     +  D + C  ++          NI + G +   N L+ YD+E
Sbjct: 385 FHFENSLYLKVHPHEYLFSY-DGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLE 443

Query: 361 QAMVSFKPSRCTN 373
             ++ +    C++
Sbjct: 444 NQVIGWTEYNCSS 456


>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 428

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 118/419 (28%), Positives = 184/419 (43%), Gaps = 79/419 (18%)

Query: 1   AQNSQK---LPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSD 57
           +Q +QK   LP     +TP   +S  +   +        + L++G+PP ++   +DTGS+
Sbjct: 30  SQLTQKPLLLPLKTQTQTPSRKLSFHHNVTLT-------VSLTVGSPPQNVTMVLDTGSE 82

Query: 58  CTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCS---EGDCSYSFL 114
            +W  C+  P L+        F+P  SS+Y    C+SS C   T + +     D +    
Sbjct: 83  LSWLHCKKLPNLNS------TFNPLLSSSYTPTPCNSSICTTRTRDLTIPASCDPNNKLC 136

Query: 115 YGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTS----DSKQTG 168
           +   +YA  SS  G LA ET +    +      P  +FGC   + A  TS    DSK TG
Sbjct: 137 HVIVSYADASSAEGTLAAETFSLAGAA-----QPGTLFGC--MDSAGYTSDINEDSKTTG 189

Query: 169 IIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAG--------------- 213
           ++G+  G+ SL++QM      KFSYC+    S +   G ++ G G               
Sbjct: 190 LMGMNRGSLSLVTQMSLP---KFSYCI----SGEDALGVLLLGDGTDAPSPLQYTPLVTA 242

Query: 214 VVSTPLIIRDHYYLSLEAISVGNQRLE-----FVSSST--GNIFVDTGVLRTLLPLEYHS 266
             S+P   R  Y + LE I V  + L+     FV   T  G   VD+G   T L    +S
Sbjct: 243 TTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQTMVDSGTQFTFLLGSVYS 302

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNI-SSQPKFPEVTIHFRGADVKLSPS 321
           +LK       K   V     +P F    +  LCY+  +S    P VT+ F GA++++S  
Sbjct: 303 SLKDEFLEQTKG--VLTRIEDPNFVFEGAMDLCYHAPASFAAVPAVTLVFSGAEMRVSGE 360

Query: 322 NLFRNI---SDEIMCSAFRGGNANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            L   +   SD + C  F  GN+++      V G   Q N  + +D+ ++ V F  + C
Sbjct: 361 RLLYRVSKGSDWVYCFTF--GNSDLLGIEAYVIGHHHQQNVWMEFDLLKSRVGFTQTTC 417


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 108/364 (29%), Positives = 163/364 (44%), Gaps = 52/364 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++  +IGTP   +  ++DT +D  W  C  C  + C      LFDP KSS+  ++ C +
Sbjct: 88  YIVRANIGTPAQAMLVALDTSNDAAWIPCSGC--VGC--SSSVLFDPSKSSSSRTLQCEA 143

Query: 95  SQCAVVTS-NCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
            QC    + +C+    C ++  YG  A  ++    L  +TLT  +       +PN  FGC
Sbjct: 144 PQCKQAPNPSCTVSKSCGFNMTYGGSAIEAY----LTQDTLTLATD-----VIPNYTFGC 194

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
            +K  AS TS   Q G++GLG G  SLISQ        FSYCLP+  SS  NF G +   
Sbjct: 195 INK--ASGTSLPAQ-GLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKSS--NFSGSLRLG 249

Query: 213 ------GVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS--------TGNIFVDTG 254
                  + +TPL+        YY++L  I VGN+ ++  +S+         G IF D+G
Sbjct: 250 PKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIF-DSG 308

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGA 314
            + T L    +  +++     +K      +G   GF    CY  S    FP VT  F G 
Sbjct: 309 TVYTRLVEPAYVAMRNEFRRRVKNANATSLG---GFDT--CY--SGSVVFPSVTFMFAGM 361

Query: 315 DVKLSPSNLF-RNISDEIMCSAFRGGNANI-----VYGRIMQINFLIGYDIEQAMVSFKP 368
           +V L P NL   + +  + C A      N+     V   + Q N  +  D+  + +    
Sbjct: 362 NVTLPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGISR 421

Query: 369 SRCT 372
             CT
Sbjct: 422 ETCT 425


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 96/353 (27%), Positives = 148/353 (41%), Gaps = 48/353 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTPP      +DTGSD  W QC PC +  C+ Q   +FDP++S +Y ++ C +
Sbjct: 142 YFASVGVGTPPTPALLVLDTGSDVVWLQCAPCRQ--CYAQSGRVFDPRRSRSYAAVRCGA 199

Query: 95  SQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
             C           +   G C Y   YG G   S ++G+LATETL F   +     +P V
Sbjct: 200 PPCRGLDAGGGGGCDRRRGTCLYQVAYGDG---SVTAGDLATETLWFARGA----RVPRV 252

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI 208
             GCGH N     + +   G+        SL +Q       +FSYC   QGS        
Sbjct: 253 AVGCGHDNEGLFVAAAGLLGLGRG---RLSLPTQTARRYGRRFSYCF--QGSD------- 300

Query: 209 VAGAGVVSTPLIIRD-HYYLSLEAIS-VGNQRLEF-VSSSTGNIFVDTGVLRTLLPLEYH 265
                 +    IIR  H ++    +  VG + L    S+  G + +D+G   T L    +
Sbjct: 301 ------LDHRTIIRTVHQHVGGARVRGVGERSLRLDPSTGRGGVILDSGTSVTRLARPVY 354

Query: 266 SNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--KFPEVTIHFR-GADVKLSP 320
             ++             G+   PG   +   CY++  +   K P V++H   GA+V L P
Sbjct: 355 VAVREAFRAA-----AGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVSVHLAGGAEVALPP 409

Query: 321 SNLFRNISDE-IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            N    +      C A  G +  + + G I Q  F + +D ++  V+  P  C
Sbjct: 410 ENYLIPVDTRGTFCLALAGTDGGVSIVGNIQQQGFRVVFDGDRQRVALVPKSC 462


>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
          Length = 477

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 106/371 (28%), Positives = 161/371 (43%), Gaps = 46/371 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP  D +  VDTGSD  W    QC  CP+      E  L+D K+S T   +
Sbjct: 97  LYYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLV 156

Query: 91  SCSSSQCAVVT----SNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
           SC    C  +     S C +   CSY+ +Y  G   S S G    + + ++  SG L   
Sbjct: 157 SCDQDFCYAINGGPPSYCIANMSCSYTEIYADG---SSSFGYFVRDIVQYDQVSGDLETT 213

Query: 145 MPN--VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGS 200
             N  VIFGC        +S+    GI+G G  N+S+ISQ+ +S  +   F++CL     
Sbjct: 214 SANGSVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCL----- 268

Query: 201 SKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLEF------VSSSTGN 248
             +N GGI A   +V     +TPL+  + HY ++++A+ VG   L        V    G 
Sbjct: 269 DGLNGGGIFAIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGT 328

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVT 308
           I +D+G     LP   +  L   +S +   Q    V           Y+ S    FP VT
Sbjct: 329 I-IDSGTTLAYLPEVVYDQL---LSKIFSWQSDLKVHTIHDQFTCFQYSESLDDGFPAVT 384

Query: 309 IHFRGA-DVKLSPSNLFRNISDEIMCSAFRGG------NANI-VYGRIMQINFLIGYDIE 360
            HF  +  +K+ P     +  D + C  ++          NI + G +   N L+ YD+E
Sbjct: 385 FHFENSLYLKVHPHEYLFSY-DGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLE 443

Query: 361 QAMVSFKPSRC 371
             ++ +    C
Sbjct: 444 NQVIGWTEYNC 454


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 169/370 (45%), Gaps = 39/370 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPPV+    +DTGSD  W  C     CP+    + +   FDP  SST + I
Sbjct: 77  LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSMI 136

Query: 91  SCSSSQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST---SGL 141
           +CS  +C      +  T +     CSY+F YG G   S +SG   ++ +  N+    S  
Sbjct: 137 ACSDQRCNNGKQSSDATCSSQNNQCSYTFQYGDG---SGTSGYYVSDMMHLNTIFEGSMT 193

Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-D 197
                 V+FGC ++     T SD    GI G G    S+ISQ+ +  IA + FS+CL  D
Sbjct: 194 TNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGD 253

Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
                I   G +    +V T L+  + HY L+L++ISV  Q L+     F +S++    V
Sbjct: 254 SSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSRGTIV 313

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTI 309
           D+G     L  E +    S ++  I  Q V+ V +        CY I+S     FP+V++
Sbjct: 314 DSGTTLAYLAEEAYDPFVSAITAAIP-QSVRTVVSRGN----QCYLITSSVTDVFPQVSL 368

Query: 310 HFR-GADVKLSPSNLFRNISD----EIMCSAFRG--GNANIVYGRIMQINFLIGYDIEQA 362
           +F  GA + L P +     +      + C  F+   G    + G ++  + ++ YD+   
Sbjct: 369 NFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAGQ 428

Query: 363 MVSFKPSRCT 372
            + +    C+
Sbjct: 429 RIGWANYDCS 438


>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 397

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 99/366 (27%), Positives = 171/366 (46%), Gaps = 53/366 (14%)

Query: 36  LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
           + + +IGTPP      +D   +  WTQC  C  + CFKQ+ P+F P  SST+    C + 
Sbjct: 55  VANFTIGTPPQAASAFIDLTGELVWTQCSQC--IHCFKQDLPVFVPNASSTFKPEPCGTD 112

Query: 96  QC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
            C ++ T  C+   C+Y  + G G +   + G +AT+T    + +  P  +    FGC  
Sbjct: 113 VCKSIPTPKCASDVCAYDGVTGLGGH---TVGIVATDTFAIGTAA--PASLG---FGC-- 162

Query: 155 KNLASPTSDSK--QTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVA 210
             + +   D+    +G IGLG    SL++QM  +   +FSYCL   D G +   F G  A
Sbjct: 163 --VVASDIDTMGGPSGFIGLGRTPWSLVAQMKLT---RFSYCLAPHDTGKNSRLFLGASA 217

Query: 211 --GAGVVSTPLI-------IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLP 261
               G   TP +       +  +Y + LE I  G+  +  +      + V T V+R  L 
Sbjct: 218 KLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATIT-MPRGRNTVLVQTAVVRVSLL 276

Query: 262 LE--YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKL 318
           ++  Y    K+VM+++  A     VGA   F   +C+  +     P++   F+ GA + +
Sbjct: 277 VDSVYQEFKKAVMASVGAAPTATPVGAP--FE--VCFPKAGVSGAPDLVFTFQAGAALTV 332

Query: 319 SPSNLFRNISDEIMC-----------SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFK 367
            P+N   ++ ++ +C           +A  G N   + G   Q N  + +D+++ M+SF+
Sbjct: 333 PPANYLFDVGNDTVCLSVMSIALLNITALDGLN---ILGSFQQENVHLLFDLDKDMLSFE 389

Query: 368 PSRCTN 373
           P+ C++
Sbjct: 390 PADCSS 395


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 157/371 (42%), Gaps = 47/371 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK-QEPPLFDPKKSSTYNSISCS 93
           Y+    +GTPP  +  ++D  +D  W  C  C  L C      P FDP +SSTY  + C 
Sbjct: 100 YVARARLGTPPQTLLVAIDPSNDAAWVPCSAC--LGCAPGASSPSFDPTQSSTYRPVRCG 157

Query: 94  SSQCAVV---TSNCSEG---DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           + QCA V   T +C  G    C+++  Y     +S     L  + L+ + ++G  V   +
Sbjct: 158 APQCAQVPPATPSCPAGPGASCAFNLSYA----SSTLHAVLGQDALSLSDSNGAAVPDDH 213

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
             FGC      S  S   Q G++G G G  S +SQ   +    FSYCLP   SS  NF G
Sbjct: 214 YTFGCLRVVTGSGGSVPPQ-GLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSS--NFSG 270

Query: 208 I--VAGAG----VVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS--------TGNI 249
              +  AG    + +TPL+   H    YY+++  + V  + +   +S+         G  
Sbjct: 271 TLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGT 330

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTI 309
            VD G + T L    ++ L++     + A     +G   GF    CY ++     P V  
Sbjct: 331 IVDAGTMFTRLSPPAYAALRNAFRRGVSAPAAPALG---GFDT--CYYVNGTKSVPAVAF 385

Query: 310 HFR-GADVKLSPSN-LFRNISDEIMCSAFRGG-----NANI-VYGRIMQINFLIGYDIEQ 361
            F  GA V L   N +  + S  + C A   G     NA + V   + Q N  + +D+  
Sbjct: 386 VFAGGARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGN 445

Query: 362 AMVSFKPSRCT 372
             V F    CT
Sbjct: 446 GRVGFSRELCT 456


>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 103/373 (27%), Positives = 168/373 (45%), Gaps = 44/373 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTPP + +  VDTGSD  W    QC+ CP       +  L+D K+SS+   +
Sbjct: 84  LYYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFV 143

Query: 91  SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
            C    C  +      G      C Y  +YG G   S ++G    + + ++  SG L  +
Sbjct: 144 PCDQEFCKEINGGLLTGCTANISCPYLEIYGDG---SSTAGYFVKDIVLYDQVSGDLKTD 200

Query: 145 MPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
             N  ++FGCG +     +S +++   GI+G G  NSS+ISQ+ +S  +   F++CL   
Sbjct: 201 SANGSIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCL--- 257

Query: 199 GSSKINFGGIVAGAGVVS-----TPLI-IRDHYYLSLEAISVGNQRLEFV--SSSTGN-- 248
             + +N GGI A   VV      TPL+  + HY +++ A+ VG+  L     +S+ G+  
Sbjct: 258 --NGVNGGGIFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDRK 315

Query: 249 -IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEV 307
              +D+G     LP   +  L   +  +I   P   V           Y+ S    FP V
Sbjct: 316 GTIIDSGTTLAYLPEGIYEPL---VYKIISQHPDLKVRTLHDEYTCFQYSESVDDGFPAV 372

Query: 308 TIHFR-GADVK------LSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIE 360
           T +F  G  +K      L PS  F  I  +   +  R      + G ++  N L+ YD+E
Sbjct: 373 TFYFENGLSLKVYPHDYLFPSGDFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLE 432

Query: 361 QAMVSFKPSRCTN 373
             ++ +    C++
Sbjct: 433 NQVIGWTEYNCSS 445


>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
          Length = 453

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 108/374 (28%), Positives = 165/374 (44%), Gaps = 55/374 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M   IGTP   + G  DTGSD  WT+C  C    C  +  P + P  SS+   ++C  
Sbjct: 92  YAMSFGIGTPATGLSGEADTGSDLIWTKCGACAR--CSPRGSPSYYPTSSSSAAFVACGD 149

Query: 95  SQCA---------VVTSNCSEGDCSYSFLYGRGA-YASFSSGNLATETLTFNSTSGLPVE 144
             C          V       G+CSY + YG       ++ G L TET TF   +     
Sbjct: 150 RTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDA---AA 206

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SK 202
            P + FGC    L S       +G++GLG G  SL++Q+       F Y L    S  S 
Sbjct: 207 FPGIAFGC---TLRSEGGFGTGSGLVGLGRGKLSLVTQLNVE---AFGYRLSSDLSAPSP 260

Query: 203 INFGGIVA-----GAGVVSTPL----IIRD--HYYLSLEAISVGNQRLEF--------VS 243
           I+FG +       G   +STPL    +++D   YY+ L  ISVG + ++          S
Sbjct: 261 ISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRS 320

Query: 244 SSTGNIFVDTGVLRTLLPLEYHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCYN-ISSQ 301
           +  G +  D+G   T+LP   ++ ++  ++S M   +P      +    D++C+   SS 
Sbjct: 321 TGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDD----DLICFTGGSST 376

Query: 302 PKFPEVTIHFR-GADVKLSPSNLF-----RNISDEIMCSAFRGGNANIVYGRIMQINFLI 355
             FP + +HF  GAD+ LS  N       +N       S  +   A  + G IMQ++F +
Sbjct: 377 TTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHV 436

Query: 356 GYDIE-QAMVSFKP 368
            +D+   A + F+P
Sbjct: 437 VFDLSGNARMLFQP 450


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 106/396 (26%), Positives = 171/396 (43%), Gaps = 51/396 (12%)

Query: 1   AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
           A++  +L F +     KS + I    +I+     Y++   IGTP   +  ++DT +D  W
Sbjct: 63  AKDQARLQFLSSLVARKSVVPIASGRQIVQ-SPTYIVRAKIGTPAQTMLLAMDTSNDAAW 121

Query: 61  TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-TSNCSEGDCSYSFLYGRGA 119
             C  C  + C      +F+  KS+T+ ++ C + QC  V  S C    C+++  YG  +
Sbjct: 122 IPCSGC--VGC---SSTVFNNVKSTTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSSS 176

Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDS-KQTGIIGLGPGNSS 178
            A+    NL+ + +T  + S     +P+  FGC    L   T  S    G++GLG G  S
Sbjct: 177 IAA----NLSQDVVTLATDS-----IPSYTFGC----LTEATGSSIPPQGLLGLGRGPMS 223

Query: 179 LISQMGTSIAGKFSYCLPD----QGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLE 230
           L+SQ        FSYCLP       S  +  G +     + +TPL+        YY++L 
Sbjct: 224 LLSQTQNLYQSTFSYCLPSFRSLNFSGSLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLM 283

Query: 231 AISVGNQRLEFVSSS--------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVK 282
           AI VG + ++   S+         G IF D+G + T L    ++ ++      +    V 
Sbjct: 284 AIRVGRRVVDIPPSALAFNPTTGAGTIF-DSGTVFTRLVAPAYTAVRDAFRKRVGNATVT 342

Query: 283 GVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLF-RNISDEIMCSAFRGGNA 341
            +G   GF    CY  +S    P +T  F G +V L P NL   + +  I C A      
Sbjct: 343 SLG---GFDT--CY--TSPIVAPTITFMFSGMNVTLPPDNLLIHSTASSITCLAMAAAPD 395

Query: 342 NI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           N+     V   + Q N  I +D+  + +      CT
Sbjct: 396 NVNSVLNVIANMQQQNHRILFDVPNSRLGVAREPCT 431


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 98/352 (27%), Positives = 157/352 (44%), Gaps = 55/352 (15%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV----------T 101
           VDT S+ TW QC PC    C  Q+ PLFDP  S +Y  + C+SS C  +           
Sbjct: 142 VDTASELTWVQCAPCA--SCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGAC 199

Query: 102 SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPT 161
               +  CSY+  Y  G+Y   S G LA + L+          +   +FGCG  N   P 
Sbjct: 200 GGGEQPSCSYTLSYRDGSY---SQGVLAHDKLSLAGEV-----IDGFVFGCGTSN-QGPF 250

Query: 162 SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVV----ST 217
             +  +G++GLG    SLISQ      G FSYCLP + S   + G +V G        ST
Sbjct: 251 GGT--SGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESE--SSGSLVLGDDTSVYRNST 306

Query: 218 PLI----IRD-----HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNL 268
           P++    + D      Y+++L  I++G Q +E   SS G + VD+G + T L    ++ +
Sbjct: 307 PIVYTTMVSDPVQGPFYFVNLTGITIGGQEVE---SSAGKVIVDSGTIITSLVPSVYNAV 363

Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRG-ADVKLSPSNLF 324
           K+   +     P       PGFS +  C+N++   + + P +   F G  +V++  S + 
Sbjct: 364 KAEFLSQFAEYP-----QAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVL 418

Query: 325 RNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             +S +     +  ++ +      + G   Q N  + +D   + + F    C
Sbjct: 419 YFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 470


>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
          Length = 453

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 108/374 (28%), Positives = 165/374 (44%), Gaps = 55/374 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M   IGTP   + G  DTGSD  WT+C  C    C  +  P + P  SS+   ++C  
Sbjct: 92  YAMSFGIGTPATGLSGEADTGSDLIWTKCGACAR--CSPRGSPSYYPTSSSSAAFVACGD 149

Query: 95  SQCA---------VVTSNCSEGDCSYSFLYGRGA-YASFSSGNLATETLTFNSTSGLPVE 144
             C          V       G+CSY + YG       ++ G L TET TF   +     
Sbjct: 150 RTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTFGDDA---AA 206

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS--SK 202
            P + FGC    L S       +G++GLG G  SL++Q+       F Y L    S  S 
Sbjct: 207 FPGIAFGC---TLRSEGGFGTGSGLVGLGRGKLSLVTQLNVE---AFGYRLSSDLSAPSP 260

Query: 203 INFGGIVA-----GAGVVSTPL----IIRD--HYYLSLEAISVGNQRLEF--------VS 243
           I+FG +       G   +STPL    +++D   YY+ L  ISVG + ++          S
Sbjct: 261 ISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRS 320

Query: 244 SSTGNIFVDTGVLRTLLPLEYHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCYN-ISSQ 301
           +  G +  D+G   T+LP   ++ ++  ++S M   +P      +    D++C+   SS 
Sbjct: 321 TGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDD----DLICFTGGSST 376

Query: 302 PKFPEVTIHFR-GADVKLSPSNLF-----RNISDEIMCSAFRGGNANIVYGRIMQINFLI 355
             FP + +HF  GAD+ LS  N       +N       S  +   A  + G IMQ++F +
Sbjct: 377 TTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHV 436

Query: 356 GYDIE-QAMVSFKP 368
            +D+   A + F+P
Sbjct: 437 VFDLSGNARMLFQP 450


>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
 gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
          Length = 490

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 102/379 (26%), Positives = 171/379 (45%), Gaps = 57/379 (15%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IG+P    +  VDTGSD  W    +C+ CP       E   +DP  S T  ++
Sbjct: 84  LYYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT--TV 141

Query: 91  SCSSSQCAVVTSN-------CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
            C    C   + N        +   C +   YG G   S ++G   ++++ +N  SG   
Sbjct: 142 GCDQEFCVANSPNGLPPACPSTSSPCQFRIAYGDG---SSTTGFYVSDSVQYNQVSGNGQ 198

Query: 144 EMP---NVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLP 196
             P   ++ FGCG + L      S Q   GI+G G  +SS++SQ+  +  +   F++CL 
Sbjct: 199 TTPSNASITFGCGAQ-LGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL- 256

Query: 197 DQGSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSS 245
                 ++ GGI A   VV     +TPL+    HY ++L+ ISVG   L+     F S  
Sbjct: 257 ----DTVHGGGIFAIGNVVQPKVKTTPLVQNVTHYNVNLQGISVGGATLQLPSSTFDSGD 312

Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPK 303
           +    +D+G     LP E +  L + + +  K Q +    A   + D +C+  S      
Sbjct: 313 SKGTIIDSGTTLAYLPREVYRTLLTAVFD--KYQDL----ALHNYQDFVCFQFSGSIDDG 366

Query: 304 FPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINFL 354
           FP VT  F G   + + P + LF+N +D + C  F       + G   ++ G ++  N L
Sbjct: 367 FPVVTFSFEGEITLNVYPHDYLFQNEND-LYCMGFLDGGVQTKDGKDMVLLGDLVLSNKL 425

Query: 355 IGYDIEQAMVSFKPSRCTN 373
           + YD+E+ ++ +    C++
Sbjct: 426 VVYDLEKQVIGWADYNCSS 444


>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 481

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 103/376 (27%), Positives = 168/376 (44%), Gaps = 50/376 (13%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP  D +  VDTG+D  W    QC+ CP       +  L++ K+SS+   +
Sbjct: 72  LYYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKLV 131

Query: 91  SCSSSQCAVVTSNCSEG-------DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LP 142
            C    C  +      G        C Y  +YG G   S ++G    + + F+  SG L 
Sbjct: 132 PCDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDG---SSTAGYFVKDVVLFDQVSGDLK 188

Query: 143 VEMPN--VIFGCGHKNLA--SPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP 196
               N  VIFGCG +     S +++    GI+G G  N S+ISQ+ +S  +   F++CL 
Sbjct: 189 TASANGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCL- 247

Query: 197 DQGSSKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLEFVS------S 244
               + +N GGI A   VV     +TPL+  + HY +++ AI VG+  L   +       
Sbjct: 248 ----NGVNGGGIFAIGHVVQPTVNTTPLLPDQPHYSVNMTAIQVGHTFLNLSTDASEQRD 303

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKF 304
           S G I +D+G     LP   +  L   +  ++  QP   V           Y+ S    F
Sbjct: 304 SKGTI-IDSGTTLAYLPDGIYQPL---VYKILSQQPNLKVQTLHDEYTCFQYSGSVDDGF 359

Query: 305 PEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNAN-------IVYGRIMQINFLIG 356
           P VT +F  G  +K+ P + +  +S+ + C  ++   A         + G ++  N L+ 
Sbjct: 360 PNVTFYFENGLSLKVYPHD-YLFLSENLWCIGWQNSGAQSRDSKNMTLLGDLVLSNKLVF 418

Query: 357 YDIEQAMVSFKPSRCT 372
           YD+E  ++ +    C+
Sbjct: 419 YDLENQVIGWTEYNCS 434


>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 475

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 97/344 (28%), Positives = 150/344 (43%), Gaps = 42/344 (12%)

Query: 51  SVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-------TSN 103
           ++DT  D  W QC PCP   C+ Q  PLFDP  SST  ++ C S  C  +       ++ 
Sbjct: 151 AIDTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLGPYGNGCSNR 210

Query: 104 CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSD 163
            +  +C Y   Y        ++G   T+TLT + T+ +     N  FGC H  +    SD
Sbjct: 211 SANAECRYLIEYSDD---RATAGTYMTDTLTISGTTAV----RNFRFGCSHA-VRGRFSD 262

Query: 164 SKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-INFGG--IVAGAGVVSTPLI 220
               G + LG G  SL++Q   S+   FSYC+P   +S  ++ GG        V +T  +
Sbjct: 263 -LTAGTMSLGGGAQSLLAQTARSLGNAFSYCVPQASASGFLSIGGPATTNSTTVFATTPL 321

Query: 221 IRDH-----YYLSLEAISVGNQRLEF--VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMS 273
           +R       Y + L+ I V  +RL    V+ S G +   + V+  L P  Y + L+    
Sbjct: 322 VRSAINPSLYLVRLQGIVVAGRRLGIPPVAFSAGAVMDSSAVITQLPPTAYRA-LRRAFR 380

Query: 274 NMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHF-RGADVKLSPSNLFRNISDE 330
           N ++A P  G     G  D  CY+    +  + P V++ F  GA V L P  +       
Sbjct: 381 NAMRAYPRSGA---TGTLDT-CYDFLGLTNVRVPAVSLVFGGGAVVVLDPPAVMIG---- 432

Query: 331 IMCSAFRGGNANIVY---GRIMQINFLIGYDIEQAMVSFKPSRC 371
             C AF   ++++     G + Q    + YD+    V F+   C
Sbjct: 433 -GCLAFTATSSDLALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 98/352 (27%), Positives = 157/352 (44%), Gaps = 55/352 (15%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV----------T 101
           VDT S+ TW QC PC    C  Q+ PLFDP  S +Y  + C+SS C  +           
Sbjct: 141 VDTASELTWVQCAPCA--SCHDQQGPLFDPASSPSYAVLPCNSSSCDALQVATGSAAGAC 198

Query: 102 SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPT 161
               +  CSY+  Y  G+Y   S G LA + L+          +   +FGCG  N   P 
Sbjct: 199 GGGEQPSCSYTLSYRDGSY---SQGVLAHDKLSLAGEV-----IDGFVFGCGTSN-QGPF 249

Query: 162 SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVV----ST 217
             +  +G++GLG    SLISQ      G FSYCLP + S   + G +V G        ST
Sbjct: 250 GGT--SGLMGLGRSQLSLISQTMDQFGGVFSYCLPLKESE--SSGSLVLGDDTSVYRNST 305

Query: 218 PLI----IRD-----HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNL 268
           P++    + D      Y+++L  I++G Q +E   SS G + VD+G + T L    ++ +
Sbjct: 306 PIVYTTMVSDPVQGPFYFVNLTGITIGGQEVE---SSAGKVIVDSGTIITSLVPSVYNAV 362

Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRG-ADVKLSPSNLF 324
           K+   +     P       PGFS +  C+N++   + + P +   F G  +V++  S + 
Sbjct: 363 KAEFLSQFAEYP-----QAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVDSSGVL 417

Query: 325 RNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             +S +     +  ++ +      + G   Q N  + +D   + + F    C
Sbjct: 418 YFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 469


>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
 gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
          Length = 492

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 100/377 (26%), Positives = 171/377 (45%), Gaps = 51/377 (13%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IG+PP   +  VDTGSD  W     C+ CP       E   +DP  S T  ++
Sbjct: 84  LYYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGT--TV 141

Query: 91  SCSSSQCAV------VTSNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
            C    C        V   C  +   C +   YG G   S ++G   T+ + +N  SG  
Sbjct: 142 GCEQEFCVANSAASGVPPACPSAASPCQFRITYGDG---SSTTGFYVTDFVQYNQVSGNG 198

Query: 143 VEMP---NVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCL 195
              P   ++ FGCG + L      S Q   GI+G G  ++S++SQ+  +  +   F++CL
Sbjct: 199 QTTPSNVSITFGCGAQ-LGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCL 257

Query: 196 PD-QGSSKINFGGIVAGAGVVSTPLIIR-DHYYLSLEAISVGNQRLE-----FVSSSTGN 248
              +G      G +V    V +TPL+    HY ++L+ ISVG   L+     F S  +  
Sbjct: 258 DTVRGGGIFAIGNVVQPPIVKTTPLVPNATHYNVNLQGISVGGATLQLPTSTFDSGDSKG 317

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPE 306
             +D+G     LP E +   +++++ +    P   V     + D +C+  S     +FP 
Sbjct: 318 TIIDSGTTLAYLPREVY---RTLLTAVFDKHPDLAV---RNYEDFICFQFSGSLDEEFPV 371

Query: 307 VTIHFRGADVKLS--PSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIG 356
           +T  F G D+ L+  P + LF+N  +++ C  F       + G   ++ G ++  N L+ 
Sbjct: 372 ITFSFEG-DLTLNVYPHDYLFQN-GNDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVV 429

Query: 357 YDIEQAMVSFKPSRCTN 373
           YD+E+ ++ +    C++
Sbjct: 430 YDLEKQVIGWTDYNCSS 446


>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 442

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 107/379 (28%), Positives = 164/379 (43%), Gaps = 47/379 (12%)

Query: 26  AEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKS 84
           A++      Y+    IG+PP      +DTGSD  WTQC   C    C KQ  P ++  +S
Sbjct: 77  AQVHRATRQYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQS 136

Query: 85  STYNSISCSSSQ--CAV--VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
           ST+  + C+     CA   V     +G C++   YG G       G+L TE+  F S + 
Sbjct: 137 STFVPVPCADKAGFCAANGVHLCGLDGSCTFIASYGAGRVI----GSLGTESFAFESGT- 191

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP---- 196
                 ++ FGC      +  + +  +G+IGLG G  SL+SQ+G   A +FSYCL     
Sbjct: 192 -----TSLAFGCVSLTRITSGALNDASGLIGLGRGRLSLVSQIG---ATRFSYCLTPYFH 243

Query: 197 DQGSSKINF--GGIVAGAGVVSTPLII--RDH-----YYLSLEAISVGNQRLEFVSSST- 246
             G+S   F       G G  S P +   +D+     YYL LE I+VG  RL  V+S+T 
Sbjct: 244 SSGASSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTF 303

Query: 247 -----------GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLC 295
                      G + +DTG   T L    +  LK  ++  +    +     + G    LC
Sbjct: 304 QLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLE--LC 361

Query: 296 YNISS-QPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINF 353
                 Q   P +  HF  GAD+ +  ++ +  +     C     G  + + G   Q + 
Sbjct: 362 VAREGFQKVVPALVFHFGGGADMAVPAASYWAPVDKAAACMMILEGGYDSIIGNFQQQDM 421

Query: 354 LIGYDIEQAMVSFKPSRCT 372
            + YD+ +   SF+ + CT
Sbjct: 422 HLLYDLRRGRFSFQTADCT 440


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 110/376 (29%), Positives = 164/376 (43%), Gaps = 52/376 (13%)

Query: 23  IYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
           I     I     Y++  +IGTP   +  ++DT +D  W  C  C  + C      LFDP 
Sbjct: 76  IASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGC--VGC--SSSVLFDPS 131

Query: 83  KSSTYNSISCSSSQCAVVTS-NCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
           KSS+  ++ C + QC    + +C+    C ++  YG     ++    L  +TLT  S   
Sbjct: 132 KSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSTIEAY----LTQDTLTLASD-- 185

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
               +PN  FGC +K  AS TS   Q G++GLG G  SLISQ        FSYCLP+  S
Sbjct: 186 ---VIPNYTFGCINK--ASGTSLPAQ-GLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKS 239

Query: 201 SKINFGGIVAGA------GVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS----- 245
           S  NF G +          + +TPL+        YY++L  I VGN+ ++  +S+     
Sbjct: 240 S--NFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDP 297

Query: 246 ---TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
               G IF D+G + T L    +  +++     +K      +G   GF    CY  S   
Sbjct: 298 ATGAGTIF-DSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLG---GFDT--CY--SGSV 349

Query: 303 KFPEVTIHFRGADVKLSPSNLF-RNISDEIMCSAFRGGNANI-----VYGRIMQINFLIG 356
            FP VT  F G +V L P NL   + +  + C A      N+     V   + Q N  + 
Sbjct: 350 VFPSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVL 409

Query: 357 YDIEQAMVSFKPSRCT 372
            D+  + +      CT
Sbjct: 410 IDVPNSRLGISRETCT 425


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  107 bits (267), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 110/376 (29%), Positives = 164/376 (43%), Gaps = 52/376 (13%)

Query: 23  IYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPK 82
           I     I     Y++  +IGTP   +  ++DT +D  W  C  C  + C      LFDP 
Sbjct: 76  IASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGC--VGC--SSSVLFDPS 131

Query: 83  KSSTYNSISCSSSQCAVVTS-NCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG 140
           KSS+  ++ C + QC    + +C+    C ++  YG     ++    L  +TLT  S   
Sbjct: 132 KSSSSRTLQCEAPQCKQAPNPSCTVSKSCGFNMTYGGSTIEAY----LTQDTLTLASD-- 185

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS 200
               +PN  FGC +K  AS TS   Q G++GLG G  SLISQ        FSYCLP+  S
Sbjct: 186 ---VIPNYTFGCINK--ASGTSLPAQ-GLMGLGRGPLSLISQSQNLYQSTFSYCLPNSKS 239

Query: 201 SKINFGGIVAGA------GVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS----- 245
           S  NF G +          + +TPL+        YY++L  I VGN+ ++  +S+     
Sbjct: 240 S--NFSGSLRLGPKNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDP 297

Query: 246 ---TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
               G IF D+G + T L    +  +++     +K      +G   GF    CY  S   
Sbjct: 298 ATGAGTIF-DSGTVYTRLVEPAYVAVRNEFRRRVKNANATSLG---GFDT--CY--SGSV 349

Query: 303 KFPEVTIHFRGADVKLSPSNLF-RNISDEIMCSAFRGGNANI-----VYGRIMQINFLIG 356
            FP VT  F G +V L P NL   + +  + C A      N+     V   + Q N  + 
Sbjct: 350 VFPSVTFMFAGMNVTLPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVL 409

Query: 357 YDIEQAMVSFKPSRCT 372
            D+  + +      CT
Sbjct: 410 IDVPNSRLGISRETCT 425


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 101/344 (29%), Positives = 148/344 (43%), Gaps = 49/344 (14%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGD 108
           +D+ SD  W QC PCP   C  Q    +DP +S T  + SCSS  C  +    + C+   
Sbjct: 33  LDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPYANGCANNQ 92

Query: 109 CSYSFLYGRGAYASFSSGNLATETLTF---NSTSGLPVEMPNVIFGCGHKNLASPTSDSK 165
           C Y   Y  G   S +SG    + LT    N+ SG         FGC H    S   D++
Sbjct: 93  CQYLVRYPDG---SSTSGAYIADLLTLDAGNAVSGFK-------FGCSHAEQGS--FDAR 140

Query: 166 QTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV---AGAGVVSTPLI-- 220
             GI+ LG G  SL+SQ  +     FSYC+P   S    F   V   A +  V TP++  
Sbjct: 141 AAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRF 200

Query: 221 --IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI 276
                 Y + L  I+VG QRL    +  + G++      +  L P  Y +   +  S+M 
Sbjct: 201 RQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPPTAYQALRAAFRSSMT 260

Query: 277 --KAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF-RGADVKLSPSNLFRNISDEI 331
             ++ P K      G+ D  CY+ +     + P++++ F R A + L PS +  N     
Sbjct: 261 MYRSAPPK------GYLDT-CYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILFND---- 309

Query: 332 MCSAFRGGNANI----VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            C AF   NA+     V G + Q    + YD+    V F+   C
Sbjct: 310 -CLAFT-SNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351


>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
          Length = 417

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 72/243 (29%), Positives = 115/243 (47%), Gaps = 29/243 (11%)

Query: 14  ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK 73
           E   +  +++ +  I+     YL+ L IGTPP     ++DT SD  WTQC+PC    C+ 
Sbjct: 68  EAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPC--TGCYH 125

Query: 74  QEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLA 129
           Q  P+F+P+ SSTY ++ CSS  C  +    C   D   C Y++ Y   A    + G LA
Sbjct: 126 QVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNA---TTEGTLA 182

Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
            + L     +        V FGC   +        + +G++GLG G  SL+SQ+      
Sbjct: 183 VDKLVIGEDA-----FRGVAFGCSTSSTGG-APPPQASGVVGLGRGPLSLVSQLSVR--- 233

Query: 190 KFSYCLPDQGSS---KINFGGIVAGAGVVSTPLII---RD-----HYYLSLEAISVGNQR 238
           +F+YCLP   S    K+  G     A   +  + +   RD     +YYL+L+ + +G++ 
Sbjct: 234 RFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRT 293

Query: 239 LEF 241
           +  
Sbjct: 294 MSL 296


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 107/382 (28%), Positives = 180/382 (47%), Gaps = 69/382 (18%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-PLFDPKKSSTYNSISCSSSQC 97
           +++GTPP ++   +DTGS+ +W  C      +     P P F+P  SS+Y  ISCSS  C
Sbjct: 70  ITVGTPPQNMSMVIDTGSELSWLHC----NTNTTATIPYPFFNPNISSSYTPISCSSPTC 125

Query: 98  AVVT------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
              T      ++C   +  ++ L    A AS S GNLA++T  F S+       P ++FG
Sbjct: 126 TTRTRDFPIPASCDSNNLCHATL--SYADASSSEGNLASDTFGFGSSFN-----PGIVFG 178

Query: 152 CGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GSSK 202
           C + + ++ + SDS  TG++G+  G+ SL+SQ+      KFSYC+           G S 
Sbjct: 179 CMNSSYSTNSESDSNTTGLMGMNLGSLSLVSQLKIP---KFSYCISGSDFSGILLLGESN 235

Query: 203 INFGGIVAGAGVV--STPL--IIRDHYYLSLEAISVGNQRLEFVSSSTGNIFV--DTGVL 256
            ++GG +    +V  STPL    R  Y + LE I + ++ L      +GN+FV   TG  
Sbjct: 236 FSWGGSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNI----SGNLFVPDHTGAG 291

Query: 257 RTLLPL-EYHSNLKSVMSNMIKAQPVKGVGA------EPGF----SDVLCYNI----SSQ 301
           +T+  L    S L   + N ++ + +           +P F    +  LCY +    S  
Sbjct: 292 QTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRALDDPNFVFQIAMDLCYRVPVNQSEL 351

Query: 302 PKFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRIM 349
           P+ P V++ F GA++++    L   +      +D + C  F  GN+++      + G   
Sbjct: 352 PELPSVSLVFEGAEMRVFGDQLLYRVPGFVWGNDSVYCFTF--GNSDLLGVEAFIIGHHH 409

Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
           Q +  + +D+ +  V    +RC
Sbjct: 410 QQSMWMEFDLVEHRVGLAHARC 431


>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
           distachyon]
          Length = 836

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 113/356 (31%), Positives = 168/356 (47%), Gaps = 38/356 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +S+GTP V     VDTGSD +W QC PC    C+ Q+  LFDP KSS+Y+++ C++
Sbjct: 500 YVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKDQLFDPAKSSSYSAVPCAA 559

Query: 95  SQCAVVTS---NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
             C+ +++    C+ G  C Y   YG G   S ++G   ++TLT      +       +F
Sbjct: 560 DACSELSTYGHGCAAGSQCGYVVSYGDG---SNTTGVYGSDTLTLTDADAV----TGFLF 612

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM-GTSIAGKFSYCLPDQGSSK--INFGG 207
           GCGH   A     +   G++ LG    SL SQ  G    G FSYCLP   SS   +  GG
Sbjct: 613 GCGH---AQAGLFAGIDGLLALGRKGMSLTSQTSGAYGGGVFSYCLPPSPSSTGFLTLGG 669

Query: 208 IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS--TGNIFVDTGVLRTLLP 261
             + +G  +T L+    +   Y + L  I VG Q+L  V +S   G   VDTG + T LP
Sbjct: 670 PSSASGFATTGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPASAFAGGTVVDTGTVITRLP 729

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHFR-GADV 316
               +   ++ +    A    G  A P  G  D  CYN +       P V++ F  GA +
Sbjct: 730 ---PTAYAALRAAFRAAMAPYGYPAAPATGILDT-CYNFTDYGTVTLPTVSLTFSGGATL 785

Query: 317 KLSPSNLFRNISDEIMCSAFRGGNAN-IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           KL        +S   +  A   G+ +  + G + Q +F + +D   + V F P  C
Sbjct: 786 KLDAPGF---LSSGCLAFATNSGDGDPAILGNVQQRSFAVRFD--GSSVGFMPHSC 836


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 107/358 (29%), Positives = 170/358 (47%), Gaps = 38/358 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +++ +  GTP       +DTGSD +W QC+PC    C++Q  P FDP KSS+Y ++ C +
Sbjct: 137 FVVVVGFGTPAQTAAIILDTGSDLSWIQCKPC-SGHCYRQHDPDFDPAKSSSYAAVPCGT 195

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
             CA     C+   C Y   YG G   S ++G L+ +TLTFNS+S    +     FGCG 
Sbjct: 196 PVCAAAGGMCNGTTCLYGVQYGDG---SSTTGVLSRDTLTFNSSS----KFTGFTFGCGE 248

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGI--VA 210
           KN+       +  G++GLG G  SL SQ   S  G FSYCLP   ++   +N G     +
Sbjct: 249 KNIG---DFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNTTPGYLNIGATKPTS 305

Query: 211 GAGVVSTPLIIRDH----YYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLE 263
              V  T +I +      Y++ L +I++G   L     V + TG + +D+G + T LP  
Sbjct: 306 TVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKTGTL-LDSGTILTYLPPP 364

Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GA--DVK 317
            +++L+      +     +G    P +  +  CY+ + Q     P V+ +F  GA  D+ 
Sbjct: 365 AYTSLRDRFKFTM-----QGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDGAVFDLD 419

Query: 318 LSPSNLFRNISDEIM-CSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
                +F + +  ++ C AF    A +   + G   Q    + YD+    + F P  C
Sbjct: 420 FYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIPISC 477


>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
          Length = 459

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 99/404 (24%), Positives = 171/404 (42%), Gaps = 79/404 (19%)

Query: 21  SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFD 80
           ++  +A ++     YL+ L  GTP      ++DT SD  W QC+PC  + C++Q  P+F+
Sbjct: 78  AVASEAPLVPGGGEYLVKLGTGTPQHFFSAAIDTASDLVWMQCQPC--VSCYRQLDPVFN 135

Query: 81  PKKSSTYNSISCSSSQCAVVTSN-CSEGD---CSYSFLY-GRGAYASFSSGNLATETLTF 135
           PK SS+Y  + C+S  CA +  + C E D   C Y++ Y G G     + G LA + L  
Sbjct: 136 PKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGHGV----TKGTLAIDKLAI 191

Query: 136 NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL 195
                       V+FGC   ++  P   ++ +G++GLG G  SL+SQ+      +F YCL
Sbjct: 192 GGDV-----FHAVVFGCSDSSVGGPA--AQASGLVGLGRGPLSLVSQLSVH---RFMYCL 241

Query: 196 PDQGSSKINFGGIVAGAGV-------------VSTPLIIRDHYYLSLEAISVGNQRLEFV 242
           P   S     G +V GAG              +S+      +YYL+L+ ++VG+Q     
Sbjct: 242 PPPMSR--TSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPGTT 299

Query: 243 SSSTG--------------------------NIFVDTGVLRTLLPLEYHSNLKSVMSNMI 276
            ++T                            + VD     + L    +  L   +   I
Sbjct: 300 RNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLEEEI 359

Query: 277 ---KAQPVKGVGAEPGFSDVLCYNISS-----QPKFPEVTIHFRGADVKLSPSNLFRNIS 328
              +A P   +G +      LC+ +       +   P V++ F G  ++L    LF  ++
Sbjct: 360 RLPRATPSLRLGLD------LCFILPEGVGMDRVYVPTVSLSFDGRWLELDRDRLF--VT 411

Query: 329 D-EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           D  +MC      +   + G     N  + +++ +  ++F  + C
Sbjct: 412 DGRMMCLMIGRTSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 455


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 104/370 (28%), Positives = 169/370 (45%), Gaps = 39/370 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPPV+    +DTGSD  W  C     CP+    + +   FDP  SST + I
Sbjct: 74  LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMI 133

Query: 91  SCSSSQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           +CS  +C      +  T +     CSY+F YG G   S +SG   ++ +  N+     V 
Sbjct: 134 ACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDG---SGTSGYYVSDMMHLNTIFEGSVT 190

Query: 145 MPN---VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-D 197
             +   V+FGC ++     T SD    GI G G    S+ISQ+ +  IA + FS+CL  D
Sbjct: 191 TNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGD 250

Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
                I   G +    +V T L+  + HY L+L++I+V  Q L+     F +S++    V
Sbjct: 251 SSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIV 310

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTI 309
           D+G     L  E +    S ++  I  Q V  V +        CY I+S     FP+V++
Sbjct: 311 DSGTTLAYLAEEAYDPFVSAITASIP-QSVHTVVSRGN----QCYLITSSVTEVFPQVSL 365

Query: 310 HFR-GADVKLSPSNLFRNISD----EIMCSAFRG--GNANIVYGRIMQINFLIGYDIEQA 362
           +F  GA + L P +     +      + C  F+   G    + G ++  + ++ YD+   
Sbjct: 366 NFAGGASMILRPQDYLIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAGQ 425

Query: 363 MVSFKPSRCT 372
            + +    C+
Sbjct: 426 RIGWANYDCS 435


>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 97/347 (27%), Positives = 148/347 (42%), Gaps = 50/347 (14%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSE-G 107
           +D+GSD +W QC+PCP   C +Q  PLFDP  S+TY ++ C+S+ CA +      CS   
Sbjct: 172 IDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSANA 231

Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-MPNVIFGCGHKNLASPTSDSKQ 166
            C +   YG         G+ AT T +F+  +  P + +    FGC H +  S   D   
Sbjct: 232 QCQFGINYG--------DGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGS-AFDYDV 282

Query: 167 TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA---------GVVST 217
            G + LG G+ SL+ Q  T     FSYCLP   SS    G +V G            VST
Sbjct: 283 AGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS---LGFLVLGVPPERAQLIPSFVST 339

Query: 218 PLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYH---SNL 268
           PL+        Y + L AI V  + L    +  S  ++   + ++  L P  Y    +  
Sbjct: 340 PLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQALRAAF 399

Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPSNLFRNI 327
           +S M+    A PV  +     F+ V    +      P + + F  GA V L  + +    
Sbjct: 400 RSAMTMYRAAPPVSILDTCYDFTGVRSITL------PSIALVFDGGATVNLDAAGILLG- 452

Query: 328 SDEIMCSAFRGGNANIV---YGRIMQINFLIGYDIEQAMVSFKPSRC 371
                C AF    ++ +    G + Q    + YD+    + F+ + C
Sbjct: 453 ----SCLAFAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495


>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
 gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 103/382 (26%), Positives = 168/382 (43%), Gaps = 68/382 (17%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++   +G+P   +  ++DT +D TW  C PC    C      LF P  SS+Y S+ CSS
Sbjct: 79  YVVRAGLGSPSQQLLLALDTSADATWAHCSPCGT--CPSSS--LFAPANSSSYASLPCSS 134

Query: 95  SQCAVVTSNCSE-----GD----------CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
           S C +            GD          C++S  +   A ASF +  LA++TL     +
Sbjct: 135 SWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPF---ADASFQAA-LASDTLRLGKDA 190

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
                +PN  FGC   ++  PT++  + G++GLG G  +L+SQ G+   G FSYCLP   
Sbjct: 191 -----IPNYTFGC-VSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYR 244

Query: 200 SSKINFGGIVAGAG------VVSTPLIIRDH----YYLSLEAISVGNQRLE-------FV 242
           S   + G +  GAG      V  TP++   H    YY+++  +SVG+  ++       F 
Sbjct: 245 SYYFS-GSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFD 303

Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNI 298
           +++     VD+G + T      ++ L+            + V A  G++ +     C+N 
Sbjct: 304 AATGAGTVVDSGTVITRWTAPVYAALREEFR--------RQVAAPSGYTSLGAFDTCFNT 355

Query: 299 S--SQPKFPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAFRGGNANI-----VYGRIM 349
              +    P VT+H  G  D+ L   N L  + +  + C A      N+     V   + 
Sbjct: 356 DEVAAGGAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQ 415

Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
           Q N  + +D+  + V F    C
Sbjct: 416 QQNIRVVFDVANSRVGFAKESC 437


>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
          Length = 328

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 79/243 (32%), Positives = 115/243 (47%), Gaps = 44/243 (18%)

Query: 40  SIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA- 98
           S G+P  ++   VDTGSD TW QC+PC    C+ Q  PLFDP  S+TY ++ C++S CA 
Sbjct: 101 SSGSPAANLTVIVDTGSDLTWVQCKPCSA--CYAQRDPLFDPAGSATYAAVRCNASACAD 158

Query: 99  -----------VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
                        ++      C Y+  YG G   SFS G LAT+T+     S     +  
Sbjct: 159 SLRAATGTPGSCGSTGAGSEKCYYALAYGDG---SFSRGVLATDTVALGGAS-----LGG 210

Query: 148 VIFGCG--HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP------DQG 199
            +FGCG  ++ L   T+     G++GLG    SL+SQ  +   G FSYCLP        G
Sbjct: 211 FVFGCGLSNRGLFGGTA-----GLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASG 265

Query: 200 SSKINFGGIVAGAGVVSTPL----IIRD-----HYYLSLEAISVGNQRLEFVSSSTGNIF 250
           S  +  G   A +   +TP+    +I D      Y+L++   +VG   L        N+ 
Sbjct: 266 SLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVL 325

Query: 251 VDT 253
           +D+
Sbjct: 326 IDS 328


>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
          Length = 367

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 99/366 (27%), Positives = 172/366 (46%), Gaps = 53/366 (14%)

Query: 36  LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
           + + +IGTPP      +D   +  WTQC  C  + CFKQ+ P+F P  SST+    C + 
Sbjct: 25  VANFTIGTPPQAASAFIDLTGELVWTQCSQC--IHCFKQDLPVFVPNASSTFKPEPCGTD 82

Query: 96  QC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
            C ++ T  C+   C++  + G G +   + G +AT+T    + +  P  +    FGC  
Sbjct: 83  VCKSIPTPKCASDVCAFDGVTGLGGH---TVGIVATDTFAIGTAA--PASLG---FGC-- 132

Query: 155 KNLASPTSDSK--QTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGGIVA 210
             + +   D+    +G IGLG    SL++QM  +   +FSYCL   D G +   F G  A
Sbjct: 133 --VVASDIDTMGGPSGFIGLGRTPWSLVAQMKLT---RFSYCLAPHDTGKNSRLFLGASA 187

Query: 211 --GAGVVSTPLI-------IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLP 261
               G   TP +       +  +Y + LE I  G+  +  +      + V T V+R  L 
Sbjct: 188 KLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATIT-MPRGRNTVLVQTAVVRVSLL 246

Query: 262 LE--YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKL 318
           ++  Y    K+VM+++  A     VG EP F   +C+  +     P++   F+ GA + +
Sbjct: 247 VDSVYQEFKKAVMASVGAAPTATPVG-EP-FE--VCFPKAGVSGAPDLVFTFQAGAALTV 302

Query: 319 SPSNLFRNISDEIMC-----------SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFK 367
            P+N   ++ ++ +C           +A  G N   + G   Q N  + +D+++ M+SF+
Sbjct: 303 PPANYLFDVGNDTVCLSVMSIALLNITALDGLN---ILGSFQQENVHLLFDLDKDMLSFE 359

Query: 368 PSRCTN 373
           P+ C++
Sbjct: 360 PADCSS 365


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 106/367 (28%), Positives = 163/367 (44%), Gaps = 70/367 (19%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAV-------VTSNC 104
           VDTGSD TW QC+PC    C+ Q  PLFDP  S++Y ++ C++S C         V  +C
Sbjct: 181 VDTGSDLTWVQCKPCSV--CYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 238

Query: 105 S----------EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG- 153
           +             C YS  YG G   SFS G LAT+T+     S     +   +FGCG 
Sbjct: 239 ATVGGGGGGGKSERCYYSLAYGDG---SFSRGVLATDTVALGGAS-----VDGFVFGCGL 290

Query: 154 -HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGGI 208
            ++ L   T+     G++GLG    SL+SQ      G FSYCLP       +  ++ GG 
Sbjct: 291 SNRGLFGGTA-----GLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGD 345

Query: 209 VAG---AGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSSTGNIFVDTG-VLRTL 259
            +    A  VS   +I D      Y++++   SVG   +        N+ +D+G V+  L
Sbjct: 346 TSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRL 405

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAE-----PGFSDV-LCYNIS--SQPKFPEVTIHF 311
            P  Y +         ++A+  +  GAE     P FS +  CYN++   + K P +T+  
Sbjct: 406 APSVYRA---------VRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRL 456

Query: 312 R-GADVKLSPSN-LFRNISDE----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
             GAD+ +  +  LF    D     +  ++    +   + G   Q N  + YD   + + 
Sbjct: 457 EGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLG 516

Query: 366 FKPSRCT 372
           F    C+
Sbjct: 517 FADEDCS 523


>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
          Length = 419

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 104/373 (27%), Positives = 167/373 (44%), Gaps = 51/373 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+ + +IGTPP  + G VD   +  WTQC  C    CFKQE P+FDP  S+TY +  C S
Sbjct: 62  YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121

Query: 95  SQC-AVVTSNCS-EGDCSYSF--LYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
             C ++ T NCS +G+C Y    ++G       + G  +T+ +   +  G       + F
Sbjct: 122 PLCKSIPTRNCSGDGECGYEAPSMFGD------TFGIASTDAIAIGNAEG------RLAF 169

Query: 151 GCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-----IN 204
           GC   +  S   +    +G +GLG    SL+ Q   +    FSYCL   G  K     + 
Sbjct: 170 GCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVT---AFSYCLAPHGPGKKSALFLG 226

Query: 205 FGGIVAGAGVVS--TPLIIR----------DHYY-LSLEAISVGNQRLEFVSSSTGNIFV 251
               +AGAG  +  TPL+ +          D YY + LE I  G+  +   SS  G I +
Sbjct: 227 ASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGAITI 286

Query: 252 DTGVLRTLLPLEYHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIH 310
               L T  PL Y  +     +  ++ A       A P     LC+  ++    P++   
Sbjct: 287 LQ--LETFRPLSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSGVPDLVFT 344

Query: 311 FRGADVKLSPSN---LFRNISDEIMC----SAFRGGNAN---IVYGRIMQINFLIGYDIE 360
           F+G     +P +   L     +  +C    S+ R  +A+    + G ++Q N    +D+E
Sbjct: 345 FQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLE 404

Query: 361 QAMVSFKPSRCTN 373
           +  +SF+P+ C++
Sbjct: 405 KETLSFEPADCSS 417


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 106/367 (28%), Positives = 163/367 (44%), Gaps = 70/367 (19%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAV-------VTSNC 104
           VDTGSD TW QC+PC    C+ Q  PLFDP  S++Y ++ C++S C         V  +C
Sbjct: 180 VDTGSDLTWVQCKPCSV--CYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGSC 237

Query: 105 S----------EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG- 153
           +             C YS  YG G   SFS G LAT+T+     S     +   +FGCG 
Sbjct: 238 ATVGGGGGGGKSERCYYSLAYGDG---SFSRGVLATDTVALGGAS-----VDGFVFGCGL 289

Query: 154 -HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFGGI 208
            ++ L   T+     G++GLG    SL+SQ      G FSYCLP       +  ++ GG 
Sbjct: 290 SNRGLFGGTA-----GLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGDAAGSLSLGGD 344

Query: 209 VAG---AGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSSTGNIFVDTG-VLRTL 259
            +    A  VS   +I D      Y++++   SVG   +        N+ +D+G V+  L
Sbjct: 345 TSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRL 404

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAE-----PGFSDV-LCYNIS--SQPKFPEVTIHF 311
            P  Y +         ++A+  +  GAE     P FS +  CYN++   + K P +T+  
Sbjct: 405 APSVYRA---------VRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRL 455

Query: 312 R-GADVKLSPSN-LFRNISDE----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
             GAD+ +  +  LF    D     +  ++    +   + G   Q N  + YD   + + 
Sbjct: 456 EGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLG 515

Query: 366 FKPSRCT 372
           F    C+
Sbjct: 516 FADEDCS 522


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 104/381 (27%), Positives = 174/381 (45%), Gaps = 50/381 (13%)

Query: 31  VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTY 87
           V  +Y   + +GTPP   +  +DTGSD  W  C+P   CP           FDP+ SST 
Sbjct: 37  VAGLYYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTA 96

Query: 88  NSISCSSSQCA----VVTSNC-SEGDCSYSFLYGRGA-----YAS--FSSGNLATETLTF 135
           + +SC  S+C     +  S C ++  C YSF YG G+     Y S  F       + +T 
Sbjct: 97  SPLSCIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTN 156

Query: 136 NSTSGLPVEMPNVIFGCGHKNLASPTS-DSKQTGIIGLGPGNSSLISQMGTS-IAGK-FS 192
           N+++        + FGC +      T  D    GI G G  + S++SQ+ +  +A K FS
Sbjct: 157 NASA-------KITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFS 209

Query: 193 YCL--PDQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSS 244
           +CL   D G   +  G I    G+V TP++  + HY L+L+ I+V  Q+L      F ++
Sbjct: 210 HCLEGADPGGGILVLGEITE-PGMVYTPIVPSQPHYNLNLQGIAVNGQQLSIDPQVFATT 268

Query: 245 STGNIFVDTGVLRTLLPLE-YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK 303
           +T    +D G     L  E Y   + ++++ + ++     +   P F  V     S    
Sbjct: 269 NTRGTIIDCGTTLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNPCFLTVH----SIDEI 324

Query: 304 FPEVTIHFRGADVKLSPSN-LFRNISDE---IMCSAFR--GGNAN-----IVYGRIMQIN 352
           FP VT++F GA + L P + L + +S +   + C  ++  G  A       + G ++  +
Sbjct: 325 FPSVTLYFEGAPMDLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVLKD 384

Query: 353 FLIGYDIEQAMVSFKPSRCTN 373
            +  YD+E   + +    C++
Sbjct: 385 KVFVYDLENQRIGWTSFDCSS 405


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 103/337 (30%), Positives = 153/337 (45%), Gaps = 42/337 (12%)

Query: 31  VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTY 87
           V  +Y   L +GTPP D +  VDTGSD  W  C     CP+    + +   FDP  S T 
Sbjct: 77  VVGLYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTA 136

Query: 88  NSISCSSSQCAV----VTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSG- 140
           + ISCS  +C+       S CS  +  C+Y+F YG G   S +SG   ++ L F+   G 
Sbjct: 137 SPISCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDG---SGTSGFYVSDVLQFDMIVGS 193

Query: 141 --LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCL 195
             +P     V+FGC          SD    GI G G    S+ISQ+ +  IA + FS+CL
Sbjct: 194 SLVPNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL 253

Query: 196 P-DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGN 248
             + G   I   G +    +V TPL+  + HY ++L +ISV  Q L      F +S+   
Sbjct: 254 KGENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQG 313

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--F 304
             +DTG     L    +      ++N +    +PV   G +       CY I++     F
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ-------CYVITTSVGDIF 366

Query: 305 PEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNA 341
           P V+++F G       +++F N  D ++     GG A
Sbjct: 367 PPVSLNFAGG------ASMFLNPQDYLIQQNNVGGTA 397


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 102/334 (30%), Positives = 152/334 (45%), Gaps = 42/334 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   L +GTPP D +  VDTGSD  W  C     CP+    + +   FDP  S T + I
Sbjct: 80  LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPI 139

Query: 91  SCSSSQCAV----VTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
           SCS  +C+       S CS  +  C+Y+F YG G   S +SG   ++ L F+   G   +
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDG---SGTSGFYVSDVLQFDMIVGSSLV 196

Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-D 197
           P     V+FGC          SD    GI G G    S+ISQ+ +  IA + FS+CL  +
Sbjct: 197 PNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGE 256

Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
            G   I   G +    +V TPL+  + HY ++L +ISV  Q L      F +S+     +
Sbjct: 257 NGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTII 316

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FPEV 307
           DTG     L    +      ++N +    +PV   G +       CY I++     FP V
Sbjct: 317 DTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ-------CYVITTSVGDIFPPV 369

Query: 308 TIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNA 341
           +++F G       +++F N  D ++     GG A
Sbjct: 370 SLNFAGG------ASMFLNPQDYLIQQNNVGGTA 397


>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
          Length = 419

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 108/377 (28%), Positives = 172/377 (45%), Gaps = 59/377 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+ + +IGTPP  + G VD   +  WTQC  C    CFKQE P+FDP  S+TY +  C S
Sbjct: 62  YVANFTIGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGS 121

Query: 95  SQC-AVVTSNCS-EGDCSYSF--LYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
             C ++ T NCS +G+C Y    ++G       + G  +T+ +   +  G       + F
Sbjct: 122 PLCKSIPTRNCSGDGECGYEAPSMFGD------TFGIASTDAIAIGNAEG------RLAF 169

Query: 151 GCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-----IN 204
           GC   +  S   +    +G +GLG    SL+ Q   +    FSYCL   G  K     + 
Sbjct: 170 GCVVASDGSIDGAMDGPSGFVGLGRTPWSLVGQSNVT---AFSYCLALHGPGKKSALFLG 226

Query: 205 FGGIVAGAGVVS--TPLIIR----------DHYY-LSLEAISVGNQRLEFVSSSTGNIFV 251
               +AGAG  +  TPL+ +          D YY + LE I  G+  +   SS  G I V
Sbjct: 227 ASAKLAGAGKSNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAGDVAVAAASSGGGAITV 286

Query: 252 DTGVLRTLLPLEY-----HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPE 306
               L T  PL Y     +  L+ V++  + +  +    A P     LC+  ++    P+
Sbjct: 287 LQ--LETFRPLSYLPDAAYQALEKVVTAALGSPSM----ANPPEPFDLCFQNAAVSGVPD 340

Query: 307 VTIHFR-GADVKLSPSN--LFRNISDEIMC----SAFRGGNAN---IVYGRIMQINFLIG 356
           +   F+ GA +   PS   L     +  +C    S+ R  +A+    + G ++Q N    
Sbjct: 341 LVFTFQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFL 400

Query: 357 YDIEQAMVSFKPSRCTN 373
           +D+E+  +SF+P+ C++
Sbjct: 401 FDLEKETLSFEPADCSS 417


>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
          Length = 396

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 105/367 (28%), Positives = 166/367 (45%), Gaps = 45/367 (12%)

Query: 36  LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
           + + +IGTPP      +D   +  WTQC  C    CFKQ+ PLF P  SST+    C + 
Sbjct: 44  VANFTIGTPPQPASAIIDVAGELVWTQCSRCSR--CFKQDLPLFIPNASSTFRPEPCGTD 101

Query: 96  QC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
            C +  TSNCS   C+Y            + G + TET    + +       ++ FGC  
Sbjct: 102 ACKSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGTATA------SLAFGC-- 153

Query: 155 KNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFG--GI 208
             +AS       T G IGLG    SL++QM  +   KFSYCL  +G   SS++  G    
Sbjct: 154 -VVASDIDTMDGTSGFIGLGRTPRSLVAQMKLT---KFSYCLSPRGTGKSSRLFLGSSAK 209

Query: 209 VAGAGVVSTPLIIR-------DHYY-LSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLL 260
           +AG    ST   I+        HYY LSL+AI  GN  +   + S G + + T    +LL
Sbjct: 210 LAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIA-TAQSGGILVMHTVSPFSLL 268

Query: 261 PLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQPKFPEVTIHFRG-ADV 316
               +   K  ++  +     + +   P   D LC+  +   S+   P++   F+G A +
Sbjct: 269 VDSAYRAFKKAVTEAVGGAAEQPMATPPQPFD-LCFKKAAGFSRATAPDLVFTFQGAAAL 327

Query: 317 KLSPSNLFRNISDE--IMCSAF-------RGGNANI-VYGRIMQINFLIGYDIEQAMVSF 366
            + P+    ++ +E    C+A        R G   + V G + Q +    YD+++  +SF
Sbjct: 328 TVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSF 387

Query: 367 KPSRCTN 373
           +P+ C++
Sbjct: 388 EPADCSS 394


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 108/370 (29%), Positives = 167/370 (45%), Gaps = 41/370 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP +    +DTGSD  W  C     CP       +   FD   S T  S+
Sbjct: 99  LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 158

Query: 91  SCSSSQCAVV----TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           +CS   C+ V     + CSE + C YSF YG G   S +SG   T+T  F++  G  +  
Sbjct: 159 TCSDPICSSVFQTTAAQCSENNQCGYSFRYGDG---SGTSGYYMTDTFYFDAILGESLVA 215

Query: 146 PN---VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQG 199
            +   ++FGC        T SD    GI G G G  S++SQ+ +       FS+CL   G
Sbjct: 216 NSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG 275

Query: 200 SSKINFG-GIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
           S    F  G +   G+V +PL+  + HY L+L +I V  Q L      F +S+T    VD
Sbjct: 276 SGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVD 335

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISS--QPKFPEVT 308
           TG   T L  E +    + +SN +     P+   G +       CY +S+     FP V+
Sbjct: 336 TGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ-------CYLVSTSISDMFPSVS 388

Query: 309 IHFR-GADVKLSPSN-LFR-NISD--EIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQA 362
           ++F  GA + L P + LF   I D   + C  F+       + G ++  + +  YD+ + 
Sbjct: 389 LNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQ 448

Query: 363 MVSFKPSRCT 372
            + +    C+
Sbjct: 449 RIGWASYDCS 458


>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
 gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
          Length = 453

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 96/400 (24%), Positives = 166/400 (41%), Gaps = 73/400 (18%)

Query: 21  SIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFD 80
           +++ +A ++     YL+ L IGTP      ++DT SD  W QC+PC  + C++Q  P+F+
Sbjct: 74  AVVGEAPLVPRGGEYLVKLGIGTPQHYFSAAIDTASDLVWLQCQPC--VSCYRQLDPIFN 131

Query: 81  PKKSSTYNSISCSSSQCAVVTSN-CSEGD---CSYSFLYGRGAYASFSSGNLATETLTFN 136
           P+ SS+Y  + CSS  C+ +  + C E D   C Y++ Y   A    ++G LA + L   
Sbjct: 132 PRLSSSYAVVPCSSDTCSQLDGHRCDEDDDQACRYNYKYSGNA---VTNGTLAIDKLAVG 188

Query: 137 STSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP 196
                      V+ GC   ++  P    + +G++GL  G  SL+SQ+      +F YCLP
Sbjct: 189 GNV-----FHAVVLGCSDSSVGGPP--PQASGLVGLARGPLSLLSQLSVR---RFMYCLP 238

Query: 197 DQGSSKINFGGIVAGAGV---------------VSTPLIIRDHYYLSLEAISVGNQRLEF 241
              S     G +V GAG                +S+      +YYL+ + ++VG+Q    
Sbjct: 239 PPMSR--TPGKLVLGAGAGADAVRNVSDRVTVTMSSSTRYPSYYYLNFDGLAVGDQTPGT 296

Query: 242 VSSSTG----------------------NIFVDTGVLRTLLPLEYHSNLKSVMSNMI--- 276
           +   T                        + VD     + L    +  L   +   I   
Sbjct: 297 IRRPTSPPATGGGVGGGGGDGGSGANAYGMIVDVASTISFLEASLYDELADDLEEEIRLP 356

Query: 277 KAQPVKGVGAEPGFSDVLCYNISS-----QPKFPEVTIHFRGADVKLSPSNLFRNISDEI 331
           +A P   +G +      LC+ +       +   P V++ F G  ++L    LF      +
Sbjct: 357 RATPSTRLGLD------LCFILPEGVGIDRVYVPTVSMSFDGRWLELERDRLFLE-DGRM 409

Query: 332 MCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           MC      +   + G   Q N  + Y++ +  ++F  + C
Sbjct: 410 MCLMIGRTSGVSILGNYQQQNMHVLYNLRRGKITFAKASC 449


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 159/371 (42%), Gaps = 52/371 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP--LFDPKKSSTYNSISC 92
           Y + L +GTP  +     DTGSD TW +C            PP  +F PK S ++  I C
Sbjct: 116 YFVKLRVGTPVQEFTLVADTGSDLTWVKCA--------GASPPGRVFRPKTSRSWAPIPC 167

Query: 93  SSSQCAV----VTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           SS  C +      +NCS     C+Y + Y  G  ++ + G + TE+ T     G   ++ 
Sbjct: 168 SSDTCKLDVPFTLANCSSPASPCTYDYRYKEG--SAGARGIVGTESATIALPGGKVAQLK 225

Query: 147 NVIFGC--GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
           +V+ GC   H   +  ++D    G++ LG    S  +Q      G FSYCL D  + +  
Sbjct: 226 DVVLGCSSSHDGQSFRSAD----GVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNA 281

Query: 205 FGGIVAGAGVV------STPLIIRDH---YYLSLEAISVGNQRL----EFVSSSTGNIFV 251
            G +  G G V       T L +      Y + ++AI V  + L    E   + +G + +
Sbjct: 282 TGYLAFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWDAKSGGVIL 341

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS-QPKFPEV--- 307
           D+G   T+L    +  + + +S  +   P       P F    CYN ++ +P  PE+   
Sbjct: 342 DSGNTLTVLAAPAYKAVVAALSKHLDGVPKVSF---PPFEH--CYNWTARRPGAPEIIPK 396

Query: 308 -TIHFRGADVKLSP--SNLFRNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQA 362
             + F G+  +L P   +   ++   + C   + G      V G IMQ   L  +D++  
Sbjct: 397 LAVQFAGS-ARLEPPAKSYVIDVKPGVKCIGVQEGEWPGLSVIGNIMQQEHLWEFDLKNM 455

Query: 363 MVSFKPSRCTN 373
            V FK S CT 
Sbjct: 456 QVRFKQSNCTR 466


>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
          Length = 507

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 98/363 (26%), Positives = 159/363 (43%), Gaps = 53/363 (14%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP  D +  VDTGSD  W     C+ CP       +  L+D K S+T +++
Sbjct: 77  LYFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAV 136

Query: 91  SCSSSQCAVVTS---NCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
            C  + C++       C  G  C YS LYG G   S ++G    + + +N  SG     P
Sbjct: 137 GCDDNFCSLYDGPLPGCKPGLQCLYSVLYGDG---SSTTGYFVQDFVQYNRISGNFQTTP 193

Query: 147 ---NVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGS 200
               V+FGCG+K      S S+   GI+G G  NSS++SQ+ +S  +   FS+CL +   
Sbjct: 194 TNGTVVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDN--- 250

Query: 201 SKINFGGIVAGAGVVSTPL--------------IIRDHYYLSLEAISVGNQRLE-----F 241
             ++ GGI A   VV   +              + R HY + ++ I VG   L+     F
Sbjct: 251 --VDGGGIFAIGEVVEPKVRFLLMNSVMIVVLFLSRAHYNVVMKEIEVGGDPLDVPSDAF 308

Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISS 300
            S       +D+G      P E +  L   +  ++  QP ++    E  F+   C++ + 
Sbjct: 309 ESGDRKGTIIDSGTTLAYFPQEVYVPL---IEKILSQQPDLRLHTVEQAFT---CFDYTG 362

Query: 301 --QPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGY 357
                FP VT+HF +   + + P      + +   C  ++   A    G+ +    L+G 
Sbjct: 363 NVDDGFPTVTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSGAQTKDGKDLT---LLGE 419

Query: 358 DIE 360
           D +
Sbjct: 420 DAQ 422


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 108/369 (29%), Positives = 167/369 (45%), Gaps = 41/369 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTW---TQCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP +    +DTGSD  W   + C  CP       +   FD   S T  S+
Sbjct: 99  LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 158

Query: 91  SCSSSQCAVV----TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           +CS   C+ V     + CSE + C YSF YG G   S +SG   T+T  F++  G  +  
Sbjct: 159 TCSDPICSSVFQTTAAQCSENNQCGYSFRYGDG---SGTSGYYMTDTFYFDAILGESLVA 215

Query: 146 PN---VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQG 199
            +   ++FGC        T SD    GI G G G  S++SQ+ +       FS+CL   G
Sbjct: 216 NSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG 275

Query: 200 SSKINFG-GIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
           S    F  G +   G+V +PL+  + HY L+L +I V  Q L      F +S+T    VD
Sbjct: 276 SGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVD 335

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISS--QPKFPEVT 308
           TG   T L  E +    + +SN +     P+   G +       CY +S+     FP V+
Sbjct: 336 TGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ-------CYLVSTSISDMFPSVS 388

Query: 309 IHFR-GADVKLSPSN-LFR-NISD--EIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQA 362
           ++F  GA + L P + LF   I D   + C  F+       + G ++  + +  YD+ + 
Sbjct: 389 LNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQ 448

Query: 363 MVSFKPSRC 371
            + +    C
Sbjct: 449 RIGWASYDC 457


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 108/370 (29%), Positives = 167/370 (45%), Gaps = 41/370 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP +    +DTGSD  W  C     CP       +   FD   S T  S+
Sbjct: 104 LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSV 163

Query: 91  SCSSSQCAVV----TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           +CS   C+ V     + CSE + C YSF YG G   S +SG   T+T  F++  G  +  
Sbjct: 164 TCSDPICSSVFQTTAAQCSENNQCGYSFRYGDG---SGTSGYYMTDTFYFDAILGESLVA 220

Query: 146 PN---VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQG 199
            +   ++FGC        T SD    GI G G G  S++SQ+ +       FS+CL   G
Sbjct: 221 NSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG 280

Query: 200 SSKINFG-GIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
           S    F  G +   G+V +PL+  + HY L+L +I V  Q L      F +S+T    VD
Sbjct: 281 SGGGVFVLGEILVPGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVD 340

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISS--QPKFPEVT 308
           TG   T L  E +    + +SN +     P+   G +       CY +S+     FP V+
Sbjct: 341 TGTTLTYLVKEAYDLFLNAISNSVSQLVTPIISNGEQ-------CYLVSTSISDMFPSVS 393

Query: 309 IHFR-GADVKLSPSN-LFR-NISD--EIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQA 362
           ++F  GA + L P + LF   I D   + C  F+       + G ++  + +  YD+ + 
Sbjct: 394 LNFAGGASMMLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQ 453

Query: 363 MVSFKPSRCT 372
            + +    C+
Sbjct: 454 RIGWASYDCS 463


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 110/374 (29%), Positives = 170/374 (45%), Gaps = 47/374 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   L +GTPP D +  +DTGSD  W     C  CP           FDP  S T + I
Sbjct: 51  LYYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLI 110

Query: 91  SCSSSQCAV----VTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           SCS  +C++      S CS  +  C Y+F YG G   S +SG   ++ L F++  G  V 
Sbjct: 111 SCSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDG---SGTSGYYVSDLLHFDTVLGGSV- 166

Query: 145 MPN----VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP- 196
           M N    ++FGC        T SD    GI G G  + S++SQ+ +  I+ + FS+CL  
Sbjct: 167 MNNSSAPIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKG 226

Query: 197 -DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
            D G   +  G IV    +V TPL+  + HY L++++ISV  Q L      F +SS+   
Sbjct: 227 DDSGGGILVLGEIVE-PNIVYTPLVPSQPHYNLNMQSISVNGQTLAIDPSVFGTSSSQGT 285

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FP 305
            +D+G     L    +    S +++++    +P    G         CY ISS     FP
Sbjct: 286 IIDSGTTLAYLAEAAYDPFISAITSIVSPSVRPYLSKGNH-------CYLISSSINDIFP 338

Query: 306 EVTIHFR-GADVKLSPSNLFRNISD----EIMCSAFRG--GNANIVYGRIMQINFLIGYD 358
           +V+++F  GA + L P +     S      + C  F+   G    + G ++  + +  YD
Sbjct: 339 QVSLNFAGGASMILIPQDYLIQQSSIGGAALWCIGFQKIQGQGITILGDLVLKDKIFVYD 398

Query: 359 IEQAMVSFKPSRCT 372
           I    + +    C+
Sbjct: 399 IANQRIGWANYDCS 412


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 103/377 (27%), Positives = 164/377 (43%), Gaps = 49/377 (12%)

Query: 31  VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL----FDPKKSST 86
           V  +Y   + +GTPPV  +  VDTGSD TW  C PC       Q P +    +DP +SST
Sbjct: 33  VTGLYYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSST 92

Query: 87  YNSISCSSSQC-AVVTSN----CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST-SG 140
             ++SC  S C A + SN     S G C+YS  YG G   S + G    + +TF    + 
Sbjct: 93  DGALSCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDG---SSTQGYFIQDVMTFQEIHNN 149

Query: 141 LPVE-MPNVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGT--SIAGKFSYCLP 196
             V    +V FGCG     +    S+   G+IG G    S+ SQ+ +   +  +F++CL 
Sbjct: 150 TQVNGTASVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQ 209

Query: 197 --DQGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRL------EFVSSSTGN 248
             +QG   I  G  V+   +  TP++ R+HY + ++ I+V  + +      +  S+S G 
Sbjct: 210 GDNQGGGTIVIGS-VSEPNISYTPIVSRNHYAVGMQNIAVNGRNVTTPASFDTTSTSAGG 268

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI---SSQPKFP 305
           + +D+G     L    ++   + +S    +            S   C  +   S Q  FP
Sbjct: 269 VIMDSGTTLAYLVDPAYTQFVNAVSTFESSMFS---------SHSQCLQLAWCSLQADFP 319

Query: 306 EVTIHF-RGADVKLSPSNLFRNI----SDEIMCSAFRGGNANIVY------GRIMQINFL 354
            V + F  GA + L+P N   +          C  ++       Y      G I+  + L
Sbjct: 320 TVKLFFDAGAVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHL 379

Query: 355 IGYDIEQAMVSFKPSRC 371
           + YD +  +V +K   C
Sbjct: 380 VVYDNDNRVVGWKSFDC 396


>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 358

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 78/247 (31%), Positives = 120/247 (48%), Gaps = 26/247 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + +  G+P       VDTGS  +W QC+PC  + C  Q  PLFDP  S TY S+SC+S
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPC-VVYCHVQADPLFDPSASKTYKSLSCTS 176

Query: 95  SQCAVVTS--------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           SQC+ +            S   C Y+  YG    +S+S G L+ + LT   +      +P
Sbjct: 177 SQCSSLVDATLNNPLCETSSNVCVYTASYGD---SSYSMGYLSQDLLTLAPSQ----TLP 229

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-INF 205
             ++GCG     S     +  GI+GLG    S++ Q+ +     FSYCLP +G    ++ 
Sbjct: 230 GFVYGCGQD---SDGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGFLSI 286

Query: 206 G-GIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSSTG-NIFVDTGVLRTL 259
           G   +AG+    TP+         Y+L L AI+VG + L   ++       +D+G + T 
Sbjct: 287 GKASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITR 346

Query: 260 LPLEYHS 266
           LP+  ++
Sbjct: 347 LPMSVYT 353


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 102/345 (29%), Positives = 148/345 (42%), Gaps = 51/345 (14%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEGD 108
           +D+ SD  W QC PCP   C  Q    +DP +S +    SCSS  C  +    + C+   
Sbjct: 163 LDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALGPYANGCANNQ 222

Query: 109 CSYSFLYGRGAYASFSSGNLATETLTF---NSTSGLPVEMPNVIFGCGHKNLASPTSDSK 165
           C Y   Y  G   S +SG    + LT    N+ SG         FGC H    S   D++
Sbjct: 223 CQYLVRYPDG---SSTSGAYIADLLTLDAGNAVSGF-------KFGCSHAEQGS--FDAR 270

Query: 166 QTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV---AGAGVVSTPLI-- 220
             GI+ LG G  SL+SQ  +     FSYC+P   S    F   V   A +  V TP++  
Sbjct: 271 AAGIMALGGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRF 330

Query: 221 --IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYH---SNLKSVMS 273
                 Y + L  I+VG QRL    +  + G++      +  L P  Y    S  +S M+
Sbjct: 331 RQAATFYGVLLRTITVGGQRLGVAPAVFAAGSVLDSRTAITRLPPTAYQALRSAFRSSMT 390

Query: 274 NMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF-RGADVKLSPSNLFRNISDE 330
            M ++ P K      G+ D  CY+ +     + P++++ F R A + L PS +  N    
Sbjct: 391 -MYRSAPPK------GYLDT-CYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILFND--- 439

Query: 331 IMCSAFRGGNANI----VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             C AF   NA+     V G + Q    + YD+    V F+   C
Sbjct: 440 --CLAFT-SNADDRMPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 104/376 (27%), Positives = 168/376 (44%), Gaps = 48/376 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G P  + F  +DTGSD  W  C P   CP       +   F+P  SST + I
Sbjct: 4   LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 63

Query: 91  SCSSSQCAV---------VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG- 140
           +CS  +C            TSN     C Y+F YG G   S +SG   ++T+ F +  G 
Sbjct: 64  TCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDG---SGTSGYYVSDTMFFETVMGN 120

Query: 141 --LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGT-SIAGK-FSYCL 195
                   +++FGC +      T +D    GI G G    S+ISQ+ +  ++ K FS+CL
Sbjct: 121 EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 180

Query: 196 P--DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTG 247
              D G   +  G IV   G+V TPL+  + HY L+LE+I+V  Q+L      F +S+T 
Sbjct: 181 KGSDNGGGILVLGEIVE-PGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQ 239

Query: 248 NIFVDTGVLRTLLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK 303
              VD+G     L       + S + + +S  +++   KG       S     + S    
Sbjct: 240 GTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKG-------SQCFITSSSVDSS 292

Query: 304 FPEVTIHFRGA-DVKLSPSN-LFRNISDE---IMCSAFR--GGNANIVYGRIMQINFLIG 356
           FP VT++F G   + + P N L +  S +   + C  ++   G    + G ++  + +  
Sbjct: 293 FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 352

Query: 357 YDIEQAMVSFKPSRCT 372
           YD+    + +    C+
Sbjct: 353 YDLANMRMGWADYDCS 368


>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
          Length = 394

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 163/371 (43%), Gaps = 61/371 (16%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+ + +IGTPP      +D   +  WTQC+ C    CF+Q+ PLFDP  S+TY +  C +
Sbjct: 51  YVANFTIGTPPQPASAVIDLAGELVWTQCKQCSR--CFEQDTPLFDPTASNTYRAEPCGT 108

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   NCS   C+Y         A  + G + T+T    +         ++ FG
Sbjct: 109 PLCESIPSDSRNCSGNVCAYQ----ASTNAGDTGGKVGTDTFAVGTAKA------SLAFG 158

Query: 152 CGHKNLASPTSDSK--QTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGG 207
           C    + +   D+    +GI+GLG    SL++Q G +    FSYCL   D G +   F G
Sbjct: 159 C----VVASDIDTMGGPSGIVGLGRTPWSLVTQTGVA---AFSYCLAPHDAGKNSALFLG 211

Query: 208 ----IVAGAGVVSTPLI--------IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGV 255
               +  G    STP +        + ++Y + LE +  G+  +    S +      T +
Sbjct: 212 SSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGS------TVL 265

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS-SQPKFPEVTIH 310
           L T  P+ +   L       +K      VGA P  + V    LC+  S +    P++   
Sbjct: 266 LDTFSPISF---LVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFT 322

Query: 311 FR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI-------VYGRIMQINFLIGYDIEQA 362
           FR GA + ++ SN   +  +  +C A    +A +       + G + Q N    +D+++ 
Sbjct: 323 FRGGAAMTVAASNYLLDYKNGTVCLAML-SSARLNSTTELSLLGSLQQENIHFLFDLDKE 381

Query: 363 MVSFKPSRCTN 373
            +SF+P+ CT 
Sbjct: 382 TLSFEPADCTK 392


>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
          Length = 440

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 100/376 (26%), Positives = 163/376 (43%), Gaps = 56/376 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++   +G+P   +  ++DT +D TW  C PC    C      LF P  SS+Y S+ CSS
Sbjct: 81  YVVRAGLGSPSQQLLLALDTSADATWAHCSPCGT--CPSSS--LFAPANSSSYASLPCSS 136

Query: 95  SQCAVVTSNCSE-----GD----------CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
           S C +            GD          C++S  +   A ASF +  LA++TL     +
Sbjct: 137 SWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPF---ADASFQAA-LASDTLRLGKDA 192

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
                +PN  FGC   ++  PT++  + G++GLG G  +L+SQ G+   G FSYCLP   
Sbjct: 193 -----IPNYTFGC-VSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCLPSYR 246

Query: 200 SSKINFGGIVAGAG------VVSTPLIIRDH----YYLSLEAISVGNQRLE-------FV 242
           S   + G +  GAG      V  TP++   H    YY+++  +SVG   ++       F 
Sbjct: 247 SYYFS-GSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFAFD 305

Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
           +++     VD+G + T      ++ L+      + A    G  +   F      +  +  
Sbjct: 306 AATGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAP--SGYTSLGAFDTCFNTDEVAAG 363

Query: 303 KFPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAFRGGNANI-----VYGRIMQINFLI 355
             P VT+H  G  D+ L   N L  + +  + C A      N+     V   + Q N  +
Sbjct: 364 GAPAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRV 423

Query: 356 GYDIEQAMVSFKPSRC 371
            +D+  + + F    C
Sbjct: 424 VFDVANSRIGFAKESC 439


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 104/376 (27%), Positives = 168/376 (44%), Gaps = 48/376 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G P  + F  +DTGSD  W  C P   CP       +   F+P  SST + I
Sbjct: 88  LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 147

Query: 91  SCSSSQCAV---------VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG- 140
           +CS  +C            TSN     C Y+F YG G   S +SG   ++T+ F +  G 
Sbjct: 148 TCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDG---SGTSGYYVSDTMFFETVMGN 204

Query: 141 --LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGT-SIAGK-FSYCL 195
                   +++FGC +      T +D    GI G G    S+ISQ+ +  ++ K FS+CL
Sbjct: 205 EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 264

Query: 196 P--DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTG 247
              D G   +  G IV   G+V TPL+  + HY L+LE+I+V  Q+L      F +S+T 
Sbjct: 265 KGSDNGGGILVLGEIVE-PGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQ 323

Query: 248 NIFVDTGVLRTLLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK 303
              VD+G     L       + S + + +S  +++   KG       S     + S    
Sbjct: 324 GTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKG-------SQCFITSSSVDSS 376

Query: 304 FPEVTIHFRGA-DVKLSPSN-LFRNISDE---IMCSAFR--GGNANIVYGRIMQINFLIG 356
           FP VT++F G   + + P N L +  S +   + C  ++   G    + G ++  + +  
Sbjct: 377 FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 436

Query: 357 YDIEQAMVSFKPSRCT 372
           YD+    + +    C+
Sbjct: 437 YDLANMRMGWADYDCS 452


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 104/376 (27%), Positives = 168/376 (44%), Gaps = 48/376 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G P  + F  +DTGSD  W  C P   CP       +   F+P  SST + I
Sbjct: 90  LYFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRI 149

Query: 91  SCSSSQCAV---------VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG- 140
           +CS  +C            TSN     C Y+F YG G   S +SG   ++T+ F +  G 
Sbjct: 150 TCSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDG---SGTSGYYVSDTMFFETVMGN 206

Query: 141 --LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGT-SIAGK-FSYCL 195
                   +++FGC +      T +D    GI G G    S+ISQ+ +  ++ K FS+CL
Sbjct: 207 EQTANSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL 266

Query: 196 P--DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTG 247
              D G   +  G IV   G+V TPL+  + HY L+LE+I+V  Q+L      F +S+T 
Sbjct: 267 KGSDNGGGILVLGEIVE-PGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQ 325

Query: 248 NIFVDTGVLRTLLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK 303
              VD+G     L       + S + + +S  +++   KG       S     + S    
Sbjct: 326 GTIVDSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKG-------SQCFITSSSVDSS 378

Query: 304 FPEVTIHFRGA-DVKLSPSN-LFRNISDE---IMCSAFR--GGNANIVYGRIMQINFLIG 356
           FP VT++F G   + + P N L +  S +   + C  ++   G    + G ++  + +  
Sbjct: 379 FPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFV 438

Query: 357 YDIEQAMVSFKPSRCT 372
           YD+    + +    C+
Sbjct: 439 YDLANMRMGWADYDCS 454


>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
 gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
          Length = 426

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 102/339 (30%), Positives = 155/339 (45%), Gaps = 37/339 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   L +GTPP D +  VDTGSD  W  C     CP+    + +   FDP  S T + I
Sbjct: 80  LYYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPI 139

Query: 91  SCSSSQCAV----VTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
           SCS  +C+       S CS  +  C+Y+F YG G   S +SG   ++ L F+   G   +
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDG---SGTSGFYVSDVLQFDMIVGSSLV 196

Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-D 197
           P     V+FGC          SD    GI G G    S+ISQ+ +  IA + FS+CL  +
Sbjct: 197 PNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCLKGE 256

Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
            G   I   G +    +V TPL+  + HY ++L +ISV  Q L      F +S+     +
Sbjct: 257 NGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTII 316

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FPEV 307
           DTG     L    +      ++N +    +PV   G +       CY I++     FP V
Sbjct: 317 DTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ-------CYVITTSVGDIFPPV 369

Query: 308 TIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVY 345
           +++F  GA + L+P +     ++      F G   ++V+
Sbjct: 370 SLNFAGGASMFLNPQDYLIQQNNVASALCFLGRYCSVVH 408


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 104/369 (28%), Positives = 161/369 (43%), Gaps = 44/369 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP       VDTGS+ TW  C         K    +F   +S ++ ++ C +
Sbjct: 106 YFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRAR---GKDNRRVFRADESKSFKTVGCLT 162

Query: 95  SQCAVVTSNC--------SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
             C V   N             CSY + Y  G+ A    G  A ET+T   T+G    +P
Sbjct: 163 QTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQ---GVFAKETITVGLTNGRMARLP 219

Query: 147 NVIFGCGHKNLASPTSDSKQ--TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
             + GC     +S T  S Q   G++GL   + S  S   +    KFSYCL D  S+K  
Sbjct: 220 GHLIGCS----SSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNV 275

Query: 205 FGGIVAG-------AGVVSTPLIIRD---HYYLSLEAISVGNQRLE-----FVSSSTGNI 249
              ++ G       A   +TPL +      Y +++  IS+G   L+     + ++S G  
Sbjct: 276 SNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGT 335

Query: 250 FVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVK--GVGAEPGFSDVLCYNISSQPKFPE 306
            +D+G   TLL    Y   +  +   +++ + VK  GV  E  FS    +N+S   K P+
Sbjct: 336 ILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVS---KLPQ 392

Query: 307 VTIHFRG-ADVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLIGYDIEQAM 363
           +T H +G A  +    +   + +  + C  F   G  A  V G IMQ N+L  +D+  + 
Sbjct: 393 LTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMAST 452

Query: 364 VSFKPSRCT 372
           +SF PS CT
Sbjct: 453 LSFAPSACT 461


>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 466

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 99/383 (25%), Positives = 169/383 (44%), Gaps = 59/383 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF-----------------KQEPP 77
           YL  +++GTPPV      DTGSD  W +C      +                    +   
Sbjct: 82  YLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPPEAVV 141

Query: 78  LFDPKKSSTYNSISCSSSQCAVVTSNCS-EGD---CSYSFLYGRGAYASFSSGNLATETL 133
            F+P  SS+Y+ + C    C  + +N S  GD   C + + Y  GA A   +G LA +T 
Sbjct: 142 YFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASA---TGLLAADTF 198

Query: 134 TF-NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFS 192
           TF  + +       ++ FGC      +   + +  G++GLG G  SL SQ+G     KFS
Sbjct: 199 TFGGNINNDTTSTASIDFGCAT---GTAGREFQADGMVGLGAGPLSLASQLGR----KFS 251

Query: 193 YCLP----DQGSSKINFG--GIVAGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEF 241
           +CL     D  SS +NFG   +V+  G  +TPLI        +Y +S++++ V  Q +  
Sbjct: 252 FCLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAGQPVPG 311

Query: 242 VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV--LCYNIS 299
            ++S   + VDTG + T   L+  + L  +  ++ +     G+   P   +   LCY++S
Sbjct: 312 -TTSVSKVIVDTGTVLTF--LDRAALLAPLTESLARVMDGAGLPRAPPPDETLELCYDVS 368

Query: 300 SQPK----FPEVTIHF---RGADVKLSPSNLFRNISDEIMCSAFRGGNANI----VYGRI 348
                    P+VT+      G +V+L+    F  + + ++C A    +  +    V G +
Sbjct: 369 RVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGVLCLAVVTTSPELQPLSVLGNV 428

Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
              +  +G D++    +F  + C
Sbjct: 429 ALQDLHVGIDLDARTATFATANC 451


>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 407

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 114/403 (28%), Positives = 182/403 (45%), Gaps = 64/403 (15%)

Query: 13  NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
           N  P+SP  + ++  I        + L++GTPP ++   +DTGS+ +W  C    +    
Sbjct: 14  NSFPRSPNKLPFRHNIS-----LTVSLTVGTPPQNVSMVIDTGSELSWLYCN---KTTTT 65

Query: 73  KQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCS-EGDC-SYSFLYGRGAYASFSS--GNL 128
              P  F+  +S +Y  I CSSS C   T + S    C S S  +   +YA  SS  GNL
Sbjct: 66  TSYPTTFNQTRSISYRPIPCSSSTCTNQTRDFSIPASCDSNSLCHATLSYADASSSEGNL 125

Query: 129 ATETLTFNSTSGLPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSI 187
           A++T    ++     ++P ++FGC     +S +  DSK TG++G+  G+ S +SQMG   
Sbjct: 126 ASDTFHMGAS-----DIPGMVFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQMGFP- 179

Query: 188 AGKFSYCLPDQGSSKI------NFGGIV----AGAGVVSTPLIIRDH--YYLSLEAISVG 235
             KFSYC+     S +      NF   V         +STPL   D   Y + LE I V 
Sbjct: 180 --KFSYCISGTDFSGMLLLGESNFTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKVS 237

Query: 236 NQRLEFVSS-------STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEP 288
           ++ L    S         G   VD+G   T L    ++ L+S   N         V  +P
Sbjct: 238 DRLLPIPKSVFEPDHTGAGQTMVDSGTQFTFLLGPAYTALRSEFLNQTTG--FLRVLEDP 295

Query: 289 GF----SDVLCYNIS-SQ---PKFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCS 334
            F    +  LCY +  SQ   P+ P V++ F GA++ ++   +   +      +D + C 
Sbjct: 296 DFVFQGAMDLCYRVPISQRVLPRLPTVSLVFNGAEMTVADERVLYRVPGEIRGNDSVHCL 355

Query: 335 AFRGGNANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +F  GN+++      V G   Q N  + +D+E++ +     RC
Sbjct: 356 SF--GNSDLLGVEAYVIGHHHQQNVWMEFDLERSRIGLAQVRC 396


>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Cucumis sativus]
          Length = 478

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 99/374 (26%), Positives = 162/374 (43%), Gaps = 46/374 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IG+PP D    VDTGSD  W     C  CP+      +  L++PK SST   I
Sbjct: 72  LYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLI 131

Query: 91  SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LP 142
           +C    C+        G      C Y  +YG G   S ++G    + +      G     
Sbjct: 132 TCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDG---SATAGYFVNDYIQLQRAVGNHKTS 188

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQG 199
               +++FGCG K      S S+   GI+G G  NSS+ISQ+  +  +   F++CL    
Sbjct: 189 ETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL---- 244

Query: 200 SSKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGN 248
              I+ GGI A   VV     +TP++  + HY + L  + VG+  L+     F +S    
Sbjct: 245 -DSISGGGIFAIGEVVEPKLXNTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRG 303

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQPKFPEV 307
             +D+G     LP    S    +M  ++ AQP +K    +  F+    ++ +    FP V
Sbjct: 304 AIIDSGTTLAYLP---ESIYLPLMEKILGAQPDLKLRTVDDQFT-CFVFDKNVDDGFPTV 359

Query: 308 TIHFRGADV-KLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDI 359
           T  F  + +  + P      I D++ C  +       + GN   + G ++  N L+ Y++
Sbjct: 360 TFKFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNL 419

Query: 360 EQAMVSFKPSRCTN 373
           E   + +    C++
Sbjct: 420 ENQTIGWTEYNCSS 433


>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 478

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 99/374 (26%), Positives = 162/374 (43%), Gaps = 46/374 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IG+PP D    VDTGSD  W     C  CP+      +  L++PK SST   I
Sbjct: 72  LYYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLI 131

Query: 91  SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LP 142
           +C    C+        G      C Y  +YG G   S ++G    + +      G     
Sbjct: 132 TCDQPFCSATYDAPIPGCKPDLLCQYKVIYGDG---SATAGYFVNDYIQLQRAVGNHKTS 188

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQG 199
               +++FGCG K      S S+   GI+G G  NSS+ISQ+  +  +   F++CL    
Sbjct: 189 ETNGSIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCL---- 244

Query: 200 SSKINFGGIVAGAGVV-----STPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGN 248
              I+ GGI A   VV     +TP++  + HY + L  + VG+  L+     F +S    
Sbjct: 245 -DSISGGGIFAIGEVVEPKLKTTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRG 303

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP-VKGVGAEPGFSDVLCYNISSQPKFPEV 307
             +D+G     LP    S    +M  ++ AQP +K    +  F+    ++ +    FP V
Sbjct: 304 AIIDSGTTLAYLP---DSIYLPLMEKILGAQPDLKLRTVDDQFT-CFVFDKNVDDGFPTV 359

Query: 308 TIHFRGADV-KLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDI 359
           T  F  + +  + P      I D++ C  +       + GN   + G ++  N L+ Y++
Sbjct: 360 TFKFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNL 419

Query: 360 EQAMVSFKPSRCTN 373
           E   + +    C++
Sbjct: 420 ENQTIGWTEYNCSS 433


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 104/369 (28%), Positives = 161/369 (43%), Gaps = 44/369 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP       VDTGS+ TW  C         K    +F   +S ++ ++ C +
Sbjct: 84  YFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRAR---GKDNRRVFRADESKSFKTVGCLT 140

Query: 95  SQCAVVTSNC--------SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
             C V   N             CSY + Y  G+ A    G  A ET+T   T+G    +P
Sbjct: 141 QTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAAQ---GVFAKETITVGLTNGRMARLP 197

Query: 147 NVIFGCGHKNLASPTSDSKQ--TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKIN 204
             + GC     +S T  S Q   G++GL   + S  S   +    KFSYCL D  S+K  
Sbjct: 198 GHLIGCS----SSFTGQSFQGADGVLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSNKNV 253

Query: 205 FGGIVAG-------AGVVSTPLIIRD---HYYLSLEAISVGNQRLE-----FVSSSTGNI 249
              ++ G       A   +TPL +      Y +++  IS+G   L+     + ++S G  
Sbjct: 254 SNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDATSGGGT 313

Query: 250 FVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVK--GVGAEPGFSDVLCYNISSQPKFPE 306
            +D+G   TLL    Y   +  +   +++ + VK  GV  E  FS    +N+S   K P+
Sbjct: 314 ILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPIEYCFSFTSGFNVS---KLPQ 370

Query: 307 VTIHFRG-ADVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLIGYDIEQAM 363
           +T H +G A  +    +   + +  + C  F   G  A  V G IMQ N+L  +D+  + 
Sbjct: 371 LTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMAST 430

Query: 364 VSFKPSRCT 372
           +SF PS CT
Sbjct: 431 LSFAPSACT 439


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 99/352 (28%), Positives = 151/352 (42%), Gaps = 52/352 (14%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCS-EG 107
           +DT SD  W QC PCP   C  Q   L+DP KSS+  +  CSS  C  +    + C+  G
Sbjct: 160 IDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPYANGCTPAG 219

Query: 108 D-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI----FGCGHKNLASPTS 162
           D C Y   Y  G   S S+G   ++ LT N     P +  + I    FGC H  L   + 
Sbjct: 220 DQCQYRVQYPDG---SASAGTYISDVLTLN-----PAKPASAISEFRFGCSHALLQPGSF 271

Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVS------ 216
            +K +GI+ LG G  SL +Q   +    FSYCLP    + ++ G  + G   V+      
Sbjct: 272 SNKTSGIMALGRGAQSLPTQTKATYGDVFSYCLP---PTPVHSGFFILGVPRVAASRYAV 328

Query: 217 TPLIIRDH----YYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEYHSNLKSV 271
           TP++        Y + L AI V  +RL    +       +D+  + T LP   +  L++ 
Sbjct: 329 TPMLRSKAAPMLYLVRLIAIEVAGKRLPVPPAVFAAGAVMDSRTIVTRLPPTAYMALRAA 388

Query: 272 MSNMIKAQPVKGVGAEPGFSDVLCYNISSQP-------KFPEVTIHFRGAD--VKLSPSN 322
               ++A       A P      CY+ S          K P++T+ F G +  V+L PS 
Sbjct: 389 FVAEMRAY----RAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVELDPSG 444

Query: 323 LFRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +  +      C AF     +    + G + Q    + Y+++ A V F+   C
Sbjct: 445 VLLD-----GCLAFAPNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 110/388 (28%), Positives = 173/388 (44%), Gaps = 70/388 (18%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFK--------QEPPLFDPKKSS 85
           Y +  S+GTPP  +   +DTGS   WT C  P     C           + P++   KSS
Sbjct: 74  YSVIFSLGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSS 133

Query: 86  TYNSISCSSSQCAVVTS---NCSEGD-CSYSFL-YGRGAYASFSSGNLATETLTFNSTSG 140
           T  S+ C S +C  V     NCS    C Y  L YG G+    ++G L ++ L  +  + 
Sbjct: 134 TVQSLPCRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLGS----TTGQLVSDVLGLSKLN- 188

Query: 141 LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL----- 195
               +P+ +FGC      S  S+ +  GI G G G +S+ +Q+G +   KFSYCL     
Sbjct: 189 ---RIPDFLFGC------SLVSNRQPEGIAGFGRGLASIPAQLGLT---KFSYCLVSHRF 236

Query: 196 ---PDQGSSKINFGGIVAGA---GVVSTPLI-------IRDHYYLSLEAISVGNQRLE-- 240
              P  G   ++ G   A A   GV   P           ++YY+SL  I VG + +   
Sbjct: 237 DDTPQSGDLVLHRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIP 296

Query: 241 ---FVSSSTGN--IFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL 294
               V S  G+  + VD+G   T +  + +    + +  +M K +  K +    G     
Sbjct: 297 PRYLVPSKEGDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGP-- 354

Query: 295 CYNISSQPK--FPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAF-----RGGNAN---I 343
           CYNI+ Q +   P++T  F+ GA++ L  ++ F  ++D ++C          G+     I
Sbjct: 355 CYNITGQSEVDVPKLTFSFKGGANMDLPLTDYFSLVTDGVVCMTVLTDPDEPGSTTGPAI 414

Query: 344 VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           + G   Q NF I YD+++    FKP +C
Sbjct: 415 ILGNYQQQNFYIEYDLKKQRFGFKPQQC 442


>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
          Length = 397

 Score =  104 bits (259), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 108/370 (29%), Positives = 164/370 (44%), Gaps = 50/370 (13%)

Query: 36  LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
           + + +IGTPP      +D   +  WTQC  C    CFKQ+ PLF P  SST+    C + 
Sbjct: 44  VANFTIGTPPQPASAIIDVAGELVWTQCSRCSR--CFKQDLPLFIPNASSTFRPEPCGTD 101

Query: 96  QC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
            C +  TSNCS   C+Y            + G + TET    + +       ++ FGC  
Sbjct: 102 ACKSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGTATA------SLAFGC-- 153

Query: 155 KNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKINFG--GI 208
             +AS       T G IGLG    SL++QM  +   KFSYCL  +G   SS++  G    
Sbjct: 154 -VVASDIDTMDGTSGFIGLGRTPRSLVAQMKLT---KFSYCLSPRGTGKSSRLFLGSSAK 209

Query: 209 VAGAGVVSTPLIIR-------DHYY-LSLEAISVGNQRLEFVSSSTGNIFVDTGV--LRT 258
           +AG    ST   I+        HYY LSL+AI  GN  +   ++ +G I V   V     
Sbjct: 210 LAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTI--ATAQSGGILVMHTVSPFSL 267

Query: 259 LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQPKFPEVTIHFRGAD 315
           L+   Y +  K+V   +  A           F   LC+  +   S+   P++   F+G  
Sbjct: 268 LVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFD--LCFKKAAGFSRATAPDLVFTFQGGG 325

Query: 316 VKLS--PSNLFRNISDE--IMCSAF-------RGGNANI-VYGRIMQINFLIGYDIEQAM 363
             L+  P+    ++ +E    C+A        R G   + V G + Q N    YD+++  
Sbjct: 326 AALTVPPAKYLIDVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYDLKKET 385

Query: 364 VSFKPSRCTN 373
           +SF+P+ C++
Sbjct: 386 LSFEPADCSS 395


>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 162/371 (43%), Gaps = 61/371 (16%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+ + +IGTPP      +D   +  WTQC+ C    CF+Q+ PLFDP  S+TY +  C +
Sbjct: 51  YVANFTIGTPPQPASAVIDLAGELVWTQCKQCSR--CFEQDTPLFDPTASNTYRAEPCGT 108

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   NCS   C+Y         A  + G + T+T    +         ++ FG
Sbjct: 109 PLCESIPSDSRNCSGNVCAYQ----ASTNAGDTGGKVGTDTFAVGTAKA------SLAFG 158

Query: 152 CGHKNLASPTSDSK--QTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGG 207
           C    + +   D+    +GI+GLG    SL++Q G +    FSYCL   D G +   F G
Sbjct: 159 C----VVASDIDTMGGPSGIVGLGRTPWSLVTQTGVA---AFSYCLAPHDAGRNSALFLG 211

Query: 208 ----IVAGAGVVSTPLI--------IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGV 255
               +  G    STP +        + ++Y + LE +  G+  +    S +      T +
Sbjct: 212 SSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGS------TVL 265

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS-SQPKFPEVTIH 310
           L T  P+ +   L       +K      VGA P  + V    LC+  S +    P++   
Sbjct: 266 LDTFSPISF---LVDGAYQAVKKAVTAAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFT 322

Query: 311 FR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI-------VYGRIMQINFLIGYDIEQA 362
           FR GA + +  +N   +  +  +C A    +A +       + G + Q N    +D+++ 
Sbjct: 323 FRGGAAMTVPATNYLLDYKNGTVCLAML-SSARLNSTTELSLLGSLQQENIHFLFDLDKE 381

Query: 363 MVSFKPSRCTN 373
            +SF+P+ CT 
Sbjct: 382 TLSFEPADCTK 392


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 104/374 (27%), Positives = 172/374 (45%), Gaps = 46/374 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP + F  +DTGSD  W  C P   CP       +   F+P  SST + I
Sbjct: 90  LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149

Query: 91  SCSSSQC--AVVTSN--CSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSG--- 140
            CS  +C  A+ TS   C   D   C Y+F YG G   S +SG   ++T+ F+S  G   
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDG---SGTSGYYVSDTMYFDSVMGNEQ 206

Query: 141 LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGT-SIAGK-FSYCLP- 196
                 +++FGC +      T +D    GI G G    S++SQ+ +  ++ K FS+CL  
Sbjct: 207 TANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKG 266

Query: 197 -DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
            D G   +  G IV   G+V TPL+  + HY L+LE+I V  Q+L      F +S+T   
Sbjct: 267 SDNGGGILVLGEIVE-PGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGT 325

Query: 250 FVDTGVLRTLLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFP 305
            VD+G     L       + + + + +S  +++   KG       +     + S    FP
Sbjct: 326 IVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKG-------NQCFVTSSSVDSSFP 378

Query: 306 EVTIHFRGA-DVKLSPSN-LFRNIS---DEIMCSAFR--GGNANIVYGRIMQINFLIGYD 358
            V+++F G   + + P N L +  S   + + C  ++   G    + G ++  + +  YD
Sbjct: 379 TVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYD 438

Query: 359 IEQAMVSFKPSRCT 372
           +    + +    C+
Sbjct: 439 LANMRMGWTDYDCS 452


>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 418

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 103/376 (27%), Positives = 166/376 (44%), Gaps = 66/376 (17%)

Query: 36  LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
           + + +IGTPP      +D   +  WTQC  C    CFKQ+ PLF P  SST+    C + 
Sbjct: 68  VANFTIGTPPQPASAIIDVAGELVWTQCSMCSR--CFKQDLPLFVPNASSTFRPEPCGTD 125

Query: 96  QC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC-- 152
            C ++ TSNCS   C+Y            + G +AT+T    + +       ++ FGC  
Sbjct: 126 ACKSIPTSNCSSNMCTYEGTI-NSKLGGHTLGIVATDTFAIGTATA------SLGFGCVV 178

Query: 153 --GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS---SKINFGG 207
             G   +  P      +G+IGLG   SSL+SQM  +   KFSYCL    S   S++  G 
Sbjct: 179 ASGIDTMGGP------SGLIGLGRAPSSLVSQMNIT---KFSYCLTPHDSGKNSRLLLGS 229

Query: 208 ---IVAGAGVVSTPLI-------IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLR 257
              +  G    +TP +       +  +Y + L+ I  G+  +    S  GN    T +++
Sbjct: 230 SAKLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPS--GN----TVLVQ 283

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS--SQPKFPEVTIHF 311
           TL P+ +   L       +K +  K VGA P  + +    LC+  +  S    P++   F
Sbjct: 284 TLAPMSF---LVDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTF 340

Query: 312 R--GADVKLSPSNLFRNISDE--IMCSAFRGG----------NANIVYGRIMQINFLIGY 357
           +   A + + P     ++ +E   +C A              N NI+ G + Q N     
Sbjct: 341 QQGAAALTVPPPKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNIL-GSLQQENTHFLL 399

Query: 358 DIEQAMVSFKPSRCTN 373
           D+E+  +SF+P+ C++
Sbjct: 400 DLEKKTLSFEPADCSS 415


>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
 gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
          Length = 486

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 113/418 (27%), Positives = 167/418 (39%), Gaps = 98/418 (23%)

Query: 31  VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE----PCPELDCFKQEP---------- 76
           V D YL+ L+IGTPP  I   +DTGSD TW  C      C E D ++             
Sbjct: 78  VRDGYLISLNIGTPPQVIQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYS 137

Query: 77  ----------PLFDPKKSSTYNSISCSSSQCA---VVTSNCSEGDCSYSFLYGRGAYASF 123
                     P      SS     +C+ + C+   +V + CS    S+++ YG G     
Sbjct: 138 SSSYRASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVV-- 195

Query: 124 SSGNLATETLTFNSTS-GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ 182
            +G L  +TL  N +S G+  E+P   FGC       P       GI G G G  S++SQ
Sbjct: 196 -TGILTRDTLRVNGSSPGVAKEIPKFCFGCVGSAYREP------IGIAGFGRGTLSMVSQ 248

Query: 183 MGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIRD------------------- 223
           +G    G FS+C               A    +S+PL++ D                   
Sbjct: 249 LGFLQKG-FSHCF---------LAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPM 298

Query: 224 ---HYYLSLEAISVGN--------QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVM 272
               YY+ LEAI+VGN           EF S   G + +D+G   T LP  ++S + S++
Sbjct: 299 YPNFYYVGLEAITVGNVSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSIL 358

Query: 273 SNMIKAQPVKGVGAEPGFSDVLCYNI--------SSQPKFPEVTIHF-RGADVKLSPSNL 323
            + I      G+  + GF   LCY +        +S    P +T HF     + L   N 
Sbjct: 359 QSTINYPRDTGMEMQTGFD--LCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNH 416

Query: 324 FRNISDE-----IMCSAFRGGNANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           F  +S       + C  F+  +        V+G   Q N  + YD+E+  + F+P  C
Sbjct: 417 FYPVSAPGNPAVVKCLMFQSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDC 474


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 102/380 (26%), Positives = 155/380 (40%), Gaps = 65/380 (17%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC- 97
           L++GTPP ++   +DTGS+ +W  C P        +    F P+ S T+ S+ C S+QC 
Sbjct: 70  LAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQCR 129

Query: 98  -----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                +    + +   C  S  Y  G   S S G LATE  T     G P+      FGC
Sbjct: 130 SRDLPSPPACDGASKQCRVSLSYADG---SSSDGALATEVFTVG--QGPPLR---AAFGC 181

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI--------- 203
                 +        G++G+  G  S +SQ  T    +FSYC+ D+  + +         
Sbjct: 182 MATAFDTSPDGVATAGLLGMNRGALSFVSQASTR---RFSYCISDRDDAGVLLLGHSDLP 238

Query: 204 ----NFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS-------STGNIFVD 252
               N+  +   A  +  P   R  Y + L  I VG + L   +S         G   VD
Sbjct: 239 FLPLNYTPLYQPA--MPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVD 296

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS-----DVLCYNI----SSQPK 303
           +G   T L  + +S LK+  S   K  P      +P F+     D  C+ +    +   +
Sbjct: 297 SGTQFTFLLGDAYSALKAEFSRQTK--PWLPALNDPNFAFQEAFDT-CFRVPQGRAPPAR 353

Query: 304 FPEVTIHFRGADVKLSPSNLF------RNISDEIMCSAFRGGNANI------VYGRIMQI 351
            P VT+ F GA + ++   L       R   D + C  F  GNA++      V G   Q+
Sbjct: 354 LPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTF--GNADMVPITAYVIGHHHQM 411

Query: 352 NFLIGYDIEQAMVSFKPSRC 371
           N  + YD+E+  V   P RC
Sbjct: 412 NVWVEYDLERGRVGLAPIRC 431


>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
          Length = 446

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 105/372 (28%), Positives = 149/372 (40%), Gaps = 51/372 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+    IG PP      +DTGSD  WTQC  C    C +Q  P ++   SST+  + C++
Sbjct: 90  YVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCAA 149

Query: 95  SQCAV---VTSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
             CA    +   C     CS    YG G  A    G L TE   F S +        + F
Sbjct: 150 RICAANDDIIHFCDLAAGCSVIAGYGAGVVA----GTLGTEAFAFQSGTA------ELAF 199

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP----DQGSSKINFG 206
           GC         +    +G+IGLG G  SL+SQ G   A KFSYCL     + G++   F 
Sbjct: 200 GCVTFTRIVQGALHGASGLIGLGRGRLSLVSQTG---ATKFSYCLTPYFHNNGATGHLFV 256

Query: 207 GIVA---GAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-----------TG 247
           G  A   G G V T   ++       YYL L  ++VG  RL   ++            +G
Sbjct: 257 GASASLGGHGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSG 316

Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD--VLCYNISSQPK-F 304
            + +D+G   T L  + +  L S ++  +    V    A P  +D   LC       +  
Sbjct: 317 GVIIDSGSPFTSLVHDAYDALASELAARLNGSLV----APPPDADDGALCVARRDVGRVV 372

Query: 305 PEVTIHFRGADVKLSPSNLFRNISDEIMCSAFRGG----NANIVYGRIMQINFLIGYDIE 360
           P V  HFRG      P+  +    D+                 V G   Q N  + YD+ 
Sbjct: 373 PAVVFHFRGGADMAVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLA 432

Query: 361 QAMVSFKPSRCT 372
               SF+P+ C+
Sbjct: 433 NGDFSFQPADCS 444


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 102/380 (26%), Positives = 155/380 (40%), Gaps = 65/380 (17%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC- 97
           L++GTPP ++   +DTGS+ +W  C P        +    F P+ S T+ S+ C S+QC 
Sbjct: 69  LAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQCR 128

Query: 98  -----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                +    + +   C  S  Y  G   S S G LATE  T     G P+      FGC
Sbjct: 129 SRDLPSPPACDGASKQCRVSLSYADG---SSSDGALATEVFTVG--QGPPLR---AAFGC 180

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKI--------- 203
                 +        G++G+  G  S +SQ  T    +FSYC+ D+  + +         
Sbjct: 181 MATAFDTSPDGVATAGLLGMNRGALSFVSQASTR---RFSYCISDRDDAGVLLLGHSDLP 237

Query: 204 ----NFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS-------STGNIFVD 252
               N+  +   A  +  P   R  Y + L  I VG + L   +S         G   VD
Sbjct: 238 FLPLNYTPLYQPA--MPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMVD 295

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS-----DVLCYNI----SSQPK 303
           +G   T L  + +S LK+  S   K  P      +P F+     D  C+ +    +   +
Sbjct: 296 SGTQFTFLLGDAYSALKAEFSRQTK--PWLPALNDPNFAFQEAFDT-CFRVPQGRAPPAR 352

Query: 304 FPEVTIHFRGADVKLSPSNLF------RNISDEIMCSAFRGGNANI------VYGRIMQI 351
            P VT+ F GA + ++   L       R   D + C  F  GNA++      V G   Q+
Sbjct: 353 LPAVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTF--GNADMVPITAYVIGHHHQM 410

Query: 352 NFLIGYDIEQAMVSFKPSRC 371
           N  + YD+E+  V   P RC
Sbjct: 411 NVWVEYDLERGRVGLAPIRC 430


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 110/384 (28%), Positives = 166/384 (43%), Gaps = 79/384 (20%)

Query: 44  PPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSN 103
           PP +I   +DTGS+ +W +C      +        FDP +SS+Y+ I CSS  C   T +
Sbjct: 82  PPQNISMVIDTGSELSWLRCNRSSNPNPVNN----FDPTRSSSYSPIPCSSPTCRTRTRD 137

Query: 104 C-------SEGDCSYSFLYGRGAYASFSSGNLATETLTF-NSTSGLPVEMPNVIFGC-GH 154
                   S+  C  +  Y   A AS S GNLA E   F NST+       N+IFGC G 
Sbjct: 138 FLIPASCDSDKLCHATLSY---ADASSSEGNLAAEIFHFGNSTND-----SNLIFGCMGS 189

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS-------------- 200
            + + P  D+K TG++G+  G+ S ISQMG     KFSYC+                   
Sbjct: 190 VSGSDPEEDTKTTGLLGMNRGSLSFISQMGFP---KFSYCISGTDDFPGFLLLGDSNFTW 246

Query: 201 -SKINFGGIVAGAGVVSTPLIIRDH--YYLSLEAISVGNQRLEFVSS-------STGNIF 250
            + +N+  ++     +STPL   D   Y + L  I V  + L    S         G   
Sbjct: 247 LTPLNYTPLIR----ISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTM 302

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNISS------ 300
           VD+G   T L    ++ L+S   N  +   +  V  +P F    +  LCY IS       
Sbjct: 303 VDSGTQFTFLLGPVYTALRSHFLN--RTNGILTVYEDPDFVFQGTMDLCYRISPVRIRSG 360

Query: 301 -QPKFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGR 347
              + P V++ F GA++ +S   L   +      +D + C  F  GN+++      V G 
Sbjct: 361 ILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTF--GNSDLMGMEAYVIGH 418

Query: 348 IMQINFLIGYDIEQAMVSFKPSRC 371
             Q N  I +D++++ +   P  C
Sbjct: 419 HHQQNMWIEFDLQRSRIGLAPVEC 442


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 110/386 (28%), Positives = 175/386 (45%), Gaps = 65/386 (16%)

Query: 29  ISVDD------IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-----P 77
           IS DD      +Y   + +GTPP   +  VDTGSD  W  C PC   +C +         
Sbjct: 36  ISGDDDTFTTGLYYTRIYLGTPPQQFYVHVDTGSDVAWVNCVPC--TNCKRASNVALPIS 93

Query: 78  LFDPKKSSTYNSISCSSSQCAVVT-SNCS--EGDCSYSFLYGRGAYASFSSGNLATETLT 134
           +FDP+KS++  SISC+  +C + + S CS     C YS LYG G   S ++G L  + L+
Sbjct: 94  IFDPEKSTSKTSISCTDEECYLASNSKCSFNSMSCPYSTLYGDG---SSTAGYLINDVLS 150

Query: 135 FN--------STSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS 186
           FN        +TSG       + FGCG     +  +D    G++G G    SL SQ+   
Sbjct: 151 FNQVPSGNSTATSG----TARLTFGCGSNQTGTWLTD----GLVGFGQAEVSLPSQLSKQ 202

Query: 187 --IAGKFSYCLP--DQGSSKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRL-- 239
                 F++CL   ++GS  +  G I    G+V TP++ +  HY + L  I V    +  
Sbjct: 203 NVSVNIFAHCLQGDNKGSGTLVIGHIRE-PGLVYTPIVPKQSHYNVELLNIGVSGTNVTT 261

Query: 240 --EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN 297
              F  S++G + +D+G   T L    +   ++ + + +++      G  P      C  
Sbjct: 262 PTAFDLSNSGGVIMDSGTTLTYLVQPAYDQFQAKVRDCMRS------GVLPVAFQFFC-- 313

Query: 298 ISSQPKFPEVTIHFR-GADVKLSPSN-LFRN-ISDEIMCSAFRGGNANIVYGRIMQINF- 353
            + +  FP VT++F  GA + LSPS+ L++  ++  +    F    +  VYG +    F 
Sbjct: 314 -TIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFSWLESTSVYGYLSYTIFG 372

Query: 354 -------LIGYDIEQAMVSFKPSRCT 372
                  L+ YD     + +K   CT
Sbjct: 373 DNVLKDQLVVYDNVNNRIGWKNFDCT 398


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 105/365 (28%), Positives = 165/365 (45%), Gaps = 48/365 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IG+PP +    VDTGS  T+  C  C  + C   + P F P+ SSTY  + C++
Sbjct: 89  YTTRLWIGSPPQEFALIVDTGSTVTYVPCSNC--VQCGNHQDPRFQPELSSTYQPVKCNA 146

Query: 95  SQCAVVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                   NC E    C+Y   Y   A  S SSG LA + ++F   S L  +    +FGC
Sbjct: 147 D------CNCDENGVQCTYERRY---AEMSTSSGVLAEDVMSFGKESELVPQ--RAVFGC 195

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLPDQGSSKINFGGIVA 210
                +      +  GI+GLG G  S++ Q+     ++  FS C    G   +  G +V 
Sbjct: 196 ETME-SGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCY---GGMDVGGGAMVL 251

Query: 211 GAGVVSTPLIIRDH--------YYLSLEAISVGNQRLEFVSSSTGNIF---VDTGVLRTL 259
           G G+ S P ++  H        Y + L+ I V  + L+    +    +   +D+G     
Sbjct: 252 G-GISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAY 310

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF- 311
            P + +   K  +  M K   +K + G +P F D+ C+     +++  PK FPEV + F 
Sbjct: 311 FPEKAYYAFKDAI--MKKISFLKQISGPDPNFKDI-CFSGAGRDVTELPKVFPEVDMVFA 367

Query: 312 RGADVKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFK 367
            G  + LSP N LFR+  +S       F+ GN    + G I+  N L+ Y+ E + + F 
Sbjct: 368 NGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFW 427

Query: 368 PSRCT 372
            + C+
Sbjct: 428 KTNCS 432


>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 491

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 122/431 (28%), Positives = 181/431 (41%), Gaps = 88/431 (20%)

Query: 14  ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL--DC 71
           + P S + ++ +  +  V D YL+ L+IGTPP  +   +DTGSD TW    PC  L  DC
Sbjct: 63  KKPLSSVDVVMEP-LREVRDGYLITLNIGTPPQAVQVYLDTGSDLTWV---PCGNLSFDC 118

Query: 72  FK---------QEPPLFDPKKSSTYNSISCSSSQCAVVTSN------CSEGDCSYSFLYG 116
            +         + P +F P  SST    SC+SS C  + S+      C+   CS S L  
Sbjct: 119 IECYDLKNNDLKSPSVFSPLHSSTSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLK 178

Query: 117 RG--------AYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTG 168
                     AY ++  G L +  LT +       ++P   FGC       P       G
Sbjct: 179 STCVRPCPSFAY-TYGEGGLISGILTRDILKARTRDVPRFSFGCVTSTYREP------IG 231

Query: 169 IIGLGPGNSSLISQMGTSIAGKFSYC-LPDQGSSKINFGG-IVAGAGVVS---------T 217
           I G G G  SL SQ+G    G FS+C LP +  +  N    ++ GA  +S         T
Sbjct: 232 IAGFGRGLLSLPSQLGFLEKG-FSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFT 290

Query: 218 PLI----IRDHYYLSLEAISVGNQ---------RLEFVSSSTGNIFVDTGVLRTLLPLEY 264
           P++      + YY+ LE+I++G             +F S   G + VD+G   T LP  +
Sbjct: 291 PMLNTPMYPNSYYIGLESITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPF 350

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-------NISSQPK-----FPEVTIHF- 311
           +S L + + + I         +  GF   LCY       N++S        FP +T HF 
Sbjct: 351 YSQLLTTLQSTITYPRATETESRTGFD--LCYKVPCPNNNLTSLENDVMMIFPSITFHFL 408

Query: 312 RGADVKLSPSNLFRNISDE-----IMCSAFRG------GNANIVYGRIMQINFLIGYDIE 360
             A + L   N F  +S       + C  F+       G A  V+G   Q N  + YD+E
Sbjct: 409 NNATLLLPQGNSFYAMSAPSDGSVVQCLLFQNMEDGDYGPAG-VFGSFQQQNVKVVYDLE 467

Query: 361 QAMVSFKPSRC 371
           +  + F+   C
Sbjct: 468 KERIGFQAMDC 478


>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Brachypodium distachyon]
          Length = 464

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 110/364 (30%), Positives = 164/364 (45%), Gaps = 57/364 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ +SIG+P V     +DTGSD +W +C           +  L+DP  SSTY   SCS+
Sbjct: 131 YVITVSIGSPAVAXTMFIDTGSDVSWLRC-----------KSRLYDPGTSSTYAPFSCSA 179

Query: 95  SQCAVV---TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
             CA +    + CS G  C YS  YG G   S ++G   ++TLT   TS  P+ +    F
Sbjct: 180 PACAQLGRRGTGCSSGSTCVYSVKYGDG---SNTTGTYGSDTLTLAGTS-EPL-ISGFQF 234

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFG 206
           GC    +     +    G++GLG    S +SQ   +    FSYCLP   +S     +   
Sbjct: 235 GC--SAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCLPPTWNSSGFLTLGAP 292

Query: 207 GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLL 260
                A   +TP++        Y L L  ISVG + LE  SS  S G+I VD+G + T L
Sbjct: 293 SSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVFSAGSI-VDSGTVITRL 351

Query: 261 PLEYHSNLKSVMSNMI---KAQPVKGVGAEPGFSDVLCYNISSQPK-----FPEVTIHFR 312
           P   +  L +   + +   + QP     A  G  D  C++ +   +      P V +   
Sbjct: 352 PPTAYGALSAAFRDGMARYQYQPA----APRGLLDT-CFDFTGHGEGNNFTVPSVALVLD 406

Query: 313 -GADVKLSPSNLFRNISDEIMCSAFRG----GNANIVYGRIMQINFLIGYDIEQAMVSFK 367
            GA V L P+ + ++      C AF      G   I+ G + Q  F + YD+ Q++  F+
Sbjct: 407 GGAVVDLHPNGIVQD-----GCLAFAATDDDGRTGII-GNVQQRTFEVLYDVGQSVFGFR 460

Query: 368 PSRC 371
           P  C
Sbjct: 461 PGAC 464


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 105/365 (28%), Positives = 165/365 (45%), Gaps = 48/365 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IG+PP +    VDTGS  T+  C  C  + C   + P F P+ SSTY  + C++
Sbjct: 89  YTTRLWIGSPPQEFALIVDTGSTVTYVPCSNC--VQCGNHQDPRFQPELSSTYQPVKCNA 146

Query: 95  SQCAVVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                   NC E    C+Y   Y   A  S SSG LA + ++F   S L  +    +FGC
Sbjct: 147 D------CNCDENGVQCTYERRY---AEMSTSSGVLAEDVMSFGKESELVPQ--RAVFGC 195

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLPDQGSSKINFGGIVA 210
                +      +  GI+GLG G  S++ Q+     ++  FS C    G   +  G +V 
Sbjct: 196 ETME-SGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCY---GGMDVGGGAMVL 251

Query: 211 GAGVVSTPLIIRDH--------YYLSLEAISVGNQRLEFVSSSTGNIF---VDTGVLRTL 259
           G G+ S P ++  H        Y + L+ I V  + L+    +    +   +D+G     
Sbjct: 252 G-GISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFDGKYGAILDSGTTYAY 310

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF- 311
            P + +   K  +  M K   +K + G +P F D+ C+     +++  PK FPEV + F 
Sbjct: 311 FPEKAYYAFKDAI--MKKISFLKQISGPDPNFKDI-CFSGAGRDVTELPKVFPEVDMVFA 367

Query: 312 RGADVKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFK 367
            G  + LSP N LFR+  +S       F+ GN    + G I+  N L+ Y+ E + + F 
Sbjct: 368 NGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIGFW 427

Query: 368 PSRCT 372
            + C+
Sbjct: 428 KTNCS 432


>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 110/384 (28%), Positives = 166/384 (43%), Gaps = 79/384 (20%)

Query: 44  PPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSN 103
           PP +I   +DTGS+ +W +C      +        FDP +SS+Y+ I CSS  C   T +
Sbjct: 82  PPQNISMVIDTGSELSWLRCNRSSNPNPVNN----FDPTRSSSYSPIPCSSPTCRTRTRD 137

Query: 104 C-------SEGDCSYSFLYGRGAYASFSSGNLATETLTF-NSTSGLPVEMPNVIFGC-GH 154
                   S+  C  +  Y   A AS S GNLA E   F NST+       N+IFGC G 
Sbjct: 138 FLIPASCDSDKLCHATLSY---ADASSSEGNLAAEIFHFGNSTND-----SNLIFGCMGS 189

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS-------------- 200
            + + P  D+K TG++G+  G+ S ISQMG     KFSYC+                   
Sbjct: 190 VSGSDPEEDTKTTGLLGMNRGSLSFISQMGFP---KFSYCISGTDDFPGFLLLGDSNFTW 246

Query: 201 -SKINFGGIVAGAGVVSTPLIIRDH--YYLSLEAISVGNQRLEFVSS-------STGNIF 250
            + +N+  ++     +STPL   D   Y + L  I V  + L    S         G   
Sbjct: 247 LTPLNYTPLIR----ISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTM 302

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNISS------ 300
           VD+G   T L    ++ L+S   N      +  V  +P F    +  LCY IS       
Sbjct: 303 VDSGTQFTFLLGPVYTALRSDFLNQTNG--ILTVYEDPEFVFQGTMDLCYRISPFRIRTG 360

Query: 301 -QPKFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGR 347
              + P V++ F GA++ +S   L   +      +D + C  F  GN+++      V G 
Sbjct: 361 ILHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTAGNDSVYCFTF--GNSDLMGMEAYVIGH 418

Query: 348 IMQINFLIGYDIEQAMVSFKPSRC 371
             Q N  I +D++++ +   P +C
Sbjct: 419 HHQQNMWIEFDLQRSRIGLAPVQC 442


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 104/383 (27%), Positives = 175/383 (45%), Gaps = 79/383 (20%)

Query: 39   LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
            L++G+PP  +   +DTGS+ +W  C+  P L        +F+P  SS+Y+ I CSS  C 
Sbjct: 1004 LTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTS------VFNPLSSSSYSPIPCSSPICR 1057

Query: 99   VVTSN------CSEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIF 150
              T +      C      ++ +    +YA  SS  GNLA++     S++     +P  +F
Sbjct: 1058 TRTRDLPNPVTCDPKKLCHAIV----SYADASSLEGNLASDNFRIGSSA-----LPGTLF 1108

Query: 151  GCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
            GC     +S +  D+K TG++G+  G+ S ++Q+G     KFSYC+  + SS +   G +
Sbjct: 1109 GCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP---KFSYCISGRDSSGVLLFGDL 1165

Query: 210  AGAGV----------VSTPLIIRDH--YYLSLEAISVGNQRLEFVSS-------STGNIF 250
              + +          +STPL   D   Y + L+ I VGN+ L    S         G   
Sbjct: 1166 HLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTM 1225

Query: 251  VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGA---EPGF----SDVLCYNISS--- 300
            VD+G   T L    ++ L++        +  KGV A   +P F    +  LCY++++   
Sbjct: 1226 VDSGTQFTFLLGPVYTALRNEF-----LEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGK 1280

Query: 301  QPKFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRI 348
             P  P V++ FRGA++ +    L   +      ++ + C  F  GN+++      V G  
Sbjct: 1281 LPTLPSVSLMFRGAEMVVGGEVLLYRVPEMMKGNEWVYCLTF--GNSDLLGIEAFVIGHH 1338

Query: 349  MQINFLIGYDIEQAMVSFKPSRC 371
             Q N  + +D    +V+F    C
Sbjct: 1339 HQQNVWMEFD----LVAFAADLC 1357


>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
 gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
          Length = 321

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 87/255 (34%), Positives = 118/255 (46%), Gaps = 37/255 (14%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP    +  VDTGSD  W     C+ CP       E  L+DPK SST + +
Sbjct: 32  LYYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKV 91

Query: 91  SCSSSQCAVVTSNCSEG-----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           SC    CA        G      C YS  YG G   S ++G   ++ L F+  SG     
Sbjct: 92  SCDQGFCAATYGGLLPGCTTSLPCEYSVTYGDG---SSTTGYFVSDLLQFDQVSGDGQTR 148

Query: 146 P---NVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGK----FSYCLPD 197
           P    V FGCG +      +S+    GIIG G  N+S++SQ+  S AGK    F++CL  
Sbjct: 149 PANSTVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQL--SAAGKVKKIFAHCL-- 204

Query: 198 QGSSKINFGGIVAGAGVV-----STPLIIR-DHYYLSLEAISVGNQRLE-----FVSSST 246
                IN GGI A   VV     +TPL+    HY ++L++I VG   L+     F +   
Sbjct: 205 ---DTINGGGIFAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEK 261

Query: 247 GNIFVDTGVLRTLLP 261
               +D+G   T LP
Sbjct: 262 KGTIIDSGTTLTYLP 276


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 116/401 (28%), Positives = 177/401 (44%), Gaps = 53/401 (13%)

Query: 4   SQKLPFYNDNETPKSPISIIYQAEIISVDDI-----YLMHLSIGTPPVDIFGSVDTGSDC 58
           S + PF ++    +   S +  A +   DD+     Y   L IGTPP +    VDTGS  
Sbjct: 52  SHRKPFTSNYHRRQLHNSDLPNAHMRLYDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTV 111

Query: 59  TWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSE--GDCSYSFLYG 116
           T+  C  C +  C K + P F P+ SSTY  + C+ S       NC +    C+Y   Y 
Sbjct: 112 TYVPCSTCEQ--CGKHQDPRFQPESSSTYKPMQCNPS------CNCDDEGKQCTYERRY- 162

Query: 117 RGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGN 176
             A  S SSG LA + L+F + S L  +    IFGC         S  +  GI+GLG G 
Sbjct: 163 --AEMSSSSGLLAEDVLSFGNESELTPQ--RAIFGCETVETGELFS-QRADGIMGLGRGP 217

Query: 177 SSLISQM--GTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIRDH--------YY 226
            S++ Q+     +   FS C    G   +  G +V G  +   P ++  H        Y 
Sbjct: 218 LSVVDQLVIKEVVGNSFSLCY---GGMDVVGGAMVLG-NIPPPPDMVFAHSDPYRSAYYN 273

Query: 227 LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG 283
           + L+ + V  +RL+    V        +D+G     LP E     K  +   IK   +K 
Sbjct: 274 IELKELHVAGKRLKLNPRVFDGKHGTVLDSGTTYAYLPEEAFVAFKDAIIKEIKF--LKQ 331

Query: 284 V-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGADVKLSPSN-LFRN--ISDEIM 332
           + G +P ++D+ C+     ++S   K FPEV + F  G  + LSP N LFR+  +S    
Sbjct: 332 IHGPDPSYNDI-CFSGAGRDVSQLSKIFPEVNMVFGNGQKLSLSPENYLFRHTKVSGAYC 390

Query: 333 CSAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
              F+ G +   + G I+  N L+ YD +   + F  + C+
Sbjct: 391 LGIFQNGKDPTTLLGGIVVRNTLVTYDRDNDKIGFWKTNCS 431


>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
          Length = 354

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 95/314 (30%), Positives = 146/314 (46%), Gaps = 33/314 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPPV+    +DTGSD  W  C     CP+    + +   FDP  SST + I
Sbjct: 24  LYYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMI 83

Query: 91  SCSSSQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           +CS  +C      +  T +     CSY+F YG G   S +SG   ++ +  N+     V 
Sbjct: 84  ACSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDG---SGTSGYYVSDMMHLNTIFEGSVT 140

Query: 145 MPN---VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-D 197
             +   V+FGC ++     T SD    GI G G    S+ISQ+ +  IA + FS+CL  D
Sbjct: 141 TNSTAPVVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGD 200

Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
                I   G +    +V T L+  + HY L+L++I+V  Q L+     F +S++    V
Sbjct: 201 SSGGGILVLGEIVEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIV 260

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTI 309
           D+G     L  E +    S ++  I       V          CY I+S     FP+V++
Sbjct: 261 DSGTTLAYLAEEAYDPFVSAITASIPQSVHTAVSRGN-----QCYLITSSVTEVFPQVSL 315

Query: 310 HFR-GADVKLSPSN 322
           +F  GA + L P +
Sbjct: 316 NFAGGASMILRPQD 329


>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 439

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 102/374 (27%), Positives = 160/374 (42%), Gaps = 83/374 (22%)

Query: 27  EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSST 86
           ++   D  +L+ ++ GTPP +    +DTGS  TWTQC+ C                    
Sbjct: 120 KLFDEDGNFLVDVAFGTPPQNFTLILDTGSSITWTQCKAC-------------------- 159

Query: 87  YNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
                        V +N       Y+  YG     S S GN   +T+T   +        
Sbjct: 160 ------------TVENN-------YNMTYGD---DSTSVGNYGCDTMTLEPSD----VFQ 193

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG------- 199
              FG G  N       S   G++GLG G  S +SQ  +     FSYCLP++        
Sbjct: 194 KFQFGRGRNNKGD--FGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGSLLF 251

Query: 200 -------SSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS---STGNI 249
                  SS + F  +V G G +        +Y+++L  ISVGN+RL   SS   S G I
Sbjct: 252 GEKATSQSSSLKFTSLVNGPGTLQE----SGYYFVNLSDISVGNERLNIPSSVFASPGTI 307

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPK--FP 305
            +D+  + T LP   +S LK+     +   P+     + G  D+L  CYN+S +     P
Sbjct: 308 -IDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKG--DILDTCYNLSGRKDVLLP 364

Query: 306 EVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGNAN------IVYGRIMQINFLIGYD 358
           E+ +HF  GADV+L+ +N+     +  +C AF G + +       + G   Q++  + YD
Sbjct: 365 EIVLHFGGGADVRLNGTNIVWGSDESRLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYD 424

Query: 359 IEQAMVSFKPSRCT 372
           I+   + F+ + C+
Sbjct: 425 IQGGRIGFRSNGCS 438


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 103/374 (27%), Positives = 172/374 (45%), Gaps = 46/374 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP + F  +DTGSD  W  C P   CP       +   F+P  SST + I
Sbjct: 90  LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149

Query: 91  SCSSSQC--AVVTSN--CSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSG--- 140
            CS  +C  A+ TS   C   D   C Y+F YG G   S +SG   ++T+ F++  G   
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDG---SGTSGYYVSDTMYFDTVMGNEQ 206

Query: 141 LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGT-SIAGK-FSYCLP- 196
                 +++FGC +      T +D    GI G G    S++SQ+ +  ++ K FS+CL  
Sbjct: 207 TANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKG 266

Query: 197 -DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
            D G   +  G IV   G+V TPL+  + HY L+LE+I V  Q+L      F +S+T   
Sbjct: 267 SDNGGGILVLGEIVE-PGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGT 325

Query: 250 FVDTGVLRTLLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFP 305
            VD+G     L       + + + + +S  +++   KG       +     + S    FP
Sbjct: 326 IVDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKG-------NQCFVTSSSVDSSFP 378

Query: 306 EVTIHFRGA-DVKLSPSN-LFRNIS---DEIMCSAFR--GGNANIVYGRIMQINFLIGYD 358
            V+++F G   + + P N L +  S   + + C  ++   G    + G ++  + +  YD
Sbjct: 379 TVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYD 438

Query: 359 IEQAMVSFKPSRCT 372
           +    + +    C+
Sbjct: 439 LANMRMGWTDYDCS 452


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 108/379 (28%), Positives = 158/379 (41%), Gaps = 46/379 (12%)

Query: 17  KSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP 76
           K P   I     I     Y++  +IGTP   +  ++DT +D  W  C  C  + C     
Sbjct: 73  KKPSVPIASGRAIVQSPTYIVRANIGTPAQPMLVALDTSNDAAWVPCSGC--VGCASSV- 129

Query: 77  PLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGD-CSYSFLYGRGAYASFSSGNLATETLT 134
            LFDP KSS+  ++ C + QC    +  C+ G  C ++  YG     S    +L  +TLT
Sbjct: 130 -LFDPSKSSSSRNLQCDAPQCKQAPNPTCTAGKSCGFNMTYG----GSTIEASLTQDTLT 184

Query: 135 FNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC 194
             +       + +  FGC  K  A+ TS   Q G++GLG G  SLISQ        FSYC
Sbjct: 185 LAND-----VIKSYTFGCISK--ATGTSLPAQ-GLMGLGRGPLSLISQTQNLYMSTFSYC 236

Query: 195 LPDQGSSK----INFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQ-------RL 239
           LP+  SS     +  G       + +TPL+        YY++L  I VGN+        L
Sbjct: 237 LPNSKSSNFSGSLRLGPKYQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSAL 296

Query: 240 EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS 299
            F +S+      D+G + T L    +  +++     IK      +G   GF    CY  S
Sbjct: 297 AFDASTGAGTIFDSGTVFTRLVEPAYVAVRNEFRRRIKNANATSLG---GFDT--CY--S 349

Query: 300 SQPKFPEVTIHFRGADVKLSPSNLF-RNISDEIMCSAFRGGNANI-----VYGRIMQINF 353
               +P VT  F G +V L P NL   + S    C A      N+     V   + Q N 
Sbjct: 350 GSVVYPSVTFMFAGMNVTLPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNH 409

Query: 354 LIGYDIEQAMVSFKPSRCT 372
            +  D+  + +      CT
Sbjct: 410 RVLIDLPNSRLGISRETCT 428


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 104/363 (28%), Positives = 161/363 (44%), Gaps = 42/363 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IGTPP +    VD+GS  T+  C  C +  C   + P F P  SSTY+ + CS 
Sbjct: 85  YTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQ--CGNHQDPRFQPDLSSTYSPVKCS- 141

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
              A  T +  +  C+Y   Y   A  S SSG L  + ++F + S L  +    +FGC +
Sbjct: 142 ---ADCTCDSDKSQCTYERQY---AEMSSSSGVLGEDIVSFGTESELKPQ--RAVFGCEN 193

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVAGA 212
                  S     GI+GLG G  S++ Q+     I   FS C    G   I  G +V GA
Sbjct: 194 SETGDLFSQHAD-GIMGLGRGQLSIMDQLVDKGVIGDSFSMCY---GGMDIGGGAMVLGA 249

Query: 213 GVVSTPLI------IRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPL 262
                 ++      +R  YY + L+ I V  + L     +  S     +D+G     LP 
Sbjct: 250 MPAPPDMVFSRSDPVRSPYYNIELKEIHVAGKALRLDPRIFDSKHGTVLDSGTTYAYLPE 309

Query: 263 EYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGA 314
           +     K  +++  K +P+K + G +P + D+ C+     N+S   + FP+V + F  G 
Sbjct: 310 QAFVAFKDAVTS--KVRPLKKIRGPDPNYKDI-CFAGAGRNVSQLSQAFPDVDMVFGDGQ 366

Query: 315 DVKLSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSR 370
            + LSP N LFR+  +        F+ G +   + G I+  N L+ YD     + F  + 
Sbjct: 367 KLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTN 426

Query: 371 CTN 373
           C+ 
Sbjct: 427 CSE 429


>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
 gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
          Length = 485

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 100/371 (26%), Positives = 156/371 (42%), Gaps = 42/371 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP  D +  VDTGSD  W    QC  CP+      +  L++  +S T   +
Sbjct: 77  LYYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLV 136

Query: 91  SCSSSQCAVV--------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-L 141
            C    C  +        T+N S   C Y  +YG G   S ++G    + + +   SG L
Sbjct: 137 PCDQEFCYEINGGQLPGCTANMS---CPYLEIYGDG---SSTAGYFVKDVVQYARVSGDL 190

Query: 142 PVEMPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCL 195
                N  VIFGCG +      S +++   GI+G G  NSS+ISQ+  +  +   F++CL
Sbjct: 191 KTTAANGSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL 250

Query: 196 PDQGSSKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTGNI 249
                  I   G V    V  TPLI    HY +++ A+ VG++ L      F +      
Sbjct: 251 DGTNGGGIFVIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGDRKGA 310

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTI 309
            +D+G     LP   +   K ++S +I  QP   V           Y+ S    FP VT 
Sbjct: 311 IIDSGTTLAYLPEMVY---KPLVSKIISQQPDLKVHTVRDEYTCFQYSDSLDDGFPNVTF 367

Query: 310 HFRGADV-KLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDIEQ 361
           HF  + + K+ P        + + C  +       R      + G ++  N L+ YD+E 
Sbjct: 368 HFENSVILKVYPHEYLFPF-EGLWCIGWQNSGVQSRDRRNMTLLGDLVLSNKLVLYDLEN 426

Query: 362 AMVSFKPSRCT 372
             + +    C+
Sbjct: 427 QAIGWTEYNCS 437


>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 60/141 (42%), Positives = 77/141 (54%), Gaps = 15/141 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L +GTPP  ++  +DTGSD  W QC PC +  C+ Q  P+FDPKKS +++SISC S
Sbjct: 174 YFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRK--CYSQTDPVFDPKKSGSFSSISCRS 231

Query: 95  SQCAVVTS-NC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             C  + S  C S   C Y   YG G   SF+ G  +TETLTF  T      +P V  GC
Sbjct: 232 PLCLRLDSPGCNSRQSCLYQVAYGDG---SFTFGEFSTETLTFRGT-----RVPKVALGC 283

Query: 153 GHKNLASPTSDSKQTGIIGLG 173
           GH N       +   G++GLG
Sbjct: 284 GHDNEGLFVGAA---GLLGLG 301


>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
          Length = 409

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 96/347 (27%), Positives = 147/347 (42%), Gaps = 49/347 (14%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNC-SEG 107
           +D+GSD  W QC+PCP L C  Q  PLFDP  S+TY ++ CSS+ CA +      C +  
Sbjct: 85  IDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLGPYRRGCLANS 144

Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-MPNVIFGCGHKNLASPTSDSKQ 166
            C +   Y  GA A   +G  +++ LT       P + +   +FGC H +  S T     
Sbjct: 145 QCQFGITYANGATA---TGTYSSDDLTLG-----PYDVVRGFLFGCAHADQGS-TFSYDV 195

Query: 167 TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA---------GVVST 217
            G + LG G+ S + Q  +  +  FSYC+P   SS   FG I+ G            VST
Sbjct: 196 AGTLALGGGSQSFVQQTASQYSRVFSYCVPPSTSS---FGFIMFGVPPQRAALVPTFVST 252

Query: 218 PLIIRD-----HYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYHSNLKS 270
           PL+         Y + L +I V  + L    +  S  ++     V+  + P  Y +   +
Sbjct: 253 PLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVFSASSVIDSATVISRIPPTAYQALRAA 312

Query: 271 VMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNI 327
             S M   +P     A P      CY+ S       P + + F  GA V L  + +    
Sbjct: 313 FRSAMTMYRP-----APPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILLQ- 366

Query: 328 SDEIMCSAFRGGNANIV---YGRIMQINFLIGYDIEQAMVSFKPSRC 371
                C AF    ++ +    G + Q    + YD+    + F+ + C
Sbjct: 367 ----GCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409


>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
 gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
 gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 389

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 95/352 (26%), Positives = 149/352 (42%), Gaps = 37/352 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-PLFDPKKSSTYNSISCS 93
           ++  +  G+P    F  +DTGS  TWTQC PC   DC+ Q+  P + P  S TY    C 
Sbjct: 58  FMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCS--DCYAQKIYPKYRPAASITYRDAMCE 115

Query: 94  SSQCAV---VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
            S          +     C+Y   Y      +   G LA E +T ++  G    +  V F
Sbjct: 116 DSHPKSNPHFAFDPLTRICTYQQHY---LDETNIKGTLAQEMITVDTHDGGFKRVHGVYF 172

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVA 210
           GC   N  S  S    TGI+GLG G  S+I + G+    KFS+CL +    K +   I+ 
Sbjct: 173 GC---NTLSDGSYFTGTGILGLGVGKYSIIGEFGS----KFSFCLGEISEPKASHNLILG 225

Query: 211 -GAGVVSTPLII---RDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHS 266
            GA V   P +I     H    LE+I VG    E        +FVDTG   + L    + 
Sbjct: 226 DGANVQGHPTVINITEGHTIFQLESIIVGE---EITLDDPVQVFVDTGSTLSHLSTNLYY 282

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR---GADVKLSPSNL 323
                  ++I ++P+        +   LCY   +  +  ++ + F+   GA++ ++  N+
Sbjct: 283 KFVDAFDDLIGSRPLS-------YEPTLCYKADTIERLEKMDVGFKFDVGAELSVNIHNI 335

Query: 324 F-RNISDEIMCSAFRGGN---ANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           F +    EI C A +      ++++ G I    + +GYD+           C
Sbjct: 336 FIQQGPPEIRCLAIQNNKESFSHVIIGVIAMQGYNVGYDLSAKTAYINKQDC 387


>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 164/372 (44%), Gaps = 42/372 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP    +  VDTGSD  W    QC+ CP       E  L++  +S +   +
Sbjct: 79  LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138

Query: 91  SCSSSQCAVVT----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
           SC    C  ++    S C     C Y  +YG G   S ++G    + + ++S +G L  +
Sbjct: 139 SCDDDFCYQISGGPLSGCKANMSCPYLEIYGDG---SSTAGYFVKDVVQYDSVAGDLKTQ 195

Query: 145 MPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
             N  VIFGCG +      S +++   GI+G G  NSS+ISQ+ +S  +   F++CL  +
Sbjct: 196 TANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGR 255

Query: 199 GSSKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
               I   G V    V  TPL+    HY +++ A+ VG + L      F         +D
Sbjct: 256 NGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIID 315

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKFPEVTIH 310
           +G     LP   +  L   +++   A  V  V       D  C+  S +    FP VT H
Sbjct: 316 SGTTLAYLPEIIYEPLVKKITSQEPALKVHIVD-----KDYKCFQYSGRVDEGFPNVTFH 370

Query: 311 FRGAD-VKLSPSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDIEQ 361
           F  +  +++ P + LF +  + + C  +       R      + G ++  N L+ YD+E 
Sbjct: 371 FENSVFLRVYPHDYLFPH--EGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLEN 428

Query: 362 AMVSFKPSRCTN 373
            ++ +    C++
Sbjct: 429 QLIGWTEYNCSS 440


>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
          Length = 629

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 80/253 (31%), Positives = 115/253 (45%), Gaps = 35/253 (13%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSE-G 107
           +D+GSD +W QC+PCP   C +Q  PLFDP  S+TY ++ C+S+ CA +      CS   
Sbjct: 81  IDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSANA 140

Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-MPNVIFGCGHKNLASPTSDSKQ 166
            C +   YG         G+ AT T +F+  +  P + +    FGC H +  S   D   
Sbjct: 141 QCQFGINYG--------DGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGS-AFDYDV 191

Query: 167 TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA---------GVVST 217
            G + LG G+ SL+ Q  T     FSYCLP   SS    G +V G            VST
Sbjct: 192 AGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS---LGFLVLGVPPERAQLIPSFVST 248

Query: 218 PLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYH---SNL 268
           PL+        Y + L AI V  + L    +  S  ++   + ++  L P  Y    +  
Sbjct: 249 PLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQALRAAF 308

Query: 269 KSVMSNMIKAQPV 281
           +S M+    A PV
Sbjct: 309 RSAMTMYRAAPPV 321


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 101/371 (27%), Positives = 159/371 (42%), Gaps = 44/371 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPP +    +DTGSD  W  C     CP+      +   FD   SST   +
Sbjct: 80  LYFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLV 139

Query: 91  SCS----SSQCAVVTSNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
            CS    +SQ     + C      CSY+F YG G   S +SG   ++T  F++  G   +
Sbjct: 140 PCSHPICTSQIQTTATQCPPQSNQCSYAFQYGDG---SGTSGYYVSDTFYFDAVLGESLI 196

Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP-- 196
                 ++FGC        T +D    GI G G G  S+ISQ+ +       FS+CL   
Sbjct: 197 ANSSAAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLKGE 256

Query: 197 DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRL-----EFVSSSTGNIF 250
           D G   +  G I+   G+V +PL+  + HY L L++I+V  Q L      F +SS     
Sbjct: 257 DSGGGILVLGEILE-PGIVYSPLVPSQPHYNLDLQSIAVSGQLLPIDPAAFATSSNRGTI 315

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISSQPK--FPE 306
           +DTG     L  E +    S ++  +   A P    G +       CY +S+     FP 
Sbjct: 316 IDTGTTLAYLVEEAYDPFVSAITAAVSQLATPTINKGNQ-------CYLVSNSVSEVFPP 368

Query: 307 VTIHFR-GADVKLSPSNLFRNISD----EIMCSAFRGGNANI-VYGRIMQINFLIGYDIE 360
           V+ +F  GA + L P      +++     + C  F+     I + G ++  + +  YD+ 
Sbjct: 369 VSFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGITILGDLVLKDKIFVYDLA 428

Query: 361 QAMVSFKPSRC 371
              + +    C
Sbjct: 429 HQRIGWANYDC 439


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 98/334 (29%), Positives = 151/334 (45%), Gaps = 42/334 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP D +  VDTGSD  W  C     CP+    + +   FDP  S T   +
Sbjct: 80  LYYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPV 139

Query: 91  SCSSSQCAV----VTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
           SCS  +C+       S CS  +  C+Y+F YG G   S +SG   ++ L F+   G   +
Sbjct: 140 SCSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDG---SGTSGFYVSDVLQFDMIVGSSLV 196

Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-D 197
           P     V+FGC          SD    GI G G    S+ISQ+ +  +A + FS+CL  +
Sbjct: 197 PNSTAPVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCLKGE 256

Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
            G   I   G +    +V TPL+  + HY ++L +ISV  Q L      F +S+     +
Sbjct: 257 NGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPINPSVFSTSNGQGTII 316

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FPEV 307
           DTG     L    +      ++N +    +PV   G +       CY I++     FP V
Sbjct: 317 DTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ-------CYVIATSVADIFPPV 369

Query: 308 TIHFRGADVKLSPSNLFRNISDEIMCSAFRGGNA 341
           +++F G       +++F N  D ++     GG A
Sbjct: 370 SLNFAGG------ASMFLNPQDYLIQQNNVGGTA 397


>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
          Length = 720

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 80/253 (31%), Positives = 115/253 (45%), Gaps = 35/253 (13%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSE-G 107
           +D+GSD +W QC+PCP   C +Q  PLFDP  S+TY ++ C+S+ CA +      CS   
Sbjct: 172 IDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLGPYRRGCSANA 231

Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-MPNVIFGCGHKNLASPTSDSKQ 166
            C +   YG         G+ AT T +F+  +  P + +    FGC H +  S   D   
Sbjct: 232 QCQFGINYG--------DGSTATGTYSFDDLTLGPYDVIRGFRFGCAHADRGS-AFDYDV 282

Query: 167 TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA---------GVVST 217
            G + LG G+ SL+ Q  T     FSYCLP   SS    G +V G            VST
Sbjct: 283 AGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS---LGFLVLGVPPERAQLIPSFVST 339

Query: 218 PLI----IRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYH---SNL 268
           PL+        Y + L AI V  + L    +  S  ++   + ++  L P  Y    +  
Sbjct: 340 PLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQALRAAF 399

Query: 269 KSVMSNMIKAQPV 281
           +S M+    A PV
Sbjct: 400 RSAMTMYRAAPPV 412


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 93/371 (25%), Positives = 155/371 (41%), Gaps = 52/371 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP--LFDPKKSSTYNSISC 92
           Y + + +GTP  +     DTGS+ TW +C            PP  +F P+ S ++  + C
Sbjct: 91  YFVKVLVGTPAQEFTLVADTGSELTWVKC-------AGGASPPGLVFRPEASKSWAPVPC 143

Query: 93  SSSQCAV----VTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           SS  C +      +NCS     CSY + Y  G+  +   G + T++ T     G   ++ 
Sbjct: 144 SSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGAL--GVVGTDSATIALPGGKVAQLQ 201

Query: 147 NVIFGCGHKNLASPTSDSKQ----TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
           +V+ GC      S T D +      G++ LG    S  S+      G FSYCL D  + +
Sbjct: 202 DVVLGC------SSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPR 255

Query: 203 INFGGIVAGAGVV------STPLII---RDHYYLSLEAISVGNQRL----EFVSSSTGNI 249
              G +  G G V       T L +      Y + ++A+ V  Q L    E     +G +
Sbjct: 256 NATGYLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDPKSGGV 315

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS----QPKFP 305
            +D+G   T+L    +  + + ++ ++   P       P F    CYN ++     P+ P
Sbjct: 316 ILDSGTTLTVLATPAYKAVVAALTKLLAGVPKVDF---PPFEH--CYNWTAPRPGAPEIP 370

Query: 306 EVTIHFRGADVKLSPSNLFR-NISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYDIEQA 362
           ++ + F G      P+  +  ++   + C   + G      V G IMQ   L  +D++  
Sbjct: 371 KLAVQFTGCARLEPPAKSYVIDVKPGVKCIGLQEGEWPGVSVIGNIMQQEHLWEFDLKNM 430

Query: 363 MVSFKPSRCTN 373
            V F PS CT 
Sbjct: 431 EVRFMPSTCTR 441


>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
           [Cucumis sativus]
          Length = 209

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 76/124 (61%), Gaps = 13/124 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM +SIGTPPVD  G  DTGSD  W QC PC  L C+KQ  P+FDP KS++++ + C+S
Sbjct: 92  YLMSVSIGTPPVDYIGMADTGSDLMWAQCLPC--LKCYKQSRPIFDPLKSTSFSHVPCNS 149

Query: 95  SQC-AVVTSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             C A+  S+C ++G C YS+ YG   Y   + G+L  E +T  S+S         + GC
Sbjct: 150 QNCKAIDDSHCGAQGVCDYSYTYGDQTY---TKGDLGFEKITIGSSS------VKSVIGC 200

Query: 153 GHKN 156
           GH++
Sbjct: 201 GHES 204


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 104/363 (28%), Positives = 162/363 (44%), Gaps = 58/363 (15%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           +Y+    IGTPP  + G++D  SD  WT C               F+P +S+T   + C+
Sbjct: 99  MYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP----------FNPVRSTTVADVPCT 148

Query: 94  SSQC---AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
              C   A  T      +C+Y+++YG G  A+ ++G L TE  TF  T      +  V+F
Sbjct: 149 DDACQQFAPQTCGAGASECAYTYMYGGG--AANTTGLLGTEAFTFGDT-----RIDGVVF 201

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQG---SSKINFG 206
           GCG KN+      S  +G+IGLG GN SL+SQ+      +FSY   PD      S I FG
Sbjct: 202 GCGLKNVG---DFSGVSGVIGLGRGNLSLVSQLQVD---RFSYHFAPDDSVDTQSFILFG 255

Query: 207 --GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS--------TGNIFVD 252
                  +  +ST L+  D     YY+ L  I V  + L   S +        +G +F+ 
Sbjct: 256 DDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLS 315

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG--VGAEPGFSDVLCYNISS--QPKFPEVT 308
              L T+L    +  L+  +++ I    V G  +G +      LCY   S  + K P + 
Sbjct: 316 ITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLD------LCYTGESLAKAKVPSMA 369

Query: 309 IHFRGADV-KLSPSNLF-RNISDEIMCSAFRGGNA--NIVYGRIMQINFLIGYDIEQAMV 364
           + F G  V +L   N F  + +  + C      +A    V G ++Q+   + YDI  + +
Sbjct: 370 LVFAGGAVMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKL 429

Query: 365 SFK 367
            F+
Sbjct: 430 VFE 432


>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
 gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
          Length = 381

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 84/251 (33%), Positives = 124/251 (49%), Gaps = 28/251 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP + F  +DTGSD  W  C P   CP       +   F+P  SST + I
Sbjct: 90  LYFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKI 149

Query: 91  SCSSSQC--AVVTSN--CSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSG--- 140
            CS  +C  A+ TS   C   D   C Y+F YG G   S +SG   ++T+ F++  G   
Sbjct: 150 PCSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDG---SGTSGYYVSDTMYFDTVMGNEQ 206

Query: 141 LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGT-SIAGK-FSYCLP- 196
                 +++FGC +      T +D    GI G G    S++SQ+ +  ++ K FS+CL  
Sbjct: 207 TANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKG 266

Query: 197 -DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
            D G   +  G IV   G+V TPL+  + HY L+LE+I V  Q+L      F +S+T   
Sbjct: 267 SDNGGGILVLGEIVE-PGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGT 325

Query: 250 FVDTGVLRTLL 260
            VD+G     L
Sbjct: 326 IVDSGTTLAYL 336


>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 457

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 102/378 (26%), Positives = 161/378 (42%), Gaps = 44/378 (11%)

Query: 17  KSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP 76
           K PIS      +   D  Y+M  SIG+P VD +   D+GS   W QC      +C++Q+ 
Sbjct: 88  KYPIS-----RMSYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVWLQCGTPYCRNCYRQKI 142

Query: 77  PLFDPKKSSTYNSISCSSSQCAVVTSN----CSEGD--CSYSFLYGRGAYASFSSGNLAT 130
           PLF+P KS TY    C++++C V   +    C + +  C Y   Y   +Y   + G ++T
Sbjct: 143 PLFNPSKSVTYMKRLCNTAECRVALGDEYWRCKKPNQICKYHEDYLDDSY---TEGVIST 199

Query: 131 ETLTF-NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
           +  TF    SG       +IFGCG+ N  S        G++GL    +SL+ QM      
Sbjct: 200 DIFTFPEHISGFGNYTLRIIFGCGYNN--SDPQHFYPPGLVGLTNNKASLVGQMDVD--- 254

Query: 190 KFSYCLPD------QGSSKINFGGIVAGAGVVSTPLIIRDHYYL--SLEAISVGNQRLE- 240
           +FSYC+        +GS +I FG   + +G  +  +   D +Y+  +++ I V    +E 
Sbjct: 255 QFSYCVSIDTEQNLKGSMEIRFGLAASISGHSTQLVPNSDGWYIFKNVDGIYVNEFEVEG 314

Query: 241 -------FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV 293
                  +     G + +DTG   T L       L  ++   I   P K   +  GF   
Sbjct: 315 YPAWVFKYTEGGQGGLTMDTGTTYTELHNSVMDPLIKLLEEHITIVPEKDY-SNSGFE-- 371

Query: 294 LCY--NISSQPKFPEVTIHF---RGADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRI 348
           LCY  +       P++ + F   +      +  N +       MC A    N   + G  
Sbjct: 372 LCYFSDDFLGATLPDIELRFTDNKDTYFSFNTRNAWTPNGRSQMCLAMFRTNGMSIIGMH 431

Query: 349 MQINFLIGYDIEQAMVSF 366
              +  IGYD+   +VSF
Sbjct: 432 QLRDIKIGYDLHHNIVSF 449


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 103/373 (27%), Positives = 171/373 (45%), Gaps = 46/373 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSIS 91
           Y   + +G+PP + F  +DTGSD  W  C P   CP       +   F+P  SST + I 
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176

Query: 92  CSSSQC--AVVTSN--CSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
           CS  +C  A+ TS   C   D   C Y+F YG G   S +SG   ++T+ F++  G    
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDG---SGTSGYYVSDTMYFDTVMGNEQT 233

Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGT-SIAGK-FSYCLP-- 196
                +++FGC +      T +D    GI G G    S++SQ+ +  ++ K FS+CL   
Sbjct: 234 ANSSASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCLKGS 293

Query: 197 DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIF 250
           D G   +  G IV   G+V TPL+  + HY L+LE+I V  Q+L      F +S+T    
Sbjct: 294 DNGGGILVLGEIVE-PGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTI 352

Query: 251 VDTGVLRTLLP----LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPE 306
           VD+G     L       + + + + +S  +++   KG       +     + S    FP 
Sbjct: 353 VDSGTTLAYLADGAYDPFVNAITAAVSPSVRSLVSKG-------NQCFVTSSSVDSSFPT 405

Query: 307 VTIHFRGA-DVKLSPSN-LFRNIS---DEIMCSAFR--GGNANIVYGRIMQINFLIGYDI 359
           V+++F G   + + P N L +  S   + + C  ++   G    + G ++  + +  YD+
Sbjct: 406 VSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDL 465

Query: 360 EQAMVSFKPSRCT 372
               + +    C+
Sbjct: 466 ANMRMGWTDYDCS 478


>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 324

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 111/340 (32%), Positives = 159/340 (46%), Gaps = 38/340 (11%)

Query: 52  VDTGSDCTWTQCEPCPEL-DCFKQEPPLFDPKKSSTYNSISCSSSQCAVV----TSNCSE 106
           VDTGSD +W QC+PC     C+ Q+ PLFDP +SS+Y ++ C    CA +     S CS 
Sbjct: 3   VDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSA 62

Query: 107 GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQ 166
             C Y   YG G   S ++G  +++TLT +++S     +    FGCGH   A     +  
Sbjct: 63  AQCGYVVSYGDG---SNTTGVYSSDTLTLSASS----AVQGFFFGCGH---AQSGLFNGV 112

Query: 167 TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFGGIVAGAGVVSTPLIIR 222
            G++GLG    SL+ Q   +  G FSYCLP + S+     +  GG    A   ST  ++ 
Sbjct: 113 DGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLP 172

Query: 223 D-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI 276
                 +Y + L  ISVG Q+L   +S+  G   VDTG + T LP   ++ L+S   + +
Sbjct: 173 SPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSAFRSGM 232

Query: 277 KAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHF-RGADVKLSPSNLFRNISDEI 331
            +    G    P  G  D  CYN +       P V + F  GA V L    +        
Sbjct: 233 AS---YGYPTAPSNGILDT-CYNFAGYGTVTLPNVALTFGSGATVTLGADGILSFGCLAF 288

Query: 332 MCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             S   GG A  + G + Q +F +   I+   V FKPS C
Sbjct: 289 APSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 324


>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
 gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
          Length = 372

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 99/370 (26%), Positives = 157/370 (42%), Gaps = 62/370 (16%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G P  D +  VDTGSD  W     C+ CP       +  L+DP  S +   +
Sbjct: 26  LYFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKSDLGIKLTLYDPASSVSATRV 85

Query: 91  SCSSSQCAVVTS----NC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
           SC    C    +    +C  E  C Y+ +YG G   S ++G   ++ + F   +G L   
Sbjct: 86  SCDDDFCTSTYNGLLPDCKKELPCQYNVVYGDG---SSTAGYFVSDAVQFERVTGNLQTG 142

Query: 145 MPN--VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
           + N  V FGCG          ++Q+G  GLG    +L       I G F++CL       
Sbjct: 143 LSNGTVTFGCG----------AQQSG--GLGTSGEAL-----DGILGAFAHCL-----DN 180

Query: 203 INFGGIVAGAGVVS-----TPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
           +N GGI A   +VS     TP++  + HY + ++ I VG   LE     F S       +
Sbjct: 181 VNGGGIFAIGELVSPKVNTTPMVPNQAHYNVYMKEIEVGGTVLELPTDVFDSGDRRGTII 240

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
           D+G     LP   +    S+M+ +   QP  G+         +C+  S      FP++  
Sbjct: 241 DSGTTLAYLPEVVYD---SMMNEIRSQQP--GLSLHTVEEQFICFKYSGNVDDGFPDIKF 295

Query: 310 HFRGA-DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQI-------NFLIGYDIEQ 361
           HF+ +  + + P +    IS++I C  ++ G      GR M +       N L+ YDIE 
Sbjct: 296 HFKDSLTLTVYPHDYLFQISEDIWCFGWQNGGMQSKDGRDMTLLGDLVLSNKLVLYDIEN 355

Query: 362 AMVSFKPSRC 371
             + +    C
Sbjct: 356 QAIGWTEYNC 365


>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
          Length = 428

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 101/366 (27%), Positives = 170/366 (46%), Gaps = 51/366 (13%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           +Y++ + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC 
Sbjct: 81  LYVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCG 136

Query: 94  SSQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           +S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P+
Sbjct: 137 TSMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPS 189

Query: 148 VIFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--- 202
             FGC   NL S  ++      G++G+G G  S++ Q      G FSYCLP Q S +   
Sbjct: 190 FTFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFF 245

Query: 203 ------INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF---VSSSTGNI 249
                  + G +     V  T ++ R    + +++ L AISV  +RL     + S  G +
Sbjct: 246 SKTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVV 305

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEV 307
           F D+G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P +
Sbjct: 306 F-DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAI 359

Query: 308 TIHF-RGADVKLSPSNLF--RNISDE-IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAM 363
           ++HF  GA   L    +F  R++ ++ + C AF    +  + G +MQ +  + YD+++ +
Sbjct: 360 SLHFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIGSLMQTSKEVVYDLKRQL 419

Query: 364 VSFKPS 369
           +   PS
Sbjct: 420 IGIGPS 425


>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 108/377 (28%), Positives = 173/377 (45%), Gaps = 62/377 (16%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
           L++G+PP ++   +DTGS+ +W  C+    L+       +F+P  S TY+ + C S  C 
Sbjct: 73  LTVGSPPQNVTMVLDTGSELSWLHCKKTQFLNS------VFNPLSSKTYSKVPCLSPTCK 126

Query: 99  VVTSNCS-EGDCSYSFL-YGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIFGCGH 154
             T + +    C  + L +   +YA  +S  GNLA ET    S +      P  IFGC  
Sbjct: 127 TRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTK-----PATIFGCMD 181

Query: 155 KNLASPT-SDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAG 213
              +S +  DSK TG+IG+  G+ S ++QMG     KFSYC+    S+ +   G  +   
Sbjct: 182 SGFSSNSEEDSKTTGLIGMNRGSLSFVNQMGYP---KFSYCISGFDSAGVLLLGNASFPW 238

Query: 214 V----------VSTPLIIRDH--YYLSLEAISVGNQRLE-----FVSSST--GNIFVDTG 254
           +          +STPL   D   Y + LE I V N+ L      FV   T  G   VD+G
Sbjct: 239 LKPLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMVDSG 298

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNI-SSQP---KFPE 306
              T L    ++ LK+    + + + +  V  +  F    +  LCY + SS+P     P 
Sbjct: 299 TQFTFLLGPVYTALKNEF--LSQTRGILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNLPV 356

Query: 307 VTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRIMQINFL 354
           V++ F+GA++ +S   L   +       D + C  F  GN+++      V G   Q N  
Sbjct: 357 VSLMFQGAEMSVSGERLLYRVPGEVRGRDSVWCFTF--GNSDLLGVEAFVIGHHHQQNVW 414

Query: 355 IGYDIEQAMVSFKPSRC 371
           + +D+E++ +     RC
Sbjct: 415 MEFDLEKSRIGLADVRC 431


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 99/369 (26%), Positives = 167/369 (45%), Gaps = 52/369 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  + IG     +   VDT S+ TW QCEPC    C  Q+ PLFDP  S +Y ++ C+S
Sbjct: 113 YVATVGIGGGEATVI--VDTASELTWVQCEPCDA--CHDQQEPLFDPSSSPSYAAVPCNS 168

Query: 95  SQCAVVT-------SNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           S C  +          C +    CSY+  Y  G+Y   S G LA + L+         ++
Sbjct: 169 SSCDALRVATGMSGQACDDQPAACSYTLSYRDGSY---SRGVLAHDRLSLAGE-----DI 220

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKI 203
              +FGCG  N   P   +  +G++GLG    SLISQ      G FSYCLP  + GSS  
Sbjct: 221 QGFVFGCGTSN-QGPFGGT--SGLMGLGRSQLSLISQTMDQFGGVFSYCLPPKESGSSGS 277

Query: 204 NFGGIVAGAGVVSTPLI--------IRDHYYLS-LEAISVGNQRLE---FVSSSTGNIFV 251
              G  A     STP++        ++  +YL+ L  I+VG + ++   F +   G   V
Sbjct: 278 LVLGDDASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGEDVQSPGFSAGGGGKAIV 337

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISS--QPKFPEVT 308
           D+G + T L    ++ +++   + +   P     A P FS +  C++++   + + P + 
Sbjct: 338 DSGTIITSLVPSVYAAVRAEFVSQLAEYPQ----AAP-FSILDTCFDLTGLREVQVPSLK 392

Query: 309 IHFR-GADVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
           + F  GA+V++    +   ++ +     +  ++ +      + G   Q N  + +D   +
Sbjct: 393 LVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVIFDTVGS 452

Query: 363 MVSFKPSRC 371
            + F    C
Sbjct: 453 QIGFAQETC 461


>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 484

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 163/372 (43%), Gaps = 42/372 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP    +  VDTGSD  W    QC+ CP       E  L++  +S +   +
Sbjct: 79  LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138

Query: 91  SCSSSQCAVVT----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
           SC    C  ++    S C     C Y  +YG G   S ++G    + + ++S +G L  +
Sbjct: 139 SCDDDFCYQISGGPLSGCKANMSCPYLEIYGDG---SSTAGYFVKDVVQYDSVAGDLKTQ 195

Query: 145 MPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
             N  VIFGCG +      S +++   GI+G G  NSS+ISQ+ +S  +   F++CL  +
Sbjct: 196 TANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGR 255

Query: 199 GSSKINFGGIVAGAGVVSTPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
               I   G V    V  TPL+    HY +++ A+ VG + L      F         +D
Sbjct: 256 NGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGDRKGAIID 315

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKFPEVTIH 310
           +G     LP   +  L   +++   A  V  V       D  C+  S +    FP VT H
Sbjct: 316 SGTTLAYLPEIIYEPLVKKITSQEPALKVHIVD-----KDYKCFQYSGRVDEGFPNVTFH 370

Query: 311 FRGAD-VKLSPSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGYDIEQ 361
           F  +  +++ P + LF    + + C  +       R      + G ++  N L+ YD+E 
Sbjct: 371 FENSVFLRVYPHDYLFP--YEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLEN 428

Query: 362 AMVSFKPSRCTN 373
            ++ +    C++
Sbjct: 429 QLIGWTEYNCSS 440


>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 161/371 (43%), Gaps = 61/371 (16%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+ + +IGTPP      +D   +  WTQC+ C    CF+Q  PLFDP  S+TY +  C +
Sbjct: 51  YVANFTIGTPPQPASAVIDLAGELVWTQCKQCGR--CFEQGTPLFDPTASNTYRAEPCGT 108

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   NCS   C+Y         A  + G + T+T    +         ++ FG
Sbjct: 109 PLCESIPSDVRNCSGNVCAYE----ASTNAGDTGGKVGTDTFAVGTAKA------SLAFG 158

Query: 152 CGHKNLASPTSDSK--QTGIIGLGPGNSSLISQMGTSIAGKFSYCLP--DQGSSKINFGG 207
           C    + +   D+    +GI+GLG    SL++Q G +    FSYCL   D G +   F G
Sbjct: 159 C----VVASDIDTMGGPSGIVGLGRTPWSLVTQTGVA---AFSYCLAPHDAGKNSALFLG 211

Query: 208 ----IVAGAGVVSTPLI--------IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGV 255
               +  G    STP +        + ++Y + LE +  G+  +    S +      T +
Sbjct: 212 SSAKLAGGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGS------TVL 265

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNIS-SQPKFPEVTIH 310
           L T  P+ +   L       +K      VGA P  + V    LC+  S +    P++   
Sbjct: 266 LDTFSPISF---LVDGAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFT 322

Query: 311 FR-GADVKLSPSNLFRNISDEIMCSAFRGGNANI-------VYGRIMQINFLIGYDIEQA 362
           FR GA + +  +N   +  +  +C A    +A +       + G + Q N    +D+++ 
Sbjct: 323 FRGGAAMTVPATNYLLDYKNGTVCLAML-SSARLNSTTELSLLGSLQQENIHFLFDLDKE 381

Query: 363 MVSFKPSRCTN 373
            +SF+P+ CT 
Sbjct: 382 TLSFEPADCTK 392


>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
          Length = 501

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 103/378 (27%), Positives = 150/378 (39%), Gaps = 64/378 (16%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP       +DTGSD  W QC PC    C+ Q   +FDP+ S +Y ++ C++
Sbjct: 147 YFTKIGVGTPVTPALMVLDTGSDVVWLQCAPCRR--CYDQSGQMFDPRASHSYGAVDCAA 204

Query: 95  SQCAVVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFG 151
             C  + S   +     C Y   YG G   S ++G+ ATETLTF S +     +P V  G
Sbjct: 205 PLCRRLDSGGCDLRRKACLYQVAYGDG---SVTAGDFATETLTFASGA----RVPRVALG 257

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD---------QGSSK 202
           CGH N     + +   G+     G+ S  SQ+       FSYCL D           SS 
Sbjct: 258 CGHDNEGLFVAAAGLLGLG---RGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSST 314

Query: 203 INFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFV-------------------- 242
           + FG    GA       + R   +   E    G+  L                       
Sbjct: 315 VTFGSGARGA-------LGRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRPPP 367

Query: 243 --SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNI 298
             S+  G + VD+G           +   +  S    A    G+   PG   +   CY++
Sbjct: 368 DPSTGRGGVIVDSGRPSPAWARAGRTPPCATRSRAAAA----GLRLSPGGFSLFDTCYDL 423

Query: 299 SSQP--KFPEVTIHFR-GADVKLSPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINF 353
           S     K P V++HF  GA+  L P N    + S    C AF G +  + + G I Q  F
Sbjct: 424 SGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGF 483

Query: 354 LIGYDIEQAMVSFKPSRC 371
            + +D +   + F P  C
Sbjct: 484 RVVFDGDGQRLGFVPKGC 501


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 102/371 (27%), Positives = 165/371 (44%), Gaps = 42/371 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP + +  +DTGSD  W     C  CP+          FDP  SST + I
Sbjct: 82  LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLI 141

Query: 91  SCSSSQCAVVTSNCSEG------DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV- 143
           SCS  +C++   +   G       C Y+F YG G   S +SG   ++ L F++  G  V 
Sbjct: 142 SCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDG---SGTSGYYVSDLLNFDAIVGSSVT 198

Query: 144 -EMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLPDQG 199
               +++FGC        T SD    GI G G  + S+ISQM +  I  K FS+CL   G
Sbjct: 199 NSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDG 258

Query: 200 SSKINFGGI-VAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
                     +    +V +PL+  + HY L+L++ISV  + L      F +S+     VD
Sbjct: 259 GGGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFATSTNRGTIVD 318

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
           +G     L  E +    S ++  +    +P+   G +       CY I+S  K  FP V+
Sbjct: 319 SGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ-------CYLITSSVKGIFPTVS 371

Query: 309 IHFRGA-DVKLSPSNLF---RNISD-EIMCSAFRG--GNANIVYGRIMQINFLIGYDIEQ 361
           ++F G   + L P +      +I D  + C  F+   G    + G ++  + +  YD+  
Sbjct: 372 LNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAG 431

Query: 362 AMVSFKPSRCT 372
             + +    C+
Sbjct: 432 QRIGWANYDCS 442


>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
          Length = 373

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 102/373 (27%), Positives = 166/373 (44%), Gaps = 61/373 (16%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           +Y+ +L+IGTPP      +    +  WTQC PC    CFKQ+ PLF+   SSTY    C 
Sbjct: 27  LYMANLTIGTPPQPASAIIHLAGEFVWTQCSPCRR--CFKQDLPLFNRSASSTYRPEPCG 84

Query: 94  SSQC-AVVTSNCS-EGDCSYSF--LYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           ++ C +V  S CS +G CSY    ++G       +SG   T+T    + +       ++ 
Sbjct: 85  TALCESVPASTCSGDGVCSYEVETMFGD------TSGIGGTDTFAIGTATA------SLA 132

Query: 150 FGCGHKNLASPTSDSKQ----TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----S 201
           FGC         S+ KQ    +G++GLG    SL+ QM    A  FSYCL   G+    S
Sbjct: 133 FGCAMD------SNIKQLLGASGVVGLGRTPWSLVGQMN---ATAFSYCLAPHGAAGKKS 183

Query: 202 KINFGG---IVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTG 254
            +  G    +  G    +TPL+        Y + LE I  G+  +    + +  + VDT 
Sbjct: 184 ALLLGASAKLAGGKSAATTPLVNTSDDSSDYMIHLEGIKFGDVIIAPPPNGS-VVLVDTI 242

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-------NISSQPKFPEV 307
              + L       +K  ++  + A P+    A P     LC+         +S    P+V
Sbjct: 243 FGVSFLVDAAFQAIKKAVTVAVGAAPM----ATPTKPFDLCFPKAAAAAGANSSLPLPDV 298

Query: 308 TIHFRG-ADVKLSPSNLFRNISDEIMCSAFR-GGNANI-----VYGRIMQINFLIGYDIE 360
            + F+G A + + PS    +  +  +C A       N+     + GR+ Q N    +D++
Sbjct: 299 VLTFQGAAALTVPPSKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLD 358

Query: 361 QAMVSFKPSRCTN 373
           +  +SF+P+ C++
Sbjct: 359 KETLSFEPADCSS 371


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 102/371 (27%), Positives = 165/371 (44%), Gaps = 42/371 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP + +  +DTGSD  W     C  CP+          FDP  SST + I
Sbjct: 67  LYFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLI 126

Query: 91  SCSSSQCAVVTSNCSEG------DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV- 143
           SCS  +C++   +   G       C Y+F YG G   S +SG   ++ L F++  G  V 
Sbjct: 127 SCSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDG---SGTSGYYVSDLLNFDAIVGSSVT 183

Query: 144 -EMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLPDQG 199
               +++FGC        T SD    GI G G  + S+ISQM +  I  K FS+CL   G
Sbjct: 184 NSSASIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDG 243

Query: 200 SSKINFGGI-VAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
                     +    +V +PL+  + HY L+L++ISV  + L      F +S+     VD
Sbjct: 244 GGGGILVLGEIVEEDIVYSPLVPSQPHYNLNLQSISVNGKSLAIDPEVFATSTNRGTIVD 303

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
           +G     L  E +    S ++  +    +P+   G +       CY I+S  K  FP V+
Sbjct: 304 SGTTLAYLAEEAYDPFVSAITEAVSQSVRPLLSKGTQ-------CYLITSSVKGIFPTVS 356

Query: 309 IHFRGA-DVKLSPSNLF---RNISD-EIMCSAFRG--GNANIVYGRIMQINFLIGYDIEQ 361
           ++F G   + L P +      +I D  + C  F+   G    + G ++  + +  YD+  
Sbjct: 357 LNFAGGVSMNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAG 416

Query: 362 AMVSFKPSRCT 372
             + +    C+
Sbjct: 417 QRIGWANYDCS 427


>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
          Length = 434

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 122/446 (27%), Positives = 175/446 (39%), Gaps = 115/446 (25%)

Query: 12  DNETPKSPISIIYQAE-IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE----PC 66
           DN   K P  +I   E +  V D YL+ L++GTPP  I   +DTGSD TW  C      C
Sbjct: 5   DNGLTKKPSGMIDMMEPLREVRDGYLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDC 64

Query: 67  PELDCFKQEP--------------------PLFDPKKSSTYNSISCSSSQCA---VVTSN 103
            + + ++                       PL     SS  +   C+ + C+   +V   
Sbjct: 65  MDCNDYRNNKLMSTYSPSYSSSSLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGT 124

Query: 104 CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTS-GLPVEMPNVIFGCGHKNLASPTS 162
           C     S+++ YG G       G L  +TLT + +S     E+PN  FGC       P  
Sbjct: 125 CPRPCPSFAYTYGAGGVV---IGTLTRDTLTTHGSSPSFTREVPNFCFGCVGSTYREP-- 179

Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIR 222
                GI G G G  SL SQ+G    G FS+C           G   A    +S+PL+I 
Sbjct: 180 ----IGIAGFGRGVLSLPSQLGFLQKG-FSHCF---------LGFKFANNPNISSPLVIG 225

Query: 223 D----------------------HYYLSLEAISVGNQRL--------EFVSSSTGNIFVD 252
           D                      +YY+ LEAI+VGN           EF S   G + +D
Sbjct: 226 DLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNATAIQVPSSLREFDSHGNGGMIID 285

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--------SSQPKF 304
           +G   T LP  +++ L S++ ++I     +   A  GF   LCY I              
Sbjct: 286 SGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTGFD--LCYRIPCPNNVVTDHDHLL 343

Query: 305 PEVTIHFRGADVKL------------SPSN-------LFRNISDEIMCSAFRGGNANIVY 345
           P ++ HF   +V L            +PSN       L +N+ D         G A  V+
Sbjct: 344 PSISFHFSN-NVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNMDDS------DSGPAG-VF 395

Query: 346 GRIMQINFLIGYDIEQAMVSFKPSRC 371
           G   Q N  + YD+E+  + F+P  C
Sbjct: 396 GSFQQQNVKVVYDLEKERIGFQPMDC 421


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 106/382 (27%), Positives = 163/382 (42%), Gaps = 52/382 (13%)

Query: 19  PISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL 78
           P+ I    +I+S+ + Y+    +GTP   +  ++D  +D  W  C  C    C    P  
Sbjct: 68  PVPIAPGRQILSIPN-YIARAGLGTPAQTLLVAIDPSNDAAWVPCSAC--AGCAASSP-S 123

Query: 79  FDPKKSSTYNSISCSSSQCAVVTS-NCSEG---DCSYSFLYGRGAY-ASFSSGNLATETL 133
           F P +SSTY ++ C S QCA V S +C  G    C ++  Y    + A     +LA E  
Sbjct: 124 FSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENN 183

Query: 134 TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSY 193
              S +          FGC    + S  S   Q G+IG G G  S +SQ   +    FSY
Sbjct: 184 VVVSYT----------FGC--LRVVSGNSVPPQ-GLIGFGRGPLSFLSQTKDTYGSVFSY 230

Query: 194 CLPDQGSSK----INFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS 245
           CLP+  SS     +  G I     + +TPL+   H    YY+++  I VG++ ++   S+
Sbjct: 231 CLPNYRSSNFSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSA 290

Query: 246 ------TGN-IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI 298
                 TG+   +D G + T L    ++ ++      ++      +G   GF    CYN+
Sbjct: 291 LAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLG---GFDT--CYNV 345

Query: 299 SSQPKFPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAFRGG-----NANI-VYGRIMQ 350
           +     P VT  F GA  V L   N +  + S  + C A   G     NA + V   + Q
Sbjct: 346 TV--SVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQ 403

Query: 351 INFLIGYDIEQAMVSFKPSRCT 372
            N  + +D+    V F    CT
Sbjct: 404 QNQRVLFDVANGRVGFSRELCT 425


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 99/363 (27%), Positives = 156/363 (42%), Gaps = 64/363 (17%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC-------------- 97
           VDT S+ TW QC PC    C  Q+ PLFDP  S +Y ++ C+SS C              
Sbjct: 168 VDTASELTWVQCAPCES--CHDQQDPLFDPSSSPSYAAVPCNSSSCDALQLATGGTSGGA 225

Query: 98  -AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKN 156
            A    + S   CSY+  Y  G+Y   S G LA + L+          +   +FGCG  N
Sbjct: 226 AACQGQDQSAAACSYTLSYRDGSY---SRGVLAHDRLSLAGEV-----IDGFVFGCGTSN 277

Query: 157 LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAG----- 211
              P   +  +G++GLG    SL+SQ      G FSYCLP + S   + G +V G     
Sbjct: 278 QGPPFGGT--SGLMGLGRSQLSLVSQTMDQFGGVFSYCLPLKESD--SSGSLVIGDDSSV 333

Query: 212 ---------AGVVSTPLIIRDHYYLSLEAISVGNQRLE----FVSSSTGNIFVDTG-VLR 257
                    A +VS PL     Y+++L  I+VG Q +E          G   +D+G V+ 
Sbjct: 334 YRNSTPIVYASMVSDPL-QGPFYFVNLTGITVGGQEVESSGFSSGGGGGKAIIDSGTVIT 392

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNIS--SQPKFPEVTIHFRGA 314
           +L+P  Y++     +S   +          PGFS +  C+N++   + + P + + F G 
Sbjct: 393 SLVPSIYNAVKAEFLSQFAEYPQA------PGFSILDTCFNMTGLREVQVPSLKLVFDGG 446

Query: 315 -DVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKP 368
            +V++    +   +S +     +  +  +      + G   Q N  + +D   + V F  
Sbjct: 447 VEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRVIFDTSGSQVGFAQ 506

Query: 369 SRC 371
             C
Sbjct: 507 ETC 509


>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 438

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 97/361 (26%), Positives = 159/361 (44%), Gaps = 60/361 (16%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YLM L + TPPV +    DTGS   W +C          + P    P  SS+Y  + C +
Sbjct: 76  YLMALDVSTPPVRMLALADTGSSLVWLKC----------KLPAAHTP-ASSSYARLPCDA 124

Query: 95  SQCAVV--TSNCSEGD-----CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
             C  +   ++C         C Y + +  G   S ++G +  +  TF++          
Sbjct: 125 FACKALGDAASCRATGSGNNICVYRYAFADG---SCTAGPVTVDAFTFST---------R 172

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG--TSIAGKFSYCL-----PDQGS 200
           + FGC  +       D    G++GL  G  SL+SQ+   T  A KFSYCL      +  S
Sbjct: 173 LDFGCATRTEGLSVPDD---GLVGLANGPISLVSQLSAKTPFAHKFSYCLVPYSSSETVS 229

Query: 201 SKINFG--GIVAGA-GVVSTPLII---RDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTG 254
           S +NFG   IV+ + G  +TPL+    +  Y ++L++I V  + +   +++T  + VD+G
Sbjct: 230 SSLNFGSHAIVSSSPGAATTPLVAGRNKSFYTIALDSIKVAGKPVPLQTTTT-KLIVDSG 288

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP------KFPEVT 308
            + T LP      L + ++  IK   VK     P     +CY++  +         P+VT
Sbjct: 289 TMLTYLPKAVLDPLVAALTAAIKLPRVK----SPETLYAVCYDVRRRAPEDVGKSIPDVT 344

Query: 309 IHF-RGADVKLSPSNLF--RNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
           +    G +V+L   N F   N    +  +         + G + Q N  +G+D+E+  VS
Sbjct: 345 LVLGGGGEVRLPWGNTFVVENKGTTVCLALVESHLPEFILGNVAQQNLHVGFDLERRTVS 404

Query: 366 F 366
           F
Sbjct: 405 F 405


>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
          Length = 467

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 107/378 (28%), Positives = 158/378 (41%), Gaps = 56/378 (14%)

Query: 35  YLMHLSIGTPPVDI---FGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           YL+ L IGTP   I   +   DTGSD +WTQCEPC     F   PP  DP KS T+  +S
Sbjct: 102 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTFRRLS 160

Query: 92  CSSSQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST---SGLPVE 144
           C    C    AVV        C +   YG G      SG L ++   F +     G  +E
Sbjct: 161 CFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAV---SGELVSDVFHFGAAGDGGGYQLE 217

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-------- 196
             +V FGC H    S       TGI+ LG G  S ++Q+G     +FSYC+P        
Sbjct: 218 R-DVAFGCAHVE-DSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYCIPASEITDDD 272

Query: 197 -----DQGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVG-----NQRLEFVSSST 246
                ++ +S + FG      G  +        Y + L+++        NQ+        
Sbjct: 273 DDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVA 332

Query: 247 GN-------IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY--N 297
           G        + VD+G     LP      L+  +   I       +   P    + CY  N
Sbjct: 333 GEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDL-THP---SLYCYLGN 388

Query: 298 ISSQPKFPEVTIHF-RGADVKLSPSNLF---RNISDEIMCSAFRGGNANIVYGRIMQINF 353
           ++       VT+ F  GAD++L  ++LF    N++++ +C A   GN  I+ G   Q N 
Sbjct: 389 MTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRAIL-GVYPQRNI 446

Query: 354 LIGYDIEQAMVSFKPSRC 371
            +GYD+    ++F   +C
Sbjct: 447 NVGYDLSTMEIAFDRDQC 464


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 111/402 (27%), Positives = 160/402 (39%), Gaps = 88/402 (21%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTW---------TQCEPCPELDCFKQEPPLFDPKKSS 85
           Y + LS GTPP  +   +DTGSD  W           C         + +P  F PK+SS
Sbjct: 67  YSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQP--FIPKESS 124

Query: 86  TYNSISCSSSQCAVVTS---NCSEGDCS-----------YSFLYGRGAYASFSSGNLATE 131
           +   + C + +C+ +     NC + DCS           Y   YG G     + G   +E
Sbjct: 125 SSKLLGCKNPKCSWIHHSNINCDQ-DCSIKSCLNQTCPPYMIFYGSGT----TGGVALSE 179

Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
           TL  +S S      PN + GC      S  S  +  GI G G G SSL SQ+G    GKF
Sbjct: 180 TLHLHSLS-----KPNFLVGC------SVFSSHQPAGIAGFGRGLSSLPSQLGL---GKF 225

Query: 192 SYCL--------PDQGSSKI----NFGGIVAGAGVVSTPLI----------IRDHYYLSL 229
           SYCL          + SS +              +V TP +             +YYL L
Sbjct: 226 SYCLLSHRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGL 285

Query: 230 EAISVGNQRLE----FVS---SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIK-AQPV 281
             I+VG   ++    ++S      G + +D+G   T +  E    L       IK  + V
Sbjct: 286 RRITVGGHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRV 345

Query: 282 KGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAF-- 336
           K +    G     C+N+S      FPE+ ++F+G ADV L   N F  +  E+ C     
Sbjct: 346 KEIEDAIGLRP--CFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVT 403

Query: 337 -------RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
                  R G   ++ G     NF + YD+    + FK  +C
Sbjct: 404 DGVAGPERVGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 106/382 (27%), Positives = 163/382 (42%), Gaps = 52/382 (13%)

Query: 19  PISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL 78
           P+ I    +I+S+ + Y+    +GTP   +  ++D  +D  W  C  C    C    P  
Sbjct: 87  PVPIAPGRQILSIPN-YIARAGLGTPAQTLLVAIDPSNDAAWVPCSAC--AGCAASSP-S 142

Query: 79  FDPKKSSTYNSISCSSSQCAVVTS-NCSEG---DCSYSFLYGRGAY-ASFSSGNLATETL 133
           F P +SSTY ++ C S QCA V S +C  G    C ++  Y    + A     +LA E  
Sbjct: 143 FSPTQSSTYRTVPCGSPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENN 202

Query: 134 TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSY 193
              S +          FGC    + S  S   Q G+IG G G  S +SQ   +    FSY
Sbjct: 203 VVVSYT----------FGC--LRVVSGNSVPPQ-GLIGFGRGPLSFLSQTKDTYGSVFSY 249

Query: 194 CLPDQGSSK----INFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS 245
           CLP+  SS     +  G I     + +TPL+   H    YY+++  I VG++ ++   S+
Sbjct: 250 CLPNYRSSNFSGTLKLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSA 309

Query: 246 ------TGN-IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI 298
                 TG+   +D G + T L    ++ ++      ++      +G   GF    CYN+
Sbjct: 310 LAFNPVTGSGTIIDAGTMFTRLAAPVYAAVRDAFRGRVRTPVAPPLG---GFDT--CYNV 364

Query: 299 SSQPKFPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAFRGG-----NANI-VYGRIMQ 350
           +     P VT  F GA  V L   N +  + S  + C A   G     NA + V   + Q
Sbjct: 365 TV--SVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQ 422

Query: 351 INFLIGYDIEQAMVSFKPSRCT 372
            N  + +D+    V F    CT
Sbjct: 423 QNQRVLFDVANGRVGFSRELCT 444


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 104/364 (28%), Positives = 165/364 (45%), Gaps = 47/364 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IGTPP +    VDTGS  T+  C  C    C K + P F P +SSTY+ + C+ 
Sbjct: 88  YTTRLWIGTPPQEFALIVDTGSTVTYVPCSDCEH--CGKHQDPRFQPDESSTYHPVKCN- 144

Query: 95  SQCAVVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                +  NC     +C Y   Y   A  S SSG L  + ++F + S +  +    +FGC
Sbjct: 145 -----MDCNCDHDGVNCVYERRY---AEMSSSSGVLGEDIISFGNQSEVVPQ--RAVFGC 194

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLPDQGSSKINFGGIVA 210
            +       S  +  GI+GLG G  S++ Q+     I   FS C    G   +  G +V 
Sbjct: 195 ENVETGDLYSQ-RADGIMGLGRGQLSIVDQLVDKNVINDSFSLCY---GGMHVGGGAMVL 250

Query: 211 GAGVVSTPLII-------RDHYY-LSLEAISVGNQRLEFVSSS---TGNIFVDTGVLRTL 259
           G G+   P ++       R  YY + L+ I V  + L+   S+        +D+G     
Sbjct: 251 G-GIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPLKLSPSTFDRKHGTVLDSGTTYAY 309

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF- 311
           LP E     +  +  + K+  +K + G +P ++D+ C+     ++S   K FPEV + F 
Sbjct: 310 LPEEAFVAFRDAI--IKKSHNLKQIHGPDPNYNDI-CFSGAGRDVSQLSKAFPEVDMVFS 366

Query: 312 RGADVKLSPSN-LFRN--ISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKP 368
            G  + L+P N LF++  +        FR G++  + G I+  N L+ YD E   + F  
Sbjct: 367 NGQKLSLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLGGIIVRNTLVTYDRENEKIGFWK 426

Query: 369 SRCT 372
           + C+
Sbjct: 427 TNCS 430


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 108/361 (29%), Positives = 157/361 (43%), Gaps = 40/361 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IGTPP      VDTGS  T+  C  C +  C + + P FDP+ SSTY  I C+ 
Sbjct: 83  YTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQ--CGRHQDPKFDPESSSTYKPIKCN- 139

Query: 95  SQCAVVTSNC-SEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                +   C S+G  C Y   Y   A  S SSG L  + ++F + S L  +    +FGC
Sbjct: 140 -----IDCICDSDGVQCVYERQY---AEMSTSSGVLGEDVISFGNQSELIPQ--RAVFGC 189

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCL--PDQGSSKINFGGI 208
            +       S  +  GI+GLG G+ SL+ Q+    +I   FS C    D G   +  GGI
Sbjct: 190 ENMETGDLFS-QRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGI 248

Query: 209 VAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEFVSSSTGNIF---VDTGVLRTLLP 261
              + ++   S P  +R  YY + L+ I V  ++L   S      +   +D+G     LP
Sbjct: 249 SPPSDMIFTYSDP--VRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYLP 306

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP-----KFPEVTIHFR-GAD 315
            E  S  K  + + I +   K  G +P F D+      S       KFP V + F  G  
Sbjct: 307 AEAFSAFKDAIMDEIHSLK-KIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQK 365

Query: 316 VKLSPSNLF---RNISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           + L+P N F     +        F  GN    + G I+  N L+ YD   + + F  + C
Sbjct: 366 LSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTNC 425

Query: 372 T 372
           +
Sbjct: 426 S 426


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 108/362 (29%), Positives = 159/362 (43%), Gaps = 42/362 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IGTPP      VDTGS  T+  C  C +  C + + P FDP+ SSTY  I C+ 
Sbjct: 83  YTTRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQ--CGRHQDPKFDPESSSTYKPIKCN- 139

Query: 95  SQCAVVTSNC-SEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                +   C S+G  C Y   Y   A  S SSG L  + ++F + S L  +    +FGC
Sbjct: 140 -----IDCICDSDGVQCVYERQY---AEMSTSSGVLGEDVISFGNQSELIPQ--RAVFGC 189

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCL--PDQGSSKINFGGI 208
            +       S  +  GI+GLG G+ SL+ Q+    +I   FS C    D G   +  GGI
Sbjct: 190 ENMETGDLFS-QRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGI 248

Query: 209 VAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEFVSSSTGNIF---VDTGVLRTLLP 261
              + ++   S P  +R  YY + L+ I V  ++L   S      +   +D+G     LP
Sbjct: 249 SPPSDMIFTYSDP--VRSPYYNVDLKEIHVAGKKLPLSSGIFDGRYGAVLDSGTTYAYLP 306

Query: 262 LEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCYNISSQP-----KFPEVTIHFR-GA 314
            E  S  K  + + I +  +K + G +P F D+      S       KFP V + F  G 
Sbjct: 307 AEAFSAFKDAIMDEIHS--LKKIDGPDPNFKDICFSGAGSDAAELSNKFPTVDMVFENGQ 364

Query: 315 DVKLSPSNLF---RNISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPSR 370
            + L+P N F     +        F  GN    + G I+  N L+ YD   + + F  + 
Sbjct: 365 KLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIGFWKTN 424

Query: 371 CT 372
           C+
Sbjct: 425 CS 426


>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
          Length = 488

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 107/378 (28%), Positives = 158/378 (41%), Gaps = 56/378 (14%)

Query: 35  YLMHLSIGTPPVDI---FGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           YL+ L IGTP   I   +   DTGSD +WTQCEPC     F   PP  DP KS T+  +S
Sbjct: 123 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTFRRLS 181

Query: 92  CSSSQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST---SGLPVE 144
           C    C    AVV        C +   YG G      SG L ++   F +     G  +E
Sbjct: 182 CFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAV---SGELVSDVFHFGAAGDGGGYQLE 238

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-------- 196
             +V FGC H    S       TGI+ LG G  S ++Q+G     +FSYC+P        
Sbjct: 239 R-DVAFGCAHVE-DSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYCIPASEITDDD 293

Query: 197 -----DQGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVG-----NQRLEFVSSST 246
                ++ +S + FG      G  +        Y + L+++        NQ+        
Sbjct: 294 DDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVA 353

Query: 247 GN-------IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY--N 297
           G        + VD+G     LP      L+  +   I       +   P    + CY  N
Sbjct: 354 GEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDL-THP---SLYCYLGN 409

Query: 298 ISSQPKFPEVTIHF-RGADVKLSPSNLF---RNISDEIMCSAFRGGNANIVYGRIMQINF 353
           ++       VT+ F  GAD++L  ++LF    N++++ +C A   GN  I+ G   Q N 
Sbjct: 410 MTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRAIL-GVYPQRNI 467

Query: 354 LIGYDIEQAMVSFKPSRC 371
            +GYD+    ++F   +C
Sbjct: 468 NVGYDLSTMEIAFDRDQC 485


>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 432

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 96/376 (25%), Positives = 161/376 (42%), Gaps = 60/376 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++   +G+P   I  ++DT +D TW  C PC           LF P  S++Y  + CSS
Sbjct: 77  YVVRAGLGSPAQPILLALDTSADATWAHCSPC---GTCPSSGSLFAPANSTSYAPLPCSS 133

Query: 95  SQCAVVTSN-CSEGD----------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           + C V+    C   D          C+++  +   A ASF + +LA++ L     +    
Sbjct: 134 TMCTVLQGQPCPAQDPYDSSAPLPMCAFTKPF---ADASFQA-SLASDWLHLGKDA---- 185

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
            +PN  FGC    ++ PT++  + G++GLG G  +L+SQ+G    G FSYCLP       
Sbjct: 186 -IPNYAFGC-VSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYKSYYF 243

Query: 200 SSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLE-------FVSSSTGN 248
           S  +  G      GV  TP++   +    YY+++  +SVG   ++       F  ++   
Sbjct: 244 SGSLRLGAAGQPRGVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATGAG 303

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV----LCYNISSQPK- 303
             VD+G + T      ++ L+            + V A  G++ +     C+N       
Sbjct: 304 TVVDSGTVITRWTPPVYAALREEFR--------RHVAAPSGYTSLGAFDTCFNTDEVAAG 355

Query: 304 -FPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAFRGGNANI-----VYGRIMQINFLI 355
             P VT+H  G  D+ L   N L  + +  + C A      N+     V   + Q N  +
Sbjct: 356 VAPAVTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRV 415

Query: 356 GYDIEQAMVSFKPSRC 371
            +D+  + V F    C
Sbjct: 416 VFDVANSRVGFARESC 431


>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
          Length = 434

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 107/397 (26%), Positives = 175/397 (44%), Gaps = 51/397 (12%)

Query: 1   AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
           A++  ++ +++     KS + I    +II     Y++    GTPP  +  ++DT SD  W
Sbjct: 64  AKDQARMQYFSSLVARKSVVPIASARQIIQ-SPTYIVKAKFGTPPQTLLLALDTSSDAAW 122

Query: 61  TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
             C  C  + C   +P  F P KS+++ ++SC S  C  V +  C    C+++F YG  +
Sbjct: 123 IPCSGC--VGCSTSKP--FAPIKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSS 178

Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
            A+    ++  +TLT  +       +P   FGC +K      S + Q G++GLG G  SL
Sbjct: 179 IAA----SVVQDTLTLAAD-----PIPGYTFGCVNKTTG---SSAPQQGLLGLGRGPLSL 226

Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGG------IVAGAGVVSTPLIIRDH----YYLSL 229
           +SQ        FSYCLP   S  INF G      +     +  TPL+        YY++L
Sbjct: 227 LSQSQNLYKSTFSYCLPSFKS--INFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNL 284

Query: 230 EAISVGNQ-------RLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ-PV 281
            AI VG +        L F  ++      D+G + T L    ++ +++     +  + PV
Sbjct: 285 VAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPV 344

Query: 282 KGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNL-FRNISDEIMCSAFRGGN 340
             +G   GF    CYN+      P +T  F G +V L P N+   + +    C A  G  
Sbjct: 345 TTLG---GFDT--CYNVPI--VVPTITFLFSGMNVALPPDNIVIHSTAGSTTCLAMAGAP 397

Query: 341 ANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
            N+     V   + Q N  + +D+  + +      CT
Sbjct: 398 DNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELCT 434


>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 96/370 (25%), Positives = 158/370 (42%), Gaps = 40/370 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP       VDTGS+ TW  C          +   +F  ++S ++ ++ C +
Sbjct: 88  YFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEESKSFKTVGCFT 147

Query: 95  SQCAVVTSNC--------SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
             C V   N             CSY + Y  G+ A    G  A ET+T   T+G    + 
Sbjct: 148 QTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQ---GVFAKETITVGLTNGRKARLR 204

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
            ++ GC           +   G++GL   + S  S   +    K SYCL D  S+K    
Sbjct: 205 GLLVGCSSSFSGQSFQGAD--GVLGLAFSDFSFTSTATSLFGAKLSYCLVDHLSNKNISN 262

Query: 203 -INFG-----GIVAGAGVVSTPL---IIRDHYYLSLEAISVGNQRLE-----FVSSSTGN 248
            + FG          A   +TPL   +I   Y +++  IS+G+  L+     + +++ G 
Sbjct: 263 YLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQVWDATTGGG 322

Query: 249 IFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVK--GVGAEPGFSDVLCYNISSQPKFP 305
             +D+G   TLL    Y   +  +   +++ + VK  G+  E  FS    +N   + K P
Sbjct: 323 TILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFN---ESKLP 379

Query: 306 EVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFR--GGNANIVYGRIMQINFLIGYDIEQA 362
           ++T H +G A  +    +   + +  + C  F   G  A  V G IMQ N+L  +D+  +
Sbjct: 380 QLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFMSAGTPATNVVGNIMQQNYLWEFDLMAS 439

Query: 363 MVSFKPSRCT 372
            +SF PS CT
Sbjct: 440 TLSFAPSTCT 449


>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 498

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 102/379 (26%), Positives = 163/379 (43%), Gaps = 58/379 (15%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP  D +  VDTGSD  W    QC  CP       E   +D ++S+T   +
Sbjct: 86  LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLV 145

Query: 91  SCSSSQCAVVT----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
           SC    C  V     S C+    C Y  +YG G   S ++G    + + +N  SG L   
Sbjct: 146 SCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDG---SSTAGYFVKDYVQYNRVSGDLETT 202

Query: 145 MPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
             N  + FGCG +      S  ++   GI+G G  NSS+ISQ+ ++  +   F++CL   
Sbjct: 203 AANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL--- 259

Query: 199 GSSKINFGGIVAGAGVVS-----TPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTG 247
                N GGI A   VV      TPL+    HY +++  + VG+  L      F +    
Sbjct: 260 --DGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRK 317

Query: 248 NIFVDTGVLRTLLP-LEYHSNLKSVMSNM--IKAQPVKGVGAEPGFSDVLCYNISSQPK- 303
              +D+G     LP L Y   +  ++S    ++ Q + G        +  C+  S +   
Sbjct: 318 GTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHG--------EYKCFQYSERVDD 369

Query: 304 -FPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAF-------RGGNANIVYGRIMQINF 353
            FP V  HF  +  +K+ P   LF+   + + C  +       R      ++G ++  N 
Sbjct: 370 GFPPVIFHFENSLLLKVYPHEYLFQ--YENLWCIGWQNSGMQSRDRKNVTLFGDLVLSNK 427

Query: 354 LIGYDIEQAMVSFKPSRCT 372
           L+ YD+E   + +    C+
Sbjct: 428 LVLYDLENQTIGWTEYNCS 446


>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
 gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
          Length = 492

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 112/361 (31%), Positives = 157/361 (43%), Gaps = 39/361 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IGTPP +    VDTGS  T+  C  C    C   + P F P  SS+Y  + C  
Sbjct: 35  YTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSCTH--CGNHQDPRFSPALSSSYKPLECG- 91

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRG-AYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
                  S CS G C  S  Y R  A  S SSG L  + + F+++S L  +   ++FGC 
Sbjct: 92  -------SECSTGFCDGSRKYQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQ--RLVFGCE 142

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLP--DQGSSKINFGGIV 209
                    D    GIIGLG G  S+I Q+    ++   FS C    D+G   +  GG  
Sbjct: 143 TAETGD-LYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGMDEGGGAMILGGFQ 201

Query: 210 AGAGVVSTPLI-IRDHYY-LSLEAISVGNQRL----EFVSSSTGNIFVDTGVLRTLLPLE 263
               +V T     R  YY L L+ I VG   L    E      G + +D+G      P  
Sbjct: 202 PPKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFDGKYGTV-LDSGTTYAYFPGA 260

Query: 264 YHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGAD 315
                KS +   + +  +K V G +  F D+ CY     N+S+  + FP V   F  G  
Sbjct: 261 AFQAFKSAVKEQVGS--LKEVPGPDEKFKDI-CYAGAGTNVSNLSQFFPSVDFVFGDGQS 317

Query: 316 VKLSPSN-LFRN--ISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           V LSP N LFR+  IS       F  G+   + G I+  N L+ Y+  +A + F  ++C 
Sbjct: 318 VTLSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIVRNMLVTYNRGKASIGFLKTKCN 377

Query: 373 N 373
           +
Sbjct: 378 D 378


>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
          Length = 468

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 107/380 (28%), Positives = 158/380 (41%), Gaps = 58/380 (15%)

Query: 35  YLMHLSIGTPPVDI---FGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           YL+ L IGTP   I   +   DTGSD +WTQCEPC     F   PP  DP KS T+  +S
Sbjct: 101 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTFRRLS 159

Query: 92  CSSSQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST---SGLPVE 144
           C    C    AVV        C +   YG G      SG L ++   F +     G  +E
Sbjct: 160 CFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAV---SGELVSDVFHFGAAGDGGGYQLE 216

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-------- 196
             +V FGC H    S       TGI+ LG G  S ++Q+G     +FSYC+P        
Sbjct: 217 R-DVAFGCAHVE-DSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYCIPASEITDDD 271

Query: 197 -------DQGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVG-----NQRLEFVSS 244
                  ++ +S + FG      G  +        Y + L+++        NQ+      
Sbjct: 272 DDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVY 331

Query: 245 STGN-------IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY- 296
             G        + VD+G     LP      L+  +   I       +   P    + CY 
Sbjct: 332 VAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDL-THP---SLYCYL 387

Query: 297 -NISSQPKFPEVTIHF-RGADVKLSPSNLF---RNISDEIMCSAFRGGNANIVYGRIMQI 351
            N++       VT+ F  GAD++L  ++LF    N++++ +C A   GN  I+ G   Q 
Sbjct: 388 GNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRAIL-GVYPQR 445

Query: 352 NFLIGYDIEQAMVSFKPSRC 371
           N  +GYD+    ++F   +C
Sbjct: 446 NINVGYDLSTMEIAFDRDQC 465


>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 107/397 (26%), Positives = 175/397 (44%), Gaps = 51/397 (12%)

Query: 1   AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
           A++  ++ +++     KS + I    +II     Y++    GTPP  +  ++DT SD  W
Sbjct: 64  AKDQARMQYFSSLVARKSVVPIASARQIIQ-SPTYIVKAKFGTPPQTLLLALDTSSDAAW 122

Query: 61  TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
             C  C  + C   +P  F P KS+++ ++SC S  C  V +  C    C+++F YG  +
Sbjct: 123 IPCSGC--VGCSTSKP--FAPIKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSS 178

Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
            A+    ++  +TLT  +       +P   FGC +K      S + Q G++GLG G  SL
Sbjct: 179 IAA----SVVQDTLTLATD-----PIPGYTFGCVNKTTG---SSAPQQGLLGLGRGPLSL 226

Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGG------IVAGAGVVSTPLIIRDH----YYLSL 229
           +SQ        FSYCLP   S  INF G      +     +  TPL+        YY++L
Sbjct: 227 LSQSQNLYKSTFSYCLPSFKS--INFSGSLRLGPVYQPKRIKYTPLLRNPRRSSLYYVNL 284

Query: 230 EAISVGNQ-------RLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ-PV 281
            AI VG +        L F  ++      D+G + T L    ++ +++     +  + PV
Sbjct: 285 VAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRLAEPVYTAVRNEFRRRVGPKLPV 344

Query: 282 KGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNL-FRNISDEIMCSAFRGGN 340
             +G   GF    CYN+      P +T  F G +V L P N+   + +    C A  G  
Sbjct: 345 TTLG---GFDT--CYNVPI--VVPTITFLFSGMNVTLPPDNIVIHSTAGSTTCLAMAGAP 397

Query: 341 ANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
            N+     V   + Q N  + +D+  + +      CT
Sbjct: 398 DNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELCT 434


>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
 gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
          Length = 471

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 107/380 (28%), Positives = 158/380 (41%), Gaps = 58/380 (15%)

Query: 35  YLMHLSIGTPPVDI---FGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           YL+ L IGTP   I   +   DTGSD +WTQCEPC     F   PP  DP KS T+  +S
Sbjct: 104 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTFRRLS 162

Query: 92  CSSSQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST---SGLPVE 144
           C    C    AVV        C +   YG G      SG L ++   F +     G  +E
Sbjct: 163 CFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAV---SGELVSDVFHFGAAGDGGGYQLE 219

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-------- 196
             +V FGC H    S       TGI+ LG G  S ++Q+G     +FSYC+P        
Sbjct: 220 R-DVAFGCAHVE-DSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYCIPASEITDDD 274

Query: 197 -------DQGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVG-----NQRLEFVSS 244
                  ++ +S + FG      G  +        Y + L+++        NQ+      
Sbjct: 275 DDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVY 334

Query: 245 STGN-------IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY- 296
             G        + VD+G     LP      L+  +   I       +   P    + CY 
Sbjct: 335 VAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDL-THP---SLYCYL 390

Query: 297 -NISSQPKFPEVTIHF-RGADVKLSPSNLF---RNISDEIMCSAFRGGNANIVYGRIMQI 351
            N++       VT+ F  GAD++L  ++LF    N++++ +C A   GN  I+ G   Q 
Sbjct: 391 GNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRAIL-GVYPQR 448

Query: 352 NFLIGYDIEQAMVSFKPSRC 371
           N  +GYD+    ++F   +C
Sbjct: 449 NINVGYDLSTMEIAFDRDQC 468


>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
 gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
          Length = 489

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 107/380 (28%), Positives = 158/380 (41%), Gaps = 58/380 (15%)

Query: 35  YLMHLSIGTPPVDI---FGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           YL+ L IGTP   I   +   DTGSD +WTQCEPC     F   PP  DP KS T+  +S
Sbjct: 122 YLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTFRRLS 180

Query: 92  CSSSQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST---SGLPVE 144
           C    C    AVV        C +   YG G      SG L ++   F +     G  +E
Sbjct: 181 CFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAV---SGELVSDVFHFGAAGDGGGYQLE 237

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-------- 196
             +V FGC H    S       TGI+ LG G  S ++Q+G     +FSYC+P        
Sbjct: 238 R-DVAFGCAHVE-DSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYCIPASEITDDD 292

Query: 197 -------DQGSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVG-----NQRLEFVSS 244
                  ++ +S + FG      G  +        Y + L+++        NQ+      
Sbjct: 293 DDDDDDEERSASFLRFGSHARMTGKRAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPVY 352

Query: 245 STGN-------IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY- 296
             G        + VD+G     LP      L+  +   I       +   P    + CY 
Sbjct: 353 VAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDL-THP---SLYCYL 408

Query: 297 -NISSQPKFPEVTIHF-RGADVKLSPSNLF---RNISDEIMCSAFRGGNANIVYGRIMQI 351
            N++       VT+ F  GAD++L  ++LF    N++++ +C A   GN  I+ G   Q 
Sbjct: 409 GNMTDVEAV-SVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRAIL-GVYPQR 466

Query: 352 NFLIGYDIEQAMVSFKPSRC 371
           N  +GYD+    ++F   +C
Sbjct: 467 NINVGYDLSTMEIAFDRDQC 486


>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
           distachyon]
          Length = 473

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 84/355 (23%), Positives = 151/355 (42%), Gaps = 41/355 (11%)

Query: 51  SVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS-QCAVVTSNCSEGDC 109
            +D  +  +W QC PC    C  Q  P+FDP KS T+  +S  ++  C        +G C
Sbjct: 119 EMDMAAGFSWMQCAPC--HPCLPQLNPVFDPAKSPTFRPVSGHNAVLCRPPYHPLQDGRC 176

Query: 110 SYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGI 169
            +   Y  GA A   +G LA +T +F +       +P ++FGC ++ +A   +     G+
Sbjct: 177 GFGIAYRNGASA---AGYLARDTFSFPTGDNNFQHLPGIVFGCANR-IARFDTHGALAGV 232

Query: 170 IGLGPGN-----SSLISQMGTSIAGKFSYCLPDQGSSKINF------------GGIVAGA 212
           +G+G G      +  + Q+  +  G+FSYC    G++  +F             G+   +
Sbjct: 233 LGMGMGAEGKPLTGFMRQLYHNGGGRFSYCPIVPGTTAYSFLRFGNDIPSQPPAGVHRQS 292

Query: 213 GVVSTPLIIRDHYYLSLEAISVGNQRLEFVS--------SSTGNIFVDTGVLRTLLPLEY 264
             V  P    + YY+ L  ISVG  R+  V+           G   +D G   T +    
Sbjct: 293 MAVLAPTTTSEAYYVKLAGISVGALRVPGVTPEMFERDQHGRGGCAIDIGTKMTAIVQTA 352

Query: 265 HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGAD-VKLSPSNL 323
           ++++++ +   ++    + V   PG    +    + + + P +T+HF G   +++ P +L
Sbjct: 353 YAHVEAAVRGHLQRNRARFV-QSPGHHLCVHRTPAIEERLPSMTLHFVGGPWLRVKPQHL 411

Query: 324 FRNISD-----EIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ--AMVSFKPSRC 371
           F  +       E +C          V G + QI+    +D+     +VSF P  C
Sbjct: 412 FLVVGSPTGGGEYLCLGLVPDAEMTVIGAMQQIDTRFIFDLHNNIPIVSFNPEDC 466


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 104/368 (28%), Positives = 166/368 (45%), Gaps = 37/368 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP +    +DTGSD  W  C     CP       +   FD   S T  S+
Sbjct: 99  LYFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSV 158

Query: 91  SCSSSQCAVV----TSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           +CS   C+ V     + CSE + C YSF YG G   S +SG   T+T  F++  G  +  
Sbjct: 159 TCSDPICSSVFQTTAAQCSENNQCGYSFRYGDG---SGTSGYYMTDTFYFDAILGESLVA 215

Query: 146 PN---VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQG 199
            +   ++FGC        T SD    GI G G G  S++SQ+ +       FS+CL   G
Sbjct: 216 NSSAPIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDG 275

Query: 200 SSKINFG-GIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
           S    F  G +   G+V +PL+  + HY L+L +I V  Q L      F +S+T    VD
Sbjct: 276 SGGGVFVLGEILVPGMVYSPLLPSQPHYNLNLLSIGVNGQILPIDAAVFEASNTRGTIVD 335

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIH 310
           TG   T L  E +    + +SN + +Q V  + +    +   CY +S+     FP V+++
Sbjct: 336 TGTTLTYLVKEAYDPFLNAISNSV-SQLVTLIIS----NGEQCYLVSTSISDMFPPVSLN 390

Query: 311 FR-GADVKLSPSNLFRNI----SDEIMCSAF-RGGNANIVYGRIMQINFLIGYDIEQAMV 364
           F  GA + L P +   +        + C  F +      + G ++  + +  YD+ +  +
Sbjct: 391 FAGGASMMLRPQDYLFHYGFYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRI 450

Query: 365 SFKPSRCT 372
            +    C+
Sbjct: 451 GWANYDCS 458


>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
 gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
          Length = 389

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 105/374 (28%), Positives = 166/374 (44%), Gaps = 55/374 (14%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
           M LS+GTPP  +  ++   S  +W  C     ++C      LF P  S+++  + C S  
Sbjct: 1   MDLSLGTPPQPLNFTLAVDSGFSWVACSSSCAINC--TTASLFQPGLSTSHTKLPCGSPS 58

Query: 97  C----AVVTSNCSEGDCSYSFLYGRGAYASFSS-GNLATETLTFNSTSGLPVEMPNVIFG 151
           C    AV TS      CSY+  YG     +FSS G+L ++  T +S     V   N+  G
Sbjct: 59  CSAFSAVSTSCGPSSSCSYNTSYG----TNFSSAGDLVSDIATMDSVRNRKVA-ANLSLG 113

Query: 152 CGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT-SIAGKFSYCLP-DQGSSKINFGGIV 209
           CG ++          +G +G   GN S + Q+       KF YCLP D    K+  G   
Sbjct: 114 CG-RDSGGLLELLDTSGFVGFDKGNVSFMGQLSALGYRSKFIYCLPSDTFRGKLVIGNYK 172

Query: 210 AGAGVVS-----TPLIIR----DHYYLSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
                +S     TP+I      + Y+++L  IS+   + +     F+S+ TG   +DT  
Sbjct: 173 LRNASISSSMAYTPMITNPQAAELYFINLSTISIDKNKFQVPIQGFLSNGTGGTVIDTTT 232

Query: 256 LRTLLPLEYHSNL----KSVMSNMIKAQP--VKGVGAEPGFSDVLCYNISSQPKFP---E 306
             + L  ++++ L    K+  +N+++        +G E      LCYNIS+   FP    
Sbjct: 233 FLSYLTSDFYTQLVQAIKNYTTNLVEVSSSVADALGVE------LCYNISANSDFPPPAT 286

Query: 307 VTIHFR-GADVKLSPSNLFRNISDEI---MCSAF-----RGGNANIVYGRIMQINFLIGY 357
           +T HF  GA V++S   L  + SD +   +C A       G N N++ G   Q++  + Y
Sbjct: 287 LTYHFLGGAGVEVSTWFLLDD-SDSVNNTICMAIGRSESVGPNLNVI-GTYQQLDLTVEY 344

Query: 358 DIEQAMVSFKPSRC 371
           D+EQ    F    C
Sbjct: 345 DLEQMRYGFGAQGC 358


>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 293

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 67/160 (41%), Positives = 86/160 (53%), Gaps = 12/160 (7%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + IGTP  DI    DTGSD TWTQCEPC    C+ Q+ P F+P  SS+Y+++SCSS
Sbjct: 134 YIVTIGIGTPKHDISLMFDTGSDLTWTQCEPCLG-SCYSQKEPKFNPSSSSSYHNVSCSS 192

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
             C    S CS  +C Y   YG G   S + G LA E  T  ++  L     ++ FGCG 
Sbjct: 193 PMCGNPES-CSASNCLYGIGYGDG---SVTVGFLAKEKFTLTNSDVL----DDIYFGCGE 244

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC 194
            N       +   GI+GLGPG  S   Q  T+    FSYC
Sbjct: 245 NNKGVFIGSA---GILGLGPGKFSFPLQTTTTYNNIFSYC 281


>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
 gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
          Length = 483

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 121/418 (28%), Positives = 172/418 (41%), Gaps = 99/418 (23%)

Query: 31  VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE----PCPELDCFKQEPPL--FDPKKS 84
           V D YL+ LSIGTPP  I   +DTGSD TW  C      C E D ++    +  F P  S
Sbjct: 76  VRDGYLISLSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCIECDNYRNNRMMASFSPSHS 135

Query: 85  STYNSISCSSSQCAVVTSN------CSEGDCS---------------YSFLYGRGAYASF 123
           S+ +  SC+S  C  V S+      C+   CS               +++ YG G     
Sbjct: 136 SSSHRDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVV-- 193

Query: 124 SSGNLATETLTFNSTS-GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ 182
            +G L  +TL  +  + G+  E+P   FGC   +   P       GI G G G  SL SQ
Sbjct: 194 -TGTLTRDTLRVHGRNLGVTQEIPRFCFGCVASSYREP------IGIAGFGRGALSLPSQ 246

Query: 183 MGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIRD------------------- 223
           +G    G FS+C               A    +S+PLII D                   
Sbjct: 247 LGFLRKG-FSHCF---------LAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPM 296

Query: 224 ---HYYLSLEAISVGN--------QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVM 272
              +YY+ LEAI+VGN           EF S   G + VD+G   T LP  ++S + SV+
Sbjct: 297 YPNYYYVGLEAITVGNVSATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVL 356

Query: 273 SNMIKAQPVKGVGAEPGFSDVLCYNISSQPK-------FPEVTIHF-RGADVKLSPSNLF 324
            ++I       +    GF   LCY +  Q          P +T HF   A + LS  + F
Sbjct: 357 QSIINYPRATDMEMRTGFD--LCYKVPCQNNSILTGDLLPSITFHFLNNASLVLSRGSHF 414

Query: 325 RNISDE-----IMCSAFRG------GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             +S       + C  F+       G A ++ G   Q +  + YD+E+  + F+P  C
Sbjct: 415 YAMSAPSNSTVVKCLLFQSMDDGDYGPAGVL-GSFQQQDVEVVYDMEKERIGFRPMDC 471


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 106/360 (29%), Positives = 168/360 (46%), Gaps = 49/360 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  +S GTP V     +DTGSD TW QC+PC    C  Q+ PLFDP  SSTY+++ C+S
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCAS 171

Query: 95  SQCAVVT-----SNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
            +C  +      S CS G  C ++  Y  G   + + G    + LT     G  V+  + 
Sbjct: 172 GECKKLAADAYGSGCSNGQPCGFAISYVDG---TSTVGVYGKDKLTL--APGAIVK--DF 224

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFG 206
            FGCGH   + P       G+  L   + SL +Q G      FSYCLP   S    + FG
Sbjct: 225 YFGCGHSKSSLPGLFDGLLGLGRL---SESLGAQYGGGGG--FSYCLPAVNSKPGFLAFG 279

Query: 207 GIVAGAGVVSTPL--IIRDHYY--LSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLP 261
                +G V TP+  +     +  ++L  I+VG ++L+   S+ +G + VD+G + T+L 
Sbjct: 280 AGRNPSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFSGGMIVDSGTVVTVLQ 339

Query: 262 LEYHSNLKSVMSNMIKA-QPVKGVGAEPGFSDV-LCYNISSQPK--FPEVTIHFR-GADV 316
              +  L++     +KA + V G        D+  CY+++       P++ + F  GA +
Sbjct: 340 STVYRALRAAFREAMKAYRLVHG--------DLDTCYDLTGYKNVVVPKIALTFSGGATI 391

Query: 317 KLS-PSNLFRNISDEIMCSAF----RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            L  P+ +  N      C AF    + G A ++ G + Q  F + +D   +   F+   C
Sbjct: 392 NLDVPNGILVN-----GCLAFAETGKDGTAGVL-GNVNQRTFEVLFDTSASKFGFRAKAC 445


>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
 gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
          Length = 428

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 99/364 (27%), Positives = 166/364 (45%), Gaps = 47/364 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           +Y++ + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC 
Sbjct: 81  LYVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCG 136

Query: 94  SSQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           +S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P 
Sbjct: 137 TSMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPG 189

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----- 202
             FGC   +  +        G++G+G G  S++ Q   +    FSYCLP Q S +     
Sbjct: 190 FSFGCNMDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDC-FSYCLPLQKSERGFFSK 247

Query: 203 ----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF---VSSSTGNIFV 251
                + G +     V  T ++ R    + +++ L AISV  +RL     V S  G +F 
Sbjct: 248 TTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVF- 306

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
           D+G   + +P        SV+S  I+   +K   AE   S+  CY++ S  +   P +++
Sbjct: 307 DSGSELSYIP----DRALSVLSQRIRELLLKRGAAEEE-SERNCYDMRSVDEGDMPAISL 361

Query: 310 HF-RGADVKLSPSNLF--RNISDE-IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
           HF  GA   L    +F  R++ ++ + C AF    +  + G +MQ +  + YD+++ ++ 
Sbjct: 362 HFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTESVSIIGSLMQTSKEVVYDLKRQLIG 421

Query: 366 FKPS 369
             PS
Sbjct: 422 IGPS 425


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 110/401 (27%), Positives = 173/401 (43%), Gaps = 53/401 (13%)

Query: 4   SQKLPFYNDNETPKSPISIIYQAEIISVDDI-----YLMHLSIGTPPVDIFGSVDTGSDC 58
           + ++PF           S +  A +   DD+     Y   L IGTPP +    VDTGS  
Sbjct: 41  AHRMPFDGHYSRRHLQNSELPNARMRLFDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTV 100

Query: 59  TWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSE--GDCSYSFLYG 116
           T+  C  C +  C K + P F P  SSTY  + C+ S       NC +    C+Y   Y 
Sbjct: 101 TYVPCSSCEQ--CGKHQDPRFQPDLSSTYRPVKCNPS------CNCDDEGKQCTYERRY- 151

Query: 117 RGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGN 176
             A  S SSG +A + ++F + S L  +    +FGC +       S  +  GI+GLG G 
Sbjct: 152 --AEMSSSSGVIAEDVVSFGNESELKPQ--RAVFGCENVETGDLYS-QRADGIMGLGRGR 206

Query: 177 SSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIRDH--------YY 226
            S++ Q+     I   FS C    G   +  G +V G  +   P ++  H        Y 
Sbjct: 207 LSVVDQLVDKGVIGDSFSLCY---GGMDVGGGAMVLGQ-ISPPPNMVFSHSNPYRSPYYN 262

Query: 227 LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVK 282
           + L+ + V  + L+    V        +D+G      P   +H+   ++M  +   + + 
Sbjct: 263 IELKELHVAGKPLKLKPKVFDEKHGTVLDSGTTYAYFPEAAFHALKDAIMKEIRHLKQIP 322

Query: 283 GVGAEPGFSDVLCYN-----ISSQPK-FPEVTIHF-RGADVKLSPSN-LFRN--ISDEIM 332
             G +P + D+ C++     +S   K FPEV + F  G  + LSP N LFR+  +S    
Sbjct: 323 --GPDPNYHDI-CFSGAGREVSHLSKVFPEVNMVFGSGQKLSLSPENYLFRHTKVSGAYC 379

Query: 333 CSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
              F+ GN    + G I+  N L+ YD E   + F  + C+
Sbjct: 380 LGIFQNGNDLTTLLGGIVVRNTLVTYDRENDKIGFWKTNCS 420


>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
          Length = 438

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 98/381 (25%), Positives = 163/381 (42%), Gaps = 42/381 (11%)

Query: 12  DNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC 71
           D +T   PI+   Q   I+    Y++ + +GTP   +F  +DT +D  W  C  C     
Sbjct: 78  DQKTTAVPIAPGQQVLKIAN---YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGFSS 134

Query: 72  FKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATE 131
                  F P  S+T  S+ CS +QC+ V         S + L+ +    S+   +  T 
Sbjct: 135 TT-----FLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQ----SYGGDSSLTA 185

Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
           TL  ++ +     +P   FGC   N  S  S   Q G++GLG G  SLISQ G   +G F
Sbjct: 186 TLVQDAITLANDVIPGFTFGC--INAVSGGSIPPQ-GLLGLGRGPISLISQAGAMYSGVF 242

Query: 192 SYCLPDQG----SSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVG-------N 236
           SYCLP       S  +  G +     + +TPL+   H    YY++L  +SVG       +
Sbjct: 243 SYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPS 302

Query: 237 QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
           ++L F  ++     +D+G + T      +  ++      +   P+  +GA   F    C+
Sbjct: 303 EQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG-PISSLGA---FDT--CF 356

Query: 297 NISSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANI-----VYGRIMQ 350
             +++ + P +T+HF G ++ L   N L  + S  + C +      N+     V   + Q
Sbjct: 357 AATNEAEAPAITLHFEGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQ 416

Query: 351 INFLIGYDIEQAMVSFKPSRC 371
            N  I +D   + +      C
Sbjct: 417 QNLRIMFDTTNSRLGIARELC 437


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 94/374 (25%), Positives = 152/374 (40%), Gaps = 45/374 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE----PCPELDCFKQEPPLFDPKKSSTYNSI 90
           Y +   +GTP        DTGSD TW +C       P+       P +F P  S ++  I
Sbjct: 110 YFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLA-SPRVFRPANSKSWAPI 168

Query: 91  SCSSSQCAVVT----SNCSEGD-----CSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
            CSS  C        +NCS G      C Y + Y   + A    G  A       S S  
Sbjct: 169 PCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSGSGSDR 228

Query: 142 PVEMPNVIFGC--GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---- 195
             ++  V+ GC   +   +  +SD    G++ LG  N S  S+      G+FSYCL    
Sbjct: 229 KAKLQEVVLGCTTSYDGQSFQSSD----GVLSLGNSNISFASRAAARFGGRFSYCLVDHL 284

Query: 196 -PDQGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVS-----SS 245
            P   +S + FG + A      TPL++       Y ++++A+SV  + L   +       
Sbjct: 285 APRNATSYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEVWDVKK 344

Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ---P 302
            G   +D+G   T+L    +  + + +S  +   P   V  +P F    CYN ++    P
Sbjct: 345 NGGAILDSGTSLTILATPAYKAVVAALSKQLARVP--RVTMDP-FE--YCYNWTATRRPP 399

Query: 303 KFPEVTIHFRGADVKLSPSNLFR-NISDEIMCSAFRGG--NANIVYGRIMQINFLIGYDI 359
             P + + F G+     P+  +  + +  + C   + G      V G I+Q   L  +D+
Sbjct: 400 AVPRLEVRFAGSARLRPPTKSYVIDAAPGVKCIGLQEGVWPGVSVIGNILQQEHLWEFDL 459

Query: 360 EQAMVSFKPSRCTN 373
               + F+ SRC +
Sbjct: 460 ANRWLRFQESRCAH 473


>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
          Length = 137

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/136 (44%), Positives = 80/136 (58%), Gaps = 11/136 (8%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           QA + + +  +LM L+IG P +     +DTGSD TWTQC PC   DC+KQ  P++DP  S
Sbjct: 11  QAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCMPCS--DCYKQPTPIYDPSLS 68

Query: 85  STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           STY ++SC SS C A+  S C    C Y  LY  G Y+S + G L+ ET T +S S    
Sbjct: 69  STYGTVSCKSSLCLALPASACISATCEY--LYTYGDYSS-TQGILSYETFTLSSQS---- 121

Query: 144 EMPNVIFGCGHKNLAS 159
            +P++ FGCG  N  S
Sbjct: 122 -IPHIAFGCGQDNEGS 136


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 99/394 (25%), Positives = 161/394 (40%), Gaps = 65/394 (16%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP---------LFDPKKSS 85
           Y +   +GTP        DTGSD TW +C P                      F P+KS 
Sbjct: 95  YFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSK 154

Query: 86  TYNSISCSSSQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF---- 135
           T+  I C+S  C      ++ T       C+Y + Y  G+ A    G + TE+ T     
Sbjct: 155 TWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAA---RGTVGTESATIALSS 211

Query: 136 ----NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
               +       ++  ++ GC   +   P+ ++   G++ LG  N S  S   +   G+F
Sbjct: 212 SSSSSKNKVKKAKLQGLVLGC-TGSYTGPSFEASD-GVLSLGYSNVSFASHAASRFGGRF 269

Query: 192 SYCL-----PDQGSSKINFG---------GIVAGAGVVSTPLII----RDHYYLSLEAIS 233
           SYCL     P   +S + FG            AG G   TPL++    R  Y +S++AIS
Sbjct: 270 SYCLVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAIS 329

Query: 234 VGNQRLE-----FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEP 288
           V  + L+     +     G + VD+G   T+L    +  + + +   +   P   V  +P
Sbjct: 330 VDGELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFPR--VAMDP 387

Query: 289 GFSDVLCYNISSQPK------FPEVTIHFRGADVKLSPSNLFR-NISDEIMCSAFRGGNA 341
            F    CYN +S  +       P++ +HF G+     PS  +  + +  + C   + G  
Sbjct: 388 -FE--YCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEGPW 444

Query: 342 N--IVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
               V G I+Q   L  +D++   + FK SRCT+
Sbjct: 445 PGISVIGNILQQEHLWEFDLKNRRLRFKRSRCTH 478


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 110/385 (28%), Positives = 174/385 (45%), Gaps = 46/385 (11%)

Query: 13  NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
           +E+ + P + +   + + ++  Y   L IGTPP      VDTGS  T+  C  C +  C 
Sbjct: 62  SESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQ--CG 119

Query: 73  KQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEG--DCSYSFLYGRGAYASFSSGNLAT 130
           + + P F P+ SSTY  + C+      +  NC      C Y   Y   A  S SSG L  
Sbjct: 120 RHQDPKFQPESSSTYQPVKCT------IDCNCDSDRMQCVYERQY---AEMSTSSGVLGE 170

Query: 131 ETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQM--GTSI 187
           + ++F + S L  +    +FGC  +N+ +    S+   GI+GLG G+ S++ Q+     I
Sbjct: 171 DLISFGNQSELAPQ--RAVFGC--ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVI 226

Query: 188 AGKFSYCL--PDQGSSKINFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF 241
           +  FS C    D G   +  GGI   + +    S P  +R  YY + L+ I V  +RL  
Sbjct: 227 SDSFSLCYGGMDVGGGAMVLGGISPPSDMAFAYSDP--VRSPYYNIDLKEIHVAGKRLPL 284

Query: 242 ---VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY- 296
              V        +D+G     LP       K  +  + + Q +K + G +P ++D+ C+ 
Sbjct: 285 NANVFDGKHGTVLDSGTTYAYLPEAAFLAFKDAI--VKELQSLKKISGPDPNYNDI-CFS 341

Query: 297 ----NISSQPK-FPEVTIHFR-GADVKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVYG 346
               ++S   K FP V + F  G    LSP N +FR+  +        F+ GN    + G
Sbjct: 342 GAGIDVSQLSKSFPVVDMVFENGQKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLG 401

Query: 347 RIMQINFLIGYDIEQAMVSFKPSRC 371
            I+  N L+ YD EQ  + F  + C
Sbjct: 402 GIIVRNTLVVYDREQTKIGFWKTNC 426


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 101/370 (27%), Positives = 161/370 (43%), Gaps = 41/370 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP +    +DTGSD  W     C  CP       +   FD   SST   +
Sbjct: 65  LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQV 124

Query: 91  SCSSSQC--AVVT--SNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV- 143
            CS   C  AV T  + CS     CSY+F YG G   S +SG   ++TL F++  G  + 
Sbjct: 125 RCSDPICTSAVQTTATQCSSQTDQCSYTFQYGDG---SGTSGYYVSDTLYFDAILGQSLI 181

Query: 144 --EMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
                 ++FGC        T +D    GI G G G  S+ISQ+ T       FS+CL   
Sbjct: 182 DNSSALIVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGD 241

Query: 199 GS-SKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRL-----EFVSSSTGNIFV 251
           GS   I   G +   G+V +PL+  + HY L+L +I+V  Q L      F +S++    V
Sbjct: 242 GSGGGILVLGEILEPGIVYSPLVPSQPHYNLNLLSIAVNGQLLPIDPAAFATSNSQGTIV 301

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISS--QPKFPEV 307
           D+G     L  E +    S ++ ++     P+   G +       CY +S+     FP  
Sbjct: 302 DSGTTLAYLVAEAYDPFVSAVNAIVSPSVTPITSKGNQ-------CYLVSTSVSQMFPLA 354

Query: 308 TIHFR-GADVKLSPSNLF----RNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA 362
           + +F  GA + L P +       +    + C  F+      + G ++  + +  YD+ + 
Sbjct: 355 SFNFAGGASMVLKPEDYLIPFGSSGGSAMWCIGFQKVQGVTILGDLVLKDKIFVYDLVRQ 414

Query: 363 MVSFKPSRCT 372
            + +    C+
Sbjct: 415 RIGWANYDCS 424


>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
 gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
          Length = 414

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 106/395 (26%), Positives = 168/395 (42%), Gaps = 48/395 (12%)

Query: 1   AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
           A+++ +L F +     KS + I    +II     Y++   IGTPP  +  ++DT +D  W
Sbjct: 45  AKDTTRLQFLDSLVARKSVVPIASGRQIIQ-SPTYIVRAKIGTPPQTLLLAMDTSNDAAW 103

Query: 61  TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
             C  C    C      LF P+KS+T+ ++SC++ +C  V +  C    C+++  YG  +
Sbjct: 104 IPCTACD--GCAST---LFAPEKSTTFKNVSCAAPECKQVPNPGCGVSSCNFNLTYGSSS 158

Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
            A+    NL  +T+T  +       +P+  FGC  K     TS   Q  +     G  SL
Sbjct: 159 IAA----NLVQDTITLATD-----PVPSYTFGCVSKTTG--TSAPPQGLLGLGR-GPLSL 206

Query: 180 ISQMGTSIAGKFSYCLPD----QGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEA 231
           +SQ        FSYCLP       S  +  G +     +  TPL+        YY++LEA
Sbjct: 207 LSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEA 266

Query: 232 ISVGNQRLEF--------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG 283
           I VG + ++          ++  G IF D+G + T L    +  ++       +  P   
Sbjct: 267 IRVGRKVVDIPPAALAFNPTTGAGTIF-DSGTVFTRLVAPVYVAVRDEFRR--RVGPKLT 323

Query: 284 VGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNAN 342
           V +  GF    CYN+      P +T  F G +V L   N L  + +    C A  G   N
Sbjct: 324 VTSLGGFDT--CYNVPI--VVPTITFIFTGMNVTLPQDNILIHSTAGSTTCLAMAGAPDN 379

Query: 343 I-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           +     V   + Q N  + YD+  + V      CT
Sbjct: 380 VNSVLNVIANMQQQNHRVLYDVPNSRVGVARELCT 414


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 99/374 (26%), Positives = 160/374 (42%), Gaps = 46/374 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G P  + F  +DTGSD  W  C P   CP       +   F+P  SST + I
Sbjct: 88  LYFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRI 147

Query: 91  SCSSSQCAVVTSN----CSEGD-----CSYSFLYGRGAYASFSSGNLATETLTFNSTSG- 140
            CS  +C          C   D     C Y+F YG G   S +SG   ++T+ F++  G 
Sbjct: 148 PCSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDG---SGTSGFYVSDTMYFDTVMGN 204

Query: 141 --LPVEMPNVIFGCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQ---MGTSIAGKFSYC 194
                   +V+FGC +        +D    GI G G    S++SQ   +G S    FS+C
Sbjct: 205 EQTANSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVS-PKTFSHC 263

Query: 195 LP--DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSST 246
           L   D G   +  G IV   G+V TPL+  + HY L+LE+I+V  Q+L      F +S+T
Sbjct: 264 LKGSDNGGGILVLGEIVE-PGLVFTPLVPSQPHYNLNLESIAVSGQKLPIDSSLFATSNT 322

Query: 247 GNIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPK 303
               VD+G  L  L+   Y   + ++      A  V           + C+  +S     
Sbjct: 323 QGTIVDSGTTLVYLVDGAYDPFINAI------AAAVSPSVRSVVSKGIQCFVTTSSVDSS 376

Query: 304 FPEVTIHFRGA-DVKLSPSNLFRNI----SDEIMCSAFRGGNANIVYGRIMQINFLIGYD 358
           FP  T++F+G   + + P N         ++ + C  ++      + G ++  + +  YD
Sbjct: 377 FPTATLYFKGGVSMTVKPENYLLQQGSVDNNVLWCIGWQRSQGITILGDLVLKDKIFVYD 436

Query: 359 IEQAMVSFKPSRCT 372
           +    + +    C+
Sbjct: 437 LANMRMGWADYDCS 450


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 110/363 (30%), Positives = 160/363 (44%), Gaps = 44/363 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IGTPP      VDTGS  T+  C  C    C   + P F P+ S TY  + C +
Sbjct: 93  YTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCKH--CGSHQDPKFRPEASETYQPVKC-T 149

Query: 95  SQCAVVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
            QC     NC +    C+Y   Y   A  S SSG L  + ++F + S L  +    IFGC
Sbjct: 150 WQC-----NCDDDRKQCTYERRY---AEMSTSSGVLGEDVVSFGNQSELSPQ--RAIFGC 199

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCL--PDQGSSKINFGGI 208
            +        + +  GI+GLG G+ S++ Q+     I+  FS C      G   +  GGI
Sbjct: 200 ENDETGD-IYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYGGMGVGGGAMVLGGI 258

Query: 209 VAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLP 261
              A +V   S P  +R  YY + L+ I V  +RL     V        +D+G     LP
Sbjct: 259 SPPADMVFTHSDP--VRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLP 316

Query: 262 LEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RG 313
                  K  +  M +   +K + G +P ++D+ C+     N+S   K FP V + F  G
Sbjct: 317 ESAFLAFKHAI--MKETHSLKRISGPDPHYNDI-CFSGAEINVSQLSKSFPVVEMVFGNG 373

Query: 314 ADVKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPS 369
             + LSP N LFR+  +        F  GN    + G I+  N L+ YD E + + F  +
Sbjct: 374 HKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHSKIGFWKT 433

Query: 370 RCT 372
            C+
Sbjct: 434 NCS 436


>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
          Length = 362

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 70/199 (35%), Positives = 92/199 (46%), Gaps = 26/199 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M L +GTP  +++  +DTGSD  W QC PC    C+ Q   +FDPKKS T+ ++ C S
Sbjct: 135 YFMRLGVGTPATNVYMVLDTGSDVVWLQCSPCKA--CYNQTDAIFDPKKSKTFATVPCGS 192

Query: 95  SQCAVV--TSNC---SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
             C  +  +S C       C Y   YG G   SF+ G+ +TETLTF+        + +V 
Sbjct: 193 RLCRRLDDSSECVTRRSKTCLYQVSYGDG---SFTEGDFSTETLTFHG-----ARVDHVP 244

Query: 150 FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GSS 201
            GCGH N       +   G+        S  SQ      GKFSYCL D+          S
Sbjct: 245 LGCGHDNEGLFVGAAGLLGLGRG---GLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPS 301

Query: 202 KINFGGIVAGAGVVSTPLI 220
            I FG        V TPL+
Sbjct: 302 TIVFGNAAVPKTSVFTPLL 320


>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
 gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 99/381 (25%), Positives = 164/381 (43%), Gaps = 42/381 (11%)

Query: 12  DNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC 71
           D +T   PI+   Q   I+    Y++ + +GTP   +F  +DT +D  W  C  C    C
Sbjct: 78  DQKTTAVPIAPGQQVLKIAN---YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGC--TGC 132

Query: 72  FKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATE 131
                  F P  S+T  S+ CS +QC+ V         S + L+ +    S+   +  T 
Sbjct: 133 SSTT---FLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQ----SYGGDSSLTA 185

Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
           TL  ++ +     +P   FGC   N  S  S   Q G++GLG G  SLISQ G   +G F
Sbjct: 186 TLVQDAITLANDVIPGFTFGC--INAVSGGSIPPQ-GLLGLGRGPISLISQAGAMYSGVF 242

Query: 192 SYCLPDQG----SSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVG-------N 236
           SYCLP       S  +  G +     + +TPL+   H    YY++L  +SVG       +
Sbjct: 243 SYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPS 302

Query: 237 QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
           ++L F  ++     +D+G + T      +  ++      +   P+  +GA   F    C+
Sbjct: 303 EQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG-PISSLGA---FDT--CF 356

Query: 297 NISSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNANI-----VYGRIMQ 350
             +++ + P +T+HF G ++ L   N L  + S  + C +      N+     V   + Q
Sbjct: 357 AATNEAEAPAITLHFEGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQ 416

Query: 351 INFLIGYDIEQAMVSFKPSRC 371
            N  I +D   + +      C
Sbjct: 417 QNLRIMFDTTNSRLGIARELC 437


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 103/367 (28%), Positives = 162/367 (44%), Gaps = 62/367 (16%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           +Y+    IGTPP  + G++D  SD  WT C               F+P +S+T   + C+
Sbjct: 99  MYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP----------FNPVRSTTVADVPCT 148

Query: 94  SSQC-AVVTSNCSEG------DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
              C       C  G      +C+Y+++YG G  A+ ++G L TE  TF  T      + 
Sbjct: 149 DDACQQFAPQTCGAGAGAGSSECAYTYMYGGG--AANTTGLLGTEAFTFGDT-----RID 201

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQG---SSK 202
            V+FGCG +N+      S  +G+IGLG GN SL+SQ+      +FSY   PD      S 
Sbjct: 202 GVVFGCGLQNVG---DFSGVSGVIGLGRGNLSLVSQLQVD---RFSYHFAPDDSVDTQSF 255

Query: 203 INFG--GIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS--------TGN 248
           I FG       +  +ST L+  D     YY+ L  I V  + L   S +        +G 
Sbjct: 256 ILFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGG 315

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG--VGAEPGFSDVLCYNISS--QPKF 304
           +F+    L T+L    +  L+  +++ I    V G  +G +      LCY   S  + K 
Sbjct: 316 VFLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLD------LCYTGESLAKAKV 369

Query: 305 PEVTIHFRGADV-KLSPSNLF-RNISDEIMCSAFRGGNA--NIVYGRIMQINFLIGYDIE 360
           P + + F G  V +L   N F  + +  + C      +A    V G ++Q+   + YDI 
Sbjct: 370 PSMALVFAGGAVMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDIN 429

Query: 361 QAMVSFK 367
            + + F+
Sbjct: 430 GSKLVFE 436


>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
           ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
           from this gene [Arabidopsis thaliana]
          Length = 388

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 86/302 (28%), Positives = 134/302 (44%), Gaps = 35/302 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP    +  VDTGSD  W    QC+ CP       E  L++  +S +   +
Sbjct: 79  LYYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLV 138

Query: 91  SCSSSQCAVVT----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
           SC    C  ++    S C     C Y  +YG G   S ++G    + + ++S +G L  +
Sbjct: 139 SCDDDFCYQISGGPLSGCKANMSCPYLEIYGDG---SSTAGYFVKDVVQYDSVAGDLKTQ 195

Query: 145 MPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
             N  VIFGCG +      S +++   GI+G G  NSS+ISQ+ +S  +   F++CL  +
Sbjct: 196 TANGSVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGR 255

Query: 199 GSSKINFGGIVAGAGVVSTPLII-RDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVD 252
               I   G V    V  TPL+  + HY +++ A+ VG + L      F         +D
Sbjct: 256 NGGGIFAIGRVVQPKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIID 315

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKFPEVTIH 310
           +G     LP       + +   ++K +P   V       D  C+  S +    FP VT H
Sbjct: 316 SGTTLAYLP-------EIIYEPLVKKEPALKVHIVD--KDYKCFQYSGRVDEGFPNVTFH 366

Query: 311 FR 312
           F 
Sbjct: 367 FE 368


>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
          Length = 508

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 104/366 (28%), Positives = 163/366 (44%), Gaps = 54/366 (14%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-----PLFDPKKSSTYN 88
           +Y++  S+GTPP  + G +D  SD  W QC  C    C    P     P F    SST  
Sbjct: 96  MYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACAT--CGADAPAATSAPPFYAFLSSTIR 153

Query: 89  SISCSSSQCA-VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
            + C++  C  +V   CS  D  C YS++YG GA A+ ++G LA +   F +     V  
Sbjct: 154 EVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGA-ANTTAGLLAVDAFAFAT-----VRA 207

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGSSKIN 204
             VIFGC      +  ++    G+IGLG G  SL+SQ+     G+FSY L PD      +
Sbjct: 208 DGVIFGC------AVATEGDIGGVIGLGRGELSLVSQLQI---GRFSYYLAPDDAVDVGS 258

Query: 205 FGGIVAGAG-----VVSTPLII----RDHYYLSLEAISVGNQRL-------EFVSSSTGN 248
           F   +  A       VSTPL+     R  YY+ L  I V  + L       +  +  +G 
Sbjct: 259 FILFLDDAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGG 318

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPE 306
           + +   +  T L    +  ++  M++ I  +     G+E G    LCY   S    K P 
Sbjct: 319 VVLSITIPVTFLDAGAYKVVRQAMASKIGLRAAD--GSELGLD--LCYTSESLATAKVPS 374

Query: 307 VTIHFRGADV-KLSPSNLF-RNISDEIMCSAFR---GGNANIVYGRIMQINFLIGYDIEQ 361
           + + F G  V +L   N F  + +  + C        G+ +++ G ++Q+   + YDI  
Sbjct: 375 MALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLL-GSLIQVGTHMIYDISG 433

Query: 362 AMVSFK 367
           + + F+
Sbjct: 434 SRLVFE 439


>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
          Length = 137

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 60/136 (44%), Positives = 80/136 (58%), Gaps = 11/136 (8%)

Query: 25  QAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKS 84
           QA + + +  +LM L+IG P +     +DTGSD TWTQC PC   DC+KQ  P++DP  S
Sbjct: 11  QAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCIPCS--DCYKQPTPIYDPSLS 68

Query: 85  STYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           STY ++SC SS C A+  S C    C Y  LY  G Y+S + G L+ ET T +S S    
Sbjct: 69  STYGTVSCKSSLCLALPASACISATCEY--LYTYGDYSS-TQGILSYETFTLSSQS---- 121

Query: 144 EMPNVIFGCGHKNLAS 159
            +P++ FGCG  N  S
Sbjct: 122 -IPHIAFGCGQDNEGS 136


>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
          Length = 431

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 97/367 (26%), Positives = 169/367 (46%), Gaps = 59/367 (16%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
           + L IGTP +++    DT SD  WTQC+PC  L C  Q   ++DP K+ TY +++ S   
Sbjct: 90  VFLGIGTPAMNVTLVFDTTSDLLWTQCQPC--LSCVAQAGDMYDPNKTETYANLTSS--- 144

Query: 97  CAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKN 156
                        SY++ Y +    SF+SG  ATET    +     V + N+ FGCG +N
Sbjct: 145 -------------SYNYTYSK---QSFTSGYFATETFALGN-----VTVANITFGCGTRN 183

Query: 157 LASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGSSKINFGG------ 207
                + +   G+   G G  SL++Q+G     +FSYC       GSS +  GG      
Sbjct: 184 QGYYDNVAGVFGVGRGGRGGVSLLNQLGID---RFSYCFSSSGAPGSSAVFLGGSPELAT 240

Query: 208 -----IVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGN-----IFVDTGVLR 257
                  A   +V+ P +++  Y++ L  ++VG   ++   +S+       + +D+    
Sbjct: 241 NATTTPAASTPMVADP-VLKSGYFVKLVGVTVGATLVDVAGASSAEGGGRALVIDSTSPV 299

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQPKFPEV--TIHFR 312
           T+L    +  ++  +   +         A  G    LC+ ++   + P  P V  T+HF 
Sbjct: 300 TVLDEATYGPVRRALVAQLAPLKEANANASAGVGLDLCFELAAGGATPTPPNVTMTLHFD 359

Query: 313 G--ADVKLSPSN-LFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFK 367
           G  AD+ L P++ L ++ +  ++C      ++N   V G    ++ L+ YD+ + +VSF+
Sbjct: 360 GGAADLVLPPASYLAKDSAGGLICLTMTPSSSNGVPVLGSWALLDTLVLYDLAKNVVSFQ 419

Query: 368 PSRCTNY 374
           P  C  +
Sbjct: 420 PLDCAAF 426


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 106/411 (25%), Positives = 165/411 (40%), Gaps = 104/411 (25%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPL---FDPKKSSTYN 88
           Y + L+ GTPP ++    DTGS   W  C     C        +P     F PK SS+  
Sbjct: 132 YSVSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSSVK 191

Query: 89  SISCSSSQCAVV---------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETL 133
            + C + +CA +               +  CS+    Y   YG GA A    G L +ETL
Sbjct: 192 VVGCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGSGATA----GILLSETL 247

Query: 134 TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSY 193
              +       +P+ + GC   ++  P       GI G G G  SL SQM      +FS+
Sbjct: 248 DLENK-----RVPDFLVGCSVMSVHQPA------GIAGFGRGPESLPSQMRLK---RFSH 293

Query: 194 CLPDQGSSKINFGGIVAGAGVVSTPLII------------------------------RD 223
           CL  +G               VS+PL++                              R+
Sbjct: 294 CLVSRGFDD----------SPVSSPLVLDSGSESDESKTKSFIYAPFRENPSVSNAAFRE 343

Query: 224 HYYLSLEAISVGNQRLEF-----VSSSTGN--IFVDTGVLRTLL--PLEYHSNLKSVMSN 274
           +YYLSL  I +G + ++F     V  STGN    +D+G   T L  P+ + +    +   
Sbjct: 344 YYYLSLRRILIGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPI-FEAIADELEKQ 402

Query: 275 MIKAQPVKGVGAEPGFSDVLCYNISSQ---PKFPEVTIHFRGA-DVKLSPSNLFRNISDE 330
           ++K    K V A+ G     C+NI  +    +FP+V + F+G   + L+  N    ++DE
Sbjct: 403 LVKYPRAKDVEAQSGLRP--CFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDE 460

Query: 331 -IMCSAFRGGNAN--------IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
            ++C       A         I+ G   Q N L+ YD+ +  + F+  +CT
Sbjct: 461 GVVCLTMMTDEAVVGGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKCT 511


>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
 gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
          Length = 460

 Score = 99.0 bits (245), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 104/363 (28%), Positives = 157/363 (43%), Gaps = 52/363 (14%)

Query: 27  EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSST 86
           + ++ D ++L+++  GTP       +DTGSD TW QC  C   +C  ++   F+P  SS+
Sbjct: 121 DTLNEDGLFLVNVGFGTPQQKFNLIIDTGSDTTWIQCNSCSLGNCHNKK--TFNPSLSSS 178

Query: 87  YNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           Y++ SC  S            D +Y+  Y   +Y   S G    + +T       P   P
Sbjct: 179 YSNRSCIPST-----------DTNYTMKYEDNSY---SKGVFVCDEVTLK-----PDVFP 219

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNS-SLISQMGTSIAGKFSYCLPDQGSSKINF 205
              FGCG        + S   G++GL  G   SLISQ  +    KFSYC P +   +   
Sbjct: 220 KFQFGCGDSGGGEFGTAS---GVLGLAKGEQYSLISQTASKFKKKFSYCFPPK---EHTL 273

Query: 206 GGIVAGAGVVSTPLIIRDH----------YYLSLEAISVGNQRLEFVSS---STGNIFVD 252
           G ++ G   +S    ++            Y++ L  ISV  +RL   SS   S G I +D
Sbjct: 274 GSLLFGEKAISASPSLKFTQLLNPPSGLGYFVELIGISVAKKRLNVSSSLFASPGTI-ID 332

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS----SQPKFPEVT 308
           +G + T LP   +  L++     +   P      +    D  CYN+        K PE+ 
Sbjct: 333 SGTVITRLPTAAYEALRTAFQQEMLHCPSISPPPQEKLLDT-CYNLKGCGGRNIKLPEIV 391

Query: 309 IHFRG-ADVKLSPSN-LFRNISDEIMCSAF-RGGNAN--IVYGRIMQINFLIGYDIEQAM 363
           +HF G  DV L PS  L+ N      C AF R  N +   + G   Q++  + YDIE   
Sbjct: 392 LHFVGEVDVSLHPSGILWANGDLTQACLAFARKSNPSHVTIIGNRQQVSLKVVYDIEGGR 451

Query: 364 VSF 366
           + F
Sbjct: 452 LGF 454


>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 491

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 99/350 (28%), Positives = 152/350 (43%), Gaps = 49/350 (14%)

Query: 51  SVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSE- 106
           ++DT  D  W QC PC    C+ Q    FDP++SST   + C S  C  +    + CS+ 
Sbjct: 162 AIDTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYANGCSKP 221

Query: 107 ---GDCSYSFLYGRGAYASFSSGNLATETLTFN-STSGLPVEMPNVIFGCGHKNLASPTS 162
              GDC Y   Y   +    + G   T+TLT + ST+ L     N  FGC H        
Sbjct: 222 NSTGDCLYRIEY---SDHRLTLGTYMTDTLTISPSTTFL-----NFRFGCSHAVRGK--F 271

Query: 163 DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK-INFGGIV-----AGAGVVS 216
            ++ +G + LG G  SL+SQ   +    FSYC+P   ++  ++ GG V      G+G  +
Sbjct: 272 SAQASGTMSLGGGPQSLLSQTARAYGNAFSYCVPGPSAAGFLSIGGPVNGDDGGGSGAFA 331

Query: 217 TPLIIRDH-------YYLSLEAISVGNQRLEF--VSSSTGNIFVDTGVLRTLLPLEYHSN 267
           T  ++R         Y + L+ I V  +RL    V  S G +   + V+  L P  Y + 
Sbjct: 332 TTPLVRSANVINPTIYVVRLQGIEVAGRRLNVPPVVFSGGTVMDSSAVITQLPPTAYRA- 390

Query: 268 LKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHFR-GADVKLSPSNLF 324
           L+    N ++A   +   A  G  D  C++    S+   P V++ F  GA ++L    L 
Sbjct: 391 LRLAFRNAMRAYKTR---APTGNLDT-CFDFVGVSKVTVPTVSLVFDGGAVIEL---GLL 443

Query: 325 RNISDEIMCSAFRGGNANIVY---GRIMQINFLIGYDIEQAMVSFKPSRC 371
             + D   C AF    A+      G + Q    + YD+    V F+   C
Sbjct: 444 SVLLDS--CLAFAPMAADFALGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491


>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 106/364 (29%), Positives = 167/364 (45%), Gaps = 47/364 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++   +GTPP  +F  +DT +D  W  C  C            F+   SSTY+++SCS+
Sbjct: 105 YVVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGC---SNASTSFNTNSSSTYSTVSCST 161

Query: 95  SQCAVV------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           +QC         +S      CS++  YG    +SFS+ NL  +TLT +     P  +PN 
Sbjct: 162 TQCTQARGLTCPSSTPQPSICSFNQSYG--GDSSFSA-NLVQDTLTLS-----PDVIPNF 213

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSSKIN 204
            FGC   N AS  S   Q G++GLG G  SL+SQ  +  +G FSYCLP       S  + 
Sbjct: 214 SFGC--INSASGNSLPPQ-GLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLK 270

Query: 205 FGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQR-------LEFVSSSTGNIFVDT 253
            G +     +  TPL+        YY++L  +SVG+ +       L F S+S     +D+
Sbjct: 271 LGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTIIDS 330

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRG 313
           G + T     +   +   + +  + Q V G  +  G  D  C++  ++   P++T+H   
Sbjct: 331 GTVIT----RFAQPVYEAIRDEFRKQ-VNGSFSTLGAFDT-CFSADNENVTPKITLHMTS 384

Query: 314 ADVKLSPSN-LFRNISDEIMCSAFRG--GNANIVY---GRIMQINFLIGYDIEQAMVSFK 367
            D+KL   N L  + +  + C +  G   NAN V      + Q N  I +D+  + +   
Sbjct: 385 LDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIA 444

Query: 368 PSRC 371
           P  C
Sbjct: 445 PEPC 448


>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 79/249 (31%), Positives = 117/249 (46%), Gaps = 32/249 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-PL--FDPKKSSTYNSI 90
           +Y   +S+GTPP   +  VDTGS+  W +C PC   +     P P+  FDP+KS+T  SI
Sbjct: 40  LYYTRISLGTPPQQFYVDVDTGSNVAWVKCAPCTGCEHSGDVPVPMSTFDPRKSTTKISI 99

Query: 91  SCSSSQCAVVTS--NCS--EGDCSYSFLYGRGAYASFSSGNLATETLTFNST----SGLP 142
           SC+ ++C V+     CS     C YS LYG G   S ++G    +  TFN      S   
Sbjct: 100 SCTDAECGVLNKKLQCSPERLSCPYSLLYGDG---SSTAGYYLNDVFTFNQVPSDNSTAK 156

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGS 200
                ++FGCG     S + D    G++G GP   SL +Q+         F++CL    S
Sbjct: 157 SGTARLVFGCGGTQTGSWSVD----GLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVS 212

Query: 201 SKINFGGIVAGA----GVVSTPLII-RDHYYLSLEAISVGNQRL----EFVSSSTGNIFV 251
            +   G +V G      +V TP++   DHY + L  I +  + +     F    TG + +
Sbjct: 213 GR---GSLVIGTIREPDLVYTPMVFGEDHYNVQLLNIGISGRNVTTPASFDLEYTGGVII 269

Query: 252 DTGVLRTLL 260
           D+G   T L
Sbjct: 270 DSGTTLTYL 278


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score = 98.6 bits (244), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 111/420 (26%), Positives = 168/420 (40%), Gaps = 85/420 (20%)

Query: 14  ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW------TQCEPCP 67
           ++PK+  S+I           Y + L+ GTPP      +DTGS   W        C  C 
Sbjct: 62  KSPKTNFSLIKTPLFPRSYGGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSECN 121

Query: 68  ELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-----TSNCSEGDCS----------YS 112
             +  K   P F PK SS+   I C + +C+++      S C E D +          Y 
Sbjct: 122 FPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYV 181

Query: 113 FLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGL 172
             YG G+ A    G L +ETL F +       +P+ + GC   ++  P       GI G 
Sbjct: 182 IQYGSGSTA----GLLLSETLDFPNKK----TIPDFLVGCSIFSIKQPE------GIAGF 227

Query: 173 GPGNSSLISQMGTSIAGKFSYCL--------PDQGSSKINFG---GIVAGAGVVSTPLI- 220
           G    SL SQ+G     KFSYCL        P      ++ G   G+   AG+  TP + 
Sbjct: 228 GRSPESLPSQLGLK---KFSYCLVSHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLK 284

Query: 221 -----IRDHYYLSLEAISVGNQRLE-----FVSSSTGN--IFVDTGVLRTLLPLEYH--- 265
                 RD+YY+ L  I +G+  ++      V  + GN    VD+G   T +    +   
Sbjct: 285 NPTTAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELV 344

Query: 266 -SNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCYNISSQPKF--PEVTIHFRG-ADVKLSP 320
               +  M++   A  ++ + G  P      CYNIS +     P++   F+G A + L  
Sbjct: 345 AKEFEKQMAHYTVATEIQNLTGLRP------CYNISGEKSLSVPDLIFQFKGGAKMALPL 398

Query: 321 SNLFRNISDEIMCSAFRGGNAN---------IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           SN F  +   ++C      N           I+ G   Q NF + +D+E     FK   C
Sbjct: 399 SNYFSIVDSGVICLTIVSDNVAGPGLGGGPAIILGNYQQRNFYVEFDLENEKFGFKQQSC 458


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 109/407 (26%), Positives = 167/407 (41%), Gaps = 99/407 (24%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEP---PLFDPKKSSTYN 88
           Y   LS GTP   +    DTGS   W  C     C E    K +P   P F PK SS+  
Sbjct: 81  YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSK 140

Query: 89  SISCSSSQCAVV---------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETL 133
            + C + +C+ +               T NC++   +Y   YG G+ A    G L +ETL
Sbjct: 141 LVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTA----GLLLSETL 196

Query: 134 TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSY 193
            F        ++PN + GC   ++  P+      GI G G G+ SL SQMG     KF+Y
Sbjct: 197 DFPDK-----KIPNFVVGCSFLSIHQPS------GIAGFGRGSESLPSQMGLK---KFAY 242

Query: 194 CL--------PDQGSSKINFGGIVAGAGVVSTPL---------IIRDHYYLSLEAISVGN 236
           CL        P  G   ++  G V  +G+  TP            +++YYL++  I VGN
Sbjct: 243 CLASRKFDDSPHSGQLILDSTG-VKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGN 301

Query: 237 QRLE----------------FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
           Q ++                 + S +   F+D  VL  +         +  ++N  +A  
Sbjct: 302 QAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVA-----REFEKQLANWTRATD 356

Query: 281 VKGV-GAEPGFSDVLCYNISSQP--KFPEVTIHFRG-ADVKLSPSNLFRNISDE-IMC-- 333
           V+ + G  P      C++IS +   KFPE+   F+G A   L  +N F  +S   + C  
Sbjct: 357 VETLTGLRP------CFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLT 410

Query: 334 --------SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
                       GG  +++ G   Q NF + YD+    + F+   C+
Sbjct: 411 VVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 97/373 (26%), Positives = 167/373 (44%), Gaps = 45/373 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+P  D +  +DTGSD  W     C  CP       E   FD   SST   +
Sbjct: 82  LYFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALV 141

Query: 91  SCSSSQCA----VVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNST----SG 140
           SC+   C+      TS CS     CSY+F YG G   S ++G   ++T+ F++     S 
Sbjct: 142 SCADPICSYAVQTATSGCSSQANQCSYTFQYGDG---SGTTGYYVSDTMYFDTVLLGQSM 198

Query: 141 LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP- 196
           +      ++FGC        T +D    GI G GPG  S+ISQ+ +       FS+CL  
Sbjct: 199 VANSSSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKG 258

Query: 197 -DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
            + G   +  G I+  + +V +PL+    HY L+L++I+V  Q L      F +++    
Sbjct: 259 GENGGGVLVLGEILEPS-IVYSPLVPSLPHYNLNLQSIAVNGQLLPIDSNVFATTNNQGT 317

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNISSQPK--FP 305
            VD+G     L  E ++     ++  +   ++P+   G +       CY +S+     FP
Sbjct: 318 IVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPIISKGNQ-------CYLVSNSVGDIFP 370

Query: 306 EVTIHFR-GADVKLSPSNLFRNI----SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDI 359
           +V+++F  GA + L+P +   +     S  + C  F+       + G ++  + +  YD+
Sbjct: 371 QVSLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLKDKIFVYDL 430

Query: 360 EQAMVSFKPSRCT 372
               + +    C+
Sbjct: 431 ANQRIGWADYNCS 443


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 102/374 (27%), Positives = 162/374 (43%), Gaps = 46/374 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPP +    +DTGSD  W  C     CP+      E   FD   SST   +
Sbjct: 83  LYTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALV 142

Query: 91  SCSSSQCAV----VTSNCSE--GDCSYSFLY--GRGAYASFSSGNLATETLTFNSTSGLP 142
            CS   CA       + CS     CSY+F Y  G G    + S  +  + +   ST    
Sbjct: 143 PCSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANV 202

Query: 143 VEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLPDQG 199
                ++FGC        T +D    GI+G GPG  S++SQ+ +  I  K FS+CL   G
Sbjct: 203 ASSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLKGDG 262

Query: 200 SSKINFGGI-----VAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGN 248
               N GGI     +    +V +PL+  + HY L+L++I+V  Q L      F +S    
Sbjct: 263 ----NGGGILVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQVLSINPAVFATSDKRG 318

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIK--AQPVKGVGAEPGFSDVLCYNI--SSQPKF 304
             +D+G   + L  E +  L + +   +   A      G++       CY +  S    F
Sbjct: 319 TIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQ-------CYLVLTSIDDSF 371

Query: 305 PEVTIHFR-GADVKLSPSN--LFRNISD--EIMCSAFRGGNANI-VYGRIMQINFLIGYD 358
           P V+ +F  GA + L PS   L R   D  ++ C  F+     + + G ++  + ++ YD
Sbjct: 372 PTVSFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVVYD 431

Query: 359 IEQAMVSFKPSRCT 372
           + +  + +    C+
Sbjct: 432 LARQQIGWTNYDCS 445


>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 448

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 99/365 (27%), Positives = 150/365 (41%), Gaps = 53/365 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++   +GTPP  +  +VDT +D  W  C  C    C    P  F+P  S +Y ++ C S
Sbjct: 108 YVVRARLGTPPQQLLLAVDTSNDAAWIPCSGC--AGCPTTTP--FNPAASKSYRAVPCGS 163

Query: 95  SQCAVVTS-NCS--EGDCSYSFLYGRGAY-ASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
             C+   + +CS     C +S  Y   +  A+ S  +LA       S +          F
Sbjct: 164 PACSRAPNPSCSLNTKSCGFSLTYADSSLEAALSQDSLAVANDVVKSYT----------F 213

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV- 209
           GC  K   + T      G+        S +SQ      G FSYCLP   S  +NF G + 
Sbjct: 214 GCLQKATGTATPPQGLLGLGRG---PLSFLSQTKDMYEGTFSYCLPSFKS--LNFSGTLR 268

Query: 210 -----AGAGVVSTPLIIRDH----YYLSLEAISVGNQ-------RLEFVSSSTGNIFVDT 253
                    + +TPL++  H    YY+S+  I VG +        L F  ++     +D+
Sbjct: 269 LGRKGQPLRIKTTPLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDS 328

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRG 313
           G + T L    +  ++  +   I+  P+  +G   GF    CYN +   K+P VT  F G
Sbjct: 329 GTMFTRLVAPAYVAVRDEVRRRIRGAPLSSLG---GFDT--CYNTTV--KWPPVTFMFTG 381

Query: 314 ADVKLSPSNL-----FRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
             V L   NL     +   S   M +A  G N  + V   + Q N  I +D+    V F 
Sbjct: 382 MQVTLPADNLVIHSTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFA 441

Query: 368 PSRCT 372
             +CT
Sbjct: 442 REQCT 446


>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 413

 Score = 98.2 bits (243), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 106/375 (28%), Positives = 170/375 (45%), Gaps = 61/375 (16%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+ + +IGTPP      VD   +  WTQC  C    CFKQ+ P+F P  SST+    C +
Sbjct: 62  YVANFTIGTPPQPASAIVDVAGELVWTQCSACRR--CFKQDLPVFVPNASSTFKPEPCGT 119

Query: 95  SQC-AVVTSNCSEGDCSY----SFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           + C ++ T +CS   CSY    + L G       +SG  AT+T    + +        + 
Sbjct: 120 AVCESIPTRSCSGDVCSYKGPPTQLRGN------TSGFAATDTFAIGTAT------VRLA 167

Query: 150 FGCGHKNLASPTSDSKQ--TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKIN 204
           FGC    + +   D+    +G IGLG    SL++QM  +   +FSYCL  +    SS++ 
Sbjct: 168 FGC----VVASDIDTMDGPSGFIGLGRTPWSLVAQMKLT---RFSYCLSPRNTGKSSRLF 220

Query: 205 FG--GIVAGAGVVSTPLIIR-------DHYY-LSLEAISVGNQRLEFVSSSTGNIFVDTG 254
            G    +AG    ST   I+        HYY LSL+AI  GN  +   ++ +G I V   
Sbjct: 221 LGSSAKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTI--ATAQSGGILVMHT 278

Query: 255 V--LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQPKFPEVTI 309
           V     L+   Y +  K+V   +  A           F   LC+  +   S+   P++  
Sbjct: 279 VSPFSLLVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFD--LCFKKAAGFSRATAPDLVF 336

Query: 310 HFRG-ADVKLSPSNLFRNISDE--IMCSAF-------RGGNANI-VYGRIMQINFLIGYD 358
            F+G A + + P+    ++ +E    C+A        R G   + V G + Q +    YD
Sbjct: 337 TFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYD 396

Query: 359 IEQAMVSFKPSRCTN 373
           +++  +SF+P+ C++
Sbjct: 397 LKKETLSFEPADCSS 411


>gi|357119741|ref|XP_003561592.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 410

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 95/343 (27%), Positives = 165/343 (48%), Gaps = 41/343 (11%)

Query: 51  SVDTGSDCTWTQCEPC-PELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDC 109
           ++DTG+  +W  CEPC P L    Q   LF P  S T+  +      C V   +  +G C
Sbjct: 86  ALDTGASTSWLMCEPCQPPL---PQVGHLFSPAASPTFQGVRGDGPVCTVPYRHTDKG-C 141

Query: 110 SYSFLYGRGAYASFSSGNLATETLTFNS-TSGLPVE-MPNVIFGCGHKNLASPTSDSKQT 167
           S+ F         F++G L+ +T    S  SG  +E +P ++FGC H ++    +D   +
Sbjct: 142 SFRF--------PFAAGYLSRDTFHLRSGRSGTVMESVPGIMFGCAH-SVTGFHNDGTLS 192

Query: 168 GIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGS----SKINFGGIVAGA--GVVSTPLII 221
           G++ L     S ++ +G   +G+FSYCLP   +    S + FG  V        +T L+ 
Sbjct: 193 GVLSLSHSPLSFLTLLGGRSSGRFSYCLPKPTTHNPDSFLRFGADVPSLPPHAHTTTLVH 252

Query: 222 RD--HYYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRT-LLPLEYHSNLKSVMSNM 275
                Y+L++  IS+GN+RL     V ++ G   ++  V  T ++ L Y +   +++++M
Sbjct: 253 AGVPGYHLNIVGISLGNKRLHIDRHVFAAGGGCSINPAVTITRIMELAYLAVEHALVAHM 312

Query: 276 --IKAQPVKGVGAEPGFSDVLCY---NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISD 329
             + +  VKG+   PG S  LC+   + S + + P ++ HF  GA+++ +   LF ++  
Sbjct: 313 KELGSGRVKGM---PGRS--LCFDHMDRSVRVQLPGMSFHFEDGAELRFAAEQLF-DVRV 366

Query: 330 EIMCSAFRG-GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
              C    G G+   V G   Q++    +DI    ++F P  C
Sbjct: 367 MAACFLVVGRGHHQTVIGAAQQVDTRFTFDIAAGRLAFVPETC 409


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 103/365 (28%), Positives = 156/365 (42%), Gaps = 51/365 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++   +GTPP  +  +VDT +D  W  C  C    C     P FDP  S++Y S+ C S
Sbjct: 110 YVVRARLGTPPQQLLLAVDTSNDAAWIPCAGC--AGCPTSSAPPFDPAASTSYRSVPCGS 167

Query: 95  SQCAVV-TSNCSEGD--CSYSFLYGRGAY-ASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
             CA    + C  G   C +S  Y   +  A+ S  +LA         +G  V+     F
Sbjct: 168 PLCAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLA--------VAGDAVK--TYTF 217

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV- 209
           GC  K  A+ T+   Q  +     G  S +SQ      G FSYCLP   S  +NF G + 
Sbjct: 218 GCLQK--ATGTAAPPQGLLGLGR-GPLSFLSQTRDMYQGTFSYCLPSFKS--LNFSGTLR 272

Query: 210 -----AGAGVVSTPLIIRDH----YYLSLEAISVGNQ-------RLEFVSSSTGNIFVDT 253
                    + +TPL+   H    YY+++  I VG +        L F  ++     +D+
Sbjct: 273 LGRNGQPPRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDS 332

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRG 313
           G + T L    +  ++  +   + A PV  +G   GF    C+N ++   +P VT+ F G
Sbjct: 333 GTMFTRLVAPAYVAVRDEVRRRVGA-PVSSLG---GFDT--CFNTTAV-AWPPVTLLFDG 385

Query: 314 ADVKLSPSNL-----FRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFK 367
             V L   N+     +  IS   M +A  G N  + V   + Q N  + +D+    V F 
Sbjct: 386 MQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFA 445

Query: 368 PSRCT 372
             RCT
Sbjct: 446 RERCT 450


>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
          Length = 256

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 50/123 (40%), Positives = 71/123 (57%), Gaps = 10/123 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IG+PP  ++  VDTGSD  W QC PC   DC++Q  P+F+P  SS+Y  ++C +
Sbjct: 53  YFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPCA--DCYQQADPIFEPSFSSSYAPLTCET 110

Query: 95  SQC-AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
            QC ++  S C    C Y   YG G+Y   + G+ ATET+T + ++ L     NV  GCG
Sbjct: 111 HQCKSLDVSECRNDSCLYEVSYGDGSY---TVGDFATETITLDGSASL----NNVAIGCG 163

Query: 154 HKN 156
           H N
Sbjct: 164 HDN 166


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 109/398 (27%), Positives = 164/398 (41%), Gaps = 82/398 (20%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPP--LFDPKKSSTYNS 89
           Y + LS GTPP  +   +DTGSD  W  C     C         P   +F PK SS+   
Sbjct: 90  YSIPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSSKV 149

Query: 90  ISCSSSQCAVV---------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLT 134
           + C + +C  +               + NC++    Y   YG G     + G + +ETL 
Sbjct: 150 LGCVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSG----ITGGIMLSETLD 205

Query: 135 FNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYC 194
                     +PN I GC      S  S S+  GI G G G  SL SQ+G     KFSYC
Sbjct: 206 LPGKG-----VPNFIVGC------SVLSTSQPAGISGFGRGPPSLPSQLGLK---KFSYC 251

Query: 195 L-------PDQGSSKINFGGIVAG---AGVVSTPLI----------IRDHYYLSLEAISV 234
           L         + SS +  G   +G   AG+  TP +             +YYL L  I+V
Sbjct: 252 LLSRRYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITV 311

Query: 235 GNQRLEFV-------SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAE 287
           G + ++         +   G   +D+G   T +  E    + +     ++++    V   
Sbjct: 312 GGKHVKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATEVEGI 371

Query: 288 PGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNI-SDEIMC---------- 333
            G     C+NIS  + P FPE+T+ FR GA+++L  +N    +  D+++C          
Sbjct: 372 TGLRP--CFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFLGGDDVVCLTIVTDGAAG 429

Query: 334 SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             F GG A I+ G   Q NF + YD+    + F+   C
Sbjct: 430 KEFSGGPA-IILGNFQQQNFYVEYDLRNERLGFRQQSC 466


>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 373

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 101/371 (27%), Positives = 164/371 (44%), Gaps = 54/371 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSIS 91
           + M +S+GTP V    ++DTGS  +W QC+ C  + C+ Q+    P F+   SSTY  + 
Sbjct: 23  FFMGISLGTPAVFNLVTIDTGSTISWVQCQYCI-VHCYTQDQRAGPTFNTSSSSTYRRVG 81

Query: 92  CSSSQC------AVVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           CS+  C        + S C E +  C YS  Y  G Y   S+G L+ + LT  ++     
Sbjct: 82  CSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEY---SAGYLSQDRLTLANS----Y 134

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT-SIAGKFSYCLPDQGSSK 202
            +   IFGCG  N  +  S     GIIG G  + S  +Q+   +    FSYC P   S++
Sbjct: 135 SIQKFIFGCGSDNRYNGHS----AGIIGFGNKSYSFFNQIAQLTNYSAFSYCFP---SNQ 187

Query: 203 INFGGIVAGAGVVSTPLIIRDH----------YYLSLEAISVGNQRLEFV--SSSTGNIF 250
            N G +  G  V  +  +I             Y L    + V   RL+      +T    
Sbjct: 188 ENEGFLSIGPYVRDSNKLILTQLFDYGAHLPVYALQQFDMMVNGMRLQVDPPVYTTRMTV 247

Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ----PKFP 305
           VD+G + T +L   + +  +++   M+    V+G  ++      +C++ +       K P
Sbjct: 248 VDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSKE-----ICFHSNGDSVDWSKLP 302

Query: 306 EVTIHFRGADVKLSPSNLF-RNISDEIMCSAFRGGNANI----VYGRIMQINFLIGYDIE 360
            V I F  + +KL   N+F    SD  +CS F+  +A +    + G     +F + +DI+
Sbjct: 303 VVEIKFSRSILKLPAENVFYYETSDGSICSTFQPDDAGVPGVQILGNRATRSFRVVFDIQ 362

Query: 361 QAMVSFKPSRC 371
           Q    F+   C
Sbjct: 363 QRNFGFEAGAC 373


>gi|297794789|ref|XP_002865279.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297311114|gb|EFH41538.1| hypothetical protein ARALYDRAFT_494467 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 419

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 118/413 (28%), Positives = 173/413 (41%), Gaps = 85/413 (20%)

Query: 31  VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL--DCFK---------QEPPLF 79
           V D YL+ L+IGTPP  +   +DTGSD TW    PC  L  DC           +   +F
Sbjct: 7   VRDGYLITLNIGTPPQAVQVYMDTGSDLTWV---PCGNLSFDCIDCNDLKSNNLKSSSIF 63

Query: 80  DPKKSSTYNSISCSSSQCAVVTSN------CSEGDCSYSFLYGRG--------AYASFSS 125
            P  SS+    SC+SS CA + S+      C+   CS S L            AY ++  
Sbjct: 64  SPLHSSSSFRASCASSFCAEIHSSDNPFDPCAIAGCSVSMLLKSTCIRPCPSFAY-TYGE 122

Query: 126 GNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT 185
           G L +  LT +       ++P   FGC       P       GI G G G  SL SQ+G 
Sbjct: 123 GGLVSGILTRDILKARTRDVPRFSFGCVTSTYHEP------IGIAGFGRGLLSLPSQLGF 176

Query: 186 SIAGKFSYC-LPDQGSSKINFGG-IVAGAGVVS---------TPL----IIRDHYYLSLE 230
              G FS+C LP +  +  N    ++ GA  +S         TP+    +  + YY+ LE
Sbjct: 177 LEKG-FSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPVYPNSYYIGLE 235

Query: 231 AISVGNQ---------RLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPV 281
           +I++G             +F S   G + VD+G   T LP  ++S L +++ + I     
Sbjct: 236 SITIGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPNPFYSQLLTILQSTITYPRA 295

Query: 282 KGVGAEPGFSDVLCY-------NISSQPK-----FPEVTIHF-RGADVKLSPSNLFRNIS 328
               +  GF   LCY       N++S        FP +T +F   A + L   N F  +S
Sbjct: 296 TETESRTGFD--LCYKVPCPNNNLTSLENDVMMVFPSITFNFLNNATLLLPQGNSFYAMS 353

Query: 329 -----DEIMCSAFRG---GNANI--VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
                  + C  F+    GN     V+G   Q N  + YD+E+  + F+   C
Sbjct: 354 APSDGSVVQCLLFQNMEDGNYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 406


>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
          Length = 469

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 115/371 (30%), Positives = 176/371 (47%), Gaps = 55/371 (14%)

Query: 36  LMHLSIGTPPVD-IFGSVDTGSDCTWTQCEPCPELDCFKQEPP-LFDPKKSSTYNSISCS 93
           ++++++GTP    + G VD  S   W QC PC         P   F P  S+T++ + CS
Sbjct: 89  VINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCS 148

Query: 94  SSQCA-VVTSNCSEGDC-----------SYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
           S  C  V+   C                SYS  YG    A+ +SG LAT+T TF +T+  
Sbjct: 149 SDMCLPVLRETCGRAGAAANATAGARCDSYSLTYG--GSAANTSGYLATDTFTFGATA-- 204

Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----P 196
              +P V+FGC     AS    +  +G+IG+G GN SLISQ+     GKFSY L      
Sbjct: 205 ---VPGVVFGCSD---ASYGDFAGASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEAT 255

Query: 197 DQGS--SKINFG--GIVAGAGVVSTPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGN 248
           D GS  S I FG   +       STPL    +  D YY++L  + V   RL+ + + T +
Sbjct: 256 DDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFD 315

Query: 249 IFVD-TG--VLRTLLPLEY-----HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
           +  + TG  +L +  P+ Y     +  +++ +++ I    V G  A       LCYN SS
Sbjct: 316 LRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAA---LELDLCYNASS 372

Query: 301 --QPKFPEVTIHFR-GADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIG 356
             + K P++T+ F  GAD+ LS +N F   +D  + C          V G ++Q    + 
Sbjct: 373 MAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMI 432

Query: 357 YDIEQAMVSFK 367
           YD++   ++F+
Sbjct: 433 YDVDAGRLTFE 443


>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
          Length = 469

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 115/371 (30%), Positives = 176/371 (47%), Gaps = 55/371 (14%)

Query: 36  LMHLSIGTPPVD-IFGSVDTGSDCTWTQCEPCPELDCFKQEPP-LFDPKKSSTYNSISCS 93
           ++++++GTP    + G VD  S   W QC PC         P   F P  S+T++ + CS
Sbjct: 89  VINITVGTPVAQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATAFRPNGSATFSPLPCS 148

Query: 94  SSQCA-VVTSNCSEGDC-----------SYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
           S  C  V+   C                SYS  YG    A+ +SG LAT+T TF +T+  
Sbjct: 149 SDMCLPVLRETCGRAGAAANATAGARCDSYSLTYG--GSAANTSGYLATDTFTFGATA-- 204

Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----P 196
              +P V+FGC     AS    +  +G+IG+G GN SLISQ+     GKFSY L      
Sbjct: 205 ---VPGVVFGCSD---ASYGDFAGASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEAT 255

Query: 197 DQGS--SKINFG--GIVAGAGVVSTPL----IIRDHYYLSLEAISVGNQRLEFVSSSTGN 248
           D GS  S I FG   +       STPL    +  D YY++L  + V   RL+ + + T +
Sbjct: 256 DDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFD 315

Query: 249 IFVD-TG--VLRTLLPLEY-----HSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS 300
           +  + TG  +L +  P+ Y     +  +++ +++ I    V G  A       LCYN SS
Sbjct: 316 LRANGTGGVILSSTTPVTYLEQAAYDVVRAAVASRIGLPAVNGSAA---LELDLCYNASS 372

Query: 301 --QPKFPEVTIHFR-GADVKLSPSNLFRNISDE-IMCSAFRGGNANIVYGRIMQINFLIG 356
             + K P++T+ F  GAD+ LS +N F   +D  + C          V G ++Q    + 
Sbjct: 373 MAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMI 432

Query: 357 YDIEQAMVSFK 367
           YD++   ++F+
Sbjct: 433 YDVDAGRLTFE 443


>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 486

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 103/366 (28%), Positives = 163/366 (44%), Gaps = 54/366 (14%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP-----PLFDPKKSSTYN 88
           +Y++  S+GTPP  + G +D  SD  W QC  C    C    P     P F    SST  
Sbjct: 96  MYVLSFSVGTPPQVVTGVLDITSDFVWMQCSACAT--CGADAPAATSAPPFYAFLSSTIR 153

Query: 89  SISCSSSQCA-VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
            + C++  C  +V   CS  D  C YS++YG GA A+ ++G LA +   F +     V  
Sbjct: 154 EVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGA-ANTTAGLLAVDAFAFAT-----VRA 207

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGSSKIN 204
             VIFGC      +  ++    G+IGLG G  S +SQ+     G+FSY L PD      +
Sbjct: 208 DGVIFGC------AVATEGDIGGVIGLGRGELSPVSQLQI---GRFSYYLAPDDAVDVGS 258

Query: 205 FGGIVAGAG-----VVSTPLII----RDHYYLSLEAISVGNQRL-------EFVSSSTGN 248
           F   +  A       VSTPL+     R  YY+ L  I V  + L       +  +  +G 
Sbjct: 259 FILFLDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGG 318

Query: 249 IFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPE 306
           + +   +  T L    +  ++  M++ I+ +     G+E G    LCY   S    K P 
Sbjct: 319 VVLSITIPVTFLDAGAYKVVRQAMASKIELRAAD--GSELGLD--LCYTSESLATAKVPS 374

Query: 307 VTIHFRGADV-KLSPSNLF-RNISDEIMCSAFR---GGNANIVYGRIMQINFLIGYDIEQ 361
           + + F G  V +L   N F  + +  + C        G+ +++ G ++Q+   + YDI  
Sbjct: 375 MALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLL-GSLIQVGTHMIYDISG 433

Query: 362 AMVSFK 367
           + + F+
Sbjct: 434 SRLVFE 439


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 105/373 (28%), Positives = 160/373 (42%), Gaps = 47/373 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   L +G+PP D +  +DTGSD  W  C     CP           FDP  S T + I
Sbjct: 89  LYYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLI 148

Query: 91  SCSSSQCAV----VTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           SCS  +C++      S C+  +  C Y+F YG G   S +SG   ++ L F++  G  V 
Sbjct: 149 SCSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDG---SGTSGYYVSDLLHFDTILGGSV- 204

Query: 145 MPN----VIFGCGHKNLASPTS-DSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP- 196
           M N    ++FGC        T  D    GI G G  + S+ISQ+ +       FS+CL  
Sbjct: 205 MKNSSAPIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKG 264

Query: 197 -DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
            D G   +  G IV    +V TPL+  + HY L+L++I V  Q L      F +SS    
Sbjct: 265 DDSGGGILVLGEIVE-PNIVYTPLVPSQPHYNLNLQSIYVNGQTLAIDPSVFATSSNQGT 323

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FP 305
            +D+G     L    +    S +++ +     P    G +       CY  SS     FP
Sbjct: 324 IIDSGTTLAYLTEAAYDPFISAITSTVSPSVSPYLSKGNQ-------CYLTSSSINDVFP 376

Query: 306 EVTIHFRGA-DVKLSPSNLFRNISD----EIMCSAFRG--GNANIVYGRIMQINFLIGYD 358
           +V+++F G   + L P +     S      + C  F+   G    + G ++  + +  YD
Sbjct: 377 QVSLNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDKIFVYD 436

Query: 359 IEQAMVSFKPSRC 371
           I    + +    C
Sbjct: 437 IAGQRIGWANYDC 449


>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
          Length = 396

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 102/373 (27%), Positives = 171/373 (45%), Gaps = 57/373 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+ + +IGTPP      VD   +  WTQC  C    CFKQ+ P+F P  SST+    C +
Sbjct: 45  YVANFTIGTPPQPASAIVDVAGELVWTQCSACRR--CFKQDLPVFVPNASSTFKPEPCGT 102

Query: 95  SQC-AVVTSNCSEGDCSY----SFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVI 149
           + C ++ T +CS   CSY    + L G       +SG  AT+T    + +        + 
Sbjct: 103 AVCESIPTRSCSGDVCSYKGPPTQLRGN------TSGFAATDTFAIGTAT------VRLA 150

Query: 150 FGCGHKNLASPTSDSKQ--TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---SSKIN 204
           FGC    + +   D+    +G IGLG    SL++QM  +   +FSYCL  +    SS++ 
Sbjct: 151 FGC----VVASDIDTMDGPSGFIGLGRTPWSLVAQMKLT---RFSYCLSPRNTGKSSRLF 203

Query: 205 FG--GIVAGAGVVSTPLIIR--------DHYYLSLEAISVGNQRLEFVSSSTGNIFVDTG 254
            G    +AG+   ST   I+        ++Y LSL+AI  GN  +   + S G + + T 
Sbjct: 204 LGSSAKLAGSESTSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTIA-TAQSGGILVMHTV 262

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQPKFPEVTIHF 311
              +LL    +   K  ++  +       +   P   D LC+  +   S+   P++   F
Sbjct: 263 SPFSLLVDSAYKAFKKAVTEAVGGAAAPPMATPPQPFD-LCFKKAAGFSRATAPDLVFTF 321

Query: 312 RG-ADVKLSPSNLFRNISDE--IMCSAF-------RGGNANI-VYGRIMQINFLIGYDIE 360
           +G A + + P+    ++ +E    C+A        R G   + V G + Q +    YD++
Sbjct: 322 QGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLK 381

Query: 361 QAMVSFKPSRCTN 373
           +  +SF+P+ C++
Sbjct: 382 KETLSFEPADCSS 394


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 109/407 (26%), Positives = 166/407 (40%), Gaps = 99/407 (24%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEP---PLFDPKKSSTYN 88
           Y   LS GTP   +    DTGS   W  C     C E    K +P   P F PK SS+  
Sbjct: 81  YSTPLSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSK 140

Query: 89  SISCSSSQCAVV---------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETL 133
            + C + +C+ +               T NC++   +Y   YG G+ A    G L +ETL
Sbjct: 141 LVGCQNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGSTA----GLLLSETL 196

Query: 134 TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSY 193
            F         +PN + GC   ++  P+      GI G G G+ SL SQMG     KF+Y
Sbjct: 197 DFPDKX-----IPNFVVGCSFLSIHQPS------GIAGFGRGSESLPSQMGLK---KFAY 242

Query: 194 CL--------PDQGSSKINFGGIVAGAGVVSTPL---------IIRDHYYLSLEAISVGN 236
           CL        P  G   ++  G V  +G+  TP            +++YYL++  I VGN
Sbjct: 243 CLASRKFDDSPHSGQLILDSTG-VKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGN 301

Query: 237 QRLE----------------FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
           Q ++                 + S +   F+D  VL  +         +  ++N  +A  
Sbjct: 302 QAVKVPYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVA-----REFEKQLANWTRATD 356

Query: 281 VKGV-GAEPGFSDVLCYNISSQP--KFPEVTIHFRG-ADVKLSPSNLFRNISDE-IMC-- 333
           V+ + G  P      C++IS +   KFPE+   F+G A   L  +N F  +S   + C  
Sbjct: 357 VETLTGLRP------CFDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLT 410

Query: 334 --------SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
                       GG  +++ G   Q NF + YD+    + F+   C+
Sbjct: 411 VVTHQMEDGGGGGGGPSVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457


>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
 gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
          Length = 280

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 48/123 (39%), Positives = 69/123 (56%), Gaps = 11/123 (8%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IG PP   +  +DTGSD +W QC PC   DC++Q  P+F+P  S++Y  +SC +
Sbjct: 132 YFSRIGIGEPPSQAYMVLDTGSDISWVQCAPCA--DCYRQADPIFEPTASASYAPLSCEA 189

Query: 95  SQCAVV-TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
           +QC  +  S C  G+C Y   YG G+Y   + G+  TET+T         ++ NV  GCG
Sbjct: 190 AQCRYLDQSQCRNGNCLYQVSYGDGSY---TVGDFVTETVTIGVN-----KVKNVALGCG 241

Query: 154 HKN 156
           H N
Sbjct: 242 HNN 244


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 100/360 (27%), Positives = 159/360 (44%), Gaps = 36/360 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IGTPP +    VD+GS  T+  C  C +  C   + P F P  SSTY+ + C+ 
Sbjct: 88  YTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQ--CGNHQDPRFQPDLSSTYSPVKCNV 145

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
                 T +  +  C+Y   Y   A  S SSG L  + ++F + S L  +    +FGC +
Sbjct: 146 D----CTCDSDKNQCTYERQY---AEMSSSSGVLGEDIVSFGTESELKPQ--RAVFGCEN 196

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP--DQGSSKINFGGIVA 210
                  S     GI+GLG G  S++ Q+     I   FS C    D G   +  G + A
Sbjct: 197 SETGDLFSQHAD-GIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPA 255

Query: 211 GAGVVSTPL-IIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLEYH 265
             G++ T    +R  YY + L+ + V  + L     +        +D+G     LP +  
Sbjct: 256 PPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLPEQAF 315

Query: 266 SNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGADVK 317
              K  +S+ +   P+K + G +P + D+ C+     N+S   + FP+V + F  G  + 
Sbjct: 316 VAFKDAVSSQV--HPLKKIRGPDPNYKDI-CFAGAGRNVSQLSEVFPKVDMVFGNGQKLS 372

Query: 318 LSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           LSP N LFR+  +        F+ G +   + G I+  N L+ YD     + F  + C+ 
Sbjct: 373 LSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSE 432


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 105/367 (28%), Positives = 172/367 (46%), Gaps = 36/367 (9%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTW---TQCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPP +    +DTGSD  W   T C  CP+    + +   FDP  SS+ + +
Sbjct: 83  LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLV 142

Query: 91  SCSSSQCA---VVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNS--TSGLPVE 144
           SCS  +C       S CS  + CSYSF YG G   S +SG   ++ ++F++  TS L + 
Sbjct: 143 SCSDRRCYSNFQTESGCSPNNLCSYSFKYGDG---SGTSGFYISDFMSFDTVITSTLAIN 199

Query: 145 MPN-VIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-DQG 199
                +FGC +          +   GI GLG G+ S+ISQ+    +A + FS+CL  D+ 
Sbjct: 200 SSAPFVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKS 259

Query: 200 SSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVDT 253
              I   G +     V TPL+  + HY ++L++I+V  Q L      F  ++     +DT
Sbjct: 260 GGGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDT 319

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKFPEVTIHF 311
           G     LP E +S     ++N +        G    +    C+ I++     FPEV++ F
Sbjct: 320 GTTLAYLPDEAYSPFIQAIANAVSQ-----YGRPITYESYQCFEITAGDVDVFPEVSLSF 374

Query: 312 R-GADVKLSPS---NLFRNISDEIMCSAF-RGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
             GA + L P     +F +    I C  F R  +  I + G ++  + ++ YD+ +  + 
Sbjct: 375 AGGASMVLRPHAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIG 434

Query: 366 FKPSRCT 372
           +    C+
Sbjct: 435 WAEYDCS 441


>gi|222637182|gb|EEE67314.1| hypothetical protein OsJ_24556 [Oryza sativa Japonica Group]
          Length = 304

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 98/355 (27%), Positives = 154/355 (43%), Gaps = 74/355 (20%)

Query: 37  MHLSIGTPPVDI---FGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           M L++GTPPV +   FG     SD  W +C PC    C     P   P  +  Y+  + S
Sbjct: 1   MELAVGTPPVTVQALFGI----SDLCWVECTPCS--GCNNNAAP---PAGARLYDRANSS 51

Query: 94  SSQCAVVTSNCSEGDCSYSFLYG-RGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
           S       S  ++ +C Y ++YG      ++  G L TET+ F S     V+  +  FGC
Sbjct: 52  S------FSPLADTECGYRYVYGATDTDRNYVKGILGTETIKFGSNDAATVQ--SFTFGC 103

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG--SSKINFGGIVA 210
            +    +   D   TG++GLG    SL+ Q+G     +FSYCL      +S + FG   +
Sbjct: 104 TNTVYRNDLFDG-NTGVVGLGRSKLSLVGQLGLD---RFSYCLASNPNVASPVLFGSTAS 159

Query: 211 --GAGVVSTPLIIRD-HYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSN 267
             G GV STPL+  D +YY++L  ISV   RL                           N
Sbjct: 160 MDGNGVSSTPLLPDDANYYVNLLGISVDGTRLAI------------------------PN 195

Query: 268 LKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK----FPEVTIHFRGADVKLSPSNL 323
             + MS   +A  V G G       +LC+ +    K     P +T+HF G D++L   N 
Sbjct: 196 DTARMSRTYEA--VNGSG-------LLCFLVDDASKNVVTVPTMTMHFDGMDMELLFGNY 246

Query: 324 FR-------NISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           F            +++C      +     G  +Q++F + Y+++ +++S +P+ C
Sbjct: 247 FAYTGKQSGGGGGDVLCLMIGKSSTGSRIGNYLQMDFHVLYELKNSVLSVQPADC 301


>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 556

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 97/383 (25%), Positives = 164/383 (42%), Gaps = 48/383 (12%)

Query: 19  PISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP- 77
           PI +I   +I +   ++LM + +GTPPV    +VDTG+  ++ QCEPC  L C KQ    
Sbjct: 192 PIDLIQNGDINNF--LFLMPIKLGTPPVWNLVAVDTGATLSFVQCEPC-TLRCHKQTDAG 248

Query: 78  -LFDPKKSSTYNSISCSSSQCAVV-------TSNCSEGD--CSYSFLYGRGAYASFSSGN 127
            +FDP KS +++ + CS ++C  V       +  C E +  C YS  +  G  +S+S G 
Sbjct: 249 EIFDPSKSESFSRVGCSENKCRTVQRALHLQSKACMEKEDSCLYSMTF--GGTSSYSVGK 306

Query: 128 LATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSI 187
           L  + L     +      P+ +FGC             + G++G      S   Q+   +
Sbjct: 307 LVRDRLAIGKYAK-GYSFPDFLFGCS----LDTEYHQYEAGLVGFADEPFSFFEQVAPLV 361

Query: 188 AGK-FSYCLP-DQGSSKINFGGIVAGAGVVSTPLII---RDHYYLSLEAISVGNQRLEFV 242
             K FSYC P D+  +     G         TPL +   +  Y L L+ + V    L   
Sbjct: 362 NYKAFSYCFPSDRRKTGYLSIGDYTRVNSTYTPLFLARQQSRYALKLDEVLVNGMAL--- 418

Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF---SDVLCY--- 296
            ++   + VD+G   T+L  +  + L + ++     + ++ +G    +   SD +C+   
Sbjct: 419 VTTPSEMIVDSGSRWTILLSDTFTQLDAAIT-----EAMRPLGYNRNYYRGSDYICFEDA 473

Query: 297 ---NISSQPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRG----GNANIVYGRI 348
                S     P V + F  G  + L P + F   +D  +C+ F      G+   + G  
Sbjct: 474 HFQQFSDWAALPVVELKFDMGVKMVLQPQSSFHFNNDYGLCTYFMRDASLGSGVQLLGNT 533

Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
           M  +  I +DI+     F+   C
Sbjct: 534 MTRSVGITFDIQGGQFGFRKGDC 556


>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
          Length = 375

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 102/364 (28%), Positives = 164/364 (45%), Gaps = 46/364 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++   +GTPP  +F  +DT +D  W  C  C            F+   SSTY+++SCS+
Sbjct: 30  YVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGC---SNASTSFNTNSSSTYSTVSCST 86

Query: 95  SQCAVV------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           +QC         +S+     CS++  YG    +SFS+ +L  +TLT       P  +PN 
Sbjct: 87  AQCTQARGLTCPSSSPQPSVCSFNQSYG--GDSSFSA-SLVQDTLTL-----APDVIPNF 138

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSSKIN 204
            FGC   N AS  S   Q G++GLG G  SL+SQ  +  +G FSYCLP       S  + 
Sbjct: 139 SFGC--INSASGNSLPPQ-GLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLK 195

Query: 205 FGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQR-------LEFVSSSTGNIFVDT 253
            G +     +  TPL+        YY++L  +SVG+ +       L F ++S     +D+
Sbjct: 196 LGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDS 255

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRG 313
           G + T      +  ++      +       +GA   F    C++  ++   P++T+H   
Sbjct: 256 GTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGA---FDT--CFSADNENVAPKITLHMTS 310

Query: 314 ADVKLSPSN-LFRNISDEIMCSAFRG--GNANIVY---GRIMQINFLIGYDIEQAMVSFK 367
            D+KL   N L  + +  + C +  G   NAN V      + Q N  I +D+  + +   
Sbjct: 311 LDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIA 370

Query: 368 PSRC 371
           P  C
Sbjct: 371 PEPC 374


>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 417

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 116/426 (27%), Positives = 167/426 (39%), Gaps = 114/426 (26%)

Query: 31  VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE----PCPELDCFKQEP---------- 76
           V D YL+ L++GTPP  I   +DTGSD TW  C      C + + ++             
Sbjct: 8   VRDGYLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYS 67

Query: 77  ----------PLFDPKKSSTYNSISCSSSQCA---VVTSNCSEGDCSYSFLYGRGAYASF 123
                     PL     SS  +   C+ + C+   +V   C     S+++ YG G     
Sbjct: 68  SSSLRDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVV-- 125

Query: 124 SSGNLATETLTFNSTS-GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ 182
             G L  +TLT + +S     E+PN  FGC       P       GI G G G  SL SQ
Sbjct: 126 -IGTLTRDTLTTHGSSPSFTREVPNFCFGCVGSTYREP------IGIAGFGRGVLSLPSQ 178

Query: 183 MGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIRD------------------- 223
           +G    G FS+C           G   A    +S+PL+I D                   
Sbjct: 179 LGFLQKG-FSHCF---------LGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPM 228

Query: 224 ---HYYLSLEAISVGNQRL--------EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVM 272
              +YY+ LEAI+VGN           EF S   G + +D+G   T LP  +++ L S++
Sbjct: 229 YPNYYYIGLEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSML 288

Query: 273 SNMIKAQPVKGVGAEPGFSDVLCYNI--------SSQPKFPEVTIHFRGADVKL------ 318
            ++I     +   A  GF   LCY I              P ++ HF   +V L      
Sbjct: 289 QSIITYPRAQEQEARTGFD--LCYRIPCPNNVVTDHDHLLPSISFHFSN-NVSLVLPQGN 345

Query: 319 ------SPSN-------LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVS 365
                 +PSN       L +N+ D         G A  V+G   Q N  + YD+E+  + 
Sbjct: 346 HFYAMGAPSNSTVVKCLLLQNMDDS------DSGPAG-VFGSFQQQNVKVVYDLEKERIG 398

Query: 366 FKPSRC 371
           F+P  C
Sbjct: 399 FQPMDC 404


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 98/373 (26%), Positives = 171/373 (45%), Gaps = 45/373 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+P  + +  +DTGSD  W     C  CP       E   FD   SST   +
Sbjct: 82  LYFTKVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALV 141

Query: 91  SCSSSQCA----VVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNST----SG 140
           SC    C+      TS CS     CSY+F YG G   S ++G   ++T+ F++     S 
Sbjct: 142 SCGDPICSYAVQTATSECSSQANQCSYTFQYGDG---SGTTGYYVSDTMYFDTVLLGQSV 198

Query: 141 LPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP- 196
           +      +IFGC        T +D    GI G GPG  S+ISQ+ +       FS+CL  
Sbjct: 199 VANSSSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCLKG 258

Query: 197 -DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNI 249
            + G   +  G I+  + +V +PL+  + HY L+L++I+V  Q L      F +++    
Sbjct: 259 GENGGGVLVLGEILEPS-IVYSPLVPSQPHYNLNLQSIAVNGQLLPIDSNVFATTNNQGT 317

Query: 250 FVDTG-VLRTLLPLEYHSNLKSVMSNMIK-AQPVKGVGAEPGFSDVLCYNISSQPK--FP 305
            VD+G  L  L+   Y+  +K++ + + + ++P+   G +       CY +S+     FP
Sbjct: 318 IVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPIISKGNQ-------CYLVSNSVGDIFP 370

Query: 306 EVTIHFR-GADVKLSPSNLFRNI----SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDI 359
           +V+++F  GA + L+P +   +        + C  F+       + G ++  + +  YD+
Sbjct: 371 QVSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVYDL 430

Query: 360 EQAMVSFKPSRCT 372
               + +    C+
Sbjct: 431 ANQRIGWADYDCS 443


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 111/386 (28%), Positives = 176/386 (45%), Gaps = 48/386 (12%)

Query: 13  NETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF 72
           +E+ + P + +   + + ++  Y   L IGTPP      VDTGS  T+  C  C +  C 
Sbjct: 90  SESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQ--CG 147

Query: 73  KQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLA 129
           + + P F P+ SSTY  + C+      +  NC +GD   C Y   Y   A  S SSG L 
Sbjct: 148 RHQDPKFQPESSSTYQPVKCT------IDCNC-DGDRMQCVYERQY---AEMSTSSGVLG 197

Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQM--GTS 186
            + ++F + S L  +    +FGC  +N+ +    S+   GI+GLG G+ S++ Q+     
Sbjct: 198 EDVISFGNQSELAPQ--RAVFGC--ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKV 253

Query: 187 IAGKFSYCL--PDQGSSKINFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLE 240
           I+  FS C    D G   +  GGI   + +    S P   R  YY + L+ + V  +RL 
Sbjct: 254 ISDSFSLCYGGMDVGGGAMVLGGISPPSDMTFAYSDP--DRSPYYNIDLKEMHVAGKRLP 311

Query: 241 F---VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY 296
               V        +D+G     LP       K  +  + + Q +K + G +P ++D+ C+
Sbjct: 312 LNANVFDGKHGTVLDSGTTYAYLPEAAFLAFKDAI--VKELQSLKQISGPDPNYNDI-CF 368

Query: 297 -----NISSQPK-FPEVTIHF-RGADVKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVY 345
                ++S   K FP V + F  G    LSP N +FR+  +        F+ GN    + 
Sbjct: 369 SGAGNDVSQLSKSFPVVDMVFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLL 428

Query: 346 GRIMQINFLIGYDIEQAMVSFKPSRC 371
           G I+  N L+ YD EQ  + F  + C
Sbjct: 429 GGIIVRNTLVMYDREQTKIGFWKTNC 454


>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
 gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
 gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 449

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 102/364 (28%), Positives = 164/364 (45%), Gaps = 46/364 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++   +GTPP  +F  +DT +D  W  C  C            F+   SSTY+++SCS+
Sbjct: 104 YVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGC---SNASTSFNTNSSSTYSTVSCST 160

Query: 95  SQCAVV------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           +QC         +S+     CS++  YG    +SFS+ +L  +TLT       P  +PN 
Sbjct: 161 AQCTQARGLTCPSSSPQPSVCSFNQSYG--GDSSFSA-SLVQDTLTL-----APDVIPNF 212

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSSKIN 204
            FGC   N AS  S   Q G++GLG G  SL+SQ  +  +G FSYCLP       S  + 
Sbjct: 213 SFGC--INSASGNSLPPQ-GLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLK 269

Query: 205 FGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQR-------LEFVSSSTGNIFVDT 253
            G +     +  TPL+        YY++L  +SVG+ +       L F ++S     +D+
Sbjct: 270 LGLLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDS 329

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRG 313
           G + T      +  ++      +       +GA   F    C++  ++   P++T+H   
Sbjct: 330 GTVITRFAQPVYEAIRDEFRKQVNVSSFSTLGA---FDT--CFSADNENVAPKITLHMTS 384

Query: 314 ADVKLSPSN-LFRNISDEIMCSAFRG--GNANIVY---GRIMQINFLIGYDIEQAMVSFK 367
            D+KL   N L  + +  + C +  G   NAN V      + Q N  I +D+  + +   
Sbjct: 385 LDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIA 444

Query: 368 PSRC 371
           P  C
Sbjct: 445 PEPC 448


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 96/352 (27%), Positives = 160/352 (45%), Gaps = 42/352 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           +L+ + +G PP   +   D  +D TW QC+PC  + C+ Q   +FDP +SS+Y  +SC +
Sbjct: 187 FLVQIGVGGPPQKFYMIFDLQTDFTWLQCQPC--IKCYDQPDSIFDPSQSSSYTLLSCET 244

Query: 95  SQCAVV-TSNCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
             C ++  S+CS +G C Y+  Y  G     + G L  ET++F S+      +  V  GC
Sbjct: 245 KHCNLLPNSSCSDDGYCRYNITYKDGTN---TEGVLINETVSFESSGW----VDRVSLGC 297

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ----GSSKINFGGI 208
            +KN   P   S   G  GLG G+ S  S++    A   SYCL +      SS + F   
Sbjct: 298 SNKN-QGPFVGSD--GTFGLGRGSLSFPSRIN---ASSMSYCLVESKDGYSSSTLEFNSP 351

Query: 209 VAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSS-------TGNIFVDTGVLR 257
              +G V   L+      + YY+ L+ I VG ++++  +S+        G + V +  L 
Sbjct: 352 PC-SGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSSLI 410

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVK 317
           T+L  + ++ ++     + K Q ++ + A   F    CYN+SS        + F   D K
Sbjct: 411 TMLENDTYNVVRDAF--VAKTQHLERLKAFLQFDT--CYNLSSNNTVELPILEFEVNDGK 466

Query: 318 --LSPSNLFRNISDE--IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMV 364
             L P   +    D+    C AF     +  + G + Q    + +D+  + V
Sbjct: 467 SWLLPKESYLYAVDKNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDLVNSFV 518


>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 113/395 (28%), Positives = 169/395 (42%), Gaps = 83/395 (21%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE---PCPELDCFKQEPPLFDPKKSSTYNSIS 91
           Y + LS GTPP  +   +DTGS   W  C     C       +  P F PK SS+   I 
Sbjct: 77  YSISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISP-FLPKHSSSSKIIG 135

Query: 92  CSSSQCAVV-------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNST 138
           C + +C+ +             + NCS+    Y  LYG G     + G   +ETL  +  
Sbjct: 136 CKNPKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGSGT----TGGVALSETLHLH-- 189

Query: 139 SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL--- 195
            GL V  PN + GC      S  S  +  GI G G G SSL SQ+G +   KFSYCL   
Sbjct: 190 -GLIV--PNFLVGC------SVFSSRQPAGIAGFGRGPSSLPSQLGLT---KFSYCLLSH 237

Query: 196 ---PDQGSSKINFGGI----VAGAGVVSTPLI----IRD------HYYLSLEAISVGNQR 238
                Q SS +            A ++ TPL+    ++D      +YY+SL  IS+G + 
Sbjct: 238 KFDDTQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRS 297

Query: 239 LEFV-------SSSTGNIFVDTGVLRTLLPLEYHSNLK----SVMSNMIKAQPVKGV-GA 286
           ++             G   +D+G   T +  E    L     S + N  +A  V+ + G 
Sbjct: 298 VKIPYKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGL 357

Query: 287 EPGFSDVLCYNIS--SQPKFPEVTIHFR-GADVKLSPSNLFRNI-SDEIMC------SAF 336
           +P      C+N+S   + + P++ +HF+ GADV+L   N F  + S E+ C       A 
Sbjct: 358 KP------CFNVSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAE 411

Query: 337 RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +     ++ G     NF + YD++   + FK   C
Sbjct: 412 KASGPGMILGNFQMQNFYVEYDLQNERLGFKKESC 446


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 105/362 (29%), Positives = 163/362 (45%), Gaps = 40/362 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IGTPP +    VD+GS  T+  C  C +  C   + P F P  SS+Y+ + C+ 
Sbjct: 88  YTTRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQ--CGNHQDPRFQPDLSSSYSPVKCNV 145

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
                 T +  +  C+Y   Y   A  S SSG L  + ++F   S L  +  + IFGC +
Sbjct: 146 D----CTCDSDKKQCTYERQY---AEMSSSSGVLGEDIVSFGRESELKPQ--HAIFGCEN 196

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP--DQGSSKINFGGIVA 210
                  S     GI+GLG G  S++ Q+     I+  FS C    D G   +  GG++A
Sbjct: 197 SETGDLFSQHAD-GIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGMLA 255

Query: 211 GAGVV---STPLIIRDHYY-LSLEAISVGNQRLEFVS---SSTGNIFVDTGVLRTLLPLE 263
              ++   S PL  R  YY + L+ I V  + L   S   +S     +D+G     LP +
Sbjct: 256 PPDMIFSNSDPL--RSPYYNIELKEIHVAGKALRVESRIFNSKHGTVLDSGTTYAYLPEQ 313

Query: 264 YHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISS-QPKFPEVTIHF-RGAD 315
                K  +++  K   +K + G +P + D+ C+     N+S     FP+V + F  G  
Sbjct: 314 AFVAFKEAVTS--KVHSLKKIRGPDPSYKDI-CFAGAGRNVSKLHEVFPDVDMVFGNGQK 370

Query: 316 VKLSPSN-LFRNIS-DEIMC-SAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           + L+P N LFR+   D   C   F+ G +   + G I+  N L+ YD     + F  + C
Sbjct: 371 LSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 430

Query: 372 TN 373
           + 
Sbjct: 431 SE 432


>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
 gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
 gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
 gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
 gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
 gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
 gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
 gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
          Length = 474

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 108/380 (28%), Positives = 162/380 (42%), Gaps = 58/380 (15%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
           D ++LM +S+G PPV    ++DTGS  +W QC+PC  + C  Q     P+FDP +S T  
Sbjct: 113 DFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCA-VHCHTQSAKAGPIFDPGRSYTSR 171

Query: 89  SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
            + CSS +C        +  +NC E +  C+YS  YG G   ++S G + T+TL    + 
Sbjct: 172 RVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW--AYSVGKMVTDTLRIGDS- 228

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG----TSIAGKFSYCL 195
                  +++FGC      S      + GI G G  + S   Q+           FSYCL
Sbjct: 229 -----FMDLMFGCSMDVKYSEF----EAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL 279

Query: 196 P-DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
           P D+        G    A +    TPL   I R  Y L++E +    QRL    +S+  +
Sbjct: 280 PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEM 336

Query: 250 FVDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------ 296
            VD+G  RT L P  +    K++   M      +   A       +CY            
Sbjct: 337 IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGT 394

Query: 297 --NISSQPKFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQI 351
               S+    P + I F G A + LSP N+F N     +C  F    A  + + G  +  
Sbjct: 395 ITPFSNWSALPPLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTR 454

Query: 352 NFLIGYDIEQAMVSFKPSRC 371
           +F   +DI+     FK + C
Sbjct: 455 SFGTTFDIQGKQFGFKYAAC 474


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 96/306 (31%), Positives = 142/306 (46%), Gaps = 34/306 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G PP D +  +DTGSD  W  C     CP     +     FDP  S+T + +
Sbjct: 82  LYYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLV 141

Query: 91  SCSSSQCAVVT----SNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFN---STSGL 141
           SCS   CA+      S C      C+Y F YG G   S +SG    + +  +    +S  
Sbjct: 142 SCSDQICALGVQSSDSACFGQSNQCAYVFQYGDG---SGTSGYYVMDMIHLDVVIDSSVT 198

Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-- 196
                +V+FGC        T SD    GI G G  + S+ISQ+ +  IA K FS+CL   
Sbjct: 199 SNSSASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGD 258

Query: 197 DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIF 250
           D G   +  G IV    VV TPL+  + HY L+L++ISV  Q L      F +SS+    
Sbjct: 259 DSGGGILVLGEIVE-PNVVYTPLVPSQPHYNLNLQSISVNGQVLPISPAVFATSSSQGTI 317

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
           +D+G     L  E ++     ++N++ +Q  + V  +       CY  SS     FP+V+
Sbjct: 318 IDSGTTLAYLAEEAYNAFVVAVTNIV-SQSTQSVVLKGN----RCYVTSSSVSDIFPQVS 372

Query: 309 IHFRGA 314
           ++F G 
Sbjct: 373 LNFAGG 378


>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
          Length = 472

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 108/380 (28%), Positives = 162/380 (42%), Gaps = 58/380 (15%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
           D ++LM +S+G PPV    ++DTGS  +W QC+PC  + C  Q     P+FDP +S T  
Sbjct: 111 DFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCA-VHCHTQSAKAGPIFDPGRSYTSR 169

Query: 89  SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
            + CSS +C        +  +NC E +  C+YS  YG G   ++S G + T+TL    + 
Sbjct: 170 RVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW--AYSVGKMVTDTLRIGDS- 226

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG----TSIAGKFSYCL 195
                  +++FGC      S      + GI G G  + S   Q+           FSYCL
Sbjct: 227 -----FMDLMFGCSMDVKYSEF----EAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL 277

Query: 196 P-DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
           P D+        G    A +    TPL   I R  Y L++E +    QRL    +S+  +
Sbjct: 278 PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEM 334

Query: 250 FVDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------ 296
            VD+G  RT L P  +    K++   M      +   A       +CY            
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGT 392

Query: 297 --NISSQPKFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQI 351
               S+    P + I F G A + LSP N+F N     +C  F    A  + + G  +  
Sbjct: 393 ITPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTR 452

Query: 352 NFLIGYDIEQAMVSFKPSRC 371
           +F   +DI+     FK + C
Sbjct: 453 SFGTTFDIQGKQFGFKYAAC 472


>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 452

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 107/353 (30%), Positives = 151/353 (42%), Gaps = 56/353 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL-DCFKQEPPLFDPKKSSTYNSISCS 93
           Y++  S+GTP V     VDTGSD +W QC+PC     C+ Q+ PLFDP +SS+Y ++ C 
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCG 199

Query: 94  SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
              CA                 G G YA+ +        +                FGCG
Sbjct: 200 GPVCA-----------------GLGIYAASACSAAQCGAVQ------------GFFFGCG 230

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFGGIV 209
           H   A     +   G++GLG    SL+ Q   +  G FSYCLP + S+     +  GG  
Sbjct: 231 H---AQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPS 287

Query: 210 AGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLE 263
             A   ST  ++       +Y + L  ISVG Q+L   +S+  G   VDTG + T LP  
Sbjct: 288 GAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPT 347

Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHF-RGADVKL 318
            ++ L+S   + + +    G    P  G  D  CYN +       P V + F  GA V L
Sbjct: 348 AYAALRSAFRSGMASY---GYPTAPSNGILDT-CYNFAGYGTVTLPNVALTFGSGATVTL 403

Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
               +          S   GG A  + G + Q +F +   I+   V FKPS C
Sbjct: 404 GADGILSFGCLAFAPSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 452


>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 488

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 99/369 (26%), Positives = 166/369 (44%), Gaps = 42/369 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G P  +    +DTGSD  W  C P   CP+      E  LFD  KSS+   +
Sbjct: 83  LYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVL 142

Query: 91  SCSSSQCAVVTSNCSE-----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LP 142
            C+   CA V++   +       CSYSF Y      S +SG   T+++ F+   G   + 
Sbjct: 143 PCTDPICAAVSTTTDQCLTQTDHCSYSFHYRD---RSGTSGFYVTDSMHFDILLGESTIA 199

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS-IAGK-FSYCLP--D 197
                ++FGC        T  +K   GI G G G  S+ISQ+ +  I  K FS+CL   +
Sbjct: 200 NSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGE 259

Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRL----EFVSSSTGNIFVD 252
            G   +  G I+  + +V +PLI  + HY L L++I++  Q       F  S+ G   +D
Sbjct: 260 NGGGILVLGEILEPS-IVYSPLIPSQPHYTLKLQSIALSGQLFPNPTMFPISNAGETIID 318

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
           +G     L  E +  + SV+++ +   A P    G++       C+ +S      FP + 
Sbjct: 319 SGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-------CFRVSMSVADIFPVLR 371

Query: 309 IHFRG-ADVKLSPSNL--FRNISDE--IMCSAFRGGNANI-VYGRIMQINFLIGYDIEQA 362
            +F G A + ++P     F +I  E  + C  F+     + + G ++  + +I YD+ + 
Sbjct: 372 FNFEGIASMVVTPEEYLQFDSIVREPALWCIGFQKAEDGLNILGDLVLKDKIIVYDLARQ 431

Query: 363 MVSFKPSRC 371
            + +    C
Sbjct: 432 RIGWANYDC 440


>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
          Length = 376

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 72/204 (35%), Positives = 98/204 (48%), Gaps = 25/204 (12%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNCSEG- 107
           +D+GSD  W QC+PCP L C  Q  PLFDP  S+TY+++ CSS+ CA +      CS   
Sbjct: 165 IDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYSAVPCSSAACARLGPYRRGCSANV 224

Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-MPNVIFGCGHKNLASPTSDSKQ 166
            C + F Y  GA A   +G  +++ LT       P + +   +FGC H +  S T     
Sbjct: 225 QCQFGFTYTDGATA---TGTYSSDDLTLG-----PYDVVRGFLFGCAHADRGS-TFSFDV 275

Query: 167 TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--INFGGIVAGAGV----VSTPLI 220
           +G + LG G  S + Q  T     FSYC+P   SS   I  G     A +    VSTPL+
Sbjct: 276 SGTLALGGGAQSFVQQTATQYGRVFSYCIPPSPSSLGFITLGVPPQRAALVPTFVSTPLL 335

Query: 221 IRDH-----YYLSLEAISVGNQRL 239
                    Y + L AI V  + L
Sbjct: 336 SSSSMPPTFYRVLLRAIIVAGRPL 359


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 109/383 (28%), Positives = 172/383 (44%), Gaps = 44/383 (11%)

Query: 14  ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK 73
           E+ + P + +   + + ++  Y   L IGTPP      VDTGS  T+  C  C +  C +
Sbjct: 60  ESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQ--CGR 117

Query: 74  QEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATE 131
            + P F P  SSTY  + C+      +  NC      C Y   Y   A  S SSG L  +
Sbjct: 118 HQDPKFQPDLSSTYQPVKCT------LDCNCDNDRMQCVYERQY---AEMSTSSGVLGED 168

Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQM--GTSIA 188
            ++F + S L  +    +FGC  +N+ +    S+   GI+GLG G+ S++ Q+     ++
Sbjct: 169 VVSFGNQSELAPQ--RAVFGC--ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVVS 224

Query: 189 GKFSYCL--PDQGSSKINFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF- 241
             FS C    D G   +  GGI   + +V   S P  +R  YY + L+ I V  +RL   
Sbjct: 225 DSFSLCYGGMDVGGGAMVLGGISPPSDMVFAQSDP--VRSPYYNIDLKEIHVAGKRLPLN 282

Query: 242 --VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY--- 296
             V        +D+G     LP E     K  +   +++   +  G +P ++D LC+   
Sbjct: 283 PSVFDGKHGSVLDSGTTYAYLPEEAFLAFKEAIVKELQSFS-QISGPDPNYND-LCFSGA 340

Query: 297 --NISSQPK-FPEVTIHF-RGADVKLSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRI 348
             ++S   K FP V + F  G    LSP N +FR+  +        F+ G +   + G I
Sbjct: 341 GIDVSQLSKTFPVVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGI 400

Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
           +  N L+ YD EQ  + F  + C
Sbjct: 401 VVRNTLVLYDREQTKIGFWKTNC 423


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 108/364 (29%), Positives = 158/364 (43%), Gaps = 44/364 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IGTPP      VDTGS  T+  C  C    C   + P F P+ S TY  + C +
Sbjct: 93  YTARLWIGTPPQRFALIVDTGSTVTYVPCSTCRH--CGSHQDPKFRPEDSETYQPVKC-T 149

Query: 95  SQCAVVTSNC--SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
            QC     NC      C+Y   Y   A  S SSG L  + ++F + + L  +    IFGC
Sbjct: 150 WQC-----NCDNDRKQCTYERRY---AEMSTSSGALGEDVVSFGNQTELSPQ--RAIFGC 199

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCL--PDQGSSKINFGGI 208
            +        + +  GI+GLG G+ S++ Q+     I+  FS C      G   +  GGI
Sbjct: 200 ENDETGD-IYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLGGI 258

Query: 209 VAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLP 261
              A +V   S P  +R  YY + L+ I V  +RL     V        +D+G     LP
Sbjct: 259 SPPADMVFTRSDP--VRSPYYNIDLKEIHVAGKRLHLNPKVFDGKHGTVLDSGTTYAYLP 316

Query: 262 LEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RG 313
                  K  +  M +   +K + G +P ++D+ C+     ++S   K FP V + F  G
Sbjct: 317 ESAFLAFKHAI--MKETHSLKRISGPDPRYNDI-CFSGAEIDVSQISKSFPVVEMVFGNG 373

Query: 314 ADVKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPS 369
             + LSP N LFR+  +        F  GN    + G I+  N L+ YD E   + F  +
Sbjct: 374 HKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHTKIGFWKT 433

Query: 370 RCTN 373
            C+ 
Sbjct: 434 NCSE 437


>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
 gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
          Length = 452

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 107/353 (30%), Positives = 151/353 (42%), Gaps = 56/353 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL-DCFKQEPPLFDPKKSSTYNSISCS 93
           Y++  S+GTP V     VDTGSD +W QC+PC     C+ Q+ PLFDP +SS+Y ++ C 
Sbjct: 140 YVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCG 199

Query: 94  SSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCG 153
              CA                 G G YA+ +        +                FGCG
Sbjct: 200 GPVCA-----------------GLGIYAASACSAAQCGAVQ------------GFFFGCG 230

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK----INFGGIV 209
           H   A     +   G++GLG    SL+ Q   +  G FSYCLP + S+     +  GG  
Sbjct: 231 H---AQSGLFNGVDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPS 287

Query: 210 AGAGVVSTPLIIRD-----HYYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLE 263
             A   ST  ++       +Y + L  ISVG Q+L   +S+  G   VDTG + T LP  
Sbjct: 288 GAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPT 347

Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEP--GFSDVLCYNIS--SQPKFPEVTIHF-RGADVKL 318
            ++ L+S   + + +    G    P  G  D  CYN +       P V + F  GA V L
Sbjct: 348 AYAALRSAFRSGMASY---GYPTAPSNGILDT-CYNFAGYGTVTLPNVALTFGSGATVTL 403

Query: 319 SPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
               +          S   GG A  + G + Q +F +   I+   V FKPS C
Sbjct: 404 GADGILSFGCLAFAPSGSDGGMA--ILGNVQQRSFEV--RIDGTSVGFKPSSC 452


>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 445

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 99/383 (25%), Positives = 170/383 (44%), Gaps = 72/383 (18%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
           L++GTPP  +   +DTGS+ +W  C+    ++       +F+P  SS+Y  I C S  C 
Sbjct: 74  LTVGTPPQSVTMVLDTGSELSWLHCKKQQNINS------VFNPHLSSSYTPIPCMSPICK 127

Query: 99  VVTSN------CSEGDCSYSFLYGRGAYASFSS--GNLATETLTFNSTSGLPVEMPNVIF 150
             T +      C   +  +  +    +YA F+S  GNLA++T   + +       P +IF
Sbjct: 128 TRTRDFLIPVSCDSNNLCHVTV----SYADFTSLEGNLASDTFAISGSG-----QPGIIF 178

Query: 151 GCGHKNLAS-PTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV 209
           G      +S    DSK TG++G+  G+ S ++QMG     KFSYC+  + +S +   G  
Sbjct: 179 GSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGFP---KFSYCISGKDASGVLLFGDA 235

Query: 210 AGAGV----------VSTPLIIRDH--YYLSLEAISVGNQRLE-----FVSSST--GNIF 250
               +          ++TPL   D   Y + L  I VG++ L+     F    T  G   
Sbjct: 236 TFKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQTM 295

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLCYNISSQ---PK 303
           VD+G   T L    ++ L++    + + + V  +  +P F    +  LC+ +      P 
Sbjct: 296 VDSGTRFTFLLGSVYTALRNEF--VAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVVPA 353

Query: 304 FPEVTIHFRGADVKLSPSNLFRNI---------SDEIMCSAFRGGNANI------VYGRI 348
            P VT+ F GA++ +S   L   +         + ++ C  F  GN+++      V G  
Sbjct: 354 VPAVTMVFEGAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTF--GNSDLLGIEAYVIGHH 411

Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
            Q N  + +D+  + V F  ++C
Sbjct: 412 HQQNVWMEFDLVNSRVGFADTKC 434


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 105/362 (29%), Positives = 162/362 (44%), Gaps = 40/362 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IGTPP +    VD+GS  T+  C  C +  C   + P F P  SS+Y+ + C+ 
Sbjct: 89  YTTRLYIGTPPQEFALIVDSGSTVTYVPCASCEQ--CGNHQDPRFQPDLSSSYSPVKCNV 146

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
                 T +  +  C+Y   Y   A  S SSG L  + ++F   S L  +    +FGC +
Sbjct: 147 D----CTCDSDKKQCTYERQY---AEMSSSSGVLGEDIVSFGRESELKPQ--RAVFGCEN 197

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP--DQGSSKINFGGIVA 210
                  S     GI+GLG G  S++ Q+     I+  FS C    D G   +  GG+ A
Sbjct: 198 SETGDLFSQHAD-GIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGVPA 256

Query: 211 GAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLE 263
            + +V   S PL  R  YY + L+ I V  + L     V +S     +D+G     LP +
Sbjct: 257 PSDMVFSHSDPL--RSPYYNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGTTYAYLPEQ 314

Query: 264 YHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISS-QPKFPEVTIHF-RGAD 315
                K  +++  K   +K + G +P + D+ C+     N+S     FP+V + F  G  
Sbjct: 315 AFVAFKDAVTS--KVHSLKKIRGPDPNYKDI-CFAGAGRNVSKLHEVFPDVDMVFGNGQK 371

Query: 316 VKLSPSN-LFRNIS-DEIMC-SAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           + L+P N LFR+   D   C   F+ G +   + G I+  N L+ YD     + F  + C
Sbjct: 372 LSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 431

Query: 372 TN 373
           + 
Sbjct: 432 SE 433


>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
 gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
          Length = 467

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 114/413 (27%), Positives = 165/413 (39%), Gaps = 99/413 (23%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQC--EPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
           +++G PP ++   +DTGS+ +W  C     P      Q P  F+   SSTY +  CSSS 
Sbjct: 63  VAVGAPPQNVTMVLDTGSELSWLLCNGSRVPSTPPQPQAPAAFNGSASSTYAAAHCSSSP 122

Query: 97  --------------CAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
                         CA   SN     C  S  Y   A AS + G LA +T         P
Sbjct: 123 ECQWRGRDLPVPPFCAGPPSN----SCRVSLSY---ADASSADGVLAADTFLLGGAP--P 173

Query: 143 VEMPNVIFGC-------------GHKNLASPTSDSK-QTGIIGLGPGNSSLISQMGTSIA 188
           V     +FGC             G+ N AS T+ S+  TG++G+  G+ S ++Q GT   
Sbjct: 174 VR---ALFGCITSYSSSSTADGNGNGNDASATNSSEAATGLLGMNRGSLSFVTQTGTL-- 228

Query: 189 GKFSYCL-PDQGSSKINFGGIVAGAGVVS------TPLII---------RDHYYLSLEAI 232
            +F+YC+ P  G   +  GG   GA + +      TPLI          R  Y + LE I
Sbjct: 229 -RFAYCIAPGDGPGLLVLGGDGDGAALSAAPQLNYTPLIEMSQPLPYFDRVAYSVQLEGI 287

Query: 233 SVGNQRLEFVSS-------STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG 285
            VG   L    S         G   VD+G   T L  + ++ LK    N   A  +    
Sbjct: 288 RVGAALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSA--LLAPL 345

Query: 286 AEPGF------------SDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNI------ 327
            EP F            S+      ++    PEV +  RGA+V +    L   +      
Sbjct: 346 GEPDFVFQGAFDACFRASEARVAAATASQLLPEVGLVLRGAEVAVGGEKLLYMVPGERRG 405

Query: 328 ---SDEIMCSAFRGGNANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
              S+ + C  F  GN+++      V G   Q N  + YD++ + V F P+RC
Sbjct: 406 EGGSEAVWCLTF--GNSDMAGMSAYVIGHHHQQNVWVEYDLQNSRVGFAPARC 456


>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
          Length = 519

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 111/416 (26%), Positives = 164/416 (39%), Gaps = 96/416 (23%)

Query: 35  YLMHLSIGTPP----VDIFGSVDTGSDCTWTQCEPCPELDCFKQE----------PPLFD 80
           Y + LS+G P     V +F  +DTGSD  W  C P   + C  +           PP  D
Sbjct: 88  YTLSLSVGPPSTASSVSLF--LDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPID 145

Query: 81  PKK------------SSTYNSISCSSSQC---AVVTSNCSEGDCS-YSFLYGRGAY-ASF 123
            ++            SS   S  C++++C   A+ T +C+   C    + YG G+  A+ 
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVANL 205

Query: 124 SSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM 183
             G +          + + VE  N  F C H  LA P       G+ G G G  SL +Q+
Sbjct: 206 RRGRVGL-------AASMAVE--NFTFACAHTALAEPV------GVAGFGRGPLSLPAQL 250

Query: 184 GTSIAGKFSYCLPDQG--------SSKINFGGIVAGAGV-------VSTPLIIRDH---- 224
             S++G+FSYCL            SS +  G     A +       V TPL+        
Sbjct: 251 APSLSGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYF 310

Query: 225 YYLSLEAISVGNQRLE-------FVSSSTGNIFVDTGVLRTLLPLEYHSNLKS---VMSN 274
           Y ++LEA+SVG +R++             G + VD+G   T+LP +  + +         
Sbjct: 311 YSVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMA 370

Query: 275 MIKAQPVKGVGAEPGFSDVLCYNIS-SQPKFPEVTIHFRG-ADVKLSPSNLFRNISDE-- 330
             +    +G  A+ G +   CY+ S S    P V +HFRG A V L   N F     E  
Sbjct: 371 AARFTRAEGAEAQTGLAP--CYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEG 428

Query: 331 --IMCSAFR-----------GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
             + C               GG      G   Q  F + YD++   V F   RCT+
Sbjct: 429 RSVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTD 484


>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
 gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
 gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
 gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 492

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 104/367 (28%), Positives = 172/367 (46%), Gaps = 36/367 (9%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTW---TQCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPP +    +DTGSD  W   T C  CP+    + +   FDP  SS+ + +
Sbjct: 83  LYYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLV 142

Query: 91  SCSSSQCA---VVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNS--TSGLPVE 144
           SCS  +C       S CS  + CSYSF YG G   S +SG   ++ ++F++  TS L + 
Sbjct: 143 SCSDRRCYSNFQTESGCSPNNLCSYSFKYGDG---SGTSGYYISDFMSFDTVITSTLAIN 199

Query: 145 MPN-VIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS-IAGK-FSYCLP-DQG 199
                +FGC +          +   GI GLG G+ S+ISQ+    +A + FS+CL  D+ 
Sbjct: 200 SSAPFVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKS 259

Query: 200 SSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFVDT 253
              I   G +     V TPL+  + HY ++L++I+V  Q L      F  ++     +DT
Sbjct: 260 GGGIMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDT 319

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ--PKFPEVTIHF 311
           G     LP E +S     ++N +        G    +    C+ I++     FP+V++ F
Sbjct: 320 GTTLAYLPDEAYSPFIQAVANAVSQ-----YGRPITYESYQCFEITAGDVDVFPQVSLSF 374

Query: 312 R-GADVKLSPS---NLFRNISDEIMCSAF-RGGNANI-VYGRIMQINFLIGYDIEQAMVS 365
             GA + L P     +F +    I C  F R  +  I + G ++  + ++ YD+ +  + 
Sbjct: 375 AGGASMVLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIG 434

Query: 366 FKPSRCT 372
           +    C+
Sbjct: 435 WAEYDCS 441


>gi|125595855|gb|EAZ35635.1| hypothetical protein OsJ_19925 [Oryza sativa Japonica Group]
          Length = 335

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 69/212 (32%), Positives = 102/212 (48%), Gaps = 26/212 (12%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV---TSNC-SEG 107
           +D+GSD  W QC+PCP L C  Q  PLFDP  S+TY ++ CSS+ CA +      C +  
Sbjct: 85  IDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLGPYRRGCLANS 144

Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE-MPNVIFGCGHKNLASPTSDSKQ 166
            C +   Y  GA A   +G  +++ LT       P + +   +FGC H +  S T     
Sbjct: 145 QCQFGITYANGATA---TGTYSSDDLTLG-----PYDVVRGFLFGCAHADQGS-TFSYDV 195

Query: 167 TGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA---------GVVST 217
            G + LG G+ S + Q  +  +  FSYC+P   SS   FG I+ G            VST
Sbjct: 196 AGTLALGGGSQSFVQQTASQYSRVFSYCVPPSTSS---FGFIMFGVPPQRAALVPTFVST 252

Query: 218 PLIIRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
           PL+       +  +I++ +  L F   +T N+
Sbjct: 253 PLLSSSTMSPTFYSITLPSIALVFDGGATVNL 284


>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
 gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
          Length = 482

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 95/367 (25%), Positives = 156/367 (42%), Gaps = 39/367 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP V  +  +DTGS   W     C+ CP      ++   +DP+ S +   +
Sbjct: 82  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141

Query: 91  SCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP---N 147
            C  + C           C Y   Y  G     + G L T+ L ++   G     P   +
Sbjct: 142 KCDDTICTSRPPCNMTLRCPYITGYADGG---LTMGILFTDLLHYHQLYGNGQTQPTSTS 198

Query: 148 VIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQGSSK 202
           V FGCG +   S  + +    GIIG G  N + +SQ+  + AGK    FS+CL       
Sbjct: 199 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQL--AAAGKTKKIFSHCLDSTNGGG 256

Query: 203 INFGGIVAGAGVVSTPLIIRDHYY--LSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
           I   G V    V +TP++  +  Y  ++L++I+V    L+     F ++ T   F+D+G 
Sbjct: 257 IFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 316

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHFRG 313
               LP   +S L   +  +    P   +GA   F    C++   S   KFP++T HF  
Sbjct: 317 TLVYLPEIIYSEL---ILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDKFPKITFHFEN 370

Query: 314 ADVKLS--PSNLFRNISDEIMCSAFR-----GGNANIVYGRIMQINFLIGYDIEQAMVSF 366
            D+ L   P +          C  F+     G    I+ G ++  N ++ YD+E+  + +
Sbjct: 371 -DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGW 429

Query: 367 KPSRCTN 373
               C++
Sbjct: 430 TEHNCSS 436


>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
 gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
 gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
          Length = 492

 Score = 95.5 bits (236), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 109/415 (26%), Positives = 164/415 (39%), Gaps = 94/415 (22%)

Query: 35  YLMHLSIGTPP----VDIFGSVDTGSDCTWTQCEPCPELDCFKQE----------PPLFD 80
           Y + LS+G P     V +F  +DTGSD  W  C P   + C  +           PP  D
Sbjct: 88  YTLSLSVGPPSTASSVSLF--LDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPID 145

Query: 81  PKK------------SSTYNSISCSSSQC---AVVTSNCSEGDCS-YSFLYGRGAYASFS 124
            ++            SS   S  C++++C   A+ T +C+   C    + YG G+  +  
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVA-- 203

Query: 125 SGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG 184
             NL    +   ++    + + N  F C H  LA P       G+ G G G  SL +Q+ 
Sbjct: 204 --NLRRGRVGLAAS----MAVENFTFACAHTALAEPV------GVAGFGRGPLSLPAQLA 251

Query: 185 TSIAGKFSYCLPDQG--------SSKINFGGIVAGAGV-------VSTPLIIRDH----Y 225
            S++G+FSYCL            SS +  G     A +       V TPL+        Y
Sbjct: 252 PSLSGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFY 311

Query: 226 YLSLEAISVGNQRLE-------FVSSSTGNIFVDTGVLRTLLPLEYHSNLKS---VMSNM 275
            ++LEA+SVG +R++             G + VD+G   T+LP +  + +          
Sbjct: 312 SVALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAA 371

Query: 276 IKAQPVKGVGAEPGFSDVLCYNIS-SQPKFPEVTIHFRG-ADVKLSPSNLFRNISDE--- 330
            +    +G  A+ G +   CY+ S S    P V +HFRG A V L   N F     E   
Sbjct: 372 ARFTRAEGAEAQTGLAP--CYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGR 429

Query: 331 -IMCSAFR-----------GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
            + C               GG      G   Q  F + YD++   V F   RCT+
Sbjct: 430 SVGCLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTD 484


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 102/360 (28%), Positives = 158/360 (43%), Gaps = 38/360 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IGTP  +    VD+GS  T+  C  C +  C   + P F P  SSTY+ + C+ 
Sbjct: 91  YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQ--CGNHQDPRFQPDLSSTYSPVKCNV 148

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
                 T +     C+Y   Y   A  S SSG L  + ++F   S L  +    +FGC +
Sbjct: 149 D----CTCDNERSQCTYERQY---AEMSSSSGVLGEDIMSFGKESELKPQ--RAVFGCEN 199

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP--DQGSSKINFGGIVA 210
                  S     GI+GLG G  S++ Q+     I+  FS C    D G   +  GG+ A
Sbjct: 200 TETGDLFSQHAD-GIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVLGGMPA 258

Query: 211 GAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLE 263
              +V   S P  +R  YY + L+ I V  + L     + +S     +D+G     LP +
Sbjct: 259 PPDMVFSHSNP--VRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYLPEQ 316

Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGADV 316
                K  ++N + +   K  G +P + D+ C+     N+S   + FP+V + F  G  +
Sbjct: 317 AFVAFKDAVTNKVNSLK-KIRGPDPNYKDI-CFAGAGRNVSQLSEVFPDVDMVFGNGQKL 374

Query: 317 KLSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
            LSP N LFR+  +        F+ G +   + G I+  N L+ YD     + F  + C+
Sbjct: 375 SLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCS 434


>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
          Length = 376

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 60/154 (38%), Positives = 80/154 (51%), Gaps = 17/154 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP  D+    DTGSD TWTQCEPC    C+ Q+ P+F+P KS++Y +ISCSS
Sbjct: 138 YVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARY-CYHQQEPIFNPSKSTSYTNISCSS 196

Query: 95  SQCAVVTS------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
             C  + S      +CS   C Y   YG     S+S G  A + L   ST        N 
Sbjct: 197 PTCDELKSGTGNSPSCSASTCVYGIQYGD---QSYSVGFFAQDKLALTSTD----VFNNF 249

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQ 182
           +FGCG  N           G+IGLG    SL+S+
Sbjct: 250 LFGCGQNNRGLFVG---VAGLIGLGRNALSLMSK 280



 Score = 40.8 bits (94), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 30/111 (27%), Positives = 49/111 (44%), Gaps = 14/111 (12%)

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVKLSPSNL 323
           N  S+MS   KA P   +          CY+ S       P++ ++F  GA++ L PS +
Sbjct: 273 NALSLMSKYPKAAPASILDT--------CYDFSQYDTVDVPKINLYFSDGAEMDLDPSGI 324

Query: 324 FRNISDEIMCSAFRGGNANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           F  ++   +C AF G +      + G + Q  F + YD+    + F P  C
Sbjct: 325 FYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPGGC 375


>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 507

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 105/384 (27%), Positives = 160/384 (41%), Gaps = 62/384 (16%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP D +  +DTGSD  W  C     CP     +     FDP  S+T   +
Sbjct: 83  LYFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAALV 142

Query: 91  SCSSSQCAV----VTSNCSE--GDCSYSFLYGRGAYAS--------------FSSGNLAT 130
           SCS  +C        S CS     C Y+F YG G+  S               SSG L+ 
Sbjct: 143 SCSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGELSQ 202

Query: 131 ETLTFNSTSGLPVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--I 187
              T++S+         V F C        T SD    GI G G    S+ISQ+ +    
Sbjct: 203 ICQTYDSS---------VSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGIT 253

Query: 188 AGKFSYCLP--DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE---- 240
              FS+CL   D G   +  G IV    +V TPL+  + HY L L++ISV  Q L     
Sbjct: 254 PRVFSHCLKGDDSGGGVLVLGEIVE-PNIVYTPLVPSQPHYNLYLQSISVAGQTLAIDPS 312

Query: 241 -FVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYN 297
            F +SS     VD+G     L    +    S +++++   A+     G +       CY 
Sbjct: 313 VFGASSNQGTIVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLSKGNQ-------CYL 365

Query: 298 ISSQPK--FPEVTIHFR-GADVKLSPSNLFRNISD----EIMCSAFRG--GNANIVYGRI 348
           ++S     FP+V+++F  GA + L+P +     +      + C  F+   G    + G +
Sbjct: 366 VTSSVNDVFPQVSLNFAGGASLILNPQDYLLQQNSVGGAAVWCVGFQKTPGQQITILGDL 425

Query: 349 MQINFLIGYDIEQAMVSFKPSRCT 372
           +  + +  YDI    V +    C+
Sbjct: 426 VLKDKIFVYDIANQRVGWTNYDCS 449


>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 449

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 117/426 (27%), Positives = 170/426 (39%), Gaps = 94/426 (22%)

Query: 31  VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWT-------QCEPCPELDCFKQEPPL--FDP 81
           + D YLM LSIGTPP  +   +DTGSD TW         C+ C E       P L  F P
Sbjct: 17  IRDGYLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLP 76

Query: 82  KKSSTYNSISCSSSQCAVVTSN------CSEGDCSY-------------SFLYGRGAYAS 122
             SST    +C SS C  + S+      C+   CS              SF Y  GA + 
Sbjct: 77  THSSTSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGA-SG 135

Query: 123 FSSGNLATETLTFNSTSGLPV----EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSS 178
             +G+L  + L  +           ++P   FGC       P       GI G G G  S
Sbjct: 136 VVTGSLTRDVLFTHGNYNNNNNNNKQIPRFCFGCVGATYREP------IGIAGFGRGLLS 189

Query: 179 LISQMGTSIAGKFSYC-LPDQGSSKINFGG-IVAGAGVVS--------TPLI----IRDH 224
           L  Q+G S  G FS+C LP + S+  NF   ++ G   +S        TPL+      ++
Sbjct: 190 LPFQLGFSHKG-FSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNY 248

Query: 225 YYLSLEAISVGN-----------QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMS 273
           YY+ LE+I++GN           +  E  +   G + +D+G   T LP   +S L S + 
Sbjct: 249 YYIGLESITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLE 308

Query: 274 NMIKAQPVKGVGAEPGFSDVLCYNISSQ---------PKFPEVTIHF-RGADVKLSPSNL 323
            +I     K V    GF   LCY +  +          + P +T HF     V L   N 
Sbjct: 309 LVIGYPRAKQVELNTGFD--LCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNN 366

Query: 324 FRNI-----SDEIMCSAFRGGNANI------------VYGRIMQINFLIGYDIEQAMVSF 366
           F  +     S  + C  ++  +               ++G   Q N  + YD+E+  + F
Sbjct: 367 FYAMAAPINSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGF 426

Query: 367 KPSRCT 372
           +P  C 
Sbjct: 427 QPMDCV 432


>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 491

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 98/372 (26%), Positives = 164/372 (44%), Gaps = 45/372 (12%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G P  +    +DTGSD  W  C P   CP+      E  LFD  KSS+   +
Sbjct: 83  LYFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVL 142

Query: 91  SCSSSQCAVVTSNCSE-----GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---LP 142
            C+   CA V++   +       CSYSF Y      S +SG   T+++ F+   G   + 
Sbjct: 143 PCTDPICAAVSTTTDQCLTQTDHCSYSFHYRD---RSGTSGFYVTDSMHFDILLGESTIA 199

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS-IAGK-FSYCLP--D 197
                ++FGC        T  +K   GI G G G  S+ISQ+ +  I  K FS+CL   +
Sbjct: 200 NSSATIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCLKGGE 259

Query: 198 QGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRL----EFVSSSTGNIFVD 252
            G   +  G I+  + +V +PLI  + HY L L++I++  Q       F  S+ G   +D
Sbjct: 260 NGGGILVLGEILEPS-IVYSPLIPSQPHYTLKLQSIALSGQLFPNPTMFPISNAGETIID 318

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVT 308
           +G     L  E +  + SV+++ +   A P    G++       C+ +S      FP + 
Sbjct: 319 SGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQ-------CFRVSMSVADIFPVLR 371

Query: 309 IHFRG-ADVKLSPSNLFRNIS-------DEIMCSAFRGGNANI-VYGRIMQINFLIGYDI 359
            +F G A + ++P    +  S         + C  F+     + + G ++  + +I YD+
Sbjct: 372 FNFEGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKAEDGLNILGDLVLKDKIIVYDL 431

Query: 360 EQAMVSFKPSRC 371
            Q  + +    C
Sbjct: 432 AQQRIGWANYDC 443


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 101/376 (26%), Positives = 160/376 (42%), Gaps = 50/376 (13%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDP------KKS 84
           +Y   + +GTPP++    +DTGSD  W  C     CP       +   FD          
Sbjct: 78  LYFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSLV 137

Query: 85  STYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSG---L 141
           S  + I  S+ Q            CSY+F YG G   S +SG   +E++ F+   G   +
Sbjct: 138 SCSDPICNSAFQTTATQCLTQSNQCSYTFQYGDG---SGTSGYYVSESMYFDMVMGQSMI 194

Query: 142 PVEMPNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQM-GTSIAGK-FSYCLPDQ 198
                +V+FGC        T SD    GI G GPG+ S+ISQ+    I  K FS+CL  +
Sbjct: 195 ANSSASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCLKGE 254

Query: 199 GSSKINFGGI-----VAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTG 247
           G    N GGI     V   G+V +PL+  + HY L L++ISV  Q L      F +S   
Sbjct: 255 G----NGGGILVLGEVLEPGIVYSPLVPSQPHYNLYLQSISVNGQTLPIDPSVFATSINR 310

Query: 248 NIFVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQ--PK 303
              +D+G     L  E ++   S ++  +     P    G +       CY +S+     
Sbjct: 311 GTIIDSGTTLAYLVEEAYTPFVSAITAAVSQSVTPTISKGNQ-------CYLVSTSVGEI 363

Query: 304 FPEVTIHFRG-ADVKLSPSNLFRNI----SDEIMCSAFRGGNANI-VYGRIMQINFLIGY 357
           FP V+++F G A + L P     ++       + C  F+     + + G ++  + +  Y
Sbjct: 364 FPLVSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGVTILGDLVMKDKIFVY 423

Query: 358 DIEQAMVSFKPSRCTN 373
           D+ +  + +    C+ 
Sbjct: 424 DLARQRIGWASYDCSQ 439


>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
           [Cucumis sativus]
          Length = 420

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 89/310 (28%), Positives = 134/310 (43%), Gaps = 47/310 (15%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP  D +  VDTGSD  W    QC  CP       E   +D ++S+T   +
Sbjct: 86  LYYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLV 145

Query: 91  SCSSSQCAVVT----SNCSEG-DCSYSFLYGRGAYASFSSGNLATETLTFNSTSG-LPVE 144
           SC    C  V     S C+    C Y  +YG G   S ++G    + + +N  SG L   
Sbjct: 146 SCDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDG---SSTAGYFVKDYVQYNRVSGDLETT 202

Query: 145 MPN--VIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
             N  + FGCG +      S  ++   GI+G G  NSS+ISQ+ ++  +   F++CL   
Sbjct: 203 AANGSIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL--- 259

Query: 199 GSSKINFGGIVAGAGVVS-----TPLIIRD-HYYLSLEAISVGNQRLE-----FVSSSTG 247
                N GGI A   VV      TPL+    HY +++  + VG+  L      F +    
Sbjct: 260 --DGTNGGGIFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRK 317

Query: 248 NIFVDTGVLRTLLP-LEYHSNLKSVMSNM--IKAQPVKGVGAEPGFSDVLCYNISSQ--P 302
              +D+G     LP L Y   +  ++S    ++ Q + G        +  C+  S +   
Sbjct: 318 GTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHG--------EYKCFQYSERVDD 369

Query: 303 KFPEVTIHFR 312
            FP V  HF 
Sbjct: 370 GFPPVIFHFE 379


>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
          Length = 371

 Score = 94.7 bits (234), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 90/337 (26%), Positives = 155/337 (45%), Gaps = 48/337 (14%)

Query: 63  CEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGAYA 121
           C  C  + CFKQ+ P+F P  SST+    C +  C ++ T  C+   C+Y  + G G + 
Sbjct: 55  CSQC--IHCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPTPKCASDVCAYDGVTGLGGH- 111

Query: 122 SFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLIS 181
             + G +AT+T    + +  P   P      G    A+ T  +  +G IGLG    SL++
Sbjct: 112 --TVGIVATDTFAIGTAA--PARPP----ASGASWRATSTPWAGPSGFIGLGRTPWSLVA 163

Query: 182 QMGTSIAGKFSYCLP--DQGSSKINFGGIVA--GAGVVSTPLI-------IRDHYYLSLE 230
           QM  +   +FSYCL   D G +   F G  A    G   TP +       +  +Y + LE
Sbjct: 164 QMKLT---RFSYCLAPHDTGKNSRLFLGASAKLAGGGAWTPFVKTSPNDGMSQYYPIELE 220

Query: 231 AISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLE--YHSNLKSVMSNMIKAQPVKGVGAEP 288
            I  G+  +  +      + V T V+R  L ++  Y    K+VM+++  A     VGA  
Sbjct: 221 EIKAGDATIT-MPRGRNTVLVQTAVVRVSLLVDSVYQEFKKAVMASVGAAPTATPVGAP- 278

Query: 289 GFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMC-----------SAF 336
            F   +C+  +     P++   F+ GA + + P+N   ++ ++ +C           +A 
Sbjct: 279 -FE--VCFPKAGVSGAPDLVFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITAL 335

Query: 337 RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
            G N   + G   Q N  + +D+++ M+SF+P+ C++
Sbjct: 336 DGLN---ILGSFQQENVHLLFDLDKDMLSFEPADCSS 369


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 108/367 (29%), Positives = 166/367 (45%), Gaps = 51/367 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IGTPP +    VDTGS  T+  C  C +  C K + P F P+ SS+Y ++ C+ 
Sbjct: 80  YTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQ--CGKHQDPKFQPELSSSYKALKCNP 137

Query: 95  SQCAVVTSNC-SEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                   NC  EG  C Y   Y   A  S SSG L+ + ++F + S L  +    +FGC
Sbjct: 138 D------CNCDDEGKLCVYERRY---AEMSSSSGVLSEDLISFGNESQLTPQ--RAVFGC 186

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVA 210
            +       S  +  GI+GLG G  S++ Q+     I   FS C    G  ++  G +V 
Sbjct: 187 ENVETGDLFS-QRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY---GGMEVGGGAMVL 242

Query: 211 G-----AGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRT 258
           G     AG+V   S P   R  YY + L+ + V  + L+    V +      +D+G    
Sbjct: 243 GKISPPAGMVFSHSDPF--RSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYA 300

Query: 259 LLPLEYHSNLKSVMSNMIKAQP-VKGV-GAEPGFSDVLCYNISSQPK------FPEVTIH 310
             P E    +K     +IK  P +K + G +P + DV C++ + +        FPE+ + 
Sbjct: 301 YFPKEAFIAIKDA---IIKEIPSLKRIHGPDPNYDDV-CFSGAGRDVAEIHNFFPEIDME 356

Query: 311 F-RGADVKLSPSN-LFRN--ISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
           F  G  + LSP N LFR+  +        F   ++  + G I+  N L+ YD E   + F
Sbjct: 357 FGNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGF 416

Query: 367 KPSRCTN 373
             + C++
Sbjct: 417 LKTNCSD 423


>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 482

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 103/376 (27%), Positives = 164/376 (43%), Gaps = 50/376 (13%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE---PCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTP  D +  VDTGSD  W  C     CP+      E  L+ P  SST N +
Sbjct: 73  LYFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPSSSSTSNRV 132

Query: 91  SCSSSQCAVVTS----NCS-EGDCSYSFLYGRGAYAS--FSSGNLATETLTFN----STS 139
           +C+   C          C+ E  C Y   YG G+  +  F   ++  + +T N    ST+
Sbjct: 133 TCNQDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTTSTN 192

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTS--IAGKFSYCLP 196
           G      +++FGCG +      + S    GI+G G  NSS+ISQ+ +S  +   F++CL 
Sbjct: 193 G------SIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLD 246

Query: 197 DQGSSKINFGGIVAGAGVV-----STPLIIRD-HYYLSLEAISVGNQRLE-----FVSSS 245
           +     IN GGI A   VV     +TPL+ +  HY + ++AI V N+ L      F +  
Sbjct: 247 N-----INGGGIFAIGEVVQPKVRTTPLVPQQAHYNVFMKAIEVDNEVLNLPTDVFDTDL 301

Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFP 305
                +D+G      P   +  L S +    +   +K    E  F+    Y+ +    FP
Sbjct: 302 RKGTIIDSGTTLAYFPDVIYEPLISKI--FARQSTLKLHTVEEQFT-CFEYDGNVDDGFP 358

Query: 306 EVTIHFRGA-DVKLSPSNLFRNISDEIMCSAF-------RGGNANIVYGRIMQINFLIGY 357
            VT HF  +  + + P     +I     C  +       R G   I+ G ++  N L+ Y
Sbjct: 359 TVTFHFEDSLSLTVYPHEYLFDIDSNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVMY 418

Query: 358 DIEQAMVSFKPSRCTN 373
           D+E   + +    C++
Sbjct: 419 DLENQTIGWTEYNCSS 434


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 99/360 (27%), Positives = 158/360 (43%), Gaps = 36/360 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IGTPP +    VD+GS  T+  C  C +  C   + P F P  SSTY+ + C+ 
Sbjct: 88  YTTRLHIGTPPQEFALIVDSGSTVTYVPCASCEQ--CGNHQDPRFQPDLSSTYSPVKCNV 145

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
                 T +  +  C+Y   Y   A  S SSG L  + ++F + S L  +    +FGC +
Sbjct: 146 D----CTCDSDKNQCTYERQY---AEMSSSSGVLGEDIVSFGTESELKPQ--RAVFGCEN 196

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP--DQGSSKINFGGIVA 210
                  S     GI+GLG G  S++ Q+     I   FS C    D G   +  G + A
Sbjct: 197 SETGDLFSQHAD-GIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPA 255

Query: 211 GAGVVSTPL-IIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLEYH 265
             G++ T    +R  YY + L+ + V  + L     +        +D+G     LP +  
Sbjct: 256 PPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDSGTTYAYLPEQAF 315

Query: 266 SNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGADVK 317
              K  +S+ +   P+K + G +  + D+ C+     N+S   + FP+V + F  G  + 
Sbjct: 316 VAFKDAVSSQV--HPLKKIRGPDSNYKDI-CFAGAGRNVSQLSEVFPKVDMVFGNGQKLS 372

Query: 318 LSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           LSP N LFR+  +        F+ G +   + G I+  N L+ YD     + F  + C+ 
Sbjct: 373 LSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNCSE 432


>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
 gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
          Length = 486

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 92/352 (26%), Positives = 157/352 (44%), Gaps = 48/352 (13%)

Query: 43  TPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV-- 100
           +PPV +   +DT  D  W +C PC    C       +DP +SSTY++  C+SS C  +  
Sbjct: 160 SPPVTVV--LDTAGDVPWMRCVPCTFAQCAD-----YDPTRSSTYSAFPCNSSACKQLGR 212

Query: 101 -TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLA 158
             + C + G C Y  +    ++ +  SG  +++ LT NS  G  VE     FGC      
Sbjct: 213 YANGCDANGQCQYMVVTAGDSFTT--SGTYSSDVLTINS--GDRVE--GFRFGCSQNEQG 266

Query: 159 SPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFG-GIVAGAG--VV 215
           S   +++  GI+ LG G  SL++Q  ++    FSYCLP   ++K  F  G+  GA    V
Sbjct: 267 S--FENQADGIMALGRGVQSLMAQTSSTYGDAFSYCLPPTETTKGFFQIGVPIGASYRFV 324

Query: 216 STPLIIRDH---------YYLSLEAISVGNQRLEFVSSS-TGNIFVDTGVLRTLLPLEYH 265
           +TP++             Y   L AI+V  + L   +        +D+  + T LP+  +
Sbjct: 325 TTPMLKERGGASAAAATLYRALLLAITVDGKELNVPAEVFAAGTVMDSRTIITRLPVTAY 384

Query: 266 SNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTIHFRG-ADVKLSPSN 322
             L++   N ++ +      A P      CY+++    P+ P + + F G A V++  S 
Sbjct: 385 GALRAAFRNRMRYRV-----APPQEELDTCYDLTGVRYPRLPRIALVFDGNAVVEMDRSG 439

Query: 323 LFRNISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +  N      C AF   + +    + G + Q    + +D+    + F+ + C
Sbjct: 440 ILLN-----GCLAFASNDDDSSPSILGNVQQQTIQVLHDVGGGRIGFRSAAC 486


>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
 gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
 gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
 gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
 gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
 gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
 gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
          Length = 472

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 107/380 (28%), Positives = 161/380 (42%), Gaps = 58/380 (15%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
           D ++LM +S+G PPV    ++DTGS  +W QC+PC  + C  Q     P+FDP +S T  
Sbjct: 111 DFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCA-VHCHTQSAKAGPIFDPGRSYTSR 169

Query: 89  SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
            + CSS +C        +  +NC E +  C+YS  YG G   ++S G + T+TL    + 
Sbjct: 170 RVRCSSVKCGELRYDLRLQQANCMEKENSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS- 226

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG----TSIAGKFSYCL 195
                  +++FGC      S      + GI G G  + S   Q+           FSYCL
Sbjct: 227 -----FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL 277

Query: 196 P-DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
           P D+        G    A +    TPL   I R  Y L++E +    QRL    +S+  +
Sbjct: 278 PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEM 334

Query: 250 FVDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------ 296
            VD+G  RT L P  +    K++   M      +   A       +CY            
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGT 392

Query: 297 --NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQI 351
               S+    P + I F  GA + L P N+F N     +C  F    A  + + G  +  
Sbjct: 393 ITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTR 452

Query: 352 NFLIGYDIEQAMVSFKPSRC 371
           +F   +DI+     FK + C
Sbjct: 453 SFGTTFDIQGKQFGFKYAAC 472


>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
 gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
          Length = 416

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 116/412 (28%), Positives = 168/412 (40%), Gaps = 78/412 (18%)

Query: 27  EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE----PCPELDCFKQEPPLFDPK 82
           ++  V D YL+ L+IGTPP  I   +DTGSD TW  C      C + D ++    +    
Sbjct: 4   QLREVRDGYLISLNIGTPPQVIQVYMDTGSDLTWVPCGNLSFDCMDCDDYRNSKLMSAFS 63

Query: 83  KSSTYNSI--SCSSSQCAVVTSN------CSEGDCSY-------------SFLYGRGAYA 121
            S + +S   SC+S  C  + S+      C+   CS              SF Y  GA  
Sbjct: 64  PSHSSSSYRDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGA-G 122

Query: 122 SFSSGNLATETLTFNS-TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLI 180
              +G L  +TL  +   + +  ++P   FGC       P       GI G   G  S  
Sbjct: 123 GVVTGTLTRDTLRVHEGPARVTKDIPKFCFGCVGSTYHEP------IGIAGFVRGTLSFP 176

Query: 181 SQMGTSIAGKFSYC-LPDQGSSKINFGG-IVAGAGVVS-------TPLI----IRDHYYL 227
           SQ+G    G FS+C L  + ++  N    +V G   +S       TP++      ++YY+
Sbjct: 177 SQLGLLKKG-FSHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYI 235

Query: 228 SLEAISVGN--------QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ 279
            LEAI+VGN           EF S   G + +D+G   T LP  ++S L S+   +I   
Sbjct: 236 GLEAITVGNVSATTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKAIITYP 295

Query: 280 PVKGVGAEPGFSDVLCYNI--------SSQPKFPEVTIHF-RGADVKLSPSNLFRNISDE 330
               V    GF   LCY +             FP +T HF       L   N F  +S  
Sbjct: 296 RATEVEMRAGFD--LCYKVPCPNNRLTDDDNLFPSITFHFLNNVSFVLPQGNHFYAMSAP 353

Query: 331 -----IMCSAFRG------GNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
                + C  F+       G A  V+G   Q N  I YD+E+  + F+P  C
Sbjct: 354 SNSTVVKCLLFQSMADSDYGPAG-VFGSFQQQNVQIVYDLEKERIGFQPMDC 404


>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
 gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
 gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
 gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
 gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
 gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
 gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
 gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
 gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
 gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
 gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
 gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
 gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
 gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
 gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
 gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
 gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
 gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
 gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
 gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
 gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
 gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
 gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
 gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
 gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
 gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
 gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
 gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
 gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
 gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
 gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
 gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
 gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
 gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
 gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
 gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
 gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
 gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
 gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
 gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
 gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
 gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
 gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
 gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
 gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
 gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
 gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
 gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
 gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
 gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
 gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
 gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
 gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
 gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
          Length = 472

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 107/380 (28%), Positives = 161/380 (42%), Gaps = 58/380 (15%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
           D ++LM +S+G PPV    ++DTGS  +W QC+PC  + C  Q     P+FDP +S T  
Sbjct: 111 DFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCA-VHCHTQSAKAGPIFDPGRSYTSR 169

Query: 89  SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
            + CSS +C        +  +NC E +  C+YS  YG G   ++S G + T+TL    + 
Sbjct: 170 RVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS- 226

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG----TSIAGKFSYCL 195
                  +++FGC      S      + GI G G  + S   Q+           FSYCL
Sbjct: 227 -----FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL 277

Query: 196 P-DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
           P D+        G    A +    TPL   I R  Y L++E +    QRL    +S+  +
Sbjct: 278 PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEM 334

Query: 250 FVDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------ 296
            VD+G  RT L P  +    K++   M      +   A       +CY            
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGT 392

Query: 297 --NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQI 351
               S+    P + I F  GA + L P N+F N     +C  F    A  + + G  +  
Sbjct: 393 ITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTR 452

Query: 352 NFLIGYDIEQAMVSFKPSRC 371
           +F   +DI+     FK + C
Sbjct: 453 SFGTTFDIQGKQFGFKYAAC 472


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 96/327 (29%), Positives = 145/327 (44%), Gaps = 48/327 (14%)

Query: 77  PLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGD------CSYSFLYGRGAYASFSSGNLA 129
           P FD   SST    SC S+ C  ++ ++C          C Y++ Y      S ++G L 
Sbjct: 175 PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYND---KSVTTGLLE 231

Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
            +  TF    G    +P V FGCG  N  +    S +TGI G G G  SL SQ+     G
Sbjct: 232 VDKFTF----GAGASVPGVAFGCGLFN--NGVFKSNETGIAGFGRGPLSLPSQLKV---G 282

Query: 190 KFSYCLP-----DQGSSKINFGGIVAGAG---VVSTPLIIRDH----YYLSLEAISVGNQ 237
            FS+C        Q +  ++    +   G   V STPLI        YYLSL+ I+VG+ 
Sbjct: 283 NFSHCFTAVNGLKQSTVLLDLLADLYKNGRGAVQSTPLIQNSANPTLYYLSLKGITVGST 342

Query: 238 RLEF------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS 291
           RL        +++ TG   +D+G   T LP + +  ++   +  IK   V G    P   
Sbjct: 343 RLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGP--- 399

Query: 292 DVLCYNISSQ--PKFPEVTIHFRGADVKLSPSNLFRNISDE----IMCSAFRG-GNANIV 344
              C++  SQ  P  P++ +HF GA + L   N    + D+    ++C A    G+    
Sbjct: 400 -YTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSMICLAINELGDERAT 458

Query: 345 YGRIMQINFLIGYDIEQAMVSFKPSRC 371
            G   Q N  + YD++  M+SF  ++C
Sbjct: 459 IGNFQQQNMHVLYDLQNNMLSFVAAQC 485



 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 36/134 (26%), Positives = 59/134 (44%), Gaps = 16/134 (11%)

Query: 232 ISVGNQRLEF------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG 285
           I+VG+ RL        +++ TG   +D+G   T LP + +  ++   +  IK   V G  
Sbjct: 42  ITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNA 101

Query: 286 AEPGFSDVLCYNISSQ--PKFPEVTIHFRGADVKLSPSNLFRNISDE----IMCSAFRGG 339
             P      C++  SQ  P  P++ +HF GA + L   N    + D+    I+C A   G
Sbjct: 102 TGP----YTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKG 157

Query: 340 NANIVYGRIMQINF 353
           +   + G   Q N 
Sbjct: 158 DETTIIGNFQQQNM 171


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 96/383 (25%), Positives = 163/383 (42%), Gaps = 67/383 (17%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP--LFDPKKSSTYNSIS 91
           I L+ L IGTPP      +DTGS  +W QC         ++ PP  +FDP  SS+++ + 
Sbjct: 81  ILLVSLPIGTPPQTQQMILDTGSQLSWIQCHK----KVPRKPPPSSVFDPSLSSSFSVLP 136

Query: 92  CSSSQCA------VVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           C+   C        + ++C +   C YS+ Y  G  A    GNL  E +TF+ +      
Sbjct: 137 CNHPLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAE---GNLVREKITFSRSQ----S 189

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------ 198
            P +I GC     A  +SD+K  GI+G+  G  S  SQ   +   KFSYC+P +      
Sbjct: 190 TPPLILGC-----AEESSDAK--GILGMNLGRLSFASQAKLT---KFSYCVPTRQVRPGF 239

Query: 199 ------------GSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSS- 245
                        S    +  ++  +     P +    Y ++++ I +GNQ+L    S+ 
Sbjct: 240 TPTGSFYLGENPNSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAF 299

Query: 246 ------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN-- 297
                  G   +D+G   T L  E ++ ++  +  ++ A+  KG     G SD +C+N  
Sbjct: 300 RPDPSGAGQTMIDSGSEFTYLVDEAYNKVREEVVRLVGARLKKGY-VYGGVSD-MCFNGN 357

Query: 298 -ISSQPKFPEVTIHF-RGADVKLSPSNLFRNISDEIMC-----SAFRGGNANIVYGRIMQ 350
            I        +   F +G ++ +    +  ++   + C     S   G  +NI+ G   Q
Sbjct: 358 AIEIGRLIGNMVFEFDKGVEIVVEKERVLADVGGGVHCVGIGRSEMLGAASNII-GNFHQ 416

Query: 351 INFLIGYDIEQAMVSFKPSRCTN 373
            N  + +D+    V F  + C+ 
Sbjct: 417 QNIWVEFDLANRRVGFGKADCSR 439


>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
 gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
          Length = 469

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 110/422 (26%), Positives = 162/422 (38%), Gaps = 85/422 (20%)

Query: 14  ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELD 70
           ++PK+  S++           Y + L+ GTPP      +DTGS   W  C     C   D
Sbjct: 71  KSPKTKFSLLKTPLFPRSYGGYSISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCD 130

Query: 71  CFKQEP---PLFDPKKSSTYNSISCSSSQCAVV---------------TSNCSEGDCSYS 112
               E    P F PK+SS+ N I C + +C+ +               T NC++    Y 
Sbjct: 131 FPNIEVTGIPTFIPKQSSSSNLIGCKNHKCSWLFGPKVQSKCQECDPTTQNCTQSCPPYV 190

Query: 113 FLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGL 172
             YG G+ A    G L +ETL F         +P  + GC   ++  P       GI G 
Sbjct: 191 IQYGLGSTA----GLLLSETLDFPHKK----TIPGFLVGCSLFSIRQP------EGIAGF 236

Query: 173 GPGNSSLISQMGTSIAGKFSYCL--------PDQGSSKINFGG---IVAGAGVVSTPL-- 219
           G    SL SQ+G     KFSYCL        P      ++ G         G+  TP   
Sbjct: 237 GRSPESLPSQLGLK---KFSYCLVSHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQK 293

Query: 220 ----IIRDHYYLSLEAISVGNQRLE-----FVSSSTGN--IFVDTGVLRTLLPLEYHSNL 268
                 RD+YY+ L  I +G+  ++      V  S GN    VD+G   T +    +  +
Sbjct: 294 NPTAAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELV 353

Query: 269 -----KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVKLSP 320
                K V    +  +     G  P      C+NIS +     PE   HF+ GA + L  
Sbjct: 354 AKEFEKQVAHYTVATEVQNQTGLRP------CFNISGEKSVSVPEFIFHFKGGAKMALPL 407

Query: 321 SNLFRNISDEIMCSAFR---------GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +N F  +   ++C             GG   I+ G   Q NF + +D++     FK   C
Sbjct: 408 ANYFSFVDSGVICLTIVSDNMSGSGIGGGPAIILGNYQQRNFHVEFDLKNERFGFKQQNC 467

Query: 372 TN 373
            +
Sbjct: 468 VS 469


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 103/372 (27%), Positives = 161/372 (43%), Gaps = 43/372 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPP +    +DTGSD  W  C     CP+      E   FD   SST   I
Sbjct: 77  LYYTKVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALI 136

Query: 91  SCSSSQCAV----VTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
            CS   C        + CS     CSY+F YG G   S +SG   ++ + F+   G P  
Sbjct: 137 PCSDPICTSRVQGAAAECSPRVNQCSYTFQYGDG---SGTSGYYVSDAMYFSLIMGQPPA 193

Query: 145 M---PNVIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLPDQ 198
           +     ++FGC        T +D    GI G GPG  S++SQ+ +  I  K FS+CL   
Sbjct: 194 VNSSATIVFGCSISQSGDLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGD 253

Query: 199 GSSKINFGGI-VAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLEF------VSSSTGNIF 250
           G          +    +V +PL+  + HY L+L++I+V  Q L        +S++ G   
Sbjct: 254 GDGGGVLVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPINPAVFSISNNRGGTI 313

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQ--PKFPE 306
           VD G     L  E +  L + ++  +   A+     G +       CY +S+     FP 
Sbjct: 314 VDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ-------CYLVSTSIGDIFPS 366

Query: 307 VTIHFR-GADVKLSPSN-LFRN---ISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIE 360
           V+++F  GA + L P   L  N      E+ C  F+       + G ++  + ++ YDI 
Sbjct: 367 VSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVLKDKIVVYDIA 426

Query: 361 QAMVSFKPSRCT 372
           Q  + +    C+
Sbjct: 427 QQRIGWANYDCS 438


>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
 gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
          Length = 370

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 98/395 (24%), Positives = 166/395 (42%), Gaps = 49/395 (12%)

Query: 1   AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
           A++  +L F +     KS + I     +I     Y++   +GTPP  +  ++D   D  W
Sbjct: 2   AKDQARLQFLSSLVAKKSVVPIASGRGVIQSPS-YIVKAKVGTPPQTLLMALDNSYDAAW 60

Query: 61  TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSN-CSEGDCSYSFLYGRGA 119
             C+ C  + C      +F+  KS+T+ ++ C + QC  V +  C    C+++  YG   
Sbjct: 61  IPCKGC--VGCSST---VFNTVKSTTFKTLGCGAPQCKQVPNPICGGSTCTWNTTYGSST 115

Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
             S    NL  +T+  +        +P   FGC  K  A+ +S   Q G++G G G  S 
Sbjct: 116 ILS----NLTRDTIALSMD-----PVPYYAFGCIQK--ATGSSVPPQ-GLLGFGRGPLSF 163

Query: 180 ISQMGTSIAGKFSYCLPD----QGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEA 231
           +SQ        FSYCLP       S  +  G +     + +TPL+        YY+ L  
Sbjct: 164 LSQTQNLYKSTFSYCLPSFRTLNFSGSLRLGPVGQPPRIKTTPLLKNPRRSSLYYVKLNG 223

Query: 232 ISVGNQRLEFVSSS--------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG 283
           I VG + ++   S+         G IF D+G + T L    +  +++     +    V  
Sbjct: 224 IRVGRKIVDIPRSALAFNPTTGAGTIF-DSGTVFTRLVAPAYIAVRNEFRKRVGNATVSS 282

Query: 284 VGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNLFRNISDEIM-CSAFRGGNAN 342
           +G   GF    CY++   P  P +T  F G +V + P NL  + +  +  C A      N
Sbjct: 283 LG---GFDT--CYSVPIVP--PTITFMFSGMNVTMPPENLLIHSTAGVTSCLAMAAAPDN 335

Query: 343 I-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           +     V   + Q N  I +D+  + +     +C+
Sbjct: 336 VNSVLNVIASMQQQNHRILFDVPNSRLGVAREQCS 370


>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
          Length = 393

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 97/343 (28%), Positives = 143/343 (41%), Gaps = 87/343 (25%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELD-CFKQEPPLFDPKKSSTYNSISCS 93
           Y++ + +G+P V     +DTGSD +W QCEPCP    C      LFDP  SSTY + +CS
Sbjct: 106 YVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCS 165

Query: 94  SSQCAVV-----TSNC-SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           ++ CA +      + C ++  C Y   YG G                 ++T+G   +   
Sbjct: 166 AAACAQLGDSGEANGCDAKSRCQYIVKYGDG-----------------SNTTGTGFQ--- 205

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
             FGC H  L +   D K  G+IGLG    SL+SQ                 S K+    
Sbjct: 206 --FGCSHAELGAGM-DDKTDGLIGLGGDAQSLVSQTAAR-------------SKKVP--- 246

Query: 208 IVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS--STGNIFVDTGVLRTLLPLEYH 265
                           +Y+ +LE I+VG ++L    S  + G++ VD+G + T LP   +
Sbjct: 247 ---------------TYYFAALEDIAVGGKKLGLSPSVFAAGSL-VDSGTVITRLPPAAY 290

Query: 266 SNLKSV----MSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--FPEVTIHFR-GADVKL 318
           + L S     M+   +A+P+       G  D  C+N +   K   P V + F  GA V L
Sbjct: 291 AALSSAFRAGMTRYARAEPL-------GILDT-CFNFTGLDKVSIPTVALVFAGGAVVDL 342

Query: 319 SPSNLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLIGYD 358
               +         C AF   R   A    G + Q  F + YD
Sbjct: 343 DAHGIVSG-----GCLAFAPTRDDKAFGTIGNVQQRTFEVLYD 380


>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 435

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 98/369 (26%), Positives = 164/369 (44%), Gaps = 48/369 (13%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISC 92
            Y + L+IG PP   F  VDTGS+ TW QC+ PC +  C +   PL+ P      + I C
Sbjct: 73  FYNVTLNIGQPPRPYFLDVDTGSELTWLQCDAPCSQ--CSETPHPLYKPSN----DFIPC 126

Query: 93  SSSQCAVVTS----NCSE-GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
               CA +       C +   C Y   Y    Y++   G L  +    N T+G+ +++  
Sbjct: 127 KDPLCASLQPTDDYTCEDPNQCDYEIKYAD-QYSTL--GVLLNDVYLLNFTNGVQLKV-R 182

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINF 205
           +  GCG+  + SP++     GI+GLG G +SLISQ+ +   +     +CL  +G   I F
Sbjct: 183 MALGCGYDQIFSPSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMGHCLSSRGGGYIFF 242

Query: 206 GGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTG----NIFVDTGVLRTLLP 261
           G +   + +  TP+   D    S +  S G   L F    TG    NI  DTG   T   
Sbjct: 243 GNVYDSSRMSWTPISSID----SGKHYSAGPAELVFGGRKTGVGSLNIIFDTGSSYTYFN 298

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN-------ISSQPK-FPEVTIHF-R 312
            + +  + S+++  +  +P+K    +      +C++       I+   K F  +T+ F  
Sbjct: 299 SQAYQAMISLLNKELHRKPIKAAPDDQTLP--MCWHGKRPFRSINEVKKYFKPLTLSFTN 356

Query: 313 GADVKLS---PSNLFRNISD--EIMCSAFRG-----GNANIVYGRIMQINFLIGYDIEQA 362
           G  VK     P   +  IS+   +      G     G  N++ G I  ++ ++ +D E+ 
Sbjct: 357 GGRVKPQFEIPPEAYLIISNMGNVCLGILNGPEVGLGELNLI-GDISMLDKVMVFDNEKQ 415

Query: 363 MVSFKPSRC 371
           ++ + P+ C
Sbjct: 416 LIGWGPADC 424


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 102/366 (27%), Positives = 164/366 (44%), Gaps = 49/366 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IGTPP +    VDTGS  T+  C  C +  C K + P F P+ S++Y ++ C+ 
Sbjct: 76  YTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQ--CGKHQDPKFQPELSTSYQALKCNP 133

Query: 95  SQCAVVTSNC-SEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                   NC  EG  C Y   Y   A  S SSG L+ + ++F + S L  +    +FGC
Sbjct: 134 D------CNCDDEGKLCVYERRY---AEMSSSSGVLSEDLISFGNESQLSPQ--RAVFGC 182

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVA 210
            ++      S  +  GI+GLG G  S++ Q+     I   FS C    G  ++  G +V 
Sbjct: 183 ENEETGDLFS-QRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY---GGMEVGGGAMVL 238

Query: 211 GAGVVSTPLIIRDH--------YYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTL 259
           G  +   P ++  H        Y + L+ + V  + L+    V +      +D+G     
Sbjct: 239 GK-ISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAY 297

Query: 260 LPLEYHSNLKSVMSNMIKAQP-VKGV-GAEPGFSDVLCYNISSQPK------FPEVTIHF 311
            P E    +K     +IK  P +K + G +P + DV C++ + +        FPE+ + F
Sbjct: 298 FPKEAFIAIKDA---VIKEIPSLKRIHGPDPNYDDV-CFSGAGRDVAEIHNFFPEIAMEF 353

Query: 312 -RGADVKLSPSN-LFRN--ISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFK 367
             G  + LSP N LFR+  +        F   ++  + G I+  N L+ YD E   + F 
Sbjct: 354 GNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFL 413

Query: 368 PSRCTN 373
            + C++
Sbjct: 414 KTNCSD 419


>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
          Length = 367

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 56/174 (32%), Positives = 85/174 (48%), Gaps = 15/174 (8%)

Query: 14  ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK 73
           E   +  +++ +  I+     YL+ L IGTPP     ++DT SD  WTQC+PC    C+ 
Sbjct: 68  EAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKFTAAIDTASDLIWTQCQPCT--GCYH 125

Query: 74  QEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGD---CSYSFLYGRGAYASFSSGNLA 129
           Q  P+F+P+ SSTY ++ CSS  C  +    C   D   C Y++ Y   A    + G LA
Sbjct: 126 QVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHDDDESCQYTYTYSGNAT---TEGTLA 182

Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM 183
            + L     +        V FGC   +        + +G++GLG G  SL+SQ+
Sbjct: 183 VDKLVIGEDA-----FRGVAFGCSTSSTGG-APPPQASGVVGLGRGPLSLVSQL 230


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 105/368 (28%), Positives = 166/368 (45%), Gaps = 46/368 (12%)

Query: 31  VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           ++  Y   L IGTPP      VDTGS  T+  C  C +  C + + P F P  SSTY S+
Sbjct: 9   INGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQ--CGRHQDPKFQPDLSSTYQSV 66

Query: 91  SCSSSQCAVVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
            C+      +  NC +    C Y   Y   A  S SSG L  + ++F + S L  +    
Sbjct: 67  KCN------IDCNCDDEKQQCVYERQY---AEMSTSSGVLGEDIISFGNLSALAPQ--RA 115

Query: 149 IFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS--IAGKFSYCL--PDQGSSKI 203
           +FGC  +N+ +    S+   GI+G+G G+ S++  +     I   FS C      G   +
Sbjct: 116 VFGC--ENMETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAM 173

Query: 204 NFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVL 256
             GGI   + +V   S P  +R  YY + L+ I V  + L     V        +D+G  
Sbjct: 174 VLGGISPPSNMVFSQSDP--VRSPYYNIDLKEIHVAGKPLPLNPTVFDGKHGTILDSGTT 231

Query: 257 RTLLPLEYHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCY-----NISS-QPKFPEVTI 309
              LP     + K ++M  +   +P++  G +P ++D+ C+     +IS     FP V +
Sbjct: 232 YAYLPEAAFVSFKDAIMKELHSLKPIR--GPDPNYNDI-CFSGAGSDISQLSSSFPAVEM 288

Query: 310 HF-RGADVKLSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMV 364
            F  G  + LSP N LFR+  +        F+ G +   + G I+  N L+ YD E + +
Sbjct: 289 VFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENSKI 348

Query: 365 SFKPSRCT 372
            F  + C+
Sbjct: 349 GFWKTNCS 356


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score = 93.6 bits (231), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 102/366 (27%), Positives = 164/366 (44%), Gaps = 49/366 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   L IGTPP +    VDTGS  T+  C  C +  C K + P F P+ S++Y ++ C+ 
Sbjct: 76  YTTRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQ--CGKHQDPKFQPELSTSYQALKCNP 133

Query: 95  SQCAVVTSNC-SEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                   NC  EG  C Y   Y   A  S SSG L+ + ++F + S L  +    +FGC
Sbjct: 134 D------CNCDDEGKLCVYERRY---AEMSSSSGVLSEDLISFGNESQLSPQ--RAVFGC 182

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVA 210
            ++      S  +  GI+GLG G  S++ Q+     I   FS C    G  ++  G +V 
Sbjct: 183 ENEETGDLFS-QRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLCY---GGMEVGGGAMVL 238

Query: 211 GAGVVSTPLIIRDH--------YYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTL 259
           G  +   P ++  H        Y + L+ + V  + L+    V +      +D+G     
Sbjct: 239 GK-ISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFNGKHGTVLDSGTTYAY 297

Query: 260 LPLEYHSNLKSVMSNMIKAQP-VKGV-GAEPGFSDVLCYNISSQPK------FPEVTIHF 311
            P E    +K     +IK  P +K + G +P + DV C++ + +        FPE+ + F
Sbjct: 298 FPKEAFIAIKDA---VIKEIPSLKRIHGPDPNYDDV-CFSGAGRDVAEIHNFFPEIAMEF 353

Query: 312 -RGADVKLSPSN-LFRN--ISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFK 367
             G  + LSP N LFR+  +        F   ++  + G I+  N L+ YD E   + F 
Sbjct: 354 GNGQKLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRENDKLGFL 413

Query: 368 PSRCTN 373
            + C++
Sbjct: 414 KTNCSD 419


>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 105/376 (27%), Positives = 174/376 (46%), Gaps = 50/376 (13%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPP +++  +DTGSD  W     C  CP+    + +   FDP  SST + I
Sbjct: 76  LYYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSSTSSLI 135

Query: 91  SCSSSQC--AVVTSNCS----EGDCSYSFLYGRGAYAS-------FSSGNLATETLTFNS 137
           SC   +C   V TS+ S       C+Y+F YG G+  S           ++   TLT NS
Sbjct: 136 SCLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTNS 195

Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS-IAGK-FSYC 194
           ++       +V+FGC        T   +   GI G G    S+ISQ+ +  IA + FS+C
Sbjct: 196 SA-------SVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHC 248

Query: 195 LP--DQGSSKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSST 246
           L   + G   +  G IV    +V +PL+  + HY L+L++ISV  Q +      F +S+ 
Sbjct: 249 LKGDNSGGGVLVLGEIVE-PNIVYSPLVPSQPHYNLNLQSISVNGQIVRIAPSVFATSNN 307

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--- 303
               VD+G     L  E ++     ++ +I  Q V+ V +        CY I++      
Sbjct: 308 RGTIVDSGTTLAYLAEEAYNPFVIAIAAVIP-QSVRSVLSRGN----QCYLITTSSNVDI 362

Query: 304 FPEVTIHFR-GADVKLSPSNLF--RNISDE--IMCSAFR--GGNANIVYGRIMQINFLIG 356
           FP+V+++F  GA + L P +    +N   E  + C  F+   G +  + G ++  + +  
Sbjct: 363 FPQVSLNFAGGASLVLRPQDYLMQQNFIGEGSVWCIGFQKISGQSITILGDLVLKDKIFV 422

Query: 357 YDIEQAMVSFKPSRCT 372
           YD+    + +    C+
Sbjct: 423 YDLAGQRIGWANYDCS 438


>gi|356558304|ref|XP_003547447.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 336

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 100/358 (27%), Positives = 154/358 (43%), Gaps = 54/358 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   LSIG PP+     +DT SD  W  C              LFDP KSST++ +    
Sbjct: 9   YWSILSIGQPPIPQLVIMDTSSDILWIMC---------NHVGLLFDPSKSSTFSPL--CK 57

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
           + C      C     + S++       S +SG   ++T+ F +T     ++ +V+  CGH
Sbjct: 58  TPCGFKGCKCDPIPFNISYV-----DKSSTSGTFGSDTVVFETTDEGHSQIFDVLVRCGH 112

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGV 214
            N+   T D    GI GL  G +SL +++G     KFSYC+ +      N+  ++   G 
Sbjct: 113 -NIGFNT-DPGYNGIRGLNNGPNSLATKIGQ----KFSYCVGNLADPYYNYNQLILCEGA 166

Query: 215 ----VSTPLIIRD-HYYLSLEAISVGNQRL-------EFVSSSTGNIFVDTGVLRTLLPL 262
                STP  +    YY++L+ I VG +RL       E   ++TG +  D+G   T L  
Sbjct: 167 DLEGYSTPFEVHHGFYYVTLKGIIVGEKRLDIAPITFEIKGNNTGGVIRDSGTTITYLVD 226

Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFR-GADVKLS 319
             H  L + + N++             F  +  Y I S+    FP VT HF  GAD+ L 
Sbjct: 227 SVHKLLYNEVRNLLSWS----------FRQLCHYGIISRDLVGFPVVTFHFADGADLALD 276

Query: 320 PSNLFRNISDEIMCSAFRGG---NANI---VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             + F  + + I+C         N  I   V   + Q ++ +GYD+    V F+   C
Sbjct: 277 TGSFFNQL-NSILCMTVSPASILNTTISPSVIELLAQQSYNVGYDLLTNFVYFQRIDC 333


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 95/375 (25%), Positives = 155/375 (41%), Gaps = 46/375 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQC---EPCPELDCFKQEPPLFDPKKSSTYNSIS 91
           Y + L +GTP        DTGSD TW +C                 +F P  S +++ + 
Sbjct: 104 YFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSWSPLP 163

Query: 92  CSSSQCAVVT----SNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL-PVE 144
           C S  C        +NCS     CSY + Y   + A    G L + T++ +   G    +
Sbjct: 164 CDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVG-LDSATVSLSGNDGTRKAK 222

Query: 145 MPNVIFGC--GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PD 197
           +  V+ GC   +   +  +SD    G++ LG  N S  S+  +   G+FSYCL     P 
Sbjct: 223 LQEVVLGCTTSYDGQSFKSSD----GVLSLGNSNISFASRAASRFGGRFSYCLVDHLAPR 278

Query: 198 QGSSKINFGGIVAGAGVV----STPLII------RDHYYLSLEAISVGNQRLEFVS---- 243
             +S + FG   +  G       TPL++      R  Y++S++A++V  +RLE +     
Sbjct: 279 NATSFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEILPDVWD 338

Query: 244 -SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS-Q 301
               G   +D+G   T+L    +  +   +S      P   V  +P F    CYN +   
Sbjct: 339 FRKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVP--RVNMDP-FE--YCYNWTGVS 393

Query: 302 PKFPEVTIHFRGADVKLSPSNLFR-NISDEIMC-SAFRGGNANI-VYGRIMQINFLIGYD 358
            + P + + F GA     P   +  + +  + C     G    + V G I+Q   L  +D
Sbjct: 394 AEIPRMELRFAGAATLAPPGKSYVIDTAPGVKCIGVVEGAWPGVSVIGNILQQEHLWEFD 453

Query: 359 IEQAMVSFKPSRCTN 373
           +    + FK SRC +
Sbjct: 454 LANRWLRFKQSRCAH 468


>gi|357510893|ref|XP_003625735.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355500750|gb|AES81953.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 535

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 80/252 (31%), Positives = 120/252 (47%), Gaps = 25/252 (9%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+P  + +  +DTGSD  W  C     CP+      +   FD   SST   +
Sbjct: 70  LYFTKVKMGSPAKEFYVQIDTGSDILWLNCNTCNNCPKSSGLGIDLNYFDTASSSTAALV 129

Query: 91  SCSSSQCA----VVTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV- 143
           SCS   C+      TS CS     CSY+F YG G   S +SG    + + F+   G  V 
Sbjct: 130 SCSDPVCSYAVQTATSQCSSQANQCSYTFQYGDG---SGTSGYYVYDAMYFDVIMGQSVF 186

Query: 144 --EMPNVIFGCG-HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLPDQ 198
                 V+FGC  +++     ++    GI G GPG  S++SQ+ +  +A K FS+CL  Q
Sbjct: 187 SNSSSTVVFGCSTYQSGDLARTEKAVDGIFGFGPGALSVVSQVSSQGMAPKVFSHCLKGQ 246

Query: 199 GS-SKINFGGIVAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
           GS   I   G +    +V TPL+ ++ HY L+L++I+V  Q L      F + +     V
Sbjct: 247 GSGGGILVLGEILEPNIVYTPLVPLQPHYNLNLQSIAVNGQILPIDQDVFATGNNRGTIV 306

Query: 252 DTGVLRTLLPLE 263
           D+G     L  E
Sbjct: 307 DSGTTLAYLVQE 318


>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 436

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 100/371 (26%), Positives = 167/371 (45%), Gaps = 49/371 (13%)

Query: 27  EIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSST 86
           ++++V + Y++ + +GTP   ++  +DT +D  W  C  C  + C       F  + SST
Sbjct: 88  QVLNVGN-YVVRVQLGTPGQTMYMVLDTSNDAAWAPCSGC--IGCSSTT--TFSAQNSST 142

Query: 87  YNSISCSSSQC----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
           + ++ CS  +C     +        DC ++  YG    ++FS+      TL  +S    P
Sbjct: 143 FATLDCSKPECTQARGLSCPTTGNVDCLFNQTYG--GDSTFSA------TLVQDSLHLGP 194

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG--- 199
             +PN  FGC   + AS +S   Q G++GLG G  SLISQ G+  +G FSYCLP      
Sbjct: 195 NVIPNFSFGC--ISSASGSSIPPQ-GLMGLGRGPLSLISQSGSLYSGLFSYCLPSFKSYY 251

Query: 200 -SSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGN-------QRLEFVSSSTG 247
            S  +  G +     + +TPL+   H    YY++L  ISVG        + L F  ++  
Sbjct: 252 FSGSLKLGPVGQPKAIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGA 311

Query: 248 NIFVDTG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPE 306
              +D+G V+   +P  Y     + + +  + Q V G  +  G  D  C+  +++   P 
Sbjct: 312 GTIIDSGTVITRFVPAIY-----TAVRDEFRKQ-VGGSFSPLGAFDT-CFATNNEVSAPA 364

Query: 307 VTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGG-----NANIVYGRIMQINFLIGYDIE 360
           +T+H  G D+KL   N L  + +  + C A         +   V   + Q N  I +DI 
Sbjct: 365 ITLHLSGLDLKLPMENSLIHSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDIN 424

Query: 361 QAMVSFKPSRC 371
            + +      C
Sbjct: 425 NSKLGIARELC 435


>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 108/399 (27%), Positives = 154/399 (38%), Gaps = 82/399 (20%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP--------CPELDCFKQEPPLFDPKKSST 86
           Y + L++GTPP      +DTGS   W  C           P +D  K   P F PK SST
Sbjct: 92  YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKI--PTFIPKNSST 149

Query: 87  YNSISCSSSQCAVV---------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATE 131
              + C + +C  +               + NCS    +Y   YG G+ A F    L  +
Sbjct: 150 AKLLGCRNPKCGYIFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGSTAGF----LLLD 205

Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
            L F   +     +P  + GC   ++  P      +GI G G G  SL SQM      +F
Sbjct: 206 NLNFPGKT-----VPQFLVGCSILSIRQP------SGIAGFGRGQESLPSQMNLK---RF 251

Query: 192 SYCL-------PDQGSS---KINFGGIVAGAGVVSTPL---------IIRDHYYLSLEAI 232
           SYCL         Q S    +I+  G     G+  TP            +++YYL+L  +
Sbjct: 252 SYCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKV 311

Query: 233 SVGNQR-------LEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG 285
            VG +        LE  S   G   VD+G   T +    ++ +       ++    +   
Sbjct: 312 IVGGKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAED 371

Query: 286 AEPGFSDVLCYNIS--SQPKFPEVTIHFRGADVKLSP-SNLFRNISD-EIMC-SAFRGGN 340
           AE       C+NIS      FPE+T  F+G      P  N F  + D E++C +    G 
Sbjct: 372 AETQSGLSPCFNISGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGG 431

Query: 341 AN--------IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           A         I+ G   Q NF I YD+E     F P  C
Sbjct: 432 AGPPKTTGPAIILGNYQQQNFYIEYDLENERFGFGPRSC 470


>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 485

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 94/355 (26%), Positives = 150/355 (42%), Gaps = 30/355 (8%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS--SQ 96
           L +GTP       +DTGS  T+  C+ C    C K     FDP KS+T   ++C      
Sbjct: 17  LKLGTPERTFSVIIDTGSTITYIPCKDCSH--CGKHTAEWFDPDKSTTAKKLACGDPLCN 74

Query: 97  CAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKN 156
           C   +  C+   C YS  Y   A  S S G +  +T  F  +   PV +   +FGC +  
Sbjct: 75  CGTPSCTCNNDRCYYSRTY---AERSSSEGWMIEDTFGFPDSDS-PVRL---VFGCENGE 127

Query: 157 LASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCL--PDQGSSKINFGGIVAGA 212
                      GI+G+G  +++  SQ+     I   FS C   P  G   +    +  GA
Sbjct: 128 TGE-IYRQMADGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYPKDGILLLGDVTLPEGA 186

Query: 213 GVVSTPLI--IRDHYY-LSLEAISVGNQRLEFVSSSTGNIF---VDTGVLRTLLPLEYHS 266
             V TPL+  +  HYY + ++ I+V  Q L F +S     +   +D+G   T LP +   
Sbjct: 187 NTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGYGTVLDSGTTFTYLPTDAFK 246

Query: 267 NLKSVMSNMIKAQPVKGV-GAEPGFSDVLCYNISSQPK-----FPEVTIHF-RGADVKLS 319
            +   + + ++ + ++   GA+P ++D+       Q K     FP     F  GA + L 
Sbjct: 247 AMAKAVGDYVEKKGLQSTPGADPQYNDICWKGAPDQFKDLDKYFPPAEFVFGGGAKLTLP 306

Query: 320 PSN-LFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           P   LF +   E     F  GN+  + G +   + ++ YD   + V F    C +
Sbjct: 307 PLRYLFLSKPAEYCLGIFDNGNSGALVGGVSVRDVVVTYDRRNSKVGFTTMACAD 361


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 94/311 (30%), Positives = 143/311 (45%), Gaps = 37/311 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IGTPP      VDTGS  T+  C  C +  C + + P F+P+ SSTY  +SC+ 
Sbjct: 90  YTTRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQ--CGRHQDPKFEPELSSTYQPVSCNI 147

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
                 T +     C Y   Y   A  S SSG L  + ++F + S L  +    IFGC +
Sbjct: 148 D----CTCDNERKQCVYERQY---AEMSSSSGVLGEDIISFGNQSELVPQ--RAIFGCEN 198

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCL--PDQGSSKINFGGIVA 210
           +      S  +  GI+GLG G+ S++ Q+     I+  FS C    D G   +  GGI  
Sbjct: 199 QETGDLYS-QRADGIMGLGRGDLSIVDQLVEKGVISDSFSLCYGGMDIGGGAMILGGISP 257

Query: 211 GAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLE 263
            +G+V   S P  +R  YY + L+AI V  ++L     +        +D+G     LP  
Sbjct: 258 PSGMVFAESDP--VRSQYYNIDLKAIHVAGKQLHLDPSIFDGKHGTVLDSGTTYAYLPEA 315

Query: 264 YHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCYNISSQP------KFPEVTIHF-RGAD 315
             +  K  M  M +   +K + G +P ++D+ C++ +          FP V + F  G  
Sbjct: 316 AFTAFKDAM--MKELTSLKQIHGPDPNYNDI-CFSGAESDVSQLSNTFPAVEMVFSNGQK 372

Query: 316 VKLSPSN-LFR 325
           + LSP N LF+
Sbjct: 373 LSLSPENYLFQ 383


>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 111/399 (27%), Positives = 157/399 (39%), Gaps = 82/399 (20%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEP---PLFDPKKSSTYN 88
           Y + L++GTPP      +DTGS   W  C     C   +    +P   P F PK SST  
Sbjct: 88  YSIDLNLGTPPQTSPFVLDTGSSLVWFPCTSHYLCSHCNFPNIDPTKIPTFIPKNSSTAK 147

Query: 89  SISCSSSQCAVV----------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATET 132
            + C + +C  +                + NCS    SY   YG GA A F    L  + 
Sbjct: 148 LLGCRNPKCGYLFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGATAGF----LLLDN 203

Query: 133 LTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFS 192
           L F   +     +P  + GC   ++  P      +GI G G G  SL SQM      +FS
Sbjct: 204 LNFPGKT-----VPQFLVGCSILSIRQP------SGIAGFGRGQESLPSQMNLK---RFS 249

Query: 193 YCL-------PDQGSS---KINFGGIVAGAGVVSTPL--------IIRDHYYLSLEAISV 234
           YCL         Q S    +I+  G     G+  TP         + R++YY++L  + V
Sbjct: 250 YCLVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIV 309

Query: 235 GN-------QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVG 285
           G        + LE  S   G   VD+G   T +    ++ +       +  K    + V 
Sbjct: 310 GGVDVKIPYKFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVE 369

Query: 286 AEPGFSDVLCYNISSQP--KFPEVTIHFRGADVKLSP-SNLFRNISD-EIMC-SAFRGGN 340
           A+ G S   C+NIS      FPE T  F+G      P  N F  + D E++C +    G 
Sbjct: 370 AQSGLSP--CFNISGVKTISFPEFTFQFKGGAKMSQPLLNYFSFVGDAEVLCFTVVSDGG 427

Query: 341 AN--------IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           A         I+ G   Q NF + YD+E     F P  C
Sbjct: 428 AGQPKTAGPAIILGNYQQQNFYVEYDLENERFGFGPRNC 466


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 105/382 (27%), Positives = 167/382 (43%), Gaps = 40/382 (10%)

Query: 14  ETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK 73
           E+ + P + +   + + ++  Y   L IGTPP      VDTGS  T+  C  C    C +
Sbjct: 68  ESKRHPNARMRLYDDLLINGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTCEH--CGR 125

Query: 74  QEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETL 133
            + P F P  S TY  + C+         NC +GD +      + A  S SSG L  + +
Sbjct: 126 HQDPKFQPDLSETYQPVKCTPD------CNC-DGDTNQCMYDRQYAEMSSSSGVLGEDVV 178

Query: 134 TFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKF 191
           +F + S L  +    +FGC +       S  +  GI+GLG G+ S++ Q+     I+  F
Sbjct: 179 SFGNLSELAPQ--RAVFGCENDETGDLYS-QRADGIMGLGRGDLSIMDQLVDKKVISDSF 235

Query: 192 SYCL--PDQGSSKINFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---V 242
           S C    D G   +  GGI     +V   S P   R  YY ++L+ + V  ++L+    V
Sbjct: 236 SLCYGGMDVGGGAMILGGISPPEDMVFTHSDP--DRSPYYNINLKEMHVAGKKLQLNPKV 293

Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY----- 296
                   +D+G     LP       K  +  M +   +K + G +P + D+ C+     
Sbjct: 294 FDGKHGTVLDSGTTYAYLPETAFLAFKRAI--MKERNSLKQINGPDPNYKDI-CFTGAGI 350

Query: 297 NISSQPK-FPEVTIHFR-GADVKLSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQ 350
           ++S   K FP V + F  G  + LSP N LFR+  +        F  G +   + G I  
Sbjct: 351 DVSQLAKSFPVVDMVFENGHKLSLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFV 410

Query: 351 INFLIGYDIEQAMVSFKPSRCT 372
            N L+ YD E + + F  + C+
Sbjct: 411 RNTLVMYDRENSKIGFWKTNCS 432


>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
          Length = 422

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 94/360 (26%), Positives = 153/360 (42%), Gaps = 39/360 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP V  +  +DTGS   W     C+ CP      ++   +DP+ S +   +
Sbjct: 58  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 117

Query: 91  SCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP---N 147
            C  + C           C Y   Y  G     + G L T+ L ++   G     P   +
Sbjct: 118 KCDDTICTSRPPCNMTLRCPYITGYADGG---LTMGILFTDLLHYHQLYGNGQTQPTSTS 174

Query: 148 VIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQGSSK 202
           V FGCG +   S  + +    GIIG G  N + +SQ+  + AGK    FS+CL       
Sbjct: 175 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQL--AAAGKTKKIFSHCLDSTNGGG 232

Query: 203 INFGGIVAGAGVVSTPLIIRDHYY--LSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
           I   G V    V +TP++  +  Y  ++L++I+V    L+     F ++ T   F+D+G 
Sbjct: 233 IFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 292

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHFRG 313
               LP   +S L   +  +    P   +GA   F    C++   S   KFP++T HF  
Sbjct: 293 TLVYLPEIIYSEL---ILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDKFPKITFHFEN 346

Query: 314 ADVKLS--PSNLFRNISDEIMCSAFR-----GGNANIVYGRIMQINFLIGYDIEQAMVSF 366
            D+ L   P +          C  F+     G    I+ G ++  N ++ YD+E+  + +
Sbjct: 347 -DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGW 405


>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 433

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 94/360 (26%), Positives = 153/360 (42%), Gaps = 39/360 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP V  +  +DTGS   W     C+ CP      ++   +DP+ S +   +
Sbjct: 82  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141

Query: 91  SCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP---N 147
            C  + C           C Y   Y  G     + G L T+ L ++   G     P   +
Sbjct: 142 KCDDTICTSRPPCNMTLRCPYITGYADGG---LTMGILFTDLLHYHQLYGNGQTQPTSTS 198

Query: 148 VIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQGSSK 202
           V FGCG +   S  + +    GIIG G  N + +SQ+  + AGK    FS+CL       
Sbjct: 199 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQL--AAAGKTKKIFSHCLDSTNGGG 256

Query: 203 INFGGIVAGAGVVSTPLIIRDHYY--LSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
           I   G V    V +TP++  +  Y  ++L++I+V    L+     F ++ T   F+D+G 
Sbjct: 257 IFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 316

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHFRG 313
               LP   +S L   +  +    P   +GA   F    C++   S   KFP++T HF  
Sbjct: 317 TLVYLPEIIYSEL---ILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDKFPKITFHFEN 370

Query: 314 ADVKLS--PSNLFRNISDEIMCSAFR-----GGNANIVYGRIMQINFLIGYDIEQAMVSF 366
            D+ L   P +          C  F+     G    I+ G ++  N ++ YD+E+  + +
Sbjct: 371 -DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGW 429


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 96/382 (25%), Positives = 159/382 (41%), Gaps = 64/382 (16%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL----FDPKKSSTYNSISCSS 94
           L++GTPP ++   +DTGS+ +W  C    +              F P+ S+T+ ++ C S
Sbjct: 67  LAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPCGS 126

Query: 95  SQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           +QC      A  + + +   C  S  Y  G   S S G LAT+         L       
Sbjct: 127 TQCSSRDLPAPPSCDGASRQCHVSLSYADG---SASDGALATDVFAVGEAPPL-----RS 178

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GS 200
            FGC      S        G++G+  G  S ++Q  T    +FSYC+ D+        G 
Sbjct: 179 AFGCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQASTR---RFSYCISDRDDAGVLLLGH 235

Query: 201 SKINFGGI---VAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS-------STGNIF 250
           S + F  +         +  P   R  Y + L  I VG + L   +S         G   
Sbjct: 236 SDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGAGQTM 295

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS--DVL--CYNI-SSQP--- 302
           VD+G   T L  + +S LK+    + + +P+     +P F+  + L  C+ + + +P   
Sbjct: 296 VDSGTQFTFLLGDAYSALKAEF--LKQTKPLLRALDDPSFAFQEALDTCFRVPAGRPPPS 353

Query: 303 -KFPEVTIHFRGADVKLSPSNLFRNI------SDEIMCSAFRGGNANI------VYGRIM 349
            + P VT+ F GA++ ++   L   +      +D + C  F  GNA++      V G   
Sbjct: 354 ARLPPVTLLFNGAEMSVAGDRLLYKVPGEHRGADGVWCLTF--GNADMVPLTAYVIGHHH 411

Query: 350 QINFLIGYDIEQAMVSFKPSRC 371
           Q+N  + YD+E+  V   P +C
Sbjct: 412 QMNLWVEYDLERGRVGLAPVKC 433


>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
          Length = 337

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 60/173 (34%), Positives = 86/173 (49%), Gaps = 19/173 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y + +  G+P       VDTGS  +W QC+PC  + C  Q  PLFDP  S TY S+SC+S
Sbjct: 118 YYVKVGFGSPARYYSMIVDTGSSLSWLQCKPC-VVYCHVQADPLFDPSASKTYKSLSCTS 176

Query: 95  SQCAVVTS--------NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           SQC+ +            S   C Y+  YG    +S+S G L+ + LT   +      +P
Sbjct: 177 SQCSSLVDATLNNPLCETSSNVCVYTASYGD---SSYSMGYLSQDLLTLAPSQ----TLP 229

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
             ++GCG     S     +  GI+GLG    S++ Q+ +     FSYCLP +G
Sbjct: 230 GFVYGCGQD---SDGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRG 279


>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
 gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
 gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
 gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
 gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
 gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
 gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
 gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
 gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
 gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
 gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
 gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
 gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
 gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
 gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
 gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
 gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
 gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
 gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
 gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
 gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
 gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
 gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
 gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
 gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
 gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
 gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
 gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
 gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
 gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
 gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
 gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
 gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
 gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
 gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
 gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
 gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 89/326 (27%), Positives = 145/326 (44%), Gaps = 36/326 (11%)

Query: 12  DNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC 71
           D +T   PI+   Q   I+    Y++ + +GTP   +F  +DT +D  W  C  C    C
Sbjct: 25  DQKTTAVPIAPGQQVLKIAN---YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGC--TGC 79

Query: 72  FKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATE 131
                  F P  S+T  S+ CS +QC+ V         S + L+ +    S+   +    
Sbjct: 80  SSTT---FLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQ----SYGGDSSLAA 132

Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
           TL  ++ +     +P   FGC   N  S  S   Q G++GLG G  SLISQ G   +G F
Sbjct: 133 TLVQDAITLANDVIPGFTFGC--INAVSGGSIPPQ-GLLGLGRGPISLISQAGAMYSGVF 189

Query: 192 SYCLPDQG----SSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVG-------N 236
           SYCLP       S  +  G +     + +TPL+   H    YY++L  +SVG       +
Sbjct: 190 SYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPS 249

Query: 237 QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
           ++L F  ++     +D+G + T      +  ++      +   P+  +GA   F    C+
Sbjct: 250 EQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG-PISSLGA---FDT--CF 303

Query: 297 NISSQPKFPEVTIHFRGADVKLSPSN 322
             +++ + P VT+HF G ++ L   N
Sbjct: 304 AATNEAEAPAVTLHFEGLNLVLPMEN 329


>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 155/371 (41%), Gaps = 46/371 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQ---------EPPLFDPKKSS 85
           Y   + IGTPP +    VDTGS  T+  C  C      +            P F P+ SS
Sbjct: 40  YTSRVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPENSS 99

Query: 86  TYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           +Y  I C SS C     + +   C Y  +Y   A  S S G L  + L F   S L  ++
Sbjct: 100 SYQKIGCRSSDCITGLCDSNSHQCKYERMY---AEMSTSKGVLGKDLLDFGPASRLQSQL 156

Query: 146 PNVIFGCGHK---NLASPTSDSKQTGIIGLGPGNSSLISQM--GTSIAGKFSYCLP--DQ 198
             + FGC      +L    +D    GI+GLG G  S++ Q+    +I   FS C    D+
Sbjct: 157 --LSFGCETAESGDLYLQVAD----GIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDE 210

Query: 199 GSSKINFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEFVSSSTGNIF---V 251
           G   +  G I A +G+V   S P   R +YY L L  I V    L+  S+     F   +
Sbjct: 211 GGGSMVLGAIPAPSGMVFAKSDPR--RSNYYNLELTEIQVQGASLKLDSNVFNGKFGTIL 268

Query: 252 DTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK-----FP 305
           D+G     LP   + +   +V++ +   Q V   G +P + D+      +  K     FP
Sbjct: 269 DSGTTYAYLPDRAFEAFTDAVVAQLGSLQAVD--GPDPNYPDICYAGAGTDTKELGKHFP 326

Query: 306 EVTIHF-RGADVKLSPSN-LFRN--ISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
            V   F     V L+P N LF++  +        F+  +A  + G I+  N L+ YD   
Sbjct: 327 LVDFVFAENQKVSLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIIVRNMLVTYDRYN 386

Query: 362 AMVSFKPSRCT 372
             + F  + CT
Sbjct: 387 HQIGFLKTNCT 397


>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
 gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
 gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
 gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
          Length = 431

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 94/360 (26%), Positives = 153/360 (42%), Gaps = 39/360 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP V  +  +DTGS   W     C+ CP      ++   +DP+ S +   +
Sbjct: 58  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 117

Query: 91  SCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP---N 147
            C  + C           C Y   Y  G     + G L T+ L ++   G     P   +
Sbjct: 118 KCDDTICTSRPPCNMTLRCPYITGYADGG---LTMGILFTDLLHYHQLYGNGQTQPTSTS 174

Query: 148 VIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQGSSK 202
           V FGCG +   S  + +    GIIG G  N + +SQ+  + AGK    FS+CL       
Sbjct: 175 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQL--AAAGKTKKIFSHCLDSTNGGG 232

Query: 203 INFGGIVAGAGVVSTPLIIRDHYY--LSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
           I   G V    V +TP++  +  Y  ++L++I+V    L+     F ++ T   F+D+G 
Sbjct: 233 IFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 292

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHFRG 313
               LP   +S L   +  +    P   +GA   F    C++   S   KFP++T HF  
Sbjct: 293 TLVYLPEIIYSEL---ILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDKFPKITFHFEN 346

Query: 314 ADVKLS--PSNLFRNISDEIMCSAFR-----GGNANIVYGRIMQINFLIGYDIEQAMVSF 366
            D+ L   P +          C  F+     G    I+ G ++  N ++ YD+E+  + +
Sbjct: 347 -DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIGW 405


>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 89/326 (27%), Positives = 145/326 (44%), Gaps = 36/326 (11%)

Query: 12  DNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDC 71
           D +T   PI+   Q   I+    Y++ + +GTP   +F  +DT +D  W  C  C    C
Sbjct: 25  DQKTTAVPIAPGQQVLKIAN---YVVRVKLGTPGQQMFMVLDTSNDAAWVPCSGC--TGC 79

Query: 72  FKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATE 131
                  F P  S+T  S+ CS +QC+ V         S + L+ +    S+   +    
Sbjct: 80  SSTT---FLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQ----SYGGDSSLAA 132

Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKF 191
           TL  ++ +     +P   FGC   N  S  S   Q G++GLG G  SLISQ G   +G F
Sbjct: 133 TLVQDAITLANDVIPGFTFGC--INAVSGGSIPPQ-GLLGLGRGPISLISQAGAMYSGVF 189

Query: 192 SYCLPDQG----SSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVG-------N 236
           SYCLP       S  +  G +     + +TPL+   H    YY++L  +SVG       +
Sbjct: 190 SYCLPSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPS 249

Query: 237 QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY 296
           ++L F  ++     +D+G + T      +  ++      +   P+  +GA   F    C+
Sbjct: 250 EQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG-PISSLGA---FDT--CF 303

Query: 297 NISSQPKFPEVTIHFRGADVKLSPSN 322
             +++ + P VT+HF G ++ L   N
Sbjct: 304 AETNEAEAPAVTLHFEGLNLVLPMEN 329


>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 330

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 92/315 (29%), Positives = 139/315 (44%), Gaps = 47/315 (14%)

Query: 77  PLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGD------CSYSFLYGRGAYASFSSGNLA 129
           P FD   SST    SC S+ C  ++ ++C          C Y++ Y      S ++G + 
Sbjct: 23  PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYND---KSVTTGLIE 79

Query: 130 TETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG 189
            +  TF    G    +P V FGCG  N  +    S +TGI G G G  SL SQ+     G
Sbjct: 80  VDKFTF----GAGASVPGVAFGCGLFN--NGVFKSNETGIAGFGRGPLSLPSQLKV---G 130

Query: 190 KFSYCLP-----DQGSSKINFGGIVAGAG---VVSTPLIIRD----HYYLSLEAISVGNQ 237
            FS+C        Q +  ++    +   G   V STPLI        YYLSL+ I+VG+ 
Sbjct: 131 NFSHCFTAVNGLKQSTVLLDLPADLYKNGRGAVQSTPLIQNSANPTFYYLSLKGITVGST 190

Query: 238 RLEF------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS 291
           RL        +++ TG   +D+G   T LP + +  ++   +  IK   V G    P   
Sbjct: 191 RLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGP--- 247

Query: 292 DVLCYNISSQ--PKFPEVTIHFRGADVKLSPSNLFRNISDE----IMCSAFRGGNANIVY 345
              C++  SQ  P  P++ +HF GA + L   N    + D+    I+C A   G+   + 
Sbjct: 248 -YTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDETTII 306

Query: 346 GRIMQINFLIGYDIE 360
           G   Q N  + YD++
Sbjct: 307 GNFQQQNMHVLYDLQ 321


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 98/392 (25%), Positives = 160/392 (40%), Gaps = 64/392 (16%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-------PCPELDCFKQEPPLFDPKKSSTY 87
           Y +   +GTP        DTGSD TW +C             D        F P+ S T+
Sbjct: 97  YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTW 156

Query: 88  NSISCSSSQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATE--TLTFNSTS 139
             ISC+S  C      ++ T       C+Y + Y  G+ A    G + TE  T+  +   
Sbjct: 157 APISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAA---RGTVGTESATIALSGRE 213

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL---- 195
               ++  ++ GC   +   P+ ++   G++ LG    S  S   +   G+FSYCL    
Sbjct: 214 ERKAKLKGLVLGC-SSSYTGPSFEASD-GVLSLGYSGISFASHAASRFGGRFSYCLVDHL 271

Query: 196 -PDQGSSKINFG--------------GIVAGAGVVSTPLII----RDHYYLSLEAISVGN 236
            P   +S + FG                 A      TPL++    R  Y +SL+AISV  
Sbjct: 272 SPRNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAG 331

Query: 237 QRLEFVSS-----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS 291
           + L+   +     + G + +D+G   T+L    +  + + +S  +   P   V  +P F 
Sbjct: 332 EFLKIPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPR--VTMDP-FE 388

Query: 292 DVLCYNISSQP------KFPEVTIHFRGADVKLSP--SNLFRNISDEIMCSAFRGG--NA 341
              CYN +S          P++ +HF GA  +L P   +   + +  + C   + G    
Sbjct: 389 --YCYNWTSPSGKDADVAVPKMAVHFAGA-ARLEPPGKSYVIDAAPGVKCIGLQEGPWPG 445

Query: 342 NIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
             V G I+Q   L  +DI+   + F+ SRCT+
Sbjct: 446 ISVIGNILQQEHLWEFDIKNRRLKFQRSRCTH 477


>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
          Length = 454

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 112/402 (27%), Positives = 163/402 (40%), Gaps = 95/402 (23%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCE-------PCPELDCFKQEPPLFDPKKSSTYNSIS 91
           +++G PP ++   +DTGS+ +W +C        P P      Q P  F+   SSTY +  
Sbjct: 66  VAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPP------QAPAAFNGSASSTYAAAH 119

Query: 92  CSSSQCAV------VTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
           CSS +C        V   C+      C  S  Y   A AS + G LA +T         P
Sbjct: 120 CSSPECQWRGRDLPVPPFCAGPPSNSCRVSLSY---ADASSADGILAADTFLLGGAP--P 174

Query: 143 VEMPNVIFGCG---HKNLASPTSDSK-QTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PD 197
           V     +FGC        A+ +SDS+  TG++G+  G+ S ++Q  T    +F+YC+ P 
Sbjct: 175 VR---ALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCIAPG 228

Query: 198 QGSSKINFGGIVAGAGVVS----TPLII---------RDHYYLSLEAISVGNQRLEFVSS 244
            G   +  GG   GA +      TPLI          R  Y + LE I VG   L    S
Sbjct: 229 DGPGLLVLGG--DGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKS 286

Query: 245 -------STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--- 294
                    G   VD+G   T L  + ++ LK    N   A     + A  G SD +   
Sbjct: 287 VLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSA-----LLAPLGESDFVFQG 341

Query: 295 ----CYNIS------SQPKFPEVTIHFRGADVKLSPSNLFRNISDE---------IMCSA 335
               C+  S      +    PEV +  RGA+V +    L   +  E         + C  
Sbjct: 342 AFDACFRASEARVAAASQMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLT 401

Query: 336 FRGGNANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           F  GN+++      V G   Q N  + YD++   V F P+RC
Sbjct: 402 F--GNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 441


>gi|125564663|gb|EAZ10043.1| hypothetical protein OsI_32347 [Oryza sativa Indica Group]
          Length = 330

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 92/358 (25%), Positives = 162/358 (45%), Gaps = 62/358 (17%)

Query: 46  VDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCS 105
           +D+    DT SD  WTQC+PC  L C  Q   ++DP K+ TY +++ S            
Sbjct: 1   MDVTLVFDTTSDLLWTQCQPC--LSCVAQAGDMYDPNKTETYANLTSS------------ 46

Query: 106 EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSK 165
               +Y++ Y +    SF+SG  ATET    +     V + N+ FGCG +N     + + 
Sbjct: 47  ----NYNYTYSK---QSFTSGYFATETFALGN-----VTVANITFGCGTRNQGYYDNVAG 94

Query: 166 QTGIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGSSKINFGG-----------IVAG 211
             G+        SL++Q+G     +FSYC       GSS +  GG             A 
Sbjct: 95  VFGVGRG---GVSLLNQLGID---RFSYCFSSSGAPGSSAVFLGGSPELATNATTTPAAS 148

Query: 212 AGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTGN-----IFVDTGVLRTLLPLEYHS 266
             +V+ P +++  Y++ L  ++VG  R++   +S+       + +D+    T+L    + 
Sbjct: 149 TPMVADP-VLKSGYFVKLVGVTVGATRVDVAGASSAEGGGRALVIDSTSPVTVLDEATYG 207

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS---SQPKFPEV--TIHFRG--ADVKLS 319
            ++  +   +         A  G    LC+ ++   + P  P V  T+HF G  AD+ L 
Sbjct: 208 PVRRALVAQLAPLKEANANASAGVGLDLCFELAAGGATPTPPNVTMTLHFDGGAADLVLP 267

Query: 320 PSN-LFRNISDEIMCSAFRGGNAN--IVYGRIMQINFLIGYDIEQAMVSFKPSRCTNY 374
           P+N L ++ +  ++C      ++N   V G    ++ L+ YD+ + +VSF+P  C  +
Sbjct: 268 PANYLAKDSAGGLICLTMTPSSSNGVPVLGSSALLDTLVLYDLAKNVVSFQPLDCAAF 325


>gi|413919745|gb|AFW59677.1| hypothetical protein ZEAMMB73_406599 [Zea mays]
          Length = 246

 Score = 91.7 bits (226), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 68/203 (33%), Positives = 102/203 (50%), Gaps = 27/203 (13%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP---LFDPKKSSTYNSI 90
           ++LM ++IGTPPV     +DTGS  +W QC PC E  C KQ      +FDP +S+T+   
Sbjct: 41  LFLMPINIGTPPVMNLVGIDTGSTLSWVQCRPC-EPHCHKQAAKAGQIFDPSRSTTFRRA 99

Query: 91  SCSSSQCAVVT-------SNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGL 141
            C+S +C VV        +NC E    C YS +Y  G +A +++  +  + L   +   L
Sbjct: 100 GCNSRECFVVKDALKLEFANCMEKVNTCLYSMIY-EGGWA-YTASKVVWDNLIIGTNISL 157

Query: 142 PVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLPDQGS 200
                + +FGC   +L     + K+ G +G G  + S   Q+ + I  K FSYCLP   +
Sbjct: 158 -----SFMFGC---SLDVEYGNYKEAGTVGFGTTSISFFEQVSSQINYKAFSYCLPSNET 209

Query: 201 SK--INFGGIVA-GAGVVSTPLI 220
           +   +N G     GA V+ TPL 
Sbjct: 210 TTGYMNLGDYSGQGAHVLYTPLF 232


>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
 gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
 gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
 gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
 gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
 gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
 gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
 gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
 gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
 gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
 gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
 gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
 gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
 gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
 gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
 gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
 gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
 gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
 gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
 gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
 gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
          Length = 472

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 106/380 (27%), Positives = 160/380 (42%), Gaps = 58/380 (15%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
           D ++LM +S+G PPV    ++DTGS  +W QC+PC  + C  Q     P+FDP +S T  
Sbjct: 111 DFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCA-VHCHTQSAKAGPIFDPGRSYTSR 169

Query: 89  SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
            + CSS +C        +  +NC E +  C+YS  YG G   ++S G + T+TL    + 
Sbjct: 170 RVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS- 226

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG----TSIAGKFSYCL 195
                  +++FGC      S      + GI G G  + S   Q+            SYCL
Sbjct: 227 -----FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCL 277

Query: 196 P-DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
           P D+        G    A +    TPL   I R  Y L++E +    QRL    +S+  +
Sbjct: 278 PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEM 334

Query: 250 FVDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------ 296
            VD+G  RT L P  +    K++   M      +   A       +CY            
Sbjct: 335 IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGT 392

Query: 297 --NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQI 351
               S+    P + I F  GA + L P N+F N     +C  F    A  + + G  +  
Sbjct: 393 ITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTR 452

Query: 352 NFLIGYDIEQAMVSFKPSRC 371
           +F   +DI+     FK + C
Sbjct: 453 SFGTTFDIQGKQFGFKYAVC 472


>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
 gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
 gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
          Length = 474

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 106/380 (27%), Positives = 160/380 (42%), Gaps = 58/380 (15%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
           D ++LM +S+G PPV    ++DTGS  +W QC+PC  + C  Q     P+FDP +S T  
Sbjct: 113 DFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCA-VHCHTQSAKAGPIFDPGRSYTSR 171

Query: 89  SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
            + CSS +C        +  +NC E +  C+YS  YG G   ++S G + T+TL    + 
Sbjct: 172 RVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS- 228

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG----TSIAGKFSYCL 195
                  +++FGC      S      + GI G G  + S   Q+            SYCL
Sbjct: 229 -----FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCL 279

Query: 196 P-DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNI 249
           P D+        G    A +    TPL   I R  Y L++E +    QRL    +S+  +
Sbjct: 280 PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEM 336

Query: 250 FVDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------ 296
            VD+G  RT L P  +    K++   M      +   A       +CY            
Sbjct: 337 IVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGT 394

Query: 297 --NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQI 351
               S+    P + I F  GA + L P N+F N     +C  F    A  + + G  +  
Sbjct: 395 ITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTR 454

Query: 352 NFLIGYDIEQAMVSFKPSRC 371
           +F   +DI+     FK + C
Sbjct: 455 SFGTTFDIQGKQFGFKYAVC 474


>gi|340810915|gb|AEK75384.1| S5 [Oryza sativa]
 gi|340810917|gb|AEK75385.1| S5 [Oryza sativa]
 gi|340810919|gb|AEK75386.1| S5 [Oryza sativa]
 gi|340810927|gb|AEK75390.1| S5 [Oryza sativa]
 gi|340810975|gb|AEK75414.1| S5 [Oryza nivara]
 gi|340810979|gb|AEK75416.1| S5 [Oryza nivara]
 gi|340810995|gb|AEK75424.1| S5 [Oryza nivara]
 gi|340811027|gb|AEK75440.1| S5 [Oryza nivara]
 gi|340811063|gb|AEK75458.1| S5 [Oryza nivara]
          Length = 357

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 108/379 (28%), Positives = 160/379 (42%), Gaps = 66/379 (17%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
           M +S+G PPV    ++DTGS  +W QC+PC  + C  Q     P+FDP +S T   + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 94  SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           S +C        +  +NC E +  C+YS  YG G   ++S G + T+TL    +      
Sbjct: 60  SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGW--AYSVGKMVTDTLRIGDS------ 111

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--------KFSYCLP 196
             +++FGC      S      + GI G G  + S   Q+    AG         FSYCLP
Sbjct: 112 FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQL----AGYPDILSYKAFSYCLP 163

Query: 197 -DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
            D+        G    A +    TPL   I R  Y L++E +    QRL    +S+  + 
Sbjct: 164 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEMI 220

Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------- 296
           VD+G  RT L P  +    K++   M      +   A       +CY             
Sbjct: 221 VDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTI 278

Query: 297 -NISSQPKFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQIN 352
              S+    P + I F G A + LSP N+F N     +C  F    A  + + G  +  +
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRS 338

Query: 353 FLIGYDIEQAMVSFKPSRC 371
           F   +DI+     FK + C
Sbjct: 339 FGTTFDIQGKQFGFKYAAC 357


>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
 gi|194688798|gb|ACF78483.1| unknown [Zea mays]
 gi|194703430|gb|ACF85799.1| unknown [Zea mays]
 gi|194707192|gb|ACF87680.1| unknown [Zea mays]
 gi|223944599|gb|ACN26383.1| unknown [Zea mays]
 gi|223948667|gb|ACN28417.1| unknown [Zea mays]
 gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 450

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 99/361 (27%), Positives = 156/361 (43%), Gaps = 47/361 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++  S+GTPP  +  +VDT +D +W  C  C         P  FDP  S++Y ++ C S
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAP--FDPASSASYRTVPCGS 169

Query: 95  SQCAVV-TSNCSEGD--CSYSFLYGRGAY-ASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
             CA    + C  G   C +S  Y   +  A+ S  +LA         +G  V+     F
Sbjct: 170 PLCAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLA--------VAGNAVKA--YTF 219

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV- 209
           GC  +  A+ T+   Q  +     G  S +SQ        FSYCLP   S  +NF G + 
Sbjct: 220 GCLQR--ATGTAAPPQGLLGLGR-GPLSFLSQTKDMYEATFSYCLPSFKS--LNFSGTLR 274

Query: 210 -----AGAGVVSTPLIIRDH----YYLSLEAISVGNQRL---EFVSSSTGNIFVDTGVLR 257
                    + +TPL+   H    YY+++  I VG + +    F  ++     +D+G + 
Sbjct: 275 LGRNGQPQRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPAFDPATGAGTVLDSGTMF 334

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVK 317
           T L    +  ++  +   + A PV  +G   GF    C+N ++   +P VT+ F G  V 
Sbjct: 335 TRLVAPAYVAVRDEVRRRVGA-PVSSLG---GFDT--CFNTTAV-AWPPVTLLFDGMQVT 387

Query: 318 LSPSNL-----FRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           L   N+     +  IS   M +A  G N  + V   + Q N  + +D+    V F   RC
Sbjct: 388 LPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447

Query: 372 T 372
           T
Sbjct: 448 T 448


>gi|357114697|ref|XP_003559132.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 416

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 101/378 (26%), Positives = 163/378 (43%), Gaps = 56/378 (14%)

Query: 28  IISVDDIYLMHLSIGTP---PVDIFGSVDTGSDCTWTQCEPC-PELDCFKQEPPLFDPKK 83
           + S   +Y + +SIGT     + + G +DT +  +W  CEPC P L    Q   LF P  
Sbjct: 60  LTSARFVYGVFVSIGTGQGFKLQVLG-LDTSTSMSWVMCEPCQPSL---PQAGHLFSPAA 115

Query: 84  SSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNS---TSG 140
           S T++ +  +   C       + G CS+ F         F+SG L+ +T    +   + G
Sbjct: 116 SPTFHGVHSNDPVCTAPYRPTANG-CSFRF--------PFASGYLSRDTFHLRNGGLSGG 166

Query: 141 LPVE-MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD-- 197
            P+E +P ++FGC H ++A   +D    G++ L     SL++Q+     G+FSYCLP   
Sbjct: 167 APIESVPGIMFGCAH-SVAGFHNDGTLGGVLSLSHLRLSLLTQLSARAGGRFSYCLPKPT 225

Query: 198 QGSSKINFGGIVAGAGVVS-------TPLIIRD----HYYLSLEAISVGNQRLE-----F 241
           QG+     G +  GA V+        T L +R      YYLSL  I++  +RL      F
Sbjct: 226 QGNPH---GFLRLGADVLPPLPHSHMTALTVRSGSAPDYYLSLVGITLAEKRLRIDPRVF 282

Query: 242 VSSSTGNIFVDTGVLRTLLPLEYH-------SNLKSVMSNMIKAQPVKGVGAEPGFSDVL 294
            +   G        +  ++   Y        + +K + S+ +K  P  G GA   F D +
Sbjct: 283 AAGRGGCSINPAATITAIMEPAYLVVERALVAYMKELGSDRVKKGPPGG-GAL--FFDRM 339

Query: 295 CYNISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINF 353
             ++  Q + P +  HF+ GA++  +P  LF              G    V G   Q+N 
Sbjct: 340 YKSV--QARLPSMAFHFKDGAELWFTPEQLFEVHGMVAWFMMVGKGYRRTVIGAPQQVNT 397

Query: 354 LIGYDIEQAMVSFKPSRC 371
              +D+    +SF    C
Sbjct: 398 RFTFDVAAGRLSFASELC 415


>gi|356537015|ref|XP_003537027.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 476

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 103/358 (28%), Positives = 156/358 (43%), Gaps = 51/358 (14%)

Query: 52  VDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAV----VTSNC 104
           +DTGSD  W  C     CP+      E   FD   SST   I CS   C        + C
Sbjct: 85  IDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDLICTSGVQGAAAEC 144

Query: 105 SE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM---PNVIFGCGHKNLAS 159
           S     CSY+F YG G   S +SG   ++ + FN   G P  +     ++FGC       
Sbjct: 145 SPRVNQCSYTFQYGDG---SGTSGYYVSDAMYFNLIMGQPPAVNSTATIVFGCSISQSGD 201

Query: 160 PT-SDSKQTGIIGLGPGNSSLISQMGTS-IAGK-FSYCLPDQGSSKINFGGI-----VAG 211
            T +D    GI G GPG  S++SQ+ +  I  K FS+CL   G    N GGI     +  
Sbjct: 202 LTKTDKAVDGIFGFGPGPLSVVSQLSSQGITPKVFSHCLKGDG----NGGGILVLGEILE 257

Query: 212 AGVVSTPLI-IRDHYYLSLEAISVGNQRLEF------VSSSTGNIFVDTGVLRTLLPLEY 264
             +V +PL+  + HY L+L++I+V  Q L        +S++ G   VD G     L  E 
Sbjct: 258 PSIVYSPLVPSQPHYNLNLQSIAVNGQPLPINPAVFSISNNRGGTIVDCGTTLAYLIQEA 317

Query: 265 HSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISSQ--PKFPEVTIHFR-GADVKLS 319
           +  L + ++  +   A+     G +       CY +S+     FP V+++F  GA + L 
Sbjct: 318 YDPLVTAINTAVSQSARQTNSKGNQ-------CYLVSTSIGDIFPLVSLNFEGGASMVLK 370

Query: 320 PSN-LFRN---ISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           P   L  N      E+ C  F+       + G ++  + ++ YDI Q  + +    C+
Sbjct: 371 PEQYLMHNGYLDGAEMWCVGFQKLQEGASILGDLVLKDKIVVYDIAQQRIGWANYDCS 428


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 163/382 (42%), Gaps = 48/382 (12%)

Query: 1   AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
           A+++ +L F +     KS + I    +II     Y++   IGTPP  +  ++DT +D  W
Sbjct: 60  AKDTTRLQFLDSLVARKSIVPIASGRQIIQ-SPTYIVRAKIGTPPQTLLLAMDTSNDAAW 118

Query: 61  TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
             C  C    C      LF P+KS+T+ ++SC++ +C  V +  C     +++  YG  +
Sbjct: 119 IPCTACD--GCAST---LFAPEKSTTFKNVSCAAPECKQVPNPGCGVSSRNFNLTYGSSS 173

Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
            A+    NL  +T+T  +       +P+  FGC  K     TS   Q  +     G  SL
Sbjct: 174 IAA----NLVQDTITLATD-----PVPSYTFGCVSKTTG--TSAPPQGLLGLGR-GPLSL 221

Query: 180 ISQMGTSIAGKFSYCLPD----QGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEA 231
           +SQ        FSYCLP       S  +  G +     +  TPL+        YY++LEA
Sbjct: 222 LSQTQNLYQSTFSYCLPSFKSLNFSGSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEA 281

Query: 232 ISVGNQRLEF--------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG 283
           I VG + ++          ++  G IF D+G + T L    +  ++       +  P   
Sbjct: 282 IRVGRKVVDIPPAALAFNPTTGAGTIF-DSGTVFTRLVAPVYVAVRDEFRR--RVGPKLT 338

Query: 284 VGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRGGNAN 342
           V +  GF    CYN+      P +T  F G +V L   N L  + +    C A  G   N
Sbjct: 339 VTSLGGFDT--CYNVPI--VVPTITFIFTGMNVTLPQDNILIHSTAGSTTCLAMAGAPDN 394

Query: 343 I-----VYGRIMQINFLIGYDI 359
           +     V   + Q N  + YD+
Sbjct: 395 VNSVLNVIANMQQQNHRVLYDV 416


>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 452

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 99/394 (25%), Positives = 162/394 (41%), Gaps = 79/394 (20%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
           +++GTPP ++   +DTGS+ +W  C      +  + + P FD   SS+Y  + CSS  C 
Sbjct: 67  VAVGTPPQNVTMVLDTGSELSWLLC------NGSRHDAP-FDASASSSYAPVPCSSPACT 119

Query: 99  VVTSN------CSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
            +  +      C    C  S  Y   A AS + G LA +T    S+      MP  +FGC
Sbjct: 120 WLGRDLPVRPFCDSSACRVSLSY---ADASSADGLLAADTFLLGSS-----PMP-ALFGC 170

Query: 153 --GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGSSKINFGGIV 209
              + +   P S++  TG++G+  G  S ++Q  T    +F+YC+   QG   +  GG  
Sbjct: 171 ITSYSSSTDP-SETPPTGLLGMNRGGLSFVTQTATR---RFAYCIAAGQGPGILLLGGND 226

Query: 210 AGAGVVS--------TPLII---------RDHYYLSLEAISVGNQRLEFVS-------SS 245
               + S        TPL+          R  Y + LE I VG+  L           + 
Sbjct: 227 TETPLTSPPQQQLNYTPLVEISQPLPYFDRAAYTVQLEGIRVGSALLAIPKHLLTPDHTG 286

Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG--AEPGF------------S 291
            G   VD+G   T L  + ++ LK+  +N +      G+    EPGF            +
Sbjct: 287 AGQTMVDSGTRFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVFQGAFDACFRGT 346

Query: 292 DVLCYNISSQPKFPEVTIHFRGADVKLSPSNLF--------RNISDEIMCSAF----RGG 339
           +      ++    PEV +  RGA+V ++ +           R   + + C  F      G
Sbjct: 347 EARVSAAAAGGLLPEVGLVLRGAEVVVAGAEKLLYRVPGERRGEGEGVWCLTFGSSDMAG 406

Query: 340 NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
            +  V G   Q +  + YD+  A + F  +RC +
Sbjct: 407 VSAYVIGHHHQQDVWVEYDLRNARLGFAAARCAD 440


>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
          Length = 472

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 105/383 (27%), Positives = 161/383 (42%), Gaps = 64/383 (16%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
           D ++LM +S+G PPV    ++DTGS  +W QC+PC  + C  Q     P+FDP +S T  
Sbjct: 111 DFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPCA-VHCHTQSAKAGPIFDPGRSYTSR 169

Query: 89  SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
            + CSS +C        +  +NC E +  C+YS  YG G   ++S G + T+TL    + 
Sbjct: 170 RVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS- 226

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG----TSIAGKFSYCL 195
                  +++FGC      S      + GI G G  + S   Q+           FSYCL
Sbjct: 227 -----FMDLMFGCSMDVKYSEF----EAGIFGFGSSSFSFFEQLAGYPDILSYKAFSYCL 277

Query: 196 PDQGSSKINFGGIVAG--------AGVVSTPLII-RDHYYLSLEAISVGNQRLEFVSSST 246
           P   + +   G ++ G         G  S    I R  Y L++E +    QRL    +S+
Sbjct: 278 P---TDETKPGYMILGRYDRAAMDGGYTSLFRSINRPTYSLTMEMLIANGQRL---VTSS 331

Query: 247 GNIFVDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY--------- 296
             + VD+G  RT L P  +    K++   M      +   A       +CY         
Sbjct: 332 SEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGW 389

Query: 297 -----NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRI 348
                  S+    P + I F  GA + L P N+F N     +C  F    A  + + G  
Sbjct: 390 NGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNR 449

Query: 349 MQINFLIGYDIEQAMVSFKPSRC 371
           +  +F   +DI+     FK + C
Sbjct: 450 VTRSFGTTFDIQGKQFGFKYAAC 472


>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 447

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 110/397 (27%), Positives = 167/397 (42%), Gaps = 83/397 (20%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEP--CPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
           +++GTPP ++   +DTGS+ +W  C     P L       P F+   SS+Y ++ C S+ 
Sbjct: 59  VAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLT------PAFNASGSSSYGAVPCPSTA 112

Query: 97  CAV------VTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           C        V   C       C  S  Y   A AS + G LAT+  TF  T G P     
Sbjct: 113 CEWRGRDLPVPPFCDTPPSNACRVSLSY---ADASSADGVLATD--TFLLTGGAPPVAVG 167

Query: 148 VIFGCGHKNLASPTSDS---------KQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PD 197
             FGC     ++  ++S           TG++G+  G  S ++Q GT    +F+YC+ P 
Sbjct: 168 AYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTR---RFAYCIAPG 224

Query: 198 QGSSKINFG--GIVAGAGVVSTPLII---------RDHYYLSLEAISVGNQRLEFVSS-- 244
           +G   +  G  G VA   +  TPLI          R  Y + LE I VG   L    S  
Sbjct: 225 EGPGVLLLGDDGGVA-PPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVL 283

Query: 245 -----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLC 295
                  G   VD+G   T L  + ++ LK+  ++  +A+ +     EPGF    +   C
Sbjct: 284 TPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTS--QARLLLAPLGEPGFVFQGAFDAC 341

Query: 296 YN------ISSQPKFPEVTIHFRGADVKLSPSNLFRNISDE---------IMCSAFRGGN 340
           +        ++    PEV +  RGA+V +S   L   +  E         + C  F  GN
Sbjct: 342 FRGPEARVAAASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTF--GN 399

Query: 341 ANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +++      V G   Q N  + YD++   V F P+RC
Sbjct: 400 SDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436


>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 252

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 62/171 (36%), Positives = 92/171 (53%), Gaps = 24/171 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +G+  + +   +DT SD TW QCEPC  + C+ Q+ P+F P  SS+Y S+SC+S
Sbjct: 65  YIVTMGLGSKNMTVI--IDTRSDLTWVQCEPC--MSCYNQQGPIFKPSTSSSYQSVSCNS 120

Query: 95  SQCAVV------TSNCSEGD---CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
           S C  +      T  C   +   C+Y   YG G+Y   ++G+L  E L+F       V +
Sbjct: 121 STCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSY---TNGDLGVEALSFGG-----VSV 172

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP 196
            + +FGCG  N       S   G++GLG    SL+SQ   +  G FSYCLP
Sbjct: 173 SDFVFGCGRNNKGLFGGVS---GLMGLGRSYLSLVSQTNATFGGVFSYCLP 220


>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
          Length = 435

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 96/379 (25%), Positives = 155/379 (40%), Gaps = 65/379 (17%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC- 97
           L++GTPP ++   +DTGS+ +W  C               F P+ S+T+ ++ C S++C 
Sbjct: 65  LAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAADS---FRPRASATFAAVPCGSARCS 121

Query: 98  -----AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
                A  + + +   C  S  Y  G   S S G LAT+         L        FGC
Sbjct: 122 SRDLPAPPSCDAASRRCRVSLSYADG---SASDGALATDVFAVGDAPPL-----RSAFGC 173

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--------GSSKIN 204
                 S        G++G+  G  S ++Q  T    +FSYC+ D+        G S + 
Sbjct: 174 MSAAYDSSPDAVATAGLLGMNRGALSFVTQASTR---RFSYCISDRDDAGVLLLGHSDLP 230

Query: 205 FGGIVAGAGVVSTPLII---RDHYYLSLEAISVGNQRLEFVSS-------STGNIFVDTG 254
           F  +        TP +    R  Y + L  I VG + L    S         G   VD+G
Sbjct: 231 FLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMVDSG 290

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFS-----DVLCYNI-SSQP----KF 304
              T L  + +S +K+    + + +P+     +P F+     D  C+ +   +P    + 
Sbjct: 291 TQFTFLLGDAYSAVKAEF--LKQTKPLLPALEDPSFAFQEAFDT-CFRVPKGRPPPSARL 347

Query: 305 PEVTIHFRGADVKLSPSNLF------RNISDEIMCSAFRGGNANI------VYGRIMQIN 352
           P VT+ F GA + ++   L       R  +D + C  F  GNA++      V G   Q+N
Sbjct: 348 PPVTLLFNGAQMSVAGDRLLYKVPGERRGADGVWCLTF--GNADMVPLTAYVIGHHHQMN 405

Query: 353 FLIGYDIEQAMVSFKPSRC 371
             + YD+E+  V   P +C
Sbjct: 406 LWVEYDLERGRVGLAPVKC 424


>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 98/380 (25%), Positives = 156/380 (41%), Gaps = 62/380 (16%)

Query: 36  LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
           ++ L IGTP       +DTGS  +W QC P             FDP  SS+++ + CS  
Sbjct: 82  ILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 141

Query: 96  QC-------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
            C        + TS  S   C YS+ Y  G   +F+ GNL  E  TF+++       P +
Sbjct: 142 LCKPRIPDFTLPTSCDSNRLCHYSYFYADG---TFAEGNLVKEKFTFSNSQ----TTPPL 194

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---------- 198
           I GC     A  ++D K  GI+G+  G  S ISQ   S   KFSYC+P +          
Sbjct: 195 ILGC-----AKESTDVK--GILGMNLGRLSFISQAKIS---KFSYCIPTRSNRPGLASTG 244

Query: 199 --------GSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS------ 244
                    S    +  ++        P +    Y + L  I +G +RL   SS      
Sbjct: 245 SFYLGENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDA 304

Query: 245 -STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-- 301
             +G   VD+G   T L    +  +K  +  ++ ++  KG     G +  +C++ + Q  
Sbjct: 305 GGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVY--GSTADMCFDGNHQMV 362

Query: 302 --PKFPEVTIHF-RGADVKLSPSNLFRNISDEIMC-----SAFRGGNANIVYGRIMQINF 353
                 ++   F RG ++ +    L  N+   I C     S+  G  +NI+ G + Q N 
Sbjct: 363 IGRLIGDLVFEFGRGVEILVEKQRLLVNVGGGIHCVGIGRSSMLGAASNII-GNVHQQNL 421

Query: 354 LIGYDIEQAMVSFKPSRCTN 373
            + +D+    V F  + C+ 
Sbjct: 422 WVEFDVANRRVGFSKAECSR 441


>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
 gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
          Length = 452

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 112/402 (27%), Positives = 163/402 (40%), Gaps = 95/402 (23%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCE-------PCPELDCFKQEPPLFDPKKSSTYNSIS 91
           +++G PP ++   +DTGS+ +W +C        P P      Q P  F+   SSTY +  
Sbjct: 64  VAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPP------QAPAAFNGSASSTYAAAH 117

Query: 92  CSSSQCAV------VTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
           CSS +C        V   C+      C  S  Y   A AS + G LA +T         P
Sbjct: 118 CSSPECQWRGRDLPVPPFCAGPPSXSCRVSLSY---ADASSADGILAADTFLLGGAP--P 172

Query: 143 VEMPNVIFGCG---HKNLASPTSDSK-QTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PD 197
           V     +FGC        A+ +SDS+  TG++G+  G+ S ++Q  T    +F+YC+ P 
Sbjct: 173 V---XALFGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCIAPG 226

Query: 198 QGSSKINFGGIVAGAGVVS----TPLII---------RDHYYLSLEAISVGNQRLEFVSS 244
            G   +  GG   GA +      TPLI          R  Y + LE I VG   L    S
Sbjct: 227 DGPGLLVLGG--DGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKS 284

Query: 245 -------STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--- 294
                    G   VD+G   T L  + ++ LK    N   A     + A  G SD +   
Sbjct: 285 VLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSA-----LLAPLGESDFVFQG 339

Query: 295 ----CYNIS------SQPKFPEVTIHFRGADVKLSPSNLFRNISDE---------IMCSA 335
               C+  S      +    PEV +  RGA+V +    L   +  E         + C  
Sbjct: 340 AFDACFRASEARVAAASXMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLT 399

Query: 336 FRGGNANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           F  GN+++      V G   Q N  + YD++   V F P+RC
Sbjct: 400 F--GNSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 439


>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 485

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 113/397 (28%), Positives = 166/397 (41%), Gaps = 53/397 (13%)

Query: 12  DNETPKSPISIIYQAEIISVDDI-----YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPC 66
           D    +    ++  A ++  DD+     Y   + IGTP  +    VDTGS  T+  C  C
Sbjct: 71  DRRFERRGRGLVEDARMVLHDDLLTKGYYTSRVFIGTPAQEFALIVDTGSTVTYVPCSSC 130

Query: 67  PELD----CFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYAS 122
                   CF    P F P  SS+Y ++SC+S  C     +     C Y  +Y   A  S
Sbjct: 131 THCGHHQACFD---PRFKPDNSSSYQTVSCNSPDCITKMCDARVHQCKYERVY---AEMS 184

Query: 123 FSSGNLATETLTFNSTSGLPVEMPN-VIFGCGHKNLASPTSD---SKQTGIIGLGPGNSS 178
            S G L  + L F + S L    P+ ++FGC      + T D       GI+GLG G  S
Sbjct: 185 SSKGVLGKDLLGFGNGSRL---QPHPLLFGCE----TAETGDLYLQHADGIMGLGRGPLS 237

Query: 179 LISQM-GT-SIAGKFSYCLP--DQGSSKINFGGIVAGAGVV---STPLIIRDHYY-LSLE 230
           ++ Q+ GT ++   FS C    D+G   +  G I     +V   S P   R +YY L L 
Sbjct: 238 IVDQLVGTGAMEDSFSLCYGGMDEGGGSMVLGAIPPPPAMVFAKSDP--NRSNYYNLELS 295

Query: 231 AISVGNQRL----EFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKA-QPVKGVG 285
            I V    L    E  +   G + +D+G     LP +     K  ++  + + Q V   G
Sbjct: 296 EIQVQGVSLNVPSEVFNGRLGTV-LDSGTTYAYLPDKAFDAFKDAITQQLGSLQAVP--G 352

Query: 286 AEPGFSDVLCYNISSQPK-----FPEVTIHFRG-ADVKLSPSN-LFRN--ISDEIMCSAF 336
            +P + DV      S  K     FP V   F G   V L+P N LF++  +        F
Sbjct: 353 PDPSYPDVCFAGAGSDSKALGKHFPPVDFVFSGNQKVFLAPENYLFKHTKVPGAYCLGFF 412

Query: 337 RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           +  +A  + G I+  N L+ YD     + F  + CTN
Sbjct: 413 KNQDATTLLGGIVVRNTLVTYDRANHQIGFFKTNCTN 449


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 95/377 (25%), Positives = 154/377 (40%), Gaps = 49/377 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKK------SSTY 87
           Y +   +GTP        DTGSD TW  C+  C   +C  ++      K+      SS++
Sbjct: 83  YFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 142

Query: 88  NSISCSSSQCAV------VTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
            +I C +  C +        +NC      C Y + Y  G+ A    G  A ET+T     
Sbjct: 143 KTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTAL---GFFANETVTVELKE 199

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
           G  +++ NV+ GC         S     G++GLG    S   +      GKFSYCL D  
Sbjct: 200 GRKMKLHNVLIGCSES--FQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHL 257

Query: 200 SSK-----INFGGIVAGAGVVS----TPLI---IRDHYYLSLEAISVGNQRLEFVS---- 243
           S K     + FG   +   +++    T L+   +   Y +++  IS+G   L+  S    
Sbjct: 258 SHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWD 317

Query: 244 -SSTGNIFVDTGVLRTLLPL-EYHSNLKSVMSNMIKAQPVK-GVGAEPGFSDVLCYNIS- 299
               G   +D+G   T L    Y   + ++  +++K + V+  +G         C+N + 
Sbjct: 318 VKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLE-----YCFNSTG 372

Query: 300 -SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLI 355
             +   P +  HF  GA+ +    +   + +D + C  F         V G IMQ N L 
Sbjct: 373 FEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLW 432

Query: 356 GYDIEQAMVSFKPSRCT 372
            +D+    + F PS CT
Sbjct: 433 EFDLGLKKLGFAPSSCT 449


>gi|218201673|gb|EEC84100.1| hypothetical protein OsI_30414 [Oryza sativa Indica Group]
          Length = 366

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 82/296 (27%), Positives = 123/296 (41%), Gaps = 45/296 (15%)

Query: 29  ISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
           I  D  YL  + IG      +  +DTGS   WTQC+ CP   C   + P +   +S T+ 
Sbjct: 76  IYEDVAYLAEMEIGERQQKQYLLIDTGSSLVWTQCDECPH--CHIGDVPPYGRSQSRTFQ 133

Query: 89  SISCS-----------SSQCAV----VTSNCSEGDCSYSFLYGRGAYASFSSGNLATETL 133
            +SC            +S C        + C  G C +  LY          G ++ +T 
Sbjct: 134 EVSCGDDDDNDKEEAIASYCPAKPPGYITLCVNGRCMFKALYNLTGQGETVQGYMSMDTF 193

Query: 134 TFNSTSGLPVEMP-NVIFGCGHKN------LASPTSDSKQ-TGIIGLGPGNSSLISQMGT 185
            F        +    ++FGC H+       +   T+  K+ TGI+GLG G++S + Q G 
Sbjct: 194 HFIDDRRFDYQAKFRMVFGCAHQENIVLTAVKECTTAVKECTGILGLGMGDASFLRQTGI 253

Query: 186 SIAGKFSYCLPD-------QGSSKINFGGIVAGAGVVSTPLIIR-DHYYLSLEAISVG-N 236
           +   KFSYC P        +  S + FG     +G    PL++R   YYL L AI+   N
Sbjct: 254 T---KFSYCAPPRMPGYSYRRDSWLRFGSHAQISG-KKVPLVMRWGKYYLPLTAITYTYN 309

Query: 237 QRLEFV-------SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVG 285
           + +  V            ++ VDTG     LP   H +L   M  +IK++   G G
Sbjct: 310 ELMSPVPIIAYKSQEDYLHMMVDTGTSLLSLPTSLHDDLIKEMEAIIKSKKYDGRG 365


>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
          Length = 321

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 91/329 (27%), Positives = 148/329 (44%), Gaps = 45/329 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP       +DTGS  TW  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTTWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P+ 
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
            FGC   +  +        G++G+G G  S++ Q   +  G FSYCLP Q S +      
Sbjct: 110 TFGCNLDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKT 167

Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDT 253
               + G +     V  T ++ R    + +++ L AISV  +RL    S  S   +  D+
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF 311
           G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P +++HF
Sbjct: 228 GSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISLHF 282

Query: 312 -RGADVKLSPSNLF--RNISDE-IMCSAF 336
             GA   L    +F  R++ ++ + C AF
Sbjct: 283 DDGARFDLGSRGVFVERSVQEQDVWCLAF 311


>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 104/381 (27%), Positives = 162/381 (42%), Gaps = 47/381 (12%)

Query: 19  PISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL 78
           P S +   + + ++  Y   L IGTPP      VD+GS  T+  C  C +  C K + P 
Sbjct: 78  PHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQ--CGKHQDPK 135

Query: 79  FDPKKSSTYNSISCSSSQCAVVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFN 136
           F P+ SSTY  + C+      +  NC +    C Y   Y   A  S S G L  + ++F 
Sbjct: 136 FQPELSSTYQPVKCN------MDCNCDDDKEQCVYEREY---AEHSSSKGVLGEDLISFG 186

Query: 137 STSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYC 194
           + S L  +    +FGC         S  +  GIIGLG G+ SL+ Q+     I+  F  C
Sbjct: 187 NESQLTPQ--RAVFGCETVETGDLYS-QRADGIIGLGQGDLSLVDQLVDKGLISNSFGLC 243

Query: 195 LPDQGSSKINFGGIVAGAGVVSTPLIIRD-------HYYLSLEAISVGNQRLEF---VSS 244
               G   +  G ++ G     + +I  D       +Y + L  I V  ++L     V  
Sbjct: 244 Y---GGMDVGGGSMILGGFDYPSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNSRVFD 300

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV-GAEPGFSDVLCY------N 297
                 +D+G     LP    +  +  +  M +  P+K + G +P F D  C+      +
Sbjct: 301 GEHGAVLDSGTTYAYLPDAAFAAFEEAV--MREVSPLKQIDGPDPNFKDT-CFLVAASND 357

Query: 298 ISSQPK-FPEVTIHFR-GADVKLSPSN-LFRN--ISDEIMCSAF-RGGNANIVYGRIMQI 351
           +S   K FP V + F+ G    LSP N +FR+  +        F  G +   + G I+  
Sbjct: 358 VSELSKIFPSVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVR 417

Query: 352 NFLIGYDIEQAMVSFKPSRCT 372
           N L+ YD E + V F  + C+
Sbjct: 418 NTLVVYDRENSKVGFWRTNCS 438


>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 95/391 (24%), Positives = 159/391 (40%), Gaps = 60/391 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE--PCPELDCFKQEPPLFDPKKSSTYNSISC 92
           Y +   +GTP        DTGSD TW +C        +        F P+ S T+  ISC
Sbjct: 94  YFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISC 153

Query: 93  SSSQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNL-ATETLTFNSTSGLPVEM 145
           +S  C      ++ T       C+Y + Y  G+ A  + G   AT  L+         ++
Sbjct: 154 ASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAKL 213

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-----PDQGS 200
             ++ GC   +   P+ +    G++ LG  + S  S   +  AG+FSYCL     P   +
Sbjct: 214 KGLVLGC-TSSYTGPSFEVSD-GVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNAT 271

Query: 201 SKINFG----------------------GIVAGAGVVSTPLII----RDHYYLSLEAISV 234
           S + FG                                TPL++    R  Y ++++A+SV
Sbjct: 272 SYLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVSV 331

Query: 235 GNQRLEFVSS-----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPG 289
             Q L+   +     + G + +D+G   T+L    +  + + +S  +   P   V  +P 
Sbjct: 332 AGQFLKIPRAVWDVDAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPR--VTMDP- 388

Query: 290 FSDVLCYNISS---QPKFPEVTIHFRGADVKLSP--SNLFRNISDEIMCSAFRGG--NAN 342
           F    CYN +S       P++ +HF GA  +L P   +   + +  + C   + G     
Sbjct: 389 FE--YCYNWTSPSGDVTLPKMAVHFAGA-ARLEPPGKSYVIDAAPGVKCIGLQEGPWPGI 445

Query: 343 IVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
            V G I+Q   L  +DI+   + F+ SRCT+
Sbjct: 446 SVIGNILQQEHLWEFDIKNRRLKFQRSRCTH 476


>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
 gi|194689376|gb|ACF78772.1| unknown [Zea mays]
 gi|224031455|gb|ACN34803.1| unknown [Zea mays]
 gi|238011528|gb|ACR36799.1| unknown [Zea mays]
 gi|238015454|gb|ACR38762.1| unknown [Zea mays]
          Length = 304

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 86/317 (27%), Positives = 141/317 (44%), Gaps = 48/317 (15%)

Query: 90  ISCSSSQCA-VVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           + C+ + C+ ++  +C   D C+Y + YG G   + + G  ATE  TF S+ G  +    
Sbjct: 1   MRCAGTLCSDILHHSCERPDTCTYRYNYGDG---TMTVGVYATERFTFASSGGGGLTTTT 57

Query: 148 VI--FGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK--- 202
           V   FGCG  N+ S  + S   GI+G G    SL+SQ+      +FSYCL    S +   
Sbjct: 58  VPLGFGCGSVNVGSLNNGS---GIVGFGRNPLSLVSQLSIR---RFSYCLTSYASRRQST 111

Query: 203 INFGGIVAG------AGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVSSS------- 245
           + FG +  G        V +TPL+        YY+    ++VG +RL    S+       
Sbjct: 112 LLFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDG 171

Query: 246 TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI------- 298
           +G + VD+G   TLLP    + +       ++     G   E G    +C+ +       
Sbjct: 172 SGGVIVDSGTALTLLPAAVLAEVVRAFRQQLRLPFANGGNPEDG----VCFLVPAAWRRS 227

Query: 299 --SSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFR-GGNANIVYGRIMQINFL 354
             +SQ   P + +HF+GAD+ L   N +  +     +C      G+     G ++Q +  
Sbjct: 228 SSTSQMPVPRMVLHFQGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMR 287

Query: 355 IGYDIEQAMVSFKPSRC 371
           + YD+E   +S  P+RC
Sbjct: 288 VLYDLEAETLSIAPARC 304


>gi|340810945|gb|AEK75399.1| S5 [Oryza sativa]
 gi|340810957|gb|AEK75405.1| S5 [Oryza sativa]
 gi|340811007|gb|AEK75430.1| S5 [Oryza nivara]
 gi|340811073|gb|AEK75463.1| S5 [Oryza rufipogon]
 gi|340811094|gb|AEK75473.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 108/379 (28%), Positives = 159/379 (41%), Gaps = 66/379 (17%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
           M +S+G PPV    ++DTGS  +W QC+PC  + C  Q     P+FDP +S T   + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 94  SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           S +C        +  +NC E +  C+YS  YG G   ++S G + T+TL    +      
Sbjct: 60  SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGW--AYSVGKMVTDTLRIGDS------ 111

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--------KFSYCLP 196
             +++FGC      S      + GI G G  + S   Q+    AG         FSYCLP
Sbjct: 112 FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQL----AGYPDILSYKAFSYCLP 163

Query: 197 -DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
            D+        G    A +    TPL   I R  Y L+ E +    QRL    +S+  + 
Sbjct: 164 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRL---VTSSSEMI 220

Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------- 296
           VD+G  RT L P  +    K++   M      +   A       +CY             
Sbjct: 221 VDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTI 278

Query: 297 -NISSQPKFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQIN 352
              S+    P + I F G A + LSP N+F N     +C  F    A  + + G  +  +
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRS 338

Query: 353 FLIGYDIEQAMVSFKPSRC 371
           F   +DI+     FK + C
Sbjct: 339 FGTTFDIQGKQFGFKYAAC 357


>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 97/361 (26%), Positives = 156/361 (43%), Gaps = 47/361 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++  S+GTPP  +  +VDT +D +W  C  C         P  FDP  S++Y ++ C S
Sbjct: 112 YVVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAP--FDPAASASYRTVPCGS 169

Query: 95  SQCAVV-TSNCSEGD--CSYSFLYGRGAY-ASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
             CA    + C  G   C +S  Y   +  A+ S  +LA         +G  V+     F
Sbjct: 170 PLCAQAPNAACPPGGKACGFSLTYADSSLQAALSQDSLA--------VAGNAVKA--YTF 219

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIV- 209
           GC  +  A+ T+   Q  +     G  S +SQ        FSYCLP   S  +NF G + 
Sbjct: 220 GCLQR--ATGTAAPPQGLLGLGR-GPLSFLSQTKDMYEATFSYCLPSFKS--LNFSGTLR 274

Query: 210 -----AGAGVVSTPLIIRDH----YYLSLEAISVGNQRL---EFVSSSTGNIFVDTGVLR 257
                    + +TPL+   H    YY+++  + VG + +    F  ++     +D+G + 
Sbjct: 275 LGRNGQPQRIKTTPLLANPHRSSLYYVNMTGVRVGRKVVPIPAFDPATGAGTVLDSGTMF 334

Query: 258 TLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVK 317
           T L    +  ++  +   + A PV  +G   GF    C+N ++   +P +T+ F G  V 
Sbjct: 335 TRLVAPAYVAVRDEVRRRVGA-PVSSLG---GFDT--CFNTTAV-AWPPMTLLFDGMQVT 387

Query: 318 LSPSNL-----FRNISDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           L   N+     +  IS   M +A  G N  + V   + Q N  + +D+    V F   RC
Sbjct: 388 LPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERC 447

Query: 372 T 372
           T
Sbjct: 448 T 448


>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 92/368 (25%), Positives = 151/368 (41%), Gaps = 57/368 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +G+P    +  VDTGS+ TW  C                    S ++ +++C+S
Sbjct: 113 YFAEVKVGSPGQRFWLVVDTGSEFTWLNC--------------------SKSFEAVTCAS 152

Query: 95  SQCAVVTSN------CSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
            +C V  S       C +    C Y   Y  G+ A    G   T+++T   T+G   ++ 
Sbjct: 153 RKCKVDLSELFSLSVCPKPSDPCLYDISYADGSSAK---GFFGTDSITVGLTNGKQGKLN 209

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ-----GSS 201
           N+  GC    L     + +  GI+GLG    S I +       KFSYCL D       SS
Sbjct: 210 NLTIGCTKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSHRSVSS 269

Query: 202 KINFGGIVAG---AGVVSTPLIIRDHYY-LSLEAISVGNQRLE-----FVSSSTGNIFVD 252
            +  GG         +  T LI+   +Y +++  IS+G Q L+     +  ++ G   +D
Sbjct: 270 NLTIGGHHNAKLLGEIRRTELILFPPFYGVNVVGISIGGQMLKIPPQVWDFNAEGGTLID 329

Query: 253 TG-VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQPKFPEVTI 309
           +G  L +LL   Y +  +++  ++ K + V G   E   +   C++         P +  
Sbjct: 330 SGTTLTSLLLPAYEAVFEALTKSLTKVKRVTG---EDFDALEFCFDAEGFDDSVVPRLVF 386

Query: 310 HFRGADVKLSP--SNLFRNISDEIMCSA---FRGGNANIVYGRIMQINFLIGYDIEQAMV 364
           HF G   +  P   +   +++  + C       G     V G IMQ N L  +D+    V
Sbjct: 387 HFAGG-ARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLSTNTV 445

Query: 365 SFKPSRCT 372
            F PS CT
Sbjct: 446 GFAPSTCT 453


>gi|340811098|gb|AEK75475.1| S5 [Oryza nivara]
          Length = 357

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 108/379 (28%), Positives = 159/379 (41%), Gaps = 66/379 (17%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
           M +S+G PPV    ++DTGS  +W QC+PC  + C  Q     P+FDP +S T   + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 94  SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           S +C        +  +NC E +  C+YS  YG G   ++S G + T+TL    +      
Sbjct: 60  SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW--AYSVGKMVTDTLRIGDS------ 111

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--------KFSYCLP 196
             +++FGC      S      + GI G G  + S   Q+    AG         FSYCLP
Sbjct: 112 FMDLMFGCSMDVKYSEF----EAGIFGFGSSSFSFFEQL----AGYPDILSYKAFSYCLP 163

Query: 197 -DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
            D+        G    A +    TPL   I R  Y L+ E +    QRL    +S+  + 
Sbjct: 164 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTTEMLIANGQRL---VTSSSEMI 220

Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------- 296
           VD+G  RT L P  +    K++   M      +   A       +CY             
Sbjct: 221 VDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTI 278

Query: 297 -NISSQPKFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQIN 352
              S+    P + I F G A + LSP N+F N     +C  F    A  + + G  +  +
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRS 338

Query: 353 FLIGYDIEQAMVSFKPSRC 371
           F   +DI+     FK + C
Sbjct: 339 FGTTFDIQGKQFGFKYAAC 357


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 100/373 (26%), Positives = 157/373 (42%), Gaps = 48/373 (12%)

Query: 35  YLMHLSIGTP-PVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSI 90
           Y + + IGTP P       DTGSD TW  CE   +  C K  P    +F    SS++ +I
Sbjct: 119 YFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCK-SCPKPNPHPGRVFRANDSSSFRTI 177

Query: 91  SCSSSQCAVVTSN------CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
            CSS  C +   +      C   +  C + + Y  G  A    G  A ET+T        
Sbjct: 178 PCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAI---GVFANETVTVGLNDHKK 234

Query: 143 VEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK 202
           + + +V+ GC     +   ++    G++GLG    SL  ++      KFSYCL D  SS 
Sbjct: 235 IRLFDVLIGCTE---SFNETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLVDHLSSS 291

Query: 203 -----INFGGI--VAGAGVVSTPLI---IRDHYYLSLEAISVGNQRLEFVS-----SSTG 247
                ++FG I  +    +  T L+   I   Y +++  ISVG   L   S     +  G
Sbjct: 292 NHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDIWNVTGVG 351

Query: 248 NIFVDTGVLRTLLPLEYHSN----LKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS--SQ 301
            + VD+G   T+L  E +      LK +     K  P++     P  ++  C+      +
Sbjct: 352 GMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIE----LPELNN-FCFEDKGFDR 406

Query: 302 PKFPEVTIHF-RGADVKLSPSNLFRNISDEIMCSAFRGGN--ANIVYGRIMQINFLIGYD 358
              P + IHF  GA  K    +   ++++ I C      +   + + G +MQ N L  YD
Sbjct: 407 AAVPRLLIHFADGAIFKPPVKSYIIDVAEGIKCLGIIKADFPGSSILGNVMQQNHLWEYD 466

Query: 359 IEQAMVSFKPSRC 371
           + +  + F PS C
Sbjct: 467 LGRGKLGFGPSSC 479


>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
 gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
 gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
 gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
 gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 96/378 (25%), Positives = 156/378 (41%), Gaps = 62/378 (16%)

Query: 36  LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS 95
           ++ L IGTP       +DTGS  +W QC P             FDP  SS+++ + CS  
Sbjct: 81  ILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 140

Query: 96  QC-------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
            C        + TS  S   C YS+ Y  G   +F+ GNL  E  TF+++       P +
Sbjct: 141 LCKPRIPDFTLPTSCDSNRLCHYSYFYADG---TFAEGNLVKEKFTFSNSQ----TTPPL 193

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ---------- 198
           I GC     A  ++D K  GI+G+  G  S ISQ   S   KFSYC+P +          
Sbjct: 194 ILGC-----AKESTDEK--GILGMNLGRLSFISQAKIS---KFSYCIPTRSNRPGLASTG 243

Query: 199 --------GSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS------ 244
                    S    +  ++        P +    Y + L+ I +G +RL    S      
Sbjct: 244 SFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDA 303

Query: 245 -STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK 303
             +G   VD+G   T L    +  +K  +  ++ ++  KG     G +  +C++ +   +
Sbjct: 304 GGSGQTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVY--GSTADMCFDGNHSME 361

Query: 304 ----FPEVTIHF-RGADVKLSPSNLFRNISDEIMC-----SAFRGGNANIVYGRIMQINF 353
                 ++   F RG ++ +   +L  N+   I C     S+  G  +NI+ G + Q N 
Sbjct: 362 IGRLIGDLVFEFGRGVEILVEKQSLLVNVGGGIHCVGIGRSSMLGAASNII-GNVHQQNL 420

Query: 354 LIGYDIEQAMVSFKPSRC 371
            + +D+    V F  + C
Sbjct: 421 WVEFDVTNRRVGFSKAEC 438


>gi|255685714|gb|ACU28346.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
          Length = 91

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 48/106 (45%), Positives = 62/106 (58%), Gaps = 15/106 (14%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
           M L IGTPP +I   +DTGS+  WTQC PC  L C+ Q+ P+FDP KSST+    C    
Sbjct: 1   MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRC---- 54

Query: 97  CAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
                 N  +  CSY  +Y   +Y   + G LATET+T +STSG+P
Sbjct: 55  ------NTPDHSCSYKIVYDDKSY---TQGTLATETVTIHSTSGVP 91


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 102/368 (27%), Positives = 158/368 (42%), Gaps = 44/368 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQE--------PPLFDPKKSST 86
           Y   L IGTP  +    VD+GS  T+  C  C +    + E         P F P  SST
Sbjct: 92  YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSST 151

Query: 87  YNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           Y+ + C+       T +     C+Y   Y   A  S SSG L  + ++F   S L  +  
Sbjct: 152 YSPVKCNVD----CTCDNERSQCTYERQY---AEMSSSSGVLGEDIMSFGKESELKPQ-- 202

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP--DQGSSK 202
             +FGC +       S     GI+GLG G  S++ Q+     I+  FS C    D G   
Sbjct: 203 RAVFGCENTETGDLFSQHAD-GIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 261

Query: 203 INFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGV 255
           +  GG+ A   +V   S P  +R  YY + L+ I V  + L     + +S     +D+G 
Sbjct: 262 MVLGGMPAPPDMVFSHSNP--VRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGT 319

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-----NISSQPK-FPEVTI 309
               LP +     K  ++N + +   K  G +P + D+ C+     N+S   + FP+V +
Sbjct: 320 TYAYLPEQAFVAFKDAVTNKVNSLK-KIRGPDPNYKDI-CFAGAGRNVSQLSEVFPDVDM 377

Query: 310 HF-RGADVKLSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMV 364
            F  G  + LSP N LFR+  +        F+ G +   + G I+  N L+ YD     +
Sbjct: 378 VFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKI 437

Query: 365 SFKPSRCT 372
            F  + C+
Sbjct: 438 GFWKTNCS 445


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 97/383 (25%), Positives = 158/383 (41%), Gaps = 72/383 (18%)

Query: 36  LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL----FDPKKSSTYNSIS 91
           ++ L IGTPP      +DTGS  +W QC         K+ PP     FDP  SS+++++ 
Sbjct: 73  IISLPIGTPPQAQQMVLDTGSQLSWIQCH-------RKKLPPKPKTSFDPSLSSSFSTLP 125

Query: 92  CSSSQC-------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           CS   C        + TS  S   C YS+ Y  G   +F+ GNL  E +TF++T      
Sbjct: 126 CSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADG---TFAEGNLVKEKITFSNTE----I 178

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------ 198
            P +I GC        T  S   GI+G+  G  S +SQ   S   KFSYC+P +      
Sbjct: 179 TPPLILGCA-------TESSDDRGILGMNRGRLSFVSQAKIS---KFSYCIPPKSNRPGF 228

Query: 199 ------------GSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS-- 244
                        S    +  ++        P +    Y + +  I  G ++L    S  
Sbjct: 229 TPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVF 288

Query: 245 -----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY--N 297
                 +G   VD+G   T L    +  +++ +   +  +  KG     G +D +C+  N
Sbjct: 289 RPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGY-VYGGTAD-MCFDGN 346

Query: 298 ISSQPKF--PEVTIHFRGADVKLSPSNLFRNISDEIMC-----SAFRGGNANIVYGRIMQ 350
           ++  P+     V +  RG ++ +    +  N+   I C     S+  G  +NI+ G + Q
Sbjct: 347 VAMIPRLIGDLVFVFTRGVEILVPKERVLVNVGGGIHCVGIGRSSMLGAASNII-GNVHQ 405

Query: 351 INFLIGYDIEQAMVSFKPSRCTN 373
            N  + +D+    V F  + C+ 
Sbjct: 406 QNLWVEFDVTNRRVGFAKADCSR 428


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 102/368 (27%), Positives = 158/368 (42%), Gaps = 44/368 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQE--------PPLFDPKKSST 86
           Y   L IGTP  +    VD+GS  T+  C  C +    + E         P F P  SST
Sbjct: 91  YTTRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSST 150

Query: 87  YNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP 146
           Y+ + C+       T +     C+Y   Y   A  S SSG L  + ++F   S L  +  
Sbjct: 151 YSPVKCNVD----CTCDNERSQCTYERQY---AEMSSSSGVLGEDIMSFGKESELKPQ-- 201

Query: 147 NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLP--DQGSSK 202
             +FGC +       S     GI+GLG G  S++ Q+     I+  FS C    D G   
Sbjct: 202 RAVFGCENTETGDLFSQHAD-GIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGT 260

Query: 203 INFGGIVAGAGVV---STPLIIRDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGV 255
           +  GG+ A   +V   S P  +R  YY + L+ I V  + L     + +S     +D+G 
Sbjct: 261 MVLGGMPAPPDMVFSHSNP--VRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGT 318

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY-----NISSQPK-FPEVTI 309
               LP +     K  ++N + +   K  G +P + D+ C+     N+S   + FP+V +
Sbjct: 319 TYAYLPEQAFVAFKDAVTNKVNSLK-KIRGPDPNYKDI-CFAGAGRNVSQLSEVFPDVDM 376

Query: 310 HF-RGADVKLSPSN-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMV 364
            F  G  + LSP N LFR+  +        F+ G +   + G I+  N L+ YD     +
Sbjct: 377 VFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKI 436

Query: 365 SFKPSRCT 372
            F  + C+
Sbjct: 437 GFWKTNCS 444


>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
 gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 107/379 (28%), Positives = 159/379 (41%), Gaps = 66/379 (17%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
           M +S+G PPV    ++DTGS  +W QC+PC  + C  Q     P+FDP +S T   + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 94  SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           S +C        +  +NC E +  C+YS  YG G   ++S G + T+TL    +      
Sbjct: 60  SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS------ 111

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--------KFSYCLP 196
             +++FGC      S      + GI G G  + S   Q    +AG         FSYCLP
Sbjct: 112 FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQ----LAGYPDILSYKAFSYCLP 163

Query: 197 -DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
            D+        G    A +    TPL   I R  Y L++E +    QRL    +S+  + 
Sbjct: 164 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEMI 220

Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------- 296
           VD+G  RT L P  +    K++   M      +   A       +CY             
Sbjct: 221 VDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTI 278

Query: 297 -NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQIN 352
              S+    P + I F  GA + L P N+F N     +C  F    A  + + G  +  +
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRS 338

Query: 353 FLIGYDIEQAMVSFKPSRC 371
           F   +DI+     FK + C
Sbjct: 339 FGTTFDIQGKQFGFKYAAC 357


>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 96/378 (25%), Positives = 153/378 (40%), Gaps = 88/378 (23%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y M + +G+PP      +DTGSD  W QC PC   DCF+Q                    
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPC--YDCFQQN------------------- 208

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFN-STSGLPVEMPNV---IF 150
                         C Y + YG    +S ++G+ A ET T N +T+G   E+ NV   +F
Sbjct: 209 ----------DNQSCPYYYWYGD---SSNTTGDFAVETFTVNLTTNGGSSELYNVENMMF 255

Query: 151 GCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSS--------- 201
           GCGH N       +   G+        S  SQ+ +     FSYCL D+ S          
Sbjct: 256 GCGHWNRGLFHGAAGLLGLGRG---PLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIF 312

Query: 202 ----------KINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRL-------EFVSS 244
                      +NF   VAG        ++   YY+ +++I V  + L          S 
Sbjct: 313 GEDKDLLSHPNLNFTSFVAGK-----ENLVDTFYYVQIKSILVAGEVLNIPEETWNISSD 367

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSD--VL--CYNIS- 299
             G   +D+G   +      +  +K+ ++        K  G  P + D  +L  C+N+S 
Sbjct: 368 GAGGTIIDSGTTLSYFAEPAYEFIKNKIAE-------KAKGKYPVYRDFPILDPCFNVSG 420

Query: 300 -SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRG--GNANIVYGRIMQINFLI 355
               + PE+ I F  GA       N F  ++++++C A  G   +A  + G   Q NF I
Sbjct: 421 IHNVQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHI 480

Query: 356 GYDIEQAMVSFKPSRCTN 373
            YD +++ + + P++C +
Sbjct: 481 LYDTKRSRLGYAPTKCAD 498


>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
          Length = 287

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 84/289 (29%), Positives = 131/289 (45%), Gaps = 32/289 (11%)

Query: 101 TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASP 160
           T  CS G C Y   YG G+Y   + G  A +TLT +S       +    FGCG +N    
Sbjct: 13  TRGCSGGHCLYGVQYGDGSY---TIGFFAMDTLTLSSHD----AIKGFRFGCGERN---E 62

Query: 161 TSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ--GSSKINFG---GIVAGAGVV 215
               +  G++GLG G +SL  Q      G F++C P +  G+  + FG        A + 
Sbjct: 63  GLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCFPARSSGTGYLEFGPGSSPAVSAKLS 122

Query: 216 STPLIIR---DHYYLSLEAISVGNQRL---EFVSSSTGNIFVDTGVLRTLLPLEYHSNLK 269
           +TP++I      YY+ +  I VG + L   + V ++ G I VD+G + T LP   +S+L+
Sbjct: 123 TTPMLIDTGPTFYYVGMTGIRVGGKLLPIPQSVFAAAGTI-VDSGTVITRLPPAAYSSLR 181

Query: 270 SVMSNMIKAQPVKGVGAEPGFSDV-LCYNI--SSQPKFPEVTIHFRGA-DVKLSPSNLFR 325
           S  +  + A+  K     P  S +  CY++  +S+   P V++ F+G   + +  S +  
Sbjct: 182 SAFAASMAARGYK---RAPALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASGIIY 238

Query: 326 NISDEIMCSAFRGGNAN---IVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
             S    C  F G  A     + G      F + YDI   +V F P  C
Sbjct: 239 AASVSQACLGFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287


>gi|88174577|gb|ABD39363.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 91/329 (27%), Positives = 148/329 (44%), Gaps = 45/329 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSISWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P+ 
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
            FGC   +  +        G++G+G G  S++ Q   +  G FSYCLP Q S +      
Sbjct: 110 TFGCNLDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKT 167

Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDT 253
               + G +     V  T ++ R    + +++ L AISV  +RL    S  S   +  D+
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF 311
           G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P +++HF
Sbjct: 228 GSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISLHF 282

Query: 312 -RGADVKLSPSNLF--RNISDE-IMCSAF 336
             GA   L  S +F  R++ ++ + C AF
Sbjct: 283 DDGARFDLGSSGVFVERSVQEQDVWCLAF 311


>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
 gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
          Length = 334

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 92/332 (27%), Positives = 145/332 (43%), Gaps = 53/332 (15%)

Query: 77  PLFDPKKSSTYNSISCSSSQCA---------VVTSNCSEGDCSYSFLYGRGA-YASFSSG 126
           PL  P  SS+   ++C    C          V       G+CSY + YG       ++ G
Sbjct: 13  PLLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEG 72

Query: 127 NLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS 186
            L TET TF   +      P + FGC    L S       +G++GLG G  SL++Q+   
Sbjct: 73  ILMTETFTFGDDA---AAFPGIAFGC---TLRSEGGFGTGSGLVGLGRGKLSLVTQLNVE 126

Query: 187 IAGKFSYCLPDQGS--SKINFGGIVA-----GAGVVSTPL----IIRD--HYYLSLEAIS 233
               F Y L    S  S I+FG +       G   +STPL    +++D   YY+ L  IS
Sbjct: 127 ---AFGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGIS 183

Query: 234 VGNQRLEF--------VSSSTGNIFVDTGVLRTLLPLEYHSNLK-SVMSNMIKAQPVKGV 284
           VG + ++          S+  G +  D+G   T+LP   ++ ++  ++S M   +P    
Sbjct: 184 VGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAA 243

Query: 285 GAEPGFSDVLCYN-ISSQPKFPEVTIHFR-GADVKLSPSNLF-----RNISDEIMCSAFR 337
             +    D++C+   SS   FP + +HF  GAD+ LS  N       +N       S  +
Sbjct: 244 NDD----DLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVK 299

Query: 338 GGNANIVYGRIMQINFLIGYDIE-QAMVSFKP 368
              A  + G IMQ++F + +D+   A + F+P
Sbjct: 300 SSQALTIIGNIMQMDFHVVFDLSGNARMLFQP 331


>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
 gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
 gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
 gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
 gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
 gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
 gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
 gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
          Length = 357

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 107/379 (28%), Positives = 159/379 (41%), Gaps = 66/379 (17%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
           M +S+G PPV    ++DTGS  +W QC+PC  + C  Q     P+FDP +S T   + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 94  SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           S +C        +  +NC E +  C+YS  YG G   ++S G + T+TL    +      
Sbjct: 60  SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS------ 111

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--------KFSYCLP 196
             +++FGC      S      + GI G G  + S   Q    +AG         FSYCLP
Sbjct: 112 FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQ----LAGYPDILSYKAFSYCLP 163

Query: 197 -DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
            D+        G    A +    TPL   I R  Y L++E +    QRL    +S+  + 
Sbjct: 164 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEMI 220

Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------- 296
           VD+G  RT L P  +    K++   M      +   A       +CY             
Sbjct: 221 VDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTI 278

Query: 297 -NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQIN 352
              S+    P + I F  GA + L P N+F N     +C  F    A  + + G  +  +
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRS 338

Query: 353 FLIGYDIEQAMVSFKPSRC 371
           F   +DI+     FK + C
Sbjct: 339 FGTTFDIQGKQFGFKYAAC 357


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 95/377 (25%), Positives = 154/377 (40%), Gaps = 49/377 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPP------LFDPKKSSTY 87
           Y +   +GTP        DTGSD TW  C+  C   +C  ++        +F    SS++
Sbjct: 12  YSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 71

Query: 88  NSISCSSSQCAV------VTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
            +I C +  C +        +NC      C Y + Y  G+ A    G  A ET+T     
Sbjct: 72  KTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTAL---GFFANETVTVELKE 128

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
           G  +++ NV+ GC         S     G++GLG    S   +      GKFSYCL D  
Sbjct: 129 GRKMKLHNVLIGCSES--FQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHL 186

Query: 200 SSK-----INFGGIVAGAGVVS----TPLI---IRDHYYLSLEAISVGNQRLEFVS---- 243
           S K     + FG   +   +++    T L+   +   Y +++  IS+G   L+  S    
Sbjct: 187 SHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWD 246

Query: 244 -SSTGNIFVDTGVLRTLLPL-EYHSNLKSVMSNMIKAQPVK-GVGAEPGFSDVLCYNIS- 299
               G   +D+G   T L    Y   + ++  +++K + V+  +G         C+N + 
Sbjct: 247 VKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLE-----YCFNSTG 301

Query: 300 -SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLI 355
             +   P +  HF  GA+ +    +   + +D + C  F         V G IMQ N L 
Sbjct: 302 FEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLW 361

Query: 356 GYDIEQAMVSFKPSRCT 372
            +D+    + F PS CT
Sbjct: 362 EFDLGLKKLGFAPSSCT 378


>gi|356542694|ref|XP_003539801.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 105/376 (27%), Positives = 174/376 (46%), Gaps = 50/376 (13%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +GTPP + +  +DTGSD  W     C  CP+    + +   FDP+ SST + I
Sbjct: 76  LYYTKVKLGTPPREFYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPRSSSTSSLI 135

Query: 91  SCSSSQC--AVVTSNCS----EGDCSYSFLYGRGAYAS-------FSSGNLATETLTFNS 137
           SCS  +C   V TS+ S       C+Y+F YG G+  S            +   TLT NS
Sbjct: 136 SCSDRRCRSGVQTSDASCSSQNNQCTYTFQYGDGSGTSGYYVSDLMHFAGIFEGTLTTNS 195

Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTS-IAGK-FSYC 194
           ++       +V+FGC        T   +   GI G G    S+ISQ+    IA + FS+C
Sbjct: 196 SA-------SVVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSLQGIAPRVFSHC 248

Query: 195 LP--DQGSSKINFGGIVAGAGVVSTPLII-RDHYYLSLEAISVGNQRLE-----FVSSST 246
           L   + G   +  G IV    +V +PL+  + HY L+L++ISV  Q +      F +S+ 
Sbjct: 249 LKGDNSGGGVLVLGEIVE-PNIVYSPLVQSQPHYNLNLQSISVNGQIVPIAPAVFATSNN 307

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--- 303
               VD+G     L  E ++   + ++ ++  Q V+ V +        CY I++      
Sbjct: 308 RGTIVDSGTTLAYLAEEAYNPFVNAITALVP-QSVRSVLSRGN----QCYLITTSSNVDI 362

Query: 304 FPEVTIHFR-GADVKLSPSNLF--RNISDE--IMCSAFRG--GNANIVYGRIMQINFLIG 356
           FP+V+++F  GA + L P +    +N   E  + C  F+   G +  + G ++  + +  
Sbjct: 363 FPQVSLNFAGGASLVLRPQDYLMQQNYIGEGSVWCIGFQRIPGQSITILGDLVLKDKIFV 422

Query: 357 YDIEQAMVSFKPSRCT 372
           YD+    + +    C+
Sbjct: 423 YDLAGQRIGWANYDCS 438


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 97/383 (25%), Positives = 158/383 (41%), Gaps = 72/383 (18%)

Query: 36  LMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL----FDPKKSSTYNSIS 91
           ++ L IGTPP      +DTGS  +W QC         K+ PP     FDP  SS+++++ 
Sbjct: 73  IISLPIGTPPQAQQMVLDTGSQLSWIQCH-------RKKLPPKPKTSFDPSLSSSFSTLP 125

Query: 92  CSSSQC-------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           CS   C        + TS  S   C YS+ Y  G   +F+ GNL  E +TF++T      
Sbjct: 126 CSHPLCKPRIPDFTLPTSCDSNRLCHYSYFYADG---TFAEGNLVKEKITFSNTE----I 178

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQ------ 198
            P +I GC        T  S   GI+G+  G  S +SQ   S   KFSYC+P +      
Sbjct: 179 TPPLILGCA-------TESSDDRGILGMNRGRLSFVSQAKIS---KFSYCIPPKSNRPGF 228

Query: 199 ------------GSSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS-- 244
                        S    +  ++        P +    Y + +  I  G ++L    S  
Sbjct: 229 TPTGSFYLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVF 288

Query: 245 -----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY--N 297
                 +G   VD+G   T L    +  +++ +   +  +  KG     G +D +C+  N
Sbjct: 289 RPDAGGSGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGY-VYGGTAD-MCFDGN 346

Query: 298 ISSQPKF--PEVTIHFRGADVKLSPSNLFRNISDEIMC-----SAFRGGNANIVYGRIMQ 350
           ++  P+     V +  RG ++ +    +  N+   I C     S+  G  +NI+ G + Q
Sbjct: 347 VAMIPRLIGDLVFVFTRGVEIFVPKERVLVNVGGGIHCVGIGRSSMLGAASNII-GNVHQ 405

Query: 351 INFLIGYDIEQAMVSFKPSRCTN 373
            N  + +D+    V F  + C+ 
Sbjct: 406 QNLWVEFDVTNRRVGFAKADCSR 428


>gi|88174589|gb|ABD39369.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 151/331 (45%), Gaps = 49/331 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSASWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P+ 
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109

Query: 149 IFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
            FGC   NL S  ++      G++G+G G  S++ Q   +  G FSYCLP Q S +    
Sbjct: 110 TFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFS 165

Query: 203 -----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFV 251
                 + G +     V  T ++ R    + +++ L AISV  +RL    S  S   +  
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
           D+G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P +++
Sbjct: 226 DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISL 280

Query: 310 HF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
           HF  GA   L    +F  R++ ++ + C AF
Sbjct: 281 HFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 95/377 (25%), Positives = 154/377 (40%), Gaps = 49/377 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKK------SSTY 87
           Y +   +GTP        DTGSD TW  C+  C   +C  ++      K+      SS++
Sbjct: 83  YSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSF 142

Query: 88  NSISCSSSQCAV------VTSNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTFNSTS 139
            +I C +  C +        +NC      C Y + Y  G+ A    G  A ET+T     
Sbjct: 143 KTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTAL---GFFANETVTVELKE 199

Query: 140 GLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG 199
           G  +++ NV+ GC         S     G++GLG    S   +      GKFSYCL D  
Sbjct: 200 GRKMKLHNVLIGCSES--FQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHL 257

Query: 200 SSK-----INFGGIVAGAGVVS----TPLI---IRDHYYLSLEAISVGNQRLEFVS---- 243
           S K     + FG   +   +++    T L+   +   Y +++  IS+G   L+  S    
Sbjct: 258 SHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWD 317

Query: 244 -SSTGNIFVDTGVLRTLLPL-EYHSNLKSVMSNMIKAQPVK-GVGAEPGFSDVLCYNIS- 299
               G   +D+G   T L    Y   + ++  +++K + V+  +G         C+N + 
Sbjct: 318 VKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLE-----YCFNSTG 372

Query: 300 -SQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAF--RGGNANIVYGRIMQINFLI 355
             +   P +  HF  GA+ +    +   + +D + C  F         V G IMQ N L 
Sbjct: 373 FEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLW 432

Query: 356 GYDIEQAMVSFKPSRCT 372
            +D+    + F PS CT
Sbjct: 433 EFDLGLKKLGFAPSSCT 449


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 97/355 (27%), Positives = 144/355 (40%), Gaps = 38/355 (10%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPEL-DCFKQEPPL-FDPKKSSTYNSISC 92
           Y   + +GTP       +DTGSD  W      P L    +Q       P  +  +N ++ 
Sbjct: 122 YFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAAPAPTPRWNCVAP 181

Query: 93  SSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGC 152
              +      +     C Y   YG G   S ++G+ A+ETLTF   +     +  V  GC
Sbjct: 182 ICRRLDSAGCDRRRNSCLYQVAYGDG---SVTAGDFASETLTFARGA----RVQRVAIGC 234

Query: 153 GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGA 212
           GH N     + S   G+        S  SQ+  S    FSYCL D+ SS+        G 
Sbjct: 235 GHDNEGLFIAASGLLGLGRG---RLSFPSQIARSFGRSFSYCLVDRTSSRRARPSRRWGG 291

Query: 213 GVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSS---------TGNIFVDTGVLRTLLPLE 263
               TP +    YY+ L   SVG  R++ VS S          G + +D+G   T L   
Sbjct: 292 ----TPRMA-TFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARP 346

Query: 264 YHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQP--KFPEVTIHFR-GADVKL 318
            +  ++    +  +A  V G+   PG   +   CYN+S +   K P V++H   GA V L
Sbjct: 347 VYEAVR----DAFRAAAV-GLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVAL 401

Query: 319 SPSNLFRNI-SDEIMCSAFRGGNANI-VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            P N    + +    C A  G +  + + G I Q  F + +D +   V F P  C
Sbjct: 402 PPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456


>gi|115465777|ref|NP_001056488.1| Os05g0591300 [Oryza sativa Japonica Group]
 gi|113580039|dbj|BAF18402.1| Os05g0591300 [Oryza sativa Japonica Group]
          Length = 453

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 92/180 (51%), Gaps = 22/180 (12%)

Query: 32  DDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYN 88
           D  +L+ + +GTP V    ++DTGS  +W QC PC  + C  Q     P+FDP  SST+ 
Sbjct: 50  DFAFLIPVKLGTPAVQYLVTMDTGSSLSWVQCRPC-TIKCHVQPAKVGPIFDPSNSSTFR 108

Query: 89  SISCSSSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTF--NS 137
            + CS+S C+       + +  C E +  C Y+  YG G   ++S G   T+ L      
Sbjct: 109 HVGCSTSICSYLGRTLRIQSKACMEWEDICLYTMSYGGG--WAYSVGKAVTDRLVLGGGE 166

Query: 138 TSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGK-FSYCLP 196
           T+   + + N +FGC   ++ +  S  K+ GI GLG  N S   Q+   ++ K FSYCLP
Sbjct: 167 TTRTTLSLANFVFGC---SMDTQYSTHKEAGIFGLGTSNYSF-EQIAPLLSYKAFSYCLP 222


>gi|220702733|gb|ACL81165.1| aspartyl protease [Mirabilis jalapa]
          Length = 499

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 101/410 (24%), Positives = 155/410 (37%), Gaps = 83/410 (20%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCF-KQEPPLFDPKKSSTYNSISCS 93
           Y +  SI +  + ++  +DTGSD  W  C P   + C  K EP    P   S  + ISC 
Sbjct: 94  YTLTFSINSQTLSVY--MDTGSDIVWFPCSPFECILCEGKFEPGTLTPLNVSKSSLISCK 151

Query: 94  SSQCA---------------------VVTSNCSEGDC-SYSFLYGRGAYASFSSGNLATE 131
           S  C+                     + TS+CS   C S+ + YG G+  +     L   
Sbjct: 152 SRACSTAHNSPSTSDLCAIAKCPLDEIETSDCSNYHCPSFYYAYGDGSLIA----KLHKH 207

Query: 132 TLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT---SIA 188
            L   STS  P  + +  FGC H  L  P       G+ G G G+ SL +Q+      + 
Sbjct: 208 NLIMPSTSNKPFSLKDFTFGCAHSALGEPI------GVAGFGFGSLSLPAQLANLSPDLG 261

Query: 189 GKFSYCLPDQG--SSKINFGGIVAGAGV-----------VSTPLIIRDH----YYLSLEA 231
            +FSYCL      S+K++    +    V           V TP++        Y +S+EA
Sbjct: 262 NQFSYCLVSHSFDSTKLHHPSPLILGKVKERDFDEITQFVYTPMLDNPKHPYFYSVSMEA 321

Query: 232 ISVGNQR-------LEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGV 284
           ISVG+ R       +       G + VD+G   T+LP  +++++ + +   +     +  
Sbjct: 322 ISVGSSRVRAPNALIRIDRDGNGGVVVDSGTTYTMLPTGFYNSVATELDRRVGRVFKRAS 381

Query: 285 GAEPGFSDVLCYNISSQPK------FPEVTIHFRG-ADVKLSPSNLFRNISD-------- 329
             E       CY +            P +  HF G   V L   N F    D        
Sbjct: 382 ETESKTGLSPCYYLEGNGVERLGLVVPRLAFHFGGNYSVVLPRRNYFYEFLDGEDEKKGR 441

Query: 330 EIMCSAFRGGNAN------IVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           ++ C     G            G   Q  F + YD+E+  V F P +C +
Sbjct: 442 KVGCLMLMDGGDESEGGPGATLGNYQQQGFQVVYDLEERRVGFAPRKCAS 491


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 98/414 (23%), Positives = 161/414 (38%), Gaps = 91/414 (21%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE----------------PCPELDCFKQEPPL 78
           Y +   +GTP        DTGSD TW +C                 P P     ++    
Sbjct: 87  YFVRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRT--- 143

Query: 79  FDPKKSSTYNSISCSSSQC------AVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATET 132
           F P KS T+  I CSS+ C      ++         C+Y + Y  G+ A  + G + + T
Sbjct: 144 FRPDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVG-VDSAT 202

Query: 133 LTFNSTSGLPVEMPNVIFGC-----GHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSI 187
           +  +  +    ++  V+ GC     G   LAS        G++ LG  N S  S+  +  
Sbjct: 203 IALSGRAARKAKLRGVVLGCTTSYNGQSFLAS-------DGVLSLGYSNISFASRAASRF 255

Query: 188 AGKFSYCL-----PDQGSSKINFGGIVA------GAGVVS-------------------T 217
            G+FSYCL     P   +S + FG   A        G+ S                   T
Sbjct: 256 GGRFSYCLVDHLAPRNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQT 315

Query: 218 PLII----RDHYYLSLEAISVGNQRLEFVSS-----STGNIFVDTGVLRTLLPLEYHSNL 268
           PL++    R  Y ++++ +SV  + L+   +       G   +D+G   T+L    +  +
Sbjct: 316 PLVLDHRTRPFYAVTVKGVSVAGELLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAV 375

Query: 269 KSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS------QPKFPEVTIHFRGADVKLSPSN 322
            + +S  +   P   V  +P F    CYN +S          P + +HF G+     P+ 
Sbjct: 376 VAALSKRLAGLPR--VTMDP-FD--YCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAK 430

Query: 323 LFR-NISDEIMCSAFRGG--NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
            +  + +  + C   + G      V G I+Q   L  YD++   + FK SRC +
Sbjct: 431 SYVIDAAPGVKCIGLQEGPWPGLSVIGNILQQEHLWEYDLKNRRLRFKRSRCMH 484


>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 151/331 (45%), Gaps = 49/331 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSASWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P+ 
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109

Query: 149 IFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
            FGC   NL S  ++      G++G+G G  S++ Q   +  G FSYCLP Q S +    
Sbjct: 110 TFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFS 165

Query: 203 -----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFV 251
                 + G +     V  T ++ R    + +++ L AISV  +RL    S  S   +  
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
           D+G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P +++
Sbjct: 226 DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISL 280

Query: 310 HF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
           HF  GA   L    +F  R++ ++ + C AF
Sbjct: 281 HFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311


>gi|88174597|gb|ABD39373.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174601|gb|ABD39375.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174603|gb|ABD39376.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 94/332 (28%), Positives = 153/332 (46%), Gaps = 51/332 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P+ 
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109

Query: 149 IFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
            FGC   NL S  ++      G++G+G G  S++ Q   +  G FSYCLP Q S +    
Sbjct: 110 TFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFS 165

Query: 203 -----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF---VSSSTGNIF 250
                 + G +     V  T ++ R    + +++ L AISV  +RL     + S  G +F
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVT 308
            D+G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P ++
Sbjct: 226 -DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAIS 279

Query: 309 IHF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
           +HF  GA   L    +F  R++ ++ + C AF
Sbjct: 280 LHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311


>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 93/332 (28%), Positives = 151/332 (45%), Gaps = 49/332 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P  
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPGF 109

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
            FGC   +  +        G++G+G G  S++ Q   +  G FSYCLP Q S +      
Sbjct: 110 TFGCNMDSFGA-NEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKT 167

Query: 203 ---INFGGIVAG--AGVVSTPLIIR----DHYYLSLEAISVGNQRLEF---VSSSTGNIF 250
               + GG +A     V  T ++ R    + +++ L AISV  +RL     + S  G +F
Sbjct: 168 TGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVF 227

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVT 308
            D+G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P ++
Sbjct: 228 -DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAIS 281

Query: 309 IHF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
           +HF  GA   L    +F  R++ ++ + C AF
Sbjct: 282 LHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 313


>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
          Length = 382

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 99/370 (26%), Positives = 158/370 (42%), Gaps = 54/370 (14%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCA 98
            +IGTPP      +D G    WTQC  C    CF QE P FDP KSSTY    C ++ C 
Sbjct: 28  FTIGTPPQPASAFIDVGGLLVWTQCSQCSSSSCFNQELPPFDPTKSSTYRPEPCGTALCE 87

Query: 99  VVTS---NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHK 155
              +   NCS   C+Y            +SG + T+ +   + +       +V FGC   
Sbjct: 88  FFPASIRNCSGDVCAYE---ASTQLFEHTSGKIGTDAVAIGTATAA-----SVAFGC--- 136

Query: 156 NLASPTS--DSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PDQGSSKIN-------- 204
            +AS     D   +G +GL     SL++QM  +    FS+CL P  G    N        
Sbjct: 137 VMASDIKLMDGGPSGFVGLARTPLSLVAQMNVT---AFSHCLAPHDGGGGKNSRLFLGAA 193

Query: 205 -FGGIVAGAGVVSTPLI------IRDHYYL-SLEAISVGNQRLEFVSSSTGNIFVDTGVL 256
                   +  ++TP +      I+  YYL +LE I  G++ +  V  S   + + T   
Sbjct: 194 AKLAGGGKSAAMTTPFVKSSPDDIKSLYYLINLEGIKAGDEAIITVPQSGRTVLLQTFSP 253

Query: 257 RTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEP--GFSDV--LCYNISSQPKFPEVTIHFR 312
            + L    + +LK  ++       V G  A P   F  +  LC+        P+V + F+
Sbjct: 254 VSFLVDGVYQDLKKAVTAA-----VGGPTATPPEQFQSIFDLCFKRGGVSGAPDVVLTFQ 308

Query: 313 G-ADVKLSPSNLFRNISDEIMCSAF----RGGNANI----VYGRIMQINFLIGYDIEQAM 363
           G A + + P+N   ++ D+ +C A     R  +  +    + G + Q N    YD+E+  
Sbjct: 309 GAAALTVPPTNYLLDVGDDTVCVAIASSARLNSTEVAGMSILGGLQQQNVHFLYDLEKET 368

Query: 364 VSFKPSRCTN 373
           +SF+ + C++
Sbjct: 369 LSFEAADCSS 378


>gi|88174571|gb|ABD39360.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
          Length = 321

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 90/329 (27%), Positives = 147/329 (44%), Gaps = 45/329 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P+ 
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
            FGC   +  +        G++G+G G  S++ Q      G FSYCLP Q S +      
Sbjct: 110 TFGCNLDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKT 167

Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDT 253
               + G +     V  T ++ R    + +++ L AISV  +RL    S  S   +  D+
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF 311
           G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P +++HF
Sbjct: 228 GSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISLHF 282

Query: 312 -RGADVKLSPSNLF--RNISDE-IMCSAF 336
             GA   L    +F  R++ ++ + C AF
Sbjct: 283 DDGARFDLGSKGVFVERSVQEQDVWCLAF 311


>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 402

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 85/317 (26%), Positives = 141/317 (44%), Gaps = 43/317 (13%)

Query: 82  KKSSTYNSISCSSSQCAVVTSN---CSEGD--CSYSFLYGRGAYASFSSGNLATETLTFN 136
           K++   N+   S++Q  V + N   C      C+Y+  YG G   SF+ G L  E L F 
Sbjct: 101 KRTVPSNTEDVSNAQIPVTSGNSGVCGSAAPICNYAINYGDG---SFTRGELGHEKLKFG 157

Query: 137 STSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP 196
           +     + + + IFGCG  N       S   G++GLG  + SLISQ      G FSYCLP
Sbjct: 158 T-----ILVKDFIFGCGRNNKGLFGGVS---GLMGLGRSDLSLISQTSGIFGGVFSYCLP 209

Query: 197 D---QGSSKINFGG---------IVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS 244
               +GS  +  GG          ++ A ++  P +  + Y+++L  IS+G   L+  S 
Sbjct: 210 STERKGSGSLILGGNSSVYRNSSPISYAKMIENPQLY-NFYFINLTGISIGGVALQAPSV 268

Query: 245 STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDV-LCYNISS--Q 301
               I VD+G + T LP   +  LK+         P       P FS +  C+N+S+  +
Sbjct: 269 GPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGFP-----PAPAFSILDTCFNLSAYQE 323

Query: 302 PKFPEVTIHFRG-ADVKLSPSNLFRNISDE-----IMCSAFRGGNANIVYGRIMQINFLI 355
              P + +HF G A++ +  + +F  +  +     +  ++    +   + G   Q N  +
Sbjct: 324 VDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALASLEYQDEVAILGNYQQKNLRV 383

Query: 356 GYDIEQAMVSFKPSRCT 372
            YD ++  V F    C+
Sbjct: 384 IYDTKETKVGFALETCS 400


>gi|357464807|ref|XP_003602685.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355491733|gb|AES72936.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 440

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 78/259 (30%), Positives = 113/259 (43%), Gaps = 27/259 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISC 92
            Y + ++IG PP   F  +DTGSD TW QC+ PC    C +   PL+ P      + + C
Sbjct: 84  FYNVTINIGYPPRPYFLDIDTGSDLTWLQCDAPCSR--CSQTPHPLYRPSN----DLVPC 137

Query: 93  SSSQCAVV--TSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
               CA V  T N     E  C Y   Y    Y+S   G L  +    N T+G+ +++  
Sbjct: 138 RHPLCASVHQTDNYECEVEHQCDYEVEYA-DHYSSL--GVLVNDVYVLNFTNGVQLKV-R 193

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINF 205
           +  GCG+  +   +S     G++GLG G SSLISQ+     +     +CL  QG   I F
Sbjct: 194 MALGCGYDQIFPDSSYHPVDGMLGLGRGKSSLISQLNGQGLVRNVVGHCLSAQGGGYIFF 253

Query: 206 GGIVAGAGVVSTPLIIRD--HYYLSLEAISVGNQRLEFVSSSTGNIFV--DTGVLRTLLP 261
           G +   + +  TP+  RD  HY      + +G +R  F     GN+    D G   T   
Sbjct: 254 GDVYDSSRLAWTPMSSRDYKHYSAGAAELVLGGKRTGF-----GNLLAVFDAGSSYTYFN 308

Query: 262 LEYHSNLKSVMSNMIKAQP 280
              +   K +    IK  P
Sbjct: 309 SNAYQLTKELAGKPIKEAP 327


>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 91/361 (25%), Positives = 147/361 (40%), Gaps = 56/361 (15%)

Query: 51  SVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSS-QCAVVTSNCSEGDC 109
           ++D G   +W QC PC    C  Q  P+FDP KS T+++I   ++  C       + G C
Sbjct: 114 ALDMGGGLSWMQCLPC--RHCLLQMSPVFDPTKSPTFSNIPAHNTVWCRPPYQPLANGAC 171

Query: 110 SYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGI 169
            +   Y    +A   SG LA +T +F + +   V +  ++FGC H+      +     GI
Sbjct: 172 GFDIAYRDNTHA---SGYLARDTFSFPAGNDDFVPLSAIVFGCAHQT-EHFKNQRAVAGI 227

Query: 170 IGLGPGNS-----SLISQMGTSIAGKFSYCLPDQGSSK---INFGGIV-----AGAGVVS 216
           +GLG G +     +   Q+  +  G+FSYC    G S    + FG  +           S
Sbjct: 228 LGLGMGPAGKPPTAFTKQVLPAHGGRFSYCPFVPGMSMYSYLRFGSDIPSHPPPNVHRQS 287

Query: 217 TPLIIRDH----YYLSLEAISVGNQRLEFVSSS--------TGNIFVDTGVLRTLLPLEY 264
           TP++   H    Y++ L  +SVG  RL  V+ +         G   VD G   T      
Sbjct: 288 TPVLAPAHNSEAYFVKLAGVSVGANRLSGVTPAMFRRNAHGAGGCVVDIGTRMTAFIHSA 347

Query: 265 HSNLKSVMSNMIKAQ-----PVKG---VGAEPGFSDVLCYNISSQPKFPEVTIHFR-GAD 315
           + ++   +   ++ +      V+G   V       DVL          P +T+HF  GA 
Sbjct: 348 YVHIDHAVRQHLQRRGAHIVVVRGNTCVQQPAPHHDVL----------PSMTLHFENGAW 397

Query: 316 VKLSPSNLFRNI---SDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQA--MVSFKPSR 370
           +++ P ++F           C  F       V G   Q+N    +D+     ++SF P  
Sbjct: 398 LRVMPEHVFMPFVVGGHHYQCFGFVSSTDLTVIGARQQVNHRFIFDLHDTIPIMSFNPED 457

Query: 371 C 371
           C
Sbjct: 458 C 458


>gi|88174581|gb|ABD39365.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 90/329 (27%), Positives = 147/329 (44%), Gaps = 45/329 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P+ 
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
            FGC   +  +        G++G+G G  S++ Q      G FSYCLP Q S +      
Sbjct: 110 TFGCNLDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFSKT 167

Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDT 253
               + G +     V  T ++ R    + +++ L AISV  +RL    S  S   +  D+
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF 311
           G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P +++HF
Sbjct: 228 GSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISLHF 282

Query: 312 -RGADVKLSPSNLF--RNISDE-IMCSAF 336
             GA   L    +F  R++ ++ + C AF
Sbjct: 283 DDGARFDLGRRGVFVERSVQEQDVWCLAF 311


>gi|326533786|dbj|BAK05424.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 412

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 87/339 (25%), Positives = 138/339 (40%), Gaps = 29/339 (8%)

Query: 51  SVDTGSDCTWTQCEPC-PELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNC-SEGD 108
           ++DT +  +W  CEPC P L    Q   LF P +S T+  +      C        S   
Sbjct: 84  ALDTAASTSWVMCEPCRPPL---HQLGRLFSPAESPTFRGVRRDDPVCVPPYHRLHSTNG 140

Query: 109 CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTG 168
           CS++F    G  A  +     +E     S SG       V FGC H        D    G
Sbjct: 141 CSFAFPSAIGYLARDTFHLRHSERSVVKSISG-------VAFGCAHTTTGFYNED-ILGG 192

Query: 169 IIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTP--------LI 220
           ++ L P   S ++Q G+   G+FSYCLPD  +S    G I  G  V S P         +
Sbjct: 193 VLSLSPSPLSFLTQFGSRAGGRFSYCLPDPTTSHNPSGFIQFGIEVPSLPRHAHTTTLTV 252

Query: 221 IRDHYYLSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNM-- 275
               Y+LSL  IS+GN+RL+    + +S G        +  +    Y    + +M+ M  
Sbjct: 253 SASGYHLSLIGISLGNKRLDIDRHILTSHGCSINPAETITKIAEPAYIIVARELMAQMNE 312

Query: 276 IKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCS 334
           + ++ VKG  + P   + +   + +  + P +  HF  G D+  +   LF+ I       
Sbjct: 313 LGSKQVKGPPSSPLVFNKISRRVRA--RLPNMVFHFADGGDMWFTAGKLFQVIGTTARFL 370

Query: 335 AFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
               G+   V G   Q+N    +++    ++F    C+ 
Sbjct: 371 VEGHGSHRTVIGAAQQVNARFIFNVAAGRLTFAEELCSR 409


>gi|88174605|gb|ABD39377.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 150/331 (45%), Gaps = 49/331 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P+ 
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109

Query: 149 IFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
            FGC   NL S  ++      G++G+G G  S++ Q      G FSYCLP Q S +    
Sbjct: 110 TFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFS 165

Query: 203 -----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFV 251
                 + G +     V  T ++ R    + +++ L AISV  +RL    S  S   +  
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
           D+G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P +++
Sbjct: 226 DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISL 280

Query: 310 HF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
           HF  GA   L    +F  R++ ++ + C AF
Sbjct: 281 HFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311


>gi|340810981|gb|AEK75417.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 107/379 (28%), Positives = 159/379 (41%), Gaps = 66/379 (17%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
           M +S+G PPV    ++DTGS  +W QC+PC  + C  Q     P+FDP +S T   + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 94  SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           S +C        +  +NC E +  C+YS  YG G   ++S G + T+TL    +      
Sbjct: 60  SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGW--AYSVGKMVTDTLRIGDS------ 111

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--------KFSYCLP 196
             +++FGC      S      + GI G G  + S   Q+    AG          SYCLP
Sbjct: 112 FMDLMFGCSMDVKYSEF----EAGIFGFGSSSFSFFEQL----AGYPDILSYKALSYCLP 163

Query: 197 -DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
            D+        G    A +    TPL   I R  Y L++E +    QRL    +S+  + 
Sbjct: 164 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEMI 220

Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------- 296
           VD+G  RT L P  +    K++   M      +   A       +CY             
Sbjct: 221 VDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTI 278

Query: 297 -NISSQPKFPEVTIHFRG-ADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQIN 352
              S+    P + I F G A + LSP N+F N     +C  F    A  + + G  +  +
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRS 338

Query: 353 FLIGYDIEQAMVSFKPSRC 371
           F   +DI+     FK + C
Sbjct: 339 FGTTFDIQGKQFGFKYAVC 357


>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 93/332 (28%), Positives = 151/332 (45%), Gaps = 49/332 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P  
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPGF 109

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
            FGC   +  +        G++G+G G  S++ Q   +  G FSYCLP Q S +      
Sbjct: 110 TFGCNMDSFGA-NEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKT 167

Query: 203 ---INFGGIVAG--AGVVSTPLIIR----DHYYLSLEAISVGNQRLEF---VSSSTGNIF 250
               + GG +A     V  T ++ R    + +++ L AISV  +RL     + S  G +F
Sbjct: 168 TGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVF 227

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVT 308
            D+G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P ++
Sbjct: 228 -DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAIS 281

Query: 309 IHF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
           +HF  GA   L    +F  R++ ++ + C AF
Sbjct: 282 LHFDDGARFDLGRHGVFVERSVQEQDVWCLAF 313


>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
 gi|255638149|gb|ACU19388.1| unknown [Glycine max]
          Length = 437

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 89/366 (24%), Positives = 149/366 (40%), Gaps = 42/366 (11%)

Query: 29  ISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
           I+    Y++   IGTP   +  ++DT +D +W  C  C  + C    P  F P KS+T+ 
Sbjct: 92  ITQSPTYIVKAKIGTPAQTLLLAMDTSNDASWVPCTAC--VGCSTTTP--FAPAKSTTFK 147

Query: 89  SISCSSSQCAVVTS-NCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
            + C +SQC  V +  C    C+++F YG  + A+    +L  +T+T  +       +P 
Sbjct: 148 KVGCGASQCKQVRNPTCDGSACAFNFTYGTSSVAA----SLVQDTVTLATD-----PVPA 198

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPD----QGSSKI 203
             FGC  K   S        G+        +   ++  S    FSYCLP       S  +
Sbjct: 199 YAFGCIQKVTGSSVPPQGLLGLGRGPLSLLAQTQKLYQST---FSYCLPSFKTLNFSGSL 255

Query: 204 NFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGN-------QRLEFVSSSTGNIFVD 252
             G +     +  TPL+        YY++L AI VG        + L F +++      D
Sbjct: 256 RLGPVAQPKRIKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNANTGAGTVFD 315

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFR 312
           +G + T L    ++ +++     I       V +  GF    CY  ++    P +T  F 
Sbjct: 316 SGTVFTRLVEPAYNAVRNEFRRRIAVHKKLTVTSLGGFDT--CY--TAPIVAPTITFMFS 371

Query: 313 GADVKLSPSN-LFRNISDEIMCSAFRGGNANI-----VYGRIMQINFLIGYDIEQAMVSF 366
           G +V L P N L  + +  + C A      N+     V   + Q N  + +D+  + +  
Sbjct: 372 GMNVTLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGV 431

Query: 367 KPSRCT 372
               CT
Sbjct: 432 ARELCT 437


>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
 gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
          Length = 536

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 162/371 (43%), Gaps = 41/371 (11%)

Query: 31  VDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE--PCPELDC------FKQEPPLFDPK 82
           +D ++   + IGTP V    ++D GSD  W  C+   C  L          ++   + P 
Sbjct: 103 LDWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCIQCAPLSASYYNISLDRDLSEYSPS 162

Query: 83  KSSTYNSISCSSSQCAVVTSNCS--EGDCSYSFLYGRGAYASFSSGNLATETLTFNST-- 138
            SST   +SC    C    SNC   +  C Y F Y      + S+G L  + L   S   
Sbjct: 163 LSSTSRHLSCDHQLCE-WGSNCKNPKDPCPYIFNYDDFENTT-SAGFLVEDKLHLASVGD 220

Query: 139 -SGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCL 195
            +   +   +V+ GCG K   S    +   G++GLGPG+ S+ S +  +  I   FS C 
Sbjct: 221 HTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPGDISVPSLLAKAGLIQNCFSLCF 280

Query: 196 PDQGSSKINFGGIVAGAGVVSTPLI----IRDHYYLSLEAISVGNQRLEFVSSSTGNIFV 251
            +  S +I FG     A   STP +        Y++ +E+  VGN  L+    S     V
Sbjct: 281 DENDSGRILFGD-RGHASQQSTPFLPIQGTYVAYFVGVESYCVGNSCLK---RSGFKALV 336

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTI 309
           D+G   T LP E ++ L S     + A   K +  + G  D  CYN SSQ     P + +
Sbjct: 337 DSGSSFTYLPSEVYNELVSEFDKQVNA---KRISFQDGLWDY-CYNASSQELHDIPAIQL 392

Query: 310 HF-RGAD-VKLSPS-NLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGY----DIEQA 362
            F R  + V  +P+ ++  +    + C + +  + +  YG I Q NF+IGY    DIE  
Sbjct: 393 KFPRNQNFVVHNPTYSIPHHQGFTMFCLSLQPTDGS--YGIIGQ-NFMIGYRMVFDIENL 449

Query: 363 MVSFKPSRCTN 373
            + +  S C +
Sbjct: 450 KLGWSNSSCQD 460


>gi|88174558|gb|ABD39354.1| chloroplast nucleoid DNA-binding protein [Oryza meridionalis]
          Length = 321

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 93/332 (28%), Positives = 151/332 (45%), Gaps = 51/332 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P  
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPGF 109

Query: 149 IFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
            FGC   NL S  ++      G++G+G G  S++ Q   +  G FSYCLP Q S +    
Sbjct: 110 TFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFS 165

Query: 203 -----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF---VSSSTGNIF 250
                 + G +     V  T ++ R    + +++ L AISV  +RL     V S  G +F
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVF 225

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVT 308
            D+G   + +P    S L+  +  ++  +     GA    S+  CY++ S  +   P ++
Sbjct: 226 -DSGSELSYIPDRALSVLRQRIRELLLKR-----GAAEEESERNCYDMRSVDEGDMPAIS 279

Query: 309 IHF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
           +HF  GA   L    +F  R++ ++ + C AF
Sbjct: 280 LHFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311


>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 96/355 (27%), Positives = 154/355 (43%), Gaps = 40/355 (11%)

Query: 41  IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV 100
           IGTPP +    VDTGS  T+  C  C +  C   + P F P  S TY+ + C +  C   
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCNSCDQ--CGNHQDPKFQPDLSDTYHPVKC-NPDCTCD 58

Query: 101 TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASP 160
           T N     C+Y   Y   A  S SSG L  + ++F + S L  +    +FGC +      
Sbjct: 59  TEN---DQCTYERQY---AEMSSSSGILGEDLVSFGNMSELKPQ--RAVFGCENAETGDL 110

Query: 161 TSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVAGAGVVSTP 218
            S     GI+GLG G+ S++ Q+     I   FS C    G  ++  G +V G     + 
Sbjct: 111 FSQHAD-GIMGLGRGDLSIVDQLVEKGVINDSFSLCY---GGMEVGGGAMVLGQISPPSD 166

Query: 219 LII------RDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLP-LEYHSN 267
           ++       R  YY + L  + V  ++L+    V        +D+G     LP   +   
Sbjct: 167 MVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEAAFLPF 226

Query: 268 LKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-----PKFPEVTIHF-RGADVKLSPS 321
           ++++ S +   + ++  G +P ++DV      S+       FP V + F  G    LSP 
Sbjct: 227 IQAITSELHGLKQIR--GPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEKYSLSPE 284

Query: 322 N-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           N LF++  +        F+ G +   + G I+  N L+ YD E + V F  + C+
Sbjct: 285 NYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339


>gi|255685712|gb|ACU28345.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
          Length = 91

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 47/106 (44%), Positives = 61/106 (57%), Gaps = 15/106 (14%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
           M L IGTPP +I   +DTGS+  WTQC PC  L C+ Q+ P+FDP KSST+    C    
Sbjct: 1   MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRC---- 54

Query: 97  CAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
                 N  +  C Y  +Y   +Y   + G LATET+T +STSG+P
Sbjct: 55  ------NTPDHSCXYKIVYDDKSY---TQGTLATETVTIHSTSGVP 91


>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 105/400 (26%), Positives = 170/400 (42%), Gaps = 52/400 (13%)

Query: 1   AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
           A++  +L F       +S + I    +II     Y++   IG+PP  +  ++DT +D  W
Sbjct: 65  AKDQARLQFLASMVAGRSVVPIASGRQIIQ-SPTYIVRAKIGSPPQTLLLAMDTSNDAAW 123

Query: 61  TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQC-AVVTSNCSEGDCSYSFLYGRGA 119
             C  C    C      LF P+KS+T+ ++SC S QC  V   +C    C+++  YG  +
Sbjct: 124 IPCTACD--GCTST---LFAPEKSTTFKNVSCGSPQCNQVPNPSCGTSACTFNLTYGSSS 178

Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
            A+    N+  +T+T  +       +P+  FGC  K      + +   G++GLG G  SL
Sbjct: 179 IAA----NVVQDTVTLATD-----PIPDYTFGCVAKTTG---ASAPPQGLLGLGRGPLSL 226

Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIR-----------DHYYLS 228
           +SQ        FSYCLP   S  +NF G +   G V+ P+ I+             YY++
Sbjct: 227 LSQTQNLYQSTFSYCLPSFKS--LNFSGSLR-LGPVAQPIRIKYTPLLKNPRRSSLYYVN 283

Query: 229 LEAISVGN-------QRLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSN--MIKAQ 279
           L AI VG        + L F +++      D+G + T L    ++ ++        I A+
Sbjct: 284 LVAIRVGRKVVDIPPEALAFNAATGAGTVFDSGTVFTRLVAPAYTAVRDEFQRRVAIAAK 343

Query: 280 PVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRG 338
               V +  GF    CY +      P +T  F G +V L   N L  + +    C A   
Sbjct: 344 ANLTVTSLGGFDT--CYTVPI--VAPTITFMFSGMNVTLPEDNILIHSTAGSTTCLAMAS 399

Query: 339 GNANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
              N+     V   + Q N  + YD+  + +      CT 
Sbjct: 400 APDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELCTK 439


>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 96/355 (27%), Positives = 154/355 (43%), Gaps = 40/355 (11%)

Query: 41  IGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV 100
           IGTPP +    VDTGS  T+  C  C +  C   + P F P  S TY+ + C +  C   
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCNSCDQ--CGNHQDPKFQPDLSDTYHPVKC-NPDCTCD 58

Query: 101 TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASP 160
           T N     C+Y   Y   A  S SSG L  + ++F + S L  +    +FGC +      
Sbjct: 59  TEN---DQCTYERQY---AEMSSSSGILGEDLVSFGNMSELKPQ--RAVFGCENAETGDL 110

Query: 161 TSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVAGAGVVSTP 218
            S     GI+GLG G+ S++ Q+     I   FS C    G  ++  G +V G     + 
Sbjct: 111 FSQHAD-GIMGLGRGDLSIVDQLVEKGVINDSFSLCY---GGMEVGGGAMVLGQISPPSD 166

Query: 219 LII------RDHYY-LSLEAISVGNQRLEF---VSSSTGNIFVDTGVLRTLLP-LEYHSN 267
           ++       R  YY + L  + V  ++L+    V        +D+G     LP   +   
Sbjct: 167 MVFSHSDPDRSPYYNIELRGLHVAGKKLDINPQVFDGKHGTILDSGTTYAYLPEAAFLPF 226

Query: 268 LKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQ-----PKFPEVTIHF-RGADVKLSPS 321
           ++++ S +   + ++  G +P ++DV      S+       FP V + F  G    LSP 
Sbjct: 227 IQAITSELHGLKQIR--GPDPNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEKYSLSPE 284

Query: 322 N-LFRN--ISDEIMCSAFRGG-NANIVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
           N LF++  +        F+ G +   + G I+  N L+ YD E + V F  + C+
Sbjct: 285 NYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339


>gi|88174575|gb|ABD39362.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 150/331 (45%), Gaps = 49/331 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P+ 
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109

Query: 149 IFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
            FGC   NL S  ++      G++G+G G  S++ Q   +  G FSYCLP Q S +    
Sbjct: 110 TFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFS 165

Query: 203 -----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFV 251
                 + G +     V  T ++ R    + +++ L AISV  +RL    S  S   +  
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
           D+G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P +++
Sbjct: 226 DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISL 280

Query: 310 HF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
           HF  GA   L    +F  R++ ++ + C AF
Sbjct: 281 HFDDGARFDLGRHGVFVERSVQEQDVWCLAF 311


>gi|242091057|ref|XP_002441361.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
 gi|241946646|gb|EES19791.1| hypothetical protein SORBIDRAFT_09g025220 [Sorghum bicolor]
          Length = 439

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 112/434 (25%), Positives = 172/434 (39%), Gaps = 104/434 (23%)

Query: 28  IISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFK-----QEPPLFDPK 82
           + +  D YL+ L++GTPP      +DTGSD TW  C       C       +  P F P 
Sbjct: 18  VTAYTDGYLLSLNLGTPPQVFQVYLDTGSDLTWVPCGSSSSYQCLDCGSSVKPTPTFLPS 77

Query: 83  KSSTYNSISCSSSQCAVVTSN------CSEGDCS---------------YSFLYGRGAYA 121
           +S++     C S  C  V S+      C+   C+               +S+ YG GA  
Sbjct: 78  ESTSNTRDLCGSRFCVDVHSSDNRFDPCAAAGCAIPAFTGGQCPRPCPPFSYTYGGGALV 137

Query: 122 SFSSGNLATETLTFN-STSG-------LPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLG 173
               G+L+ +++T + ST G       LPV  P   FGC   ++  P       GI G G
Sbjct: 138 ---LGSLSRDSVTLHGSTHGSGAGAGPLPVAFPGFGFGCVGSSIREP------LGIAGFG 188

Query: 174 PGNSSLISQMGTSIAGKFSYC-LPDQGSSKINFGG-IVAG----------AGVVSTPLII 221
            G  SL SQ+G  +   FS+C L  + +   NF   +V G           G V TP++ 
Sbjct: 189 RGALSLPSQLGF-LGKGFSHCFLGFRFARNPNFTSPLVMGDLALSSASTDGGFVFTPMLT 247

Query: 222 R----DHYYLSLEAISVGNQ----------RLEFV-SSSTGNIFVDTGVLRTLLPLEYHS 266
                + YY+ LE + +G+            L  + +   G + VDTG   T LP  +++
Sbjct: 248 SATYPNFYYVGLEGVVLGDDDGGSAMAAPPSLSGIDAQGNGGVLVDTGTTYTQLPDPFYA 307

Query: 267 NLKSVMSNMIKAQP----VKGVGAEPGFSDVLCYNIS------SQPKFPEVTIHFRGA-- 314
              SV++++I A P     + + A  GF   LC+ +       +  + P +T+H  G   
Sbjct: 308 ---SVLASLISAAPPYERSRDLEARTGFD--LCFKVPCARAPCADDELPPITLHLAGGAR 362

Query: 315 ----------------DVKLSPSNLFRNISDEIMCSAFRGGNANIVYGRIMQINFLIGYD 358
                           D  +    LF+ +  E       GG    V G     N  + YD
Sbjct: 363 LALPKLSSYYPVTAIRDSVVVKCLLFQRMEMEDDGDGTSGGGPAAVLGSFQMQNVEVVYD 422

Query: 359 IEQAMVSFKPSRCT 372
           +    V F+P  C 
Sbjct: 423 LAAGRVGFRPRDCA 436


>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
 gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
          Length = 422

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 94/366 (25%), Positives = 156/366 (42%), Gaps = 46/366 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISCS 93
           Y + L+IG PP      +DTGSD TW QC+ PC    C K    L+ PK     N + C+
Sbjct: 68  YSVILNIGNPPKAFDLDIDTGSDLTWVQCDAPCK--GCTKPLDKLYKPKN----NRVPCA 121

Query: 94  SSQC-AVVTSNCS--EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
           SS C A+  +NC      C Y   Y   A    S G L ++       +G  ++ P + F
Sbjct: 122 SSLCQAIQNNNCDIPTEQCDYEVEY---ADLGSSLGVLLSDYFPLRLNNGSLLQ-PRIAF 177

Query: 151 GCGH-KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGG 207
           GCG+ +    P S     GI+GLG G +S++SQ+ T         +C        + FG 
Sbjct: 178 GCGYDQKYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRVTGGFLFFGD 237

Query: 208 -IVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTG----NIFVDTGVLRTLLPL 262
            ++  +G+  TP++      L     S G   L F    TG     +  D+G   T    
Sbjct: 238 HLLPPSGITWTPMLRSSSDTL----YSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNA 293

Query: 263 EYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPK--------FPEVTIHF--- 311
           + + ++ +++   +   P+K    E   +  +C+  +   K        F  +TI+F   
Sbjct: 294 QVYQSILNLVRKDLSGMPLKDAPEEKALA--VCWKTAKPIKSILDIKSFFKPLTINFIKA 351

Query: 312 RGADVKLSPSNLFRNISDEIMCSAFRG------GNANIVYGRIMQINFLIGYDIEQAMVS 365
           +   ++L+P +      D  +C           GN N++ G I   + ++ YD E+  + 
Sbjct: 352 KNVQLQLAPEDYLIITKDGNVCLGILNGGEQGLGNLNVI-GDIFMQDRVVVYDNERQQIG 410

Query: 366 FKPSRC 371
           + P+ C
Sbjct: 411 WFPTNC 416


>gi|88174583|gb|ABD39366.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 93/331 (28%), Positives = 150/331 (45%), Gaps = 49/331 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P+ 
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109

Query: 149 IFGCGHKNLASPTSDS--KQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK---- 202
            FGC   NL S  ++      G++G+G G  S++ Q      G FSYCLP Q S +    
Sbjct: 110 TFGC---NLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCLPLQKSERGFFS 165

Query: 203 -----INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFV 251
                 + G +     V  T ++ R    + +++ L AISV  +RL    S  S   +  
Sbjct: 166 KTTGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVF 225

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
           D+G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P +++
Sbjct: 226 DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISL 280

Query: 310 HF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
           HF  GA   L    +F  R++ ++ + C AF
Sbjct: 281 HFDDGARFDLGSHGVFVERSVQEQDVWCLAF 311


>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 438

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 103/400 (25%), Positives = 165/400 (41%), Gaps = 52/400 (13%)

Query: 1   AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
           A++  +L F       +S + I    +II     Y++   IGTPP  +  ++DT +D  W
Sbjct: 64  AKDQARLQFLASMVAGRSIVPIASGRQIIQ-SPTYIVRAKIGTPPQTLLLAIDTSNDAAW 122

Query: 61  TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
             C  C    C      LF P+KS+T+ ++SC S +C  V S +C    C+++  YG  +
Sbjct: 123 IPCTACD--GCTST---LFAPEKSTTFKNVSCGSPECNKVPSPSCGTSACTFNLTYGSSS 177

Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
            A+    N+  +T+T  +       +P   FGC  K     T      G+     G  SL
Sbjct: 178 IAA----NVVQDTVTLATD-----PIPGYTFGCVAKTTGPSTPPQGLLGLGR---GPLSL 225

Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIR-----------DHYYLS 228
           +SQ        FSYCLP   S  +NF G +   G V+ P+ I+             YY++
Sbjct: 226 LSQTQNLYQSTFSYCLPSFKS--LNFSGSLR-LGPVAQPIRIKYTPLLKNPRRSSLYYVN 282

Query: 229 LEAISVGNQ-------RLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMI--KAQ 279
           L AI VG +        L F +++      D+G + T L    ++ ++      +   A+
Sbjct: 283 LFAIRVGRKIVDIPPAALAFNAATGAGTVFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAK 342

Query: 280 PVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSN-LFRNISDEIMCSAFRG 338
               V +  GF    CY +      P +T  F G +V L   N L  + +    C A   
Sbjct: 343 ANLTVTSLGGFDT--CYTVPIVA--PTITFMFSGMNVTLPQDNILIHSTAGSTSCLAMAS 398

Query: 339 GNANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
              N+     V   + Q N  + YD+  + +      CT 
Sbjct: 399 APDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELCTK 438


>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
          Length = 447

 Score = 88.2 bits (217), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 109/397 (27%), Positives = 166/397 (41%), Gaps = 83/397 (20%)

Query: 39  LSIGTPPVDIFGSVDTGSDCTWTQCEP--CPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
           +++GTPP ++   +DTGS+ +W  C     P L       P F+   SS+Y ++ C S+ 
Sbjct: 59  VAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLT------PAFNASGSSSYGAVPCPSTA 112

Query: 97  CAV------VTSNCS---EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
           C        V   C       C  S  Y   A AS + G LAT+  TF  T G P     
Sbjct: 113 CEWRGRDLPVPPFCDTPPSNACRVSLSY---ADASSADGVLATD--TFLLTGGAPPVAVG 167

Query: 148 VIFGCGHKNLASPTSDS---------KQTGIIGLGPGNSSLISQMGTSIAGKFSYCL-PD 197
             FGC     ++  ++S           TG++G+  G  S ++Q GT    +F+YC+ P 
Sbjct: 168 AYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTR---RFAYCIAPG 224

Query: 198 QGSSKINFG--GIVAGAGVVSTPLII---------RDHYYLSLEAISVGNQRLEFVSS-- 244
           +G   +  G  G VA   +  TPLI          R  Y + LE I VG   L    S  
Sbjct: 225 EGPGVLLLGDDGGVA-PPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVL 283

Query: 245 -----STGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGF----SDVLC 295
                  G   VD+G   T L  + ++ LK+  ++  +A+ +     EPGF    +   C
Sbjct: 284 TPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTS--QARLLLAPLGEPGFVFQGAFDAC 341

Query: 296 YN------ISSQPKFPEVTIHFRGADVKLSPSNLFRNISDE---------IMCSAFRGGN 340
           +        ++    P V +  RGA+V +S   L   +  E         + C  F  GN
Sbjct: 342 FRGPEARVAAASGLLPVVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTF--GN 399

Query: 341 ANI------VYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           +++      V G   Q N  + YD++   V F P+RC
Sbjct: 400 SDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436


>gi|88174573|gb|ABD39361.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
          Length = 321

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 90/329 (27%), Positives = 147/329 (44%), Gaps = 45/329 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVTSVGLGTPSKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P+ 
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
            FGC   +  +        G++G+G G  S++ Q   +  G FSYCLP Q S +      
Sbjct: 110 TFGCNLDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKT 167

Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDT 253
               + G +     V  T ++ R    + +++ L AISV  +RL    S  S   +  D+
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF 311
           G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P +++HF
Sbjct: 228 GSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISLHF 282

Query: 312 -RGADVKLSPSNLF--RNISDE-IMCSAF 336
             GA   L    +F  R++ ++ + C AF
Sbjct: 283 DDGARFDLGSRGVFVERSVQEQDVWCLAF 311


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 100/381 (26%), Positives = 161/381 (42%), Gaps = 47/381 (12%)

Query: 19  PISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPL 78
           P S +   + + ++  Y   L IGTPP      VD+GS  T+  C  C +  C K + P 
Sbjct: 77  PHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQ--CGKHQDPK 134

Query: 79  FDPKKSSTYNSISCSSSQCAVVTSNCSEG--DCSYSFLYGRGAYASFSSGNLATETLTFN 136
           F P+ SSTY  + C+      +  NC +    C Y   Y   A  S S G L  + ++F 
Sbjct: 135 FQPEMSSTYQPVKCN------MDCNCDDDREQCVYEREY---AEHSSSKGVLGEDLISFG 185

Query: 137 STSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYC 194
           + S L  +    +FGC         S  +  GIIGLG G+ SL+ Q+     I+  F  C
Sbjct: 186 NESQLTPQ--RAVFGCETVETGDLYS-QRADGIIGLGQGDLSLVDQLVDKGLISNSFGLC 242

Query: 195 LPDQGSSKINFGGIVAGAGVVSTPLIIRD-------HYYLSLEAISVGNQRLEF---VSS 244
               G   +  G ++ G     + ++  D       +Y + L  I V  ++L     V  
Sbjct: 243 Y---GGMDVGGGSMILGGFDYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHSRVFD 299

Query: 245 STGNIFVDTGVLRTLLP-LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--- 300
                 +D+G     LP   + +  ++VM  +   + +   G +P F D  C+ +++   
Sbjct: 300 GEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQID--GPDPNFKDT-CFQVAASNY 356

Query: 301 ----QPKFPEVTIHFR-GADVKLSPSN-LFRN--ISDEIMCSAF-RGGNANIVYGRIMQI 351
                  FP V + F+ G    LSP N +FR+  +        F  G +   + G I+  
Sbjct: 357 VSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVR 416

Query: 352 NFLIGYDIEQAMVSFKPSRCT 372
           N L+ YD E + V F  + C+
Sbjct: 417 NTLVVYDRENSKVGFWRTNCS 437


>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 460

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 99/379 (26%), Positives = 162/379 (42%), Gaps = 57/379 (15%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           YL+  S+GTPP  +  +VDT +D  W  C  C    C     P F+P  S+T+  + C +
Sbjct: 94  YLVRASLGTPPQRLLLAVDTSNDAAWVPCAGC--HGC-PTTAPSFNPASSATFRPVPCGA 150

Query: 95  SQCAVVTS-NC-----SEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
             C+   + +C     S+  C +S  YG  +        L+ + L   +  G+   +   
Sbjct: 151 PPCSQAPNPSCTSLAKSKNSCGFSLSYGDSSL----DATLSQDNLAVTANGGV---IKGY 203

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGI 208
            FGC  K   S  S +   G++GLG G    ++Q      G FSYCLP    S  NF G 
Sbjct: 204 TFGCLTK---SNGSAAPAQGLLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFSGS 260

Query: 209 V--------AGAGVVSTPLIIRDH----YYLSLEAISVGNQR-------LEFVSSSTGNI 249
           +        A   + +TPL+   H    YY+++  + +G +        L F +++    
Sbjct: 261 LTLGRKGQPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAATGAGT 320

Query: 250 FVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVK--------GVGAEPGFSDVLCYNISSQ 301
            +D+G +   L    ++ ++  +   +     +         V +  GF    CYN+S+ 
Sbjct: 321 VLDSGTMFARLAQPAYAAVRDEVRRRVAGSLRRRGGGGASVSVSSLGGFDT--CYNVSTV 378

Query: 302 PKFPEVTIHFRGA-DVKLSPSN-LFRNISDEIMCSAFR-----GGNANI-VYGRIMQINF 353
             +P VT+ F G  +V+L   N + R+      C A       G NA + V G + Q N 
Sbjct: 379 -AWPAVTLVFGGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALNVIGSLQQQNH 437

Query: 354 LIGYDIEQAMVSFKPSRCT 372
            + +D+  A V F   RCT
Sbjct: 438 RVLFDVPNARVGFARERCT 456


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 105/362 (29%), Positives = 157/362 (43%), Gaps = 40/362 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IGTPP      VDTGS  T+  C  C +  C K + P F P  SSTY  + C S
Sbjct: 92  YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQ--CGKHQDPNFQPDWSSTYQPLKC-S 148

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
            +C   T +     C Y   Y   A  S SSG L  + ++F   S L  +    +FGC +
Sbjct: 149 MEC---TCDSEMMHCVYDRQY---AEMSSSSGVLGEDIVSFGKQSELKPQ--RTVFGCEN 200

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCL--PDQGSSKINFGGIVA 210
                  S  +  GI+GLG G+ S++ Q+     I   FS C    D G   +  GGI  
Sbjct: 201 VETGDIYS-QRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLGGISP 259

Query: 211 GAGVV---STPLIIRDHYY-LSLEAISVGNQRL---EFVSSSTGNIFVDTGVLRTLLPLE 263
            AG+V   S P   R  YY + L+ I +  ++L     V        +D+G     LP  
Sbjct: 260 PAGMVFTHSDP--ARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYLPEP 317

Query: 264 YHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGAD 315
                K ++M  +   + ++  G +  ++D+ C+     ++S   K FP V + F  G  
Sbjct: 318 AFKAFKDAIMKELNSLKLIQ--GPDRNYNDI-CFSGVGSDVSQLSKTFPAVDLVFSNGNR 374

Query: 316 VKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           + LSP N LF++           F+  N    + G I+  N L+ YD E   + F  + C
Sbjct: 375 LSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434

Query: 372 TN 373
           + 
Sbjct: 435 SE 436


>gi|255685716|gb|ACU28347.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
 gi|255685726|gb|ACU28352.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
 gi|255685728|gb|ACU28353.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
          Length = 91

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 47/106 (44%), Positives = 61/106 (57%), Gaps = 15/106 (14%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
           M L IGTPP +I   +DTGS+  WTQC PC  L C+ Q+ P+FDP KSST+    C    
Sbjct: 1   MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRC---- 54

Query: 97  CAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
                 N  +  C Y  +Y   +Y   + G LATET+T +STSG+P
Sbjct: 55  ------NTPDHSCPYKIVYDDKSY---TQGTLATETVTIHSTSGVP 91


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 95/379 (25%), Positives = 159/379 (41%), Gaps = 55/379 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPP--LFDPKKSSTYNSISC 92
           Y +   +GTP        DTGSD TW +C             P  +F    S ++  I+C
Sbjct: 101 YFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAPIAC 160

Query: 93  SSSQCAVVT----SNCSE--GDCSYSFLYGRGAYASFSSGNLATETLTF----------- 135
           SS  C        +NCS     C+Y + Y  G   S + G + T++ T            
Sbjct: 161 SSDTCTSYVPFSLANCSSPASPCAYDYRYRDG---SAARGVVGTDSATIALSSGSGRGGG 217

Query: 136 NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT--GIIGLGPGNSSLISQMGTSIAGKFSY 193
           +S+ G   ++  V+ GC     A+    S Q+  G++ LG  N S  S+      G+FSY
Sbjct: 218 DSSGGRRAKLQGVVLGCA----ATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSY 273

Query: 194 CL-----PDQGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLEFVS- 243
           CL     P   +S + F G  A A    TPL++       Y ++++A+ V  + L+  + 
Sbjct: 274 CLVDHLAPRNATSYLTF-GPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPAD 332

Query: 244 ----SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS 299
                  G   +D+G   T+L    +  + + +S  +   P   V  +P F    CYN +
Sbjct: 333 VWDVDRNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPR--VTMDP-FE--YCYNWT 387

Query: 300 SQP--KFPEVTIHFRGADVKLSP--SNLFRNISDEIMCSAFRGGN--ANIVYGRIMQINF 353
                + P++ +HF G+  +L P   +   + +  + C   + G+     V G I+Q   
Sbjct: 388 DAGALEIPKMEVHFAGS-ARLEPPAKSYVIDAAPGVKCIGVQEGSWPGVSVIGNILQQEH 446

Query: 354 LIGYDIEQAMVSFKPSRCT 372
           L  +D+    + FK +RC 
Sbjct: 447 LWEFDLRDRWLRFKHTRCA 465


>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 481

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 101/419 (24%), Positives = 154/419 (36%), Gaps = 98/419 (23%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPC--------PELDCFKQEPPLFDPKKSST 86
           Y+    IG PP      VDTGSD  WTQC  C            CF Q  P ++   S T
Sbjct: 78  YIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLSRT 137

Query: 87  YNSISCSSSQ---CAVV--TSNCSEGD------CSYSFLYGRGAYASFSSGNLATETLTF 135
             ++ C       C V   T+ C+ G       C  +  YG G     + G L T+  TF
Sbjct: 138 ARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG----VALGVLGTDAFTF 193

Query: 136 NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL 195
            S+S +      + FGC  +   SP + +  +GIIGLG G  SL+SQ+    A +FSYCL
Sbjct: 194 PSSSSV-----TLAFGCVSQTRISPGALNGASGIIGLGRGALSLVSQLN---ATEFSYCL 245

Query: 196 PDQGSSKINFGGIVAGAG------------------VVSTPLI-------IRDHYYLSLE 230
                  ++   +  G G                  V + P             YYL L 
Sbjct: 246 TPYFRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFYYLPLV 305

Query: 231 AISVGNQRLEFVSSS-----------TGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQ 279
            ++ GN  +   + +            G   +D+G   T L    H  L   ++  ++  
Sbjct: 306 GLAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRGS 365

Query: 280 ------PVKGVGAEPGFSDVLCYNIS------SQPKFPEVTIHFR-----GADVKLSPSN 322
                 P K  GA       LC          +    P + + F      G ++ +    
Sbjct: 366 GSLVPPPAKLGGALE-----LCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEK 420

Query: 323 LFRNISDEIMCSAF---RGGNANI------VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
            +  +     C A      GNA +      + G  MQ +  + YD+   ++SF+P+ C+
Sbjct: 421 YWARVEASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 479


>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 431

 Score = 87.8 bits (216), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 90/368 (24%), Positives = 151/368 (41%), Gaps = 47/368 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++   +GTP   +  ++DT +D TW+ C PC       +    F P  SS+Y S+ C+S
Sbjct: 79  YVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR----FIPASSSSYASLPCAS 134

Query: 95  SQCAVVTSNCSEGD---------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
             C +        +         C++S  +   A  SF + +L ++TL     +     +
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPF---ADTSFQA-SLGSDTLRLGKDA-----I 185

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SS 201
               FGC    +A PT++  + G++GLG G  SL+SQ G++  G FSYCLP       S 
Sbjct: 186 AGYAFGC-VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSYYFSG 244

Query: 202 KINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLE-------FVSSSTGNIF 250
            +  G       V  TPL+   H    YY+++  +SVG   ++       F  ++     
Sbjct: 245 SLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTV 304

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIH 310
           +D+G + T      ++ L+      + A    G  +   F      +  +    P VT+H
Sbjct: 305 IDSGTVITRWTAPVYAALREEFRRQVAAP--SGYTSLGAFDTCFNTDEVAAGGAPPVTLH 362

Query: 311 FRGA-DVKLSPSN-LFRNISDEIMCSAFRGG-----NANIVYGRIMQINFLIGYDIEQAM 363
             G  D+ L   N L  + +  + C A             V   + Q N  +  D+  + 
Sbjct: 363 MDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSR 422

Query: 364 VSFKPSRC 371
           V F    C
Sbjct: 423 VGFAREPC 430


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 105/362 (29%), Positives = 157/362 (43%), Gaps = 40/362 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + IGTPP      VDTGS  T+  C  C +  C K + P F P  SSTY  + C S
Sbjct: 92  YTTRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQ--CGKHQDPNFQPDWSSTYQPLKC-S 148

Query: 95  SQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGH 154
            +C   T +     C Y   Y   A  S SSG L  + ++F   S L  +    +FGC +
Sbjct: 149 MEC---TCDSEMMHCVYDRQY---AEMSSSSGVLGEDIVSFGKQSELKPQ--RTVFGCEN 200

Query: 155 KNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCL--PDQGSSKINFGGIVA 210
                  S  +  GI+GLG G+ S++ Q+     I   FS C    D G   +  GGI  
Sbjct: 201 VETGDIYSQ-RADGIMGLGRGDLSIVDQLVEKGVIGNSFSLCYGGMDVGGGAMVLGGISP 259

Query: 211 GAGVV---STPLIIRDHYY-LSLEAISVGNQRL---EFVSSSTGNIFVDTGVLRTLLPLE 263
            AG+V   S P   R  YY + L+ I +  ++L     V        +D+G     LP  
Sbjct: 260 PAGMVFTHSDP--ARSAYYNIDLKEIHIAGKQLPINPMVFDGKYGTILDSGTTYAYLPEP 317

Query: 264 YHSNLK-SVMSNMIKAQPVKGVGAEPGFSDVLCY-----NISSQPK-FPEVTIHF-RGAD 315
                K ++M  +   + ++  G +  ++D+ C+     ++S   K FP V + F  G  
Sbjct: 318 AFKAFKDAIMKELNSLKLIQ--GPDRNYNDI-CFSGVGSDVSQLSKTFPAVDLVFSNGNR 374

Query: 316 VKLSPSN-LFRN--ISDEIMCSAFRGGN-ANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
           + LSP N LF++           F+  N    + G I+  N L+ YD E   + F  + C
Sbjct: 375 LSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFWKTNC 434

Query: 372 TN 373
           + 
Sbjct: 435 SE 436


>gi|88174591|gb|ABD39370.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 90/329 (27%), Positives = 147/329 (44%), Gaps = 45/329 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y+  + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVTSVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P+ 
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
            FGC   +  +        G++G+G G  S++ Q   +  G FSYCLP Q S +      
Sbjct: 110 TFGCNLDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQKSERGFFSKT 167

Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDT 253
               + G +     V  T ++ R    + +++ L AISV  +RL    S  S   +  D+
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDS 227

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF 311
           G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P +++HF
Sbjct: 228 GSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISLHF 282

Query: 312 -RGADVKLSPSNLF--RNISDE-IMCSAF 336
             GA   L    +F  R++ ++ + C AF
Sbjct: 283 DDGARFDLGIHGVFVERSVQEQDVWCLAF 311


>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 321

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 90/329 (27%), Positives = 148/329 (44%), Gaps = 45/329 (13%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVISVGLGTPSKTQILEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P+ 
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPSF 109

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
            FGC   +  +        G++G+G G  S++ Q   +  G FSYCLP Q S +      
Sbjct: 110 SFGCNMDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKT 167

Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFVDT 253
               + G +     V  T ++ R    + +++ L AISV  +RL    S  S   +  D+
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDS 227

Query: 254 GVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIHF 311
           G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P +++HF
Sbjct: 228 GSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISLHF 282

Query: 312 -RGADVKLSPSNLF--RNISDE-IMCSAF 336
             GA   L    +F  R++ ++ + C AF
Sbjct: 283 DDGARFDLGSHGVFVERSVQEQDVWCLAF 311


>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 445

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 89/361 (24%), Positives = 151/361 (41%), Gaps = 54/361 (14%)

Query: 49  FGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTSNCSEGD 108
           F  +DT S   W +C  C  L   +Q  P+FDP  SS+Y  +  +S  C         GD
Sbjct: 90  FLVLDTASSLPWMRCAHC--LPVQRQRSPVFDPSDSSSYRPLHPTSPLCRAPNPVLPAGD 147

Query: 109 -CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT 167
            CS+            + G + T+T+   + + LP+   +V FGC     ++   D+K T
Sbjct: 148 KCSFHL-------PGEAHGYVGTDTIILGNPT-LPIH--SVAFGCAQ---STEGFDTKGT 194

Query: 168 --GIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIRDH- 224
             G +G+G   +SLI Q+   +  +FSYCL   G S    G I  GA +    L++    
Sbjct: 195 FAGTLGMGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIRFGADIPDPTLLVHHRI 254

Query: 225 -----------------YYLSLEAIS--------VGNQRLEFVSSSTGNIFVDTGVLRTL 259
                            YY+ L  IS        +     E  S  +G  FVD G   T 
Sbjct: 255 KILPTPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGSGGCFVDAGTQVTH 314

Query: 260 LPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRG------ 313
           L    ++ ++  +++M++    K V  +P FS     +       P++T+ F G      
Sbjct: 315 LVPAAYAVVEEAVAHMVQQWGYKRV-RDPNFSLCFREHPGIWSHIPKLTLDFEGPASRTV 373

Query: 314 ADVKLSPSNLFRNISDE-IMC-SAFRGGNAN-IVYGRIMQINFLIGYDIEQAMVSFKPSR 370
           A +++   NLF  + ++ ++C   +R    +  V G + Q++    +D+    ++F    
Sbjct: 374 AHLEIVSRNLFLKVDNQPLVCFGVYRTSRGSPTVVGAMQQVDTRFIFDLHANTITFHRES 433

Query: 371 C 371
           C
Sbjct: 434 C 434


>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 101/398 (25%), Positives = 169/398 (42%), Gaps = 50/398 (12%)

Query: 1   AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
           AQ+  +L + +     +S + I    +++     Y++ + IGTP   +  ++DT SD  W
Sbjct: 66  AQDQARLQYLSSLVAGRSVVPIASGRQMLQ-STTYIVKVLIGTPAQPLLLAMDTSSDVAW 124

Query: 61  TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
             C  C  + C       F P KS+++ ++SCS+ QC  V +  C    CS++  YG  +
Sbjct: 125 IPCSGC--VGCPSNT--AFSPAKSTSFKNVSCSAPQCKQVPNPACGARACSFNLTYGSSS 180

Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
            A+    NL+ +T+   +    P++     FGC +K     T    Q  +     G  SL
Sbjct: 181 IAA----NLSQDTIRLAAD---PIKA--FTFGCVNKVAGGGTIPPPQGLLGLGR-GPLSL 230

Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIR-----------DHYYLS 228
           +SQ  +     FSYCLP   S  + F G +   G  S P  ++             YY++
Sbjct: 231 MSQAQSVYKSTFSYCLPSFRS--LTFSGSLR-LGPTSQPQRVKYTQLLRNPRRSSLYYVN 287

Query: 229 LEAISVGNQRLEF--------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
           L AI VG + ++          S+  G IF D+G + T L    +  +++     +K  P
Sbjct: 288 LVAIRVGRKVVDLPPAAIAFNPSTGAGTIF-DSGTVYTRLAKPVYEAVRNEFRKRVKP-P 345

Query: 281 VKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNL-FRNISDEIMCSAFRGG 339
              V +  GF    CY  S Q K P +T  F+G ++ +   NL   + +    C A    
Sbjct: 346 TAVVTSLGGFDT--CY--SGQVKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMASA 401

Query: 340 NANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
             N+     V   + Q N  +  D+    +     RC+
Sbjct: 402 PENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439


>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
 gi|194690728|gb|ACF79448.1| unknown [Zea mays]
          Length = 431

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 90/368 (24%), Positives = 150/368 (40%), Gaps = 47/368 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++   +GTP   +  ++DT +D TW+ C PC       +    F P  SS+Y S+ C+S
Sbjct: 79  YVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR----FIPASSSSYASLPCAS 134

Query: 95  SQCAVVTSNCSEGD---------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
             C +        +         C++S  +   A  SF + +L ++TL     +     +
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPF---ADTSFQA-SLGSDTLRLGKDA-----I 185

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SS 201
               FGC    +A PT++  + G++GLG G  SL+SQ G+   G FSYCLP       S 
Sbjct: 186 AGYAFGC-VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSG 244

Query: 202 KINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLE-------FVSSSTGNIF 250
            +  G       V  TPL+   H    YY+++  +SVG   ++       F  ++     
Sbjct: 245 SLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTV 304

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIH 310
           +D+G + T      ++ L+      + A    G  +   F      +  +    P VT+H
Sbjct: 305 IDSGTVITRWTAPVYAALREEFRRQVAAP--SGYTSLGAFDTCFNTDEVAAGGAPPVTLH 362

Query: 311 FRGA-DVKLSPSN-LFRNISDEIMCSAFRGG-----NANIVYGRIMQINFLIGYDIEQAM 363
             G  D+ L   N L  + +  + C A             V   + Q N  +  D+  + 
Sbjct: 363 MDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSR 422

Query: 364 VSFKPSRC 371
           V F    C
Sbjct: 423 VGFAREPC 430


>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
 gi|194703964|gb|ACF86066.1| unknown [Zea mays]
 gi|219886221|gb|ACL53485.1| unknown [Zea mays]
 gi|219886359|gb|ACL53554.1| unknown [Zea mays]
 gi|223950085|gb|ACN29126.1| unknown [Zea mays]
 gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 431

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 90/368 (24%), Positives = 150/368 (40%), Gaps = 47/368 (12%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++   +GTP   +  ++DT +D TW+ C PC       +    F P  SS+Y S+ C+S
Sbjct: 79  YVVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSR----FIPASSSSYASLPCAS 134

Query: 95  SQCAVVTSNCSEGD---------CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEM 145
             C +        +         C++S  +   A  SF + +L ++TL     +     +
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPF---ADTSFQA-SLGSDTLRLGKDA-----I 185

Query: 146 PNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG----SS 201
               FGC    +A PT++  + G++GLG G  SL+SQ G+   G FSYCLP       S 
Sbjct: 186 AGYAFGC-VGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSRYNGVFSYCLPSYRSYYFSG 244

Query: 202 KINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLE-------FVSSSTGNIF 250
            +  G       V  TPL+   H    YY+++  +SVG   ++       F  ++     
Sbjct: 245 SLRLGAAGQPRNVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTV 304

Query: 251 VDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQPKFPEVTIH 310
           +D+G + T      ++ L+      + A    G  +   F      +  +    P VT+H
Sbjct: 305 IDSGTVITRWTAPVYAALREEFRRQVAAP--SGYTSLGAFDTCFNTDEVAAGGAPPVTLH 362

Query: 311 FRGA-DVKLSPSN-LFRNISDEIMCSAFRGG-----NANIVYGRIMQINFLIGYDIEQAM 363
             G  D+ L   N L  + +  + C A             V   + Q N  +  D+  + 
Sbjct: 363 MDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSR 422

Query: 364 VSFKPSRC 371
           V F    C
Sbjct: 423 VGFAREPC 430


>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
          Length = 451

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 105/386 (27%), Positives = 161/386 (41%), Gaps = 55/386 (14%)

Query: 17  KSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP 76
           +S + I    +++S+   Y+    +GTP   +  ++D  +D  W  C             
Sbjct: 90  RSFVPIAPGRQLLSIPS-YVARARLGTPAQALLVAIDPSNDAAWVPCA----ACAGCARA 144

Query: 77  PLFDPKKSSTYNSISCSSSQCAVVTS-NCSEG---DCSYSFLYGRGAYASFSSGNLATET 132
           P FDP +SSTY  + C + QC+   + +C  G    C+++  Y    + +     L  + 
Sbjct: 145 PSFDPTRSSTYRPVRCGAPQCSQAPAPSCPGGLGSSCAFNLSYAASTFQAL----LGQDA 200

Query: 133 LTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFS 192
           L  +        +    FGC H  + +  S   Q G++G G G  S  SQ        FS
Sbjct: 201 LALHDDVD---AVAAYTFGCLH--VVTGGSVPPQ-GLVGFGRGPLSFPSQTKDVYGSVFS 254

Query: 193 YCLPDQGSSKINFGGI--VAGAG----VVSTPLIIRDH----YYLSLEAISVGNQ----- 237
           YCLP   SS  NF G   +  AG    + +TPL+   H    YY+++  I VG +     
Sbjct: 255 YCLPSYKSS--NFSGTLRLGPAGQPKRIKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVP 312

Query: 238 --RLEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKG-VGAEPGFSDVL 294
              L F  +S     VD G + T L    ++ ++ V  + ++A PV G +G   GF    
Sbjct: 313 ASALAFDPTSGRGTIVDAGTMFTRLSAPVYAAVRDVFRSRVRA-PVAGPLG---GFDT-- 366

Query: 295 CYNISSQPKFPEVTIHFRG-ADVKLSPSN-LFRNISDEIMCSAFRGG------NANIVYG 346
           CYN++     P VT  F G   V L   N + R+ S  I C A   G       A  V  
Sbjct: 367 CYNVTI--SVPTVTFSFDGRVSVTLPEENVVIRSSSGGIACLAMAAGPPDGVDAALNVLA 424

Query: 347 RIMQINFLIGYDIEQAMVSFKPSRCT 372
            + Q N  + +D+    V F    CT
Sbjct: 425 SMQQQNHRVLFDVANGRVGFSRELCT 450


>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
 gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
          Length = 493

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 99/407 (24%), Positives = 156/407 (38%), Gaps = 90/407 (22%)

Query: 44  PPVDIFGSVDTGSDCTWTQCEPCPELDCFKQE-------PPLFDPKKSSTYNSISCSSSQ 96
           PP  +   +DTGSD  W  C+P   + C  +        PP   P+ SST  S+ C SS 
Sbjct: 92  PPQHVSLYLDTGSDLVWFPCKPFECILCEGKAENTTASTPP---PRLSSTARSVHCKSSA 148

Query: 97  CAVVTSN------CSEGDCSYSFLYGRGA--------YASFSSGNLATETLTFNSTSGLP 142
           C+   SN      C+  DC    +             Y ++  G+L      ++ +  LP
Sbjct: 149 CSAAHSNLPTSDLCAIADCPLESIETSDCHSFSCPSFYYAYGDGSLVAR--LYHDSIKLP 206

Query: 143 VEMP-----NVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGT---SIAGKFSYC 194
           +  P     N  FGC H  LA P       G+ G G G  SL +Q+ +    +  +FSYC
Sbjct: 207 LATPSLSLHNFTFGCAHTALAEP------VGVAGFGRGVLSLPAQLASFAPQLGNRFSYC 260

Query: 195 LPDQ--GSSKINF-GGIVAGAGVVSTPLIIRDH-----------------YYLSLEAISV 234
           L      S ++     ++ G        + +D                  Y + LE IS+
Sbjct: 261 LVSHSFNSDRLRLPSPLILGHSDDKEKRVNKDDVQFVYTSMLDNPKHPYFYCVGLEGISI 320

Query: 235 GNQRL---EFVS----SSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKA--QPVKGVG 285
           G +++   EF+       +G + VD+G   T+LP   ++++ +   N +    +  K V 
Sbjct: 321 GKKKIPAPEFLKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVE 380

Query: 286 AEPGFSDVLCYNISSQPKFPEVTIHFRG--ADVKLSPSNLFRNISD---------EIMC- 333
            + G     CY   +    P + +HF G  + V L   N F +  D          + C 
Sbjct: 381 DKTGLGP--CYYYDTVVNIPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCL 438

Query: 334 -------SAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
                   A   G      G   Q  F + YD+EQ  V F   +C +
Sbjct: 439 MLMNGGEEAELTGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKCAS 485


>gi|340810977|gb|AEK75415.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 148/371 (39%), Gaps = 50/371 (13%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
           M +S+G PPV    ++DTGS  +W QC+PC  + C  Q     P+FDP +S T   + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 94  SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           S +C        +  +NC E +  C+YS  YG G   ++S G + T+TL    +      
Sbjct: 60  SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS------ 111

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLP-DQGSSKI 203
             +++FGC      S                   L            SYCLP D+     
Sbjct: 112 FMDLMFGCSMDVKYSEFEAGIFGFGSSSFSFFEQLAGYPDILSYKALSYCLPTDETKPGY 171

Query: 204 NFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRT 258
              G    A +    TPL   I R  Y L++E +    QRL    +S+  + VD+G  RT
Sbjct: 172 MILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEMIVDSGAQRT 228

Query: 259 -LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY--------------NISSQPK 303
            L P  +    K++   M      +   A       +CY                S+   
Sbjct: 229 SLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTITPFSNWSA 286

Query: 304 FPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQINFLIGYDIE 360
            P + I F  GA + L P N+F N     +C  F    A  + + G  +  +F   +DI+
Sbjct: 287 LPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRSFGTTFDIQ 346

Query: 361 QAMVSFKPSRC 371
                FK + C
Sbjct: 347 GKQFGFKYAVC 357


>gi|340810961|gb|AEK75407.1| S5 [Oryza sativa]
 gi|340811037|gb|AEK75445.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 106/379 (27%), Positives = 158/379 (41%), Gaps = 66/379 (17%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEP---PLFDPKKSSTYNSISCS 93
           M +S+G PPV    ++DTGS  +W QC+PC  + C  Q     P+FDP +S T   + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 94  SSQCA-------VVTSNCSEGD--CSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
           S +C        +  +NC E +  C+YS  YG G   ++S G + T+TL    +      
Sbjct: 60  SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNG--WAYSVGKMVTDTLRIGDS------ 111

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAG--------KFSYCLP 196
             +++FGC      S      + GI G G  + S   Q+    AG          SYCLP
Sbjct: 112 FMDLMFGCSMDVKYS----EFEAGIFGFGSSSFSFFEQL----AGYPDILSYKALSYCLP 163

Query: 197 -DQGSSKINFGGIVAGAGVVS--TPL---IIRDHYYLSLEAISVGNQRLEFVSSSTGNIF 250
            D+        G    A +    TPL   I R  Y L++E +    QRL    +S+  + 
Sbjct: 164 TDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTMEMLIANGQRL---VTSSSEMI 220

Query: 251 VDTGVLRT-LLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCY------------- 296
           VD+G  RT L P  +    K++   M      +   A       +CY             
Sbjct: 221 VDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ--ESYICYLSEHDYSGWNGTI 278

Query: 297 -NISSQPKFPEVTIHFR-GADVKLSPSNLFRNISDEIMCSAFRGGNA--NIVYGRIMQIN 352
              S+    P + I F  GA + L P N+F N     +C  F    A  + + G  +  +
Sbjct: 279 TPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQNPALRSQILGNRVTRS 338

Query: 353 FLIGYDIEQAMVSFKPSRC 371
           F   +DI+     FK + C
Sbjct: 339 FGTTFDIQGKQFGFKYAVC 357


>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 455

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 101/398 (25%), Positives = 167/398 (41%), Gaps = 50/398 (12%)

Query: 1   AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
           AQ+  +L + +     +S + I    +++     Y++   IGTP   +  ++DT SD  W
Sbjct: 82  AQDQARLQYLSSLVAGRSVVPIASGRQMLQ-STTYIVKALIGTPAQPLLLAMDTSSDVAW 140

Query: 61  TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
             C  C  + C       F P KS+++ ++SCS+ QC  V +  C    CS++  YG  +
Sbjct: 141 IPCSGC--VGCPSNT--AFSPAKSTSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSS 196

Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
            A+    NL+ +T+   +    P++     FGC +K     T    Q  +     G  SL
Sbjct: 197 IAA----NLSQDTIRLAAD---PIKA--FTFGCVNKVAGGGTIPPPQGLLGLGR-GPLSL 246

Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIR-----------DHYYLS 228
           +SQ  +     FSYCLP   S  + F G +   G  S P  ++             YY++
Sbjct: 247 MSQAQSIYKSTFSYCLPSFRS--LTFSGSLR-LGPTSQPQRVKYTQLLRNPRRSSLYYVN 303

Query: 229 LEAISVGNQRLEF--------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
           L AI VG + ++          S+  G IF D+G + T L    +  +++     +K  P
Sbjct: 304 LVAIRVGRKVVDLPPAAIAFNPSTGAGTIF-DSGTVYTRLAKPVYEAVRNEFRKRVK--P 360

Query: 281 VKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNL-FRNISDEIMCSAFRGG 339
              V    G  D  CY  S Q K P +T  F+G ++ +   NL   + +    C A    
Sbjct: 361 TTAVVTSLGGFDT-CY--SGQVKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAA 417

Query: 340 NANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
             N+     V   + Q N  +  D+    +     RC+
Sbjct: 418 PENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 455


>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 525

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 96/366 (26%), Positives = 161/366 (43%), Gaps = 43/366 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT--QCEPCPELDCFKQEPPL-----FDPKKSST 86
           ++  ++ IGTP V     +DTGSD  W   +CE C  L    ++P       + P  SST
Sbjct: 110 LHYSYIDIGTPNVQFLVVLDTGSDLLWIPCECESCAPLSAESKDPRTSQLNPYTPSLSST 169

Query: 87  YNSISCSSSQCAVVTSNCSEGD-CSYSFLYGRGAYASFSSGNLATETLTFNSTS-GLPVE 144
              + CS   C + ++  +  D C Y   Y     +  +SG L  + + F   S G PV+
Sbjct: 170 AKPVLCSDPLCEMSSTCMAPTDQCPYEINYVSANTS--TSGALYEDYMYFMRESGGNPVK 227

Query: 145 MPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSK 202
           +P V  GCG     S    +   G++GLG  + S+ +++ ++  +A  FS C+   GS  
Sbjct: 228 LP-VYLGCGKVQTGSLLKGAAPNGLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGT 286

Query: 203 INFGGIVAGAGVVSTPLI-----IRDHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLR 257
           + FG     A   +TP+I     + D Y + +++I+VGN  L   S +      DTG   
Sbjct: 287 LTFGD-EGPAAQRTTPIIPKSVSMLDTYIVEIDSITVGNTNLLMASHA----LFDTGTSF 341

Query: 258 TLLPLEYHSNLKSVMSNMIK---AQPVKGVGAEPGFSDV-LCYNIS-SQPKFPEVTIHFR 312
           T L        K+V    ++   AQ       +P FS   LCY  S +  + P V++   
Sbjct: 342 TYLS-------KTVYPQFVQAYDAQMSLPKWNDPRFSKWDLCYQTSNTNFQVPVVSLALS 394

Query: 313 GADVKLSPSNLFRNISDE------IMCSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSF 366
           G +  L   +  ++I D+      +  +    G    + G+    N+ I Y+  +  + +
Sbjct: 395 GGN-SLDVVSGLKSIVDDNNAMIAVCVTVMDSGAGLSIIGQNFMTNYSITYNRAKMTIGW 453

Query: 367 KPSRCT 372
            PS C+
Sbjct: 454 TPSDCS 459


>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
 gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
          Length = 388

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 82/299 (27%), Positives = 129/299 (43%), Gaps = 31/299 (10%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQ---CEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + IGTP V  +  +DTGS   W     C+ CP      ++   +DP+ S +   +
Sbjct: 82  LYYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEV 141

Query: 91  SCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMP---N 147
            C  + C           C Y   Y  G     + G L T+ L ++   G     P   +
Sbjct: 142 KCDDTICTSRPPCNMTLRCPYITGYADGG---LTMGILFTDLLHYHQLYGNGQTQPTSTS 198

Query: 148 VIFGCGHKNLASPTSDSKQT-GIIGLGPGNSSLISQMGTSIAGK----FSYCLPDQGSSK 202
           V FGCG +   S  + +    GIIG G  N + +SQ+  + AGK    FS+CL       
Sbjct: 199 VTFGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQL--AAAGKTKKIFSHCLDSTNGGG 256

Query: 203 INFGGIVAGAGVVSTPLIIRDHYY--LSLEAISVGNQRLE-----FVSSSTGNIFVDTGV 255
           I   G V    V +TP++  +  Y  ++L++I+V    L+     F ++ T   F+D+G 
Sbjct: 257 IFAIGEVVEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 316

Query: 256 LRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNI--SSQPKFPEVTIHFR 312
               LP   +S L   +  +    P   +GA   F    C++   S   KFP++T HF 
Sbjct: 317 TLVYLPEIIYSEL---ILAVFAKHPDITMGAMYNFQ---CFHFLGSVDDKFPKITFHFE 369


>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
 gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
          Length = 478

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 96/371 (25%), Positives = 160/371 (43%), Gaps = 42/371 (11%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWT---QCEPCPELDCFKQEPPLFDPKKSSTYNSI 90
           +Y   + +G+PP +    +DTGSD  W     C  CP       +   FD   SST   +
Sbjct: 65  LYFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLV 124

Query: 91  SCSSSQC--AVVT--SNCS--EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVE 144
            CS   C  AV T  + CS     CSY+F Y  G   S +SG   ++TL F++  G  + 
Sbjct: 125 HCSDPICTSAVQTTVTQCSPQTNQCSYTFQYEDG---SGTSGYYVSDTLYFDAILGESLV 181

Query: 145 MPN---VIFGCGHKNLASPT-SDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQ 198
           + +   ++FGC        T +D    GI G G G  S+ISQ+ T       FS+CL  +
Sbjct: 182 VNSSALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCLKGE 241

Query: 199 GSSKINFGGI-VAGAGVVSTPLI-IRDHYYLSLEAISVGNQRLE-----FVSSSTGNIFV 251
           G          +   G+V +PL+  + HY L+L++I+V  + L      F +S++    V
Sbjct: 242 GIGGGILVLGEILEPGMVYSPLVPSQPHYNLNLQSIAVNGKLLPIDPSVFATSNSQGTIV 301

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMI--KAQPVKGVGAEPGFSDVLCYNISS--QPKFPEV 307
           D+G     L  E +    S ++ ++     P+   G +       CY +S+     FP  
Sbjct: 302 DSGTTLAYLVAEAYDPFVSAVNVIVSPSVTPIISKGNQ-------CYLVSTSVSQMFPLA 354

Query: 308 TIHFR-GADVKLSPSNLF-----RNISDEIMCSAFRGGNANIVYGRIMQINFLIGYDIEQ 361
           + +F  GA + L P +             + C  F+      + G ++  + +  YD+ +
Sbjct: 355 SFNFAGGASMVLKPEDYLIPFGPSQGGSVMWCIGFQKVQGVTILGDLVLKDKIFVYDLVR 414

Query: 362 AMVSFKPSRCT 372
             + +    C+
Sbjct: 415 QRIGWANYDCS 425


>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 466

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 103/399 (25%), Positives = 156/399 (39%), Gaps = 84/399 (21%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEPPLFDPKKSSTYNSIS 91
           Y + L  GTP       +DTGS   W  C     C + + F   P  F PK SS+   + 
Sbjct: 86  YSIDLEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFSNTPK-FIPKNSSSSKFVG 144

Query: 92  CSSSQCAVVT----------------SNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF 135
           C++ +CA V                 +NCS+   +Y+  YG G+ A F    L +E L F
Sbjct: 145 CTNPKCAWVFGPDVKSHCCRQDKAAFNNCSQTCPAYTVQYGLGSTAGF----LLSENLNF 200

Query: 136 NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL 195
            +      +  + + GC   ++  P       GI G G G  SL SQM  +   +FSYCL
Sbjct: 201 PTK-----KYSDFLLGCSVVSVYQPA------GIAGFGRGEESLPSQMNLT---RFSYCL 246

Query: 196 PDQ---GSSKINFGGIVAGA--------GVVSTPLI----------IRDHYYLSLEAISV 234
                  S+ I    ++  A        GV  TP +             +YY++L+ I V
Sbjct: 247 LSHQFDDSATITSNLVLETASSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVV 306

Query: 235 GNQR-------LEFVSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAE 287
           G +R       LE      G   VD+G   T +       +    +  +     +   AE
Sbjct: 307 GEKRVRVPRRLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSY--TRAREAE 364

Query: 288 PGFSDVLCYNIS---SQPKFPEVTIHFRG-ADVKLSPSNLFRNI-----------SDEIM 332
             F    C+ ++       FPE+   FRG A ++L  +N F  +           SD++ 
Sbjct: 365 KQFGLSPCFVLAGGAETASFPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIVSDDVA 424

Query: 333 CSAFRGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            S    G A ++ G   Q NF + YD+E     F+   C
Sbjct: 425 GSGGTVGPA-VILGNYQQQNFYVEYDLENERFGFRSQSC 462


>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 462

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 97/361 (26%), Positives = 153/361 (42%), Gaps = 50/361 (13%)

Query: 29  ISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYN 88
           ++ D  +L+++  G P  ++   +DTGSD TW +C  C   +C  ++ P F+P  SS+Y+
Sbjct: 123 LNEDGFFLVNVGFGKPQQNLNLIIDTGSDTTWIRCNSCSLGNCHNKKIPTFNPSLSSSYS 182

Query: 89  SISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           + SC  S     T N  +               S+S G    + +T       P   P  
Sbjct: 183 NRSCIPSTKTNYTMNYEDN--------------SYSKGVFVCDEVTLK-----PDVFPKF 223

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNS-SLISQMGTSIAGKFSYCLPDQGSSKINFGG 207
            F       +        +G++GL  G   SLISQ  +    KFSYC P   +++   G 
Sbjct: 224 QF---GCGDSGGGDFGSASGVLGLAQGEQYSLISQTASKFKKKFSYCFPHNENTR---GS 277

Query: 208 IVAGAGVVSTPLIIR----------DHYYLSLEAISVGNQRLEFVSS---STGNIFVDTG 254
           ++ G   +S    ++            Y++ L  ISV  +RL   SS   S G I +D+G
Sbjct: 278 LLFGEKAISASPSLKFTRLLNPSSGSVYFVELIGISVAKKRLNVSSSLFASPGTI-IDSG 336

Query: 255 VLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNIS----SQPKFPEVTIH 310
            + T LP   +  L++     +   P      +    D  CYN+        K PE+ +H
Sbjct: 337 TVITHLPTAAYEALRTAFQQEMLHCPSVSPPPQEKPLDT-CYNLKGCGGRNIKLPEIVLH 395

Query: 311 FRG-ADVKLSPSN-LFRNISDEIMCSAF-RGGNAN--IVYGRIMQINFLIGYDIEQAMVS 365
           F G  DV L PS  L+ N      C AF R  + +   + G   Q++  + YDIE   + 
Sbjct: 396 FVGEVDVSLHPSGILWANGDLTQACLAFARKSHPSHVTIIGNRQQVSLKVVYDIEGGRLG 455

Query: 366 F 366
           F
Sbjct: 456 F 456


>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
          Length = 439

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 101/398 (25%), Positives = 167/398 (41%), Gaps = 50/398 (12%)

Query: 1   AQNSQKLPFYNDNETPKSPISIIYQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTW 60
           AQ+  +L + +     +S + I    +++     Y++   IGTP   +  ++DT SD  W
Sbjct: 66  AQDQARLQYLSSLVAGRSVVPIASGRQMLQ-STTYIVKALIGTPAQPLLLAMDTSSDVAW 124

Query: 61  TQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVVTS-NCSEGDCSYSFLYGRGA 119
             C  C  + C       F P KS+++ ++SCS+ QC  V +  C    CS++  YG  +
Sbjct: 125 IPCSGC--VGCPSNT--AFSPAKSTSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSS 180

Query: 120 YASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSL 179
            A+    NL+ +T+   +    P++     FGC +K     T    Q  +     G  SL
Sbjct: 181 IAA----NLSQDTIRLAAD---PIKA--FTFGCVNKVAGGGTIPPPQGLLGLGR-GPLSL 230

Query: 180 ISQMGTSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIR-----------DHYYLS 228
           +SQ  +     FSYCLP   S  + F G +   G  S P  ++             YY++
Sbjct: 231 MSQAQSIYKSTFSYCLPSFRS--LTFSGSLR-LGPTSQPQRVKYTQLLRNPRRSSLYYVN 287

Query: 229 LEAISVGNQRLEF--------VSSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQP 280
           L AI VG + ++          S+  G IF D+G + T L    +  +++     +K  P
Sbjct: 288 LVAIRVGRKVVDLPPAAIAFNPSTGAGTIF-DSGTVYTRLAKPVYEAVRNEFRKRVK--P 344

Query: 281 VKGVGAEPGFSDVLCYNISSQPKFPEVTIHFRGADVKLSPSNL-FRNISDEIMCSAFRGG 339
              V    G  D  CY  S Q K P +T  F+G ++ +   NL   + +    C A    
Sbjct: 345 TTAVVTSLGGFDT-CY--SGQVKVPTITFMFKGVNMTMPADNLMLHSTAGSTSCLAMAAA 401

Query: 340 NANI-----VYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
             N+     V   + Q N  +  D+    +     RC+
Sbjct: 402 PENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439


>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
 gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
 gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 92/330 (27%), Positives = 148/330 (44%), Gaps = 47/330 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P  
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPGF 109

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
            FGC   +  +        G++G+G G  S++ Q   +    FSYCLP Q S +      
Sbjct: 110 SFGCNMDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDC-FSYCLPLQKSERGFFSKT 167

Query: 203 ---INFGGIVAGAGVVSTPLIIR----DHYYLSLEAISVGNQRLEF---VSSSTGNIFVD 252
               + G +     V  T ++ R    + +++ L AISV  +RL     V S  G +F D
Sbjct: 168 TGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFSRKGVVF-D 226

Query: 253 TGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTIH 310
           +G   + +P        SV+S  I+   +K   AE   S+  CY++ S  +   P +++H
Sbjct: 227 SGSELSYIP----DRALSVLSQRIRELLLKRGAAEEE-SERNCYDMRSVDEGDMPAISLH 281

Query: 311 F-RGADVKLSPSNLF--RNISDE-IMCSAF 336
           F  GA   L    +F  R++ ++ + C AF
Sbjct: 282 FDDGARFDLGSHGVFVERSVQEQDVWCLAF 311


>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 323

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 92/331 (27%), Positives = 149/331 (45%), Gaps = 47/331 (14%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y++ + +GTP       +DTGS  +W  C    E D     P  F   +S+T   +SC +
Sbjct: 1   YVISVGLGTPSKTQILEIDTGSSTSWVFC----ECDGCHTNPRTFLQSRSTTCAKVSCGT 56

Query: 95  SQCAVVTSN--CSEG----DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
           S C +  S+  C +     DC +   Y  G   S S G L  +TLTF+       ++P  
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDG---SASYGILYQDTLTFSDVQ----KIPGF 109

Query: 149 IFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQGSSK------ 202
            FGC   +  +        G++G+G G  S++ Q   +  G FSYCLP Q S +      
Sbjct: 110 SFGCNMDSFGA-NEFGNVDGLLGMGAGPMSVLKQSSPTFDG-FSYCLPLQMSERGFFSKT 167

Query: 203 ---INFGGIVAG--AGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSS--STGNIFV 251
               + GG +A     V  T ++ R    + +++ L AISV  +RL    S  S   +  
Sbjct: 168 TGYFSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVF 227

Query: 252 DTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISS--QPKFPEVTI 309
           D+G   + +P        SV+S  I+   ++   AE   S+  CY++ S  +   P +++
Sbjct: 228 DSGSELSYIP----DRALSVLSQRIRELLLRRGAAEEE-SERNCYDMRSVDEGDMPAISL 282

Query: 310 HF-RGADVKLSPSNLF--RNISDE-IMCSAF 336
           HF  GA   L    +F  R++ ++ + C AF
Sbjct: 283 HFDDGARFDLGSHGVFVERSVQEQDVWCLAF 313


>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 438

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 94/370 (25%), Positives = 158/370 (42%), Gaps = 50/370 (13%)

Query: 34  IYLMHLSIGTPPVDIFGSVDTGSDCTWTQCE-PCPELDCFKQEPPLFDPKKSSTYNSISC 92
            Y + L+IG PP   F  +DTGSD TW QC+ PC    C +   PL+ P      + + C
Sbjct: 76  FYNVTLNIGQPPRPYFLDIDTGSDLTWLQCDAPCSR--CSQTPHPLYRPSN----DFVPC 129

Query: 93  SSSQCAVVTS----NCS-EGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPN 147
             S CA +      +C     C Y   Y    Y+S   G L  +  T N T+G+ +++  
Sbjct: 130 RHSLCASLHHSDNYDCEVPHQCDYEVQYA-DHYSSL--GVLLHDVYTLNFTNGVQLKV-R 185

Query: 148 VIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINF 205
           +  GCG+  +    S     G++GLG G +SL SQ+ +   +     +CL  QG   I F
Sbjct: 186 MALGCGYDQIFPDPSHHPLDGMLGLGRGKTSLTSQLNSQGLVRNVIGHCLSAQGGGYIFF 245

Query: 206 GGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSSSTG----NIFVDTGVLRTLLP 261
           G +   + +  TP+  RD+ + S    + G   L F    +G    +   DTG   T   
Sbjct: 246 GDVYDSSRLTWTPMSSRDYKHYS----AAGAAELLFGGKKSGIGSLHAVFDTGSSYTYFN 301

Query: 262 LEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYN--------ISSQPKFPEVTIHFRG 313
              +  L S +      +P+K    +      LC+            +  F  + + F  
Sbjct: 302 PYAYQALISWLGKESGGKPLKEAHDDQTLP--LCWRGRRPFRSIYEVRKYFKPIVLSFTS 359

Query: 314 -----ADVKLSP------SNLFRNISDEIMCSAFRG-GNANIVYGRIMQINFLIGYDIEQ 361
                A  ++ P      SN+  N+   I+  +  G G+ N++ G I  +N ++ +D ++
Sbjct: 360 NGRSKAQFEMPPEAYLIISNM-GNVCLGILNGSEVGMGDLNLI-GDISMLNKVMVFDNDK 417

Query: 362 AMVSFKPSRC 371
            ++ + P+ C
Sbjct: 418 QLIGWTPADC 427


>gi|222629275|gb|EEE61407.1| hypothetical protein OsJ_15596 [Oryza sativa Japonica Group]
          Length = 466

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 105/400 (26%), Positives = 159/400 (39%), Gaps = 90/400 (22%)

Query: 35  YLMHLSIGTPP----VDIFGSVDTGSDCTWTQCEPCPELDCFKQE----------PPLFD 80
           Y + LS+G P     V +F  +DTGSD  W  C P   + C  +           PP  D
Sbjct: 88  YTLSLSVGPPSTASSVSLF--LDTGSDLVWFPCAPFTCMLCEGKATPGGNHSSPLPPPID 145

Query: 81  PKK------------SSTYNSISCSSSQC---AVVTSNCSEGDCS-YSFLYGRGAYASFS 124
            ++            SS   S  C++++C   A+ T +C+   C    + YG G+  +  
Sbjct: 146 SRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSLVA-- 203

Query: 125 SGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMG 184
             NL    +   ++  + VE  N  F C H  LA P       G+ G G G  SL +Q+ 
Sbjct: 204 --NLRRGRVGLAAS--MAVE--NFTFACAHTALAEPV------GVAGFGRGPLSLPAQLA 251

Query: 185 TSIAGKFSYCLPDQGSSKINFGGIVAGAGVVSTPLIIRDH----YYLSLEAISVGNQRLE 240
            S++G         G+S+ +F         V TPL+        Y ++LEA+SVG +R++
Sbjct: 252 PSLSGSTDAAA--IGASETDF---------VYTPLLHNPKHPYFYSVALEAVSVGGKRIQ 300

Query: 241 -------FVSSSTGNIFVDTGVLRTLLPLEYHSNLKS---VMSNMIKAQPVKGVGAEPGF 290
                        G + VD+G   T+LP +  + +           +    +G  A+ G 
Sbjct: 301 AQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAARFTRAEGAEAQTGL 360

Query: 291 SDVLCYNIS-SQPKFPEVTIHFRG-ADVKLSPSNLFRNISDE----IMCSAFR------- 337
           +   CY+ S S    P V +HFRG A V L   N F     E    + C           
Sbjct: 361 AP--CYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVGCLMLMNVGGNND 418

Query: 338 ----GGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
               GG      G   Q  F + YD++   V F   RCT+
Sbjct: 419 DGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTD 458


>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
          Length = 447

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 48/128 (37%), Positives = 65/128 (50%), Gaps = 15/128 (11%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSS 94
           Y   + +GTP       +DTGSD  W QC PC    C+ Q   +FDP++SSTY  + CSS
Sbjct: 86  YFALVGVGTPSTKAMLVIDTGSDLVWLQCSPCRR--CYAQRGQVFDPRRSSTYRRVPCSS 143

Query: 95  SQCAVV------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNV 148
            QC  +      +   + G C Y   YG G   S S+G+LAT+ L F + +     + NV
Sbjct: 144 PQCRALRFPGCDSGGAAGGGCRYMVAYGDG---SSSTGDLATDKLAFANDT----YVNNV 196

Query: 149 IFGCGHKN 156
             GCG  N
Sbjct: 197 TLGCGRDN 204



 Score = 45.8 bits (107), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 42/90 (46%), Gaps = 11/90 (12%)

Query: 295 CYNISSQPKF--PEVTIHFRG-ADVKLSPSNLF-------RNISDEIMCSAFRGGNANI- 343
           CY++  +P    P + +HF G AD+ L P N F       R  +    C  F   +  + 
Sbjct: 358 CYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLS 417

Query: 344 VYGRIMQINFLIGYDIEQAMVSFKPSRCTN 373
           V G + Q  F + +D+E+  + F P  CT+
Sbjct: 418 VIGNVQQQGFRVVFDVEKERIGFAPKGCTS 447


>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 421

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 94/305 (30%), Positives = 137/305 (44%), Gaps = 48/305 (15%)

Query: 24  YQAEIISVDDIYLMHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKK 83
           +   +   D  +L+ ++ GTPP +    +DTGS  TWTQC+ C  ++C +     F+   
Sbjct: 117 HNNNLFDEDGNFLVDVAFGTPPQNFMLILDTGSSITWTQCKAC--VNCLQDSHRYFNWSA 174

Query: 84  SSTYNSISCSSSQCAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPV 143
           SSTY+S SC       V +N       Y+  YG     S S GN   +T+T   +     
Sbjct: 175 SSTYSSGSCIP---GTVENN-------YNMTYGD---DSTSVGNYGCDTMTLEPSD---- 217

Query: 144 EMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCLPDQG---- 199
                 FGCG  N       S   G++GLG G  S +SQ  +     FSYCLP++     
Sbjct: 218 VFQKFQFGCGRNNKGD--FGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGS 275

Query: 200 ----------SSKINFGGIVAGAGVVSTPLIIRDHYYLSLEAISVGNQRLEFVSS---ST 246
                     SS + F  +V G G +        +Y+++L  ISVGN+RL   SS   S 
Sbjct: 276 LLFGEKATSQSSSLKFTSLVNGPGTLQE----SGYYFVNLSDISVGNERLNIPSSVFASP 331

Query: 247 GNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVL--CYNISSQPKF 304
           G I +D+  + T LP   +S LK+     +   P+     + G  D+L  CYN       
Sbjct: 332 GTI-IDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKG--DILDTCYNXXXX-XX 387

Query: 305 PEVTI 309
           PE+TI
Sbjct: 388 PELTI 392


>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
          Length = 375

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 91/357 (25%), Positives = 149/357 (41%), Gaps = 78/357 (21%)

Query: 52  VDTGSDCTWTQCEPCPELDCFKQE--PPL--FDPKKSSTYNSISCSSSQCAVVTSNCSEG 107
           VDTGSD  WTQC+         +   PPL    P ++  +   +C++S  AV        
Sbjct: 57  VDTGSDLIWTQCKLSSSTAAAARHGSPPLSRTAPARTGAFTR-TCTASAAAV-------- 107

Query: 108 DCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIFGCGHKNLASPTSDSKQT 167
                             G LA+ET TF +   + + +    FGCG  +  S       T
Sbjct: 108 ------------------GVLASETFTFGARRAVSLRLG---FGCGALSAGSLIG---AT 143

Query: 168 GIIGLGPGNSSLISQMGTSIAGKFSYCL---PDQGSSKINFGGI-----------VAGAG 213
           GI+GL P + SLI+Q+      +FSYCL    D+ +S + FG +           +    
Sbjct: 144 GILGLSPESLSLITQLKIQ---RFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQTTA 200

Query: 214 VVSTPLIIRDHYYLSLEAISVGNQRLEFVSSST-------GNIFVDTGVLRTLLPLEYHS 266
           +VS P +   +YY+ L  IS+G++RL   ++S        G   VD+G     L      
Sbjct: 201 IVSNP-VETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFE 259

Query: 267 NLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--------KFPEVTIHFRGADVKL 318
            +K  + ++++  PV     E      LC+ +  +         + P + +HF G    +
Sbjct: 260 AVKEAVMDVVRL-PVANRTVE---DYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMV 315

Query: 319 SPS-NLFRNISDEIMCSAF---RGGNANIVYGRIMQINFLIGYDIEQAMVSFKPSRC 371
            P  N F+     +MC A      G+   + G + Q N  + +D++    SF P++C
Sbjct: 316 LPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 372


>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
          Length = 370

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 99/384 (25%), Positives = 153/384 (39%), Gaps = 76/384 (19%)

Query: 52  VDTGSDCTWTQCEP------CPELDCFKQEPPLFDPKKSSTYNSISCSSSQCAVV----- 100
           +DTGSD  W  C        CPE         +F P+ SS+ + ++C+ S C  +     
Sbjct: 1   MDTGSDLVWVPCTRNYSCINCPED---SASNGVFLPRMSSSLHLVTCADSNCKTLYGNNT 57

Query: 101 ----------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLPVEMPNVIF 150
                       NCSE    Y   YGRG+ A    G L TETL       LP+E      
Sbjct: 58  ELLCQSCAGSLKNCSETCPPYGIQYGRGSTA----GLLLTETLN------LPLENGEGAR 107

Query: 151 GCGHKNLASPTSDSKQ-TGIIGLGPGNSSLISQMGTSIA-GKFSYCLPDQGSSKINFGGI 208
              H  +      S+Q +GI G G G  S+ SQ+G  I   +F+YCL      + N   +
Sbjct: 108 AITHFAVGCSIVSSQQPSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSL 167

Query: 209 -VAGAGVVS-------TPLIIRD----------HYYLSLEAISVGNQRLE--------FV 242
            V G   +        TP +             +YY+ L  +S+G +RL+        F 
Sbjct: 168 MVLGDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFD 227

Query: 243 SSSTGNIFVDTGVLRTLLPLEYHSNLKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP 302
           +   G   +D+G   T+   E   ++ +  ++ I  +    V  + G    LCY+++   
Sbjct: 228 TKGNGGTIIDSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVEDKTGMG--LCYDVTGLE 285

Query: 303 K--FPEVTIHFR-GADVKLSPSNLFRNIS--DEIMCSAF--RG-----GNANIVYGRIMQ 350
               PE   HF+ G+D+ L  +N F   S  D I  +    RG         ++ G   Q
Sbjct: 286 NIVLPEFAFHFKGGSDMVLPVANYFSYFSSFDSICLTMISSRGLLEVDSGPAVILGNDQQ 345

Query: 351 INFLIGYDIEQAMVSFKPSRCTNY 374
            +F + YD E+  + F    C  +
Sbjct: 346 QDFYLLYDREKNRLGFTQQTCKTF 369


>gi|255685718|gb|ACU28348.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
 gi|255685720|gb|ACU28349.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
 gi|255685724|gb|ACU28351.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
          Length = 91

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 47/106 (44%), Positives = 60/106 (56%), Gaps = 15/106 (14%)

Query: 37  MHLSIGTPPVDIFGSVDTGSDCTWTQCEPCPELDCFKQEPPLFDPKKSSTYNSISCSSSQ 96
           M L IGTPP +I   +DTGS+  WTQC PC  L C+ Q+ P+FDP KSST+    C    
Sbjct: 1   MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRC---- 54

Query: 97  CAVVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTFNSTSGLP 142
                 N     C Y  +Y   +Y   + G LATET+T +STSG+P
Sbjct: 55  ------NTPNHSCPYKIVYDDKSY---TLGTLATETVTIHSTSGVP 91


>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
 gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
 gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
 gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
 gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
 gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 469

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 110/403 (27%), Positives = 157/403 (38%), Gaps = 89/403 (22%)

Query: 35  YLMHLSIGTPPVDIFGSVDTGSDCTWTQCEP---CPELDCFKQEP---PLFDPKKSSTYN 88
           Y + LS GTP   I    DTGS   W  C     C   D    +P   P F PK SS+  
Sbjct: 90  YSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSK 149

Query: 89  SISCSSSQCAVV-------------TSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF 135
            I C S +C  +             T NC+ G   Y   YG G+ A    G L TE L F
Sbjct: 150 IIGCQSPKCQFLYGPNVQCRGCDPNTRNCTVGCPPYILQYGLGSTA----GVLITEKLDF 205

Query: 136 NSTSGLPVEMPNVIFGCGHKNLASPTSDSKQTGIIGLGPGNSSLISQMGTSIAGKFSYCL 195
                  + +P+ + GC      S  S  +  GI G G G  SL SQM      +FS+CL
Sbjct: 206 PD-----LTVPDFVVGC------SIISTRQPAGIAGFGRGPVSLPSQMNLK---RFSHCL 251

Query: 196 PDQGSSKINF---------GGIVAGA---GVVSTPL---------IIRDHYYLSLEAISV 234
             +     N           G  +G+   G+  TP             ++YYL+L  I V
Sbjct: 252 VSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYV 311

Query: 235 GNQRLEFV-------SSSTGNIFVDTGVLRTLLPLEYHS----NLKSVMSNMIKAQPV-K 282
           G + ++         ++  G   VD+G   T +             S MSN  + + + K
Sbjct: 312 GRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEK 371

Query: 283 GVGAEPGFSDVLCYNISSQPKF--PEVTIHFRG-ADVKLSPSNLFRNI--SDEIMCSAFR 337
             G  P      C+NIS +     PE+   F+G A ++L  SN F  +  +D +  +   
Sbjct: 372 ETGLGP------CFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVS 425

Query: 338 GGNAN--------IVYGRIMQINFLIGYDIEQAMVSFKPSRCT 372
               N        I+ G   Q N+L+ YD+E     F   +C+
Sbjct: 426 DKTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 535

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 103/415 (24%), Positives = 167/415 (40%), Gaps = 64/415 (15%)

Query: 5   QKLPFYNDNETPKSPISIIYQAEII-------------SVDDIYLMHLSIGTPPVDIFGS 51
           Q L    DN+  +  + +  Q +++              +D ++   + IGTP V    +
Sbjct: 59  QYLQLLLDNDLKRQKMKLGAQNQLLFPSLGSHTFFYGNDLDWLHYTWIDIGTPNVSFLVA 118

Query: 52  VDTGSDCTWTQCEPCPELDCFKQEP-------PL------FDPKKSSTYNSISCSSSQCA 98
           +D GSD +W  C      DC +  P       PL      + P  S+T   +SC+   C 
Sbjct: 119 LDAGSDLSWVPC------DCIQCAPLSASLYKPLDRDLSEYRPSLSTTSRHLSCNHQLCE 172

Query: 99  VVTSNCSEGDCSYSFLYGRGAYASFSSGNLATETLTF-----NSTSGLPVEMPNVIFGCG 153
            + S+C        ++       + SSG L  + L       +S S       +VI GCG
Sbjct: 173 -LGSHCKNLKDPCPYIADYADPNTSSSGFLVEDILHLASVSDDSNSTQKRVQASVILGCG 231

Query: 154 HKNLASPTSDSKQTGIIGLGPGNSSLISQMGTS--IAGKFSYCLPDQGSSKINFGGIVAG 211
            K        +   G++GLGPG+ S+ S +  +  I   FS C    GS  I FG     
Sbjct: 232 RKQTGGYLDGAAPDGVMGLGPGSISVPSLLAKAGLIRKSFSLCFDVNGSGTILFGD-QGH 290

Query: 212 AGVVSTPLIIR----DHYYLSLEAISVGNQRLEFVSSSTGNIFVDTGVLRTLLPLEYHSN 267
               STPL+      D Y + +E+  VGN  L+    S     VD+G   T LP++ ++ 
Sbjct: 291 TSQKSTPLLPTQGNYDAYLIEVESYCVGNSCLK---QSGFKALVDSGASFTYLPIDVYNK 347

Query: 268 LKSVMSNMIKAQPVKGVGAEPGFSDVLCYNISSQP--KFPEVTIHFRGADVKLSPSNLF- 324
           +       + AQ +   G    +    CYN SS+     P + + F      L  ++ + 
Sbjct: 348 IVLEFDKQVNAQRISSQGGPWNY----CYNTSSKQLDNVPAMRLSFLMNQSLLIHNSTYY 403

Query: 325 --RNISDEIMCSAFRGGNANIVYGRIMQINFLIGY----DIEQAMVSFKPSRCTN 373
             +N    + C   +  + N  YG I Q N++ GY    D+E   + +  S C +
Sbjct: 404 VPQNQEFAVFCLTLQPTDLN--YGIIGQ-NYMTGYRVVFDMENLKLGWSSSNCKD 455


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.134    0.401 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,059,136,327
Number of Sequences: 23463169
Number of extensions: 265984114
Number of successful extensions: 581998
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 822
Number of HSP's successfully gapped in prelim test: 2254
Number of HSP's that attempted gapping in prelim test: 575237
Number of HSP's gapped (non-prelim): 3282
length of query: 374
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 230
effective length of database: 8,980,499,031
effective search space: 2065514777130
effective search space used: 2065514777130
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)