BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 014537
         (423 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 225/442 (50%), Positives = 304/442 (68%), Gaps = 21/442 (4%)

Query: 1   MATF---LSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDA 57
           MA F   LS    +  LC      I A+  GF+V+LIHRDSP SPFYNS ET  QR+ +A
Sbjct: 1   MAAFRSPLSFALAIALLCVSGFGCIYARKVGFTVDLIHRDSPLSPFYNSEETDLQRINNA 60

Query: 58  LTRSLNRLNHFNQNSSIS-SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
           L RS++R++HF+  ++ S S KA+++D+  N   YL+ +S+GTPP + + +ADTGSDLIW
Sbjct: 61  LRRSISRVHHFDPIAAASVSPKAAESDVTSNRGEYLMSLSLGTPPFKIMGIADTGSDLIW 120

Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGS 176
           TQC+PC   +CY Q  PLFDPK S TY+   C + QC+ L+Q +CSG  CQY  SYGD S
Sbjct: 121 TQCKPC--ERCYKQVDPLFDPKSSKTYRDFSCDARQCSLLDQSTCSGNICQYQYSYGDRS 178

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           ++ GN+A++T+TL STTG  V+ P    GCG  N G F+ K +GIVGLG G +SLISQM 
Sbjct: 179 YTMGNVASDTITLDSTTGSPVSFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMG 238

Query: 237 TTIAGKFSYCLVPVS-----STKINFGTNGIVSGPGVVSTPLTKAKT---FYVLTIDAIS 288
           +++ GKFSYCLVP+S     S+K+NFG+N +VSGPGV STPL  ++T   FY LT++A+S
Sbjct: 239 SSVGGKFSYCLVPLSSRAGNSSKLNFGSNAVVSGPGVQSTPLLSSETMSSFYFLTLEAMS 298

Query: 289 VGNQR-------LGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
           VGN+R       LG    +I+IDSGTTLT +P  + SNL + + + +E +   DP+G L 
Sbjct: 299 VGNERIKFGDSSLGTGEGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLS 358

Query: 342 LCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTN 401
           +CYS  S  +VP +T HF GADVKL   N FV+VS+D+VC  F   T+ + IYGN+ Q N
Sbjct: 359 VCYSATSDLKVPAITAHFTGADVKLKPINTFVQVSDDVVCLAFASTTSGISIYGNVAQMN 418

Query: 402 FLVGYDIEQQTVSFKPTDCTKQ 423
           FLV Y+I+ +++SFKPTDCTK+
Sbjct: 419 FLVEYNIQGKSLSFKPTDCTKK 440


>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 238/443 (53%), Positives = 310/443 (69%), Gaps = 24/443 (5%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           MA  +S + I+  +    + PI+A   GF+VELI+RDSPKSPFYN  ETP QR+  A+ R
Sbjct: 1   MAASVSLLAIVTLIFSGTLVPIDAAKDGFTVELINRDSPKSPFYNPRETPTQRIVSAVRR 60

Query: 61  SLNRLNHFN--QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
           S++R++HF+  +NS I +  A Q+++I N   YL++ S+GTP  + LA+ADTGSDLIWTQ
Sbjct: 61  SMSRVHHFSPTKNSDIFTDTA-QSEMISNQGEYLMKFSLGTPAFDILAIADTGSDLIWTQ 119

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSG---VNCQYSVSYGD 174
           C+PC   QCY QD+PLFDPK SSTY+ + CS+ QC  L +  SCSG     C YS SYGD
Sbjct: 120 CKPC--DQCYEQDAPLFDPKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGD 177

Query: 175 GSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
            SF++GN+A +T+TLGST+G+ V LP    GCG NNGG F  K +GIVGLGGG ISLISQ
Sbjct: 178 RSFTSGNVAADTITLGSTSGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQ 237

Query: 235 MRTTIAGKFSYCLVPVS-----STKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAI 287
           + +TI GKFSYCLVP+S     S+K+NFG+NGIVSG GV STPL      TFY LT++A+
Sbjct: 238 LGSTIDGKFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAV 297

Query: 288 SVGNQRL-------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 340
           SVG++R+       G S  +I+IDSGTTLT  P+ + S L S +   +   PV DP+G L
Sbjct: 298 SVGSERIKFPGSSFGTSEGNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGIL 357

Query: 341 ELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQT 400
            LCYS ++  + P +T HF GADVKL+  N FV+VS+ ++C  F  I NS  I+GN+ Q 
Sbjct: 358 SLCYSIDADLKFPSITAHFDGADVKLNPLNTFVQVSDTVLCFAFNPI-NSGAIFGNLAQM 416

Query: 401 NFLVGYDIEQQTVSFKPTDCTKQ 423
           NFLVGYD+E +TVSFKPTDCT+ 
Sbjct: 417 NFLVGYDLEGKTVSFKPTDCTQD 439


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  406 bits (1043), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 227/413 (54%), Positives = 285/413 (69%), Gaps = 21/413 (5%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK-ASQADIIP 86
           GF+ +LIHRDSPKSPFYN +ET  QRLR+A+ RS++R+ HF   S   +S  A Q D+  
Sbjct: 30  GFTADLIHRDSPKSPFYNPTETSSQRLRNAIHRSVSRVFHFTDISQKDASDNAPQIDLTS 89

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N+  YL+ IS+GTPP   +A+ADTGSDL+WTQC+PC    CY Q  PLFDPK SSTYK +
Sbjct: 90  NSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPC--DDCYTQVDPLFDPKASSTYKDV 147

Query: 147 PCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
            CSSSQC +L NQ SCS  +  C YS SYGD S++ GN+A +T+TLGST  + V L  I 
Sbjct: 148 SCSSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNII 207

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS-----TKINFG 258
            GCG NN G FN K +GIVGLGGG +SLI+Q+  +I GKFSYCLVP++S     +KINFG
Sbjct: 208 IGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFG 267

Query: 259 TNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRL-------GVSTPDIVIDSGTTL 309
           TN +VSG GVVSTPL     +TFY LT+ +ISVG++ +       G    +I+IDSGTTL
Sbjct: 268 TNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNIIIDSGTTL 327

Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRS 369
           T LP  + S L   ++S I+A+   DP   L LCYS     +VP +T+HF GADV L  S
Sbjct: 328 TLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATGDLKVPAITMHFDGADVNLKPS 387

Query: 370 NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           N FV++SED+VC  F+G + S  IYGN+ Q NFLVGYD   +TVSFKPTDC K
Sbjct: 388 NCFVQISEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAK 439


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 228/413 (55%), Positives = 286/413 (69%), Gaps = 24/413 (5%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           GF+ +LIHRDSPKSPFYN  ET  QRLR+A+ RS+NR+ HF +  +   +   Q D+  N
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDN---TPQPQIDLTSN 86

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  YL+ +SIGTPP   +A+ADTGSDL+WTQC PC    CY Q  PLFDPK SSTYK + 
Sbjct: 87  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPC--DDCYTQVDPLFDPKTSSTYKDVS 144

Query: 148 CSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           CSSSQC +L NQ SCS  +  C YS+SYGD S++ GN+A +T+TLGS+  + + L  I  
Sbjct: 145 CSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIII 204

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS-----TKINFGT 259
           GCG NN G FN K +GIVGLGGG +SLI Q+  +I GKFSYCLVP++S     +KINFGT
Sbjct: 205 GCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGT 264

Query: 260 NGIVSGPGVVSTPL-TKA--KTFYVLTIDAISVGNQRL-------GVSTPDIVIDSGTTL 309
           N IVSG GVVSTPL  KA  +TFY LT+ +ISVG++++         S  +I+IDSGTTL
Sbjct: 265 NAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTL 324

Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRS 369
           T LP  + S L   ++S I+A+   DP   L LCYS     +VP +T+HF GADVKL  S
Sbjct: 325 TLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADVKLDSS 384

Query: 370 NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           N FV+VSED+VC  F+G + S  IYGN+ Q NFLVGYD   +TVSFKPTDC K
Sbjct: 385 NAFVQVSEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAK 436


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score =  404 bits (1039), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 228/413 (55%), Positives = 286/413 (69%), Gaps = 24/413 (5%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           GF+ +LIHRDSPKSPFYN  ET  QRLR+A+ RS+NR+ HF +  +   +   Q D+  N
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDN---TPQPQIDLTSN 86

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  YL+ +SIGTPP   +A+ADTGSDL+WTQC PC    CY Q  PLFDPK SSTYK + 
Sbjct: 87  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPC--DDCYTQVDPLFDPKTSSTYKDVS 144

Query: 148 CSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           CSSSQC +L NQ SCS  +  C YS+SYGD S++ GN+A +T+TLGS+  + + L  I  
Sbjct: 145 CSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIII 204

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS-----TKINFGT 259
           GCG NN G FN K +GIVGLGGG +SLI Q+  +I GKFSYCLVP++S     +KINFGT
Sbjct: 205 GCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLVPLTSKKDQTSKINFGT 264

Query: 260 NGIVSGPGVVSTPL-TKA--KTFYVLTIDAISVGNQRL-------GVSTPDIVIDSGTTL 309
           N IVSG GVVSTPL  KA  +TFY LT+ +ISVG++++         S  +I+IDSGTTL
Sbjct: 265 NAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEGNIIIDSGTTL 324

Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRS 369
           T LP  + S L   ++S I+A+   DP   L LCYS     +VP +T+HF GADVKL  S
Sbjct: 325 TLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHFDGADVKLDSS 384

Query: 370 NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           N FV+VSED+VC  F+G + S  IYGN+ Q NFLVGYD   +TVSFKPTDC K
Sbjct: 385 NAFVQVSEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAK 436


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  403 bits (1035), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 211/429 (49%), Positives = 284/429 (66%), Gaps = 23/429 (5%)

Query: 11  LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ 70
           LF LCF + S   A + GFSVELIHRDSPKSP+Y  +E  YQ   DA  RS+NR NHF +
Sbjct: 11  LFSLCF-IASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARRSINRANHFFK 69

Query: 71  NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
           +S  S+ +++   +IP+   YL+  S+GTPPT+   +ADTGSD++W QCEPC   QCY Q
Sbjct: 70  DSDTSTPEST---VIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPC--EQCYNQ 124

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTL 189
            +P+F+P  SS+YK++PCSS  C S+   SCS  N CQY +SYGD S S G+L+ +T++L
Sbjct: 125 TTPIFNPSKSSSYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSL 184

Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP 249
            ST+G  V+ P I  GCGT+N G F   ++GIVGLGGG +SLI+Q+ ++I GKFSYCLVP
Sbjct: 185 ESTSGSPVSFPKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVP 244

Query: 250 V------SSTKINFGTNGIVSGPGVVSTPLTKAK-TFYVLTIDAISVGNQRL-------- 294
           +      +S+ ++FG   +VSG GVVSTPL K    FY LT+ A SVGN+R+        
Sbjct: 245 LLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEG 304

Query: 295 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVP 353
           G    +I+IDSGTTLT +P    +NL S +  +++   V DP     LCYS  S     P
Sbjct: 305 GDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNEYDFP 364

Query: 354 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
            +T+HF+GADV+L   + FV +++ IVC  F+       I+GN+ Q N LVGYD++Q+TV
Sbjct: 365 IITVHFKGADVELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTV 424

Query: 414 SFKPTDCTK 422
           SFKPTDCTK
Sbjct: 425 SFKPTDCTK 433


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  397 bits (1020), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 210/439 (47%), Positives = 283/439 (64%), Gaps = 23/439 (5%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           M T       LF LCF + S   A + GFSVELIHRDSPKSP+Y  +E  YQ   DA  R
Sbjct: 1   MNTLCFLTLSLFSLCF-IASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARR 59

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
           S+NR NHF ++S  S+ +++   +IP+   YL+  S+GTPPT+   +ADTGSD++W QCE
Sbjct: 60  SINRANHFFKDSDTSTPEST---VIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCE 116

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSN 179
           PC   QCY Q +P+F+P  SS+YK++PC S  C S+   SCS  N CQY +SYGD S S 
Sbjct: 117 PC--EQCYNQTTPIFNPSKSSSYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQ 174

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           G+L+ +T++L ST+G  V+ P    GCGT+N G F   ++GIVGLGGG +SLI+Q+ ++I
Sbjct: 175 GDLSVDTLSLESTSGSPVSFPKTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSI 234

Query: 240 AGKFSYCLVPV------SSTKINFGTNGIVSGPGVVSTPLTKAK-TFYVLTIDAISVGNQ 292
            GKFSYCLVP+      +S+ ++FG   +VSG GVVSTPL K    FY LT+ A SVGN+
Sbjct: 235 GGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNK 294

Query: 293 RL--------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY 344
           R+        G    +I+IDSGTTLT +P    +NL S +  +++   V DP     LCY
Sbjct: 295 RVEFGGSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCY 354

Query: 345 SFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFL 403
           S  S     P +T HF+GAD++L   + FV +++ IVC  F+       I+GN+ Q N L
Sbjct: 355 SLKSNEYDFPIITAHFKGADIELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLL 414

Query: 404 VGYDIEQQTVSFKPTDCTK 422
           VGYD++Q+TVSFKPTDCTK
Sbjct: 415 VGYDLQQKTVSFKPTDCTK 433


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  394 bits (1012), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 216/445 (48%), Positives = 286/445 (64%), Gaps = 29/445 (6%)

Query: 1   MATFLSCVFILFF-----LCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLR 55
           MATF S   +L F     LC      I A   GF+ EL+HRDSPKSP YNS +T  QR  
Sbjct: 1   MATFQS---VLSFASAIALCVASFGCIYAHNAGFTTELVHRDSPKSPLYNSQQTHLQRWN 57

Query: 56  DALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLI 115
            A+ RS++R++HF + ++  S K  +++II N   YL+ +S+GTPP E LA+ADTGSDLI
Sbjct: 58  KAMRRSVSRVHHFQRTAATVSPKEVESEIIANGGEYLMSLSLGTPPFEILAIADTGSDLI 117

Query: 116 WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSGVN-CQYSVSYG 173
           WTQC PC   +CY Q +PLFDPK S TY+ L C + QC +L +  SCS    CQYS  YG
Sbjct: 118 WTQCTPC--DKCYKQIAPLFDPKSSKTYRDLSCDTRQCQNLGESSSCSSEQLCQYSYYYG 175

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           D SF+NGNLA +TVTL ST G  V  P    GCG  N G F+ K +GI+GLGGG +SLIS
Sbjct: 176 DRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLIS 235

Query: 234 QMRTTIAGKFSYCLVPVS------STKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTID 285
           QM +++ GKFSYCLVP S      S+K++FG N +VSG GV STPL      TFY LT++
Sbjct: 236 QMGSSVGGKFSYCLVPFSSESAGNSSKLHFGRNAVVSGSGVQSTPLISKNPDTFYYLTLE 295

Query: 286 AISVGNQRL-------GVSTPDIVIDSGTTLTFLPQGYNSNLL-SVMSSMIEAQPVADPT 337
           A+SVG++++       G S  +I+IDSGT+LT  P  + +    +V +++I  +   D +
Sbjct: 296 AMSVGDKKIEFGGSSFGGSEGNIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDAS 355

Query: 338 GSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNI 397
           G L  CY      +VP +T HF GADV L   N F+ +S+D++C  F   T S  I+GN+
Sbjct: 356 GLLSHCYRPTPDLKVPVITAHFNGADVVLQTLNTFILISDDVLCLAFNS-TQSGAIFGNV 414

Query: 398 MQTNFLVGYDIEQQTVSFKPTDCTK 422
            Q NFL+GYDI+ ++VSFKPTDCT+
Sbjct: 415 AQMNFLIGYDIQGKSVSFKPTDCTQ 439


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score =  387 bits (994), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 208/438 (47%), Positives = 287/438 (65%), Gaps = 37/438 (8%)

Query: 9   FILFFLC--FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
            +LF+LC  FY    +EA  GGFSVE+IHRDS +SPF+  +ET +QR+ +A+ RS+NR N
Sbjct: 11  LVLFYLCNIFY----LEAFNGGFSVEMIHRDSSRSPFFRPTETQFQRVANAVHRSVNRAN 66

Query: 67  HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
           HF++     + KA++A I  N+  YLI  S+G PP +   + DTGSD+IW QC+PC   +
Sbjct: 67  HFHK-----AHKAAKATITQNDGEYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPC--EK 119

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLA 183
           CY Q + +FDP  S+TYK LP SS+ C S+   SCS  N   C+Y++ YGDGS+S G+L+
Sbjct: 120 CYNQTTRIFDPSKSNTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLS 179

Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR---TTIA 240
            ET+TLGST G +V       GCG NN   F  K++GIVGLG G +SLI+Q+R   ++I 
Sbjct: 180 VETLTLGSTNGSSVKFRRTVIGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIG 239

Query: 241 GKFSYCLVPVS--STKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGV 296
            KFSYCL  +S  S+K+NFG   +VSG G VSTP+     K FY LT++A SVGN R+  
Sbjct: 240 RKFSYCLASMSNISSKLNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEF 299

Query: 297 STP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SF 346
           ++         +I+IDSGTTLT LP    S L S ++ ++E   V DP   L LCY  +F
Sbjct: 300 TSSSFRFGEKGNIIIDSGTTLTLLPNDIYSKLESAVADLVELDRVKDPLKQLSLCYRSTF 359

Query: 347 NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVG 405
           + L+  P +  HF GADVKL+  N F++V + + C  F  I++ + PI+GN+ Q NFLVG
Sbjct: 360 DELN-APVIMAHFSGADVKLNAVNTFIEVEQGVTCLAF--ISSKIGPIFGNMAQQNFLVG 416

Query: 406 YDIEQQTVSFKPTDCTKQ 423
           YD++++ VSFKPTDC+KQ
Sbjct: 417 YDLQKKIVSFKPTDCSKQ 434


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score =  387 bits (993), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 214/411 (52%), Positives = 278/411 (67%), Gaps = 21/411 (5%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           GF+++LIHRDSPKSPFYNS+ET  QR+R+A+ RS      F+ + +  S  + Q+ I  N
Sbjct: 25  GFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDA--SPNSPQSFITSN 82

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
              YL+ ISIGTPP   LA+ADTGSDLIWTQC PC    CY Q SPLFDPK SSTY+ + 
Sbjct: 83  RGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPC--EDCYQQTSPLFDPKESSTYRKVS 140

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           CSSSQC +L   SCS     C Y+++YGD S++ G++A +TVT+GS+  + V+L  +  G
Sbjct: 141 CSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIG 200

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS-----TKINFGTN 260
           CG  N G F+   +GI+GLGGG  SL+SQ+R +I GKFSYCLVP +S     +KINFGTN
Sbjct: 201 CGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTN 260

Query: 261 GIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRL-------GVSTPDIVIDSGTTLTF 311
           GIVSG GVVST + K    T+Y L ++AISVG++++       G    +IVIDSGTTLT 
Sbjct: 261 GIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIVIDSGTTLTL 320

Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNF 371
           LP  +   L SV++S I+A+ V DP G L LCY  +S  +VP++T+HF+G DVKL   N 
Sbjct: 321 LPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSSSFKVPDITVHFKGGDVKLGNLNT 380

Query: 372 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           FV VSED+ C  F      + I+GN+ Q NFLVGYD    TVSFK TDC++
Sbjct: 381 FVAVSEDVSCFAFAA-NEQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDCSQ 430


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  386 bits (992), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 223/431 (51%), Positives = 286/431 (66%), Gaps = 22/431 (5%)

Query: 10  ILFFLCFY---VVSPIEAQTG-GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL 65
           +L  LC +   ++S + A+   GF+ +LIHRDSPKSPFYN +ETP QR+R+A+ RS NR+
Sbjct: 8   VLLSLCLFSSHILSNVNAKPKLGFTTDLIHRDSPKSPFYNPAETPSQRIRNAIHRSFNRV 67

Query: 66  NHFNQNSSISSSKAS-QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPP 124
           +HF   S + +S  S Q DI P    YL+ +S+GTPP+  +AVADTGS+LIWTQC+PC  
Sbjct: 68  SHFTDLSEMDASLNSPQTDITPCGGEYLMNLSLGTPPSPIMAVADTGSNLIWTQCKPC-- 125

Query: 125 SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGN 181
             CY Q  PLFDPK SSTYK + CSSSQC +L NQ SCS  +  C Y VSY DGS++ G 
Sbjct: 126 DDCYTQVDPLFDPKASSTYKDVSCSSSQCTALENQASCSTEDKTCSYLVSYADGSYTMGK 185

Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
            A +T+TLGST  + V L  I  GCG NN   F +K++G+VGLGGG +SLI Q+  +I G
Sbjct: 186 FAVDTLTLGSTDNRPVQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSIDG 245

Query: 242 KFSYCLVPVS--STKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLGVS 297
           KFSYCLVP +  ++KINFGTN +VSGPG VSTPL      TFY LT+ +ISVG++ +   
Sbjct: 246 KFSYCLVPENDQTSKINFGTNAVVSGPGTVSTPLVVKSRDTFYYLTLKSISVGSKNM--Q 303

Query: 298 TPD------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ 351
           TPD      +VIDSGTTLT LP  Y   + + ++S+I A    D      LCY+  +   
Sbjct: 304 TPDSNIKGNMVIDSGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSSLCYNATADLN 363

Query: 352 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
           +P +T+HF GADVKL   N F KV+ED+VC  F        IYGN+ Q NFLVGYD   +
Sbjct: 364 IPVITMHFEGADVKLYPYNSFFKVTEDLVCLAFGMSFYRNGIYGNVAQKNFLVGYDTASK 423

Query: 412 TVSFKPTDCTK 422
           T+SFKPTDC K
Sbjct: 424 TMSFKPTDCAK 434


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 203/434 (46%), Positives = 285/434 (65%), Gaps = 24/434 (5%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
           V ++ FL F ++    A+ GGFSV+LIHRDSP SPF++ S+T  +RL DA  RS++R+  
Sbjct: 12  VVVVGFL-FQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRRSVSRVGR 70

Query: 68  FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
           F   +   +S   Q+ I+P+   YL+ + IGTPP   +A+ DTGSDL WTQC PC  + C
Sbjct: 71  FRPTAM--TSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPC--THC 126

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSG-VNCQYSVSYGDGSFSNGNLATE 185
           Y Q  PLFDPK SSTY+   C +S C +L + +SCS    C +  SY DGSF+ GNLA+E
Sbjct: 127 YKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGNLASE 186

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
           T+T+ ST G+ V+ PG  FGCG ++GG+F+  ++GIVGLGGG++SLISQ+++TI G FSY
Sbjct: 187 TLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTINGLFSY 246

Query: 246 CLVPVS-----STKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRL---- 294
           CL+PVS     S++INFG +G VSG G VSTPL +    TFY LT++ ISVG +RL    
Sbjct: 247 CLLPVSTDSSISSRINFGASGRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKG 306

Query: 295 -----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 349
                 V   +I++DSGTT TFLPQ + S L   +++ I+ + V DP G   LCY+  + 
Sbjct: 307 YSKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAE 366

Query: 350 SQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
              P +T HF+ A+V+L   N F+++ ED+VC      T+ + + GN+ Q NFLVG+D+ 
Sbjct: 367 INAPIITAHFKDANVELQPLNTFMRMQEDLVCFTV-APTSDIGVLGNLAQVNFLVGFDLR 425

Query: 410 QQTVSFKPTDCTKQ 423
           ++ VSFK  DCT+ 
Sbjct: 426 KKRVSFKAADCTQH 439


>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 447

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 216/436 (49%), Positives = 289/436 (66%), Gaps = 28/436 (6%)

Query: 10  ILFFLCFYVVSPIEA-QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
           ++FF+ F  +S  EA   GGFS +LI RDSP SPFYN SET + RL+ A  RS++R NHF
Sbjct: 15  VIFFIHFSGLSHTEASNKGGFSTDLISRDSPLSPFYNPSETQFDRLQKAFHRSISRANHF 74

Query: 69  NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
             N    S+ + Q+ +I NN  YL+ IS+GTPP     +ADTGSDL+W QC+PC    CY
Sbjct: 75  RANGV--STNSIQSPVISNNGEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPC--DSCY 130

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLN-QKSCSGVN-CQYSVSYGDGSFSNGNLATET 186
            Q  P+FDP  S TY+ L C    C++L  Q  CS  N C YS SYGDGS ++G+LA +T
Sbjct: 131 EQIEPIFDPAKSKTYQILSCEGKSCSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAVDT 190

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYC 246
           +T+GSTTG+ V++P + FGCG NNGG F    +G+VGLGGG +S+ISQ+R  I G+FSYC
Sbjct: 191 LTIGSTTGRPVSVPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGGRFSYC 250

Query: 247 LVPVS-----STKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLG---- 295
           LVP+      S+K++FG+ GIVSG G VSTPL   +  TFY LT++++SVG+++L     
Sbjct: 251 LVPLGNDPSVSSKMHFGSRGIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGF 310

Query: 296 --VSTP-------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF 346
             V +P       +I+IDSGTTLT LPQ +   L S + S I  +PV DP     LCYS 
Sbjct: 311 SKVGSPLADADEGNIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNNVFSLCYSN 370

Query: 347 NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY 406
            S  ++P +T HF GAD++L   N FV+V ED+ C     +++ + I+GN+ Q NFLVGY
Sbjct: 371 LSGLRIPTITAHFVGADLELKPLNTFVQVQEDLFCFAMIPVSD-LAIFGNLAQMNFLVGY 429

Query: 407 DIEQQTVSFKPTDCTK 422
           D++ +TVSFKPTDCTK
Sbjct: 430 DLKSRTVSFKPTDCTK 445


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  374 bits (960), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 208/439 (47%), Positives = 282/439 (64%), Gaps = 31/439 (7%)

Query: 5   LSCVFILFFLC--FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           ++ VF L FL     V S + A+  GF+VELIHRDSPKSP YNSSET + R+ +AL RS 
Sbjct: 1   MAPVFSLLFLISTASVFSAVTARDYGFTVELIHRDSPKSPMYNSSETHFDRIVNALRRSS 60

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R      N+ +  S  ++A I  N   YL+ IS+GTPP   +AVADTGSD+IWTQC+PC
Sbjct: 61  HR------NTVVLESDTAEAPIFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPC 114

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA-SLNQKSCSG-VNCQYSVSYGDGSFSNG 180
             S CY Q++P+FDP  S+TYK++ CSS  C+ S +  SCS    C YS++YGD S S G
Sbjct: 115 --SNCYQQNAPMFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQG 172

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
           NLA +TVT+ ST+G+ VA P    GCG +N G FN+  +GIVGLG G  SL++Q+     
Sbjct: 173 NLAVDTVTMQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATG 232

Query: 241 GKFSYCLVPV------SSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGN 291
           GKFSYCL+P+       STK+NFG+N  VSG G VSTP+    + KTFY L ++A+SVG+
Sbjct: 233 GKFSYCLIPIGTGSTNDSTKLNFGSNANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGD 292

Query: 292 QRL----GVST----PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC 343
            +     G S      +I+IDSGTTLT+LP    ++  S +S  +      DP+  L+ C
Sbjct: 293 TKFNFPEGASKLGGESNIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYC 352

Query: 344 YSFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTN 401
           ++  +   ++P VT+HF GADV L R N FV++S+D +C  F     +++ IYGNI Q+N
Sbjct: 353 FATTTDDYEMPPVTMHFEGADVPLQRENLFVRLSDDTICLAFGSFPDDNIFIYGNIAQSN 412

Query: 402 FLVGYDIEQQTVSFKPTDC 420
           FLVGYDI+   VSF+P  C
Sbjct: 413 FLVGYDIKNLAVSFQPAHC 431


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 215/443 (48%), Positives = 284/443 (64%), Gaps = 30/443 (6%)

Query: 4   FLSCVF-ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           F+ C   I+  + F   S  EA+  GF+ + I RDSP SPFYN SET YQRL+ A  RS+
Sbjct: 8   FVFCTLAIIILIHFSEHSHAEAKIDGFTTDFISRDSPHSPFYNPSETKYQRLQKAFRRSI 67

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
            R NHF   +  +S    Q+D+I     YL+ IS+GTPP   L +ADTGSDLIW QC PC
Sbjct: 68  LRGNHFR--AMRASPNDIQSDVISGGGAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPC 125

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGVN-CQYSVSYGDGSFSNG 180
           P   CY Q  PLFDPK S TYK+L C +  C  L Q+ SC   N C YS SYGD S++ G
Sbjct: 126 P--NCYEQVEPLFDPKESETYKTLDCDNEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRG 183

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
           +L+++T+T+GST G   + PGI FGCG +NGG FN K  G++GLGGG +SL+ Q+ + + 
Sbjct: 184 DLSSDTLTIGSTEGDPASFPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVG 243

Query: 241 GKFSYCLVPVS-----STKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQR 293
           G+FSYCLVP+S     S+KINFG +G+VSG G VSTPL K    TFY LT++ +SVG++ 
Sbjct: 244 GQFSYCLVPLSSDSTVSSKINFGKSGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSET 303

Query: 294 L-------------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 340
           +              V   +I+IDSGTTLT LPQ + +++ S +++ I  Q   DP G  
Sbjct: 304 VAFKGFSENKSSPAAVEEGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIF 363

Query: 341 ELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNS-VPIYGNIMQ 399
            LCYS  +  ++P +T HF GADV+L   N FV+V ED+VC  F  I +S + I+GN+ Q
Sbjct: 364 SLCYSSVNNLEIPTITAHFTGADVQLPPLNTFVQVQEDLVC--FSMIPSSNLAIFGNLAQ 421

Query: 400 TNFLVGYDIEQQTVSFKPTDCTK 422
            NFLVGYD++   VSFK TDCT+
Sbjct: 422 INFLVGYDLKNNKVSFKQTDCTE 444


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score =  374 bits (959), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 212/442 (47%), Positives = 293/442 (66%), Gaps = 26/442 (5%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           M T    + ++   C Y +S ++A  GGFSVE+IHRDS +SP Y  +ETP+QR+ +A+ R
Sbjct: 3   MITRYCSLALVLLWCLYNISFLKANDGGFSVEMIHRDSSRSPLYRPTETPFQRVANAVRR 62

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
           S+NR NHF +  +  S+ ++++ ++ +   YL+R S+G+PP + L + DTGSD++W QCE
Sbjct: 63  SINRGNHFKK--AFVSTDSAESTVVASQGEYLMRYSVGSPPFQVLGIVDTGSDILWLQCE 120

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSN 179
           PC    CY Q +P+FDP  S TYK+LPCSS+ C SL   +CS  N C+YS+ YGDGS S+
Sbjct: 121 PC--EDCYKQTTPIFDPSKSKTYKTLPCSSNTCESLRNTACSSDNVCEYSIDYGDGSHSD 178

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           G+L+ ET+TLGST G +V  P    GCG NNGG F  + +GIVGLGGG +SLISQ+ ++I
Sbjct: 179 GDLSVETLTLGSTDGSSVHFPKTVIGCGHNNGGTFQEEGSGIVGLGGGPVSLISQLSSSI 238

Query: 240 AGKFSYCLVPV-----SSTKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQ 292
            GKFSYCL P+     SS+K+NFG   +VSG G VSTPL     + FY LT++A SVG+ 
Sbjct: 239 GGKFSYCLAPIFSESNSSSKLNFGDAAVVSGRGTVSTPLDPLNGQVFYFLTLEAFSVGDN 298

Query: 293 RLGVSTP----------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
           R+  S            +I+IDSGTTLT LPQ    NL S +S +I+ +   DP+  L L
Sbjct: 299 RIEFSGSSSSGSGSGDGNIIIDSGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKLLSL 358

Query: 343 CYSFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQT 400
           CY   S    +P +T HF+GADV+L+  + FV V + +VC  F  I++ +  I+GN+ Q 
Sbjct: 359 CYKTTSDELDLPVITAHFKGADVELNPISTFVPVEKGVVCFAF--ISSKIGAIFGNLAQQ 416

Query: 401 NFLVGYDIEQQTVSFKPTDCTK 422
           N LVGYD+ ++TVSFKPTDCTK
Sbjct: 417 NLLVGYDLVKKTVSFKPTDCTK 438


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 208/442 (47%), Positives = 287/442 (64%), Gaps = 33/442 (7%)

Query: 9   FILFFLCFYVVSPI------EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           F+   +CF  +SP        +   GFS+ LIHRDSP SP YN + T + RLR+A +RS+
Sbjct: 8   FVTIVICFISLSPFPLLGAAASPDPGFSLNLIHRDSPLSPLYNPNHTDFDRLRNAFSRSI 67

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R+N F   +   +S   Q D++PN   Y +++SIGTP  E + +ADTGSDL W QC PC
Sbjct: 68  SRVNVFKTKAVDINS--FQNDLVPNGGEYFMKMSIGTPLVEVIVIADTGSDLTWVQCLPC 125

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGVN--CQYSVSYGDGSFS 178
            P  CY Q SPLFDP  SS+Y+ + C S  C +L+  +++C+     C+Y  SYGD S++
Sbjct: 126 DP--CYRQKSPLFDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSYT 183

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           NGNLATE  T+GST+ + V L  I FGCGT NGG F+   +GIVGLGGG +SL+SQ+ + 
Sbjct: 184 NGNLATEKFTIGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQLSSI 243

Query: 239 IAGKFSYCLVPVS-----STKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGN 291
           I GKFSYCLVP+S     ++KI FGT+ ++SGP VVSTPL   +  T+Y +T++AISVGN
Sbjct: 244 IKGKFSYCLVPLSEQSNVTSKIKFGTDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGN 303

Query: 292 QRL---------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
           +RL          V   +++IDSGTTLTFL   + + L  V+   ++A+ V+DP G   +
Sbjct: 304 KRLPYTNGLLNGNVEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGLFSV 363

Query: 343 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTN 401
           C+       +P + +HF  ADVKL   N FVK  ED++C  F  I +N + I+GN+ Q +
Sbjct: 364 CFRSAGDIDLPVIAVHFNDADVKLQPLNTFVKADEDLLC--FTMISSNQIGIFGNLAQMD 421

Query: 402 FLVGYDIEQQTVSFKPTDCTKQ 423
           FLVGYD+E++TVSFKPTDCTK 
Sbjct: 422 FLVGYDLEKRTVSFKPTDCTKH 443


>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score =  371 bits (952), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 211/444 (47%), Positives = 289/444 (65%), Gaps = 30/444 (6%)

Query: 4   FLSCVFILFFLCFYVV-SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           F+ C+  + FL ++   S  EA+  GF+ + I RDSP+SPFYN SET YQRL+ A  RS+
Sbjct: 8   FVFCLLAIIFLIYFAKHSQAEAKVDGFTTDFISRDSPRSPFYNPSETKYQRLQKAFRRSI 67

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
            R NHF   +  +S    Q+++I    +YL+ IS+GTPP   L +ADTGSDLIW QC PC
Sbjct: 68  LRGNHFR--AIRASPNDIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPC 125

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGVN-CQYSVSYGDGSFSNG 180
               CY Q  PLFDPK S TYK+L C++  C  L Q+ SC   N C  S SYGD S++  
Sbjct: 126 --DDCYKQVEPLFDPKKSKTYKTLGCNNDFCQDLGQQGSCGDDNTCTSSYSYGDQSYTRR 183

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
           +L++ET T+GST G   + PG+ FGCG +NGG FN K +G++GLGGG +SL+ Q+ + + 
Sbjct: 184 DLSSETFTIGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVG 243

Query: 241 GKFSYCLVPVS-----STKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQR 293
           G+FSYCLVP+S     S+KINFG + +VSG G VSTPL K    TFY LT++ +S+G+++
Sbjct: 244 GQFSYCLVPLSSDSTASSKINFGKSAVVSGSGTVSTPLIKGTPDTFYYLTLEGMSLGSEK 303

Query: 294 LGV-------STP------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 340
           +         S+P      +I+IDSGTTLT LP+ + +++ S ++ +I  Q   DP G+ 
Sbjct: 304 VAFKGFSKNKSSPAAAEESNIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTF 363

Query: 341 ELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNS-VPIYGNIMQ 399
            LCYS     ++P +T HF GADV+L   N FV+  ED+VC  F  I +S + I+GN+ Q
Sbjct: 364 SLCYSGVKKLEIPTITAHFIGADVQLPPLNTFVQAQEDLVC--FSMIPSSNLAIFGNLSQ 421

Query: 400 TNFLVGYDIEQQTVSFKPTDCTKQ 423
            NFLVGYD++   VSFKPTDCTKQ
Sbjct: 422 MNFLVGYDLKNNKVSFKPTDCTKQ 445


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score =  370 bits (950), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 206/432 (47%), Positives = 272/432 (62%), Gaps = 26/432 (6%)

Query: 9   FILFFLC--FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
            +LF+LC  FY    +EA  GGFSVE+IHRDS +SPF++ +ET +QR+ +A+ RS+NR N
Sbjct: 11  LVLFYLCNIFY----LEAFNGGFSVEMIHRDSSRSPFFSPTETQFQRVANAVHRSINRAN 66

Query: 67  HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
           H NQ  S  S  + +  +I     YLI  S+GTP  +   + DTGSD+IW QC+PC   +
Sbjct: 67  HLNQ--SFVSPNSPETTVISALGEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPC--KK 122

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATE 185
           CY Q +P+FD   S TYK+LPC S+ C S+    CS   +C YS+ Y DGS S G+L+ E
Sbjct: 123 CYEQTTPIFDSSKSQTYKTLPCPSNTCQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVE 182

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
           T+TLGST G  V  PG   GCG  N      K +GIVGLG G +SLI+Q+  +  GKFSY
Sbjct: 183 TLTLGSTNGSPVQFPGTVIGCGRYNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSY 242

Query: 246 CLVP---VSSTKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGVSTP- 299
           CLVP    +S+K+NFG   +VSG G VSTPL       FY LT++A SVG  R+   +P 
Sbjct: 243 CLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPG 302

Query: 300 -----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSL-SQ 351
                +I+IDSGTTLT LP G  S L + ++  +  Q V DP   L LCY    + L + 
Sbjct: 303 SGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLDAS 362

Query: 352 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
           VP +T HF GADV L+  N FV+V++D+VC  F+  T +  ++GN+ Q N LVGYD++  
Sbjct: 363 VPVITAHFSGADVTLNAINTFVQVADDVVCFAFQP-TETGAVFGNLAQQNLLVGYDLQMN 421

Query: 412 TVSFKPTDCTKQ 423
           TVSFK TDCTKQ
Sbjct: 422 TVSFKHTDCTKQ 433


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  367 bits (942), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 190/428 (44%), Positives = 273/428 (63%), Gaps = 21/428 (4%)

Query: 10  ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
           +LFF   +++S   +    FS ELIHRDS KSP Y  ++  +Q + +A  RS+NR N   
Sbjct: 9   LLFFSLCFIISFSHSLRNSFSFELIHRDSSKSPLYKPAQNKFQHVVNAARRSINRANRLF 68

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           ++S    S   ++ +  N   YL+  S+GTPP     V DTGSD++W QC+PC   QCY 
Sbjct: 69  KDSL---SNTPESTVYVNGGEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPC--EQCYK 123

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
           Q +P+F+P  SS+YK++PCSS+ C S+   SC+  N C+Y++++ D S+S G L+ ET+T
Sbjct: 124 QTTPIFNPSKSSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFSDQSYSQGELSVETLT 183

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
           L STTG +V+ P    GCG NN G+F  +T+GIVGLG G +SL +Q++++I GKFSYCL+
Sbjct: 184 LDSTTGHSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLL 243

Query: 249 PV-----SSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVSTPD- 300
           P+      ++K+NFG   +VSG GVVSTP  K   + FY LT++A SVGN+R+     D 
Sbjct: 244 PLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFEVLDD 303

Query: 301 -----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPE 354
                I++DSGTTLT LP    +NL S ++ +++   V DP   L LCYS  S     P 
Sbjct: 304 SEEGNIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLNLCYSITSDQYDFPI 363

Query: 355 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
           +T HF+GAD+KL+  + F  V++ +VC  F   + + PI+GN+ Q N LVGYD++Q  VS
Sbjct: 364 ITAHFKGADIKLNPISTFAHVADGVVCLAFTS-SQTGPIFGNLAQLNLLVGYDLQQNIVS 422

Query: 415 FKPTDCTK 422
           FKP+DC K
Sbjct: 423 FKPSDCIK 430


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score =  365 bits (936), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 209/436 (47%), Positives = 289/436 (66%), Gaps = 26/436 (5%)

Query: 11  LFFLCFYV-VSPIEA-QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
           +  LC Y+ +S + A   GGFSVE+IHRDS +SP+Y  +ET +QR+ +AL RS+NR NHF
Sbjct: 12  IVLLCLYINISFLNALDGGGFSVEIIHRDSSRSPYYRPTETQFQRVANALRRSINRANHF 71

Query: 69  NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
           N+ + ++S+  +++ +I +   YL+  S+GTPP + L + DTGSD+IW QC+PC    CY
Sbjct: 72  NKPNLVASTNTAESTVIASQGEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPC--EDCY 129

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSGVN--CQYSVSYGDGSFSNGNLATE 185
            Q +P+FDP  S TYK+LPCSS+ C S+    SCS  N  C+Y+++YGD S S G+L+ E
Sbjct: 130 NQTTPIFDPSQSKTYKTLPCSSNICQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVE 189

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
           T+TLGST G +V  P    GCG NN G F  + +GIVGLGGG +SLISQ+ ++I GKFSY
Sbjct: 190 TLTLGSTDGSSVQFPKTVIGCGHNNKGTFQREGSGIVGLGGGPVSLISQLSSSIGGKFSY 249

Query: 246 CLVPV-----SSTKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRL---- 294
           CL P+     SS+K+NFG   +VSG G VSTP+       FY LT++A SVG+ R+    
Sbjct: 250 CLAPLFSQSNSSSKLNFGDEAVVSGRGTVSTPIVPKNGLGFYFLTLEAFSVGDNRIEFGS 309

Query: 295 -----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 349
                     +I+IDSGTTLT LP+    NL S ++  IE + V DP+  L LCY   S 
Sbjct: 310 SSFESSGGEGNIIIDSGTTLTILPEDDYLNLESAVADAIELERVEDPSKFLRLCYRTTSS 369

Query: 350 SQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
            +  VP +T HF+GADV+L+  + F++V E +VC  F+  +   PI+GN+ Q N LVGYD
Sbjct: 370 DELNVPVITAHFKGADVELNPISTFIEVDEGVVCFAFRS-SKIGPIFGNLAQQNLLVGYD 428

Query: 408 IEQQTVSFKPTDCTKQ 423
           + +QTVSFKPTDCT++
Sbjct: 429 LVKQTVSFKPTDCTQE 444


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score =  363 bits (933), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 195/433 (45%), Positives = 273/433 (63%), Gaps = 26/433 (6%)

Query: 10  ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
           +LFF   ++VS   AQ  GFSVELIHRDS KSP Y  ++  YQ   DA  RS+NR NHF 
Sbjct: 9   LLFFSICFIVSFSHAQKNGFSVELIHRDSLKSPLYKPTQNKYQYFVDAARRSINRANHFY 68

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           +    S +   Q+ +IP+   YL+  S+GTPP +   + DTGSD++W QCEPC   +CY 
Sbjct: 69  K---YSLANIPQSTVIPDIGEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCEPC--QECYN 123

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
           Q +P+F+P  SS+YK++PC S  C S+   SC+  N C+YS  YGD S S G+L+ +T+T
Sbjct: 124 QTTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSHSGGDLSVDTLT 183

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
           L ST G  V+ P I  GCGTNN   +   ++GIVG G G  S I+Q+ ++  GKFSYCL 
Sbjct: 184 LESTNGLTVSFPNIVIGCGTNNILSYEGASSGIVGFGSGPASFITQLGSSTGGKFSYCLT 243

Query: 249 PV---------SSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRL--- 294
           P+         +++K+NFG    VSG GVV+TP+ K   +TFY LT++A SVGN+R+   
Sbjct: 244 PLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIG 303

Query: 295 ----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
               G +  +I+IDSGTTLT L +   S L S +  +++ + V DPT +L LCYS  +  
Sbjct: 304 GVPNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQTLNLCYSVKAEG 363

Query: 351 -QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
              P +T+HF+GADV L   + FV V++ + C  F+   +   I+GN+ Q N +VGYD++
Sbjct: 364 YDFPIITMHFKGADVDLHPISTFVSVADGVFCLAFESSQDHA-IFGNLAQQNLMVGYDLQ 422

Query: 410 QQTVSFKPTDCTK 422
           Q+ VSFKP+DCTK
Sbjct: 423 QKIVSFKPSDCTK 435


>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 449

 Score =  363 bits (932), Expect = 8e-98,   Method: Compositional matrix adjust.
 Identities = 211/444 (47%), Positives = 283/444 (63%), Gaps = 33/444 (7%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
           +FI F       S +EA+  GFS  LIHRDS  SP YN  +T + RLR++  RS++R N 
Sbjct: 11  LFIAFISMVSAFSLVEARNAGFSANLIHRDSSVSPLYNPRDTYFDRLRNSFHRSISRANR 70

Query: 68  FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
           F  NS IS+    Q+DI+P    YL+RISIG P  E LA+ADTGSDLIW QC+PC    C
Sbjct: 71  FKPNS-ISARALVQSDIVPGGGEYLMRISIGNPQVEILAIADTGSDLIWVQCQPC--EMC 127

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSG----VNCQYSVSYGDGSFSNGN 181
           Y Q+SP+FDP+ SS+Y+++ C +  C  L+   +SC        C Y+ SYGD SFS+G+
Sbjct: 128 YKQNSPIFDPRRSSSYRNVLCGNEFCNKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGH 187

Query: 182 LATETVTLGST---TGQAVA-LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
           LA E   +GST   T  A+A    + FGCGT NGG F+   +GI+GLGGG +SL+SQ+  
Sbjct: 188 LAIERFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGP 247

Query: 238 TIAGKFSYCLVPVS-----STKINFGTNGIVSGPG--VVSTPL--TKAKTFYVLTIDAIS 288
            ++GKFSYCLVP S     ++KINFG +  +SG    VVSTPL   K +T+Y LT++AIS
Sbjct: 248 KLSGKFSYCLVPTSEQSNYTSKINFGNDINISGSNYNVVSTPLLPKKPETYYYLTLEAIS 307

Query: 289 VGNQRL--------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 340
           V N+RL         V   +I+IDSGTTLTFL   + +NL S +   ++ + V+DP G  
Sbjct: 308 VENKRLPYTNLWNGEVEKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLF 367

Query: 341 ELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQ 399
            +C+      ++P +T HF GADV+L   N F KV ED++C  F  I +N + I+GN+ Q
Sbjct: 368 NICFKDEKAIELPIITAHFTGADVELQPVNTFAKVEEDLLC--FTMIPSNDIAIFGNLAQ 425

Query: 400 TNFLVGYDIEQQTVSFKPTDCTKQ 423
            NFLVGYD+E++ VSF PTDCTKQ
Sbjct: 426 MNFLVGYDLEKKAVSFLPTDCTKQ 449


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  363 bits (931), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 201/443 (45%), Positives = 271/443 (61%), Gaps = 34/443 (7%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTG---GFSVELIHRDSPKSPFYNSSETPYQRLRDA 57
           MA   S V ++ FL    V  + A TG   GF+VELIHRDSPKSP YN  E  Y R+ D 
Sbjct: 1   MAPIFSLVIVIIFLISTAV--VSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADT 58

Query: 58  LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
           L RS++       N+ + ++   +A I  N   YL+++S+GTPP   +AVADTGSD+IWT
Sbjct: 59  LRRSIS------HNTGLVTNTV-EAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWT 111

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVSYGDG 175
           QCEPC  + CY QD P+F+P  S+TY+ + CSS  C+   +  SCS   +C YS+SYGD 
Sbjct: 112 QCEPC--TNCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDN 169

Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           S S G+ A +T+T+GST+G+ VA P    GCG +N G F++  +GIVGLG G  SLI QM
Sbjct: 170 SHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQM 229

Query: 236 RTTIAGKFSYCLVPV-----SSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAI 287
            + + GKFSYCL P+      S K+NFG+N  VSG G VSTP+    K K+FY L + A+
Sbjct: 230 GSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAV 289

Query: 288 SVGNQRLGVST--------PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 339
           SVG      ST         +I+IDSGTTLT LP     N    +S+ I  Q   DP   
Sbjct: 290 SVGRNNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQF 349

Query: 340 LELCYSFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNI 397
           LE C+   +   +VP + +HF GA+++L R N  ++VS++++C  F G   N + IYGNI
Sbjct: 350 LEYCFETTTDDYKVPFIAMHFEGANLRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNI 409

Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
            Q NFLVGYD+   ++SFKP +C
Sbjct: 410 AQINFLVGYDVTNMSLSFKPMNC 432


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score =  363 bits (931), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 206/420 (49%), Positives = 281/420 (66%), Gaps = 27/420 (6%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           GFSVE+IHRDS +SP Y  +ETP+QR+ +A+ RS+NR NHFN+ S ++S+  +++ +  +
Sbjct: 34  GFSVEMIHRDSSRSPLYRHTETPFQRVANAMRRSINRANHFNKKSFVASTNTAESTVKAS 93

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
              YL+  S+GTPP E L V DTGS + W QC+ C    CY Q +P+FDP  S TYK+LP
Sbjct: 94  QGEYLMSYSVGTPPFEILGVVDTGSGITWMQCQRC--EDCYEQTTPIFDPSKSKTYKTLP 151

Query: 148 CSSSQCAS-LNQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           CSS+ C S ++  SCS   + C+Y++ YGDGS S G+L+ ET+TLGST G +V  P    
Sbjct: 152 CSSNMCQSVISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTLGSTNGSSVQFPNTVI 211

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFGT 259
           GCG NN G F  + +G+VGLGGG +SLISQ+ ++I GKFSYCL P+     SS+K+NFG 
Sbjct: 212 GCGHNNKGTFQGEGSGVVGLGGGPVSLISQLSSSIGGKFSYCLAPMFSQSNSSSKLNFGD 271

Query: 260 NGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL-----------GVSTPDIVIDS 305
             +VSG G VSTPL   T ++ FY LT++A SVG++R+                +I+IDS
Sbjct: 272 AAVVSGLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIIIDS 331

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGAD 363
           GTTLT LPQ   SNL S ++  I+A  V+DP+  L LCY      Q  VP +T HF+GAD
Sbjct: 332 GTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCYQTTPSGQLDVPVITAHFKGAD 391

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 423
           V+L+  + FV+V+E +VC  F   +  V I+GN+ Q N LVGYD+ +QTVSFKPTDCT++
Sbjct: 392 VELNPISTFVQVAEGVVCFAFHS-SEVVSIFGNLAQLNLLVGYDLMEQTVSFKPTDCTQE 450


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score =  362 bits (930), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 196/436 (44%), Positives = 281/436 (64%), Gaps = 30/436 (6%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
           V ++ FL F+++    A  GGFSV+LIHRDSP SPF++ S+T  +RL DA  RS +R+  
Sbjct: 12  VVVVGFL-FHLLEVGLASGGGFSVDLIHRDSPHSPFFDPSKTRTERLTDAFHRSASRVGR 70

Query: 68  FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
           F Q  S  +S   Q+ ++P+   Y++ +SIGTPP   +A+ DTGSDL WTQC PC  + C
Sbjct: 71  FRQ--SAMTSDGIQSRLVPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPC--THC 126

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASL-NQKSC-SGVNCQYSVSYGDGSFSNGNLATE 185
           Y Q  P FDPK SSTY+   C +S C +L N +SC +G  C +  SY DGSF+ GNLA E
Sbjct: 127 YKQVVPFFDPKNSSTYRDSSCGTSFCLALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVE 186

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
           T+T+ ST G+ V+ PG  FGC   +GG+F+  ++GIVGLG  ++S+ISQ+++TI G+FSY
Sbjct: 187 TLTVASTAGKPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTINGRFSY 246

Query: 246 CLVPV-----SSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLG-- 295
           CL+PV      S++INFG +GIVSG G VSTPL        +Y++T++  SVG +RL   
Sbjct: 247 CLLPVFTDSSMSSRINFGRSGIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYK 306

Query: 296 -------VSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS 348
                  V   +I++DSGTT T+LP  +   L   ++  I+ + V DP G   LCY+  +
Sbjct: 307 GFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYN-TT 365

Query: 349 LSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVG 405
           + Q+  P +T HF+ A+V+L   N F+++ ED+VC +V    T+ + I GN+ Q NFLVG
Sbjct: 366 VDQIDAPIITAHFKDANVELQPWNTFLRMQEDLVCFTVLP--TSDIGILGNLAQVNFLVG 423

Query: 406 YDIEQQTVSFKPTDCT 421
           +D+ ++ VSFK  DCT
Sbjct: 424 FDLRKKRVSFKAADCT 439


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  359 bits (922), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 200/443 (45%), Positives = 270/443 (60%), Gaps = 34/443 (7%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTG---GFSVELIHRDSPKSPFYNSSETPYQRLRDA 57
           MA   S V ++ FL    V  + A TG   GF+VELIHRDSPKSP YN  E  Y R+ D 
Sbjct: 1   MAPIFSLVIVIIFLISTAV--VSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADT 58

Query: 58  LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
           L RS++       N+ + ++   +A I  N   YL+++S+GTPP   +AVADTGSD+IWT
Sbjct: 59  LRRSIS------HNTGLVTNTV-EAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWT 111

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVSYGDG 175
           QC PC  + CY QD P+F+P  S+TY+ + CSS  C+   +  SCS   +C YS+SYGD 
Sbjct: 112 QCVPC--TNCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDN 169

Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           S S G+ A +T+T+GST+G+ VA P    GCG +N G F++  +GIVGLG G  SLI QM
Sbjct: 170 SHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQM 229

Query: 236 RTTIAGKFSYCLVPV-----SSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAI 287
            + + GKFSYCL P+      S K+NFG+N  VSG G VSTP+    K K+FY L + A+
Sbjct: 230 GSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAV 289

Query: 288 SVGNQRLGVST--------PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 339
           SVG      ST         +I+IDSGTTLT LP     N    +S+ I  Q   DP   
Sbjct: 290 SVGRNNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQF 349

Query: 340 LELCYSFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNI 397
           LE C+   +   +VP + +HF GA+++L R N  ++VS++++C  F G   N + IYGNI
Sbjct: 350 LEYCFETTTDDYKVPFIAMHFEGANLRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNI 409

Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
            Q NFLVGYD+   ++SFKP +C
Sbjct: 410 AQINFLVGYDVTNMSLSFKPMNC 432


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score =  358 bits (918), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 198/429 (46%), Positives = 264/429 (61%), Gaps = 22/429 (5%)

Query: 11  LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ 70
           L  LC Y +   EA   GFSVE+IHRDS +SPFY ++ET +QR+ +A+ RS+NR NHFNQ
Sbjct: 9   LVLLCLYNICFSEALKSGFSVEIIHRDSSRSPFYRATETQFQRVTNAVRRSMNRANHFNQ 68

Query: 71  NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
            S  S++  S   ++ ++ +YL+  S+GTPP     + DT SD+IW QC+ C    CY  
Sbjct: 69  ISVYSNAVESPVTLL-DDGDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLC--ETCYND 125

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETV 187
            SP+FDP  S TYK+LPCSS+ C S+   SCS      C+++V+Y DGS S G+L  ETV
Sbjct: 126 TSPMFDPSYSKTYKNLPCSSTTCKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETV 185

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           TLGS     V  P    GC  N    F+S   GIVGLGGG +SL+ Q+ ++I+ KFSYCL
Sbjct: 186 TLGSYNDPFVHFPRTVIGCIRNTNVSFDS--IGIVGLGGGPVSLVPQLSSSISKKFSYCL 243

Query: 248 VPVS--STKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLGVSTP---- 299
            P+S  S+K+ FG   +VSG G VST +     K FY LT++A SVGN R+   +     
Sbjct: 244 APISDRSSKLKFGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSSSSRS 303

Query: 300 ----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY-SFNSLSQVPE 354
               +I+IDSGTT T LP    S L S ++ +++ +   DP     LCY S      VP 
Sbjct: 304 SGKGNIIIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSLCYKSTYDKVDVPV 363

Query: 355 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
           +T HF GADVKL+  N F+  S  +VC  F   + S  I+GN+ Q NFLVGYD++++ VS
Sbjct: 364 ITAHFSGADVKLNALNTFIVASHRVVCLAFLS-SQSGAIFGNLAQQNFLVGYDLQRKIVS 422

Query: 415 FKPTDCTKQ 423
           FKPTDCTKQ
Sbjct: 423 FKPTDCTKQ 431


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  351 bits (900), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 200/442 (45%), Positives = 286/442 (64%), Gaps = 30/442 (6%)

Query: 2   ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
            +FL+  F  FFLCF  +S  +A + GFS+ELIHRDS KSPFY  ++  YQ + DA+ RS
Sbjct: 4   VSFLTLSF--FFLCF-SISFSQAVSNGFSIELIHRDSSKSPFYKPTQNKYQHVVDAVHRS 60

Query: 62  LNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
           +NR+NH N+NS  S+ +++   +I    +Y++  S+GTPP +   + DTGSD++W QCEP
Sbjct: 61  INRVNHSNKNSLASTPEST---VISYEGDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCEP 117

Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-NCQYSVSYGDGSFSNG 180
           C   QCY Q +P F+P  SS+YK++ CSS  C S+   SC+   NC+YS++YG+ S S G
Sbjct: 118 C--EQCYNQTTPKFNPSKSSSYKNISCSSKLCQSVRDTSCNDKKNCEYSINYGNQSHSQG 175

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
           +L+ ET+TL STTG+ V+ P    GCGTNN G F   ++G+VGLGGG  SLI+Q+  +I 
Sbjct: 176 DLSLETLTLESTTGRPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIG 235

Query: 241 GKFSYCLVPVS---------STKINFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISV 289
           GKFSYCLV +S         S+K+NFG   IVSG  V+STP+ K     FY LTI+A SV
Sbjct: 236 GKFSYCLVRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSV 295

Query: 290 GNQRL-------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
           G++R+       GV   +I+IDS T +TF+P    + L S +  ++  + V DP     L
Sbjct: 296 GDKRVEFAGSSKGVEEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPNQQFSL 355

Query: 343 CYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQT 400
           CY+ +S  +   P +T HF+GAD+ L  +N FV+V+ D++C  F   +N   I+G+  Q 
Sbjct: 356 CYNVSSDEEYDFPYMTAHFKGADILLYATNTFVEVARDVLCFAF-APSNGGAIFGSFSQQ 414

Query: 401 NFLVGYDIEQQTVSFKPTDCTK 422
           +F+VGYD++Q+TVSFK  DCT+
Sbjct: 415 DFMVGYDLQQKTVSFKSVDCTE 436


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score =  350 bits (897), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 186/415 (44%), Positives = 266/415 (64%), Gaps = 24/415 (5%)

Query: 4   FLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
           F + V + F   F ++    A+ GGFSV+LIHRDSP SPF++ S+T  +RL DA  RS++
Sbjct: 9   FFNVVVVGFL--FQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRRSVS 66

Query: 64  RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
           R+  F   +   +S   Q+ I+P+   YL+ + IGTPP   +A+ DTGSDL WTQC PC 
Sbjct: 67  RVGRFRPTAM--TSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPC- 123

Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSG-VNCQYSVSYGDGSFSNGN 181
            + CY Q  PLFDPK SSTY+   C +S C +L + +SCS    C +  SY DGSF+ GN
Sbjct: 124 -THCYKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGN 182

Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           LA+ET+T+ ST G+ V+ PG  FGCG ++GG+F+  ++GIVGLGGG++SLISQ+++TI G
Sbjct: 183 LASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTING 242

Query: 242 KFSYCLVPVS-----STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV 296
            FSYCL+PVS     S++INFG +G VSG G VSTPL      Y          +++  V
Sbjct: 243 LFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLRLPYKGY----------SKKTEV 292

Query: 297 STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVT 356
              +I++DSGTT TFLPQ + S L   +++ I+ + V DP G   LCY+  +    P +T
Sbjct: 293 EEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAEINAPIIT 352

Query: 357 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
            HF+ A+V+L   N F+++ ED+VC      T+ + + GN+ Q NFLVG+D+ ++
Sbjct: 353 AHFKDANVELQPLNTFMRMQEDLVCFTV-APTSDIGVLGNLAQVNFLVGFDLRKK 406



 Score = 97.1 bits (240), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 52/134 (38%), Positives = 84/134 (62%), Gaps = 6/134 (4%)

Query: 291 NQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
           +++  V   +I++DSGTT T+LP  +   L   ++  I+ + V DP G   LCY+  ++ 
Sbjct: 410 SKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSLCYN-TTVD 468

Query: 351 QV--PEVTIHFRGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 407
           Q+  P +T HF+ A+V+L   N F+++ ED+VC +V    T+ + I GN+ Q NFLVG+D
Sbjct: 469 QIDAPIITAHFKDANVELQPWNTFLRMQEDLVCFTVLP--TSDIGILGNLAQVNFLVGFD 526

Query: 408 IEQQTVSFKPTDCT 421
           + ++ VSFK  DCT
Sbjct: 527 LRKKRVSFKAADCT 540


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  348 bits (894), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 194/437 (44%), Positives = 266/437 (60%), Gaps = 49/437 (11%)

Query: 8   VFILFF--LCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL 65
           + ILF+  LCF ++S   A   GFSVELIHRDS KSP Y  ++  YQ + +A  RS+NR 
Sbjct: 6   LLILFYFSLCF-IISLSHALNNGFSVELIHRDSSKSPLYQPTQNKYQHIVNAARRSINRA 64

Query: 66  NHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           NHF + +    +   Q+ +IP++  YL+  S+GTPP +   +ADTGSD++W QCEPC   
Sbjct: 65  NHFYKTAL---TNTPQSTVIPDHGEYLMTYSVGTPPFKLYGIADTGSDIVWLQCEPC--K 119

Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
           +CY Q +P F P  SSTYK++PCSS  C S  Q                     GNL+ +
Sbjct: 120 ECYNQTTPKFKPSKSSTYKNIPCSSDLCKSGQQ---------------------GNLSVD 158

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
           T+TL S+TG  ++ P    GCGT+N   F   ++GIVGLGGG  SLI+Q+ ++I  KFSY
Sbjct: 159 TLTLESSTGHPISFPKTVIGCGTDNTVSFEGASSGIVGLGGGPASLITQLGSSIDAKFSY 218

Query: 246 CLVP-----VSSTKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRL---- 294
           CL+P      +++K+NFG   +VSG GVVSTP+ K     FY LT++A SVGN+R+    
Sbjct: 219 CLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIEFEG 278

Query: 295 ---GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS- 350
              G    +I+IDSGTTLT +P    +NL S +  +++ + V DPT    LCYS  S   
Sbjct: 279 SSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVNDPTRLFNLCYSVTSDGY 338

Query: 351 QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP-----IYGNIMQTNFLVG 405
             P +T HF+GADVKL   + FV V++ IVC  F   +  +P     I+GN+ Q N LVG
Sbjct: 339 DFPIITTHFKGADVKLHPISTFVDVADGIVCLAFATTSAFIPSDVVSIFGNLAQQNLLVG 398

Query: 406 YDIEQQTVSFKPTDCTK 422
           YD++Q+ VSFKPTDC+K
Sbjct: 399 YDLQQKIVSFKPTDCSK 415


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score =  344 bits (883), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 190/433 (43%), Positives = 265/433 (61%), Gaps = 37/433 (8%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           +FL+ +F   F CF ++S   A   GF++ELIHRDS KSPFY  ++  Y+R+ +A+ RS+
Sbjct: 5   SFLTLLFFTIF-CF-IISLSHALNNGFTLELIHRDSSKSPFYQPTQNKYERIANAVRRSI 62

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           NR+NHF + S  S+    Q+ +  +   YL+  SIGTPP +     DTGSDL+W QCEPC
Sbjct: 63  NRVNHFYKYSLTSTP---QSTVNSDKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPC 119

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
              QCY Q +P+FDP +SS+Y+++PC S  C S+   SC                  G L
Sbjct: 120 --KQCYPQITPIFDPSLSSSYQNIPCLSDTCHSMRTTSCD---------------VRGYL 162

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
           + ET+TL STTG +V+ P    GCG  N G F+  ++GIVGLG G +SL SQ+ T+I GK
Sbjct: 163 SVETLTLDSTTGYSVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGK 222

Query: 243 FSYCL---VPVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVS 297
           FSYCL   +P S++K+NFG   IV G G ++TP+ K  A++ Y LT++A SVGN+ +   
Sbjct: 223 FSYCLGPWLPNSTSKLNFGDAAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFG 282

Query: 298 TP-------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
            P       +I+IDSGTT TFLP        S ++  I  + V DP G+ +LCY+     
Sbjct: 283 GPTYGGNEGNILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDPNGTFKLCYNVAYHG 342

Query: 351 -QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
            + P +T HF+GAD+KL   + F+KVS+ I C  F  I +   I+GN+ Q N LVGY++ 
Sbjct: 343 FEAPLITAHFKGADIKLYYISTFIKVSDGIACLAF--IPSQTAIFGNVAQQNLLVGYNLV 400

Query: 410 QQTVSFKPTDCTK 422
           Q TV+FKP DCTK
Sbjct: 401 QNTVTFKPVDCTK 413


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score =  341 bits (875), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 200/420 (47%), Positives = 263/420 (62%), Gaps = 33/420 (7%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
           G F+  LIHRDSP SP YN   T + RL+ +  RS++R N F  NS +S++K  + DIIP
Sbjct: 31  GSFTASLIHRDSPISPLYNPKNTYFDRLQSSFHRSISRANRFTPNS-VSAAKTLEYDIIP 89

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
               Y +RISIGTPP E L +ADTGSDLIW QC+PC   +CY Q SP+F+PK SSTY+ +
Sbjct: 90  GGGEYFMRISIGTPPIEVLVIADTGSDLIWVQCQPC--QECYKQKSPIFNPKQSSTYRRV 147

Query: 147 PCSSSQCASLN--QKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
            C +  C +LN   ++CS       C YS SYGD SF+ G LATE   +GST     ++ 
Sbjct: 148 LCETRYCNALNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNN---SIQ 204

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV------SSTK 254
            + FGCG +NGG F+   +GIVGLGGG +SLISQ+ T I  KFSYCLVP+      S  K
Sbjct: 205 ELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGK 264

Query: 255 INFGTNGIVSGPGV-VSTPLT--KAKTFYVLTIDAISVGNQRLG---------VSTPDIV 302
           I FG N  +SG    VSTPL   + +TFY LT++AISVGN+RL          V   +I+
Sbjct: 265 IVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRNDGNVEKGNII 324

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA 362
           IDSGTTLTFL     + L  V+   +E + V+DP G   +C+      ++P +T+HF  A
Sbjct: 325 IDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDPNGIFSICFRDKIGIELPIITVHFTDA 384

Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           DV+L   N F K  ED++C  F  I +N + I+GN+ Q NFLVGYD+++  VSF PTDC+
Sbjct: 385 DVELKPINTFAKAEEDLLC--FTMIPSNGIAIFGNLAQMNFLVGYDLDKNCVSFMPTDCS 442


>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
           Precursor
 gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 447

 Score =  336 bits (862), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 207/446 (46%), Positives = 271/446 (60%), Gaps = 38/446 (8%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
           + + FFL F V          FSVELIHRDSP SP YN   T   RL  A  RS++R   
Sbjct: 5   ILLCFFLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRR 64

Query: 68  FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
           FN   S +     Q+ +I  +  + + I+IGTPP +  A+ADTGSDL W QC+PC   QC
Sbjct: 65  FNHQLSQTDL---QSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPC--QQC 119

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGVN--CQYSVSYGDGSFSNGNLA 183
           Y ++ P+FD K SSTYKS PC S  C +L+  ++ C   N  C+Y  SYGD SFS G++A
Sbjct: 120 YKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVA 179

Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
           TETV++ S +G  V+ PG  FGCG NNGG F+   +GI+GLGGG +SLISQ+ ++I+ KF
Sbjct: 180 TETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKF 239

Query: 244 SYCLVPVSSTK-----INFGTNGIVSG----PGVVSTPLTKAK--TFYVLTIDAISVGNQ 292
           SYCL   S+T      IN GTN I S      GVVSTPL   +  T+Y LT++AISVG +
Sbjct: 240 SYCLSHKSATTNGTSVINLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKK 299

Query: 293 R---------------LGVSTPDIVIDSGTTLTFLPQGYNSNLLS-VMSSMIEAQPVADP 336
           +               L  ++ +I+IDSGTTLT L  G+     S V  S+  A+ V+DP
Sbjct: 300 KIPYTGSSYNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDP 359

Query: 337 TGSLELCYSFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYG 395
            G L  C+   S    +PE+T+HF GADV+LS  N FVK+SED+VC +    T  V IYG
Sbjct: 360 QGLLSHCFKSGSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVC-LSMVPTTEVAIYG 418

Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDCT 421
           N  Q +FLVGYD+E +TVSF+  DC+
Sbjct: 419 NFAQMDFLVGYDLETRTVSFQHMDCS 444


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 186/441 (42%), Positives = 252/441 (57%), Gaps = 26/441 (5%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           +    S V  L F+    +S  E + G FS++LIHRDSPKSP YN SETP +RL     R
Sbjct: 7   LGLLFSIVIALSFVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERL----DR 62

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
              R   F++ S   S    +  +  NN  YL++ISIGTPP +   + DTGSDL+WTQC 
Sbjct: 63  FFRRFMSFSEASI--SPNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCL 120

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFS 178
           PC    CY Q +P+FDP  S+++K + C S QC  L+  SCS     C +S  YGDGS +
Sbjct: 121 PC--LSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLA 178

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
            G +ATET+TL S +GQ  ++  I FGCG NN G FN    G+ G GG  +SL SQ+ +T
Sbjct: 179 QGVIATETLTLNSNSGQPXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMST 238

Query: 239 IAG--KFSYCLVPVSS-----TKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISV 289
           +    KFS CLVP  +     +KI FG    VSG  VVSTPL      T+Y +T+D ISV
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISV 298

Query: 290 GNQRLGVSTP-------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
           G++    S+        ++ ID+GT  T LP+ + + L+  +   I  +PV DP    +L
Sbjct: 299 GDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQL 358

Query: 343 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 402
           CY   +L   P +T HF GADV+L   N F+   E + C   + I     I+GN +Q NF
Sbjct: 359 CYRSATLIDGPILTAHFDGADVQLKPLNTFISPKEGVYCFAMQPIDGDTGIFGNFVQMNF 418

Query: 403 LVGYDIEQQTVSFKPTDCTKQ 423
           L+G+D++ + VSFK  DCTKQ
Sbjct: 419 LIGFDLDGKKVSFKAVDCTKQ 439


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score =  328 bits (842), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 186/441 (42%), Positives = 252/441 (57%), Gaps = 26/441 (5%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           +    S V  L F+    +S  E + G FS++LIHRDSPKSP YN SETP +RL     R
Sbjct: 7   LGLLFSIVIALSFVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERL----DR 62

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
              R   F++ S   S    +  +  NN  YL++ISIGTPP +   + DTGSDL+WTQC 
Sbjct: 63  FFRRFMSFSEASI--SPNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCL 120

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFS 178
           PC    CY Q +P+FDP  S+++K + C S QC  L+  SCS     C +S  YGDGS +
Sbjct: 121 PC--LSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLA 178

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
            G +ATET+TL S +GQ  ++  I FGCG NN G FN    G+ G GG  +SL SQ+ +T
Sbjct: 179 QGVIATETLTLNSNSGQPTSILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMST 238

Query: 239 IAG--KFSYCLVPVSS-----TKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISV 289
           +    KFS CLVP  +     +KI FG    VSG  VVSTPL      T+Y +T+D ISV
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISV 298

Query: 290 GNQRLGVSTP-------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
           G++    S+        ++ ID+GT  T LP+ + + L+  +   I  +PV DP    +L
Sbjct: 299 GDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQL 358

Query: 343 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 402
           CY   +L   P +T HF GADV+L   N F+   E + C   + I     I+GN +Q NF
Sbjct: 359 CYRSATLIDGPILTAHFDGADVQLKPLNTFISPKEGVYCFAMQPIDGDTGIFGNFVQMNF 418

Query: 403 LVGYDIEQQTVSFKPTDCTKQ 423
           L+G+D++ + VSFK  DCTKQ
Sbjct: 419 LIGFDLDGKKVSFKAVDCTKQ 439


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score =  327 bits (837), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 183/440 (41%), Positives = 260/440 (59%), Gaps = 27/440 (6%)

Query: 4   FLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
           F S + +LF  CF  VS  + Q  GFSVELIH  S KSPFYN++E+ +QR+ + +  S N
Sbjct: 3   FYSSLLLLF--CFCRVSVSKTQNNGFSVELIHPISSKSPFYNTAESHFQRMSNNMKHSTN 60

Query: 64  RLNHFNQNSSISSSKASQADIIPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           R+++ N   S   +K     + P   + Y+I   IGTPP +   V DT +D IW QC PC
Sbjct: 61  RVHYLNHVFSFPPNKVPNIVVSPFMGDGYIISFLIGTPPFQLYGVMDTANDNIWFQCNPC 120

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSN 179
            P  C+   SP+FDP  SSTYK++PCSS +C ++    CS  +   C+YS +YG  ++S 
Sbjct: 121 KP--CFNTTSPMFDPSKSSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGEAYSQ 178

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           G+L+ +T+TL S     ++   I  GCG  N G      +G +GLG G +S ISQ+ ++I
Sbjct: 179 GDLSIDTLTLNSNNDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSI 238

Query: 240 AGKFSYCLVPVSST-----KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL 294
            GKFSYCLVP+ S      K++FG   +VSG G VSTP+T  +  Y  T++A+SVG+  +
Sbjct: 239 GGKFSYCLVPLFSNEGISGKLHFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHII 298

Query: 295 GVSTP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY-- 344
                        + +IDSGTTLT LP+   S L S+++SM++ +    P    +LCY  
Sbjct: 299 KFENSTSKNDNLGNTIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYKA 358

Query: 345 SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNF 402
           +  +L  VP +T HF GADV L+  N F  +  ++VC  F  + N  P  I GNI Q NF
Sbjct: 359 TLKNL-DVPIITAHFNGADVHLNSLNTFYPIDHEVVCFAFVSVGN-FPGTIIGNIAQQNF 416

Query: 403 LVGYDIEQQTVSFKPTDCTK 422
           LVG+D+++  +SFKPTDCTK
Sbjct: 417 LVGFDLQKNIISFKPTDCTK 436


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score =  326 bits (835), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 185/427 (43%), Positives = 257/427 (60%), Gaps = 26/427 (6%)

Query: 18  VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS 77
           VV+PIE+Q  GFSVELIH DS +SPFYN  ET  QR+ + +T S+ R ++ N   S+S +
Sbjct: 16  VVTPIESQNRGFSVELIHPDSSRSPFYNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHN 75

Query: 78  KASQADIIPNNANY-LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
              +  IIP   +Y ++  SIGTPP +   V DTGSD IW QC+PC P  C  Q SP+F+
Sbjct: 76  DLPKPTIIPYAGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKP--CLNQTSPIFN 133

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           P  SSTYK++ CSS  C    +  CS      C+Y ++Y D S S G+++ +T+TL S  
Sbjct: 134 PSKSSTYKNIRCSSPICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSND 193

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS- 252
           G  ++ P I  GCG  N        +GI+G G G+ S++SQ+ ++I GKFSYCL  + S 
Sbjct: 194 GSPISFPKIVIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSK 253

Query: 253 ----TKINFGTNGIVSGPGVVSTPLTKAKTFYV----LTIDAISVGNQRLGVS----TPD 300
               +K+ FG   +VSG GVVSTPL ++  FYV      ++A SVG+  + +      PD
Sbjct: 254 ANISSKLYFGDMAVVSGHGVVSTPLIQS--FYVGNYFTNLEAFSVGDHIIKLKDSSLIPD 311

Query: 301 ----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEV 355
                VIDSG+T+T LP    S L + + SM++ + V DPT  L LCY       +VP +
Sbjct: 312 NEGNAVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTTLKKYEVPII 371

Query: 356 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           T HFRGADVKL+  N F++++ +++C  F        +YGNI Q NFLVGYD  +  +SF
Sbjct: 372 TAHFRGADVKLNAFNTFIQMNHEVMCFAFNSSAFPWVVYGNIAQQNFLVGYDTLKNIISF 431

Query: 416 KPTDCTK 422
           KPT+CTK
Sbjct: 432 KPTNCTK 438


>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score =  325 bits (833), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 196/425 (46%), Positives = 265/425 (62%), Gaps = 38/425 (8%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
            SVELIHRDSP SP YN   T   RL  A  RS++R    N   +I S    Q+ +I  +
Sbjct: 26  LSVELIHRDSPLSPLYNPKNTVTDRLNAAFLRSISRSRRLN---NILSQTDLQSGLIGAD 82

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             + + I+IGTPP +  A+ADTGSDL W QC+PC   QCY ++ P+FD K SSTYKS PC
Sbjct: 83  GEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPC--QQCYKENGPIFDKKKSSTYKSEPC 140

Query: 149 SSSQCASLN--QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            S  C +L+  ++ C      C+Y  SYGD SFS G++ATET+++ S +G  V+ PG  F
Sbjct: 141 DSRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTVF 200

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFGT 259
           GCG NNGG F+   +GI+GLGGG +SLISQ+ ++I+ KFSYCL   S+T      IN GT
Sbjct: 201 GCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSISKKFSYCLSHKSATTNGTSVINLGT 260

Query: 260 NGIVSG----PGVVSTPLT--KAKTFYVLTIDAISVGNQRL------------GV---ST 298
           N I S      GV+STPL   + +T+Y LT++AISVG +++            G+   ++
Sbjct: 261 NSIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSSYNPNDGGIFSETS 320

Query: 299 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSFNSLS-QVPEVT 356
            +I+IDSGTTLT L  G+     + +  ++  A+ V+DP G L  C+   S    +PE+T
Sbjct: 321 GNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVSDPQGLLSHCFKSGSAEIGLPEIT 380

Query: 357 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           +HF GADV+LS  N FVKVSED+VC +    T  V IYGN  Q +FLVGYD+E +TVSF+
Sbjct: 381 VHFTGADVRLSPINAFVKVSEDMVC-LSMVPTTEVAIYGNFAQMDFLVGYDLETRTVSFQ 439

Query: 417 PTDCT 421
             DC+
Sbjct: 440 RMDCS 444


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  319 bits (818), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 179/439 (40%), Positives = 270/439 (61%), Gaps = 25/439 (5%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           M+ F     I F+LC ++     A   G S+E+IHRD  KSP Y+ + T +QR  + + R
Sbjct: 1   MSRFSVLTLIFFYLCCFIYFS-HASKKGLSIEMIHRDFSKSPLYHPTVTKFQRAYNVVHR 59

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
           S+NR+N+F +  S++ ++   + + P    YLI  S+GTPP +     DTGS+++W QC+
Sbjct: 60  SINRVNYFTKEFSLNKNQPV-STLTPELGEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQ 118

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCS--GVNCQYSVSYGDGS 176
           PC  + C+ Q SP+F+P  SS+YK++PC+SS C   N    SCS  G  C+YS++YG  +
Sbjct: 119 PC--NTCFNQTSPIFNPSKSSSYKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGGDA 176

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM- 235
            S G+L+ +++TL ST+G +V  P I  GCG  N    NS+++G+VG+G G +SLI Q+ 
Sbjct: 177 KSQGDLSNDSLTLDSTSGSSVLFPNIVIGCGHINVLQDNSQSSGVVGMGRGPMSLIKQVG 236

Query: 236 RTTIAGKFSYCLVPV-----SSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAI 287
            +++  KFSYCL+P      SS+K+ FG + +VSG  VVSTP+ K    + +Y LT++A 
Sbjct: 237 SSSVGSKFSYCLIPYNSDSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAF 296

Query: 288 SVGNQRL------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
           SVGN R+        ST +I+IDSGT LT LP  + S L+S ++  ++   +  P   L 
Sbjct: 297 SVGNNRIEYGERSNASTQNILIDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLS 356

Query: 342 LCYSFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQT 400
           LCY+       VP++T HF GADVKL+ +  F    + I+C  F   +N + I+GNI Q 
Sbjct: 357 LCYNTTGKQLNVPDITAHFNGADVKLNSNGTFFPFEDGIMCFGFIS-SNGLEIFGNIAQN 415

Query: 401 NFLVGYDIEQQTVSFKPTD 419
           N L+ YD+E++ +SFKPTD
Sbjct: 416 NLLIDYDLEKEIISFKPTD 434


>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 445

 Score =  317 bits (811), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 202/449 (44%), Positives = 273/449 (60%), Gaps = 41/449 (9%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           T L C   L  +  +  S   A     SVELIHRDSP SP YN   T   RL  A     
Sbjct: 5   TLLYCS--LLAITIFFTSTSSAHRKNLSVELIHRDSPHSPLYNPQHTVSDRLNAAF---- 58

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
             L   +++   S+    Q+ +I N   Y + ISIGTPP++ LA+ADTGSDL W QC+PC
Sbjct: 59  --LRSISRSRRFSTKTDLQSGLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPC 116

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSC--SGVNCQYSVSYGDGSFS 178
              QCY Q++PLFD K SSTYK+  C S  C +L  +++ C  S   C+Y  SYGD SF+
Sbjct: 117 --QQCYKQNTPLFDKKKSSTYKTESCDSITCNALSEHEEGCDESRNACKYRYSYGDESFT 174

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
            G +ATET+++ S++G  V+ PG  FGCG NNGG F    +GI+GLGGG +SL+SQ+ ++
Sbjct: 175 KGEVATETISIDSSSGSPVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSS 234

Query: 239 IAGKFSYCLVPVSSTK-----INFGTNGIVSGP----GVVSTPLTKA--KTFYVLTIDAI 287
           I  KFSYCL   S+T      IN GTN + S P     +++TPL +   +T+Y LT++AI
Sbjct: 235 IGKKFSYCLSHTSATTNGTSVINLGTNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAI 294

Query: 288 SVGNQRL------GVS-------TPDIVIDSGTTLTFLPQGYNSNLLSVM-SSMIEAQPV 333
           +VG  +L      G S       T +I+IDSGTTLT L  G+  +  +V+  S+  A+ V
Sbjct: 295 TVGKTKLPYTGGGGYSLNRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKRV 354

Query: 334 ADPTGSLELCY-SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP 392
           +DP G L  C+ S +    +P +T+HF GADVKLS  N FVK+SEDIVC +    T  V 
Sbjct: 355 SDPQGILTHCFKSGDKEIGLPTITMHFTGADVKLSPINSFVKLSEDIVC-LSMIPTTEVA 413

Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           IYGN++Q +FLVGYD+E +TVSF+  DC+
Sbjct: 414 IYGNMVQMDFLVGYDLETKTVSFQRMDCS 442


>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 445

 Score =  316 bits (810), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 198/449 (44%), Positives = 272/449 (60%), Gaps = 41/449 (9%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           TFL C   L  + F+  S   A     +VELIHRDSP SP YN   T   RL  A  RS+
Sbjct: 5   TFLYCS--LLAISFFFASNSSANRENLTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSI 62

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R   F   + +      Q+ +I N   Y + ISIGTPP++  A+ADTGSDL W QC+PC
Sbjct: 63  SRSRRFTTKTDL------QSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPC 116

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGVN--CQYSVSYGDGSFS 178
              QCY Q+SPLFD K SSTYK+  C S  C +L  +++ C      C+Y  SYGD SF+
Sbjct: 117 --QQCYKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFT 174

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
            G++ATET+++ S++G +V+ PG  FGCG NNGG F    +GI+GLGGG +SL+SQ+ ++
Sbjct: 175 KGDVATETISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSS 234

Query: 239 IAGKFSYCLVPVSSTK-----INFGTNGIVSGP----GVVSTPLTKA--KTFYVLTIDAI 287
           I  KFSYCL   ++T      IN GTN I S P      ++TPL +   +T+Y LT++A+
Sbjct: 235 IGKKFSYCLSHTAATTNGTSVINLGTNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAV 294

Query: 288 SVGNQRLGVS-------------TPDIVIDSGTTLTFLPQGYNSNL-LSVMSSMIEAQPV 333
           +VG  +L  +             T +I+IDSGTTLT L  G+  +   +V  S+  A+ V
Sbjct: 295 TVGKTKLPYTGGGYGLNGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRV 354

Query: 334 ADPTGSLELCY-SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP 392
           +DP G L  C+ S +    +P +T+HF  ADVKLS  N FVK++ED VC +    T  V 
Sbjct: 355 SDPQGLLTHCFKSGDKEIGLPAITMHFTNADVKLSPINAFVKLNEDTVC-LSMIPTTEVA 413

Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           IYGN++Q +FLVGYD+E +TVSF+  DC+
Sbjct: 414 IYGNMVQMDFLVGYDLETKTVSFQRMDCS 442


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  307 bits (787), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 182/440 (41%), Positives = 265/440 (60%), Gaps = 31/440 (7%)

Query: 1   MATFLSCVF--ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL 58
           MA  +S  F  ILF + F   + I    G F+  L HRDS  SP   SS + Y RL +A 
Sbjct: 1   MAATISLFFHLILFLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLANAF 59

Query: 59  TRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
            RSL+R       ++ S +   Q+ I P +  YL+ +SIGTPP + L +ADTGSDL W Q
Sbjct: 60  RRSLSRSAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQ 119

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGS 176
           C PC   +CY Q  P+F+P  S+++  +PC++  C +++   C GV   C YS +YGD +
Sbjct: 120 CLPCL--KCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHC-GVQGVCDYSYTYGDRT 176

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           +S G+L  E +T+GS++ ++V       GCG  + G F    +G++GLGGG +SL+SQM 
Sbjct: 177 YSKGDLGFEKITIGSSSVKSV------IGCGHASSGGFGF-ASGVIGLGGGQLSLVSQMS 229

Query: 237 TT--IAGKFSYC---LVPVSSTKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISV 289
            T  I+ +FSYC   L+  ++ KINFG N +VSGPGVVSTPL      T+Y +T++AIS+
Sbjct: 230 QTSGISRRFSYCLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISI 289

Query: 290 GNQR--LGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-- 345
           GN+R        +++IDSGTTLT LP+     ++S +  +++A+ V DP GSL+LC+   
Sbjct: 290 GNERHMAFAKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDG 349

Query: 346 FNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQT 400
            N+ +   +P +T HF  GA+V L   N F KV++++ C   K    T    I GN+ Q 
Sbjct: 350 INAAASLGIPVITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQA 409

Query: 401 NFLVGYDIEQQTVSFKPTDC 420
           NFL+GYD+E + +SFKPT C
Sbjct: 410 NFLIGYDLEAKRLSFKPTVC 429


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score =  305 bits (782), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 191/437 (43%), Positives = 257/437 (58%), Gaps = 27/437 (6%)

Query: 9   FILFFLCFYVVSPI---EAQTG--GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
           F+ F L FY VS +   EA     GF+V+LIHRDSP SPFYN S TP QR+ +A  RS++
Sbjct: 4   FVFFCLAFYSVSSLFSTEANESPSGFTVDLIHRDSPLSPFYNPSLTPSQRIINAALRSIS 63

Query: 64  RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
           RLN  + N    ++K  Q+ +I +N  YL+R  IGTPP ERLA ADTGSDLIW QC PC 
Sbjct: 64  RLNRVS-NLLDQNNKLPQSVLILHNGEYLMRFYIGTPPVERLATADTGSDLIWVQCSPC- 121

Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSC--SGVNCQYSVSYGDG-SFS 178
            + C+ Q +PLF P  SST+    C S  C  L   QK C  SG  C Y+  YGD  SFS
Sbjct: 122 -ASCFPQSTPLFQPLKSSTFMPTTCRSQPCTLLLPEQKGCGKSG-ECIYTYKYGDQYSFS 179

Query: 179 NGNLATETVTLGSTTG-QAVALPGITFGCGT-NNGGLFNS-KTTGIVGLGGGDISLISQM 235
            G L+TET+   S  G Q VA P   FGCG  NN  +F S K TGI+GLG G +SL+SQ+
Sbjct: 180 EGLLSTETLRFDSQGGVQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQI 239

Query: 236 RTTIAGKFSYCLVPVSST---KINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISV 289
              I  KFSYCL+P+ ST   K+ FG   I++G GVVSTP+       T+Y L ++A++V
Sbjct: 240 GDQIGHKFSYCLLPLGSTSTSKLKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTV 299

Query: 290 GNQRLGVSTPD--IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
             + +   + D  ++IDSGT LT+L + +  N  + +   +  + V D    L  C+ + 
Sbjct: 300 AQKTVPTGSTDGNVIIDSGTLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPFCFPYR 359

Query: 348 SLSQVPEVTIHFRGADVKLSRSNFFVKVSE-DIVCSVFKGITNS-VPIYGNIMQTNFLVG 405
                PE+   F GA V L  +N FV   + + VC +    + S + I+G+  Q +F V 
Sbjct: 360 DNFVFPEIAFQFTGARVSLKPANLFVMTEDRNTVCLMIAPSSVSGISIFGSFSQIDFQVE 419

Query: 406 YDIEQQTVSFKPTDCTK 422
           YD+E + VSF+PTDC+K
Sbjct: 420 YDLEGKKVSFQPTDCSK 436


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score =  305 bits (780), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 178/432 (41%), Positives = 254/432 (58%), Gaps = 22/432 (5%)

Query: 9   FILFFLCFYVVSPIEAQTG--GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
            IL       +S  EA+ G  GFSV+LIHRDSP SPFYN S TP +R+ +A  RS++RL 
Sbjct: 7   MILALFSLSTLSSREAREGLRGFSVDLIHRDSPSSPFYNPSLTPSERIINAALRSMSRLQ 66

Query: 67  HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
             +    +  +K  ++ +IP+   YL+R  IG+PP ERLA+ DTGS LIW QC PC    
Sbjct: 67  RVSH--FLDENKLPESLLIPDKGEYLMRFYIGSPPVERLAMVDTGSSLIWLQCSPC--HN 122

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGV-NCQYSVSYGDGSFSNGNLA 183
           C+ Q++PLF+P  SSTYK   C S  C  L  +Q+ C  +  C Y + YGD SFS G L 
Sbjct: 123 CFPQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQCIYGIMYGDKSFSVGILG 182

Query: 184 TETVTLGSTTG-QAVALPGITFGCGT-NNGGLFNS-KTTGIVGLGGGDISLISQMRTTIA 240
           TET++ GST G Q V+ P   FGCG  NN  ++ S K  GI GLG G +SL+SQ+   I 
Sbjct: 183 TETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIG 242

Query: 241 GKFSYCLVPVSST---KINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL 294
            KFSYCL+P  ST   K+ FG+  I++  GVVSTPL       T+Y L ++A+++G + +
Sbjct: 243 HKFSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVV 302

Query: 295 GVSTPD--IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 352
                D  IVIDSGT LT+L   + +N ++ +   +  + + D    L+ C+   +   +
Sbjct: 303 STGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLPSPLKTCFPNRANLAI 362

Query: 353 PEVTIHFRGADVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
           P++   F GA V L   N  + +++ +I+C +V       + ++G+I Q +F V YD+E 
Sbjct: 363 PDIAFQFTGASVALRPKNVLIPLTDSNILCLAVVPSSGIGISLFGSIAQYDFQVEYDLEG 422

Query: 411 QTVSFKPTDCTK 422
           + VSF PTDC K
Sbjct: 423 KKVSFAPTDCAK 434


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score =  303 bits (775), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 175/411 (42%), Positives = 250/411 (60%), Gaps = 20/411 (4%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD----I 84
           F+++LIH DSP SPFYNSS T  Q +R+A  RS++R N  + + S S ++  ++     I
Sbjct: 30  FTIDLIHHDSPPSPFYNSSMTRSQLIRNAAMRSISRANQLSLSLSHSLNQLKESSPEPII 89

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
           IPNN NYL+RI IGTP  ERLA+ADTGSDL W QC PC  ++C+ Q++PL+DP  SST+ 
Sbjct: 90  IPNNGNYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFT 149

Query: 145 SLPCSSSQCASL--NQKSCSGV-NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
            LPC S  C  L  +Q  CS   +C Y+ +YGD S+S G L+++++ L     Q      
Sbjct: 150 LLPCDSQPCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRL--MLLQLHYNSK 207

Query: 202 ITFGCGTNNGGLFNS--KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKIN 256
           I FGCG  N    +   KTTGIVGLG G +SL+SQ+   I  KFSYCL+P SS   +K+ 
Sbjct: 208 ICFGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNSKLK 267

Query: 257 FGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQ--RLGVSTPDIVIDSGTTLTFL 312
           FG   IV G GVVSTPL       FY L ++ I+VG +  + G +  +I+IDSG+TLT+L
Sbjct: 268 FGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVKTGQTDGNIIIDSGSTLTYL 327

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS-LSQVPEVTIHFRGADVKLSRSNF 371
            + + +  +S++   +  +         + C+++   +S  P+V  HF G DV L   N 
Sbjct: 328 EESFYNEFVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTPPDVVFHFTGGDVVLKPMNT 387

Query: 372 FVKVSEDIVCS-VFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            V + ++++CS V     + + I+GN+ Q +F VGYDI+   VSF PTDC+
Sbjct: 388 LVLIEDNLICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPTDCS 438


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score =  300 bits (767), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 171/430 (39%), Positives = 248/430 (57%), Gaps = 39/430 (9%)

Query: 9   FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
           F+L   CF  +S  + Q  GF+VELIH  S +SPFYN  ET  QR+   L  S+NR+ + 
Sbjct: 7   FVLLLFCFCRLSLTKTQNHGFNVELIHPISSRSPFYNPKETQIQRISSILNYSINRVRYL 66

Query: 69  NQNSSISSSKASQADIIP-NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
           N   S S +K     +     A Y++  SIGTPP +  ++ DTG+D IW QC+PC P  C
Sbjct: 67  NHVFSFSPNKIQDVPLSSFMGAGYVMSYSIGTPPFQLYSLIDTGNDNIWFQCKPCKP--C 124

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
             Q SP+F P  SSTYK++PC+S  C +                  DG +    L  +T+
Sbjct: 125 LNQTSPMFHPSKSSTYKTIPCTSPICKN-----------------ADGHY----LGVDTL 163

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           TL S  G  ++   I  GCG  N G      +G +GL  G +S ISQ+ ++I GKFSYCL
Sbjct: 164 TLNSNNGTPISFKNIVIGCGHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCL 223

Query: 248 VPV-----SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD-- 300
           VP+      S+K++FG    VSG G VSTP+ K +  Y ++++A SVG+  + +   D  
Sbjct: 224 VPLFSKENVSSKLHFGDKSTVSGLGTVSTPI-KEENGYFVSLEAFSVGDHIIKLENSDNR 282

Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS---LSQVPEV 355
              +IDSGTT+T LP+   S L SV+  M++ + V DP+    LCY   S   L++V  +
Sbjct: 283 GNSIIDSGTTMTILPKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLII 342

Query: 356 TIHFRGADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
           T HF G++V L+  N F  ++++++C  F   G  +S+ I+GN++Q NFLVG+D+ ++T+
Sbjct: 343 TAHFSGSEVHLNALNTFYPITDEVICFAFVSGGNFSSLAIFGNVVQQNFLVGFDLNKKTI 402

Query: 414 SFKPTDCTKQ 423
           SFKPTDCTK 
Sbjct: 403 SFKPTDCTKH 412


>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 437

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 189/440 (42%), Positives = 271/440 (61%), Gaps = 33/440 (7%)

Query: 8   VFILF-FLCFYVVSPI---EAQTG--GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
           VF++F  L  Y  S I   EA  G  GFS++LIHRDSP SPFY+ S TP +R+ +A  RS
Sbjct: 5   VFMVFMLLALYSPSSISTREAGEGLRGFSIDLIHRDSPLSPFYDPSLTPSERITNAAFRS 64

Query: 62  ---LNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
              LNR++HF   +++  S      +IP N  YL+ + IGTPP ERLA+ADTGSDLIW Q
Sbjct: 65  SSRLNRVSHFLDENNLPESL-----LIPENGEYLMTLYIGTPPVERLAIADTGSDLIWVQ 119

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGV-NCQYSVSYGDG 175
           C PC    C+ QD+PLF+P  SST+K+  C S  C S+  +Q+ C  V  C YS SYGD 
Sbjct: 120 CSPC--QNCFPQDTPLFEPLKSSTFKAATCDSQPCTSVPPSQRQCGKVGQCIYSYSYGDK 177

Query: 176 SFSNGNLATETVTLGST-TGQAVALPGITFGCGTNNGGLFNS--KTTGIVGLGGGDISLI 232
           SF+ G + TET++ GST   Q V+ P   FGCG  N   F++  K TG+VGLGGG +SL+
Sbjct: 178 SFTVGVVGTETLSFGSTGDAQTVSFPSSIFGCGVYNNFTFHTSDKVTGLVGLGGGPLSLV 237

Query: 233 SQMRTTIAGKFSYCLVPVSS---TKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDA 286
           SQ+   I  KFSYCL+P SS   +K+ FG+  IV+  GVVSTPL       +FY L ++A
Sbjct: 238 SQLGPQIGYKFSYCLLPFSSNSTSKLKFGSEAIVTTNGVVSTPLIIKPLFPSFYFLNLEA 297

Query: 287 ISVGNQRL--GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY 344
           +++G + +  G +  +I+IDSGT LT+L Q + +N ++ +  ++  +   D     + C+
Sbjct: 298 VTIGQKVVPTGRTDGNIIIDSGTVLTYLEQTFYNNFVASLQEVLSVESAQDLPFPFKFCF 357

Query: 345 SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNF 402
            +  ++ +P +   F GA V L   N  +K+ + +++C +V     + + I+GN+ Q +F
Sbjct: 358 PYRDMT-IPVIAFQFTGASVALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFDF 416

Query: 403 LVGYDIEQQTVSFKPTDCTK 422
            V YD+E + VSF PTDCTK
Sbjct: 417 QVVYDLEGKKVSFAPTDCTK 436


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score =  292 bits (747), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 172/433 (39%), Positives = 245/433 (56%), Gaps = 30/433 (6%)

Query: 14  LCFYVVSPIEAQT-----GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
           L  Y++S + ++       GFS++LIHRDSP SPFY  S TP  R+ +   RS+ +LN  
Sbjct: 9   LALYLLSTVSSREVSEGQRGFSIDLIHRDSPLSPFYKPSLTPSDRIINTALRSIYQLNR- 67

Query: 69  NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
             +S ++  K  +   IPN+  YL+R  IGTPP ERLA+ADT SDLIW QC PC    C+
Sbjct: 68  ASHSDLNEKKTLERVRIPNHGEYLMRFYIGTPPVERLAIADTASDLIWVQCSPC--ETCF 125

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATET 186
            QD+PLF+P  SST+ +L C S  C S N   C  V   C Y+ +YGDGS + G L TE+
Sbjct: 126 PQDTPLFEPHKSSTFANLSCDSQPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTES 185

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGL--FNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
           +  GS   Q V  P   FGCG+NN  +   ++K TGIVGLG G +SL+SQ+   I  KFS
Sbjct: 186 IHFGS---QTVTFPKTIFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFS 242

Query: 245 YCLVPVSST---KINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVST 298
           YCL+P +ST   K+ FG +  ++G GVVSTPL       ++Y L +  I++G + L V T
Sbjct: 243 YCLLPFTSTSTIKLKFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRT 302

Query: 299 PD-----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLSQV 352
            D     I+ID GT LT+L   +  N ++++   +      D      + C+   +    
Sbjct: 303 TDHTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGISETKDDIPYPFDFCFPNQANITF 362

Query: 353 PEVTIHFRGADVKLSRSNFFVKVSE-DIVC-SVFKGI-TNSVPIYGNIMQTNFLVGYDIE 409
           P++   F GA V LS  N F +  + +++C +V          ++GN+ Q +F V YD +
Sbjct: 363 PKIVFQFTGAKVFLSPKNLFFRFDDLNMICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRK 422

Query: 410 QQTVSFKPTDCTK 422
            + VSF P DC+K
Sbjct: 423 GKKVSFAPADCSK 435


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  288 bits (736), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 184/416 (44%), Positives = 249/416 (59%), Gaps = 32/416 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           GF+  L  RDSP SP +N S + Y  L DA  RS +R      + +  S+   ++ IIP+
Sbjct: 27  GFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPD 86

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  +L+ I IGTPP   +A+ADTGSDL WTQC PC   +C+ Q  P+F+P+ SS+Y+ + 
Sbjct: 87  SGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPC--RECFNQSQPIFNPRRSSSYRKVS 144

Query: 148 CSSSQCASLNQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           C+S  C SL    C     +C Y  SYGD SF+ G+LA++ +T+GS       LP    G
Sbjct: 145 CASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGS-----FKLPKTVIG 199

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG---KFSYCLVPVSSTK-----INF 257
           CG  NGG F   T+GI+GLGGG +SL+SQMR TIAG   +FSYCL    S       I+F
Sbjct: 200 CGHQNGGTFGGVTSGIIGLGGGSLSLVSQMR-TIAGVKPRFSYCLPTFFSNANITGTISF 258

Query: 258 GTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRL----GVST----PDIVIDSGT 307
           G   +VSG  VVSTPL      TFY LT++AISVG +R     G+S      +I+IDSGT
Sbjct: 259 GRKAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKRFKAANGISAMTNHGNIIIDSGT 318

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADV 364
           TLT LP+     + S ++ +I+A+ V DP+G LELCYS   +    +P +T HF  GADV
Sbjct: 319 TLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAGGADV 378

Query: 365 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           KL   N F  V++++ C  F   T  V I+GN+ Q NF VGYD+  + +SF+P  C
Sbjct: 379 KLLPVNTFAPVADNVTCLTFAPATQ-VAIFGNLAQINFEVGYDLGNKRLSFEPKLC 433


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  286 bits (732), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 169/430 (39%), Positives = 254/430 (59%), Gaps = 27/430 (6%)

Query: 9   FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
            IL  + F   + I    G F+  L HRDS  SP   SS + Y RL +A  RSL+R    
Sbjct: 11  LILLLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATL 69

Query: 69  NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
              ++ + +   QA + P +  YL+ +SIGTPP + + +ADTGSDL+W QC PC   +CY
Sbjct: 70  LNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPC--LKCY 127

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETV 187
            Q  P+FDP  S+++  +PC+S  C +++   C     C YS +YGD +++ G+L  E +
Sbjct: 128 KQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKI 187

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSY 245
           T+GS++ ++V       GCG +  G      +G++GLGGG +SL+SQM  T  I+ +FSY
Sbjct: 188 TIGSSSVKSV------IGCG-HESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240

Query: 246 C---LVPVSSTKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVSTP- 299
           C   L+  ++ KINFG N +VSGPGVVSTPL      T+Y +T++AIS+GN+R   S   
Sbjct: 241 CLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQ 300

Query: 300 -DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY----SFNSLSQVPE 354
            +++IDSGTTL+FLP+     ++S +  +++A+ V DP    +LC+    +  + S +P 
Sbjct: 301 GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI 360

Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQ 411
           +T  F  GA+V L   N F KV+ ++ C        T+   I GN+   NFL+GYD+E +
Sbjct: 361 ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK 420

Query: 412 TVSFKPTDCT 421
            +SFKPT CT
Sbjct: 421 RLSFKPTVCT 430


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score =  285 bits (729), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 170/440 (38%), Positives = 245/440 (55%), Gaps = 38/440 (8%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           M  + +   +   +C  ++    + T GFSV LI ++S      ++   P +RL +    
Sbjct: 1   MVVYPTSFHLATIICLMLLPLHISATEGFSVNLIRKNSS-----HAHVLPLRRLMEL--- 52

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
                      S++  +   Q+ I     +YL+ +SIGTPP +   +ADTGSDL WT C 
Sbjct: 53  -----------SAMEKTLTPQSPIYAYLGHYLMELSIGTPPFKIYGIADTGSDLTWTSCV 101

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSN 179
           PC  + CY Q +P+FDP+ S+TY+++ C S  C  L+   CS    C Y+ +Y   + + 
Sbjct: 102 PC--NNCYKQRNPMFDPQKSTTYRNISCDSKLCHKLDTGVCSPQKRCNYTYAYASAAITR 159

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           G LA ET+TL ST G++V L GI FGCG NN G FN    GI+GLGGG +SLISQM ++ 
Sbjct: 160 GVLAQETITLSSTKGKSVPLKGIVFGCGHNNTGGFNDHEMGIIGLGGGPVSLISQMGSSF 219

Query: 240 AGK-FSYCLVPVS-----STKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGN 291
            GK FS CLVP       S+K++FG    VSG GVVSTPL   + KT Y +T+  ISV N
Sbjct: 220 GGKRFSQCLVPFHTDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVEN 279

Query: 292 QRL-------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELC 343
             L        V   ++ +DSGT  T LP      +++ + S +  +PV  DP    +LC
Sbjct: 280 TYLHFNGSSQNVEKGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLC 339

Query: 344 YSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFL 403
           Y   +  + P +T HF GADVKLS +  F+   + + C  F   ++   +YGN  Q+N+L
Sbjct: 340 YRTKNNLRGPVLTAHFEGADVKLSPTQTFISPKDGVFCLGFTNTSSDGGVYGNFAQSNYL 399

Query: 404 VGYDIEQQTVSFKPTDCTKQ 423
           +G+D+++Q VSFKP DCTK 
Sbjct: 400 IGFDLDRQVVSFKPKDCTKH 419


>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  281 bits (718), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 177/440 (40%), Positives = 253/440 (57%), Gaps = 27/440 (6%)

Query: 3   TFLSCVFILFFLCFYV--VSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           T LS    + FL   +   S ++A+   F+ ELIHRDSP SP +N+SET   RL +A+ R
Sbjct: 9   TLLSFALSIIFLTVSMSGFSLVQAEKLSFTTELIHRDSPNSPLFNASETTDIRLANAVER 68

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC- 119
           S +R+N FN   S S + A    I+ +N ++L++ISIG PPTE L    TGSDL+W  C 
Sbjct: 69  SADRVNRFNDLISNSITAAEFPSIL-DNGDFLMKISIGIPPTELLVNVATGSDLVWIPCL 127

Query: 120 --EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVS-YGDGS 176
             +PC    C   D   FDP  SSTYK++PC S +C   N  +C   +C YS       S
Sbjct: 128 SFKPC-THNC---DLRFFDPMESSTYKNVPCDSYRCQITNAATCQFSDCFYSCDPRHQDS 183

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
             +G+LA +T+TL STTG++  LP   F CG   GG  +    GI+GLG G +SL++++ 
Sbjct: 184 CPDGDLAMDTLTLNSTTGKSFMLPNTGFICGNRIGG--DYPGVGILGLGHGSLSLLNRIS 241

Query: 237 TTIAGKFSYCLVPVSS---TKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGN 291
             I GKFS+C+VP SS   +K++FG   +VSG  + ST L  T     Y L+   ISVGN
Sbjct: 242 HLIDGKFSHCIVPYSSNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPYSYTLSFYGISVGN 301

Query: 292 QRL---GVSTPDIV----IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELC 343
           + +   G+ +   +    +DSGT  T+ P+ + S L   +   I+ +P+  DPT  L LC
Sbjct: 302 KSISAGGIGSDYYMNGLGMDSGTMFTYFPEYFYSQLEYDVRYAIQQEPLYPDPTRRLRLC 361

Query: 344 YSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFK-GITNSVPIYGNIMQTNF 402
           Y ++     P +T+HF G  V+LS SN F++++EDIVC  F    +    ++G   QTN 
Sbjct: 362 YRYSPDFSPPTITMHFEGGSVELSSSNSFIRMTEDIVCLAFATSSSEQDAVFGYWQQTNL 421

Query: 403 LVGYDIEQQTVSFKPTDCTK 422
           L+GYD++   +SF  TDCTK
Sbjct: 422 LIGYDLDAGFLSFLKTDCTK 441


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score =  280 bits (716), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 172/435 (39%), Positives = 240/435 (55%), Gaps = 56/435 (12%)

Query: 9   FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
            +   L  ++   IEA  G F+V+LI R        NSS+  + R+              
Sbjct: 9   LLAILLLVFIFPSIEAHNGRFTVKLIPR--------NSSQVLFNRI-------------- 46

Query: 69  NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
                      +Q  +  ++ +YL+ +SIGTPP +  A  DTGSDLIW QC PC  + CY
Sbjct: 47  ----------TAQTPVSVHHYDYLMELSIGTPPVKTYAQVDTGSDLIWLQCIPC--TNCY 94

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATET 186
            Q +P+FDP+ SSTY ++   S  C+ L   SCS    NC Y+ SY D S + G LA ET
Sbjct: 95  KQLNPMFDPQSSSTYSNIAYGSESCSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQET 154

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSY 245
           +TL STTG+ VAL G+ FGCG NN G+FN K  GI+GLG G +SL+SQ+ ++  GK FS 
Sbjct: 155 LTLTSTTGKPVALKGVIFGCGHNNNGVFNDKEMGIIGLGRGPLSLVSQIGSSFGGKMFSQ 214

Query: 246 CLVPVS-----STKINFGTNGIVSGPGVVSTPLTKAKT---FYVLTIDAISVGNQRL--- 294
           CLVP       ++ ++FG    V G GVVSTPL    T   FY +T+  ISV +  L   
Sbjct: 215 CLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPFN 274

Query: 295 ------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFN 347
                  ++  ++VIDSGT  T LP+ +   L+  + + +   P+  DPT   +LCY   
Sbjct: 275 DGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRTP 334

Query: 348 SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKG-ITNSVPIYGNIMQTNFLVGY 406
           +  +   +T HF GADV L+ +  F+ V + I C  F    +N   IYGN  Q+N+L+G+
Sbjct: 335 TNLKGTTLTAHFEGADVLLTPTQIFIPVQDGIFCFAFTSTFSNEYGIYGNHAQSNYLIGF 394

Query: 407 DIEQQTVSFKPTDCT 421
           D+E+Q VSFK TDCT
Sbjct: 395 DLEKQLVSFKATDCT 409


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score =  278 bits (710), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 178/436 (40%), Positives = 262/436 (60%), Gaps = 58/436 (13%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           ++LIHRDSP SP +  + T   RL+ +  R+++R     Q+  +      Q D++P+   
Sbjct: 29  LDLIHRDSPLSPLHTPNLTFSDRLQASFLRAISR-----QSRHVDF----QTDLLPSGGE 79

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y++ +SIGTPP   LA+ADTGSDL W Q +PC   QCY Q  P+FDP  S+T+  LPC++
Sbjct: 80  YMMNLSIGTPPFPILAIADTGSDLTWLQSKPC--DQCYPQKGPIFDPSNSTTFHKLPCTT 137

Query: 151 SQCASLNQ--KSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           + C +L++  +SC+    C Y+ SYGD S++ G LA++TVT+G+    +V +  + FGCG
Sbjct: 138 APCNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNA---SVQIRNVAFGCG 194

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV------------SSTKI 255
           T NGG F+ + +GIVGLGGG++S +SQ+  TI  KFSYCL+P+            ++++I
Sbjct: 195 TRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIGKKFSYCLLPLENEISSQPSDSPATSRI 254

Query: 256 NFGTNGIVSGP---GVV--STPLTKAK--TFYVLTIDAISVGNQRL-------------- 294
            FG N + S     GVV  +TPL   +  T+Y LTI+AI+VG ++L              
Sbjct: 255 VFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTASYDS 314

Query: 295 ----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY-SFNS 348
                V   +I+IDSGTTLTFL + +   L + +   I+ + V D   S+  LC+ S   
Sbjct: 315 GSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSMFSLCFKSGKE 374

Query: 349 LSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
             ++P + +HFR GADV+L   N FV+  E +VC      TN V IYGN+ Q NF+VGYD
Sbjct: 375 EVELPLMKVHFRGGADVELKPVNTFVRAEEGLVCFTMLP-TNDVGIYGNLAQMNFVVGYD 433

Query: 408 IEQQTVSFKPTDCTKQ 423
           + ++TVSF P DC+KQ
Sbjct: 434 LGKRTVSFLPADCSKQ 449


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score =  276 bits (705), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 171/422 (40%), Positives = 239/422 (56%), Gaps = 33/422 (7%)

Query: 20  SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS-SSK 78
           +P EA   GFS +LIH++SP SPFY S+   + +         N+L  F Q    S   K
Sbjct: 21  TPTEAYNKGFSFKLIHKNSPNSPFYKSNN--FHK---------NKLRSFYQVPKKSFVQK 69

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           +    +  NN +YL+++++G+PP +   + DTGSDL+W QC PC    CY Q SP+F+P 
Sbjct: 70  SPYTRVTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPC--GGCYRQKSPMFEPL 127

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            S TY  +PC S QC+           C YS SY D S + G LA E +T  ST G  V 
Sbjct: 128 RSKTYSPIPCESEQCSFFGYSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVV 187

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPV-----SS 252
           +  I FGCG +N G FN    GI+G+GGG +SL+SQ+ T    K FS CLVP      +S
Sbjct: 188 VGDIIFGCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTS 247

Query: 253 TKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVG------NQRLGVSTPDIVID 304
             INFG    VSG GVV+TPL   + +T Y++T++ ISVG      N    +S  +I+ID
Sbjct: 248 GTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSETLSKGNIMID 307

Query: 305 SGTTLTFLPQGYNSNL---LSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 361
           SGT  T++PQ +   L   L V SS++  +   DP    +LCY   +  + P +T HF G
Sbjct: 308 SGTPATYIPQEFYERLVEELKVQSSLLPIE--DDPDLGTQLCYRSETNLEGPILTAHFEG 365

Query: 362 ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           ADV+L     F+   + + C    G T+   I+GN  Q+N L+G+D++++T+SFKPTDCT
Sbjct: 366 ADVQLLPIQTFIPPKDGVFCFAMAGSTDGDYIFGNFAQSNILMGFDLDRKTISFKPTDCT 425

Query: 422 KQ 423
            Q
Sbjct: 426 NQ 427


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score =  276 bits (705), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 173/441 (39%), Positives = 254/441 (57%), Gaps = 43/441 (9%)

Query: 1   MATFLSCVF--ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL 58
           MA  +S  F  ILF + F   + I    G F+  L HRDS  SP   SS + Y RL +A 
Sbjct: 1   MAATISLFFHLILFLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLANAF 59

Query: 59  TRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
            RSL+R       ++ S +   Q+ II            GTPP + L +ADTGSDL W Q
Sbjct: 60  RRSLSRSAALLNRAATSGAVGLQSSII------------GTPPVDYLGIADTGSDLTWAQ 107

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGS 176
           C PC   +CY Q  P+F+P  S+++  +PC++  C +++   C GV   C YS +YGD +
Sbjct: 108 CLPCL--KCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHC-GVQGVCDYSYTYGDRT 164

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           +S G+L  E +T+GS++ ++V       GCG  + G F    +G++GLGGG +SL+SQM 
Sbjct: 165 YSKGDLGFEKITIGSSSVKSV------IGCGHASSGGFGF-ASGVIGLGGGQLSLVSQMS 217

Query: 237 TT--IAGKFSYC---LVPVSSTKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISV 289
            T  I+ +FSYC   L+  ++ KINFG N +VSGPGVVSTPL      T+Y +T++AIS+
Sbjct: 218 QTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNTVTYYYITLEAISI 277

Query: 290 GNQR--LGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--- 344
           GN+R        +++IDSGTTL+FLP+     ++S +  +++A+ V DP    +LC+   
Sbjct: 278 GNERHMAFAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDG 337

Query: 345 -SFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQT 400
            +  + S +P +T  F  GA+V L   N F KV+ ++ C        T+   I GN+   
Sbjct: 338 INVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALA 397

Query: 401 NFLVGYDIEQQTVSFKPTDCT 421
           NFL+GYD+E + +SFKPT CT
Sbjct: 398 NFLIGYDLEAKRLSFKPTVCT 418


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score =  271 bits (694), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 169/439 (38%), Positives = 245/439 (55%), Gaps = 28/439 (6%)

Query: 8   VFILFFLCFY-VVSPIEAQT--GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR 64
           VF    LC + + S  EA     GFS+ LIHR+SP SPFYN S TP +R+++ + RS  R
Sbjct: 5   VFCFLLLCSHSIASFAEASKTLSGFSINLIHRESPLSPFYNPSLTPSERIKNTVLRSFAR 64

Query: 65  LNHFNQNSSISSSKASQADIIPNN--ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
                   S +  ++     IP+     YL+R  IGTPP ER A+ADTGSDLIW QC PC
Sbjct: 65  SKR-RLRLSQNDDRSPGTITIPDEPITEYLMRFYIGTPPVERFAIADTGSDLIWVQCAPC 123

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGVN--CQYSVSYGDGSFS 178
              +C  Q++PLFDP+ SST+K++PC S  C  L  +Q++C G +  C Y   YGD +  
Sbjct: 124 --EKCVPQNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKSGQCYYQYIYGDHTLV 181

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGLFNSK-TTGIVGLGGGDISLISQMR 236
           +G L  E++  GS    A+  P +TFGC  +NN  +  SK   G+VGLG G +SLISQ+ 
Sbjct: 182 SGILGFESINFGSKN-NAIKFPKLTFGCTFSNNDTVDESKRNMGLVGLGVGPLSLISQLG 240

Query: 237 TTIAGKFSYCLVPVSS---TKINFGTNGIVSG-PGVVSTPL---TKAKTFYVLTIDAISV 289
             I  KFSYC  P+SS   +K+ FG + IV    GVVSTPL   +   ++Y L ++ +S+
Sbjct: 241 YQIGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLEGVSI 300

Query: 290 GNQRLGVSTP----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 345
           GN+++  S      +I+IDSGT+ T L Q + +  ++++  +   + V  P      C+ 
Sbjct: 301 GNKKVKTSESQTDGNILIDSGTSFTILKQSFYNKFVALVKEVYGVEAVKIPPLVYNFCFE 360

Query: 346 FN-SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP-IYGNIMQTNFL 403
                 + P+V   F GA V++  SN F     +++C V    ++    I+GN  Q  + 
Sbjct: 361 NKGKRKRFPDVVFLFTGAKVRVDASNLFEAEDNNLLCMVALPTSDEDDSIFGNHAQIGYQ 420

Query: 404 VGYDIEQQTVSFKPTDCTK 422
           V YD++   VSF P DC K
Sbjct: 421 VEYDLQGGMVSFAPADCAK 439


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score =  270 bits (690), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 153/370 (41%), Positives = 219/370 (59%), Gaps = 20/370 (5%)

Query: 72  SSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD 131
           S++  + + Q+ I     +YL+ +SIGTPP +   +ADTGSDL WT C PC  ++CY Q 
Sbjct: 6   SAMEKTVSPQSPIYAYLGHYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPC--NKCYKQR 63

Query: 132 SPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLG 190
           +P+FDP+ S++Y+++ C S  C  L+   CS   +C Y+ +Y   + + G LA ET+TL 
Sbjct: 64  NPIFDPQKSTSYRNISCDSKLCHKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLS 123

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVP 249
           ST G++V L GI FGCG NN G FN +  GI+GLGGG +S ISQ+ ++  GK FS CLVP
Sbjct: 124 STKGESVPLKGIVFGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKRFSQCLVP 183

Query: 250 VS-----STKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRL-------- 294
                  S+K++ G    VSG GVVSTPL   + KT Y +T+  ISVGN  L        
Sbjct: 184 FHTDVSVSSKMSLGKGSEVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQ 243

Query: 295 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVP 353
            V   ++ +DSGT  T LP      L++ + S +  +PV  D     +LCY   +  + P
Sbjct: 244 SVEKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKNNLRGP 303

Query: 354 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
            +T HF G DVKL  +  FV   + + C  F   ++   +YGN  Q+N+L+G+D+++Q V
Sbjct: 304 VLTAHFEGGDVKLLPTQTFVSPKDGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVV 363

Query: 414 SFKPTDCTKQ 423
           SFKP DCTK 
Sbjct: 364 SFKPMDCTKH 373


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score =  266 bits (679), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 171/433 (39%), Positives = 230/433 (53%), Gaps = 28/433 (6%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
           V  LFFL   ++        GFS++LI R SP SP YNS  T  + ++ A  RS+ R   
Sbjct: 5   VLTLFFLVSTMLVDASKSLMGFSIDLIPRHSPISPLYNSQMTQTELVKSAALRSITRSKR 64

Query: 68  FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
            N    IS   +     IP++  YL+R S+GTP  ERLA+ DTGSDL W QC PC    C
Sbjct: 65  VNFIGQISPPLSPIITPIPDHGEYLMRFSLGTPSVERLAIFDTGSDLSWLQCTPC--KTC 122

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSC-SGVNCQYSVSYGDGSFSNGNLAT 184
           Y Q++PLFDP  SSTY  +PC S  C     NQ+ C S   C Y   YG  SF+ G L  
Sbjct: 123 YPQEAPLFDPTQSSTYVDVPCESQPCTLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGY 182

Query: 185 ETVTLGST-TGQAVA-LPGITFGCGTNNGGLFN--SKTTGIVGLGGGDISLISQMRTTIA 240
           +T++  ST  GQ  A  P   FGC   +   F   +K  G VGLG G +SL SQ+   I 
Sbjct: 183 DTISFSSTGMGQGGATFPKSVFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIG 242

Query: 241 GKFSYCLVPVSST---KINFG----TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQR 293
            KFSYC+VP SST   K+ FG    TN +VS P +++       ++YVL ++ I+VG ++
Sbjct: 243 HKFSYCMVPFSSTSTGKLKFGSMAPTNEVVSTPFMIN---PSYPSYYVLNLEGITVGQKK 299

Query: 294 L--GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ 351
           +  G    +I+IDS   LT L QG  ++ +S +   I  +   D     E C    +   
Sbjct: 300 VLTGQIGGNIIIDSVPILTHLEQGIYTDFISSVKEAINVEVAEDAPTPFEYCVRNPTNLN 359

Query: 352 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVF---KGITNSVPIYGNIMQTNFLVGYDI 408
            PE   HF GADV L   N F+ +  ++VC      KGI+    I+GN  Q NF V YD+
Sbjct: 360 FPEFVFHFTGADVVLGPKNMFIALDNNLVCMTVVPSKGIS----IFGNWAQVNFQVEYDL 415

Query: 409 EQQTVSFKPTDCT 421
            ++ VSF PT+C+
Sbjct: 416 GEKKVSFAPTNCS 428


>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 455

 Score =  264 bits (674), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 172/449 (38%), Positives = 238/449 (53%), Gaps = 35/449 (7%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           +F S + IL  +     + I+A    F+ ELIH DSP SPF+N+SET   RL  AL RS 
Sbjct: 12  SFTSLIIILSTVFLSSFAIIQADKFSFTAELIHIDSPNSPFFNASETTTHRLAKALQRSA 71

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           NR+   N  S  +S +   A I   + NYL+++ IGTPPTE  A  DTGS++IW  C  C
Sbjct: 72  NRVARLNPLS--NSDEGVHASIFSGDGNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCINC 129

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDG-SFSNGN 181
               C+ Q S +F+P  SSTY+  PC S QC + +    S   C YS       +  NG 
Sbjct: 130 --KDCFNQSSSIFNPLASSTYQDAPCDSYQCETTSSSCQSDNVCLYSCDEKHQLNCPNGR 187

Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           +A +T+TL S+ G+   LP   F CG +    F     G++GLG G +SL S++     G
Sbjct: 188 IAVDTMTLTSSDGRPFPLPYSDFVCGNSIYKTF--AGVGVIGLGRGALSLTSKLYHLSDG 245

Query: 242 KFSYCLVPVSS---TKINFGTNGIVSGPG--VVSTPLTKAKTF--YVLTIDAISVGNQRL 294
           KFSYCL    S   +KINFG    +S     VVST L   +    Y +T++ ISVG +R 
Sbjct: 246 KFSYCLADYYSKQPSKINFGLQSFISDDDLEVVSTTLGHHRHSGNYYVTLEGISVGEKRQ 305

Query: 295 GVSTPD---------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS------ 339
            +   D         ++IDSGT  T LP+ +   L S +S  I   P   P  S      
Sbjct: 306 DLYYVDDPFAPPVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSM 365

Query: 340 -----LELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGIT-NSVPI 393
                L  C+ +    + P++TIHF  ADV+LS  N F++V+ED+VC  F         +
Sbjct: 366 DNTLKLSPCFWYYPELKFPKITIHFTDADVELSDDNSFIRVAEDVVCFAFAATQPGQSTV 425

Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           YG+  Q NF++GYD+++ TVSFK TDC+K
Sbjct: 426 YGSWQQMNFILGYDLKRGTVSFKRTDCSK 454


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score =  263 bits (673), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 156/375 (41%), Positives = 219/375 (58%), Gaps = 27/375 (7%)

Query: 70  QNSSISSSKAS--QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
           +NSS  S K S  Q+ +   +  YL+ +SIGTPP +  A ADTGSDL+W QC PC  ++C
Sbjct: 37  RNSSHDSYKPSTIQSPVSAYDCEYLMELSIGTPPIKIYAEADTGSDLVWFQCIPC--TKC 94

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATE 185
           Y Q +P+FDP+ SS+Y ++ C +  C  L+   CS     C Y+ SY D S + G LA E
Sbjct: 95  YKQQNPMFDPRSSSSYTNITCGTESCNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQE 154

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG---K 242
           T+TL STTG+ VA  GI FGCG NN G FN +  G++GLG G +SLISQ+ +++      
Sbjct: 155 TLTLTSTTGEPVAFQGIIFGCGHNNSG-FNDREMGLIGLGRGPLSLISQIGSSLGAGGNM 213

Query: 243 FSYCLVPVSS-----TKINFGTNGIVSGPGVVSTPL-TKAKTFYVLTIDAISVGNQRL-- 294
           FS CLVP ++     +++NFG    V G G VSTPL +K  T Y  T+  ISV +  L  
Sbjct: 214 FSQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPF 273

Query: 295 -------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
                   ++  +I+IDSGTT+T+LP+ +   L+  + + +  +P        ELCY   
Sbjct: 274 SNGSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFR--IDGYELCYQTP 331

Query: 348 SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
           +    P +TIHF G DV L+ +  F+ V +D  C            YGN  Q+N+L+G+D
Sbjct: 332 TNLNGPTLTIHFEGGDVLLTPAQMFIPVQDDNFCFAVFDTNEEYVTYGNYAQSNYLIGFD 391

Query: 408 IEQQTVSFKPTDCTK 422
           +E+Q VSFK TDCTK
Sbjct: 392 LERQVVSFKATDCTK 406


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score =  259 bits (661), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 167/432 (38%), Positives = 238/432 (55%), Gaps = 40/432 (9%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLR---------DALTRSLNRLNHFNQNSSI 74
           A  GGFSV+ IHRDS +SP+ + + +P+ R           + L RS +  +      S 
Sbjct: 28  AGEGGFSVDFIHRDSARSPYRHPALSPHARALAAARRSLRGEVLGRSYSGASPAAAPVSA 87

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP- 133
           +     ++ II  +  YL+ +++GTPPT+ LA+ADTGSDL+W  C     S   + D+  
Sbjct: 88  ADGGV-ESKIITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNCS---SSGGGLADADA 143

Query: 134 ----LFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVT 188
               +F P  SSTY  L C S+ C +L+Q SC     CQY  SYGDGS + G L+TET +
Sbjct: 144 GGNVVFQPTRSSTYSQLSCQSNACQALSQASCDADSECQYQYSYGDGSRTIGVLSTETFS 203

Query: 189 L--GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT--IAGKFS 244
              G   GQ V +P + FGC T + G F S   G+VGLG G  SL+SQ+  T  I  K S
Sbjct: 204 FVDGGGKGQ-VRVPRVNFGCSTASAGTFRSD--GLVGLGAGAFSLVSQLGATTHIDRKLS 260

Query: 245 YCLVPV----SSTKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGVST 298
           YCL+P     SS+ +NFG+  +VS PG  STPL  +   ++Y + +++++VG Q +    
Sbjct: 261 YCLIPSYDANSSSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQEVATHD 320

Query: 299 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-----VP 353
             I++DSGTTLTFL       L++ +   I+ Q V  P   L+LCY     S+     +P
Sbjct: 321 SRIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSETDNFGIP 380

Query: 354 EVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQ 410
           +VT+ F  GA V L   N F  + E  +C V   ++ S P  I GNI Q NF VGYD++ 
Sbjct: 381 DVTLRFGGGAAVTLRPENTFSLLQEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDA 440

Query: 411 QTVSFKPTDCTK 422
           +TV+F   DC +
Sbjct: 441 RTVTFAAADCAR 452


>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 396

 Score =  258 bits (660), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 164/417 (39%), Positives = 226/417 (54%), Gaps = 40/417 (9%)

Query: 24  AQTGGFSVELIHRDSPK-SPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
           A   GF+++LI  +SP  SPFY S E    RL                      S     
Sbjct: 3   ADNSGFTIQLIRHNSPNYSPFYKSDELHMHRL---------------------GSNGVFT 41

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
            +  NN +YL+++++GTPP +   + DTGSDL+W QC PC    CY Q SP+F+P  S+T
Sbjct: 42  RVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPC--QGCYRQKSPMFEPLRSNT 99

Query: 143 YKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           Y  +PC S +C SL   SCS    C YS +Y D S + G LA ETVT  ST G+ V +  
Sbjct: 100 YTPIPCDSEECNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVGD 159

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPVSST-----KI 255
           I FGCG +N G FN    GI+GLGGG +SL+SQ       K FS CLVP  +       I
Sbjct: 160 IVFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPHTLGTI 219

Query: 256 NFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVG------NQRLGVSTPDIVIDSGT 307
           +FG    VSG GV +TPL   + +T Y++T++ ISVG      N    +S  +I+IDSGT
Sbjct: 220 SFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSSEMLSKGNIMIDSGT 279

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLSQVPEVTIHFRGADVKL 366
             T+LPQ +   L+  +       P+  DP    +LCY   +  + P +  HF GADV+L
Sbjct: 280 PATYLPQEFYDRLVKELKVQSNMLPIDDDPDLGTQLCYRSETNLEGPILIAHFEGADVQL 339

Query: 367 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 423
                F+   + + C    G T+   I+GN  Q+N L+G+D++++TVSFK TDC+ Q
Sbjct: 340 MPIQTFIPPKDGVFCFAMAGTTDGEYIFGNFAQSNVLIGFDLDRKTVSFKATDCSNQ 396


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score =  258 bits (659), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 153/430 (35%), Positives = 245/430 (56%), Gaps = 44/430 (10%)

Query: 21  PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ----NSSISS 76
           P +  + GF V L H D  K+       T ++RLR  + R  NRL+  N      ++ + 
Sbjct: 43  PNKLPSHGFRVRLKHVDHVKN------LTRFERLRRGVARGKNRLHRLNAMVLAAANATV 96

Query: 77  SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
               +A ++  N  +L++++IG+PP    A+ DTGSDLIWTQC+PC   QC+ Q +P+FD
Sbjct: 97  GDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPC--QQCFDQSTPIFD 154

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           PK SS++  + CSS  C +L   +CS   C+Y  +YGD S + G LA ET T G +T   
Sbjct: 155 PKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQ 214

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-- 254
           +++PG+ FGCG +N G   S+  G+VGLG G +SL+SQ++     KF+YCL  +  +K  
Sbjct: 215 ISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQ---KFAYCLTAIDDSKPS 271

Query: 255 -INFGTNGIV----SGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV---------- 296
            +  G+   +    S   + +TPL K     +FY L++  ISVG  +L +          
Sbjct: 272 SLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDD 331

Query: 297 STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--PVADP-TGSLELCYSFNSLS--- 350
            +  ++IDSGTT+T++    NS   S+ +  I     PV D  TG L+LC++  + +   
Sbjct: 332 GSGGVIIDSGTTITYVE---NSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQV 388

Query: 351 QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
           +VP++T HF+GAD++L   N+ +  S+  +  +  G +  + I+GN+ Q NF+V +D+++
Sbjct: 389 EVPKLTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVVHDLQE 448

Query: 411 QTVSFKPTDC 420
           +T+SF PT C
Sbjct: 449 ETLSFLPTQC 458


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score =  257 bits (657), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 154/431 (35%), Positives = 246/431 (57%), Gaps = 46/431 (10%)

Query: 21  PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
           P +  + GF V L H D  K+       T ++RLR  + R  NRL+  N    ++++ A+
Sbjct: 298 PNKLPSHGFRVRLKHVDHVKN------LTRFERLRRGVARGKNRLHRLNA-MVLAAANAT 350

Query: 81  QAD-----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
             D     ++  N  +L++++IG+PP    A+ DTGSDLIWTQC+PC   QC+ Q +P+F
Sbjct: 351 VGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPC--QQCFDQSTPIF 408

Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           DPK SS++  + CSS  C +L   +CS   C+Y  +YGD S + G LA ET T G +T  
Sbjct: 409 DPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTED 468

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK- 254
            +++PG+ FGCG +N G   S+  G+VGLG G +SL+SQ++     KF+YCL  +  +K 
Sbjct: 469 QISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLKEQ---KFAYCLTAIDDSKP 525

Query: 255 --INFGTNGIV----SGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV--------- 296
             +  G+   +    S   + +TPL K     +FY L++  ISVG  +L +         
Sbjct: 526 SSLLLGSLANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHD 585

Query: 297 -STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--PVADP-TGSLELCYSF---NSL 349
             +  ++IDSGTT+T++    NS   S+ +  I     PV D  TG L+LC++     + 
Sbjct: 586 DGSGGVIIDSGTTITYVE---NSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQ 642

Query: 350 SQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
            +VP++T HF+GAD++L   N+ +  S+  +  +  G +  + I+GN+ Q NF+V +D++
Sbjct: 643 VEVPKLTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVVHDLQ 702

Query: 410 QQTVSFKPTDC 420
           ++T+SF PT C
Sbjct: 703 EETLSFLPTQC 713


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  257 bits (656), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 166/415 (40%), Positives = 236/415 (56%), Gaps = 33/415 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           G  ++L+  DSP SPF   + +  +R + A+ RS +RL       S+   KA +A +   
Sbjct: 54  GLRIDLVRTDSPLSPFSPGNISSTERFKRAIKRSQDRLEKLQM--SVDEVKAVEAPVYAG 111

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           N  +L++++IGTP     A+ DTGSDL WTQC+PC  + CY Q +P++DP  SSTY  +P
Sbjct: 112 NGEFLMKMAIGTPSLSFSAILDTGSDLTWTQCKPC--TDCYPQPTPIYDPSQSSTYSKVP 169

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           CSSS C +L   SCSG NC+Y  SYGD S + G L+ E+ TL S      +LP I FGCG
Sbjct: 170 CSSSMCQALPMYSCSGANCEYLYSYGDQSSTQGILSYESFTLTSQ-----SLPHIAFGCG 224

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-----PVSSTKINFGTNGI 262
             N G   S+  G+VG G G +SLISQ+  ++  KFSYCLV     P  ++ +  G    
Sbjct: 225 QENEGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLVSITDSPSKTSPLFIGKTAS 284

Query: 263 VSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTTL 309
           ++   V STPL +++   TFY L+++ ISVG Q L ++          T  ++IDSGTT+
Sbjct: 285 LNAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVIIDSGTTV 344

Query: 310 TFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPEVTIHFRGADVK 365
           T+L Q GY+    +V+SS+   Q      G L+LC+   S +S S  P +T HF GAD  
Sbjct: 345 TYLEQSGYDVVKKAVISSINLPQVDGSNIG-LDLCFEPQSGSSTSHFPTITFHFEGADFN 403

Query: 366 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           L + N+    S  I C      +N + I+GNI Q N+ + YD E+  +SF PT C
Sbjct: 404 LPKENYIYTDSSGIACLAMLP-SNGMSIFGNIQQQNYQILYDNERNVLSFAPTVC 457


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score =  254 bits (649), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 150/353 (42%), Positives = 205/353 (58%), Gaps = 29/353 (8%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           AD + +N+ YL+++ +GTPP E  AV DTGS++ WTQC PC    CY Q++P+FDP  SS
Sbjct: 371 ADTVFDNSVYLMKLQVGTPPFEIEAVIDTGSEITWTQCLPC--VHCYKQNAPIFDPSKSS 428

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           T+K             +K C   +C Y V Y D +++ G LAT+TVT+ ST+G+   +  
Sbjct: 429 TFK-------------EKRCHDHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAE 475

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
              GCG NN   F     G VGL  G +SLI+QM     G  SYC     ++KINFGTN 
Sbjct: 476 TIIGCGRNNS-WFRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAGNGTSKINFGTNA 534

Query: 262 IVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLG-VSTP------DIVIDSGTTLTF 311
           IV G GVVST +   T    FY L +DA+SVG+ R+  + TP      +IVIDSGTTLT+
Sbjct: 535 IVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSGTTLTY 594

Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 370
            P+ Y + +   +  ++ A P ADPTG+  LCY  N+    P +T+HF  GAD+ L + N
Sbjct: 595 FPESYCNLVRQAVEHVVPAVPAADPTGNDLLCYYSNTTEIFPVITMHFSGGADLVLDKYN 654

Query: 371 FFVK-VSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            F++  S  + C ++         I+GN  Q NFLVGYD     VSFKPT+C+
Sbjct: 655 MFMESYSGGLFCLAIICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 707



 Score =  207 bits (528), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 146/423 (34%), Positives = 214/423 (50%), Gaps = 83/423 (19%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           +AT +  +F L  + +++ +   +   GF+++LIHR S  S                   
Sbjct: 3   LATTMIAIF-LQIITYFLFTTTASSPHGFTIDLIHRRSNAS------------------- 42

Query: 61  SLNRLNHFNQNSSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
                     +S +S+++A    AD + +   YL+++ IGTPP E  AV DTGS+LIWTQ
Sbjct: 43  ----------SSRVSNTQAGSPYADTVFDTYEYLMKLQIGTPPFEVEAVLDTGSELIWTQ 92

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFS 178
           C PC    CY Q +P+FDP  SST+K   C++   +           C Y + Y D S++
Sbjct: 93  CLPC--LHCYDQKAPIFDPSKSSTFKETRCNTPDHS-----------CPYKLVYDDKSYT 139

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQMRT 237
            G LATETVT+ ST+G    +P    GC  NN G  F   ++GIVGL  G +SLISQM  
Sbjct: 140 QGTLATETVTIHSTSGVPFVMPETIIGCSRNNSGSGFRPSSSGIVGLSRGSLSLISQM-- 197

Query: 238 TIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL 294
                                  G   G GVVST +   T  +  Y L +DA+SVG+ R+
Sbjct: 198 ----------------------GGAYPGDGVVSTTMFAKTAKRGQYYLNLDAVSVGDTRI 235

Query: 295 G-VSTP------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
             V TP      +IVIDSGT LT+ P  Y + +   +  ++ A  V DP+ +  LCY  N
Sbjct: 236 ETVGTPFHALNGNIVIDSGTPLTYFPVSYCNLVRKAVERVVTADRVVDPSRNDMLCYYSN 295

Query: 348 SLSQVPEVTIHFR-GADVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLV 404
           ++   P +T+HF  GAD+ L + N +++++   + C ++       V I+GN  Q NFLV
Sbjct: 296 TIEIFPVITVHFSGGADLVLDKYNMYMELNRGGVFCLAIICNNPTQVAIFGNRAQNNFLV 355

Query: 405 GYD 407
           GYD
Sbjct: 356 GYD 358


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score =  254 bits (649), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 164/428 (38%), Positives = 235/428 (54%), Gaps = 32/428 (7%)

Query: 25  QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN--RLNHFNQNSSISSSKASQA 82
           + GGFSV+ IHRDS +SPF   S  P+ R   A  RSL    L  +   +S +     +A
Sbjct: 26  EAGGFSVDFIHRDSARSPFAQPSLPPHARALAAARRSLRGAALGRYVGGASPAPGPVPEA 85

Query: 83  D------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
           D      II  +  YL+ +++GTPP + LA+ADTGSDL+W  C            + +F 
Sbjct: 86  DGGVESKIITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFH 145

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           P  S+TY  L C S+ C +L+Q SC     CQY  +YGDGS + G L+TET +  +  G 
Sbjct: 146 PSRSTTYSLLSCQSAACQALSQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGG 205

Query: 196 A---VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPV 250
               V +P ++FGC T + G F S   G+VGLG G +SL+SQ+     IA +FSYCLVP 
Sbjct: 206 GEGQVRVPRVSFGCSTGSAGSFRSD--GLVGLGAGALSLVSQLGAAARIARRFSYCLVPP 263

Query: 251 -----SSTKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLG-VSTPDIV 302
                SS+ ++FG   +VS PG  STPL  ++  ++Y + +++++V  Q +   ++  I+
Sbjct: 264 YAAANSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQDVASANSSRII 323

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-----VPEVTI 357
           +DSGTTLTFL       L++ +   I       P   L+LCY     SQ     +P+VT+
Sbjct: 324 VDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAEDFGIPDVTL 383

Query: 358 HF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVS 414
            F  GA V L   N F  + E  +C V   ++ S P  I GNI Q NF VGYD++ +TV+
Sbjct: 384 RFGGGASVTLRPENTFSLLEEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDARTVT 443

Query: 415 FKPTDCTK 422
           F   DCT+
Sbjct: 444 FAAVDCTR 451


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score =  254 bits (648), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 161/434 (37%), Positives = 235/434 (54%), Gaps = 57/434 (13%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           +AT +  +F+   LCF + +   +   GF+++LIHR                        
Sbjct: 3   LATTIIVLFLQISLCF-LFTTTASPPHGFTMDLIHR------------------------ 37

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
              R N  ++ S+  S  +  A+ + +N+ YL+++ +GTPP E  A+ DTGS++ WTQC 
Sbjct: 38  ---RSNASSRVSNTQSGSSPYANTVFDNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCL 94

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
           PC    CY Q++P+FDP  SST+K             +K C G +C Y V Y D +++ G
Sbjct: 95  PC--VHCYEQNAPIFDPSKSSTFK-------------EKRCDGHSCPYEVDYFDHTYTMG 139

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
            LATET+TL ST+G+   +P    GCG NN   F    +G+VGL  G  SLI+QM     
Sbjct: 140 TLATETITLHSTSGEPFVMPETIIGCGHNN-SWFKPSFSGMVGLNWGPSSLITQMGGEYP 198

Query: 241 GKFSYCLVPVSSTKINFGTNGIVSGPGVVSTP--LTKAKT-FYVLTIDAISVGNQRLGVS 297
           G  SYC     ++KINFG N IV+G GVVST   +T AK  FY L +DA+SVGN R+   
Sbjct: 199 GLMSYCFSGQGTSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETM 258

Query: 298 -------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
                    +IVIDSGTTLT+ P  Y + +   +  ++ A   ADPTG+  LCY+ +++ 
Sbjct: 259 GTTFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTID 318

Query: 351 QVPEVTIHFRGA-DVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 407
             P +T+HF G  D+ L + N +++ +   + C ++         I+GN  Q NFLVGYD
Sbjct: 319 IFPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYD 378

Query: 408 IEQQTVSFKPTDCT 421
                VSF PT+C+
Sbjct: 379 SSSLLVSFSPTNCS 392


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  254 bits (648), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 161/422 (38%), Positives = 238/422 (56%), Gaps = 38/422 (9%)

Query: 26  TGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-----NSSISSSKAS 80
           T GF V L H DS K+       T  +R++  + R  +RL   N      +S+  S    
Sbjct: 44  TNGFRVMLRHVDSGKN------LTKLERVQHGIKRGKSRLQKLNAMVLAASSTPDSEDQL 97

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           +A I   N  YLI ++IGTPP    AV DTGSDLIWTQC+PC  ++CY Q +P+FDPK S
Sbjct: 98  EAPIHAGNGEYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPC--TRCYKQPTPIFDPKKS 155

Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           S++  + C SS C++L   +CS   C+Y  SYGD S + G LATET T G +  + V++ 
Sbjct: 156 SSFSKVSCGSSLCSALPSSTCSD-GCEYVYSYGDYSMTQGVLATETFTFGKSKNK-VSVH 213

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INF 257
            I FGCG +N G    + +G+VGLG G +SL+SQ++     +FSYCL P+  TK   +  
Sbjct: 214 NIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEQ---RFSYCLTPIDDTKESVLLL 270

Query: 258 GTNGIVS-GPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTP----------DIVI 303
           G+ G V     VV+TPL K     +FY L+++AISVG+ RL +              ++I
Sbjct: 271 GSLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVII 330

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---QVPEVTIHFR 360
           DSGTT+T++ Q     L     S  +       +  L+LC+S  S S   ++P++  HF+
Sbjct: 331 DSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFK 390

Query: 361 GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           G D++L   N+ +  S   V  +  G ++ + I+GN+ Q N LV +D+E++T+SF PT C
Sbjct: 391 GGDLELPAENYMIGDSNLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKETISFVPTSC 450

Query: 421 TK 422
            +
Sbjct: 451 DQ 452


>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  253 bits (646), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 176/432 (40%), Positives = 232/432 (53%), Gaps = 47/432 (10%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-----NSSISS 76
           ++A   GF+ ELI RDSP SPFYN+       L  A TRS N   H++      N    S
Sbjct: 30  VKADNFGFTAELIRRDSPNSPFYNA-------LEAAATRSTNASQHYDAQIGRFNLMSDS 82

Query: 77  SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
             ASQ+++  +  NYLI+IS+GTPP E LA+AD   DL W  C+ C   Q   +D   F 
Sbjct: 83  YYASQSELNFSKGNYLIKISVGTPPAEILALADITGDLTWLPCKTC---QDCTKDGFTFF 139

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY---SVSYGDGSFSN-GNLATETVTLGST 192
           P  SSTY S  C S QC   N   C    C Y    +     S +N G +A +T++  S+
Sbjct: 140 PSESSTYTSAACESYQCQITNGAVCQTKMCIYLCGPLPQQRSSCTNKGLVAMDTISFHSS 199

Query: 193 TGQAVALPGITFGCGT--NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
           +GQA++ P   F CGT  +N     +   GIVGLG G  S+ SQM+  I G FS CLVP 
Sbjct: 200 SGQALSYPNTNFICGTFIDNWHYIGA---GIVGLGRGLFSMTSQMKHLINGTFSQCLVPY 256

Query: 251 S---STKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLG---VSTP--D 300
           S   S+KINFG  G+VSG GVVSTP+        Y L ++A+SVG  R+     S P  +
Sbjct: 257 SSKQSSKINFGLKGVVSGEGVVSTPIADDGESGAYFLFLEAMSVGGNRVANNFYSAPKSN 316

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLS--QVPEVTI 357
           I ID  TT T LP  +  N+ + +   I   P+  +    L LCY   S      P +T+
Sbjct: 317 IYIDWRTTFTSLPHDFYENVEAEVRKAINLTPINYNNERKLSLCYKSESDHDFDAPPITM 376

Query: 358 HFRGADVKLSRSNFFVKVSEDIVCSVF--------KGITNSVPIYGNIMQTNFLVGYDIE 409
           HF  ADV+LS  N FV++  ++VC  F        K IT++V  YG+  Q NF+VGYD++
Sbjct: 377 HFTNADVQLSPLNTFVRMDWNVVCFAFLDGTFNATKRITHAV--YGSWQQMNFIVGYDLK 434

Query: 410 QQTVSFKPTDCT 421
             TVSFK  DCT
Sbjct: 435 SSTVSFKQADCT 446


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 159/421 (37%), Positives = 237/421 (56%), Gaps = 37/421 (8%)

Query: 26  TGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ----NSSISSSKASQ 81
           T GF V L H DS K+       T  +R++  + R  +RL   N      S++ S    +
Sbjct: 45  TKGFRVMLRHVDSGKN------LTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLE 98

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           A I   N  YL+ ++IGTPP    AV DTGSDLIWTQC+PC  +QCY Q +P+FDPK SS
Sbjct: 99  APIHAGNGEYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPC--TQCYKQPTPIFDPKKSS 156

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           ++  + C SS C+++   +CS   C+Y  SYGD S + G LATET T G +  + V++  
Sbjct: 157 SFSKVSCGSSLCSAVPSSTCSD-GCEYVYSYGDYSMTQGVLATETFTFGKSKNK-VSVHN 214

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFG 258
           I FGCG +N G    + +G+VGLG G +SL+SQ++     +FSYCL P+  TK   +  G
Sbjct: 215 IGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLKEP---RFSYCLTPMDDTKESILLLG 271

Query: 259 TNGIVS-GPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTP----------DIVID 304
           + G V     VV+TPL K     +FY L+++ ISVG+ RL +              ++ID
Sbjct: 272 SLGKVKDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIID 331

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---QVPEVTIHFRG 361
           SGTT+T++ Q     L     S  +       +  L+LC+S  S S   ++P++  HF+G
Sbjct: 332 SGTTITYIEQKAFEALKKEFISQTKLPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKG 391

Query: 362 ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            D++L   N+ +  S   V  +  G ++ + I+GN+ Q N LV +D+E++T+SF PT C 
Sbjct: 392 GDLELPAENYMIGDSNLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKETISFVPTSCD 451

Query: 422 K 422
           +
Sbjct: 452 Q 452


>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
          Length = 308

 Score =  251 bits (641), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 153/361 (42%), Positives = 209/361 (57%), Gaps = 65/361 (18%)

Query: 69  NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
           N  + ++S    Q+++I    +YL+ IS+GTPP   L +ADTGSDLIW QC PC    CY
Sbjct: 7   NTGNQLASPNDIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPC--DDCY 64

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
            Q  PLFDPK S TYK+L                                 G L++ET T
Sbjct: 65  KQVEPLFDPKKSKTYKTL---------------------------------GYLSSETFT 91

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
           +GST G   + PG+ FGCG +NGG FN K +G++GLGGG +SL+ Q+ + + G+FSYCLV
Sbjct: 92  IGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLV 151

Query: 249 PVS-----STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVI 303
           P+S     S+KINFG + +VSG G  S+P    ++                     +I+I
Sbjct: 152 PLSSDSTASSKINFGKSAVVSGSG-TSSPAAAEES---------------------NIII 189

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGAD 363
           DSGTTLT LP+ + +++ S ++ +I  Q   DP G+  LCYS     ++P +T HF GAD
Sbjct: 190 DSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCYSGVKKLEIPTITAHFIGAD 249

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           V+L   N FV+  ED+VC  F  I +S + I+GN+ Q NFLVGYD++   VSFKPTDCTK
Sbjct: 250 VQLPPLNTFVQAQEDLVC--FSMIPSSNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDCTK 307

Query: 423 Q 423
           Q
Sbjct: 308 Q 308


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score =  249 bits (635), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 152/418 (36%), Positives = 223/418 (53%), Gaps = 38/418 (9%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
           EA+  GF + L H DS K+       T +Q L  A+ R   RL      + ++     + 
Sbjct: 35  EAKVTGFQIMLEHVDSGKN------LTKFQLLERAIERGSRRLQRLE--AMLNGPSGVET 86

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
            +   +  YL+ +SIGTP     A+ DTGSDLIWTQC+PC  +QC+ Q +P+F+P+ SS+
Sbjct: 87  SVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPC--TQCFNQSTPIFNPQGSSS 144

Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           + +LPCSS  C +L+  +CS   CQY+  YGDGS + G++ TET+T GS     V++P I
Sbjct: 145 FSTLPCSSQLCQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNI 199

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKINFGT 259
           TFGCG NN G       G+VG+G G +SL SQ+  T   KFSYC+ P+ S   + +  G+
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTPSNLLLGS 256

Query: 260 --NGIVSG-PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS-----------TPDIVIDS 305
             N + +G P       ++  TFY +T++ +SVG+ RL +            T  I+IDS
Sbjct: 257 LANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDS 316

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPEVTIHFRGA 362
           GTTLT+       ++     S I    V   +   +LC+   S  S  Q+P   +HF G 
Sbjct: 317 GTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGG 376

Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           D++L   N+F+  S  ++C      +  + I+GNI Q N LV YD     VSF    C
Sbjct: 377 DLELPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  249 bits (635), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 162/438 (36%), Positives = 229/438 (52%), Gaps = 57/438 (13%)

Query: 4   FLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
           +L+ +F+LF +    +S IEAQ  GF+++L  + S                        N
Sbjct: 18  YLAIIFLLFHVLH--LSSIEAQNDGFTIKLFRKTS------------------------N 51

Query: 64  RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
            + +             QA I      +L+ I IGTPP +   + DTGSDLIW QC PC 
Sbjct: 52  NIQNI-----------VQAPINAYIGQHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPC- 99

Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNL 182
              CY Q  P+FDP  SSTY ++ C S  C  L+   CS    C Y+  YGD S + G L
Sbjct: 100 -LGCYKQIKPMFDPLKSSTYNNISCDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVL 158

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG- 241
           A +T T  S TG+ V+L    FGCG NN G FN    G++GLGGG  SLISQ+     G 
Sbjct: 159 AQDTATFTSNTGKPVSLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGK 218

Query: 242 KFSYCLVPVS-----STKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRL 294
           KFS CLVP       S++++FG    V G GVV+TPL   +  T Y +T+  ISV +   
Sbjct: 219 KFSQCLVPFLTDIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYF 278

Query: 295 ----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSL 349
                +   ++++DSGT    LPQ     + + + + +  +P+  DP+   +LCY   + 
Sbjct: 279 PMNSTIGKANMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQTN 338

Query: 350 SQVPEVTIHFRGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVP-IYGNIMQTNFLVG 405
            + P +T HF GA+V L+    F+     ++ I C      TNS P +YGN  Q+N+L+G
Sbjct: 339 LKGPTLTFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIG 398

Query: 406 YDIEQQTVSFKPTDCTKQ 423
           +D+++Q VSFKPTDCTKQ
Sbjct: 399 FDLDRQVVSFKPTDCTKQ 416


>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 392

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 151/355 (42%), Positives = 208/355 (58%), Gaps = 33/355 (9%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           AD + +   YL+++ +GTPP E  A  DTGSDLIWTQC PC  + CY Q +P+FDP  SS
Sbjct: 52  ADTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPC--TNCYSQYAPIFDPSNSS 109

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           T+K             +K C+G +C Y + Y D ++S G LATETVT+ ST+G+   +P 
Sbjct: 110 TFK-------------EKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPE 156

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
            T GCG +N   F    +G+VGL  G  SLI+QM     G  SYC     ++KINFGTN 
Sbjct: 157 TTIGCG-HNSSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNA 215

Query: 262 IVSGPGVVSTP--LTKAKT-FYVLTIDAISVGN---QRLGVS----TPDIVIDSGTTLTF 311
           IV+G GVVST   LT AK   Y L +DA+SVG+   + +G +      +I+IDSGTTLT+
Sbjct: 216 IVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTY 275

Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 370
            P  Y + +   +   + A   ADPTG+  LCY  +++   P +T+HF  GAD+ L + N
Sbjct: 276 FPVSYCNLVREAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYN 335

Query: 371 FFVK-VSEDIVCSVFKGITNSVP---IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            +++ ++    C     I N+ P   I+GN  Q NFLVGYD     VSF PT+C+
Sbjct: 336 MYIETITRGTFCLAI--ICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score =  246 bits (629), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 161/421 (38%), Positives = 223/421 (52%), Gaps = 52/421 (12%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
           IEAQ  GF+V+LI + S                            H + N+        Q
Sbjct: 26  IEAQNDGFTVKLIRKSS----------------------------HLSSNNI---QDIVQ 54

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           A I      YL+ + IGTPP +     DTGSDLIW QC PC    CY Q +P+FDP  SS
Sbjct: 55  APINAYIGQYLMELYIGTPPIKISGTVDTGSDLIWVQCVPC--LGCYNQINPMFDPLKSS 112

Query: 142 TYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           TY ++ C S  C       CS    C Y+  Y D S + G LA ETVTL S TG+ ++L 
Sbjct: 113 TYTNISCDSPLCYKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPISLQ 172

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG-KFSYCLVP-----VSSTK 254
           GI FGCG NN G FN    G++GLGGG  SL+SQ+     G KFS CLVP       S++
Sbjct: 173 GILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISSQ 232

Query: 255 INFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTP----DIVIDSGT 307
           ++FG    V G GVV+TPL + +   T Y +T+  ISV +  L +++     ++++DSGT
Sbjct: 233 MSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTIEKGNMLVDSGT 292

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVPEVTIHFRGADVKL 366
               LPQ     +   + + +  +P+  DP+   +LCY   +  + P +T HF GA++ L
Sbjct: 293 PPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQTNLKGPTLTYHFEGANLLL 352

Query: 367 SRSNFFVKVSED---IVCSVFKGITNSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           +    F+  + +   + C       NS P IYGN  QTN+L+G+D+++Q VSFKPTDCTK
Sbjct: 353 TPIQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNYLIGFDLDRQIVSFKPTDCTK 412

Query: 423 Q 423
           Q
Sbjct: 413 Q 413


>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 392

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 150/355 (42%), Positives = 207/355 (58%), Gaps = 33/355 (9%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           AD + +   YL+++ +GTPP E  A  DTGSDLIWTQC PC  + CY Q +P+FDP  SS
Sbjct: 52  ADTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPC--TNCYSQYAPIFDPSNSS 109

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           T+K             +K C+G +C Y + Y D ++S G LATETVT+ ST+G+   +P 
Sbjct: 110 TFK-------------EKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPE 156

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
            T GCG +N   F    +G+VGL  G  SLI+QM     G  SYC     ++KINFGTN 
Sbjct: 157 TTIGCG-HNSSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNA 215

Query: 262 IVSGPGVVSTP--LTKAKT-FYVLTIDAISVGN---QRLGVS----TPDIVIDSGTTLTF 311
           IV+G GVVST   LT AK   Y L +DA+SVG+   + +G +      +I+IDSGTTLT+
Sbjct: 216 IVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTY 275

Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 370
            P  Y + +   +   + A   ADPTG+  LCY  +++   P +T+HF  GAD+ L + N
Sbjct: 276 FPVSYCNLVREAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYN 335

Query: 371 FFVK-VSEDIVCSVFKGITNSVP---IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            +++ ++    C     I N+ P   I+GN  Q NFLVGYD     V F PT+C+
Sbjct: 336 MYIETITRGTFCLAI--ICNNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388


>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 397

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 150/357 (42%), Positives = 212/357 (59%), Gaps = 32/357 (8%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           AD + + + YL+R+ +GTPP E +A  DTGSDLIWTQC PCP   CY Q +P+FDP  SS
Sbjct: 52  ADTVFDYSIYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCP--NCYTQFAPIFDPSKSS 109

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           T+K             +K C G +C Y + Y D S+S G LATETVT+ ST+G+   +  
Sbjct: 110 TFK-------------EKRCHGNSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAE 156

Query: 202 ITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
            + GCG NN  L    + + ++GIVGL  G  SLISQM   I G  SYC     ++KINF
Sbjct: 157 TSIGCGLNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQGTSKINF 216

Query: 258 GTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLG-VSTP------DIVIDSGTT 308
           GTN +V+G G V+  +   K + FY L +DA+SVG++R+  + TP      +I IDSGTT
Sbjct: 217 GTNAVVAGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFIDSGTT 276

Query: 309 LTFLPQGY-NSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKL 366
            T+LP  Y N    +V +S++ A  V DP+    LCY+++++   P +T+HF  GAD+ L
Sbjct: 277 YTYLPTSYCNLVREAVAASVVAANQVPDPSSENLLCYNWDTMEIFPVITLHFAGGADLVL 336

Query: 367 SRSNFFVK-VSEDIVCSVFKGITNSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            + N +V+ ++    C     +  S+P I+GN    N LVGYD     +SF PT+C+
Sbjct: 337 DKYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNCS 393


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  244 bits (622), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 159/417 (38%), Positives = 243/417 (58%), Gaps = 43/417 (10%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS-QADIIP 86
           GF V L H DS K+       T  +R+R  + R  NRL      + ++SS +  +A ++P
Sbjct: 39  GFRVRLKHVDSGKN------LTKLERIRHGVKRGRNRLQRLQAMALVASSSSEIEAPVLP 92

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            N  +L++++IGTPP    A+ DTGSDLIWTQC+PC  +QC+ Q +P+FDPK SS++  L
Sbjct: 93  GNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCKPC--TQCFHQSTPIFDPKKSSSFSKL 150

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            CSS  C +L Q SC+   C+Y  SYGD S + G LA+ET+T G       ++P + FGC
Sbjct: 151 SCSSQLCEALPQSSCNN-GCEYLYSYGDYSSTQGILASETLTFGK-----ASVPNVAFGC 204

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV--- 263
           G +N G   S+  G+VGLG G +SL+SQ++     KFSYCL  V  TK +    G +   
Sbjct: 205 GADNEGSGFSQGAGLVGLGRGPLSLVSQLKEP---KFSYCLTTVDDTKTSTLLMGSLASV 261

Query: 264 --SGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTT 308
             S   + +TPL  +    +FY L+++ ISVG+ RL +           +  ++IDSGTT
Sbjct: 262 NASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGTT 321

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLS---QVPEVTIHFRGAD 363
           +T+L +   + +    ++ I   PV D +GS  L++C++  S S   +VP++  HF GAD
Sbjct: 322 ITYLEESAFNLVAKEFTAKINL-PV-DSSGSTGLDVCFTLPSGSTNIEVPKLVFHFDGAD 379

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           ++L   N+ +  S   V  +  G ++ + I+GN+ Q N LV +D+E++T+SF PT C
Sbjct: 380 LELPAENYMIGDSSMGVACLAMGSSSGMSIFGNVQQQNMLVLHDLEKETLSFLPTQC 436


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 148/418 (35%), Positives = 219/418 (52%), Gaps = 38/418 (9%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
           E +  GF + L H DS K+       T ++ L  A+ R   RL      + ++     + 
Sbjct: 35  EPKVAGFQIMLEHVDSGKN------LTKFELLERAVERGSRRLQRLE--AMLNGPSGVET 86

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
            +   +  YL+ +SIGTP     A+ DTGSDLIWTQC+PC  +QC+ Q +P+F+P+ SS+
Sbjct: 87  PVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPC--TQCFNQSTPIFNPQGSSS 144

Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           + +LPCSS  C +L   +CS  +CQY+  YGDGS + G++ TET+T GS     V++P I
Sbjct: 145 FSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNI 199

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGT 259
           TFGCG NN G       G+VG+G G +SL SQ+  T   KFSYC+ P+ S+    +  G+
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSNSSTLLLGS 256

Query: 260 --NGIVSG-PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS-----------TPDIVIDS 305
             N + +G P       ++  TFY +T++ +SVG+  L +            T  I+IDS
Sbjct: 257 LANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDS 316

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPEVTIHFRGA 362
           GTTLT+        +     S +    V   +   +LC+   S  S  Q+P   +HF G 
Sbjct: 317 GTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGG 376

Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           D+ L   N+F+  S  ++C      +  + I+GNI Q N LV YD     VSF    C
Sbjct: 377 DLVLPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLSAQC 434


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 148/418 (35%), Positives = 220/418 (52%), Gaps = 38/418 (9%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
           E +  GF + L H DS K+       T ++ L  A+ R   RL      + ++     + 
Sbjct: 35  EPKVAGFQIMLEHVDSGKN------LTKFELLERAVERGSRRLQRLE--AMLNGPSGVET 86

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
            +   +  YL+ +SIGTP     A+ DTGSDLIWTQC+PC  +QC+ Q +P+F+P+ SS+
Sbjct: 87  PVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPC--TQCFNQSTPIFNPQGSSS 144

Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           + +LPCSS  C +L   +CS  +CQY+  YGDGS + G++ TET+T GS     V++P I
Sbjct: 145 FSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNI 199

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGT 259
           TFGCG NN G       G+VG+G G +SL SQ+  T   KFSYC+ P+   +S+ +  G+
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT---KFSYCMTPIGSSTSSTLLLGS 256

Query: 260 --NGIVSG-PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS-----------TPDIVIDS 305
             N + +G P       ++  TFY +T++ +SVG+  L +            T  I+IDS
Sbjct: 257 LANSVTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDS 316

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPEVTIHFRGA 362
           GTTLT+        +     S +    V   +   +LC+   S  S  Q+P   +HF G 
Sbjct: 317 GTTLTYFADNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGG 376

Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           D+ L   N+F+  S  ++C      +  + I+GNI Q N LV YD     VSF    C
Sbjct: 377 DLVLPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLFAQC 434


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  241 bits (614), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 159/419 (37%), Positives = 244/419 (58%), Gaps = 43/419 (10%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS-QADIIP 86
           GF  +L H DS K+       T ++R++  + R  +RL  F   + ++SS +   A ++P
Sbjct: 39  GFRAKLKHVDSGKNL------TKFERIQHGVKRGRHRLQRFKAMALVASSNSEIDAPVLP 92

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            N  +L++++IGTPP    A+ DTGSDLIWTQC+PC  +QC+ Q +P+FDPK SS++  L
Sbjct: 93  GNGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPC--TQCFDQPTPIFDPKKSSSFSKL 150

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            CSS  C +L Q +CS   C+Y   YGD S + G LA+ET+T G      V++P + FGC
Sbjct: 151 SCSSKLCEALPQSTCSD-GCEYLYGYGDYSSTQGMLASETLTFGK-----VSVPEVAFGC 204

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV--- 263
           G +N G   S+ +G+VGLG G +SL+SQ++     KFSYCL  V  TK +    G +   
Sbjct: 205 GEDNEGSGFSQGSGLVGLGRGPLSLVSQLKEP---KFSYCLTSVDDTKASTLLMGSLASV 261

Query: 264 --SGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTT 308
             S   + +TPL +     +FY L+++ ISVG+  L +           +  ++IDSGTT
Sbjct: 262 KASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGTT 321

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLS---QVPEVTIHFRGAD 363
           +T+L Q     +    +S I   PV D +GS  LE+C++  S S   +VP++  HF GAD
Sbjct: 322 ITYLEQSAFDLVAKEFTSQINL-PV-DNSGSTGLEVCFTLPSGSTDIEVPKLVFHFDGAD 379

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           ++L   N+ +  +   V  +  G ++ + I+GNI Q N LV +D+E++T+SF PT C +
Sbjct: 380 LELPAENYMIADASMGVACLAMGSSSGMSIFGNIQQQNMLVLHDLEKETLSFLPTQCDE 438


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score =  241 bits (614), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 164/436 (37%), Positives = 229/436 (52%), Gaps = 62/436 (14%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           +AT +  +F L  + +++++   +   GF+++LIHR S  S                 +R
Sbjct: 3   LATTMIAIF-LQIITYFLITTTASSPQGFTIDLIHRRSNASS----------------SR 45

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
             N           +   +  AD + +   YL+++ IGTPP E  AV DTGS+ IWTQC 
Sbjct: 46  VFN-----------TQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCL 94

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
           PC    CY Q +P+FDP  SST+K + C +   +           C Y + YG  S++ G
Sbjct: 95  PC--VHCYNQTAPIFDPSKSSTFKEIRCDTHDHS-----------CPYELVYGGKSYTKG 141

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
            L TETVT+ ST+GQ   +P    GCG NN G F     G+VGL  G  SLI+QM     
Sbjct: 142 TLVTETVTIHSTSGQPFVMPETIIGCGRNNSG-FKPGFAGVVGLDRGPKSLITQMGGEYP 200

Query: 241 GKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLG-V 296
           G  SYC     ++KINFG N IV+G GVVST +   T    FY L +DA+SVGN R+  V
Sbjct: 201 GLMSYCFAGKGTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETV 260

Query: 297 STP------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
            TP      +IVIDSG+TLT+ P+ Y + +   +  ++ A  V  P   + LCY   ++ 
Sbjct: 261 GTPFHALKGNIVIDSGSTLTYFPESYCNLVRKAVEQVVTA--VRFPRSDI-LCYYSKTID 317

Query: 351 QVPEVTIHFR-GADVKLSRSNFFVKVSE-DIVCSVFKGITNS---VPIYGNIMQTNFLVG 405
             P +T+HF  GAD+ L + N +V  +   + C     I NS     I+GN  Q NFLVG
Sbjct: 318 IFPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAI--ICNSPIEEAIFGNRAQNNFLVG 375

Query: 406 YDIEQQTVSFKPTDCT 421
           YD     VSFKPT+C+
Sbjct: 376 YDSSSLLVSFKPTNCS 391


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 159/417 (38%), Positives = 243/417 (58%), Gaps = 43/417 (10%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS-QADIIP 86
           GF + L H DS K+       T +QR++  + R+ +RL   N     +SS A   + ++ 
Sbjct: 42  GFRITLKHVDSDKN------LTKFQRIQHGIKRANHRLERLNAMVLAASSNAEINSPVLS 95

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            N  +L+ ++IGTPP    A+ DTGSDLIWTQC+PC  +QC+ Q SP+FDPK SS++  L
Sbjct: 96  GNGEFLMNLAIGTPPETYSAIMDTGSDLIWTQCKPC--TQCFDQPSPIFDPKKSSSFSKL 153

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            CSS  C +L Q SCS  +C+Y  +YGD S + G +ATET T G      V++P + FGC
Sbjct: 154 SCSSQLCKALPQSSCSD-SCEYLYTYGDYSSTQGTMATETFTFGK-----VSIPNVGFGC 207

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN---FGTNGIV 263
           G +N G   ++ +G+VGLG G +SL+SQ++     KFSYCL  +  TK +    G+   V
Sbjct: 208 GEDNEGDGFTQGSGLVGLGRGPLSLVSQLK---EAKFSYCLTSIDDTKTSTLLMGSLASV 264

Query: 264 SG--PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTT 308
           +G    + +TPL +     +FY L+++ ISVG  RL +           T  ++IDSGTT
Sbjct: 265 NGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGTT 324

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLS---QVPEVTIHFRGAD 363
           +T+L +     +    +S +   PV D +G+  LELCY+  S +   +VP++ +HF GAD
Sbjct: 325 ITYLEESAFDLVKKEFTSQM-GLPV-DNSGATGLELCYNLPSDTSELEVPKLVLHFTGAD 382

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           ++L   N+ +  S   V  +  G +  + I+GN+ Q N  V +D+E++T+SF PT+C
Sbjct: 383 LELPGENYMIADSSMGVICLAMGSSGGMSIFGNVQQQNMFVSHDLEKETLSFLPTNC 439


>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 756

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 144/375 (38%), Positives = 212/375 (56%), Gaps = 31/375 (8%)

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           N L  ++ +S +    +  AD + + + YL+++ +GTPP E +A  DTGSD+IWTQC PC
Sbjct: 393 NFLVGYDSSSLLLQGASPYADTLYDYSIYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPC 452

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
           P   CY Q +P+FDP  SST++             ++ C+G +C Y + Y D ++S G L
Sbjct: 453 P--NCYSQFAPIFDPSKSSTFR-------------EQRCNGNSCHYEIIYADKTYSKGIL 497

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMRTT 238
           ATETVT+ ST+G+   +     GCG +N  L    F S ++GIVGL  G +SLISQM   
Sbjct: 498 ATETVTIPSTSGEPFVMAETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLP 557

Query: 239 IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLG- 295
             G  SYC     ++KINFGTN IV+G G V+  +   K   FY L +DA+SV +  +  
Sbjct: 558 YPGLISYCFSGQGTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIAT 617

Query: 296 VSTP------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 349
           + TP      +I IDSGTTLT+ P  Y + +   +  ++ A  V D      LCY  +++
Sbjct: 618 LGTPFHAEDGNIFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNLLCYYSDTI 677

Query: 350 SQVPEVTIHFR-GADVKLSRSNFFVK-VSEDIVCSVFKGITNSVP-IYGNIMQTNFLVGY 406
              P +T+HF  GAD+ L + N +++ ++  I C        S+P ++GN  Q NFLVGY
Sbjct: 678 DIFPVITMHFSGGADLVLDKYNMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGY 737

Query: 407 DIEQQTVSFKPTDCT 421
           D     +SF PT+C+
Sbjct: 738 DPSSNVISFSPTNCS 752



 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 158/423 (37%), Positives = 225/423 (53%), Gaps = 57/423 (13%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           +AT +  +F+    CF   + + +  G F+++LI R S  S F         RL      
Sbjct: 18  LATTMIVLFLQIITCFLFTTTVSSPHG-FTIDLIQRRSNSSSF---------RL------ 61

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
           S N+L             +  AD + +   YL+++ +GTPP E  A  DTGSDLIWTQC 
Sbjct: 62  SKNQLQ----------GASPYADTLFDYNIYLMKLQVGTPPFEIAAEIDTGSDLIWTQCM 111

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
           PCP   CY Q  P+FDP  SST+             N++ C G +C Y + Y D ++S G
Sbjct: 112 PCP--DCYSQFDPIFDPSKSSTF-------------NEQRCHGKSCHYEIIYEDNTYSKG 156

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMR 236
            LATETVT+ ST+G+   +   T GCG +N  L    F S ++GIVGL  G  SLISQM 
Sbjct: 157 ILATETVTIHSTSGEPFVMAETTIGCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMD 216

Query: 237 TTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRL 294
               G  SYC     ++KINFGTN IV+G G V+  +   K   FY L +DA+SV + R+
Sbjct: 217 LPYPGLISYCFSGQGTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRI 276

Query: 295 G-VSTP------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
             + TP      +IVIDSG+T+T+ P  Y + +   +  ++ A  V DP+G+  LCY   
Sbjct: 277 ETLGTPFHAEDGNIVIDSGSTVTYFPVSYCNLVRKAVEQVVTAVRVPDPSGNDMLCYFSE 336

Query: 348 SLSQVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVC-SVFKGITNSVPIYGNIMQTNFLV 404
           ++   P +T+HF  GAD+ L + N +++  S  + C ++         I+GN  Q NFLV
Sbjct: 337 TIDIFPVITMHFSGGADLVLDKYNMYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNFLV 396

Query: 405 GYD 407
           GYD
Sbjct: 397 GYD 399


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 161/426 (37%), Positives = 223/426 (52%), Gaps = 61/426 (14%)

Query: 11  LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ 70
           L  + +++++   +   GF+++LIHR S  S                 +R  N       
Sbjct: 6   LQIITYFLITTTASSPQGFTIDLIHRRSNASS----------------SRVFN------- 42

Query: 71  NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
               +   +  AD + +   YL+++ IGTPP E  AV DTGS+ IWTQC PC    CY Q
Sbjct: 43  ----TQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPC--VHCYNQ 96

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
            +P+FDP  SST+K + C +   +           C Y + YG  S++ G L TETVT+ 
Sbjct: 97  TAPIFDPSKSSTFKEIRCDTHDHS-----------CPYELVYGGKSYTKGTLVTETVTIH 145

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
           ST+GQ   +P    GCG NN G F     G+VGL  G  SLI+QM     G  SYC    
Sbjct: 146 STSGQPFVMPETIIGCGRNNSG-FKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGK 204

Query: 251 SSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLG-VSTP------D 300
            ++KINFG N IV+G GVVST +   T    FY L +DA+SVGN R+  V TP      +
Sbjct: 205 GTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGN 264

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR 360
           IVIDSG+TLT+ P+ Y + +   +  ++ A  V  P   + LCY   ++   P +T+HF 
Sbjct: 265 IVIDSGSTLTYFPESYCNLVRKAVEQVVTA--VRFPRSDI-LCYYSKTIDIFPVITMHFS 321

Query: 361 -GADVKLSRSNFFVKVSE-DIVCSVFKGITNS---VPIYGNIMQTNFLVGYDIEQQTVSF 415
            GAD+ L + N +V  +   + C     I NS     I+GN  Q NFLVGYD     VSF
Sbjct: 322 GGADLVLDKYNMYVASNTGGVFCLAI--ICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSF 379

Query: 416 KPTDCT 421
           KPT+C+
Sbjct: 380 KPTNCS 385


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score =  238 bits (607), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 149/418 (35%), Positives = 226/418 (54%), Gaps = 40/418 (9%)

Query: 25  QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
           +  GF V L H DS        + T ++RL+ A+ R   RL   +  ++ S   + +A +
Sbjct: 38  EKNGFRVSLRHVDS------GGNYTKFERLQRAVKRGRLRLQRLSAKTA-SFEPSVEAPV 90

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
              N  +L+ ++IGTP     A+ DTGSDLIWTQC+PC    C+ Q +P+FDP+ SS++ 
Sbjct: 91  HAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPC--KVCFDQPTPIFDPEKSSSFS 148

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            LPCSS  C +L   SCS   C+Y  SYGD S + G LATET T G  +     +  I F
Sbjct: 149 KLPCSSDLCVALPISSCSD-GCEYRYSYGDHSSTQGVLATETFTFGDAS-----VSKIGF 202

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----INFGTN 260
           GCG +N G   S+  G+VGLG G +SLISQ+      KFSYCL  +  +K    +  G+ 
Sbjct: 203 GCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVP---KFSYCLTSIDDSKGISTLLVGSE 259

Query: 261 GIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGT 307
             V     + TPL +     +FY L+++ ISVG+  L +           +  ++IDSGT
Sbjct: 260 ATVKS--AIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGT 317

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQVPEVTIHFRGADV 364
           T+T+L     + L     S ++    A  +  LELC++     S  +VP++  HF G D+
Sbjct: 318 TITYLKDNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVEVPQLVFHFEGVDL 377

Query: 365 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           KL + N+ ++ S   V  +  G ++ + I+GN  Q N +V +D+E++T+SF P  C +
Sbjct: 378 KLPKENYIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQ 435


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score =  237 bits (604), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 149/418 (35%), Positives = 225/418 (53%), Gaps = 40/418 (9%)

Query: 25  QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
           +  GF V L H DS        + T ++RL+ A+ R   RL   +  ++ S   + +A +
Sbjct: 38  EKNGFRVSLRHVDS------GGNYTKFERLQRAVKRGRLRLQRLSAKTA-SFEPSVEAPV 90

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
              N  +L+ ++IGTP     A+ DTGSDLIWTQC+PC    C+ Q +P+FDP+ SS++ 
Sbjct: 91  HAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPC--KVCFDQPTPIFDPEKSSSFS 148

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            LPCSS  C +L   SCS   C+Y  SYGD S + G LATET T G  +     +  I F
Sbjct: 149 KLPCSSDLCVALPISSCSD-GCEYRYSYGDHSSTQGVLATETFTFGDAS-----VSKIGF 202

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----INFGTN 260
           GCG +N G   S+  G+VGLG G +SLISQ+      KFSYCL  +  +K    +  G+ 
Sbjct: 203 GCGEDNRGRAYSQGAGLVGLGRGPLSLISQLGVP---KFSYCLTSIDDSKGISTLLVGSE 259

Query: 261 GIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGT 307
             V     + TPL +     +FY L+++ ISVG+  L +           +  ++IDSGT
Sbjct: 260 ATVKS--AIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGT 317

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQVPEVTIHFRGADV 364
           T+T+L     + L     S ++    A  +  LELC++     S   VP++  HF G D+
Sbjct: 318 TITYLKDSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVDVPQLVFHFEGVDL 377

Query: 365 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           KL + N+ ++ S   V  +  G ++ + I+GN  Q N +V +D+E++T+SF P  C +
Sbjct: 378 KLPKENYIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQ 435


>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  235 bits (599), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 161/436 (36%), Positives = 235/436 (53%), Gaps = 54/436 (12%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN---SSISSSKASQADI 84
           GFSVE IHRDS +SPF++ S T   R+ +A  RS  R    +++       S+    +++
Sbjct: 34  GFSVEFIHRDSARSPFHDPSLTAPARVLEAARRSTVRAAALSRSYVRVDAPSADGFVSEL 93

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP----------- 133
                 YL+ ++IGTPPT  +A+ADTGSDLIW  C        Y  D P           
Sbjct: 94  TSTPFEYLMAVNIGTPPTRMVAIADTGSDLIWLNCS-------YGGDGPGLAAARDADAQ 146

Query: 134 ----LFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
                FDP  S+T++ + C S  C+ L + SC +   C+YS SYGDGS ++G L+TET T
Sbjct: 147 PPGVQFDPSKSTTFRLVDCDSVACSELPEASCGADSKCRYSYSYGDGSHTSGVLSTETFT 206

Query: 189 LGSTTGQ-----AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAG 241
                G         +  + FGC T   G  +S   G+VGLGGGD+SL+SQ+   T++  
Sbjct: 207 FADAPGARGDGTTTRVANVNFGCSTTFVG--SSVGDGLVGLGGGDLSLVSQLGADTSLGR 264

Query: 242 KFSYCLVPVS---STKINFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGV 296
           +FSYCLVP S   S+ +NFG    V+ PG V+TPL  ++ K +Y++ + ++ VGN+    
Sbjct: 265 RFSYCLVPYSVKASSALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGNKTF-- 322

Query: 297 STPD---IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-- 351
             PD   +++DSGTTLTFLP+     L+  ++  I+  P   P   L LC+  + + +  
Sbjct: 323 EAPDRSPLIVDSGTTLTFLPEALVDPLVKELTGRIKLPPAQSPERLLPLCFDVSGVREGQ 382

Query: 352 ----VPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLV 404
               +P+VT+    GA V L   N FV+V E  +C     ++   P  I GNI Q N  V
Sbjct: 383 VAAMIPDVTVGLGGGAAVTLKAENTFVEVQEGTLCLAVSAMSEQFPASIIGNIAQQNMHV 442

Query: 405 GYDIEQQTVSFKPTDC 420
           GYD+++ TV+F P  C
Sbjct: 443 GYDLDKGTVTFAPAAC 458


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  233 bits (593), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 161/458 (35%), Positives = 242/458 (52%), Gaps = 59/458 (12%)

Query: 1   MATFLSCVFILFFLCFYV----VSPIEAQTGG---------FSVELIHRDSPKSPFYNSS 47
           MA+  S + I+  L   V    VSP  + + G         F V L H DS        +
Sbjct: 1   MASSGSHMIIVILLALAVSSALVSPAASTSRGLDRRPEKTWFRVSLRHVDS------GGN 54

Query: 48  ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAV 107
            T ++RL+ A+ R   RL   +  ++ S   + +A +   N  +L++++IGTP     A+
Sbjct: 55  YTKFERLQRAMKRGKLRLQRLSAKTA-SFESSVEAPVHAGNGEFLMKLAIGTPAETYSAI 113

Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
            DTGSDLIWTQC+PC    C+ Q +P+FDPK SS++  LPCSS  CA+L   SCS   C+
Sbjct: 114 MDTGSDLIWTQCKPC--KDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPISSCSD-GCE 170

Query: 168 YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
           Y  SYGD S + G LATET   G  +     +  I FGCG +N G   S+  G+VGLG G
Sbjct: 171 YLYSYGDYSSTQGVLATETFAFGDAS-----VSKIGFGCGEDNDGSGFSQGAGLVGLGRG 225

Query: 228 DISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGVVSTPLTK---AKTF 279
            +SLISQ+      KFSYCL  +  +K   G + ++ G        ++TPL +     +F
Sbjct: 226 PLSLISQLGEP---KFSYCLTSMDDSK---GISSLLVGSEATMKNAITTPLIQNPSQPSF 279

Query: 280 YVLTIDAISVGNQRLGVS----------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE 329
           Y L+++ ISVG+  L +           +  ++IDSGTT+T+L     + L     S ++
Sbjct: 280 YYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTITYLEDSAFAALKKEFISQLK 339

Query: 330 AQPVADPTGS--LELCYSF---NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVF 384
                D +GS  L+LC++     S   VP++  HF GAD+KL   N+ +  S   V  + 
Sbjct: 340 LD--VDESGSTGLDLCFTLPPDASTVDVPQLVFHFEGADLKLPAENYIIADSGLGVICLT 397

Query: 385 KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
            G ++ + I+GN  Q N +V +D+E++T+SF P  C +
Sbjct: 398 MGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQ 435


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score =  231 bits (589), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 141/413 (34%), Positives = 215/413 (52%), Gaps = 38/413 (9%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           G  V+L   DS K+       T Y+ ++ A+ R   R+   N  + + SS   +  +   
Sbjct: 41  GLRVDLEQVDSGKN------LTKYELIKRAIKRGERRMRSIN--AMLQSSSGIETPVYAG 92

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  YL+ ++IGTP +   A+ DTGSDLIWTQCEPC  +QC+ Q +P+F+P+ SS++ +LP
Sbjct: 93  DGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPC--TQCFSQPTPIFNPQDSSSFSTLP 150

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C S  C  L  ++C+   CQY+  YGDGS + G +ATET T      +  ++P I FGCG
Sbjct: 151 CESQYCQDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTF-----ETSSVPNIAFGCG 205

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVS 264
            +N G       G++G+G G +SL SQ+     G+FSYC+    S+    +  G+     
Sbjct: 206 EDNQGFGQGNGAGLIGMGWGPLSLPSQLG---VGQFSYCMTSYGSSSPSTLALGSAASGV 262

Query: 265 GPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTF 311
             G  ST L  +    T+Y +T+  I+VG   LG+           T  ++IDSGTTLT+
Sbjct: 263 PEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTY 322

Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPEVTIHFRGADVKLSR 368
           LPQ   + +    +  I    V + +  L  C+   S  S  QVPE+++ F G  + L  
Sbjct: 323 LPQDAYNAVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGE 382

Query: 369 SNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            N  +  +E ++C      +   + I+GNI Q    V YD++   VSF PT C
Sbjct: 383 QNILISPAEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 156/434 (35%), Positives = 238/434 (54%), Gaps = 56/434 (12%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS-SSKASQADIIP 86
           GF + L H DS K+       T  Q+++  + R  +RLN     + ++ +SK    + I 
Sbjct: 44  GFRLSLRHVDSGKNL------TKIQKIQRGINRGFHRLNRLGAVAVLAVASKPDDTNNIK 97

Query: 87  -----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                 +  +L+ +SIG P  +  A+ DTGSDLIWTQC+PC  ++C+ Q +P+FDP+ SS
Sbjct: 98  APTHGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPC--TECFDQPTPIFDPEKSS 155

Query: 142 TYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           +Y  + CSS  C +L + +C+     C+Y  +YGD S + G LATET T         ++
Sbjct: 156 SYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDEN----SI 211

Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----- 254
            GI FGCG  N G   S+ +G+VGLG G +SLISQ++ T   KFSYCL  +  ++     
Sbjct: 212 SGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSL 268

Query: 255 -INFGTNGIVSGPGV-VSTPLTKAKT---------FYVLTIDAISVGNQRLGVS------ 297
            I    +GIV+  G  +   +TK  +         FY L +  I+VG +RL V       
Sbjct: 269 FIGSLASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFEL 328

Query: 298 ----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ 351
               T  ++IDSGTT+T+L +     L    +S + + PV D +GS  L+LC+     ++
Sbjct: 329 AEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM-SLPV-DDSGSTGLDLCFKLPDAAK 386

Query: 352 ---VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
              VP++  HF+GAD++L   N+ V  S   V  +  G +N + I+GN+ Q NF V +D+
Sbjct: 387 NIAVPKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDL 446

Query: 409 EQQTVSFKPTDCTK 422
           E++TVSF PT+C K
Sbjct: 447 EKETVSFVPTECGK 460


>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
          Length = 334

 Score =  228 bits (582), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 137/353 (38%), Positives = 185/353 (52%), Gaps = 55/353 (15%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           NN  YL++ISIGTPP +   + DTGSDL+WTQC PC    CY Q +P+FDP  S+++K +
Sbjct: 20  NNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPC--LSCYKQKNPMFDPSKSTSFKEV 77

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C S QC  L+                          T T  L            I FGC
Sbjct: 78  SCESQQCRLLD--------------------------TPTSILN-----------IVFGC 100

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG--KFSYCLVPVSS-----TKINFGT 259
           G NN G FN    G+ G GG  +SL SQ+ +T+    KFS CLVP  +     +KI FG 
Sbjct: 101 GHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGP 160

Query: 260 NGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVSTP-------DIVIDSGTTLT 310
              VSG  VVSTPL      T+Y +T+D ISVG++    S+        ++ ID+GT  T
Sbjct: 161 EAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGTPPT 220

Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSN 370
            LP+ + + L+  +   I  +PV DP    +LCY   +L   P +T HF GADV+L   N
Sbjct: 221 LLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLIDGPILTAHFDGADVQLKPLN 280

Query: 371 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 423
            F+   E + C   + I     I+GN +Q NFL+G+D++ + VSFK  DCTKQ
Sbjct: 281 TFISPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTKQ 333


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score =  228 bits (582), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 136/392 (34%), Positives = 208/392 (53%), Gaps = 33/392 (8%)

Query: 49  TPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVA 108
           T Y+ ++ A+ R   R+   N  + + SS   +  +   +  YL+ ++IGTP +   A+ 
Sbjct: 56  TKYELIKRAIKRGERRMRSIN--AMLQSSSGIETPVYAGSGEYLMNVAIGTPASSLSAIM 113

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
           DTGSDLIWTQCEPC  +QC+ Q +P+F+P+ SS++ +LPC S  C  L  +SC   +CQY
Sbjct: 114 DTGSDLIWTQCEPC--TQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSESCYN-DCQY 170

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
           +  YGDGS + G +ATET T      +  ++P I FGCG +N G       G++G+G G 
Sbjct: 171 TYGYGDGSSTQGYMATETFTF-----ETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGP 225

Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVSGPGVVSTPLTKAK---TFYVL 282
           +SL SQ+     G+FSYC+    S+    +  G+       G  ST L  +    T+Y +
Sbjct: 226 LSLPSQLG---VGQFSYCMTSSGSSSPSTLALGSAASGVPEGSPSTTLIHSSLNPTYYYI 282

Query: 283 TIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 332
           T+  I+VG   LG+           T  ++IDSGTTLT+LPQ   + +    +  I   P
Sbjct: 283 TLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLSP 342

Query: 333 VADPTGSLELCYSF---NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVC-SVFKGIT 388
           V + +  L  C+      S  QVPE+++ F G  + L   N  +  +E ++C ++     
Sbjct: 343 VDESSSGLSTCFQLPSDGSTVQVPEISMQFDGGVLNLGEENVLISPAEGVICLAMGSSSQ 402

Query: 389 NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             + I+GNI Q    V YD++   VSF PT C
Sbjct: 403 QGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  227 bits (579), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 156/436 (35%), Positives = 239/436 (54%), Gaps = 60/436 (13%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           GF + L H DS K+       T  Q+++  + R  +RLN     + ++   AS  D   N
Sbjct: 45  GFRLSLRHVDSGKNL------TKIQKIQRGINRGFHRLNRLGAVAVLAV--ASNPDDTNN 96

Query: 88  --------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
                   +  +L+ +SIG P  +  A+ DTGSDLIWTQC+PC  ++C+ Q +P+FDP+ 
Sbjct: 97  IKAPTHGGSGEFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPC--TECFDQPTPIFDPEK 154

Query: 140 SSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           SS+Y  + CSS  C +L + +C+    +C+Y  +YGD S + G LATET T         
Sbjct: 155 SSSYSKVGCSSGLCNALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDEN---- 210

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--- 254
           ++ GI FGCG  N G   S+ +G+VGLG G +SLISQ++ T   KFSYCL  +  ++   
Sbjct: 211 SISGIGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASS 267

Query: 255 ---INFGTNGIVSGPGV-VSTPLTKAKT---------FYVLTIDAISVGNQRLGVS---- 297
              I    +GIV+  G  +   +TK  +         FY L +  I+VG +RL V     
Sbjct: 268 SLFIGSLASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTF 327

Query: 298 ------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSL 349
                 T  ++IDSGTT+T+L +     L    +S + + PV D +GS  L+LC+   + 
Sbjct: 328 ELSEDGTGGMIIDSGTTITYLEETAFKVLKEEFTSRM-SLPV-DDSGSTGLDLCFKLPNA 385

Query: 350 SQ---VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY 406
           ++   VP++  HF+GAD++L   N+ V  S   V  +  G +N + I+GN+ Q NF V +
Sbjct: 386 AKNIAVPKLIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLH 445

Query: 407 DIEQQTVSFKPTDCTK 422
           D+E++TV+F PT+C K
Sbjct: 446 DLEKETVTFVPTECGK 461


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score =  226 bits (575), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 135/369 (36%), Positives = 195/369 (52%), Gaps = 33/369 (8%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           ++ +     +Y+  IS+GTP      +ADTGSDLIW QC+PC    C+ Q  P+FDP+ S
Sbjct: 30  ESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPC--QACFNQKDPIFDPEGS 87

Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           S+Y ++ C  + C SL +KSCS  NC YS  YGDGS + G L++ETVTL ST G+ +A  
Sbjct: 88  SSYTTMSCGDTLCDSLPRKSCS-PNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAK 146

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKI 255
            I FGCG  N G FN   +G+VGLG G++S +SQ+      KFSYCLVP       ++ +
Sbjct: 147 NIAFGCGHLNRGSFN-DASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPM 205

Query: 256 NFGTNGIVSGPG----VVSTPLTK---AKTFYVLTIDAISVGNQRLGVST------PD-- 300
            FG        G       TP+      ++FY + +  IS+  + L +        PD  
Sbjct: 206 FFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGS 265

Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS-----LSQVP 353
             ++ DSGTTLT LP      +L  + S +    +   +  L+LCY  +        ++P
Sbjct: 266 GGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPEIDGSSAGLDLCYDVSGSKASYKKKIP 325

Query: 354 EVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
            +  HF GAD +L   N+F+  ++   IVC         + IYGN+MQ NF V YDI   
Sbjct: 326 AMVFHFEGADHQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSS 385

Query: 412 TVSFKPTDC 420
            + + P+ C
Sbjct: 386 KIGWAPSQC 394


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score =  225 bits (573), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 135/369 (36%), Positives = 195/369 (52%), Gaps = 33/369 (8%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           ++ +     +Y+  IS+GTP      +ADTGSDLIW QC+PC    C+ Q  P+FDP+ S
Sbjct: 30  ESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPC--QACFNQKDPIFDPEGS 87

Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           S+Y ++ C  + C SL +KSCS  +C YS  YGDGS + G L++ETVTL ST G+ +A  
Sbjct: 88  SSYTTMSCGDTLCDSLPRKSCS-PDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAK 146

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKI 255
            I FGCG  N G FN   +G+VGLG G++S +SQ+      KFSYCLVP       ++ +
Sbjct: 147 NIAFGCGHLNRGSFN-DASGLVGLGRGNLSFVSQLGDLFGHKFSYCLVPWRDAPSKTSPM 205

Query: 256 NFGTNGIVSGPG----VVSTPLTK---AKTFYVLTIDAISVGNQRLGVST------PD-- 300
            FG        G       TP+      ++FY + +  IS+  + L +        PD  
Sbjct: 206 FFGDESSSHSSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGS 265

Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-----QVP 353
             ++ DSGTTLT LP      +L  + S I    +   +  L+LCY  +        ++P
Sbjct: 266 GGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPKIDGSSAGLDLCYDVSGSKASYKMKIP 325

Query: 354 EVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
            +  HF GAD +L   N+F+  ++   IVC         + IYGN+MQ NF V YDI   
Sbjct: 326 AMVFHFEGADYQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSS 385

Query: 412 TVSFKPTDC 420
            + + P+ C
Sbjct: 386 KIGWAPSQC 394


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score =  223 bits (569), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 159/432 (36%), Positives = 231/432 (53%), Gaps = 45/432 (10%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
           A  GGFSVE IHRDSP+SPF++ + T + R   A  RS+ R      ++S S+S    AD
Sbjct: 29  ASGGGFSVEFIHRDSPRSPFHDPAFTAHGRALAAARRSVARAAAIAGSASSSASGGGAAD 88

Query: 84  -----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE---------PCPPSQCYM 129
                ++  +  YL+ +++G+PP   LA+ADTGSDL+W +C+           P +Q   
Sbjct: 89  DVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQ--- 145

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
                FDP  SSTY  + C +  C +L + +C  G NC Y  +YGDGS + G L+TET T
Sbjct: 146 -----FDPSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFT 200

Query: 189 L----GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGK 242
                   + + V + G+ FGC T   G F +     +G   G +SL++Q+   T++  +
Sbjct: 201 FDDGGSGRSPRQVRVGGVKFGCSTATAGSFPADGLVGLGG--GAVSLVTQLGGATSLGRR 258

Query: 243 FSYCLVPVS---STKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLG-V 296
           FSYCLVP S   S+ +NFG    V+ PG  STPL      T+Y + +D++ VGN+ +   
Sbjct: 259 FSYCLVPHSVNASSALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKTVASA 318

Query: 297 STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-----FNSLSQ 351
           ++  I++DSGTTLTFL       ++  +S  I   PV  P G L+LCY+       +   
Sbjct: 319 ASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGES 378

Query: 352 VPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDI 408
           +P++T+ F  GA V L   N FV V E  +C      T   P  I GN+ Q N  VGYD+
Sbjct: 379 IPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDL 438

Query: 409 EQQTVSFKPTDC 420
           +  TV+F   DC
Sbjct: 439 DAGTVTFAGADC 450


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score =  220 bits (561), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 154/457 (33%), Positives = 244/457 (53%), Gaps = 69/457 (15%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           +   +SC+ +L  L         + + G+ + L H DS              ++    T 
Sbjct: 8   LQALMSCLVLLTSLAV-------SASSGYRLALTHVDS--------------KIGLTKTE 46

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
            + R  H ++  ++S   A+   +      YL+ ++IGTPP   +A+ADTGSDL WTQC+
Sbjct: 47  LMRRAAHRSRLRALSGYDANSPRLHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQCQ 106

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS-LNQKSCSGVN--CQYSVSYGDGSF 177
           PC    C+ QD+P++DP  SST+  +PCSS+ C   L  ++CS  +  C+Y  SY DG++
Sbjct: 107 PC--KLCFPQDTPVYDPSASSTFSPVPCSSATCLPVLRSRNCSTPSSLCRYGYSYSDGAY 164

Query: 178 SNGNLATETVTLGSTT-GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           S G L TET+TLGS+  GQAV++  + FGCGT+NGG  +  +TG VGLG G +SL++Q+ 
Sbjct: 165 SAGILGTETLTLGSSVPGQAVSVSDVAFGCGTDNGG-DSLNSTGTVGLGRGTLSLLAQLG 223

Query: 237 TTIAGKFSYCLVPVSSTKIN----FGTNG-IVSGPGVV-STPLTKA---KTFYVLTIDAI 287
               GKFSYCL    ++ ++     GT   +  GPG V STPL ++    + YV+++  I
Sbjct: 224 ---VGKFSYCLTDFFNSTLDSPFLLGTLAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGI 280

Query: 288 SVGNQRLGV----------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV---- 333
           ++G+ RL +          ST  +V+DSGTT + LP+     ++  ++ ++   PV    
Sbjct: 281 TLGDVRLPIPNKTFDLHANSTGGMVVDSGTTFSILPESGFRVVVDHVAQVLGQPPVNASS 340

Query: 334 ------ADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSED-IVCSVFK 385
                   P G  +L +       +P++ +HF  GAD++L R N+     ED   C    
Sbjct: 341 LDSPCFPAPAGERQLPF-------MPDLVLHFAGGADMRLHRDNYMSYNQEDSSFCLNIV 393

Query: 386 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           G T++  + GN  Q N  + +D+    +SF PTDC+K
Sbjct: 394 GTTSTWSMLGNFQQQNIQMLFDMTVGQLSFLPTDCSK 430


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score =  219 bits (557), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 161/460 (35%), Positives = 233/460 (50%), Gaps = 62/460 (13%)

Query: 16  FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS 75
            +V   + A+  GFSVE IHRDS KSPF++ + TP+ R   A  RS  R    +   +  
Sbjct: 27  LFVSPAVGAEEDGFSVEFIHRDSVKSPFHDPALTPHGRALAAARRSAARAAELHHLLARR 86

Query: 76  SSKASQ--------ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE------- 120
           SS A          A+++     YL+ I +GTPP   LA+ADTGSDL+W +C+       
Sbjct: 87  SSGAPSPGTGAGVVAEVVSRQFEYLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNN 146

Query: 121 -PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVSYGDGSF 177
              PPS         F P  SSTY  + C +  C +L+   SCS   +C+Y  SYGDGS 
Sbjct: 147 STAPPSV-------YFVPSASSTYGRVGCDTKACRALSSAASCSPDGSCEYLYSYGDGSR 199

Query: 178 SNGNLATETVTLGSTTGQA-----------------VALPGITFGCGTNNGGLFNSKTTG 220
           ++G L+TET T  +    +                 V +  + FGC T   G F +    
Sbjct: 200 ASGQLSTETFTFSTIADSSKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRADGLV 259

Query: 221 IVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTK----INFGTNGIVSGPGVVSTPLT 274
            +G   G +SL SQ+   T++  KFSYCL P ++T     +NFG+  +VS PG  STPL 
Sbjct: 260 GLGG--GPVSLASQLGATTSLGRKFSYCLAPYANTNASSALNFGSRAVVSEPGAASTPLI 317

Query: 275 --KAKTFYVLTIDAISV-GNQR-LGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEA 330
             + +T+Y + +D+I+V G +R    +   I++DSGTTLT+L     + L+  ++  I+ 
Sbjct: 318 TGEVETYYTIALDSINVAGTKRPTTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKL 377

Query: 331 QPVADPTGSLELCYSFNSLS-----QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF 384
                P   L+LCY  + +       +P+VT+    G +V L   N FV V E ++C   
Sbjct: 378 PRAESPEKILDLCYDISGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLAL 437

Query: 385 KGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
              +   SV I GNI Q N  VGYD+E+ TV+F   DC K
Sbjct: 438 VATSERQSVSILGNIAQQNLHVGYDLEKGTVTFAAADCAK 477


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 141/410 (34%), Positives = 221/410 (53%), Gaps = 54/410 (13%)

Query: 54  LRDALTRSLNRLNHFNQNSSISSSKASQAD----------IIPNNANYLIRISIGTPPTE 103
           +RDAL R ++R     Q+ S+   + +++D           +PN   YL+ +SIGTPP  
Sbjct: 49  VRDALRRDMHR----QQSRSLFGRELAESDGTTVSARTRKDLPNGGEYLMTLSIGTPPLS 104

Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASL--NQK 159
             A+ADTGSDLIWTQC PC   QC+ Q +PL++P  S+T+  LPC+S  S CA +   + 
Sbjct: 105 YPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNSSLSMCAGVLAGKA 164

Query: 160 SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTT 219
              G  C Y+ +YG G ++ G   +ET T GS       +PGI FGC   +   +N  + 
Sbjct: 165 PPPGCACMYNQTYGTG-WTAGVQGSETFTFGSAAADQARVPGIAFGCSNASSSDWNG-SA 222

Query: 220 GIVGLGGGDISLISQMRTTIAGKFSYCLVPV----SSTKINFGTNGIVSGPGVVSTPLTK 275
           G+VGLG G +SL+SQ+    AG+FSYCL P     S++ +  G +  ++G GV STP   
Sbjct: 223 GLVGLGRGSLSLVSQLG---AGRFSYCLTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVA 279

Query: 276 A------KTFYVLTIDAISVGNQRLGVSTPD-----------IVIDSGTTLTFLPQGYNS 318
           +       T+Y L +  IS+G + L +S PD           ++IDSGTT+T L      
Sbjct: 280 SPAKAPMSTYYYLNLTGISLGAKALSIS-PDAFSLKADGTGGLIIDSGTTITSLVNAAYQ 338

Query: 319 NLLSVMSSMIEAQPV--ADPTGSLELCYSF----NSLSQVPEVTIHFRGADVKLSRSNFF 372
            + + + S++    +  +D TG L+LCY+     ++   +P +T+HF GAD+ L   ++ 
Sbjct: 339 QVRAAVQSLVTLPAIDGSDSTG-LDLCYALPTPTSAPPAMPSMTLHFDGADMVLPADSYM 397

Query: 373 VKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           +  S  + C   +  T+ ++  +GN  Q N  + YD+  + +SF P  C+
Sbjct: 398 ISGS-GVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVRNEMLSFAPAKCS 446


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 142/418 (33%), Positives = 209/418 (50%), Gaps = 40/418 (9%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK-----ASQADI 84
           S  L+ RD+     Y S       + D + R   R  +     S ++ +      S++ +
Sbjct: 60  SFALVRRDAVTGSTYPSRR---HAVLDLVARDNARAEYLASRLSPAAYQPTGFSGSESKV 116

Query: 85  I----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           +      +  Y +R+ IG+PPTE+  V D+GSD+IW QC+PC   +CY Q  PLFDP  S
Sbjct: 117 VSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPC--LECYAQADPLFDPATS 174

Query: 141 STYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           +T+ ++PC S+ C +L    C  SG  C Y VSYGDGS++ G LA ET+TLG T     A
Sbjct: 175 ATFSAVPCGSAVCRTLRTSGCGDSG-GCDYEVSYGDGSYTKGALALETLTLGGT-----A 228

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG 258
           + G+  GCG  N GLF     G++GLG G +SL+ Q+     G FSYCL    +  +  G
Sbjct: 229 VEGVAIGCGHRNRGLFVG-AAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGAGSLVLG 287

Query: 259 TNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS------TPD----IVIDS 305
            +  V   G V  PL +   A +FY + +  I VG++RL +       T D    +V+D+
Sbjct: 288 RSEAVP-EGAVWVPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDT 346

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG-A 362
           GT +T LPQ   + L     + + A P A     L+ CY  +  +  +VP V+ +F G A
Sbjct: 347 GTAVTRLPQEAYAALRDAFVAAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAA 406

Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            + L   N  ++V   I C  F   ++   I GNI Q    +  D     + F PT C
Sbjct: 407 TLTLPARNLLLEVDGGIYCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 155/438 (35%), Positives = 229/438 (52%), Gaps = 55/438 (12%)

Query: 21  PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
           P      G  V L H D+      + + T  Q LR A  RS +R++     ++  S KA+
Sbjct: 49  PAAGLLDGLRVPLTHVDA------HGNYTKLQLLRRAARRSHHRMSRLVARTATGSVKAA 102

Query: 81  -----QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
                Q  +   N  +L+ +SIGTP     A+ DTGSDL+WTQC+PC   +C+ Q +P+F
Sbjct: 103 AAPDLQVPVHAGNGEFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPC--VECFNQSTPVF 160

Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           DP  SSTY +LPCSSS C+ L   +C+    +C Y+ +YGD S + G LA ET TL  T 
Sbjct: 161 DPSSSSTYSTLPCSSSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTK 220

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
                LPG+ FGCG  N G   ++  G+VGLG G +SL+SQ+     GKFSYCL  +  T
Sbjct: 221 -----LPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGL---GKFSYCLTSLDDT 272

Query: 254 K---INFGTNGIV-----SGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV------ 296
               +  G+   +     S   + +TPL K     +FY +T+ A++VG+ R+ +      
Sbjct: 273 SKSPLLLGSLAAISTDTASAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFA 332

Query: 297 ----STPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS 350
                T  +++DSGT++T+L  QGY     +  + M    PVAD +   L+LC+   +  
Sbjct: 333 VQDDGTGGVIVDSGTSITYLELQGYRPLKKAFAAQM--KLPVADGSAVGLDLCFKAPASG 390

Query: 351 ----QVPEVTIHFR-GADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLV 404
               +VP++ +HF  GAD+ L   N+ V  S    +C    G +  + I GN  Q N   
Sbjct: 391 VDDVEVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMG-SRGLSIIGNFQQQNIQF 449

Query: 405 GYDIEQQTVSFKPTDCTK 422
            YD+++ T+SF P  C K
Sbjct: 450 VYDVDKDTLSFAPVQCAK 467


>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  215 bits (547), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 142/401 (35%), Positives = 217/401 (54%), Gaps = 41/401 (10%)

Query: 54  LRDALTRSLNRLNHFNQNSSISSSKASQADI---IPNNANYLIRISIGTPPTERLAVADT 110
           +RDAL R ++R   F +  + S  +   A     +PN   Y++ ++IGTPP    A+ADT
Sbjct: 48  VRDALRRDMHRHARFTRELASSGDRTVAAPTRKDLPNGGEYIMTLAIGTPPLSYPAIADT 107

Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKS-CSGVNCQ 167
           GSDLIWTQC PC  SQC+ Q    ++P  S+T+  LPC+S  S CA+L   S   G +C 
Sbjct: 108 GSDLIWTQCAPC-GSQCFKQAGQPYNPSSSTTFGVLPCNSSVSMCAALAGPSPPPGCSCM 166

Query: 168 YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
           Y+ +YG G ++ G  + ET T GST      +PGI FGC   +   +N  + G+VGLG G
Sbjct: 167 YNQTYGTG-WTAGIQSVETFTFGSTPADQTRVPGIAFGCSNASSDDWNG-SAGLVGLGRG 224

Query: 228 DISLISQMRTTIAGKFSYCLVPV----SSTKINFGTNGIVSGPGVVSTPLTKA------K 277
            +SL+SQ+    AG FSYCL P     S++ +  G +  ++G GV++TP   +       
Sbjct: 225 SMSLVSQLG---AGMFSYCLTPFQDANSTSTLLLGPSAALNGTGVLTTPFVASPSKAPMS 281

Query: 278 TFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSM 327
           T+Y L +  IS+G   L +           T  ++IDSGTT+T L       + + + S+
Sbjct: 282 TYYYLNLTGISIGTTALSIPPNAFALRTDGTGGLIIDSGTTITSLVDAAYQQVRAAIESL 341

Query: 328 IEAQPVADPTGS--LELCYSFNSLS----QVPEVTIHFRGADVKLSRSNFFVKVSEDIVC 381
           +   PVAD + S  L+LC++  S +     +P +T HF GAD+ L   N+ + +   + C
Sbjct: 342 VTL-PVADGSDSTGLDLCFALTSETSTPPSMPSMTFHFDGADMVLPVDNYMI-LGSGVWC 399

Query: 382 SVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
              +  T  ++  +GN  Q N  + YDI ++T+SF P  C+
Sbjct: 400 LAMRNQTVGAMSTFGNYQQQNVHLLYDIHEETLSFAPAKCS 440


>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 364

 Score =  215 bits (547), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 140/352 (39%), Positives = 196/352 (55%), Gaps = 31/352 (8%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           NN +YL+++++GTPP +   + DT SDL+W QC PC    CY Q +P+FDP         
Sbjct: 27  NNGDYLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPC--QGCYKQKNPMFDPL-------- 76

Query: 147 PCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
                +C S    SCS    C Y  +Y D S + G LA E  T  ST G+ + +  I FG
Sbjct: 77  ----KECNSFFDHSCSPEKACDYVYAYADDSATKGMLAKEIATFSSTDGKPI-VESIIFG 131

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPV-----SSTKINFGT 259
           CG NN G+FN    G++GLGGG +SL+SQM      K FS CLVP      +S  I+ G 
Sbjct: 132 CGHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADPHTSGTISLGE 191

Query: 260 NGIVSGPGVVSTPLT--KAKTFYVLTIDAISVG------NQRLGVSTPDIVIDSGTTLTF 311
              VSG GVV+TPL   + +T Y++T++ ISVG      N    +S  +I+IDSGT  T+
Sbjct: 192 ASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSEMLSKGNIMIDSGTPETY 251

Query: 312 LPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSN 370
           LPQ +   L+  +   I   P+  DP    +LCY   +  + P +T HF GADVKL    
Sbjct: 252 LPQEFYDRLVEELKVQINLPPIHVDPDLGTQLCYKSETNLEGPILTAHFEGADVKLLPLQ 311

Query: 371 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
            F+   + + C    G T+ + I+GN  Q+N L+G+D++++ V FKPTD TK
Sbjct: 312 TFIPPKDGVFCFAMTGTTDGLYIFGNFAQSNVLIGFDLDKRIVFFKPTDFTK 363


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score =  213 bits (543), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 139/425 (32%), Positives = 207/425 (48%), Gaps = 49/425 (11%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN-- 87
           S+ L+HRD+     Y S      ++   + R   R+ H  +    S+S     D++    
Sbjct: 64  SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120

Query: 88  ------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                 +  Y +R+ +G+PPT++  V D+GSD+IW QC PC   QCY Q  PLFDP  SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178

Query: 142 TYKSLPCSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           ++  + C S+ C +L+            C YSV+YGDGS++ G LA ET+TLG T     
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT----- 233

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
           A+ G+  GCG  N GLF     G++GLG G +SLI Q+     G FSYCL    +++   
Sbjct: 234 AVQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCL----ASRGAG 288

Query: 258 GTNGIVSGP------GVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS------TPD-- 300
           G   +V G       G V  PL +   A +FY + +  I VG +RL +       T D  
Sbjct: 289 GAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGA 348

Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVT 356
             +V+D+GT +T LP+   + L       + A P +     L+ CY  +  +  +VP V+
Sbjct: 349 GGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVS 408

Query: 357 IHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
            +F +GA + L   N  V+V   + C  F   ++ + I GNI Q    +  D     V F
Sbjct: 409 FYFDQGAVLTLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGF 468

Query: 416 KPTDC 420
            P  C
Sbjct: 469 GPNTC 473


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 138/425 (32%), Positives = 207/425 (48%), Gaps = 49/425 (11%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN-- 87
           S+ L+HRD+     Y S      ++   + R   R+ H  +    S+S     D++    
Sbjct: 64  SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120

Query: 88  ------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                 +  Y +R+ +G+PPT++  V D+GSD+IW QC PC   QCY Q  PLFDP  SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178

Query: 142 TYKSLPCSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           ++  + C S+ C +L+            C YSV+YGDGS++ G LA ET+TLG T     
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT----- 233

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
           A+ G+  GCG  N GLF     G++GLG G +SL+ Q+     G FSYCL    +++   
Sbjct: 234 AVQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCL----ASRGAG 288

Query: 258 GTNGIVSGP------GVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS------TPD-- 300
           G   +V G       G V  PL +   A +FY + +  I VG +RL +       T D  
Sbjct: 289 GAGSLVLGRTEAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGA 348

Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVT 356
             +V+D+GT +T LP+   + L       + A P +     L+ CY  +  +  +VP V+
Sbjct: 349 GGVVMDTGTAVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVS 408

Query: 357 IHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
            +F +GA + L   N  V+V   + C  F   ++ + I GNI Q    +  D     V F
Sbjct: 409 FYFDQGAVLTLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGF 468

Query: 416 KPTDC 420
            P  C
Sbjct: 469 GPNTC 473


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 139/363 (38%), Positives = 207/363 (57%), Gaps = 44/363 (12%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           + +SIG P  +  A+ DTGSDLIWTQC+PC  ++C+ Q +P+FDP+ SS+Y  + CSS  
Sbjct: 1   MELSIGNPAVKYSAIVDTGSDLIWTQCKPC--TECFDQPTPIFDPEKSSSYSKVGCSSGL 58

Query: 153 CASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
           C +L + +C+     C+Y  +YGD S + G LATET T         ++ GI FGCG  N
Sbjct: 59  CNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDEN----SISGIGFGCGVEN 114

Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK------INFGTNGIVS 264
            G   S+ +G+VGLG G +SLISQ++ T   KFSYCL  +  ++      I    +GIV+
Sbjct: 115 EGDGFSQGSGLVGLGRGPLSLISQLKET---KFSYCLTSIEDSEASSSLFIGSLASGIVN 171

Query: 265 GPGV-VSTPLTKAKT---------FYVLTIDAISVGNQRLGVS----------TPDIVID 304
             G  +   +TK  +         FY L +  I+VG +RL V           T  ++ID
Sbjct: 172 KTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIID 231

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ---VPEVTIHF 359
           SGTT+T+L +     L    +S + + PV D +GS  L+LC+     ++   VP++  HF
Sbjct: 232 SGTTITYLEETAFKVLKEEFTSRM-SLPV-DDSGSTGLDLCFKLPDAAKNIAVPKMIFHF 289

Query: 360 RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
           +GAD++L   N+ V  S   V  +  G +N + I+GN+ Q NF V +D+E++TVSF PT+
Sbjct: 290 KGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTE 349

Query: 420 CTK 422
           C K
Sbjct: 350 CGK 352


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 151/409 (36%), Positives = 217/409 (53%), Gaps = 33/409 (8%)

Query: 32  ELIHRDSPKSPFY-NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           ELIHR+ P SP   N+S+T  +    A+ R   R    +++  ++  +     +   N  
Sbjct: 21  ELIHREHPSSPLRSNTSKTTTEIFLAAVKRGAERRAQLSKHI-LAEGRLFSTPVASGNGE 79

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           YLI IS G+PP +   + DTGSDLIWTQC PC    C    S +FDP  SSTY ++ C+S
Sbjct: 80  YLIDISFGSPPQKASVIVDTGSDLIWTQCLPC--ETCNAAASVIFDPVKSSTYDTVSCAS 137

Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
           + C+SL  +SC+  +C+Y   YGDGS ++G L+TETVT+         +P + FGCG  N
Sbjct: 138 NFCSSLPFQSCT-TSCKYDYMYGDGSSTSGALSTETVTV-----GTGTIPNVAFGCGHTN 191

Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG-IVSGPGVV 269
            G F +   GIVGLG G +SLISQ  +  + KFSYCLVP+ STK +    G   +  GV 
Sbjct: 192 LGSF-AGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTKTSPMLIGDSAAAGGVA 250

Query: 270 STPL---TKAKTFYVLTIDAISVGNQR----LGVSTPD------IVIDSGTTLTFLPQGY 316
            T L   T   TFY   +  ISV  +     +G  + D       ++DSGTTLT+L  G 
Sbjct: 251 YTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLETGA 310

Query: 317 NSNLLSVMSSMIEAQPVADPTGS---LELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNF 371
            + L++ + + +   P  +  GS   L+ C+S   ++    P +T HF+GAD +L   N 
Sbjct: 311 FNALVAALKAEV---PFPEADGSLYGLDYCFSTAGVANPTYPTMTFHFKGADYELPPENV 367

Query: 372 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           FV +       +    +    I GNI Q N L+ +D+  Q V FK  +C
Sbjct: 368 FVALDTGGSICLAMAASTGFSIMGNIQQQNHLIVHDLVNQRVGFKEANC 416


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  212 bits (539), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 154/459 (33%), Positives = 235/459 (51%), Gaps = 60/459 (13%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           MA FL  V+IL  L +  +S   +   G  +EL H D      Y  +E    R+R A  R
Sbjct: 1   MAAFL--VWILLLLPYVAISSTASH--GVRLELTHADDRGG--YVGAE----RVRRAADR 50

Query: 61  SLNRLNHF-----------NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
           S  R+N F              S  + +  ++A +  + A YL+ I+IGTPP    AV D
Sbjct: 51  SHRRVNGFLGAIEGPSSTARLGSDGAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLD 110

Query: 110 TGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCS--GV 164
           TGSDLIWTQC+ PC   +C+ Q +PL+ P  S+TY ++ C S  C +L      CS    
Sbjct: 111 TGSDLIWTQCDAPC--RRCFPQPAPLYAPARSATYANVSCRSPMCQALQSPWSRCSPPDT 168

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C Y  SYGDG+ ++G LATET TLGS T    A+ G+ FGCGT N G     ++G+VG+
Sbjct: 169 GCAYYFSYGDGTSTDGVLATETFTLGSDT----AVRGVAFGCGTENLG-STDNSSGLVGM 223

Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSSTKIN---FGTNGIVSGPGVVSTPLT------- 274
           G G +SL+SQ+  T   +FSYC  P ++T  +    G++  +S     +TP         
Sbjct: 224 GRGPLSLVSQLGVT---RFSYCFTPFNATAASPLFLGSSARLSS-AAKTTPFVPSPSGGA 279

Query: 275 -KAKTFYVLTIDAISVGNQRLGVS------TP----DIVIDSGTTLTFLPQGYNSNLLSV 323
            +  ++Y L+++ I+VG+  L +       TP     ++IDSGTT T L +     L   
Sbjct: 280 RRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVALARA 339

Query: 324 MSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSEDIVC 381
           ++S +     +     L LC++  S    +VP + +HF GAD++L R ++ V+     V 
Sbjct: 340 LASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVVEDRSAGVA 399

Query: 382 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            +       + + G++ Q N  + YD+E+  +SF+P  C
Sbjct: 400 CLGMVSARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 438


>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 469

 Score =  212 bits (539), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 151/457 (33%), Positives = 234/457 (51%), Gaps = 62/457 (13%)

Query: 11  LFFLCFYVVSPIEAQTGGFSVEL----IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
           L  L F VV    A +G  SV +    IH D           T  Q +RDAL R ++R  
Sbjct: 27  LAVLVFLVVCATLA-SGAASVRVGLTRIHSDP--------DTTAPQFVRDALRRDMHRQR 77

Query: 67  ----------HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
                        ++   ++  A     +PN   YL+ ++IGTPP    AVADTGSDLIW
Sbjct: 78  SRSFGRDRDRELAESDGRTTVSARTRKDLPNGGEYLMTLAIGTPPLPYAAVADTGSDLIW 137

Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKSCSGVN--CQYSVSY 172
           TQC PC  +QC+ Q +PL++P  S+T+  LPC+S  S CA     +       C Y+ +Y
Sbjct: 138 TQCAPC-GTQCFEQPAPLYNPASSTTFSVLPCNSSLSMCAGALAGAAPPPGCACMYNQTY 196

Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
           G G ++ G   +ET T GS+      +PG+ FGC   +   +N  + G+VGLG G +SL+
Sbjct: 197 GTG-WTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDWNG-SAGLVGLGRGSLSLV 254

Query: 233 SQMRTTIAGKFSYCLVPV----SSTKINFGTNGIVSGPGVVSTPLTKA------KTFYVL 282
           SQ+    AG+FSYCL P     S++ +  G +  ++G GV STP   +       T+Y L
Sbjct: 255 SQLG---AGRFSYCLTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPARAPMSTYYYL 311

Query: 283 TIDAISVGNQRLGVS------TPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 332
            +  IS+G + L +S       PD    ++IDSGTT+T L       + + + S++   P
Sbjct: 312 NLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTITSLANAAYQQVRAAVKSLVTTLP 371

Query: 333 VADPTGS--LELCYSFNSLSQ-----VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFK 385
             D + S  L+LC++  + +      +P +T+HF GAD+ L   ++ +  S  + C   +
Sbjct: 372 TVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFDGADMVLPADSYMISGS-GVWCLAMR 430

Query: 386 GITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             T+ ++  +GN  Q N  + YD+ ++T+SF P  C+
Sbjct: 431 NQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKCS 467


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score =  212 bits (539), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 156/459 (33%), Positives = 234/459 (50%), Gaps = 60/459 (13%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           MA FL  V+IL  L +  +S   +   G  +EL H D      Y  +E    R+R A  R
Sbjct: 1   MAAFL--VWILLLLPYVAISSTASH--GVRLELTHADDRGG--YVGAE----RVRRAADR 50

Query: 61  SLNRLNHFNQNSSISSSKAS-----------QADIIPNNANYLIRISIGTPPTERLAVAD 109
           S  R+N F       SS A            +A +  + A YL+ I+IGTPP    AV D
Sbjct: 51  SHRRVNGFLGAIEGPSSTARLGIDGAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLD 110

Query: 110 TGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCS--GV 164
           TGSDLIWTQC+ PC   +C+ Q +PL+ P  S+TY ++ C S  C +L      CS    
Sbjct: 111 TGSDLIWTQCDAPC--RRCFPQPAPLYAPARSATYANVSCRSPMCQALQSPWSRCSPPDT 168

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C Y  SYGDG+ ++G LATET TLGS T    A+ G+ FGCGT N G     ++G+VG+
Sbjct: 169 GCAYYFSYGDGTSTDGVLATETFTLGSDT----AVRGVAFGCGTENLG-STDNSSGLVGM 223

Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSSTKIN---FGTNGIVSGPGVVSTPLT------- 274
           G G +SL+SQ+  T   +FSYC  P ++T  +    G++  +S     +TP         
Sbjct: 224 GRGPLSLVSQLGVT---RFSYCFTPFNATAASPLFLGSSARLSS-AAKTTPFVPSPSGGA 279

Query: 275 -KAKTFYVLTIDAISVGNQRLGVS------TP----DIVIDSGTTLTFLPQGYNSNLLSV 323
            +  ++Y L+++ I+VG+  L +       TP     ++IDSGTT T L +     L   
Sbjct: 280 RRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEESAFVALARA 339

Query: 324 MSSMIEAQPVADPTGSLELCYSFNS--LSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVC 381
           ++S +     +     L LC++  S    +VP + +HF GAD++L R ++ V+     V 
Sbjct: 340 LASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVVEDRSAGVA 399

Query: 382 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            +       + + G++ Q N  + YD+E+  +SF+P  C
Sbjct: 400 CLGMVSARGMSVLGSMQQQNTHILYDLERGILSFEPAKC 438


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score =  212 bits (539), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 159/461 (34%), Positives = 226/461 (49%), Gaps = 56/461 (12%)

Query: 2   ATFLSCVFILFFLCFYVVSPIEAQTGGFSVEL--IHRDSPKSPFYNSSETPYQRLRDALT 59
           A   S   ++  L F  ++       G  VEL  +H D         S T  Q +R AL 
Sbjct: 7   AQMASLAVLIISLVFAALASDSDAAAGVRVELTRVHADP--------SVTASQFVRGALR 58

Query: 60  RSLNRLNHFNQNSSISSSKASQADI--IPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
           R ++R N      + SS     A     P    YL+ ++IGTPP    A+ADTGSDLIWT
Sbjct: 59  RDMHRHNARKLALAASSGATVSAPTQDSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWT 118

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ--CASLNQKSCS----GVNCQYSVS 171
           QC PC  SQC+ Q +PL++P  S+T+  LPC+SS   CA+    + +    G  C Y+V+
Sbjct: 119 QCAPC-TSQCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGCACTYNVT 177

Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
           YG G +++    +ET T GST      +PGI FGC T + G   S  +G+VGLG G +SL
Sbjct: 178 YGSG-WTSVFQGSETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSL 236

Query: 232 ISQMRTTIAGKFSYCLVPVSSTK------------INFGTNGIVSGPGVVSTPLTKAKTF 279
           +SQ+      KFSYCL P   T             +N GT G+ S P V S       TF
Sbjct: 237 VSQLGVP---KFSYCLTPYQDTNSTSTLLLGPSASLN-GTAGVSSTPFVASPSTAPMNTF 292

Query: 280 YVLTIDAISVGNQRLGVSTPD-----------IVIDSGTTLTFLPQGYNSNLLSVMSSMI 328
           Y L +  IS+G   L +  PD           ++IDSGTT+T L       + + + S++
Sbjct: 293 YYLNLTGISLGTTALSIP-PDAFSLNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLV 351

Query: 329 EAQPVADPTGS--LELCYSFNSLSQ----VPEVTIHFRGADVKLSRSNFFVKVSEDIVCS 382
              P  D +    L+LC+   S +     +P +T+HF GAD+ L   ++ +     + C 
Sbjct: 352 TL-PTTDGSADTGLDLCFMLPSSTSAPPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCL 410

Query: 383 VFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
             +  T+  V I GN  Q N  + YDI Q+T+SF P  C+ 
Sbjct: 411 AMQNQTDGEVNILGNYQQQNMHILYDIGQETLSFAPAKCSA 451


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score =  211 bits (538), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 134/416 (32%), Positives = 203/416 (48%), Gaps = 40/416 (9%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN-- 87
           S+ L+HRD+     Y S      ++   + R   R+ H  +    S+S     D++    
Sbjct: 64  SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120

Query: 88  ------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                 +  Y +R+ +G+PPT++  V D+GSD+IW QC PC   QCY Q  PLFDP  SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178

Query: 142 TYKSLPCSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           ++  + C S+ C +L+            C YSV+YGDGS++ G LA ET+TLG T     
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT----- 233

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
           A+ G+  GCG  N GLF     G++GLG G +SL+ Q+     G FSYCL    +++   
Sbjct: 234 AVQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCL----ASRGAG 288

Query: 258 GTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS------TPD----IVIDSGT 307
           G   +V G         +A +FY + +  I VG +RL +       T D    +V+D+GT
Sbjct: 289 GAGSLVLGRTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGT 348

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADV 364
            +T LP+   + L       + A P +     L+ CY  +  +  +VP V+ +F +GA +
Sbjct: 349 AVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVL 408

Query: 365 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            L   N  V+V   + C  F   ++ + I GNI Q    +  D     V F P  C
Sbjct: 409 TLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 464


>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
 gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 472

 Score =  211 bits (537), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 156/461 (33%), Positives = 237/461 (51%), Gaps = 67/461 (14%)

Query: 11  LFFLCFYVVSPIEAQTGGFSVEL----IHRDSPKSPFYNSSETPYQRLRDALTRSLNR-- 64
           L  L F VV    A +G  SV +    IH D           T  Q +RDAL R ++R  
Sbjct: 27  LAVLVFLVVCATLA-SGAASVRVGLTRIHSDP--------DTTAPQFVRDALRRDMHRQR 77

Query: 65  -----------LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSD 113
                      L   +  +S + S  ++ D+ PN   YL+ ++IGTPP    AVADTGSD
Sbjct: 78  SRSFGRDRDRELAESDGRTSTTVSARTRKDL-PNGGEYLMTLAIGTPPLPYAAVADTGSD 136

Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKSCSGVN--CQYS 169
           LIWTQC PC  +QC+ Q +PL++P  S+T+  LPC+S  S CA     +       C Y 
Sbjct: 137 LIWTQCAPC-GTQCFEQPAPLYNPASSTTFSVLPCNSSLSMCAGALAGAAPPPGCACMYY 195

Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDI 229
            +YG G ++ G   +ET T GS+      +PG+ FGC   +   +N  + G+VGLG G +
Sbjct: 196 QTYGTG-WTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDWNG-SAGLVGLGRGSL 253

Query: 230 SLISQMRTTIAGKFSYCLVPV----SSTKINFGTNGIVSGPGVVSTPLTKA------KTF 279
           SL+SQ+    AG+FSYCL P     S++ +  G +  ++G GV STP   +       T+
Sbjct: 254 SLVSQLG---AGRFSYCLTPFQDTNSTSTLLLGPSAALNGTGVRSTPFVASPARAPMSTY 310

Query: 280 YVLTIDAISVGNQRLGVS------TPD----IVIDSGTTLTFLPQ-GYNSNLLSVMSSMI 328
           Y L +  IS+G + L +S       PD    ++IDSGTT+T L    Y     +V S ++
Sbjct: 311 YYLNLTGISLGAKALPISPGAFSLKPDGTGGLIIDSGTTITSLANAAYQQVRAAVKSQLV 370

Query: 329 EAQPVADPTGS--LELCYSFNSLSQ-----VPEVTIHFRGADVKLSRSNFFVKVSEDIVC 381
              P  D + S  L+LC++  + +      +P +T+HF GAD+ L   ++ +  S  + C
Sbjct: 371 TTLPTVDGSDSTGLDLCFALPAPTSAPPAVLPSMTLHFDGADMVLPADSYMISGS-GVWC 429

Query: 382 SVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
              +  T+ ++  +GN  Q N  + YD+ ++T+SF P  C+
Sbjct: 430 LAMRNQTDGAMSTFGNYQQQNMHILYDVREETLSFAPAKCS 470


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  211 bits (537), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 147/433 (33%), Positives = 225/433 (51%), Gaps = 56/433 (12%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-I 85
           GG  V L H D+      + + +  Q L+ A  RS +R++     ++   + A   D+ +
Sbjct: 38  GGLRVRLTHVDA------HGNYSRLQLLQRAARRSHHRMSRLVARATGVKAVAGGGDLQV 91

Query: 86  P---NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
           P    N  +L+ ++IGTP     A+ DTGSDL+WTQC+PC    C+ Q +P+FDP  SST
Sbjct: 92  PVHAGNGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSST 149

Query: 143 YKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           Y ++PCSS+ C+ L   +C S   C Y+ +YGD S + G LA+ET TLG    +   LPG
Sbjct: 150 YATVPCSSALCSDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGK---EKKKLPG 206

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
           + FGCG  N G   ++  G+VGLG G +SL+SQ+      KFSYCL   +S     G + 
Sbjct: 207 VAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLD---KFSYCL---TSLDDGDGKSP 260

Query: 262 IVSGPG------------VVSTPLTK---AKTFYVLTIDAISVGNQRLGV---------- 296
           ++ G              V +TPL K     +FY +++  ++VG+ R+ +          
Sbjct: 261 LLLGGSAAAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDD 320

Query: 297 STPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS---- 350
            T  +++DSGT++T+L  QGY +   + ++ M  A P  D +   L+LC+   +      
Sbjct: 321 GTGGVIVDSGTSITYLELQGYRALKKAFVAQM--ALPTVDGSEIGLDLCFQGPAKGVDEV 378

Query: 351 QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
           QVP++ +HF  GAD+ L   N+ V  S      +    +  + I GN  Q NF   YD+ 
Sbjct: 379 QVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVAPSRGLSIIGNFQQQNFQFVYDVA 438

Query: 410 QQTVSFKPTDCTK 422
             T+SF P  C K
Sbjct: 439 GDTLSFAPVQCNK 451


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score =  211 bits (536), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 152/430 (35%), Positives = 225/430 (52%), Gaps = 55/430 (12%)

Query: 31  VEL--IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN--HFNQNSSISSSKASQADIIP 86
           VEL  IH D         S T  Q +RDAL R ++R N      +SS  ++ ++   I P
Sbjct: 30  VELTRIHADP--------SVTASQFVRDALRRDMHRHNARQLAASSSNGTTVSAPTQISP 81

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
               YL+ ++IGTPP    A+ADTGSDLIWTQC PC  SQC+ Q +PL++P  S+T+  L
Sbjct: 82  TAGEYLMTLAIGTPPVSYQAIADTGSDLIWTQCAPC-SSQCFQQPTPLYNPSSSTTFAVL 140

Query: 147 PCSS--SQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAVALPG 201
           PC+S  S CA+    +    G  C Y+++YG G +++    +ET T GS+T      +PG
Sbjct: 141 PCNSSLSMCAAALAGTTPPPGCTCMYNMTYGSG-WTSVYQGSETFTFGSSTPANQTGVPG 199

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----SSTKINF 257
           I FGC   +GG   S  +G+VGLG G +SL+SQ+      KFSYCL P     S++ +  
Sbjct: 200 IAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQLGVP---KFSYCLTPYQDTNSTSTLLL 256

Query: 258 G-------TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI--------- 301
           G       T G+ S P V S       T+Y L +  IS+G   L + T  +         
Sbjct: 257 GPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSLKADGTGG 316

Query: 302 -VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD----PTGSLELCYSFNSLSQ----V 352
            +IDSGTT+T L       + + + S++   P  D     TG L+LC+   S +     +
Sbjct: 317 FIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGGSAATG-LDLCFELPSSTSAPPTM 374

Query: 353 PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQ 411
           P +T+HF GAD+ L   ++ + +  ++ C   +  T+  V I GN  Q N  + YD+ Q+
Sbjct: 375 PSMTLHFDGADMVLPADSYMM-LDSNLWCLAMQNQTDGGVSILGNYQQQNMHILYDVGQE 433

Query: 412 TVSFKPTDCT 421
           T++F P  C+
Sbjct: 434 TLTFAPAKCS 443


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score =  211 bits (536), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 156/433 (36%), Positives = 225/433 (51%), Gaps = 57/433 (13%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS---LNRLNHFNQNSSISSSKAS---- 80
           G  V L H D+      + + + +Q LR A  RS   ++RL        ++SSKA+    
Sbjct: 40  GLRVHLTHVDA------HGNYSRHQLLRRAARRSHHRMSRLVARATGVPMTSSKAAGGGD 93

Query: 81  -QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
            Q  +   N  +L+ +SIGTP     A+ DTGSDL+WTQC+PC    C+ Q +P+FDP  
Sbjct: 94  LQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSS 151

Query: 140 SSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           SSTY ++PCSS+ C+ L    C S   C Y+ +YGD S + G LATET TL  +      
Sbjct: 152 SSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----- 206

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG 258
           LPG+ FGCG  N G   S+  G+VGLG G +SL+SQ+      KFSYCL  +  T  +  
Sbjct: 207 LPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPL 263

Query: 259 TNGIVSG--------PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV----------S 297
             G ++G          V +TPL K     +FY +++ AI+VG+ R+ +           
Sbjct: 264 LLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDG 323

Query: 298 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 351
           T  +++DSGT++T+L  QGY +   +  + M  A P AD +G  L+LC+   +      +
Sbjct: 324 TGGVIVDSGTSITYLEVQGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVE 381

Query: 352 VPEVTIHFR-GADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
           VP +  HF  GAD+ L   N+ V       +C    G +  + I GN  Q NF   YD+ 
Sbjct: 382 VPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVG 440

Query: 410 QQTVSFKPTDCTK 422
             T+SF P  C K
Sbjct: 441 HDTLSFAPVQCNK 453


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  210 bits (535), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 156/433 (36%), Positives = 225/433 (51%), Gaps = 57/433 (13%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS---LNRLNHFNQNSSISSSKAS---- 80
           G  V L H D+      + + + +Q LR A  RS   ++RL        ++SSKA+    
Sbjct: 30  GLRVHLTHVDA------HGNYSRHQLLRRAARRSHHRMSRLVARATGVPMTSSKAAGGGD 83

Query: 81  -QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
            Q  +   N  +L+ +SIGTP     A+ DTGSDL+WTQC+PC    C+ Q +P+FDP  
Sbjct: 84  LQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSS 141

Query: 140 SSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           SSTY ++PCSS+ C+ L    C S   C Y+ +YGD S + G LATET TL  +      
Sbjct: 142 SSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----- 196

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG 258
           LPG+ FGCG  N G   S+  G+VGLG G +SL+SQ+      KFSYCL  +  T  +  
Sbjct: 197 LPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPL 253

Query: 259 TNGIVSG--------PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV----------S 297
             G ++G          V +TPL K     +FY +++ AI+VG+ R+ +           
Sbjct: 254 LLGSLAGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDG 313

Query: 298 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 351
           T  +++DSGT++T+L  QGY +   +  + M  A P AD +G  L+LC+   +      +
Sbjct: 314 TGGVIVDSGTSITYLEVQGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVE 371

Query: 352 VPEVTIHFR-GADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
           VP +  HF  GAD+ L   N+ V       +C    G +  + I GN  Q NF   YD+ 
Sbjct: 372 VPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVG 430

Query: 410 QQTVSFKPTDCTK 422
             T+SF P  C K
Sbjct: 431 HDTLSFAPVQCNK 443


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 136/380 (35%), Positives = 215/380 (56%), Gaps = 42/380 (11%)

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
           +SS A  A +    A YL+ ++IGTPP   +A+ADTGSDL WTQC+PC    C+ QD+P+
Sbjct: 77  TSSDAGPARLRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCQPC--KLCFPQDTPI 134

Query: 135 FDPKMSSTYKSLPCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGS 191
           +D  +SS++  +PC+S+ C  + + ++C+  +  C+Y  +YGDG++S G L TET+T   
Sbjct: 135 YDTAVSSSFSPVPCASATCLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPG 194

Query: 192 TTGQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
             G  V++ GI FGCG +NGGL +NS  TG VGLG G +SL++Q+     GKFSYCL   
Sbjct: 195 APG--VSVGGIAFGCGVDNGGLSYNS--TGTVGLGRGSLSLVAQLGV---GKFSYCLTDF 247

Query: 251 SSTKIN----FGTNGIVSGP----GVVSTPLTKA---KTFYVLTIDAISVGNQRLGV--- 296
            +T +     FG    ++ P     V STPL ++    T+Y ++++ IS+G+ RL +   
Sbjct: 248 FNTSLGSPVLFGALAELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNG 307

Query: 297 -------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--- 346
                   +  +++DSGTT TFL +     ++  ++ ++  QPV + +     C+     
Sbjct: 308 TFDLRDDGSGGMIVDSGTTFTFLVESAFRVVVDHVAGVLR-QPVVNASSLDSPCFPAATG 366

Query: 347 -NSLSQVPEVTIHFR-GADVKLSRSNF--FVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 402
              L  +P++ +HF  GAD++L R N+  F +       ++    +  V I GN  Q N 
Sbjct: 367 EQQLPAMPDMVLHFAGGADMRLHRDNYMSFNQEESSFCLNIAGSPSADVSILGNFQQQNI 426

Query: 403 LVGYDIEQQTVSFKPTDCTK 422
            + +DI    +SF PTDC K
Sbjct: 427 QMLFDITVGQLSFMPTDCGK 446


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 155/435 (35%), Positives = 219/435 (50%), Gaps = 56/435 (12%)

Query: 28  GFSVEL--IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI- 84
           G  VEL  +H D         S T  Q +R AL R ++R N      + SS     A   
Sbjct: 31  GVRVELTRVHADP--------SVTASQFVRGALRRDMHRHNARKLALAASSGATVSAPTQ 82

Query: 85  -IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
             P    YL+ ++IGTPP    A+ADTGSDLIWTQC PC  SQC+ Q +PL++P  S+T+
Sbjct: 83  NSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPC-TSQCFRQPTPLYNPSSSTTF 141

Query: 144 KSLPCSSSQ--CASLNQKSCS----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
             LPC+SS   CA+    + +    G  C Y+V+YG G +++    +ET T GST     
Sbjct: 142 AVLPCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGSG-WTSVFQGSETFTFGSTPAGQS 200

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--- 254
            +PGI FGC T + G   S  +G+VGLG G +SL+SQ+      KFSYCL P   T    
Sbjct: 201 RVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTS 257

Query: 255 ---------INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD----- 300
                    +N GT G+ S P V S       TFY L +  IS+G   L +  PD     
Sbjct: 258 TLLLGPSASLN-GTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIP-PDAFLLN 315

Query: 301 ------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ- 351
                 ++IDSGTT+T L       + + + S++   P  D + +  L+LC+   S +  
Sbjct: 316 ADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSAATGLDLCFMLPSSTSA 374

Query: 352 ---VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYD 407
              +P +T+HF GAD+ L   ++ +     + C   +  T+  V I GN  Q N  + YD
Sbjct: 375 PPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYD 434

Query: 408 IEQQTVSFKPTDCTK 422
           I Q+T+SF P  C+ 
Sbjct: 435 IGQETLSFAPAKCSA 449


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 154/427 (36%), Positives = 217/427 (50%), Gaps = 39/427 (9%)

Query: 22  IEAQTGGFSVELIHRDSPKSPF-YNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
           +++ TG  +V L HR  P SP       T  +RL     R+      F+      S   +
Sbjct: 51  VKSSTGAATVPLHHRHGPCSPLPTKKMPTLEERLHRDQLRAAYIQRKFSGGGVNGSRGGA 110

Query: 81  QADIIPNNA-------------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
             D+  ++A              YLI + +G+P   +  + DTGSD+ W QC+PC  SQC
Sbjct: 111 -GDVQQSHATVPTTLGTSLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPC--SQC 167

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLATE 185
           + Q  PLFDP  SSTY    CSS+ CA L Q+   CS   CQY+V+YGDGS + G  +++
Sbjct: 168 HSQADPLFDPSSSSTYSPFSCSSAACAQLGQEGNGCSSSQCQYTVTYGDGSSTTGTYSSD 227

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
           T+ LGS      A+    FGC     G FN +T G++GLGGG  SL+SQ   T    FSY
Sbjct: 228 TLALGSN-----AVRKFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSY 281

Query: 246 CLVPVSSTKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVST---- 298
           CL P +S+   F T G  +  G V TP+ ++    TFY + I AI VG ++L + T    
Sbjct: 282 CL-PATSSSSGFLTLGAGT-SGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFS 339

Query: 299 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVT 356
              ++DSGT LT LP    S L S   + ++  P A P+G L+ C+ F+  S V  P V 
Sbjct: 340 AGTIMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDFSGQSSVSIPTVA 399

Query: 357 IHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTV 413
           + F  GA V ++     ++ S  I+C  F   ++  S+ I GN+ Q  F V YD+    V
Sbjct: 400 LVFSGGAVVDIASDGIMLQTSNSILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAV 459

Query: 414 SFKPTDC 420
            FK   C
Sbjct: 460 GFKAGAC 466


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 134/353 (37%), Positives = 186/353 (52%), Gaps = 31/353 (8%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +  NY++ + +GTP ++   V DTGSD  W QC PC   +CY Q  PLFDP  SSTY ++
Sbjct: 159 STGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCV-VKCYKQKEPLFDPAKSSTYANV 217

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C+ S CA L+   C+G +C Y+V YGDGS++ G  A +T+T+        A+ G  FGC
Sbjct: 218 SCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHD-----AIKGFRFGC 272

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
           G  N GLF  KT G++GLG G  SL  Q      G F+YCL  +++     GT  +  GP
Sbjct: 273 GEKNNGLFG-KTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTT-----GTGYLDFGP 326

Query: 267 GVVS-----TPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQ 314
           G        TP+   K +TFY + +  I VG Q++ V     ST   ++DSGT +T LP 
Sbjct: 327 GSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPA 386

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQV--PEVTIHFR-GADVKLSRS 369
              + L S    ++ A+      G   L+ CY F  LS V  P V++ F+ GA + +  S
Sbjct: 387 TAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVS 446

Query: 370 NFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                +SE  VC  F   G   SV I GN  Q  + V YD+ ++TV F P  C
Sbjct: 447 GIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 438

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 146/426 (34%), Positives = 222/426 (52%), Gaps = 61/426 (14%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ------ 81
           GFSVE IHRDS KS F++ + TP  RLR A  RS+ R  H  + ++ +++  +       
Sbjct: 3   GFSVEFIHRDSVKSLFHDPTLTPEARLRQAARRSMARHAHAARINNSAAAAGASGSDDSD 62

Query: 82  ----ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
               + ++P N  YL+ + + TPP   LA+ADTGS L+W +C+            P    
Sbjct: 63  ADVVSPMVPQNFEYLMALDVSTPPVRMLALADTGSSLVWLKCK-----------LPAAHT 111

Query: 138 KMSSTYKSLPCSSSQCASL-NQKSC----SGVN-CQYSVSYGDGSFSNGNLATETVTLGS 191
             SS+Y  LPC +  C +L +  SC    SG N C Y  ++ DGS + G +  +  T  +
Sbjct: 112 PASSSYARLPCDAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAGPVTVDAFTFST 171

Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVP 249
                     + FGC T   GL +    G+VGL  G ISL+SQ+  +T  A KFSYCLVP
Sbjct: 172 R---------LDFGCATRTEGL-SVPDDGLVGLANGPISLVSQLSAKTPFAHKFSYCLVP 221

Query: 250 -----VSSTKINFGTNGIV-SGPGVVSTPLT--KAKTFYVLTIDAISVGNQ--RLGVSTP 299
                  S+ +NFG++ IV S PG  +TPL   + K+FY + +D+I V  +   L  +T 
Sbjct: 222 YSSSETVSSSLNFGSHAIVSSSPGAATTPLVAGRNKSFYTIALDSIKVAGKPVPLQTTTT 281

Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS------QVP 353
            +++DSGT LT+LP+     L++ +++ I+   V  P     +CY     +       +P
Sbjct: 282 KLIVDSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETLYAVCYDVRRRAPEDVGKSIP 341

Query: 354 EVTIHF-RGADVKLSRSN-FFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIE 409
           +VT+    G +V+L   N F V+     VC     + + +P  I GN+ Q N  VG+D+E
Sbjct: 342 DVTLVLGGGGEVRLPWGNTFVVENKGTTVCLAL--VESHLPEFILGNVAQQNLHVGFDLE 399

Query: 410 QQTVSF 415
           ++TVSF
Sbjct: 400 RRTVSF 405


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  209 bits (531), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 134/353 (37%), Positives = 186/353 (52%), Gaps = 31/353 (8%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +  NY++ + +GTP ++   V DTGSD  W QC PC   +CY Q  PLFDP  SSTY ++
Sbjct: 159 STGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCV-VKCYKQKGPLFDPAKSSTYANV 217

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C+ S CA L+   C+G +C Y+V YGDGS++ G  A +T+T+        A+ G  FGC
Sbjct: 218 SCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHD-----AIKGFRFGC 272

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
           G  N GLF  KT G++GLG G  SL  Q      G F+YCL  +++     GT  +  GP
Sbjct: 273 GEKNNGLFG-KTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTT-----GTGYLDFGP 326

Query: 267 GVVS-----TPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQ 314
           G        TP+   K +TFY + +  I VG Q++ V     ST   ++DSGT +T LP 
Sbjct: 327 GSAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPA 386

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQV--PEVTIHFR-GADVKLSRS 369
              + L S    ++ A+      G   L+ CY F  LS V  P V++ F+ GA + +  S
Sbjct: 387 TAYTALSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVS 446

Query: 370 NFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                +SE  VC  F   G   SV I GN  Q  + V YD+ ++TV F P  C
Sbjct: 447 GIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score =  208 bits (530), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 142/425 (33%), Positives = 205/425 (48%), Gaps = 46/425 (10%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS--------ISSSKASQ 81
           S  L+ RD+     Y S   P   + D ++R   R  +     S          S     
Sbjct: 59  SFALVRRDAVTGATYPS---PRHAVLDLVSRDNARAEYLASRLSPAYQPTDFFGSESKVV 115

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           + +   +  Y +R+ IG+PPTE+  V D+GSD+IW QC+PC   +CY Q  PLFDP  S+
Sbjct: 116 SGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPC--LECYAQADPLFDPASSA 173

Query: 142 TYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           T+ ++ C S+ C +L    C  SG  C+Y VSYGDGS++ G LA ET+TLG T     A+
Sbjct: 174 TFSAVSCGSAICRTLRTSGCGDSG-GCEYEVSYGDGSYTKGTLALETLTLGGT-----AV 227

Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV--SSTKINF 257
            G+  GCG  N GLF     G++GLG G +SL+ Q+     G FSYCL     S +    
Sbjct: 228 EGVAIGCGHRNRGLF-VGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAAD 286

Query: 258 GTNGIVSG------PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS------TPD-- 300
               +V G       G V  PL +   A +FY + +  I VG++RL +       T D  
Sbjct: 287 AAGSLVLGRSEAVPEGAVWVPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGG 346

Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVT 356
             +V+D+GT +T LPQ   + L       + A P A     L+ CY  +  +  +VP V+
Sbjct: 347 GGVVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVS 406

Query: 357 IHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
            +F G A + L   N  ++V   I C  F   ++ + I GNI Q    +  D     + F
Sbjct: 407 FYFDGAATLTLPARNLLLEVDGGIYCLAFAPSSSGLSILGNIQQEGIQITVDSANGYIGF 466

Query: 416 KPTDC 420
            P  C
Sbjct: 467 GPATC 471


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  208 bits (530), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 143/390 (36%), Positives = 205/390 (52%), Gaps = 30/390 (7%)

Query: 52  QRLRDALTRSLNRLNHF--NQNSSISSSKASQADI----IPNNANYLIRISIGTPPTERL 105
           + +R  + +S  R+       NSS  SS A   D+     P+   Y++ IS+GTP     
Sbjct: 10  EAIRGLVAKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFR 69

Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN 165
           A+ADTGSDL+W Q EPC  + C      +FDP+ SST++ + CSS  C  L      G +
Sbjct: 70  AIADTGSDLVWVQSEPC--TGC--SGGTIFDPRQSSTFREMDCSSQLCTELPGSCEPGSS 125

Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C YS  YG G  + G  A +T++LG+T+G +   P    GCG  N G       G+VGL
Sbjct: 126 ACSYSYEYGSGE-TEGEFARDTISLGTTSGGSQKFPSFAVGCGMVNSGF--DGVDGLVGL 182

Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVS----STKINFGTNGIVSGPGVVSTPLTKAK--- 277
           G G +SL SQ+   I  KFSYCLV ++    S+ + FG +  + G G+ ST +T      
Sbjct: 183 GQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTY 242

Query: 278 -TFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 336
            T+Y+LT++ I+V  Q +G S    +IDSGTTLT++P G    +LS M SM+    V   
Sbjct: 243 PTYYLLTVNGIAVAGQTMG-SPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGS 301

Query: 337 TGSLELCY--SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFKGITNSVP 392
           +  L+LCY  S N   + P +TI   GA +    SN+F+ V +  D VC +  G    +P
Sbjct: 302 SMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLVVDDSGDTVC-LAMGSAGGLP 360

Query: 393 --IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             I GN+MQ  + + YD     +SF    C
Sbjct: 361 VSIIGNVMQQGYHILYDRGSSELSFVQAKC 390


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  207 bits (527), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 143/390 (36%), Positives = 206/390 (52%), Gaps = 30/390 (7%)

Query: 52  QRLRDALTRSLNRLNHF--NQNSSISSSKASQADI----IPNNANYLIRISIGTPPTERL 105
           + +R  + +S  R+       NSS  SS A   D+     P+   Y++ IS+GTP     
Sbjct: 10  EAIRALVAKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFR 69

Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN 165
           A+ADTGSDL+W Q EPC  + C      +FDP+ SST++ + CSS  CA L      G +
Sbjct: 70  AIADTGSDLVWVQSEPC--TGC--SGGTIFDPRQSSTFREMDCSSQLCAELPGSCEPGSS 125

Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C YS  YG G  + G  A +T++LG+T+  +   P    GCG  N G       G+VGL
Sbjct: 126 TCSYSYEYGSGE-TEGEFARDTISLGTTSDGSQKFPSFAVGCGMVNSGF--DGVDGLVGL 182

Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVS----STKINFGTNGIVSGPGVVSTPLTKAK--- 277
           G G +SL SQ+   I  KFSYCLV ++    S+ + FG +  + G G+ ST +T      
Sbjct: 183 GQGPVSLTSQLSAAIDSKFSYCLVDINSQSESSPLLFGPSAALHGTGIQSTKITPPSDTY 242

Query: 278 -TFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 336
            T+Y+LT++ I+V  Q +G S    +IDSGTTLT++P G    +LS M SM+    V   
Sbjct: 243 PTYYLLTVNGIAVAGQTMG-SPGTTIIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGS 301

Query: 337 TGSLELCY--SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFKGITNSVP 392
           +  L+LCY  S N   + P +TI   GA +    SN+F+ V +  D VC +  G  + +P
Sbjct: 302 SMGLDLCYDRSSNRNYKFPALTIRLAGATMTPPSSNYFLVVDDSGDTVC-LAMGSASGLP 360

Query: 393 --IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             I GN+MQ  + + YD     +SF    C
Sbjct: 361 VSIIGNVMQQGYHILYDRGSSELSFVQAKC 390


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score =  207 bits (527), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 151/421 (35%), Positives = 213/421 (50%), Gaps = 38/421 (9%)

Query: 26  TGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS---KASQA 82
           +GG +V L HR  P SP   S++ P   L + L R   R  +  +  S +     + S A
Sbjct: 58  SGGITVPLHHRHGPCSPV-PSNKMP-ASLEERLQRDQLRAAYIKRKFSGAKGGDVEQSDA 115

Query: 83  DIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
             +P       +   Y+I + IG+P   +    DTGSD+ W QC+PC  SQC+ +   LF
Sbjct: 116 ATVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPC--SQCHSEVDSLF 173

Query: 136 DPKMSSTYKSLPCSSSQCASLNQ----KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
           DP  SSTY    CSS+ C  L+Q      CS   CQY VSY DGS + G  +++T+TLGS
Sbjct: 174 DPSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTLTLGS 233

Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS 251
                 A+ G  FGC  +  G F+ +T G++GLGG   SL+SQ   T    FSYCL P  
Sbjct: 234 N-----AIKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPTP 288

Query: 252 STKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVST----PDIVID 304
            +   F T G  S  G V TP+   T+  T+Y + ++AI VG Q+L + T       V+D
Sbjct: 289 GSS-GFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSAGSVMD 347

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-G 361
           SGT +T LP    S L S   + ++  P A P+G L+ C+ F+  S V  P V + F  G
Sbjct: 348 SGTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGG 407

Query: 362 ADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
           A V L  +   +++  D  C  F   ++  S+   GN+ Q  F V YD+    V F+   
Sbjct: 408 AVVNLDFNGIMLEL--DNWCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGA 465

Query: 420 C 420
           C
Sbjct: 466 C 466


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 145/420 (34%), Positives = 226/420 (53%), Gaps = 47/420 (11%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           G+ + L H DS          T  + +R A+ RS  R        ++S   A+   +   
Sbjct: 22  GYRLVLTHVDS------KGGYTKTELMRRAVHRSRLR--------ALSGYDATSPRLHSV 67

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
              YL+ ++IG PP   +A+ADTGSDL WTQC+PC    C+ QD+P++DP  SST+  LP
Sbjct: 68  QVEYLMELAIGKPPVPFVALADTGSDLTWTQCQPC--KLCFPQDTPVYDPSASSTFSPLP 125

Query: 148 CSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           CSS+ C  +  ++C+  + C+Y  +YGDG++S G L TET+TLG ++   V++ G+ FGC
Sbjct: 126 CSSATCLPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSA-PVSVGGVAFGC 184

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN----FGTNG- 261
           GT+NGG  +  +TG VGLG G +SL++Q+     GKFSYCL    ++ ++     GT   
Sbjct: 185 GTDNGG-DSLNSTGTVGLGRGTLSLLAQLG---VGKFSYCLTDFFNSALDSPFLLGTLAE 240

Query: 262 IVSGPGVV-STPLTKA---KTFYVLTIDAISVGNQRL----------GVSTPDIVIDSGT 307
           +  GP  V STPL ++    + Y +++  IS+G+ RL          G  T  +++DSGT
Sbjct: 241 LAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIVDSGT 300

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHFR-GADV 364
           T T L +     ++  ++ ++  QP  + +     C+         +P++ +HF  GAD+
Sbjct: 301 TFTILAESGFREVVGRVARVL-GQPPVNASSLDAPCFPAPAGEPPYMPDLVLHFAGGADM 359

Query: 365 KLSRSNFFVKVSED-IVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           +L R N+     ED   C    G T  S  + GN  Q N  + +D     +SF PTDC+K
Sbjct: 360 RLYRDNYMSYNEEDSSFCLNIAGTTPESTSVLGNFQQQNIQMLFDTTVGQLSFLPTDCSK 419


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  207 bits (526), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 138/366 (37%), Positives = 196/366 (53%), Gaps = 43/366 (11%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            N  +L+ +SIGTP     A+ DTGSDL+WTQC+PC    C+ Q +P+FDP  SSTY ++
Sbjct: 70  GNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATV 127

Query: 147 PCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           PCSS+ C+ L    C S   C Y+ +YGD S + G LATET TL  +      LPG+ FG
Sbjct: 128 PCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK-----LPGVVFG 182

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
           CG  N G   S+  G+VGLG G +SL+SQ+      KFSYCL  +  T  +    G ++G
Sbjct: 183 CGDTNEGDGFSQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPLLLGSLAG 239

Query: 266 --------PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV----------STPDIVID 304
                     V +TPL K     +FY +++ AI+VG+ R+ +           T  +++D
Sbjct: 240 ISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVD 299

Query: 305 SGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----QVPEVTIH 358
           SGT++T+L  QGY +   +  + M  A P AD +G  L+LC+   +      +VP +  H
Sbjct: 300 SGTSITYLEVQGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFH 357

Query: 359 FR-GADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           F  GAD+ L   N+ V       +C    G +  + I GN  Q NF   YD+   T+SF 
Sbjct: 358 FDGGADLDLPAENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVGHDTLSFA 416

Query: 417 PTDCTK 422
           P  C K
Sbjct: 417 PVQCNK 422


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 143/425 (33%), Positives = 215/425 (50%), Gaps = 45/425 (10%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK------ASQ 81
           GF ++L H D+       +S T  Q L  A+ RS  R+    Q++++S +       A++
Sbjct: 27  GFQLKLTHVDA------GTSYTKPQLLSRAIARSKARVAAL-QSAAVSPAPVADPITAAR 79

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
             +  ++  YL+ ++IGTPP    A+ DTGSDLIWTQC PC    C  Q +P FD K S+
Sbjct: 80  VLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCAAQPTPYFDVKRSA 137

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           TY++LPC SS+CA+L+  SC    C Y   YGD + + G LA ET T G+ +   V    
Sbjct: 138 TYRALPCRSSRCAALSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAAN 197

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKINFG 258
           I+FGCG+ N G   + ++G+VG G G +SL+SQ+  +   +FSYCL    S   +++ FG
Sbjct: 198 ISFGCGSLNAGEL-ANSSGMVGFGRGPLSLVSQLGPS---RFSYCLTSYLSPTPSRLYFG 253

Query: 259 ------TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----------TP 299
                 +    SG  V STP          Y L++  IS+G +RL +           T 
Sbjct: 254 VFANLNSTNTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTG 313

Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF----NSLSQVPEV 355
            ++IDSGT++T+L Q     +   ++S I    + D    L+ C+ +    N    VP+ 
Sbjct: 314 GVIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDF 373

Query: 356 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
             HF GA++ L   N+ +  S      +    T+   I GN  Q N  + YDI    +SF
Sbjct: 374 VFHFDGANMTLPPENYMLIASTTGYLCLAMAPTSVGTIIGNYQQQNLHLLYDIANSFLSF 433

Query: 416 KPTDC 420
            P  C
Sbjct: 434 VPAPC 438


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score =  206 bits (524), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 143/420 (34%), Positives = 208/420 (49%), Gaps = 37/420 (8%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-------NSSISSSKASQADII 85
           ++HR  P SP       P     + L R  +R++  ++       +++   S AS+   +
Sbjct: 68  VVHRHGPCSPLQARGGEPSHA--EILDRDQDRVDSIHRLAAARPSSTADDPSSASKGVSL 125

Query: 86  P-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           P         ANY++ + +GTP  + L V DTGSDL W QC+PC    CY Q  PLFDP 
Sbjct: 126 PARRGVPLGTANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPC--DGCYQQHDPLFDPS 183

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            S+TY ++PC + +C  L+  SCS   C+Y V YGD S ++GNLA +T+TLG ++  + +
Sbjct: 184 QSTTYSAVPCGAQECRRLDSGSCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSS 243

Query: 199 --LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
             L    FGCG ++ GLF  K  G+ GLG   +SL SQ        FSYCL P SST   
Sbjct: 244 DQLQEFVFGCGDDDTGLFG-KADGLFGLGRDRVSLASQAAAKYGAGFSYCL-PSSSTAEG 301

Query: 257 FGTNGIVSGPGVVSTPL-TKAKT--FYVLTIDAISVGNQRLGVS-----TPDIVIDSGTT 308
           + + G  + P    T + T++ T  FY L +  I V  + + VS     TP  VIDSGT 
Sbjct: 302 YLSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTV 361

Query: 309 LTFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GAD 363
           +T LP    + L S  + ++   +   A     L+ CY F   +  Q+P V + F  GA 
Sbjct: 362 ITRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTCYDFTGRNKVQIPSVALLFDGGAT 421

Query: 364 VKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           + L         ++   C  F   G   S+ I GN+ Q  F V YD+  Q + F    C+
Sbjct: 422 LNLGFGEVLYVANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGCS 481


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 134/416 (32%), Positives = 199/416 (47%), Gaps = 53/416 (12%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN-- 87
           S+ L+HRD+     Y S      ++   + R   R+ H  +    S+S     D++    
Sbjct: 64  SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120

Query: 88  ------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                 +  Y +R+ +G+PPT++  V D+GSD+IW QC PC   QCY Q  PLFDP  SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178

Query: 142 TYKSLPCSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           ++  + C S+ C +L+            C YSV+YGDGS++ G LA ET+TLG T     
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT----- 233

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
           A+ G+  GCG  N GLF     G++GLG G +SL+ Q+     G FSYCL        + 
Sbjct: 234 AVQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLA-------SR 285

Query: 258 GTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS------TPD----IVIDSGT 307
           G  G  S           A +FY + +  I VG +RL +       T D    +V+D+GT
Sbjct: 286 GAGGAGS----------LASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGT 335

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADV 364
            +T LP+   + L       + A P +     L+ CY  +  +  +VP V+ +F +GA +
Sbjct: 336 AVTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVL 395

Query: 365 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            L   N  V+V   + C  F   ++ + I GNI Q    +  D     V F P  C
Sbjct: 396 TLPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 451


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 140/414 (33%), Positives = 203/414 (49%), Gaps = 36/414 (8%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS----ISSSKASQADIIPNN 88
           ++HR  P SP       P     + L R  +R++  ++ ++       S AS+   +P +
Sbjct: 121 VVHRHGPCSPLLARGGEPSHA--EILDRDQDRVDSIHRMTAGPWTAGQSSASKGVSLPAH 178

Query: 89  -------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                  ANY++ + +GTP  + L V DTGSDL W QC+PC  + CY Q  PLFDP  S+
Sbjct: 179 RGLRLGTANYIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPC--NNCYKQHDPLFDPSQST 236

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           TY ++PC + +C  L+  +CS   C+Y V YGD S ++GNLA +T+TLG ++ Q   L G
Sbjct: 237 TYSAVPCGAQEC--LDSGTCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSDQ---LQG 291

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
             FGCG ++ GLF  +  G+ GLG   +SL SQ        FSYCL P S     + + G
Sbjct: 292 FVFGCGDDDTGLFG-RADGLFGLGRDRVSLASQAAARYGAGFSYCL-PSSWRAEGYLSLG 349

Query: 262 IVSGP--GVVSTPLTKAKT--FYVLTIDAISVGNQRLGVS-----TPDIVIDSGTTLTFL 312
             + P     +  +T++ T  FY L +  I V  + + V+      P  VIDSGT +T L
Sbjct: 350 SAAAPPHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAPGTVIDSGTVITRL 409

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRS 369
           P    S L S  +  +     A     L+ CY F   +  Q+P V + F  GA + L   
Sbjct: 410 PSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRTKVQIPSVALLFDGGATLNLGFG 469

Query: 370 NFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
                 +    C  F   G   SV I GN+ Q  F V YD+  Q + F    C+
Sbjct: 470 GVLYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGCS 523


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  204 bits (519), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 135/391 (34%), Positives = 206/391 (52%), Gaps = 34/391 (8%)

Query: 56  DALTRSLNRLNHFNQNSSISS--SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSD 113
           +A+ RS  R+  +    S  +  S+  Q+ +   N  YL+ +++G+PP     + DTGSD
Sbjct: 2   EAVQRSHERVAFYTLKLSPDAFGSQEFQSPVKAGNGEYLMTLTLGSPPQSFDVIVDTGSD 61

Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC--ASLNQKSCSGVNCQYSVS 171
           L W QC PC    CY Q  P FDP  S +++   C+ + C  ++L  K+C+   CQY  +
Sbjct: 62  LNWVQCLPC--RVCYQQPGPKFDPSKSRSFRKAACTDNLCNVSALPLKACAANVCQYQYT 119

Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
           YGD S +NG+LA ET++L +  G   ++P   FGCGT N G F +   G+VGLG G +SL
Sbjct: 120 YGDQSNTNGDLAFETISLNNGAGTQ-SVPNFAFGCGTQNLGTF-AGAAGLVGLGQGPLSL 177

Query: 232 ISQMRTTIAGKFSYCLV---PVSSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTID 285
            SQ+  T A KFSYCLV    +S++ + FG+  I +   +  T +    +  T+Y + ++
Sbjct: 178 NSQLSHTFANKFSYCLVSLNSLSASPLTFGS--IAAAANIQYTSIVVNARHPTYYYVQLN 235

Query: 286 AISVGNQRLGVSTPDI------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV 333
           +I VG Q L ++ P +            +IDSGTT+T L     S +L    S +    +
Sbjct: 236 SIEVGGQPLNLA-PSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRL 294

Query: 334 ADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKV--SEDIVCSVFKGITN 389
                 L+LC++   +S   VP++   F+GAD ++   N FV V  S   +C    G + 
Sbjct: 295 DGSAYGLDLCFNIAGVSNPSVPDMVFKFQGADFQMRGENLFVLVDTSATTLCLAMGG-SQ 353

Query: 390 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              I GNI Q N LV YD+E + + F   DC
Sbjct: 354 GFSIIGNIQQQNHLVVYDLEAKKIGFATADC 384


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 155/425 (36%), Positives = 218/425 (51%), Gaps = 56/425 (13%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
           A  GGFSVE IHRDSP+SPF++ + T + R   A  RS+ R      ++S S+S    AD
Sbjct: 29  ASGGGFSVEFIHRDSPRSPFHDPAFTAHGRALAAARRSVARAAAIAGSASSSASGGGAAD 88

Query: 84  -----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE---------PCPPSQCYM 129
                ++  +  YL+ +++G+PP   LA+ADTGSDL+W +C+           P +Q   
Sbjct: 89  DVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQ--- 145

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
                FDP  SSTY  + C +  C +L + +C  G NC Y  +YGDGS + G L+TET T
Sbjct: 146 -----FDPSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFT 200

Query: 189 L----GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGK 242
                   + + V + G+ FGC T   G F +     +G   G +SL++Q+   T++  +
Sbjct: 201 FDDGGAGRSPRQVRIGGVKFGCSTATAGSFPADGLVGLGG--GAVSLVTQLGGATSLGRR 258

Query: 243 FSYCLVPVS---STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP 299
           FSYCLVP S   S+ +NFG    V+ PG  STPL   KT         S  + R      
Sbjct: 259 FSYCLVPHSVNASSALNFGALADVTEPGAASTPLVGNKTV-------ASAASSR------ 305

Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-----FNSLSQVPE 354
            I++DSGTTLTFL       ++  +S  I   PV  P G L+LCY+       +   +P+
Sbjct: 306 -IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPD 364

Query: 355 VTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQ 411
           +T+ F  GA V L   N FV V E  +C      T   P  I GN+ Q N  VGYD++  
Sbjct: 365 LTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLDAG 424

Query: 412 TVSFK 416
           TV  K
Sbjct: 425 TVGNK 429



 Score = 87.0 bits (214), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 54/146 (36%), Positives = 77/146 (52%), Gaps = 9/146 (6%)

Query: 284 IDAISVGNQRLG-VSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
           +DA +VGN+ +   ++  I++DSGTTLTFL       ++  +S  I   PV  P G L+L
Sbjct: 421 LDAGTVGNKTVASAASSRIIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQL 480

Query: 343 CYS-----FNSLSQVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IY 394
           CY+       +   +P++T+ F  GA V L   N FV V E  +C      T   P  I 
Sbjct: 481 CYNVAGREVEAGESIPDLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSIL 540

Query: 395 GNIMQTNFLVGYDIEQQTVSFKPTDC 420
           GN+ Q N  VGYD++  TV+F   DC
Sbjct: 541 GNLAQQNIHVGYDLDAGTVTFAVADC 566


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 144/430 (33%), Positives = 216/430 (50%), Gaps = 54/430 (12%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDA-------LTRSLNRLNHFNQNSSISSSKAS 80
           G  V L H D+      + + T  Q LR A       ++R + R       SS + + A 
Sbjct: 38  GLRVALTHVDA------HGNYTKLQLLRRAARRSRHRMSRLVARTTGVPVMSSKAVAPAL 91

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           Q  +   N  +L+ +SIGTP     A+ DTGSDL+WTQC+PC   +C+ Q +P+FDP  S
Sbjct: 92  QVPVHAGNGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPC--VECFNQSTPVFDPSSS 149

Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           STY +LPCSS+ C+ L    C+   C Y+ +YGD S + G LA ET TL  T      LP
Sbjct: 150 STYAALPCSSTLCSDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAKTK-----LP 204

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INF 257
            + FGCG  N G   ++  G+VGLG G +SL+SQ+      KFSYCL  +  T    +  
Sbjct: 205 DVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGLN---KFSYCLTSLDDTSKSPLLL 261

Query: 258 GTNGIV-----SGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV----------STP 299
           G+   +     +   V +TPL +     +FY + +  ++VG+  + +           T 
Sbjct: 262 GSLATISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTG 321

Query: 300 DIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----QVP 353
            +++DSGT++T+L  QGY +   +  + M    P AD +G  L+ C+   +      +VP
Sbjct: 322 GVIVDSGTSITYLELQGYRALKKAFAAQM--KLPAADGSGIGLDTCFEAPASGVDQVEVP 379

Query: 354 EVTIHFRGADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
           ++  H  GAD+ L   N+ V  S    +C    G +  + I GN  Q N    YD+ + T
Sbjct: 380 KLVFHLDGADLDLPAENYMVLDSGSGALCLTVMG-SRGLSIIGNFQQQNIQFVYDVGENT 438

Query: 413 VSFKPTDCTK 422
           +SF P  C K
Sbjct: 439 LSFAPVQCAK 448


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 128/363 (35%), Positives = 185/363 (50%), Gaps = 37/363 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+R+++GTP        DTGSDL+WTQC PC    C+ QD P+ DP  SSTY +LPC 
Sbjct: 83  EYLVRLAVGTPRRPVALTLDTGSDLVWTQCAPC--RDCFDQDLPVLDPAASSTYAALPCG 140

Query: 150 SSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGST--TGQAVALP 200
           +++C +L   SC GV        C Y+  YGD S + G +AT+  T G +  +G+++   
Sbjct: 141 AARCRALPFTSC-GVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTR 199

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN 260
            +TFGCG  N G+F S  TGI G G G  SL SQ+  T    FSYC   +  +K +  T 
Sbjct: 200 RLTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVT---SFSYCFTSMFESKSSLVTL 256

Query: 261 G---------IVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI---VIDS 305
           G           SG  V +TP+ K     + Y L++  ISVG  RL V        +IDS
Sbjct: 257 GGSPAALYSHAHSGE-VRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTIIDS 315

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-----QVPEVTIHFR 360
           G ++T LP+     + +  ++ +   P      +L+LC++    +      VP +T+H  
Sbjct: 316 GASITTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVPSLTLHLE 375

Query: 361 GADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
           GAD +L RSN+ F  +   ++C V         + GN  Q N  V YD+E   +SF P  
Sbjct: 376 GADWELPRSNYVFEDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVYDLENDRLSFAPAR 435

Query: 420 CTK 422
           C +
Sbjct: 436 CDR 438


>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 294

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 127/304 (41%), Positives = 176/304 (57%), Gaps = 46/304 (15%)

Query: 3   TFLSCVFILFFLCFYVVSP-IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
           T+   + ++  L F  + P IEA  GGF+ +LI R+S K                     
Sbjct: 2   TYPRKIHLISILLFVFIFPHIEAHNGGFTGKLIPRNSSK--------------------- 40

Query: 62  LNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
               + FN+N+        Q+ +  N+ +YL+ +SIGTPP +  A ADTGSDLIW QC P
Sbjct: 41  ----DFFNRNTI-------QSPVSANHYDYLMELSIGTPPVKIYAQADTGSDLIWLQCIP 89

Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSN 179
           C  + CY Q +P+FD + SST+ ++ C S  C+ L   SCS   +NC+Y+ SY DGS + 
Sbjct: 90  C--TNCYKQLNPMFDSQSSSTFSNIACGSESCSKLYSTSCSPDQINCKYNYSYVDGSETQ 147

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           G LA ET+TL STTG+ VA  G+ FGCG NN G FN K  GI+GLG G +SL+SQ+ +++
Sbjct: 148 GVLAQETLTLTSTTGEPVAFKGVIFGCGHNNNGAFNDKEMGIIGLGRGPLSLVSQIGSSL 207

Query: 240 AGK-FSYCLVPVS-----STKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVG 290
            G  FS CLVP +     S+ ++FG    V G GVVSTPL   T  ++FY +T+  ISV 
Sbjct: 208 GGNMFSQCLVPFNTNPSISSPMSFGKGSEVLGNGVVSTPLVSKTTYQSFYFVTLLGISVE 267

Query: 291 NQRL 294
           +  L
Sbjct: 268 DINL 271


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 128/362 (35%), Positives = 187/362 (51%), Gaps = 36/362 (9%)

Query: 86  PNNANY---LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
           P +A Y   L+ I +GTPP + + + DTGSDL W Q EPC    C+ Q  P+FDP  SST
Sbjct: 17  PESAGYGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPC--RACFEQADPIFDPSKSST 74

Query: 143 YKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           Y  + CSSS CA L   Q   +  NC Y+  YGDGS + G  + ET+T   T G+ V   
Sbjct: 75  YNKIACSSSACADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVK-- 132

Query: 201 GITFGCGTNNGGLF-NSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTK 254
              FG    N G F ++   GI+GLG G +S+ SQ+ + +  KFSYCLV        ++ 
Sbjct: 133 ---FGASVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETST 189

Query: 255 INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVS----------TPDI 301
           + FG   + SG  V  TP+       T+Y + +  ISVG   L +           +   
Sbjct: 190 MYFGDAAVPSGE-VQYTPIVPNADHPTYYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGT 248

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 359
           +IDSGTT+T+L Q   + L++  +S +        TG L+LC++         P +TIH 
Sbjct: 249 IIDSGTTITYLQQEVFNALVAAYTSQVRYPTTTSATG-LDLCFNTRGTGSPVFPAMTIHL 307

Query: 360 RGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
            G  ++L  +N F+ +  +I+C  F    +  + I+GNI Q NF + YD++   + F P 
Sbjct: 308 DGVHLELPTANTFISLETNIICLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPA 367

Query: 419 DC 420
           DC
Sbjct: 368 DC 369


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score =  202 bits (513), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 138/390 (35%), Positives = 220/390 (56%), Gaps = 33/390 (8%)

Query: 59  TRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
           T  + R  H ++  ++S   A+   +      YL+ ++IGTPP   +A+ADTGSDL WTQ
Sbjct: 34  TELMRRAAHRSRLQALSGYDANSPRLHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQ 93

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQKSCSGVN--CQYSVSYGDG 175
           C+PC    C+ QD+P++DP  SST+  +PCSS+ C  +   ++CS  +  C+Y  SY DG
Sbjct: 94  CQPC--KLCFPQDTPVYDPSASSTFSPVPCSSATCLPTWRSRNCSNPSSPCRYIYSYSDG 151

Query: 176 SFSNGNLATETVTLGSTT-GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
           ++S G L TET+T+GS+  GQ V++  + FGCGT+NGG  +  +TG VGLG G +SL++Q
Sbjct: 152 AYSVGILGTETLTIGSSVPGQTVSVGSVAFGCGTDNGG-DSLNSTGTVGLGRGTLSLLAQ 210

Query: 235 MRTTIAGKFSYCLVPVSSTKIN----FGTNG-IVSGPGVV-STPLTKA---KTFYVLTID 285
           +     GKFSYCL    ++ ++     GT   +  GPG V STPL ++    + Y + + 
Sbjct: 211 LG---VGKFSYCLTDFFNSTMDSPFFLGTLAELAPGPGTVQSTPLLQSPLNPSRYFVNLQ 267

Query: 286 AISVGNQRLGV--STPDI--------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD 335
            IS+G+ RL +   T D+        ++DSGTT T L +     ++  ++ ++  QP  +
Sbjct: 268 GISLGDVRLPIPNGTFDLRADGNGGMMVDSGTTFTILAKSGFREVVDRVAQLL-GQPPVN 326

Query: 336 PTGSLELCY-SFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSED-IVCSVFKGITNSVP 392
            +     C+ S +    +P++ +HF  GAD++L R N+     +D   C    G  ++  
Sbjct: 327 ASSLDSPCFPSPDGEPFMPDLVLHFAGGADMRLHRDNYMSYNEDDSSFCLNIVGSPSTWS 386

Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
             GN  Q N  + +D+    +SF PTDC+K
Sbjct: 387 RLGNFQQQNIQMLFDMTVGQLSFLPTDCSK 416


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score =  201 bits (511), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 141/434 (32%), Positives = 211/434 (48%), Gaps = 58/434 (13%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ-------- 81
           S+ L+ RD      Y S       LR A+   + R N   +  +   S A Q        
Sbjct: 105 SLALVRRDEVTGSTYPS-------LRHAVLDLVARDNARAEYLATRLSPAYQPPGFSGSE 157

Query: 82  ----ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
               + +   +  YL+R+S+G+PPTE+  V D+GSD++W QC+PC   +CY+Q  PLFDP
Sbjct: 158 SKVVSGLDEGSGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPC--LECYVQADPLFDP 215

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTG 194
             S+T+  + C S+ C  L   +C       C+Y VSY DGS++ G LA ET+TLG T  
Sbjct: 216 ATSATFSGVSCGSAICRILPTSACGDGELGGCEYEVSYADGSYTKGALALETLTLGGT-- 273

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP---VS 251
              A+ G+  GCG  N GLF     G++GLG G +SL+ Q+   + G FSYCL       
Sbjct: 274 ---AVEGVVIGCGHRNRGLFVG-AAGLMGLGWGPMSLVGQLGGEVGGAFSYCLASRGGYG 329

Query: 252 STKINFGTNGIVSG------PGVVSTPLT---KAKTFYVLTIDAISVGNQRLGV------ 296
           S   +     +V G       G V  PL    +A +FY + +  I VG++RL +      
Sbjct: 330 SGAADDDAGWLVLGRSEAVPEGAVWVPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQ 389

Query: 297 ----STPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSL 349
                  D+V+D+GTT+T LPQ  Y +   + + ++  A P A    S  L+ CY  +  
Sbjct: 390 LTEDGAGDVVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGY 449

Query: 350 S--QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY 406
           +  +VP V+  F G A + L+  N  ++V   I C  F   ++ + I GN  Q    +  
Sbjct: 450 ASVRVPTVSFCFDGDARLILAARNVLLEVDMGIYCLAFAPSSSGLSIMGNTQQAGIQITV 509

Query: 407 DIEQQTVSFKPTDC 420
           D     + F P +C
Sbjct: 510 DSANGYIGFGPANC 523


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score =  201 bits (511), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 137/373 (36%), Positives = 195/373 (52%), Gaps = 44/373 (11%)

Query: 86  PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
           P    YL+ ++IGTPP    A+ADTGSDLIWTQC PC  SQC+ Q +PL++P  S+T+  
Sbjct: 27  PTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPC-TSQCFRQPTPLYNPSSSTTFAV 85

Query: 146 LPCSSSQ--CASLNQKSCS----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           LPC+SS   CA+    + +    G  C Y+V+YG G +++    +ET T GST      +
Sbjct: 86  LPCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARV 144

Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----- 254
           PGI FGC T + G   S  +G+VGLG G +SL+SQ+      KFSYCL P   T      
Sbjct: 145 PGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQLGVP---KFSYCLTPYQDTNSTSTL 201

Query: 255 -------INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD------- 300
                  +N GT G+ S P V S       TFY L +  IS+G   L +  PD       
Sbjct: 202 LLGPSASLN-GTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIP-PDAFSLNAD 259

Query: 301 ----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ--- 351
               ++IDSGTT+T L       + + + S++   P  D +    L+LC+   S +    
Sbjct: 260 GTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSADTGLDLCFMLPSSTSAPP 318

Query: 352 -VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIE 409
            +P +T+HF GAD+ L   ++ +     + C   +  T+  V I GN  Q N  + YDI 
Sbjct: 319 AMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIG 378

Query: 410 QQTVSFKPTDCTK 422
           Q+T+SF P  C+ 
Sbjct: 379 QETLSFAPAKCSA 391


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 125/342 (36%), Positives = 177/342 (51%), Gaps = 16/342 (4%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
           ANY+I +  GTP   +  + DTGS++ W QC+PC  S CY Q  PLFDP +SSTY+++ C
Sbjct: 14  ANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVS-CYPQQEPLFDPTLSSTYRNISC 72

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
           +S+ C  L+ + CSG  C Y V+YGDGS + G LATET TL +            FGCG 
Sbjct: 73  TSAACTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTLAAGN----VFNNFIFGCGQ 128

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGV 268
           NN GLF +   G++GLG    SL SQ+ T++   FSYCL   SS          +  PG 
Sbjct: 129 NNQGLF-TGAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSSATGYLNIGNPLRTPGY 187

Query: 269 VSTPL-TKAKTFYVLTIDAISVGNQRLGVSTP-----DIVIDSGTTLTFLPQGYNSNLLS 322
            +    ++A T Y + +  ISVG  RL +S+        +IDSGT +T LP      L +
Sbjct: 188 TAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVGTIIDSGTVITRLPPTAYGALRT 247

Query: 323 VMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIV 380
              + +     A     L+ CY F+  + V  P + +H+ G DV +  +  F  +S   V
Sbjct: 248 AFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHYTGLDVTIPGAGVFYVISSSQV 307

Query: 381 CSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           C  F G ++S  + I GN+ Q    V YD   + + F    C
Sbjct: 308 CLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 148/461 (32%), Positives = 235/461 (50%), Gaps = 64/461 (13%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLN 66
           V I  +LC   V+   A  G   V+L H D+ K       E P + L R A+ RS  R  
Sbjct: 9   VLIACWLCGCPVAGEAAFAGDIRVDLTHVDAGK-------ELPKRELIRRAMQRSKARAA 61

Query: 67  HFN--QNSSI---SSSKASQADIIPNNA-------NYLIRISIGTPPTERLAVADTGSDL 114
             +  +N      S ++A + +  P  A        Y++ +++GTPP    A+ DTGSDL
Sbjct: 62  ALSVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVLDLAVGTPPQPITALLDTGSDL 121

Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYG 173
           IWTQC+ C  + C  Q  PLF P+MSS+Y+ + C+   C  +   SC   + C Y  SYG
Sbjct: 122 IWTQCDTC--TACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSCVRPDTCTYRYSYG 179

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           DG+ + G  ATE  T  S++G+  ++P + FGCGT N G  N+  +GIVG G   +SL+S
Sbjct: 180 DGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSLNN-ASGIVGFGRDPLSLVS 237

Query: 234 QMRTTIAGKFSYCLVPVSSTK---INFGTNGIV------SGPGVVSTPLTKAK---TFYV 281
           Q+      +FSYCL P +S++   + FG+   V      +GP V +TP+ ++    TFY 
Sbjct: 238 QLSIR---RFSYCLTPYASSRKSTLQFGSLADVGLYDDATGP-VQTTPILQSAQNPTFYY 293

Query: 282 LTIDAISVGNQRLGVST------PD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
           +    ++VG +RL +        PD    ++IDSGT LT  P    + ++    S +   
Sbjct: 294 VAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFPAAVLAEVVRAFRSQLRL- 352

Query: 332 PVADPTGSLE-LCYSFNSLS----------QVPEVTIHFRGADVKLSRSNFFVK-VSEDI 379
           P A+ +   + +C++  +++           VP +  HF+GAD+ L R N+ ++      
Sbjct: 353 PFANGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQGADLDLPRENYVLEDHRRGH 412

Query: 380 VCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           +C +     +     GN +Q +  V YD+E++T+SF P +C
Sbjct: 413 LCVLLGDSGDDGATIGNFVQQDMRVVYDLERETLSFAPVEC 453


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 143/445 (32%), Positives = 219/445 (49%), Gaps = 61/445 (13%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-------- 73
           + A +    + L+HRD      + ++ TP Q L   L R + R       ++        
Sbjct: 61  VAASSSTLHIRLLHRDR-----FAANATPAQLLARRLQRDVLRAAWIISKAAANGTPPPV 115

Query: 74  --ISSSKASQADII---PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
             +SS++   A ++   P +  Y+ +I++GTP  E L   DT SDL W QC+PC   +CY
Sbjct: 116 AGLSSARGFVAPVVSRAPTSGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPC--RRCY 173

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATE 185
            Q  P+FDP+ S++Y+ +  +++ C +L +          C Y+V YGDGS + G+   E
Sbjct: 174 PQSGPVFDPRHSTSYREMSFNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEE 233

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
           T+T        V LP I+ GCG +N GLF +   GI+GLG G +S  +Q+     G FSY
Sbjct: 234 TLTFAG----GVRLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQIDHN--GTFSY 287

Query: 246 CLV-----PVS-STKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL-G 295
           CLV     P S S+ + FG   + + P V  TP        TFY + +  ISVG  R+ G
Sbjct: 288 CLVDFLSGPGSLSSTLTFGAGAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPG 347

Query: 296 VSTPD-----------IVIDSGTTLTFLPQ----GYNSNLLSVMSSMIEAQPVADPTGSL 340
           V+  D           +++DSGT +T L +     +     +V   + +   +  P+G  
Sbjct: 348 VTERDLQLDPYTGRGGVIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVS-IGGPSGFF 406

Query: 341 ELCYSF--NSLSQVPEVTIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGI-TNSVPIYG 395
           + CY+     + +VP V++HF G+ +VKL   N+ + V S   VC  F     +SV I G
Sbjct: 407 DTCYTVGGRGMKKVPTVSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIG 466

Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDC 420
           NI Q  F + YDI  + V F P  C
Sbjct: 467 NIQQQGFRIVYDIGGR-VGFAPNSC 490


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 132/370 (35%), Positives = 184/370 (49%), Gaps = 45/370 (12%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ +++GTPP       DTGSDL+WTQC PC    C+ Q  PL DP  SSTY +LPC 
Sbjct: 91  EYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPC--RDCFHQGLPLLDPAASSTYAALPCG 148

Query: 150 SSQCASLNQKSCSG----------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA- 198
           + +C +L   SC G           +C Y   YGD S + G +AT+  T G   G   + 
Sbjct: 149 APRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSR 208

Query: 199 LP--GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
           LP   +TFGCG  N G+F S  TGI G G G  SL SQ+  T    FSYC   +  +K +
Sbjct: 209 LPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVT---TFSYCFTSMFESKSS 265

Query: 257 FGTNG-------------IVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPD 300
             T G              +SG  V +TPL K     + Y L++  ISVG  RL V    
Sbjct: 266 LVTLGGAPAAALLYSHAAHISGE-VRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEAK 324

Query: 301 I---VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSF--NSLSQ--- 351
           +   +IDSG ++T LP+     + +  ++ +   P     GS L+LC++    +L +   
Sbjct: 325 LRSTIIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDLCFALPVTALWRRPP 384

Query: 352 VPEVTIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
           VP +T+H  GAD +L R N+ F  ++  ++C V         + GN  Q N  V YD+E 
Sbjct: 385 VPSLTLHLDGADWELPRGNYVFEDLAARVMCVVLDAAPGDQTVIGNFQQQNTHVVYDLEN 444

Query: 411 QTVSFKPTDC 420
             +SF P  C
Sbjct: 445 DWLSFAPARC 454


>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 429

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 146/414 (35%), Positives = 220/414 (53%), Gaps = 37/414 (8%)

Query: 29  FSVELIHRDSPKSPFYNSS-ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           F  ELI+R+   SP  + + +TP +    A+ R   R     ++  ++  +  +  +   
Sbjct: 28  FRAELIYREHQSSPLRSETLKTPSEIFIAAVKRGHERRARLAKHV-LAGDQLFETPVASG 86

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           N  YLI IS G PP +  A+ DTGSDL W QC PC    CY   S  FDP  S++YK+L 
Sbjct: 87  NGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPC--KSCYETLSAKFDPSKSASYKTLG 144

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C S+ C  L  +SC+  +CQY   YGDGS ++G L+T+ VT+G  TG+   +P + FGCG
Sbjct: 145 CGSNFCQDLPFQSCA-ASCQYDYMYGDGSSTSGALSTDDVTIG--TGK---IPNVAFGCG 198

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN--FGTNGIVSG 265
            +N G F      +VGLG G +SL+SQ+  T   KFSYCLVP+ STK +  +  +  ++G
Sbjct: 199 NSNLGTFAGAGG-LVGLGKGPLSLVSQLGGTATKKFSYCLVPLGSTKTSPLYIGDSTLAG 257

Query: 266 PGVVSTPL---TKAKTFYVLTIDAISVGNQRLG--VSTPDI--------VIDSGTTLTFL 312
            GV  TP+       TFY   +  ISV  + +    +T DI        ++DSGTTLT+L
Sbjct: 258 -GVAYTPMLTNNNYPTFYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLTYL 316

Query: 313 P-QGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNSLSQ--VPEVTIHFRGADVKL 366
               +N     +++++  A P  +  GS   LE C+S   ++    P V  HF GADV L
Sbjct: 317 DVDAFN----PMVAALKAALPYPEADGSFYGLEYCFSTAGVANPTYPTVVFHFNGADVAL 372

Query: 367 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           +  N F+ +  +    +    +    I+GNI Q N ++ +D+  + + FK  +C
Sbjct: 373 APDNTFIALDFEGTTCLAMASSTGFSIFGNIQQLNHVIVHDLVNKRIGFKSANC 426


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 142/429 (33%), Positives = 208/429 (48%), Gaps = 40/429 (9%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN------------ 69
           I +   G +V L HR  P SP  +S + P +   + L R   R  H              
Sbjct: 45  ISSSLSGTTVALNHRHGPCSPVPSSKKRPTEE--ELLKRDQLRAEHIQRKFAMNAAVDGA 102

Query: 70  ---QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
              Q S +SSS  ++     +   Y+I + +GTP   +    DTGSD+ W QC PCP   
Sbjct: 103 GDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPP 162

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCSGVN--CQYSVSYGDGSFSNGNL 182
           C+ Q   LFDP  SSTY+++ C++++CA L Q+   C   N  CQY V YGDGS +NG  
Sbjct: 163 CHAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTY 222

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
           + +T+TL   +G + A+ G  FGC     G F+ +T G++GLGGG  SL+SQ        
Sbjct: 223 SRDTLTL---SGASDAVKGFQFGCSHLESG-FSDQTDGLMGLGGGAQSLVSQTAAAYGNS 278

Query: 243 FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTP 299
           FSYCL P S +       G     G V+T + ++K   TFY   +  I+VG ++LG+S P
Sbjct: 279 FSYCLPPTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLS-P 337

Query: 300 DI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--V 352
            +     V+DSGT +T LP    S L S   + ++    A     L+ C+ F   +Q  +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397

Query: 353 PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
           P V + F  GA + L  +        + +     G   +  I GN+ Q  F V YD+   
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYG---NCLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSS 454

Query: 412 TVSFKPTDC 420
           T+ F+   C
Sbjct: 455 TLGFRSGAC 463


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 148/461 (32%), Positives = 235/461 (50%), Gaps = 64/461 (13%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLN 66
           V I  +LC   V+   A  G   V+L H D+ K       E P + L R A+ RS  R  
Sbjct: 9   VLIACWLCGCPVAGEAAFAGDIRVDLTHVDAGK-------ELPKRELIRRAMQRSKARAA 61

Query: 67  HFN--QNSSI---SSSKASQADIIPNNA-------NYLIRISIGTPPTERLAVADTGSDL 114
             +  +N      S ++A + +  P  A        Y++ +++GTPP    A+ DTGSDL
Sbjct: 62  ALSVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVLDLAVGTPPQPITALLDTGSDL 121

Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYG 173
           IWTQC+ C  + C  Q  PLF P+MSS+Y+ + C+   C  +   SC   + C Y  SYG
Sbjct: 122 IWTQCDTC--TACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSCVRPDTCTYRYSYG 179

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           DG+ + G  ATE  T  S++G+  ++P + FGCGT N G  N+  +GIVG G   +SL+S
Sbjct: 180 DGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSLNN-ASGIVGFGRDPLSLVS 237

Query: 234 QMRTTIAGKFSYCLVPVSSTK---INFGTNGIV------SGPGVVSTPLTKAK---TFYV 281
           Q+      +FSYCL P +S++   + FG+   V      +GP V +TP+ ++    TFY 
Sbjct: 238 QLSIR---RFSYCLTPYASSRKSTLQFGSLADVGLYDDATGP-VQTTPILQSAQNPTFYY 293

Query: 282 LTIDAISVGNQRLGVST------PD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
           +    ++VG +RL +        PD    ++IDSGT LT  P    + ++    S +   
Sbjct: 294 VAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSGTALTLFPVAVLAEVVRAFRSQLRL- 352

Query: 332 PVADPTGSLE-LCYSFNSLS----------QVPEVTIHFRGADVKLSRSNFFVK-VSEDI 379
           P A+ +   + +C++  +++           VP +  HF+GAD+ L R N+ ++      
Sbjct: 353 PFANGSSPDDGVCFAAPAVAAGGGRMARQVAVPRMVFHFQGADLDLPRENYVLEDHRRGH 412

Query: 380 VCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           +C +     +     GN +Q +  V YD+E++T+SF P +C
Sbjct: 413 LCVLLGDSGDDGATIGNFVQQDMRVVYDLERETLSFAPVEC 453


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 135/356 (37%), Positives = 190/356 (53%), Gaps = 43/356 (12%)

Query: 97  IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
           IGTP     A+ DTGSDL+WTQC+PC    C+ Q +P+FDP  SSTY ++PCSS+ C+ L
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATVPCSSASCSDL 230

Query: 157 NQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFN 215
               C S   C Y+ +YGD S + G LATET TL  +      LPG+ FGCG  N G   
Sbjct: 231 PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK-----LPGVVFGCGDTNEGDGF 285

Query: 216 SKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG--------PG 267
           S+  G+VGLG G +SL+SQ+      KFSYCL  +  T  +    G ++G          
Sbjct: 286 SQGAGLVGLGRGPLSLVSQLGLD---KFSYCLTSLDDTNNSPLLLGSLAGISEASAAASS 342

Query: 268 VVSTPLTK---AKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLP- 313
           V +TPL K     +FY +++ AI+VG+ R+ +           T  +++DSGT++T+L  
Sbjct: 343 VQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGTSITYLEV 402

Query: 314 QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----QVPEVTIHFR-GADVKLS 367
           QGY +   +  + M  A P AD +G  L+LC+   +      +VP +  HF  GAD+ L 
Sbjct: 403 QGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFDGGADLDLP 460

Query: 368 RSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
             N+ V       +C    G +  + I GN  Q NF   YD+   T+SF P  C K
Sbjct: 461 AENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVGHDTLSFAPVQCNK 515


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 135/421 (32%), Positives = 206/421 (48%), Gaps = 42/421 (9%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL----NHFNQNSSISSSKASQADI 84
           + ++L+HRD  K P +N+S     R    + R   R+     H        + +A  +D+
Sbjct: 66  YKLKLVHRD--KVPTFNTSHDHRTRFNARMQRDTKRVAALRRHLAAGKPTYAEEAFGSDV 123

Query: 85  I----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           +      +  Y +RI +G+PP  +  V D+GSD+IW QCEPC  +QCY Q  P+F+P  S
Sbjct: 124 VSGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPC--TQCYHQSDPVFNPADS 181

Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           S+Y  + C+S+ C+ ++   C    C+Y VSYGDGS++ G LA ET+T G T  + VA+ 
Sbjct: 182 SSYAGVSCASTVCSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALETLTFGRTLIRNVAI- 240

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINF 257
               GCG +N G+F     G++GLG G +S + Q+     G FSYCLV     SS  + F
Sbjct: 241 ----GCGHHNQGMF-VGAAGLLGLGSGPMSFVGQLGGQAGGTFSYCLVSRGIQSSGLLQF 295

Query: 258 GTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP----------DIVID 304
           G   +  G   V  PL    +A++FY + +  + VG  R+ +S             +V+D
Sbjct: 296 GREAVPVGAAWV--PLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMD 353

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLS-QVPEVTIHFRGA 362
           +GT +T LP            +     P A      + CY  F  +S +VP V+ +F G 
Sbjct: 354 TGTAVTRLPTAAYEAFRDAFIAQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSFYFSGG 413

Query: 363 DV-KLSRSNFFVKVSEDI--VCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
            +  L   NF + V +D+   C  F   ++ + I GNI Q    +  D     V F P  
Sbjct: 414 PILTLPARNFLIPV-DDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFVGFGPNV 472

Query: 420 C 420
           C
Sbjct: 473 C 473


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 140/423 (33%), Positives = 207/423 (48%), Gaps = 42/423 (9%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS----ISSSKASQAD 83
           GF ++L H D+       +S T  Q L  A+ RS  R+      +     +    A++  
Sbjct: 28  GFQLKLTHVDA------GTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVL 81

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           +  ++  YL+ ++IGTPP    A+ DTGSDLIWTQC PC    C  Q +P FD K S+TY
Sbjct: 82  VTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATY 139

Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           ++LPC SS+CASL+  SC    C Y   YGD + + G LA ET T G+     V    I 
Sbjct: 140 RALPCRSSRCASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIA 199

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL---VPVSSTKINFG-- 258
           FGCG+ N G   + ++G+VG G G +SL+SQ+  +   +FSYCL   +  + +++ FG  
Sbjct: 200 FGCGSLNAGDL-ANSSGMVGFGRGPLSLVSQLGPS---RFSYCLTSYLSATPSRLYFGVY 255

Query: 259 ----TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----------TPDI 301
               +    SG  V STP          Y L++ AIS+G + L +           T  +
Sbjct: 256 ANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGV 315

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF----NSLSQVPEVTI 357
           +IDSGT++T+L Q     +   + S I    + D    L+ C+ +    N    VP++  
Sbjct: 316 IIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVF 375

Query: 358 HFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
           HF  A++ L   N+ +  S      +    T    I GN  Q N  + YDI    +SF P
Sbjct: 376 HFDSANMTLLPENYMLIASTTGYLCLVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSFVP 435

Query: 418 TDC 420
             C
Sbjct: 436 APC 438


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score =  198 bits (504), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 145/455 (31%), Positives = 207/455 (45%), Gaps = 61/455 (13%)

Query: 20  SPIEAQTG----GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR---LNHFNQNS 72
           +P E + G    G  + ++HR  P SP  ++   P     D L    NR   + H    +
Sbjct: 72  APREHKHGATSSGTRMTIVHRHGPCSPLADAHGKPPSH-EDILAADQNRAESIQHRVSTT 130

Query: 73  SISSSKASQADIIPNN-------------------------------ANYLIRISIGTPP 101
           +       ++   P+                                 NY++ + +GTP 
Sbjct: 131 ATGRGNPKRSRRAPSRRQQPSSAPAPAASLSSSTASLPASSGRALGTGNYVVTVGLGTPA 190

Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
           +    V DTGSD  W QC+PC    CY Q   LFDP  SSTY ++ C++  C+ L+ + C
Sbjct: 191 SRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANISCAAPACSDLDTRGC 249

Query: 162 SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGI 221
           SG NC Y V YGDGS+S G  A +T+TL S      A+ G  FGCG  N GLF  +  G+
Sbjct: 250 SGGNCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGCGERNEGLFG-EAAGL 304

Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--INFGTNGIVSGPGVVSTPL--TKAK 277
           +GLG G  SL  Q      G F++CL   SS    ++FG     +    ++TP+      
Sbjct: 305 LGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAAAGARLTTPMLTDNGP 364

Query: 278 TFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ- 331
           TFY + +  I VG Q L +     +T   ++DSGT +T LP    S+L S  +S + A+ 
Sbjct: 365 TFYYVGMTGIRVGGQLLSIPQSVFTTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARG 424

Query: 332 -PVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI 387
              A     L+ CY F  +SQV  P V++ F+ GA + +  S      S   VC  F   
Sbjct: 425 YKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASVSQVCLGFAAN 484

Query: 388 TNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            +   V I GN     F V YDI ++ V F P  C
Sbjct: 485 EDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score =  197 bits (502), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 126/365 (34%), Positives = 184/365 (50%), Gaps = 27/365 (7%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           A +      YL  + +GTP      + DTGSDL W QC PC   +CY Q+  LF P  S+
Sbjct: 4   APVAAARGEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPC--GKCYSQNDALFLPNTST 61

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           ++  L C S+ C  L    C+   C Y  SYGDGS + G+   +T+T+    GQ   +P 
Sbjct: 62  SFTKLACGSALCNGLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPN 121

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-----PVSSTKIN 256
             FGCG +N G F +   GI+GLG G +S  SQ+++   GKFSYCLV     P  ++ + 
Sbjct: 122 FAFGCGHDNEGSF-AGADGILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPLL 180

Query: 257 FGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP--DI--------VI 303
           FG   +   P V   P+    K  T+Y + ++ ISVG+  L +S+   DI        + 
Sbjct: 181 FGDAAVPILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGTIF 240

Query: 304 DSGTTLTFLPQGYNSNLLSVM--SSMIEAQPVADPTGSLELCYS---FNSLSQVPEVTIH 358
           DSGTT+T L +     +L+ M  S+M  ++ + D    L+LC S    + L  VP +T H
Sbjct: 241 DSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDD-ISRLDLCLSGFPKDQLPTVPAMTFH 299

Query: 359 FRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
           F G D+ L  SN+F+ +            +  V I G++ Q NF V YD   + + F P 
Sbjct: 300 FEGGDMVLPPSNYFIYLESSQSYCFAMTSSPDVNIIGSVQQQNFQVYYDTAGRKLGFVPK 359

Query: 419 DCTKQ 423
           DC  +
Sbjct: 360 DCVGR 364


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score =  197 bits (502), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 127/358 (35%), Positives = 185/358 (51%), Gaps = 27/358 (7%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             YL  + +GTP      + DTGSDL W QC PC    CY Q+  LF P  S+++  L C
Sbjct: 1   GEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPC--GTCYSQNDSLFIPNTSTSFTKLAC 58

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
            +  C  L    C+   C Y  SYGDGS S G+   +T+T+    GQ   +P   FGCG 
Sbjct: 59  GTELCNGLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGH 118

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-----PVSSTKINFGTNGIV 263
           +N G F +   GI+GLG G +S  SQ++T   GKFSYCLV     P  ++ + FG   + 
Sbjct: 119 DNEGSF-AGADGILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLLFGDAAVP 177

Query: 264 SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP--DI--------VIDSGTTLT 310
           + PGV    L    K  T+Y + ++ ISVG + L +S+   DI        + DSGTT+T
Sbjct: 178 TFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSGTTVT 237

Query: 311 FLPQGYNSNLLSVM-SSMIEAQPVADPTGSLELC---YSFNSLSQVPEVTIHFRGADVKL 366
            L    +  +L+ M +S ++    +D +  L+LC   ++   L  VP +T HF G D++L
Sbjct: 238 QLAGEVHQEVLAAMNASTMDYPRKSDDSSGLDLCLGGFAEGQLPTVPSMTFHFEGGDMEL 297

Query: 367 SRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 423
             SN+F+ + E      F  +++  V I G+I Q NF V YD   + + F P  C  +
Sbjct: 298 PPSNYFIFL-ESSQSYCFSMVSSPDVTIIGSIQQQNFQVYYDTVGRKIGFVPKSCVGR 354


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score =  197 bits (501), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 146/422 (34%), Positives = 204/422 (48%), Gaps = 45/422 (10%)

Query: 31  VELIHRDSPKSPFYNSSE--TPYQRLRDALTRSLNRLNHFNQNSSISSSKA-------SQ 81
           + L HR  P +P   +S   +P   L D L     R  +  +  S +++ A       S+
Sbjct: 67  LRLTHRHGPCAPAGKASALGSPPSFL-DTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSK 125

Query: 82  ADIIPNNAN-------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
           A  +P N         Y++ +S+GTP   +    DTGSD+ W QC+PCP   CY Q  PL
Sbjct: 126 AATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPL 185

Query: 135 FDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
           FDP  SS+Y ++PC+++ C+  +L    CSG  C Y VSYGDGS + G  +++T+TL  +
Sbjct: 186 FDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGS 245

Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
                AL G  FGCG    GLF +   G++GLG    SL+SQ  +T  G FSYCL P  +
Sbjct: 246 N----ALKGFLFGCGHAQQGLF-AGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQN 300

Query: 253 TKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----TPDIVIDS 305
           +       G  S  G  +TPL  A    T+Y++ +  ISVG Q L +         V+D+
Sbjct: 301 SVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAVVDT 360

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHF-R 360
           GT +T LP    S L S   + +     P A  TG L+ CY F     V  P ++I F  
Sbjct: 361 GTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGG 420

Query: 361 GADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
           GA + L  S           C  F   G  +   I GN+ Q +F V +D    TV F P 
Sbjct: 421 GAAMDLGTSGILTS-----GCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPA 473

Query: 419 DC 420
            C
Sbjct: 474 SC 475


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score =  197 bits (501), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 145/412 (35%), Positives = 221/412 (53%), Gaps = 47/412 (11%)

Query: 45  NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN--ANYLIRISIGTPPT 102
           + S T  Q +R AL R ++R N     +S SS     A + P      +L+ ++IGTPP 
Sbjct: 38  DPSVTASQFVRAALHRDMHRHNARKLAAS-SSDGTVSAPVSPTTVPGEFLMTLAIGTPPL 96

Query: 103 ERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS 162
             LA+ADTGSDLIWTQC PC   QC+ Q +PL++P  S+T+ +LPC+SS    L   +C+
Sbjct: 97  PFLAIADTGSDLIWTQCAPC-SRQCFQQPTPLYNPSSSTTFSALPCNSS--LGLCAPACA 153

Query: 163 GVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAVALPGITFGCGTNNGGLFNSKTTGI 221
              C Y+++YG G ++     TET T GS+T    V +PGI FGC   + G   S  +G+
Sbjct: 154 ---CMYNMTYGSG-WTYVFQGTETFTFGSSTPADQVRVPGIAFGCSNASSGFNASSASGL 209

Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPV----SSTKINFGTNGIVSGPGVV-STPLTKA 276
           VGLG G +SL+SQ+    A KFSYCL P     S++ +  G +  ++  GVV STP   +
Sbjct: 210 VGLGRGSLSLVSQLG---APKFSYCLTPYQDTNSTSTLLLGPSASLNDTGVVSSTPFVAS 266

Query: 277 KT--FYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQGYNSNLLSVM 324
            +  +Y L +  IS+G   L +           T  ++IDSGTT+T L       + + +
Sbjct: 267 PSSIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSGTTITMLGNTAYQQVRAAV 326

Query: 325 SSMIEAQPVADPTGS--LELCYSFNSLS----QVPEVTIHFRGADVKLSRSNFFVKVSED 378
            S++   P  D + +  L+LC+   S +     +P +T+HF GAD+ L   N+ + +S+ 
Sbjct: 327 LSLVTL-PTTDGSAATGLDLCFELPSSTSAPPSMPSMTLHFDGADMVLPADNYMMSLSDP 385

Query: 379 IV-----CSVFKGITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
                  C   +  T++    V I GN  Q N  + YD+ ++T+SF P  C+
Sbjct: 386 DSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGKETLSFAPAKCS 437


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score =  197 bits (501), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 142/429 (33%), Positives = 208/429 (48%), Gaps = 40/429 (9%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN------------ 69
           I +   G +V L HR  P SP  +S + P +   + L R   R  H              
Sbjct: 45  ISSSLSGTTVALNHRHGPCSPVPSSKKRPTEE--ELLKRDQLRAEHIQRKFAMNAAVDGA 102

Query: 70  ---QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
              Q S +SSS  ++     +   Y+I + +GTP   +    DTGSD+ W QC PCP   
Sbjct: 103 GDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPP 162

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCSGVN--CQYSVSYGDGSFSNGNL 182
           CY Q   LFDP  SSTY+++ C++++CA L Q+   C   N  CQY V YGDGS +NG  
Sbjct: 163 CYAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTY 222

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
           + +T+TL   +G + A+ G  FGC     G F+ +T G++GLGGG  SL+SQ        
Sbjct: 223 SRDTLTL---SGASDAVKGFQFGCSHVESG-FSDQTDGLMGLGGGAQSLVSQTAAAYGNS 278

Query: 243 FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTP 299
           FSYCL P S +       G     G V+T + +++   TFY   +  I+VG ++LG+S P
Sbjct: 279 FSYCLPPTSGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLS-P 337

Query: 300 DI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--V 352
            +     V+DSGT +T LP    S L S   + ++    A     L+ C+ F   +Q  +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397

Query: 353 PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
           P V + F  GA + L  +        + +     G   +  I GN+ Q  F V YD+   
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYG---NCLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSS 454

Query: 412 TVSFKPTDC 420
           T+ F+   C
Sbjct: 455 TLGFRSGAC 463


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score =  197 bits (501), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 145/422 (34%), Positives = 205/422 (48%), Gaps = 33/422 (7%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPY-QRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
           EA   G  + L H     SP    + + +   +  +  R  +RLN     ++ + S  S 
Sbjct: 65  EALKPGVKIRLDHIHGACSPLRPINSSSWIDMVSQSFDRDNDRLNTIWSKNNGTYSTMSN 124

Query: 82  ADIIPNN----ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
             + P +     NY++    GTP    L + DTGSD+ W QC+PC  S CY Q  P+F+P
Sbjct: 125 LPLQPGSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPC--SDCYSQVDPIFEP 182

Query: 138 KMSSTYKSLPCSSSQCASL-NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           + SS+YK L C SS C  L     C    C Y ++YGDGS S G+ + ET+TLGS +   
Sbjct: 183 QQSSSYKHLSCLSSACTELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGSDS--- 239

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP-VSSTKI 255
              P   FGCG  N GLF   + G++GLG   +S  SQ ++   G+FSYCL   VSST  
Sbjct: 240 --FPSFAFGCGHTNTGLFKG-SAGLLGLGRTALSFPSQTKSKYGGQFSYCLPDFVSSTST 296

Query: 256 NFGTNGIVSGPGVVS-TPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI-----VIDSG 306
              + G  S P   +  PL   +   +FY + ++ ISVG +RL +    +     ++DSG
Sbjct: 297 GSFSVGQGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIVDSG 356

Query: 307 TTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GA 362
           T +T L PQ Y++ L +   S     P A P   L+ CY  +S SQV  P +T HF+  A
Sbjct: 357 TVITRLVPQAYDA-LKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFHFQNNA 415

Query: 363 DVKLSRSNFFVKVSED--IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPT 418
           DV +S       +  D   VC  F   + S+   I GN  Q    V +D     + F P 
Sbjct: 416 DVAVSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRIGFAPG 475

Query: 419 DC 420
            C
Sbjct: 476 SC 477


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 146/422 (34%), Positives = 204/422 (48%), Gaps = 45/422 (10%)

Query: 31  VELIHRDSPKSPFYNSSE--TPYQRLRDALTRSLNRLNHFNQNSSISSSKA-------SQ 81
           + L HR  P +P   +S   +P   L D L     R  +  +  S +++ A       S+
Sbjct: 56  LRLTHRHGPCAPAGKASALGSPPSFL-DTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSK 114

Query: 82  ADIIPNNAN-------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
           A  +P N         Y++ +S+GTP   +    DTGSD+ W QC+PCP   CY Q  PL
Sbjct: 115 AATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPL 174

Query: 135 FDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
           FDP  SS+Y ++PC+++ C+  +L    CSG  C Y VSYGDGS + G  +++T+TL  +
Sbjct: 175 FDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGS 234

Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
                AL G  FGCG    GLF +   G++GLG    SL+SQ  +T  G FSYCL P  +
Sbjct: 235 N----ALKGFLFGCGHAQQGLF-AGVDGLLGLGRQGQSLVSQASSTYGGVFSYCLPPTQN 289

Query: 253 TKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----TPDIVIDS 305
           +       G  S  G  +TPL  A    T+Y++ +  ISVG Q L +         V+D+
Sbjct: 290 SVGYISLGGPSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVFASGAVVDT 349

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHF-R 360
           GT +T LP    S L S   + +     P A  TG L+ CY F     V  P ++I F  
Sbjct: 350 GTVVTRLPPTAYSALRSAFRAAMAPYGYPSAPATGILDTCYDFTRYGTVTLPTISIAFGG 409

Query: 361 GADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
           GA + L  S           C  F   G  +   I GN+ Q +F V +D    TV F P 
Sbjct: 410 GAAMDLGTSGILTS-----GCLAFAPTGGDSQASILGNVQQRSFEVRFD--GSTVGFMPA 462

Query: 419 DC 420
            C
Sbjct: 463 SC 464


>gi|356528675|ref|XP_003532925.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 342

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 135/404 (33%), Positives = 188/404 (46%), Gaps = 99/404 (24%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           GFS++LIHRDSP SPFYN S TP +R+ DA   S       N+N      K  ++ +IPN
Sbjct: 28  GFSIDLIHRDSPLSPFYNPSLTPSERITDAALSS-------NEN------KLPESILIPN 74

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           N  YL+R+ IGTPP ERL +ADTGSD IW QC                           P
Sbjct: 75  NGEYLMRLYIGTPPVERLVIADTGSDFIWVQCS--------------------------P 108

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG-QAVALPGITFGC 206
           C + QC  LN              Y + SF+   + TET++  ST G Q V+ P   FGC
Sbjct: 109 CQNCQCVYLN-------------IYANKSFTIEVVGTETLSFDSTGGAQTVSFPNSIFGC 155

Query: 207 GTNNGGLFNS--KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
           G NN   F S  K TG+VGL  G +SL+SQ+   I  KFSY         + FG+  I++
Sbjct: 156 GANNNLTFRSSDKATGLVGLVAGQLSLVSQLGAQIGYKFSY---------LKFGSEAIIT 206

Query: 265 GPGVVSTPLTKAKT--FYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLS 322
             GVVSTPL    +   Y L ++ +++G + +   T                        
Sbjct: 207 TNGVVSTPLIIKPSLPLYFLNLEVVTIGQKVVPTET------------------------ 242

Query: 323 VMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSED--IV 380
                +  + V D     + C+ +     VP +   F GA V L   N  +K+ +   + 
Sbjct: 243 -----LGVESVQDLPFPFKFCFPYRDNMTVPAIAFQFTGASVALRPKNLLIKLQDRNMLX 297

Query: 381 CSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
            +V    +  + + I+G I Q +F V YD++ + VS  PTDCTK
Sbjct: 298 LAVVPSASSLSVISIFGIIAQFDFQVLYDLDGKKVSVAPTDCTK 341


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  197 bits (500), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 144/417 (34%), Positives = 200/417 (47%), Gaps = 32/417 (7%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
            S+E++HR  P     N  +        + L +  +R++  +   S       +   +P 
Sbjct: 63  LSLEVVHRSGPCIQVLNQEKAANAPSNMEILLQDRHRVDSIHARLSSHGVFQEKQATLPV 122

Query: 87  ------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
                  + +Y + + +GTP  E   + DTGSDL WTQCEPC  + CY Q  P  DP  S
Sbjct: 123 QSGASIGSGDYAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKT-CYKQKEPRLDPTKS 181

Query: 141 STYKSLPCSSSQCASLNQ---KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           ++YK++ CSS+ C  L+    +SCS   C Y V YGDGS+S G  ATET+TL S+     
Sbjct: 182 TSYKNISCSSAFCKLLDTEGGESCSSPTCLYQVQYGDGSYSIGFFATETLTLSSSN---- 237

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
                 FGCG  N GLF     G++GLG   +SL SQ        FSYCL   SS+K   
Sbjct: 238 VFKNFLFGCGQQNSGLFRG-AAGLLGLGRTKLSLPSQTAQKYKKLFSYCLPASSSSKGYL 296

Query: 258 GTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTL 309
              G VS   V  TPL+   K+  FY L I  +SVG  +L +     ST   VIDSGT +
Sbjct: 297 SFGGQVS-KTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFSTSGTVIDSGTVI 355

Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGA-DVKL 366
           T LP    S L S    ++   P  D     + CY F  N   ++P+V + F+G  ++ +
Sbjct: 356 TRLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNETIKIPKVGVSFKGGVEMDI 415

Query: 367 SRSNFFVKVSE-DIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             S     V+    VC  F G  + V   I+GN  Q  + V YD  +  V F P+ C
Sbjct: 416 DVSGILYPVNGLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 147/419 (35%), Positives = 198/419 (47%), Gaps = 43/419 (10%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
           + L HR  P +P   SS      + D L     R  H  +  S      +   KA+ A +
Sbjct: 66  LRLTHRHGPCAPLRASSLAA-PSVADTLRADQRRAEHILRRVSGRGAPQLWDYKAAAATV 124

Query: 85  IPN------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
             N       +NY++  S+GTP   +    DTGSDL W QC+PC    CY Q  PLFDP 
Sbjct: 125 PANWGYDIGTSNYVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPA 184

Query: 139 MSSTYKSLPCSSSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
            SS+Y ++PC  S CA L     +CS   C Y VSYGDGS + G  +++T+TL +     
Sbjct: 185 QSSSYAAVPCGRSACAGLGIYASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLAAN---- 240

Query: 197 VALPGITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
             + G  FGCG   +GGLF +   G++G G    SL+ Q      G FSYCL P  S+  
Sbjct: 241 ATVQGFLFGCGHAQSGGLF-TGIDGLLGFGREQPSLVQQTAGAYGGVFSYCL-PTKSSTT 298

Query: 256 NFGTNGIVSG--PGVVST---PLTKAKTFYVLTIDAISVGNQRLGVS----TPDIVIDSG 306
            + T G  SG  PG  +T   P   A T+YV+ +  ISVG Q L V         V+D+G
Sbjct: 299 GYLTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTG 358

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGAD 363
           T +T LP    + L S   S + + P A P G L+ CYSF     V    V + F  GA 
Sbjct: 359 TVITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCYSFAGYGTVNLTSVALTFSSGAT 418

Query: 364 VKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + L              C  F   G   S+ I GN+ Q +F V   I+  +V F+P+ C
Sbjct: 419 MTLGADGIM-----SFGCLAFASSGSDGSMAILGNVQQRSFEV--RIDGSSVGFRPSSC 470


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 144/428 (33%), Positives = 219/428 (51%), Gaps = 56/428 (13%)

Query: 20  SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSK 78
            P+   TG     LIH+DS  S         YQ L R+ + R   R   F        + 
Sbjct: 37  KPLRLVTG-----LIHQDSILSS--------YQSLDRNNVERRRTRRAAF-------ITD 76

Query: 79  ASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
             QA+++ ++    +L+  S+G PP  +L   DTGSDL+W QC PC  + C+ Q +P+FD
Sbjct: 77  EIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPC--ADCFRQSTPIFD 134

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           P  SSTY  L   S  C +  QK  + +N C Y+ SY DGS S+GNLATE +   ++   
Sbjct: 135 PSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQG 194

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
            V +  + FGCG +N G F+ + +GI+GL  GD S++S++      +FSYC+  +     
Sbjct: 195 TVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL----GSRFSYCIGDLFDP-- 248

Query: 256 NFGTNGIVSGPGVV----STPLTKAKTFYVLTIDAISVGNQRLGVSTPD----------- 300
           ++  N +V G GV     STP      FY +T++ ISVG  RL ++ P+           
Sbjct: 249 HYTHNQLVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDIN-PEVFQRTESGQGG 307

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYS---FNSLSQVPEV 355
           +V+DSGTT TFL +     L + +  ++    Q V   T    LCY       L   PE+
Sbjct: 308 VVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPEL 367

Query: 356 TIHF-RGADVKLSRSNFFVKVSEDIVC-SVFK-GITNSVPIYGNIMQTNFLVGYDIEQQT 412
             HF  GAD+ L  ++ FV+ ++D+ C +V +  + N   + G + Q ++ V YD+  + 
Sbjct: 368 AFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKR 427

Query: 413 VSFKPTDC 420
           V F+ TDC
Sbjct: 428 VYFQRTDC 435


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 152/437 (34%), Positives = 210/437 (48%), Gaps = 45/437 (10%)

Query: 18  VVSPIEAQTGGFSVELIHRDSPKSPFYNSSET-----PYQRLRDALTRSLNR-------L 65
           V+SP  A T   S+ + HR    S   N   T        RL  A   S++         
Sbjct: 51  VLSP-RASTTKSSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLTT 109

Query: 66  NHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           NH +Q+ S        + +   + NY++ + +GTP  +   + DTGSDL WTQC+PC  +
Sbjct: 110 NHVSQSQSTDLPAKDGSTL--GSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRT 167

Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNG 180
            CY Q  P+F+P  S++Y ++ CSS+ C SL     N  SCS  NC Y + YGD SFS G
Sbjct: 168 -CYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVG 226

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
            LA +  TL S+        G+ FGCG NN GLF +   G++GLG   +S  SQ  T   
Sbjct: 227 FLAKDKFTLTSSD----VFDGVYFGCGENNQGLF-TGVAGLLGLGRDKLSFPSQTATAYN 281

Query: 241 GKFSYCLVPVSST---KINFGTNGIVSGPGVVSTP---LTKAKTFYVLTIDAISVGNQRL 294
             FSYCL P S++    + FG+ GI     V  TP   +T   +FY L I AI+VG Q+L
Sbjct: 282 KIFSYCL-PSSASYTGHLTFGSAGI--SRSVKFTPISTITDGTSFYGLNIVAITVGGQKL 338

Query: 295 GV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 349
            +     STP  +IDSGT +T LP    + L S   + +   P       L+ C+  +  
Sbjct: 339 PIPSTVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGF 398

Query: 350 SQV--PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLV 404
             V  P+V   F  GA V+L     F       VC  F G ++  +  I+GN+ Q    V
Sbjct: 399 KTVTIPKVAFSFSGGAVVELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEV 458

Query: 405 GYDIEQQTVSFKPTDCT 421
            YD     V F P  C+
Sbjct: 459 VYDGAGGRVGFAPNGCS 475


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 136/360 (37%), Positives = 196/360 (54%), Gaps = 41/360 (11%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y+++IS+GTPP +  A+ DTGSDL W QC PC  ++C+ Q  PLF P  SS+Y +  C
Sbjct: 6   GEYVLQISLGTPPQQFSAIVDTGSDLCWVQCAPC--ARCFEQPDPLFIPLASSSYSNASC 63

Query: 149 SSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           + S C +L + +CS  N C YS SYGDGS + G+ A ETVTL  +T     L  I FGCG
Sbjct: 64  TDSLCDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGST-----LARIGFGCG 118

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST----KINFGTNGIV 263
            N  G F +   G++GLG G +SL SQ+ ++    FSYCLV  S+T     I FG     
Sbjct: 119 HNQEGTF-AGADGLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTFSPITFGNAAEN 177

Query: 264 SGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTP------------DIVIDSGTT 308
           S      TPL + +   ++Y + +++ISVGN+R  V TP             +++DSGTT
Sbjct: 178 SRASF--TPLLQNEDNPSYYYVGVESISVGNRR--VPTPPSAFRIDANGVGGVILDSGTT 233

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLSQ----VPEVTIHFRGAD 363
           +T+        +L+ +   I + P ADPT   L LCY  +S+S     +P +T+H    D
Sbjct: 234 ITYWRLAAFIPILAELRRQI-SYPEADPTPYGLNLCYDISSVSASSLTLPSMTVHLTNVD 292

Query: 364 VKLSRSNFFVKVSE--DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            ++  SN +V V    + VC+     ++   I GN+ Q N L+  D+    V F  TDC+
Sbjct: 293 FEIPVSNLWVLVDNFGETVCTAMS-TSDQFSIIGNVQQQNNLIVTDVANSRVGFLATDCS 351


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 130/388 (33%), Positives = 214/388 (55%), Gaps = 48/388 (12%)

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
           +SS A  A +    A YL+ ++IGTPP   +A+ADTGSDL WTQC+PC    C+ QD+P+
Sbjct: 79  TSSNAGPARLRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCKPC--KLCFPQDTPI 136

Query: 135 FDPKMSSTYKSLPCSSSQCASL--NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTL 189
           +D   S+++  +PC+S+ C  +  + ++C+      C+Y  +Y DG++S G L TET+T 
Sbjct: 137 YDTAASASFSPVPCASATCLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTF 196

Query: 190 GSTT----GQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
             ++    G  V++ G+ FGCG +NGGL +NS  TG VGLG G +SL++Q+     GKFS
Sbjct: 197 AGSSPGAPGPGVSVGGVAFGCGVDNGGLSYNS--TGTVGLGRGSLSLVAQLGV---GKFS 251

Query: 245 YCLVPVSSTKIN----FGTNGIVSGP------GVVSTPLTKA---KTFYVLTIDAISVGN 291
           YCL    +T +     FG+   ++ P       V STPL +     + Y ++++ IS+G+
Sbjct: 252 YCLTDFFNTSLGSPVLFGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGD 311

Query: 292 QRLGV----------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
            RL +           +  +++DSGT  T L +     +++ ++ ++  QPV + +    
Sbjct: 312 ARLPIPNGTFDLRDDGSGGMIVDSGTIFTVLVESAFRVVVNHVAGVLN-QPVVNASSLDS 370

Query: 342 LCYSFNS----LSQVPEVTIHFR-GADVKLSRSNF--FVKVSEDIVCSVFKGITNSVPIY 394
            C+   +    L  +P++ +HF  GAD++L R N+  F + S     ++    +    I 
Sbjct: 371 PCFPATAGEQQLPDMPDMLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSIL 430

Query: 395 GNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           GN  Q N  + +DI    +SF PTDC+K
Sbjct: 431 GNFQQQNIQMLFDITVGQLSFVPTDCSK 458


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  196 bits (497), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 143/432 (33%), Positives = 205/432 (47%), Gaps = 59/432 (13%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA 89
           S+E+IH+  P S           +L     RS +R    +Q+ S  +S  S+    P + 
Sbjct: 67  SLEVIHKHGPCS-----------KLSQDKGRSPSRTQMLDQDESRVNSIRSRLAKNPADG 115

Query: 90  ---------------------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
                                NY++ + +GTP  +   + DTGSDL WTQCEPC    CY
Sbjct: 116 GKLKGSKVTLPSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPC-ARYCY 174

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLA 183
            Q  P+F+P  S++Y ++ CSS  C  L     N  SCS   C Y + YGD S+S G  A
Sbjct: 175 HQQEPIFNPSKSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFA 234

Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
            + + L ST           FGCG NN GLF     G++GLG   +SL+SQ        F
Sbjct: 235 QDKLALTSTD----VFNNFLFGCGQNNRGLFVG-VAGLIGLGRNALSLVSQTAQKYGKLF 289

Query: 244 SYCLVPVSSTK--INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGV-- 296
           SYCL   SS+   + FG+ G  S   V  TP    ++  +FY L + AISVG ++L    
Sbjct: 290 SYCLPSTSSSTGYLTFGSGGGTS-KAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSA 348

Query: 297 ---STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--Q 351
              ST   +IDSGT ++ LP    S+L +     +   P A P   L+ CY F+      
Sbjct: 349 SVFSTAGTIIDSGTVISRLPPTAYSDLRASFQQQMSKYPKAAPASILDTCYDFSQYDTVD 408

Query: 352 VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDI 408
           VP++ ++F  GA++ L  S  F  ++   VC  F G +++  + I GN+ Q  F V YD+
Sbjct: 409 VPKINLYFSDGAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDV 468

Query: 409 EQQTVSFKPTDC 420
               + F P  C
Sbjct: 469 AGGRIGFAPGGC 480


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 144/446 (32%), Positives = 207/446 (46%), Gaps = 56/446 (12%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
           A + G  + ++HR  P SP  ++   P     + L    NR+   +   S +++   +  
Sbjct: 83  ASSSGTRMTIVHRHGPCSPLADAHGKPPSH-DEILAADQNRVESIHHRVSTTATVRGKPK 141

Query: 84  IIPN---------------------------------NANYLIRISIGTPPTERLAVADT 110
             P+                                   NY++ I +GTP +    V DT
Sbjct: 142 RRPSPSRRQQQPSAPAPAASLSSSTASLPASSGRALGTGNYVVTIGLGTPASRYTVVFDT 201

Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSV 170
           GSD  W QC+PC    CY Q   LFDP  SSTY ++ C++  C+ L  + CSG +C YSV
Sbjct: 202 GSDTTWVQCQPCV-VVCYKQQEKLFDPARSSTYANVSCAAPACSDLYTRGCSGGHCLYSV 260

Query: 171 SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDIS 230
            YGDGS+S G  A +T+TL S      A+ G  FGCG  N GLF  +  G++GLG G  S
Sbjct: 261 QYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGCGERNEGLFG-EAAGLLGLGRGKTS 315

Query: 231 LISQMRTTIAGKFSYCLVPVSSTK--INFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDA 286
           L  Q      G F++CL   SS    ++FG     +     +TP+      TFY + +  
Sbjct: 316 LPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAAVGARQTTPMLTDNGPTFYYVGMTG 375

Query: 287 ISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--PVADPTGS 339
           I VG Q L +     ST   ++DSGT +T LP    S+L S  +S + A+    A     
Sbjct: 376 IRVGGQLLSIPQSVFSTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSL 435

Query: 340 LELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKG--ITNSVPIY 394
           L+ CY F  +S+V  P+V++ F+ GA + ++ S      S   VC  F      + V I 
Sbjct: 436 LDTCYDFTGMSEVAIPKVSLLFQGGAYLDVNASGIMYAASLSQVCLGFAANEDDDDVGIV 495

Query: 395 GNIMQTNFLVGYDIEQQTVSFKPTDC 420
           GN     F V YDI ++TV F P  C
Sbjct: 496 GNTQLKTFGVVYDIGKKTVGFSPGAC 521


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score =  195 bits (496), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 144/428 (33%), Positives = 219/428 (51%), Gaps = 56/428 (13%)

Query: 20  SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSK 78
            P+   TG     LIH+DS  S         YQ L R+ + R   R   F        + 
Sbjct: 5   KPLRLVTG-----LIHQDSILSS--------YQSLDRNNVERRRTRRAAF-------ITD 44

Query: 79  ASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
             QA+++ ++    +L+  S+G PP  +L   DTGSDL+W QC PC  + C+ Q +P+FD
Sbjct: 45  EIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPC--ADCFRQSTPIFD 102

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           P  SSTY  L   S  C +  QK  + +N C Y+ SY DGS S+GNLATE +   ++   
Sbjct: 103 PSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQG 162

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
            V +  + FGCG +N G F+ + +GI+GL  GD S++S++      +FSYC+  +     
Sbjct: 163 TVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL----GSRFSYCIGDLFDP-- 216

Query: 256 NFGTNGIVSGPGVV----STPLTKAKTFYVLTIDAISVGNQRLGVSTPD----------- 300
           ++  N +V G GV     STP      FY +T++ ISVG  RL ++ P+           
Sbjct: 217 HYTHNQLVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDIN-PEVFQRTESGQGG 275

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSF---NSLSQVPEV 355
           +V+DSGTT TFL +     L + +  ++    Q V   T    LCY       L   PE+
Sbjct: 276 VVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPEL 335

Query: 356 TIHF-RGADVKLSRSNFFVKVSEDIVC-SVFK-GITNSVPIYGNIMQTNFLVGYDIEQQT 412
             HF  GAD+ L  ++ FV+ ++D+ C +V +  + N   + G + Q ++ V YD+  + 
Sbjct: 336 AFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKR 395

Query: 413 VSFKPTDC 420
           V F+ TDC
Sbjct: 396 VYFQRTDC 403


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score =  195 bits (495), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 144/428 (33%), Positives = 219/428 (51%), Gaps = 56/428 (13%)

Query: 20  SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSK 78
            P+   TG     LIH+DS  S         YQ L R+ + R   R   F  +       
Sbjct: 5   KPLRLVTG-----LIHQDSILSS--------YQSLDRNNVERRRTRRAAFIXDEI----- 46

Query: 79  ASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
             QA+++ ++    +L+  S+G PP  +L   DTGSDL+W QC PC  + C+ Q +P+FD
Sbjct: 47  --QANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPC--ADCFRQSTPIFD 102

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           P  SSTY  L   S  C +  QK  + +N C Y+ SY DGS S+GNLATE +   ++   
Sbjct: 103 PSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQG 162

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
            V +  + FGCG +N G F+ + +GI+GL  GD S++S++      +FSYC+  +     
Sbjct: 163 TVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRL----GSRFSYCIGDLFDP-- 216

Query: 256 NFGTNGIVSGPGVV----STPLTKAKTFYVLTIDAISVGNQRLGVSTPD----------- 300
           ++  N +V G GV     STP      FY +T++ ISVG  RL ++ P+           
Sbjct: 217 HYTHNQLVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDIN-PEVFQRTESGQGG 275

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSF---NSLSQVPEV 355
           +V+DSGTT TFL +     L + +  ++    Q V   T    LCY       L   PE+
Sbjct: 276 VVMDSGTTATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPEL 335

Query: 356 TIHF-RGADVKLSRSNFFVKVSEDIVC-SVFK-GITNSVPIYGNIMQTNFLVGYDIEQQT 412
             HF  GAD+ L  ++ FV+ ++D+ C +V +  + N   + G + Q ++ V YD+  + 
Sbjct: 336 AFHFAEGADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKR 395

Query: 413 VSFKPTDC 420
           V F+ TDC
Sbjct: 396 VYFQRTDC 403


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score =  195 bits (495), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 153/414 (36%), Positives = 213/414 (51%), Gaps = 30/414 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSS-ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
           G +V L HR  P SP  +    T  +RLR    R+      F+    I  S A+      
Sbjct: 54  GVTVPLHHRYDPCSPVPSKKVPTLEERLRRDQLRAAYIKRKFSGAGDIEQSDAATVPTTL 113

Query: 87  NNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
             +     Y+I + IG+P   +    DTGSD+ W QC+PC  SQC+ +   LFDP  SST
Sbjct: 114 GTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPC--SQCHSEVDSLFDPSSSST 171

Query: 143 YKSLPCSSSQCASLNQKS----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           Y    CSS+ CA L+Q      C    CQY V+YGD S + G  +++T+TLGS+     A
Sbjct: 172 YSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDTLTLGSS-----A 226

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG 258
           +    FGC  +  G FN +T G++GLGGG  SL SQ   T    FSYCL P S +   F 
Sbjct: 227 MTDFQFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPTSGSS-GFL 285

Query: 259 TNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVST----PDIVIDSGTTLTF 311
           T G  S  G V TP+   T+  T+YV+ +++I VG+Q+L + T       ++DSGT +T 
Sbjct: 286 TLGTGSS-GFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFSAGSLMDSGTIITR 344

Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSR 368
           LP    S L S   + ++  P A P+G L+ C+ F+  S   +P VT+ F  GA V L+ 
Sbjct: 345 LPPTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAVDLAF 404

Query: 369 SNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
               +++S  I C  F   G  +S+ I GN+ Q  F V YD+    V FK   C
Sbjct: 405 DGIMLEISSSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 129/348 (37%), Positives = 178/348 (51%), Gaps = 21/348 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC  + CY Q   LFDP  SSTY ++
Sbjct: 175 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVA-CYEQREKLFDPASSSTYANV 233

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C+ L+   CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 234 SCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 289

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
           G  N GLF  +  G++GLG G  SL  Q      G F++CL P  ST   +   G  S P
Sbjct: 290 GERNDGLFG-EAAGLLGLGRGKTSLPVQTYGKYGGVFAHCL-PARSTGTGYLDFGAGSPP 347

Query: 267 GVVSTPLTKAK--TFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSN 319
              +TP+      TFY + +  I VG + L +     +    ++DSGT +T LP    S+
Sbjct: 348 ATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSS 407

Query: 320 LLSVMSSMIEAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVK 374
           L S  ++ + A+    A     L+ CY F  +SQV  P V++ F+ GA + +  S     
Sbjct: 408 LRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYT 467

Query: 375 VSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           VS   VC  F G  +   V I GN     F V YDI ++ V F P  C
Sbjct: 468 VSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score =  194 bits (494), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 147/423 (34%), Positives = 206/423 (48%), Gaps = 40/423 (9%)

Query: 30  SVELIHRDSPKSPFYNSSET-----PYQRLRDALTRSLN-RLNHFNQNSSISSSKA---- 79
           S+ + HR    S   N   T        RL  A   S++ +L+       +S SK+    
Sbjct: 33  SLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLATDHVSESKSTDLP 92

Query: 80  SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
           ++      + NY++ + +GTP  +   + DTGSDL WTQC+PC  + CY Q  P+F+P  
Sbjct: 93  AKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRT-CYDQKEPIFNPSK 151

Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           S++Y ++ CSS+ C SL     N  SCS  NC Y + YGD SFS G LA E  TL ++  
Sbjct: 152 STSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSD- 210

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST- 253
                 G+ FGCG NN GLF +   G++GLG   +S  SQ  T     FSYCL P S++ 
Sbjct: 211 ---VFDGVYFGCGENNQGLF-TGVAGLLGLGRDKLSFPSQTATAYNKIFSYCL-PSSASY 265

Query: 254 --KINFGTNGIVSGPGVVSTP---LTKAKTFYVLTIDAISVGNQRLGV-----STPDIVI 303
              + FG+ GI     V  TP   +T   +FY L I AI+VG Q+L +     STP  +I
Sbjct: 266 TGHLTFGSAGI--SRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALI 323

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR- 360
           DSGT +T LP    + L S   + +   P       L+ C+  +    V  P+V   F  
Sbjct: 324 DSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSG 383

Query: 361 GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
           GA V+L     F       VC  F G ++  +  I+GN+ Q    V YD     V F P 
Sbjct: 384 GAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPN 443

Query: 419 DCT 421
            C+
Sbjct: 444 GCS 446


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score =  194 bits (493), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 129/348 (37%), Positives = 178/348 (51%), Gaps = 21/348 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC  + CY Q   LFDP  SSTY ++
Sbjct: 179 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVA-CYEQREKLFDPASSSTYANV 237

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C+ L+   CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 238 SCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 293

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
           G  N GLF  +  G++GLG G  SL  Q      G F++CL P  ST   +   G  S P
Sbjct: 294 GERNDGLFG-EAAGLLGLGRGKTSLPVQTYGKYGGVFAHCL-PARSTGTGYLDFGAGSPP 351

Query: 267 GVVSTPLTKAK--TFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSN 319
              +TP+      TFY + +  I VG + L +     +    ++DSGT +T LP    S+
Sbjct: 352 ATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSS 411

Query: 320 LLSVMSSMIEAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVK 374
           L S  ++ + A+    A     L+ CY F  +SQV  P V++ F+ GA + +  S     
Sbjct: 412 LRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYT 471

Query: 375 VSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           VS   VC  F G  +   V I GN     F V YDI ++ V F P  C
Sbjct: 472 VSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score =  194 bits (493), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 141/417 (33%), Positives = 199/417 (47%), Gaps = 36/417 (8%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN----SSISSSK---------A 79
           ++HR  P SP  ++ +       + L    NR     +     +++S  K         A
Sbjct: 91  IVHRHGPCSPLADAHDGKLPSHEEILAADQNRAKSIQRRVSTTTTVSRGKPKRNRPSLPA 150

Query: 80  SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
           S    +    NY++ I +GTP      V DTGSD  W QCEPC    CY Q   LFDP  
Sbjct: 151 SSGSAL-GTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCV-VVCYKQQEKLFDPAR 208

Query: 140 SSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           SSTY ++ C++  C+ L  K CSG +C Y V YGDGS+S G  A +T+TL S      A+
Sbjct: 209 SSTYANISCAAPACSDLYIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AI 264

Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--INF 257
            G  FGCG  N GL+  +  G++GLG G  SL  Q      G F++C    SS    ++F
Sbjct: 265 KGFRFGCGERNEGLYG-EAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDF 323

Query: 258 GTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLT 310
           G   + +    ++TP+      TFY + +  I VG + L +     +T   ++DSGT +T
Sbjct: 324 GPGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTSGTIVDSGTVIT 383

Query: 311 FLPQGYNSNLLSVM-SSMIEAQPVADPTGS-LELCYSFNSLSQV--PEVTIHFR-GADVK 365
            LP    S+L S   S+M E      P  S L+ CY F  +S+V  P V++ F+ GA + 
Sbjct: 384 RLPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVSLLFQGGASLD 443

Query: 366 LSRSNFFVKVSEDIVCSVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           +  S      S    C  F G    + V I GN     F V YDI ++ V F P  C
Sbjct: 444 VHASGIIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKKVVGFCPGAC 500


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score =  194 bits (493), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 142/424 (33%), Positives = 212/424 (50%), Gaps = 44/424 (10%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQ--------RLRDALTRSLNRLNHFNQNSSISSSKA 79
           G    L H  SP SP   SS+ P+         R+    +R   +   +   SS+  +  
Sbjct: 41  GLHQTLHHPQSPCSPAPLSSDLPFSAFITHDAARIAGLASRLATKDKDWVAASSVPLASG 100

Query: 80  SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
           +   +     NY+ R+ +GTP T  + V D+GS L W QC PC  S C+ Q  PL+DP+ 
Sbjct: 101 ASVGV----GNYITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVS-CHPQAGPLYDPRA 155

Query: 140 SSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTT 193
           SSTY ++PCS+ QC     A+LN  SCSG   CQY  SYGDGSFS G L+ +TV+L S+ 
Sbjct: 156 SSTYAAVPCSAPQCAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSG 215

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL---VPV 250
               + PG  +GCG +N GLF  +  G++GL    +SL+SQ+  ++   F+YCL      
Sbjct: 216 ----SFPGFYYGCGQDNVGLFG-RAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPTSAAA 270

Query: 251 SSTKINFGTNGIVSGPG------VVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD---- 300
           S+  ++FG+N     PG      +VS+ L    + Y +++  +SV    L V + +    
Sbjct: 271 SAGYLSFGSNSDNKNPGKYSYTSMVSSSLD--ASLYFVSLAGMSVAGSPLAVPSSEYGSL 328

Query: 301 -IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIH 358
             +IDSGT +T LP    + L   + + + A      +  L+ C+        VP V + 
Sbjct: 329 PTIIDSGTVITRLPTPVYTALSKAVGAALAAPSAPAYS-ILQTCFKGQVAKLPVPAVNMA 387

Query: 359 FR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
           F  GA ++L+  N  V V+E   C  F   T+S  I GN  Q  F V YD++   + F  
Sbjct: 388 FAGGATLRLTPGNVLVDVNETTTCLAFA-PTDSTAIIGNTQQQTFSVVYDVKGSRIGFAA 446

Query: 418 TDCT 421
             C+
Sbjct: 447 GGCS 450


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score =  194 bits (493), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 149/430 (34%), Positives = 208/430 (48%), Gaps = 40/430 (9%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSET-----PYQRLRDALTRSLN-RLNHFNQNSSISS 76
            A T   S+ + HR    S   N   T        RL  A   S++ +L+       +S 
Sbjct: 54  RASTTKSSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLATDHVSE 113

Query: 77  SKA----SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
           SK+    ++      + NY++ + +GTP  +   + DTGSDL WTQC+PC  + CY Q  
Sbjct: 114 SKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRT-CYDQKE 172

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           P+F+P  S++Y ++ CSS+ C SL     N  SCS  NC Y + YGD SFS G LA E  
Sbjct: 173 PIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKF 232

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           TL ++        G+ FGCG NN GLF +   G++GLG   +S  SQ  T     FSYCL
Sbjct: 233 TLTNSD----VFDGVYFGCGENNQGLF-TGVAGLLGLGRDKLSFPSQTATAYNKIFSYCL 287

Query: 248 VPVSST---KINFGTNGIVSGPGVVSTP---LTKAKTFYVLTIDAISVGNQRLGV----- 296
            P S++    + FG+ GI     V  TP   +T   +FY L I AI+VG Q+L +     
Sbjct: 288 -PSSASYTGHLTFGSAGISR--SVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVF 344

Query: 297 STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PE 354
           STP  +IDSGT +T LP    + L S   + +   P       L+ C+  +    V  P+
Sbjct: 345 STPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPK 404

Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQ 411
           V   F  GA V+L     F       VC  F G ++  +  I+GN+ Q    V YD    
Sbjct: 405 VAFSFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGG 464

Query: 412 TVSFKPTDCT 421
            V F P  C+
Sbjct: 465 RVGFAPNGCS 474


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score =  194 bits (492), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 148/427 (34%), Positives = 221/427 (51%), Gaps = 42/427 (9%)

Query: 21  PIEAQTGGFSVELIHRDSPKSP-----FYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS 75
           P  A++ GFS  +I R    +      F  ++   ++RL    +RS ++++   Q+SS S
Sbjct: 22  PAHAESRGFSGTMIRRGRTDTTTAAINFTQAALESHRRLSFLASRS-SQVDK-PQSSSAS 79

Query: 76  SSKASQADIIP-----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
               +  D +P         Y +  SIGTPP +  A+ADTGSDLIWT+C+          
Sbjct: 80  QLSNNDTDTVPLRMDGGGGAYDMEFSIGTPPQKLTALADTGSDLIWTKCDAG--GGAAWG 137

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-----GVNCQYSVSYG---DGSFSNGNL 182
            S  + P  SST+  LPCS   CA+L   S +     G  C Y  +YG   D  F+ G L
Sbjct: 138 GSSSYHPNASSTFTRLPCSDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFL 197

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
            +ET TLG       A+PG+ FGC T   G +  +  G+VGLG G +SL+SQ+    AG 
Sbjct: 198 GSETFTLGGD-----AVPGVGFGCTTALEGDYG-EGAGLVGLGRGPLSLVSQLD---AGT 248

Query: 243 FSYCLVPVSS--TKINFGTNGIV--SGPGVVSTPLTKAKTFYVLTIDAISVGNQRLG--V 296
           F YCL   +S  + + FG    +  +G GV ST L  + TFY + + +I++G+       
Sbjct: 249 FMYCLTADASKASPLLFGALATMTGAGAGVQSTGLLASTTFYAVNLRSITIGSATTAGVG 308

Query: 297 STPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQVPE 354
               +V DSGTTLT+L +  Y     + +S      PV    G  E CY   +S   +P 
Sbjct: 309 GPGGVVFDSGTTLTYLAEPAYTEAKAAFLSQTTSLTPVEGRYG-FEACYEKPDSARLIPA 367

Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
           + +HF  GAD+ L  +N+ V+V + +VC V +  + S+ I GNIMQ N+LV +D+ +  +
Sbjct: 368 MVLHFDGGADMALPVANYVVEVDDGVVCWVVQ-RSPSLSIIGNIMQMNYLVLHDVRKSVL 426

Query: 414 SFKPTDC 420
           SF+P +C
Sbjct: 427 SFQPANC 433


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score =  194 bits (492), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 127/351 (36%), Positives = 174/351 (49%), Gaps = 24/351 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC    CY Q   LFDP  SSTY ++
Sbjct: 176 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANV 234

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C+ LN   CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 235 SCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 290

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIV 263
           G  N GLF  +  G++GLG G  SL  Q      G F++CL P  ST    ++FG   + 
Sbjct: 291 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCL-PARSTGTGYLDFGAGSLA 348

Query: 264 SGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGY 316
           +    ++TP+      TFY + +  I VG Q L +     +T   ++DSGT +T LP   
Sbjct: 349 AARARLTTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPAA 408

Query: 317 NSNL--LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNF 371
            S+L      +        A     L+ CY F  +SQV  P V++ F+ GA + +  S  
Sbjct: 409 YSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGI 468

Query: 372 FVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
               S   VC  F    +   V I GN     F V YDI ++ V F P  C
Sbjct: 469 MYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 128/351 (36%), Positives = 180/351 (51%), Gaps = 26/351 (7%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC    CY Q   LFDP  SSTY ++
Sbjct: 175 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANV 233

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C+ L+ + CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 234 SCAAPACSDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 289

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIV 263
           G  N GLF  +  G++GLG G  SL  Q      G F++CL P  ST    ++FG     
Sbjct: 290 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCL-PARSTGTGYLDFGAGSPA 347

Query: 264 SGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGY 316
           +   + +TP+      TFY + +  I VG + L +     +T   ++DSGT +T LP   
Sbjct: 348 A--RLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTIVDSGTVITRLPPAA 405

Query: 317 NSNLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNF 371
            S+L S  ++ + A+    A     L+ CY F  +SQV  P V++ F+ GA + +  S  
Sbjct: 406 YSSLRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQGGARLDVDASGI 465

Query: 372 FVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
               S   VC  F    +   V I GN     F V YDI ++ VSF P  C
Sbjct: 466 MYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 138/417 (33%), Positives = 204/417 (48%), Gaps = 43/417 (10%)

Query: 30  SVELIHRDSPKSPFYNSSETPY--QRLRDALTRSLNRLNHFNQ-NSSISSSKASQADIIP 86
           SV L+HR  P +P   SS+ P   +RLR +  RS   ++  ++ N SI +      D + 
Sbjct: 60  SVPLVHRHGPCAPSTRSSDEPSLSERLRRSRARSKYIMSRASKSNVSIPTHLGGSVDSL- 118

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
               Y++ + +GTP   ++ + DTGSDL W QC PC  + CY Q  PLFDP  SSTY  +
Sbjct: 119 ---EYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPI 175

Query: 147 PCSSSQCASLNQK---------SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           PC++  C  L +          S  G  C Y+++YGDGS + G  + ET+T+       V
Sbjct: 176 PCNTDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMAP----GV 231

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
            +    FGCG +  G  N K  G++GLGG   SL+ Q  +   G FSYCL P ++ +  F
Sbjct: 232 TVKDFHFGCGHDQDGP-NDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCL-PAANDQAGF 289

Query: 258 GTNG--IVSGPGVVSTPLTKA-KTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLT 310
              G  +    G V TP+ +  +TFYV+ +  I+VG + + V     +  ++IDSGT +T
Sbjct: 290 LALGAPVNDASGFVFTPMVREQQTFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTVVT 349

Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSR 368
            L     + L +     + A P+  P G L+ CY+F   S   VP V + F G       
Sbjct: 350 ELQHTAYAALQAAFRKAMAAYPLL-PNGELDTCYNFTGHSNVTVPRVALTFSGG------ 402

Query: 369 SNFFVKVSEDIV---CSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           +   + V + I+   C  F+  G  N   I GN+ Q    V YD+    V F    C
Sbjct: 403 ATVDLDVPDGILLDNCLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGRVGFGADAC 459


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 126/367 (34%), Positives = 185/367 (50%), Gaps = 32/367 (8%)

Query: 80  SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
           +Q+ +     NY++ + +GTP  +   + DTGSDL WTQC+PC  S CY Q  P+FDP  
Sbjct: 143 AQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKS-CYAQQQPIFDPSA 201

Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           S TY ++ C+S+ C+ L     N   CS  NC Y + YGD SF+ G  A +T+TL     
Sbjct: 202 SKTYSNISCTSTACSGLKSATGNSPGCSSSNCVYGIQYGDSSFTVGFFAKDTLTL----T 257

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---S 251
           Q     G  FGCG NN GLF  KT G++GLG   +S++ Q        FSYCL P    S
Sbjct: 258 QNDVFDGFMFGCGQNNRGLFG-KTAGLIGLGRDPLSIVQQTAQKFGKYFSYCL-PTSRGS 315

Query: 252 STKINFGT-NGIVSGP----GVVSTPL--TKAKTFYVLTIDAISVGNQRLGVS-----TP 299
           +  + FG  NG+ +      G+  TP   ++  TFY + +  ISVG + L +S       
Sbjct: 316 NGHLTFGNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQNA 375

Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTI 357
             +IDSGT +T LP     +L S     +   P A     L+ CY  ++ +   +P+++ 
Sbjct: 376 GTIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISF 435

Query: 358 HFRG-ADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
           +F G A+V L  +   +      VC  F   G  +++ I+GNI Q    V YD+    + 
Sbjct: 436 NFNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQLG 495

Query: 415 FKPTDCT 421
           F    C+
Sbjct: 496 FGYKGCS 502


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 129/348 (37%), Positives = 178/348 (51%), Gaps = 21/348 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC  + CY Q   LFDP  SSTY ++
Sbjct: 176 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVA-CYEQREKLFDPASSSTYANV 234

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C+ L+   CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 235 SCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 290

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
           G  N GLF  +  G++GLG G  SL  Q      G F++CL P  ST   +   G  S P
Sbjct: 291 GERNDGLFG-EAAGLLGLGRGKTSLPVQTYGKYGGVFAHCL-PPRSTGTGYLDFGAGSPP 348

Query: 267 GVVSTPLTKAK--TFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSN 319
              +TP+      TFY + +  I VG + L +     +    ++DSGT +T LP    S+
Sbjct: 349 ATTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSS 408

Query: 320 LLSVMSSMIEAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVK 374
           L S  ++ + A+    A     L+ CY F  +SQV  P V++ F+ GA + +  S     
Sbjct: 409 LRSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYT 468

Query: 375 VSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           VS   VC  F G  +   V I GN     F V YDI ++ V F P  C
Sbjct: 469 VSASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 139/422 (32%), Positives = 205/422 (48%), Gaps = 43/422 (10%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRL--RDALTRSL--NRLNHFNQNSSISSSKASQADII 85
           S+ L+HRD+     Y S+      L  RD         RL+     + + S   S   I 
Sbjct: 70  SLALLHRDAVSGRTYPSTRHAMLGLAARDGARVEYLQRRLSPTTMTTEVGSEVVS--GIS 127

Query: 86  PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
             +  Y +R+ +G+PPTE+  V D+GSD+IW QC PC  ++CY Q  PLFDP  S+++ +
Sbjct: 128 EGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPC--AECYQQADPLFDPAASASFTA 185

Query: 146 LPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           +PC S  C +L   S    +   C+Y VSYGDGS++ G LA ET+T G +T     + G+
Sbjct: 186 VPCDSGVCRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDST----PVQGV 241

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI 262
             GCG  N GLF     G++GLG G +SL+ Q+     G FSYCL   +S   + G   +
Sbjct: 242 AIGCGHRNRGLF-VGAAGLLGLGWGPMSLVGQLGGAAGGAFSYCL---ASRGADAGAGSL 297

Query: 263 VSGP------GVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS------TPD----IVI 303
           V G       G V  PL +     +FY + +  + VG +RL +       T D    +V+
Sbjct: 298 VFGRDDAMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVM 357

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQ-PVADPTGSLELCYSFNSLS--QVPEVTIHF- 359
           D+GT +T LP    + L    +S I    P A     L+ CY  +  +  +VP V ++F 
Sbjct: 358 DTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYDLSGYASVRVPTVALYFG 417

Query: 360 -RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
             GA + L   N  V++   + C  F    + + I GNI Q    +  D     V F P+
Sbjct: 418 RDGAALTLPARNLLVEMGGGVYCLAFAASASGLSILGNIQQQGIQITVDSANGYVGFGPS 477

Query: 419 DC 420
            C
Sbjct: 478 TC 479


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 135/405 (33%), Positives = 194/405 (47%), Gaps = 56/405 (13%)

Query: 49  TPYQRLRDALTRSLNRLNHF-NQNSSISSSKASQADIIPNN-------ANYLIRISIGTP 100
           T ++ LR    RS  R  H  +        +++ A + P           YL+ ++ GTP
Sbjct: 38  THWELLRRMAQRSKARATHLLSAQDQSGRGRSASAPVNPGAYDDGFPFTEYLVHLAAGTP 97

Query: 101 PTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS 160
           P E     DTGSD+ WTQC+ CP S C+ Q  PLFDP  SS++ SLPCSS  C +     
Sbjct: 98  PQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSSPACET--TPP 155

Query: 161 CSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGG 212
           C G N      C YS+SYGDGS S G +  E  T  S TG+  + A+PG+ FGCG  N G
Sbjct: 156 CGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVFGCGHANRG 215

Query: 213 LFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-PGVV-- 269
           +F S  TGI G G G +SL SQ++    G FS+C   ++ +K    T+ ++ G PGV   
Sbjct: 216 VFTSNETGIAGFGRGSLSLPSQLKV---GNFSHCFTTITGSK----TSAVLLGLPGVAPP 268

Query: 270 -STPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMI 328
            ++PL + +  Y                STP    +SGT++T LP      +    ++ +
Sbjct: 269 SASPLGRRRGSYRCR-------------STPR-SSNSGTSITSLPPRTYRAVREEFAAQV 314

Query: 329 EAQPVADPTGSLELCYSF---NSLSQVPEVTIHFRGADVKLSRSNFFVKVSED------- 378
           +   V         C+S         VP + +HF GA ++L + N+  +V +D       
Sbjct: 315 KLPVVPGNATDPFTCFSAPLRGPKPDVPTMALHFEGATMRLPQENYVFEVVDDDDAGNSS 374

Query: 379 -IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
            I+C     I     I GNI Q N  V YD++   +SF P  C +
Sbjct: 375 RIICLAV--IEGGEIILGNIQQQNMHVLYDLQNSKLSFVPAQCDQ 417


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/348 (36%), Positives = 182/348 (52%), Gaps = 25/348 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ ++ +GTP T    V DTGS L W QC PC  S C+ Q  PLFDP+ SSTY S+ CS
Sbjct: 133 NYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVS-CHRQVGPLFDPRASSTYASVRCS 191

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           +SQC     A+LN  +CS  N C Y  SYGD SFS G+L+T+TV+ GST       P   
Sbjct: 192 ASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSFGSTR-----YPSFY 246

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
           +GCG +N GLF  ++ G++GL    +SL+ Q+  ++   FSYCL P +++          
Sbjct: 247 YGCGQDNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFSYCL-PTAASTGYLSIGPYN 304

Query: 264 SGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQG 315
           +G     TP+  +    + Y +T+  +SVG   L VS  +      +IDSGT +T LP  
Sbjct: 305 TGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTA 364

Query: 316 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFR-GADVKLSRSNFFV 373
            ++ L   ++  +     A     L+ C+    S  +VP V + F  GA +KL+  N  +
Sbjct: 365 VHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQLRVPTVAMAFAGGASMKLTTRNVLI 424

Query: 374 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            V +   C  F   T+S  I GN  Q  F V YD+ Q  + F    C+
Sbjct: 425 DVDDSTTCLAF-APTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471


>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 133/431 (30%), Positives = 212/431 (49%), Gaps = 38/431 (8%)

Query: 18  VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH-FNQNSSISS 76
           +V    AQ      +LIH  S  SP++N + +  +R    +  S  R+ + + Q      
Sbjct: 23  IVEAYNAQPKQLVTKLIHWGSILSPYFNPNASVAERAERIVKTSATRIAYLYAQIKGDIH 82

Query: 77  SKASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
               + +++P+     +L+  S+G P T +LA+ DTGS+++W +C PC   +C  Q+ PL
Sbjct: 83  MNDFELNLLPSTYEPLFLVNFSMGQPATPQLAIMDTGSNILWVRCAPC--KRCTQQNGPL 140

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTT 193
            DP  SSTY SLPC+++ C       C+ +N C Y++SY  G  S G LATE +   S+ 
Sbjct: 141 LDPSKSSTYASLPCTNTMCHYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSD 200

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
               A+P + FGC   NG   + + TG+ GLG G  S +++M      KFSYCL  ++  
Sbjct: 201 EGVNAVPSVVFGCSHENGDYKDRRFTGVFGLGKGITSFVTRM----GSKFSYCLGNIADP 256

Query: 254 KINFGTNGIVSGPGV----VSTPLTKAKTFYVLTIDAISVGNQRLGVST---------PD 300
             ++G N +V G        STPL      Y +T++ ISVG +RL + +           
Sbjct: 257 --HYGYNQLVFGEKANFEGYSTPLKVVNGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKS 314

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVT 356
            +IDSGT LT+L +     L + +  +++   +    GS   CY   ++SQ     P VT
Sbjct: 315 ALIDSGTALTWLAESAFRALDNEVRQLLDGVLMPFWRGSFA-CYK-GTVSQDLIGFPVVT 372

Query: 357 IHFR-GADVKLSRSNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNFLVGYDIE 409
            HF  GAD+ L   + F + + DI+C      S +     S  + G + Q  + + YD+ 
Sbjct: 373 FHFSGGADLDLDTESMFYQATPDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLN 432

Query: 410 QQTVSFKPTDC 420
              + F+  DC
Sbjct: 433 SNKLFFQRIDC 443


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score =  192 bits (488), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 127/348 (36%), Positives = 182/348 (52%), Gaps = 25/348 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ ++ +GTP T    V DTGS L W QC PC  S C+ Q  PLFDP+ SSTY S+ CS
Sbjct: 133 NYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVS-CHRQVGPLFDPRASSTYTSVRCS 191

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           +SQC     A+LN  +CS  N C Y  SYGD SFS G L+T+TV+ GST+      P   
Sbjct: 192 ASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGSTS-----YPSFY 246

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
           +GCG +N GLF  ++ G++GL    +SL+ Q+  ++   FSYCL P +++          
Sbjct: 247 YGCGQDNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFSYCL-PTAASTGYLSIGPYN 304

Query: 264 SGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQG 315
           +G     TP+  +    + Y +T+  +SVG   L VS  +      +IDSGT +T LP  
Sbjct: 305 TGHYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTA 364

Query: 316 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFR-GADVKLSRSNFFV 373
            ++ L   ++  +     A     L+ C+    S  +VP V + F  GA +KL+  N  +
Sbjct: 365 VHTALSKAVAQAMAGAQRAPAFSILDTCFEGQASQLRVPTVVMAFAGGASMKLTTRNVLI 424

Query: 374 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            V +   C  F   T+S  I GN  Q  F V YD+ Q  + F    C+
Sbjct: 425 DVDDSTTCLAF-APTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 130/352 (36%), Positives = 178/352 (50%), Gaps = 26/352 (7%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC    CY Q   LFDP  SSTY ++
Sbjct: 175 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQQEKLFDPARSSTYANV 233

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C  L+ + CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 234 SCAAPACFDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 289

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--INFGTNGIVS 264
           G  N GLF  +  G++GLG G  SL  Q      G F++CL   SS    ++FG     +
Sbjct: 290 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAA 348

Query: 265 GPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYN 317
               ++TP+      TFY + +  I VG Q L +     +T   ++DSGT +T LP    
Sbjct: 349 AGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPPAY 408

Query: 318 SNLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV--PEVTIHFRGA---DVKLSRSN 370
           S+L S   S + A+    A     L+ CY F  +SQV  P V++ F+G    DV  S   
Sbjct: 409 SSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAILDVDASGIM 468

Query: 371 FFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           +   VS+  VC  F    +   V I GN     F V YDI ++ V F P  C
Sbjct: 469 YAASVSQ--VCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 152/463 (32%), Positives = 226/463 (48%), Gaps = 66/463 (14%)

Query: 10  ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
           +L  L   +++   A      +  IH D    P   +SE     +R AL R ++R   F 
Sbjct: 6   VLLILACTILASDAAAAVRVGLTRIHAD----PEVTASEF----VRGALRRDMHRHARFA 57

Query: 70  QNSSISSSKAS---------QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
           +     SS A+         Q D+  N   Y++ +SIGTPP    A+ADTGSDLIWTQC 
Sbjct: 58  REQLAPSSAAAAGLTVGAPTQKDLR-NGGEYIMTLSIGTPPLSYRAIADTGSDLIWTQCA 116

Query: 121 PCPPS------QCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKS-CSGVNCQYSVS 171
           PC  +      QC+ Q   L++P  S+T+  LPC+S  S CA++   S   G  C Y+ +
Sbjct: 117 PCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGPSPPPGCACMYNQT 176

Query: 172 YGDGSFSNGNLATETVTLG-STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDIS 230
           YG G ++ G  + ET T G S+T  AV +P I FGC   +   +N  + G+VGLG G +S
Sbjct: 177 YGTG-WTAGVQSVETFTFGSSSTPPAVRVPNIAFGCSNASSNDWNG-SAGLVGLGRGSMS 234

Query: 231 LISQMRTTIAGKFSYCLVPV-------------SSTKINFGTNGIVSGPGVVSTPLTKAK 277
           L+SQ+    AG FSYCL P              S+     GT  + S P V         
Sbjct: 235 LVSQLG---AGAFSYCLTPFQDANSTSTLLLGPSAAAALKGTGPVRSTPFVAGPSKAPMS 291

Query: 278 TFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFL-PQGYNSNLLSVMSS 326
           T+Y L +  ISVG   L +           T  ++IDSGTT+T L    Y     +V S 
Sbjct: 292 TYYYLNLTGISVGETALAIPPDAFSLRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSL 351

Query: 327 MIEAQPVA---DPTGSLELCYSFNSLS---QVPEVTIHFR-GADVKLSRSNFFVKVSEDI 379
           ++   P+A   D +  L+LC++  + +    +P +T+HF  GAD+ L   N+ + +   +
Sbjct: 352 LVTRLPLAHGPDHSTGLDLCFALKASTPPPAMPSMTLHFEGGADMVLPVENYMI-LGSGV 410

Query: 380 VCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            C   +  T  ++ + GN  Q N  V YD+ ++T+SF P  C+
Sbjct: 411 WCLAMRNQTVGAMSMVGNYQQQNIHVLYDVRKETLSFAPAVCS 453


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 151/424 (35%), Positives = 211/424 (49%), Gaps = 48/424 (11%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-- 84
           G  +V L HR  P SP   + + P   L + L R   R  +  +    S    +  D+  
Sbjct: 126 GAATVPLHHRHGPCSPL-PTKKMP--TLEETLHRDQLRAAYIQRK--FSGGGGAGGDVQR 180

Query: 85  ----IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
               +P       N   YLI + +G+P T +  + DTGSD+ W QC+PC  SQC+ Q  P
Sbjct: 181 SDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADP 238

Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
           LFDP  SSTY    C S+ CA L Q+     S   CQY V+YGDGS + G  +++T+ LG
Sbjct: 239 LFDPSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG 298

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--V 248
           S+     A+    FGC     G FN +T G++GLGGG  SL+SQ   T+   FSYCL   
Sbjct: 299 SS-----AVRSFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPT 352

Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----TPDI 301
           P SS  +  G  G     G V TP+ ++    TFY + + AI VG ++L +     +   
Sbjct: 353 PSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGT 412

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
           V+DSGT +T LP    S L S   + ++  P A P+G L+ C+ F+  S V  P V + F
Sbjct: 413 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 472

Query: 360 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 416
             GA V L  S   +       C  F G ++  S+ I GN+ Q  F V YD+ +  V F+
Sbjct: 473 SGGAVVSLDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 527

Query: 417 PTDC 420
              C
Sbjct: 528 AGAC 531


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  191 bits (486), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 125/367 (34%), Positives = 185/367 (50%), Gaps = 32/367 (8%)

Query: 80  SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
           +Q+ +     NY++ + +GTP  +   + DTGSDL WTQC+PC  S CY Q  P+FDP  
Sbjct: 143 AQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKS-CYAQQQPIFDPST 201

Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           S TY ++ C+S+ C+SL     N   CS  NC Y + YGD SF+ G  A + +TL     
Sbjct: 202 SKTYSNISCTSAACSSLKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTL----T 257

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---S 251
           Q     G  FGCG NN GLF  KT G++GLG   +S++ Q        FSYCL P    S
Sbjct: 258 QNDVFDGFMFGCGQNNKGLFG-KTAGLIGLGRDPLSIVQQTAQKFGKYFSYCL-PTSRGS 315

Query: 252 STKINFGT-NGIVSGP----GVVSTPL--TKAKTFYVLTIDAISVGNQRLGVS-----TP 299
           +  + FG  NG+ +      G+  TP   ++   +Y + +  ISVG + L +S       
Sbjct: 316 NGHLTFGNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQNA 375

Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTI 357
             +IDSGT +T LP     +L S     +   P A     L+ CY  ++ +   +P+++ 
Sbjct: 376 GTIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISF 435

Query: 358 HFRG-ADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
           +F G A+V+L  +   +      VC  F   G  +S+ I+GNI Q    V YD+    + 
Sbjct: 436 NFNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQLG 495

Query: 415 FKPTDCT 421
           F    C+
Sbjct: 496 FGYKGCS 502


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  191 bits (486), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 141/424 (33%), Positives = 211/424 (49%), Gaps = 46/424 (10%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
           GF   L H D+      N+  T  Q L  A+ RS  R+      ++ + +  +   ++  
Sbjct: 30  GFKATLTHVDA------NAGYTKAQLLSRAVARSRARVAALQSLATAADAITAARILLRF 83

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +   YL+ + IG+PP    A+ DTGSDLIWTQC PC    C  Q +P F+P  S++Y SL
Sbjct: 84  SEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPC--LLCVEQPTPYFEPAKSTSYASL 141

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           PCSS+ C +L    C    C Y   YGD + S G LA ET T G+ + + VA+P ++FGC
Sbjct: 142 PCSSAMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTR-VAVPRVSFGC 200

Query: 207 GTNNGG-LFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFG---- 258
           G  N G LFN   +G+VG G G +SL+SQ+ +    +FSYCL      +++++ FG    
Sbjct: 201 GNMNAGTLFNG--SGMVGFGRGALSLVSQLGSP---RFSYCLTSFMSPATSRLYFGAYAT 255

Query: 259 ---TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS-----------TPDI 301
              TN   SGP V STP        T Y L +  ISV    L +            T  +
Sbjct: 256 LNSTNTSSSGP-VQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGV 314

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF----NSLSQVPEVT 356
           +IDSGTT+TFL Q   + +     + +   +  A P+ + + C+ +      +  +PE+ 
Sbjct: 315 IIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRRMVTLPEMV 374

Query: 357 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           +HF GAD++L   N+ V         +    ++   I G+    NF + YD+E   +SF 
Sbjct: 375 LHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLENSLLSFV 434

Query: 417 PTDC 420
           P  C
Sbjct: 435 PAPC 438


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 150/424 (35%), Positives = 211/424 (49%), Gaps = 48/424 (11%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-- 84
           G  +V L HR  P SP   + + P   L + L R   R  +  +    S    +  D+  
Sbjct: 56  GAATVPLHHRHGPCSPL-PTKKMP--TLEETLHRDQLRAAYIQRK--FSGGGGAGGDVQR 110

Query: 85  ----IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
               +P       N   YLI + +G+P T +  + DTGSD+ W QC+PC  SQC+ Q  P
Sbjct: 111 SDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADP 168

Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
           LFDP  SSTY    C S+ CA L Q+     S   CQY V+YGDGS + G  +++T+ LG
Sbjct: 169 LFDPSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG 228

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
           S+     A+    FGC     G FN +T G++GLGGG  SL+SQ   T+   FSYCL P 
Sbjct: 229 SS-----AVRSFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPT 282

Query: 251 SSTK--INFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----TPDI 301
            S+   +  G  G     G V TP+ ++    TFY + + AI VG ++L +     +   
Sbjct: 283 PSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGT 342

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
           V+DSGT +T LP    S L S   + ++  P A P+G L+ C+ F+  S V  P V + F
Sbjct: 343 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 402

Query: 360 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 416
             GA V L  S   +       C  F G ++  S+ I GN+ Q  F V YD+ +  V F+
Sbjct: 403 SGGAVVSLDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 457

Query: 417 PTDC 420
              C
Sbjct: 458 AGAC 461


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 137/409 (33%), Positives = 198/409 (48%), Gaps = 54/409 (13%)

Query: 60  RSLNRLNHFNQNSSI----SSSKASQADIIPN-------NANYLIRISIGTPPTERLAVA 108
           RSL R    ++ ++     +S +A+ A + P        +  YL+ ++IGTPP     + 
Sbjct: 373 RSLTRREVLHRMAARLLFSASGRAASARVDPGPYANGVPDTEYLVHLAIGTPPQPVQLIL 432

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--- 165
           DTGSDL+WTQC PCP   C+ +     DP  SST+  LPCSS  C +L   SC   N   
Sbjct: 433 DTGSDLVWTQCRPCP--VCFSRALGPLDPSNSSTFDVLPCSSPVCDNLTWSSCGKHNWGN 490

Query: 166 --CQYSVSYGDGSFSNGNLATETVTLGST--TGQAVALPGITFGCGTNNGGLFNSKTTGI 221
             C Y  +Y DGS + G+L  ET T  +   TGQA  +P + FGCG  N G+F S  TGI
Sbjct: 491 QTCVYVYAYADGSITTGHLDAETFTFAAADGTGQAT-VPDLAFGCGLFNNGIFTSNETGI 549

Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFGTNGIVSGPGVV-STPLTK 275
            G G G +SL SQ++      FS+C   +     SS  +    N      G V STPL +
Sbjct: 550 AGFGRGALSLPSQLKVD---NFSHCFTAITGSEPSSVLLGLPANLYSDADGAVQSTPLVQ 606

Query: 276 ---AKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQGYNSNLLS 322
              +   Y L++  I+VG+ RL +           T   +IDSGT +T LPQ     +  
Sbjct: 607 NFSSLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTIIDSGTGMTTLPQDAYKLVHD 666

Query: 323 VMSSMIEAQPVADPTGS--LELCYSF----NSLSQVPEVTIHFRGADVKLSRSNFFVKVS 376
             ++ +   PV + T S    LC+SF     +   VP++ +HF GA + L R N+  +  
Sbjct: 667 AFTAQVRL-PVDNATSSSLSRLCFSFSVPRRAKPDVPKLVLHFEGATLDLPRENYMFEFE 725

Query: 377 E---DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           +    + C       + + I GN  Q N  V YD+ +  +SF P  C +
Sbjct: 726 DAGGSVTCLAINA-GDDLTIIGNYQQQNLHVLYDLVRNMLSFVPAQCNR 773


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 141/395 (35%), Positives = 208/395 (52%), Gaps = 43/395 (10%)

Query: 54  LRDALTRSLNRLNHFNQNSSISSSKASQAD--IIPN--NANYLIRISIGTPPTERLAVAD 109
           ++ A+ RS  RL      S++++ +    +  + P+  +  YLI+++IGTP     A+ D
Sbjct: 1   MKRAIQRSQERLEKLQITSAVNTHQMKDIETPVTPDIGSGEYLIQMAIGTPALSLSAIMD 60

Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-NCQY 168
           TGSDL+WT+C PC  + C            SSTY  + C SS C   +  SC+   +C+Y
Sbjct: 61  TGSDLVWTKCNPC--TDCSTSSIYDP--SSSSTYSKVLCQSSLCQPPSIFSCNNDGDCEY 116

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
              YGD S ++G L+ ET ++ S      +LP ITFGCG +N G    K  G+VG G G 
Sbjct: 117 VYPYGDRSSTSGILSDETFSISSQ-----SLPNITFGCGHDNQGF--DKVGGLVGFGRGS 169

Query: 229 ISLISQMRTTIAGKFSYCLVPVS----STKINFGTNGIVSGPGVVSTPLTKAKT--FYVL 282
           +SL+SQ+  ++  KFSYCLV  +    ++ +  G    +    V STPL ++ +   Y L
Sbjct: 170 LSLVSQLGPSMGNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGSTPLVQSSSTNHYYL 229

Query: 283 TIDAISVGNQRLGVST----------PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 332
           +++ ISVG Q L + T            ++IDSGTTLTFL Q     +   M S I   P
Sbjct: 230 SLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSINL-P 288

Query: 333 VADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITN 389
            AD  G L+LC++    S    P +T HF+GAD  + + N+ F   + DIVC      TN
Sbjct: 289 QAD--GQLDLCFNQQGSSNPGFPSMTFHFKGADYDVPKENYLFPDSTSDIVCLAMM-PTN 345

Query: 390 S----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           S    + I+GN+ Q N+ + YD E   +SF PT C
Sbjct: 346 SNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTAC 380


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  191 bits (485), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 138/447 (30%), Positives = 216/447 (48%), Gaps = 62/447 (13%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN---------QNSS 73
           +A  G   + L H D+ K        +  + +R A+ RS  R    +            S
Sbjct: 28  DAFAGDVRLHLTHVDAGKQ------MSRRELIRRAMQRSKARAAALSVARSGSGRVPGKS 81

Query: 74  ISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
               +  Q   +P     +  YLI ++IGTPP    A+ DTGSDLIWTQC PC  + C  
Sbjct: 82  AQQGEQHQQPGVPVRPSGDLEYLIDLAIGTPPQPVSALLDTGSDLIWTQCAPC--ASCLA 139

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
           Q  PLF P  SS+Y  + CS   C  +   SC   + C Y  +YGDG+ + G  ATE  T
Sbjct: 140 QPDPLFAPAASSSYVPMRCSGQLCNDILHHSCQRPDTCTYRYNYGDGTTTLGVYATERFT 199

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
             S++G+ +++P + FGCGT N G  N+  +GIVG G   +SL+SQ+      +FSYCL 
Sbjct: 200 FASSSGEKLSVP-LGFGCGTMNVGSLNNG-SGIVGFGRDPLSLVSQLSIR---RFSYCLT 254

Query: 249 PVSSTK---INFG--TNGIVSGPG-----VVSTPLTKAK---TFYVLTIDAISVGNQRLG 295
           P +ST+   + FG  ++G+  G       V +T L +++   TFY +    ++VG +RL 
Sbjct: 255 PYTSTRKSTLMFGSLSDGVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLR 314

Query: 296 VS------TPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 345
           +        PD    +++DSGT LT  P    + +L    + +     +  +    +C++
Sbjct: 315 IPLSAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDGVCFA 374

Query: 346 -----------FNSLSQVPEVTIHFRGADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPI 393
                        ++  VP +  HF+GAD++L R N+ +       +C +     +S   
Sbjct: 375 TPMAAGGRRASAATVVSVPRMAFHFQGADLELPRRNYVLDDPRRGSLCILLADSGDSGAT 434

Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            GN +Q +  V YD+E +T+SF P  C
Sbjct: 435 IGNFVQQDMRVLYDLEAETLSFAPAQC 461


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score =  191 bits (485), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 141/424 (33%), Positives = 211/424 (49%), Gaps = 46/424 (10%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
           GF   L H D+      N+  T  Q L  A+ RS  R+      ++ + +  +   ++  
Sbjct: 27  GFKATLTHVDA------NAGYTKAQLLSRAVARSRARVAALQSLATAADAITAARILLRF 80

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +   YL+ + IG+PP    A+ DTGSDLIWTQC PC    C  Q +P F+P  S++Y SL
Sbjct: 81  SEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPC--LLCVEQPTPYFEPAKSTSYASL 138

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           PCSS+ C +L    C    C Y   YGD + S G LA ET T G+ + + VA+P ++FGC
Sbjct: 139 PCSSAMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTR-VAVPRVSFGC 197

Query: 207 GTNNGG-LFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFG---- 258
           G  N G LFN   +G+VG G G +SL+SQ+ +    +FSYCL      +++++ FG    
Sbjct: 198 GNMNAGTLFNG--SGMVGFGRGALSLVSQLGSP---RFSYCLTSFMSPATSRLYFGAYAT 252

Query: 259 ---TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS-----------TPDI 301
              TN   SGP V STP        T Y L +  ISV    L +            T  +
Sbjct: 253 LNSTNTSSSGP-VQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGV 311

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF----NSLSQVPEVT 356
           +IDSGTT+TFL Q   + +     + +   +  A P+ + + C+ +      +  +PE+ 
Sbjct: 312 IIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRRMVTLPEMV 371

Query: 357 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           +HF GAD++L   N+ V         +    ++   I G+    NF + YD+E   +SF 
Sbjct: 372 LHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLENSLLSFV 431

Query: 417 PTDC 420
           P  C
Sbjct: 432 PAPC 435


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score =  191 bits (484), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 137/383 (35%), Positives = 204/383 (53%), Gaps = 26/383 (6%)

Query: 50  PYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN--YLIRISIGTPPTERLAV 107
           P   L  A  +S  RL+        ++S ++Q  +  ++    Y +  SIGTPP E  A+
Sbjct: 39  PAINLTRAAHKSHQRLSMLAARLDDAASGSAQTPLQLDSGGGAYDMTFSIGTPPQELSAL 98

Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVN 165
           ADTGSDLIW +C  C  ++C  Q SP + P  SS++  LPCS S C+ L    CS  G  
Sbjct: 99  ADTGSDLIWAKCGAC--TRCVPQGSPSYYPNKSSSFSKLPCSGSLCSDLPSSQCSAGGAE 156

Query: 166 CQYSVSYGDGS----FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGI 221
           C Y  SYG  S    ++ G L +ET TLGS      A+PGI FGC T          +G+
Sbjct: 157 CDYKYSYGLASDPHHYTQGYLGSETFTLGSD-----AVPGIGFGC-TTMSEGGYGSGSGL 210

Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLV--PVSSTKINFGTNGIVSGPGVVSTPLTKAKT- 278
           VGLG G +SL+SQ+     G FSYCL      ++ + FG+ G ++G GV STPL +  T 
Sbjct: 211 VGLGRGPLSLVSQLNV---GAFSYCLTSDAAKTSPLLFGS-GALTGAGVQSTPLLRTSTY 266

Query: 279 FYVLTIDAISVGNQ-RLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT 337
           +Y + +++IS+G     G  +  I+ DSGTT+ FL +   +     + S      +A   
Sbjct: 267 YYTVNLESISIGAATTAGTGSSGIIFDSGTTVAFLAEPAYTLAKEAVLSQTTNLTMASGR 326

Query: 338 GSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNI 397
              E+C+   S +  P + +HF G D+ L   N+F  V + + C + +  + S+ I GNI
Sbjct: 327 DGYEVCFQ-TSGAVFPSMVLHFDGGDMDLPTENYFGAVDDSVSCWIVQK-SPSLSIVGNI 384

Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
           MQ N+ + YD+E+  +SF+P +C
Sbjct: 385 MQMNYHIRYDVEKSMLSFQPANC 407


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 143/447 (31%), Positives = 199/447 (44%), Gaps = 55/447 (12%)

Query: 21  PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS--- 77
           PI A +    V ++HR  P SP   +         + L    NR+   +   S +++   
Sbjct: 65  PITATSSAARVPIVHRHGPCSPLAGAHAGKPPSHAEILAADQNRVESLHHRVSSTTTGLG 124

Query: 78  -KASQADIIPNN------------------------ANYLIRISIGTPPTERLAVADTGS 112
            K       P +                        ANY++ I +GTPP+    V DTGS
Sbjct: 125 GKPRTKKKTPGHSSVPASSSSSSSSVPASSGLSLGTANYVVPIGLGTPPSRFTVVFDTGS 184

Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
           D  W QC PC  S CY Q   LFDP  SSTY ++ C+   CA L+   C+  +C Y + Y
Sbjct: 185 DTTWVQCRPCVVS-CYKQKDRLFDPAKSSTYANVSCADPACADLDASGCNAGHCLYGIQY 243

Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
           GDGS++ G  A +T+ +        A+ G  FGCG  N GLF  +T G++GLG G  S+ 
Sbjct: 244 GDGSYTVGFFAKDTLAVAQD-----AIKGFKFGCGEKNRGLFG-QTAGLLGLGRGPTSIT 297

Query: 233 SQMRTTIAGKFSYCLVPVSSTKINF----GTNGIVSGPGVVSTPL--TKAKTFYVLTIDA 286
            Q      G FSYCL P SS    +      +   SG    +TP+   K  TFY + +  
Sbjct: 298 VQAYEKYGGSFSYCL-PASSAATGYLEFGPLSPSSSGSNAKTTPMLTDKGPTFYYVGLTG 356

Query: 287 ISVGNQRLGV------STPDIVIDSGTTLTFLPQ--GYNSNLLSVMSSMIEAQPVADPTG 338
           I VG ++LG       S    ++DSGT +T LP       +     +        A    
Sbjct: 357 IRVGGKQLGAIPESVFSNSGTLVDSGTVITRLPDTAYAALSSAFAAAMAASGYKKAAAYS 416

Query: 339 SLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPI 393
            L+ CY F  LSQV  P V++ F+ GA + L  S     +S+  VC  F   G   SV I
Sbjct: 417 ILDTCYDFTGLSQVSLPTVSLVFQGGACLDLDASGIVYAISQSQVCLGFASNGDDESVGI 476

Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            GN  Q  + V YD+ ++ V F P  C
Sbjct: 477 VGNTQQRTYGVLYDVSKKVVGFAPGAC 503


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 146/429 (34%), Positives = 204/429 (47%), Gaps = 48/429 (11%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
           + + +G  +V L HR  P SP   + + P   L D L R   R  +  +  S    K  Q
Sbjct: 50  VRSSSGATTVPLHHRHGPCSPL-PTKKMP--SLEDRLHRDQLRAAYIKRKFSGDVKKDGQ 106

Query: 82  AD--------IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
                      +P       N   YLI + +G+P   +  + D+GSD+ W QC+PC   Q
Sbjct: 107 GAGGVEQSHVTVPTTLGTSLNTLEYLITVRLGSPAKTQTVLIDSGSDVSWVQCKPC--LQ 164

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLA 183
           C+ Q  PLFDP +SSTY    CSS+ CA L Q      S   CQY V Y DGS + G  +
Sbjct: 165 CHSQVDPLFDPSLSSTYSPFSCSSAACAQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYS 224

Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
           ++T+ LGS T     +    FGC     G FN  T G++GLGGG  SL SQ   T    F
Sbjct: 225 SDTLALGSNT-----ISNFQFGCSHVESG-FNDLTDGLMGLGGGAPSLASQTAGTFGTAF 278

Query: 244 SYCLVPVSSTK----INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST- 298
           SYCL P  S+     +  GT+G V  P + S+P+    TFY + ++AI VG  +L + T 
Sbjct: 279 SYCLPPTPSSSGFLTLGAGTSGFVKTPMLRSSPV---PTFYGVRLEAIRVGGTQLSIPTS 335

Query: 299 ---PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--P 353
                +V+DSGT +T LP+   S L S   + ++    A P   ++ C+ F+  S V  P
Sbjct: 336 VFSAGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLP 395

Query: 354 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQ 411
            V + F G  V    +N  +  +    C  F   ++  S  I GN+ Q  F V YD+   
Sbjct: 396 SVALVFSGGAVVNLDANGIILGN----CLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGG 451

Query: 412 TVSFKPTDC 420
            V FK   C
Sbjct: 452 AVGFKAGAC 460


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 143/418 (34%), Positives = 203/418 (48%), Gaps = 34/418 (8%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL--RDALTRSLNRLNHFNQNSSISSSKASQADI- 84
           G ++ L+HR  P SP  +  +  ++    RD L R+ N     +   + S+ +  Q+ + 
Sbjct: 58  GATLPLVHRHGPCSPVMSKEKPSHEETLGRDQL-RAANIHAKLSSPRNSSAKELQQSGVT 116

Query: 85  IPNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           IP ++        Y+I +S+GTP   ++   DTGSD+ W QC PC    C  Q   LFDP
Sbjct: 117 IPTSSGYSLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDP 176

Query: 138 KMSSTYKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
             S+TY +  CSS+QCA L  +   C   +CQY V Y D S + G   ++  TLG TT  
Sbjct: 177 AKSATYSAFSCSSAQCAQLGGEGNGCLNSHCQYIVKYVDHSNTTGTYGSD--TLGLTTSD 234

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
           AV      FGC     G F  +  G++GLGG   SL+SQ   T    FSYCL P SS+  
Sbjct: 235 AVK--NFQFGCSHRANG-FVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAG 291

Query: 256 NFGTNGIVSGPGVVS----TPLTK--AKTFYVLTIDAISVGNQRLGVSTPDI----VIDS 305
            F T G  +G    S    TPL +    TFY + + AI+V   +L V         V+DS
Sbjct: 292 GFLTLGAAAGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVFSGASVVDS 351

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGA 362
           GT +T LP      L +     ++A P A P G L+ C+ F+ +   +VP VT+ F RGA
Sbjct: 352 GTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIKTVRVPVVTLTFSRGA 411

Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            + L  S  F         +   G T    I GN+ Q  F + +D+   T+ F+P  C
Sbjct: 412 VMDLDVSGIFYAGCLAFTATAQDGDTG---ILGNVQQRTFEMLFDVGGSTLGFRPGAC 466


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 135/416 (32%), Positives = 197/416 (47%), Gaps = 31/416 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL--RDALTRSLNRLNHFNQNSSISSSKASQADII 85
           G ++ L HR  P SP  +  +  ++    RD L  +  +    ++ ++++      A  I
Sbjct: 57  GSTLALSHRHGPCSPVISKEKPSHEETLRRDQLRAAYIQAKVSSRYNNVAKELQQSAVTI 116

Query: 86  PNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           P ++        Y+I ++IGTP   ++   DTGSD+ W QC PC    C  Q   LFDP 
Sbjct: 117 PTSSGYSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPA 176

Query: 139 MSSTYKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           MS+TY +  C S+QCA L  +   C    CQY V YGDGS + G   ++T++L S+    
Sbjct: 177 MSATYSAFSCGSAQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSD--- 233

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
            A+    FGC     G F  +  G++GLGG   SL+SQ   T    FSYCL P SS+   
Sbjct: 234 -AVKSFQFGCSHRAAG-FVGELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGG 291

Query: 257 F---GTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDI----VIDSGT 307
           F   G  G  S      TP+ +    TFY + +  I+V    L V         V+DSGT
Sbjct: 292 FLTLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVFSGASVVDSGT 351

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADV 364
            +T LP      L +     ++A P A P GSL+ C+ F+  +   VP VT+ F RGA +
Sbjct: 352 VITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTITVPTVTLTFSRGAAM 411

Query: 365 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            L  S            +   G T    I GN+ Q  F + +D+  +T+ F+   C
Sbjct: 412 DLDISGILYAGCLAFTATAHDGDTG---ILGNVQQRTFEMLFDVGGRTIGFRSGAC 464


>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
          Length = 418

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 136/384 (35%), Positives = 206/384 (53%), Gaps = 33/384 (8%)

Query: 57  ALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDL 114
           A  RS  RL+        +S+ ++Q+ +  ++    Y +  S+GTPP    A+ADTGSDL
Sbjct: 45  AAHRSRERLSILATRLGAASAGSAQSPLQMDSGGGAYDMTFSMGTPPQTLSALADTGSDL 104

Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVN-----C 166
           IW +C  C   +C  + S  + P  SS++  LPCSS+ C +L  +S   C G       C
Sbjct: 105 IWAKCGAC--KRCAPRGSASYYPTKSSSFSKLPCSSALCRTLESQSLATCGGTRARGAVC 162

Query: 167 QYSVSYGDGS----FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
            Y  SYG  S    ++ G + +ET TLGS      A+ GI FGC T          +G+V
Sbjct: 163 SYRYSYGLSSNPHHYTQGYMGSETFTLGSD-----AVQGIGFGC-TTMSEGGYGSGSGLV 216

Query: 223 GLGGGDISLISQMRTTIAGKFSYCLV--PVSSTKINFGTNGIVSGPGVVSTPLT--KAKT 278
           GLG G +SL+ Q++    G FSYCL   P +S+ + FG  G ++GPGV STPL   K  T
Sbjct: 217 GLGRGKLSLVRQLK---VGAFSYCLTSDPSTSSPLLFGA-GALTGPGVQSTPLVNLKTST 272

Query: 279 FYVLTIDAISVGNQRL-GVSTPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADP 336
           FY + +D+IS+G  +  G     I+ DSGTTLTFL +  Y      ++S       V   
Sbjct: 273 FYTVNLDSISIGAAKTPGTGRHGIIFDSGTTLTFLAEPAYTLAEAGLLSQTTNLTRVPGT 332

Query: 337 TGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGN 396
            G  E+C+  +  +  P + +HF G D+ L   N+F  V++ + C + +   + + I GN
Sbjct: 333 DG-YEVCFQTSGGAVFPSMVLHFDGGDMALKTENYFGAVNDSVSCWLVQKSPSEMSIVGN 391

Query: 397 IMQTNFLVGYDIEQQTVSFKPTDC 420
           IMQ ++ + YD+++  +SF+PT+C
Sbjct: 392 IMQMDYHIRYDLDKSVLSFQPTNC 415


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 134/386 (34%), Positives = 191/386 (49%), Gaps = 28/386 (7%)

Query: 55  RDALTRSLNRLNHFNQNSSISSSKASQADIIPN---NANYLIRISIGTPPTERLAVADTG 111
           RD L     R  H + NSS +         +P       Y + + +GTP  +   + DTG
Sbjct: 94  RDQLRVKSIRAKH-SMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTPKKDFSLLFDTG 152

Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN----CQ 167
           SDL WTQCEPC    C+ Q+   FDP  S++YK+L CSS  C S+ ++S  G +    C 
Sbjct: 153 SDLTWTQCEPCS-GGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQGCSSSNSCL 211

Query: 168 YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
           Y V YG G ++ G LATET+T+  +            GCG  NGG F S T G++GLG  
Sbjct: 212 YGVKYGTG-YTVGFLATETLTITPSD----VFENFVIGCGERNGGRF-SGTAGLLGLGRS 265

Query: 228 DISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAI 287
            ++L SQ  +T    FSYCL   SS+  +    G VS     +   +K    Y L +  I
Sbjct: 266 PVALPSQTSSTYKNLFSYCLPASSSSTGHLSFGGGVSQAAKFTPITSKIPELYGLDVSGI 325

Query: 288 SVGNQRLGVS-----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
           SVG ++L +      T   +IDSGTTLT+LP   +S L S    M+    +   T  L+ 
Sbjct: 326 SVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKGTSGLQP 385

Query: 343 CYSFNSLSQ----VPEVTIHFRGA-DVKLSRSNFFVKVSE-DIVCSVFK--GITNSVPIY 394
           CY F+  +     +P+++I F G  +V +  S  F+  +  + VC  FK  G    V I+
Sbjct: 386 CYDFSKHANDNITIPQISIFFEGGVEVDIDDSGIFIAANGLEEVCLAFKDNGNDTDVAIF 445

Query: 395 GNIMQTNFLVGYDIEQQTVSFKPTDC 420
           GN+ Q  + V YD+ +  V F P  C
Sbjct: 446 GNVQQKTYEVVYDVAKGMVGFAPGGC 471


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 126/354 (35%), Positives = 182/354 (51%), Gaps = 33/354 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  RI +GTP  E   V DTGSD+ W QC PC  S+CY Q  P+FDP  SST+KSL 
Sbjct: 161 SGEYFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPC--SECYQQSDPIFDPTSSSTFKSLT 218

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           CS  +CASL+  +C    C Y VSYGDGSF+ GN AT+TVT     G++  +  +  GCG
Sbjct: 219 CSDPKCASLDVSACRSNKCLYQVSYGDGSFTVGNYATDTVTF----GESGKVNDVALGCG 274

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVS 264
            +N GLF      +   GG  +S+ +Q++   A  FSYCLV   S K   ++F  N +  
Sbjct: 275 HDNEGLFTGAAGLLGLGGGA-LSMTNQIK---AKSFSYCLVDRDSAKSSSLDF--NSVQI 328

Query: 265 GPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTP----------DIVIDSGTTLTF 311
           G G  + PL   +K  TFY + +   SVG Q++ + +            +++D GT +T 
Sbjct: 329 GAGDATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTR 388

Query: 312 LP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLS 367
           L  Q YNS   + +    + +    P    + CY F+SLS  +VP VT HF G   + L 
Sbjct: 389 LQTQAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSLNLP 448

Query: 368 RSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             N+ + + +    C  F   ++S+ I GN+ Q    + YD+    +      C
Sbjct: 449 AKNYLIPIDDAGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANNLIGLSANKC 502


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score =  189 bits (481), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 149/424 (35%), Positives = 210/424 (49%), Gaps = 48/424 (11%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-- 84
           G  +V L HR  P SP   + + P   L + L R   R  +  +    S    +  D+  
Sbjct: 56  GAATVPLHHRHGPCSPL-PTKKMP--TLEETLHRDQLRAAYIQRK--FSGGGGAGGDVQR 110

Query: 85  ----IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
               +P       N   YLI + +G+P T +  + DTGSD+ W QC+PC  SQC+ Q  P
Sbjct: 111 SDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADP 168

Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
           LFDP  SSTY    C S+ CA L Q+     S   CQY V+YGDGS + G  +++T+ LG
Sbjct: 169 LFDPSSSSTYSPFSCGSAACAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG 228

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
           S+     A+    FGC     G FN +T G++GLGGG  SL+SQ   T+   FSYCL P 
Sbjct: 229 SS-----AVKSFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPT 282

Query: 251 SSTK--INFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----TPDI 301
            S+   +  G  G     G V TP+ ++    TFY + + AI VG ++L +     +   
Sbjct: 283 PSSSGFLTLGAAGGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGT 342

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
           V+DSGT +T LP    S L S   + ++  P A P+G L+ C+ F+  S V  P V + F
Sbjct: 343 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 402

Query: 360 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 416
             GA V L  S   +       C  F   ++  S+ I GN+ Q  F V YD+ +  V F+
Sbjct: 403 SGGAVVSLDASGIILS-----NCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 457

Query: 417 PTDC 420
              C
Sbjct: 458 AGAC 461


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 132/368 (35%), Positives = 190/368 (51%), Gaps = 42/368 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + I +G+PP +  A+ DTGSDL+W QC+PC  SQCY Q  P++DP  SST+    CS+
Sbjct: 4   YTMEIELGSPPKKFNAIVDTGSDLVWIQCKPC--SQCYSQSDPIYDPSASSTFAKTSCST 61

Query: 151 SQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
           S C SL    CS     C Y   YGD S + G+ A ET+TL S+ G + A P   FGCG 
Sbjct: 62  SSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCGR 121

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFGTNGIV 263
            N G F     GIVGLG G ISL +Q+ + I  KFSYCLV        ++ + FG++   
Sbjct: 122 LNSGSFGG-AAGIVGLGQGKISLSTQLGSAINNKFSYCLVDFDDDSSKTSPLIFGSSA-S 179

Query: 264 SGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI------------------- 301
           +G G +STP+   +   T+Y + ++ ISVG ++L ++T  I                   
Sbjct: 180 TGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRALEVN 239

Query: 302 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 355
               + DSGTTLT L     S + S  +S +    V   +   +LCY  +     + P +
Sbjct: 240 SGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDVSKSKNFKFPAL 299

Query: 356 TIHFRGADVKLSRSNFFVKV--SEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
           T+ F+G      + N+FV V  +E + C ++    +  + I GN+MQ N+ V YD    T
Sbjct: 300 TLAFKGTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYHVVYDRGTST 359

Query: 413 VSFKPTDC 420
           +S  P  C
Sbjct: 360 ISMSPAQC 367


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 132/350 (37%), Positives = 189/350 (54%), Gaps = 30/350 (8%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+  + +GTP T    V DTGS L W QC PC  S C+ Q  PL+DP+ SSTY ++PCS
Sbjct: 133 NYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVS-CHRQVGPLYDPRASSTYATVPCS 191

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           +SQC     A+LN  +CS  N C Y  SYGD SFS G L+ +TV+ GS +      P   
Sbjct: 192 ASQCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSFGSGS-----YPNFY 246

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL-VPVSSTKINFG--TN 260
           +GCG +N GLF  ++ G++GL    +SL+ Q+  ++   FSYCL  P S+  ++ G  T+
Sbjct: 247 YGCGQDNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTPASTGYLSIGPYTS 305

Query: 261 GIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQG 315
           G  S   + S+ L    + Y +T+  +SVG   L VS  +      +IDSGT +T LP  
Sbjct: 306 GHYSYTPMASSSLD--ASLYFVTLSGMSVGGSPLAVSPAEYSSLPTIIDSGTVITRLPTA 363

Query: 316 -YNSNLLSVMSSMIEAQPVADPTGS-LELCYSFN-SLSQVPEVTIHFR-GADVKLSRSNF 371
            Y +   +V ++M+  Q  + P  S L+ C+    S  +VP V + F  GA +KL+  N 
Sbjct: 364 VYTALSKAVAAAMVGVQ--SAPAFSILDTCFQGQASQLRVPAVAMAFAGGATLKLATQNV 421

Query: 372 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            + V +   C  F   T+S  I GN  Q  F V YD+ Q  + F    C+
Sbjct: 422 LIDVDDSTTCLAFA-PTDSTTIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 470


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 125/344 (36%), Positives = 173/344 (50%), Gaps = 17/344 (4%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           + NY+I +  GTP   +  V DTGSD+ W QC+PC   +CY Q  PLFDP +SSTY+++ 
Sbjct: 13  SGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCA-VRCYAQQEPLFDPSLSSTYRNVS 71

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C+   C  L+ + CS   C Y V YGDGS + G LA +T  L      A       FGCG
Sbjct: 72  CTEPACVGLSTRGCSSSTCLYGVFYGDGSSTIGFLAMDTFML----TPAQKFKNFIFGCG 127

Query: 208 TNNGGLFNSKTTGIVGLG-GGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
            NN GLF   T G+VGLG     SL SQ+  ++   FSYCL   SS           + P
Sbjct: 128 QNNTGLFQG-TAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTSSATGYLNIGNPQNTP 186

Query: 267 GVVSTPL-TKAKTFYVLTIDAISVGNQRLGVSTP-----DIVIDSGTTLTFLPQGYNSNL 320
           G  +    T+  T Y + +  ISVG  RL +S+        +IDSGT +T LP    S L
Sbjct: 187 GYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSVGTIIDSGTVITRLPPTAYSAL 246

Query: 321 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSED 378
            + + + +    +A     L+ CY F+  + V  P + +HF G DV++  +  F   +  
Sbjct: 247 KTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHFAGLDVRIPATGVFFVFNSS 306

Query: 379 IVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            VC  F G T+S  + I GN+ Q    V YD E + + F    C
Sbjct: 307 QVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score =  189 bits (479), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 149/461 (32%), Positives = 217/461 (47%), Gaps = 55/461 (11%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           M+  L+   I   L     +P    T     +L H D  +        T ++RL     R
Sbjct: 6   MSELLAYALIFTLLFTAAATPTAGLT--MRADLTHVDKGRG------FTRWERLSRMAVR 57

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVA-DTGSDLIWTQC 119
           S  R     Q       +   A  +P++  YLI  +IGTP  +R+A+  DTGSDL+WTQC
Sbjct: 58  SRARAASLYQRGG-HYGQPVTATAVPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQC 116

Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC---ASLNQKSCS--GVNCQYSVSYGD 174
            PCP   C+ Q  PLFDP +SST++++ C    C   + L+  +C+     C Y  SYGD
Sbjct: 117 TPCP--VCFDQPFPLFDPSVSSTFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYGD 174

Query: 175 GSFSNGNLATETVTLGSTTGQA---VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
            S + G +  +T T  S  G+    VA+ G+ FGCG  N G+F S  +GI G G G +SL
Sbjct: 175 KSITAGYIFKDTFTFMSPNGEGAPPVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSL 234

Query: 232 ISQMRTTIAGKFSYCLVPVSSTKIN------FGT--NGIV---SGPGVVSTPLTKA---K 277
            SQ+R    G+FSYCL     T+ N       GT  NG+    SGP   STP+  +    
Sbjct: 235 PSQLRV---GRFSYCLTSHDETESNKTSAVFLGTPPNGLRAHSSGP-FRSTPIIHSPSFP 290

Query: 278 TFYVLTIDAISVGNQRLGVSTP----------DIVIDSGTTLTFLPQGYNSNLLSVMSSM 327
           TFY L+++ I+VG  RL V +             VIDSGT +T  P      L +   + 
Sbjct: 291 TFYYLSLEGITVGKTRLPVDSSVFALKKDGSGGTVIDSGTGVTTFPAAVFEQLKNEFVAQ 350

Query: 328 IEAQPVADPTGSLE--LCYSFNSLSQ---VPEVTIHFRGADVKLSRSNFFVKVSED-IVC 381
           +   P  D T  +   LC+      +   VP++  H   AD+ L R N+  + ++  ++C
Sbjct: 351 LPL-PRYDNTSEVGNLLCFQRPKGGKQVPVPKLIFHLASADMDLPRENYIPEDTDSGVMC 409

Query: 382 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
            +  G    + + GN  Q N  + YD+E   + F    C K
Sbjct: 410 LMINGAEVDMVLIGNFQQQNMHIVYDVENSKLLFASAQCDK 450


>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 424

 Score =  188 bits (478), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 140/423 (33%), Positives = 208/423 (49%), Gaps = 38/423 (8%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA----- 82
           GF+  LIH DSP SPFYN + T   R+   + RS +RLN+    + +S +          
Sbjct: 7   GFTARLIHHDSPLSPFYNHTMTDTARIEATVHRSRSRLNYLYYINKLSENALDNDVSLSP 66

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS- 141
            ++     YL+  +IG P ++ +   DT + LIW QC  C  SQC  +   L    +SS 
Sbjct: 67  TLVNEGGEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNC-NSQCEPEKRGLTTKFLSSK 125

Query: 142 --TYKSLPCSSSQCASLNQ-KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
             TY+  PC S+ C SL   ++C+  +  C+Y + YGD   ++G L++++    ++ G  
Sbjct: 126 SFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDGML 185

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----SS 252
           V +  + FGC            TG VGL    +SLISQ+      KFSYCLVP     S+
Sbjct: 186 VDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQLGIK---KFSYCLVPFNNLGST 242

Query: 253 TKINFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLGVS--------TPDIV 302
           +K+ FG+  + SG     TPL    +  +YV  +  IS+GN                  +
Sbjct: 243 SKMYFGSLPVTSGG---QTPLLYPNSDAYYVKVL-GISIGNDEPHFDGVFDVYEVRDGWI 298

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF---NSLSQVPEVTIH 358
           ID+G T + L      +LL+   ++ +  Q   DP    ELC+     N L   P+VT+H
Sbjct: 299 IDTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKERFELCFELQNANDLESFPDVTVH 358

Query: 359 FRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
           F GAD+ L+  + FVK+ +D I C       + V I GN    N+ VGYD+E Q +SF P
Sbjct: 359 FDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFAP 418

Query: 418 TDC 420
            DC
Sbjct: 419 VDC 421


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score =  188 bits (478), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 134/355 (37%), Positives = 175/355 (49%), Gaps = 32/355 (9%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ I +GTP      V DTGSD  W QCEPC    CY Q   LFDP  SST  ++
Sbjct: 182 GTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCV-VVCYEQQEKLFDPARSSTDANI 240

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C+ L  K CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 241 SCAAPACSDLYTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AIKGFRFGC 296

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
           G  N GLF  +  G++GLG G  SL  Q      G F++C    SS     GT  +  GP
Sbjct: 297 GERNEGLFG-EAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSS-----GTGYLDFGP 350

Query: 267 G---VVSTPLT------KAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFL 312
           G    VST LT         TFY + +  I VG + L +     +T   ++DSGT +T L
Sbjct: 351 GSSPAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTAGTIVDSGTVITRL 410

Query: 313 PQGYNSNLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV--PEVTIHFRGA---DVK 365
           P    S+L S  +S I A+    A     L+ CY F  +SQV  P V++ F+G    DV 
Sbjct: 411 PPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGASLDVD 470

Query: 366 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            S   +   VS+  +        + V I GN     F V YDI ++ V F P  C
Sbjct: 471 ASGIIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score =  188 bits (478), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 141/422 (33%), Positives = 202/422 (47%), Gaps = 41/422 (9%)

Query: 30  SVELIHRDSPKSPFYNSSETP--YQRLRDALTR--SLNRLNHFNQNSSISSSKASQADII 85
           ++ ++HR  P SP       P   + L D   R  S++R      +  +  ++  +   +
Sbjct: 74  ALNVVHRQGPCSPLQARGAPPPHAELLNDDQARVDSIHRKIAAAASPVLDQARGKKGVTL 133

Query: 86  P-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           P          NY++ + +GTP  +   V DTGSDL W QC PC  S CY Q  PLFDP 
Sbjct: 134 PAQRGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPC--SDCYEQKDPLFDPA 191

Query: 139 MSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
            SSTY ++PC+S +C  L+ +SCS    C+Y V YGD S ++G LA +T+TL     Q+ 
Sbjct: 192 RSSTYSAVPCASPECQGLDSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTL----TQSD 247

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
            LPG  FGCG  + GLF  +  G+VGLG   +SL SQ  +     FSYCL P S +   +
Sbjct: 248 VLPGFVFGCGEQDTGLFG-RADGLVGLGREKVSLSSQAASKYGAGFSYCL-PSSPSAAGY 305

Query: 258 GTNGIVSGPGVVSTPLTKAKT------FYVLTIDAISVGNQRLGV-----STPDIVIDSG 306
            + G   GP   +   T  +T      FY + +  + V  + + V     S    VIDSG
Sbjct: 306 LSLG---GPAPANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAAGTVIDSG 362

Query: 307 TTLTFLPQGYNSNLLSVMS-SMIEAQPVADPTGS-LELCYSF--NSLSQVPEVTIHFR-G 361
           T +T LP    + L S  + SM        P  S L+ CY F  ++  ++P V + F  G
Sbjct: 363 TVITRLPPRVYAALRSAFARSMGRYGYKRAPALSILDTCYDFTGHTTVRIPSVALVFAGG 422

Query: 362 ADVKLSRSN--FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
           A V L  S   +  KVS+  +     G      I GN  Q    V YD+ +Q + F    
Sbjct: 423 AAVGLDFSGVLYVAKVSQACLAFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGANG 482

Query: 420 CT 421
           C+
Sbjct: 483 CS 484


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 137/355 (38%), Positives = 180/355 (50%), Gaps = 28/355 (7%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y IR+S+GTPP     V DTGSD++W QC PC    CY Q   +FDP  SSTY +L C
Sbjct: 35  GEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPC--VSCYHQCDEVFDPYKSSTYSTLGC 92

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA-VALPGITFGCG 207
           +S QC +L+   C G  C Y V YGDGSFS G  AT+ V+L ST+G   V L  I  GCG
Sbjct: 93  NSRQCLNLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPLGCG 152

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-----PVSSTKINFGTNGI 262
            +N G F      ++GLG G +S  +Q+ +   G+FSYCL          + + FG +  
Sbjct: 153 HDNEGYFVGAAG-LLGLGKGPLSFPNQINSENGGRFSYCLTGRDTDSTERSSLIFG-DAA 210

Query: 263 VSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP----------DIVIDSGTTL 309
           V   GV  TP     +  TFY L +  ISVG   L + T            ++IDSGT++
Sbjct: 211 VPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVIIDSGTSV 270

Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKL 366
           T L     ++L     +      +       + CY+ + LS   VP VT+HF+ GAD+KL
Sbjct: 271 TRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSSVDVPTVTLHFQGGADLKL 330

Query: 367 SRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             SN+ V V +    C  F G T    I GNI Q  F V YD     V F P+ C
Sbjct: 331 PASNYLVPVDNSSTFCLAFAGTTGP-SIIGNIQQQGFRVIYDNLHNQVGFVPSQC 384


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score =  188 bits (477), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 135/351 (38%), Positives = 186/351 (52%), Gaps = 30/351 (8%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N   YLI + +G+P T +  + DTGSD+ W QC+PC  SQC+ Q  PLFDP  SSTY   
Sbjct: 48  NTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADPLFDPSSSSTYSPF 105

Query: 147 PCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
            C S+ CA L Q+     S   CQY V+YGDGS + G  +++T+ LGS+     A+    
Sbjct: 106 SCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSS-----AVRSFQ 160

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNG 261
           FGC     G FN +T G++GLGGG  SL+SQ   T+   FSYCL   P SS  +  G  G
Sbjct: 161 FGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTLGRAFSYCLPPTPSSSGFLTLGAAG 219

Query: 262 IVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQ 314
                G V TP+ ++    TFY + + AI VG ++L +     +   V+DSGT +T LP 
Sbjct: 220 GSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVFSAGTVMDSGTVITRLPP 279

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNF 371
              S L S   + ++  P A P+G L+ C+ F+  S V  P V + F  GA V L  S  
Sbjct: 280 TAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGAVVSLDASGI 339

Query: 372 FVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            +       C  F G ++  S+ I GN+ Q  F V YD+ +  V F+   C
Sbjct: 340 ILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score =  188 bits (477), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 141/417 (33%), Positives = 199/417 (47%), Gaps = 41/417 (9%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
            +++L H DS      + ++TP       L R   R++  N  ++  SS      +   +
Sbjct: 54  LTLDLHHLDS-----LSLNKTPTDLFNLRLHRDTLRVHALNSRAAGFSSSVVSG-LSQGS 107

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y  R+ +GTPP     V DTGSD++W QC PC   +CY Q  P+F+P  S ++  +PC
Sbjct: 108 GEYFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPC--RKCYSQSDPIFNPYKSKSFAGIPC 165

Query: 149 SSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           SS  C  L+   CS     C Y VSYGDGSF+ G+ ATET+T        VAL     GC
Sbjct: 166 SSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVAL-----GC 220

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
           G +N GLF      ++GLG G +S  SQ       KFSYCLV  S++      + +V G 
Sbjct: 221 GHHNEGLFVGAAG-LLGLGRGRLSFPSQTGIRFNHKFSYCLVDRSASS---KPSSMVFGD 276

Query: 267 GVVS-----TPLT---KAKTFYVLTIDAISVGNQRLGVSTPD-----------IVIDSGT 307
             +S     TPL    K  TFY + +  ISVG  R+   +P            ++IDSGT
Sbjct: 277 AAISRLARFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGT 336

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVK 365
           ++T L +   + L                    + CY  +  S  +VP V +HFRGAD+ 
Sbjct: 337 SVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVVLHFRGADMA 396

Query: 366 LSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           L  +N+ + V E+   C  F G  + + I GNI Q  F V YD+    + F P  CT
Sbjct: 397 LPATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGCT 453


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score =  188 bits (477), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 141/438 (32%), Positives = 219/438 (50%), Gaps = 53/438 (12%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD--- 83
           G  S+ELIHR+S          T  Q L + L R   R+      + ++  K  +A    
Sbjct: 54  GTLSLELIHRNSLLREAKEKLHTHEQLLLETLQRDEQRVRWIESKAQLAGKKKDEASSTD 113

Query: 84  --------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
                   ++  +  Y +R+ +GTP      V DTGSDL W QC+PC    CY Q  P+F
Sbjct: 114 LNGPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPC--KSCYKQADPIF 171

Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSG-----VNCQYSVSYGDGSFSNGNLATETVTLG 190
           DP+ SS+++ +PC S  C +L   SCSG       C Y V+YGDGSFS G+ +++  TLG
Sbjct: 172 DPRNSSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLG 231

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM-----RTTIAGKFSY 245
            T  +A++   + FGCG +N GL  +   G++GLG G +S  SQ+      ++ A  FSY
Sbjct: 232 -TGSKAMS---VAFGCGFDNEGL-FAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSY 286

Query: 246 CLV----PV--SSTKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGV 296
           CLV    P+  SS+ + FG   I S   +  +PL    K  TFY   +  +SVG  +L +
Sbjct: 287 CLVDRSNPMTRSSSSLIFGAAAIPSTAAL--SPLLKNPKLDTFYYAAMIGVSVGGAQLPI 344

Query: 297 STPD----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF 346
           S             ++IDSGT++T  P    + +     +     P A      + CY+F
Sbjct: 345 SLKSLQLSQSGSGGVIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSLFDTCYNF 404

Query: 347 NSLS--QVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNF 402
           +  +   VP + +HF  GAD++L  +N+ + + +    C  F   +  + I GNI Q +F
Sbjct: 405 SGKASVDVPALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSF 464

Query: 403 LVGYDIEQQTVSFKPTDC 420
            +G+D+++  ++F P  C
Sbjct: 465 RIGFDLQKSHLAFAPQQC 482


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score =  187 bits (476), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 141/432 (32%), Positives = 207/432 (47%), Gaps = 49/432 (11%)

Query: 30  SVELIHRDSPKSPFYNSSETP--YQRLRDALTRS---LNRLNHFNQNSSISSSKASQADI 84
           SV L+HR  P +P   S   P   +RLR    R+   + +       ++  S  A     
Sbjct: 18  SVPLVHRHGPCAPSAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTS 77

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           IP       N+  Y++ + IGTP  ++  + DTGSDL W QC+PC   +CY Q  PLFDP
Sbjct: 78  IPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDP 137

Query: 138 KMSSTYKSLPCSSSQCASLNQKS----CSGVN------CQYSVSYGDGSFSNGNLATETV 187
             SS+Y S+PC S  C  L   +    C+GV+      C+Y + YG+ + + G  +TET+
Sbjct: 138 SSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETL 197

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           TL       V +    FGCG +  G +  K  G++GLGG   SL+SQ  +   G FSYCL
Sbjct: 198 TL----KPGVVVADFGFGCGDHQHGPYE-KFDGLLGLGGAPESLVSQTSSQFGGPFSYCL 252

Query: 248 VPVSSTKINFGTNGI-------VSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS 297
            P S     F T G         +  G+  TP+ +     TFY++T+  ISVG   L + 
Sbjct: 253 PPTSG-GAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIP 311

Query: 298 ----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT--GSLELCYSFNSLSQ 351
               +  +VIDSGT +T LP    + L S   S +    +  P+  G L+ CY F   + 
Sbjct: 312 PSAFSSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHAN 371

Query: 352 --VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
             VP +++ F  GA + L+     +   +  +     G  N++ I GN+ Q  F V YD 
Sbjct: 372 VTVPTISLTFSGGATIDLAAPAGVLV--DGCLAFAGAGTDNAIGIIGNVNQRTFEVLYDS 429

Query: 409 EQQTVSFKPTDC 420
            + TV F+   C
Sbjct: 430 GKGTVGFRAGAC 441


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score =  187 bits (476), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 144/415 (34%), Positives = 203/415 (48%), Gaps = 59/415 (14%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
           F+VE + R   K P YN  +T YQ   + LT  +              S ASQ      +
Sbjct: 122 FAVEGVDRSDLK-PVYNE-DTRYQT--EDLTTPV-------------VSGASQG-----S 159

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y  RI +GTP  E   V DTGSD+ W QCEPC  + CY Q  P+F+P  SSTYKSL C
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPC--ADCYQQSDPVFNPTSSSTYKSLTC 217

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
           S+ QC+ L   +C    C Y VSYGDGSF+ G LAT+TVT G++      +  +  GCG 
Sbjct: 218 SAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSG----KINNVALGCGH 273

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVSG 265
           +N GLF      +   GG  +S+ +QM+ T    FSYCLV   S K   ++F  N +  G
Sbjct: 274 DNEGLFTGAAGLLGLGGGV-LSITNQMKAT---SFSYCLVDRDSGKSSSLDF--NSVQLG 327

Query: 266 PGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPD------------IVIDSGTTLT 310
            G  + PL + K   TFY + +   SVG ++  V  PD            +++D GT +T
Sbjct: 328 GGDATAPLLRNKKIDTFYYVGLSGFSVGGEK--VVLPDAIFDVDASGSGGVILDCGTAVT 385

Query: 311 FLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKL 366
            L  Q YNS   + +   +  +  +      + CY F+SLS  +VP V  HF G   + L
Sbjct: 386 RLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDL 445

Query: 367 SRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              N+ + V +    C  F   ++S+ I GN+ Q    + YD+ +  +      C
Sbjct: 446 PAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score =  187 bits (475), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 129/423 (30%), Positives = 201/423 (47%), Gaps = 39/423 (9%)

Query: 27  GGFSVELIHRDSPKS---PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ-- 81
           G + ++L+HRD   +     Y+ S   + R++    R    +   +   + SS    +  
Sbjct: 69  GKWKLKLVHRDKITAFNKSSYDHSHNFHARIQRDKKRVATLIRRLSPRDATSSYSVEEFG 128

Query: 82  ADII----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           A+++      +  Y IRI +G+PP E+  V D+GSD++W QC+PC  +QCY Q  P+FDP
Sbjct: 129 AEVVSGMNQGSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPC--TQCYHQTDPVFDP 186

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
             S+++  +PCSSS C  +    C    C+Y V YGDGS++ G LA ET+T G T  + V
Sbjct: 187 ADSASFMGVPCSSSVCERIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTFGRTVVRNV 246

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTK 254
           A+     GCG  N G+F      ++GLGGG +SL+ Q+     G FSYCLV     S+  
Sbjct: 247 AI-----GCGHRNRGMFVGAAG-LLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTDSAGS 300

Query: 255 INFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP----------DI 301
           + FG   +  G   +  PL    +A +FY + +  + VG  ++ +S             +
Sbjct: 301 LEFGRGAMPVGAAWI--PLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGV 358

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHF 359
           V+D+GT +T +P                  P A      + CY+ N     +VP V+ +F
Sbjct: 359 VMDTGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYF 418

Query: 360 RGADV-KLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
            G  +  L   NF + V +    C  F    + + I GNI Q    + +D     V F P
Sbjct: 419 AGGPILTLPARNFLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGP 478

Query: 418 TDC 420
             C
Sbjct: 479 NVC 481


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score =  187 bits (475), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 141/432 (32%), Positives = 207/432 (47%), Gaps = 49/432 (11%)

Query: 30  SVELIHRDSPKSPFYNSSETP--YQRLRDALTRS---LNRLNHFNQNSSISSSKASQADI 84
           SV L+HR  P +P   S   P   +RLR    R+   + +       ++  S  A     
Sbjct: 98  SVPLVHRHGPCAPSAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTS 157

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           IP       N+  Y++ + IGTP  ++  + DTGSDL W QC+PC   +CY Q  PLFDP
Sbjct: 158 IPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDP 217

Query: 138 KMSSTYKSLPCSSSQCASLNQKS----CSGVN------CQYSVSYGDGSFSNGNLATETV 187
             SS+Y S+PC S  C  L   +    C+GV+      C+Y + YG+ + + G  +TET+
Sbjct: 218 SSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETL 277

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           TL       V +    FGCG +  G +  K  G++GLGG   SL+SQ  +   G FSYCL
Sbjct: 278 TL----KPGVVVADFGFGCGDHQHGPYE-KFDGLLGLGGAPESLVSQTSSQFGGPFSYCL 332

Query: 248 VPVSSTKINFGTNGI-------VSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS 297
            P S     F T G         +  G+  TP+ +     TFY++T+  ISVG   L + 
Sbjct: 333 PPTSG-GAGFLTLGAPPNSSSSTAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIP 391

Query: 298 ----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT--GSLELCYSFNSLSQ 351
               +  +VIDSGT +T LP    + L S   S +    +  P+  G L+ CY F   + 
Sbjct: 392 PSAFSSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHAN 451

Query: 352 --VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
             VP +++ F  GA + L+     +   +  +     G  N++ I GN+ Q  F V YD 
Sbjct: 452 VTVPTISLTFSGGATIDLAAPAGVLV--DGCLAFAGAGTDNAIGIIGNVNQRTFEVLYDS 509

Query: 409 EQQTVSFKPTDC 420
            + TV F+   C
Sbjct: 510 GKGTVGFRAGAC 521


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score =  187 bits (475), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 125/354 (35%), Positives = 183/354 (51%), Gaps = 32/354 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           + NY +++ +G+PP     + DTGS L W QC+PC    C+ Q  PLF+P  S+TY+ L 
Sbjct: 117 SGNYYLKLGLGSPPKYYTMILDTGSSLSWLQCKPC-VVYCHSQVDPLFEPSASNTYRPLY 175

Query: 148 CSSSQC-----ASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           CSSS+C     A+LN   C  SGV C Y+ SYGD S+S G L+ + +TL  T  Q   LP
Sbjct: 176 CSSSECSLLKAATLNDPLCTASGV-CVYTASYGDASYSMGYLSRDLLTL--TPSQ--TLP 230

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN 260
             T+GCG +N GLF  K  GIVGL    +S+++Q+       FSYCL   +S+   F + 
Sbjct: 231 SFTYGCGQDNEGLFG-KAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSGGGFLSI 289

Query: 261 GIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPDI----VIDSGTTLTFLP 313
           G +S      TP+ +     + Y L + AI+V  + +GV+        +IDSGT +T LP
Sbjct: 290 GKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTIIDSGTVVTRLP 349

Query: 314 ----QGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHFR-GADVKL 366
                      + +MS   E  P       L+ C+  S  S+S  PE+ + F+ GAD+ L
Sbjct: 350 ISIYAALREAFVKIMSRRYEQAPAYS---ILDTCFKGSLKSMSGAPEIRMIFQGGADLSL 406

Query: 367 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              N  ++  + I C  F   +N + I GN  Q  + + YD+    + F P  C
Sbjct: 407 RAPNILIEADKGIACLAFAS-SNQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 130/363 (35%), Positives = 179/363 (49%), Gaps = 40/363 (11%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ ++IGTPP       DTGSDLIWTQC+PCP   C+ Q  P FDP  SST     C 
Sbjct: 81  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 138

Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           S+ C  L   SC          C Y+ SYGD S + G L  +  T     G   ++PG+ 
Sbjct: 139 STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 195

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFG 258
           FGCG  N G+F S  TGI G G G +SL SQ++    G FS+C   V+  K     ++  
Sbjct: 196 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPSTVLLDLP 252

Query: 259 TNGIVSGPGVV-STPLTK---AKTFYVLTIDAISVGNQRLGV---------STPDIVIDS 305
            +   SG G V STPL +     TFY L++  I+VG+ RL V          T   +IDS
Sbjct: 253 ADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDS 312

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHFRGAD 363
           GT +T LP      +    ++ ++   V+  T     C S    +   VP++ +HF GA 
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGAT 372

Query: 364 VKLSRSNFFVKVSE---DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
           + L R N+  +V +    I+C ++ +G    V   GN  Q N  V YD++   +SF P  
Sbjct: 373 MDLPRENYVFEVEDAGSSILCLAIIEG--GEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQ 430

Query: 420 CTK 422
           C K
Sbjct: 431 CDK 433


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 130/363 (35%), Positives = 179/363 (49%), Gaps = 40/363 (11%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ ++IGTPP       DTGSDLIWTQC+PCP   C+ Q  P FDP  SST     C 
Sbjct: 81  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 138

Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           S+ C  L   SC          C Y+ SYGD S + G L  +  T     G   ++PG+ 
Sbjct: 139 STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 195

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFG 258
           FGCG  N G+F S  TGI G G G +SL SQ++    G FS+C   V+  K     ++  
Sbjct: 196 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPSTVLLDLP 252

Query: 259 TNGIVSGPGVV-STPLTK---AKTFYVLTIDAISVGNQRLGV---------STPDIVIDS 305
            +   SG G V STPL +     TFY L++  I+VG+ RL V          T   +IDS
Sbjct: 253 ADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLKNGTGGTIIDS 312

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHFRGAD 363
           GT +T LP      +    ++ ++   V+  T     C S    +   VP++ +HF GA 
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGAT 372

Query: 364 VKLSRSNFFVKVSE---DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
           + L R N+  +V +    I+C ++ +G    V   GN  Q N  V YD++   +SF P  
Sbjct: 373 MDLPRENYVFEVEDAGSSILCLAIIEG--GEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQ 430

Query: 420 CTK 422
           C K
Sbjct: 431 CDK 433


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 146/448 (32%), Positives = 219/448 (48%), Gaps = 67/448 (14%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           MA FL  V+IL  L +  +S   +   G  +EL H D      Y  +E    R+R A  R
Sbjct: 1   MAAFL--VWILLLLPYVAISSTASH--GVRLELTHADDRGG--YVGAE----RVRRAADR 50

Query: 61  SLNRLNHF-----------NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
           S  R+N F              S  + +  ++A +  + A YL+ I+IGTPP    AV D
Sbjct: 51  SHRRVNGFLGAIEGPSSTARLGSDGAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLD 110

Query: 110 TGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCS--GV 164
           TGSDLIWTQC+ PC   +C+ Q +PL+ P  S+TY ++ C S  C +L      CS    
Sbjct: 111 TGSDLIWTQCDAPC--RRCFPQPAPLYAPARSATYANVSCRSPMCQALQSPWSRCSPPDT 168

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C Y  SYGDG+ ++G LATET TLGS T    A+ G+ FGCGT N G     ++G+VG+
Sbjct: 169 GCAYYFSYGDGTSTDGVLATETFTLGSDT----AVRGVAFGCGTENLG-STDNSSGLVGM 223

Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTI 284
           G G +SL+SQ+  T   +   C    ++      T          ++PL           
Sbjct: 224 GRGPLSLVSQLGVTRPRR--SCRARAAARGGGAPTT---------TSPL----------- 261

Query: 285 DAISVGNQRLGVS------TP----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 334
           + I+VG+  L +       TP     ++IDSGTT T L +     L   ++S +     +
Sbjct: 262 EGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGTTFTALEERAFVALARALASRVRLPLAS 321

Query: 335 DPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP 392
                L LC++  S    +VP + +HF GAD++L R ++ V+     V  +       + 
Sbjct: 322 GAHLGLSLCFAAASPEAVEVPRLVLHFDGADMELRRESYVVEDRSAGVACLGMVSARGMS 381

Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + G++ Q N  + YD+E+  +SF+P  C
Sbjct: 382 VLGSMQQQNTHILYDLERGILSFEPAKC 409


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 136/370 (36%), Positives = 189/370 (51%), Gaps = 51/370 (13%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ ++IGTPP       DTGSDLIWTQC+PC    C+ Q  P FD   SST   LPC 
Sbjct: 34  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPC--VSCFDQPLPYFDTSRSSTNALLPCE 91

Query: 150 SSQC---------ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           S+QC           LNQ   +   C Y  SYGD S + G LA +  T  + T    +LP
Sbjct: 92  STQCKLDPTVTVCVKLNQTVQT---CAYYTSYGDNSVTIGLLAADKFTFVAGT----SLP 144

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKI 255
           G+TFGCG NN G+FNS  TGI G G G +SL SQ++    G FS+C   +     S+  +
Sbjct: 145 GVTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVLL 201

Query: 256 NFGTNGIVSGPGVV-STPLTK-AK-----TFYVLTIDAISVGNQRLGV---------STP 299
           +   +   +G G V +TPL + AK     T Y L++  I+VG+ RL V          T 
Sbjct: 202 DLPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTG 261

Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTI 357
             +IDSGT++T LP      +    ++ I+   V         C+S  S ++  VP++ +
Sbjct: 262 GTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVL 321

Query: 358 HFRGADVKLSRSNFFVKVSED----IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
           HF GA + L R N+  +V +D    I+C ++ KG  +   I GN  Q N  V YD++   
Sbjct: 322 HFEGATMDLPRENYVFEVPDDAGNSIICLAINKG--DETTIIGNFQQQNMHVLYDLQNNM 379

Query: 413 VSFKPTDCTK 422
           +SF    C K
Sbjct: 380 LSFVAAQCDK 389


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score =  187 bits (474), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 141/422 (33%), Positives = 197/422 (46%), Gaps = 41/422 (9%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN-----SSISSSKASQADI 84
           S+E+IHR  P     +++ T      + L +  +R++  +        S+   + S+A  
Sbjct: 62  SLEVIHRHGPCGDEVSNAPTA----AEMLVKDQSRVDFIHSKIAGELESVDRLRGSKATK 117

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           IP        + NY++ + +GTP      + DTGSDL WTQC+PC    CY Q  P+F P
Sbjct: 118 IPAKSGATIGSGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPC-ARYCYNQKDPVFVP 176

Query: 138 KMSSTYKSLPCSSSQCASL-----NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGS 191
             S+TY ++ CSS  C+ L     NQ  CS    C Y + YGD SFS G  A ET+TL S
Sbjct: 177 SQSTTYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLTS 236

Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS 251
           T      +    FGCG NN GLF S   G++GLG   IS++ Q        FSYCL   S
Sbjct: 237 TD----VIENFLFGCGQNNRGLFGS-AAGLIGLGQDKISIVKQTAQKYGQVFSYCLPKTS 291

Query: 252 STKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGV-----STPDIVI 303
           S+       G   G  +  TP+TKA     FY + I  + VG  ++ +     ST   +I
Sbjct: 292 SSTGYLTFGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSVFSTSGAII 351

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG 361
           DSGT +T LP    S L S     +   P A     L+ CY  +  S  Q+P+V   F+G
Sbjct: 352 DSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYSTIQIPKVGFVFKG 411

Query: 362 A-DVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
             ++ L         S   VC  F G  +  +V I GN+ Q    V YD+    + F   
Sbjct: 412 GEELDLDGIGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGFGYN 471

Query: 419 DC 420
            C
Sbjct: 472 GC 473


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 129/356 (36%), Positives = 179/356 (50%), Gaps = 37/356 (10%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  RI +GTP  E   V DTGSD+ W QCEPC  S CY Q  P+F+P  SSTYKSL 
Sbjct: 159 SGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPC--SDCYQQSDPVFNPTSSSTYKSLT 216

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           CS+ QC+ L   +C    C Y VSYGDGSF+ G LAT+TVT G++      +  +  GCG
Sbjct: 217 CSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSG----KINDVALGCG 272

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVS 264
            +N GLF      +   GG  +S+ +QM+ T    FSYCLV   S K   ++F  N +  
Sbjct: 273 HDNEGLFTGAAGLLGLGGGA-LSITNQMKAT---SFSYCLVDRDSGKSSSLDF--NSVQL 326

Query: 265 GPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPD------------IVIDSGTTL 309
           G G  + PL    K  TFY + +   SVG Q+  V  PD            +++D GT +
Sbjct: 327 GSGDATAPLLRNQKIDTFYYVGLSGFSVGGQK--VMMPDAIFDVDASGSGGVILDCGTAV 384

Query: 310 TFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VK 365
           T L  Q YNS   + +      +         + CY F+SLS  +VP V  HF G   + 
Sbjct: 385 TRLQTQAYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSLSSVKVPTVAFHFTGGKSLD 444

Query: 366 LSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           L   N+ + V ++   C  F   ++S+ I GN+ Q    + YD+  + +      C
Sbjct: 445 LPAKNYLIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANKIIGLSGNKC 500


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 147/402 (36%), Positives = 200/402 (49%), Gaps = 37/402 (9%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
           E+I RD  +       E+ Y +L      S N  N  ++  + S+   +++ I   + NY
Sbjct: 87  EIIRRDQARV------ESIYSKL------SKNSANEVSE--AKSTELPAKSGITLGSGNY 132

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           ++ I IGTP  +   V DTGSDL WTQCEPC  S CY Q  P F+P  SSTY+++ CSS 
Sbjct: 133 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGS-CYSQKEPKFNPSSSSTYQNVSCSSP 191

Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
            C   + +SCS  NC YS+ YGD SF+ G LA E  TL ++      L  + FGCG NN 
Sbjct: 192 MCE--DAESCSASNCVYSIGYGDKSFTQGFLAKEKFTLTNSD----VLEDVYFGCGENNQ 245

Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST---KINFGTNGIVSGPGV 268
           GLF+     ++GLG G +SL +Q  TT    FSYCL   +S     + FG+ GI     V
Sbjct: 246 GLFDGVAG-LLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGI--SESV 302

Query: 269 VSTPLTKAKTFYVLTID--AISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLL 321
             TP++   + +   ID   ISVG++ L +     ST   +IDSGT  T LP    + L 
Sbjct: 303 KFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELR 362

Query: 322 SVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR---GADVKLSRSNFFVKVSED 378
           SV    + +       G  + CY F  L  V   TI F    G  V+L  S   + +   
Sbjct: 363 SVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGGTVVELDGSGISLPIKIS 422

Query: 379 IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            VC  F G  +   I+GN+ QT   V YD+    V F P  C
Sbjct: 423 QVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 140/450 (31%), Positives = 219/450 (48%), Gaps = 74/450 (16%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-------------- 73
           G  V L H D+      + + +  Q L+ A  RS +R++     ++              
Sbjct: 44  GLRVRLTHVDA------HGNYSRLQLLQRAARRSHHRMSRLVARATGAASTSSSKAAAAG 97

Query: 74  -ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
             S  K  Q  +   N  +L+ +S+GTP     A+ DTGSDL+WTQC+PC   +C+ Q +
Sbjct: 98  DGSGGKDLQVPVHAGNGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPC--VECFNQTT 155

Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ--------YSVSYGDGSFSNGNLAT 184
           P+FDP  SSTY +LPCSS+ CA L   +C+  +          Y+ +YGD S + G LAT
Sbjct: 156 PVFDPAASSTYAALPCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLAT 215

Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
           ET TL         +PG+ FGCG  N G   ++  G+VGLG G +SL+SQ+      +FS
Sbjct: 216 ETFTLARQK-----VPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQLGID---RFS 267

Query: 245 YCLV---------PVSSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQ 292
           YCL          P+        +    + P   +TPL K     +FY +++  ++VG+ 
Sbjct: 268 YCLTSLDDAAGRSPLLLGSAAGISASAATAP-AQTTPLVKNPSQPSFYYVSLTGLTVGST 326

Query: 293 RLGV----------STPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SL 340
           RL +           T  +++DSGT++T+L  + Y +   + ++ M  + P  D +   L
Sbjct: 327 RLALPSSAFAIQDDGTGGVIVDSGTSITYLELRAYRALRKAFVAHM--SLPTVDASEIGL 384

Query: 341 ELCYSFNSLS-------QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVP 392
           +LC+   + +       QVP++ +HF  GAD+ L   N+ V  S      +    +  + 
Sbjct: 385 DLCFQGPAGAVDQDVQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMASRGLS 444

Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           I GN  Q NF   YD+   T+SF P +C K
Sbjct: 445 IIGNFQQQNFQFVYDVAGDTLSFAPAECNK 474


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score =  186 bits (472), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 143/415 (34%), Positives = 203/415 (48%), Gaps = 59/415 (14%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
           F+VE + R   K P YN  +T YQ   + LT  +              S ASQ      +
Sbjct: 122 FAVEGVDRSDLK-PVYNE-DTRYQT--EDLTTPV-------------VSGASQG-----S 159

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y  RI +GTP  +   V DTGSD+ W QCEPC  + CY Q  P+F+P  SSTYKSL C
Sbjct: 160 GEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPC--ADCYQQSDPVFNPTSSSTYKSLTC 217

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
           S+ QC+ L   +C    C Y VSYGDGSF+ G LAT+TVT G++      +  +  GCG 
Sbjct: 218 SAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSG----KINNVALGCGH 273

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVSG 265
           +N GLF      +   GG  +S+ +QM+ T    FSYCLV   S K   ++F  N +  G
Sbjct: 274 DNEGLFTGAAGLLGLGGGV-LSITNQMKAT---SFSYCLVDRDSGKSSSLDF--NSVQLG 327

Query: 266 PGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPD------------IVIDSGTTLT 310
            G  + PL + K   TFY + +   SVG ++  V  PD            +++D GT +T
Sbjct: 328 GGDATAPLLRNKKIDTFYYVGLSGFSVGGEK--VVLPDAIFDVDASGSGGVILDCGTAVT 385

Query: 311 FLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKL 366
            L  Q YNS   + +   +  +  +      + CY F+SLS  +VP V  HF G   + L
Sbjct: 386 RLQTQAYNSLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDL 445

Query: 367 SRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              N+ + V +    C  F   ++S+ I GN+ Q    + YD+ +  +      C
Sbjct: 446 PAKNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 145/427 (33%), Positives = 200/427 (46%), Gaps = 39/427 (9%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKAS- 80
           EA   G  + L H     SP    + + +  L   +  R   RLN     +S   +  S 
Sbjct: 64  EALKPGVKIRLDHIHGACSPLRPINSSSWIDLVSQSFERDNARLNTIRSKNSGPYTTMSN 123

Query: 81  ---QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
              Q+       NY++    GTP    L + DTGSDL W QC+PC  + CY Q   +F+P
Sbjct: 124 LPLQSGTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPC--ADCYSQVDAIFEP 181

Query: 138 KMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
           K SS+YK+LPC S+ C  L     N   C    C Y ++YGDGS S G+ + ET+TLGS 
Sbjct: 182 KQSSSYKTLPCLSATCTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLGSD 241

Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
           + Q  A     FGCG  N GLF   ++G++GLG   +S  SQ ++   G+F+YCL P   
Sbjct: 242 SFQNFA-----FGCGHTNTGLFKG-SSGLLGLGQNSLSFPSQSKSKYGGQFAYCL-PDFG 294

Query: 253 TKINFGTNGIVSG---PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV-----STPDI 301
           +  + G+  +  G      V TPL       TFY + ++ ISVG  RL +          
Sbjct: 295 SSTSTGSFSVGKGSIPASAVFTPLVSNFMYPTFYFVGLNGISVGGDRLSIPPAVLGRGST 354

Query: 302 VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 358
           ++DSGT +T  LPQ YN+ L +   S     P A P   L+ CY  +  SQV  P +T H
Sbjct: 355 IVDSGTVITRLLPQAYNA-LKTSFRSKTRDLPSAKPFSILDTCYDLSRHSQVRIPTITFH 413

Query: 359 FR-GADVKLSRSNFFVKVSE--DIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTV 413
           F+  ADV +S     V V      VC  F   +  +   I GN  Q    V +D     +
Sbjct: 414 FQNNADVAVSDVGILVPVQNGGSQVCLAFASASQMDGFNIIGNFQQQRMRVAFDTGAGRI 473

Query: 414 SFKPTDC 420
            F    C
Sbjct: 474 GFASGSC 480


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 139/400 (34%), Positives = 199/400 (49%), Gaps = 41/400 (10%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISS-SKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
           + +R    RS  R      +S+ +  S  +  D +P    YL+ ++IGTPP       DT
Sbjct: 52  ELMRRMALRSKARAPRLLSSSATAPVSPGAYDDGVPMT-EYLLHLAIGTPPQPVQLTLDT 110

Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN----- 165
           GSDL+WTQC+PC  + C+ Q  P +D   SST+    C S+QC  L+      VN     
Sbjct: 111 GSDLVWTQCQPC--AVCFNQSLPYYDASRSSTFALPSCDSTQC-KLDPSVTMCVNQTVQT 167

Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
           C +S SYGD S + G L  ETV+  +      ++PG+ FGCG NN G+F S  TGI G G
Sbjct: 168 CAFSYSYGDKSATIGFLDVETVSFVA----GASVPGVVFGCGLNNTGIFRSNETGIAGFG 223

Query: 226 GGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVV-STPLTK---A 276
            G +SL SQ++    G FS+C   VS  K      +   +   +G G V +TPL K    
Sbjct: 224 RGPLSLPSQLKV---GNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAH 280

Query: 277 KTFYVLTIDAISVGNQRLGV---------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSM 327
            TFY L++  I+VG+ RL V          T   +IDSGT  T LP      +    ++ 
Sbjct: 281 PTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAH 340

Query: 328 IEAQPV-ADPTGSLELCYSFNSLSQ---VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSV 383
           ++   V ++ TG L LC+S   L +   VP++ +HF GA + L R N+  +  +   CS+
Sbjct: 341 VKLPVVPSNETGPL-LCFSAPPLGKAPHVPKLVLHFEGATMHLPRENYVFEAKDGGNCSI 399

Query: 384 -FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
               I   + I GN  Q N  V YD++   +SF    C K
Sbjct: 400 CLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCDK 439


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  186 bits (471), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 144/427 (33%), Positives = 200/427 (46%), Gaps = 50/427 (11%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---- 83
             S+ L H D+      +S++TP Q  +  L R   R+      ++++ S A ++     
Sbjct: 61  ALSLHLHHIDA-----LSSNKTPEQLFQLRLQRDAKRVEGVVALAALNQSHARRSGSSFS 115

Query: 84  ------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
                 +   +  Y  RI +GTP      V DTGSD++W QC PC   +CY Q  P+FDP
Sbjct: 116 SSIISGLAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPC--RKCYTQADPVFDP 173

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
             S TY  +PC +  C  L+   C+  N  CQY VSYGDGSF+ G+ +TET+T   T   
Sbjct: 174 TKSRTYAGIPCGAPLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTRVT 233

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
            VAL     GCG +N GLF      ++GLG G +S   Q       KFSYCLV  S++  
Sbjct: 234 RVAL-----GCGHDNEGLFIGAAG-LLGLGRGRLSFPVQTGRRFNQKFSYCLVDRSASA- 286

Query: 256 NFGTNGIVSGPGVVS-----TPLT---KAKTFYVLTIDAISVGNQ----------RLGVS 297
               + +V G   VS     TPL    K  TFY L +  ISVG            RL  +
Sbjct: 287 --KPSSVVFGDSAVSRTARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAA 344

Query: 298 -TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPE 354
               ++IDSGT++T L +     L             A      + C+  + L++  VP 
Sbjct: 345 GNGGVIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPT 404

Query: 355 VTIHFRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
           V +HFRGADV L  +N+ + V      C  F G  + + I GNI Q  F V +D+    V
Sbjct: 405 VVLHFRGADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRV 464

Query: 414 SFKPTDC 420
            F P  C
Sbjct: 465 GFAPRGC 471


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score =  186 bits (471), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 127/351 (36%), Positives = 174/351 (49%), Gaps = 24/351 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC    CY Q   LFDP  SSTY ++
Sbjct: 176 GTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANV 234

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C+ LN   CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 235 SCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 290

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIV 263
           G  N GLF  +  G++GLG G  SL  Q      G F++CL P  ST    ++FG   + 
Sbjct: 291 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCL-PARSTGTGYLDFGAGSLA 348

Query: 264 SGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGY 316
           +    ++TP+      TFY + +  I VG Q L +     +T   ++DSGT +T LP   
Sbjct: 349 AASARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPAA 408

Query: 317 NSNL--LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNF 371
            S+L      +        A     L+ CY F  +SQV  P V++ F+ GA + +  S  
Sbjct: 409 YSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGI 468

Query: 372 FVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
               S   VC  F    +   V I GN     F V YDI ++ V F P  C
Sbjct: 469 MYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 135/417 (32%), Positives = 202/417 (48%), Gaps = 44/417 (10%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADIIPNN 88
           S++++H+  P S   N        L + L    +R++  +   S  S  K + A  +P  
Sbjct: 66  SLKVVHKHGPCSQL-NQQNGNAPNLVEILLEDQSRVDSIHAKLSDHSGVKETDAAKLPTK 124

Query: 89  A-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           +       NY++ I +G+P  + + + DTGSDL W +C            +  FDP  S+
Sbjct: 125 SGMSLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSA----------AETFDPTKST 174

Query: 142 TYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           +Y ++ CS+  C+S+     N   C+   C Y + YGDGS+S G L  E +T+GST    
Sbjct: 175 SYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIGSTD--- 231

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-I 255
                  FGCG +  GLF  K  G++GLG   +S++SQ        FSYCL   SST  +
Sbjct: 232 -IFNNFYFGCGQDVDGLFG-KAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPSSSSTGFL 289

Query: 256 NFGTNGIVSGPGVVSTPLTKA-KTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTL 309
           +FG++   S      TPL+    +FY L +  I+VG Q+L +     ST   +IDSGT +
Sbjct: 290 SFGSSQSKSAK---FTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTAGTIIDSGTVV 346

Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA-DVKL 366
           T LP    S L S     + + P+  P   L+ CY F+     +VP++ I F G  DV +
Sbjct: 347 TRLPPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPKIVISFSGGVDVDV 406

Query: 367 SRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            ++  FV      VC  F G T +    I+GN  Q NF V YD+    V F P  C+
Sbjct: 407 DQAGIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVGFAPASCS 463


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 146/402 (36%), Positives = 201/402 (50%), Gaps = 37/402 (9%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
           E+I RD  +       E+ Y +L      S N  N  ++  + S+   +++ I   + NY
Sbjct: 87  EIIRRDQARV------ESIYSKL------SKNSANEVSE--AKSTELPAKSGITLGSGNY 132

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           ++ I IGTP  +   V DTGSDL WTQCEPC  S CY Q  P F+P  SSTY+++ CSS 
Sbjct: 133 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGS-CYSQKEPKFNPSSSSTYQNVSCSSP 191

Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
            C   + +SCS  NC YS+ YGD SF+ G LA E  TL ++      L  + FGCG NN 
Sbjct: 192 MCE--DAESCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSD----VLEDVYFGCGENNQ 245

Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST---KINFGTNGIVSGPGV 268
           GLF+     ++GLG G +SL +Q  TT    FSYCL   +S     + FG+ GI     V
Sbjct: 246 GLFDGVAG-LLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGI--SESV 302

Query: 269 VSTPLTKAKTFYVLTID--AISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLL 321
             TP++   + +   ID   ISVG++ L +     ST   +IDSGT  T LP    + L 
Sbjct: 303 KFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELR 362

Query: 322 SVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGAD-VKLSRSNFFVKVSED 378
           SV    + +       G  + CY F  L  V  P +   F G+  V+L  S   + +   
Sbjct: 363 SVFKEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPIKIS 422

Query: 379 IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            VC  F G  +   I+GN+ QT   V YD+    V F P  C
Sbjct: 423 QVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  185 bits (469), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 134/433 (30%), Positives = 208/433 (48%), Gaps = 52/433 (12%)

Query: 23  EAQTGG--FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
           + + GG  + ++++HRD      + +S+    RL   L R   R+    +  S     + 
Sbjct: 64  DHEEGGEKWMMKVVHRDQLS---FGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSY 120

Query: 81  QAD---------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD 131
           + D         +   +  Y +RI +G+PP  +  V D+GSD++W QC+PC  +QCY Q 
Sbjct: 121 RVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--TQCYHQS 178

Query: 132 SPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
            P+FDP  S+++  + CSSS C  L    C    C+Y VSYGDGS++ G LA ET+T G 
Sbjct: 179 DPVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGR 238

Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV- 250
           T  ++VA+     GCG  N G+F      ++GLGGG +S + Q+     G FSYCLV   
Sbjct: 239 TMVRSVAI-----GCGHRNRGMFVGAAG-LLGLGGGSMSFVGQLGGQTGGAFSYCLVSRG 292

Query: 251 --SSTKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP------ 299
             SS  + FG   + +G   V  PL    +A +FY + +  + VG  R+ +S        
Sbjct: 293 TDSSGSLVFGREALPAGAAWV--PLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTE 350

Query: 300 ----DIVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL-- 349
                +V+D+GT +T LP    Q +    L+  +++  A  VA      + CY       
Sbjct: 351 LGDGGVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVA----IFDTCYDLLGFVS 406

Query: 350 SQVPEVTIHFRGADV-KLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
            +VP V+ +F G  +  L   NF + + +    C  F   T+ + I GNI Q    + +D
Sbjct: 407 VRVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFD 466

Query: 408 IEQQTVSFKPTDC 420
                V F P  C
Sbjct: 467 GANGYVGFGPNIC 479


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score =  184 bits (468), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 127/351 (36%), Positives = 173/351 (49%), Gaps = 24/351 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC    CY Q   LFDP  SSTY ++
Sbjct: 174 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQQEKLFDPVRSSTYANV 232

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C+ LN   CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 233 SCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 288

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIV 263
           G  N GLF  +  G++GLG G  SL  Q      G F++CL P  ST    ++FG     
Sbjct: 289 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCL-PARSTGTGYLDFGAGSPA 346

Query: 264 SGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGY 316
           +    ++TP+      TFY + +  I VG Q L +     +T   ++DSGT +T LP   
Sbjct: 347 AASARLTTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPPA 406

Query: 317 NSNL--LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNF 371
            S+L      +        A     L+ CY F  +SQV  P V++ F+ GA + +  S  
Sbjct: 407 YSSLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGI 466

Query: 372 FVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
               S   VC  F    +   V I GN     F V YDI ++ V F P  C
Sbjct: 467 MYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGVC 517


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score =  184 bits (467), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 148/424 (34%), Positives = 199/424 (46%), Gaps = 50/424 (11%)

Query: 31  VELIHRDSPKSPFYNSS-ETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQAD 83
           + L H+  P +P   SS  TP   + D L     R  +  +  S      +  SKA  A 
Sbjct: 67  LRLTHKHGPCAPSRASSLATP--SVADTLRADQRRAEYILRRVSGRGTPQLWDSKAEAAT 124

Query: 84  I-IPNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
             +P N        NY++ +S+GTP   +    DTGSDL W QC PC    CY Q  PLF
Sbjct: 125 ATVPANWGFNIGTLNYVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLF 184

Query: 136 DPKMSSTYKSLPCSSSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           DP  SS+Y ++PC    C  L     SCS   C Y VSYGDGS + G  +++T+TL    
Sbjct: 185 DPAQSSSYAAVPCGGPVCGGLGIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSPND 244

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
               A+ G  FGCG    G   +   G++GLG  + SL+ Q   T  G FSYCL P   +
Sbjct: 245 ----AVRGFFFGCGHAQSGFTGND--GLLGLGREEASLVEQTAGTYGGVFSYCL-PTRPS 297

Query: 254 KINFGTNGIVSG---PGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP----DIVI 303
              + T G  SG   PG  +T L     A T+YV+ +  ISVG Q+L V +       V+
Sbjct: 298 TTGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAGGTVV 357

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
           D+GT +T LP    + L S   S + +   P A  TG L+ CY+F+    V  P V + F
Sbjct: 358 DTGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGTVTLPNVALTF 417

Query: 360 R-GADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
             GA V L              C  F   G    + I GN+ Q +F V   I+  +V FK
Sbjct: 418 SGGATVTLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFK 470

Query: 417 PTDC 420
           P+ C
Sbjct: 471 PSSC 474


>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
 gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
          Length = 459

 Score =  184 bits (467), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 131/348 (37%), Positives = 188/348 (54%), Gaps = 29/348 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y +  S+GTPP +  A+ADTGSDLIW +C     + C  Q SP + P  SST+  LPCS 
Sbjct: 91  YDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPCSD 150

Query: 151 SQCASLNQKS-----CSGVNCQYSVSYG----DGSFSNGNLATETVTLGSTTGQAVALPG 201
             C+ L   S      +G  C Y  SYG    D  ++ G LA ET TLG     A A+P 
Sbjct: 151 RLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLG-----ADAVPS 205

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS--TKINFGT 259
           + FGC T          +G+VGLG G +SL+SQ+    A  F YCL   +S  + + FG+
Sbjct: 206 VRFGC-TTASEGGYGSGSGLVGLGRGPLSLVSQLN---ASTFMYCLTSDASKASPLLFGS 261

Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL-GVSTPD-IVIDSGTTLTFLPQGYN 317
              ++G  V ST L  + TFY + + +IS+G+    GV  P+ +V DSGTTLT+L +   
Sbjct: 262 LASLTGAQVQSTGLLASTTFYAVNLRSISIGSATTPGVGEPEGVVFDSGTTLTYLAEPAY 321

Query: 318 SNLLSVMSSMIEAQPVADPTGSLELCYSFN-----SLSQVPEVTIHFRGADVKLSRSNFF 372
           S   +   S      V D  G  E C+        S + VP + +HF GAD+ L  +N+ 
Sbjct: 322 SEAKAAFLSQTSLDQVEDTDG-FEACFQKPANGRLSNAAVPTMVLHFDGADMALPVANYV 380

Query: 373 VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           V+V + +VC + +  + S+ I GNIMQ N+LV +D+ +  +SF+P +C
Sbjct: 381 VEVEDGVVCWIVQR-SPSLSIIGNIMQVNYLVLHDVHRSVLSFQPANC 427


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  184 bits (467), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 130/357 (36%), Positives = 179/357 (50%), Gaps = 35/357 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ +GTPP     V DTGSD++W QC+PC  ++CY Q   +FDP  S ++  +P
Sbjct: 127 SGEYFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPC--TKCYSQTDQIFDPSKSKSFAGIP 184

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           C S  C  L+   CS  N  CQY VSYGDGSF+ G+ +TET+T      +  A+P +  G
Sbjct: 185 CYSPLCRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTF-----RRAAVPRVAIG 239

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
           CG +N GLF      ++GLG G +S  +Q  T    KFSYCL   +++      + IV G
Sbjct: 240 CGHDNEGLFVGAAG-LLGLGRGGLSFPTQTGTRFNNKFSYCLTDRTASA---KPSSIVFG 295

Query: 266 PGVVS-----TPLT---KAKTFYVLTIDAISVGNQRL-GVSTP----------DIVIDSG 306
              VS     TPL    K  TFY + +  ISVG   + G+S             ++IDSG
Sbjct: 296 DSAVSRTARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSG 355

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADV 364
           T++T L +    +L             A      + CY  + LS+  VP V +HFRGADV
Sbjct: 356 TSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLHFRGADV 415

Query: 365 KLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            L  +N+ V V      C  F G  + + I GNI Q  F V +D+    V F P  C
Sbjct: 416 SLPAANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRGC 472


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score =  184 bits (467), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 133/421 (31%), Positives = 197/421 (46%), Gaps = 40/421 (9%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDA--LTRSLNRLNHFNQN--------SSISSSKAS-- 80
           ++HR  P SP           +  A  L R   R++  ++         S +  ++AS  
Sbjct: 73  VVHRHGPCSPVQARPRGGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARASEQ 132

Query: 81  ------QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
                 Q  I     NY++ + +GTP  +   + DTGSDL W QC+PC  + CY Q  PL
Sbjct: 133 GVSLPAQRGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPC--ADCYEQQDPL 190

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           FDP +SSTY ++ C + +C  L+   CS    C+Y V YGD S ++GNL  +T+TL ++ 
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD 250

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
                LPG  FGCG  N GLF  +  G+ GLG   +SL SQ   +    F+YCL P SS+
Sbjct: 251 ----TLPGFVFGCGDQNAGLFG-QVDGLFGLGREKVSLPSQGAPSYGPGFTYCL-PSSSS 304

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQRLGV------STPDIVIDS 305
              + + G         T L    T  FY + +  I VG + + +      +    VIDS
Sbjct: 305 GRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDS 364

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFR-GA 362
           GT +T LP    + L +  +  +     A     L+ CY F  +  +Q+P V + F  GA
Sbjct: 365 GTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGA 424

Query: 363 DVKLSRSN--FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            V L  +   +  KVS+  +        +S+ I GN  Q  F V YD+  Q + F    C
Sbjct: 425 TVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGC 484

Query: 421 T 421
           +
Sbjct: 485 S 485


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score =  184 bits (466), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 127/360 (35%), Positives = 178/360 (49%), Gaps = 39/360 (10%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  RI IG+P  +   V DTGSD+ W QC PC  + CY Q  PLFDP +SS+Y ++P
Sbjct: 193 SGEYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPC--ADCYAQSDPLFDPALSSSYATVP 250

Query: 148 CSSSQCASLNQKSCS------GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           C S  C +L+  +C         +C Y V+YGDGS++ G+ ATET+TLG     AV    
Sbjct: 251 CDSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLGGDGSAAVH--D 308

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFG 258
           +  GCG +N GLF      ++ LGGG +S  SQ+  T   +FSYCLV     S++ + FG
Sbjct: 309 VAIGCGHDNEGLFVGAAG-LLALGGGPLSFPSQISAT---EFSYCLVDRDSPSASTLQFG 364

Query: 259 TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP-----------DIVID 304
                S    V+ PL    ++ TFY + ++ ISVG + L    P            +++D
Sbjct: 365 ----ASDSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIVD 420

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-G 361
           SGT +T L     S L        +A P A      + CY     S  QVP V++ F  G
Sbjct: 421 SGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQVPAVSLRFEGG 480

Query: 362 ADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            ++KL   N+ + V      C  F     +V I GN+ Q    V +D  + TV F P  C
Sbjct: 481 GELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNTVGFSPNKC 540


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score =  184 bits (466), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 125/354 (35%), Positives = 177/354 (50%), Gaps = 31/354 (8%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           + NY +++ +G+P      + DTGS L W QC+PC    C++Q  PLFDP  S TYKSL 
Sbjct: 10  SGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCV-VYCHVQADPLFDPSASKTYKSLS 68

Query: 148 CSSSQC-----ASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           C+SSQC     A+LN   C  S   C Y+ SYGD S+S G L+ + +TL  +      LP
Sbjct: 69  CTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ----TLP 124

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN 260
           G  +GCG ++ GLF  +  GI+GLG   +S++ Q+ +     FSYCL             
Sbjct: 125 GFVYGCGQDSEGLFG-RAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGFLSIGK 183

Query: 261 GIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPDI----VIDSGTTLTFLP 313
             ++G     TP+T      + Y L + AI+VG + LGV+        +IDSGT +T LP
Sbjct: 184 ASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITRLP 243

Query: 314 QG----YNSNLLSVMSSMIEAQPVADPTGSLELCYSFN--SLSQVPEVTIHFR-GADVKL 366
                 +    + +MSS     P       L+ C+  N   +  VPEV + F+ GAD+ L
Sbjct: 244 MSVYTPFQQAFVKIMSSKYARAPGFS---ILDTCFKGNLKDMQSVPEVRLIFQGGADLNL 300

Query: 367 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              N  ++V E + C  F G  N V I GN  Q  F V +DI    + F    C
Sbjct: 301 RPVNVLLQVDEGLTCLAFAG-NNGVAIIGNHQQQTFKVAHDISTARIGFATGGC 353


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 139/400 (34%), Positives = 198/400 (49%), Gaps = 41/400 (10%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISS-SKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
           + +R    RS  R      +S+ +  S  +  D +P    YL+ ++IGTPP       DT
Sbjct: 52  ELMRRMALRSKARAPRLLSSSATAPVSPGAYDDGVPMT-EYLLHLAIGTPPQPVQLTLDT 110

Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN----- 165
           GS L+WTQC+PC  + C+ Q  P +D   SST+    C S+QC  L+      VN     
Sbjct: 111 GSVLVWTQCQPC--AVCFNQSLPYYDASRSSTFALPSCDSTQC-KLDPSVTMCVNQTVQT 167

Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
           C YS SYGD S + G L  ETV+  +      ++PG+ FGCG NN G+F S  TGI G G
Sbjct: 168 CAYSYSYGDKSATIGFLDVETVSFVA----GASVPGVVFGCGLNNTGIFRSNETGIAGFG 223

Query: 226 GGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVV-STPLTK---A 276
            G +SL SQ++    G FS+C   VS  K      +   +   +G G V +TPL K    
Sbjct: 224 RGPLSLPSQLKV---GNFSHCFTAVSGRKPSTVLFDLPADLYKNGRGTVQTTPLIKNPAH 280

Query: 277 KTFYVLTIDAISVGNQRLGV---------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSM 327
            TFY L++  I+VG+ RL V          T   +IDSGT  T LP      +    ++ 
Sbjct: 281 PTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSGTAFTSLPPRVYRLVHDEFAAH 340

Query: 328 IEAQPV-ADPTGSLELCYSFNSLSQ---VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSV 383
           ++   V ++ TG L LC+S   L +   VP++ +HF GA + L R N+  +  +   CS+
Sbjct: 341 VKLPVVPSNETGPL-LCFSAPPLGKAPHVPKLVLHFEGATMHLPRENYVFEAKDGGNCSI 399

Query: 384 -FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
               I   + I GN  Q N  V YD++   +SF    C K
Sbjct: 400 CLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCDK 439


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 133/421 (31%), Positives = 197/421 (46%), Gaps = 40/421 (9%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDA--LTRSLNRLNHFNQN--------SSISSSKAS-- 80
           ++HR  P SP           +  A  L R   R++  ++         S +  ++AS  
Sbjct: 73  VVHRHGPCSPVQARRRGGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARASEQ 132

Query: 81  ------QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
                 Q  I     NY++ + +GTP  +   + DTGSDL W QC+PC  + CY Q  PL
Sbjct: 133 GVSLPAQRGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPC--ADCYEQQDPL 190

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           FDP +SSTY ++ C + +C  L+   CS    C+Y V YGD S ++GNL  +T+TL ++ 
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD 250

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
                LPG  FGCG  N GLF  +  G+ GLG   +SL SQ   +    F+YCL P SS+
Sbjct: 251 ----TLPGFVFGCGDQNAGLFG-QVDGLFGLGREKVSLPSQGAPSYGPGFTYCL-PSSSS 304

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQRLGV------STPDIVIDS 305
              + + G         T L    T  FY + +  I VG + + +      +    VIDS
Sbjct: 305 GRGYLSLGGAPPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGGTVIDS 364

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFR-GA 362
           GT +T LP    + L +  +  +     A     L+ CY F  +  +Q+P V + F  GA
Sbjct: 365 GTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAFAGGA 424

Query: 363 DVKLSRSN--FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            V L  +   +  KVS+  +        +S+ I GN  Q  F V YD+  Q + F    C
Sbjct: 425 TVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGC 484

Query: 421 T 421
           +
Sbjct: 485 S 485


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 140/435 (32%), Positives = 207/435 (47%), Gaps = 53/435 (12%)

Query: 29  FSVELIHRDSPK-SPFYNSSETPYQRLRDALTRSLNRLNHFNQ--------NSSISSSKA 79
           +SV+++HRDS       N++ +  +RL + L R   R+    Q        N   + S  
Sbjct: 114 WSVQVVHRDSLLVKDAANATASYERRLEETLRRDARRVRGLEQRIEKRLRLNKDPAGSHE 173

Query: 80  SQADIIPN------------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
           + A++               +  Y  RI +GTP  E+  V DTGSD++W QCEPC  S+C
Sbjct: 174 NVAEVAAEFGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLDTGSDVVWIQCEPC--SKC 231

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           Y Q  P+F+P +S+++ +L C+S+ C+ L+  +C G  C Y VSYGDGS++ G+ ATE +
Sbjct: 232 YSQVDPIFNPSLSASFSTLGCNSAVCSYLDAYNCHGGGCLYKVSYGDGSYTIGSFATEML 291

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           T G+T+ + VA+     GCG +N GLF      ++GLG G +S  SQ+ T     FSYCL
Sbjct: 292 TFGTTSVRNVAI-----GCGHDNAGLFVGAAG-LLGLGAGLLSFPSQLGTQTGRAFSYCL 345

Query: 248 VPV---SSTKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI 301
           V     SS  + FG   +  G   + TPL       TFY + + +ISVG   L    PD+
Sbjct: 346 VDRFSESSGTLEFGPESVPLGS--ILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDV 403

Query: 302 ------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 349
                       ++DSGT +T L       +     +     P A+     + CY  + L
Sbjct: 404 FRIDETSGRGGFIVDSGTAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSIFDTCYDLSGL 463

Query: 350 S--QVPEVTIHF-RGADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVG 405
               VP V  HF  GA + L   N+ + +      C  F   T+ + I GNI Q    V 
Sbjct: 464 PLVNVPTVVFHFSNGASLILPAKNYMIPMDFMGTFCFAFAPATSDLSIMGNIQQQGIRVS 523

Query: 406 YDIEQQTVSFKPTDC 420
           +D     V F    C
Sbjct: 524 FDTANSLVGFALRQC 538


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 139/438 (31%), Positives = 215/438 (49%), Gaps = 53/438 (12%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQA 82
             + L+HRDS  +    ++E   +RL+    R+   ++    N +      +S+ +   A
Sbjct: 64  LHIHLLHRDS-FAVNATAAELLARRLQRDELRAAWIISKAAANGTPPPVVGLSTGRGLVA 122

Query: 83  DII---PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
            ++   P +  Y+ +I++GTP  + L   DT SDL W QC+PC   +CY Q  P+FDP+ 
Sbjct: 123 PVVSRAPTSGEYMAKIAVGTPAVQALLALDTASDLTWLQCQPC--RRCYPQSGPVFDPRH 180

Query: 140 SSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDG----SFSNGNLATETVTLGST 192
           S++Y  +   +  C +L +          C Y+V YGDG    S S G+L  ET+T    
Sbjct: 181 STSYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGG 240

Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR-TTIAGKFSYCLV--- 248
             QA     ++ GCG +N GLF +   GI+GLG G IS+  Q+        FSYCLV   
Sbjct: 241 VRQAY----LSIGCGHDNKGLFGAPAAGILGLGRGQISIPHQIAFLGYNASFSYCLVDFI 296

Query: 249 --PVS-STKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL-GVSTPD- 300
             P S S+ + FG   + + P    TP        TFY + +  +SVG  R+ GV+  D 
Sbjct: 297 SGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTERDL 356

Query: 301 ----------IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFN 347
                     +++DSGTT+T L +  Y +   +  ++      V+   P+G  + CY+  
Sbjct: 357 QLDPYTGRGGVILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVG 416

Query: 348 SLS--QVPEVTIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITN-SVPIYGNIMQTNF 402
             +  +VP V++HF G  +V L   N+ + V S   VC  F G  + SV + GNI+Q  F
Sbjct: 417 GRAGVKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGF 476

Query: 403 LVGYDIEQQTVSFKPTDC 420
            V YD+  Q V F P +C
Sbjct: 477 RVVYDLAGQRVGFAPNNC 494


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  183 bits (465), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 129/357 (36%), Positives = 179/357 (50%), Gaps = 35/357 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ +GTP      V DTGSD++W QC PC   +CY Q  P+FDP+ S TY ++P
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC--RRCYSQSDPIFDPRKSKTYATIP 196

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           CSS  C  L+   C+     C Y VSYGDGSF+ G+ +TET+T      + VAL     G
Sbjct: 197 CSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVAL-----G 251

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
           CG +N GLF      ++GLG G +S   Q       KFSYCLV  S++      + +V G
Sbjct: 252 CGHDNEGLFVGAAG-LLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASS---KPSSVVFG 307

Query: 266 PGVVS-----TPLT---KAKTFYVLTIDAISVGNQRL-GVSTP----------DIVIDSG 306
              VS     TPL    K  TFY + +  ISVG  R+ GV+             ++IDSG
Sbjct: 308 NAAVSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSG 367

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADV 364
           T++T L +     +        +A   A      + C+  +++++  VP V +HFRGADV
Sbjct: 368 TSVTRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADV 427

Query: 365 KLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            L  +N+ + V  +   C  F G    + I GNI Q  F V YD+    V F P  C
Sbjct: 428 SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 140/428 (32%), Positives = 201/428 (46%), Gaps = 46/428 (10%)

Query: 30  SVELIHRDSP---------------KSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSI 74
           S+E++H+  P                +   N      + ++  L+++L R N   +  S 
Sbjct: 62  SLEVVHKHGPCSQLNHNGKAKTTISHTDIMNLDNERVKYIQSRLSKNLGRENSVKELDST 121

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
           +    S + I   +ANY + + +GTP  +   V DTGSDL WTQCEPC  S CY Q   +
Sbjct: 122 TLPAKSGSLI--GSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGS-CYKQQDAI 178

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK------SCSGVNCQYSVSYGDGSFSNGNLATETVT 188
           FDP  SS+Y ++ C+SS C  L         S S   C Y + YGD S S G L+ E +T
Sbjct: 179 FDPSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLT 238

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
           + +T      +    FGCG +N GLF S + G++GLG   IS + Q  +     FSYCL 
Sbjct: 239 ITATD----IVDDFLFGCGQDNEGLF-SGSAGLIGLGRHPISFVQQTSSIYNKIFSYCLP 293

Query: 249 PVSST--KINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRL-GVSTPDI- 301
             SS+   + FG +   +   +  TPL+      TFY L I  ISVG  +L  VS+    
Sbjct: 294 STSSSLGHLTFGASA-ATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFS 352

Query: 302 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEV 355
               +IDSGT +T L     + L S     +E  PVA+  G  + CY F+   +  VP++
Sbjct: 353 AGGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVPKI 412

Query: 356 TIHFRGA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQT 412
              F G   V+L      +  S   VC  F   G  N + I+GN+ Q    V YD+E   
Sbjct: 413 DFEFAGGVTVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGR 472

Query: 413 VSFKPTDC 420
           + F    C
Sbjct: 473 IGFGAAGC 480


>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
 gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
          Length = 453

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 139/415 (33%), Positives = 214/415 (51%), Gaps = 53/415 (12%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISSSKASQADII--------PNNANYLIRISIGTPPTE 103
           Q +RDAL R ++R   F +  + SSS +S A  +        PN   Y++ ++IGTPP  
Sbjct: 45  QFVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQS 104

Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS--QCASLNQKSC 161
             A+ADTGSDL+WTQC PC   +C+ Q SPL++P  S T++ LPCSS+   CA+  + + 
Sbjct: 105 YPAIADTGSDLVWTQCAPC-GERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAG 163

Query: 162 S----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
           +    G  C+Y+ +YG G +++G   +ET T GS+    V +PGI FGC   +   +N  
Sbjct: 164 ATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDWN-- 220

Query: 218 TTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN-------FGTNGIVSGPGVVS 270
             G  GL G     +S +    AG FSYCL P   TK               ++G GV S
Sbjct: 221 --GSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRS 278

Query: 271 TPLTKA------KTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQ 314
           TP   +       T+Y L +  ISVG   L +           T  ++IDSGTT+T L  
Sbjct: 279 TPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRADGTGGLIIDSGTTITSLVD 338

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPEVTIHF-RGADVKLS 367
                + + + S+++  PV D + +  L+LC++  S S     +P +T+HF  GAD+ L 
Sbjct: 339 AAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLP 397

Query: 368 RSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             N+ + +   + C   +  T+  +   GN  Q N  + YD++++T+SF P  C+
Sbjct: 398 VENYMI-LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 451


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 131/392 (33%), Positives = 196/392 (50%), Gaps = 32/392 (8%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTG 111
           + ++  L+++L R N      S  ++  +++  +  +ANY++ + +GTP  +   V DTG
Sbjct: 9   KYIQSRLSKNLGRENTVKDLDS--TTLPAESGSLIGSANYVVVVGLGTPKRDLSLVFDTG 66

Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN----QKSCSG---V 164
           SDL WTQCEPC  S CY Q   +FDP  SS+Y ++ C+SS C  L     +  CS     
Sbjct: 67  SDLTWTQCEPCAGS-CYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIKSECSSSTDA 125

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
           +C Y   YGD S S G L+ E +T+ +T      +    FGCG +N GLFN  + G++GL
Sbjct: 126 SCIYDAKYGDNSTSVGFLSQERLTITATD----IVDDFLFGCGQDNEGLFNG-SAGLMGL 180

Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSST--KINFGTNGIVSGPGVVSTPLTKA---KTF 279
           G   IS++ Q  +     FSYCL   SS+   + FG +   +   ++ TPL+      +F
Sbjct: 181 GRHPISIVQQTSSNYNKIFSYCLPATSSSLGHLTFGASA-ATNASLIYTPLSTISGDNSF 239

Query: 280 YVLTIDAISVGNQRL-GVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV 333
           Y L I +ISVG  +L  VS+        +IDSGT +T L     + L S     +E  PV
Sbjct: 240 YGLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYPV 299

Query: 334 ADPTGSLELCYSFNSLSQ--VPEVTIHFRGA-DVKLSRSNFFVKVSEDIVCSVF--KGIT 388
           A+  G L+ CY  +   +  VP +   F G   V+L         SE  VC  F   G  
Sbjct: 300 ANEAGLLDTCYDLSGYKEISVPRIDFEFSGGVTVELXHRGILXVESEQQVCLAFAANGSD 359

Query: 389 NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           N + ++GN+ Q    V YD++   + F    C
Sbjct: 360 NDITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 122/355 (34%), Positives = 183/355 (51%), Gaps = 28/355 (7%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           + NY +++ +GTPP     + DTGS L W QC+PC    C+ Q  PL+DP +S TYK L 
Sbjct: 122 SGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPC-AVYCHAQADPLYDPSVSKTYKKLS 180

Query: 148 CSSSQC-----ASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           C+S +C     A+LN   C   +  C Y+ SYGD SFS G L+ + +TL S+      LP
Sbjct: 181 CASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQ----TLP 236

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFG 258
             T+GCG +N GLF  +  GI+GL    +S+++Q+ T     FSYCL      S+   F 
Sbjct: 237 QFTYGCGQDNQGLFG-RAAGIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFL 295

Query: 259 TNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTP----DIVIDSGTTLTF 311
           + G +S      TP+   +K  + Y L + AI+V  + L ++        +IDSGT +T 
Sbjct: 296 SIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITR 355

Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCY--SFNSLSQVPEVTIHFR-GADVKLS 367
           LP    + L      ++  +    P  S L+ C+  S  S+S VPE+ + F+ GAD+ L 
Sbjct: 356 LPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLR 415

Query: 368 RSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             +  ++  + I C  F G   TN + I GN  Q  + + YD+    + F P  C
Sbjct: 416 APSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 470


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 146/451 (32%), Positives = 213/451 (47%), Gaps = 58/451 (12%)

Query: 18  VVSPIEAQT---GGFSVELIHRDSPKSPFYNSSETPY-QRLRDALTRSLNRLNHFNQNSS 73
           VV P + +T     +S+ L+HRD+ K     ++E  Y +R++  L R   R+   N    
Sbjct: 45  VVQPAKEETLEIKPWSIPLVHRDAMKGNSNKNNELSYAERMQQRLKRDAARVAAINSRLE 104

Query: 74  IS-------------------SSKASQADII----PNNANYLIRISIGTPPTERLAVADT 110
           ++                   +    Q+ ++      +  Y  RI +G P  ++L V DT
Sbjct: 105 LAVNGIKRSSLKPDSSSSFTMAESDFQSPVVSGMDQGSGEYFSRIGVGAPRRDQLMVLDT 164

Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYS 169
           GSD+ W QCEPC  S CY Q  P+++P +SS+YK + C ++ C  L+   CS   +C Y 
Sbjct: 165 GSDVTWIQCEPC--SDCYQQSDPIYNPALSSSYKLVGCQANLCQQLDVSGCSRNGSCLYQ 222

Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDI 229
           VSYGDGS++ GN ATET+TLG    Q VA+     GCG +N GLF      ++GLGGG +
Sbjct: 223 VSYGDGSYTQGNFATETLTLGGAPLQNVAI-----GCGHDNEGLFVGAAG-LLGLGGGSL 276

Query: 230 SLISQMRTTIAGKFSYCLVPV---SSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLT 283
           S  SQ+       FSYCLV     SS+ + FG   + +  G V  P+ K     TFY ++
Sbjct: 277 SFPSQLTDENGKIFSYCLVDRDSESSSTLQFGRAAVPN--GAVLAPMLKNSRLDTFYYVS 334

Query: 284 IDAISVGNQRLGVSTP----------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV 333
           +  ISVG + L +S             +++DSGT +T L      +L     +  +  P 
Sbjct: 335 LSGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPS 394

Query: 334 ADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITN 389
            D     + CY  +S     VP V  HF  G  + L   N+ V V S    C  F   ++
Sbjct: 395 TDGVSLFDTCYDLSSKESVDVPTVVFHFSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSS 454

Query: 390 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           S+ I GNI Q    V +D     V F    C
Sbjct: 455 SLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 130/374 (34%), Positives = 194/374 (51%), Gaps = 40/374 (10%)

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           A++  +  +   YL+ ++IGTPP    A+ DTGSDLIWTQC PC    C  Q +P F P 
Sbjct: 80  AARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPC--VLCADQPTPYFRPA 137

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
            S+TY+ +PC S  CA+L   +C   + C Y   YGD + + G LA+ET T G+     V
Sbjct: 138 RSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKV 197

Query: 198 ALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---T 253
            +  + FGCG  N+G L NS  +G+VGLG G +SL+SQ+  +   +FSYCL    S   +
Sbjct: 198 MVSDVAFGCGNINSGQLANS--SGMVGLGRGPLSLVSQLGPS---RFSYCLTSFLSPEPS 252

Query: 254 KINF-------GTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVS------ 297
           ++NF       GTN   SG  V STPL       + Y +++  IS+G +RL +       
Sbjct: 253 RLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAI 312

Query: 298 ----TPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSF----NS 348
               T  + IDSGT+LT+L Q  Y++    ++S +    P  D    LE C+ +    + 
Sbjct: 313 NDDGTGGVFIDSGTSLTWLQQDAYDAVRRELVSVLRPLPPTNDTEIGLETCFPWPPPPSV 372

Query: 349 LSQVPEVTIHFR-GADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY 406
              VP++ +HF  GA++ +   N+  +  +   +C       ++  I GN  Q N  + Y
Sbjct: 373 AVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDAT-IIGNYQQQNMHILY 431

Query: 407 DIEQQTVSFKPTDC 420
           DI    +SF P  C
Sbjct: 432 DIANSLLSFVPAPC 445


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 130/374 (34%), Positives = 194/374 (51%), Gaps = 40/374 (10%)

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           A++  +  +   YL+ ++IGTPP    A+ DTGSDLIWTQC PC    C  Q +P F P 
Sbjct: 80  AARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPC--VLCADQPTPYFRPA 137

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
            S+TY+ +PC S  CA+L   +C   + C Y   YGD + + G LA+ET T G+     V
Sbjct: 138 RSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKV 197

Query: 198 ALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---T 253
            +  + FGCG  N+G L NS  +G+VGLG G +SL+SQ+  +   +FSYCL    S   +
Sbjct: 198 MVSDVAFGCGNINSGQLANS--SGMVGLGRGPLSLVSQLGPS---RFSYCLTSFLSPEPS 252

Query: 254 KINF-------GTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVS------ 297
           ++NF       GTN   SG  V STPL       + Y +++  IS+G +RL +       
Sbjct: 253 RLNFGVFATLNGTNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAI 312

Query: 298 ----TPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSF----NS 348
               T  + IDSGT+LT+L Q  Y++    ++S +    P  D    LE C+ +    + 
Sbjct: 313 NDDGTGGVFIDSGTSLTWLQQDAYDAVRHELVSVLRPLPPTNDTEIGLETCFPWPPPPSV 372

Query: 349 LSQVPEVTIHFR-GADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY 406
              VP++ +HF  GA++ +   N+  +  +   +C       ++  I GN  Q N  + Y
Sbjct: 373 AVTVPDMELHFDGGANMTVPPENYMLIDGATGFLCLAMIRSGDAT-IIGNYQQQNMHILY 431

Query: 407 DIEQQTVSFKPTDC 420
           DI    +SF P  C
Sbjct: 432 DIANSLLSFVPAPC 445


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score =  182 bits (463), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 138/424 (32%), Positives = 202/424 (47%), Gaps = 40/424 (9%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF-----NQNSSISS--SKASQADII 85
           ++HR  P SP     + P     D L +   R++       N+ S++    S  ++  I 
Sbjct: 91  VMHRHGPCSPLQTPGDAPSDA--DLLDQDQARVDSILGMITNETSAVGPGVSLPAERGIS 148

Query: 86  PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
               NY++ + +GTP  +   V DTGSDL W QC PC    CY Q  PLF P  SST+ +
Sbjct: 149 VGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSA 208

Query: 146 LPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAVA-- 198
           + C + +C +  ++SC G      C Y V YGD S + G+L  +T+TLG+     A A  
Sbjct: 209 VRCGARECRA--RQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAEN 266

Query: 199 ---LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
              LPG  FGCG NN GLF  +  G+ GLG G +SL SQ        FSYCL   SS+  
Sbjct: 267 DNKLPGFVFGCGENNTGLFG-QADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAP 325

Query: 256 NFGTNGI-VSGPGVVS-TPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI----VIDSG 306
            + + G  V  P     TP+   T   +FY + +  I V  + + VS+P +    ++DSG
Sbjct: 326 GYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPLIVDSG 385

Query: 307 TTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSF----NSLSQVPEVTIHFR 360
           T +T L P+ Y +   + +S+M +      P  S L+ CY F    N+   +P V + F 
Sbjct: 386 TVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVFA 445

Query: 361 GA---DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
           G     V  S   +  KV++  +     G   S  I GN  Q    V YD+ +Q + F  
Sbjct: 446 GGATISVDFSGVLYVAKVAQACLAFAPNGDGRSAGILGNTQQRTLAVVYDVARQKIGFAA 505

Query: 418 TDCT 421
             C+
Sbjct: 506 KGCS 509


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 131/361 (36%), Positives = 183/361 (50%), Gaps = 39/361 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ ++IGTPP       DTGS L+WTQC+PC  + C+ Q  P +D   SST+    C 
Sbjct: 34  EYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPC--AVCFNQSLPYYDASRSSTFALPSCD 91

Query: 150 SSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           S+QC  L+      VN     C YS SYGD S + G L  ETV+  +      ++PG+ F
Sbjct: 92  STQC-KLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVA----GASVPGVVF 146

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFGT 259
           GCG NN G+F S  TGI G G G +SL SQ++    G FS+C   VS  K      +   
Sbjct: 147 GCGLNNTGIFRSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVSGRKPSTVLFDLPA 203

Query: 260 NGIVSGPGVV-STPLTK---AKTFYVLTIDAISVGNQRLGV---------STPDIVIDSG 306
           +   +G G V +TPL K     TFY L++  I+VG+ RL V          T   +IDSG
Sbjct: 204 DLYKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALKNGTGGTIIDSG 263

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLSQ---VPEVTIHFRGA 362
           T  T LP      +    ++ ++   V ++ TG L LC+S   L +   VP++ +HF GA
Sbjct: 264 TAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPLGKAPHVPKLVLHFEGA 322

Query: 363 DVKLSRSNFFVKVSEDIVCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            + L R N+  +  +   CS+    I   + I GN  Q N  V YD++   +SF    C 
Sbjct: 323 TMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFVRAKCD 382

Query: 422 K 422
           K
Sbjct: 383 K 383


>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 453

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 139/415 (33%), Positives = 214/415 (51%), Gaps = 53/415 (12%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISSSKASQADII--------PNNANYLIRISIGTPPTE 103
           Q +RDAL R ++R   F +  + SSS +S A  +        PN   Y++ ++IGTPP  
Sbjct: 45  QFVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQS 104

Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS--QCASLNQKSC 161
             A+ADTGSDL+WTQC PC   +C+ Q SPL++P  S T++ LPCSS+   CA+  + + 
Sbjct: 105 YPAIADTGSDLVWTQCAPC-GERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAG 163

Query: 162 S----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
           +    G  C+Y+ +YG G +++G   +ET T GS+    V +PGI FGC   +   +N  
Sbjct: 164 ATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDWN-- 220

Query: 218 TTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN-------FGTNGIVSGPGVVS 270
             G  GL G     +S +    AG FSYCL P   TK               ++G GV S
Sbjct: 221 --GSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRS 278

Query: 271 TPLTKA------KTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQ 314
           TP   +       T+Y L +  ISVG   L +           T  ++IDSGTT+T L  
Sbjct: 279 TPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVD 338

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPEVTIHF-RGADVKLS 367
                + + + S+++  PV D + +  L+LC++  S S     +P +T+HF  GAD+ L 
Sbjct: 339 AAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLP 397

Query: 368 RSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             N+ + +   + C   +  T+  +   GN  Q N  + YD++++T+SF P  C+
Sbjct: 398 VENYMI-LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 451


>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
 gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
          Length = 458

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 139/415 (33%), Positives = 214/415 (51%), Gaps = 53/415 (12%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISSSKASQADII--------PNNANYLIRISIGTPPTE 103
           Q +RDAL R ++R   F +  + SSS +S A  +        PN   Y++ ++IGTPP  
Sbjct: 50  QFVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQS 109

Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS--QCASLNQKSC 161
             A+ADTGSDL+WTQC PC   +C+ Q SPL++P  S T++ LPCSS+   CA+  + + 
Sbjct: 110 YPAIADTGSDLVWTQCAPC-GERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAG 168

Query: 162 S----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
           +    G  C+Y+ +YG G +++G   +ET T GS+    V +PGI FGC   +   +N  
Sbjct: 169 ATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGCSNASSDDWN-- 225

Query: 218 TTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN-------FGTNGIVSGPGVVS 270
             G  GL G     +S +    AG FSYCL P   TK               ++G GV S
Sbjct: 226 --GSAGLVGLGRGGLSLVSQLAAGMFSYCLTPFQDTKSKSTLLLGPAAAAAALNGTGVRS 283

Query: 271 TPLTKA------KTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQ 314
           TP   +       T+Y L +  ISVG   L +           T  ++IDSGTT+T L  
Sbjct: 284 TPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRADGTGGLIIDSGTTITSLVD 343

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPEVTIHF-RGADVKLS 367
                + + + S+++  PV D + +  L+LC++  S S     +P +T+HF  GAD+ L 
Sbjct: 344 AAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPPATLPSMTLHFGGGADMVLP 402

Query: 368 RSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             N+ + +   + C   +  T+  +   GN  Q N  + YD++++T+SF P  C+
Sbjct: 403 VENYMI-LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKETLSFAPAKCS 456


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score =  182 bits (462), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 139/406 (34%), Positives = 191/406 (47%), Gaps = 41/406 (10%)

Query: 45  NSSETPYQRLRDALTRSLNRLN------HFNQNSSISSSKASQADIIPNNANYLIRISIG 98
           +S++TP Q     L R   R+       H  +++  S S +  + +   +  Y  RI +G
Sbjct: 66  SSNKTPEQLFHLRLQRDAKRVEALLNQIHARRSAGSSFSSSIISGLAQGSGEYFTRIGVG 125

Query: 99  TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
           TP      V DTGSD++W QC PC   +CY Q   +FDP  S TY  +PC +  C  L+ 
Sbjct: 126 TPARYVYMVLDTGSDVVWLQCAPC--RKCYTQTDHVFDPTKSRTYAGIPCGAPLCRRLDS 183

Query: 159 KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS 216
             CS  N  CQY VSYGDGSF+ G+ +TET+T        VAL     GCG +N GLF  
Sbjct: 184 PGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRNRVTRVAL-----GCGHDNEGLFTG 238

Query: 217 KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS-----T 271
               ++GLG G +S   Q       KFSYCLV  S++      + ++ G   VS     T
Sbjct: 239 AAG-LLGLGRGRLSFPVQTGRRFNHKFSYCLVDRSASA---KPSSVIFGDSAVSRTAHFT 294

Query: 272 PLT---KAKTFYVLTIDAISVGNQ----------RLGVS-TPDIVIDSGTTLTFLPQGYN 317
           PL    K  TFY L +  ISVG            RL  +    ++IDSGT++T L +   
Sbjct: 295 PLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGTSVTRLTRPAY 354

Query: 318 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKV 375
             L             A      + C+  + L++  VP V +HFRGADV L  +N+ + V
Sbjct: 355 IALRDAFRIGASHLKRAPEFSLFDTCFDLSGLTEVKVPTVVLHFRGADVSLPATNYLIPV 414

Query: 376 SED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                 C  F G  + + I GNI Q  F + YD+    V F P  C
Sbjct: 415 DNSGSFCFAFAGTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score =  182 bits (461), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 126/350 (36%), Positives = 182/350 (52%), Gaps = 28/350 (8%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ R+ +GTP    + V DTGS L W QC PC  S C+ Q  P+FDPK SS+Y ++ CS
Sbjct: 116 NYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVS-CHRQSGPVFDPKTSSSYAAVSCS 174

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           S QC     A+LN   CS  N C Y  SYGD SFS G L+ +TV+ G     A ++P   
Sbjct: 175 SPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSFG-----ANSVPNFY 229

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
           +GCG +N GLF  ++ G++GL    +SL+ Q+  T+   FSYCL   SS+   + + G  
Sbjct: 230 YGCGQDNEGLFG-RSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSTSSS--GYLSIGSY 286

Query: 264 SGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQG 315
           +  G   TP+   T   + Y +++  ++V  + L VS+ +      +IDSGT +T LP  
Sbjct: 287 NPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTIIDSGTVITRLPTS 346

Query: 316 -YNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHFR-GADVKLSRSNF 371
            Y +   +V ++M  +   A     L+ C+    + L  VP V++ F  GA +KLS  N 
Sbjct: 347 VYTALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGGATLKLSAGNL 406

Query: 372 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            V V     C  F     S  I GN  Q  F V YD++   + F    C+
Sbjct: 407 LVDVDGATTCLAF-APARSAAIIGNTQQQTFSVVYDVKSNRIGFAAAGCS 455


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  182 bits (461), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 131/384 (34%), Positives = 195/384 (50%), Gaps = 33/384 (8%)

Query: 59  TRSLNRLNHFNQNSSISSSKASQADIIPNNA----NYLIRISIGTPPTERLAVADTGSDL 114
           T ++  L   N ++++  S AS   + P  +    NY+ R+ +GTP    + V DTGS L
Sbjct: 102 TVTVASLYRANDDAAVDGSLAS-VPLTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSL 160

Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQY 168
            W QC PC  S C+ Q  P+FDPK SS+Y ++ CS+ QC     A+LN  +CS  + C Y
Sbjct: 161 TWLQCSPCRVS-CHRQSGPVFDPKTSSSYAAVSCSTPQCNDLSTATLNPAACSSSDVCIY 219

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
             SYGD SFS G L+ +TV+ GS +     +P   +GCG +N GLF  ++ G++GL    
Sbjct: 220 QASYGDSSFSVGYLSKDTVSFGSNS-----VPNFYYGCGQDNEGLFG-RSAGLMGLARNK 273

Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS-TPL---TKAKTFYVLTI 284
           +SL+ Q+  T+   FSYCL    S+  +   +     PG  S TP+   T   + Y + +
Sbjct: 274 LSLLYQLAPTLGYSFSYCL---PSSSSSGYLSIGSYNPGQYSYTPMVSSTLDDSLYFIKL 330

Query: 285 DAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 339
             ++V  + L VS+ +      +IDSGT +T LP      L   ++  ++    AD    
Sbjct: 331 SGMTVAGKPLAVSSSEYSSLPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYSI 390

Query: 340 LELCYSFNSLS-QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNI 397
           L+ C+   + S +VP V++ F  GA +KLS  N  V V     C  F     S  I GN 
Sbjct: 391 LDTCFVGQASSLRVPAVSMAFSGGAALKLSAQNLLVDVDSSTTCLAF-APARSAAIIGNT 449

Query: 398 MQTNFLVGYDIEQQTVSFKPTDCT 421
            Q  F V YD++   + F    CT
Sbjct: 450 QQQTFSVVYDVKSNRIGFAAGGCT 473


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  181 bits (460), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 143/395 (36%), Positives = 192/395 (48%), Gaps = 43/395 (10%)

Query: 56  DALTRSLNRLNHFNQNSSISSSKASQADIIPN----NANYLIRISIGTPPTERLAVADTG 111
           + LTRS +R        +   S+  QA ++      +  Y IRIS+GTPP     V DTG
Sbjct: 24  NGLTRSRSR-----DRQTKVPSQDFQAPVVSGLSLGSGEYFIRISVGTPPRRMYLVMDTG 78

Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVS 171
           SD++W QC PC    CY Q   +FDP  SSTY +L CS+ QC +L+  +C    C Y V 
Sbjct: 79  SDILWLQCAPC--VNCYHQSDAIFDPYKSSTYSTLGCSTRQCLNLDIGTCQANKCLYQVD 136

Query: 172 YGDGSFSNGNLATETVTLGSTTGQA-VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDIS 230
           YGDGSF+ G   T+ V+L ST+G   V L  I  GCG +N G F      ++GLG G +S
Sbjct: 137 YGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGYFVGAAG-LLGLGKGPLS 195

Query: 231 LISQMRTTIAGKFSYCLVP-----VSSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVL 282
             +Q+     G+FSYCL          + + FG    V   G   TP     +  TFY L
Sbjct: 196 FPNQVDPQNGGRFSYCLTDRETDSTEGSSLVFG-EAAVPPAGARFTPQDSNMRVPTFYYL 254

Query: 283 TIDAISVGNQRLGVSTP----------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 332
            +  ISVG   L + T            ++IDSGT++T L    N+   S+  +      
Sbjct: 255 KMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGTSVTRLQ---NAAYASLRDAFRAGTS 311

Query: 333 VADPTGSLEL---CYSFNSLS--QVPEVTIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFK 385
              PT    L   CY  + L+   VP VT+HF+G  D+KL  SN+ + V + +  C  F 
Sbjct: 312 DLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQGGTDLKLPASNYLIPVDNSNTFCLAFA 371

Query: 386 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           G T    I GNI Q  F V YD     V F P+ C
Sbjct: 372 GTTGP-SIIGNIQQQGFRVIYDNLHNQVGFVPSQC 405


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score =  181 bits (460), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 145/426 (34%), Positives = 208/426 (48%), Gaps = 48/426 (11%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---I 84
           GF   L H D+      ++  T  Q L  AL RS  R+      ++++   A  A    +
Sbjct: 30  GFKATLRHVDA------DAGYTEEQLLSRALRRSSARVATLQSLAALAPGDAITAARILV 83

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
           + ++  YL+ + IGTP     A+ DTGSDLIWTQC PC    C  Q +P FDP  S+TY+
Sbjct: 84  LASDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPARSATYR 141

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           SL C+S  C +L    C    C Y   YGD + + G LA ET T G T    V+LPGI+F
Sbjct: 142 SLGCASPACNALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFG-TNETRVSLPGISF 200

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTKINFGTN 260
           GCG  N GL  +  +G+VG G G +SL+SQ+ +    +FSYCL     PV S ++ FG  
Sbjct: 201 GCGNLNAGLL-ANGSGMVGFGRGSLSLVSQLGSP---RFSYCLTSFLSPVPS-RLYFGVY 255

Query: 261 GIV-----SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS-----------TPDI 301
             +     S   V STP        T Y L +  ISVG   L +            T   
Sbjct: 256 ATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGT 315

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPEV 355
           +IDSGTT+T+L +     + +  +S I   P+ + T +  L+ C+ +    +    +P++
Sbjct: 316 IIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNVTDASVLDTCFQWPPPPRQSVTLPQL 374

Query: 356 TIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
            +HF GAD +L   N+  V  S      +    ++   I G+    NF V YD+E   +S
Sbjct: 375 VLHFDGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMS 434

Query: 415 FKPTDC 420
           F P  C
Sbjct: 435 FVPAPC 440


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score =  181 bits (460), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 148/439 (33%), Positives = 212/439 (48%), Gaps = 56/439 (12%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQ--------RLRDALTRSLNR---------LNHFNQ 70
           G  + L H  SP SP    S+ P+         R+    +R  N          L H ++
Sbjct: 42  GLHLTLHHPQSPCSPAPLPSDLPFSAVVTHDDARIAHLASRLANNHPTSPSSSSLLHGHR 101

Query: 71  NSSISSSKASQAD-----IIPNNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
                    SQA      + P  +    NY+ R+ +GTP T  + V DTGS L W QC P
Sbjct: 102 KKKAGGVGGSQASSSSVPLTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSP 161

Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDG 175
           C  S C+ Q  P+FDP+ S TY ++ CSSS+C     A+LN  +CS  N C Y  SYGD 
Sbjct: 162 CSVS-CHRQAGPVFDPRASGTYAAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGDS 220

Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           S+S G L+ +TV+ GS +      PG  +GCG +N GLF  ++ G++GL    +SL+ Q+
Sbjct: 221 SYSVGYLSKDTVSFGSGS-----FPGFYYGCGQDNEGLFG-RSAGLIGLAKNKLSLLYQL 274

Query: 236 RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS-TPLTKA---KTFYVLTIDAISVGN 291
             ++   FSYCL P SS    + + G  + PG  S TP+  +    + Y +T+  ISV  
Sbjct: 275 APSLGYAFSYCL-PTSSAAAGYLSIGSYN-PGQYSYTPMASSSLDASLYFVTLSGISVAG 332

Query: 292 QRLGV------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCY 344
             L V      S P I IDSGT +T LP    + L   +++ + +     PT S L+ C+
Sbjct: 333 APLAVPPSEYRSLPTI-IDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCF 391

Query: 345 SFNSLS-QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 402
             ++   +VP V + F  GA + LS  N  + V +   C  F   T    I GN  Q  F
Sbjct: 392 RGSAAGLRVPRVDMAFAGGATLALSPGNVLIDVDDSTTCLAFA-PTGGTAIIGNTQQQTF 450

Query: 403 LVGYDIEQQTVSFKPTDCT 421
            V YD+ Q  + F    C+
Sbjct: 451 SVVYDVAQSRIGFAAGGCS 469


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score =  181 bits (459), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 143/428 (33%), Positives = 207/428 (48%), Gaps = 45/428 (10%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL---RDALTRSLN-RLNHFNQNSSISSSKASQAD 83
           G  +EL H  SP SP    ++ P+  +    DA   SL  RL       + S    + A 
Sbjct: 42  GLHLELHHPRSPCSPAPVPADLPFTAVLTHDDARISSLAARLAKTPSARATSLDADADAG 101

Query: 84  IIPNNA-------------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
           +  + A             NY+ R+ +GTP T+ + V DTGS L W QC PC  S C+ Q
Sbjct: 102 LAGSLASVPLSPGASVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVS-CHRQ 160

Query: 131 DSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLAT 184
             P+F+PK SSTY S+ CS+ QC     A+LN  +CS  N C Y  SYGD SFS G L+ 
Sbjct: 161 SGPVFNPKSSSTYASVGCSAQQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSK 220

Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
           +TV+ GST+     LP   +GCG +N GLF  ++ G++GL    +SL+ Q+  ++   F+
Sbjct: 221 DTVSFGSTS-----LPNFYYGCGQDNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFT 274

Query: 245 YCLVPVSSTKINFGTNGIVSGPGVVS-TPLTKA---KTFYVLTIDAISVGNQRLGVSTPD 300
           YCL    S+  +   +     PG  S TP+  +    + Y + +  ++V    L VS+  
Sbjct: 275 YCL---PSSSSSGYLSLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSA 331

Query: 301 -----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPE 354
                 +IDSGT +T LP    S L   +++ ++    A     L+ C+    S    P 
Sbjct: 332 YSSLPTIIDSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQASRVSAPA 391

Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
           VT+ F  GA +KLS  N  V V +   C  F     S  I GN  Q  F V YD++   +
Sbjct: 392 VTMSFAGGAALKLSAQNLLVDVDDSTTCLAF-APARSAAIIGNTQQQTFSVVYDVKSSRI 450

Query: 414 SFKPTDCT 421
            F    C+
Sbjct: 451 GFAAGGCS 458


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  181 bits (459), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 137/391 (35%), Positives = 190/391 (48%), Gaps = 36/391 (9%)

Query: 53  RLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA----NYLIRISIGTPPTERLAVA 108
           RL   L R  N   H  ++++   + A Q  ++   +     Y +R+ IG PP++   V 
Sbjct: 107 RLDLVLKRVSNSDLHPAESNAEFEANALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVL 166

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
           DTGSD+ W QC PC  S+CY Q  P+FDP  S++Y  + C + QC SL+   C    C Y
Sbjct: 167 DTGSDVSWIQCAPC--SECYQQSDPIFDPVSSNSYSPIRCDAPQCKSLDLSECRNGTCLY 224

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
            VSYGDGS++ G  ATETVTLG+   + VA+     GCG NN GLF     G++GLGGG 
Sbjct: 225 EVSYGDGSYTVGEFATETVTLGTAAVENVAI-----GCGHNNEGLF-VGAAGLLGLGGGK 278

Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTID 285
           +S  +Q+  T    FSYCLV   S  ++           VV+ PL +     TFY L + 
Sbjct: 279 LSFPAQVNAT---SFSYCLVNRDSDAVSTLEFNSPLPRNVVTAPLRRNPELDTFYYLGLK 335

Query: 286 AISVGNQRLGVSTPDIVI------------DSGTTLTFLPQGYNSNLLSVMSSMIEAQPV 333
            ISVG + L +  P+ +             DSGT +T L       L        +  P 
Sbjct: 336 GISVGGEALPI--PESIFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPK 393

Query: 334 ADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKV-SEDIVCSVFKGITN 389
           A+     + CY  +S    QVP V+ HF  G ++ L   N+ + V S    C  F   T+
Sbjct: 394 ANGVSLFDTCYDLSSRESVQVPTVSFHFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTS 453

Query: 390 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           S+ I GN+ Q    VG+DI    V F    C
Sbjct: 454 SLSIMGNVQQQGTRVGFDIANSLVGFSADSC 484


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  181 bits (459), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 128/357 (35%), Positives = 178/357 (49%), Gaps = 35/357 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ +GTP      V DTGSD++W QC PC   +CY Q  P+FDP+ S TY ++P
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC--RRCYSQSDPIFDPRKSKTYATIP 196

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           CSS  C  L+   C+     C Y VSYGDGSF+ G+ +TET+T      + VAL     G
Sbjct: 197 CSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVAL-----G 251

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
           CG +N GLF      ++GLG G +S   Q       KFSYCLV  S++      + +V G
Sbjct: 252 CGHDNEGLFVGAAG-LLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASS---KPSSVVFG 307

Query: 266 PGVVS-----TPLT---KAKTFYVLTIDAISVGNQRL-GVSTP----------DIVIDSG 306
              VS     TPL    K  TFY + +  ISVG  R+ GV+             ++IDSG
Sbjct: 308 NAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSG 367

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADV 364
           T++T L +     +        +    A      + C+  +++++  VP V +HFRGADV
Sbjct: 368 TSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADV 427

Query: 365 KLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            L  +N+ + V  +   C  F G    + I GNI Q  F V YD+    V F P  C
Sbjct: 428 SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  181 bits (458), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 127/373 (34%), Positives = 182/373 (48%), Gaps = 34/373 (9%)

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           Q    SSS  S   +   +  Y  R+ +GTPP     V DTGSD++W QC PC   +CY 
Sbjct: 128 QGGGFSSSVTS--GLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPC--RKCYS 183

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
           Q  P+FDPK S ++ S+ C S  C  L+   C S  +C Y V+YGDGSF+ G  +TET+T
Sbjct: 184 QTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLT 243

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
                 +   +P +  GCG +N GLF      ++GLG G +S  +Q       KFSYCLV
Sbjct: 244 F-----RGTRVPKVALGCGHDNEGLFVGAAG-LLGLGRGRLSFPTQTGLRFGRKFSYCLV 297

Query: 249 PVSS----TKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPD- 300
             S+    + + FG + +      V TPL    K  TFY L +  ISVG  R+   T   
Sbjct: 298 DRSASSKPSSVVFGQSAVSR--TAVFTPLITNPKLDTFYYLELTGISVGGARVAGITASL 355

Query: 301 ----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
                     ++IDSGT++T L +    +L     +       A      + C+  +  +
Sbjct: 356 FKLDTAGNGGVIIDSGTSVTRLTRRAYVSLRDAFRAGAADLKRAPDYSLFDTCFDLSGKT 415

Query: 351 Q--VPEVTIHFRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
           +  VP V +HFRGADV L  +N+ + V  + + C  F G  + + I GNI Q  F V +D
Sbjct: 416 EVKVPTVVMHFRGADVSLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQGFRVVFD 475

Query: 408 IEQQTVSFKPTDC 420
           +    + F    C
Sbjct: 476 VAASRIGFAARGC 488


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score =  181 bits (458), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 145/436 (33%), Positives = 206/436 (47%), Gaps = 55/436 (12%)

Query: 29  FSVELIHRDSPK-SPFYNSSETPYQRLRDALTRSLNRLNHFNQN---------------- 71
           +SV+L+HRDS       N++ +  +RL + L R   R+    Q                 
Sbjct: 71  WSVQLVHRDSLLFKGAANATASYERRLEEKLRREAARVRALEQRIERKLKLKKDPAGSYE 130

Query: 72  --SSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
             + +++   S+  + +   +  Y  RI IGTP  E+  V DTGSD++W QCEPC   +C
Sbjct: 131 NVAGVTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPC--REC 188

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           Y Q  P+F+P  S ++ ++ C S+ C+ L+   C G  C Y VSYGDGS++ G+ ATET+
Sbjct: 189 YSQADPIFNPSSSVSFSTVGCDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETL 248

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           T G+T+ Q VA+     GCG +N GLF      ++GLG G +S  +Q+ T     FSYCL
Sbjct: 249 TFGTTSIQNVAI-----GCGHDNVGLFVGAAG-LLGLGAGSLSFPAQLGTQTGRAFSYCL 302

Query: 248 VPV---SSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPD- 300
           V     SS  + FG   +  G   + TPL       TFY L++ AISVG   L  S P  
Sbjct: 303 VDRDSESSGTLEFGPESVPIGS--IFTPLVANPFLPTFYYLSMVAISVGGVILD-SVPSE 359

Query: 301 ------------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS 348
                       I+IDSGT +T L       L     +  +  P AD     + CY  ++
Sbjct: 360 AFRIDETTGRGGIIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSA 419

Query: 349 LSQV--PEVTIHF-RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLV 404
           L  V  P V  HF  GA   L   N  + + S    C  F    +++ I GNI Q    V
Sbjct: 420 LQSVSIPAVGFHFSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRV 479

Query: 405 GYDIEQQTVSFKPTDC 420
            +D     V F    C
Sbjct: 480 SFDSANSLVGFAIDQC 495


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 133/421 (31%), Positives = 202/421 (47%), Gaps = 39/421 (9%)

Query: 30  SVELIHRDSPKSPFYNSSETP---YQRLRDALTRS---LNRLNHFNQNSSISSSKASQAD 83
           SV L+HR  P +P   SS+ P     RLR    RS   ++R++          S  +   
Sbjct: 57  SVPLVHRHGPCAPTQLSSDKPSSFTDRLRRNRARSKYIMSRVSKGMMGDDADVSIPTHLG 116

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
              ++  Y++ + +GTP   ++ + DTGSDL W QC+PC  + CY Q  PLFDP  SSTY
Sbjct: 117 GSVDSLEYVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTY 176

Query: 144 KSLPCSSSQCASLNQKSCSG--------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
             +PC++  C  L      G          C ++++YGDGS + G  + ET+ L      
Sbjct: 177 APIPCNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALAP---- 232

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
            VA+    FGCG +  G  N K  G++GLGG   SL+ Q  +   G FSYCL P  + ++
Sbjct: 233 GVAVKDFRFGCGHDQDGA-NDKYDGLLGLGGAPESLVVQTASVYGGAFSYCL-PALNNQV 290

Query: 256 --------NFGTNGIVSGPGVVSTPLTK-AKTFYVLTIDAISVGNQRLGVS----TPDIV 302
                      + G+V+  G V TP+ +  +TFYV+ +  I+VG + + V     +  ++
Sbjct: 291 GFLALGGGGAPSGGVVNTSGFVFTPMIREEETFYVVNMTGITVGGEPIDVPPSAFSGGMI 350

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR 360
           IDSGT +T L     + L +     + A P+    G L+ CY F+  S V  P+V + F 
Sbjct: 351 IDSGTVVTELQHTAYNALQAAFRKAMAAYPLVR-NGELDTCYDFSGYSNVTLPKVALTFS 409

Query: 361 -GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
            GA + L   N  +   +D +     G  +   I GN+ Q    V YD  +  V F+   
Sbjct: 410 GGATIDLDVPNGILL--DDCLAFQESGPDDQPGILGNVNQRTLEVLYDAGRGRVGFRAAV 467

Query: 420 C 420
           C
Sbjct: 468 C 468


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score =  180 bits (457), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 131/412 (31%), Positives = 197/412 (47%), Gaps = 39/412 (9%)

Query: 30  SVELIHRDSPKSPFYN-----SSETPYQRLRDALTRSLNRLN-----HFNQNSSISSSKA 79
           S+E++H+  P S   N      S+TP+  + +     +  +N     +  Q+SS+S   +
Sbjct: 70  SLEVVHKHGPCSQLNNHDGKAKSKTPHSEILNQDKERVKYINSRISKNLGQDSSVSELDS 129

Query: 80  ----SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
               +++  +  + NY + + +GTP  +   + DTGSDL WTQCEPC  S CY Q   +F
Sbjct: 130 VTLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARS-CYKQQDAIF 188

Query: 136 DPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVT 188
           DP  S++Y ++ C+S+ C  L     N+  CS     C Y + YGD SFS G  + E ++
Sbjct: 189 DPSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLS 248

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
           + +T      +    FGCG NN GLF   + G++GLG   IS + Q        FSYCL 
Sbjct: 249 VTATD----IVDNFLFGCGQNNQGLFGG-SAGLIGLGRHPISFVQQTAAVYRKIFSYCLP 303

Query: 249 PVSST--KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STPDI 301
             SS+  +++FGT           + +++  +FY L I  ISVG  +L V     ST   
Sbjct: 304 ATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTGGA 363

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 359
           +IDSGT +T LP    + L S     +   P A     L+ CY  +      +P++   F
Sbjct: 364 IIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKIDFSF 423

Query: 360 RGA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDI 408
            G   V+L         S   VC  F   G  + V IYGN+ Q    V YD+
Sbjct: 424 AGGVTVQLPPQGILYVASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYDV 475


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 140/423 (33%), Positives = 209/423 (49%), Gaps = 41/423 (9%)

Query: 30  SVELIHRDSPKS---PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS-KASQADII 85
           S+E++H+  P S   P   +S +  Q L    +R  +  +   +N +  S+ KAS+A + 
Sbjct: 76  SLEVVHKHGPCSKLRPHKANSPSHTQILAQDESRVASIQSRLAKNLAGGSNLKASKATLP 135

Query: 86  PNNA------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
             +A      NY++ + +G+P  +   + DTGSDL WTQCEPC    CY Q   +FDP  
Sbjct: 136 SKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPC-VGYCYQQREHIFDPST 194

Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           S +Y ++ C S  C  L     N   CS   C Y + YGDGS+S G  A E ++L ST  
Sbjct: 195 SLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD- 253

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSS 252
                    FGCG NN GLF   T G++GL    +SL+SQ        FSYCL     S+
Sbjct: 254 ---VFNNFQFGCGQNNRGLFGG-TAGLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSST 309

Query: 253 TKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGV-----STPDIVID 304
             ++FG+ G      V  TP    +   +FY L +  ISVG ++L +     ST   +ID
Sbjct: 310 GYLSFGS-GDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFSTAGTIID 368

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-G 361
           SGT ++ LP    S++  V   ++   P       L+ CY  +     +VP++ ++F  G
Sbjct: 369 SGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVPKIILYFSGG 428

Query: 362 ADVKLSRSN--FFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
           A++ L+     + +KVS+  VC  F G +  + V I GN+ Q    V YD  +  V F P
Sbjct: 429 AEMDLAPEGIIYVLKVSQ--VCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAP 486

Query: 418 TDC 420
           + C
Sbjct: 487 SGC 489


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 124/366 (33%), Positives = 176/366 (48%), Gaps = 38/366 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD-SPLFDPKMSSTYKSLPC 148
            YL+ +S+GTPP       DTGSDL+WTQC PC    C+ Q  +P+ DP  SST+ +LPC
Sbjct: 89  EYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPC--LDCFEQGAAPVLDPAASSTHAALPC 146

Query: 149 SSSQCASLNQKSCSG-----VNCQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVALPGI 202
            +  C +L   SC G      +C Y   YGD S + G LAT++ T  G      +A   +
Sbjct: 147 DAPLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAARRV 206

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----INFG 258
           TFGCG  N G+F +  TGI G G G  SL SQ+  T    FSYC   +  TK    +  G
Sbjct: 207 TFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVT---SFSYCFTSMFDTKSSSVVTLG 263

Query: 259 --------TNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI----VI 303
                   T+       V +T L K     + Y + +  ISVG  R+ V    +    +I
Sbjct: 264 AAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLRSSTII 323

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-----QVPEVTIH 358
           DSG ++T LP+     + +   S +     A  + +L+LC++    +      VP +T+H
Sbjct: 324 DSGASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPVAALWRRPAVPALTLH 383

Query: 359 FR-GADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
              GAD +L R N+ F   +  ++C V         + GN  Q N  V YD+E   +SF 
Sbjct: 384 LDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQNTHVVYDLENDVLSFA 443

Query: 417 PTDCTK 422
           P  C K
Sbjct: 444 PARCDK 449


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 147/430 (34%), Positives = 206/430 (47%), Gaps = 59/430 (13%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISS---------SKA 79
           FSV+L H D+     +NS  TP       L R   R+   +  +  +          S +
Sbjct: 60  FSVQLHHVDALS---FNS--TPETLFTTRLQRDAARVEAISYLAETAGTGKRVGTGFSSS 114

Query: 80  SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
             + +   +  Y  RI +GTPP     V DTGSD++W QC PC   +CY Q  P+FDP+ 
Sbjct: 115 VISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPC--KRCYAQSDPVFDPRK 172

Query: 140 SSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           S ++ S+ C S  C  L+   C+     C Y VSYGDGSF+ G+ +TET+T   T    V
Sbjct: 173 SRSFASIACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTRVARV 232

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
           AL     GCG +N GLF      ++GLG G +S  SQ       KFSYCLV  S++    
Sbjct: 233 AL-----GCGHDNEGLFVGAAG-LLGLGRGRLSFPSQTGRRFNHKFSYCLVDRSASS--- 283

Query: 258 GTNGIVSGPGVVS-----TPLT---KAKTFYVLTIDAISVGNQRL-GVSTP--------- 299
             + +V G   VS     TPL    K  TFY + +  ISVG  R+ G++           
Sbjct: 284 KPSSMVFGDSAVSRTARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGN 343

Query: 300 -DIVIDSGTTLTFLPQ----GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSFNSLSQ-- 351
             ++IDSGT++T L +     +     +  S++  A     P  SL + C+  +  ++  
Sbjct: 344 GGVIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRA-----PQFSLFDTCFDLSGKTEVK 398

Query: 352 VPEVTIHFRGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
           VP V +HFRGADV L  SN+ + V +    C  F G    + I GNI Q  F V YD+  
Sbjct: 399 VPTVVLHFRGADVSLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAG 458

Query: 411 QTVSFKPTDC 420
             V F P  C
Sbjct: 459 SRVGFAPHGC 468


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 134/415 (32%), Positives = 201/415 (48%), Gaps = 37/415 (8%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
           ++I +D  +  F +S  T  + +R++ T      +      S+ S+   ++ +   + NY
Sbjct: 59  DMITKDEERVRFLHSRLTNKESVRNSATT-----DKLRGGPSLVSTTPLKSGLSIGSGNY 113

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC--- 148
            ++I +GTP      + DTGS L W QC+PC    C++Q  P+F P  S TYK+LPC   
Sbjct: 114 YVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCV-IYCHVQVDPIFTPSTSKTYKALPCSSS 172

Query: 149 --SSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
             SS + ++LN   CS     C Y  SYGD SFS G L+ + +TL   T       G  +
Sbjct: 173 QCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTL---TPSEAPSSGFVY 229

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS--------STKIN 256
           GCG +N GLF  +++GI+GL    IS++ Q+       FSYCL            S  ++
Sbjct: 230 GCGQDNQGLFG-RSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSLSGFLS 288

Query: 257 FGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPDI----VIDSGTTL 309
            G + + S P    TPL K +   + Y L +  I+V  + LGVS        +IDSGT +
Sbjct: 289 IGASSLTSSP-YKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPTIIDSGTVI 347

Query: 310 TFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHFR-GADVK 365
           T LP   YN+   S +  M +    A     L+ C+  S   +S VPE+ I FR GA ++
Sbjct: 348 TRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIFRGGAGLE 407

Query: 366 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           L   N  V++ +   C      +N + I GN  Q  F V YD+    + F P  C
Sbjct: 408 LKAHNSLVEIEKGTTCLAIAASSNPISIIGNYQQQTFKVAYDVANFKIGFAPGGC 462


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 130/367 (35%), Positives = 183/367 (49%), Gaps = 44/367 (11%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ ++IGTPP       DTGSDLIWTQC+PCP   C+ Q  P FDP  SST     C 
Sbjct: 34  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 91

Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           S+ C  L   SC          C Y+ SYGD S + G L  +  T     G   ++PG+ 
Sbjct: 92  STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 148

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFG 258
           FGCG  N G+F S  TGI G G G +SL SQ++    G FS+C   +     S+  ++  
Sbjct: 149 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVLLDLP 205

Query: 259 TNGIVSGPGVV-STPLTK-AK-----TFYVLTIDAISVGNQRLGV---------STPDIV 302
            +   +G G V +TPL + AK     T Y L++  I+VG+ RL V          T   +
Sbjct: 206 ADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTI 265

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR 360
           IDSGT++T LP      +    ++ I+   V         C+S  S ++  VP++ +HF 
Sbjct: 266 IDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFE 325

Query: 361 GADVKLSRSNFFVKVSED----IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           GA + L R N+  +V +D    I+C ++ KG  +   I GN  Q N  V YD++   +SF
Sbjct: 326 GATMDLPRENYVFEVPDDAGNSIICLAINKG--DETTIIGNFQQQNMHVLYDLQNNMLSF 383

Query: 416 KPTDCTK 422
               C K
Sbjct: 384 VAAQCDK 390


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  180 bits (456), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 124/359 (34%), Positives = 182/359 (50%), Gaps = 38/359 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY++ + +G        + DTGSDL W QC+PC    CY Q  PL+DP +SS+YK++ C+
Sbjct: 137 NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 192

Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           SS C  L     N   C G N      C+Y VSYGDGS++ G+LA+E++ LG T      
Sbjct: 193 SSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDT-----K 247

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI 255
           L  + FGCG NN GLF    +G++GLG   +SL+SQ   T  G FSYCL  +   +S  +
Sbjct: 248 LENLVFGCGRNNKGLFGG-ASGLMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGTL 306

Query: 256 NFGTNGIV--SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVST--PDIVIDSGTT 308
           +FG +  V  +   V  TPL    + ++FY+L +   S+G   L   +    I+IDSGT 
Sbjct: 307 SFGNDFSVYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGGVELKTLSFGRGILIDSGTV 366

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRG---AD 363
           +T LP      + +         P A     L+ C++  S     +P + + F G    +
Sbjct: 367 ITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILDTCFNLTSYEDISIPTIKMIFEGNAELE 426

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           V ++   +FVK    +VC     ++  N V I GN  Q N  V YD  Q+ +     +C
Sbjct: 427 VDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIAGENC 485


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 139/447 (31%), Positives = 201/447 (44%), Gaps = 46/447 (10%)

Query: 6   SCVF-ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR 64
           SC+   LFFL      P+ + T      L H D  +        T  + LR  + RS  R
Sbjct: 11  SCMLPYLFFLAILFAWPVTSAT--LRAHLSHVDDGRG------FTKRELLRRMVVRSRAR 62

Query: 65  LNHFNQNSSISSSKASQADIIPN---NANYLIRISIGTPPTERLAVA-DTGSDLIWTQCE 120
             +    S  ++  A+      N   N+ YLI +SIG P ++ + +  DTGSD++WTQCE
Sbjct: 63  AANLCPYSGATARPATAPVGRANTDVNSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCE 122

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
           PC  ++C+ Q  P FD   S+T +S+ CS   C + ++  C    C Y   YGDGS S G
Sbjct: 123 PC--AECFTQPLPRFDTAASNTVRSVACSDPLCNAHSEHGCFLHGCTYVSGYGDGSLSFG 180

Query: 181 NLATETVTLGSTTGQA-VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           +   ++ T     G   V +P I FGCG  N G F    TGI G G G +SL SQ++   
Sbjct: 181 HFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKVR- 239

Query: 240 AGKFSYCLV---PVSSTKINFGTNG---------IVSGPGVVSTPLTKAKTFYVLTIDAI 287
             +FSYC        S+ +  G  G         I+S P V S P     + YVL+   +
Sbjct: 240 --QFSYCFTTRFEAKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGV 297

Query: 288 SVGNQRLGVSTPDI--------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 339
           +VG  RL V  P+I         IDSGT +T  P      L S   +   A PV      
Sbjct: 298 TVGKTRLPV--PEIKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQ-AALPVNKTADE 354

Query: 340 LELCYSFN--SLSQVPEVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFKGITNSVPIYG 395
            ++C+S++    + +P++  H  GAD  L R N+  +  E   +  +V         + G
Sbjct: 355 DDICFSWDGKKTAAMPKLVFHLEGADWDLPRENYVTEDRESGQVCVAVSTSGQMDRTLIG 414

Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           N  Q N  + YD+    +   P  C K
Sbjct: 415 NFQQQNTHIVYDLAAGKLLLVPAQCDK 441


>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 466

 Score =  179 bits (455), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 143/444 (32%), Positives = 215/444 (48%), Gaps = 67/444 (15%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
           GGFSVELIHRDS KSPF++   T + R   A  R           S +SS      D+  
Sbjct: 25  GGFSVELIHRDSIKSPFHDPKLTRHDRFL-AAARRSRARAAALLASDVSS------DLFY 77

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM----------------- 129
            +  YL  +++GTPP   LAVADTGSDL+W +C     +   +                 
Sbjct: 78  GDFEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPP 137

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATET 186
           +    F+P  SS+Y  + C    C +L    SC+G +  C +  SY DG+ + G LA +T
Sbjct: 138 EAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASATGLLAADT 197

Query: 187 VTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
            T  G+      +   I FGC T   G    +  G+VGLG G +SL SQ+      KFS+
Sbjct: 198 FTFGGNINNDTTSTASIDFGCATGTAGR-EFQADGMVGLGAGPLSLASQL----GRKFSF 252

Query: 246 CL----VPVSSTKINFGTNGIVSGPGVVSTPL----TKAKTFYVLTIDAISVGNQRL--G 295
           CL    +  +S+ +NFG   +VS PG  +TPL    + A  +Y ++ID++ V  Q +   
Sbjct: 253 CLTAYDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAGQPVPGT 312

Query: 296 VSTPDIVIDSGTTLTFLPQG-----YNSNLLSVM--SSMIEAQPVADPTGSLELCYSFNS 348
            S   +++D+GT LTFL +         +L  VM  + +  A P   P  +LELCY  + 
Sbjct: 313 TSVSKVIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAPP---PDETLELCYDVSR 369

Query: 349 LSQV----PEVTIHF---RGADVKLSRSNFFVKVSEDIVCSVFKGITNS-----VPIYGN 396
           +  V    P+VT+      G +V+L+    FV V E ++C     +T S     + + GN
Sbjct: 370 VKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGVLC--LAVVTTSPELQPLSVLGN 427

Query: 397 IMQTNFLVGYDIEQQTVSFKPTDC 420
           +   +  VG D++ +T +F   +C
Sbjct: 428 VALQDLHVGIDLDARTATFATANC 451


>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
          Length = 465

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 136/431 (31%), Positives = 202/431 (46%), Gaps = 49/431 (11%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF--------NQNSSISSSKASQ 81
           SV L+HR  P +P   S   P   L + L R   R N+            +++S +    
Sbjct: 44  SVPLVHRHGPCAPSAASGGKP--SLAERLRRDRARANYIVTKAAGGRTAATAVSDAVGGG 101

Query: 82  ADIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
              IP       ++  Y++ + IGTP  +++ + DTGSDL W QC+PC   +CY Q  PL
Sbjct: 102 GTSIPTFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPL 161

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQ-------KSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           FDP  SS+Y S+PC S  C  L          S +   C+Y + YG+ + + G  +TET+
Sbjct: 162 FDPSSSSSYASVPCDSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETL 221

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           TL       V +    FGCG +  G +  K  G++GLGG   SL+SQ  +   G FSYCL
Sbjct: 222 TL----KPGVVVADFGFGCGDHQHGPYE-KFDGLLGLGGAPESLVSQTSSQFGGPFSYCL 276

Query: 248 VPVSSTK--INFGT----NGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS- 297
            P S     +  G     +   +  G + TP+ +     TFYV+T+  ISVG   L V  
Sbjct: 277 PPTSGGAGFLALGAPNSSSSSTAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPLAVPP 336

Query: 298 ---TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ- 351
              +  +VIDSGT +T LP    + L S   S +    +  P+    L+ CY F   +  
Sbjct: 337 SAFSSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGAVLDTCYDFTGHTNV 396

Query: 352 -VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
            VP + + F  GA + L+     +   +  +     G  +++ I GN+ Q  F V YD  
Sbjct: 397 TVPTIALTFSGGATIDLATPAGVLV--DGCLAFAGAGTDDTIGIIGNVNQRTFEVLYDSG 454

Query: 410 QQTVSFKPTDC 420
           + TV F+   C
Sbjct: 455 KGTVGFRAGAC 465


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 139/425 (32%), Positives = 200/425 (47%), Gaps = 50/425 (11%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-NSSISSSKASQAD------ 83
           V+L H D+      +S ETP       L R  +R+       +++ S+  ++A       
Sbjct: 80  VQLHHLDA-----LSSDETPQDLFNSRLARDASRVKSLTSLAAAVGSTNRTRARGPGFSS 134

Query: 84  -----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
                +   +  Y  R+ +GTP      V DTGSD++W QC PC   +CY Q  P+F+P 
Sbjct: 135 SVTSGLAQGSGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPC--KKCYSQTDPVFNPT 192

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
            S ++ ++PC S  C  L+   CS     C Y VSYGDGSF+ G  +TET+T   T    
Sbjct: 193 KSRSFANIPCGSPLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTRVGR 252

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-- 254
           VAL     GCG +N GLF      ++GLG G +S  SQ+    + KFSYCLV  S++   
Sbjct: 253 VAL-----GCGHDNEGLFIGAAG-LLGLGRGRLSFPSQIGRRFSRKFSYCLVDRSASSKP 306

Query: 255 --INFGTNGIVSGPG---VVSTPLTKAKTFYVLTIDAISVGNQRL-GVSTP--------- 299
             + FG + I        +VS P  K  TFY + +  +SVG  R+ G++           
Sbjct: 307 SYMVFGDSAISRTARFTPLVSNP--KLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGN 364

Query: 300 -DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVT 356
             ++IDSGT++T L +     L             A      + C+  +  ++  VP V 
Sbjct: 365 GGVIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVV 424

Query: 357 IHFRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           +HFRGADV L  SN+ + V      C  F G  + + I GNI Q  F V YD+    V F
Sbjct: 425 LHFRGADVSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGF 484

Query: 416 KPTDC 420
            P  C
Sbjct: 485 APRGC 489


>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 475

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 132/424 (31%), Positives = 210/424 (49%), Gaps = 48/424 (11%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR----LNHFNQNSSISSSKASQADI 84
           + ++L+HRD  K P +N+      R    + R   R    L          +++A  +D+
Sbjct: 68  YKLKLVHRD--KVPTFNTYHDHRTRFNARMQRDTKRAASLLRRLAAGKPTYAAEAFGSDV 125

Query: 85  I----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           +      +  Y +RI +G+PP  +  V D+GSD+IW QCEPC  +QCY Q  P+F+P  S
Sbjct: 126 VSGMEQGSGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPC--TQCYHQSDPVFNPADS 183

Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           S++  + C+S+ C+ ++  +C    C+Y VSYGDGS++ G LA ET+T G T  + VA+ 
Sbjct: 184 SSFSGVSCASTVCSHVDNAACHEGRCRYEVSYGDGSYTKGTLALETITFGRTLIRNVAI- 242

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINF 257
               GCG +N G+F      ++GLGGG +S + Q+     G FSYCLV     SS  + F
Sbjct: 243 ----GCGHHNQGMFVGAAG-LLGLGGGPMSFVGQLGGQTGGAFSYCLVSRGIESSGLLEF 297

Query: 258 GTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP----------DIVID 304
           G   +  G   V  PL    +A++FY + +  + VG  R+ +S             +V+D
Sbjct: 298 GREAMPVGAAWV--PLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELGDGGVVMD 355

Query: 305 SGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLS-QVPEVTIH 358
           +GT +T LP    + +    ++  +++    P A      + CY  F  +S +VP V+ +
Sbjct: 356 TGTAVTRLPTVAYEAFRDGFIAQTTNL----PRASGVSIFDTCYDLFGFVSVRVPTVSFY 411

Query: 359 FRGADV-KLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           F G  +  L   NF + V +    C  F   ++ + I GNI Q    +  D     V F 
Sbjct: 412 FSGGPILTLPARNFLIPVDDVGTFCFAFAPSSSGLSIIGNIQQEGIQISVDGANGFVGFG 471

Query: 417 PTDC 420
           P  C
Sbjct: 472 PNVC 475


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 134/423 (31%), Positives = 198/423 (46%), Gaps = 39/423 (9%)

Query: 30  SVELIHRDSPKSPFYNSSETPY------------QRLRDALTRSLNRLNHFNQNSSISSS 77
           S+E++H+  P S   +S +               +R++   +R    L   N+   + S+
Sbjct: 66  SLEVVHKHGPCSQLNHSGKAEATISHNDIMNLDNERVKYIQSRLSKNLGGENRVKELDST 125

Query: 78  KA-SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
              +++  +  +A+Y + + +GTP  +   + DTGS L WTQCEPC  S CY Q  P+FD
Sbjct: 126 TLPAKSGRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGS-CYKQQDPIFD 184

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           P  SS+Y ++ C+SS C       CS     +C Y V YGD S S G L+ E +T+ +T 
Sbjct: 185 PSKSSSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD 244

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVS 251
                +    FGCG +N GLF   T G++GL    IS + Q  +     FSYCL   P S
Sbjct: 245 ----IVHDFLFGCGQDNEGLFRG-TAGLMGLSRHPISFVQQTSSIYNKIFSYCLPSTPSS 299

Query: 252 STKINFGTNGIVSGPGVVSTP---LTKAKTFYVLTIDAISVGNQRL-GVSTPDI-----V 302
              + FG +   +   +  TP   ++   +FY L I  ISVG  +L  VS+        +
Sbjct: 300 LGHLTFGASA-ATNANLKYTPFSTISGENSFYGLDIVGISVGGTKLPAVSSSTFSAGGSI 358

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR 360
           IDSGT +T LP    + L S     +   PVA  T  L+ CY F+   +  VP +   F 
Sbjct: 359 IDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEFA 418

Query: 361 GA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
           G   V+L         S   +C  F   G  N + I+GN+ Q    V YD+E   + F  
Sbjct: 419 GGVKVELPLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFGA 478

Query: 418 TDC 420
             C
Sbjct: 479 AGC 481


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 134/431 (31%), Positives = 200/431 (46%), Gaps = 55/431 (12%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP----- 86
            ++HRD+     +  + T  + L+  L R   R    ++ +        +    P     
Sbjct: 68  RVVHRDT-----FAVNATAGELLKHRLQRDKRRAARISEAAGAGGGNGRKGVAAPVVSGL 122

Query: 87  --NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
              +  Y  +I +GTP T+ L V DTGSD++W QC PC   +CY Q  P+FDP+ SS+Y 
Sbjct: 123 AQGSGEYFTKIGVGTPATQALMVLDTGSDVVWVQCAPC--RRCYEQSGPVFDPRRSSSYG 180

Query: 145 SLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           ++ C ++ C  L+   C      C Y V+YGDGS + G+  TET+T     G  VA   +
Sbjct: 181 AVGCGAALCRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAG--GARVAR--V 236

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-------- 254
             GCG +N GLF +    ++GLG G +S  +Q+       FSYCLV  +S+         
Sbjct: 237 ALGCGHDNEGLFVAAAG-LLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSH 295

Query: 255 ----INFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL-GVSTPD------ 300
               ++FG  G V       TP+    + +TFY + +  ISVG  R+ GV+  D      
Sbjct: 296 RSSTVSFGA-GSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPS 354

Query: 301 -----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG--SLELCYSF--NSLSQ 351
                +++DSGT++T L +   S L     +         P G    + CY      + +
Sbjct: 355 TGRGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVK 414

Query: 352 VPEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
           VP V++HF  GA+  L   N+ + V S    C  F G    V I GNI Q  F V +D +
Sbjct: 415 VPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGD 474

Query: 410 QQTVSFKPTDC 420
            Q V F P  C
Sbjct: 475 GQRVGFAPKGC 485


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score =  179 bits (453), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 138/430 (32%), Positives = 201/430 (46%), Gaps = 55/430 (12%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR---LNHFNQNSSISSSKASQADI 84
           GF   L H D+       +  T  Q L  A+ RS  R   L      ++  +   ++  +
Sbjct: 29  GFQATLTHIDA------GAGYTEAQLLSRAVRRSKARVAALQSLATTTAADAITVARILV 82

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
           + +   YL+ + IGTPP    A+ DTGSDLIWTQC PC    C  Q +P FDP  S +Y 
Sbjct: 83  LASEGEYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPC--MLCVDQPTPFFDPAQSPSYA 140

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            LPC+S  C +L    C    C Y   YGD + + G L+ ET T G T    V +P I F
Sbjct: 141 KLPCNSPMCNALYYPLCYRNVCVYQYFYGDSANTAGVLSNETFTFG-TNDTRVTVPRIAF 199

Query: 205 GCGT-NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTKINFGT 259
           GCG  N G LFN   +G+VG G G +SL+SQ+ +    +FSYCL     PV S ++ FG 
Sbjct: 200 GCGNLNAGSLFNG--SGMVGFGRGPLSLVSQLGSP---RFSYCLTSFMSPVPS-RLYFGA 253

Query: 260 NGIV------SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS-----------TP 299
              +      +G  V STP        T Y L +  ISVG + L +            T 
Sbjct: 254 YATLNSTSASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTG 313

Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMI-----EAQPVADPTGSLELCYSF----NSLS 350
            ++IDSG+T+T+L +     +    +  +      A  +AD    L+ C+ +      + 
Sbjct: 314 GVIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLAD---VLDTCFVWPPPPRKIV 370

Query: 351 QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
            +PE+  HF GA+++L   N+ +   +     +    ++   I G+    NF V YD E 
Sbjct: 371 TMPELAFHFEGANMELPLENYMLIDGDTGNLCLAIAASDDGSIIGSFQHQNFHVLYDNEN 430

Query: 411 QTVSFKPTDC 420
             +SF P  C
Sbjct: 431 SLLSFTPATC 440


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score =  178 bits (452), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 139/436 (31%), Positives = 204/436 (46%), Gaps = 51/436 (11%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF--NQNSSISSSKA 79
           +E  +   S+ L+HR  P +P    S  P   + + L RS  R N+     + S+    A
Sbjct: 48  LEPSSATVSMSLVHRYGPCAP-SQYSNVPTPSISETLRRSRARTNYIMSQASKSMGMGMA 106

Query: 80  SQAD------IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
           S  D       IP       ++  Y++ +  GTP   ++ + DTGSD+ W QC PC  ++
Sbjct: 107 STPDDDDAAVTIPTRLGGFVDSLEYVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTK 166

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGN 181
           CY Q  PLFDP  SSTY  + C++  C  L     N  +  G  C YSV Y DGS S G 
Sbjct: 167 CYPQKDPLFDPSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGV 226

Query: 182 LATETVTLGSTTGQAVALPGIT-----FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
            + ET+TL          PGIT     FGCG +  G  + K  G++GLGG  +SL+ Q  
Sbjct: 227 YSNETLTLA---------PGITVEDFHFGCGRDQRGP-SDKYDGLLGLGGAPVSLVVQTS 276

Query: 237 TTIAGKFSYCLVPVSSTK--INFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGN 291
           +   G FSYCL  ++S    +  G+    +    V TP+       TFY++T+  ISVG 
Sbjct: 277 SVYGGAFSYCLPALNSEAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGG 336

Query: 292 QRLGVSTP----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
           + L +        ++IDSGT  T LP+   + L + +   ++A P+  P+   + CY+F 
Sbjct: 337 KPLHIPQSAFRGGMIIDSGTVDTELPETAYNALEAALRKALKAYPLV-PSDDFDTCYNFT 395

Query: 348 SLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLV 404
             S   VP V   F  GA + L   N  +    D +     G  + + I GN+ Q    V
Sbjct: 396 GYSNITVPRVAFTFSGGATIDLDVPNGILV--NDCLAFQESGPDDGLGIIGNVNQRTLEV 453

Query: 405 GYDIEQQTVSFKPTDC 420
            YD  +  V F+   C
Sbjct: 454 LYDAGRGNVGFRAGAC 469


>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  178 bits (452), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 141/430 (32%), Positives = 209/430 (48%), Gaps = 47/430 (10%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRL--RD-ALTRSLNRLNHFNQNSSISSSKASQADIIP 86
           S++L+HRD+     + S       L  RD A    L R    + + S +SS  S   I+ 
Sbjct: 58  SLQLLHRDTVSGTKHPSRRHAVLALASRDTARVAYLQRRLSPSPSPSSTSSVESGGTIVS 117

Query: 87  N-NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
           + +  YL+R+ IG+PP E+  VADTGSD+IW QC PC  S CY Q  PLFDP  S+++  
Sbjct: 118 HGSGEYLVRVGIGSPPLEQHLVADTGSDVIWVQCSPC--SDCYAQGDPLFDPANSASFSP 175

Query: 146 LPCSSSQCASLNQ-----KSCSGVNCQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVAL 199
           +PC+S  C +  +         G  C+Y VSYGD S++NG LA ET+TL G T  Q VA+
Sbjct: 176 VPCNSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDGGTEVQGVAM 235

Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGT 259
                GCG  N GLF ++  G++GLG G +SL+ Q+     G FSYCL    S + +   
Sbjct: 236 -----GCGHENRGLF-AEAAGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSG 289

Query: 260 NGIV----SGP-GVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS----------TPDI 301
           + ++    + P G V  PL +   A +FY + ++ + V  +RL +              +
Sbjct: 290 SLVLGREDAAPTGAVWVPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGV 349

Query: 302 VIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 358
           V+D+GT +T LP + Y +   +   +  E  P A      + CY  +  +  +VP V ++
Sbjct: 350 VMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDTCYDLSGYASVRVPTVALY 409

Query: 359 F-------RGADVKLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
           F         A + L   N  V V +    C  F  + +   I GNI Q    +  D   
Sbjct: 410 FGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILGNIQQQGIEITVDSAS 469

Query: 411 QTVSFKPTDC 420
             V F P  C
Sbjct: 470 GYVGFGPATC 479


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score =  178 bits (452), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 135/416 (32%), Positives = 195/416 (46%), Gaps = 46/416 (11%)

Query: 30  SVELIHRDSPKSPFYN-----SSETPYQRLRDALTRSLNRLNHFN--------QNSSI-- 74
           S+E++H+  P S   +      S TP+    D L +   R+ + N        Q+SS+  
Sbjct: 71  SLEVVHKHGPCSQLNDHDGKAKSTTPHS---DILNQDKERVKYINSRLSKNLGQDSSVEE 127

Query: 75  --SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
             S++  +++  +  + NY + + +GTP  +   + DTGSDL WTQCEPC  S CY Q  
Sbjct: 128 LDSATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARS-CYKQQD 186

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN--CQYSVSYGDGSFSNGNLATE 185
            +FDP  S++Y ++ C+S+ C  L     N   CS     C Y + YGD SFS G  + E
Sbjct: 187 VIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRE 246

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
            +T+ +T      +    FGCG NN GLF   + G++GLG   IS + Q        FSY
Sbjct: 247 RLTVTATD----VVDNFLFGCGQNNQGLFGG-SAGLIGLGRHPISFVQQTAAKYRKIFSY 301

Query: 246 CLVPVSSTKINFGTNGIVSGPGVVSTP---LTKAKTFYVLTIDAISVGNQRLGV-----S 297
           CL   SS+  +       +G  +  TP   +++  +FY L I AI+VG  +L V     S
Sbjct: 302 CLPSTSSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFS 361

Query: 298 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 355
           T   +IDSGT +T LP      L S     +   P A     L+ CY  +      +P +
Sbjct: 362 TGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKVFSIPTI 421

Query: 356 TIHFRGA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDI 408
              F G   VKL         S   VC  F   G  + V IYGN+ Q    V YD+
Sbjct: 422 EFSFAGGVTVKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVYDV 477


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score =  178 bits (452), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 144/426 (33%), Positives = 207/426 (48%), Gaps = 48/426 (11%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---I 84
           GF   L H D+      ++  T  Q L  AL RS  R+      ++++   A  A    +
Sbjct: 30  GFKATLRHVDA------DAGYTEEQLLSRALRRSSARVATLQSLAALAPGDAITAARILV 83

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
           + ++  YL+ + IGTP     A+ DTGSDLIWTQC PC    C  Q +P FDP  S+TY+
Sbjct: 84  LASDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPARSATYR 141

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           SL C+S  C +L    C    C Y   YGD + + G LA ET T G T    V+LPGI+F
Sbjct: 142 SLGCASPACNALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFG-TNETRVSLPGISF 200

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTKINFGTN 260
           GCG  N G   +  +G+VG G G +SL+SQ+ +    +FSYCL     PV S ++ FG  
Sbjct: 201 GCGNLNAGSL-ANGSGMVGFGRGSLSLVSQLGSP---RFSYCLTSFLSPVPS-RLYFGVY 255

Query: 261 GIV-----SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS-----------TPDI 301
             +     S   V STP        T Y L +  ISVG   L +            T   
Sbjct: 256 ATLNSTNASSEPVQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGT 315

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPEV 355
           +IDSGTT+T+L +     + +  +S I   P+ + T +  L+ C+ +    +    +P++
Sbjct: 316 IIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNVTDASVLDTCFQWPPPPRQSVTLPQL 374

Query: 356 TIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
            +HF GAD +L   N+  V  S      +    ++   I G+    NF V YD+E   +S
Sbjct: 375 VLHFDGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMS 434

Query: 415 FKPTDC 420
           F P  C
Sbjct: 435 FVPAPC 440


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score =  178 bits (451), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 127/429 (29%), Positives = 210/429 (48%), Gaps = 68/429 (15%)

Query: 49  TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
           T ++ LR A+ RS  RL      +  + S+ KA  A+  I+P    YL+++ IGTPP + 
Sbjct: 43  TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102

Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
            A  DT SDLIWTQC+PC  + CY Q  P+F+P++SSTY +LPCSS  C  L+   C   
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160

Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
           +   CQY+ +Y   + + G LA + + +G       A  G+ FGC T++ GG    + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215

Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCLVPVSST---KINFGTNGIVS--GPGVVSTPLTK 275
           +VGLG G +SL+SQ+      +F+YCL P +S    K+  G +   +      ++ P+ +
Sbjct: 216 VVGLGRGPLSLVSQLSVR---RFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRR 272

Query: 276 A---KTFYVLTIDAISVGNQRL----------------------------GVSTPD---- 300
                ++Y L +D + +G++ +                             V+  D    
Sbjct: 273 DPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRY 332

Query: 301 -IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY------SFNSLSQVP 353
            ++ID  +T+TFL       L++ +   I        +  L+LC+      +F+ +  VP
Sbjct: 333 GMIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRV-YVP 391

Query: 354 EVTIHFRGADVKLSRSNFFVKVSED-IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
            V + F G  ++L ++  F +  E  ++C  V +    SV I GN  Q N  V Y++ + 
Sbjct: 392 AVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRG 451

Query: 412 TVSFKPTDC 420
            V+F  + C
Sbjct: 452 RVTFVQSPC 460


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score =  178 bits (451), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 134/426 (31%), Positives = 195/426 (45%), Gaps = 41/426 (9%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP------ 86
           ++HR  P SP     + P     D L     R++  ++  +  ++   Q   +P      
Sbjct: 22  VMHRHGPCSPLQTPDDAPSDA--DLLEHDQARVDSIHRMIANETAVVGQDVSLPAERGIS 79

Query: 87  -NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
               NY++ + +GTP  +   V DTGSDL W QC PC    CY Q  PLF P  SST+ +
Sbjct: 80  VGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSA 139

Query: 146 LPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA--- 198
           + C   +C    Q SCS       C Y V YGD S + G+L  +T+TLG+T     +   
Sbjct: 140 VRCGEPECPRARQ-SCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENN 198

Query: 199 ---LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK- 254
              LPG  FGCG NN GLF  K  G+ GLG G +SL SQ        FSYCL   SS   
Sbjct: 199 SNKLPGFVFGCGENNTGLFG-KADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSNAH 257

Query: 255 --INFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQRLGVST------PDIVID 304
             ++ GT          +  L ++ T  FY + +  I V  + + VS+        +++D
Sbjct: 258 GYLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPAGLIVD 317

Query: 305 SGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSF----NSLSQVPEVTIH 358
           SGT +T L P+ Y++   + +S+M +      P  S L+ CY F    N+   +P V + 
Sbjct: 318 SGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALV 377

Query: 359 FRGA---DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           F G     V  S   +  KV++  +     G   S  I GN  Q    V YD+ +Q + F
Sbjct: 378 FAGGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAGILGNTQQRTVAVVYDVGRQKIGF 437

Query: 416 KPTDCT 421
               C+
Sbjct: 438 AAKGCS 443


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score =  178 bits (451), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 133/430 (30%), Positives = 200/430 (46%), Gaps = 37/430 (8%)

Query: 19  VSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS--LNRLNHFNQNSSISS 76
           VS  ++    F + L+HRD       +      +  RDA+  +  + RL+H    +++  
Sbjct: 62  VSGYKSDNNTFKLNLLHRDKLSHVHGHRRGFNDRMKRDAIRVATLVRRLSH-GAPAAVKD 120

Query: 77  SKASQA----DII----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
           S+   A    D+I      +  Y +RI +G+PP  +  V D+GSD++W QC+PC  S+CY
Sbjct: 121 SRYKVANFATDVISGMEAGSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPC--SRCY 178

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
            Q  P+FDP  SS++  + C S  C  L    C+   C+Y VSYGDGS++ G LA ET+T
Sbjct: 179 QQSDPVFDPADSSSFAGVSCGSDVCDRLENTGCNAGRCRYEVSYGDGSYTKGTLALETLT 238

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
           +G    + VA+     GCG  N G+F      ++GLGGG +S I Q+     G FSYCLV
Sbjct: 239 VGQVMIRDVAI-----GCGHTNQGMFIGAAG-LLGLGGGSMSFIGQLGGQTGGAFSYCLV 292

Query: 249 PV---SSTKINFGTNGIVSGPGVVSTPLT-KAKTFYVLTIDAISVGNQRLGV-------- 296
                S+  + FG   +  G   +S     +A +FY + +  I VG  R+ V        
Sbjct: 293 SRGTGSTGALEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLT 352

Query: 297 --STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QV 352
              T  +V+D+GT +T  P           ++     P A      + CY  N     +V
Sbjct: 353 EYGTNGVVMDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRV 412

Query: 353 PEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
           P V+ +F  G  + L   NF + V      C  F    + + I GNI Q    + +D   
Sbjct: 413 PTVSFYFSDGPVLTLPARNFLIPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFDGAN 472

Query: 411 QTVSFKPTDC 420
             V F P  C
Sbjct: 473 GFVGFGPNIC 482


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score =  178 bits (451), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 144/432 (33%), Positives = 209/432 (48%), Gaps = 55/432 (12%)

Query: 30  SVELIHRDSPKSPFYNSS---ETP--YQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
           SV L HR  P +P  +S+   + P   +RLR    R+ + L   +    +S    +    
Sbjct: 55  SVPLAHRHGPCAPKGSSATDKKKPSFAERLRSDRARADHILRKASGRRMMSEGGGAS--- 111

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           IP       ++  Y++ + IGTP  ++  + DTGSDL W QC+PC  S CY Q  PLFDP
Sbjct: 112 IPTYLGGFVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDP 171

Query: 138 KMSSTYKSLPCSSSQCASL----------NQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
             SST+ ++PC+S  C  L          N  S     C Y++ YG+G+ + G  +TET+
Sbjct: 172 SKSSTFATIPCASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETL 231

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
            LGS+      +    FGCG++  G ++ K  G++GLGG   SL+SQ  +   G FSYCL
Sbjct: 232 ALGSS----AVVKSFRFGCGSDQHGPYD-KFDGLLGLGGAPESLVSQTASVYGGAFSYCL 286

Query: 248 VPVSSTKINFGTNGIV-----SGPGVVSTPLT----KAKTFYVLTIDAISVGNQRLGVST 298
            P++S    F T G       S  G V TP+     K  TFYV+T+  ISVG + L +  
Sbjct: 287 PPLNS-GAGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIP- 344

Query: 299 PDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSF--NSLS 350
           P +     ++DSGT +T +P      L +   S +   P+  P  S L+ CY+F  +   
Sbjct: 345 PAVFAKGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPADSALDTCYNFTGHGTV 404

Query: 351 QVPEVTIHF-RGADVKLS-RSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
            VP+V + F  GA V L   S   V   ED +     G   S  I GN+      V YD 
Sbjct: 405 TVPKVALTFVGGATVDLDVPSGVLV---EDCLAFADAG-DGSFGIIGNVNTRTIEVLYDS 460

Query: 409 EQQTVSFKPTDC 420
            +  + F+   C
Sbjct: 461 GKGHLGFRAGAC 472


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score =  177 bits (450), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 127/429 (29%), Positives = 210/429 (48%), Gaps = 68/429 (15%)

Query: 49  TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
           T ++ LR A+ RS  RL      +  + S+ KA  A+  I+P    YL+++ IGTPP + 
Sbjct: 43  TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102

Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
            A  DT SDLIWTQC+PC  + CY Q  P+F+P++SSTY +LPCSS  C  L+   C   
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160

Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
           +   CQY+ +Y   + + G LA + + +G       A  G+ FGC T++ GG    + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215

Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCLVPVSST---KINFGTNGIVS--GPGVVSTPLTK 275
           +VGLG G +SL+SQ+      +F+YCL P +S    K+  G +   +      ++ P+ +
Sbjct: 216 VVGLGRGPLSLVSQLSVR---RFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRR 272

Query: 276 A---KTFYVLTIDAISVGNQRL----------------------------GVSTPD---- 300
                ++Y L +D + +G++ +                             V+  D    
Sbjct: 273 DPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATATATAPAPAPTPSPNATAVAVGDANRY 332

Query: 301 -IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY------SFNSLSQVP 353
            ++ID  +T+TFL       L++ +   I        +  L+LC+      +F+ +  VP
Sbjct: 333 GMIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRV-YVP 391

Query: 354 EVTIHFRGADVKLSRSNFFVKVSED-IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
            V + F G  ++L ++  F +  E  ++C  V +    SV I GN  Q N  V Y++ + 
Sbjct: 392 AVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRG 451

Query: 412 TVSFKPTDC 420
            V+F  + C
Sbjct: 452 RVTFVQSPC 460


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score =  177 bits (450), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 127/357 (35%), Positives = 177/357 (49%), Gaps = 35/357 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ +GTP      V DTGSD++W QC PC   +CY Q  P+FDP+ S TY ++P
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC--RRCYSQSDPIFDPRKSKTYATIP 196

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           CSS  C  L+   C+     C Y VSYGDGSF+ G+ +TET+T      + VAL     G
Sbjct: 197 CSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVAL-----G 251

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
           CG +N GLF      ++GLG G +S   Q       KFSYCLV  S++      + +V G
Sbjct: 252 CGHDNEGLFVGAAG-LLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASS---KPSSVVFG 307

Query: 266 PGVVS-----TPLT---KAKTFYVLTIDAISVGNQRL-GVSTP----------DIVIDSG 306
              VS     TPL    K  TFY + +  ISVG  R+ GV+             ++IDSG
Sbjct: 308 NAAVSRIARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSG 367

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADV 364
           T++T L +     +        +    A      + C+  +++++  VP V +HFR ADV
Sbjct: 368 TSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPTVVLHFRRADV 427

Query: 365 KLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            L  +N+ + V  +   C  F G    + I GNI Q  F V YD+    V F P  C
Sbjct: 428 SLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  177 bits (450), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 128/427 (29%), Positives = 201/427 (47%), Gaps = 59/427 (13%)

Query: 23  EAQTGG--FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
           + + GG  + ++++HRD      + +S+    RL   L R   R+    +  S     + 
Sbjct: 125 DHEEGGEKWMMKVVHRDQLS---FGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSY 181

Query: 81  QAD---------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD 131
           + D         +   +  Y +RI +G+PP  +  V D+GSD++W QC+PC  +QCY Q 
Sbjct: 182 RVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--TQCYHQS 239

Query: 132 SPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
            P+FDP  S+++  + CSSS C  L    C    C+Y VSYGDGS++ G LA ET+T G 
Sbjct: 240 DPVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGR 299

Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS 251
           T  ++VA+     GCG  N G+F      ++GLGGG +S + Q+     G FSYCLV  +
Sbjct: 300 TMVRSVAI-----GCGHRNRGMFVGAAG-LLGLGGGSMSFVGQLGGQTGGAFSYCLVSAA 353

Query: 252 STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP----------DI 301
              +             V  P  +A +FY + +  + VG  R+ +S             +
Sbjct: 354 WVPL-------------VRNP--RAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGV 398

Query: 302 VIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEV 355
           V+D+GT +T LP    Q +    L+  +++  A  VA      + CY        +VP V
Sbjct: 399 VMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVA----IFDTCYDLLGFVSVRVPTV 454

Query: 356 TIHFRGADV-KLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
           + +F G  +  L   NF + + +    C  F   T+ + I GNI Q    + +D     V
Sbjct: 455 SFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYV 514

Query: 414 SFKPTDC 420
            F P  C
Sbjct: 515 GFGPNIC 521


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score =  177 bits (449), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 126/355 (35%), Positives = 171/355 (48%), Gaps = 33/355 (9%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP      V DTGSD  W QC+PC  + CY Q  PLFDP  S+TY ++
Sbjct: 92  GTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCV-AYCYRQKEPLFDPTKSATYANI 150

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            CSSS C+ L    CSG +C Y + YGDGS++ G  A +T+TL   T     +    FGC
Sbjct: 151 SCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYDT-----IKNFRFGC 205

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
           G  N GLF  +  G++GLG G  SL  Q      G F+YCL P +S     GT  +  GP
Sbjct: 206 GEKNRGLFG-RAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL-PATSA----GTGFLDLGP 259

Query: 267 GVVS-----TPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQ 314
           G  +     TP+   +  TFY + +  I VG   L +     ST   ++DSGT +T LP 
Sbjct: 260 GAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPP 319

Query: 315 GYNSNLLSVMSSMIEAQPV-ADPTGS-LELCYSFNSLS----QVPEVTIHFRGA---DVK 365
              + L S  S  ++     A P  S L+ CY           +P V++ F+G    DV 
Sbjct: 320 SAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVD 379

Query: 366 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            S   +   VS+  +          V I GN  Q    V YDI ++ V F P  C
Sbjct: 380 ASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score =  177 bits (449), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 144/452 (31%), Positives = 223/452 (49%), Gaps = 78/452 (17%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---- 83
           G  +EL H D+ +  F  S      R+R A  RS  R+N     +   ++   ++D    
Sbjct: 29  GIRLELTHVDA-RGDFTGS-----DRVRRAADRSHRRVNGLLAAAPPPAASTLRSDGGGG 82

Query: 84  ----------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDS 132
                     +  + A YL+  +IGTPP    AV DTGSDLIWTQC+ PC   +C+ Q +
Sbjct: 83  GACAATAAASVHASTATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPC--RRCFPQPA 140

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-------------NQKSCSGVNCQYSVSYGDGSFSN 179
           PL+ P  S TY ++ C S  C +L             +  +     C Y  SYGDGS ++
Sbjct: 141 PLYAPARSVTYANVSCGSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTD 200

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQMRTT 238
           G LATET T G+ T     +  + FGCGT+N GG  NS  +G+VG+G G +SL+SQ+  T
Sbjct: 201 GVLATETFTFGAGT----TVHDLAFGCGTDNLGGTDNS--SGLVGMGRGPLSLVSQLGVT 254

Query: 239 IAGKFSYCLVP----VSSTKINFGTNGIVSGPGVVSTPLT------KAKTFYVLTIDAIS 288
              KFSYC  P     +S+ +  G++  +S P   STP        +  ++Y L+++ I+
Sbjct: 255 ---KFSYCFTPFNDTTTSSPLFLGSSASLS-PAAKSTPFVPSPSGPRRSSYYYLSLEGIT 310

Query: 289 VGNQRLGVSTP----------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 338
           VG+  L +              ++IDSGTT T L +     L   +++ +     +    
Sbjct: 311 VGDTLLPIDPAVFRLTASGRGGLIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHL 370

Query: 339 SLELCYSF-----NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVF-KGITNS-- 390
            L +C++           VP + +HF GAD++L RS+    V ED V  V   GI ++  
Sbjct: 371 GLSVCFAAPQGRGPEAVDVPRLVLHFDGADMELPRSS---AVVEDRVAGVACLGIVSARG 427

Query: 391 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           + + G++ Q N  V YD+ +  +SF+P +C +
Sbjct: 428 MSVLGSMQQQNMHVRYDVGRDVLSFEPANCGE 459


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score =  177 bits (449), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 126/355 (35%), Positives = 171/355 (48%), Gaps = 33/355 (9%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP      V DTGSD  W QC+PC  + CY Q  PLFDP  S+TY ++
Sbjct: 157 GTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCV-AYCYRQKEPLFDPTKSATYANI 215

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            CSSS C+ L    CSG +C Y + YGDGS++ G  A +T+TL   T     +    FGC
Sbjct: 216 SCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYDT-----IKNFRFGC 270

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
           G  N GLF  +  G++GLG G  SL  Q      G F+YCL P +S     GT  +  GP
Sbjct: 271 GEKNRGLFG-RAAGLLGLGRGKTSLPVQAYDKYGGVFAYCL-PATSA----GTGFLDLGP 324

Query: 267 GVVS-----TPL--TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQ 314
           G  +     TP+   +  TFY + +  I VG   L +     ST   ++DSGT +T LP 
Sbjct: 325 GAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPP 384

Query: 315 GYNSNLLSVMSSMIEAQPV-ADPTGS-LELCYSFNSLS----QVPEVTIHFRGA---DVK 365
              + L S  S  ++     A P  S L+ CY           +P V++ F+G    DV 
Sbjct: 385 SAYAPLRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVD 444

Query: 366 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            S   +   VS+  +          V I GN  Q    V YDI ++ V F P  C
Sbjct: 445 ASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score =  177 bits (449), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 110/324 (33%), Positives = 166/324 (51%), Gaps = 17/324 (5%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
           DTGSD+ W QC+PCP  QCY Q   LF P  S+TYK LPC+S+ C  L     SC   +C
Sbjct: 6   DTGSDITWIQCDPCP--QCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLNSSC 63

Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
            Y VSYGD S + G+ A ET+TL S     V++P   FGCG  N GLFN    G++GLG 
Sbjct: 64  NYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNG-AAGLMGLGK 122

Query: 227 GDISLISQMRTTIAGKFSYCLVPVSST----KINFGTNGIVSGPGVVSTPLTKAK---TF 279
             I   +Q        FSYCL  VSST     ++FG   ++    V  TPL  +    + 
Sbjct: 123 SSIGFPAQTSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDY-DVRFTPLVDSSSGPSQ 181

Query: 280 YVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 339
           Y +++  I+VG++ L +S   +++DSGT ++   Q     L    + ++     A     
Sbjct: 182 YFVSMTGINVGDELLPISA-TVMVDSGTVISRFEQSAYERLRDAFTQILPGLQTAVSVAP 240

Query: 340 LELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGN 396
            + C+  +++    +P +T+HFR  A+++LS  +    V + ++C  F   ++   + GN
Sbjct: 241 FDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPVDDGVMCFAFAPSSSGRSVLGN 300

Query: 397 IMQTNFLVGYDIEQQTVSFKPTDC 420
             Q N    YDI +  +     +C
Sbjct: 301 FQQQNLRFVYDIPKSRLGISAFEC 324


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score =  177 bits (449), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 129/434 (29%), Positives = 203/434 (46%), Gaps = 56/434 (12%)

Query: 29  FSVELIHRDS--PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
             V + HRD+  P  P         QRL     R  + ++   +  S   S       IP
Sbjct: 27  LHVPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATGRLHSPVFSG------IP 80

Query: 87  -NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
             +  Y   + +GTP T+ + V DTGSDL+W QC PC   +CY Q   +FDP+ SSTY+ 
Sbjct: 81  FESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPC--RRCYAQRGQVFDPRRSSTYRR 138

Query: 146 LPCSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           +PCSS QC +L    C     +G  C+Y V+YGDGS S G+LAT+ +   + T     + 
Sbjct: 139 VPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFANDT----YVN 194

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS--STKINFG 258
            +T GCG +N GLF+S   G++G+G G IS+ +Q+       F YCL   +  ST+ ++ 
Sbjct: 195 NVTLGCGRDNEGLFDS-AAGLLGVGRGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYL 253

Query: 259 TNGIVSGP------GVVSTPLTKAKTFYVLTIDAISVGNQR--------LGVSTP----D 300
             G    P       ++S P  +  + Y + +   SVG +R        L + T      
Sbjct: 254 VFGRTPEPPSTAFTALLSNP--RRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGG 311

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNS--LSQVPEV 355
           +V+DSGT ++   +   + L     +   A  +    G     + CY       +  P +
Sbjct: 312 VVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLI 371

Query: 356 TIHFR-GADVKLSRSNFFV-------KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
            +HF  GAD+ L   N+F+       + +    C  F+   + + + GN+ Q  F V +D
Sbjct: 372 VLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFD 431

Query: 408 IEQQTVSFKPTDCT 421
           +E++ + F P  CT
Sbjct: 432 VEKERIGFAPKGCT 445


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 130/412 (31%), Positives = 201/412 (48%), Gaps = 49/412 (11%)

Query: 56  DALTRSLNRLNHFNQNSSISSSKASQADIIPNNA------------------NYLIRISI 97
           D+  +   R++  ++ +++S S A++ D  P  A                   YL+ + +
Sbjct: 96  DSAEKDAVRIDTMHRRAALSGSAAARRDSAPRRALSERVVATVESGVPVGSGEYLVDVYL 155

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC---- 153
           GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  S +Y+++ C   +C    
Sbjct: 156 GTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQSGPIFDPAASISYRNVTCGDDRCRLVS 213

Query: 154 --ASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
             A    + C       C Y   YGD S + G+LA E  T+  T      + G+ FGCG 
Sbjct: 214 PPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVAFGCGH 273

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVP---VSSTKINFG-TNGIV 263
            N GLF+     ++GLG G +S  SQ+R    G  FSYCLV     + +KI FG  + ++
Sbjct: 274 RNRGLFHGAAG-LLGLGRGPLSFASQLRGVYGGHAFSYCLVEHGSAAGSKIIFGHDDALL 332

Query: 264 SGPGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQ- 314
           + P +  T   P T A TFY L + +I VG + + +S+  +     +IDSGTTL++ P+ 
Sbjct: 333 AHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLSAGGTIIDSGTTLSYFPEP 392

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNF 371
            Y +   + +  M  + P+      L  CY+ +     +VPE+++ F  GA  +    N+
Sbjct: 393 AYQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEVPELSLVFADGAAWEFPAENY 452

Query: 372 FVKVS-EDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           F+++  E I+C    G   S + I GN  Q NF V YD+E   + F P  C 
Sbjct: 453 FIRLEPEGIMCLAVLGTPRSGMSIIGNYQQQNFHVLYDLEHNRLGFAPRRCA 504


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score =  177 bits (448), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 147/448 (32%), Positives = 215/448 (47%), Gaps = 80/448 (17%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA- 89
           ++++HRDS  S   +++    + L++ L R   R++  N    +++   S+A++ P N  
Sbjct: 70  LQVVHRDSLSSS--SNTSLVKEILQERLKRDAARVDSINARVQLAAMGVSKAEMKPLNGS 127

Query: 90  ------------------------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
                                    Y  R+ +GTPP     V DTGSD++W QC PC  +
Sbjct: 128 SIDARFDAKDFSSSIISGLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPC--A 185

Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLAT 184
           +CY Q  PLF+P  SSTY+ +PC++  C  L+   C     C+Y VSYGDGSF+ G+ +T
Sbjct: 186 KCYGQTDPLFNPAASSTYRKVPCATPLCKKLDISGCRNKRYCEYQVSYGDGSFTVGDFST 245

Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
           ET+T      + VAL     GCG +N GLF      ++GLG G +S  SQ     + +FS
Sbjct: 246 ETLTFRGQVIRRVAL-----GCGHDNEGLFIGAAG-LLGLGRGSLSFPSQTGAQFSKRFS 299

Query: 245 YCLVPVS----STKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS 297
           YCLV  S    ++ + FG   I      + TPL    K  TFY + +  ISVG +RL  S
Sbjct: 300 YCLVDRSASGTASSLIFGKAAIPK--SAIFTPLLSNPKLDTFYYVELVGISVGGRRL-TS 356

Query: 298 TP------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL----- 340
            P             ++IDSGT++T L         S  S+M +A  V   TG+L     
Sbjct: 357 IPASVFRMDATGNGGVIIDSGTSVTRLVD-------SAYSTMRDAFRVG--TGNLKSAGG 407

Query: 341 ----ELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVP 392
               + CY  + L   +VP +  HF+ GA + L  +N+ + V S    C  F G T  + 
Sbjct: 408 FSLFDTCYDLSGLKTVKVPTLVFHFQGGAHISLPATNYLIPVDSSATFCFAFAGNTGGLS 467

Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           I GNI Q  + V +D     V FK   C
Sbjct: 468 IIGNIQQQGYRVVFDSLANRVGFKAGSC 495


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 140/445 (31%), Positives = 213/445 (47%), Gaps = 60/445 (13%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-------ISSSKASQ 81
             V L+HRDS  +     +E   +RL+    R+   ++    N +       +S+ +   
Sbjct: 70  MHVRLLHRDS-FAVNATGAELLARRLQRDELRAAWIISTAAANGTPPPDVVGLSTGRGLV 128

Query: 82  ADII---PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           A ++   P + +Y+ +I++GTP  E L   DT SDL W QC+PC   +CY Q  P+FDP+
Sbjct: 129 APVVSRAPTSGDYIAKIAVGTPAVEALLALDTASDLTWLQCQPC--RRCYPQSGPVFDPR 186

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDG------SFSNGNLATETVTL 189
            S++Y  +   +  C +L +          C Y+V YGDG      S S G+L  ET+T 
Sbjct: 187 HSTSYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTF 246

Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR-TTIAGKFSYCLV 248
                QA     ++ GCG +N GLF +   GI+GL  G IS+  Q+        FSYCLV
Sbjct: 247 AGGVRQAY----LSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQIAFLGYNASFSYCLV 302

Query: 249 -----PVS-STKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL-GVST 298
                P S S+ + FG   + + P    TP        TFY + +  +SVG  R+ GV+ 
Sbjct: 303 DFISGPGSPSSTLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTE 362

Query: 299 PD-----------IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVA--DPTGSLELCY 344
            D           +++DSGTT+T L +  Y +   +  ++      V+   P+G  + CY
Sbjct: 363 RDLQLDPYTGHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCY 422

Query: 345 SFNSLS------QVPEVTIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITN-SVPIYG 395
           +    +      +VP V++HF G  ++ L   N+ + V S   VC  F G  + SV + G
Sbjct: 423 TVGGRAGLRHCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSVIG 482

Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDC 420
           NI+Q  F V YDI  Q V F P  C
Sbjct: 483 NILQQGFRVVYDIGGQRVGFAPNSC 507


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 137/437 (31%), Positives = 211/437 (48%), Gaps = 46/437 (10%)

Query: 28  GFSVELIHR------DSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
           G +++++HR      D    P ++      +R R  + RS+ R     + ++ +++  ++
Sbjct: 54  GSTLQIVHRACLQTGDDIAVPDHHHYTGILRRDRHRV-RSIYRRLTAAETTTTTTTIPAR 112

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
             +   +  Y++ I IGTPP     + DTGSDL W QC PCP S CY Q  PLFDP  SS
Sbjct: 113 LGLAFQSLEYVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSS 172

Query: 142 TYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           TY  +PCS+ +C    + Q  C   +C+YSV YGD S ++G+LA ET TL   +  A A 
Sbjct: 173 TYVDVPCSAPECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAA 232

Query: 200 PGITFGCGTNNGGLFNSK---TTGIVGLGGGDISLISQMRTTI---AGKFSYCLVPVSST 253
            G+ FGC      +FN       G++GLG GD S++SQ R +I    G FSYCL P  S+
Sbjct: 233 TGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRGSS 292

Query: 254 KINFGTNGIVSGP-----GVVSTPL----TKAKTFYVLTIDAISVGNQRLGVSTPDI--- 301
                  G  + P      +  TPL    ++ ++ YV+ +  +SV    + +        
Sbjct: 293 TGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLG 352

Query: 302 -VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL---CYSFNSLSQV--PEV 355
            VIDSGT +T +P      L       + +  +  P GS++L   CY       V  P V
Sbjct: 353 AVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKML-PEGSMKLLDTCYDVTGQDVVTAPRV 411

Query: 356 TIHF-RGADVKLSRSNFFVKV-SED-------IVCSVFKGITNS--VPIYGNIMQTNFLV 404
            + F  GA + +  S   + + +ED       + C  F   TNS  + I GN+ Q  + V
Sbjct: 412 ALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFL-PTNSAGLVIVGNMQQRAYNV 470

Query: 405 GYDIEQQTVSFKPTDCT 421
            +D++   + F P  C+
Sbjct: 471 VFDVDGGRIGFGPNGCS 487


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 140/465 (30%), Positives = 222/465 (47%), Gaps = 67/465 (14%)

Query: 8   VFILFFLCFYVVSPIEAQT----GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
           V +L     Y   P+ +          V L H D+ K    + SE     +R A+ RS  
Sbjct: 7   VLVLAIASLYYACPVASAAFVGDDDVRVALKHVDAGKQ--LSRSEL----IRRAMQRSKA 60

Query: 64  RLNHFN--QNSSISSSKASQAD-----------IIPN-NANYLIRISIGTPPTERLAVAD 109
           R    +  +N + S+  + + D           + P+ +  Y++ ++IGTPP    A+ D
Sbjct: 61  RAAALSAVRNRAASARFSGKNDDQRTTPPTGVSVRPSGDLEYVVDLAIGTPPQPVSALLD 120

Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQY 168
           TGSDLIWTQC PC  + C  Q  PLF P  S++Y+ + C+   C+ +    C   + C Y
Sbjct: 121 TGSDLIWTQCAPC--ASCLAQPDPLFAPGESASYEPMRCAGQLCSDILHHGCEMPDTCTY 178

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
             +YGDG+ + G  ATE  T  S+ G  +    + FGCG+ N G  N+  +GIVG G   
Sbjct: 179 RYNYGDGTMTMGVYATERFTFTSSGGDRLMTVPLGFGCGSMNVGSLNNG-SGIVGFGRNP 237

Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTK--------INFGTNGIVSGPGVVSTPLTKA---K 277
           +SL+SQ+      +FSYCL    S +        ++ G  G  +GP V +TPL ++    
Sbjct: 238 LSLVSQLSIR---RFSYCLTSYGSGRKSTLLFGSLSGGVYGDATGP-VQTTPLLQSLQNP 293

Query: 278 TFYVLTIDAISVGNQRLGVST------PD----IVIDSGTTLTFLPQGYNSNLLSVMSSM 327
           TFY + +  ++VG +RL +        PD    +++DSGT LT LP    + ++      
Sbjct: 294 TFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQ 353

Query: 328 IEAQPVADPTGSLE--LCY---------SFNSLSQVPEVTIHFRGADVKLSRSNFFV-KV 375
           +   P A+  G+ E  +C+         S  S   VP +  HF+ AD+ L R N+ +   
Sbjct: 354 LRL-PFAN-GGNPEDGVCFLVPAAWRRSSSTSQVPVPRMVFHFQDADLDLPRRNYVLDDH 411

Query: 376 SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            +  +C +     +     GN++Q +  V YD+E +T+SF P  C
Sbjct: 412 RKGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSFAPAQC 456


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  176 bits (446), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 146/443 (32%), Positives = 204/443 (46%), Gaps = 57/443 (12%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK--- 78
           +   T   SV L H D+  S F ++S     +LR  L R   R+      +++S+ +   
Sbjct: 57  VSESTTSLSVHLSHVDALSS-FSDASPVDLFKLR--LQRDSLRVKSITSLAAVSTGRNAT 113

Query: 79  ------------ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
                       A  + +   +  Y +R+ +GTP T    V DTGSD++W QC PC    
Sbjct: 114 KRTPRSAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC--KA 171

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS-C---SGVNCQYSVSYGDGSFSNGNL 182
           CY Q   +FDPK S T+ ++PC S  C  L+  S C       C Y VSYGDGSF+ G+ 
Sbjct: 172 CYNQSDVIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDF 231

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
           +TET+T          +  +  GCG +N GLF      ++GLG G +S  SQ ++   GK
Sbjct: 232 STETLTF-----HGARVDHVPLGCGHDNEGLFVGAAG-LLGLGRGGLSFPSQTKSRYNGK 285

Query: 243 FSYCLVPVSSTK--------INFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGN 291
           FSYCLV  +S+         I FG + +      V TPL    K  TFY L +  ISVG 
Sbjct: 286 FSYCLVDRTSSGSSSKPPSTIVFGNDAVPKTS--VFTPLLTNPKLDTFYYLQLLGISVGG 343

Query: 292 QRL-GVSTPD----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 340
            R+ GVS             ++IDSGT++T L Q     L             A      
Sbjct: 344 SRVPGVSESQFKLDATGNGGVIIDSGTSVTRLTQSAYVALRDAFRLGATKLKRAPSYSLF 403

Query: 341 ELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNI 397
           + C+  + ++  +VP V  HF G +V L  SN+ + V +E   C  F G   S+ I GNI
Sbjct: 404 DTCFDLSGMTTVKVPTVVFHFGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNI 463

Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
            Q  F V YD+    V F    C
Sbjct: 464 QQQGFRVAYDLVGSRVGFLSRAC 486


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 129/395 (32%), Positives = 203/395 (51%), Gaps = 61/395 (15%)

Query: 65  LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPP 124
           L H++  S+  SS    A +    A YL+ ++IGTPP   +A+ADTGSDL WTQC+PC  
Sbjct: 59  LLHYSTLST--SSDPGPARLRSGQAEYLMELAIGTPPVPFIALADTGSDLTWTQCKPC-- 114

Query: 125 SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNL 182
             C+ QD+P++D   SS++  LPCSS+ C  +    CS     C+Y  +Y DG++     
Sbjct: 115 KLCFGQDTPIYDTTTSSSFSPLPCSSATCLPIWSSRCSTPSATCRYRYAYDDGAY----- 169

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQMRTTIAG 241
                   S     +++ GI FGCG +NGGL +NS  TG VGLG G +SL++Q+     G
Sbjct: 170 --------SPECAGISVGGIAFGCGVDNGGLSYNS--TGTVGLGRGSLSLVAQLG---VG 216

Query: 242 KFSYCLVPVSSTKIN----FGTNGIVSGPG-------VVSTPLTKA---KTFYVLTIDAI 287
           KFSYCL    +T ++    FG+   ++          V STPL ++    + Y ++++ I
Sbjct: 217 KFSYCLTDFFNTSLSSPVFFGSLAELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGI 276

Query: 288 SVGNQRLGV-----------STPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVAD 335
           S+G+ RL +            +  +++DSGT  T L + G+   +  V    +  QPV +
Sbjct: 277 SLGDARLPIPNGTFDLNDDDGSGGMIVDSGTIFTILVETGFRVVVDHVAG--VLGQPVVN 334

Query: 336 PTGSLELCY-----SFNSLSQVPEVTIHFR-GADVKLSRSNF--FVKVSEDIVCSVFKGI 387
            +     C+         L  +P++ +HF  GAD++L R N+  F +       ++    
Sbjct: 335 ASSLDRPCFPAPAAGVQELPDMPDMVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTE 394

Query: 388 TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           + S  + GN  Q N  + +DI    +SF PTDC+K
Sbjct: 395 SASGSVLGNFQQQNIQMLFDITVGQLSFMPTDCSK 429


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 131/371 (35%), Positives = 188/371 (50%), Gaps = 38/371 (10%)

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           A++  ++ ++  YL+ + IGTP     A+ DTGSDLIWTQC PC    C  Q +P FDP 
Sbjct: 80  AARILVLASDGEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPA 137

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            SSTY+SL CS+  C +L    C    C Y   YGD + + G LA ET T G T    V 
Sbjct: 138 NSSTYRSLGCSAPACNALYYPLCYQKTCVYQYFYGDSASTAGVLANETFTFG-TNDTRVT 196

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTK 254
           LP I+FGCG  N G   +  +G+VG G G +SL+SQ+ +    +FSYCL     PV S +
Sbjct: 197 LPRISFGCGNLNAGSL-ANGSGMVGFGRGSLSLVSQLGSP---RFSYCLTSFLSPVRS-R 251

Query: 255 INFGTNGIV---SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----------- 297
           + FG    +   +   V STP        T Y L +  ISVG  RL +            
Sbjct: 252 LYFGAYATLNSTNASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDG 311

Query: 298 TPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ--- 351
           T   +IDSGTT+T+L +  Y +   + +  +    P+ D T +  L+ C+ +    +   
Sbjct: 312 TGGTIIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSV 371

Query: 352 -VPEVTIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
            +P++ +HF GAD +L   N+  V  S   +C +    ++   I G+    NF V YD+E
Sbjct: 372 TLPQLVLHFDGADWELPLQNYMLVDPSTGGLC-LAMATSSDGSIIGSYQHQNFNVLYDLE 430

Query: 410 QQTVSFKPTDC 420
              +SF P  C
Sbjct: 431 NSLLSFVPAPC 441


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 139/435 (31%), Positives = 199/435 (45%), Gaps = 53/435 (12%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRL------RDA-----LTRSLNRLNHFNQNSSISSS 77
           +SVE++HRD+       ++   Y+R       R+A     L R + R    N++      
Sbjct: 74  WSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPVNRYE 133

Query: 78  KASQAD----------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
             ++ D          +   +  Y  RI +GTP  E+  V DTGSD+ W QCEPC   +C
Sbjct: 134 NVAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPC--REC 191

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           Y Q  P+F+P  S+++ ++ C S+ C+ L+   C    C Y  SYGDGS+S G+ ATET+
Sbjct: 192 YSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETL 251

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           T G+T+   VA+     GCG  N GLF      ++GLG G +S  +Q+ T     FSYCL
Sbjct: 252 TFGTTSVANVAI-----GCGHKNVGLFIGAAG-LLGLGAGALSFPNQIGTQTGHTFSYCL 305

Query: 248 VPV---SSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI 301
           V     SS  + FG   +  G   + TPL K     TFY L++ AISVG   L    P++
Sbjct: 306 VDRESDSSGPLQFGPKSVPVGS--IFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEV 363

Query: 302 ------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 349
                       +IDSGT +T L       +     +     P  D     + CY  + L
Sbjct: 364 FRIDETSGHGGFIIDSGTVVTRLVTSAYDAVRDAFVAGTGQLPRTDAVSIFDTCYDLSGL 423

Query: 350 S--QVPEVTIHF-RGADVKLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVG 405
               VP V  HF  GA + L   N+ + +      C  F    +SV I GN  Q +  V 
Sbjct: 424 QFVSVPTVGFHFSNGASLILPAKNYLIPMDTVGTFCFAFAPAASSVSIMGNTQQQHIRVS 483

Query: 406 YDIEQQTVSFKPTDC 420
           +D     V F    C
Sbjct: 484 FDSANSLVGFAFDQC 498


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 123/367 (33%), Positives = 183/367 (49%), Gaps = 41/367 (11%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +   +GTPP +   + D+GSDL+W QC PC   QCY QDSPL+ P  SST+  +P
Sbjct: 61  SGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPC--RQCYAQDSPLYVPSNSSTFSPVP 118

Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           C SS C  +        +      C Y   Y D S S G  A E+ T+       V +  
Sbjct: 119 CLSSDCLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATV-----DGVRIDK 173

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVS-STKIN 256
           + FGCG++N G F +   G++GLG G +S  SQ+      KF+YCLV    P S S+ + 
Sbjct: 174 VAFGCGSDNQGSF-AAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSSLI 232

Query: 257 FGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP----DI------VI 303
           FG   I +   +  TP+    K+ T Y + I+ ++VG + L +S      D+      + 
Sbjct: 233 FGDELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNGGSIF 292

Query: 304 DSGTTLTF-LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF- 359
           DSGTTLT+  P  Y S++L+   S +   P A+    L+LC     + Q   P  TI F 
Sbjct: 293 DSGTTLTYWFPSAY-SHILAAFDSGVH-YPRAESVQGLDLCVELTGVDQPSFPSFTIEFD 350

Query: 360 RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIY---GNIMQTNFLVGYDIEQQTVSFK 416
            GA  +    N+FV V+ ++ C    G+ + +  +   GN++Q NF V YD E+  + F 
Sbjct: 351 DGAVFQPEAENYFVDVAPNVRCLAMAGLASPLGGFNTIGNLLQQNFFVQYDREENLIGFA 410

Query: 417 PTDCTKQ 423
           P  C+  
Sbjct: 411 PAKCSSH 417


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score =  176 bits (445), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 136/417 (32%), Positives = 204/417 (48%), Gaps = 36/417 (8%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ---NSSISSSKASQAD 83
           G  S++L+HR  P +P + +S  P     + L R   R++   Q   + +++SS      
Sbjct: 59  GSSSLKLVHRFGPCNP-HRTSTAPASSFNEILRRDKLRVDSIIQARRSMNLTSSVEHMKS 117

Query: 84  IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
            +P         ++Y++ + IGTP  E   + DTGS LIWTQC+PC    CY +  P+FD
Sbjct: 118 SVPFYGLSKITASDYIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPC--KACYPK-VPVFD 174

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           P  S+++K LPCSS  C S+ Q  CS   C Y  +Y D S S G LATET++    +   
Sbjct: 175 PTKSASFKGLPCSSKLCQSIRQ-GCSSPKCTYLTAYVDNSSSTGTLATETISF---SHLK 230

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTK 254
                I  GC     G  +   +GI+GL    ISL SQ        FSYC+   P S+  
Sbjct: 231 YDFKNILIGCSDQVSGE-SLGESGIMGLNRSPISLASQTANIYDKLFSYCIPSTPGSTGH 289

Query: 255 INFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVSTPDI----VIDSGTT 308
           + FG  G V    V  +P++K    + Y + +  ISVG ++L +          IDSG  
Sbjct: 290 LTFG--GKVPN-DVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIASTIDSGAV 346

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGA---D 363
           LT LP    S L SV   M++  P+ D    L+ CY F++ S V  P +++ F G    D
Sbjct: 347 LTRLPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMD 406

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + +S   + V  S+ + C  F  + + V I+GN  Q  + V +D  ++ + F P  C
Sbjct: 407 IDVSGIMWQVPGSK-VYCLAFAELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 129/364 (35%), Positives = 183/364 (50%), Gaps = 37/364 (10%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  +I +GTP T  L V DTGSD++W QC PC   +CY Q   +FDP+ S +Y ++
Sbjct: 138 GSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPC--RRCYDQSGQVFDPRRSRSYGAV 195

Query: 147 PCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            CS+  C  L+   C      C Y V+YGDGS + G+ ATET+T     G  VA   I  
Sbjct: 196 GCSAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAG--GARVAR--IAL 251

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVS-STKIN 256
           GCG +N GLF +    ++GLG G +S  +Q+       FSYCLV       P S S+ + 
Sbjct: 252 GCGHDNEGLFVAAAG-LLGLGRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTVT 310

Query: 257 FGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL-GVSTPD-----------I 301
           FG+  + S      TP+ K    +TFY + +  ISVG  R+ GV+  D           +
Sbjct: 311 FGSGAVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADSDLRLDPSSGRGGV 370

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY--SFNSLSQVPEVTIH 358
           ++DSGT++T L +   S L     +      ++    SL + CY  S   + +VP V++H
Sbjct: 371 IVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSMH 430

Query: 359 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           F  GA+  L   N+ + V S+   C  F G    V I GNI Q  F V +D + Q V F 
Sbjct: 431 FAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFV 490

Query: 417 PTDC 420
           P  C
Sbjct: 491 PKGC 494


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score =  175 bits (443), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 179/365 (49%), Gaps = 38/365 (10%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  +I +GTP T  L V DTGSD++W QC PC   +CY Q   +FDP+ S +Y ++
Sbjct: 143 GSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPC--RRCYDQSGQMFDPRASHSYGAV 200

Query: 147 PCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C++  C  L+   C      C Y V+YGDGS + G+ ATET+T  S       +P +  
Sbjct: 201 DCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAS----GARVPRVAL 256

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP---------VSSTKI 255
           GCG +N GLF +    ++GLG G +S  SQ+       FSYCLV            S+ +
Sbjct: 257 GCGHDNEGLFVAAAG-LLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTV 315

Query: 256 NFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL-GVSTPD----------- 300
            FG+  +        TP+ K    +TFY + +  ISVG  R+ GV+  D           
Sbjct: 316 TFGSGAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPSTGRGG 375

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSFNSLS--QVPEVTI 357
           +++DSGT++T L +   + L     +      ++    SL + CY  + L   +VP V++
Sbjct: 376 VIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVPTVSM 435

Query: 358 HFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           HF  GA+  L   N+ + V S    C  F G    V I GNI Q  F V +D + Q + F
Sbjct: 436 HFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRLGF 495

Query: 416 KPTDC 420
            P  C
Sbjct: 496 VPKGC 500


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score =  175 bits (443), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 133/416 (31%), Positives = 209/416 (50%), Gaps = 53/416 (12%)

Query: 49  TPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD-----------IIPNNANYLIRISI 97
           T  Q L + L R   R+      + ++  K  +A            ++  +  Y +R+ +
Sbjct: 1   THEQLLLETLQRDERRVRWIESKAKLAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRLGL 60

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           GTP      V DTGSDL W QC+PC    CY Q  P+FDP+ SS+++ +PC S  C +L 
Sbjct: 61  GTPARSLFMVVDTGSDLPWLQCQPC--KSCYKQADPIFDPRNSSSFQRIPCLSPLCKALE 118

Query: 158 QKSCSG-----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
             SCSG       C Y V+YGDGSFS G+ +++  TLG T  +A++   + FGCG +N G
Sbjct: 119 VHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLG-TGSKAMS---VAFGCGFDNEG 174

Query: 213 LFNSKTTGIVGLGGGDISLISQM-----RTTIAGKFSYCLV----PV--SSTKINFGTNG 261
           L  +   G++GLG G +S  SQ+      ++ A  FSYCLV    P+  SS+ + FG   
Sbjct: 175 L-FAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSSLIFGVAA 233

Query: 262 IVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPD----------IVIDSGTT 308
           I S   +  +PL    K  TFY   +  +SVG  +L +S             ++IDSGT+
Sbjct: 234 IPSTAAL--SPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGGVIIDSGTS 291

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVK 365
           +T  P    + +     +     P A      + CY+F+  +   VP + +HF  GAD++
Sbjct: 292 VTRFPTSVYATIRDAFRNATINLPSAPRYSLFDTCYNFSGKASVDVPALVLHFENGADLQ 351

Query: 366 LSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           L  +N+ + + +    C  F   +  + I GNI Q +F +G+D+++  ++F P  C
Sbjct: 352 LPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFAPQQC 407


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score =  175 bits (443), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 123/355 (34%), Positives = 178/355 (50%), Gaps = 38/355 (10%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ IG P +    V DTGSD+ W QC PC  + CY Q  P+F+P  S++Y  L 
Sbjct: 141 SGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPC--ADCYHQADPIFEPASSTSYSPLS 198

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C + QC SL+   C    C Y VSYGDGS++ G+  TET+TLGS +   VA+     GCG
Sbjct: 199 CDTKQCQSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLGSASVDNVAI-----GCG 253

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTNGIVS 264
            NN GLF      ++GLGGG +S  SQ+    A  FSYCLV     S++ + F +  +  
Sbjct: 254 HNNEGLFIGAAG-LLGLGGGKLSFPSQIN---ASSFSYCLVDRDSDSASTLEFNSALL-- 307

Query: 265 GPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPD------------IVIDSGTTL 309
            P  ++ PL + +   TFY + +  +SVG + L +  P+            I+IDSGT +
Sbjct: 308 -PHAITAPLLRNRELDTFYYVGMTGLSVGGELLSI--PESMFEMDESGNGGIIIDSGTAV 364

Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADV-KL 366
           T L     + L        +  PV       + CY  +  +  +VP VT H  G  V  L
Sbjct: 365 TRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFHLAGGKVLPL 424

Query: 367 SRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             +N+ + V  D   C  F   ++++ I GN+ Q    VG+D+    V F+P  C
Sbjct: 425 PATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  175 bits (443), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 129/426 (30%), Positives = 201/426 (47%), Gaps = 40/426 (9%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL--NHFNQNSSISSSKASQADII 85
           G   +L H DS +   +  +E   + +  +  R+  +L  +       +++  AS + ++
Sbjct: 30  GLRADLTHIDSGRG--FTRNELLRRMVLRSRARAAKQLCPSRSGTPVRVTAPVASGSHVV 87

Query: 86  PNNANYLIRISIGTPPTERLAV-ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
                YLI   IGTP  +++A+  DTGSD++WTQC PC    C+ Q  P FD   S T  
Sbjct: 88  -GYTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPC--FDCFTQPLPRFDTSASDTVH 144

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            + C+   C +L   +C    C Y V+YGD S + G LA ++ T     G  V +P + F
Sbjct: 145 GVLCTDPICRALRPHACFLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVF 204

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFG--- 258
           GCG  N G F+S  TGI G G G +SL  Q+  +    FSYC   +    ST +  G   
Sbjct: 205 GCGQYNTGNFHSNETGIAGFGRGPLSLPRQLGVS---SFSYCFTTIFESKSTPVFLGGAP 261

Query: 259 TNGI---VSGPGVVSTP-LTKAKTFYVLTIDAISVGNQRLGVSTPDIV----------ID 304
            +G+    +GP ++STP L     +Y L++  I+VG  RL V     V          ID
Sbjct: 262 ADGLRAHATGP-ILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIID 320

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLEL-CYSFNSLSQ-----VPEVTI 357
           SGT +T  P+    +L     + +     + + TG   L C+S  S+       VP++T+
Sbjct: 321 SGTAITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTESVPDASKVPVPKMTL 380

Query: 358 HFRGADVKLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           H  GAD +L R N+  +  + D +C V     +   + GN  Q N  + +D+    +  +
Sbjct: 381 HLEGADWELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIVHDLAGNKLVIE 440

Query: 417 PTDCTK 422
           P  C K
Sbjct: 441 PAQCDK 446


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 129/371 (34%), Positives = 177/371 (47%), Gaps = 49/371 (13%)

Query: 90  NYLIRISIG----TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
           NY+  IS+G    +P      + DTGSDL W QC+PC  S CY Q  PLFDP  S+TY +
Sbjct: 143 NYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPC--SACYAQRDPLFDPAGSATYAA 200

Query: 146 LPCSSSQCA-----------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           + C++S CA           S          C Y+++YGDGSFS G LAT+TV LG  + 
Sbjct: 201 VRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGAS- 259

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK 254
               L G  FGCG +N GLF   T G++GLG  ++SL+SQ  +   G FSYCL P +++ 
Sbjct: 260 ----LGGFVFGCGLSNRGLFGG-TAGLMGLGRTELSLVSQTASRYGGVFSYCL-PAATSG 313

Query: 255 INFGTNGIVSGPGVVS-----TPLTKAKT--------FYVLTIDAISVGNQRL---GVST 298
              G+  +  G    S     TP+   +         FY L +   +VG   L   G+  
Sbjct: 314 DASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGA 373

Query: 299 PDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSFNSLSQ--VPE 354
            +++IDSGT +T L P  Y +     M     A   A P  S L+ CY      +  VP 
Sbjct: 374 SNVLIDSGTVITRLAPSVYRAVRAEFMRQFGAAGYPAAPGFSILDTCYDLTGHDEVKVPL 433

Query: 355 VTIHFR-GADVKLSRSNFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIE 409
           +T+    GADV +  +     V +D   VC     ++  +  PI GN  Q N  V YD  
Sbjct: 434 LTLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKNKRVVYDTL 493

Query: 410 QQTVSFKPTDC 420
              + F   DC
Sbjct: 494 GSRLGFADEDC 504


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score =  174 bits (442), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 128/434 (29%), Positives = 201/434 (46%), Gaps = 56/434 (12%)

Query: 29  FSVELIHRDS--PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
             V + HRD+  P  P         QRL     R  + ++   +  S   S       IP
Sbjct: 27  LHVPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATGRLHSPVFSG------IP 80

Query: 87  -NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
             +  Y   + +GTP T+ + V DTGSDL+W QC PC   +CY Q   +FDP+ SSTY+ 
Sbjct: 81  FESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPC--RRCYAQRGQVFDPRRSSTYRR 138

Query: 146 LPCSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           +PCSS QC +L    C     +G  C+Y V+YGDGS S G LAT+ +   + T     + 
Sbjct: 139 VPCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAFANDT----YVN 194

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS--STKINFG 258
            +T GCG +N GLF+S   G++G+  G IS+ +Q+       F YCL   +  ST+ ++ 
Sbjct: 195 NVTLGCGRDNEGLFDS-AAGLLGVARGKISISTQVAPAYGSVFEYCLGDRTSRSTRSSYL 253

Query: 259 TNGIVSGP------GVVSTPLTKAKTFYVLTIDAISVGNQR--------LGVSTP----D 300
             G    P       ++S P  +  + Y + +   SVG +R        L + T      
Sbjct: 254 VFGRTPEPPSTAFTALLSNP--RRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGG 311

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNS--LSQVPEV 355
           +V+DSGT ++   +   + L     +   A  +    G     + CY       +  P +
Sbjct: 312 VVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLI 371

Query: 356 TIHFR-GADVKLSRSNFFV-------KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
            +HF  GAD+ L   N+F+       + +    C  F+   + + + GN+ Q  F V +D
Sbjct: 372 VLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFD 431

Query: 408 IEQQTVSFKPTDCT 421
           +E++ + F P  CT
Sbjct: 432 VEKERIGFAPKGCT 445


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score =  174 bits (442), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 138/416 (33%), Positives = 197/416 (47%), Gaps = 61/416 (14%)

Query: 54  LRDALTRSLNRLNHF-----NQNSSISSSKASQADIIPNNA------NYLIRISIG---- 98
           LR  L    +R N F     N  ++ +S+++  A++   +       NY+  I++G    
Sbjct: 137 LRRLLAADESRANSFQLRIRNDRAAAASTQSGSAEVPLTSGIRFQTLNYVTTIALGGGSS 196

Query: 99  -TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASL 156
            +P      + DTGSDL W QC+PC  S CY Q  PLFDP  S+TY ++ C++S C ASL
Sbjct: 197 GSPAANLTVIVDTGSDLTWVQCKPC--SACYAQRDPLFDPAGSATYAAVRCNASACAASL 254

Query: 157 NQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
                   SC G N  C Y+++YGDGSFS G LAT+TV LG  +     L G  FGCG +
Sbjct: 255 KAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGAS-----LDGFVFGCGLS 309

Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV 269
           N GLF   T G++GLG  ++SL+SQ      G FSYCL   +S       +G +S  G  
Sbjct: 310 NRGLFGG-TAGLMGLGRTELSLVSQTALRYGGVFSYCLPATTSGD----ASGSLSLGGDA 364

Query: 270 S-----TPLTKAKT--------FYVLTIDAISVGNQRL---GVSTPDIVIDSGTTLTFLP 313
           S     TP+   +         FY L +   +VG   L   G+   +++IDSGT +T L 
Sbjct: 365 SSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGASNVLIDSGTVITRLA 424

Query: 314 QGYNSNLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSR 368
                 + +  +    A   P A     L+ CY      +  VP +T+    GA+V +  
Sbjct: 425 PSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRLEGGAEVTVDA 484

Query: 369 SNFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           +     V +D   VC     ++  +  PI GN  Q N  V YD     + F   DC
Sbjct: 485 AGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDC 540


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score =  174 bits (442), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 136/426 (31%), Positives = 199/426 (46%), Gaps = 59/426 (13%)

Query: 39  PKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP----------- 86
           P+   Y      Y+ L    L R   R N       ++    S++D+ P           
Sbjct: 88  PRETIYKIHHKDYKSLVLSRLHRDTVRFNSLTARLQLALEDISKSDLKPLETEIKPEDLS 147

Query: 87  ---------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
                     +  Y  R+ +G P  +   V DTGSD+ W QC+PC  + CY Q  P+FDP
Sbjct: 148 TPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPC--TDCYQQTDPIFDP 205

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
             SSTY  + C S QC+SL   SC    C Y V+YGDGS++ G+ ATE+V+ G++     
Sbjct: 206 TASSTYAPVTCQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSG---- 261

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV---PVSSTK 254
           ++  +  GCG +N GLF     G++GLGGG +SL +Q++ T    FSYCLV      S+ 
Sbjct: 262 SVKNVALGCGHDNEGLF-VGAAGLLGLGGGPLSLTNQLKAT---SFSYCLVNRDSAGSST 317

Query: 255 INFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPD----------- 300
           ++F  N    G   V+ PL K +   TFY + +  +SVG Q   VS P+           
Sbjct: 318 LDF--NSAQLGVDSVTAPLMKNRKIDTFYYVGLSGMSVGGQM--VSIPESTFRLDESGNG 373

Query: 301 -IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVT 356
            I++D GT +T L  Q YN  L      M +   +       + CY  +  +  +VP V+
Sbjct: 374 GIIVDCGTAITRLQTQAYNP-LRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVS 432

Query: 357 IHF-RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
            HF  G    L  +N+ + V S    C  F   T+S+ I GN+ Q    V +D+    + 
Sbjct: 433 FHFADGKSWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMG 492

Query: 415 FKPTDC 420
           F P  C
Sbjct: 493 FSPNKC 498


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score =  174 bits (441), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 125/378 (33%), Positives = 187/378 (49%), Gaps = 44/378 (11%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ---DSPLFDPKMSSTYK 144
            YL+ ++ GTPP E L +ADTGSDLIW QC     PP+ C  +     P F    S+T  
Sbjct: 53  QYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLS 112

Query: 145 SLPCSSSQCASL-----NQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
            +PCS++QC  +     +  SCS    V C Y+  Y DGS + G LA +T T+ + T   
Sbjct: 113 VVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGG 172

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
            A+ G+ FGCGT N G   S T G++GLG G +S  +Q  +  A  FSYCL+ +   +  
Sbjct: 173 AAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRG 232

Query: 257 FGTNGIVSG-----PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI------- 301
             ++ +  G          TPL     A TFY + + AI VGN+ L V   +        
Sbjct: 233 RSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGN 292

Query: 302 ---VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT---GSLELCYSFNSLSQV--- 352
              VIDSG+TLT+L  G   +L+S  ++ +    +         LELCY+ +S S +   
Sbjct: 293 GGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSSLAPA 352

Query: 353 ----PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVG 405
               P +TI F +G  ++L   N+ V V++D+ C   +   +  +  + GN+MQ  + V 
Sbjct: 353 NGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGNLMQQGYHVE 412

Query: 406 YDIEQQTVSFKPTDCTKQ 423
           +D     + F  T+C   
Sbjct: 413 FDRASARIGFARTECVAH 430


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score =  174 bits (441), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 133/361 (36%), Positives = 176/361 (48%), Gaps = 37/361 (10%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +R+ +GTP T    V DTGSD++W QC PC    CY Q  P+F+P  S T+ ++P
Sbjct: 133 SGEYFMRLGVGTPATNMYMVLDTGSDVVWLQCSPC--KVCYNQSDPVFNPAKSKTFATVP 190

Query: 148 CSSSQCASLNQKS-CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           C S  C  L+  S C       C Y VSYGDGSF+ G+ +TET+T        VAL    
Sbjct: 191 CGSRLCRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGARVDHVAL---- 246

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--------I 255
            GCG +N GLF      ++GLG G +S  SQ +    GKFSYCLV  +S+         I
Sbjct: 247 -GCGHDNEGLFVGAAG-LLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTI 304

Query: 256 NFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRL-GVSTPD----------IV 302
            FG NG V    V +  LT  K  TFY L +  ISVG  R+ GVS             ++
Sbjct: 305 VFG-NGAVPKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVI 363

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR 360
           IDSGT++T L Q     L             A      + C+  + ++  +VP V  HF 
Sbjct: 364 IDSGTSVTRLTQSAYVALRDAFRLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFT 423

Query: 361 GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
           G +V L  SN+ + V ++   C  F G   S+ I GNI Q  F V YD+    V F    
Sbjct: 424 GGEVSLPASNYLIPVNNQGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRA 483

Query: 420 C 420
           C
Sbjct: 484 C 484


>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
 gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  174 bits (441), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 131/361 (36%), Positives = 174/361 (48%), Gaps = 37/361 (10%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +R+ +GTP T    V DTGSD++W QC PC    CY Q   +FDPK S T+ ++P
Sbjct: 132 SGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC--KACYNQTDAIFDPKKSKTFATVP 189

Query: 148 CSSSQCASLNQKS-C---SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           C S  C  L+  S C       C Y VSYGDGSF+ G+ +TET+T          +  + 
Sbjct: 190 CGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF-----HGARVDHVP 244

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--------I 255
            GCG +N GLF      ++GLG G +S  SQ +    GKFSYCLV  +S+         I
Sbjct: 245 LGCGHDNEGLFVGAAG-LLGLGRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTI 303

Query: 256 NFGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRL-GVSTPD----------IV 302
            FG N  V    V +  LT  K  TFY L +  ISVG  R+ GVS             ++
Sbjct: 304 VFG-NAAVPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVI 362

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR 360
           IDSGT++T L Q     L             A      + C+  + ++  +VP V  HF 
Sbjct: 363 IDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHFG 422

Query: 361 GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
           G +V L  SN+ + V +E   C  F G   S+ I GNI Q  F V YD+    V F    
Sbjct: 423 GGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSRA 482

Query: 420 C 420
           C
Sbjct: 483 C 483


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  174 bits (441), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 140/435 (32%), Positives = 202/435 (46%), Gaps = 62/435 (14%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
           FS++L  RDS     +N+    Y+ L    L+R  +R+         + S+  ++D+ P 
Sbjct: 76  FSLQLHPRDS----LHNAGHKDYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPL 131

Query: 87  -------------------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
                               +  Y  R+ +G P      V DTGSD+ W QC+PC  + C
Sbjct: 132 KTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPC--TDC 189

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           Y Q  P+FDP+ SS++ SLPC S QC +L    C    C Y VSYGDGSF+ G   TET+
Sbjct: 190 YQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFTVGEFVTETL 249

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           T G++      +  +  GCG +N GLF      +   GG  +SL SQM+   A  FSYCL
Sbjct: 250 TFGNSG----MINDVAVGCGHDNEGLFVGSAGLLGLGGGP-LSLTSQMK---ASSFSYCL 301

Query: 248 VPVSSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPD---- 300
           V   S+  +       +    V+ PL K+    TFY + +  +SVG Q L +  P+    
Sbjct: 302 VDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSI-PPNLFQM 360

Query: 301 -------IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLEL---CYSFNSL 349
                  I++DSGT +T L  Q YN    ++  + +   P    T    L   CY  +S 
Sbjct: 361 DDSGYGGIIVDSGTAITRLQTQAYN----TLRDAFVSRTPYLKKTNGFALFDTCYDLSSQ 416

Query: 350 SQV--PEVTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVG 405
           S+V  P V+  F G   ++L   N+ + V S    C  F   T+S+ I GN+ Q    V 
Sbjct: 417 SRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVH 476

Query: 406 YDIEQQTVSFKPTDC 420
           YD+    V F P  C
Sbjct: 477 YDLANSVVGFSPHKC 491


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score =  174 bits (441), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 134/428 (31%), Positives = 201/428 (46%), Gaps = 51/428 (11%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD-------- 83
            L+HRD      ++ + T  + L   L R   R    +  +  ++               
Sbjct: 77  RLVHRDD-----FSVNATAAELLAYRLERDAKRAARLSAAAGPANGTRRGGGGVVAPVVS 131

Query: 84  -IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
            +   +  Y  +I +GTP T  L V DTGSD++W QC PC   +CY Q   +FDP+ S +
Sbjct: 132 GLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPC--RRCYEQSGQVFDPRRSRS 189

Query: 143 YKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           Y ++ C++  C  L+   C      C Y V+YGDGS + G+ ATET+T     G  VA  
Sbjct: 190 YNAVGCAAPLCRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAG--GARVAR- 246

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK------ 254
            +  GCG +N GLF +    ++GLG G +S  +Q+       FSYCLV  +S+       
Sbjct: 247 -VALGCGHDNEGLFVAAAG-LLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRS 304

Query: 255 --INFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL-GVSTPD-------- 300
             + FG+  + S      TP+ K    +TFY + +  ISVG  R+ GV+  D        
Sbjct: 305 STVTFGSGAVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANSDLRLDPSSG 364

Query: 301 ---IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY--SFNSLSQVPE 354
              +++DSGT++T L +   S L            ++    SL + CY  S   + +VP 
Sbjct: 365 RGGVIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPT 424

Query: 355 VTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
           V++HF  GA+  L   N+ + V S+   C  F G    V I GNI Q  F V +D + Q 
Sbjct: 425 VSMHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQR 484

Query: 413 VSFKPTDC 420
           V+F P  C
Sbjct: 485 VAFTPKGC 492


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  174 bits (441), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 134/389 (34%), Positives = 182/389 (46%), Gaps = 32/389 (8%)

Query: 53  RLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA----NYLIRISIGTPPTERLAVA 108
           RL   L R  N   H  ++ +   S A Q  ++   +     Y +R+ IG PP++   V 
Sbjct: 107 RLDLFLKRVSNSDLHPAESKAEFESNALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVL 166

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
           DTGSD+ W QC PC  S+CY Q  P+FDP  S++Y  + C   QC SL+   C    C Y
Sbjct: 167 DTGSDVSWIQCAPC--SECYQQSDPIFDPISSNSYSPIRCDEPQCKSLDLSECRNGTCLY 224

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
            VSYGDGS++ G  ATETVTLGS   + VA+     GCG NN GLF     G++GLGGG 
Sbjct: 225 EVSYGDGSYTVGEFATETVTLGSAAVENVAI-----GCGHNNEGLF-VGAAGLLGLGGGK 278

Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTID 285
           +S  +Q+  T    FSYCLV   S  ++             + PL +     TFY L + 
Sbjct: 279 LSFPAQVNAT---SFSYCLVNRDSDAVSTLEFNSPLPRNAATAPLMRNPELDTFYYLGLK 335

Query: 286 AISVGNQRLGVSTPDIVI----------DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD 335
            ISVG + L +      +          DSGT +T L       L        +  P A+
Sbjct: 336 GISVGGEALPIPESSFEVDAIGGGGIIIDSGTAVTRLRSEVYDALRDAFVKGAKGIPKAN 395

Query: 336 PTGSLELCYSFNSLSQVPEVTIHFR---GADVKLSRSNFFVKV-SEDIVCSVFKGITNSV 391
                + CY  +S   V   T+ FR   G ++ L   N+ + V S    C  F   T+S+
Sbjct: 396 GVSLFDTCYDLSSRESVEIPTVSFRFPEGRELPLPARNYLIPVDSVGTFCFAFAPTTSSL 455

Query: 392 PIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            I GN+ Q    VG+DI    V F    C
Sbjct: 456 SIIGNVQQQGTRVGFDIANSLVGFSVDSC 484


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score =  174 bits (440), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 125/362 (34%), Positives = 185/362 (51%), Gaps = 40/362 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + +S+GTPP    A+ DTGSDL WTQC PC  + C+ Q +PL+DP  SST+  LPC+S
Sbjct: 96  YHMILSVGTPPLAFPAIIDTGSDLTWTQCAPC-TTACFAQPTPLYDPARSSTFSKLPCAS 154

Query: 151 SQCASLNQ--KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA---LPGITFG 205
             C +L    ++C+   C Y   Y  G F+ G LA +T+ +G   G   A     G+ FG
Sbjct: 155 PLCQALPSAFRACNATGCVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSSFAGVAFG 213

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL---VPVSSTKINFGTNGI 262
           C T NGG  +   +GIVGLG   +SL+SQ+     G+FSYCL       ++ I FG    
Sbjct: 214 CSTANGGDMDGA-SGIVGLGRSALSLLSQIGV---GRFSYCLRSDADAGASPILFGALAN 269

Query: 263 VSGPGVVSTPL-------TKAKTFYVLTIDAISVGNQRLGVSTP----------DIVIDS 305
           V+G  V ST L        +   +Y + +  I+VG+  L V++            +++DS
Sbjct: 270 VTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIVDS 329

Query: 306 GTTLTFLPQ-GY---NSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL-SQVPEVTIHFR 360
           GTT T+L + GY       LS  + ++    V+      +LC+   +  + VP +   F 
Sbjct: 330 GTTFTYLAEAGYTMLRQAFLSQTAGLLTR--VSGAQFDFDLCFEAGAADTPVPRLVFRFA 387

Query: 361 -GADVKLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
            GA+  + R ++F  V E   V  +    T  V + GN+MQ +  V YD++  T SF P 
Sbjct: 388 GGAEYAVPRQSYFDAVDEGGRVACLLVLPTRGVSVIGNVMQMDLHVLYDLDGATFSFAPA 447

Query: 419 DC 420
           DC
Sbjct: 448 DC 449


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score =  174 bits (440), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 131/412 (31%), Positives = 206/412 (50%), Gaps = 54/412 (13%)

Query: 54  LRDALTRSLNRLNHFN--QNSSISSSKASQ---ADIIP----NNANYLIRISIGTPPTER 104
           +R A+ RS  R    +  +N +  S K  Q   A ++P     +  Y++ ++IGTPP   
Sbjct: 50  IRRAMRRSKARAAALSAVRNRARFSGKNEQQTPAGVLPVRPSGDLEYVVDLAIGTPPQPV 109

Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
            A+ DTGSDLIWTQC PC  + C  Q  PLF P  S++Y+ + C+ + C+ +   SC   
Sbjct: 110 SALLDTGSDLIWTQCAPC--ASCLSQPDPLFAPGQSASYEPMRCAGTLCSDILHHSCERP 167

Query: 165 N-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT--FGCGTNNGGLFNSKTTGI 221
           + C Y  +YGDG+ + G  ATE  T  S+ G  +    +   FGCG+ N G  N+  +GI
Sbjct: 168 DTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNNG-SGI 226

Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--------INFGTNGIVSGPGVVSTPL 273
           VG G   +SL+SQ+      +FSYCL   +S +        ++ G  G  +G  V +TPL
Sbjct: 227 VGFGRNPLSLVSQLSIR---RFSYCLTSYASRRQSTLLFGSLSDGVYGDATG-RVQTTPL 282

Query: 274 TKA---KTFYVLTIDAISVGNQRLGVS------TPD----IVIDSGTTLTFLPQGYNSNL 320
            ++    TFY +    ++VG +RL +        PD    +++DSGT LT LP    + +
Sbjct: 283 LQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEV 342

Query: 321 LSVMSSMIEAQPVADPTGSLE--LCY---------SFNSLSQVPEVTIHFRGADVKLSRS 369
           +      +   P A+  G+ E  +C+         S  S   VP + +HF+GAD+ L R 
Sbjct: 343 VRAFRQQLRL-PFAN-GGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHFQGADLDLPRR 400

Query: 370 NFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           N+ +       +C +     +     GN++Q +  V YD+E +T+S  P  C
Sbjct: 401 NYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 452


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score =  174 bits (440), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 132/419 (31%), Positives = 199/419 (47%), Gaps = 40/419 (9%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ---ADII 85
           + ++L HRD  K P     + P +R ++ ++R   R++   +  S  S +      +D++
Sbjct: 71  WKLKLFHRD--KLPLNFDPDHP-RRFKERISRDSKRVSSLLRLLSSGSDEQVTDFGSDVV 127

Query: 86  ----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                 +  Y +RI +G+PP  +  V D+GSD++W QC+PC  S+CY Q  P+FDP  S+
Sbjct: 128 SGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPC--SECYQQSDPVFDPAGSA 185

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           TY  + C SS C  L+   C+   C+Y VSYGDGS++ G LA ET+T G      V +  
Sbjct: 186 TYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGR-----VLIRN 240

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFG 258
           I  GCG  N G+F      ++GLGGG +S + Q+     G FSYCLV     S+  + FG
Sbjct: 241 IAIGCGHMNRGMFIGAAG-LLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFG 299

Query: 259 TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP----------DIVIDS 305
              +  G   V  PL    +A +FY + +  + VG  R+ +              +V+D+
Sbjct: 300 RGAMPVGAAWV--PLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDT 357

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGAD 363
           GT +T LP                  P +D     + CY+ N     +VP V+ +F G  
Sbjct: 358 GTAVTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGP 417

Query: 364 V-KLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           +  L   NF + V  E   C  F    + + I GNI Q    +  D     V F PT C
Sbjct: 418 ILTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  174 bits (440), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 140/466 (30%), Positives = 210/466 (45%), Gaps = 76/466 (16%)

Query: 12  FFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN 71
            F C  +++   A++     +L H DS +        T ++ LR  + RS  RL      
Sbjct: 19  LFPCVLLLTFSLAESAALRADLTHVDSGRG------FTKHELLRRMVARSKARL------ 66

Query: 72  SSISSSKASQADIIP--------NNANYLIRISIGTPPTERLAVA-DTGSDLIWTQCEPC 122
           +S+ SS    A   P         ++ YLI + IGTP  +R+ +  DTGSDL+WTQC  C
Sbjct: 67  ASLRSSACDTALTAPVDHGGSDVGSSEYLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA-C 125

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS---LNQKSCSGVN--CQYSVSYGDGSF 177
             + C+ Q  P+F   +S T+  +PCS   C     L    C+  +  C Y+  Y D S 
Sbjct: 126 --TVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGYMDHSI 183

Query: 178 SNGNLATETVTLGS--TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           + G +A +T T  +      A A+P I FGCG  N GLF    +GI G G G +SL SQ+
Sbjct: 184 TTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYGLFTPNQSGIAGFGTGPLSLPSQL 243

Query: 236 RTTIAGKFSYCLVPVSSTKIN-------------FGTNGIVS---GPGVVSTPLTKAKTF 279
           +     +FSYC   +  ++++               T  I S    PG    P+  ++ F
Sbjct: 244 KVR---RFSYCFTAMEESRVSPVILGGEPENIEAHATGPIQSTPFAPGPAGAPV-GSQPF 299

Query: 280 YVLTIDAISVGNQRL----------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE 329
           Y L++  ++VG  RL          G  +    IDSGT +TF PQ    +L     + + 
Sbjct: 300 YFLSLRGVTVGETRLPFNASTFALKGDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVP 359

Query: 330 ---AQPVADPTGSLELCYSFNSLSQ---VPEVTIHFRGADVKLSRSNFFVKVSED----- 378
              A+   DP     LC+S  +  +   VP++ +H  GAD +L R N+ +   +D     
Sbjct: 360 LPVAKGYTDPDN--LLCFSVPAKKKAPAVPKLILHLEGADWELPRENYVLDNDDDGSGAG 417

Query: 379 -IVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
             +C V     NS   I GN  Q N  + YD+E   + F P  C K
Sbjct: 418 RKLCVVILSAGNSNGTIIGNFQQQNMHIVYDLESNKMVFAPARCDK 463


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score =  174 bits (440), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 124/360 (34%), Positives = 184/360 (51%), Gaps = 40/360 (11%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +R+ IG+P   +  V DTGSD+ W QC PC    CY Q+  +FDP+ SS+++ L 
Sbjct: 11  SGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPC--KSCYKQNDAVFDPRASSSFRRLS 68

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATE--TVTLGSTTGQAVALPGIT 203
           CS+ QC  L+ K+C+  +  C Y VSYGDGSF+ G+LA++  +V+ G T+        + 
Sbjct: 69  CSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTS-------PVV 121

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP-----VSSTKINFG 258
           FGCG +N GLF      ++GLG G +S  SQ+ +    KFSYCLV       +S+ + FG
Sbjct: 122 FGCGHDNEGLFVGAAG-LLGLGAGKLSFPSQLSSR---KFSYCLVSRDNGVRASSALLFG 177

Query: 259 TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP-----------DIVID 304
            + + +      T L    K  TFY   +  IS+G   L + +             ++ID
Sbjct: 178 DSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIID 237

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-G 361
           SGT++T LP    + +     S  +  P A      + CY F++L+ V  P V+ HF  G
Sbjct: 238 SGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGG 297

Query: 362 ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           A V+L  SN+ V V +    C  F   +  + I GNI Q    V  D++   V F P  C
Sbjct: 298 ASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score =  174 bits (440), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 124/360 (34%), Positives = 184/360 (51%), Gaps = 40/360 (11%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +R+ IG+P   +  V DTGSD+ W QC PC    CY Q+  +FDP+ SS+++ L 
Sbjct: 11  SGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPC--KSCYKQNDAVFDPRASSSFRRLS 68

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATET--VTLGSTTGQAVALPGIT 203
           CS+ QC  L+ K+C+  +  C Y VSYGDGSF+ G+LA+++  V+ G T+        + 
Sbjct: 69  CSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGRTS-------PVV 121

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP-----VSSTKINFG 258
           FGCG +N GLF      ++GLG G +S  SQ+ +    KFSYCLV       +S+ + FG
Sbjct: 122 FGCGHDNEGLFVGAAG-LLGLGAGKLSFPSQLSSR---KFSYCLVSRDNGVRASSALLFG 177

Query: 259 TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTP-----------DIVID 304
            + + +      T L    K  TFY   +  IS+G   L + +             ++ID
Sbjct: 178 DSALPTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIID 237

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-G 361
           SGT++T LP    + +     S  +  P A      + CY F++L+ V  P V+ HF  G
Sbjct: 238 SGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFHFEGG 297

Query: 362 ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           A V+L  SN+ V V +    C  F   +  + I GNI Q    V  D++   V F P  C
Sbjct: 298 ASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 141/462 (30%), Positives = 225/462 (48%), Gaps = 57/462 (12%)

Query: 5   LSCVFILFFLCFYVV------SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL 58
           +S  + LFF     +      S ++ +     ++L H  S KSP  NS+   +  +    
Sbjct: 1   MSLFWFLFFSAHLAIASSLKDSGLKHKQPDMQLKLYHMTSLKSP-PNSTSLLFAYM---F 56

Query: 59  TRSLNRLNHFNQN-SSISSSKASQADIIPNNA-------------NYLIRISIGTPPTER 104
            +   R+ +F+   +  S + AS   + P  A             NY +++ +G+P    
Sbjct: 57  AKDEERIRYFHSRLAKNSDANASSKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTKYY 116

Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC-----SSSQCASLNQK 159
             + DTGS   W QC+PC    C++Q+ P+F+P  S TYK++PC     SS + A+LN+ 
Sbjct: 117 TMIVDTGSSFSWLQCQPCT-IYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEP 175

Query: 160 SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
           +CS  +  C Y  SYGD SFS G L+ + +TL  T  Q   L    +GCG +N GLF  +
Sbjct: 176 TCSKQSNACVYKASYGDSSFSLGYLSQDVLTL--TPSQ--TLSSFVYGCGQDNQGLFG-R 230

Query: 218 TTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-------INFGTNGIVSGPGVVS 270
           T GI+GL   ++S++SQ+       FSYCL    ST        ++ GT+ +        
Sbjct: 231 TDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKF 290

Query: 271 TPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI----VIDSGTTLTFLPQGYNSNLLSV 323
           TPL K     + Y + +++I+V  + LGV+        +IDSGT +T LP    + L + 
Sbjct: 291 TPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNA 350

Query: 324 MSSMIEAQPVADPTGS-LELCY--SFNSLSQV-PEVTIHFR-GADVKLSRSNFFVKVSED 378
             +++  +    P  S L+ C+  S   +S+V P++ I F+ GAD++L   N  V++   
Sbjct: 351 YVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRIIFKGGADLQLKGHNSLVELETG 410

Query: 379 IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           I C    G ++S+ I GN  Q    V YD+    V F P  C
Sbjct: 411 ITCLAMAG-SSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 129/363 (35%), Positives = 175/363 (48%), Gaps = 39/363 (10%)

Query: 90  NYLIRISIGTPPTERLAV-ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
           NY+  I++G    + L V  DTGSDL W QCEPCP S CY Q  PLFDP  S T+ ++PC
Sbjct: 179 NYVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPC 238

Query: 149 SSSQCASLNQKSCSGV-------------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
            S  CA+ + K  +G               C Y++SYGDGSFS G LA +T+ LG+TT  
Sbjct: 239 GSPACAA-SLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLGTTT-- 295

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST-- 253
              L G  FGCG +N GLF   T G++GLG  D+SL+SQ      G FSYCL P ++T  
Sbjct: 296 --KLDGFVFGCGLSNRGLFGG-TAGLMGLGRTDLSLVSQTAARFGGVFSYCL-PATTTST 351

Query: 254 -KINFGTNGIVSGPGVVSTPLTKAKT---FYVLTIDAISVGNQRL----GVSTPDIVIDS 305
             ++ G     S P +  T +    T   FY + I   +VG        G    ++++DS
Sbjct: 352 GSLSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGFGAGNVLVDS 411

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GA 362
           GT +T L       + +  +   E  P A     L+ CY      +  VP +T+    GA
Sbjct: 412 GTVITRLAPSVYKAVRAEFARRFE-YPAAPGFSILDACYDLTGRDEVNVPLLTLTLEGGA 470

Query: 363 DVKLSRSNFFVKVSED--IVCSVFKGI--TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
            V +  +     V +D   VC     +   +  PI GN  Q N  V YD     + F   
Sbjct: 471 QVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYDTVGSRLGFADE 530

Query: 419 DCT 421
           DCT
Sbjct: 531 DCT 533


>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
 gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
          Length = 332

 Score =  173 bits (439), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 120/355 (33%), Positives = 181/355 (50%), Gaps = 50/355 (14%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y   I++G+PP +   V DTGSDL W +C+PC P  C    S  FD   S+TYK+L C+ 
Sbjct: 3   YYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSP-DC----SSTFDRLASNTYKALTCAD 57

Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVALPGITFGCGTN 209
                            YS  YGDGSF+ G+L+ +T+ + G+ + +    PG  FGCG+ 
Sbjct: 58  ----------------DYSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGSL 101

Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV------PVSSTKINFGTNGI- 262
             GL  S   GI+ L  G +S  SQ+      KFSYCL+       +  + + FG   + 
Sbjct: 102 LKGLI-SGEVGILALSPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVE 160

Query: 263 VSGPG------VVSTPLTKAKTFYVLTIDAISVGNQRLGVS--------TPDIVIDSGTT 308
           +  PG      +  TP+ ++  +Y + +D ISVGNQRL +S            + DSGTT
Sbjct: 161 LKEPGSGKLQELQYTPIGESSIYYTVRLDGISVGNQRLDLSPSAFLNGQDKPTIFDSGTT 220

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFR-GADVK 365
           LT LP G   ++   ++SM+         G L+ C+    +S   +P++T HF  GAD  
Sbjct: 221 LTMLPPGVCDSIKQSLASMVSGAEFVAIKG-LDACFRVPPSSGQGLPDITFHFNGGADFV 279

Query: 366 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              SN+ + +   + C +F   TN V I+GN+ Q +F V +D++ + + FK TDC
Sbjct: 280 TRPSNYVIDLGS-LQCLIFV-PTNEVSIFGNLQQQDFFVLHDMDNRRIGFKETDC 332


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 123/355 (34%), Positives = 176/355 (49%), Gaps = 24/355 (6%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           A I+P    Y++ + +GTP  +     DTGSDL WTQCEPC    C+ Q+ P FDP  S+
Sbjct: 131 ASIVPTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPC-LGGCFPQNQPKFDPTTST 189

Query: 142 TYKSLPCSSSQCASLNQ-----KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           +YK++ CSS  C  + +     + C    C Y + YG G ++ G LATET+ + S+    
Sbjct: 190 SYKNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGSG-YTIGFLATETLAIASSD--- 245

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
                  FGC   + G FN  TTG++GLG   I+L SQ        FSYCL P S +   
Sbjct: 246 -VFKNFLFGCSEESRGTFNG-TTGLLGLGRSPIALPSQTTNKYKNLFSYCL-PASPSSTG 302

Query: 257 FGTNGIVSGPGVVSTPLT-KAKTFYVLTIDAISVGNQRLGV--STPDIVIDSGTTLTFLP 313
             + G+       STP++ K K  Y L    ISV  + L +  S    +IDSGTT TFLP
Sbjct: 303 HLSFGVEVSQAAKSTPISPKLKQLYGLNTVGISVRGRELPINGSISRTIIDSGTTFTFLP 362

Query: 314 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTIHFRGA-DVKLSR 368
               S L S    M+    + + T S + CY F+++      +P ++I F G  +V++  
Sbjct: 363 SPTYSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGNGTLTIPGISIFFEGGVEVEIDV 422

Query: 369 SNFFVKVSE-DIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           S   + V+    VC  F   G  +   I+GN  Q  + V YD+ +  V F P  C
Sbjct: 423 SGIMIPVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDVAKGMVGFAPKGC 477


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 116/351 (33%), Positives = 167/351 (47%), Gaps = 25/351 (7%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N  NY++ I +GTP      V DTGSD  W QC+PC  + CY Q  PLF P  S+TY ++
Sbjct: 161 NTGNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCV-AYCYQQKEPLFTPTKSATYANI 219

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C+SS C+ L+ + CSG +C Y+V YGDGS++ G  A +T+TLG  T     +    FGC
Sbjct: 220 SCTSSYCSDLDTRGCSGGHCLYAVQYGDGSYTVGFYAQDTLTLGYDT-----VKDFRFGC 274

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF----GTNGI 262
           G  N GLF  K  G++GLG G  S+  Q     +G F+YC +P +S+   F         
Sbjct: 275 GEKNRGLFG-KAAGLMGLGRGKTSVPVQAYDKYSGVFAYC-IPATSSGTGFLDFGPGAPA 332

Query: 263 VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYN 317
            +   +    +    TFY + +  I VG   L +     S    ++DSGT +T LP    
Sbjct: 333 AANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFSDAGALVDSGTVITRLPPSAY 392

Query: 318 SNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLS---QVPEVTIHFRGA---DVKLSRS 369
             L S  +  +E      A     L+ CY          +P V++ F+G    DV  S  
Sbjct: 393 EPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGACLDVDASGI 452

Query: 370 NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            +   VS+  +          + I GN  Q  + V YD+ ++ V F P  C
Sbjct: 453 LYVADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 139/435 (31%), Positives = 201/435 (46%), Gaps = 62/435 (14%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
           FS++L  RDS     +N+    Y+ L    L+R  +R+         + S+  ++D+ P 
Sbjct: 76  FSLQLHPRDS----LHNAGHKDYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPL 131

Query: 87  -------------------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
                               +  Y  R+ +G P      V DTGSD+ W QC+PC  + C
Sbjct: 132 KTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPC--TDC 189

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           Y Q  P+FDP+ SS++ SLPC S QC +L    C    C Y VSYGDGSF+ G    ET+
Sbjct: 190 YQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFTVGEFVIETL 249

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           T G++      +  +  GCG +N GLF      +   GG  +SL SQM+   A  FSYCL
Sbjct: 250 TFGNSG----MINNVAVGCGHDNEGLFVGSAGLLGLGGGS-LSLTSQMK---ASSFSYCL 301

Query: 248 VPVSSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPD---- 300
           V   S+  +       +    V+ PL K+    TFY + +  +SVG Q L +  P+    
Sbjct: 302 VDRDSSSSSDLEFNSAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSI-PPNLFQM 360

Query: 301 -------IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLEL---CYSFNSL 349
                  I++DSGT +T L  Q YN    ++  + +   P    T    L   CY  +S 
Sbjct: 361 DDSGYGGIIVDSGTAITRLQTQAYN----TLRDAFVSRTPYLKKTNGFALFDTCYDLSSQ 416

Query: 350 SQV--PEVTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVG 405
           S+V  P V+  F G   ++L   N+ + V S    C  F   T+S+ I GN+ Q    V 
Sbjct: 417 SRVTIPTVSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVH 476

Query: 406 YDIEQQTVSFKPTDC 420
           YD+    V F P  C
Sbjct: 477 YDLANSVVGFSPHKC 491


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 120/352 (34%), Positives = 170/352 (48%), Gaps = 32/352 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ IG PP++   + DTGSD+ W QC PC  + CY Q  P+F+P  S+++ +L 
Sbjct: 146 SGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPC--ADCYQQADPIFEPASSASFSTLS 203

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C++ QC SL+   C    C Y VSYGDGS++ G+  TET+TLGS     VA+     GCG
Sbjct: 204 CNTRQCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLGSAPVDNVAI-----GCG 258

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPG 267
            NN GLF      +   GG  +S  SQ+  T    FSYCLV   S   +         P 
Sbjct: 259 HNNEGLFVGAAGLLGLGGGS-LSFPSQINAT---SFSYCLVDRDSESASTLEFNSTLPPN 314

Query: 268 VVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPD------------IVIDSGTTLTFL 312
            VS PL +     TFY + +  +SVG +   VS P+            +++DSGT +T L
Sbjct: 315 AVSAPLLRNHHLDTFYYVGLTGLSVGGEL--VSIPESAFQIDESGNGGVIVDSGTAITRL 372

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRS 369
                ++L           P  +     + CY  +S    +VP V+ HF  G ++ L   
Sbjct: 373 QTDVYNSLRDAFVKRTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAK 432

Query: 370 NFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           N+ V + SE   C  F    +S+ I GN+ Q    V YD+    V F P  C
Sbjct: 433 NYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 124/360 (34%), Positives = 180/360 (50%), Gaps = 40/360 (11%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS----ISSSKASQAD 83
           GF ++L H D+       +S T  Q L  A+ RS  R+      +     +    A++  
Sbjct: 28  GFQLKLTHVDA------GTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVL 81

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           +  ++  YL+ ++IGTPP    A+ DTGSDLIWTQC PC    C  Q +P FD K S+TY
Sbjct: 82  VTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATY 139

Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           ++LPC SS+CASL+  SC    C Y   YGD + + G LA ET T G+     V    I 
Sbjct: 140 RALPCRSSRCASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIA 199

Query: 204 FGCGT-NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL---VPVSSTKINFG- 258
           FGCG+ N G L NS  +G+VG G G +SL+SQ+  +   +FSYCL   +  + +++ FG 
Sbjct: 200 FGCGSLNAGDLANS--SGMVGFGRGPLSLVSQLGPS---RFSYCLTSYLSATPSRLYFGV 254

Query: 259 -----TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----------TPD 300
                +    SG  V STP          Y L++ AIS+G + L +           T  
Sbjct: 255 YANLSSTNTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGG 314

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR 360
           ++IDSGT++T+L Q     +   + S I    + D    L+ C+ +     V      FR
Sbjct: 315 VIIDSGTSITWLQQDAYEAVRRGLVSAIPLTAMNDTDIGLDTCFQWPPPPNVTVTVPDFR 374


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 119/356 (33%), Positives = 177/356 (49%), Gaps = 36/356 (10%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  R+ +G P  +   V DTGSD+ W QC+PC  + CY Q  P++DP +S++Y ++
Sbjct: 159 GSGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPC--ADCYAQSDPVYDPSVSTSYATV 216

Query: 147 PCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C S +C  L+  +C  S  +C Y V+YGDGS++ G+ ATET+TLG +      +  +  
Sbjct: 217 GCDSPRCRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGDS----APVSNVAI 272

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTNG 261
           GCG +N GLF      ++ LGGG +S  SQ+  T    FSYCLV     SS+ + FG   
Sbjct: 273 GCGHDNEGLFVGAAG-LLALGGGPLSFPSQISATT---FSYCLVDRDSPSSSTLQFGD-- 326

Query: 262 IVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTT 308
             S    V+ PL ++    TFY + +  ISVG + L +           +  +++DSGT 
Sbjct: 327 --SEQPAVTAPLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTA 384

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVK 365
           +T L  G    L        ++ P A      + CY     S  QVP V + F  G ++K
Sbjct: 385 VTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAVALWFEGGGELK 444

Query: 366 LSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           L   N+ + V +    C  F G +  V I GN+ Q    V +D  + TV F    C
Sbjct: 445 LPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTVGFTADKC 500


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 137/434 (31%), Positives = 207/434 (47%), Gaps = 52/434 (11%)

Query: 25  QTGGFSVELIHRDSPKSPF--YNSSETPYQRLRDALTRSL-NRLNHF----NQNSSISSS 77
           + G   +E+ H+DS       +N     +  + D   RSL +R+       N + S+ + 
Sbjct: 62  ENGATILEMKHKDSCSGKILDWNKKLKKHLIMDDFQLRSLQSRMKSIISGRNIDDSVDAP 121

Query: 78  KASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
               + I     NY++ + +G    +   + DTGSDL W QC+PC   +CY Q  P+F+P
Sbjct: 122 IPLTSGIRLQTLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPC--KRCYNQQDPVFNP 177

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCS-GV------NCQYSVSYGDGSFSNGNLATETVTLG 190
             S +Y+++ CSS  C SL   + + GV      +C Y V+YGDGS++ G L TE + LG
Sbjct: 178 STSPSYRTVLCSSPTCQSLQSATGNLGVCGSNPPSCNYVVNYGDGSYTRGELGTEHLDLG 237

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
           ++T    A+    FGCG NN GLF    +G+VGLG   +SLISQ      G FSYCL P+
Sbjct: 238 NST----AVNNFIFGCGRNNQGLFGG-ASGLVGLGRSSLSLISQTSAMFGGVFSYCL-PI 291

Query: 251 SSTKIN----FGTNGIVSGPGVVSTPLTKAKT-------FYVLTIDAISVGNQRLGVSTP 299
           + T+ +     G N  V      +TP++  +        FY L +  I+VG+  + V  P
Sbjct: 292 TETEASGSLVMGGNSSVYKN---TTPISYTRMIPNPQLPFYFLNLTGITVGS--VAVQAP 346

Query: 300 D-----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV-- 352
                 ++IDSGT +T LP      L           P A     L+ C++ +   +V  
Sbjct: 347 SFGKDGMMIDSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMILDTCFNLSGYQEVEI 406

Query: 353 PEVTIHFRG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYD 407
           P + +HF G    +V ++   +FVK     VC     ++  N V I GN  Q N  V YD
Sbjct: 407 PNIKMHFEGNAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVGIIGNYQQKNQRVIYD 466

Query: 408 IEQQTVSFKPTDCT 421
            +   + F    CT
Sbjct: 467 TKGSMLGFAAEACT 480


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score =  172 bits (436), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 126/393 (32%), Positives = 193/393 (49%), Gaps = 43/393 (10%)

Query: 56  DALTRSLNRLNHFNQNSSISSSKASQADIIPN----NANYLIRISIGTPPTERLAVADTG 111
           D +TR L+ L   N ++  ++S A Q  ++      +  Y  R+ IG+P  +   V DTG
Sbjct: 129 DGVTR-LD-LRPANGSAVFAASAAIQGPVVSGVGQGSGEYFSRVGIGSPARQLYMVLDTG 186

Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYS 169
           SD+ W QC+PC  + CY Q  P+FDP +S++Y ++ C S +C  L+  +C      C Y 
Sbjct: 187 SDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGACLYE 244

Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDI 229
           V+YGDGS++ G+ ATET+TLG +T     +  +  GCG +N GLF      ++ LGGG +
Sbjct: 245 VAYGDGSYTVGDFATETLTLGDST----PVGNVAIGCGHDNEGLFVGAAG-LLALGGGPL 299

Query: 230 SLISQMRTTIAGKFSYCLV----PVSSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVL 282
           S  SQ+    A  FSYCLV    P +ST + FG     +  G V+ PL ++    TFY +
Sbjct: 300 SFPSQIS---ASTFSYCLVDRDSPAAST-LQFGDGAAEA--GTVTAPLVRSPRTSTFYYV 353

Query: 283 TIDAISVGNQRLGV-----------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
            +  ISVG Q L +            +  +++DSGT +T L     + L         + 
Sbjct: 354 ALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSL 413

Query: 332 PVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVKVS-EDIVCSVFKGI 387
           P        + CY  +  +  +VP V++ F G   ++L   N+ + V      C  F   
Sbjct: 414 PRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPT 473

Query: 388 TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             +V I GN+ Q    V +D  +  V F P  C
Sbjct: 474 NAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 506


>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  172 bits (435), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 141/419 (33%), Positives = 205/419 (48%), Gaps = 32/419 (7%)

Query: 22  IEAQTGGFSVELIHRDSPKS--PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKA 79
           +   +G  +V L HR  P S  P  N+        RD L  +     +   N S    + 
Sbjct: 50  VAPSSGVVTVPLHHRHGPCSTVPSTNAPTLEDMLRRDQLRAAYITRKYSGVNGSAGDVEG 109

Query: 80  SQADIIP------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
           S   +        +   YLI + +G+P   +  + DTGSD+ W QC+PC  SQC+ Q   
Sbjct: 110 SDVTVPTTLGTSLDTLEYLITVGMGSPAVAQTMLIDTGSDVSWVQCKPC--SQCHSQADS 167

Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           LFDP  SSTY +  C+S+ CA L Q+ CS   CQY+V YGDGS  +G  +++T+ LGS+T
Sbjct: 168 LFDPSSSSTYSAFSCTSAACAQLRQRGCSSSQCQYTVKYGDGSTGSGTYSSDTLALGSST 227

Query: 194 GQAVALPGITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
                +    FGC  + +G L   +T G++GLGGG  SL +Q   T    FSYCL P   
Sbjct: 228 -----VENFQFGCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPTPG 282

Query: 253 TKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI----VIDS 305
           +   F T G  +   VV TP+   T+  ++Y + + AI VG ++L +         ++DS
Sbjct: 283 SS-GFLTLGASTSGFVVKTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFSAGSIMDS 341

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGAD 363
           GT +T LP+   S L S   + ++  P A P G  + C+ F+  S V  P V + F G  
Sbjct: 342 GTIITRLPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVFSGGA 401

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           V    S+  +  S    C  F   ++  S+ I GN+ Q  F V YD+    V FK   C
Sbjct: 402 VVDLASDGIILGS----CLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score =  172 bits (435), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 118/358 (32%), Positives = 177/358 (49%), Gaps = 37/358 (10%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  R+ IG+P  E   V DTGSD+ W QC+PC  + CY Q  P+FDP +S++Y ++
Sbjct: 165 GSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAV 222

Query: 147 PCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C S +C  L+  +C      C Y V+YGDGS++ G+ ATET+TLG +T     +  +  
Sbjct: 223 SCDSPRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDST----PVTNVAI 278

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTKINFGTN 260
           GCG +N GLF      ++ LGGG +S  SQ+    A  FSYCLV    P +ST + FG +
Sbjct: 279 GCGHDNEGLFVGAAG-LLALGGGPLSFPSQIS---ASTFSYCLVDRDSPAAST-LQFGAD 333

Query: 261 GIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGV-----------STPDIVIDSG 306
           G  +    V+ PL ++    TFY + +  ISVG Q L +            +  +++DSG
Sbjct: 334 GAEA--DTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSG 391

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD- 363
           T +T L     + L         + P        + CY  +  +  +VP V++ F G   
Sbjct: 392 TAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGA 451

Query: 364 VKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           ++L   N+ + V      C  F     +V I GN+ Q    V +D  +  V F P  C
Sbjct: 452 LRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVVGFTPNKC 509


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score =  171 bits (434), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 130/356 (36%), Positives = 175/356 (49%), Gaps = 34/356 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  RI IGTP  E+  V DTGSD++W QCEPC   +CY Q  P+F+P  S ++ ++ 
Sbjct: 5   SGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPC--RECYSQADPIFNPSSSVSFSTVG 62

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C S+ C+ L+   C G  C Y VSYGDGS++ G+ ATET+T G+T+ Q VA+     GCG
Sbjct: 63  CDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFGTTSIQNVAI-----GCG 117

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP---VSSTKINFGTNGIVS 264
            +N GLF      ++GLG G +S  +Q+ T     FSYCLV     SS  + FG   +  
Sbjct: 118 HDNVGLFVGAAG-LLGLGAGSLSFPAQLGTQTGRAFSYCLVDRDSESSGTLEFGPESVPI 176

Query: 265 GPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPD-------------IVIDSGTT 308
           G   + TPL       TFY L++ AISVG   L  S P              I+IDSGT 
Sbjct: 177 GS--IFTPLVANPFLPTFYYLSMVAISVGGVILD-SVPSEAFRIDETTGRGGIIIDSGTA 233

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVK 365
           +T L       L     +  +  P AD     + CY  ++L  V  P V  HF  GA   
Sbjct: 234 VTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFHFSNGAGFI 293

Query: 366 LSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           L   N  + + S    C  F    +++ I GNI Q    V +D     V F    C
Sbjct: 294 LPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 131/380 (34%), Positives = 193/380 (50%), Gaps = 56/380 (14%)

Query: 84  IIPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
           ++ N+A  Y + +SIGTPP     +ADTGS LIWTQC PC  ++C  + +P F P  SST
Sbjct: 82  LLDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPC--TECAARPAPPFQPASSST 139

Query: 143 YKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           +  LPC+SS C  L     +C+   C Y   YG G F+ G LATET+ +G       + P
Sbjct: 140 FSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLATETLHVG-----GASFP 193

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL---VPVSSTKINF 257
           G+ FGC T NG    + ++GIVGLG   +SL+SQ+     G+FSYCL        + I F
Sbjct: 194 GVAFGCSTENG--VGNSSSGIVGLGRSPLSLVSQVGV---GRFSYCLRSDADAGDSPILF 248

Query: 258 GTNGIVSGPGVVSTPLTK-----AKTFYVLTIDAISVGNQRLGVSTPDI----------- 301
           G+   V+G  V STPL +     + ++Y + +  I+VG   L V++              
Sbjct: 249 GSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLV 308

Query: 302 ---VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNSL---SQ 351
              ++DSGTTLT+L  +GY     + +S M  A       G+    +LC+   +    S 
Sbjct: 309 GGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSG 368

Query: 352 VPEVTIHFR---GADVKLSRSNFFVKVSED------IVCSVFKGITN--SVPIYGNIMQT 400
           VP  T+  R   GA+  + R ++   V+ D      + C +    +   S+ I GN+MQ 
Sbjct: 369 VPVPTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNVMQM 428

Query: 401 NFLVGYDIEQQTVSFKPTDC 420
           +  V YD++    SF P DC
Sbjct: 429 DLHVLYDLDGGMFSFAPADC 448


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 144/433 (33%), Positives = 201/433 (46%), Gaps = 58/433 (13%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           FS++L     P+    N     Y+ L    L R   R+N  N    ++ S  +++D+ P 
Sbjct: 77  FSLQL----HPRETLLNEQHPNYKTLVLSRLARDTARVNSLNTKLQLALSSLNRSDLYPT 132

Query: 88  N---------------------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
                                   Y  R+ +G P      V DTGSD+ W QC+PC  S 
Sbjct: 133 ETELLRPEDLSTPVSSGTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPC--SD 190

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
           CY Q  P+FDP  SS+Y  L C + QC  L   +C    C Y VSYGDGSF+ G   TET
Sbjct: 191 CYQQSDPIFDPTASSSYNPLTCDAQQCQDLEMSACRNGKCLYQVSYGDGSFTVGEYVTET 250

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYC 246
           V+ G+ +   VA+     GCG +N GLF   + G++GLGGG +SL SQ++ T    FSYC
Sbjct: 251 VSFGAGSVNRVAI-----GCGHDNEGLF-VGSAGLLGLGGGPLSLTSQIKAT---SFSYC 301

Query: 247 LVPVSSTKIN-FGTNGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLGVSTPD--- 300
           LV   S K +    N    G  VV+  L   K  TFY + +  +SVG + + V  P+   
Sbjct: 302 LVDRDSGKSSTLEFNSPRPGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVP-PETFA 360

Query: 301 --------IVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS- 350
                   +++DSGT +T L  Q YNS   +        +P A+     + CY  +SL  
Sbjct: 361 VDQSGAGGVIVDSGTAITRLRTQAYNSVRDAFKRKTSNLRP-AEGVALFDTCYDLSSLQS 419

Query: 351 -QVPEVTIHFRGADV-KLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
            +VP V+ HF G     L   N+ + V      C  F   T+S+ I GN+ Q    V +D
Sbjct: 420 VRVPTVSFHFSGDRAWALPAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFD 479

Query: 408 IEQQTVSFKPTDC 420
           +    V F P  C
Sbjct: 480 LANSLVGFSPNKC 492


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 125/355 (35%), Positives = 180/355 (50%), Gaps = 38/355 (10%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y  R+ +G P  +   V DTGSD+ W QC+PC  + CY Q  P+FDP  SSTY  + C
Sbjct: 18  GEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPC--TDCYQQTDPIFDPTASSTYAPVTC 75

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
            S QC+SL   SC    C Y V+YGDGS++ G+ ATE+V+ G++     ++  +  GCG 
Sbjct: 76  QSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSG----SVKNVALGCGH 131

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV---PVSSTKINFGTNGIVSG 265
           +N GLF     G++GLGGG +SL +Q++ T    FSYCLV      S+ ++F  N    G
Sbjct: 132 DNEGLF-VGAAGLLGLGGGPLSLTNQLKAT---SFSYCLVNRDSAGSSTLDF--NSAQLG 185

Query: 266 PGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPD------------IVIDSGTTLT 310
              V+ PL K +   TFY + +  +SVG Q   VS P+            I++D GT +T
Sbjct: 186 VDSVTAPLMKNRKIDTFYYVGLSGMSVGGQM--VSIPESTFRLDESGNGGIIVDCGTAIT 243

Query: 311 FLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKL 366
            L  Q YN  L      M +   +       + CY  +  +  +VP V+ HF  G    L
Sbjct: 244 RLQTQAYNP-LRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWNL 302

Query: 367 SRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             +N+ + V S    C  F   T+S+ I GN+ Q    V +D+    + F P  C
Sbjct: 303 PAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 138/428 (32%), Positives = 203/428 (47%), Gaps = 46/428 (10%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD------ 83
           SV L HR  P SP   +S        + L R   R ++  +  S S+  A+  D      
Sbjct: 61  SVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQSSKV 120

Query: 84  IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLF 135
            +P       +   Y+I + +G+P   +  V DTGSD+ W QCEPCP PS C+     LF
Sbjct: 121 SVPTTLGSSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 180

Query: 136 DPKMSSTYKSLPCSSSQCASL-NQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLG 190
           DP  SSTY +  CS++ CA L +    +G +    CQY V YGDGS + G  +++ +TL 
Sbjct: 181 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTL- 239

Query: 191 STTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL-- 247
             +G  V + G  FGC     G   + KT G++GLGG   SL+SQ        FSYCL  
Sbjct: 240 --SGSDV-VRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARYGKSFSYCLPA 296

Query: 248 VPVSSTKINFGTNGIVSGPGV---VSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPDI 301
            P SS  +  G      G G     +TP+ ++K   T+Y   ++ I+VG ++LG+S P +
Sbjct: 297 TPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS-PSV 355

Query: 302 -----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PE 354
                ++DSGT +T LP    + L S   + +     A+P G L+ C++F  L +V  P 
Sbjct: 356 FAAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPT 415

Query: 355 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQT 412
           V + F G  V    ++  V       C  F    +  +    GN+ Q  F V YD+    
Sbjct: 416 VALVFAGGAVVDLDAHGIVSGG----CLAFAPTRDDKAFGTIGNVQQRTFEVLYDVGGGV 471

Query: 413 VSFKPTDC 420
             F+   C
Sbjct: 472 FGFRAGAC 479


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score =  171 bits (434), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 128/358 (35%), Positives = 174/358 (48%), Gaps = 37/358 (10%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ +GTP      V DTGSD++W QC PC   +CY Q  P+FDP  S ++ ++P
Sbjct: 142 SGEYFTRLGVGTPARYVYMVLDTGSDIVWIQCAPC--IKCYSQTDPVFDPTKSRSFANIP 199

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVALPGITF 204
           C S  C  L+   CS     C Y VSYGDGSF+ G  +TET+T  G+  G+ V       
Sbjct: 200 CGSPLCRRLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGTRVGRVV------L 253

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
           GCG +N GLF      ++GLG G +S  SQ+      KFSYCL   S++      + IV 
Sbjct: 254 GCGHDNEGLFVGAAG-LLGLGRGRLSFPSQIGRRFNSKFSYCLGDRSASS---RPSSIVF 309

Query: 265 GPGVVS-----TPLT---KAKTFYVLTIDAISVGNQRL-GVSTP----------DIVIDS 305
           G   +S     TPL    K  TFY + +  ISVG  R+ G+S             ++IDS
Sbjct: 310 GDSAISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIIDS 369

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGAD 363
           GT++T L +     L             A      + C+  +  ++  VP V +HFRGAD
Sbjct: 370 GTSVTRLTRAAYVALRDAFLVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGAD 429

Query: 364 VKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           V L  SN+ + V      C  F G  + + I GNI Q  F V YD+    V F P  C
Sbjct: 430 VPLPASNYLIPVDNSGSFCFAFAGTASGLSIIGNIQQQGFRVVYDLATSRVGFAPRGC 487


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score =  171 bits (433), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 119/415 (28%), Positives = 193/415 (46%), Gaps = 58/415 (13%)

Query: 55  RDALTRSLNRLNHFNQNSSISSSKASQADIIPN-------NANYLIRISIGTPPTERLAV 107
           R+ L R   R     +++ + S +A+ A + P        +  YL+ ++IGTPP     +
Sbjct: 70  RELLHRMAARSK--ARSARLLSGRAASARVDPGSYTDGVPDTEYLVHMAIGTPPQPVQLI 127

Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC------ 161
            DTGSDL WTQC PC    C+ Q  P F+P  S T+  LPC    C  L   SC      
Sbjct: 128 LDTGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWG 185

Query: 162 SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTT 219
           +G+ C Y+ +Y D S + G+L ++T +  S        ++P +TFGCG  N G+F S  T
Sbjct: 186 NGI-CVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNET 244

Query: 220 GIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---------------INFGTNGIVS 264
           GI G   G +S+ +Q++      FSYC   ++ ++                  G +G+V 
Sbjct: 245 GIAGFSRGALSMPAQLKVD---NFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQ 301

Query: 265 GPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQ 314
              ++    ++ K +Y+ ++  ++VG  RL +           T   ++DSGT +T LP+
Sbjct: 302 STALIRYHSSQLKAYYI-SLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPE 360

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIHFRGADVKLSRSNF 371
              + +     +  +   V + T SL +LC+S    +   VP + +HF GA + L R N+
Sbjct: 361 AVYNLVCDAFVAQTKLT-VHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENY 419

Query: 372 FVKVSE----DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
             ++ E     + C         + + GN  Q N  V YD+    +SF P  C K
Sbjct: 420 MFEIEEAGGIRLTCLAINA-GEDLSVIGNFQQQNMHVLYDLANDMLSFVPARCNK 473


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score =  171 bits (433), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 139/434 (32%), Positives = 205/434 (47%), Gaps = 57/434 (13%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ----------NSSISSSKASQ 81
            ++HRD+     + ++ T  + LR  L R   R    ++          N + S   A  
Sbjct: 72  RVVHRDA-----FAANATAAELLRHRLQRDKRRAARISKAAAGGGAGAANGTRSRGGAVA 126

Query: 82  ADIIPNNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           A ++   A     Y  +I +GTP T  L V DTGSD++W QC PC   +CY Q  P+FDP
Sbjct: 127 APVVSGLAQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPC--RRCYDQSGPVFDP 184

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           + SS+Y ++ C++  C  L+   C      C Y V+YGDGS + G+ ATET+T     G 
Sbjct: 185 RRSSSYGAVDCAAPLCRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAG--GA 242

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
            VA   +  GCG +N GLF +    ++GLG G +S  +Q+       FSYCLV  +S+  
Sbjct: 243 RVAR--VALGCGHDNEGLFVAAAG-LLGLGRGSLSFPTQISRRYGKSFSYCLVDRTSSSS 299

Query: 256 NFG---------TNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL-GVSTPD-- 300
           +           T G  S      TP+    + +TFY + +  ISVG  R+ GV+  D  
Sbjct: 300 SGAASRSRSSTVTFGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLR 359

Query: 301 ---------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NS 348
                    +++DSGT++T L +   S L     +      ++    SL + CY      
Sbjct: 360 LDPSTGRGGVIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRK 419

Query: 349 LSQVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGY 406
           + +VP V++HF  GA+  L   N+ + V S    C  F G    V I GNI Q  F V +
Sbjct: 420 VVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVF 479

Query: 407 DIEQQTVSFKPTDC 420
           D + Q V F P  C
Sbjct: 480 DGDGQRVGFAPKGC 493


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score =  171 bits (433), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 120/413 (29%), Positives = 190/413 (46%), Gaps = 52/413 (12%)

Query: 52  QRLRDALTRSLNRLNHF--NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
           + LR    RS  R       + +S      S  D +P+   YL+ ++IGTPP     + D
Sbjct: 71  ELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDT-EYLVHMAIGTPPQPVQLILD 129

Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC------SG 163
           TGSDL WTQC PC    C+ Q  P F+P  S T+  LPC    C  L   SC      +G
Sbjct: 130 TGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNG 187

Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGI 221
           + C Y+ +Y D S + G+L ++T +  S        ++P +TFGCG  N G+F S  TGI
Sbjct: 188 I-CVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGI 246

Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---------------INFGTNGIVSGP 266
            G   G +S+ +Q++      FSYC   ++ ++                  G +G+V   
Sbjct: 247 AGFSRGALSMPAQLKVD---NFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQST 303

Query: 267 GVVSTPLTKAKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQGY 316
            ++    ++ K +Y+ ++  ++VG  RL +           T   ++DSGT +T LP+  
Sbjct: 304 ALIRYHSSQLKAYYI-SLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAV 362

Query: 317 NSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIHFRGADVKLSRSNFFV 373
            + +     +  +   V + T SL +LC+S    +   VP + +HF GA + L R N+  
Sbjct: 363 YNLVCDAFVAQTKLT-VHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYMF 421

Query: 374 KVSE----DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           ++ E     + C         + + GN  Q N  V YD+    +SF P  C K
Sbjct: 422 EIEEAGGIRLTCLAINA-GEDLSVIGNFQQQNMHVLYDLANDMLSFVPARCNK 473


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score =  171 bits (432), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 122/344 (35%), Positives = 176/344 (51%), Gaps = 28/344 (8%)

Query: 95  ISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC- 153
           + +GTP T+ + V DTGS L W QC PC  S C+ Q  P+F+PK SSTY S+ CS+ QC 
Sbjct: 1   MGLGTPATQYVMVVDTGSSLTWLQCSPCLVS-CHRQSGPVFNPKSSSTYASVGCSAQQCS 59

Query: 154 ----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
               A+LN  +CS  N C Y  SYGD SFS G L+ +TV+ GST+     LP   +GCG 
Sbjct: 60  DLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----LPNFYYGCGQ 114

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGV 268
           +N GLF  ++ G++GL    +SL+ Q+  ++   F+YCL    S+  +   +     PG 
Sbjct: 115 DNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFTYCL---PSSSSSGYLSLGSYNPGQ 170

Query: 269 VS-TPLTKA---KTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQGYNSN 319
            S TP+  +    + Y + +  ++V    L VS+        +IDSGT +T LP    S 
Sbjct: 171 YSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTIIDSGTVITRLPTSVYSA 230

Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFR-GADVKLSRSNFFVKVSE 377
           L   +++ ++    A     L+ C+    S    P VT+ F  GA +KLS  N  V V +
Sbjct: 231 LSKAVAAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGAALKLSAQNLLVDVDD 290

Query: 378 DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
              C  F     S  I GN  Q  F V YD++   + F    C+
Sbjct: 291 STTCLAF-APARSAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score =  171 bits (432), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 131/434 (30%), Positives = 201/434 (46%), Gaps = 40/434 (9%)

Query: 18  VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS 77
           V +P +A     ++ ++H   P SP  +    P     + L R  +R++   +  +  ++
Sbjct: 52  VCTPTKAAPSSSALTVVHGHGPCSPQESRRGAPSHT--EILGRDQDRVDAIRRKVAAVTT 109

Query: 78  KASQADI--IP---------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
            AS +    +P         +  NY   + +GTP T+ L   DTGSD  W QC+PCP   
Sbjct: 110 AASSSKPKGVPLQVGWGKYLDTTNYFTSLRLGTPATDLLVELDTGSDQSWIQCKPCP--D 167

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASL---NQKSCSG-VNCQYSVSYGDGSFSNGNL 182
           CY Q   LFDP  SSTY  + CSS +C  L   ++ +CS    C Y ++Y D S++ GNL
Sbjct: 168 CYEQHEALFDPSKSSTYSDITCSSRECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNL 227

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
           A +T+TL  T     A+PG  FGCG NN G F  +  G++GLG G  SL SQ+       
Sbjct: 228 ARDTLTLSPTD----AVPGFVFGCGHNNAGSFG-EIDGLLGLGRGKASLSSQVAARYGAG 282

Query: 243 FSYCL--VPVSSTKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGV-- 296
           FSYCL   P ++  ++F      +      T +   +  +FY L +  I+V  + + V  
Sbjct: 283 FSYCLPSSPSATGYLSFSGAAAAAPTNAQFTEMVAGQHPSFYYLNLTGITVAGRAIKVPP 342

Query: 297 ----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLS 350
               +    +IDSGT  + LP    + L S + S +     A  +   + CY    +   
Sbjct: 343 SVFATAAGTIIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHETV 402

Query: 351 QVPEVTIHFR-GADVKLSRSNF---FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY 406
           ++P V + F  GA V L  S     +  VS+  +  +      S+ + GN  Q    V Y
Sbjct: 403 RIPSVALVFADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIY 462

Query: 407 DIEQQTVSFKPTDC 420
           D++ Q V F    C
Sbjct: 463 DVDNQKVGFGANGC 476


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score =  171 bits (432), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 120/413 (29%), Positives = 190/413 (46%), Gaps = 52/413 (12%)

Query: 52  QRLRDALTRSLNRLNHF--NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
           + LR    RS  R       + +S      S  D +P+   YL+ ++IGTPP     + D
Sbjct: 45  ELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDT-EYLVHMAIGTPPQPVQLILD 103

Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC------SG 163
           TGSDL WTQC PC    C+ Q  P F+P  S T+  LPC    C  L   SC      +G
Sbjct: 104 TGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNG 161

Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGI 221
           + C Y+ +Y D S + G+L ++T +  S        ++P +TFGCG  N G+F S  TGI
Sbjct: 162 I-CVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGI 220

Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---------------INFGTNGIVSGP 266
            G   G +S+ +Q++      FSYC   ++ ++                  G +G+V   
Sbjct: 221 AGFSRGALSMPAQLKVD---NFSYCFTAITGSEPSPVFLGVPPNLYSDAAGGGHGVVQST 277

Query: 267 GVVSTPLTKAKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTFLPQGY 316
            ++    ++ K +Y+ ++  ++VG  RL +           T   ++DSGT +T LP+  
Sbjct: 278 ALIRYHSSQLKAYYI-SLKGVTVGTTRLPIPESVFALKEDGTGGTIVDSGTGMTMLPEAV 336

Query: 317 NSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIHFRGADVKLSRSNFFV 373
            + +     +  +   V + T SL +LC+S    +   VP + +HF GA + L R N+  
Sbjct: 337 YNLVCDAFVAQTKLT-VHNSTSSLSQLCFSVPPGAKPDVPALVLHFEGATLDLPRENYMF 395

Query: 374 KVSE----DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           ++ E     + C         + + GN  Q N  V YD+    +SF P  C K
Sbjct: 396 EIEEAGGIRLTCLAINA-GEDLSVIGNFQQQNMHVLYDLANDMLSFVPARCNK 447


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score =  171 bits (432), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 126/354 (35%), Positives = 172/354 (48%), Gaps = 30/354 (8%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  RI +GTPP     V DTGSD++W QC PC    CY Q  P+F+P  S ++  + 
Sbjct: 126 SGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPC--KNCYSQTDPVFNPVKSGSFAKVL 183

Query: 148 CSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           C +  C  L    C+    C Y VSYGDGS++ G   TET+T   T  + VAL     GC
Sbjct: 184 CRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVAL-----GC 238

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS----TKINFGTNGI 262
           G +N GLF      ++GLG G +S  SQ   T   KFSYCLV  S+    + + FG N  
Sbjct: 239 GHDNEGLFVGAAG-LLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFG-NSA 296

Query: 263 VSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRL-GVSTPD----------IVIDSGTTL 309
           VS     +  LT  +  TFY + +  ISVG   + G++             ++ID GT++
Sbjct: 297 VSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSV 356

Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLS 367
           T L +     L     +   +   A      + CY  +  +  +VP V +HFRGADV L 
Sbjct: 357 TRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLP 416

Query: 368 RSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            SN+ + V      C  F G T+ + I GNI Q  F V YD+    V F P  C
Sbjct: 417 ASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 470


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score =  171 bits (432), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 129/380 (33%), Positives = 188/380 (49%), Gaps = 48/380 (12%)

Query: 76  SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
           SS + QA +      Y + IS+GTP      VADTGSDLIWTQC PC  ++C+ Q +P F
Sbjct: 71  SSVSFQALLENGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPC--TKCFQQPAPPF 128

Query: 136 DPKMSSTYKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
            P  SST+  LPC+SS C  L  + ++C+   C Y+  YG G ++ G LATET+ +G   
Sbjct: 129 QPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGD-- 185

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV---PV 250
               + P + FGC T NG    + T+GI GLG G +SLI Q+     G+FSYCL      
Sbjct: 186 ---ASFPSVAFGCSTENG--VGNSTSGIAGLGRGALSLIPQLGV---GRFSYCLRSGSAA 237

Query: 251 SSTKINFGTNGIVSGPGVVSTPLTK----AKTFYVLTIDAISVGNQRLGVSTPDI----- 301
            ++ I FG+   ++   V STP         ++Y + +  I+VG   L V+T        
Sbjct: 238 GASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQN 297

Query: 302 ------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---- 350
                 ++DSGTTLT+L + GY     + +S   +   V + T  L+LC+          
Sbjct: 298 GLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTV-NGTRGLDLCFKSTGGGGGGI 356

Query: 351 QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--------IYGNIMQTNF 402
            VP + + F G   + +   +F  V  D   SV       +P        + GN+MQ + 
Sbjct: 357 AVPSLVLRFDGG-AEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDM 415

Query: 403 LVGYDIEQQTVSFKPTDCTK 422
            + YD++    SF P DC K
Sbjct: 416 HLLYDLDGGIFSFAPADCAK 435


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score =  171 bits (432), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 124/353 (35%), Positives = 173/353 (49%), Gaps = 34/353 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ IG PP+    V DTGSD+ W QC PC  ++CY Q  P+F+P  S+++ SL 
Sbjct: 148 SGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPC--AECYEQTDPIFEPTSSASFTSLS 205

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C + QC SL+   C    C Y VSYGDGS++ G+  TETVTLGST     +L  I  GCG
Sbjct: 206 CETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGST-----SLGNIAIGCG 260

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTNGIVS 264
            NN GLF      +   GG  +S  SQ+    A  FSYCLV     S++ ++F  N  ++
Sbjct: 261 HNNEGLFIGAAGLLGLGGGS-LSFPSQLN---ASSFSYCLVDRDSDSTSTLDF--NSPIT 314

Query: 265 GPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPD----------IVIDSGTTLTF 311
            P  V+ PL +     TF+ L +  +SVG   L +              I++DSGT +T 
Sbjct: 315 -PDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTR 373

Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSR 368
           L     + L             A      + CY  +S S  +VP V+ HF  G ++ L  
Sbjct: 374 LQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPA 433

Query: 369 SNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            N+ + V SE   C  F    +++ I GN  Q    VG+D+    V F P  C
Sbjct: 434 KNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 135/427 (31%), Positives = 203/427 (47%), Gaps = 53/427 (12%)

Query: 33  LIHRD----SPKSPFYNSSETPYQRLRDALTRSL-NRLNHF---NQNSSISSSKASQADI 84
           + HRD    S KS  +N        L D   RSL +R+      N   ++ S     + +
Sbjct: 1   MKHRDFCNSSGKSTDWNKKLQKSLILDDFRVRSLQSRIKSIFSGNNIDALDSQIPLSSGV 60

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
                NY++ + IG        + DTGSDL W QC+PC    CY Q  PLF+P  S +Y+
Sbjct: 61  RLQTLNYIVTVEIGG--RNMTVIVDTGSDLTWVQCQPC--RLCYNQQDPLFNPSGSPSYQ 116

Query: 145 SLPCSSSQCASLNQKSCS----GVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           ++ C+SS C SL   + +    G N   C Y V+YGDGS++ G+L  E + LG+T     
Sbjct: 117 TILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTT----- 171

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
            +    FGCG NN GLF    +G++GLG  D+SL+SQ      G FSYCL    +T  + 
Sbjct: 172 HVSNFIFGCGRNNKGLFGG-ASGLMGLGKSDLSLVSQTSAIFEGVFSYCL---PTTAADA 227

Query: 258 GTNGIVSGPGVV---STPLTKAK--------TFYVLTIDAISVGNQRLGVSTPD-----I 301
             + I+ G   V   +TP++  +        TFY L +  IS+G   + +  P+     I
Sbjct: 228 SGSLILGGNSSVYKNTTPISYTRMIANPQLPTFYFLNLTGISIGG--VALQAPNYRQSGI 285

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
           +IDSGT +T LP     +L +         P A P   L+ C++ N   +V  P + + F
Sbjct: 286 LIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQF 345

Query: 360 RG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVS 414
            G     V ++   +FVK     VC     ++  + +PI GN  Q N  V Y+ ++  + 
Sbjct: 346 EGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLG 405

Query: 415 FKPTDCT 421
           F    C+
Sbjct: 406 FAAEACS 412


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 137/418 (32%), Positives = 205/418 (49%), Gaps = 35/418 (8%)

Query: 31  VELIHRDSPKSPFY--NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP-- 86
           ++++H+  P S     + +E  Y  L+D  +R  +  +  +++S +S  KA+ A  +P  
Sbjct: 85  LKVVHKHGPCSDLRQGHKAEAQYILLQDQ-SRVDSIHSKLSKDSGLSDVKATAATTLPAK 143

Query: 87  -----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                 + NY + + +GTP  +   + DTGSDL WTQCEPC  S CY Q   +F+P  S+
Sbjct: 144 DGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKS-CYNQKEAIFNPSQST 202

Query: 142 TYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           +Y ++ C S+ C SL     N  +C+   C Y + YGD SFS G    E ++L +T    
Sbjct: 203 SYANISCGSTLCDSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLTATD--- 259

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
                  FGCG NN GL      G++GLG   +SL+SQ        FSYCL P SS+   
Sbjct: 260 -VFNDFYFGCGQNNKGL-FGGAAGLLGLGRDKLSLVSQTAQRYNKIFSYCL-PSSSSSTG 316

Query: 257 FGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTT 308
           F T G  +      TPL   +   +FY L +  ISVG ++L +     ST   +IDSGT 
Sbjct: 317 FLTFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFSTAGTIIDSGTV 376

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA-DVK 365
           +T LP    S L S    ++   P A     L+ C+ F++     VP++ + F G   V 
Sbjct: 377 ITRLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSNHDTISVPKIGLFFSGGVVVD 436

Query: 366 LSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           + ++  F       VC  F G +++  V I+GN+ Q    V YD     V F P  C+
Sbjct: 437 IDKTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRVGFAPAGCS 494


>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 315

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 113/295 (38%), Positives = 162/295 (54%), Gaps = 19/295 (6%)

Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           C S  C  L+   CS    C Y+  YGD S + G LA +T T  S TG+ V+L    FGC
Sbjct: 21  CDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKLVSLSRFLFGC 80

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG-KFSYCLVPVS-----STKINFGTN 260
           G NN G FN    G++GLGGG  SLISQ+     G KFS CLVP       S++++FG  
Sbjct: 81  GHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKG 140

Query: 261 GIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRL----GVSTPDIVIDSGTTLTFLP 313
             V G GVV+TPL + +   T Y +T+  ISV +  L     +   ++++DSGT    LP
Sbjct: 141 SQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDTYLPMNSTIEKGNMLVDSGTPPNILP 200

Query: 314 QG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF 372
           Q  Y+   + V +++       DP+   +LCY   +  + P +T HF GA++ L+    F
Sbjct: 201 QQLYDRVYVEVKNNVPLELITNDPSLGPQLCYRTQTNLKGPTLTYHFEGANLLLTPIQTF 260

Query: 373 VKVSED---IVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 423
           +  + +   + C      TNS   +YGN  Q+N+L+G+D+++Q VSFK TDCTKQ
Sbjct: 261 IPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLIGFDLDRQVVSFKATDCTKQ 315


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 116/363 (31%), Positives = 179/363 (49%), Gaps = 40/363 (11%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y   + +GTPPT  L V DTGSD++W QC+PC    CY Q SPL+DP+ SSTY   P
Sbjct: 96  SGEYFASVGVGTPPTPALLVIDTGSDVVWLQCKPC--VHCYRQLSPLYDPRGSSTYAQTP 153

Query: 148 CSSSQCASLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           CS  QC   N ++C G    C Y + YGD S ++GNLAT+ +   + T    ++  +T G
Sbjct: 154 CSPPQCR--NPQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVFSNDT----SVGNVTLG 207

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFGTN 260
           CG +N GLF S   G++G+  G+ S  +Q+  +    F+YCL        SS+ + FG  
Sbjct: 208 CGHDNEGLFGS-AAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGSSSSYLVFGRT 266

Query: 261 GIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL------------GVSTPDIVIDS 305
                P  V TPL    +  + Y + +   SVG + +                  +V+DS
Sbjct: 267 A-PEPPSSVFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRGGVVVDS 325

Query: 306 GTTLT-FLPQGYNS--NLLSVMSSMIEAQPVADPTGSLELCYSFN--SLSQVPEVTIHFR 360
           GT++T F    Y +  +     ++ +  + V       + CY     +++  P V +HF 
Sbjct: 326 GTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVADAPGVVLHFA 385

Query: 361 -GADVKLSRSNFFV-KVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
            GADV L   N+ V + S    C   +    + + + GN++Q  F V +D+E + V F+P
Sbjct: 386 GGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVENERVGFEP 445

Query: 418 TDC 420
             C
Sbjct: 446 NGC 448


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 123/378 (32%), Positives = 183/378 (48%), Gaps = 44/378 (11%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ---DSPLFDPKMSSTYK 144
            YL+ ++ GTPP E L +ADTGSDLIW QC     PP+ C  +     P F    S+T  
Sbjct: 52  QYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLS 111

Query: 145 SLPCSSSQCASLNQKSCSG--------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
            +PCS++QC  +      G        V C Y+  Y DGS + G LA +T T+ + T   
Sbjct: 112 VVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGG 171

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
            A+ G+ FGCGT N G   S T G++GLG G +S  +Q  +  A  FSYCL+ +   +  
Sbjct: 172 AAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQSGSLFAQTFSYCLLDLEGGRRG 231

Query: 257 FGTNGIVSG-----PGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI------- 301
             ++ +  G          TPL     A TFY + + AI VGN+ L V   +        
Sbjct: 232 RSSSFLFLGRPERRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAIDVLGN 291

Query: 302 ---VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT---GSLELCYSFNSLSQ---- 351
              VIDSG+TLT+L  G   +L+S  ++ +    +         LELCY+ +S S     
Sbjct: 292 GGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSSSAPA 351

Query: 352 ---VPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVG 405
               P +TI F +G  ++L   N+ V V++D+ C   +   +  +  + GN+MQ  + V 
Sbjct: 352 NGGFPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGNLMQQGYHVE 411

Query: 406 YDIEQQTVSFKPTDCTKQ 423
           +D     + F  T+C   
Sbjct: 412 FDRASARIGFARTECVAH 429


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 120/367 (32%), Positives = 179/367 (48%), Gaps = 41/367 (11%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +   +GTPP +   + D+GSDL+W QC PC   QCY QD+PL+ P  SST+  +P
Sbjct: 62  SGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPC--LQCYAQDTPLYAPSNSSTFNPVP 119

Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           C S +C  +        +      C Y   Y D S S G  A E+ T+       V +  
Sbjct: 120 CLSPECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATVDD-----VRIDK 174

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVS-STKIN 256
           + FGCG +N G F +   G++GLG G +S  SQ+      KF+YCLV    P S S+ + 
Sbjct: 175 VAFGCGRDNQGSF-AAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYLDPTSVSSWLI 233

Query: 257 FGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI----------VI 303
           FG   I +   +  TP+   ++  T Y + I+ + VG + L +S              + 
Sbjct: 234 FGDELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGSIF 293

Query: 304 DSGTTLTF-LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR 360
           DSGTT+T+ LP  Y  N+L+     +   P A     L+LC     + Q   P  TI   
Sbjct: 294 DSGTTVTYWLPPAYR-NILAAFDKNVR-YPRAASVQGLDLCVDVTGVDQPSFPSFTIVLG 351

Query: 361 GADV-KLSRSNFFVKVSEDIVCSVFKGITNSVPIY---GNIMQTNFLVGYDIEQQTVSFK 416
           G  V +  + N+FV V+ ++ C    G+ +SV  +   GN++Q NFLV YD E+  + F 
Sbjct: 352 GGAVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREENRIGFA 411

Query: 417 PTDCTKQ 423
           P  C+  
Sbjct: 412 PAKCSSH 418


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score =  170 bits (430), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 120/355 (33%), Positives = 173/355 (48%), Gaps = 30/355 (8%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +   +++ +  GTP      + DTGSD+ W QC PC    CY Q  P+FDP  S+TY  +
Sbjct: 131 DTLEFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCS-GHCYKQHDPIFDPTKSATYSVV 189

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           PC   QCA+ +   CS   C Y V YGDGS S G L+ ET++L ST     ALPG  FGC
Sbjct: 190 PCGHPQCAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLTSTR----ALPGFAFGC 245

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--INFGTNGIVS 264
           G  N G F     G++GLG G +SL SQ   +  G FSYCL   ++T   +  G     S
Sbjct: 246 GQTNLGDFG-DVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPSDNTTHGYLTIGPTTPAS 304

Query: 265 GPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS----TPD-IVIDSGTTLTFL-PQG 315
              V  T + + +   +FY + + +I +G   L V     T D   +DSGT LT+L P+ 
Sbjct: 305 NDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDDGTFLDSGTILTYLPPEA 364

Query: 316 YNS--NLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF- 372
           Y +  +      +  +  P  DP    + CY F   S +    + F+ +D  +   +FF 
Sbjct: 365 YTALRDRFKFTMTQYKPAPAYDP---FDTCYDFTGQSAIFIPAVSFKFSDGSVFDLSFFG 421

Query: 373 VKVSED-----IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + +  D     I C  F    +++P  I GN+ Q N  V YD+  + + F    C
Sbjct: 422 ILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 133/394 (33%), Positives = 197/394 (50%), Gaps = 60/394 (15%)

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ--DS 132
           SSS   QA +      Y + IS+GTPP +   + DTGS+LIW QC PC  ++C+ +   +
Sbjct: 75  SSSVNVQAQLENGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPC--TRCFPRPTPA 132

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL----NQKSCSG-VNCQYSVSYGDGSFSNGNLATETV 187
           P+  P  SST+  LPC+ S C  L      ++C+    C Y+ +YG G ++ G LATET+
Sbjct: 133 PVLQPARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETL 191

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           T+G  T      P + FGC T NG      ++GIVGLG G +SL+SQ+     G+FSYCL
Sbjct: 192 TVGDGT-----FPKVAFGCSTENG---VDNSSGIVGLGRGPLSLVSQL---AVGRFSYCL 240

Query: 248 ----VPVSSTKINFGT-NGIVSGPGVVSTPLTK-----AKTFYVLTIDAISVGNQRLGVS 297
                   ++ I FG+   +  G  V STPL K       T Y + +  I+V +  L V+
Sbjct: 241 RSDMADGGASPILFGSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVT 300

Query: 298 TPDI-----------VIDSGTTLTFLPQ-GY---NSNLLSVMSSMIEAQPVADPTGSLEL 342
                          ++DSGTTLT+L + GY        S M+++ +  P +     L+L
Sbjct: 301 GSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDL 360

Query: 343 CYSFNS-----LSQVPEVTIHFR-GADVKLSRSNFFVKVSED------IVCSVFKGITNS 390
           CY  ++       +VP + + F  GA   +   N+F  V  D      + C +    T+ 
Sbjct: 361 CYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDD 420

Query: 391 VP--IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           +P  I GN+MQ +  + YDI+    SF P DC K
Sbjct: 421 LPISIIGNLMQMDMHLLYDIDGGMFSFAPADCAK 454


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 124/366 (33%), Positives = 182/366 (49%), Gaps = 34/366 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  YL+ + +GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  S++Y+++ 
Sbjct: 147 SGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFDQRGPVFDPMASTSYRNVT 204

Query: 148 CSSSQC-------ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           C  ++C       A    +S     C Y   YGD S + G+LA E  T+  T   +  + 
Sbjct: 205 CGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVD 264

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKINF 257
           G+  GCG  N GLF+     ++GLG G +S  SQ+R      FSYCLV   S   +KI F
Sbjct: 265 GVVLGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYGHAFSYCLVDHGSAVGSKIVF 323

Query: 258 G-TNGIVSGPGVVST---PLTKAKTFYVLTIDAISVGNQRL-------GVSTPD----IV 302
           G  N ++S P +  T   P     TFY + +  I VG + L       GVS  D     +
Sbjct: 324 GDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGTI 383

Query: 303 IDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 359
           IDSGTTL++ P+  Y +   + +  M +A P+      L  CY+ + +   +VPE ++ F
Sbjct: 384 IDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVEVPEFSLLF 443

Query: 360 -RGADVKLSRSNFFVKV-SEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFK 416
             GA       N+F+++ +E I+C    G   S + I GN  Q NF V YD+    + F 
Sbjct: 444 ADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSIIGNYQQQNFHVLYDLHHNRLGFA 503

Query: 417 PTDCTK 422
           P  C +
Sbjct: 504 PRRCAE 509


>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
 gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
          Length = 471

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 128/425 (30%), Positives = 193/425 (45%), Gaps = 45/425 (10%)

Query: 29  FSVELIHRDS-PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ------ 81
           +++ L+HRD  P   + N     + R+R    R    L   +    ++SS +        
Sbjct: 59  YTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVVVASSDSRYEVNDFG 118

Query: 82  ADIIPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           +D++      +  Y +RI +G+PP ++  V D+GSD++W QC+PC    CY Q  P+FDP
Sbjct: 119 SDVVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPC--KLCYKQSDPVFDP 176

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
             S +Y  + C SS C  +    C    C+Y V YGDGS++ G LA ET+T   T  + V
Sbjct: 177 AKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNV 236

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTK 254
           A+     GCG  N G+F      ++G+GGG +S + Q+     G F YCLV     S+  
Sbjct: 237 AM-----GCGHRNRGMFIGAAG-LLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGS 290

Query: 255 INFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPD----------- 300
           + FG   +  G   V  PL    +A +FY + +  + VG  R  +  PD           
Sbjct: 291 LVFGREALPVGASWV--PLVRNPRAPSFYYVGLKGLGVGGVR--IPLPDGVFDLTETGDG 346

Query: 301 -IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTI 357
            +V+D+GT +T LP G  +       S     P A      + CY  +     +VP V+ 
Sbjct: 347 GVVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSF 406

Query: 358 HF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           +F  G  + L   NF + V +    C  F      + I GNI Q    V +D     V F
Sbjct: 407 YFTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGF 466

Query: 416 KPTDC 420
            P  C
Sbjct: 467 GPNVC 471


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 139/435 (31%), Positives = 205/435 (47%), Gaps = 61/435 (14%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLR-DALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           FS+EL     P+   +  S   Y+ L    L R   R+   N    ++ S   ++D++P 
Sbjct: 80  FSLEL----HPRELLHGGSHKDYRALMLSRLARDSARVKAINTKLQLAVSGTDKSDLVPM 135

Query: 88  N---------------------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
           +                       Y +R+ IG P      V DTGSD+ W QC+PC    
Sbjct: 136 DTEILHPQDFSTPVTSGTSQGSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPC--DD 193

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
           CY Q  P+FDP  SS++  L C + QC +L+  +C   +C Y VSYGDGS++ G+ ATET
Sbjct: 194 CYQQVDPIFDPASSSSFSRLGCQTPQCRNLDVFACRNDSCLYQVSYGDGSYTVGDFATET 253

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYC 246
           V+ G++     ++  +  GCG +N GLF      ++GLGGG +SL SQ++   A  FSYC
Sbjct: 254 VSFGNSG----SVDKVAIGCGHDNEGLFVGAAG-LIGLGGGPLSLTSQIK---ASSFSYC 305

Query: 247 LV---PVSSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL------ 294
           LV    V S+ + F +         V+ P+   +K  TFY + I  +SVG ++L      
Sbjct: 306 LVNRDSVDSSTLEFNS---AKPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSI 362

Query: 295 ----GVSTPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 349
               G     I++D GT +T L  Q YN+ L      + +  P        + CY+ +S 
Sbjct: 363 FEVDGSGKGGIIVDCGTAVTRLQTQAYNA-LRDTFVKLTKDLPSTSGFALFDTCYNLSSR 421

Query: 350 S--QVPEVTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVG 405
           +  +VP V   F G   + L  SN+ + V S    C  F   T S+ I GN+ Q    V 
Sbjct: 422 TSVRVPTVAFLFDGGKSLPLPPSNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVT 481

Query: 406 YDIEQQTVSFKPTDC 420
           YD+    VSF    C
Sbjct: 482 YDLANSQVSFSSRKC 496


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 145/424 (34%), Positives = 198/424 (46%), Gaps = 47/424 (11%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
           + L HR  P +P   SS      + D L     R  +  +  S        S  A+ A  
Sbjct: 68  LRLTHRHGPCAPSRASSLA-APSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAAAAAT 126

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFD 136
           +P          NY++  S+GTP   +    DTGSDL W QC+PC  +  CY Q  PLFD
Sbjct: 127 VPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFD 186

Query: 137 PKMSSTYKSLPCSSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           P  SS+Y ++PC    CA L      +CS   C Y VSYGDGS + G  +++T+TL +++
Sbjct: 187 PAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS 246

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVS 251
               A+ G  FGCG    GLFN    G++GLG    SL+ Q   T  G FSYCL   P +
Sbjct: 247 ----AVQGFFFGCGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPST 301

Query: 252 STKINFGTNGIV-SGPGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPDI----VI 303
           +  +  G  G   + PG  +T   P   A T+YV+ +  ISVG Q+L V         V+
Sbjct: 302 AGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVV 361

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
           D+GT +T LP    + L S   S + +   P A   G L+ CY+F     V  P V + F
Sbjct: 362 DTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTF 421

Query: 360 -RGADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
             GA V L              C  F   G    + I GN+ Q +F V   I+  +V FK
Sbjct: 422 GSGATVMLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFK 474

Query: 417 PTDC 420
           P+ C
Sbjct: 475 PSSC 478


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 124/353 (35%), Positives = 172/353 (48%), Gaps = 34/353 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ IG PP+    V DTGSD+ W QC PC  ++CY Q  P F+P  S+++ SL 
Sbjct: 148 SGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPC--AECYEQTDPXFEPTSSASFTSLS 205

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C + QC SL+   C    C Y VSYGDGS++ G+  TETVTLGST     +L  I  GCG
Sbjct: 206 CETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGST-----SLGNIAIGCG 260

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTNGIVS 264
            NN GLF      +   GG  +S  SQ+    A  FSYCLV     S++ ++F  N  ++
Sbjct: 261 HNNEGLFIGAAGLLGLGGGS-LSFPSQLN---ASSFSYCLVDRDSDSTSTLDF--NSPIT 314

Query: 265 GPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPD----------IVIDSGTTLTF 311
            P  V+ PL +     TF+ L +  +SVG   L +              I++DSGT +T 
Sbjct: 315 -PDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTR 373

Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSR 368
           L     + L             A      + CY  +S S  +VP V+ HF  G ++ L  
Sbjct: 374 LQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFHFANGNELPLPA 433

Query: 369 SNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            N+ + V SE   C  F    +++ I GN  Q    VG+D+    V F P  C
Sbjct: 434 KNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 126/354 (35%), Positives = 172/354 (48%), Gaps = 30/354 (8%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  RI +GTPP     V DTGSD++W QC PC    CY Q  P+F+P  S ++  + 
Sbjct: 39  SGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPC--KNCYSQTDPVFNPVKSGSFAKVL 96

Query: 148 CSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           C +  C  L    C+    C Y VSYGDGS++ G   TET+T   T  + VAL     GC
Sbjct: 97  CRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVAL-----GC 151

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS----TKINFGTNGI 262
           G +N GLF      ++GLG G +S  SQ   T   KFSYCLV  S+    + + FG N  
Sbjct: 152 GHDNEGLFVGAAG-LLGLGRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSSVVFG-NSA 209

Query: 263 VSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRL-GVSTPD----------IVIDSGTTL 309
           VS     +  LT  +  TFY + +  ISVG   + G++             ++ID GT++
Sbjct: 210 VSRTARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSV 269

Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLS 367
           T L +     L     +   +   A      + CY  +  +  +VP V +HFRGADV L 
Sbjct: 270 TRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGADVSLP 329

Query: 368 RSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            SN+ + V      C  F G T+ + I GNI Q  F V YD+    V F P  C
Sbjct: 330 ASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGC 383


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score =  169 bits (429), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 121/359 (33%), Positives = 188/359 (52%), Gaps = 33/359 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           + NY +++ +G+P      + DTGS   W QC+PC    C++Q+ P+F+P  S TYK++P
Sbjct: 100 SGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCT-IYCHIQEDPVFNPSASKTYKTVP 158

Query: 148 C-----SSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           C     SS + A+LN+ +CS  +  C Y  SYGD SFS G L+ + +TL  T  Q   L 
Sbjct: 159 CSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTL--TPSQ--TLS 214

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK------ 254
              +GCG +N GLF  +T GI+GL   ++S++SQ+       FSYCL    ST       
Sbjct: 215 SFVYGCGQDNQGLFG-RTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEG 273

Query: 255 -INFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI----VIDSG 306
            ++ GT+ +        TPL K     + Y + +++I+V  + LGV+        +IDSG
Sbjct: 274 FLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPTIIDSG 333

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCY--SFNSLSQV-PEVTIHFR-G 361
           T +T LP    + L +   +++  +    P  S L+ C+  S   +S+V P++ I F+ G
Sbjct: 334 TVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRIIFKGG 393

Query: 362 ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           AD++L   N  V++   I C    G ++S+ I GN  Q    V YD+    V F P  C
Sbjct: 394 ADLQLKGHNSLVELETGITCLAMAG-SSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 133/394 (33%), Positives = 198/394 (50%), Gaps = 60/394 (15%)

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ--DS 132
           SSS   QA +      Y + IS+GTPP +   + DTGS+LIW QC PC  ++C+ +   +
Sbjct: 75  SSSVNVQAQLENGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPC--TRCFPRPTPA 132

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL----NQKSCSG-VNCQYSVSYGDGSFSNGNLATETV 187
           P+  P  SST+  LPC+ S C  L      ++C+    C Y+ +YG G ++ G LATET+
Sbjct: 133 PVLQPARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETL 191

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           T+G  T      P + FGC T NG      ++GIVGLG G +SL+SQ+     G+FSYCL
Sbjct: 192 TVGDGT-----FPKVAFGCSTENG---VDNSSGIVGLGRGPLSLVSQL---AVGRFSYCL 240

Query: 248 ----VPVSSTKINFGTNGIVSGPGVV-STPLTK-----AKTFYVLTIDAISVGNQRLGVS 297
                   ++ I FG+   ++   VV STPL K       T Y + +  I+V +  L V+
Sbjct: 241 RSDMADGGASPILFGSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVT 300

Query: 298 TPDI-----------VIDSGTTLTFLPQ-GY---NSNLLSVMSSMIEAQPVADPTGSLEL 342
                          ++DSGTTLT+L + GY        S M+++ +  P +     L+L
Sbjct: 301 GSTFGFTQTGLGGGTIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDL 360

Query: 343 CYSFNS-----LSQVPEVTIHFR-GADVKLSRSNFFVKVSED------IVCSVFKGITNS 390
           CY  ++       +VP + + F  GA   +   N+F  V  D      + C +    T+ 
Sbjct: 361 CYKPSAGGGGKAVRVPRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDD 420

Query: 391 VP--IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           +P  I GN+MQ +  + YDI+    SF P DC K
Sbjct: 421 LPISIIGNLMQMDMHLLYDIDGGMFSFAPADCAK 454


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 129/441 (29%), Positives = 203/441 (46%), Gaps = 64/441 (14%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP-- 86
             + ++HRD+   P   +    + R R A         H  Q  S+ S+ A+ AD++   
Sbjct: 30  LHIPVVHRDAVFPPRRGAPPGSF-RCRHAAP-------HTAQLESLHSATAA-ADLLRSP 80

Query: 87  -------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
                  ++  Y   I +G PPT  L V DTGSDLIW QC PC   +CY Q +PL+DP+ 
Sbjct: 81  VMSGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPC--RRCYRQVTPLYDPRN 138

Query: 140 SSTYKSLPCSSSQCAS-LNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           S T++ +PC+S QC   L    C      C Y V YGDGS S+G+LAT+T+ L   T   
Sbjct: 139 SKTHRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDT--- 195

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL------VPV 250
             +  +T GCG +N GL  S   G++G G G +S  +Q+       FSYCL         
Sbjct: 196 -RVHNVTLGCGHDNEGLLAS-AAGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARN 253

Query: 251 SSTKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRL------------G 295
           SS+ + FG    +  P    TPL    +  + Y + +   SVG +R+             
Sbjct: 254 SSSYLVFGRTPEL--PSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPA 311

Query: 296 VSTPDIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLS-- 350
                +V+DSGT ++ F    Y +   + +S    A  + + +     + CY  +     
Sbjct: 312 TGRGGVVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPG 371

Query: 351 ---QVPEVTIHF-RGADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNF 402
              +VP + +HF   AD+ L ++N+ + V         C   +   + + + GN+ Q  F
Sbjct: 372 TGVRVPSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGF 431

Query: 403 LVGYDIEQQTVSFKPTDCTKQ 423
            V +D+E+  + F P  C+ +
Sbjct: 432 GVVFDVERGRIGFTPNGCSGE 452


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score =  169 bits (428), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 114/373 (30%), Positives = 177/373 (47%), Gaps = 46/373 (12%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           ++  Y   I++G PPT  L V DTGSDLIW QC PC    CY Q +PL+DP+ SST++ +
Sbjct: 84  DSGEYFAVINVGDPPTRALVVIDTGSDLIWLQCVPC--RHCYRQVTPLYDPRSSSTHRRI 141

Query: 147 PCSSSQCAS-LNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           PC+S +C   L    C      C Y V YGDGS S+G+LAT+ +     T     +  +T
Sbjct: 142 PCASPRCRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDT----HVHNVT 197

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
            GCG +N GL  S   G++G+G G +S  +Q+       FSYCL    S   N G++ +V
Sbjct: 198 LGCGHDNVGLLES-AAGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQN-GSSYLV 255

Query: 264 SG-----PGVVSTPLT---KAKTFYVLTIDAISVGNQRL------------GVSTPDIVI 303
            G     P    TPL    +  + Y + +   SVG +R+                  IV+
Sbjct: 256 FGRTPEPPSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIVV 315

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEA----QPVADPTGSLELCYSFN------SLSQVP 353
           DSGT ++   +   + +     S   A    + +A      + CY         +  +VP
Sbjct: 316 DSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRVP 375

Query: 354 EVTIHFR-GADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
            + +HF  GAD+ L ++N+ + V         C   +   + + + GN+ Q  F + +D+
Sbjct: 376 SIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLVFDV 435

Query: 409 EQQTVSFKPTDCT 421
           E+  + F P  C+
Sbjct: 436 ERGRIGFTPNGCS 448


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 116/351 (33%), Positives = 176/351 (50%), Gaps = 26/351 (7%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +RI +G+PP  +  V D+GSD++W QC+PC  +QCY Q  PLFDP  S+++  + 
Sbjct: 40  SGEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPC--TQCYHQTDPLFDPADSASFMGVS 97

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           CSS+ C  +    C+   C+Y VSYGDGS++ G LA ET+T G T  + VA+     GCG
Sbjct: 98  CSSAVCDRVENAGCNSGRCRYEVSYGDGSYTKGTLALETLTFGRTVVRNVAI-----GCG 152

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP- 266
            +N G+F      ++GLGGG +S + Q+       FSYCLV   +    F   G  + P 
Sbjct: 153 HSNRGMFVGAAG-LLGLGGGSMSFMGQLSGQTGNAFSYCLVSRGTNTNGFLEFGSEAMPV 211

Query: 267 GVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTTLTFLP 313
           G    PL    +A +FY + +  + VG+ R+ VS          +  +V+D+GT +T  P
Sbjct: 212 GAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFP 271

Query: 314 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLS-QVPEVTIHFRGADV-KLSRSN 370
                   +      +  P A      + CY+ F  LS +VP V+ +F G  +  +  +N
Sbjct: 272 TVAYEAFRNAFIEQTQNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTIPANN 331

Query: 371 FFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           F + V +    C  F    + + I GNI Q    +  D   + V F P  C
Sbjct: 332 FLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 118/338 (34%), Positives = 168/338 (49%), Gaps = 32/338 (9%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
           DTGSDLIWTQC PC    C  Q +P FD K S+TY++LPC SS+CASL+  SC    C Y
Sbjct: 2   DTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVY 59

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
              YGD + + G LA ET T G+     V    I FGCG+ N G   + ++G+VG G G 
Sbjct: 60  QYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDL-ANSSGMVGFGRGP 118

Query: 229 ISLISQMRTTIAGKFSYCL---VPVSSTKINFG------TNGIVSGPGVVSTPLT---KA 276
           +SL+SQ+  +   +FSYCL   +  + +++ FG      +    SG  V STP       
Sbjct: 119 LSLVSQLGPS---RFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVINPAL 175

Query: 277 KTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTTLTFLPQGYNSNLLSVMSS 326
              Y L++ AIS+G + L +           T  ++IDSGT++T+L Q     +   + S
Sbjct: 176 PNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRGLVS 235

Query: 327 MIEAQPVADPTGSLELCYSF----NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCS 382
            I    + D    L+ C+ +    N    VP++  HF  A++ L   N+ +  S      
Sbjct: 236 AIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLIASTTGYLC 295

Query: 383 VFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           +    T    I GN  Q N  + YDI    +SF P  C
Sbjct: 296 LVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 132/408 (32%), Positives = 199/408 (48%), Gaps = 53/408 (12%)

Query: 53  RLRDALTRSLNRLNHFNQNSSI------SSSKASQADIIPNNANYLIRISIGTPPTERLA 106
           +  +A+ R  +R+   +  ++       +SS + QA +      Y + IS+GTP      
Sbjct: 42  KYSEAVRRDSHRIAFLSDATAAGKATTTNSSVSFQALLENGVGGYNMNISVGTPLLTFPV 101

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGV 164
           VADTGSDLIWTQC PC  ++C+ Q +P F P  SST+  LPC+SS C  L  + ++C+  
Sbjct: 102 VADTGSDLIWTQCAPC--TKCFQQPAPPFQPASSSTFSKLPCTSSFCQFLPNSIRTCNAT 159

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C Y+  YG G ++ G LATET+ +G       + P + FGC T NG    + T+GI GL
Sbjct: 160 GCVYNYKYGSG-YTAGYLATETLKVGD-----ASFPSVAFGCSTENG--VGNSTSGIAGL 211

Query: 225 GGGDISLISQMRTTIAGKFSYCLV---PVSSTKINFGTNGIVSGPGVVSTPLTK----AK 277
           G G +SLI Q+     G+FSYCL       ++ I FG+   ++   V STP         
Sbjct: 212 GRGALSLIPQLGV---GRFSYCLRSGSAAGASPILFGSLANLTDGNVQSTPFVNNPAVHP 268

Query: 278 TFYVLTIDAISVGNQRLGVSTPDI-----------VIDSGTTLTFLPQ-GYNSNLLSVMS 325
           ++Y + +  I+VG   L V+T              ++DSGTTLT+L + GY     + +S
Sbjct: 269 SYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGTTLTYLAKDGYEMVKQAFLS 328

Query: 326 SMIEAQPVADPTGSLELCYSFNSLS---QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCS 382
                  V + T  L+LC+          VP + + F G   + +   +F  V  D   S
Sbjct: 329 QTANVTTV-NGTRGLDLCFKSTGGGGGIAVPSLVLRFDGG-AEYAVPTYFAGVETDSQGS 386

Query: 383 VFKGITNSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           V       +P        + GN+MQ +  + YD++    SF P DC K
Sbjct: 387 VTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLDGGIFSFSPADCAK 434


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  168 bits (426), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 134/357 (37%), Positives = 185/357 (51%), Gaps = 42/357 (11%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ IG P  E   V DTGSD+ W QC PC  + CY Q  P+F+P  SS+Y+ L 
Sbjct: 148 SGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPC--ADCYHQTEPIFEPSSSSSYEPLS 205

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C + QC +L    C    C Y VSYGDGS++ G+ ATET+T+GST  Q VA+     GCG
Sbjct: 206 CDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAV-----GCG 260

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTNGIVS 264
            +N GLF     G++GLGGG ++L SQ+ TT    FSYCLV     S++ + FGT+    
Sbjct: 261 HSNEGLF-VGAAGLLGLGGGLLALPSQLNTT---SFSYCLVDRDSDSASTVEFGTS---L 313

Query: 265 GPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTF 311
            P  V  PL +     TFY L +  ISVG + L +           +  I+IDSGT +T 
Sbjct: 314 PPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTR 373

Query: 312 LPQG-YNS---NLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-V 364
           L  G YNS   + L   S + +A  VA      + CY+ ++ +  +VP V  HF G   +
Sbjct: 374 LQTGIYNSLRDSFLKGTSDLEKAAGVA----MFDTCYNLSAKTTIEVPTVAFHFPGGKML 429

Query: 365 KLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            L   N+ + V S    C  F    +S+ I GN+ Q    V +D+    + F    C
Sbjct: 430 ALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  168 bits (425), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 126/354 (35%), Positives = 179/354 (50%), Gaps = 36/354 (10%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ IG P  E   V DTGSD+ W QC PC  + CY Q  P+F+P  SS+Y+ L 
Sbjct: 145 SGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPC--ADCYHQTEPIFEPSSSSSYEPLS 202

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C + QC +L    C    C Y VSYGDGS++ G+ ATET+T+GST  Q VA+     GCG
Sbjct: 203 CDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAV-----GCG 257

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTNGIVS 264
            +N GLF      +   GG  ++L SQ+ TT    FSYCLV     S++ ++FGT+    
Sbjct: 258 HSNEGLFVGAAGLLGLGGGL-LALPSQLNTT---SFSYCLVDRDSDSASTVDFGTS---L 310

Query: 265 GPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLTF 311
            P  V  PL +     TFY L +  ISVG + L +           +  I+IDSGT +T 
Sbjct: 311 SPDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTR 370

Query: 312 LP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLS 367
           L  + YNS   S +   ++ +  A      + CY+ ++ +  +VP V  HF G   + L 
Sbjct: 371 LQTEIYNSLRDSFVKGTLDLEKAAG-VAMFDTCYNLSAKTTVEVPTVAFHFPGGKMLALP 429

Query: 368 RSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             N+ + V S    C  F    +S+ I GN+ Q    V +D+    + F    C
Sbjct: 430 AKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score =  168 bits (425), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 116/336 (34%), Positives = 172/336 (51%), Gaps = 28/336 (8%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSC 161
           + DTGS L W QC+PC    C+ Q  PL+DP +S TYK L C+S +C     A+LN   C
Sbjct: 2   ILDTGSSLSWLQCQPCA-VYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60

Query: 162 SGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTT 219
              +  C Y+ SYGD SFS G L+ + +TL S+      LP  T+GCG +N GLF  +  
Sbjct: 61  ETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQ----TLPQFTYGCGQDNQGLFG-RAA 115

Query: 220 GIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNGIVSGPGVVSTPL---T 274
           GI+GL    +S+++Q+ T     FSYCL      S+   F + G +S      TP+   +
Sbjct: 116 GIIGLARDKLSMLAQLSTKYGHAFSYCLPTANSGSSGGGFLSIGSISPTSYKFTPMLTDS 175

Query: 275 KAKTFYVLTIDAISVGNQRLGVSTP----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEA 330
           K  + Y L + AI+V  + L ++        +IDSGT +T LP    + L      ++  
Sbjct: 176 KNPSLYFLRLTAITVSGRPLDLAAAMYRVPTLIDSGTVITRLPMSMYAALRQAFVKIMST 235

Query: 331 QPVADPTGS-LELCY--SFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKG 386
           +    P  S L+ C+  S  S+S VPE+ + F+ GAD+ L   +  ++  + I C  F G
Sbjct: 236 KYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRAPSILIEADKGITCLAFAG 295

Query: 387 I--TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              TN + I GN  Q  + + YD+    + F P  C
Sbjct: 296 SSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score =  167 bits (424), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 118/367 (32%), Positives = 184/367 (50%), Gaps = 40/367 (10%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  YL+ +++GTPP    A+ DTGSDLIWTQC PC  + C  Q  P+F P  SS+Y+ +
Sbjct: 100 GDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPC--ASCLPQPDPIFSPGASSSYEPM 157

Query: 147 PCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQA----VALPG 201
            C+   C  +   SC   + C Y  SYGDG+ + G  ATE  T  S++       ++ P 
Sbjct: 158 RCAGELCNDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAP- 216

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFG 258
           + FGCGT N G  N+  +GIVG G   +SL+SQ+      +FSYCL P +S +   + FG
Sbjct: 217 LGFGCGTMNKGSLNNG-SGIVGFGRAPLSLVSQLAIR---RFSYCLTPYASGRKSTLLFG 272

Query: 259 T--NGI--VSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVS------TPD----I 301
           +   G+   +   V +T L +++   TFY +    ++VG +RL +        PD     
Sbjct: 273 SLRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGA 332

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-------VPE 354
           ++DSGT LT  P    + ++    S +     A+ +   +    F + +        VP 
Sbjct: 333 IVDSGTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGVCFAAAASRVPRPAVVPR 392

Query: 355 VTIHFRGADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
           +  H +GAD+ L R N+ +    +  +C +     +S    GN +Q +  V YD+E  T+
Sbjct: 393 MVFHLQGADLDLPRRNYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMRVLYDLEADTL 452

Query: 414 SFKPTDC 420
           SF P  C
Sbjct: 453 SFAPAQC 459


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score =  167 bits (424), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 124/358 (34%), Positives = 175/358 (48%), Gaps = 30/358 (8%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           + NY ++I +GTP      + DTGS L W QC+PC    C++Q  P+F P +S TYK+L 
Sbjct: 104 SGNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCV-IYCHVQVDPIFTPSVSKTYKALS 162

Query: 148 C-----SSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           C     SS + ++LN   CS     C Y  SYGD SFS G L+ + +TL   T  A    
Sbjct: 163 CSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTL---TPSAAPSS 219

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN 260
           G  +GCG +N GLF  ++ GI+GL    +S++ Q+       FSYCL    S + N   +
Sbjct: 220 GFVYGCGQDNQGLFG-RSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPNSSVS 278

Query: 261 GIVSGPGVVS-------TPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI----VIDSG 306
           G +S             TPL    K  + Y L +  I+V  + LGVS        +IDSG
Sbjct: 279 GFLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVPTIIDSG 338

Query: 307 TTLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHFR-GA 362
           T +T LP   YN+   S +  M +    A     L+ C+  S   +S VPE+ I FR GA
Sbjct: 339 TVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIRIIFRGGA 398

Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            ++L   N  V++ +   C      +N + I GN  Q  F V YD+    + F P  C
Sbjct: 399 GLELKVHNSLVEIEKGTTCLAIAASSNPISIIGNYQQQTFTVAYDVANSKIGFAPGGC 456


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score =  167 bits (424), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 143/430 (33%), Positives = 210/430 (48%), Gaps = 47/430 (10%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL-------------RDALTRSLNRLNHFNQNSSI 74
           G  + L H  SP SP    ++ P+  +             R A T S +R     + SS 
Sbjct: 40  GLHLTLHHPRSPCSPAPLPADVPFSAVLTHDHARIASLAARLAKTPS-SRPTKLRRGSSS 98

Query: 75  SSSKASQADII--PNNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
           S    S A +   P  +    NY+ R+ +GTP    + V DTGS L W QC PC  S C+
Sbjct: 99  SPDAESLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVS-CH 157

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNL 182
            Q  P+F+P+ SS+Y S+ CS+ QC     A+LN  +CS  N C Y  SYGD SFS G L
Sbjct: 158 RQSGPVFNPRSSSSYASVSCSAPQCDALTTATLNPSTCSTSNVCIYQASYGDSSFSVGYL 217

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
           + +TV+ GST+     +P   +GCG +N GLF  ++ G++GL    +SL+ Q+  ++   
Sbjct: 218 SKDTVSFGSTS-----VPNFYYGCGQDNEGLFG-QSAGLIGLARNKLSLLYQLAPSMGYS 271

Query: 243 FSYCLVPVSSTKINFGTNGIVSGPGVVS-TPLTKA---KTFYVLTIDAISVGNQRLGVST 298
           FSYCL   +S+  +   +     PG  S TP+ K+    + Y + +  I+V  + L VS 
Sbjct: 272 FSYCL--PTSSSSSGYLSIGSYNPGQYSYTPMAKSSLDDSLYFIKMTGITVAGKPLSVSA 329

Query: 299 PD-----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQV 352
                   +IDSGT +T LP    S L   ++  ++  P A     L+ C+    S  +V
Sbjct: 330 SAYSSLPTIIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQGQASRLRV 389

Query: 353 PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
           P+V++ F  GA +KL  +N  V V     C  F     S  I GN  Q  F V YD++  
Sbjct: 390 PQVSMAFAGGAALKLKATNLLVDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNS 448

Query: 412 TVSFKPTDCT 421
            + F    C+
Sbjct: 449 KIGFAAGGCS 458


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  167 bits (424), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 130/422 (30%), Positives = 193/422 (45%), Gaps = 54/422 (12%)

Query: 40  KSPFYNSSETPYQRLRDA-LTRSLNRLNHFNQNSSISSSKASQADIIP------------ 86
           ++  + SS   Y+ L  A L R  +R+        ++ +  +++D+ P            
Sbjct: 83  RTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKPVEKELEAEALET 142

Query: 87  --------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
                    +  Y  R+ IG+PP     V DTGSD+ W QC PC  + CY Q  P+F+P 
Sbjct: 143 PLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPC--ADCYQQADPIFEPS 200

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            SS+Y  L C + QC SL+   C   +C Y VSYGDGS++ G+ ATET+TL  +     +
Sbjct: 201 FSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDGS----AS 256

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI 255
           L  +  GCG +N GLF      +   GG  +S  SQ+    A  FSYCLV     S++ +
Sbjct: 257 LNNVAIGCGHDNEGLFVGAAGLLGLGGGS-LSFPSQIN---ASSFSYCLVNRDTDSASTL 312

Query: 256 NFGT---NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD----------IV 302
            F +   +  V+ P + +  L    TFY L +  I VG Q L +              I+
Sbjct: 313 EFNSPIPSHSVTAPLLRNNQL---DTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGII 369

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF- 359
           +DSGT +T L     ++L        +  P        + CY  +S S  +VP V+ HF 
Sbjct: 370 VDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFHFP 429

Query: 360 RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
            G  + L   N+ + V S    C  F   T+++ I GN+ Q    V YD+    V F P 
Sbjct: 430 DGKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPN 489

Query: 419 DC 420
            C
Sbjct: 490 GC 491


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 118/432 (27%), Positives = 204/432 (47%), Gaps = 70/432 (16%)

Query: 49  TPYQRLRDALTRSLNRLNHFNQNSSISSSK----ASQADIIPNNANYLIRISIGTPPTER 104
           T ++ LR A+ RS +RL         +SS+     ++A ++     YL+++ +GTP    
Sbjct: 42  TDHELLRRAIQRSRDRLASIAPRLLPTSSRNKVVVAEAPVLSAGGEYLVKLGLGTPQHCF 101

Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
            A  DT SDLIWTQC+PC   +CY Q  P+F+P  S++Y  +PC+S  C  L+   C+  
Sbjct: 102 TAAIDTASDLIWTQCQPC--VKCYKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARD 159

Query: 165 N-------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
                   CQY+ SYG  + + G LA + + +G    +     G+ FGC +++ G    +
Sbjct: 160 GDSDDEDACQYTYSYGGNATTRGILAVDRLAIGDDVFR-----GVVFGCSSSSVGGPPPQ 214

Query: 218 TTGIVGLGGGDISLISQMRTTIAGKFSYCLVP---VSSTKINFGTNG---IVSGPGVVST 271
            +G+VGLG G +SL+SQ+      +F YCL P    S+ ++  G +    + +    V  
Sbjct: 215 VSGVVGLGRGALSLVSQLSVR---RFMYCLPPPVSRSAGRLVLGADAAATVRNASERVVV 271

Query: 272 PL---TKAKTFYVLTIDAISVGNQ--------RLGVSTP--------------------- 299
           P+   ++  ++Y L +D IS+G++        R+  +TP                     
Sbjct: 272 PMSTGSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTAAGAPASPVSGSGDGDGSG 331

Query: 300 ------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS---LS 350
                  ++ID  +T+TFL +     ++  +   I     +     L+LC+       +S
Sbjct: 332 TGPDAYGMIIDIASTITFLEESLYEEMVDDLEEEIRLPRGSGSDLGLDLCFILPEGVPMS 391

Query: 351 QV--PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
           +V  P V++ F G  ++L +   FV+     +  +  G T+ V I GN  Q N  V Y++
Sbjct: 392 RVYAPPVSLAFEGVWLRLDKEQMFVEDRASGMMCLMVGKTDGVSILGNYQQQNMQVMYNL 451

Query: 409 EQQTVSFKPTDC 420
            +  ++F  T C
Sbjct: 452 RRGRITFIKTAC 463


>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
 gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 131/416 (31%), Positives = 195/416 (46%), Gaps = 49/416 (11%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH--------FNQNSSISSSKASQAD 83
           +LIHRDS  SP+Y S++T   R    +  SL RL++        F+ N    +   S ++
Sbjct: 40  KLIHRDSIVSPYYRSNDTVADRTERTMKASLARLSYLYAKIERDFDINDLWLNLHPSASE 99

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD-SPLFDPKMSST 142
            +     +L+  S+G PP  +LA+ DTGS L+W QC PC    C  Q   P+FDP +SST
Sbjct: 100 PL-----FLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPC--KSCSQQIIGPMFDPSISST 152

Query: 143 YKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           Y SL C +  C       C S   C Y+ +Y +G  S G +ATE +  GS+     A+  
Sbjct: 153 YDSLSCKNIICRYAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNN 212

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
           + FGC   NG   + + TG+ GLG G  S+++QM      KFSYC+  ++    ++  N 
Sbjct: 213 VLFGCSHRNGNYKDRRFTGVFGLGSGITSVVNQM----GSKFSYCIGNIADP--DYSYNQ 266

Query: 262 IVSGPGV----VSTPLTKAKTFYVLTIDAISVGNQRLGV---------STPDIVIDSGTT 308
           +V   GV     STPL      Y + ++ ISVG  RL +             ++IDSGT 
Sbjct: 267 LVLSEGVNMEGYSTPLDVVDGHYQVILEGISVGETRLVIDPSAFKRTEKQRRVIIDSGTA 326

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN---SLSQVPEVTIHF-RGADV 364
            T+L +     L   + ++++         S  LCY       L   P VT HF  GAD+
Sbjct: 327 PTWLAENEYRALEREVRNLLDRFLTPFMRESF-LCYKGKVGQDLVGFPAVTFHFAEGADL 385

Query: 365 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                   V  +E    SV+        + G + Q  + V YD+ +  + F+  DC
Sbjct: 386 --------VVDTEMRQASVYGKDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQRIDC 433


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 122/396 (30%), Positives = 187/396 (47%), Gaps = 30/396 (7%)

Query: 48  ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAV 107
           E    RLR    R  +  +   +  S+  +    + +   +  Y  R+ IG+P       
Sbjct: 2   ERDEARLRWIHHRIQSSDHRHRRGRSLLQTAQVSSGLSLGSGEYFARMGIGSPQRSYYLE 61

Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
            DTGSD+ W QC PC  S CY Q  P++DP  SS+Y+ + C S+ C +L+  +C G+ C 
Sbjct: 62  LDTGSDVTWIQCAPC--SSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDYSACQGMGCS 119

Query: 168 YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
           Y V YGD S S+G+L  E+  LG  +  + A+  I FGCG +N GLF  +   ++G+GGG
Sbjct: 120 YRVVYGDSSASSGDLGIESFYLGPNS--STAMRNIAFGCGHSNSGLFRGEAG-LLGMGGG 176

Query: 228 DISLISQMRTTIAGKFSYCLVPV------SSTKINFGTNGIVSGPGVVSTPLTK---AKT 278
            +S  SQ+  +I   FSYCLV         S+ + FG   I        TPL K     T
Sbjct: 177 TLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPFAARF--TPLLKNPRIDT 234

Query: 279 FYVLTIDAISVGNQRL----------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMI 328
           FY   +  ISVG   L          G  T   ++DSGT++T +     + L     +  
Sbjct: 235 FYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVTRVVPAAYAVLRDAYRAAS 294

Query: 329 EAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVF 384
              P A     L+ C++F  L   Q+P + +HF    D+ L   N  + V      C  F
Sbjct: 295 RNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAF 354

Query: 385 KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              +  + + GN+ Q  F +G+D+++  ++  P +C
Sbjct: 355 APSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 390


>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
           Short=AtASPG2; Flags: Precursor
 gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
 gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score =  166 bits (421), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 130/424 (30%), Positives = 195/424 (45%), Gaps = 44/424 (10%)

Query: 29  FSVELIHRDS-PKSPFYNSSETPYQRLR---DALTRSLNRLNHFNQNSSISSSKASQ--A 82
           +++ L+HRD  P   + N     + R+R   D ++  L R++     SS S  + +   +
Sbjct: 59  YTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGS 118

Query: 83  DII----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           DI+      +  Y +RI +G+PP ++  V D+GSD++W QC+PC    CY Q  P+FDP 
Sbjct: 119 DIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPC--KLCYKQSDPVFDPA 176

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            S +Y  + C SS C  +    C    C+Y V YGDGS++ G LA ET+T   T  + VA
Sbjct: 177 KSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVA 236

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI 255
           +     GCG  N G+F      ++G+GGG +S + Q+     G F YCLV     S+  +
Sbjct: 237 M-----GCGHRNRGMFIGAAG-LLGIGGGSMSFVGQLSGQTGGAFGYCLVSRGTDSTGSL 290

Query: 256 NFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPD------------ 300
            FG   +  G   V  PL    +A +FY + +  + VG  R  +  PD            
Sbjct: 291 VFGREALPVGASWV--PLVRNPRAPSFYYVGLKGLGVGGVR--IPLPDGVFDLTETGDGG 346

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIH 358
           +V+D+GT +T LP            S     P A      + CY  +     +VP V+ +
Sbjct: 347 VVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFY 406

Query: 359 F-RGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           F  G  + L   NF + V +    C  F      + I GNI Q    V +D     V F 
Sbjct: 407 FTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFG 466

Query: 417 PTDC 420
           P  C
Sbjct: 467 PNVC 470


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 133/433 (30%), Positives = 197/433 (45%), Gaps = 49/433 (11%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS--ISSSKA 79
           +E  +   SV L+HR  P +     S+ P     + L  S  R N+    +S  ++S+  
Sbjct: 48  LEPSSATLSVPLVHRYGPCAA-SQYSDMPTPSFSETLRHSRARTNYIKSRASTGMASTPD 106

Query: 80  SQADIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
             A  +P       ++  Y++ +  GTP   ++ + DTGSD+ W QC PC  ++CY Q  
Sbjct: 107 DAAVTVPTRLGGFVDSLEYMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKD 166

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           PLFDP  SSTY  + C +  C  L     N  +  G  C Y V YGDGS + G  + ET+
Sbjct: 167 PLFDPSKSSTYAPIACGADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETI 226

Query: 188 TLGSTTGQAVALPGIT-----FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
           T           PGIT     FGCG +  G  + K  G++GLGG   SL+ Q  +   G 
Sbjct: 227 TFA---------PGITVKDFHFGCGHDQRGP-SDKFDGLLGLGGAPESLVVQTASVYGGA 276

Query: 243 FSYCLVPVSSTKINFGTNGI-----VSGPGVVSTP---LTKAKTFYVLTIDAISVGNQRL 294
           FSYCL P  +++  F   G+      +    V TP   L    T Y++ +  ISVG + L
Sbjct: 277 FSYCL-PALNSEAGFLALGVRPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPL 335

Query: 295 GVSTP----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
            +        ++IDSGT +T LP+   + L + +     A P+   +   + CY+F   S
Sbjct: 336 DIPRSAFRGGMLIDSGTIVTELPETAYNALNAALRKAFAAYPMV-ASEDFDTCYNFTGYS 394

Query: 351 Q--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
              VP V + F  GA + L   N  +   +D +     G    + I GN+ Q    V YD
Sbjct: 395 NVTVPRVALTFSGGATIDLDVPNGILV--KDCLAFRESGPDVGLGIIGNVNQRTLEVLYD 452

Query: 408 IEQQTVSFKPTDC 420
                V F+   C
Sbjct: 453 AGHGKVGFRAGAC 465


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score =  166 bits (420), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 119/357 (33%), Positives = 176/357 (49%), Gaps = 32/357 (8%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ IG P        DTGSD+ W QC PC  S CY Q  P++DP  SS+Y+ + 
Sbjct: 9   SGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPC--SSCYSQVDPIYDPSNSSSYRRVY 66

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C S+ C +L+  +C G+ C Y V YGD S S+G+L  E+  LG  +  + A+  I FGCG
Sbjct: 67  CGSALCQALDYSACQGMGCSYRVVYGDSSASSGDLGIESFYLGPNS--STAMRNIAFGCG 124

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV------SSTKINFGTNG 261
            +N GLF  +   ++G+GGG +S  SQ+  +I   FSYCLV         S+ + FG   
Sbjct: 125 HSNSGLFRGEAG-LLGMGGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTA 183

Query: 262 IVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL----------GVSTPDIVIDSGTT 308
           I        TPL K     TFY   +  ISVG   L          G  T   ++DSGT+
Sbjct: 184 IPF--AARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGTS 241

Query: 309 LT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADV 364
           +T  +P  Y + L     +     P A     L+ C++F  L   Q+P + +HF  G D+
Sbjct: 242 VTRVVPPAY-AVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQIPSLVLHFDNGVDM 300

Query: 365 KLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            L   N  + V      C  F   +  + + GN+ Q  F +G+D+++  ++  P +C
Sbjct: 301 VLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAPREC 357


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 113/308 (36%), Positives = 153/308 (49%), Gaps = 34/308 (11%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ ++IGTPP       DTGSDLIWTQC+PCP   C+ Q  P FDP  SST     C 
Sbjct: 81  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 138

Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           S+ C  L   SC          C Y+ SYGD S + G L  +  T     G   ++PG+ 
Sbjct: 139 STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 195

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFG 258
           FGCG  N G+F S  TGI G G G +SL SQ++    G FS+C   V+  K     ++  
Sbjct: 196 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTAVNGLKPSTVLLDLP 252

Query: 259 TNGIVSGPGVV-STPLTK---AKTFYVLTIDAISVGNQRLGV---------STPDIVIDS 305
            +   SG G V STPL +     TFY L++  I+VG+ RL V          T   +IDS
Sbjct: 253 ADLYKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDS 312

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHFRGAD 363
           GT +T LP      +    ++ ++   V+  T     C S    +   VP++ +HF GA 
Sbjct: 313 GTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGAT 372

Query: 364 VKLSRSNF 371
           + L R N+
Sbjct: 373 MDLPRENY 380


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 116/355 (32%), Positives = 177/355 (49%), Gaps = 34/355 (9%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  R+ +G+P  +   V DTGSD+ W QC+PC  + CY Q  P+FDP +S++Y S+
Sbjct: 159 GSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSTSYASV 216

Query: 147 PCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C + +C  L+  +C  S   C Y V+YGDGS++ G+ ATET+TLG +      +  +  
Sbjct: 217 ACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDS----APVSSVAI 272

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFG--T 259
           GCG +N GLF      ++ LGGG +S  SQ+  T    FSYCLV     SS+ + FG   
Sbjct: 273 GCGHDNEGLFVGAAG-LLALGGGPLSFPSQISAT---TFSYCLVDRDSPSSSTLQFGDAA 328

Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL----------GVSTPDIVIDSGTTL 309
           +  V+ P ++ +P T   TFY + +  ISVG Q L          G     +++DSGT +
Sbjct: 329 DAEVTAP-LIRSPRT--STFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVDSGTAV 385

Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKL 366
           T L     + L        ++ P        + CY  +  +  +VP V++ F  G +++L
Sbjct: 386 TRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGELRL 445

Query: 367 SRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              N+ + V      C  F     +V I GN+ Q    V +D  + TV F    C
Sbjct: 446 PAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTSNKC 500


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score =  166 bits (419), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 141/412 (34%), Positives = 200/412 (48%), Gaps = 47/412 (11%)

Query: 34  IHRDSPKSPFYNSSETPYQRLRDALTRS-LNRLNHFNQNSSIS---SSKASQADIIPNNA 89
           +HRDS +     +  T  Q + + +++S L  L    Q   +S   SS  SQ      + 
Sbjct: 106 LHRDSSR---VQAITTRLQLILNGVSKSDLKPLQTEIQPQDLSTPVSSGTSQG-----SG 157

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y  R+ +G P      V DTGSD+ W QC+PC  S CY Q  P+F P  SS+Y  L C 
Sbjct: 158 EYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPC--SDCYQQSDPIFTPAASSSYSPLTCD 215

Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVALPGITFGCGT 208
           S QC SL   SC    C+Y V+YGDGSF+ G+  TET++  GS T  ++AL     GCG 
Sbjct: 216 SQQCNSLQMSSCRNGQCRYQVNYGDGSFTFGDFVTETMSFGGSGTVNSIAL-----GCGH 270

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV---PVSSTKINFGTNGIVSG 265
           +N GLF      ++GLGGG +SL SQ++ T    FSYCLV     +S+ ++F  N    G
Sbjct: 271 DNEGLFVGAAG-LLGLGGGPLSLTSQLKAT---SFSYCLVNRDSAASSTLDF--NSAPVG 324

Query: 266 PGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVSTP----------DIVIDSGTTLTFL- 312
             V++  L  +K  TFY + +  +SVG + L +              +++D GT +T L 
Sbjct: 325 DSVIAPLLKSSKIDTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAITRLQ 384

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRS 369
            + YNS L     SM             + CY  +  S  +VP V+ HF G     L  +
Sbjct: 385 SEAYNS-LRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVSFHFDGGKSWDLPAA 443

Query: 370 NFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           N+ + V S    C  F   T+S+ I GN+ Q    V +D+    V F    C
Sbjct: 444 NYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 124/359 (34%), Positives = 181/359 (50%), Gaps = 38/359 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY++ + +G        + DTGSDL W QC+PC    CY Q  PL+DP +SS+YK++ C+
Sbjct: 134 NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 189

Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           SS C  L     N   C G N      C+Y VSYGDGS++ G+LA+E++ LG T      
Sbjct: 190 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----K 244

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI 255
           L    FGCG NN GLF   +  ++GLG   +SL+SQ   T  G FSYCL  +   +S  +
Sbjct: 245 LENFVFGCGRNNKGLFGGSSG-LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSL 303

Query: 256 NFGTNGIV--SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVST--PDIVIDSGTT 308
           +FG +  V  +   V  TPL    + ++FY+L +   S+G   L  S+    I+IDSGT 
Sbjct: 304 SFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGILIDSGTV 363

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRG---AD 363
           +T LP      +           P A     L+ C++  S     +P + + F+G    +
Sbjct: 364 ITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELE 423

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           V ++   +FVK    +VC     ++  N V I GN  Q N  V YD  Q+ +     +C
Sbjct: 424 VDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 482


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score =  165 bits (418), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 130/436 (29%), Positives = 194/436 (44%), Gaps = 52/436 (11%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL-TRSLNRLNHFNQNSSISSSKASQ 81
             + G   +E+  R               Q + D L  RS+   NH  + +S S    S 
Sbjct: 48  RKEKGAIILEMKDRGECSESERKGDWVEKQLVLDGLHVRSIQ--NHIRKRTSSSQIADSS 105

Query: 82  ADIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
              +P          NY++ + +G+       + DTGSDL W QCEPC    CY Q+ PL
Sbjct: 106 ETQVPLTSGIKFQTLNYIVTMGLGSQNMS--VIVDTGSDLTWVQCEPC--RSCYNQNGPL 161

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTL 189
           F P  S +Y+ + C+S+ C SL   +C     +   C Y V+YGDGS+++G L  E +  
Sbjct: 162 FKPSTSPSYQPILCNSTTCQSLELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGF 221

Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP 249
           G      +++    FGCG NN GLF    +G++GLG  ++S+ISQ   T  G FSYCL  
Sbjct: 222 G-----GISVSNFVFGCGRNNKGLFGG-ASGLMGLGRSELSMISQTNATFGGVFSYCL-- 273

Query: 250 VSSTKINFGTNGIVSG--PGVVS--TPLTKAK--------TFYVLTIDAISVGNQRLGVS 297
             ST     +  +V G   GV    TP+   +         FY+L +  I VG   L V 
Sbjct: 274 -PSTDQAGASGSLVMGNQSGVFKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQ 332

Query: 298 TPD-----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 352
                   +++DSGT ++ L       L +         P A     L+ C++     QV
Sbjct: 333 ASSFGNGGVILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFNLTGYDQV 392

Query: 353 --PEVTIHFRG-ADVKLSRSNFFVKVSEDI--VCSVFKGITN--SVPIYGNIMQTNFLVG 405
             P ++++F G A++ +  +  F  V ED   VC     +++   + I GN  Q N  V 
Sbjct: 393 NIPTISMYFEGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVL 452

Query: 406 YDIEQQTVSFKPTDCT 421
           YD +   V F    CT
Sbjct: 453 YDAKLSQVGFAKEPCT 468


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 124/359 (34%), Positives = 181/359 (50%), Gaps = 38/359 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY++ + +G        + DTGSDL W QC+PC    CY Q  PL+DP +SS+YK++ C+
Sbjct: 86  NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 141

Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           SS C  L     N   C G N      C+Y VSYGDGS++ G+LA+E++ LG T      
Sbjct: 142 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----K 196

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI 255
           L    FGCG NN GLF   +  ++GLG   +SL+SQ   T  G FSYCL  +   +S  +
Sbjct: 197 LENFVFGCGRNNKGLFGGSSG-LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSL 255

Query: 256 NFGTNGIV--SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVST--PDIVIDSGTT 308
           +FG +  V  +   V  TPL    + ++FY+L +   S+G   L  S+    I+IDSGT 
Sbjct: 256 SFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGILIDSGTV 315

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRG---AD 363
           +T LP      +           P A     L+ C++  S     +P + + F+G    +
Sbjct: 316 ITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELE 375

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           V ++   +FVK    +VC     ++  N V I GN  Q N  V YD  Q+ +     +C
Sbjct: 376 VDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVGENC 434


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score =  165 bits (417), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 124/359 (34%), Positives = 181/359 (50%), Gaps = 38/359 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY++ + +G        + DTGSDL W QC+PC    CY Q  PL+DP +SS+YK++ C+
Sbjct: 134 NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 189

Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           SS C  L     N   C G N      C+Y VSYGDGS++ G+LA+E++ LG T      
Sbjct: 190 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----K 244

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI 255
           L    FGCG NN GLF   +  ++GLG   +SL+SQ   T  G FSYCL  +   +S  +
Sbjct: 245 LENFVFGCGRNNKGLFGGSSG-LMGLGRSSVSLVSQTLKTFNGVFSYCLPSLEDGASGSL 303

Query: 256 NFGTNGIV--SGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVST--PDIVIDSGTT 308
           +FG +  V  +   V  TPL    + ++FY+L +   S+G   L  S+    I+IDSGT 
Sbjct: 304 SFGNDSSVYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELKSSSFGRGILIDSGTV 363

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRG---AD 363
           +T LP      +           P A     L+ C++  S     +P + + F+G    +
Sbjct: 364 ITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMIFQGNAELE 423

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           V ++   +FVK    +VC     ++  N V I GN  Q N  V YD  Q+ +     +C
Sbjct: 424 VDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGIVGENC 482


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score =  164 bits (416), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 132/396 (33%), Positives = 190/396 (47%), Gaps = 41/396 (10%)

Query: 56  DALTRSL-NRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTERLAVADTG 111
           D   RS+ NR+     + ++ +S+      + I     NY++ + +G+  T    + DTG
Sbjct: 26  DLRVRSMQNRIRRVVSSHNVEASQTQIPLSSGINLQTLNYIVTMGLGS--TNMTVIIDTG 83

Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN- 165
           SDL W QCEPC    CY Q  P+F P  SS+Y+S+ C+SS C SL     N  +C G N 
Sbjct: 84  SDLTWVQCEPC--MSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGAC-GSNP 140

Query: 166 --CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVG 223
             C Y V+YGDGS++NG L  E ++ G      V++    FGCG NN GLF    +G++G
Sbjct: 141 STCNYVVNYGDGSYTNGELGVEQLSFG-----GVSVSDFVFGCGRNNKGLFGG-VSGLMG 194

Query: 224 LGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAK------ 277
           LG   +SL+SQ   T  G FSYCL    S        G  S      TP+T  +      
Sbjct: 195 LGRSYLSLVSQTNATFGGVFSYCLPTTESGASGSLVMGNESSVFKNVTPITYTRMLPNPQ 254

Query: 278 --TFYVLTIDAISVGNQRLGV---STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 332
              FY+L +  I V    L V       ++IDSGT +T LP      L ++        P
Sbjct: 255 LSNFYILNLTGIDVDGVALQVPSFGNGGVLIDSGTVITRLPSSVYKALKALFLKQFTGFP 314

Query: 333 VADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVKVSEDI--VCSVFKGI 387
            A     L+ C++     +V  P +++HF G A++K+  +  F  V ED   VC     +
Sbjct: 315 SAPGFSILDTCFNLTGYDEVSIPTISMHFEGNAELKVDATGTFYVVKEDASQVCLALASL 374

Query: 388 TNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           +++    I GN  Q N  V YD +Q  V F    C+
Sbjct: 375 SDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEESCS 410


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score =  164 bits (416), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 127/407 (31%), Positives = 186/407 (45%), Gaps = 27/407 (6%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKA-SQADIIPNN 88
           S  LIH  S  SPF   + T    + + +    NRL    + S  S   A +   +   +
Sbjct: 53  SFPLIHIYSECSPFRPPNRTWESLMSEKIRGDANRLRFLKRTSRSSKEDANANVPVRSGS 112

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y+I++  GTP      + DTGSD+ W  C+ C   Q     +P+FDP  SS+YK   C
Sbjct: 113 GEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQC---QGCHSTAPIFDPAKSSSYKPFAC 169

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
            S  C  ++        CQ+ V YGDG+  +G LA++ +TLGS       LP  +FGC  
Sbjct: 170 DSQPCQEISGNCGGNSKCQFEVLYGDGTQVDGTLASDAITLGSQ-----YLPNFSFGCAE 224

Query: 209 N-NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNGIVSG 265
           + +   ++S     +G G   +   +       G FSYCL     SS  +  G    VS 
Sbjct: 225 SLSEDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAVSS 284

Query: 266 PGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPDI------VIDSGTTLTFL-PQG 315
             +  T L K     TFY +T+ AISVGN R+ V   +I      +IDSGTT+T+L P  
Sbjct: 285 SSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPATNIASGGGTIIDSGTTITYLVPSA 344

Query: 316 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFV 373
           Y     +    +   QP   P   ++ CY  +S S  VP +T+H  R  D+ L + N  +
Sbjct: 345 YKDLRDAFRQQLSSLQPT--PVEDMDTCYDLSSSSVDVPTITLHLDRNVDLVLPKENILI 402

Query: 374 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                + C  F   T+S  I GN+ Q N+ + +D+    V F    C
Sbjct: 403 TQESGLSCLAFSS-TDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score =  164 bits (415), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 133/440 (30%), Positives = 215/440 (48%), Gaps = 67/440 (15%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYN--SSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
           +A+     +E +HR + +S      +S +P + L + +  ++                  
Sbjct: 97  KAEKDAVRIETMHRRAARSGVARMPASSSPRRALSERMVATV------------------ 138

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           ++ +   +  YLI + +GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  S
Sbjct: 139 ESGVAVGSGEYLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 196

Query: 141 STYKSLPCSSSQCASL----NQKSC---SGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           S+Y+++ C   +C  +      ++C   +  +C Y   YGD S + G+LA E+ T+  T 
Sbjct: 197 SSYRNVTCGDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTA 256

Query: 194 -GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
            G +  + G+ FGCG  N GLF+     ++GLG G +S  SQ+R      FSYCLV   S
Sbjct: 257 PGASRRVDGVVFGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYGHTFSYCLVEHGS 315

Query: 253 ---TKINFGTNGIV-SGPGVVSTPL----TKAKTFYVLTIDAISVGNQRLGVS--TPDI- 301
              +K+ FG + +V + P +  T      + A TFY + +  + VG   L +S  T D+ 
Sbjct: 316 DAGSKVVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVG 375

Query: 302 -------VIDSGTTLT-FLPQGYN------SNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
                  +IDSGTTL+ F+   Y        +L+S +  +I   PV +P      CY+ +
Sbjct: 376 KDGSGGTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRLYPLIPDFPVLNP------CYNVS 429

Query: 348 SLS--QVPEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNF 402
            +   +VPE+++ F  GA       N+FV++  D I+C   +G   + + I GN  Q NF
Sbjct: 430 GVERPEVPELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGMSIIGNFQQQNF 489

Query: 403 LVGYDIEQQTVSFKPTDCTK 422
            V YD++   + F P  C +
Sbjct: 490 HVVYDLQNNRLGFAPRRCAE 509


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  164 bits (415), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 117/383 (30%), Positives = 181/383 (47%), Gaps = 61/383 (15%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC------PPSQCYMQDSPLFDPKMSS 141
           +  Y + I +GTPP   L VADTGSDL+W +C  C      PPS  ++       P+ SS
Sbjct: 85  SGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFL-------PRHSS 137

Query: 142 TYKSLPCSSSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           ++    C    C  L        N       C++  SY DGS S+G  + ET TL S +G
Sbjct: 138 SFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSG 197

Query: 195 QAVALPGITFGCGTN------NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
             + L G++FGCG        +G  FN    G++GLG G IS  SQ+      KFSYCL+
Sbjct: 198 SEIHLKGLSFGCGFRISGPSVSGAQFNG-ARGVMGLGRGSISFSSQLGRRFGNKFSYCLM 256

Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTKA--------------KTFYVLTIDAISVGNQRL 294
             + +     T+ ++ G G+ S PLT A               TFY +TI +I++   +L
Sbjct: 257 DYTLSPPP--TSFLMIGGGLHSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKL 314

Query: 295 GVSTPDI-----------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC 343
            ++ P +           V+DSGTTLT+L +     +L  +   ++    A+ T   +LC
Sbjct: 315 PIN-PAVWEIDEQGNGGTVVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLC 373

Query: 344 YSFNSLSQVPEV-TIHFR---GADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNI 397
            + +  S+ P +  + FR   GA       N+F++  E ++C   + +   N   + GN+
Sbjct: 374 VNASGESRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNL 433

Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
           MQ  FL+ +D E+  + F    C
Sbjct: 434 MQQGFLLEFDKEESRLGFTRRGC 456


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  164 bits (415), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 120/347 (34%), Positives = 161/347 (46%), Gaps = 28/347 (8%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y++ +S+GTP   +    DTGSD+ W QC+PC    C  Q   LFDP  SSTY ++PC 
Sbjct: 142 QYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCG 201

Query: 150 SSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTL--GSTTGQAVALPGITFG 205
           +  C+ L   +  CSG  C Y VSYGDGS + G   ++T+ L  G+T G         FG
Sbjct: 202 ADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVGT------FLFG 255

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
           CG    G+F +   G++ LG   +SL SQ      G FSYCL    S        G  S 
Sbjct: 256 CGHAQAGMF-AGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGGPTSA 314

Query: 266 PGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI----VIDSGTTLTFLPQGYNS 318
            G  +T L     A TFY++ +  ISVG Q++ V         V+D+GT +T LP    +
Sbjct: 315 SGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTAYA 374

Query: 319 NLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFV 373
            L S     I     P A   G L+ CY F+    V  P V + F  GA + L       
Sbjct: 375 ALRSAFRGAIAPYGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGI-- 432

Query: 374 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            +S   +     G      I GN+ Q +F V +D    TV F P  C
Sbjct: 433 -LSSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476


>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score =  164 bits (415), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 145/460 (31%), Positives = 216/460 (46%), Gaps = 72/460 (15%)

Query: 8   VFILFFLCFYVVSPIEAQTG-GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
           VF+L  LCF       + TG G  ++L H D        +  T  +R+R A+  S  RL 
Sbjct: 6   VFLLVLLCFRASLVTSSSTGAGLRMKLTHVDD------KAGYTTEERVRRAVAVSRERLA 59

Query: 67  HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPS 125
           +  Q   + +S    A +      Y+    IG PP    A+ DTGS+LIWTQC   C   
Sbjct: 60  YTQQQQQLRASGDVSAPVHLATRQYIAEYLIGDPPQRAAALIDTGSNLIWTQCGTTCGLK 119

Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQ--CASLNQKSCSGVN--CQYSVSYGDGSFSNGN 181
            C  QD P ++   SST+ ++PC+ S   CA+     C G++  C ++ SYG GS   G+
Sbjct: 120 ACAKQDLPYYNLSRSSTFAAVPCADSAKLCAANGVHLC-GLDGSCTFAASYGAGSV-FGS 177

Query: 182 LATETVTLGSTTGQAVALPGITFGC--------GTNNGGLFNSKTTGIVGLGGGDISLIS 233
           L TE  T  S   +      + FGC        G  NG       +G++GLG G +SL+S
Sbjct: 178 LGTEAFTFQSGAAK------LGFGCVSLTRITKGALNG------ASGLIGLGRGRLSLVS 225

Query: 234 QMRTTIAGKFSYCLVPV-----SSTKINFGTNGIVSGPG--VVSTPLTKA------KTFY 280
           Q   T A KFSYCL P      +S+ +  G +  +SG G  V S P  K+       TFY
Sbjct: 226 Q---TGATKFSYCLTPYLRNHGASSHLFVGASASLSGGGGAVTSIPFVKSPEDYPYSTFY 282

Query: 281 VLTIDAISVGNQRL--------------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSS 326
            L +  ISVG  +L              G  +  ++ID+G+ +T L +   S L   ++ 
Sbjct: 283 YLPLVGISVGETKLPIPSAAFELRRVAAGYWSGGVIIDTGSPVTSLAEAAYSALSDEVAR 342

Query: 327 MIE---AQPVADPTGSLELCYSFNSLSQ-VPEVTIHF-RGADVKLSRSNFFVKVSEDIVC 381
            +     QP AD TG L+LC +   + + VP +  HF  GAD+ +S  +++  V +   C
Sbjct: 343 QLNRSLVQPPAD-TG-LDLCVARQDVDKVVPVLVFHFGGGADMAVSAGSYWGPVDKSTAC 400

Query: 382 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            + +       I GN  Q +  + YDI +  +SF+  DC+
Sbjct: 401 MLIEEGGYETVI-GNFQQQDVHLLYDIGKGELSFQTADCS 439


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 137/450 (30%), Positives = 209/450 (46%), Gaps = 58/450 (12%)

Query: 14  LCFYVVSPIEAQT------GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
           L FY+ + I + T         + +LIHR+S   P Y+ +ET   R +   T S+ R + 
Sbjct: 17  LAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDF 76

Query: 68  FNQNSSISSSKA----SQADIIPNN--ANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
               S I   K+    +++ +IP N  + +L+ +SIG+PP  +L V DTGS L+W QC P
Sbjct: 77  LE--SKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134

Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNG 180
           C    C+ Q +  FDP  S ++K+L C       +N   C+  N  +Y + Y  G  S G
Sbjct: 135 CI--NCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQG 192

Query: 181 NLATETVTLGSTTGQAVALPGITFGCG-----TNNGGLFNSKTTGIVGLGGGDISLISQM 235
            LA E++   +     +    ITFGCG     TNN   +N    G+ GLG      I+ M
Sbjct: 193 ILAKESLLFETLDEGKIKKSNITFGCGHMNIKTNNDDAYN----GVFGLGA--YPHIT-M 245

Query: 236 RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV----STPLTKAKTFYVLTIDAISVGN 291
            T +  KFSYC+  +++    +  N +V G G      STPL      Y +T+ +ISVG+
Sbjct: 246 ATQLGNKFSYCIGDINNPL--YTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVGS 303

Query: 292 QRLGVS----------TPDIVIDSGTTLTFLPQG----YNSNLLSVMSSMIEAQPVADPT 337
           + L +           +  ++IDSG T T L  G        ++ +M  ++E  P     
Sbjct: 304 KTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKF 363

Query: 338 GSLELCYS---FNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF---KGITNS 390
               LC+       L   P VT HF  GAD+ L   + F +   D  C           +
Sbjct: 364 EG--LCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLN 421

Query: 391 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + + G + Q N+ VG+D+EQ  V F+  DC
Sbjct: 422 LSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 115/351 (32%), Positives = 173/351 (49%), Gaps = 26/351 (7%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +RI +G+PP  +  V D+GSD++W QC+PC  +QCY Q  PLFDP  S+++  + 
Sbjct: 40  SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPC--TQCYHQTDPLFDPADSASFMGVS 97

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           CSS+ C  ++   C+   C+Y VSYGDGS + G LA ET+TLG T  Q VA+     GCG
Sbjct: 98  CSSAVCDQVDNAGCNSGRCRYEVSYGDGSSTKGTLALETLTLGRTVVQNVAI-----GCG 152

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP- 266
             N G+F      ++GLGGG +S + Q+       FSYCLV   +    F   G  + P 
Sbjct: 153 HMNQGMFVGAAG-LLGLGGGSMSFVGQLSRERGNAFSYCLVSRVTNSNGFLEFGSEAMPV 211

Query: 267 GVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTP----------DIVIDSGTTLTFLP 313
           G    PL +   + ++Y + +  + VG+ ++ +S             +V+D+GT +T  P
Sbjct: 212 GAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGGVVMDTGTAVTRFP 271

Query: 314 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLS-QVPEVTIHFRGADV-KLSRSN 370
                             P A      + CY+ F  LS +VP V+ +F G  +  L  +N
Sbjct: 272 TVAYEAFRDAFIDQTGNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTLPANN 331

Query: 371 FFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           F + V +    C  F    + + I GNI Q    +  D   + V F P  C
Sbjct: 332 FLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 133/429 (31%), Positives = 202/429 (47%), Gaps = 63/429 (14%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ--NSSISSSKASQADIIPN-- 87
           +LIH  S   P Y  +ET   R+   +  S  R  +       S+ S+   +A + P+  
Sbjct: 38  KLIHPGSVHHPHYKPNETAKDRMELDIQHSAARFAYIQARIEGSLVSNNEYKARVSPSLT 97

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL- 146
               +  ISIG PP  +L V DTGSD++W  C PC  + C      LFDP MSST+  L 
Sbjct: 98  GRTIMANISIGQPPIPQLVVMDTGSDILWVMCTPC--TNCDNHLGLLFDPSMSSTFSPLC 155

Query: 147 --PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
             PC    C+      C  +   ++V+Y D S ++G    +TV   +T      +P + F
Sbjct: 156 KTPCDFKGCS-----RCDPI--PFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLF 208

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
           GCG N G   +    GI+GL  G  SL     T I  KFSYC+  ++    N+  + ++ 
Sbjct: 209 GCGHNIGQDTDPGHNGILGLNNGPDSLA----TKIGQKFSYCIGDLADPYYNY--HQLIL 262

Query: 265 GPGV----VSTPLTKAKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTTLT 310
           G G      STP      FY +T++ ISVG +RL ++          T  ++ID+G+T+T
Sbjct: 263 GEGADLEGYSTPFEVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTIT 322

Query: 311 FLPQGYN-------SNLL--SVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTI 357
           FL    +        NLL  S   + IE  P          C+ + S+S+     P VT 
Sbjct: 323 FLVDSVHRLLSKEVRNLLGWSFRQTTIEKSPWMQ-------CF-YGSISRDLVGFPVVTF 374

Query: 358 HF-RGADVKLSRSNFFVKVSEDIVCSVFKGIT----NSVP-IYGNIMQTNFLVGYDIEQQ 411
           HF  GAD+ L   +FF ++++++ C     ++     S P + G + Q ++ VGYD+  Q
Sbjct: 375 HFADGADLALDSGSFFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVNQ 434

Query: 412 TVSFKPTDC 420
            V F+  DC
Sbjct: 435 FVYFQRIDC 443


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 143/432 (33%), Positives = 205/432 (47%), Gaps = 53/432 (12%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQ-RL-RDA---------LTRSLNRLNHFNQNSSISSS 77
           FS+ L  R +  +P Y    T  + RL RDA         L RSLN   HF +  SI+ S
Sbjct: 69  FSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGE--SINES 126

Query: 78  KASQADIIP--------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ-CY 128
               +   P        + A YL +I +G P      V DTGSD+ W QC+PC     CY
Sbjct: 127 LIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCY 186

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
            Q  P+FDPK SS+Y  L C+S QC  L++ +C+   C Y V YGDGSF+ G LATET++
Sbjct: 187 KQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLS 246

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
            G++     ++P +  GCG +N GLF      I   GG  ISL SQ++   A  FSYCLV
Sbjct: 247 FGNSN----SIPNLPIGCGHDNEGLFAGGAGLIGLGGGA-ISLSSQLK---ASSFSYCLV 298

Query: 249 PV---SSTKINFGTNGIVSGPGVVSTPLTKAKTFY---VLTIDAISVGNQRLGVSTPD-- 300
            +   SS+ + F +N        +++PL K   F+    + +  ISVG + L +S     
Sbjct: 299 NLDSDSSSTLEFNSNMPSDS---LTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFE 355

Query: 301 --------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 352
                   I++DSGT ++ LP     +L      +  +   A      + CY+F+  S V
Sbjct: 356 IDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNV 415

Query: 353 PEVTIHF---RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
              TI F    G  ++L   N+ + + +    C  F    +S+ I G+  Q    V YD+
Sbjct: 416 EVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDL 475

Query: 409 EQQTVSFKPTDC 420
               V F    C
Sbjct: 476 TNSLVGFSTNKC 487


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 125/415 (30%), Positives = 197/415 (47%), Gaps = 49/415 (11%)

Query: 50  PYQRLRDALTRSLNRLNHF----NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERL 105
           P+     AL+   +RL+ F    +   S+ S   S A     +  Y + + +GTPP + L
Sbjct: 46  PFTTPSQALSFDSHRLSFFFSALHTPQSLKSPVVSGAST--GSGQYFVDLRLGTPPQKLL 103

Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL---NQKSCS 162
            VADTGSDL+W +C  C     +   S  F  + S+T+    C  S C  +       C+
Sbjct: 104 LVADTGSDLVWVKCSACRNCTRHTPGS-AFLARHSTTFSPNHCYDSACQLVPLPKHHRCN 162

Query: 163 GVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN------NGG 212
                  C+Y  SYGDGS ++G  + ET TL +++G+   L GI FGC         +G 
Sbjct: 163 HARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGA 222

Query: 213 LFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSSTKINFGTNGIVSG 265
            FN    G++GLG G ISL SQ+      KFSYCL+       P S   I    N +  G
Sbjct: 223 SFNG-AHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSPTSYLLIGSTQNDVAPG 281

Query: 266 PGVVS-TPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI-----------VIDSGTTLT 310
              +  TPL     + TFY + I+++SV   +L ++ P +           ++DSGTTLT
Sbjct: 282 KRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPIN-PSVWALDELGNGGTIVDSGTTLT 340

Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADV-KLS 367
           FLP+     +L+V+   +     A+PT   +LC + + +   ++P+++    G  V    
Sbjct: 341 FLPEPAYLQILTVIKRRVRLPSPAEPTPGFDLCVNVSEIEHPRLPKLSFKLGGDSVFSPP 400

Query: 368 RSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             N+FV   ED+ C   + +   +   + GN+MQ  FL+ +D ++  + F    C
Sbjct: 401 PRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQQGFLLEFDKDRTRLGFSRHGC 455


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 124/385 (32%), Positives = 180/385 (46%), Gaps = 63/385 (16%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-PLFDPKMSSTYKSLPC 148
            YL+ +S+GTPP       DTGSDL+WTQC PC    C+ Q + P+ DP  SST+ ++ C
Sbjct: 93  EYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPC--LNCFDQGAIPVLDPAASSTHAAVRC 150

Query: 149 SSSQCASLNQKSC-------SGVNCQYSVSYGDGSFSNGNLATETVTLG---STTGQAVA 198
            +  C +L   SC          +C Y   YGD S + G LA++  T G   +  G  V+
Sbjct: 151 DAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGVS 210

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG 258
              +TFGCG  N G+F +  TGI G G G  SL SQ+  T    FSYC   +  +  +  
Sbjct: 211 ERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVT---SFSYCFTSMFESTSSLV 267

Query: 259 TNGIVSGP-----GVVSTPLTK---AKTFYVLTIDAISVGNQRLGV-------STPDIVI 303
           T G+          V STPL +     + Y L++ AI+VG  R+ +            +I
Sbjct: 268 TLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLREASAII 327

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQ---PVADPTGS-LELCYSFNSLS--------- 350
           DSG ++T LP+    ++   + +   AQ   PV+   GS L+LC++  S +         
Sbjct: 328 DSGASITTLPE----DVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPKSAFGWR 383

Query: 351 ----------QVPEVTIHF-RGADVKLSRSNF-FVKVSEDIVCSVFKGIT---NSVPIYG 395
                     +VP +  H   GAD +L R N+ F      ++C V    T   +   + G
Sbjct: 384 WRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGDQTVVIG 443

Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDC 420
           N  Q N  V YD+E   +SF P  C
Sbjct: 444 NYQQQNTHVVYDLENDVLSFAPARC 468


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 120/347 (34%), Positives = 161/347 (46%), Gaps = 28/347 (8%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y++ +S+GTP   +    DTGSD+ W QC+PC    C  Q   LFDP  SSTY ++PC 
Sbjct: 142 QYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCG 201

Query: 150 SSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTL--GSTTGQAVALPGITFG 205
           +  C+ L   +  CSG  C Y VSYGDGS + G   ++T+ L  G+T G         FG
Sbjct: 202 ADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVGT------FLFG 255

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
           CG    G+F +   G++ LG   +SL SQ      G FSYCL    S        G  S 
Sbjct: 256 CGHAQAGMF-AGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGGPSSA 314

Query: 266 PGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI----VIDSGTTLTFLPQGYNS 318
            G  +T L     A TFY++ +  ISVG Q++ V         V+D+GT +T LP    +
Sbjct: 315 SGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTAYA 374

Query: 319 NLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFV 373
            L S     I     P A   G L+ CY F+    V  P V + F  GA + L       
Sbjct: 375 ALRSAFRGAIAPCGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGI-- 432

Query: 374 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            +S   +     G      I GN+ Q +F V +D    TV F P  C
Sbjct: 433 -LSSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 114/354 (32%), Positives = 177/354 (50%), Gaps = 34/354 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ +G+P  +   V DTGSD+ W QC+PC  + CY Q  P+FDP +S++Y S+ 
Sbjct: 164 SGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSTSYASVA 221

Query: 148 CSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           C + +C  L+  +C  S   C Y V+YGDGS++ G+ ATET+TLG +      +  +  G
Sbjct: 222 CDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDS----APVSSVAIG 277

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFG--TN 260
           CG +N GLF      ++ LGGG +S  SQ+  T    FSYCLV     SS+ + FG   +
Sbjct: 278 CGHDNEGLFVGAAG-LLALGGGPLSFPSQISAT---TFSYCLVDRDSPSSSTLQFGDAAD 333

Query: 261 GIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTTLT 310
             V+ P ++ +P T   TFY + +  +SVG Q L +              +++DSGT +T
Sbjct: 334 AEVTAP-LIRSPRT--STFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTAVT 390

Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLS 367
            L     + L        ++ P        + CY  +  +  +VP V++ F  G +++L 
Sbjct: 391 RLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGELRLP 450

Query: 368 RSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             N+ + V      C  F     +V I GN+ Q    V +D  + TV F    C
Sbjct: 451 AKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTTNKC 504


>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 113/344 (32%), Positives = 167/344 (48%), Gaps = 31/344 (9%)

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           GT    +  + D+GSD+ W QC+PCP   C+ Q  PLFDP MS+TY ++PC+S+ CA L 
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221

Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
             ++ CS    CQ+ ++YGDGS + G  + + +TLG        + G  FGC   + G  
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD----VIRGFRFGCAHADRGSA 277

Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGV 268
           F+    G + LGGG  SL+ Q  T     FSYCL P +S+ + F   G+        P  
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS-LGFLVLGVPPERAQLIPSF 336

Query: 269 VSTPL---TKAKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQGYNSNLL 321
           VSTPL   + A TFY + + AI V  + L V     +   VIDS T ++ LP      L 
Sbjct: 337 VSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQALR 396

Query: 322 SVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVSED 378
           +   S +     A P   L+ CY F  +  +  P + + F  GA V L  +   +     
Sbjct: 397 AAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLG---- 452

Query: 379 IVCSVFK-GITNSVPIY-GNIMQTNFLVGYDIEQQTVSFKPTDC 420
             C  F    ++ +P + GN+ Q    V YD+  + + F+   C
Sbjct: 453 -SCLAFAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 138/409 (33%), Positives = 200/409 (48%), Gaps = 31/409 (7%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI--IPN 87
           S  LIH  S  SPF   + T    + + +    NRL  F + +S SS + + A++     
Sbjct: 53  SFPLIHIYSECSPFRPPNRTWESLMSEKIRGDANRL-RFLKRTSRSSKQDANANVPVRSG 111

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y+I++  GTP      + DTGSD+ W  C+ C   Q     +P+FDP  SS+YK   
Sbjct: 112 SGEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQC---QGCHSTAPIFDPAKSSSYKPFA 168

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C S  C  ++        CQ+ VSYGDG+  +G LA++ +TLGS       LP  +FGC 
Sbjct: 169 CDSQPCQEISGNCGGNSKCQFEVSYGDGTQVDGTLASDAITLGSQ-----YLPNFSFGCA 223

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCL--VPVSSTKINFGTNGIV 263
             +     S + G++GLGGG +SL++Q  T     G FSYCL     SS  +  G    V
Sbjct: 224 -ESLSEDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCLPSSSTSSGSLVLGKEAAV 282

Query: 264 SGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI------VIDSGTTLTFL-P 313
           S   +  T L K     TFY +T+ AISVGN R+ V   +I      +IDSGTT+T L P
Sbjct: 283 SSSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVPGTNIASGGGTIIDSGTTITHLVP 342

Query: 314 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNF 371
             Y +   +    +   QP   P   ++ CY  +S S  VP +T+H  R  D+ L + N 
Sbjct: 343 SAYTALRDAFRQQLSSLQPT--PVEDMDTCYDLSSSSVDVPTITLHLDRNVDLVLPKENI 400

Query: 372 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            +     + C  F   T+S  I GN+ Q N+ + +D+    V F    C
Sbjct: 401 LITQESGLACLAFSS-TDSRSIIGNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score =  163 bits (412), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 136/422 (32%), Positives = 198/422 (46%), Gaps = 38/422 (9%)

Query: 29  FSVELIHRDSPKSPFYNS---SETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADI 84
            S+E++HR  P     N    ++ P     +   R  NR++  +   SS       QA  
Sbjct: 60  LSLEVVHRHGPCIGIVNQEKGADAPSNM--EIFLRDQNRVDSIHARLSSRGMFPEKQATT 117

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           +P          +Y++ + +GTP  E   + DTGSD+ WTQCEPC  + CY Q  P  +P
Sbjct: 118 LPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNP 176

Query: 138 KMSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
             S++YK++ CSS+ C  +       +SCS   C Y V YGDGS+S G  ATET+TL S+
Sbjct: 177 STSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSS 236

Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
                      FGCG  N         G++GLG   ++L SQ   T    FSYCL   SS
Sbjct: 237 N----VFKNFLFGCGQQN-NGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSS 291

Query: 253 TKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----TPDIVIDS 305
           +K      G VS   V  TPL+    +  FY L I  +SVG ++L +     +   VIDS
Sbjct: 292 SKGYLSLGGQVS-KSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDS 350

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA- 362
           GT +T L     S L S   +++   P        + CY F+     ++P+V + F+G  
Sbjct: 351 GTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGV 410

Query: 363 DVKLSRSNFFVKVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
           ++ +  S     V+    VC  F G  +     I+GN+ Q  + V YD  +  V F P  
Sbjct: 411 EMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGG 470

Query: 420 CT 421
           C+
Sbjct: 471 CS 472


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 138/422 (32%), Positives = 200/422 (47%), Gaps = 38/422 (9%)

Query: 29  FSVELIHRDSPKSPFYNS---SETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADI 84
            S+E++HR  P     N    ++ P     +   R  NR++  +   SS       QA  
Sbjct: 48  LSLEVVHRHGPCIGIVNQEKGADAPSNM--EIFLRDQNRVDSIHARLSSRGMFPEKQATT 105

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           +P          +Y++ + +GTP  E   + DTGSD+ WTQCEPC  + CY Q  P  +P
Sbjct: 106 LPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNP 164

Query: 138 KMSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
             S++YK++ CSS+ C  +       +SCS   C Y V YGDGS+S G  ATET+TL S+
Sbjct: 165 STSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSS 224

Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
                      FGCG  N GL      G++GLG   ++L SQ   T    FSYCL   SS
Sbjct: 225 N----VFKNFLFGCGQQNNGL-FGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSS 279

Query: 253 TKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----TPDIVIDS 305
           +K      G VS   V  TPL+    +  FY L I  +SVG ++L +     +   VIDS
Sbjct: 280 SKGYLSLGGQVS-KSVKFTPLSADFDSTPFYGLDITGLSVGGRKLSIDESAFSAGTVIDS 338

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA- 362
           GT +T L     S L S   +++   P        + CY F+     ++P+V + F+G  
Sbjct: 339 GTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGV 398

Query: 363 DVKLSRSNFFVKVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
           ++ +  S     V+    VC  F G  +     I+GN+ Q  + V YD  +  V F P  
Sbjct: 399 EMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGG 458

Query: 420 CT 421
           C+
Sbjct: 459 CS 460


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score =  162 bits (411), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 137/425 (32%), Positives = 194/425 (45%), Gaps = 50/425 (11%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL--RDAL-TRSLNRLNHFNQNSSISSSKASQADI 84
           G +V L HR  P SP  ++ E     L  RD L  + +      N  S     + S A  
Sbjct: 52  GTTVPLSHRHGPCSPAPSTVEPTMAELLRRDQLRAKYIQAKLSVNSGSGTDGVQQSAAIT 111

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           +P       +   Y+I +SIGTP   +  + DTGSD+ W  C     ++     S  FDP
Sbjct: 112 LPTTLGSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCH----ARAGAGSSLFFDP 167

Query: 138 KMSSTYKSLPCSSSQCASLNQKS--CS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
             SSTY    CSS+ C  L  +   CS    CQY+V YGDGS + G   ++T+ L ST  
Sbjct: 168 GKSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLALNSTE- 226

Query: 195 QAVALPGITFGCGTNNG---GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS 251
               +    FGC   +    GL   +T G++GLGGG  SL+SQ   T    FSYCL P +
Sbjct: 227 ---KVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCL-PAT 282

Query: 252 STKINFGTNGIVSG-PGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI-----V 302
           +    F T G  +G  G V+TP+    +A TFY + +  I+VG   + +S P +     +
Sbjct: 283 TRSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAIS-PTVFAAGSI 341

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR 360
           +DSGT +T LP    S L +   + +   P A     L+ C+ F     V  P V + F 
Sbjct: 342 MDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQDNVSIPAVELVFS 401

Query: 361 GADVKLSRSNFFVKVSEDIV----CSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVSF 415
           G  V        V +  D +    C  F   T  +  I GN+ Q  F V +D+ Q  + F
Sbjct: 402 GGAV--------VDLDADGIMYGSCLAFAPATGGIGSIIGNVQQRTFEVLHDVGQSVLGF 453

Query: 416 KPTDC 420
           +P  C
Sbjct: 454 RPGAC 458


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 118/360 (32%), Positives = 166/360 (46%), Gaps = 34/360 (9%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y   + +GTP  +   V DTGSD+ W QC PC  + CY Q   LF+P  SS++K L C
Sbjct: 14  GEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPC--TNCYKQKDALFNPSSSSSFKVLDC 71

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA-VALPGITFGCG 207
           SSS C +L+   C    C Y   YGDGSF+ G L T+ V L    G   V L  I  GCG
Sbjct: 72  SSSLCLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVVLTNIPLGCG 131

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPG 267
            +N G F +   GI+GLG G +S  + +  +    FSYCL P   +  N  +  +     
Sbjct: 132 HDNEGTFGT-AAGILGLGRGPLSFPNNLDASTRNIFSYCL-PDRESDPNHKSTLVFGDAA 189

Query: 268 VVSTPLTKAK-----------TFYVLTIDAISVGNQRLGVSTPDI-----------VIDS 305
           +  T     K           T+Y + I  ISVG   L      +           + DS
Sbjct: 190 IPHTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSHGNGGTIFDS 249

Query: 306 GTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG- 361
           GTT+T L  + Y +   +  ++ +     AD     + CY F  ++   VP VT HF+G 
Sbjct: 250 GTTITRLEARAYTAVRDAFRAATMHLTSAAD-FKIFDTCYDFTGMNSISVPTVTFHFQGD 308

Query: 362 ADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            D++L  SN+ V VS  +I C  F   +    + GN+ Q +F V YD   + +   P  C
Sbjct: 309 VDMRLPPSNYIVPVSNNNIFCFAFAA-SMGPSVIGNVQQQSFRVIYDNVHKQIGLLPDQC 367


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score =  162 bits (410), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 120/360 (33%), Positives = 174/360 (48%), Gaps = 36/360 (10%)

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           I   + +Y  RI +GTP      VADTGSD+ W QC PC   +CY Q  P+F+P +SS++
Sbjct: 74  IAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPC--RKCYRQQDPIFNPSLSSSF 131

Query: 144 KSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           K L C+SS C  L  K CS  N C Y VSYGDGSF+ G+ +TET++ G    ++VA+   
Sbjct: 132 KPLACASSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFGEHAVRSVAM--- 188

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI 262
             GCG NN GLF+     ++GLG G +S  SQ  T+ A  FSYCL P   + I      +
Sbjct: 189 --GCGRNNQGLFHGAAG-LLGLGRGPLSFPSQTGTSYASVFSYCL-PRRESAI---AASL 241

Query: 263 VSGPGVVST--------PLTKAKTFYVLTIDAISVGNQRLGV----------STPDIVID 304
           V GP  V          P  +  T+Y + +  I V    + +           T  +++D
Sbjct: 242 VFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVD 301

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFR-G 361
           SGT ++ L     + L     S++   P A      + CY  +S+  + +P V + F  G
Sbjct: 302 SGTAISRLTTPAYTALRDAFRSLVTF-PSAPGISLFDTCYDLSSMKTATLPAVVLDFDGG 360

Query: 362 ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           A + L      V V  E   C  F     +  I GN+ Q  F +  D +++ +   P  C
Sbjct: 361 ASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 420


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  162 bits (409), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 136/421 (32%), Positives = 198/421 (47%), Gaps = 38/421 (9%)

Query: 30  SVELIHRDSPKSPFYNS---SETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADII 85
           S+E++HR  P     N    ++ P     +   R  NR++  +   SS       QA  +
Sbjct: 1   SLEVVHRHGPCIGIVNQEKGADAPSNM--EIFLRDQNRVDSIHARLSSRGMFPEKQATTL 58

Query: 86  P-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           P          +Y++ + +GTP  E   + DTGSD+ WTQCEPC  + CY Q  P  +P 
Sbjct: 59  PVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNPS 117

Query: 139 MSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
            S++YK++ CSS+ C  +       +SCS   C Y V YGDGS+S G  ATET+TL S+ 
Sbjct: 118 TSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN 177

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
                     FGCG  N         G++GLG   ++L SQ   T    FSYCL   SS+
Sbjct: 178 ----VFKNFLFGCGQQN-NGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSSS 232

Query: 254 KINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----TPDIVIDSG 306
           K      G VS   V  TPL+    +  FY L I  +SVG ++L +     +   VIDSG
Sbjct: 233 KGYLSLGGQVS-KSVKFTPLSADFDSTPFYGLDITGLSVGGRQLSIDESAFSAGTVIDSG 291

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA-D 363
           T +T L     S L S   +++   P        + CY F+     ++P+V + F+G  +
Sbjct: 292 TVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTFKGGVE 351

Query: 364 VKLSRSNFFVKVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + +  S     V+    VC  F G  +     I+GN+ Q  + V YD  +  V F P  C
Sbjct: 352 MDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGFAPGGC 411

Query: 421 T 421
           +
Sbjct: 412 S 412


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score =  162 bits (409), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 120/360 (33%), Positives = 174/360 (48%), Gaps = 36/360 (10%)

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           I   + +Y  RI +GTP      VADTGSD+ W QC PC   +CY Q  P+F+P +SS++
Sbjct: 7   IAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPC--RKCYRQQDPIFNPSLSSSF 64

Query: 144 KSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           K L C+SS C  L  K CS  N C Y VSYGDGSF+ G+ +TET++ G    ++VA+   
Sbjct: 65  KPLACASSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGEHAVRSVAM--- 121

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI 262
             GCG NN GLF+     ++GLG G +S  SQ  T+ A  FSYCL P   + I      +
Sbjct: 122 --GCGRNNQGLFHGAAG-LLGLGRGPLSFPSQTGTSYASVFSYCL-PRRESAI---AASL 174

Query: 263 VSGPGVVST--------PLTKAKTFYVLTIDAISVGNQRLGV----------STPDIVID 304
           V GP  V          P  +  T+Y + +  I V    + +           T  +++D
Sbjct: 175 VFGPSAVPEKARFTKLLPNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVD 234

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFR-G 361
           SGT ++ L     + L     S++   P A      + CY  +S+  + +P V + F  G
Sbjct: 235 SGTAISRLTTPAYTALRDAFRSLVTF-PSAPGISLFDTCYDLSSMKTATLPAVVLDFDGG 293

Query: 362 ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           A + L      V V  E   C  F     +  I GN+ Q  F +  D +++ +   P  C
Sbjct: 294 ASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAPDQC 353


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  161 bits (408), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 144/432 (33%), Positives = 204/432 (47%), Gaps = 53/432 (12%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQ-RL-RDA---------LTRSLNRLNHFNQNSSISSS 77
           FS+ L  R +  +P Y    T  + RL RDA         L RSLN   HF +  SI+ S
Sbjct: 69  FSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGE--SINES 126

Query: 78  KASQADIIP--------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ-CY 128
               +   P        + A YL +I +G P      V DTGSD+ W QC+PC     CY
Sbjct: 127 LIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCY 186

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
            Q  P+FDPK SS+Y  L C+S QC  L++ +C+   C Y V YGDGSF+ G LATET++
Sbjct: 187 KQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLS 246

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
            G++     ++P +  GCG +N GLF      I   GG  ISL SQ++   A  FSYCLV
Sbjct: 247 FGNSN----SIPNLPIGCGHDNEGLFAGGAGLIGLGGGA-ISLSSQLK---ASSFSYCLV 298

Query: 249 PV---SSTKINFGTNGIVSGPGVVSTPLTKAKTFY---VLTIDAISVGNQRLGVSTPD-- 300
            +   SS+ + F  N  +    + S PL K   F+    + +  ISVG + L +S     
Sbjct: 299 NLDSDSSSTLEF--NSYMPSDSLTS-PLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFE 355

Query: 301 --------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 352
                   I++DSGT ++ LP     +L      +  +   A      + CY+F+  S V
Sbjct: 356 IDESGLGGIIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNV 415

Query: 353 PEVTIHF---RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
              TI F    G  ++L   N+ + + +    C  F    +S+ I G+  Q    V YD+
Sbjct: 416 EVPTIAFVLSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDL 475

Query: 409 EQQTVSFKPTDC 420
               V F    C
Sbjct: 476 TNSIVGFSTNKC 487


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score =  161 bits (408), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 134/415 (32%), Positives = 197/415 (47%), Gaps = 46/415 (11%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD------ 83
           SV L HR  P SP   +S        + L R   R ++  +  S S+  A+  D      
Sbjct: 34  SVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQSSKV 93

Query: 84  IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLF 135
            +P       +   Y+I + +G+P   +  V DTGSD+ W QCEPCP PS C+     LF
Sbjct: 94  SVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 153

Query: 136 DPKMSSTYKSLPCSSSQCASL-NQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLG 190
           DP  SSTY +  CS++ CA L +    +G +    CQY V YGDGS + G  +++ +TL 
Sbjct: 154 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTL- 212

Query: 191 STTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL-- 247
             +G  V + G  FGC     G   + KT G++GLGG   S +SQ        F YCL  
Sbjct: 213 --SGSDV-VRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQTAARYGKSFFYCLPA 269

Query: 248 VPVSSTKINFGTNGIVSGPGV---VSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPDI 301
            P SS  +  G      G G     +TP+ ++K   T+Y   ++ I+VG ++LG+S P +
Sbjct: 270 TPASSGFLTLGAPASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLS-PSV 328

Query: 302 -----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PE 354
                ++DSGT +T LP    + L S   + +     A+P G L+ C++F  L +V  P 
Sbjct: 329 FAAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPT 388

Query: 355 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYD 407
           V + F G  V    ++  V       C  F    +  +    GN+ Q  F V YD
Sbjct: 389 VALVFAGGAVVDLDAHGIVSGG----CLAFAPTRDDKAFGTIGNVQQRTFEVLYD 439


>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
 gi|238015146|gb|ACR38608.1| unknown [Zea mays]
 gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
          Length = 467

 Score =  161 bits (407), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 126/349 (36%), Positives = 179/349 (51%), Gaps = 26/349 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ R+ +GTP    + V DTGS L W QC PC  S C+ Q  P+F+PK SS+Y S+ CS
Sbjct: 128 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYTSVSCS 186

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           + QC     A+LN  SCS  N C Y  SYGD SFS G L+ +TV+ GST+     +P   
Sbjct: 187 AQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 241

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
           +GCG +N GLF  ++ G++GL    +SL+ Q+  ++   FSYCL P SS+  +   +   
Sbjct: 242 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQLAPSMGYSFSYCL-PTSSSSSSGYLSIGS 299

Query: 264 SGPGVVS-TPLTKA---KTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQ 314
             PG  S TP+  +    + Y + +  I V  + L VS+        +IDSGT +T LP 
Sbjct: 300 YNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITRLPT 359

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFRGADVKLSRS-NFF 372
           G  S L   ++  ++  P A     L+ C+   +   +VPEVT+ F G       + N  
Sbjct: 360 GVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAALKLAARNLL 419

Query: 373 VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           V V     C  F     S  I GN  Q  F V YD++   + F    C+
Sbjct: 420 VDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 139/445 (31%), Positives = 207/445 (46%), Gaps = 65/445 (14%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNS------------SISS 76
             V L+HRDS     +  + TP Q L   L R   R     + +             +SS
Sbjct: 61  LHVRLLHRDS-----FAVNATPAQLLARRLQRDELRAAWIIKAAAPAAAANDTPVVGLSS 115

Query: 77  SKASQADIIPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
             A  A ++      +  Y+ +I++GTP  E L   DTGSD+ W QC+PC   +CY Q  
Sbjct: 116 GGAFVAPVVSRAPTTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQPC--RRCYPQSG 173

Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQK---SCSGVNCQYSVSYG-DGSFSNGNLATETVT 188
           P+FDP+ S++Y+ +   +  C +L +        + C Y+V YG DGS + G+   ET+T
Sbjct: 174 PVFDPRHSTSYREMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLT 233

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI--AGKFSYC 246
                   V +P ++ GCG +N GLF +   GI+GLG G IS  SQ+         FSYC
Sbjct: 234 FAG----GVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQIAALGYNVTSFSYC 289

Query: 247 LVP---------VSSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL 294
           L           VSST +  G       P    TP  +     TFY + +  +SVG  R+
Sbjct: 290 LADFFLSSPGRSVSST-LTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRV 348

Query: 295 GVSTPD------------IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVA--DPTGS 339
              T D            +++DSGT +T L  + Y +   +  ++ ++   V+   P+G 
Sbjct: 349 PGVTEDDLKLDPYTGRGGVILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGF 408

Query: 340 LELCYSFNSLS-QVPEVTIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITN-SVPIYG 395
            + CY+    + +VP V++HF G  ++ L   N+ + V S   VC  F G  + SV I G
Sbjct: 409 FDTCYTMGGRAMKVPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIG 468

Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDC 420
           NI Q  F V Y+I    V F P  C
Sbjct: 469 NIQQQGFRVVYNIGGGRVGFAPNSC 493


>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 465

 Score =  161 bits (407), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 126/349 (36%), Positives = 179/349 (51%), Gaps = 26/349 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ R+ +GTP    + V DTGS L W QC PC  S C+ Q  P+F+PK SS+Y S+ CS
Sbjct: 126 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYASVSCS 184

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           + QC     A+LN  SCS  N C Y  SYGD SFS G L+ +TV+ GST+     +P   
Sbjct: 185 AQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 239

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
           +GCG +N GLF  ++ G++GL    +SL+ Q+  ++   FSYCL P SS+  +   +   
Sbjct: 240 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQLAPSMGYSFSYCL-PTSSSSSSGYLSIGS 297

Query: 264 SGPGVVS-TPLTKA---KTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQ 314
             PG  S TP+  +    + Y + +  I V  + L VS+        +IDSGT +T LP 
Sbjct: 298 YNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITRLPT 357

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFRGADVKLSRS-NFF 372
           G  S L   ++  ++  P A     L+ C+   +   +VPEVT+ F G       + N  
Sbjct: 358 GVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAALKLAARNLL 417

Query: 373 VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           V V     C  F     S  I GN  Q  F V YD++   + F    C+
Sbjct: 418 VDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAAGCS 465


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 138/427 (32%), Positives = 203/427 (47%), Gaps = 51/427 (11%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-IP-- 86
           S+ L HR  P +P   SS   +  L + L R   R +H  + +  S    + +D+ IP  
Sbjct: 61  SMPLAHRHGPCAPATTSS---WPSLAERLRRDRARRDHITRKAKASGRTTTLSDVSIPTS 117

Query: 87  -----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                ++  Y++ + IGTP  ++  + DTGSDL W QC+PC  S CY Q  PL+DP  SS
Sbjct: 118 LGAAVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASS 177

Query: 142 TYKSLPCSSSQCASL----NQKSC---SGVN-CQYSVSYGDGSFSNGNLATETVTLGSTT 193
           TY  +PC S  C  L        C   SG + CQY + YG+   + G  +TET+TL    
Sbjct: 178 TYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTL---- 233

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
              V++    FGCG    G F+     +   G  + SL+SQ   T  G FSYCL P +ST
Sbjct: 234 SPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPE-SLVSQTAETYGGAFSYCLPPGNST 292

Query: 254 ----KINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVS----TPDIV 302
                +   TN   +  G + TPL    +  TFY++ +  +SVG + L +     +  ++
Sbjct: 293 TGFLALGAPTNNNDTA-GFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVLSGGMI 351

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ--VPEVTIH 358
           IDSGT +T LP    S L +   + + A P+  P     L+ CY+F  ++   VP V + 
Sbjct: 352 IDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYNFTGIANVTVPTVALT 411

Query: 359 FRGADVKLSRSNFFVKVSEDIV---CSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTV 413
           F G       +   + V   ++   C  F G  +   V I GN+ Q  F V YD  +  V
Sbjct: 412 FDGG------ATIDLDVPSGVLIQDCLAFAGGASDGDVGIIGNVNQRTFEVLYDSGRGHV 465

Query: 414 SFKPTDC 420
            F+P  C
Sbjct: 466 GFRPGAC 472


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 134/421 (31%), Positives = 196/421 (46%), Gaps = 38/421 (9%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-----ISSSKASQADI 84
           S+ L++R  P +P  +++ T      + L R   R NH  + +S     +  S  +    
Sbjct: 57  SMPLMYRHGPCAP-ASAAATNRPSPAEMLRRDRARRNHILRKASGRRITLGVSIPTSLGA 115

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
             ++  Y++ +  GTP   ++ + DTGSDL W QC+PC  S CY Q  P+FDP  SSTY 
Sbjct: 116 FVDSLQYVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYA 175

Query: 145 SLPCSSSQCASLN--------QKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
            +PC S  C  L+          S SG + CQY + YG+G  + G  +TET+TL      
Sbjct: 176 PVPCGSEACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTLSPEA-- 233

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
           A  +   +FGCG    G+F+     +   G  + SL+SQ   T  G FSYCL   +ST  
Sbjct: 234 ATVVNNFSFGCGLVQKGVFDLFDGLLGLGGAPE-SLVSQTTGTYGGAFSYCLPAGNSTAG 292

Query: 256 NFGTNGIVSG----PGVVSTPLTKAK-TFYVLTIDAISVGNQRLGVS----TPDIVIDSG 306
                   +G     G   TPL   + TFY++ +  ISVG ++L +        ++IDSG
Sbjct: 293 FLALGAPATGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVFAGGMIIDSG 352

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG--SLELCYSF--NSLSQVPEVTIHFRGA 362
           T +T LP+   S L +   S + A P+  P     L+ CY F  N+   VP V + F G 
Sbjct: 353 TIVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGNTNVTVPTVALTFEGG 412

Query: 363 ---DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
              D+ +  S   +      V     G T    I GN+ Q  F V YD  +  V F+   
Sbjct: 413 VTIDLDVP-SGVLLDGCLAFVAGASDGDTG---IIGNVNQRTFEVLYDSARGHVGFRAGA 468

Query: 420 C 420
           C
Sbjct: 469 C 469


>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
 gi|223975971|gb|ACN32173.1| unknown [Zea mays]
 gi|224034191|gb|ACN36171.1| unknown [Zea mays]
 gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
 gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
          Length = 465

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 126/349 (36%), Positives = 179/349 (51%), Gaps = 26/349 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ R+ +GTP    + V DTGS L W QC PC  S C+ Q  P+F+PK SS+Y S+ CS
Sbjct: 126 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYASVSCS 184

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           + QC     A+LN  SCS  N C Y  SYGD SFS G L+ +TV+ GST+     +P   
Sbjct: 185 AQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 239

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
           +GCG +N GLF  ++ G++GL    +SL+ Q+  ++   FSYCL P SS+  +   +   
Sbjct: 240 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQLAPSMGYSFSYCL-PTSSSSSSGYLSIGS 297

Query: 264 SGPGVVS-TPLTKA---KTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQ 314
             PG  S TP+  +    + Y + +  I V  + L VS+        +IDSGT +T LP 
Sbjct: 298 YNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITRLPT 357

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFRGADVKLSRS-NFF 372
           G  S L   ++  ++  P A     L+ C+   +   +VPEVT+ F G       + N  
Sbjct: 358 GVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAALKLAARNLL 417

Query: 373 VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           V V     C  F     S  I GN  Q  F V YD++   + F    C+
Sbjct: 418 VDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 465


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 132/435 (30%), Positives = 197/435 (45%), Gaps = 52/435 (11%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
           G +++++HR   +S    +    +      L R  NR+   ++  + +   A+    IP 
Sbjct: 59  GNTIQIVHRACLQSGDRKTVPDHHPHYTGILRRDHNRVRSIHRRLTGAGDTAA---TIPA 115

Query: 87  ------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
                 ++  Y++ I IGTP      + DTGSDL W QC+PC  S CY Q  PLFDP  S
Sbjct: 116 SLGLAFHSLEYVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDS-CYQQQEPLFDPSKS 174

Query: 141 STYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           STY  +PC + QC        +C G  C+YSV YGD S + GNLA E  TL  +   A  
Sbjct: 175 STYVDVPCGTPQCKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAA- 233

Query: 199 LPGITFGCGTN-----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPVSS 252
             G+ FGC         G        G++GLG GD S++SQ R   +G  FSYCL P  S
Sbjct: 234 --GVVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGDVFSYCLPPRGS 291

Query: 253 TKINFGTNGIVSGP--GVVSTPL----TKAKTFYVLTIDAISVGNQRLGVSTPDI----V 302
           +   + T G  + P   +  TPL    ++  + YV+ +  ISV    L +         V
Sbjct: 292 SA-GYLTIGAAAPPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYIGTV 350

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG---SLELCYSF--NSLSQVPEVTI 357
           IDSGT +T +P      L       +    +  P G   SL+ CY    + +   P V +
Sbjct: 351 IDSGTVITHMPAAAYYVLRDEFRRHMGGYTML-PEGHVESLDTCYDVTGHDVVTAPPVAL 409

Query: 358 HF-RGADVKLSRSNFFVKVSED-------IVCSVFKGITNSVP---IYGNIMQTNFLVGY 406
            F  GA + +  S   +  + D       + C  F  +  ++P   I GN+ Q  + V +
Sbjct: 410 EFGGGARIDVDASGILLVFAVDASGQSLTLACLAF--VPTNLPGFVIIGNMQQRAYNVVF 467

Query: 407 DIEQQTVSFKPTDCT 421
           D+E + + F    C+
Sbjct: 468 DVEGRRIGFGANGCS 482


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score =  160 bits (404), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 114/377 (30%), Positives = 182/377 (48%), Gaps = 46/377 (12%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + +GTPP     + DTGSDL W QC+PC    C+ Q+   + PK SSTY+++ C
Sbjct: 169 GEYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPC--YDCFEQNGSHYYPKDSSTYRNISC 226

Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGST----TGQAVA 198
              +C  ++     + C   N  C Y   Y DGS + G+ A+ET T+  T      +   
Sbjct: 227 YDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQ 286

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SST 253
           +  + FGCG  N G F    +G++GLG G IS  SQ+++     FSYCL  +      S+
Sbjct: 287 VVDVMFGCGHWNKGFFYG-ASGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTSVSS 345

Query: 254 KINFG------TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP-------- 299
           K+ FG       N  ++   +++   T  +TFY L I +I VG + L +S          
Sbjct: 346 KLIFGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHWSSEG 405

Query: 300 -------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQ 351
                    +IDSG+TLTF P      +       I+ Q +A     +  CY+ + ++ Q
Sbjct: 406 AAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIAADDFVMSPCYNVSGAMMQ 465

Query: 352 V--PEVTIHFRGADV-KLSRSNFFVKVSED-IVC-SVFKGITNS-VPIYGNIMQTNFLVG 405
           V  P+  IHF    V      N+F +   D ++C ++ K   +S + I GN++Q NF + 
Sbjct: 466 VELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTIIGNLLQQNFHIL 525

Query: 406 YDIEQQTVSFKPTDCTK 422
           YD+++  + + P  C +
Sbjct: 526 YDVKRSRLGYSPRRCAE 542


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 133/425 (31%), Positives = 197/425 (46%), Gaps = 63/425 (14%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           ++LIH +S  SP YNS +T +      + +            + S+   S     P    
Sbjct: 45  IKLIHHESSLSP-YNSKDTIWDHYSHKILKQ-----------TFSNDYISNLVPSPRYVV 92

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           +L+  SIG PP  +LAV DTGS L W  C PC  S C  Q  P+FDP  SSTY +L CS 
Sbjct: 93  FLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPC--SSCSQQSVPIFDPSKSSTYSNLSCSE 150

Query: 151 -SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-- 207
            ++C  +N +      C YSV Y     S G  A E +TL +     + +P + FGCG  
Sbjct: 151 CNKCDVVNGE------CPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRK 204

Query: 208 ---TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
              ++NG  +     G+ GLG G  SL+     +   KFSYC+  + +T  N+  N +V 
Sbjct: 205 FSISSNGYPYQG-INGVFGLGSGRFSLLP----SFGKKFSYCIGNLRNT--NYKFNRLVL 257

Query: 265 GPGV----VSTPLTKAKTFYVLTIDAISVGNQRLGV-----------STPDIVIDSGTTL 309
           G        ST L      Y + ++AIS+G ++L +           +   ++IDSG   
Sbjct: 258 GDKANMQGDSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADH 317

Query: 310 TFLPQGYNSNLLSV-MSSMIEAQPV---ADPTGSLELCYS---FNSLSQVPEVTIHF-RG 361
           T+L + Y   +LS  + +++E   V    D      LCYS      LS  P VT HF  G
Sbjct: 318 TWLTK-YGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFPLVTFHFAEG 376

Query: 362 ADVKLSRSNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           A + L  ++ F++ +E+  C      + F     S    G + Q N+ VGYD+ +  V F
Sbjct: 377 AVLDLDVTSMFIQTTENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMRVYF 436

Query: 416 KPTDC 420
           +  DC
Sbjct: 437 QRIDC 441


>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
 gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
          Length = 453

 Score =  159 bits (403), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 119/417 (28%), Positives = 192/417 (46%), Gaps = 61/417 (14%)

Query: 49  TPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVA 108
           T ++ +R A+ RSL+R     +N     +   +A ++P    YL+++ IGTP     A  
Sbjct: 49  TDHELIRRAVQRSLDRPGVAARNRK---AVVGEAPLVPRGGEYLVKLGIGTPQHYFSAAI 105

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--- 165
           DT SDL+W QC+PC    CY Q  P+F+P++SS+Y  +PCSS  C+ L+   C   +   
Sbjct: 106 DTASDLVWLQCQPC--VSCYRQLDPIFNPRLSSSYAVVPCSSDTCSQLDGHRCDEDDDQA 163

Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
           C+Y+  Y   + +NG LA + + +G     AV L     GC  ++ G    + +G+VGL 
Sbjct: 164 CRYNYKYSGNAVTNGTLAIDKLAVGGNVFHAVVL-----GCSDSSVGGPPPQASGLVGLA 218

Query: 226 GGDISLISQMRTTIAGKFSYCLVPVSST---KINFGTNGIVSGPGVVSTPL-------TK 275
            G +SL+SQ+      +F YCL P  S    K+  G          VS  +       T+
Sbjct: 219 RGPLSLLSQLSVR---RFMYCLPPPMSRTPGKLVLGAGAGADAVRNVSDRVTVTMSSSTR 275

Query: 276 AKTFYVLTIDAISVGNQ-----RLGVSTP--------------------DIVIDSGTTLT 310
             ++Y L  D ++VG+Q     R   S P                     +++D  +T++
Sbjct: 276 YPSYYYLNFDGLAVGDQTPGTIRRPTSPPATGGGVGGGGGDGGSGANAYGMIVDVASTIS 335

Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNS-----LSQVPEVTIHFRGAD 363
           FL       L   +   I   P A P+    L+LC+            VP V++ F G  
Sbjct: 336 FLEASLYDELADDLEEEIRL-PRATPSTRLGLDLCFILPEGVGIDRVYVPTVSMSFDGRW 394

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           ++L R   F++    ++C +  G T+ V I GN  Q N  V Y++ +  ++F    C
Sbjct: 395 LELERDRLFLEDGR-MMC-LMIGRTSGVSILGNYQQQNMHVLYNLRRGKITFAKASC 449


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 140/442 (31%), Positives = 203/442 (45%), Gaps = 59/442 (13%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLR-DA-----LTRSLNRLNHFNQNSSISS 76
            A++G   +EL H  S  S   + +E  +  L  DA     L R +        + + S+
Sbjct: 35  RAESGATVLELRHHASFSSGGKSRAEEAHAVLASDAARVSSLQRRIGSYGLIRSSDAASA 94

Query: 77  SKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
           SK +Q  +         NY+  + IG    E   + DT S+L W QCEPC    C+ Q  
Sbjct: 95  SKLAQVPVTSGARLRTLNYVATVGIGG--GEATVIVDTASELTWVQCEPC--DACHDQQE 150

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL------NQKSCSG--VNCQYSVSYGDGSFSNGNLAT 184
           PLFDP  S +Y ++PC+SS C +L      + ++C      C Y++SY DGS+S G LA 
Sbjct: 151 PLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAH 210

Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
           + ++L     Q     G  FGCGT+N G F   T+G++GLG   +SLISQ      G FS
Sbjct: 211 DRLSLAGEDIQ-----GFVFGCGTSNQGPFGG-TSGLMGLGRSQLSLISQTMDQFGGVFS 264

Query: 245 YCLVPV---SSTKINFGTNGIV---SGP----GVVSTPLTKAKTFYVLTIDAISVGNQRL 294
           YCL P    SS  +  G +  V   S P     +VS PL     FY+  +  I+VG +  
Sbjct: 265 YCLPPKESGSSGSLVLGDDASVYRNSTPIVYTAMVSDPLQ--GPFYLANLTGITVGGED- 321

Query: 295 GVSTP--------DIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 345
            V +P          ++DSGT +T  +P  Y +     +S + E  P A P   L+ C+ 
Sbjct: 322 -VQSPGFSAGGGGKAIVDSGTIITSLVPSVYAAVRAEFVSQLAE-YPQAAPFSILDTCFD 379

Query: 346 FNSLS--QVPEVTIHFR-GADVKLSRSNFFVKVSEDI--VCSVFKGITNS--VPIYGNIM 398
              L   QVP + + F  GA+V++        V+ D   VC     + +    PI GN  
Sbjct: 380 LTGLREVQVPSLKLVFDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQ 439

Query: 399 QTNFLVGYDIEQQTVSFKPTDC 420
           Q N  V +D     + F    C
Sbjct: 440 QKNLRVIFDTVGSQIGFAQETC 461


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 103/287 (35%), Positives = 144/287 (50%), Gaps = 26/287 (9%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ +++GTPP       DTGSDL+WTQC PC    C+ Q  PL DP  SSTY +LPC 
Sbjct: 85  EYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPC--RDCFDQGIPLLDPAASSTYAALPCG 142

Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-----TGQAVALPGITF 204
           + +C +L   SC G +C Y   YGD S + G +AT+  T G        G   A   +TF
Sbjct: 143 APRCRALPFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRRLTF 202

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
           GCG  N G+F S  TGI G G G  SL SQ+  T    FSYC   +  +K +  T G   
Sbjct: 203 GCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNAT---SFSYCFTSMFDSKSSIVTLGGAP 259

Query: 265 GP--------GVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI---VIDSGTTLT 310
                      V +TPL K     + Y L++  ISVG  RL V        +IDSG ++T
Sbjct: 260 AALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTIIDSGASIT 319

Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEV 355
            LP+     + +  ++ +   P      +L++C++   ++L + P V
Sbjct: 320 TLPEEVYEAVKAEFAAQVGLPPSGVEGSALDVCFALPVSALWRRPAV 366


>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
 gi|194706308|gb|ACF87238.1| unknown [Zea mays]
          Length = 467

 Score =  159 bits (402), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 125/349 (35%), Positives = 179/349 (51%), Gaps = 26/349 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ R+ +GTP    + V DTGS L W QC PC  S C+ Q  P+F+PK SS+Y S+ CS
Sbjct: 128 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYTSVSCS 186

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           + QC     A+L+  SCS  N C Y  SYGD SFS G L+ +TV+ GST+     +P   
Sbjct: 187 AQQCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 241

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
           +GCG +N GLF  ++ G++GL    +SL+ Q+  ++   FSYCL P SS+  +   +   
Sbjct: 242 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQLAPSMGYSFSYCL-PTSSSSSSGYLSIGS 299

Query: 264 SGPGVVS-TPLTKA---KTFYVLTIDAISVGNQRLGVSTPD-----IVIDSGTTLTFLPQ 314
             PG  S TP+  +    + Y + +  I V  + L VS+        +IDSGT +T LP 
Sbjct: 300 YNPGQYSYTPMASSSLDDSLYFIKMTGIKVAGKPLSVSSSAYSSLPTIIDSGTVITRLPT 359

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFRGADVKLSRS-NFF 372
           G  S L   ++  ++  P A     L+ C+   +   +VPEVT+ F G       + N  
Sbjct: 360 GVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFAGGAALKLAARNLL 419

Query: 373 VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           V V     C  F     S  I GN  Q  F V YD++   + F    C+
Sbjct: 420 VDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGGCS 467


>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Brachypodium distachyon]
          Length = 464

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 134/432 (31%), Positives = 190/432 (43%), Gaps = 53/432 (12%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN---------HFNQNSS 73
           ++ + G +V L HR  P SP   S +       + L R   R N         H+ +   
Sbjct: 52  DSSSSGATVPLNHRHGPCSPV-PSGKKKQPTFTELLRRDQLRANYIQRQFSDEHYPRTGG 110

Query: 74  ISSSKAS---QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
           +  S+A+       + N   Y+I +SIG+P        DTGSD+ W +C+          
Sbjct: 111 LQQSEATVPIALGSLLNTLEYVITVSIGSPAVAXTMFIDTGSDVSWLRCK---------- 160

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSC---SGVNCQYSVSYGDGSFSNGNLATETV 187
            S L+DP  SSTY    CS+  CA L ++     SG  C YSV YGDGS + G   ++T+
Sbjct: 161 -SRLYDPGTSSTYAPFSCSAPACAQLGRRGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTL 219

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           TL  T+   ++  G  FGC     G     T G++GLGG   S +SQ   T    FSYCL
Sbjct: 220 TLAGTSEPLIS--GFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAATYGSAFSYCL 277

Query: 248 VPV--SSTKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRL----GVST 298
            P   SS  +  G     +     +TP+ ++K   TFY L +  ISVG + L     V +
Sbjct: 278 PPTWNSSGFLTLGAPSSSTSAAFSTTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVFS 337

Query: 299 PDIVIDSGTTLTFL-PQGYNSNLLSVMSSMI--EAQPVADPTGSLELCYSFNSLSQ---- 351
              ++DSGT +T L P  Y +   +    M   + QP A P G L+ C+ F    +    
Sbjct: 338 AGSIVDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAA-PRGLLDTCFDFTGHGEGNNF 396

Query: 352 -VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDI 408
            VP V +   G  V     N  V+      C  F    +     I GN+ Q  F V YD+
Sbjct: 397 TVPSVALVLDGGAVVDLHPNGIVQDG----CLAFAATDDDGRTGIIGNVQQRTFEVLYDV 452

Query: 409 EQQTVSFKPTDC 420
            Q    F+P  C
Sbjct: 453 GQSVFGFRPGAC 464


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 113/333 (33%), Positives = 161/333 (48%), Gaps = 29/333 (8%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-- 164
           V DT SD+ W QC PCP  QC++Q  PL+DP  SST+  +PC S  C  L     +G   
Sbjct: 172 VVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSP 231

Query: 165 ---NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGI 221
               C+Y V+YGDG  + G   T+T+T+  T    + +    FGC     G F+++  GI
Sbjct: 232 TTDECKYIVNYGDGKATTGTYVTDTLTMSPT----IVVKDFRFGCSHAVRGSFSNQNAGI 287

Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS-TPLTK---AK 277
           + LGGG  SL+ Q        FSYC +P  S+       G V      S TPL K   A 
Sbjct: 288 LALGGGRGSLLEQTADAYGNAFSYC-IPKPSSAGFLSLGGPVEASLKFSYTPLIKNKHAP 346

Query: 278 TFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQP 332
           TFY++ ++AI V  ++L V         V+DSG  +T L PQ Y +   +  S+M    P
Sbjct: 347 TFYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLPPQVYAALRAAFRSAMAAYGP 406

Query: 333 VADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI-- 387
           +A P  +L+ CY F      +VP+V++ F  GA + L  ++  +       C  F     
Sbjct: 407 LAAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASIILD-----GCLAFAATPG 461

Query: 388 TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             SV   GN+ Q  + V YD+    V F+   C
Sbjct: 462 EESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score =  159 bits (401), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 133/417 (31%), Positives = 190/417 (45%), Gaps = 45/417 (10%)

Query: 38  SPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA------NY 91
           S K   +N        L D   RS+   N   + +S  + +ASQ  I  ++       NY
Sbjct: 8   SEKKIDWNRRLQKQLILDDLRVRSMQ--NRIRRVASTHNVEASQTQIPLSSGINLQTLNY 65

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           ++ + +G+       + DTGSDL W QCEPC    CY Q  P+F P  SS+Y+S+ C+SS
Sbjct: 66  IVTMGLGSK--NMTVIIDTGSDLTWVQCEPC--MSCYNQQGPIFKPSTSSSYQSVSCNSS 121

Query: 152 QCASL-----NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
            C SL     N  +C   N   C Y V+YGDGS++NG L  E ++ G      V++    
Sbjct: 122 TCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSFG-----GVSVSDFV 176

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
           FGCG NN GLF    +G++GLG   +SL+SQ   T  G FSYCL    +        G  
Sbjct: 177 FGCGRNNKGLFGG-VSGLMGLGRSYLSLVSQTNATFGGVFSYCLPTTEAGSSGSLVMGNE 235

Query: 264 SGPGVVSTPLTKAK--------TFYVLTIDAISVGNQRLGV----STPDIVIDSGTTLTF 311
           S     + P+T  +         FY+L +  I VG   L          I+IDSGT +T 
Sbjct: 236 SSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALKAPLSFGNGGILIDSGTVITR 295

Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSR 368
           LP      L +         P A     L+ C++     +V  P +++ F G A + +  
Sbjct: 296 LPSSVYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIPTISLRFEGNAQLNVDA 355

Query: 369 SNFFVKVSEDI--VCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           +  F  V ED   VC     ++++    I GN  Q N  V YD +Q  V F    C+
Sbjct: 356 TGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKVGFAEEPCS 412


>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 456

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 125/425 (29%), Positives = 198/425 (46%), Gaps = 59/425 (13%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNR----LNHFNQNSSISSSKASQ-- 81
           +  +L HRD+      N  +T ++ R    + R + R    LN  N+N+    +  +   
Sbjct: 58  WKTKLFHRDN-----INLKKTTHKTRFISRINRDIKRVTFLLNRLNKNTQEQQTTTATEA 112

Query: 82  ---ADII----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
              +D++      +  Y +RI IG+P   +  V D+GSD++W QCEPC   QCY Q  P+
Sbjct: 113 SFGSDVVSGTEEGSGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPC--DQCYNQTDPI 170

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK-SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           F+P  S+++  + CSS+ C  L+   +C    C Y V+YGDGS++ G LA ET+T+G T 
Sbjct: 171 FNPATSASFIGVACSSNVCNQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRTV 230

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----P 249
            Q  A+     GCG  N G+F     G++GLGGG +S + Q+     G F YCLV    P
Sbjct: 231 IQDTAI-----GCGHWNEGMF-VGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSRAMP 284

Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS----------TP 299
           V +  +            ++  P     +FY +++  ++VG  R+ +S          T 
Sbjct: 285 VGAMWVP-----------LIHNPF--YPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTG 331

Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTI 357
            +V+D+GT +T LP    +       +     P A      + CY  N     +VP V+ 
Sbjct: 332 GVVMDTGTAITRLPTVAYNAFRDAFIAQTTNLPRAPGVSIFDTCYDLNGFVTVRVPTVSF 391

Query: 358 HFRGADVKLSRSNFFVKVSEDI--VCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           +F G  +    +  F+  ++D+   C  F    + + I GNI Q    V  D     V F
Sbjct: 392 YFSGGQILTFPARNFLIPADDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNGFVGF 451

Query: 416 KPTDC 420
            P  C
Sbjct: 452 GPNVC 456


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 139/441 (31%), Positives = 207/441 (46%), Gaps = 61/441 (13%)

Query: 23  EAQTGGFSVELIHRD--SPKSPFYNSSETPYQRLRDALTRSL-NRL-------NHFNQNS 72
             + G   +E+  R   S +   +N          D   RS+ NR+       N   Q+S
Sbjct: 57  RKEKGAIVLEMKDRGYCSERKINWNRKLQKQLIFDDLRVRSMQNRIRAKVSGHNSSEQSS 116

Query: 73  SISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
            I    AS  ++     NY++ I +G        + DTGSDL W QC+PC    CY Q  
Sbjct: 117 EIQIPLASGINL--ETLNYIVTIGLGNQ--NMTVIIDTGSDLTWVQCDPCMS--CYSQQG 170

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN---CQYSVSYGDGSFSNGNLAT 184
           P+F+P  SS+Y SL C+SS C +L     N ++C   N   C ++VSYGDGSF++G L  
Sbjct: 171 PVFNPSNSSSYNSLLCNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGV 230

Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
           E ++ G      +++    FGCG NN GLF    +GI+GLG  ++S+ISQ  TT  G FS
Sbjct: 231 EHLSFG-----GISVSNFVFGCGRNNKGLFGG-VSGIMGLGRSNLSMISQTNTTFGGVFS 284

Query: 245 YCL----------VPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL 294
           YCL          + + +    F     ++   +VS P  +   FYVL +  I VG    
Sbjct: 285 YCLPTTDSGASGSLVIGNESSLFKNLTPIAYTSMVSNP--QLSNFYVLNLTGIDVG---- 338

Query: 295 GVSTPD-------IVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF 346
           GV+  D       I+IDSGT +T L P  YN+ L +         P+A     L+ C++ 
Sbjct: 339 GVAIQDTSFGNGGILIDSGTVITRLAPSLYNA-LKAEFLKQFSGYPIAPALSILDTCFNL 397

Query: 347 NSLSQV--PEVTIHFR-GADVKLSRSN-FFVKVSEDIVCSVFKGIT--NSVPIYGNIMQT 400
             + +V  P +++HF    D+ +      ++      VC     ++  N + I GN  Q 
Sbjct: 398 TGIEEVSIPTLSMHFENNVDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQR 457

Query: 401 NFLVGYDIEQQTVSFKPTDCT 421
           N  V YD +Q  + F   DC+
Sbjct: 458 NQRVIYDAKQSKIGFAREDCS 478


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score =  159 bits (401), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 122/369 (33%), Positives = 179/369 (48%), Gaps = 48/369 (13%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           ++ + +SIGTPP  R  + DTGSDLIWTQC+     Q   ++ PL+DP  SS++ + PC 
Sbjct: 88  HHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQ--HREKPLYDPAKSSSFAAAPCD 145

Query: 150 SSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
              C   S N K+CS   C Y+ +YG  + + G LA+ET T G     +V+L    FGCG
Sbjct: 146 GRLCETGSFNTKNCSRNKCIYTYNYGSAT-TKGELASETFTFGEHRRVSVSL---DFGCG 201

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----SSTKINFG----- 258
               G      +GI+G+    +SL+SQ++     +FSYCL P     +++ I FG     
Sbjct: 202 KLTSGSL-PGASGILGISPDRLSLVSQLQIP---RFSYCLTPFLDRNTTSHIFFGAMADL 257

Query: 259 ----TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVI-DSGTTLTFLP 313
               T G +    +V+ P   +  +Y + +  ISVG +RL V      I   G+  TF+ 
Sbjct: 258 SKYRTTGPIQTTSLVTNP-DGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVD 316

Query: 314 QGYNSNLLS--VMSSMIEAQ------PVADPTG---SLELCYSF--------NSLSQVPE 354
            G  + +L   VM ++ EA       PV + T      ELC+           +  QVP 
Sbjct: 317 SGDTTGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPP 376

Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
           +  HF  GA + L R ++ V+VS   +C V         I GN  Q N  V +D+E    
Sbjct: 377 LVYHFDGGAAMLLRRDSYMVEVSAGRMCLVISSGARGA-IIGNYQQQNMHVLFDVENHEF 435

Query: 414 SFKPTDCTK 422
           SF PT C +
Sbjct: 436 SFAPTQCNQ 444


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 117/357 (32%), Positives = 168/357 (47%), Gaps = 54/357 (15%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ ++IGTPP       DTGSDLIWTQC+PCP   C+ Q  P FDP  SST     C 
Sbjct: 88  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 145

Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
           S+ C  L   S                       ++  T     G   ++PG+ FGCG  
Sbjct: 146 STLCQGLPVASLP--------------------RSDKFTF---VGAGASVPGVAFGCGLF 182

Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFGTNGIVS 264
           N G+F S  TGI G G G +SL SQ++    G FS+C   +     S+  ++   +   +
Sbjct: 183 NNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHCFTTITGAIPSTVLLDLPADLFSN 239

Query: 265 GPGVV-STPLTK---AKTFYVLTIDAISVGNQRLGV---------STPDIVIDSGTTLTF 311
           G G V +TPL +     TFY L++  I+VG+ RL V          T   +IDSGT +T 
Sbjct: 240 GQGAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGTAMTS 299

Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHFRGADVKLSRS 369
           LP      +    ++ ++   V+  T     C S    +   VP++ +HF GA + L R 
Sbjct: 300 LPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLPRE 359

Query: 370 NFFVKVSE---DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           N+  +V +    I+C ++ +G    V   GN  Q N  V YD++   +SF P  C K
Sbjct: 360 NYVFEVEDAGSSILCLAIIEG--GEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQCDK 414


>gi|356558489|ref|XP_003547539.1| PREDICTED: uncharacterized protein LOC100817234 [Glycine max]
          Length = 739

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 105/240 (43%), Positives = 145/240 (60%), Gaps = 17/240 (7%)

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----S 251
           +V+ P I  GCG NN G F+SK  GIVGLGGG +SLIS +  +I  K+SYCLVP+    S
Sbjct: 55  SVSFPKIPIGCGLNNAGTFDSKCFGIVGLGGGVVSLISHIGLSIDSKYSYCLVPLFEFNS 114

Query: 252 STKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVSTP--------DI 301
           ++KINFG N +V G G VSTP+      TFY L ++ +SVG++R+             +I
Sbjct: 115 TSKINFGENAVVEGLGTVSTPIIPGSFDTFYYLKLEGMSVGSKRIDFVDASTSNELKGNI 174

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHF 359
           +IDSGTTLT L + + + L + + + I  + V      L LCY    N+  +VP +T HF
Sbjct: 175 IIDSGTTLTILLENFYTKLEAEVEAHINLERVNSTDQILSLCYKSPPNNAIEVPIITTHF 234

Query: 360 RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
            G D+ L+  N FV V +D +   F  +  S  I+GN+ Q N LVGYD+ ++TVSFKPTD
Sbjct: 235 AGVDIVLNSLNTFVSVFDDAMWFAFAPVA-SGSIFGNLAQMNHLVGYDLLRKTVSFKPTD 293


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 120/379 (31%), Positives = 182/379 (48%), Gaps = 40/379 (10%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           ++ +   +A YL+ + +GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  S
Sbjct: 136 ESGVAVGSAEYLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 193

Query: 141 STYKSLPCSSSQCASLNQKSCSGVN---------CQYSVSYGDGSFSNGNLATETVTLGS 191
           S+Y++L C   +C  +                  C Y   YGD S S G+LA E+ T+  
Sbjct: 194 SSYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNL 253

Query: 192 TT-GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVP 249
           T  G +  + G+ FGCG  N GLF+     ++GLG G +S  SQ+R    G  FSYCLV 
Sbjct: 254 TAPGASSRVDGVVFGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYGGHTFSYCLVD 312

Query: 250 VSS---TKINFGTN---GIVSGPGVVSTPL----TKAKTFYVLTIDAISVGNQRLGVSTP 299
             S   +K+ FG +    + + P +  T      + A TFY + +  + VG + L +S+ 
Sbjct: 313 HGSDVASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISSD 372

Query: 300 ----------DIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS 348
                       +IDSGTTL+ F+   Y     + +  M  + P       L  CY+ + 
Sbjct: 373 TWDASEGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYNVSG 432

Query: 349 LS--QVPEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFL 403
           +   +VPE+++ F  GA       N+F+++  D I+C    G   + + I GN  Q NF 
Sbjct: 433 VERPEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNFH 492

Query: 404 VGYDIEQQTVSFKPTDCTK 422
           V YD+    + F P  C +
Sbjct: 493 VAYDLHNNRLGFAPRRCAE 511


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score =  158 bits (399), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 112/338 (33%), Positives = 167/338 (49%), Gaps = 37/338 (10%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN- 165
           V DTGSD+ W QC+PC  + CY Q  P+FDP +S++Y ++ C S +C  L+  +C     
Sbjct: 2   VLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATG 59

Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C Y V+YGDGS++ G+ ATET+TLG +T     +  +  GCG +N GLF      ++ L
Sbjct: 60  ACLYEVAYGDGSYTVGDFATETLTLGDST----PVGNVAIGCGHDNEGLFVGAAG-LLAL 114

Query: 225 GGGDISLISQMRTTIAGKFSYCLV----PVSSTKINFGTNGIVSGPGVVSTPLTKA---K 277
           GGG +S  SQ+    A  FSYCLV    P +ST + FG     +  G V+ PL ++    
Sbjct: 115 GGGPLSFPSQIS---ASTFSYCLVDRDSPAAST-LQFGDGAAEA--GTVTAPLVRSPRTS 168

Query: 278 TFYVLTIDAISVGNQRLGV-----------STPDIVIDSGTTLTFLPQGYNSNLLSVMSS 326
           TFY + +  ISVG Q L +            +  +++DSGT +T L     + L      
Sbjct: 169 TFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQ 228

Query: 327 MIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVKVS-EDIVCS 382
              + P        + CY  +  +  +VP V++ F G   ++L   N+ + V      C 
Sbjct: 229 GAPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCL 288

Query: 383 VFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            F     +V I GN+ Q    V +D  +  V F P  C
Sbjct: 289 AFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326


>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 469

 Score =  157 bits (398), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 139/464 (29%), Positives = 210/464 (45%), Gaps = 73/464 (15%)

Query: 14  LCFYVVSPIEAQT------GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
           L FY+ + I + T         + +LIHR+S   P Y+ +ET   R +   T S+ R + 
Sbjct: 17  LAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDF 76

Query: 68  FNQNSSISSSKA----SQADIIPNN--ANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
               S I   K+    +++ +IP N  + +L+ +SIG+PP  +L V DTGS L+W QC P
Sbjct: 77  LE--SKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134

Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNG 180
           C    C+ Q +  FDP  S ++K+L C       +N   C+  N  +Y + Y  G  S G
Sbjct: 135 CI--NCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQG 192

Query: 181 NLATETVTLG-------------STTGQAVALPGITFGCG-----TNNGGLFNSKTTGIV 222
            LA E++                ST    +    ITFGCG     TNN   +N    G+ 
Sbjct: 193 ILAKESLLFETLDEGRVFQYNAISTQISKIKKSNITFGCGHMNIKTNNDDAYN----GVF 248

Query: 223 GLGGG-DISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV----STPLTKAK 277
           GLG    I+    M T +  KFSYC+  +++    +  N +V G G      STPL    
Sbjct: 249 GLGAYPHIT----MATQLGNKFSYCIGDINNPL--YTHNHLVLGQGSYIEGDSTPLQIHF 302

Query: 278 TFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTTLTFLPQG----YNSNLLSV 323
             Y +T+ +ISVG++ L +           +  ++IDSG T T L  G        ++ +
Sbjct: 303 GHYYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDL 362

Query: 324 MSSMIEAQPVADPTGSLELCYS---FNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDI 379
           M  ++E  P         LC+       L   P VT HF  GAD+ L   + F +   D 
Sbjct: 363 MKGLLERIPTQRKFEG--LCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDR 420

Query: 380 VCSVF---KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            C           ++ + G + Q N+ VG+D+EQ  V F+  DC
Sbjct: 421 FCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 464


>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
 gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  157 bits (398), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 137/423 (32%), Positives = 202/423 (47%), Gaps = 38/423 (8%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS------KASQAD 83
           S++++H+  P S       +      + L +  +R+   +   S S +      K + + 
Sbjct: 75  SLKVVHKHGPCSKLSQDEASAAPTHTEILLQDQSRVKSIHSRLSNSKTSGGKDVKVTDST 134

Query: 84  IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
            IP        + NY++ + +GTP  +   + DTGSD+ WTQC+PC  S CY Q   +FD
Sbjct: 135 TIPAKDGSTVGSGNYIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARS-CYKQKEQIFD 193

Query: 137 PKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
           P  S++Y ++ CSSS C SL     N   C+   C Y + YGD SFS G   TE +TL S
Sbjct: 194 PSQSTSYTNISCSSSICNSLTSATGNTPGCASSACVYGIQYGDSSFSVGFFGTEKLTLTS 253

Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS 251
           T     A   I FGCG NN       + G++GLG   +S++SQ        FSYCL P S
Sbjct: 254 TD----AFNNIYFGCGQNN-QGLFGGSAGLLGLGRDKLSVVSQTAQKYNKIFSYCL-PSS 307

Query: 252 STKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGV-----STPDIVI 303
           S+   F T G  +      TPL   +   +FY L    ISVG ++L +     ST   +I
Sbjct: 308 SSSTGFLTFGGSASKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISASVFSTAGAII 367

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-R 360
           DSGT +T LP    S L +   +++   P+      L+ CY F+S +   VP++   F  
Sbjct: 368 DSGTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYDFSSYTTISVPKIGFSFSS 427

Query: 361 GADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
           G +V +  +      S   VC  F G +++  V I+GN+ Q    V YD     V F P 
Sbjct: 428 GIEVDIDATGILYASSLSQVCLAFAGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAPG 487

Query: 419 DCT 421
            C+
Sbjct: 488 GCS 490


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  157 bits (398), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 141/429 (32%), Positives = 200/429 (46%), Gaps = 50/429 (11%)

Query: 18  VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETP-------YQRLRDA-LTRSLNRLNHFN 69
           V S   A + G +V L HR  P SP   S++ P       + +LR   + R L+  +   
Sbjct: 52  VCSVTPASSSGTTVPLNHRYGPCSP-APSAKVPTILELLEHDQLRAKYIQRKLSGTDGLQ 110

Query: 70  Q-NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
             + ++ ++  S  D +     Y+I + IG+P   +  + DTGSD+ W +C         
Sbjct: 111 PLDLTVPTTLGSALDTM----EYVITVGIGSPAVTQTMMIDTGSDVSWVRCNS------- 159

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
                LFDP  S+TY    CSS+ CA L  N   CS   CQY V YGDGS + G  +++T
Sbjct: 160 TDGLTLFDPSKSTTYAPFSCSSAACAQLGNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDT 219

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYC 246
           + L ++      +    FGC  +       K  G++GLGG   SL+SQ   T    FSYC
Sbjct: 220 LALSASD----TVTDFHFGCSHHEEDFDGEKIDGLMGLGGDAQSLVSQTAATYGKSFSYC 275

Query: 247 LVPVSSTK--INFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI 301
           L P + T   + FG     SG G V+TP+    KA T Y + +  ISVG   LG+  P +
Sbjct: 276 LPPTNRTSGFLTFGAPNGTSG-GFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQ-PSV 333

Query: 302 -----VIDSGTTLTFLPQGYNSNLLSVM-SSMIE-AQPVADPTGSLELCYSFNSLSQV-- 352
                V+DSGT +T+LP+   S L S   SSM       A P G L+ CY F  L  V  
Sbjct: 334 LSNGSVMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCYDFTGLVNVSI 393

Query: 353 PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
           P V++    GA V L  +   ++      C  F   T+   I GN+ Q  F V +D+ Q 
Sbjct: 394 PAVSLVLDGGAVVDLDGNGIMIQ-----DCLAFAA-TSGDSIIGNVQQRTFEVLHDVGQG 447

Query: 412 TVSFKPTDC 420
              F+   C
Sbjct: 448 VFGFRSGAC 456


>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 460

 Score =  157 bits (398), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 127/415 (30%), Positives = 187/415 (45%), Gaps = 66/415 (15%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
           E+  RD  +  F NS    Y         S N  NH + N           ++   + N+
Sbjct: 88  EIFGRDESRVSFINSKCNQYT--------SGNLKNHAHNN-----------NLFDEDGNF 128

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           L+ ++ GTP TE   + DTGS + WTQC+ C    C    +  FD   SSTY    C  S
Sbjct: 129 LVDVAFGTPXTEIXLILDTGSSITWTQCKAC--VNCLQDSNRYFDSSASSTYSFGSCIPS 186

Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
                       V   Y+++YGD S S GN   +T+TL  +           FGCG NN 
Sbjct: 187 T-----------VENNYNMTYGDDSTSVGNYGCDTMTLEPSD----VFQKFQFGCGRNNK 231

Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST-KINFGTNG--------- 261
           G F S   G++GLG G +S +SQ  +     FSYCL    S   + FG            
Sbjct: 232 GDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGSLLFGEKATSQSSSLKF 291

Query: 262 --IVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQ 314
             +V+GPG +     +   +Y + +  ISVGN+RL +     ++P  +IDS T +T LPQ
Sbjct: 292 TSLVNGPGTL-----QESGYYFVNLSDISVGNERLNIPSSVFASPGTIIDSRTVITRLPQ 346

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS----LELCYSFNSLSQV--PEVTIHF-RGADVKLS 367
              S L +     +   P+++        L+ CY+ +    V  PE+ +HF  GADV+L+
Sbjct: 347 RAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLN 406

Query: 368 RSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
            +N         +C  F G T+ + I GN  Q +  V YDI+ + + F    C+K
Sbjct: 407 GTNIVWGSDASRLCLAFAG-TSELTIIGNRQQLSLTVLYDIQGRRIGFGGNGCSK 460


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score =  157 bits (398), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 113/373 (30%), Positives = 178/373 (47%), Gaps = 43/373 (11%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-YMQDSPLFDPKMSSTYKSL 146
           +  Y + + IGTPP   L VADTGSDLIW +C PC    C +      F  + S+TY ++
Sbjct: 83  SGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPC--RNCSHRSPGSAFFARHSTTYSAI 140

Query: 147 PCSSSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
            C S QC  +     +  N       C+Y  +Y D S + G  + E +TL ++TG+   L
Sbjct: 141 HCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKL 200

Query: 200 PGITFGCGTN------NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----- 248
            G++FGCG         G  F     G++GLG   IS  SQ+      KFSYCL+     
Sbjct: 201 NGLSFGCGFRISGPSLTGASFEG-AQGVMGLGRAPISFSSQLGRRFGSKFSYCLMDYTLS 259

Query: 249 --PVSSTKINFGTNGIVSGPGVVS-TPL---TKAKTFYVLTIDAISVGNQRLGV-----S 297
             P S   I    N  VS  G++S TPL     + TFY + I  + V   +L +     S
Sbjct: 260 PPPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWS 319

Query: 298 TPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ- 351
             D+     +IDSGTTLTF+ +   + +L      ++    A+PT   +LC + + +++ 
Sbjct: 320 IDDLGNGGTIIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFDLCMNVSGVTRP 379

Query: 352 -VPEVTIHFRGADV-KLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYD 407
            +P ++ +  G  V      N+F++  + I C   + ++      + GN+MQ  FL+ +D
Sbjct: 380 ALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGNLMQQGFLLEFD 439

Query: 408 IEQQTVSFKPTDC 420
            ++  + F    C
Sbjct: 440 RDKSRLGFTRRGC 452


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 128/451 (28%), Positives = 197/451 (43%), Gaps = 47/451 (10%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDS--PKSPFYNSSETPYQRLRDALTR 60
           T LSC+     L   +      +     ++L HRD+  PK         P  R+ D +  
Sbjct: 26  TLLSCLITTLLL---ITVADSMKDTSVRLKLAHRDTLLPK---------PLSRIEDVIGA 73

Query: 61  SLNR--LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
              R  L    +NS++       + I    A Y   I +GTP  +   V DTGS+L W  
Sbjct: 74  DQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVN 133

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVS 171
           C      +    +  +F    S ++K++ C +  C        SL         C Y   
Sbjct: 134 CRYRARGK---DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYR 190

Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
           Y DGS + G  A ET+T+G T G+   LPG   GC ++  G       G++GL   D S 
Sbjct: 191 YADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSF 250

Query: 232 ISQMRTTIAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVVSTP--LTKAKTFYVLTI 284
            S   +    KFSYCLV   S K     + FG++         +TP  LT+   FY + +
Sbjct: 251 TSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINV 310

Query: 285 DAISVGNQRLGV--------STPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVAD 335
             IS+G   L +        S    ++DSGT+LT L    Y   +  +   ++E + V  
Sbjct: 311 IGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKP 370

Query: 336 PTGSLELCYSFNS---LSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF-KGITNS 390
               +E C+SF S   +S++P++T H + GA  +  R ++ V  +  + C  F    T +
Sbjct: 371 EGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPA 430

Query: 391 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             + GNIMQ N+L  +D+   T+SF P+ CT
Sbjct: 431 TNVIGNIMQQNYLWEFDLMASTLSFAPSACT 461


>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
 gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
          Length = 464

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 115/380 (30%), Positives = 178/380 (46%), Gaps = 46/380 (12%)

Query: 66  NHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
            H  +   +    A       N   Y   I++G+PP +   V DTGSDL W +C+PC P 
Sbjct: 99  RHLAEEEEVEHDLAQTPVSFTNGGVYYSSITLGSPPKDFSLVMDTGSDLTWVRCDPCSP- 157

Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
            C    S  FD   S+TYK+L C+              +     +      F +G    +
Sbjct: 158 DC----SSTFDRLASNTYKALTCADD------------LRLPVLLRLWRRLFHSGRSLRD 201

Query: 186 TVTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFS 244
           T+ + G+ + +    PG  FGCG+   GL  S   GI+ L  G +S  SQ+      KFS
Sbjct: 202 TLKMAGAASDELEEFPGFVFGCGSLLKGLI-SGEVGILALSPGSLSFPSQIGEKYGNKFS 260

Query: 245 YCLV------PVSSTKINFGTNGI-VSGPG------VVSTPLTKAKTFYVLTIDAISVGN 291
           YCL+       +  + + FG   + +  PG      +  TP+ ++  +Y + +D ISVGN
Sbjct: 261 YCLLRQTAQNSLKKSPMVFGEAAVELKEPGSGKPQELQYTPIGESSIYYTVRLDGISVGN 320

Query: 292 QRLGVS--------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC 343
           QRL +S            + DSGTTLT LP G   ++   ++SM+         G L+ C
Sbjct: 321 QRLDLSPSTFLNGQDKPTIFDSGTTLTMLPSGVCDSIKQSLASMVSGAEFVAIKG-LDAC 379

Query: 344 YSF--NSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQT 400
           +    +S   +P++T HF  GAD     SN+ + +   + C +F   TN V I+GN+ Q 
Sbjct: 380 FRVPPSSGQGLPDITFHFNGGADFVTRPSNYVIDLGS-LQCLIFVP-TNEVSIFGNLQQQ 437

Query: 401 NFLVGYDIEQQTVSFKPTDC 420
           +F V +D++ + + FK TDC
Sbjct: 438 DFFVLHDMDNRRIGFKETDC 457


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 128/451 (28%), Positives = 197/451 (43%), Gaps = 47/451 (10%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDS--PKSPFYNSSETPYQRLRDALTR 60
           T LSC+     L   +      +     ++L HRD+  PK         P  R+ D +  
Sbjct: 4   TLLSCLITTLLL---ITVADSMKDTSVRLKLAHRDTLLPK---------PLSRIEDVIGA 51

Query: 61  SLNR--LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
              R  L    +NS++       + I    A Y   I +GTP  +   V DTGS+L W  
Sbjct: 52  DQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVN 111

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVS 171
           C      +    +  +F    S ++K++ C +  C        SL         C Y   
Sbjct: 112 CRYRARGK---DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYR 168

Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
           Y DGS + G  A ET+T+G T G+   LPG   GC ++  G       G++GL   D S 
Sbjct: 169 YADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSF 228

Query: 232 ISQMRTTIAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVVSTP--LTKAKTFYVLTI 284
            S   +    KFSYCLV   S K     + FG++         +TP  LT+   FY + +
Sbjct: 229 TSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINV 288

Query: 285 DAISVGNQRLGV--------STPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVAD 335
             IS+G   L +        S    ++DSGT+LT L    Y   +  +   ++E + V  
Sbjct: 289 IGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKP 348

Query: 336 PTGSLELCYSFNS---LSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF-KGITNS 390
               +E C+SF S   +S++P++T H + GA  +  R ++ V  +  + C  F    T +
Sbjct: 349 EGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPA 408

Query: 391 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             + GNIMQ N+L  +D+   T+SF P+ CT
Sbjct: 409 TNVIGNIMQQNYLWEFDLMASTLSFAPSACT 439


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 128/432 (29%), Positives = 193/432 (44%), Gaps = 42/432 (9%)

Query: 10  ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
           ++  +  YV    E       V L+HR  P +P   S  T  +   D   RS  R ++  
Sbjct: 1   MILHIYIYVSVKPEQNGSTVYVPLVHRHGPCAP-APSLSTDTRSFADIFRRSRARPSYIV 59

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           +   +S        ++  +  Y++R+S GTP   ++ V DTGSD+ W QC+PC   QC+ 
Sbjct: 60  RGKKVSVPAHLGTSVM--SLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFP 117

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKS-----CSGVNCQYSVSYGDGSFSNGNLAT 184
           Q  PL+DP  SSTY ++PC+S  C  L   +      SG  C +++SY DG+ + G  + 
Sbjct: 118 QKDPLYDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQ 177

Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNG---GLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           + +TL         +    FGCG       GLF+    G++GLG     L   +     G
Sbjct: 178 DKLTL----APGAIVQNFYFGCGHGKHAVRGLFD----GVLGLG----RLRESLGARYGG 225

Query: 242 KFSYCLVPVSSTKINFGTNGIVSGP-GVVSTPLTKA---KTFYVLTIDAISVGNQRLGVS 297
            FSYCL P  S+K  F   G    P G V TP+       TF  +T+  I+VG ++L + 
Sbjct: 226 VFSYCL-PSVSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLR 284

Query: 298 ----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-- 351
               +  +++DSGT +T L       L S     +EA  +  P G L+ CY+        
Sbjct: 285 PSAFSGGMIVDSGTVITGLQSTAYRALRSAFRKAMEAYRLL-PNGDLDTCYNLTGYKNVV 343

Query: 352 VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDI 408
           VP++ + F  GA + L   N  +       C  F   G   S  + GN+ Q  F V +D 
Sbjct: 344 VPKIALTFTGGATINLDVPNGILVNG----CLAFAESGPDGSAGVLGNVNQRAFEVLFDT 399

Query: 409 EQQTVSFKPTDC 420
                 F+   C
Sbjct: 400 STSKFGFRAKAC 411


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 128/395 (32%), Positives = 193/395 (48%), Gaps = 52/395 (13%)

Query: 61  SLNRLNHFNQNSS--ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
           S+ RL +    ++  I +  +    IIP    +L+ ISIG+PP  +L   DT SDL+W Q
Sbjct: 55  SVERLEYLKAKATGDIIAHLSPNVPIIPQA--FLVNISIGSPPVTQLLHMDTASDLLWLQ 112

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA----SLNQKSCSGVNCQYSVSYGD 174
           C PC    CY Q  P+FDP  S T+++  C +SQ +      N K+ S   C+YS+ Y D
Sbjct: 113 CRPC--INCYAQSLPIFDPSRSYTHRNESCRTSQYSMPSLRFNAKTRS---CEYSMRYMD 167

Query: 175 GSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
           G+ S G LA E +   +   +  + AL  + FGCG +N G      TGI+GLG G+ SL+
Sbjct: 168 GTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCGHDNYGE-PLVGTGILGLGYGEFSLV 226

Query: 233 SQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV-----STPLTKAKTFYVLTIDAI 287
            +  T    KFSYC   +     ++  N +V G         +TPL     FY +TI+AI
Sbjct: 227 HRFGT----KFSYCFGSLDDP--SYPHNVLVLGDDGANILGDTTPLEIYNGFYYVTIEAI 280

Query: 288 SVG-----------NQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 336
           SV            N+         +ID+G +LT L +     L + +    E +  A  
Sbjct: 281 SVDGIILPIDPWVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAAD 340

Query: 337 TGSLEL----CYSFN-----SLSQVPEVTIHF-RGADVKLSRSNFFVKVSEDIVC-SVFK 385
               ++    CY+ N       S  P VT HF  GA++ L   + F+K+S ++ C +V  
Sbjct: 341 VNQDDMFKVECYNGNLERDLVESGFPIVTFHFSDGAELSLDVKSVFMKLSPNVFCLAVTP 400

Query: 386 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           G  NS+   G   Q ++ +GYD+E + +SF+  DC
Sbjct: 401 GNMNSI---GATAQQSYNIGYDLEAKKISFERIDC 432


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 120/387 (31%), Positives = 188/387 (48%), Gaps = 48/387 (12%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           ++ +   +  YL+ + +GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  S
Sbjct: 141 ESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 198

Query: 141 STYKSLPCSSSQCASL---------NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVT 188
           S+Y+++ C   +C  +         + ++C       C Y   YGD S + G+LA E+ T
Sbjct: 199 SSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFT 258

Query: 189 LGSTT-GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           +  T  G +  + G+ FGCG  N GLF+     ++GLG G +S  SQ+R      FSYCL
Sbjct: 259 VNLTAPGASRRVDGVVFGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYGHTFSYCL 317

Query: 248 VPVSS---TKINFGTN----GIVSGPGVVSTPL-------TKAKTFYVLTIDAISVGNQR 293
           V   S   +K+ FG +     + + P +  T         + A TFY + +  + VG + 
Sbjct: 318 VDHGSDVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGEL 377

Query: 294 LGVS--TPDI--------VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
           L +S  T D+        +IDSGTTL+ F+   Y     + M  M  + P+      L  
Sbjct: 378 LNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSYPLVPEFPVLSP 437

Query: 343 CYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSED---IVCSVFKGITNS-VPIYG 395
           CY+ + +   +VPE+++ F  GA       N+F+++  D   I+C    G   + + I G
Sbjct: 438 CYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTPRTGMSIIG 497

Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           N  Q NF V YD++   + F P  C +
Sbjct: 498 NFQQQNFHVVYDLQNNRLGFAPRRCAE 524


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 118/349 (33%), Positives = 169/349 (48%), Gaps = 43/349 (12%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN- 165
           V DTGSD++W QC PC   +CY Q  P+FDP+ SS+Y ++ C ++ C  L+   C     
Sbjct: 2   VLDTGSDVVWVQCAPC--RRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRG 59

Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C Y V+YGDGS + G+  TET+T     G  VA   +  GCG +N GLF +    ++GL
Sbjct: 60  ACMYQVAYGDGSVTAGDFVTETLTF--AGGARVAR--VALGCGHDNEGLFVAAAG-LLGL 114

Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSSTK------------INFGTNGIVSGPGVVSTP 272
           G G +S  +Q+       FSYCLV  +S+             ++FG  G V       TP
Sbjct: 115 GRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGA-GSVGASSASFTP 173

Query: 273 LT---KAKTFYVLTIDAISVGNQRL-GVSTPD-----------IVIDSGTTLTFLPQGYN 317
           +    + +TFY + +  ISVG  R+ GV+  D           +++DSGT++T L +   
Sbjct: 174 MVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASY 233

Query: 318 SNLLSVMSSMIEAQPVADPTG--SLELCYSFNS--LSQVPEVTIHFR-GADVKLSRSNFF 372
           S L     +         P G    + CY      + +VP V++HF  GA+  L   N+ 
Sbjct: 234 SALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYL 293

Query: 373 VKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + V S    C  F G    V I GNI Q  F V +D + Q V F P  C
Sbjct: 294 IPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 125/411 (30%), Positives = 187/411 (45%), Gaps = 42/411 (10%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           V L+HR  P +P   S  T  +   D   RS  R ++  +   +S        ++  +  
Sbjct: 56  VPLVHRHGPCAP-APSLSTDTRSFADIFRRSRARPSYIVRGKKVSVPAHLGTSVM--SLE 112

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y++R+S GTP   ++ V DTGSD+ W QC+PC   QC+ Q  PL+DP  SSTY ++PC+S
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 172

Query: 151 SQCASLNQKS-----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
             C  L   +      SG  C +++SY DG+ + G  + + +TL         +    FG
Sbjct: 173 DVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL----APGAIVQNFYFG 228

Query: 206 CGTNNG---GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI 262
           CG       GLF+    G++GLG     L   +     G FSYCL P  S+K  F   G 
Sbjct: 229 CGHGKHAVRGLFD----GVLGLG----RLRESLGARYGGVFSYCL-PSVSSKPGFLALGA 279

Query: 263 VSGP-GVVSTPLTKA---KTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQ 314
              P G V TP+       TF  +T+  I+VG ++L +     +  +++DSGT +T L  
Sbjct: 280 GKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGLQS 339

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNF 371
                L S     +EA  +  P G L+ CY+        VP++ + F  GA + L   N 
Sbjct: 340 TAYRALRSAFRKAMEAYRLL-PNGDLDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNG 398

Query: 372 FVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            +       C  F   G   S  + GN+ Q  F V +D       F+   C
Sbjct: 399 ILVNG----CLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 445


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 117/367 (31%), Positives = 176/367 (47%), Gaps = 48/367 (13%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
              Y++ +SIGTPP    A+ DTGSDL+W +C+ C           +F    SS+YK LP
Sbjct: 2   EGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLP 61

Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTL---GSTTGQAVA 198
           C+S+ C+ +   S +G+       C+Y   YGDGS ++G++ ++ ++    G+       
Sbjct: 62  CNSTHCSGM---SSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSF 118

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-----PVSST 253
             G  FGCG    G +N  T G++GLG    SLI Q+   +  KFSYCLV     P + +
Sbjct: 119 FDGFLFGCGRKLKGDWNF-TQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKS 177

Query: 254 KINFGTNGIVSGPGVVSTPLTKA----KTFYVLTIDAISVG-------NQRLGVSTP--- 299
            +  G++  + G  VVSTP+       +T Y + + +I+VG       ++  G +T    
Sbjct: 178 FLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGP 237

Query: 300 ----DIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQ---PVADPTGSLELCY--SFNSL 349
                 VIDSGTT T L P  Y +     M   IE Q   P    +  L+LC+  S ++ 
Sbjct: 238 FLANKTVIDSGTTYTLLTPPVYEA-----MRKSIEEQVILPTLGNSAGLDLCFNSSGDTS 292

Query: 350 SQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
              P VT +F     + L   N F   S D+VC         + I GN+ Q NF + YD+
Sbjct: 293 YGFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYDL 352

Query: 409 EQQTVSF 415
               +SF
Sbjct: 353 VASQISF 359


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 121/375 (32%), Positives = 186/375 (49%), Gaps = 36/375 (9%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           ++ +   +  YL+ + +GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  S
Sbjct: 142 ESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 199

Query: 141 STYKSLPCSSSQCASL----NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTT 193
            +Y+++ C   +C  +      ++C   +   C Y   YGD S + G+LA E  T+  T 
Sbjct: 200 LSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 259

Query: 194 -GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
            G +  +  + FGCG +N GLF+     ++GLG G +S  SQ+R      FSYCLV   S
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAG-LLGLGRGALSFASQLRAVYGHAFSYCLVDHGS 318

Query: 253 ---TKINFGTNGIVSG-PGVVST-----PLTKAKTFYVLTIDAISVGNQRLGV--STPDI 301
              +KI FG +  + G P +  T         A TFY + +  + VG ++L +  ST D+
Sbjct: 319 SVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDV 378

Query: 302 --------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-- 350
                   +IDSGTTL++  +  Y     + +  M +A P+      L  CY+ + +   
Sbjct: 379 GKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERV 438

Query: 351 QVPEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGYD 407
           +VPE ++ F  GA       N+FV++  D I+C    G   S + I GN  Q NF V YD
Sbjct: 439 EVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYD 498

Query: 408 IEQQTVSFKPTDCTK 422
           ++   + F P  C +
Sbjct: 499 LQNNRLGFAPRRCAE 513


>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
          Length = 394

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 107/355 (30%), Positives = 180/355 (50%), Gaps = 34/355 (9%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+   +IGTPP    AV D   +L+WTQC+ C  S+C+ QD+PLFDP  S+TY++ PC 
Sbjct: 50  NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--SRCFEQDTPLFDPTASNTYRAEPCG 107

Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           +  C S+  + ++CSG  C Y  S   G  + G + T+T  +G+      A   + FGC 
Sbjct: 108 TPLCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVS 264
             +        +GIVGLG    SL++Q   T    FSYCL P  + K   +  G++  ++
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQ---TGVAAFSYCLAPHDAGKNSALFLGSSAKLA 217

Query: 265 GPG-VVSTPLT-------KAKTFYVLTIDAISVGNQRLGV--STPDIVIDSGTTLTFLPQ 314
           G G   STP             +Y + ++ +  G+  + +  S   +++D+ + ++FL  
Sbjct: 218 GGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVLLDTFSPISFLVD 277

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRSNFF 372
           G    +   ++  + A P+A P    +LC+  +  S   P++   FR GA + ++ SN+ 
Sbjct: 278 GAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVAASNYL 337

Query: 373 VKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           +      VC     S     T  + + G++ Q N    +D++++T+SF+P DCTK
Sbjct: 338 LDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 121/375 (32%), Positives = 186/375 (49%), Gaps = 36/375 (9%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           ++ +   +  YL+ + +GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  S
Sbjct: 142 ESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPATS 199

Query: 141 STYKSLPCSSSQCASL----NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTT 193
            +Y+++ C   +C  +      ++C   +   C Y   YGD S + G+LA E  T+  T 
Sbjct: 200 LSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 259

Query: 194 -GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS 252
            G +  +  + FGCG +N GLF+     ++GLG G +S  SQ+R      FSYCLV   S
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAG-LLGLGRGALSFASQLRAVYGHAFSYCLVDHGS 318

Query: 253 ---TKINFGTNGIVSG-PGVVST-----PLTKAKTFYVLTIDAISVGNQRLGV--STPDI 301
              +KI FG +  + G P +  T         A TFY + +  + VG ++L +  ST D+
Sbjct: 319 SVGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDV 378

Query: 302 --------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-- 350
                   +IDSGTTL++  +  Y     + +  M +A P+      L  CY+ + +   
Sbjct: 379 GKDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERV 438

Query: 351 QVPEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGYD 407
           +VPE ++ F  GA       N+FV++  D I+C    G   S + I GN  Q NF V YD
Sbjct: 439 EVPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYD 498

Query: 408 IEQQTVSFKPTDCTK 422
           ++   + F P  C +
Sbjct: 499 LQNNRLGFAPRRCAE 513


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 124/421 (29%), Positives = 200/421 (47%), Gaps = 47/421 (11%)

Query: 35  HRDSPKSPFYNSSETPYQRL-------RDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           H+DS      + ++   +RL       R   +R  N +   N + S+ +     + I   
Sbjct: 3   HKDSCSGKILDWNKKLQKRLIMDNFQLRSLQSRIKNIILSGNIDDSVDTQIPLTSGIRLQ 62

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           + NY++ + +G    +   + DTGSDL W QC+PC  ++CY Q  P+F+P  S +Y+++ 
Sbjct: 63  SLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPC--NRCYNQQDPVFNPSKSPSYRTVL 118

Query: 148 CSSSQCASLNQKSC-SGV------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           C+S  C SL   +  SGV       C Y V+YGDGS+++G +  E + LG+TT     + 
Sbjct: 119 CNSLTCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNLGNTT-----VN 173

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN 260
              FGCG  N GLF    +G+VGLG  D+SLISQ+     G FSYCL    +T+     +
Sbjct: 174 NFIFGCGRKNQGLFGG-ASGLVGLGRTDLSLISQISPMFGGVFSYCL---PTTEAEASGS 229

Query: 261 GIVSGPGVV---STPLTKAKT-------FYVLTIDAISVGN---QRLGVSTPDIVIDSGT 307
            ++ G   V   +TP++  +        FY L +  I+VG    Q        ++IDSGT
Sbjct: 230 LVMGGNSSVYKNTTPISYTRMIHNPLLPFYFLNLTGITVGGVEVQAPSFGKDRMIIDSGT 289

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADV 364
            ++ LP      L +         P A     L+ C++ +   +V  P++ ++F G A++
Sbjct: 290 VISRLPPSIYQALKAEFVKQFSGYPSAPSFMILDSCFNLSGYQEVKIPDIKMYFEGSAEL 349

Query: 365 KLSRSNFFVKVSEDI--VCSVFKGI--TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            +  +  F  V  D   VC     +   + V I GN  Q N  + YD +   + F    C
Sbjct: 350 NVDVTGVFYSVKTDASQVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEAC 409

Query: 421 T 421
           +
Sbjct: 410 S 410


>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
          Length = 459

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 117/424 (27%), Positives = 195/424 (45%), Gaps = 65/424 (15%)

Query: 49  TPYQRLRDALTRSLNRLNHFNQNSSISSSKA-----SQADIIPNNANYLIRISIGTPPTE 103
           T  + +R A+ RSL+R     ++   ++ +A     S+A ++P    YL+++  GTP   
Sbjct: 45  TDQELIRRAVQRSLDRPGIVARSGGGAADEAGKAVASEAPLVPGGGEYLVKLGTGTPQHF 104

Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG 163
             A  DT SDL+W QC+PC    CY Q  P+F+PK+SS+Y  +PC+S  CA L+   C  
Sbjct: 105 FSAAIDTASDLVWMQCQPC--VSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHE 162

Query: 164 VN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTG 220
            +   CQY+  Y     + G LA + + +G     AV      FGC  ++ G   ++ +G
Sbjct: 163 DDDGACQYTYKYSGHGVTKGTLAIDKLAIGGDVFHAV-----VFGCSDSSVGGPAAQASG 217

Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPL-- 273
           +VGLG G +SL+SQ+      +F YCL P  S       +  G + + +    V+  +  
Sbjct: 218 LVGLGRGPLSLVSQLSVH---RFMYCLPPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSS 274

Query: 274 -TKAKTFYVLTIDAISVGNQ-----RLGVSTPD------------------------IVI 303
            T+  ++Y L +D ++VG+Q     R   S P                         +++
Sbjct: 275 STRYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIV 334

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSF-----NSLSQVPEVT 356
           D  +T++FL       L   +   I   P A P+    L+LC+            VP V+
Sbjct: 335 DVASTISFLETSLYDELADDLEEEIRL-PRATPSLRLGLDLCFILPEGVGMDRVYVPTVS 393

Query: 357 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           + F G  ++L R   F  V++  +  +  G T+ V I GN    N  V +++ +  ++F 
Sbjct: 394 LSFDGRWLELDRDRLF--VTDGRMMCLMIGRTSGVSILGNFQLQNMRVLFNLRRGKITFA 451

Query: 417 PTDC 420
              C
Sbjct: 452 KASC 455


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 129/420 (30%), Positives = 185/420 (44%), Gaps = 41/420 (9%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN---------QNSSISSSKAS 80
           SV L HR+ P SP     E P   +   L R   R  +           Q+++ + S  +
Sbjct: 62  SVPLAHRNGPCSPVRGKGELPRAEM---LRRDRERTEYIIRRASRSRRLQDNNDAVSVPT 118

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           Q     ++  Y+  + +GTP   +  + DTGS L W QC+PC  SQCY Q  PLFDP  S
Sbjct: 119 QLGSSYDSQEYVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTS 178

Query: 141 STYKSLPCSSSQC----ASLNQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           S+Y  +PC S +C    A ++   C+      C Y + YG G+   G  +T+ +TLG   
Sbjct: 179 SSYSPVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLGP-- 236

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVP--V 250
                +    FGCG +          G++GLG    SL  Q      G  FS+CL P  V
Sbjct: 237 --GAIVKRFHFGCGHHQQRGKFDMADGVLGLGRLPQSLAWQASARRGGGVFSHCLPPTGV 294

Query: 251 SSTKINFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL----GVSTPDIVI 303
           S+  +  G     S    V TPL        FY L   AISV  Q L     V    ++ 
Sbjct: 295 STGFLALGAPHDTS--AFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFREGVIT 352

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR- 360
           DSGT L+ L +   + L +   S +   P+A P G L+ C++F       VP V++ FR 
Sbjct: 353 DSGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVSLTFRG 412

Query: 361 GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           GA V L  S+    V  D   + +        + G++ Q    V YD+  + V F+   C
Sbjct: 413 GATVHLDASS---GVLMDGCLAFWSSGDEYTGLIGSVSQRTIEVLYDMPGRKVGFRTGAC 469


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 136/436 (31%), Positives = 205/436 (47%), Gaps = 45/436 (10%)

Query: 11  LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQR---LRDALTRSLNRLNH 67
           L  + F +  P +  +  F++ L H  S K+      E+P  +   L    T + +RL+ 
Sbjct: 13  LLIILFALTCPKQCTSYRFTLRL-HTKSIKT-----KESPKIKPGYLHSKSTPAPSRLD- 65

Query: 68  FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
            N  ++  +   S    IPN A +L  ISIG PP  +L + DTGSDL W QC PC   +C
Sbjct: 66  -NLWTTEIADIVSHVTPIPNPAAFLANISIGDPPVPQLLLIDTGSDLTWIQCLPC---KC 121

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
           Y Q  P F P  SSTY++  C S+  A   + +   +G NC+Y + Y D S + G LA E
Sbjct: 122 YPQTIPFFHPSRSSTYRNASCESAPHAMPQIFRDEKTG-NCRYHLRYRDFSNTRGILAKE 180

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
            +T  ++    ++ P I FGCG +N G   ++ +G++GLG G  S++++       KFSY
Sbjct: 181 KLTFQTSDEGLISKPNIVFGCGQDNSGF--TQYSGVLGLGPGTFSIVTR---NFGSKFSY 235

Query: 246 CLVPVSSTKINFGTNGIVSGPGVV----STPLTKAKTFYVLTIDAISVGNQRLGV----- 296
           C    S     +  N ++ G G       TPL   +  Y L + AIS+G + L +     
Sbjct: 236 CF--GSLIDPTYPHNFLILGNGARIEGDPTPLQIFQDRYYLDLQAISLGEKLLDIEPGIF 293

Query: 297 ----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFN--- 347
               S    VID+G + T L +     L   +  ++    + V D       CY  N   
Sbjct: 294 QRYRSKGGTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKL 353

Query: 348 SLSQVPEVTIHFR-GADVKLSRSNFFV-KVSEDIVCSVFKGIT-NSVPIYGNIMQTNFLV 404
            L   P VT HF  GA++ L   + FV   S D  C      T + + + G + Q N+ V
Sbjct: 354 DLYGFPVVTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNV 413

Query: 405 GYDIEQQTVSFKPTDC 420
           GY++    V F+ TDC
Sbjct: 414 GYNLRTMKVYFQRTDC 429


>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
 gi|194704586|gb|ACF86377.1| unknown [Zea mays]
 gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 478

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 125/352 (35%), Positives = 169/352 (48%), Gaps = 33/352 (9%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPC 148
           NY++  S+GTP   +    DTGSDL W QC+PC  +  CY Q  PLFDP  SS+Y ++PC
Sbjct: 139 NYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198

Query: 149 SSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
               CA L      +CS   C Y VSYGDGS + G  +++T+TL +++    A+ G  FG
Sbjct: 199 GGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS----AVQGFFFG 254

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNGIV 263
           CG    GLFN    G++GLG    SL+ Q   T  G FSYCL   P ++  +  G  G  
Sbjct: 255 CGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPS 313

Query: 264 -SGPGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF----LPQG 315
            + PG  +T   P   A T+YV+ +  ISVG Q+L V        +          LP  
Sbjct: 314 GAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPT 373

Query: 316 YNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSN 370
             + L S   S + +   P A   G L+ CY+F     V  P V + F  GA V L    
Sbjct: 374 AYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADG 433

Query: 371 FFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                     C  F   G    + I GN+ Q +F V   I+  +V FKP+ C
Sbjct: 434 IL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 478


>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 478

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 140/424 (33%), Positives = 191/424 (45%), Gaps = 47/424 (11%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
           + L HR  P +P   SS      + D L     R  +  +  S        S  A+ A  
Sbjct: 68  LRLTHRHGPCAPSRASSLA-APSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAAAAAT 126

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFD 136
           +P          NY++  S+GTP   +    DTGSDL W QC+PC  +  CY Q  PLFD
Sbjct: 127 VPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFD 186

Query: 137 PKMSSTYKSLPCSSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           P  SS+Y ++PC    CA L      +CS   C Y VSYGDGS + G  +++T+TL +++
Sbjct: 187 PAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS 246

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVS 251
               A+ G  FGCG    GLFN    G++GLG    SL+ Q   T  G FSYCL   P +
Sbjct: 247 ----AVQGFFFGCGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPST 301

Query: 252 STKINFGTNGIV-SGPGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGT 307
           +  +  G  G   + PG  +T   P   A T+YV+ +  ISVG Q+L V        +  
Sbjct: 302 AGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVV 361

Query: 308 TLTF----LPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
                   LP    + L S   S + +   P A   G L+ CY+F     V  P V + F
Sbjct: 362 DTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTF 421

Query: 360 -RGADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
             GA V L              C  F   G    + I GN+ Q +F V   I+  +V FK
Sbjct: 422 GSGATVTLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFK 474

Query: 417 PTDC 420
           P+ C
Sbjct: 475 PSSC 478


>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
          Length = 404

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 118/348 (33%), Positives = 171/348 (49%), Gaps = 39/348 (11%)

Query: 84  IIPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
           ++ N+A  Y + +SIGTPP     +ADTGS LIWTQC PC  ++C  + +P F P  SST
Sbjct: 82  LLDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPC--TECAARPAPPFQPASSST 139

Query: 143 YKSLPCSSSQCASLNQ--KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           +  LPC+SS C  L    ++C+   C Y   YG G F+ G LATET+ +G  +      P
Sbjct: 140 FSKLPCASSLCQFLTSPYRTCNATGCVYYYPYGMG-FTAGYLATETLHVGGAS-----FP 193

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL---VPVSSTKINF 257
           G+TFGC T NG    + ++GIVGLG   +SL+SQ+      +FSYCL        + I F
Sbjct: 194 GVTFGCSTENG--VGNSSSGIVGLGRSPLSLVSQVGV---ARFSYCLRSNADAGDSPILF 248

Query: 258 GTNGIVSGPGVVSTPLTK-----AKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFL 312
           G+   V+G  V STPL +     + ++Y + +  I+VG   L ++  ++   +GT   F 
Sbjct: 249 GSLAKVTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPMAMANLTTVNGTRFGF- 307

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF 372
              +++        +     V    G  E      S   V EV    R A   L      
Sbjct: 308 DLCFDATAAGGGGGVPVPTLVLRFAGGAEYAVRRRSYFGVVEVDSQGRAAVECL----LV 363

Query: 373 VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           +  SE +          S+ I GN+MQ +  V YD++    SF P DC
Sbjct: 364 LPASEKL----------SISIIGNVMQMDLHVLYDLDGGMFSFAPADC 401


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 121/370 (32%), Positives = 176/370 (47%), Gaps = 51/370 (13%)

Query: 83  DIIPNN------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
           D  PNN       N+L+ ++ GTPP +   + DTGS + WTQC+PC   +C       FD
Sbjct: 148 DHTPNNKLFDEDGNFLVDVAFGTPPQKFTLILDTGSSITWTQCKPC--VRCLKASRRHFD 205

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           P  S TY    C  S            V   Y+++YGD S S GN   +T+TL      +
Sbjct: 206 PSASLTYSLGSCIPST-----------VGNTYNMTYGDKSTSVGNYGCDTMTLE----HS 250

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST-KI 255
              P   FGCG NN G F S   G++GLG G +S +SQ  +     FSYCL    S   +
Sbjct: 251 DVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSL 310

Query: 256 NFGTNG-----------IVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STP 299
            FG              +V+GPG  ++ L ++  ++V  +D ISVGN+RL +     ++P
Sbjct: 311 LFGEKATSQSSSLKFTSLVNGPG--TSGLEESGYYFVKLLD-ISVGNKRLNIPSSVFASP 367

Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS----LELCYSFNSLSQV--P 353
             +IDSGT +T LPQ   S L +     +   P+++        L+ CY+ +    V  P
Sbjct: 368 GTIIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLP 427

Query: 354 EVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
           E+ +HF  GADV+L+            +C  F G  + + I GN  Q +  V YDI+   
Sbjct: 428 EIVLHFGEGADVRLNGKRVIWGNDASRLCLAFAG-NSELTIIGNRQQVSLTVLYDIQGGR 486

Query: 413 VSFKPTDCTK 422
           + F    C+K
Sbjct: 487 IGFGGNGCSK 496


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score =  155 bits (393), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 136/436 (31%), Positives = 199/436 (45%), Gaps = 70/436 (16%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI---- 84
             V L+HRDS      N+S        D L R L R     + + I +  A+ AD     
Sbjct: 66  LQVRLVHRDSFA---VNASAA------DLLARRLQR--DMRRAAWIITKAATPADPENGT 114

Query: 85  ----IPNNANYLIRISIGTPPT-----ERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
                P +  Y+ +I++GTP       E L   D GSD+ W QC PC   +CY Q  P++
Sbjct: 115 VVTGAPTSGEYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPC--FRCYHQPGPVY 172

Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGV-----NCQYSVSYGDGSFSNGNLATETVTLG 190
           +   SS+   + C +  C +L   S  G       CQY V YGDGS S G+   ET+T  
Sbjct: 173 NRLKSSSASDVGCYAPACRALG--SSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTFP 230

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
                 V +PG+  GCG++N GLF +   GI+GLG G +S  SQ+       FSYCL   
Sbjct: 231 P----GVRVPGVAIGCGSDNQGLFPAPAAGILGLGRGSLSFPSQIAGRYGRSFSYCLAGQ 286

Query: 251 S----STKINFGTNGIVSGPGVVSTP----LTKAK--TFYVLTIDAISVGNQRL-GVSTP 299
                S+ + FG+    +            LT ++  TFY + +  ISVG  R+ GV+  
Sbjct: 287 GTGGRSSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVRVRGVTES 346

Query: 300 D-----------IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGS---LELCY 344
           D           +++DSGT +T L    Y +   +   + ++      P G     + CY
Sbjct: 347 DLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAFRVAAVKELGWPSPGGPFAFFDTCY 406

Query: 345 S---FNSLSQVPEVTIHFRGA-DVKLSRSNFFVKV--SEDIVCSVFKGITN-SVPIYGNI 397
           S      + +VP V++HF G  +VKL   N+ + V  ++  +C  F G  +  V I GNI
Sbjct: 407 SSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFAGSGDRGVSIIGNI 466

Query: 398 MQTNFLVGYDIEQQTV 413
               F V YD++ Q V
Sbjct: 467 QLQGFRVVYDVDGQRV 482


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score =  155 bits (393), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 130/430 (30%), Positives = 200/430 (46%), Gaps = 66/430 (15%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ--NSSISSSKASQADIIPN-- 87
           +LIH  S   P Y  +ET   R+   +  S  RL +       S+ S+   +A + P+  
Sbjct: 38  KLIHPGSVHHPHYKPNETAKDRMELDIQHSAARLANIQARIEGSLVSNNDYKARVSPSLT 97

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL- 146
               +  ISIG PP  +L V DTGSD++W  C PC  + C      LFDP  SST+  L 
Sbjct: 98  GRTIMANISIGQPPIPQLVVMDTGSDILWVMCTPC--TNCDNDLGLLFDPSKSSTFSPLC 155

Query: 147 --PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
             PC    C       C  +   ++V+Y D S ++G    +TV   +T      +  + F
Sbjct: 156 KTPCDFEGC------RCDPI--PFTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVLF 207

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
           GCG N G   +    GI+GL  G  SL+    T +  KFSYC+  ++    N+  + ++ 
Sbjct: 208 GCGHNIGHDTDPGHNGILGLNNGPDSLV----TKLGQKFSYCIGNLADPYYNY--HQLIL 261

Query: 265 GPGV----VSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD-----------IVIDSGTTL 309
           G G      STP      FY +T++ ISVG +RL ++ P+           ++ID+G+T+
Sbjct: 262 GEGADLEGYSTPFEVYNGFYYVTMEGISVGEKRLDIA-PETFEMKENRAGGVIIDTGSTI 320

Query: 310 TFLPQGYNS-------NLL--SVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVT 356
           TFL    +        NLL  S   + IE  P          C+ + S+S+     P VT
Sbjct: 321 TFLVDSVHKLLSKEVRNLLGWSFRQATIEKSPWMQ-------CF-YGSISRDLVGFPVVT 372

Query: 357 IHF-RGADVKLSRSNFFVKVSEDIVCSVFK-----GITNSVPIYGNIMQTNFLVGYDIEQ 410
            HF  GAD+ L   +FF ++++++ C          I +   + G + Q ++ VGYD+  
Sbjct: 373 FHFSDGADLALDSGSFFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVN 432

Query: 411 QTVSFKPTDC 420
           Q V F+  DC
Sbjct: 433 QFVYFQRIDC 442


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 120/382 (31%), Positives = 182/382 (47%), Gaps = 60/382 (15%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y + + +GTP  E + + DTGSD+ W QC PC    C     P F+P+ SS++  LPC+
Sbjct: 137 EYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPC--KDCVPALRPPFNPRHSSSFFKLPCA 194

Query: 150 SSQCASLNQK-----SCSGVNCQYSVSYGDGSFSNGNLATETVTLGST----TGQAVALP 200
           SS C ++ Q      S SG  C +S+ YGDGS S+G LA ET+  G+T     G+ V L 
Sbjct: 195 SSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVKLS 253

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN---- 256
            IT GC   +     +  +G++G+    IS  SQ+ +  A KFS+C  P     +N    
Sbjct: 254 NITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCF-PDKIAHLNSSGL 312

Query: 257 --FGTNGIVSGPGVVSTPLTK-------AKTFYVLTIDAISVGNQRLGVSTPDI------ 301
             FG + I+S P +  TPL +       +  +Y + +  ISV   RL +S  +       
Sbjct: 313 VFFGESDIIS-PYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVT 371

Query: 302 -----VIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ- 351
                +IDSGT  T+L     Q      L+  S + +     D       CY+  S +  
Sbjct: 372 GSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAK----VDDNSGFTPCYNITSGTAA 427

Query: 352 -----VPEVTIHFRGA-DVKLSRSNFFVKVS----EDIVCSVFKGITNSVP--IYGNIMQ 399
                +P +T+HFRG  DV L +++  + VS    +  +C  F+ ++  +P  I GN  Q
Sbjct: 428 LESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQ-MSGDIPFNIIGNYQQ 486

Query: 400 TNFLVGYDIEQQTVSFKPTDCT 421
            N  V YD+E+  +   P  C 
Sbjct: 487 QNLWVEYDLEKLRLGIAPAQCA 508


>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 105/355 (29%), Positives = 180/355 (50%), Gaps = 34/355 (9%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+   +IGTPP    AV D   +L+WTQC+ C  S+C+ QD+PLFDP  S+TY++ PC 
Sbjct: 50  NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--SRCFEQDTPLFDPTASNTYRAEPCG 107

Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           +  C S+  + ++CSG  C Y  S   G  + G + T+T  +G+      A   + FGC 
Sbjct: 108 TPLCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVS 264
             +        +GIVGLG    SL++Q   T    FSYCL P  + +   +  G++  ++
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQ---TGVAAFSYCLAPHDAGRNSALFLGSSAKLA 217

Query: 265 GPG-VVSTPLT-------KAKTFYVLTIDAISVGNQRLGV--STPDIVIDSGTTLTFLPQ 314
           G G   STP             +Y + ++ +  G+  + +  S   +++D+ + ++FL  
Sbjct: 218 GGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVLLDTFSPISFLVD 277

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRSNFF 372
           G    +   +++ + A P+A P    +LC+  +  S   P++   FR GA + +  +N+ 
Sbjct: 278 GAYQAVKKAVTAAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYL 337

Query: 373 VKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           +      VC     S     T  + + G++ Q N    +D++++T+SF+P DCTK
Sbjct: 338 LDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 136/431 (31%), Positives = 195/431 (45%), Gaps = 53/431 (12%)

Query: 34  IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS---KASQADIIPN--- 87
           +HRDS  SP+  ++ T +  +R+ L R   RL   +   S+  +   K+S  + + N   
Sbjct: 1   MHRDSADSPYRPANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60

Query: 88  -----------------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
                            +  Y + + +GTPP     VADTGSD++W QC PC    CY Q
Sbjct: 61  FLQQDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPC--QSCYGQ 118

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
             PLF+P  SST++S+ C SS C  L  + C    C Y VSYGDGSF+ G  +TET++ G
Sbjct: 119 TDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFG 178

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
           S    +VA+     GCG NN GLF +   G++GLG G +S  SQ+       FSYCL   
Sbjct: 179 SNAVNSVAI-----GCGHNNQGLF-TGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTR 232

Query: 251 SST---KINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGV--------- 296
            ST    + FG   + S     +T LT  K  TFY + +  I VG   + +         
Sbjct: 233 ESTGSVPLIFGNQAVASN-AQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDS 291

Query: 297 --STPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV- 352
                 +++DSGT +T L    YN    +  + M     +       + CY  +  S + 
Sbjct: 292 STGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIM 351

Query: 353 -PEVTIHFR-GADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
            P V+  F  GA + L   N  V V      C  F   + +  I GNI Q +F + +D  
Sbjct: 352 LPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDST 411

Query: 410 QQTVSFKPTDC 420
              V      C
Sbjct: 412 GNRVGIGANQC 422


>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 386

 Score =  155 bits (392), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 125/352 (35%), Positives = 169/352 (48%), Gaps = 33/352 (9%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPC 148
           NY++  S+GTP   +    DTGSDL W QC+PC  +  CY Q  PLFDP  SS+Y ++PC
Sbjct: 47  NYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 106

Query: 149 SSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
               CA L      +CS   C Y VSYGDGS + G  +++T+TL +++    A+ G  FG
Sbjct: 107 GGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS----AVQGFFFG 162

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNGIV 263
           CG    GLFN    G++GLG    SL+ Q   T  G FSYCL   P ++  +  G  G  
Sbjct: 163 CGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPS 221

Query: 264 -SGPGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF----LPQG 315
            + PG  +T   P   A T+YV+ +  ISVG Q+L V        +          LP  
Sbjct: 222 GAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPT 281

Query: 316 YNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSN 370
             + L S   S + +   P A   G L+ CY+F     V  P V + F  GA V L    
Sbjct: 282 AYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADG 341

Query: 371 FFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                     C  F   G    + I GN+ Q +F V   I+  +V FKP+ C
Sbjct: 342 IL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 386


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 136/431 (31%), Positives = 195/431 (45%), Gaps = 53/431 (12%)

Query: 34  IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS---KASQADIIPN--- 87
           +HRDS  SP+  ++ T +  +R+ L R   RL   +   S+  +   K+S  + + N   
Sbjct: 1   MHRDSADSPYRPANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60

Query: 88  -----------------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
                            +  Y + + +GTPP     VADTGSD++W QC PC    CY Q
Sbjct: 61  FLQQDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPC--QSCYGQ 118

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
             PLF+P  SST++S+ C SS C  L  + C    C Y VSYGDGSF+ G  +TET++ G
Sbjct: 119 TDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFG 178

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV 250
           S    +VA+     GCG NN GLF +   G++GLG G +S  SQ+       FSYCL   
Sbjct: 179 SNAVNSVAI-----GCGHNNQGLF-TGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTR 232

Query: 251 SST---KINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGV--------- 296
            ST    + FG   + S     +T LT  K  TFY + +  I VG   + +         
Sbjct: 233 ESTGSVPLIFGNQAVASN-AQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDS 291

Query: 297 --STPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV- 352
                 +++DSGT +T L    YN    +  + M     +       + CY  +  S + 
Sbjct: 292 STGNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIM 351

Query: 353 -PEVTIHFR-GADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
            P V+  F  GA + L   N  V V      C  F   + +  I GNI Q +F + +D  
Sbjct: 352 LPAVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDST 411

Query: 410 QQTVSFKPTDC 420
              V      C
Sbjct: 412 GNRVGIGANQC 422


>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
 gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
          Length = 493

 Score =  155 bits (391), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 143/435 (32%), Positives = 204/435 (46%), Gaps = 48/435 (11%)

Query: 30  SVELIHRDSPKSPFYNSS-ETPYQRL-RDALT-----RSLNRLNHFNQNSSISSS--KAS 80
           +V L HR  P SP  N    T  +RL RD L      R L+R        +      + S
Sbjct: 63  TVPLHHRHGPCSPLPNKKMPTLEERLHRDKLRAAYIHRKLSRGKKQGGGGAGGDVVVQQS 122

Query: 81  QADIIP-------NNANYLIRISIGTPPTE-RLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
            A  +P       +   Y+I + +G+PP + +  + DTGSD+ W +C+PC   QC  Q  
Sbjct: 123 HAMTVPTTLGTSLDTLEYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCW-QQCRPQVD 181

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGV-NCQYSVSYGDGSF-SNGNLATET 186
           PLFDP +SSTY    CSS+ CA L    N   CS    CQY   YGDGS  + G  +++T
Sbjct: 182 PLFDPSLSSTYSPFSCSSAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDT 241

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA-GKFSY 245
           + LGS +   V +    FGC     G+       +   GG   SL+SQ   T     FSY
Sbjct: 242 LALGSNS-NTVVVSKFRFGCSHAETGITGLTAGLMGLGGGAQ-SLVSQTAGTFGTTAFSY 299

Query: 246 CLVPVSSTK--INFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVST-- 298
           CL P  S+   +  G  G  S  G V TP+ ++     FY + ++AI VG ++L + T  
Sbjct: 300 CLPPTPSSSGFLTLGAAG-TSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTV 358

Query: 299 --PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT---GSLELCYSFNSLSQV- 352
               +++DSGT +T LP    S+L S   + ++  P A  +   G L+ C+  +  S V 
Sbjct: 359 FSAGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQSSVS 418

Query: 353 -PEVTIHFRGAD---VKLSRSNFFVKV-SEDIVCSVFKGITN--SVPIYGNIMQTNFLVG 405
            P V + F GA    V L  S   +++ +  I C  F   ++  S  I GN+ Q  F V 
Sbjct: 419 MPTVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIGNVQQRTFQVL 478

Query: 406 YDIEQQTVSFKPTDC 420
           YD+    V FK   C
Sbjct: 479 YDVAGGAVGFKAGAC 493


>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
 gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
          Length = 471

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 142/468 (30%), Positives = 218/468 (46%), Gaps = 69/468 (14%)

Query: 6   SCVFILFFLCFYVVSPI------------EAQTGGFSVELIHRDSPKSPFYNSSETPYQR 53
           S +F LF L  ++  P+            + +  GF   LIH  SP+SPFY  + TP + 
Sbjct: 8   SAIFRLFLLILHIPFPLSSSFSLPLKELAKGKAYGFKAPLIHWSSPESPFYEPNLTPGEL 67

Query: 54  LRDALTRSLNRLNHFNQ--NSSISSSK---ASQADIIPNNANYLIRISIGTPPTERLAVA 108
           +R ++  S  R +   +  +S IS+S+    S+  II  +  Y+++ +IG+PP E  A+ 
Sbjct: 68  MRASVRTSRARGDRIRKIRSSGISNSRKYPVSRISII--DKVYVMKFNIGSPPVETYAIP 125

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS--------LNQKS 160
           DTGS+++W QC     + CY Q  PLF+P  SSTY    C   +C          L  KS
Sbjct: 126 DTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGHRECKQALWGLGEYLGCKS 185

Query: 161 CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP-GITFGCGTNN----GGLFN 215
              V C+Y +SY D SFS G ++T+ +T      +       + FGCG NN    G   N
Sbjct: 186 SVQV-CRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSLRMFFGCGYNNSETPGQDPN 244

Query: 216 SKTT-GIVGLGGGDISLISQMRTTIAGKFSYCL------VPVSSTKINFGTNGIVSGPGV 268
           S T  G+VGLG    SL+ Q+     G+FSYC+       P  + +I FG    +SG   
Sbjct: 245 SFTAPGVVGLGNEMASLVGQL---TLGQFSYCISTPDVQKPNGTIEIRFGLAASISGH-- 299

Query: 269 VSTPLT-KAKTFYVL-TIDAISVGNQRLGVSTPD------------IVIDSGTTLTFLPQ 314
            ST L    + +Y+   +D I V + ++    P+            +++DSGTT T L  
Sbjct: 300 -STALANNLEGWYIFQNVDGIYVDDTKVK-GYPEWVFQFAEGGIGGLIMDSGTTYTELYF 357

Query: 315 GYNSNLLSVMSSMIEAQP-VADPTGS-LELCYSFNS--LSQVPEVTIHF---RGADVKLS 367
                L+  +   IE  P   D + S   LCY+  +  L+ VP + + F   + A    +
Sbjct: 358 SALDALIGELKEQIELAPDTQDHSNSNYSLCYNAANFLLTYVPAIELKFTDNKEAYFPFT 417

Query: 368 RSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
             N ++    D  C    G T+ + I G     +  +GYD++   VSF
Sbjct: 418 LRNAWIDNGNDQYCLAMFG-TSGISIIGIYQHRDIKIGYDLKYNLVSF 464


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score =  154 bits (390), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 117/367 (31%), Positives = 174/367 (47%), Gaps = 43/367 (11%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +  S+GTP  +   + DTGSDL + QC PC    CY QD PL+ P  SST+  +P
Sbjct: 31  SGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPC--DLCYEQDGPLYQPSNSSTFTPVP 88

Query: 148 CSSSQ-----------CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           C S++           C+S   +S     C Y   YGD S + G  A ET T+G      
Sbjct: 89  CDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATVGGIRVNH 148

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---- 252
           VA     FGCG  N G F S   G++GLG G +S  SQ       KF+YCL    S    
Sbjct: 149 VA-----FGCGNRNQGSFVS-AGGVLGLGQGALSFTSQAGYAFENKFAYCLTSYLSPTSV 202

Query: 253 -TKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL----------GVST 298
            + + FG + + +   +  TPL       + Y + I  I  G + L           V  
Sbjct: 203 FSSLIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDSVGN 262

Query: 299 PDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEV 355
              + DSGTT+T+  PQ Y   + +   S+   +    P G L LC + + +     P  
Sbjct: 263 GGTIFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQG-LPLCVNVSGIDHPIYPSF 321

Query: 356 TIHF-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
           TI F +GA  + ++ N+F++VS +I C ++ +  ++   + GNI+Q N+LV YD E+  +
Sbjct: 322 TIEFDQGATYRPNQGNYFIEVSPNIDCLAMLESSSDGFNVIGNIIQQNYLVQYDREEHRI 381

Query: 414 SFKPTDC 420
            F   +C
Sbjct: 382 GFAHANC 388


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score =  154 bits (390), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 114/368 (30%), Positives = 179/368 (48%), Gaps = 39/368 (10%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + IGTPP     + DTGSDL W QC PC    C++Q+ P +DPK SS++K++ C
Sbjct: 190 GEYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPC--YDCFVQNGPYYDPKESSSFKNIGC 247

Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT----GQAVA 198
              +C  ++     + C   N  C Y   YGD S + G+ A ET T+  T+     +   
Sbjct: 248 HDPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKR 307

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
           +  + FGCG  N GLF+     +    G  +S  SQ+++     FSYCLV  +     S+
Sbjct: 308 VENVMFGCGHWNRGLFHGAAGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSDTNVSS 366

Query: 254 KINFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVS------TPD- 300
           K+ FG +  +++ P V  T L   K     TFY + I +I VG + L +       +P+ 
Sbjct: 367 KLIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEG 426

Query: 301 ---IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 355
               ++DSGTTL++  +     +       ++  PV      L+ CY+ + +   ++PE 
Sbjct: 427 AGGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDPCYNVSGVEKMELPEF 486

Query: 356 TIHFR-GADVKLSRSNFFVKVS-EDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 412
            I F  GA       N+F+K+  E+IVC    G   S + I GN  Q NF + YD ++  
Sbjct: 487 RILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSALSIIGNYQQQNFHILYDTKKSR 546

Query: 413 VSFKPTDC 420
           + + P  C
Sbjct: 547 LGYAPMKC 554


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score =  154 bits (389), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 115/367 (31%), Positives = 175/367 (47%), Gaps = 48/367 (13%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
              Y++ +SIGTPP    A+ DTGSDL+W +C+ C           +F    SS+YK LP
Sbjct: 2   EGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLP 61

Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTL---GSTTGQAVA 198
           C+S+ C+ +   S +G+       C+Y   YGDGS ++G++ ++ ++    G+       
Sbjct: 62  CNSTHCSGM---SSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSF 118

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-----PVSST 253
             G  FGC     G +N  T G++GLG    SLI Q+   +  KFSYCLV     P + +
Sbjct: 119 FDGFLFGCARKLKGDWNF-TQGLIGLGQKSHSLIQQLGDKLGYKFSYCLVSYDSPPSAKS 177

Query: 254 KINFGTNGIVSGPGVVSTPLTKA----KTFYVLTIDAISVG-------NQRLGVSTP--- 299
            +  G++  + G  VVSTP+       +T Y + + +I++G       ++  G +T    
Sbjct: 178 FLFLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGP 237

Query: 300 ----DIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQ---PVADPTGSLELCY--SFNSL 349
                 VIDSGTT T L P  Y +     M   IE Q   P    +  L+LC+  S ++ 
Sbjct: 238 FLANKTVIDSGTTYTLLTPPVYEA-----MRKSIEEQVILPTLGNSAGLDLCFNSSGDTS 292

Query: 350 SQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
              P VT +F     + L   N F   S D+VC         + I GN+ Q NF + YD+
Sbjct: 293 YGFPSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYDL 352

Query: 409 EQQTVSF 415
               +SF
Sbjct: 353 VASQISF 359


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 122/362 (33%), Positives = 180/362 (49%), Gaps = 48/362 (13%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           + + +GTPP     + D GSDL+WTQC    P+    Q  P+FD   SS++  LPC S  
Sbjct: 109 LTVGVGTPPQPSKVILDLGSDLLWTQCSLVGPTA--KQLEPVFDAARSSSFSVLPCDSKL 166

Query: 153 C--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
           C   +   K+C+   C Y   YG  + + G LATET T G+  G +  L   TFGCG   
Sbjct: 167 CEAGTFTNKTCTDRKCAYENDYGIMT-ATGVLATETFTFGAHHGVSANL---TFGCGKLA 222

Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTN---GIVS 264
            G   ++ +GI+GL  G +S++ Q+  T   KFSYCL P +  K   + FG     G   
Sbjct: 223 NGTI-AEASGILGLSPGPLSMLKQLAIT---KFSYCLTPFADRKTSPVMFGAMADLGKYK 278

Query: 265 GPGVVST-PLTK---AKTFYVLTIDAISVGNQRLGVS------TPD----IVIDSGTTLT 310
             G V T PL K      +Y + +  +SVG++RL V        PD     V+DS TTL 
Sbjct: 279 TTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTLA 338

Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPT-GSLELCYSF-NSLS----QVPEVTIHFRG-AD 363
           +L +   + L   +   I+  PVA+ +     +C+     +S    QVP + +HF G A+
Sbjct: 339 YLVEPAFTELKKAVMEGIKL-PVANRSVDDYPVCFELPRGMSMEGVQVPPLVLHFDGDAE 397

Query: 364 VKLSRSNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
           + L R N+F + S  ++C     + F+G  N   + GN+ Q N  V YD+  +  S+ PT
Sbjct: 398 MSLPRDNYFQEPSPGMMCLAVMQAPFEGAPN---VIGNVQQQNMHVLYDVGNRKFSYAPT 454

Query: 419 DC 420
            C
Sbjct: 455 KC 456


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 120/381 (31%), Positives = 181/381 (47%), Gaps = 60/381 (15%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y + + +GTP  E + + DTGSD+ W QC PC    C     P F+P+ SS++  LPC+
Sbjct: 138 EYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPC--KDCVPALRPPFNPRHSSSFFKLPCA 195

Query: 150 SSQCASLNQK-----SCSGVNCQYSVSYGDGSFSNGNLATETVTLGST----TGQAVALP 200
           SS C ++ Q      S SG  C +S+ YGDGS S+G LA ET+  G+T     G+ V L 
Sbjct: 196 SSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVKLS 254

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN---- 256
            IT GC   +     +  +G++G+    IS  SQ+ +  A KFS+C  P     +N    
Sbjct: 255 NITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCF-PDKIAHLNSSGL 313

Query: 257 --FGTNGIVSGPGVVSTPLTK-------AKTFYVLTIDAISVGNQRLGVSTPDI------ 301
             FG + I+S P +  TPL +       +  +Y + +  ISV   RL +S  +       
Sbjct: 314 VFFGESDIIS-PYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVT 372

Query: 302 -----VIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ- 351
                +IDSGT  T+L     Q      L+  S + +     D       CY+  S +  
Sbjct: 373 GSGGTIIDSGTAFTYLKKPAFQAMRREFLARTSHLAK----VDDNSGFTPCYNITSGTAA 428

Query: 352 -----VPEVTIHFRGA-DVKLSRSNFFVKVS----EDIVCSVFKGITNSVP--IYGNIMQ 399
                +P +T+HFRG  DV L +++  + VS    +  +C  F  ++  +P  I GN  Q
Sbjct: 429 LESTILPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFL-MSGDIPFNIIGNYQQ 487

Query: 400 TNFLVGYDIEQQTVSFKPTDC 420
            N  V YD+E+  +   P  C
Sbjct: 488 QNLWVEYDLEKLRLGIAPAQC 508


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score =  154 bits (389), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 115/368 (31%), Positives = 181/368 (49%), Gaps = 39/368 (10%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + +GTPP     + DTGSDL W QC PC    C+ Q+ P +DPK SS++K++ C
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--YACFEQNGPYYDPKDSSSFKNITC 250

Query: 149 SSSQCASLNQ----KSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA---- 198
              +C  ++     + C G   +C Y   YGD S + G+ A ET T+  TT +       
Sbjct: 251 HDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKI 310

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
           +  + FGCG  N GLF+     ++GLG G +S  +Q+++     FSYCLV  +     S+
Sbjct: 311 VENVMFGCGHWNRGLFHGAAG-LLGLGRGPLSFATQLQSLYGHSFSYCLVDRNSNSSVSS 369

Query: 254 KINFGTNG-IVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVSTPDI------ 301
           K+ FG +  ++S P +  T     K     TFY + I +I VG + L +           
Sbjct: 370 KLIFGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHLSAQG 429

Query: 302 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 355
               +IDSGTTLT+  +     +       I+  P+ +    L+ CY+ + +   ++PE 
Sbjct: 430 GGGTIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYNVSGVEKMELPEF 489

Query: 356 TIHF-RGADVKLSRSNFFVKVS-EDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 412
            I F  GA       N+F+++  ED+VC    G   S + I GN  Q NF + YD+++  
Sbjct: 490 AILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSIIGNYQQQNFHILYDLKKSR 549

Query: 413 VSFKPTDC 420
           + + P  C
Sbjct: 550 LGYAPMKC 557


>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
          Length = 367

 Score =  154 bits (388), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 117/388 (30%), Positives = 182/388 (46%), Gaps = 83/388 (21%)

Query: 49  TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
           T ++ LR A+ RS  RL      +  + S+ KA  A+  I+P    YL+++ IGTPP + 
Sbjct: 43  TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102

Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
            A  DT SDLIWTQC+PC  + CY Q  P+F+P++SSTY +LPCSS  C  L+   C   
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160

Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
           +   CQY+ +Y   + + G LA + + +G       A  G+ FGC T++ GG    + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215

Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFY 280
           +VGLG G +SL+SQ+     G     ++ ++ST I F                       
Sbjct: 216 VVGLGRGPLSLVSQLSVRRYGM----IIDIAST-ITF----------------------- 247

Query: 281 VLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 340
                        L  S  D +++       LP+G  S+L                   L
Sbjct: 248 -------------LEASLYDELVNDLEVEIRLPRGTGSSL------------------GL 276

Query: 341 ELCY------SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSED-IVC-SVFKGITNSVP 392
           +LC+      +F+ +  VP V + F G  ++L ++  F +  E  ++C  V +    SV 
Sbjct: 277 DLCFILPDGVAFDRV-YVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVS 335

Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           I GN  Q N  V Y++ +  V+F  + C
Sbjct: 336 ILGNFQQQNMQVLYNLRRGRVTFVQSPC 363


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 131/415 (31%), Positives = 205/415 (49%), Gaps = 32/415 (7%)

Query: 30  SVELIHRDSPKSPFYNS-SETPY---QRLR-DALTRSLNRLNHFNQNSSISSSKASQADI 84
           S++++H+  P     N  S   +    +LR D++   L++++       + +   +Q+ I
Sbjct: 69  SLQVLHKYGPCMQVLNDRSHVEFLLQDQLRVDSIQARLSKISGHGIFEEMVTKLPAQSGI 128

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
                NY++ + +GTP  +   V DTGS + WTQC+PC  S CY Q    FDP  S++Y 
Sbjct: 129 AIGTGNYVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGS-CYPQKEQKFDPTKSTSYN 187

Query: 145 SLPCSSSQCASL--NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           ++ CSS+ C  L  +++ CS  N  C Y + YGD S+S G  ATET+T+ S+        
Sbjct: 188 NVSCSSASCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTISSSD----VFT 243

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFG 258
              FGCG +N GLF  +  G++GL    +SL SQ       +FSYCL   P S+  +NFG
Sbjct: 244 NFLFGCGQSNNGLFG-QAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPSTPSSTGYLNFG 302

Query: 259 TNGIVSGPGVVSTPLTKA-KTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFL 312
             G VS      TP++ A  +FY + I  ISV   +L +     +T   +IDSGT +T L
Sbjct: 303 --GKVSQTAGF-TPISPAFSSFYGIDIVGISVAGSQLPIDPSIFTTSGAIIDSGTVITRL 359

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGA-DVKLSRS 369
           P      L       +   P  +    L+ CY F++ + V  P+V++ F+G  +V +  S
Sbjct: 360 PPTAYKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVSVSFKGGVEVDIDAS 419

Query: 370 NFFVKVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
                V+   +VC  F    +     I+GN  Q  + V YD  +  + F    C+
Sbjct: 420 GILYLVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGACS 474


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 135/444 (30%), Positives = 202/444 (45%), Gaps = 61/444 (13%)

Query: 27  GGFSV-ELIHRDSPKSPFYNSSETPYQRLRDALTR--SLN-RLNHFNQNSSISSSK---- 78
           GG +V EL H     +P  +  E     L     R  SL  R+ H+   ++ SS++    
Sbjct: 65  GGATVLELRHHSFSPAPANSREEEADALLSTDAARVSSLQGRIEHYRLTTTSSSAEVAVT 124

Query: 79  ASQADI-IPNNA-----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
           AS+A + + + A     NY+  + +G    E   + DT S+L W QC PC    C+ Q  
Sbjct: 125 ASKAQVPVSSGARLRTLNYVATVGLGG--GEATVIVDTASELTWVQCAPC--ESCHDQQG 180

Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-------------CQYSVSYGDGSFSN 179
           PLFDP  S +Y ++PC S  C +L Q+  +G               C Y++SY DGS+S 
Sbjct: 181 PLFDPSSSPSYAAVPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSR 240

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           G LA + ++L         + G  FGCGT+N G     T+G++GLG   +SL+SQ     
Sbjct: 241 GVLAHDRLSLAGEV-----IDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTVDQF 295

Query: 240 AGKFSYCLVPVSSTKINFGTNGIVSGPGVV--STPLTKAKT-----------FYVLTIDA 286
            G FSYCL P+S      G+  +   P     STP+                FY++ +  
Sbjct: 296 GGVFSYCL-PLSRESDASGSLVLGDDPSAYRNSTPVVYTSMVSNSDPLLQGPFYLVNLTG 354

Query: 287 ISVGNQRLGVS--TPDIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC 343
           I+VG Q +  +  +   ++DSGT +T  +P  YN+     MS + E  P A     L+ C
Sbjct: 355 ITVGGQEVESTGFSARAIVDSGTVITSLVPSVYNAVRAEFMSQLAE-YPQAPGFSILDTC 413

Query: 344 YSFNSLS--QVPEVTIHFR-GADVKLSRSN--FFVKVSEDIVCSVFKGIT--NSVPIYGN 396
           ++   L   QVP +T+ F  GA+V++      +FV      VC     +   +   I GN
Sbjct: 414 FNMTGLKEVQVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDETSIIGN 473

Query: 397 IMQTNFLVGYDIEQQTVSFKPTDC 420
             Q N  V +D     V F    C
Sbjct: 474 YQQKNLRVVFDTSASQVGFAQETC 497


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 120/356 (33%), Positives = 168/356 (47%), Gaps = 53/356 (14%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
           + DTGSDL W QC+PC  S CY Q  PLFDP  S++Y ++PC++S C ASL        S
Sbjct: 180 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 237

Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
           C+ V           C YS++YGDGSFS G LAT+TV LG  +     + G  FGCG +N
Sbjct: 238 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 292

Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS 270
            GLF   T G++GLG  ++SL+SQ      G FSYCL   +S        G +S  G  S
Sbjct: 293 RGLFGG-TAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGD----AAGSLSLGGDTS 347

Query: 271 -----TPLTKAKT--------FYVLTI---DAISVGNQRLGVSTPDIVIDSGTTLTFL-P 313
                TP++  +         FY + +             G+   ++++DSGT +T L P
Sbjct: 348 SYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRLAP 407

Query: 314 QGYNSNLLSVMSSM-IEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRS 369
             Y +           E  P A P   L+ CY+     +  VP +T+    GAD+ +  +
Sbjct: 408 SVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAA 467

Query: 370 NFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
                  +D   VC     ++  +  PI GN  Q N  V YD     + F   DC+
Sbjct: 468 GMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 523


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score =  154 bits (388), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 120/356 (33%), Positives = 168/356 (47%), Gaps = 53/356 (14%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
           + DTGSDL W QC+PC  S CY Q  PLFDP  S++Y ++PC++S C ASL        S
Sbjct: 179 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 236

Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
           C+ V           C YS++YGDGSFS G LAT+TV LG  +     + G  FGCG +N
Sbjct: 237 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 291

Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS 270
            GLF   T G++GLG  ++SL+SQ      G FSYCL   +S        G +S  G  S
Sbjct: 292 RGLFGG-TAGLMGLGRTELSLVSQTAPRFGGVFSYCLPAATSGD----AAGSLSLGGDTS 346

Query: 271 -----TPLTKAKT--------FYVLTI---DAISVGNQRLGVSTPDIVIDSGTTLTFL-P 313
                TP++  +         FY + +             G+   ++++DSGT +T L P
Sbjct: 347 SYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRLAP 406

Query: 314 QGYNSNLLSVMSSM-IEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRS 369
             Y +           E  P A P   L+ CY+     +  VP +T+    GAD+ +  +
Sbjct: 407 SVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAA 466

Query: 370 NFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
                  +D   VC     ++  +  PI GN  Q N  V YD     + F   DC+
Sbjct: 467 GMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 522


>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
          Length = 629

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 100/283 (35%), Positives = 143/283 (50%), Gaps = 25/283 (8%)

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           GT    +  + D+GSD+ W QC+PCP   C+ Q  PLFDP MS+TY ++PC+S+ CA L 
Sbjct: 71  GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 130

Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
             ++ CS    CQ+ ++YGDGS + G  + + +TLG        + G  FGC   + G  
Sbjct: 131 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD----VIRGFRFGCAHADRGSA 186

Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGV 268
           F+    G + LGGG  SL+ Q  T     FSYCL P +S+ + F   G+        P  
Sbjct: 187 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS-LGFLVLGVPPERAQLIPSF 245

Query: 269 VSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNL 320
           VSTPL   + A TFY + + AI V  + L V  P +     VIDS T ++ LP      L
Sbjct: 246 VSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVP-PAVFSASSVIDSSTIISRLPPTAYQAL 304

Query: 321 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG 361
            +   S +     A P   L+ CY F  +  +  P + + F G
Sbjct: 305 RAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDG 347



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 74/288 (25%), Positives = 113/288 (39%), Gaps = 62/288 (21%)

Query: 156 LNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
           + QK+  G      CQ+ ++YGDGS + G  + + +TLG        LP           
Sbjct: 381 VQQKTLEGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVDRQGLP----------- 429

Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----P 266
            L  +   G V                    FSYC +P S + + F T G+        P
Sbjct: 430 -LRTATQYGRV--------------------FSYC-IPPSPSSLGFITLGVPPQRAALVP 467

Query: 267 GVVSTPLTKAK----TFYVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYN 317
             VSTPL  +     TFY + + AI V  + L V  P +     VI S T ++ LP    
Sbjct: 468 TFVSTPLLSSSSMPPTFYRVLLRAIIVAGRPLPVP-PTVFSTSSVIASTTVISRLPPTAY 526

Query: 318 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVK 374
             L +     +     A P   L+ CY F  +  +  P + + F G A V L  +   ++
Sbjct: 527 QALRAAFRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLQ 586

Query: 375 VSEDIVCSVFK-GITNSVPIY-GNIMQTNFLVGYDIEQQTVSFKPTDC 420
                 C  F    T+ +P + GN+ Q    V YD+  + + F+   C
Sbjct: 587 G-----CLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 629


>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
          Length = 440

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 135/448 (30%), Positives = 203/448 (45%), Gaps = 80/448 (17%)

Query: 25  QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
           +  G  +EL H D+ ++    S+E   +R+R A  R+  RL    + S+      SQ   
Sbjct: 20  RAAGLRLELTHVDAKQN---CSTE---ERMRRATERTHRRLASMGEASAPVHWAESQ--- 70

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
                 Y+    IG PP +  A+ DTGS+LIWTQC  C P+ C+ Q+   +DP  S T +
Sbjct: 71  ------YIAEYLIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTAR 124

Query: 145 SLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
            + C+ + CA  ++  C+  N  C    +YG G    G L TE  T    + + V+L   
Sbjct: 125 PVACNDTACALGSETRCARDNKACAVLTAYGAGVI-GGVLGTEAFTFQPQS-ENVSL--- 179

Query: 203 TFGC--------GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK 254
            FGC        G+ +G       +GI+GLG G++SL+SQ+      KFSYCL P  S  
Sbjct: 180 AFGCIAATRLTPGSLDG------ASGIIGLGRGNLSLVSQLGDN---KFSYCLTPYFSQS 230

Query: 255 INF------GTNGIVSGPG-VVSTPLTKA------KTFYVLTIDAISVGNQRLGVSTPDI 301
            N        + G+ SG     S P  K        TFY L +  I+VG+ +L V     
Sbjct: 231 TNTSRLFVGASAGLSSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAF 290

Query: 302 -------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSF 346
                        +IDSG+  T L       L   +   + A  V  P G+  L+LC + 
Sbjct: 291 DLRQVATGLWAGTLIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAV 350

Query: 347 ---NSLSQVPEVTIHF--RGADVKLSRSNFFVKVSEDIVCSVF---KGITNSVP-----I 393
              +    VP + +HF   G DV +   N++  V +   C V     G  +++P     I
Sbjct: 351 AHGDVGKLVPPLVLHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTI 410

Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            GN MQ +  + YD+E+  +SF+P DC+
Sbjct: 411 IGNYMQQDMHLLYDLEKGMLSFQPADCS 438


>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
          Length = 720

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 103/296 (34%), Positives = 148/296 (50%), Gaps = 26/296 (8%)

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           GT    +  + D+GSD+ W QC+PCP   C+ Q  PLFDP MS+TY ++PC+S+ CA L 
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221

Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
             ++ CS    CQ+ ++YGDGS + G  + + +TLG        + G  FGC   + G  
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD----VIRGFRFGCAHADRGSA 277

Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGV 268
           F+    G + LGGG  SL+ Q  T     FSYCL P +S+ + F   G+        P  
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASS-LGFLVLGVPPERAQLIPSF 336

Query: 269 VSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNL 320
           VSTPL   + A TFY + + AI V  + L V  P +     VIDS T ++ LP      L
Sbjct: 337 VSTPLLSSSMAPTFYRVLLRAIIVAGRPLAVP-PAVFSASSVIDSSTIISRLPPTAYQAL 395

Query: 321 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFV 373
            +   S +     A P   L+ CY F  +  +  P + + F  GA V L  +   +
Sbjct: 396 RAAFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILL 451



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 74/288 (25%), Positives = 113/288 (39%), Gaps = 62/288 (21%)

Query: 156 LNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
           + QK+  G      CQ+ ++YGDGS + G  + + +TLG        LP           
Sbjct: 472 VQQKTLEGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYDVDRQGLP----------- 520

Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----P 266
            L  +   G V                    FSYC +P S + + F T G+        P
Sbjct: 521 -LRTATQYGRV--------------------FSYC-IPPSPSSLGFITLGVPPQRAALVP 558

Query: 267 GVVSTPLTKAK----TFYVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYN 317
             VSTPL  +     TFY + + AI V  + L V  P +     VI S T ++ LP    
Sbjct: 559 TFVSTPLLSSSSMPPTFYRVLLRAIIVAGRPLPVP-PTVFSTSSVIASTTVISRLPPTAY 617

Query: 318 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVK 374
             L +     +     A P   L+ CY F  +  +  P + + F G A V L  +   ++
Sbjct: 618 QALRAAFRRAMTMYRTAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLQ 677

Query: 375 VSEDIVCSVFKGI-TNSVPIY-GNIMQTNFLVGYDIEQQTVSFKPTDC 420
                 C  F    T+ +P + GN+ Q    V YD+  + + F+   C
Sbjct: 678 G-----CLAFAPTATDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 720


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score =  153 bits (386), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 124/398 (31%), Positives = 185/398 (46%), Gaps = 69/398 (17%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVA-DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
            ADI   ++ YLI +SIGTP  +R+A+  DTGSDL+WTQC  C    C+ Q  P FD   
Sbjct: 93  DADI---DSEYLIHLSIGTPRPQRVALTLDTGSDLVWTQCA-C--HVCFAQPFPTFDALA 146

Query: 140 SSTYKSLPCSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL------ 189
           S T  ++PCS   C S    L+  + +   C Y   Y D S ++G +  +T T       
Sbjct: 147 SQTTLAVPCSDPICTSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGN 206

Query: 190 -GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
            GS     VA+P + FGCG  N G+F S  +GI G   G +SL SQ++     +FS+C  
Sbjct: 207 NGSKAHAGVAVPNVRFGCGQYNKGIFKSNESGIAGFSRGPMSLPSQLKV---ARFSHCFT 263

Query: 249 PVSSTKI------------NFGTNGIVSGPGVVSTPLTKAK-TFYVLTIDAISVGNQRLG 295
            ++  +             N G +   +GP V STP   +  + Y LT+  I+VG  RL 
Sbjct: 264 AIADARTSPVFLGGAPGPDNLGAH--ATGP-VQSTPFANSNGSLYYLTLKGITVGKTRLP 320

Query: 296 VST------------PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE-- 341
           ++                +IDSGT +  LP     +L +   + ++  PVA+ + +    
Sbjct: 321 LNALAFAGKGTGSGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARVKL-PVANESAADAES 379

Query: 342 -LCYS---------FNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDI------VCSVFK 385
            LC+                +P+V +H  GAD  L R ++ + + ED       +C V  
Sbjct: 380 TLCFEAARSASLPPEAPAPALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGSGLCLVMN 439

Query: 386 GITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
              +S + I GN  Q N  V YD+E+  + F P  C K
Sbjct: 440 SAGDSDLTIIGNFQQQNMHVAYDLEKNKLVFVPARCDK 477


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 121/359 (33%), Positives = 172/359 (47%), Gaps = 34/359 (9%)

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
           IPN A +L  ISIG PP  +L + DTGSDL W  C PC   +CY Q  P F P  SSTY+
Sbjct: 72  IPNPAAFLANISIGNPPVPQLLLIDTGSDLTWIHCLPC---KCYPQTIPFFHPSRSSTYR 128

Query: 145 SLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           +  C S+  A   + +   +G NCQY + Y D S + G LA E +T  ++    ++   I
Sbjct: 129 NASCVSAPHAMPQIFRDEKTG-NCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNI 187

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI 262
            FGCG +N G   +K +G++GLG G  S++++       KFSYC    S T   +  N +
Sbjct: 188 VFGCGQDNSGF--TKYSGVLGLGPGTFSIVTR---NFGSKFSYCF--GSLTNPTYPHNIL 240

Query: 263 VSGPGVV----STPLTKAKTFYVLTIDAISVGNQRLGV---------STPDIVIDSGTTL 309
           + G G       TPL   +  Y L + AIS G + L +         S    VID+G + 
Sbjct: 241 ILGNGAKIEGDPTPLQIFQDRYYLDLQAISFGEKLLDIEPGTFQRYRSQGGTVIDTGCSP 300

Query: 310 TFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFN---SLSQVPEVTIHFR-GAD 363
           T L +     L   +  ++    + V D       CY  N    L   P VT HF  GA+
Sbjct: 301 TILAREAYETLSEEIDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGAE 360

Query: 364 VKLSRSNFFV-KVSEDIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + L   + FV   S D  C      T + + + G + Q N+ VGY++    V F+ TDC
Sbjct: 361 LALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 115/368 (31%), Positives = 181/368 (49%), Gaps = 39/368 (10%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y I + +GTPP     + DTGSDL W QC PC   +C+ Q+ P +DP  SS+Y+++ C
Sbjct: 179 GEYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPC--YECFEQNGPHYDPGQSSSYRNIGC 236

Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATET----VTLGSTTGQAVA 198
             S+C  ++     + C   N  C Y   YGD S + G+ A ET    +T+ S   +   
Sbjct: 237 HDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRR 296

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
           +  + FGCG  N GLF+     ++GLG G +S  SQ+++     FSYCLV  +     S+
Sbjct: 297 VENVMFGCGHWNRGLFHGAAG-LLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDANVSS 355

Query: 254 KINFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVSTPDI------ 301
           K+ FG +  ++S P +  T L   K     TFY + I +I VG + + +           
Sbjct: 356 KLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDG 415

Query: 302 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEV 355
               +IDSGTTL++  +     +     + ++  PV      LE CY+   + Q  +P+ 
Sbjct: 416 SGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEPCYNVTGVEQPDLPDF 475

Query: 356 TIHF-RGADVKLSRSNFFVKVS-EDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQT 412
            I F  GA       N+F+++   ++VC    G   +++ I GN  Q NF + YD ++  
Sbjct: 476 GIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILYDTKKSR 535

Query: 413 VSFKPTDC 420
           + F PT C
Sbjct: 536 LGFAPTKC 543


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  152 bits (385), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 130/382 (34%), Positives = 186/382 (48%), Gaps = 37/382 (9%)

Query: 60  RSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
           R +N  +  N  ++  +S ASQ         Y  RI +G P      V DTGSD+ W QC
Sbjct: 158 RRINGSDSTNSLTAPVTSGASQG-----AGEYFARIGVGQPVQSYFFVPDTGSDVSWLQC 212

Query: 120 EPCPPSQ-CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFS 178
           +PC     CY Q  P+FDPK SS+Y  L C S QC  L++ +C   +C Y V YGDGSF+
Sbjct: 213 QPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFT 272

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
            G LATET +   +     ++P +  GCG +N GLF     G++GLGGG ISL SQ+  T
Sbjct: 273 VGELATETFSFRHSN----SIPNLPIGCGHDNEGLF-VGADGLIGLGGGAISLSSQLEAT 327

Query: 239 IAGKFSYCLVPV---SSTKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQ 292
               FSYCLV +   SS+ ++F  +        +++PL K     TF  + +  +SVG +
Sbjct: 328 ---SFSYCLVDLDSESSSTLDFNADQPSDS---LTSPLVKNDRFPTFRYVKVIGMSVGGK 381

Query: 293 RLGVSTPD----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
            L +S+            I++DSGTT+T +P      L      + +  P A      + 
Sbjct: 382 PLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDT 441

Query: 343 CYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIM 398
           CY  +S S  +VP +     G + ++L   N  ++V S    C  F   T  + I GN+ 
Sbjct: 442 CYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQVDSAGTFCLAFLPSTFPLSIIGNVQ 501

Query: 399 QTNFLVGYDIEQQTVSFKPTDC 420
           Q    V YD+    V F    C
Sbjct: 502 QQGIRVSYDLANSLVGFSTDKC 523


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 116/364 (31%), Positives = 174/364 (47%), Gaps = 38/364 (10%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  ++ +GTP T  L V DTGSD++W QC PC    CY Q   +FDP+ S +Y ++ 
Sbjct: 119 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPC--RHCYAQSGRVFDPRRSRSYAAVD 176

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           C +  C  L+   C      C Y V+YGDGS + G+ A+ET+T      +   +  +  G
Sbjct: 177 CVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF----ARGARVQRVAIG 232

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV---------PVSSTKIN 256
           CG +N GLF + +  ++GLG G +S  SQ+  +    FSYCLV            S+ + 
Sbjct: 233 CGHDNEGLFIAASG-LLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVT 291

Query: 257 FGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL-GVSTPD-----------I 301
           FG   + +  G   TP+    +  TFY + +   SVG  R+ GVS  D           +
Sbjct: 292 FGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGV 351

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIH 358
           ++DSGT++T L +     +     +      V+    SL + CY+     + +VP V++H
Sbjct: 352 ILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMH 411

Query: 359 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
              GA V L   N+ + V +    C    G    V I GNI Q  F V +D + Q V F 
Sbjct: 412 LAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFV 471

Query: 417 PTDC 420
           P  C
Sbjct: 472 PKSC 475


>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
          Length = 417

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 91/260 (35%), Positives = 145/260 (55%), Gaps = 26/260 (10%)

Query: 49  TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
           T ++ LR A+ RS  RL      +  + S+ KA  A+  I+P    YL+++ IGTPP + 
Sbjct: 43  TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102

Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
            A  DT SDLIWTQC+PC  + CY Q  P+F+P++SSTY +LPCSS  C  L+   C   
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160

Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
           +   CQY+ +Y   + + G LA + + +G       A  G+ FGC T++ GG    + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215

Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCLVPVSST---KINFGTNGIVS--GPGVVSTPLTK 275
           +VGLG G +SL+SQ+      +F+YCL P +S    K+  G +   +      ++ P+ +
Sbjct: 216 VVGLGRGPLSLVSQLSVR---RFAYCLPPPASRIPGKLVLGADADAARNATNRIAVPMRR 272

Query: 276 A---KTFYVLTIDAISVGNQ 292
                ++Y L +D + +G++
Sbjct: 273 DPRYPSYYYLNLDGLLIGDR 292


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 128/421 (30%), Positives = 194/421 (46%), Gaps = 49/421 (11%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ--NSSISSSKASQADIIPN-- 87
           +LIH  S   P Y  +ET   R+   +  S  RL +       S+  +    A + P+  
Sbjct: 38  KLIHPGSVHHPHYKPNETAKDRMELDIEHSAARLAYIQARIEGSLVYNNDYTASVSPSLT 97

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL- 146
               L+ +SIG P   +L V DTGSD++W  C PC  + C      LFDP MSST+  L 
Sbjct: 98  GRTILVNLSIGQPSIPQLVVMDTGSDILWIMCNPC--TNCDNHLGLLFDPSMSSTFSPLC 155

Query: 147 --PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
             PC    C       C  +   +++SY D S ++G    + +   +T      +  +  
Sbjct: 156 KTPCGFKGC------KCDPI--PFTISYVDNSSASGTFGRDILVFETTDEGTSQISDVII 207

Query: 205 GCGTNNGGLFNSK--TTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI 262
           GCG N G  FNS     GI+GL  G  SL +Q    I  KFSYC+  ++    N+    +
Sbjct: 208 GCGHNIG--FNSDPGYNGILGLNNGPNSLATQ----IGRKFSYCIGNLADPYYNYNQLRL 261

Query: 263 VSGPGV--VSTPLTKAKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTTLT 310
             G  +   STP      FY +T++ ISVG +RL ++          T  +++DSGTT+T
Sbjct: 262 GEGADLEGYSTPFEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTIT 321

Query: 311 FLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYS---FNSLSQVPEVTIHF-RGADV 364
           +L    +  L + + ++++   + V       +LCY       L   P VT HF  GAD+
Sbjct: 322 YLVDSAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADL 381

Query: 365 KLSRSNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
            L   +FF +  +DI C     +     T S  + G + Q ++ VGYD+  Q V F+  D
Sbjct: 382 ALDTGSFFSQ-RDDIFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRID 440

Query: 420 C 420
           C
Sbjct: 441 C 441


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 118/364 (32%), Positives = 176/364 (48%), Gaps = 38/364 (10%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  ++ +GTP T  L V DTGSD++W QC PC    CY Q   +FDP+ S +Y ++ 
Sbjct: 125 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPC--RHCYAQSGRVFDPRRSRSYAAVD 182

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           C +  C  L+   C      C Y V+YGDGS + G+ A+ET+T      +   +  +  G
Sbjct: 183 CVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF----ARGARVQRVAIG 238

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVS--STKIN 256
           CG +N GLF + +  ++GLG G +S  SQ+  +    FSYCLV       P S  S+ + 
Sbjct: 239 CGHDNEGLFIAASG-LLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVT 297

Query: 257 FGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL-GVSTPD-----------I 301
           FG   + +  G   TP+    +  TFY + +   SVG  R+ GVS  D           +
Sbjct: 298 FGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGV 357

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIH 358
           ++DSGT++T L +     +     +      V+    SL + CY+     + +VP V++H
Sbjct: 358 ILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMH 417

Query: 359 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
              GA V L   N+ + V +    C    G    V I GNI Q  F V +D + Q V F 
Sbjct: 418 LAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFV 477

Query: 417 PTDC 420
           P  C
Sbjct: 478 PKSC 481


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score =  151 bits (382), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 136/448 (30%), Positives = 212/448 (47%), Gaps = 54/448 (12%)

Query: 1   MATFL-SCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALT 59
           MA F  S +F L  LCF +     + +    + L+H        Y+        +++A  
Sbjct: 1   MAIFFTSPLFFLIILCFSISVVHLSASPTLVLNLVH----SYHIYSRKPPHVYHIKEA-- 54

Query: 60  RSLNRLNHFNQNSS--ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
            S+ RL +    ++  I +  +    IIP    +L+ ISIG+PP  +L   DT SDL+W 
Sbjct: 55  -SVERLEYLKAKTTGDIIAHLSPNVPIIPQA--FLVNISIGSPPITQLLHMDTASDLLWI 111

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGVNCQYSVSYGDGS 176
           QC PC    CY Q  P+FDP  S T+++  C +SQ +  + K + +  +C+YS+ Y D +
Sbjct: 112 QCLPC--INCYAQSLPIFDPSRSYTHRNETCRTSQYSMPSLKFNANTRSCEYSMRYVDDT 169

Query: 177 FSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
            S G LA E +   +   +  + AL  + FGCG +N G      TGI+GLG G+ SL+ +
Sbjct: 170 GSKGILAREMLLFNTIYDESSSAALHDVVFGCGHDNYGE-PLVGTGILGLGYGEFSLVHR 228

Query: 235 MRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV-----STPLTKAKTFYVLTIDAISV 289
                  KFSYC   +     ++  N +V G         +TPL     FY +TI+AISV
Sbjct: 229 F----GKKFSYCFGSLDDP--SYPHNVLVLGDDGANILGDTTPLEIHNGFYYVTIEAISV 282

Query: 290 G-----------NQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 338
                       N+         +ID+G +LT L +     L + +  + E +  A    
Sbjct: 283 DGIILPIDPRVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVS 342

Query: 339 SLEL----CYSFN-----SLSQVPEVTIHF-RGADVKLSRSNFFVKVSEDIVC-SVFKGI 387
             ++    CY+ N       S  P VT HF  GA++ L   + F+K+S ++ C +V  G 
Sbjct: 343 QDDMIKMECYNGNFERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSPNVFCLAVTPGN 402

Query: 388 TNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
            NS+   G   Q ++ +GYD+E   VSF
Sbjct: 403 LNSI---GATAQQSYNIGYDLEAMEVSF 427


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 114/368 (30%), Positives = 183/368 (49%), Gaps = 39/368 (10%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + IGTPP     + DTGSDL W QC PC    C+ Q+ P +DPK SS+++++ C
Sbjct: 88  GEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPC--HDCFEQNGPYYDPKESSSFRNIGC 145

Query: 149 SSSQCASLNQKS----CSGVN--CQYSVSYGDGSFSNGNLATE--TVTLGSTTGQA--VA 198
              +C  ++       C   N  C Y   YGD S + G+ ATE  TV L S TG++    
Sbjct: 146 HDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKR 205

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
           +  + FGCG  N GLF+  +  +    G  +S  SQ+++     FSYCLV  +     S+
Sbjct: 206 VENVMFGCGHWNRGLFHGASGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSDTNVSS 264

Query: 254 KINFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGV--STPDI---- 301
           K+ FG +  +++ P +  T L   K     TFY + I +I VG + L +  ST ++    
Sbjct: 265 KLIFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWNMTSDG 324

Query: 302 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEV 355
               ++DSGTTL++  +     +       ++  P+      L+ CY+ + + ++  P+ 
Sbjct: 325 VGGTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPILDPCYNVSGVEKIDLPDF 384

Query: 356 TIHF-RGADVKLSRSNFFVKVS-EDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 412
            I F  GA       N+F+++  E++VC    G   S + I GN  Q NF V YD ++  
Sbjct: 385 GILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSALSIIGNYQQQNFHVLYDTKKSR 444

Query: 413 VSFKPTDC 420
           + + P +C
Sbjct: 445 LGYAPMNC 452


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score =  151 bits (382), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 119/375 (31%), Positives = 185/375 (49%), Gaps = 50/375 (13%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS--PLFDPKMSSTYK 144
           ++  + + + IGTPP  R  + DTGSDLIWTQC+    +    +    P++DP  SST+ 
Sbjct: 87  SDQGHSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFA 146

Query: 145 SLPCSSSQC--ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
            LPCS   C     + K+C+  N C Y   YG  + + G LA+ET T G+   +AV+L  
Sbjct: 147 FLPCSDRLCQEGQFSFKNCTSKNRCVYEDVYGSAA-AVGVLASETFTFGAR--RAVSLR- 202

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN---FG 258
           + FGCG  + G      TGI+GL    +SLI+Q++     +FSYCL P +  K +   FG
Sbjct: 203 LGFGCGALSAGSLIG-ATGILGLSPESLSLITQLKIQ---RFSYCLTPFADKKTSPLLFG 258

Query: 259 ---------TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST------PD--- 300
                    T   +    +VS P+     +Y + +  IS+G++RL V        PD   
Sbjct: 259 AMADLSRHKTTRPIQTTAIVSNPVK--TVYYYVPLVGISLGHKRLAVPAASLAMRPDGGG 316

Query: 301 -IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT-GSLELCYSFNSLS-------- 350
             ++DSG+T+ +L +     +   +  ++   PVA+ T    ELC+     +        
Sbjct: 317 GTIVDSGSTVAYLVEAAFEAVKEAVMDVVRL-PVANRTVEDYELCFVLPRRTAAAAMEAV 375

Query: 351 QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYD 407
           QVP + +HF  GA + L R N+F +    ++C      T+   V I GN+ Q N  V +D
Sbjct: 376 QVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFD 435

Query: 408 IEQQTVSFKPTDCTK 422
           ++    SF PT C +
Sbjct: 436 VQHHKFSFAPTQCDQ 450


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score =  151 bits (381), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 119/388 (30%), Positives = 187/388 (48%), Gaps = 58/388 (14%)

Query: 79  ASQADIIP-NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-----PSQCYMQDS 132
           A+   + P ++  + + + IGTPP  R  + DTGSDLIWTQC          +    Q  
Sbjct: 71  AADVPVAPLSDQGHSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQRE 130

Query: 133 PLFDPKMSSTYKSLPCSSSQC--ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTL 189
           PL++P+ SS++  LPCS   C     + K+C+  N C Y   YG    + G LA+ET T 
Sbjct: 131 PLYEPRRSSSFAYLPCSDRLCQEGQFSYKNCARNNRCMYDELYGSAE-AGGVLASETFTF 189

Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP 249
           G      V+LP + FGCG  + G      +G++GL  G +SL+SQ+      +FSYCL P
Sbjct: 190 G--VNAKVSLP-LGFGCGALSAGDLVG-ASGLMGLSPGIMSLVSQLSVP---RFSYCLTP 242

Query: 250 VSSTKI------------NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQR---- 293
            +  K              + T G V    ++  P  +   +YV  +  +S+G +R    
Sbjct: 243 FAERKTSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLV-GLSLGTKRLDVP 301

Query: 294 ---LGVSTPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGS----L 340
              LG+  PD     ++DSG+T+++L +   +   +V  +++EA   PVA+ T       
Sbjct: 302 ATSLGMIKPDGSGGTIVDSGSTMSYLEE---TAFRAVKKAVVEAVRLPVANGTDEDYDDY 358

Query: 341 ELCYSFNS-----LSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVP 392
           ELC++  +       + P + +HF  GA + L R N+F +    ++C       +   V 
Sbjct: 359 ELCFALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAGLMCLAVGTSPDGFGVS 418

Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           I GN+ Q N  V +D+  Q  SF PT C
Sbjct: 419 IIGNVQQQNMHVLFDVRNQKFSFAPTKC 446


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 106/378 (28%), Positives = 184/378 (48%), Gaps = 52/378 (13%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEP--------CPPSQCYMQDSPLFDPKM 139
           +  Y + + +GTP  +   + DTGSDL W QC P         PP       +P +D   
Sbjct: 56  SGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPP-------APWYDKSS 108

Query: 140 SSTYKSLPCSSSQCASLNQ---KSCSGVN---CQYSVSYGDGSFSNGNLATETVTL---- 189
           SS+Y+ +PC+  +C  L      SCS  +   C Y+  Y D S + G LA ET+++    
Sbjct: 109 SSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRK 168

Query: 190 ------GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR-TTIAGK 242
                 G+   + + +  +  GC   + G      +G++GLG G ISL +Q R T + G 
Sbjct: 169 RSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGI 228

Query: 243 FSYCLVPV--SSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL-GV 296
           FSYCLV     S   +F   G      +  TP+ +   A++FY + +  ++V  + + G+
Sbjct: 229 FSYCLVDYLRGSNASSFLVMGRTHWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGI 288

Query: 297 STPDI----------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF 346
           ++ D           + DSGTTL++L +   S +L  +++ I      +     ELCY+ 
Sbjct: 289 ASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNV 348

Query: 347 NSLSQ-VPEVTIHFRGADV-KLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNF 402
             + + +P++ + F+G  V +L  +N+ V V+E++ C   + +  TN   I GN++Q + 
Sbjct: 349 TRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLLQQDH 408

Query: 403 LVGYDIEQQTVSFKPTDC 420
            + YD+ +  + FK + C
Sbjct: 409 HIEYDLAKARIGFKWSPC 426


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 124/347 (35%), Positives = 173/347 (49%), Gaps = 32/347 (9%)

Query: 95  ISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLFDPKMSSTYKSLPCSSSQC 153
           + +G P      V DTGSD+ W QC PC   + CY Q +P+FDP++SS+Y  + C S QC
Sbjct: 1   MRVGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQC 60

Query: 154 ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGL 213
             L++  C+  +C Y V YGDGSF+ G LATET+T   +     ++P I+ GCG +N GL
Sbjct: 61  QLLDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSN----SIPNISIGCGHDNEGL 116

Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS--T 271
           F     G++GLGGG IS+ SQ++   A  FSYCLV + S   +F T    + P   S  +
Sbjct: 117 F-VGADGLIGLGGGAISISSQLK---ASSFSYCLVDIDSP--SFSTLDFNTDPPSDSLIS 170

Query: 272 PLTKAKTF----YVLTIDAISVGNQRLGVSTPD----------IVIDSGTTLTFLPQGYN 317
           PL K   F    YV  I  +SVG + L +S+            I++DSGTT+T LP    
Sbjct: 171 PLVKNDRFPSFRYVKVI-GMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLPSDVY 229

Query: 318 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVK 374
             L      +    P A      + CY  +S S  +VP +     G + ++L   N  ++
Sbjct: 230 EVLREAFLGLTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLIQ 289

Query: 375 V-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           V S    C  F   T  + I GN  Q    V YD+    V F    C
Sbjct: 290 VDSAGTFCLAFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 115/364 (31%), Positives = 174/364 (47%), Gaps = 38/364 (10%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  ++ +GTP T  L V DTGSD++W QC PC    CY Q   +FDP+ S +Y ++ 
Sbjct: 119 SGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPC--RHCYAQSGRVFDPRRSRSYAAVD 176

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           C +  C  L+   C      C Y V+YGDGS + G+ A+ET+T      +   +  +  G
Sbjct: 177 CVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF----ARGARVQRVAIG 232

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV---------PVSSTKIN 256
           CG +N GLF + +  ++GLG G +S  +Q+  +    FSYCLV            S+ + 
Sbjct: 233 CGHDNEGLFIAASG-LLGLGRGRLSFPTQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVT 291

Query: 257 FGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL-GVSTPD-----------I 301
           FG   + +  G   TP+    +  TFY + +   SVG  R+ GVS  D           +
Sbjct: 292 FGAGAVAAAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGV 351

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIH 358
           ++DSGT++T L +     +     +      V+    SL + CY+     + +VP V++H
Sbjct: 352 ILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMH 411

Query: 359 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
              GA V L   N+ + V +    C    G    V I GNI Q  F V +D + Q V F 
Sbjct: 412 LAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFV 471

Query: 417 PTDC 420
           P  C
Sbjct: 472 PKSC 475


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  151 bits (381), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 130/382 (34%), Positives = 185/382 (48%), Gaps = 37/382 (9%)

Query: 60  RSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
           R +N  +  N  ++  +S ASQ         Y  RI +G P      V DTGSD+ W QC
Sbjct: 158 RRINGSDSTNSLTAPVTSGASQG-----AGEYFARIGVGQPVQSYFFVPDTGSDVSWLQC 212

Query: 120 EPCPPSQ-CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFS 178
           +PC     CY Q  P+FDPK SS+Y  L C S QC  L++ +C   +C Y V YGDGSF+
Sbjct: 213 QPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFT 272

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
            G LATET +   +     ++P +  GCG +N GLF     G++GLGGG ISL SQ+  T
Sbjct: 273 VGELATETFSFRHSN----SIPNLPIGCGHDNEGLF-VGAAGLIGLGGGAISLSSQLEAT 327

Query: 239 IAGKFSYCLVPV---SSTKINFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQ 292
               FSYCLV +   SS+ ++F  +        +++PL K     TF  + +  +SVG +
Sbjct: 328 ---SFSYCLVDLDSESSSTLDFNADQPSDS---LTSPLVKNDRFPTFRYVKVIGMSVGGK 381

Query: 293 RLGVSTPD----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
            L +S+            I++DSGTT+T +P      L      + +  P A      + 
Sbjct: 382 PLPISSSSFEIDESGSGGIIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDT 441

Query: 343 CYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIM 398
           CY  +S S  +VP +     G + ++L   N   +V S    C  F   T  + I GN+ 
Sbjct: 442 CYDLSSQSNVEVPTIAFILPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQ 501

Query: 399 QTNFLVGYDIEQQTVSFKPTDC 420
           Q    V YD+    V F    C
Sbjct: 502 QQGIRVSYDLANSLVGFSTDKC 523


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 120/370 (32%), Positives = 171/370 (46%), Gaps = 56/370 (15%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+  + +G    E   V DT S+L W QC+PC    C+ Q  PLFDP  S +Y ++PC+
Sbjct: 119 NYVATVGLGA--AEATVVVDTASELTWVQCQPC--ESCHDQQDPLFDPSSSPSYAAVPCN 174

Query: 150 SSQCASLNQKSCSGVN-----------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           SS C +L     +G +           C Y++SY DGS+S G LA + + L    GQ + 
Sbjct: 175 SSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRL---AGQDIE 231

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----SSTK 254
             G  FGCGT+N G     T+G++GLG   +SL+SQ      G FSYCL P+    SS  
Sbjct: 232 --GFVFGCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGGVFSYCL-PMRESGSSGS 288

Query: 255 INFGTN-------------GIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL---GVST 298
           +  G +              +VS  G +  P      FY L +  I+VG Q +     S 
Sbjct: 289 LVLGDDSSAYRNSTPIVYTAMVSDSGPLQGP------FYFLNLTGITVGGQEVESPWFSA 342

Query: 299 PDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 355
             ++IDSGT +T L P  YN+     +S + E  P A     L+ C++   L   QVP +
Sbjct: 343 GRVIIDSGTIITTLVPSVYNAVRAEFLSQLAE-YPQAPAFSILDTCFNLTGLKEVQVPSL 401

Query: 356 TIHFRGA-DVKLSRSNFFVKVSEDI--VCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQ 410
              F G+ +V++        VS D   VC     + +     I GN  Q N  V +D   
Sbjct: 402 KFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQKNLRVIFDTLG 461

Query: 411 QTVSFKPTDC 420
             + F    C
Sbjct: 462 SQIGFAQETC 471


>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 506

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 140/459 (30%), Positives = 199/459 (43%), Gaps = 65/459 (14%)

Query: 17  YVVSPIEAQTGGFSVELIHRDSPKSPFY--NSSETPYQRLRDALTRSLNRLNHFN----- 69
           + VSP  + +GG    L H  SP SP      S  P + L   L    +R  H       
Sbjct: 58  HRVSP--SSSGGSWAPLSHLHSPCSPAAGGRDSAPPPKTLSATLQWDEHRAGHIQRKLSG 115

Query: 70  -------------QNSSISSSKASQADIIPNNANYLIRISI-----------GTPPTERL 105
                        Q++ ++SS A+  ++  ++ +      I             P   + 
Sbjct: 116 NAAPMDDAGEETPQSTQVTSSPAANVNVGKSSTDSAFEQGIVPAATGPGGQKKLPGVAQS 175

Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--CSG 163
            V DT SD+ W QC PCP  QCY Q   L+DP  S      PCSS QC SL + +  C+G
Sbjct: 176 MVVDTASDVPWVQCAPCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTG 235

Query: 164 VN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN--NGGLFNSK 217
                 CQY V Y DGS ++G   ++ +TL +    AV+     FGC       G FN+K
Sbjct: 236 AGNTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVS--KFQFGCSHALLRPGSFNNK 293

Query: 218 TTGIVGLGGGDISLISQMRTTIAGK--FSYCLVPVSSTK--INFGTNGIVSGPGVVSTPL 273
           T G + LG G  SL SQ + T +    FSYCL P  S K  ++ G     +    V TP+
Sbjct: 294 TAGFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYAV-TPM 352

Query: 274 TKAK---TFYVLTIDAISVGNQRL----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSS 326
            K+K     Y++ +  I V  QRL     V   +  +DS T +T LP      L +   +
Sbjct: 353 LKSKMAPMIYMVRLIGIDVAGQRLPVPPAVFAANAAMDSRTIITRLPPTAYMALRAAFRA 412

Query: 327 MIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSV 383
            + A     P G L+ CY F  +  V  P+VT+ F R A V+L  S   +       C  
Sbjct: 413 QMRAYRAVAPKGQLDTCYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVMLD-----SCLA 467

Query: 384 FKGITNS-VP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           F    N  +P I GN+ Q    V Y+++  +V F+   C
Sbjct: 468 FAPNANDFMPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 110/368 (29%), Positives = 173/368 (47%), Gaps = 38/368 (10%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + IGTPP     + DTGSDL W QC PC    C+ Q  P +DPK SS+++++ C
Sbjct: 190 GEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKESSSFENITC 247

Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT----GQAVA 198
              +C  ++     K C   N  C Y   YGD S + G+ A ET T+  TT     +   
Sbjct: 248 HDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKH 307

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
           +  + FGCG  N GLF+     +    G  +S  SQ+++     FSYCLV  +     S+
Sbjct: 308 VENVMFGCGHWNRGLFHGAAGLLGLGRGP-LSFASQLQSIYGHSFSYCLVDRNSDTSVSS 366

Query: 254 KINFGTNG-IVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVSTPDI------ 301
           K+ FG +  ++S P +  T     +     TFY + I +I V  + L +           
Sbjct: 367 KLIFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLSKEG 426

Query: 302 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 355
               +IDSGTTLT+  +     +       I+   + +    L+ CY+ + +   ++P+ 
Sbjct: 427 GGGTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGFPPLKPCYNVSGIEKMELPDF 486

Query: 356 TIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTV 413
            I F  GA       N+F+++  D+VC    G   S + I GN  Q NF + YD+++  +
Sbjct: 487 GILFSDGAMWDFPVENYFIQIEPDLVCLAILGTPKSALSIIGNYQQQNFHILYDMKKSRL 546

Query: 414 SFKPTDCT 421
            + P  CT
Sbjct: 547 GYAPMKCT 554


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 123/355 (34%), Positives = 178/355 (50%), Gaps = 29/355 (8%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKS 145
           +   +++ + +GTP      + DTGSDL W QC+PC  S  C+ Q  PLFDP  SSTY +
Sbjct: 140 DTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAA 199

Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           + C   QCA+     CS  N  C Y V YGDGS + G L+ +T+ L S+     AL G  
Sbjct: 200 VHCGEPQCAAAGDL-CSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSSR----ALTGFP 254

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
           FGCGT N G F  +  G++GLG G++SL SQ   +    FSYCL P S++   + T G  
Sbjct: 255 FGCGTRNLGDFG-RVDGLLGLGRGELSLPSQAAASFGAVFSYCL-PSSNSTTGYLTIGAT 312

Query: 264 ----SGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVSTPDI------VIDSGTTLTF 311
               +G    +  L K +  +FY + + +I +G   L V  P +      ++DSGT LT+
Sbjct: 313 PATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVP-PAVFTRGGTLLDSGTVLTY 371

Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNF 371
           LP    + L       +E    A P   L+ CY F   S+V    + FR  D  +   +F
Sbjct: 372 LPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVVVPAVSFRFGDGAVFELDF 431

Query: 372 F---VKVSEDIVCSVFKGI-TNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           F   + + E++ C  F  + T  +P  I GN  Q +  V YD+  + + F P  C
Sbjct: 432 FGVMIFLDENVGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 486


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 121/380 (31%), Positives = 178/380 (46%), Gaps = 64/380 (16%)

Query: 76  SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
           SS + QA +      Y + IS+GTP      VADTGSDLIWTQC PC  ++C+ Q +P F
Sbjct: 71  SSVSFQALLENGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPC--TKCFQQPAPPF 128

Query: 136 DPKMSSTYKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
            P  SST+  LPC+SS C  L  + ++C+   C Y+  YG G ++ G LATET+ +G   
Sbjct: 129 QPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGD-- 185

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV---PV 250
               + P + FGC T N            GLG  D+ +         G+FSYCL      
Sbjct: 186 ---ASFPSVAFGCSTEN------------GLGQLDLGV---------GRFSYCLRSGSAA 221

Query: 251 SSTKINFGTNGIVSGPGVVSTPLTK----AKTFYVLTIDAISVGNQRLGVSTPDI----- 301
            ++ I FG+   ++   V STP         ++Y + +  I+VG   L V+T        
Sbjct: 222 GASPILFGSLANLTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQN 281

Query: 302 ------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---- 350
                 ++DSGTTLT+L + GY     + +S   +   V + T  L+LC+          
Sbjct: 282 GLGGGTIVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTV-NGTRGLDLCFKSTGGGGGGI 340

Query: 351 QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--------IYGNIMQTNF 402
            VP + + F G   + +   +F  V  D   SV       +P        + GN+MQ + 
Sbjct: 341 AVPSLVLRFDGG-AEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDM 399

Query: 403 LVGYDIEQQTVSFKPTDCTK 422
            + YD++    SF P DC K
Sbjct: 400 HLLYDLDGGIFSFAPADCAK 419


>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
 gi|219886805|gb|ACL53777.1| unknown [Zea mays]
 gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
          Length = 440

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 136/450 (30%), Positives = 196/450 (43%), Gaps = 89/450 (19%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-NSSISSSKASQADIIP 86
           G  +EL H D+ +   Y   E    R+R A  R+  RL       + I     SQ     
Sbjct: 22  GIRLELTHVDAKE--HYTVEE----RVRRATERTHRRLASMGGVTAPIHWGGQSQ----- 70

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
               Y+    IG PP    A+ DTGS+LIWTQC  C P+ C+ Q+ P +DP  S   +++
Sbjct: 71  ----YIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPT-CFRQNLPYYDPSRSRAARAV 125

Query: 147 PCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C+ + CA  ++  C   N  C     YG G+ + G LATE +T  S T   V      F
Sbjct: 126 GCNDAACALGSETQCLSDNKTCAVVTGYGAGNIA-GTLATENLTFQSETVSLV------F 178

Query: 205 GC--------GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP-----VS 251
           GC        G+ NG       +GI+GLG G +SL SQ+  T   +FSYCL P     + 
Sbjct: 179 GCIVVTKLSPGSLNGA------SGIIGLGRGKLSLPSQLGDT---RFSYCLTPYFEDTIE 229

Query: 252 STKINFGT-----NGIVSGPGVVSTPLTKA------KTFYVLTIDAISVGNQRLGVSTPD 300
            + +  G      NG  S   V + P  ++       TFY L +  I+ G  +L V +  
Sbjct: 230 PSHMVVGASAGLINGSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAA 289

Query: 301 I-------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCY 344
                          IDSG  LT L       L + ++  + A   QP+A  TG  +LC 
Sbjct: 290 FDLRQVAPGMWTGTFIDSGAPLTSLVDVAYQALRAELARQLGAALVQPLAGTTG-FDLCV 348

Query: 345 SFNSLSQ-VPEVTIHF-----RGADVKLSRSNFFVKVSEDIVCS-VFKGI------TNSV 391
           +     + VP + +HF      G D+ +  +N++  V     C  VF  +       N  
Sbjct: 349 ALKDAERLVPPLVLHFGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNET 408

Query: 392 PIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            + GN MQ N  V YD+    +SF+P DC+
Sbjct: 409 TVIGNYMQQNMHVLYDLAGGVLSFQPADCS 438


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 106/378 (28%), Positives = 184/378 (48%), Gaps = 52/378 (13%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEP--------CPPSQCYMQDSPLFDPKM 139
           +  Y + + +GTP  +   + DTGSDL W QC P         PP       +P +D   
Sbjct: 24  SGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPP-------APWYDKSS 76

Query: 140 SSTYKSLPCSSSQCASLNQ---KSCSGVN---CQYSVSYGDGSFSNGNLATETVTL---- 189
           SS+Y+ +PC+  +C  L      SCS  +   C Y+  Y D S + G LA ET+++    
Sbjct: 77  SSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRK 136

Query: 190 ------GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR-TTIAGK 242
                 G+   + + +  +  GC   + G      +G++GLG G ISL +Q R T + G 
Sbjct: 137 RSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALGGI 196

Query: 243 FSYCLVPV--SSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRL-GV 296
           FSYCLV     S   +F   G      +  TP+ +   A++FY + +  ++V  + + G+
Sbjct: 197 FSYCLVDYLRGSNASSFLVMGRTRWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGI 256

Query: 297 STPDI----------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF 346
           ++ D           + DSGTTL++L +   S +L  +++ I      +     ELCY+ 
Sbjct: 257 ASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNV 316

Query: 347 NSLSQ-VPEVTIHFRGADV-KLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNF 402
             + + +P++ + F+G  V +L  +N+ V V+E++ C   + +  TN   I GN++Q + 
Sbjct: 317 TRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLLQQDH 376

Query: 403 LVGYDIEQQTVSFKPTDC 420
            + YD+ +  + FK + C
Sbjct: 377 HIEYDLAKARIGFKWSPC 394


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 107/368 (29%), Positives = 176/368 (47%), Gaps = 38/368 (10%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + +GTPP     + DTGSDL W QC PC    C+ Q  P +DPK SS+++++ C
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKDSSSFRNISC 250

Query: 149 SSSQCASLNQKS----CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT----GQAVA 198
              +C  ++       C   N  C Y   YGDGS + G+ A ET T+  TT     +   
Sbjct: 251 HDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKH 310

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
           +  + FGCG  N GLF+     +    G  +S  SQM++     FSYCLV  +     S+
Sbjct: 311 VENVMFGCGHWNRGLFHGAAGLLGLGKGP-LSFASQMQSLYGQSFSYCLVDRNSNASVSS 369

Query: 254 KINFGTNG-IVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVSTP-------- 299
           K+ FG +  ++S P +  T     K     TFY + I+++ V ++ L +           
Sbjct: 370 KLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEETWHLSSEG 429

Query: 300 --DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 355
               +IDSGTTLT+  +     +       I+   + +    L+ CY+ + +   ++P+ 
Sbjct: 430 AGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKMELPDF 489

Query: 356 TIHF-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
            I F  GA       N+F+++  D+VC ++     +++ I GN  Q NF + YD+++  +
Sbjct: 490 GILFADGAVWNFPVENYFIQIDPDVVCLAILGNPRSALSIIGNYQQQNFHILYDMKKSRL 549

Query: 414 SFKPTDCT 421
            + P  C 
Sbjct: 550 GYAPMKCA 557


>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 104/355 (29%), Positives = 177/355 (49%), Gaps = 34/355 (9%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+   +IGTPP    AV D   +L+WTQC+ C   +C+ Q +PLFDP  S+TY++ PC 
Sbjct: 50  NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--GRCFEQGTPLFDPTASNTYRAEPCG 107

Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           +  C S+  + ++CSG  C Y  S   G  + G + T+T  +G+      A   + FGC 
Sbjct: 108 TPLCESIPSDVRNCSGNVCAYEASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVS 264
             +        +GIVGLG    SL++Q   T    FSYCL P  + K   +  G++  ++
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQ---TGVAAFSYCLAPHDAGKNSALFLGSSAKLA 217

Query: 265 GPG-VVSTPLT-------KAKTFYVLTIDAISVGNQRLGV--STPDIVIDSGTTLTFLPQ 314
           G G   STP             +Y + ++ +  G+  + +  S   +++D+ + ++FL  
Sbjct: 218 GGGKAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVLLDTFSPISFLVD 277

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRSNFF 372
           G    +   ++  + A P+A P    +LC+  +  S   P++   FR GA + +  +N+ 
Sbjct: 278 GAYQAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYL 337

Query: 373 VKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           +      VC     S     T  + + G++ Q N    +D++++T+SF+P DCTK
Sbjct: 338 LDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 110/369 (29%), Positives = 175/369 (47%), Gaps = 40/369 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y I + +GTPP     + DTGSDL W QC+PC    C+ Q+ P ++P  SS+Y+++ C  
Sbjct: 170 YFIDMFVGTPPKHVWLILDTGSDLSWIQCDPC--YDCFEQNGPHYNPNESSSYRNISCYD 227

Query: 151 SQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGST----TGQAVALP 200
            +C  ++     + C   N  C Y   Y DGS + G+ A ET T+  T      +   + 
Sbjct: 228 PRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVV 287

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKI 255
            + FGCG  N G F+     +    G  +S  SQ+++     FSYCL  +      S+K+
Sbjct: 288 DVMFGCGHWNKGFFHGAGGLLGLGRGP-LSFPSQLQSIYGHSFSYCLTDLFSNTSVSSKL 346

Query: 256 NFGTNG-IVSGPGVVSTPL-----TKAKTFYVLTIDAISVGNQRLGVSTPDI-------- 301
            FG +  +++   +  T L     T   TFY L I +I VG + L +             
Sbjct: 347 IFGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGVG 406

Query: 302 --VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTI 357
             +IDSG+TLTF P      +       I+ Q +A     +  CY+ +   QV  P+  I
Sbjct: 407 GTIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVELPDYGI 466

Query: 358 HF-RGADVKLSRSNFFVKVSED-IVC-SVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTV 413
           HF  GA       N+F +   D ++C ++ K   +S + I GN++Q NF + YD+++  +
Sbjct: 467 HFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQQNFHILYDVKRSRL 526

Query: 414 SFKPTDCTK 422
            + P  C +
Sbjct: 527 GYSPRRCAE 535


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 121/425 (28%), Positives = 192/425 (45%), Gaps = 56/425 (13%)

Query: 40  KSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGT 99
           KSPF + ++      R     SL R       S + S  AS       +  Y + + IG 
Sbjct: 39  KSPFPSPTQALALDTRRLHFLSLRRKPIPFVKSPVVSGAAS------GSGQYFVDLRIGQ 92

Query: 100 PPTERLAVADTGSDLIWTQCEPCPPSQC-YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
           PP   L +ADTGSDL+W +C  C    C +   + +F P+ SST+    C    C  + +
Sbjct: 93  PPQSLLLIADTGSDLVWVKCSAC--RNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPK 150

Query: 159 KSCSGV--------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
              + +         C Y   Y DGS ++G  A ET +L +++G+   L  + FGCG   
Sbjct: 151 PDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRI 210

Query: 211 GGLFNSKTT-----GIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSSTKINFG 258
            G   S T+     G++GLG G IS  SQ+      KFSYCL+       P S   I  G
Sbjct: 211 SGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLSPPPTSYLIIGNG 270

Query: 259 TNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI-----------VID 304
            +GI     +  TPL     + TFY + + ++ V   +L +  P I           V+D
Sbjct: 271 GDGISK---LFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRID-PSIWEIDDSGNGGTVVD 326

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP-TGSLELCYSFNSLSQ----VPEVTIHF 359
           SGTTL FL +    ++++ +   ++  P+AD  T   +LC + + +++    +P +   F
Sbjct: 327 SGTTLAFLAEPAYRSVIAAVRRRVKL-PIADALTPGFDLCVNVSGVTKPEKILPRLKFEF 385

Query: 360 RGADVKL-SRSNFFVKVSEDIVCSVFKGITNSV--PIYGNIMQTNFLVGYDIEQQTVSFK 416
            G  V +    N+F++  E I C   + +   V   + GN+MQ  FL  +D ++  + F 
Sbjct: 386 SGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFS 445

Query: 417 PTDCT 421
              C 
Sbjct: 446 RRGCA 450


>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
           [Cucumis sativus]
          Length = 209

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 78/207 (37%), Positives = 118/207 (57%), Gaps = 10/207 (4%)

Query: 9   FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
            IL  + F   + I    G F+  L HRDS  SP   SS + Y RL +A  RSL+R    
Sbjct: 11  LILLLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATL 69

Query: 69  NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
              ++ + +   QA + P +  YL+ +SIGTPP + + +ADTGSDL+W QC PC   +CY
Sbjct: 70  LNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCL--KCY 127

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETV 187
            Q  P+FDP  S+++  +PC+S  C +++   C     C YS +YGD +++ G+L  E +
Sbjct: 128 KQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKI 187

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
           T+GS++ ++V       GCG  +GG F
Sbjct: 188 TIGSSSVKSV------IGCGHESGGGF 208


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 125/436 (28%), Positives = 200/436 (45%), Gaps = 54/436 (12%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
           A      +E +HR + +S    +  +P    R AL+  +                  ++ 
Sbjct: 98  ADKDAVRIETMHRRAARSGGDRTPASPSSSPRRALSERM--------------VATVESG 143

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           +   +  YL+ + +GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  SS+Y
Sbjct: 144 VAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFDQVGPVFDPAASSSY 201

Query: 144 KSLPCSSSQCASLN----QKSCSGV---NCQYSVSYGDGSFSNGNLATETVTLGSTT-GQ 195
           +++ C   +C  +      ++C      +C Y   YGD S + G+LA E+ T+  T  G 
Sbjct: 202 RNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGA 261

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS--- 252
           +  +  + FGCG  N GLF+     ++GLG G +S  SQ+R      FSYCLV   S   
Sbjct: 262 SRRVDDVVFGCGHWNRGLFHGAAG-LLGLGRGPLSFASQLRAVYGHTFSYCLVDHGSDVA 320

Query: 253 TKINFG----TNGIVSGPGVVSTPL----TKAKTFYVLTIDAISVGNQRLGVSTP----- 299
           +K+ FG         + P +  T      + A TFY + +  + VG + L +S+      
Sbjct: 321 SKVVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVG 380

Query: 300 -------DIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS- 350
                    +IDSGTTL+ F+   Y     + +  M  + P+      L  CY+ + +  
Sbjct: 381 EGEGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLSPCYNVSGVDR 440

Query: 351 -QVPEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGY 406
            +VPE+++ F  GA       N+F+++  D I+C    G   + + I GN  Q NF V Y
Sbjct: 441 PEVPELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNFHVVY 500

Query: 407 DIEQQTVSFKPTDCTK 422
           D++   + F P  C +
Sbjct: 501 DLKNNRLGFAPRRCAE 516


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 127/419 (30%), Positives = 187/419 (44%), Gaps = 68/419 (16%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
           E+  RD  +  F NS    Y         S N  NH + N           ++   + N+
Sbjct: 87  EIFGRDESRVSFINSKCNQYT--------SGNLKNHAHNN-----------NLFDEDGNF 127

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           L+ ++ GTPP +   + DTGS + WTQC+ C    C       FD   SSTY    C  S
Sbjct: 128 LVDVAFGTPPQKFKLILDTGSSITWTQCKAC--VHCLKDSHRHFDSLASSTYSFGSCIPS 185

Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
                       V   Y+++YGD S S GN   +T+TL  +           FGCG NN 
Sbjct: 186 T-----------VGNTYNMTYGDKSTSVGNYGCDTMTLEPSD----VFQKFQFGCGRNNE 230

Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST-KINFGTNG--------- 261
           G F S   G++GLG G +S +SQ  +     FSYCL   +S   + FG            
Sbjct: 231 GDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEENSIGSLLFGEKATSQSSSLKF 290

Query: 262 --IVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQ 314
             +V+GPG  ++ L ++  ++V  +D ISVGN+RL +     ++P  +IDSGT +T LPQ
Sbjct: 291 TSLVNGPG--TSGLEESGYYFVKLLD-ISVGNKRLNIPSSVFASPGTIIDSGTVITRLPQ 347

Query: 315 GYNSNLLSVMSSMIEAQPVAD----PTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLS 367
              S L +     +   P+++        L+ CY+ +    V  PE  +HF  GADV+L+
Sbjct: 348 RAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFGDGADVRLN 407

Query: 368 RSNFFVKVSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
                       +C  F G + S     + I GN  Q +  V YDI  + + F    C+
Sbjct: 408 GKRVVWGNDASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIGFGGNGCS 466


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score =  149 bits (376), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 131/422 (31%), Positives = 195/422 (46%), Gaps = 37/422 (8%)

Query: 30  SVELIHRDSPKSPFYNS-SETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
           S+ ++HR  P SP  +  S  P     + L R  +R++   +  + SS+K      +  N
Sbjct: 72  SLTVVHRHGPCSPLRSRGSGAPSHT--EILRRDQDRVDAIRRKVTASSNKPKGGVSLLAN 129

Query: 89  -------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                   NY+  + +GTP TE +   DTGSD  W QC+PC  + CY Q  P+FDP  SS
Sbjct: 130 WGKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCKPC--ADCYEQRDPVFDPTASS 187

Query: 142 TYKSLPCSSSQCASLNQKSCSGV-------NCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           TY ++PC + +C  L   S S         NC Y VSY D S + G+LA +T+TL  +  
Sbjct: 188 TYSAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPS 247

Query: 195 QAVA--LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPV 250
            + A  +PG  FGCG +N G F  +  G++GLG G  SL SQ+       FSYCL   P 
Sbjct: 248 PSPADTVPGFVFGCGHSNAGTFG-EVDGLLGLGLGKASLPSQVAARYGAAFSYCLPSSPS 306

Query: 251 SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV------STPDIVID 304
           ++  ++FG     +          +  T Y L +  I V  + + V      +    +ID
Sbjct: 307 AAGYLSFGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAAGTIID 366

Query: 305 SGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIHFR 360
           SGT  + L P  Y +   S  S+M   +    P+  + + CY F  +   ++P V + F 
Sbjct: 367 SGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDTCYDFTGHETVRIPAVELVFA 426

Query: 361 -GADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
            GA V L  S      + D+  +    + N  + I GN  Q    V YD+  Q + F   
Sbjct: 427 DGATVHLHPSGVLYTWN-DVAQTCLAFVPNHDLGILGNTQQRTLAVIYDVGSQRIGFGRK 485

Query: 419 DC 420
            C
Sbjct: 486 GC 487


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 121/354 (34%), Positives = 175/354 (49%), Gaps = 27/354 (7%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKS 145
           +   +++ + +GTP      + DTGSDL W QC+PC  S  C+ Q  PLFDP  SSTY +
Sbjct: 145 DTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAA 204

Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           + C   QCA+     CS  N  C Y V YGDGS + G L+ +T+ L S+     AL G  
Sbjct: 205 VHCGEPQCAAAGGL-CSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSSR----ALAGFP 259

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
           FGCGT N G F  +  G++GLG G++SL SQ   +    FSYCL P S++   + T G  
Sbjct: 260 FGCGTRNLGDFG-RVDGLLGLGRGELSLPSQAAASFGAVFSYCL-PSSNSTTGYLTIGAT 317

Query: 264 ----SGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFL 312
               +G    +  L K +  +FY + + +I +G   L V     +    ++DSGT LT+L
Sbjct: 318 PATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFTRGGTLLDSGTVLTYL 377

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF 372
           P      L       +E    A P   L+ CY F   S+V    + FR  D  +   +FF
Sbjct: 378 PAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSFRFGDGAVFELDFF 437

Query: 373 ---VKVSEDIVCSVFKGI-TNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              + + E++ C  F  +    +P  I GN  Q +  V YD+  + + F P  C
Sbjct: 438 GVMIFLDENVGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGFVPASC 491


>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
 gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 458

 Score =  149 bits (375), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 127/459 (27%), Positives = 201/459 (43%), Gaps = 50/459 (10%)

Query: 5   LSCVF----ILFFLCFYVVSPIEAQTGGFSVELIHRDSPK--SPFYNSSETPYQRLRDAL 58
           + C F    +LF    Y V     +    +++LIHR+S    +P      TP   ++   
Sbjct: 1   MECSFQTSLLLFITVSYFVVTESIKPNRMAMKLIHRESVARLNPNARVPITPEDHIKHLT 60

Query: 59  TRSLNRLNHFNQNSSISSSKAS--QADIIP--NNANYLIRISIGTPPTERLAVADTGSDL 114
             S  R  +  QNS      +S  Q D+      + +L+  S+G PP  +L + DTGS L
Sbjct: 61  DISSARFKYL-QNSIDKELGSSNFQVDVEQAIKTSLFLVNFSVGQPPVPQLTIMDTGSSL 119

Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYG 173
           +W QC+PC          P+F+P +SST+    C    C       C   N C Y   Y 
Sbjct: 120 LWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDRFCRYAPNGHCGSSNKCVYEQVYI 179

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
            G+ S G LA E +T  +  G  V    I FGCG  NG    S  TGI+GLG    SL  
Sbjct: 180 SGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYENGEQLESHFTGILGLGAKPTSLAV 239

Query: 234 QMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP--GVVSTP----LTKAKTFYVLTIDAI 287
           Q+      KFSYC+  +++   N+G N +V G    ++  P         + Y + ++ I
Sbjct: 240 QL----GSKFSYCIGDLANK--NYGYNQLVLGEDADILGDPTPIEFETENSIYYMNLEGI 293

Query: 288 SVGNQRLGVS---------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 338
           SVG+ +L +             +++DSGT  T+L       L + + S+++  P  +   
Sbjct: 294 SVGDTQLNIEPVVFKRRGPRTGVILDSGTLYTWLADIAYRELYNEIKSILD--PKLERFW 351

Query: 339 SLE-LCYS---FNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSE----DIVCSVFK---- 385
             + LCY       L   P VT HF  GA++ +  ++ F  +SE    ++ C   K    
Sbjct: 352 FRDFLCYHGRVSEELIGFPVVTFHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKE 411

Query: 386 --GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
             G        G + Q  + +GYD++++ +  +  DC +
Sbjct: 412 HGGEYKEFTAIGLMAQQYYNIGYDLKEKNIYLQRIDCVQ 450


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 114/365 (31%), Positives = 179/365 (49%), Gaps = 42/365 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKS 145
           Y  +I +G+PP E     DTGSD++W  C PCP  +C ++        L+D K SST K+
Sbjct: 77  YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCP--KCPVKTDLGIPLSLYDSKASSTSKN 134

Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
           + C  + C+ + Q    G    C Y V YGDGS S+G+   + +TL   TG     P   
Sbjct: 135 VGCEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLAQ 194

Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKI 255
            + FGCG N  G      S   GI+G G  + S+ISQ+    ++   FS+CL  ++   I
Sbjct: 195 EVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNGGGI 254

Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQ------RLGVSTPD--IVIDSGT 307
            F   G V  P V +TPL   +  Y + +  + V  +       L  +  D   +IDSGT
Sbjct: 255 -FAI-GEVESPVVKTTPLVPNQVHYNVILKGMDVDGEPIDLPPSLASTNGDGGTIIDSGT 312

Query: 308 TLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGADV 364
           TL +LPQ  YNS +  + +       +   T +   C+SF  N+    P V +HF  + +
Sbjct: 313 TLAYLPQNLYNSLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFEDS-L 368

Query: 365 KLSR--SNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           KLS    ++   + ED+ C  ++  G+T      V + G+++ +N LV YD+E + + + 
Sbjct: 369 KLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWA 428

Query: 417 PTDCT 421
             +C+
Sbjct: 429 DHNCS 433


>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
          Length = 405

 Score =  148 bits (374), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 109/358 (30%), Positives = 166/358 (46%), Gaps = 39/358 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+   +IGTPP    AV D   +L+WTQC PC P  C+ QD PLFDP  SST++ LPC S
Sbjct: 57  YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CFEQDLPLFDPTKSSTFRGLPCGS 114

Query: 151 SQCASLNQKS--CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
             C S+ + S  C+   C Y      G  + G   T+T  +G+      A   + FGC  
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGMAGTDTFAIGA------AKETLGFGCVV 167

Query: 209 NNGGLFNS--KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG-TNGIVSG 265
                  +    +GIVGLG    SL++QM  T    FSYCL   SS  +  G T   ++G
Sbjct: 168 MTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGKSSGALFLGATAKQLAG 224

Query: 266 PGVVSTPLT----------KAKTFYVLTIDAISVGNQRLGVSTPD---IVIDSGTTLTFL 312
               STP             +  +Y++ +  I  G   L  ++     +++D+ +  ++L
Sbjct: 225 GKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKAGGAPLQAASSSGSTVLLDTVSRASYL 284

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNF 371
             G    L   +++ +  QPVA P    +LC+S       PE+   F  GA + +  +N+
Sbjct: 285 ADGAYKALKKALTAAVGVQPVASPPKPYDLCFSKAVAGDAPELVFTFDGGAALTVPPANY 344

Query: 372 FVKVSEDIVCSV--------FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            +      VC            G      I G++ Q N  V +D++++T+SFKP DC+
Sbjct: 345 LLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPADCS 402


>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
 gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score =  148 bits (373), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 121/444 (27%), Positives = 205/444 (46%), Gaps = 52/444 (11%)

Query: 25  QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
           Q  G ++ELIH+DSP+SP Y  +  P +++          L+H  Q S +S++KA    +
Sbjct: 10  QLDGLTMELIHKDSPQSPLYPGNLPPGEQILQPAACPFAGLHH--QTSMMSTNKAVMNRM 67

Query: 85  IPNNANY------LIRISIGT--PPTERLAVA------DTGSDLIWTQCEPC--PPSQCY 128
           +    +Y      L ++ +G+    + R          DTG++L W QCE C    + C+
Sbjct: 68  MSPLTSYGDPFLFLAQVGVGSFQEKSHRTHFKTYYFQIDTGNELSWIQCEGCQNKGNMCF 127

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
               P +    S +YK + C+       NQ  C    C Y+V+YG GS+++GNLA ET T
Sbjct: 128 PHKDPPYTSSQSKSYKPVSCNQHSFCEPNQ--CKEGLCAYNVTYGPGSYTSGNLANETFT 185

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLF------NSKTTGIVGLGGGDISLISQMRTTIAGK 242
             S  G+  AL  I+FGC T++  +        +  +G++G+G G  S ++Q+ +   GK
Sbjct: 186 FYSNHGKHTALKSISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHGK 245

Query: 243 FSYCLVP--VSSTKINFGTNGIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVST 298
           FSYC+      +T + FG + +V    + +T + + K    Y + +  ISV   +L ++ 
Sbjct: 246 FSYCITANNTHNTYLRFGKH-VVKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNITK 304

Query: 299 PDI----------VIDSGTTLTFLPQGYNSNLLSVMSSMIEA----QPVADPTGSLELCY 344
            D+          +ID+GT  T L +     L + +S+ + +    +         +LCY
Sbjct: 305 TDLAVRKDGSRGCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHKDLCY 364

Query: 345 ---SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVS---EDIVCSVFKGITNSVPIYGNIM 398
              S      +P VT H   AD+++     F+      +++ C       +S  I G   
Sbjct: 365 EQLSDAGRKNLPVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSMLS-DDSKTIIGAYQ 423

Query: 399 QTNFLVGYDIEQQTVSFKPTDCTK 422
           Q      YD + + +SF P DC K
Sbjct: 424 QMKQKFVYDTKARVLSFGPEDCEK 447


>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  147 bits (372), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 130/413 (31%), Positives = 190/413 (46%), Gaps = 67/413 (16%)

Query: 30  SVELIHRDSPKS---PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS-KASQADII 85
           S+E++H+  P S   P   +S +  Q L    +R  +  +   +N +  S+ KAS+A + 
Sbjct: 18  SLEVVHKHGPCSKLRPHKANSPSHTQILAQDESRVASIQSRLAKNLAGGSNLKASKATLP 77

Query: 86  PNNA------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
             +A      NY++ + +G+P  +   + DTGSDL WTQCEPC    CY Q   +FDP  
Sbjct: 78  SKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPC-VGYCYQQREHIFDPST 136

Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           S +Y ++ C S  C  L     N   CS   C Y + YGDGS+S G  A E ++L ST  
Sbjct: 137 SLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD- 195

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK 254
                    FGCG NN GLF   T G++GL    +SL+SQ        FSYCL P SS+ 
Sbjct: 196 ---VFNNFQFGCGQNNRGLFGG-TAGLLGLARNPLSLVSQTAQKYGKVFSYCL-PSSSSS 250

Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQ 314
             + + G  SG G      +KA  F                  TP            LP 
Sbjct: 251 TGYLSFG--SGDGD-----SKAVKF------------------TPR-----------LPP 274

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSN- 370
              S++  V   ++   P       L+ CY  +     +VP++ ++F  GA++ L+    
Sbjct: 275 TVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVPKIILYFSGGAEMDLAPEGI 334

Query: 371 -FFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            + +KVS+  VC  F G +  + V I GN+ Q    V YD  +  V F P+ C
Sbjct: 335 IYVLKVSQ--VCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPSGC 385


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 118/362 (32%), Positives = 180/362 (49%), Gaps = 37/362 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I +GTP  +     DTGSD++W  C     CP     ++ +P +D   SST KS+ 
Sbjct: 85  YFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP-YDADASSTAKSVS 143

Query: 148 CSSSQCASLNQKS-C-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG-I 202
           CS + C+ +NQ+S C SG  CQY + YGDGS +NG L  + V L   TG  Q  +  G I
Sbjct: 144 CSDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNGTI 203

Query: 203 TFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKINF 257
            FGCG+   G      +   GI+G G  + S ISQ+ +   +   F++CL   +   I F
Sbjct: 204 IFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGI-F 262

Query: 258 GTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDSGTTL 309
               +VS P V +TP+      Y + ++AI VGN  L +S+          ++IDSGTTL
Sbjct: 263 AIGEVVS-PKVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSDAFDSGDDKGVIIDSGTTL 321

Query: 310 TFLPQG-YNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQVPEVTIHF-RGADVK 365
            +LP   YN  +  +++S  E     V D   S    +  + L + P VT  F +   + 
Sbjct: 322 VYLPDAVYNPLMNQILASHQELNLHTVQD---SFTCFHYIDRLDRFPTVTFQFDKSVSLA 378

Query: 366 LSRSNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
           +    +  +V ED  C  ++  G+      S+ I G++  +N LV YDIE Q + +   +
Sbjct: 379 VYPQEYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHN 438

Query: 420 CT 421
           C+
Sbjct: 439 CS 440


>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
          Length = 396

 Score =  147 bits (372), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 107/357 (29%), Positives = 178/357 (49%), Gaps = 32/357 (8%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPSQCYMQDSPLFDPKMSSTYKS 145
           + A Y++ ++IGTPP    A+ D G +L+WTQC + C   +C+ QD PLFD   SST++ 
Sbjct: 47  SQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHC--RRCFKQDLPLFDTNASSTFRP 104

Query: 146 LPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSN--GNLATETVTLGSTTGQAVALPGIT 203
            PC ++ C S+  +SC+G            SF    G + T+ V +G+      A   + 
Sbjct: 105 EPCGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGT-----AATARLA 159

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTN 260
           FGC   +       ++G VGLG  ++SL +QM  T    FSYCL P  + K   +  G +
Sbjct: 160 FGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNAT---AFSYCLAPPDTGKSSALFLGAS 216

Query: 261 GIVSGP--GVVSTPLTKAKT--------FYVLTIDAISVGNQRLGV--STPDIVIDSGTT 308
             ++G   G  +TP  K  T         Y+L ++AI  GN  + +  S   I++ + T 
Sbjct: 217 AKLAGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATIAMPQSGNTIMVSTATP 276

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKL 366
           +T L      +L   ++  + A PV  P  + +LC+   S S   P++ + F+ GA++ +
Sbjct: 277 VTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGGAEMTV 336

Query: 367 SRSNFFVKVSEDIVCSVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             S++      D  C    G      V I G++ Q N  + +D++++T+SF+P DC+
Sbjct: 337 PVSSYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADCS 393


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  147 bits (371), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 111/388 (28%), Positives = 181/388 (46%), Gaps = 60/388 (15%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---------LFDPKMS 140
            Y +R  +GTP    L VADTGSDL W +C P   +      S           F P+ S
Sbjct: 94  QYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKS 153

Query: 141 STYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG----- 190
            T+  +PC+S  C+     SL+     G  C Y   Y DGS + G + TE+ T+      
Sbjct: 154 KTWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSSSS 213

Query: 191 ---STTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYC 246
                  +   L G+  GC G+  G  F + + G++ LG  ++S  S   +   G+FSYC
Sbjct: 214 SSSKNKVKKAKLQGLVLGCTGSYTGPSFEA-SDGVLSLGYSNVSFASHAASRFGGRFSYC 272

Query: 247 LV----PVSSTK-INFGTNGIVS-------GPGVVSTPL---TKAKTFYVLTIDAISVGN 291
           LV    P ++T  + FG N  +S       GPG   TPL   ++ + FY ++I AISV  
Sbjct: 273 LVDHLSPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAISVDG 332

Query: 292 QRLGVSTP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLE 341
           + L +            +++DSGT+LT L +     +++ +   +   P    DP    E
Sbjct: 333 ELLKIPRDVWEVDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFPRVAMDP---FE 389

Query: 342 LCYSFNSLSQ------VPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPI 393
            CY++ S S+      +P++ +HF G A ++    ++ +  +  + C  V +G    + +
Sbjct: 390 YCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEGPWPGISV 449

Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            GNI+Q   L  +D++ + + FK + CT
Sbjct: 450 IGNILQQEHLWEFDLKNRRLRFKRSRCT 477


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 113/366 (30%), Positives = 179/366 (48%), Gaps = 39/366 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + + +G+PP     + DTGSDL W QC PC    C+ Q+   +DPK S++YK++ C+ 
Sbjct: 155 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPC--HDCFQQNGAFYDPKASASYKNITCND 212

Query: 151 SQCASLN----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT----GQAVALP 200
            +C  ++     K C   N  C Y   YGD S + G+ A ET T+  TT     +   + 
Sbjct: 213 PRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNVE 272

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----STKI 255
            + FGCG  N GLF+     +    G  +S  SQ+++     FSYCLV  +     S+K+
Sbjct: 273 NMMFGCGHWNRGLFHGAAGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKL 331

Query: 256 NFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGV--STPDI------ 301
            FG +  ++S P +  T     K     TFY + I +I V  + L +   T +I      
Sbjct: 332 IFGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNISSDGAG 391

Query: 302 --VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ-PVADPTGSLELCYSFNSLS--QVPEVT 356
             +IDSGTTL++  +     + + ++   + + PV      L+ C++ + +   Q+PE+ 
Sbjct: 392 GTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIDSIQLPELG 451

Query: 357 IHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVS 414
           I F  GA       N F+ ++ED+VC    G   S   I GN  Q NF + YD ++  + 
Sbjct: 452 IAFADGAVWNFPTENSFIWLNEDLVCLAILGTPKSAFSIIGNYQQQNFHILYDTKRSRLG 511

Query: 415 FKPTDC 420
           + PT C
Sbjct: 512 YAPTKC 517


>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
          Length = 405

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 108/358 (30%), Positives = 165/358 (46%), Gaps = 39/358 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+   +IGTPP    AV D   +L+WTQC PC P  C+ QD PLFDP  SST++ LPC S
Sbjct: 57  YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CFEQDLPLFDPTKSSTFRGLPCGS 114

Query: 151 SQCASLNQKS--CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
             C S+ + S  C+   C Y      G  + G   T+T  +G+      A   + FGC  
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIGA------AKETLGFGCVV 167

Query: 209 NNGGLFNS--KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG-TNGIVSG 265
                  +    +GIVGLG    SL++QM  T    FSYCL   SS  +  G T   ++G
Sbjct: 168 MTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGKSSGALFLGATAKQLAG 224

Query: 266 PGVVSTPLT----------KAKTFYVLTIDAISVGNQRLGVSTPD---IVIDSGTTLTFL 312
               STP             +  +Y++ +  I  G   L  ++     +++D+ +  ++L
Sbjct: 225 GKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYL 284

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNF 371
             G    L   +++ +  QPVA P    +LC+        PE+   F  GA + +  +N+
Sbjct: 285 ADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDAPELVFTFDGGAALTVPPANY 344

Query: 372 FVKVSEDIVCSV--------FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            +      VC            G      I G++ Q N  V +D++++T+SFKP DC+
Sbjct: 345 LLASGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPADCS 402


>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 459

 Score =  147 bits (371), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 124/433 (28%), Positives = 197/433 (45%), Gaps = 56/433 (12%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF----NQNSSI---------SS 76
           + +LIHRDS  SP YN +++   R +  L  S  R ++      +NS++         ++
Sbjct: 36  TTKLIHRDSIFSPAYNPNDSIKDRAKRMLKNSNARFDYVQAISKRNSAVVDYDGGDTSAA 95

Query: 77  SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
             A +A ++     +L+  SIG PP  + AV DTGS L W QCEPC    C+ Q  PL++
Sbjct: 96  DDAYEASLLSELCTFLVNFSIGQPPVPQYAVMDTGSSLTWIQCEPC--INCHQQKGPLYN 153

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           P  SSTY S         +    +  G +C YS +Y D + + G  A E +   +     
Sbjct: 154 PSSSSTYVSCSDFDRTDTTFT--ATHGSDCNYSQTYADKTTTRGTYAREQLLFETPDDGI 211

Query: 197 VALPGITFGCGTNNGGL--FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS-- 252
             +  + FGCG NN  L       +G+ GLG    S+IS++       FSYC+  +    
Sbjct: 212 TIMHDVIFGCGHNNTQLPGPTGYASGVFGLGDSGSSIISKL----GFGFSYCIGNIGDPL 267

Query: 253 ---TKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS------------ 297
               ++  G    + G    STPL     +Y+ T+  IS+G +RL +             
Sbjct: 268 YGFHRLTLGNKLKIEG---YSTPLVPRGLYYI-TLVGISIGQERLDIDPIVFQRVDLNGI 323

Query: 298 TPDIVIDSGTTLTFLP-QGYN---SNLLSVMSSMIEAQPVADPTGSLELCY--SFN-SLS 350
           +  IVIDSG TL+++P Q YN     + S++S  +           L LCY    N  L 
Sbjct: 324 SSRIVIDSGATLSYIPRQAYNVVRDKVSSILSGFLSRYRYI--ARHLSLCYIGKLNQDLQ 381

Query: 351 QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYD 407
             P+ T H   GAD+       F + +++++C       +     + G + Q  + V YD
Sbjct: 382 GFPDATFHLADGADLVFQVEGLFFQYTDNVLCLALVPTESDEETCLIGLLAQQYYNVAYD 441

Query: 408 IEQQTVSFKPTDC 420
           ++QQ + F+  +C
Sbjct: 442 LKQQKLYFQRIEC 454


>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 396

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 107/357 (29%), Positives = 177/357 (49%), Gaps = 32/357 (8%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPSQCYMQDSPLFDPKMSSTYKS 145
           + A Y++ ++IGTPP    A+ D G +L+WTQC + C   +C+ QD PLFD   SST++ 
Sbjct: 47  SQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHC--RRCFKQDLPLFDTNASSTFRP 104

Query: 146 LPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSN--GNLATETVTLGSTTGQAVALPGIT 203
            PC ++ C S+  +SC+G            SF    G + T+ V +G+      A   + 
Sbjct: 105 EPCGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGT-----AATARLA 159

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTN 260
           FGC   +       ++G VGLG  ++SL +QM  T    FSYCL P  + K   +  G +
Sbjct: 160 FGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNAT---AFSYCLAPPDTGKSSALFLGAS 216

Query: 261 GIVSGP--GVVSTPLTKAKT--------FYVLTIDAISVGNQRLGV--STPDIVIDSGTT 308
             ++G   G  +TP  K  T         Y+L ++AI  GN  + +  S   I + + T 
Sbjct: 217 AKLAGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATIAMPQSGNTITVSTATP 276

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKL 366
           +T L      +L   ++  + A PV  P  + +LC+   S S   P++ + F+ GA++ +
Sbjct: 277 VTALVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGGAEMTV 336

Query: 367 SRSNFFVKVSEDIVCSVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             S++      D  C    G      V I G++ Q N  + +D++++T+SF+P DC+
Sbjct: 337 PVSSYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADCS 393


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 111/371 (29%), Positives = 176/371 (47%), Gaps = 40/371 (10%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y I + IG+PP     + DTGSDL W QC PC    C+ Q+ P +DPK S +++++ C
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPC--FDCFEQNGPYYDPKDSISFRNITC 251

Query: 149 SSSQCASLNQ----KSC--SGVNCQYSVSYGDGSFSNGNLATETVTLG---STTGQA--V 197
           +  +C  ++     + C     +C Y   YGD S + G+ A ET T+    STTG++   
Sbjct: 252 NDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFR 311

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----S 252
            +  + FGCG  N GLF+     +    G  +S  SQ+++     FSYCLV        S
Sbjct: 312 RVENVMFGCGHWNRGLFHGAAGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRDSDTSVS 370

Query: 253 TKINFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVSTPDI----- 301
           +K+ FG +  +++ P +  T L   K     TFY L I +I VG ++L +   +      
Sbjct: 371 SKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSAD 430

Query: 302 -----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PE 354
                +IDSGTTL++        +       ++   + +    L  CY+ +   ++  PE
Sbjct: 431 GAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDELNFPE 490

Query: 355 VTIHF-RGADVKLSRSNFFVKVSE-DIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQ 411
             I F  GA       N+F+++ + DIVC    G   S + I GN  Q NF + YD +  
Sbjct: 491 FLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNS 550

Query: 412 TVSFKPTDCTK 422
            + + P  C +
Sbjct: 551 RLGYAPMRCAE 561


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 106/369 (28%), Positives = 175/369 (47%), Gaps = 38/369 (10%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y + + +GTPP     + DTGSDL W QC PC    C+ Q  P +DPK SS+++++ 
Sbjct: 194 SGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKDSSSFRNIS 251

Query: 148 CSSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGST----TGQAV 197
           C   +C  ++     K C   N  C Y   YGDGS + G+ A ET T+  T    T +  
Sbjct: 252 CHDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELK 311

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----S 252
            +  + FGCG  N GLF+     +    G  +S  SQM++     FSYCLV  +     S
Sbjct: 312 HVENVMFGCGHWNRGLFHGAAGLLGLGKGP-LSFASQMQSLYGQSFSYCLVDRNSNASVS 370

Query: 253 TKINFGTNG-IVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVSTPDI----- 301
           +K+ FG +  ++S P +  T     K     TFY + I ++ V ++ L +          
Sbjct: 371 SKLIFGEDKELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEETWHLSSE 430

Query: 302 -----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPE 354
                +IDSGTTLT+  +     +       I+   + +    L+ CY+ + +   ++P+
Sbjct: 431 GAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSGIEKMELPD 490

Query: 355 VTIHFRGADV-KLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
             I F    V      N+F+ +  ++VC ++     +++ I GN  Q NF + YD+++  
Sbjct: 491 FGILFADEAVWNFPVENYFIWIDPEVVCLAILGNPRSALSIIGNYQQQNFHILYDMKKSR 550

Query: 413 VSFKPTDCT 421
           + + P  C 
Sbjct: 551 LGYAPMKCA 559


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 121/450 (26%), Positives = 197/450 (43%), Gaps = 62/450 (13%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDAL----TRSLNRLNHFNQNSSISSSKASQ----- 81
           +ELIHR SP+       +T  QRL++ +     R L  L H  +   I   KA +     
Sbjct: 3   LELIHRHSPQ--VMGRPKTQLQRLKELVHSDSVRQLMIL-HKLRGGQIPRRKAKEVLSSS 59

Query: 82  -------ADIIPNN-------ANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQ 126
                  A  +P +         Y +   +GTP  + + VADTGSDL W  C+  C    
Sbjct: 60  SGRGSDDAIEVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRN 119

Query: 127 C------YMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYG 173
           C       ++   +F   +SS++K++PC +  C        SL         C Y   Y 
Sbjct: 120 CSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYS 179

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           DGS + G  A ETVT+    G+ + L  +  GC  +  G       G++GLG    S   
Sbjct: 180 DGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAI 239

Query: 234 QMRTTIAGKFSYCLVPVSSTK-----INFGT----NGIVSGPGVVSTPLTKAKTFYVLTI 284
           +      GKFSYCLV   S K     + FG+      +++        L    +FY + +
Sbjct: 240 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 299

Query: 285 DAISVGNQRLGVSTP--DI------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVAD 335
             IS+G   L + +   D+      ++DSG++LTFL +  Y   + ++  S+++ + V  
Sbjct: 300 MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 359

Query: 336 PTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN-SV 391
             G LE C++     +  VP +  HF  GA+ +    ++ +  ++ + C  F  +     
Sbjct: 360 DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGT 419

Query: 392 PIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            + GNIMQ N L  +D+  + + F P+ CT
Sbjct: 420 SVVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 116/354 (32%), Positives = 172/354 (48%), Gaps = 44/354 (12%)

Query: 97  IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
           +GTPP       + G++LIW    P P  +C+ Q  P F+P   S  + LP +S  C S 
Sbjct: 1   MGTPPNPVKLKLENGNELIWNHSNPSP--ECFEQAFPYFEPLTFS--RGLPFAS--CGS- 53

Query: 157 NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS 216
             K      C Y+ SYGD S + G L  +  T     G   ++PG+ FGCG  N G+F S
Sbjct: 54  -PKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVAFGCGLFNNGVFKS 109

Query: 217 KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFGTNGIVSGPGVV-S 270
             TGI G G G +SL SQ++    G FS+C   +     S+  ++   +   +G G V +
Sbjct: 110 NETGIAGFGRGPLSLPSQLK---VGNFSHCFTTITGAIPSTVLLDLPADLFSNGQGAVQT 166

Query: 271 TPLTK-AK-----TFYVLTIDAISVGNQRLGV---------STPDIVIDSGTTLTFLPQG 315
           TPL + AK     T Y L++  I+VG+ RL V          T   +IDSGT++T LP  
Sbjct: 167 TPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQ 226

Query: 316 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFV 373
               +    ++ I+   V         C+S  S ++  VP++ +HF GA + L R N+  
Sbjct: 227 VYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVF 286

Query: 374 KVSED----IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           +V +D    I+C ++ KG  +   I GN  Q N  V YD++   +SF    C K
Sbjct: 287 EVPDDAGNSIICLAINKG--DETTIIGNFQQQNMHVLYDLQNNMLSFVAAQCDK 338


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 111/371 (29%), Positives = 176/371 (47%), Gaps = 40/371 (10%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y I + IG+PP     + DTGSDL W QC PC    C+ Q+ P +DPK S +++++ C
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPC--FDCFEQNGPYYDPKDSISFRNITC 251

Query: 149 SSSQCASLNQ----KSC--SGVNCQYSVSYGDGSFSNGNLATETVTLG---STTGQA--V 197
           +  +C  ++     + C     +C Y   YGD S + G+ A ET T+    STTG++   
Sbjct: 252 NDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFR 311

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----S 252
            +  + FGCG  N GLF+     +    G  +S  SQ+++     FSYCLV        S
Sbjct: 312 RVENVMFGCGHWNRGLFHGAAGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRDSDTSVS 370

Query: 253 TKINFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVSTPDI----- 301
           +K+ FG +  +++ P +  T L   K     TFY L I +I VG ++L +   +      
Sbjct: 371 SKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSAD 430

Query: 302 -----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PE 354
                +IDSGTTL++        +       ++   + +    L  CY+ +   ++  PE
Sbjct: 431 GAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDELNFPE 490

Query: 355 VTIHF-RGADVKLSRSNFFVKVSE-DIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQ 411
             I F  GA       N+F+++ + DIVC    G   S + I GN  Q NF + YD +  
Sbjct: 491 FLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNS 550

Query: 412 TVSFKPTDCTK 422
            + + P  C +
Sbjct: 551 RLGYAPMRCAE 561


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 178/366 (48%), Gaps = 39/366 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + + +G+PP     + DTGSDL W QC PC    C+ Q+   +DPK S++YK++ C+ 
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPC--YDCFQQNGAFYDPKASASYKNITCND 227

Query: 151 SQCASLNQKS----CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTG----QAVALP 200
            +C  ++       C   N  C Y   YGD S + G+ A ET T+  TT     +   + 
Sbjct: 228 QRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVE 287

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----STKI 255
            + FGCG  N GLF+     +    G  +S  SQ+++     FSYCLV  +     S+K+
Sbjct: 288 NMMFGCGHWNRGLFHGAAGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKL 346

Query: 256 NFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGV--STPDI------ 301
            FG +  ++S P +  T     K     TFY + I +I V  + L +   T +I      
Sbjct: 347 IFGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAG 406

Query: 302 --VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ-PVADPTGSLELCYSFNSLS--QVPEVT 356
             +IDSGTTL++  +     + + ++   + + PV      L+ C++ + +   Q+PE+ 
Sbjct: 407 GTIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELG 466

Query: 357 IHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVS 414
           I F  GA       N F+ ++ED+VC    G   S   I GN  Q NF + YD ++  + 
Sbjct: 467 IAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLG 526

Query: 415 FKPTDC 420
           + PT C
Sbjct: 527 YAPTKC 532


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score =  146 bits (369), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 117/360 (32%), Positives = 179/360 (49%), Gaps = 33/360 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I +GTP  +     DTGSD++W  C     CP     ++ +P +D   SST KS+ 
Sbjct: 85  YFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP-YDVDASSTAKSVS 143

Query: 148 CSSSQCASLNQKS-C-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG-I 202
           CS + C+ +NQ+S C SG  CQY + YGDGS +NG L  + V L   TG  Q  +  G I
Sbjct: 144 CSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTI 203

Query: 203 TFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKINF 257
            FGCG+   G      +   GI+G G  + S ISQ+ +   +   F++CL   +   I F
Sbjct: 204 IFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGI-F 262

Query: 258 GTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDSGTTL 309
               +VS P V +TP+      Y + ++AI VGN  L +S+          ++IDSGTTL
Sbjct: 263 AIGEVVS-PKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTL 321

Query: 310 TFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF-RGADVKLS 367
            +LP   YN  L  +++S  E   +     S    +  + L + P VT  F +   + + 
Sbjct: 322 VYLPDAVYNPLLNEILASHPEL-TLHTVQESFTCFHYTDKLDRFPTVTFQFDKSVSLAVY 380

Query: 368 RSNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
              +  +V ED  C  ++  G+      S+ I G++  +N LV YDIE Q + +   +C+
Sbjct: 381 PREYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 116/366 (31%), Positives = 187/366 (51%), Gaps = 38/366 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y + + +G PP   L + DTGSDL W QC+PC    C+ Q  P+FDP  S+++K +PC+
Sbjct: 86  EYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPC--KACFDQSGPVFDPSQSTSFKIIPCN 143

Query: 150 SSQCASLNQKSC-------SGVNCQYSVSYGDGSFSNGNLATETVTLG-STTGQAVALPG 201
           ++ C  +    C       S   C+Y   YGD S ++G+LA E++++  S    ++ +  
Sbjct: 144 AAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRD 203

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPVS-----STKI 255
           +  GCG +N GL      G++GLG G +S  SQ+R++  G+ FSYCLV  +     S+ I
Sbjct: 204 MVIGCGHSNKGL-FQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAI 262

Query: 256 NFGTNGIVSG--PGVVSTPLTK----AKTFYVLTIDAISVGN-------QRLGVSTP--- 299
           +FG    +S     +  TP  +     +TFY L I  I +         +R  ++T    
Sbjct: 263 SFGAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIATNGSG 322

Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTI 357
             +IDSGTTLT+L +     + S   + I + P ADP   L +CY+    + V  P ++I
Sbjct: 323 GTIIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFDILGICYNATGRAAVPFPALSI 381

Query: 358 HFR-GADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
            F+ GA++ L + N+F++            + T+ + I GN  Q N    YD++   + F
Sbjct: 382 VFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIHFLYDVQHARLGF 441

Query: 416 KPTDCT 421
             TDC+
Sbjct: 442 ANTDCS 447


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 117/353 (33%), Positives = 170/353 (48%), Gaps = 31/353 (8%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            +++ +  GTP      + DTGSD+ W QC PC    CY Q  P+FDP  S+TY ++PC 
Sbjct: 119 EFVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCS-GHCYKQHDPIFDPTKSATYSAVPCG 177

Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
             QCA+   K  S   C Y V YGDGS + G L+ ET++L S    A ALPG  FGCG  
Sbjct: 178 HPQCAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSHETLSLTS----ARALPGFAFGCGET 233

Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--INFGTNGIVSG-P 266
           N G F     G++GLG G +SL SQ   +    FSYCL   +++   +  GT    SG  
Sbjct: 234 NLGDFG-DVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGTTTPASGSD 292

Query: 267 GVVSTPLTKAK---TFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFL-PQGYN 317
           GV  T + + +   +FY + + +I VG   L V     +    ++DSGT LT+L P+ Y 
Sbjct: 293 GVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDGTLLDSGTVLTYLPPEAYT 352

Query: 318 S--NLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGAD-VKLSRSNFFVK 374
           +  +      +  +  P  DP    + CY F   + +    + F+ +D      S F V 
Sbjct: 353 ALRDRFKFTMTQYKPAPAYDP---FDTCYDFAGQNAIFMPLVSFKFSDGSSFDLSPFGVL 409

Query: 375 VSEDIV-----CSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           +  D       C  F    +++P  I GN  Q N  + YD+  + + F    C
Sbjct: 410 IFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462


>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
          Length = 409

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 107/346 (30%), Positives = 171/346 (49%), Gaps = 34/346 (9%)

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           GT    +  + D+GSD+ W QC+PCP   C+ Q  PLFDP  S+TY ++PCSS+ CA L 
Sbjct: 75  GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLG 134

Query: 158 --QKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
             ++ C +   CQ+ ++Y +G+ + G  +++ +TLG        + G  FGC   + G  
Sbjct: 135 PYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYD----VVRGFLFGCAHADQGST 190

Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGV 268
           F+    G + LGGG  S + Q  +  +  FSYC VP S++   F   G+        P  
Sbjct: 191 FSYDVAGTLALGGGSQSFVQQTASQYSRVFSYC-VPPSTSSFGFIMFGVPPQRAALVPTF 249

Query: 269 VSTPL----TKAKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSN 319
           VSTPL    T + TFY + + +I V  + L V     +   VIDS T ++ + P  Y + 
Sbjct: 250 VSTPLLSSSTMSPTFYRVLLRSIIVAGRPLPVPPTVFSASSVIDSATVISRIPPTAYQAL 309

Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVS 376
             +  S+M   +P A P   L+ CY F+ +  +  P + + F  GA V L  +   ++  
Sbjct: 310 RAAFRSAMTMYRP-APPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILLQ-- 366

Query: 377 EDIVCSVFK-GITNSVPIY-GNIMQTNFLVGYDIEQQTVSFKPTDC 420
               C  F    ++ +P + GN+ Q    V YD+  + + F+   C
Sbjct: 367 ---GCLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 115/357 (32%), Positives = 170/357 (47%), Gaps = 38/357 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+  + +G    E   + DT S+L W QC PC  + C+ Q  PLFDP  S +Y  LPC+
Sbjct: 126 NYVATVGLGG--GEATVIVDTASELTWVQCAPC--ASCHDQQGPLFDPASSPSYAVLPCN 181

Query: 150 SSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           SS C +L   + S           +C Y++SY DGS+S G LA + ++L         + 
Sbjct: 182 SSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEV-----ID 236

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----SSTKIN 256
           G  FGCGT+N G F   T+G++GLG   +SLISQ      G FSYCL P+    SS  + 
Sbjct: 237 GFVFGCGTSNQGPFGG-TSGLMGLGRSQLSLISQTMDQFGGVFSYCL-PLKESESSGSLV 294

Query: 257 FGTNGIV---SGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLT- 310
            G +  V   S P V +T ++      FY + +  I++G Q +  S   +++DSGT +T 
Sbjct: 295 LGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVIVDSGTIITS 354

Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG-ADVKLS 367
            +P  YN+     +S   E  P A     L+ C++       Q+P +   F G  +V++ 
Sbjct: 355 LVPSVYNAVKAEFLSQFAE-YPQAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVD 413

Query: 368 RSN--FFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            S   +FV      VC     + +     I GN  Q N  V +D     + F    C
Sbjct: 414 SSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 470


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 115/357 (32%), Positives = 170/357 (47%), Gaps = 38/357 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+  + +G    E   + DT S+L W QC PC  + C+ Q  PLFDP  S +Y  LPC+
Sbjct: 125 NYVATVGLGG--GEATVIVDTASELTWVQCAPC--ASCHDQQGPLFDPASSPSYAVLPCN 180

Query: 150 SSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           SS C +L   + S           +C Y++SY DGS+S G LA + ++L         + 
Sbjct: 181 SSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEV-----ID 235

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----SSTKIN 256
           G  FGCGT+N G F   T+G++GLG   +SLISQ      G FSYCL P+    SS  + 
Sbjct: 236 GFVFGCGTSNQGPFGG-TSGLMGLGRSQLSLISQTMDQFGGVFSYCL-PLKESESSGSLV 293

Query: 257 FGTNGIV---SGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLT- 310
            G +  V   S P V +T ++      FY + +  I++G Q +  S   +++DSGT +T 
Sbjct: 294 LGDDTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVESSAGKVIVDSGTIITS 353

Query: 311 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG-ADVKLS 367
            +P  YN+     +S   E  P A     L+ C++       Q+P +   F G  +V++ 
Sbjct: 354 LVPSVYNAVKAEFLSQFAE-YPQAPGFSILDTCFNLTGFREVQIPSLKFVFEGNVEVEVD 412

Query: 368 RSN--FFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            S   +FV      VC     + +     I GN  Q N  V +D     + F    C
Sbjct: 413 SSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQIGFAQETC 469


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score =  145 bits (367), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 114/365 (31%), Positives = 178/365 (48%), Gaps = 42/365 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKS 145
           Y  +I +G+PP E     DTGSD++W  C PCP  +C ++        L+D K SST K+
Sbjct: 78  YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCP--KCPVKTDLGIPLSLYDSKTSSTSKN 135

Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
           + C    C+ + Q    G    C Y V YGDGS S+G+   + +TL   TG     P   
Sbjct: 136 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 195

Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLVPVSSTKI 255
            + FGCG N  G     +S   GI+G G  + S+ISQ+    + K  FS+CL  ++   I
Sbjct: 196 EVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGI 255

Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQ------RLGVSTPD--IVIDSGT 307
            F   G V  P V +TP+   +  Y + +  + V          L  +  D   +IDSGT
Sbjct: 256 -FAV-GEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGT 313

Query: 308 TLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGADV 364
           TL +LPQ  YNS +  + +       +   T +   C+SF  N+    P V +HF  + +
Sbjct: 314 TLAYLPQNLYNSLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFEDS-L 369

Query: 365 KLSR--SNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           KLS    ++   + ED+ C  ++  G+T      V + G+++ +N LV YD+E + + + 
Sbjct: 370 KLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWA 429

Query: 417 PTDCT 421
             +C+
Sbjct: 430 DHNCS 434


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 114/365 (31%), Positives = 178/365 (48%), Gaps = 42/365 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKS 145
           Y  +I +G+PP E     DTGSD++W  C PCP  +C ++        L+D K SST K+
Sbjct: 74  YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCP--KCPVKTDLGIPLSLYDSKTSSTSKN 131

Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
           + C    C+ + Q    G    C Y V YGDGS S+G+   + +TL   TG     P   
Sbjct: 132 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 191

Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLVPVSSTKI 255
            + FGCG N  G     +S   GI+G G  + S+ISQ+    + K  FS+CL  ++   I
Sbjct: 192 EVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGI 251

Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQ------RLGVSTPD--IVIDSGT 307
            F   G V  P V +TP+   +  Y + +  + V          L  +  D   +IDSGT
Sbjct: 252 -FAV-GEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGT 309

Query: 308 TLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGADV 364
           TL +LPQ  YNS +  + +       +   T +   C+SF  N+    P V +HF  + +
Sbjct: 310 TLAYLPQNLYNSLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFEDS-L 365

Query: 365 KLSR--SNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           KLS    ++   + ED+ C  ++  G+T      V + G+++ +N LV YD+E + + + 
Sbjct: 366 KLSVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWA 425

Query: 417 PTDCT 421
             +C+
Sbjct: 426 DHNCS 430


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 121/450 (26%), Positives = 197/450 (43%), Gaps = 62/450 (13%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDAL----TRSLNRLNHFNQNSSISSSKASQ----- 81
           +ELIHR SP+       +T  QRL++ +     R L  L H  +   I   KA +     
Sbjct: 3   LELIHRHSPQ--VMGRPKTQLQRLKELVHSDSVRQLMIL-HKLRGGQIPRRKAKEVLSSS 59

Query: 82  -------ADIIPNN-------ANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQ 126
                  A  +P +         Y +   +GTP  + + VADTGSDL W  C+  C    
Sbjct: 60  SGRGSDDAIEVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRN 119

Query: 127 C------YMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYG 173
           C       ++   +F   +SS++K++PC +  C        SL         C Y   Y 
Sbjct: 120 CSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYS 179

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           DGS + G  A ETVT+    G+ + L  +  GC  +  G       G++GLG    S   
Sbjct: 180 DGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAI 239

Query: 234 QMRTTIAGKFSYCLVPVSSTK-----INFGT----NGIVSGPGVVSTPLTKAKTFYVLTI 284
           +      GKFSYCLV   S K     + FG+      +++        L    +FY + +
Sbjct: 240 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 299

Query: 285 DAISVGNQRLGVSTP--DI------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVAD 335
             IS+G   L + +   D+      ++DSG++LTFL +  Y   + ++  S+++ + V  
Sbjct: 300 MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 359

Query: 336 PTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN-SV 391
             G LE C++     +  VP +  HF  GA+ +    ++ +  ++ + C  F  +     
Sbjct: 360 DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGT 419

Query: 392 PIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            + GNIMQ N L  +D+  + + F P+ CT
Sbjct: 420 SVVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449


>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 118/426 (27%), Positives = 189/426 (44%), Gaps = 44/426 (10%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ--ADIIPNN 88
           ++L HRD+           P  R+ D +     R +  ++             + I    
Sbjct: 33  LKLAHRDT-------LWPNPLSRIEDIIGADQKRHSLISRKRKFKGGVKMDLGSGIDYGT 85

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
           A Y   + +GTP  +   V DTGS+L W  C      +  +++  +F  + S ++K++ C
Sbjct: 86  AQYFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEESKSFKTVGC 145

Query: 149 SSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
            +  C        SL+        C Y   Y DGS + G  A ET+T+G T G+   L G
Sbjct: 146 FTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLRG 205

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----IN 256
           +  GC ++  G       G++GL   D S  S   +    K SYCLV   S K     + 
Sbjct: 206 LLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSLFGAKLSYCLVDHLSNKNISNYLI 265

Query: 257 FG----TNGIVSGPGVVSTP--LTKAKTFYVLTIDAISVGNQRLGVSTP--------DIV 302
           FG    +    + PG  +TP  LT    FY + I  IS+G+  L + T           +
Sbjct: 266 FGYSSSSTSTKTAPG-RTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQVWDATTGGGTI 324

Query: 303 IDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPEVTI 357
           +DSGT+LT L +  Y   +  +   ++E + V      +E C+S    FN  S++P++T 
Sbjct: 325 LDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFNE-SKLPQLTF 383

Query: 358 HFR-GADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           H + GA  +  R ++ V  +  + C  F    T +  + GNIMQ N+L  +D+   T+SF
Sbjct: 384 HLKGGARFEPHRKSYLVDAAPGVKCLGFMSAGTPATNVVGNIMQQNYLWEFDLMASTLSF 443

Query: 416 KPTDCT 421
            P+ CT
Sbjct: 444 APSTCT 449


>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 358

 Score =  145 bits (365), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 111/332 (33%), Positives = 160/332 (48%), Gaps = 47/332 (14%)

Query: 18  VVSPIEAQTGGFSVELIHRD-------SPKSPFYNSSETPYQRLRDALTRSLN-RLNHFN 69
            + P   Q+GG     IH         +P+ P   S    +    DA  ++LN RL    
Sbjct: 28  ALGPRVNQSGGVVQMTIHHVHGPGSSLAPQPPVSFSDVLAWD---DARVKTLNSRLTR-- 82

Query: 70  QNSSISSSKASQADI-------IPNN-------ANYLIRISIGTPPTERLAVADTGSDLI 115
           +++    S  ++ DI       +P N        NY +++  G+P      + DTGS L 
Sbjct: 83  KDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFGSPARYYSMIVDTGSSLS 142

Query: 116 WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSC--SGVNCQY 168
           W QC+PC    C++Q  PLFDP  S TYKSL C+SSQC     A+LN   C  S   C Y
Sbjct: 143 WLQCKPCV-VYCHVQADPLFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVY 201

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
           + SYGD S+S G L+ + +TL  +      LPG  +GCG ++ GLF  +  GI+GLG   
Sbjct: 202 TASYGDSSYSMGYLSQDLLTLAPSQ----TLPGFVYGCGQDSDGLFG-RAAGILGLGRNK 256

Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTID 285
           +S++ Q+ +     FSYCL               ++G     TP+T      + Y L + 
Sbjct: 257 LSMLGQVSSKFGYAFSYCLPTRGGGGFLSIGKASLAGSAYKFTPMTTDPGNPSLYFLRLT 316

Query: 286 AISVGNQRLGVSTPDI----VIDSGTTLTFLP 313
           AI+VG + LGV+        +IDSGT +T LP
Sbjct: 317 AITVGGRALGVAAAQYRVPTIIDSGTVITRLP 348


>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
          Length = 396

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 106/366 (28%), Positives = 178/366 (48%), Gaps = 46/366 (12%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N AN+    +IGTPP    A+ D   +L+WTQC  C  S+C+ QD PLF P  SST++  
Sbjct: 43  NVANF----TIGTPPQPASAIIDVAGELVWTQCSRC--SRCFKQDLPLFIPNASSTFRPE 96

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYG---DGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           PC +  C S    +CSG  C Y  +     D   + G + TET  +G+ T        + 
Sbjct: 97  PCGTDACKSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGTATAS------LA 150

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTN 260
           FGC   +       T+G +GLG    SL++QM+ T   KFSYCL P     S+++  G++
Sbjct: 151 FGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLT---KFSYCLSPRGTGKSSRLFLGSS 207

Query: 261 GIVSG-------PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF-- 311
             ++G       P + ++P   +  +Y+L++DAI  GN  +  +    ++   T   F  
Sbjct: 208 AKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATAQSGGILVMHTVSPFSL 267

Query: 312 -LPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFN---SLSQVPEVTIHFRG-ADV 364
            +   Y +   +V  ++  A  QP+A P    +LC+      S +  P++   F+G A +
Sbjct: 268 LVDSAYRAFKKAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAAL 327

Query: 365 KLSRSNFFVKVSE--DIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVSF 415
            +  + + + V E  D  C+    +          V + G++ Q +    YD++++T+SF
Sbjct: 328 TVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSF 387

Query: 416 KPTDCT 421
           +P DC+
Sbjct: 388 EPADCS 393


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 115/348 (33%), Positives = 154/348 (44%), Gaps = 31/348 (8%)

Query: 94  RISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC 153
           R S   P   +L + DT SD+ W QC PCP SQCY Q   L+DP  S + +S  CSS  C
Sbjct: 172 RRSRLRPGVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTC 231

Query: 154 ASL-------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
             L       +  S S   CQY V Y DGS ++G L  + ++L  T+     +P   FGC
Sbjct: 232 RQLGPYANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTS----QVPKFEFGC 287

Query: 207 GTNNGGLF-NSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI--V 263
                G F  SKT GI+ LG G  SL+SQ  T     FSYC  P +S K  F   G+   
Sbjct: 288 SHAARGSFSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPTASHK-GFFVLGVPRR 346

Query: 264 SGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQGYNSN 319
           S      TP+ K    Y + ++AI+V  QRL V          +DS T +T LP      
Sbjct: 347 SSSRYAVTPMLKTPMLYQVRLEAIAVAGQRLDVPPTVFAAGAALDSRTVITRLPPTAYQA 406

Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR--GADVKLSRSNFFVKV 375
           L S     +     A   G L+ CY F  +S +  P +++ F   GA V+L  S      
Sbjct: 407 LRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGVQLDPSGVLFG- 465

Query: 376 SEDIVCSVF---KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                C  F    G   +  I G +      V Y++   +V F+   C
Sbjct: 466 ----SCLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 109/331 (32%), Positives = 161/331 (48%), Gaps = 28/331 (8%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGV 164
           V D+ SD+ W QC PCP   C+ Q    +DP  S T  +  CSS  C +L      C+  
Sbjct: 32  VLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPYANGCANN 91

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            CQY V Y DGS ++G    + +TL +  G AV+  G  FGC     G F+++  GI+ L
Sbjct: 92  QCQYLVRYPDGSSTSGAYIADLLTLDA--GNAVS--GFKFGCSHAEQGSFDARAAGIMAL 147

Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI--VSGPGVVSTPLTK---AKTF 279
           GGG  SL+SQ  +     FSYC +P +++   F T G+   +    V TP+ +   A TF
Sbjct: 148 GGGPESLLSQTASRYGNAFSYC-IPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATF 206

Query: 280 YVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 334
           Y + +  I+VG QRLGV+ P +     V+DS T +T LP      L +   S +     A
Sbjct: 207 YGVLLRTITVGGQRLGVA-PAVFAAGSVLDSRTAITRLPPTAYQALRAAFRSSMTMYRSA 265

Query: 335 DPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KGITNS 390
            P G L+ CY F  +   ++P++++ F R A + L  S           C  F     + 
Sbjct: 266 PPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILFN-----DCLAFTSNADDR 320

Query: 391 VP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           +P + G++ Q    V YD+    V F+   C
Sbjct: 321 MPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score =  144 bits (364), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 181/366 (49%), Gaps = 38/366 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y + + +G PP   L + DTGSDL W QC+PC    C+ Q  P+FDP  S+++K +PC+
Sbjct: 170 EYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPC--KACFDQSGPVFDPSQSTSFKIIPCN 227

Query: 150 SSQCASLNQKSC-------SGVNCQYSVSYGDGSFSNGNLATETVTLG-STTGQAVALPG 201
           ++ C  +    C       S   C+Y   YGD S ++G+LA E++++  S    ++ +  
Sbjct: 228 AAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRD 287

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPVS-----STKI 255
           +  GCG +N GLF      +    G  +S  SQ+R++  G+ FSYCLV  +     S+ I
Sbjct: 288 MVIGCGHSNKGLFQGAGGLLGLGQGA-LSFPSQLRSSPIGQSFSYCLVDRTNNLSVSSAI 346

Query: 256 NFGTNGIVSG--PGVVSTPLTK----AKTFYVLTIDAISVGNQRLGVSTPDI-------- 301
           +FG    +S     +  TP  +     +TFY L I  I +  + L +             
Sbjct: 347 SFGAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSG 406

Query: 302 --VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTI 357
             +IDSGTTLT+L +     + S   + I + P ADP   L +CY+    + V  P ++I
Sbjct: 407 GTIIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFDILGICYNATGRTAVPFPTLSI 465

Query: 358 HFR-GADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
            F+ GA++ L + N+F++            + T+ + I GN  Q N    YD++   + F
Sbjct: 466 VFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIHFLYDVQHARLGF 525

Query: 416 KPTDCT 421
             TDC+
Sbjct: 526 ANTDCS 531


>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
 gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
          Length = 487

 Score =  144 bits (363), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 107/342 (31%), Positives = 150/342 (43%), Gaps = 51/342 (14%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
           DT  DL W QC PCP  +CY Q + LFDP+ S T  ++PC S+ C  L +    CS   C
Sbjct: 167 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 226

Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
           QY V YGDG  ++G    + +TL  +T     +    FGC     G F++ T+G + LGG
Sbjct: 227 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMSLGG 282

Query: 227 GDISLISQMRTTIAGKFSYCLVPVSSTKI-------------NFGTNGIVSGPGVVSTPL 273
           G  SL+SQ   T    FSYC+   SS+                F    +V  P ++    
Sbjct: 283 GRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSII---- 338

Query: 274 TKAKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMI 328
               T Y++ +  I VG +RL V         V+DS   +T L P  Y +  L+  S+M 
Sbjct: 339 ---PTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMA 395

Query: 329 EAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKG 386
               VA     L+ CY F   +   VP V++ F G  V          V  D +  + +G
Sbjct: 396 AYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAMGVMVEG 445

Query: 387 ITNSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
               VP          GN+ Q    V YD+   +V F+   C
Sbjct: 446 CLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 487


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 113/370 (30%), Positives = 176/370 (47%), Gaps = 41/370 (11%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + +GTPP     + DTGSDL W QC PC    C+ Q+   +DPK S+++K++ C
Sbjct: 160 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPC--YDCFHQNEAFYDPKTSASFKNITC 217

Query: 149 SSSQCASLN------QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA---- 198
           +  +C+ ++      Q      +C Y   YGD S + G+ A ET T+  TT +  +    
Sbjct: 218 NDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYK 277

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
           +  + FGCG  N GLF+  +  +    G  +S  SQ+++     FSYCLV  +     S+
Sbjct: 278 VENMMFGCGHWNRGLFSGASGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSDTNVSS 336

Query: 254 KINFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGVS------TPD- 300
           K+ FG +  +++   +  T     K     TFY + I +I VG + L +       +PD 
Sbjct: 337 KLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDG 396

Query: 301 ---IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----V 352
               +IDSGTTL++  +  Y          M E   V      L+ C++ + + +    +
Sbjct: 397 AGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEENNIHL 456

Query: 353 PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQ 410
           PE+ I F  GA       N F+ +SED+VC    G   S   I GN  Q NF + YD + 
Sbjct: 457 PELGIAFADGAVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKM 516

Query: 411 QTVSFKPTDC 420
             + F PT C
Sbjct: 517 SRLGFTPTKC 526


>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
          Length = 443

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 113/382 (29%), Positives = 173/382 (45%), Gaps = 67/382 (17%)

Query: 43  FYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA-DIIPNNANYLIRISIGTPP 101
           +Y+ + T   R   A  RS+  LN+    +S SSS    +  ++P    Y++   +G P 
Sbjct: 8   YYDHNMTSTDRSIWAADRSIAXLNYLLSVTSSSSSLGDISSKLVPEYYEYIMMYYLGVPS 67

Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
           T    +ADTGS+LIW QC PC  + CY Q  P+FDP  S TY+++   S  C ++ + SC
Sbjct: 68  TLVYGIADTGSELIWLQCLPC--THCYNQTPPIFDPAESYTYETVSSDSPICNAVRRISC 125

Query: 162 S--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTT 219
                +C Y  +YGDG+ + G L+T+       T   V +  +TFGC  +          
Sbjct: 126 REGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTKARLKGHQA 185

Query: 220 GIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTKINFGTNGIVSGPGVVSTPLTK 275
           G+VGL     SL+SQ++     KFSYC+V      S +++ FG+  ++ G     TPL K
Sbjct: 186 GVVGLNRHPNSLVSQLKVK---KFSYCMVIPDDHGSGSRMYFGSRAVILGG---KTPLLK 239

Query: 276 AK-TFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 334
              + Y +T+  ISVG ++      D +  +G  +TF                       
Sbjct: 240 GDYSHYFVTLKGISVGEEK---GRSDELASAGPDITF----------------------- 273

Query: 335 DPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVF--KGITNSVP 392
                                  HF GAD  L++   +V+V + + C        T  + 
Sbjct: 274 -----------------------HFYGADFILTKXTTYVEVEKGLWCLAMLSSNSTRKLS 310

Query: 393 IYGNIMQTNFLVGYDIEQQTVS 414
           I GNI Q N+ VGYD+E Q V+
Sbjct: 311 ILGNIQQQNYHVGYDLEAQEVA 332



 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 53/112 (47%), Gaps = 3/112 (2%)

Query: 125 SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFS-NGN 181
           +QC+ Q  P+FDP  SSTY ++P  +  C      +C     +C Y +SYG GS S  G 
Sbjct: 332 AQCFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYACHIDEEDCCYRISYGSGSTSTEGT 391

Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           ++ +           V +  + FGC     G F     GIVGL    +SL+S
Sbjct: 392 ISIDAFAFEDNRQNMVDVXHLVFGCSDYTTGTFKGYEVGIVGLNQDSLSLVS 443


>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
           distachyon]
          Length = 836

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 137/426 (32%), Positives = 196/426 (46%), Gaps = 50/426 (11%)

Query: 31  VELIHRDSPKSPFYNSSETPY--------QRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
           + L HR  P +    S+  P         +R  + + R ++           +++ +S++
Sbjct: 425 LRLTHRHGPCAGPSRSASAPSFAEVLRADERRAEYIQRRMSGAKGPGGLQQFTAASSSKS 484

Query: 83  DIIPNN-------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
             IP N         Y++ +S+GTP   +    DTGSD+ W QC PC    CY Q   LF
Sbjct: 485 VTIPANIGHSIGTLQYVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKDQLF 544

Query: 136 DPKMSSTYKSLPCSSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
           DP  SS+Y ++PC++  C+ L+       +G  C Y VSYGDGS + G   ++T+TL   
Sbjct: 545 DPAKSSSYSAVPCAADACSELSTYGHGCAAGSQCGYVVSYGDGSNTTGVYGSDTLTL--- 601

Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPVS 251
              A A+ G  FGCG    GLF +   G++ LG   +SL SQ      G  FSYCL P  
Sbjct: 602 -TDADAVTGFLFGCGHAQAGLF-AGIDGLLALGRKGMSLTSQTSGAYGGGVFSYCLPPSP 659

Query: 252 STKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRL-GVSTPDI----VI 303
           S+       G  S  G  +T L  A    TFY++ +  I VG Q+L GV         V+
Sbjct: 660 SSTGFLTLGGPSSASGFATTGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPASAFAGGTVV 719

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHF 359
           D+GT +T LP    + L +   + +     P A  TG L+ CY+F     V  P V++ F
Sbjct: 720 DTGTVITRLPPTAYAALRAAFRAAMAPYGYPAAPATGILDTCYNFTDYGTVTLPTVSLTF 779

Query: 360 R-GADVKLSRSNFFVKVSEDIVCSVFKGITNS----VPIYGNIMQTNFLVGYDIEQQTVS 414
             GA +KL    F         C  F   TNS      I GN+ Q +F V +D    +V 
Sbjct: 780 SGGATLKLDAPGFLSS-----GCLAFA--TNSGDGDPAILGNVQQRSFAVRFD--GSSVG 830

Query: 415 FKPTDC 420
           F P  C
Sbjct: 831 FMPHSC 836


>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
          Length = 501

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 116/368 (31%), Positives = 168/368 (45%), Gaps = 43/368 (11%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  +I +GTP T  L V DTGSD++W QC PC   +CY Q   +FDP+ S +Y ++
Sbjct: 143 GSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPC--RRCYDQSGQMFDPRASHSYGAV 200

Query: 147 PCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C++  C  L+   C      C Y V+YGDGS + G+ ATET+T  S       +P +  
Sbjct: 201 DCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAS----GARVPRVAL 256

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP---------VSSTKI 255
           GCG +N GLF +    ++GLG G +S  SQ+       FSYCLV            S+ +
Sbjct: 257 GCGHDNEGLFVAAAG-LLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTV 315

Query: 256 NFGTNGIVS-GPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST---------PD----- 300
            FG+    + G  V+     + +   VL   A   G+QR   +          PD     
Sbjct: 316 TFGSGARGALGRRVLHPDGEEPQDGDVLLRAAH--GHQRRRRARPGRGRVRPPPDPSTGR 373

Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG--SLELCYSFNSLS--QVPE 354
             +++DSG       +   +   +  S    A     P G    + CY  + L   +VP 
Sbjct: 374 GGVIVDSGRPSPAWARAGRTPPCATRSRAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVPT 433

Query: 355 VTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
           V++HF  GA+  L   N+ + V S    C  F G    V I GNI Q  F V +D + Q 
Sbjct: 434 VSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQR 493

Query: 413 VSFKPTDC 420
           + F P  C
Sbjct: 494 LGFVPKGC 501


>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
          Length = 471

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 107/342 (31%), Positives = 150/342 (43%), Gaps = 51/342 (14%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
           DT  DL W QC PCP  +CY Q + LFDP+ S T  ++PC S+ C  L +    CS   C
Sbjct: 151 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 210

Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
           QY V YGDG  ++G    + +TL  +T     +    FGC     G F++ T+G + LGG
Sbjct: 211 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMSLGG 266

Query: 227 GDISLISQMRTTIAGKFSYCLVPVSSTKI-------------NFGTNGIVSGPGVVSTPL 273
           G  SL+SQ   T    FSYC+   SS+                F    +V  P ++    
Sbjct: 267 GRQSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSII---- 322

Query: 274 TKAKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMI 328
               T Y++ +  I VG +RL V         V+DS   +T L P  Y +  L+  S+M 
Sbjct: 323 ---PTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMA 379

Query: 329 EAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKG 386
               VA     L+ CY F   +   VP V++ F G  V          V  D +  + +G
Sbjct: 380 AYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAMGVMVEG 429

Query: 387 ITNSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
               VP          GN+ Q    V YD+   +V F+   C
Sbjct: 430 CLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 471


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 109/331 (32%), Positives = 160/331 (48%), Gaps = 28/331 (8%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGV 164
           V D+ SD+ W QC PCP   C+ Q    +DP  S +     CSS  C +L      C+  
Sbjct: 162 VLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALGPYANGCANN 221

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            CQY V Y DGS ++G    + +TL +  G AV+  G  FGC     G F+++  GI+ L
Sbjct: 222 QCQYLVRYPDGSSTSGAYIADLLTLDA--GNAVS--GFKFGCSHAEQGSFDARAAGIMAL 277

Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI--VSGPGVVSTPLTK---AKTF 279
           GGG  SL+SQ  +     FSYC +P +++   F T G+   +    V TP+ +   A TF
Sbjct: 278 GGGPESLLSQTASRYGNAFSYC-IPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATF 336

Query: 280 YVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 334
           Y + +  I+VG QRLGV+ P +     V+DS T +T LP      L S   S +     A
Sbjct: 337 YGVLLRTITVGGQRLGVA-PAVFAAGSVLDSRTAITRLPPTAYQALRSAFRSSMTMYRSA 395

Query: 335 DPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KGITNS 390
            P G L+ CY F  +   ++P++++ F R A + L  S           C  F     + 
Sbjct: 396 PPKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILFN-----DCLAFTSNADDR 450

Query: 391 VP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           +P + G++ Q    V YD+    V F+   C
Sbjct: 451 MPGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481


>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 324

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 119/333 (35%), Positives = 159/333 (47%), Gaps = 33/333 (9%)

Query: 109 DTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN---QKSCSGV 164
           DTGSDL W QC+PC  +  CY Q  PLFDP  SS+Y ++PC    CA L      +CS  
Sbjct: 4   DTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAA 63

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C Y VSYGDGS + G  +++T+TL +++    A+ G  FGCG    GLFN    G++GL
Sbjct: 64  QCGYVVSYGDGSNTTGVYSSDTLTLSASS----AVQGFFFGCGHAQSGLFNG-VDGLLGL 118

Query: 225 GGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNGIV-SGPGVVST---PLTKAKT 278
           G    SL+ Q   T  G FSYCL   P ++  +  G  G   + PG  +T   P   A T
Sbjct: 119 GREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPT 178

Query: 279 FYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF----LPQGYNSNLLSVMSSMIEA--QP 332
           +YV+ +  ISVG Q+L V        +          LP    + L S   S + +   P
Sbjct: 179 YYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYP 238

Query: 333 VADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFK--GI 387
            A   G L+ CY+F     V  P V + F  GA V L              C  F   G 
Sbjct: 239 TAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL-----SFGCLAFAPSGS 293

Query: 388 TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              + I GN+ Q +F V   I+  +V FKP+ C
Sbjct: 294 DGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 324


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 105/363 (28%), Positives = 168/363 (46%), Gaps = 41/363 (11%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
            Y +++ +GTP  E   VADTGSDL W +C    PP +       +F PK S ++  +PC
Sbjct: 115 QYFVKLRVGTPVQEFTLVADTGSDLTWVKCAGASPPGR-------VFRPKTSRSWAPIPC 167

Query: 149 SSSQCA-----SLNQKSCSGVNCQYSVSYGDGSF-SNGNLATETVTLGSTTGQAVALPGI 202
           SS  C      +L   S     C Y   Y +GS  + G + TE+ T+    G+   L  +
Sbjct: 168 SSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDV 227

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI 262
             GC +++ G       G++ LG   IS  +Q      G FSYCL  V        T  +
Sbjct: 228 VLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCL--VDHLAPRNATGYL 285

Query: 263 VSGPGVV-STPLTKAK-------TFYVLTIDAISVGNQRLGV-------STPDIVIDSGT 307
             GPG V  TP T+ K        FY + +DAI V  + L +        +  +++DSG 
Sbjct: 286 AFGPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWDAKSGGVILDSGN 345

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQP-VADPTGSLELCYSFNSLSQ-----VPEVTIHFRG 361
           TLT L       +++ +S  ++  P V+ P    E CY++ +        +P++ + F G
Sbjct: 346 TLTVLAAPAYKAVVAALSKHLDGVPKVSFPP--FEHCYNWTARRPGAPEIIPKLAVQFAG 403

Query: 362 -ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
            A ++    ++ + V   + C  V +G    + + GNIMQ   L  +D++   V FK ++
Sbjct: 404 SARLEPPAKSYVIDVKPGVKCIGVQEGEWPGLSVIGNIMQQEHLWEFDLKNMQVRFKQSN 463

Query: 420 CTK 422
           CT+
Sbjct: 464 CTR 466


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 115/428 (26%), Positives = 189/428 (44%), Gaps = 46/428 (10%)

Query: 30  SVELIHRDS--PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           +++LI R+S    +P      TP   ++     S  R  +  QNS +    +S   +  +
Sbjct: 2   AMKLIRRESVVRHNPDARVPVTPEDHIQHMTDISSARFKYL-QNSIVKELGSSDFQVDVH 60

Query: 88  NAN----YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
            A     + +  S+G PP  +  + DTGS L+W QC PC          P+F+P +SST+
Sbjct: 61  QAIKTSLFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTF 120

Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
               C    C       CS   C Y   Y  G+ S G LA E +T  +  G  V    I 
Sbjct: 121 VECSCDDRFCRYAPNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIA 180

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIV 263
           FGCG  NG    S+ TGI+GLG    SL  Q+      KFSYC+  +++   N+G N +V
Sbjct: 181 FGCGHENGEQLESEFTGILGLGAKPTSLAVQL----GSKFSYCIGDLANK--NYGYNQLV 234

Query: 264 SGP--GVVSTP----LTKAKTFYVLTIDAISVGNQRLGV---------STPDIVIDSGTT 308
            G    ++  P           Y + ++ ISVG+++L +         S   +++D+GT 
Sbjct: 235 LGEDADILGDPTPIEFETENGIYYMNLEGISVGDKQLNIEPVVFKRRGSRTGVILDTGTL 294

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE-LCYS---FNSLSQVPEVTIHFR-GAD 363
            T+L       L + + S+++  P  +     + LCY       L   P VT HF  GA+
Sbjct: 295 YTWLADIAYRELYNEIKSILD--PKLERFWFRDFLCYHGRVNEELIGFPVVTFHFAGGAE 352

Query: 364 VKLSRSNFFVKVSE-----DIVCSVFKGITNSVPIY------GNIMQTNFLVGYDIEQQT 412
           + +  ++ F  ++E     ++ C   +  T     Y      G + Q  + + YD++++ 
Sbjct: 353 LAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTAIGLMAQQYYNIAYDLKERN 412

Query: 413 VSFKPTDC 420
           +  +  DC
Sbjct: 413 IYLQRIDC 420


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 138/396 (34%), Positives = 188/396 (47%), Gaps = 41/396 (10%)

Query: 54  LRDALTRSLNRLNHFNQNSSISSSKASQADI-----IPNNA-NYLIRISIGTPPTERLAV 107
           L+D L R  +    F+  ++ S  K  QADI     IP  A NYL+++++GTP       
Sbjct: 3   LQDQL-RVKSMHARFSNKNAGSHFKEMQADIPVQSGIPLGAGNYLVKMALGTPKLSLSLA 61

Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA----SLNQKSCSG 163
            DTGSD+ WTQCEPC  S CY Q    FDP+ SS+YK++ CSSS C     S   + C  
Sbjct: 62  LDTGSDITWTQCEPCVGS-CYRQAQTKFDPRKSSSYKNVSCSSSSCRIITDSGGARGCVS 120

Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVG 223
             C Y V YGDGS+S G  ATE +T+  +      +    FGCG  N G F  +  G++G
Sbjct: 121 STCIYKVQYGDGSYSVGFFATEKLTISPSD----VISNFLFGCGQQNAGRFG-RIAGLLG 175

Query: 224 LGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLT---KAKTFY 280
           LG G +SL  Q        F+YCL   SS+     T G      V  TPL+   K   FY
Sbjct: 176 LGRGKLSLALQTSEKYNNLFTYCLPSFSSSSTGHLTLGGQVPKSVKFTPLSPAFKNTPFY 235

Query: 281 VLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD 335
            + I  +SVG   L +     S    +IDSGT +T L     S L S    +++  P  D
Sbjct: 236 GIDIKGLSVGGHVLPIDASVFSNAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDYPKTD 295

Query: 336 PTGSLELCYSF--NSLSQVPEVTIHFRGA---DVKLSRSNFF----VKVSEDIVCSVF-- 384
               L+ CY F  N    VP ++  F+G    D+K     FF    V  + D VC  F  
Sbjct: 296 GFSILDTCYDFSGNESISVPRISFFFKGGVEVDIK-----FFGILTVINAWDKVCLAFAP 350

Query: 385 KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                   ++GN  Q  + V +D+ +  + F P+ C
Sbjct: 351 NDDDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGC 386


>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
          Length = 442

 Score =  143 bits (360), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 127/442 (28%), Positives = 213/442 (48%), Gaps = 39/442 (8%)

Query: 10  ILFFLCF-YVVSP---IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL 65
           +L  +CF ++ SP     + + GFS  LIH  SP SP+ N       +   AL  +L+R 
Sbjct: 7   LLLIICFTFIFSPCISAASDSKGFSTNLIHIHSPSSPYKNVKAESLAK-DTALESTLSRH 65

Query: 66  NHF--NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
            +    Q  ++  +      +I + + +L  +SIG PPT    V DTGSDL W QCEPC 
Sbjct: 66  AYLRARQQKALQPADFVPPPLIRDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPC- 124

Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGV-NCQYSVSYGDGSFSNGN 181
              CY Q  P+++   S +Y  + C+   C SL ++  CS   +C Y  +Y DG+ ++G 
Sbjct: 125 -DVCYKQKDPIYNRTKSDSYTEMLCNEPPCVSLGREGQCSDSGSCLYQTAYADGARTSGL 183

Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS-KTTGIVGLGGGDISLISQMRT--T 238
           L+ E V   S          + FGCG  N     S +  G++GLG G +SL+SQ+     
Sbjct: 184 LSYEKVAFTSHYSDEDKTAQVGFGCGLQNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGK 243

Query: 239 IAGKFSYCLVPVSSTK----INFGTNGIVSGPGVVSTPLTKAKTFYV-LTIDAISVGNQR 293
           ++  F+YC   +S+      + FG    ++G     TP+  A+ +YV L    + VG  R
Sbjct: 244 VSKSFAYCFGNISNPNAGGFLVFGDATYLNGD---MTPMVIAEFYYVNLLGIGLGVGEPR 300

Query: 294 LGVST------PD----IVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
           L +++      PD    ++IDSG+TL+ F P+ Y     +V+  + +   ++  T S + 
Sbjct: 301 LDINSSSFERKPDGSGGVIIDSGSTLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD- 359

Query: 343 CYS---FNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQ 399
           C+       L   P + ++     +   R + F++  +++ C  F      + I G + Q
Sbjct: 360 CFEGKIERDLPLFPTLVLYLESTGILNDRWSIFLQRYDELFCLGFTS-GEGLSIIGTLAQ 418

Query: 400 TNFLVGYDIEQQTVSFKPT-DC 420
            ++  GY++E  T+S +   DC
Sbjct: 419 QSYKFGYNLELSTLSIESNPDC 440


>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
          Length = 363

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 98/289 (33%), Positives = 141/289 (48%), Gaps = 31/289 (10%)

Query: 23  EAQTGGFSVELIHRD--SPKSPFYNSSETPYQRLRDALTRSL-NRLNHFNQNSSISSSKA 79
             + G   +E+  R   S K   ++        L D   RS+ NRL     + S+  S+ 
Sbjct: 71  RQEKGAIMLEMKDRSYCSKKKVNWHRKLHNQLTLDDLHVRSMQNRLRKMVSSHSVEVSQI 130

Query: 80  S---QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
                + +     NY++ + +G    +   + DTGSDL W QCEPC    CY Q  P+F 
Sbjct: 131 QIPLASGVNFQTLNYIVTMELGG--QDMTVIIDTGSDLTWVQCEPC--MSCYNQQGPVFK 186

Query: 137 PKMSSTYKSLPCSSSQCASL-----NQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTL 189
           P  SS+Y+S+PC+SS C SL     N  +C     NC Y+V+YGDGS++NG L  E ++ 
Sbjct: 187 PSTSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSF 246

Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP 249
           G      +++    FGCG NN GLF    +G++GLG  ++SLISQ  +T  G FSYCL P
Sbjct: 247 G-----GISVSNFVFGCGKNNKGLFGG-VSGLMGLGRSNLSLISQTNSTFGGVFSYCLPP 300

Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAK--------TFYVLTIDAISVG 290
             +        G  S      TP+   +         FY+L +  I VG
Sbjct: 301 TDAGASGSLAMGNESSVFKNLTPIAYTRMVPNPQLSNFYMLNLTGIDVG 349


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 103/362 (28%), Positives = 165/362 (45%), Gaps = 39/362 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
            Y +++ +GTP  E   VADTGS+L W +C     PP         +F P+ S ++  +P
Sbjct: 90  QYFVKVLVGTPAQEFTLVADTGSELTWVKCAGGASPPGL-------VFRPEASKSWAPVP 142

Query: 148 CSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSN-GNLATETVTLGSTTGQAVALPG 201
           CSS  C      SL   S S   C Y   Y +GS    G + T++ T+    G+   L  
Sbjct: 143 CSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQD 202

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
           +  GC + + G       G++ LG   IS  S+      G FSYCL  V        T  
Sbjct: 203 VVLGCSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCL--VDHLAPRNATGY 260

Query: 262 IVSGPGVV-STPLTKAK-------TFYVLTIDAISVGNQRLGV-------STPDIVIDSG 306
           +  GPG V  TP T+ K        FY + +DA+ V  Q L +        +  +++DSG
Sbjct: 261 LAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDPKSGGVILDSG 320

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS----LSQVPEVTIHFRG- 361
           TTLT L       +++ ++ ++   P  D     E CY++ +      ++P++ + F G 
Sbjct: 321 TTLTVLATPAYKAVVAALTKLLAGVPKVD-FPPFEHCYNWTAPRPGAPEIPKLAVQFTGC 379

Query: 362 ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           A ++    ++ + V   + C  + +G    V + GNIMQ   L  +D++   V F P+ C
Sbjct: 380 ARLEPPAKSYVIDVKPGVKCIGLQEGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMPSTC 439

Query: 421 TK 422
           T+
Sbjct: 440 TR 441


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 112/370 (30%), Positives = 176/370 (47%), Gaps = 41/370 (11%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + +GTPP     + DTGSDL W QC PC    C+ Q+   +DPK S+++K++ C
Sbjct: 158 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPC--YDCFHQNGMFYDPKTSASFKNITC 215

Query: 149 SSSQCASLN------QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA----VA 198
           +  +C+ ++      Q      +C Y   YGD S + G+ A ET T+  TT +       
Sbjct: 216 NDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYK 275

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----ST 253
           +  + FGCG  N GLF+  +  +    G  +S  SQ+++     FSYCLV  +     S+
Sbjct: 276 VGNMMFGCGHWNRGLFSGASGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSNTNVSS 334

Query: 254 KINFGTN-GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGV--STPDI---- 301
           K+ FG +  +++   +  T     K     TFY + I +I VG + L +   T +I    
Sbjct: 335 KLIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDG 394

Query: 302 ----VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----V 352
               +IDSGTTL++  +  Y          M E  P+      L+ C++ + + +    +
Sbjct: 395 DGGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHL 454

Query: 353 PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQ 410
           PE+ I F  G        N F+ +SED+VC    G   S   I GN  Q NF + YD ++
Sbjct: 455 PELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKR 514

Query: 411 QTVSFKPTDC 420
             + F PT C
Sbjct: 515 SRLGFTPTKC 524


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score =  142 bits (357), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 111/340 (32%), Positives = 160/340 (47%), Gaps = 33/340 (9%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCS 162
           V DT SD+ W QC PCP   C+ Q   L+DP  SS+  + PCSS  C +L    N  + +
Sbjct: 159 VIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPYANGCTPA 218

Query: 163 GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN--NGGLFNSKTTG 220
           G  CQY V Y DGS S G   ++ +TL      A A+    FGC       G F++KT+G
Sbjct: 219 GDQCQYRVQYPDGSASAGTYISDVLTLNPAK-PASAISEFRFGCSHALLQPGSFSNKTSG 277

Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNGIVSGPGVVSTPLTKAKT 278
           I+ LG G  SL +Q + T    FSYCL   PV S     G   + +    V TP+ ++K 
Sbjct: 278 IMALGRGAQSLPTQTKATYGDVFSYCLPPTPVHSGFFILGVPRVAASRYAV-TPMLRSKA 336

Query: 279 ---FYVLTIDAISVGNQRL----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
               Y++ + AI V  +RL     V     V+DS T +T LP      L +   + + A 
Sbjct: 337 APMLYLVRLIAIEVAGKRLPVPPAVFAAGAVMDSRTIVTRLPPTAYMALRAAFVAEMRAY 396

Query: 332 PVADPTGSLELCYSFNSLS-------QVPEVTIHFRGAD--VKLSRSNFFVKVSEDIVCS 382
             A P   L+ CY F+  +       ++P++T+ F G +  V+L  S   +       C 
Sbjct: 397 RAAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVELDPSGVLLD-----GCL 451

Query: 383 VFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            F   T+     I GN+ Q    V Y+++  TV F+   C
Sbjct: 452 AFAPNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491


>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 418

 Score =  142 bits (357), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 104/363 (28%), Positives = 175/363 (48%), Gaps = 43/363 (11%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N AN+    +IGTPP    A+ D   +L+WTQC  C  S+C+ QD PLF P  SST++  
Sbjct: 67  NVANF----TIGTPPQPASAIIDVAGELVWTQCSMC--SRCFKQDLPLFVPNASSTFRPE 120

Query: 147 PCSSSQCASLNQKSCSGVNCQY--SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           PC +  C S+   +CS   C Y  +++   G  + G +AT+T  +G+ T        + F
Sbjct: 121 PCGTDACKSIPTSNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGTATAS------LGF 174

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKINFGTNG 261
           GC   +G       +G++GLG    SL+SQM  T   KFSYCL P  S   +++  G++ 
Sbjct: 175 GCVVASGIDTMGGPSGLIGLGRAPSSLVSQMNIT---KFSYCLTPHDSGKNSRLLLGSSA 231

Query: 262 IVSG-------PGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--STPDIVIDSGTTLTFL 312
            ++G       P V ++P      +Y + +D I  G+  + +  S   +++ +   ++FL
Sbjct: 232 KLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPSGNTVLVQTLAPMSFL 291

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR--GADVKLSR 368
                  L   ++  + A P A P    +LC+    LS    P++   F+   A + +  
Sbjct: 292 VDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPP 351

Query: 369 SNFFVKVSED--IVCSVF--------KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
             + + V E+   VC             +  ++ I G++ Q N     D+E++T+SF+P 
Sbjct: 352 PKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPA 411

Query: 419 DCT 421
           DC+
Sbjct: 412 DCS 414


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 115/353 (32%), Positives = 170/353 (48%), Gaps = 27/353 (7%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +   +++ +  GTP      + DTGSDL W QC+PC    CY Q  P FDP  SS+Y ++
Sbjct: 133 DTLEFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPC-SGHCYRQHDPDFDPAKSSSYAAV 191

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           PC +  CA+     C+G  C Y V YGDGS + G L+ +T+T  S++       G TFGC
Sbjct: 192 PCGTPVCAAAGGM-CNGTTCLYGVQYGDGSSTTGVLSRDTLTFNSSS----KFTGFTFGC 246

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--INFGTNGIVS 264
           G  N G F  +  G++GLG G +SL SQ   +  G FSYCL   ++T   +N G     S
Sbjct: 247 GEKNIGDFG-EVDGLLGLGRGKLSLPSQAAPSFGGVFSYCLPSYNTTPGYLNIGATKPTS 305

Query: 265 GPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPDI------VIDSGTTLTFLPQG 315
              V  T + K     +FY + + +I++G   L V  P +      ++DSGT LT+LP  
Sbjct: 306 TVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVP-PSVFTKTGTLLDSGTILTYLPPP 364

Query: 316 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF-VK 374
             ++L       ++    A P   L+ CY F     +    + F  +D  +   +F+ + 
Sbjct: 365 AYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDGAVFDLDFYGIM 424

Query: 375 VSED-----IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           +  D     I C  F     ++P  I GN  Q    V YD+  Q + F P  C
Sbjct: 425 IFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIPISC 477


>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
          Length = 366

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 101/288 (35%), Positives = 147/288 (51%), Gaps = 37/288 (12%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRL------RDA-----LTRSLNRLNHFNQNSSISSS 77
           +SVE++HRD+       ++   Y+R       R+A     L R + R    N++      
Sbjct: 74  WSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPVNRYE 133

Query: 78  KASQAD----------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
             ++ D          +   +  Y  RI +GTP  E+  V DTGSD+ W QCEPC   +C
Sbjct: 134 NVAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPC--REC 191

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           Y Q  P+F+P  S+++ ++ C S+ C+ L+   C    C Y  SYGDGS+S G+ ATET+
Sbjct: 192 YSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETL 251

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           T G+T+   VA+     GCG  N GLF      ++GLG G +S  +Q+ T     FSYCL
Sbjct: 252 TFGTTSVANVAI-----GCGHKNVGLFIGAAG-LLGLGAGALSFPNQIGTQTGHTFSYCL 305

Query: 248 VPV---SSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISV 289
           V     SS  + FG   +  G   + TPL K     TFY L++ AIS+
Sbjct: 306 VDRESDSSGPLQFGPKSVPVGS--IFTPLEKNPHLPTFYYLSVTAISI 351


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 115/366 (31%), Positives = 160/366 (43%), Gaps = 48/366 (13%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY++ + IG   +    + DTGSDL W QC PC    CY Q  PLF+P  SS++ SLPC+
Sbjct: 144 NYIVTVGIGGQNST--LIVDTGSDLTWVQCLPC--RLCYNQQEPLFNPSNSSSFLSLPCN 199

Query: 150 SSQCASLNQKS-----CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           S  C +L   +     CS  N   C Y + YGDGS+S G L  E +TLG T      +  
Sbjct: 200 SPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKT-----EIDN 254

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI--- 255
             FGCG NN GLF    +G++GL   ++SL+SQ  +     FSYCL      SS  +   
Sbjct: 255 FIFGCGRNNKGLFGG-ASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLG 313

Query: 256 -----NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI--------V 302
                NF     +S   ++  P  +   FY L +  IS+G   L V  P +        +
Sbjct: 314 GADFSNFKNISPISYTRMIQNP--QMSNFYFLNLTGISIGGVNLNV--PRLSSNEGVLSL 369

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR 360
           +DSGT +T L         +                 L  C++     +V  P V   F 
Sbjct: 370 LDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFE 429

Query: 361 GAD---VKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           G     V +    +FVK     +C  F   G  +   I GN  Q N  V Y+ ++  V F
Sbjct: 430 GNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGF 489

Query: 416 KPTDCT 421
               C+
Sbjct: 490 AGEPCS 495


>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 129/447 (28%), Positives = 216/447 (48%), Gaps = 39/447 (8%)

Query: 5   LSCVFILFFLCF-YVVSP---IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           ++ V +L  +CF ++ SP     + + GFS  LIH  SP SP+ N       +   AL  
Sbjct: 15  MASVNLLLIICFTFIFSPCISAASDSKGFSTNLIHIHSPSSPYKNVKAESLAK-DTALES 73

Query: 61  SLNRLNHF--NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
           +L+R  +    Q  ++  +      +I + + +L  +SIG PPT    V DTGSDL W Q
Sbjct: 74  TLSRHAYLRARQQKALQPADFVPPPLIRDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQ 133

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGV-NCQYSVSYGDGS 176
           CEPC    CY Q  P+++   S +Y  + C+   C SL ++  CS   +C Y  SY DGS
Sbjct: 134 CEPC--DVCYKQKDPIYNRTKSDSYTEMLCNEPPCLSLGREGQCSDSGSCLYQTSYADGS 191

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF-NSKTTGIVGLGGGDISLISQM 235
            ++G L+ E V   S          + FGCG  N     +S+  G++GLG G +SL+SQ+
Sbjct: 192 RTSGLLSYEKVAFTSHYSDEDKTAQVGFGCGLQNLNFVTSSRDGGVLGLGPGLVSLVSQL 251

Query: 236 RT--TIAGKFSYCLVPVSSTK----INFGTNGIVSGPGVVSTPLTKAKTFYV-LTIDAIS 288
                ++  F+YC   +S+      + FG    ++G     TP+  A+ +YV L    + 
Sbjct: 252 SAIGKVSKSFAYCFGNLSNPNAGGFLVFGDATYLNGD---MTPMVIAEFYYVNLLGIGLG 308

Query: 289 VGNQRLGVST------PD----IVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPT 337
           V   RL +++      PD    ++IDSG+TL+ F P+ Y     +V+  + +   ++  T
Sbjct: 309 VEEPRLDINSSSFERKPDGSGGVIIDSGSTLSIFPPEVYEVVRNAVVDKLKKGYNISPLT 368

Query: 338 GSLELCYSFN---SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIY 394
            S + C+       L   P + ++     +   R + F++  +++ C  F      + I 
Sbjct: 369 SSPD-CFEGKIGRDLPLFPTLVLYLESTGILNDRWSIFLQRYDELFCLGFTS-GEGLSII 426

Query: 395 GNIMQTNFLVGYDIEQQTVSFKPT-DC 420
           G + Q ++  GY++E  T+S +   DC
Sbjct: 427 GTLAQQSYKFGYNLELSTLSIESNPDC 453


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score =  141 bits (356), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 106/376 (28%), Positives = 170/376 (45%), Gaps = 48/376 (12%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-YMQDSPLFDPKMSSTYKSL 146
           +  Y + + IG PP   L +ADTGSDL+W +C  C    C +   + +F P+ SST+   
Sbjct: 80  SGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSAC--RNCSHHSPATVFFPRHSSTFSPA 137

Query: 147 PCSSSQCASLNQ----KSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            C    C  + +      C+       C Y   Y DGS ++G  A ET +L +++G+   
Sbjct: 138 HCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAK 197

Query: 199 LPGITFGCGTNNGGLFNSKTT-----GIVGLGGGDISLISQMRTTIAGKFSYCLV----- 248
           L  + FGCG    G   S T+     G++GLG G IS  SQ+      KFSYCL+     
Sbjct: 198 LKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQLGRRFGNKFSYCLMDYTLS 257

Query: 249 --PVSSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI-- 301
             P S   I  G + +     +  TPL     + TFY + + ++ V   +L +  P I  
Sbjct: 258 PPPTSYLIIGDGGDAVSK---LFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRID-PSIWE 313

Query: 302 ---------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ- 351
                    V+DSGTTL FL       +++ +   I+     + T   +LC + + +++ 
Sbjct: 314 IDDSGNGGTVMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPGFDLCVNVSGVTKP 373

Query: 352 ---VPEVTIHFRGADVKL-SRSNFFVKVSEDIVCSVFKGITNSV--PIYGNIMQTNFLVG 405
              +P +   F G  V +    N+F++  E I C   + +   V   + GN+MQ  FL  
Sbjct: 374 EKILPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFE 433

Query: 406 YDIEQQTVSFKPTDCT 421
           +D ++  + F    C 
Sbjct: 434 FDRDRSRLGFSRRGCA 449


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score =  141 bits (355), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 115/366 (31%), Positives = 160/366 (43%), Gaps = 48/366 (13%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY++ + IG   +    + DTGSDL W QC PC    CY Q  PLF+P  SS++ SLPC+
Sbjct: 65  NYIVTVGIGGQNST--LIVDTGSDLTWVQCLPC--RLCYNQQEPLFNPSNSSSFLSLPCN 120

Query: 150 SSQCASLNQKS-----CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           S  C +L   +     CS  N   C Y + YGDGS+S G L  E +TLG T      +  
Sbjct: 121 SPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKT-----EIDN 175

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKI--- 255
             FGCG NN GLF    +G++GL   ++SL+SQ  +     FSYCL      SS  +   
Sbjct: 176 FIFGCGRNNKGLFGG-ASGLMGLARSELSLVSQTSSLFGSVFSYCLPTTGVGSSGSLTLG 234

Query: 256 -----NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI--------V 302
                NF     +S   ++  P  +   FY L +  IS+G   L V  P +        +
Sbjct: 235 GADFSNFKNISPISYTRMIQNP--QMSNFYFLNLTGISIGGVNLNV--PRLSSNEGVLSL 290

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR 360
           +DSGT +T L         +                 L  C++     +V  P V   F 
Sbjct: 291 LDSGTVITRLSPSIYKAFKAEFEKQFSGYRTTPGFSILNTCFNLTGYEEVNIPTVKFIFE 350

Query: 361 GAD---VKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           G     V +    +FVK     +C  F   G  +   I GN  Q N  V Y+ ++  V F
Sbjct: 351 GNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKESKVGF 410

Query: 416 KPTDCT 421
               C+
Sbjct: 411 AGEPCS 416


>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
 gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
          Length = 460

 Score =  140 bits (354), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 133/445 (29%), Positives = 192/445 (43%), Gaps = 69/445 (15%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           G  +EL H D+ ++       T  +R+R A  R+  RL      S       + A I  N
Sbjct: 32  GLRLELTHVDAKQN------CTTKERMRRATERTHRRLA-----SMAGGGGEASAPIHWN 80

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
              Y+    IG PP +  A+ DTGS+LIWTQC  C  + C+ QD   +DP  S T K + 
Sbjct: 81  ETQYIAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVA 140

Query: 148 CSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           C+ + C   ++  C+  G  C    +YG G+   G L TE  T G        +  + FG
Sbjct: 141 CNDTACLLGSETRCARDGKACAVLTAYGAGAI-GGFLGTEVFTFGHGQSSENNV-SLAFG 198

Query: 206 CGTNN----GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
           C T +    G L     +GI+GLG G +SL SQ+      KFSYCL P  S   N  T  
Sbjct: 199 CITASRLTPGSL--DGASGIIGLGRGKLSLPSQLGDN---KFSYCLTPYFSDAANTSTLF 253

Query: 262 IVSGPG-------VVSTPLTKA------KTFYVLTIDAISVGNQRLGVSTPDI------- 301
           + +  G         S P  K        +FY L +  I+VG  +L V            
Sbjct: 254 VGASAGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVAP 313

Query: 302 ------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYS----FNSL 349
                 +IDSG+  T L       L   +   + A  V  P G+  L+LC       ++ 
Sbjct: 314 AKWGGTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGGVAPGDAG 373

Query: 350 SQVPEVTIHF-----RGADVKLSRSNFFVKVSEDIVCSVF---KGITNSVP-----IYGN 396
             VP + +HF      G DV +   N++  V +   C V     G  +++P     I GN
Sbjct: 374 KLVPPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTIIGN 433

Query: 397 IMQTNFLVGYDIEQQTVSFKPTDCT 421
            MQ +  + YD+ Q  +SF+P DC+
Sbjct: 434 YMQQDMHLLYDLGQGVLSFQPADCS 458


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 117/379 (30%), Positives = 185/379 (48%), Gaps = 53/379 (13%)

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS--PLFDPKMS 140
           +I+ ++  + + + I  P   R  + DTGSDLIWTQC+    +    +    P++DP  S
Sbjct: 8   NILLSDQGHSLTVGIVQP---RKLIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGES 64

Query: 141 STYKSLPCSSSQC--ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           ST+  LPCS   C     + K+C+  N C Y   YG  + + G LA+ET T G+   +AV
Sbjct: 65  STFAFLPCSDRLCQEGQFSFKNCTSKNRCVYEDVYGSAA-AVGVLASETFTFGAR--RAV 121

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN- 256
           +L  + FGCG  + G      TGI+GL    +SLI+Q++     +FSYCL P +  K + 
Sbjct: 122 SLR-LGFGCGALSAGSLIG-ATGILGLSPESLSLITQLKIQ---RFSYCLTPFADKKTSP 176

Query: 257 --FG---------TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST------P 299
             FG         T   +    +VS P+     +Y + +  IS+G++RL V        P
Sbjct: 177 LLFGAMADLSRHKTTRPIQTTAIVSNPVE--TVYYYVPLVGISLGHKRLAVPAASLAMRP 234

Query: 300 D----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT-GSLELCYSFNSLS---- 350
           D     ++DSG+T+ +L +     +   +  ++   PVA+ T    ELC+     +    
Sbjct: 235 DGGGGTIVDSGSTVAYLVEAAFEAVKEAVMDVVRL-PVANRTVEDYELCFVLPRRTAAAA 293

Query: 351 ----QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFL 403
               QVP + +HF  GA + L R N+F +    ++C      T+   V I GN+ Q N  
Sbjct: 294 MEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMH 353

Query: 404 VGYDIEQQTVSFKPTDCTK 422
           V +D++    SF PT C +
Sbjct: 354 VLFDVQHHKFSFAPTQCDQ 372


>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
           [Brachypodium distachyon]
          Length = 452

 Score =  140 bits (354), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 111/349 (31%), Positives = 168/349 (48%), Gaps = 25/349 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            +++ +  G+P      + DTGSDL W QC+PC    CY Q  P+FDP  SS+Y  +PC 
Sbjct: 111 EFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCS-GHCYKQHDPVFDPAKSSSYAVVPCG 169

Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
           +++CA+   + C+G  C Y V YGDGS + G LA ET+T  S++       G  FGCG  
Sbjct: 170 TTECAAAGGE-CNGTTCVYGVEYGDGSSTTGVLARETLTFSSSS----EFTGFIFGCGET 224

Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK--INFGTNGIVSGPG 267
           N G F  +  G++GLG G +SL SQ      G FSYCL   ++T   ++ G   +     
Sbjct: 225 NLGDFG-EVDGLLGLGRGSLSLSSQAAPAFGGIFSYCLPSYNTTPGYLSIGATPVTGQIP 283

Query: 268 VVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSN 319
           V  T +       +FY + + +I++G   L V   +      ++DSGT LT+LP    + 
Sbjct: 284 VQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKTGTLLDSGTILTYLPPPAYTA 343

Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF--VKVSE 377
           L       ++    A P   L+ CY F   S +    + F  +D  +   NFF  +   +
Sbjct: 344 LRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIPGVSFNFSDGAVFNLNFFGIMTFPD 403

Query: 378 D----IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           D    + C  F      +P  + G+  Q +  V YD+  Q + F P  C
Sbjct: 404 DTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDVPAQKIGFIPASC 452


>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
          Length = 373

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 102/358 (28%), Positives = 171/358 (47%), Gaps = 42/358 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+  ++IGTPP    A+     + +WTQC PC   +C+ QD PLF+   SSTY+  PC +
Sbjct: 28  YMANLTIGTPPQPASAIIHLAGEFVWTQCSPC--RRCFKQDLPLFNRSASSTYRPEPCGT 85

Query: 151 SQCASLNQKSCSGVN-CQYSVS--YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           + C S+   +CSG   C Y V   +GD S   G   T+T  +G+ T        + FGC 
Sbjct: 86  ALCESVPASTCSGDGVCSYEVETMFGDTSGIGG---TDTFAIGTATAS------LAFGCA 136

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS----TKINFGTNG-I 262
            ++        +G+VGLG    SL+ QM  T    FSYCL P  +    + +  G +  +
Sbjct: 137 MDSNIKQLLGASGVVGLGRTPWSLVGQMNAT---AFSYCLAPHGAAGKKSALLLGASAKL 193

Query: 263 VSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVSTPD---IVIDSGTTLTFLPQGY 316
             G    +TPL       + Y++ ++ I  G+  +    P+   +++D+   ++FL    
Sbjct: 194 AGGKSAATTPLVNTSDDSSDYMIHLEGIKFGDVII-APPPNGSVVLVDTIFGVSFLVDAA 252

Query: 317 NSNLLSVMSSMIEAQPVADPTGSLELCY-------SFNSLSQVPEVTIHFRG-ADVKLSR 368
              +   ++  + A P+A PT   +LC+         NS   +P+V + F+G A + +  
Sbjct: 253 FQAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALTVPP 312

Query: 369 SNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           S +        VC     S    +T  + I G + Q N    +D++++T+SF+P DC+
Sbjct: 313 SKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPADCS 370


>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
          Length = 447

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 107/380 (28%), Positives = 164/380 (43%), Gaps = 60/380 (15%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y   + +GTP T+ + V DTGSDL+W QC PC   +CY Q   +FDP+ SSTY+ +
Sbjct: 82  ESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPC--RRCYAQRGQVFDPRRSSTYRRV 139

Query: 147 PCSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           PCSS QC +L    C     +G  C+Y V+YGDGS S G+LAT+ +   + T     +  
Sbjct: 140 PCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFANDT----YVNN 195

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
           +T GCG +N GLF+S          G +   +  R     ++     P SST    G   
Sbjct: 196 VTLGCGRDNEGLFDSAA--------GLLGRRAAARYPSRRRWPRRTAPSSSTASATGRRA 247

Query: 262 IVSG----------------PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP------ 299
             +                 P    T   +A T +     A S      G  TP      
Sbjct: 248 QRAARTSCSAARRSRRPRRSPPCCRTRGARACTTWTWPGSA-SAARGSPGSRTPASRWTR 306

Query: 300 -----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNS--L 349
                 +V+DSGT ++   +   + L     +   A  +    G     + CY       
Sbjct: 307 RRGRGGVVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPA 366

Query: 350 SQVPEVTIHFRG-ADVKLSRSNFFV-------KVSEDIVCSVFKGITNSVPIYGNIMQTN 401
           +  P + +HF G AD+ L   N+F+       + +    C  F+   + + + GN+ Q  
Sbjct: 367 ASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQG 426

Query: 402 FLVGYDIEQQTVSFKPTDCT 421
           F V +D+E++ + F P  CT
Sbjct: 427 FRVVFDVEKERIGFAPKGCT 446


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 113/397 (28%), Positives = 187/397 (47%), Gaps = 45/397 (11%)

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPPTERLAVADTGSDL 114
            +L HF  + +   S+   +  +P   +        Y  +I +G+PP E     DTGSD+
Sbjct: 38  KKLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDI 97

Query: 115 IWTQCEPCP--PSQCYMQ-DSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYS 169
           +W  C+PCP  PS+  +     LFD   SST K + C    C+ ++Q  SC   V C Y 
Sbjct: 98  LWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKVGCDDDFCSFISQSDSCQPAVGCSYH 157

Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNGGLF---NSKTTGIVG 223
           + Y D S S GN   + +TL   TG     P    + FGCG++  G     +S   G++G
Sbjct: 158 IVYADESTSEGNFIRDKLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMG 217

Query: 224 LGGGDISLISQMRTTIAGK--FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYV 281
            G  + S++SQ+  T   K  FS+CL  V    I F   G+V  P V +TP+   +  Y 
Sbjct: 218 FGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGI-FAV-GVVDSPKVKTTPMVPNQMHYN 275

Query: 282 LTIDAISVGNQRLGVSTPDI------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA- 334
           + +  + V    L +  P I      ++DSGTTL + P+    +L+    +++  QPV  
Sbjct: 276 VMLMGMDVDGTALDLP-PSIMRNGGTIVDSGTTLAYFPKVLYDSLI---ETILARQPVKL 331

Query: 335 DPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK----- 385
                   C+SF+    V  P V+  F  + VKL+    ++   + +++ C  ++     
Sbjct: 332 HIVEDTFQCFSFSENVDVAFPPVSFEFEDS-VKLTVYPHDYLFTLEKELYCFGWQAGGLT 390

Query: 386 -GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            G    V + G+++ +N LV YD+E + + +   +C+
Sbjct: 391 TGERTEVILLGDLVLSNKLVVYDLENEVIGWADHNCS 427


>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
          Length = 328

 Score =  140 bits (352), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 92/247 (37%), Positives = 127/247 (51%), Gaps = 40/247 (16%)

Query: 90  NYLIRISIG----TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
           NY+  IS+G    +P      + DTGSDL W QC+PC  S CY Q  PLFDP  S+TY +
Sbjct: 91  NYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPC--SACYAQRDPLFDPAGSATYAA 148

Query: 146 LPCSSSQCA-----------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           + C++S CA           S          C Y+++YGDGSFS G LAT+TV LG    
Sbjct: 149 VRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALG---- 204

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK 254
              +L G  FGCG +N GLF   T G++GLG  ++SL+SQ  +   G FSYCL P +++ 
Sbjct: 205 -GASLGGFVFGCGLSNRGLFGG-TAGLMGLGRTELSLVSQTASRYGGVFSYCL-PAATSG 261

Query: 255 INFGTNGIVSGPGVVS-----TPLTKAKT--------FYVLTIDAISVGNQRL---GVST 298
              G+  +  G    S     TP+   +         FY L +   +VG   L   G+  
Sbjct: 262 DASGSLSLGGGDDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALAAQGLGA 321

Query: 299 PDIVIDS 305
            +++IDS
Sbjct: 322 SNVLIDS 328


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 110/386 (28%), Positives = 173/386 (44%), Gaps = 59/386 (15%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL---------FDPKMS 140
            Y +R  +GTP    L VADTGSDL W +C    P+      SP          F P+ S
Sbjct: 96  QYFVRFRVGTPAQPFLLVADTGSDLTWVKCRR--PASANSSLSPADSGPGPGRAFRPEDS 153

Query: 141 STYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE--TVTLGSTT 193
            T+  + C+S  C      SL      G  C Y   Y DGS + G + TE  T+ L    
Sbjct: 154 RTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRE 213

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----P 249
            +   L G+  GC ++  G     + G++ LG   IS  S   +   G+FSYCLV    P
Sbjct: 214 ERKAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGRFSYCLVDHLSP 273

Query: 250 VSSTK-INFGTNGIVSGP------------GVVSTPL---TKAKTFYVLTIDAISVGNQR 293
            ++T  + FG N  VS P                TPL    + + FY +++ AISV  + 
Sbjct: 274 RNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEF 333

Query: 294 LGVSTP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELC 343
           L +            +++DSGT+LT L +     +++ +S  +   P    DP    E C
Sbjct: 334 LKIPRAVWDVEAGGGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTMDP---FEYC 390

Query: 344 YSFNSLS------QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYG 395
           Y++ S S       VP++ +HF G A ++    ++ +  +  + C  + +G    + + G
Sbjct: 391 YNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWPGISVIG 450

Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDCT 421
           NI+Q   L  +DI+ + + F+ + CT
Sbjct: 451 NILQQEHLWEFDIKNRRLKFQRSRCT 476


>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 414

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 113/352 (32%), Positives = 165/352 (46%), Gaps = 49/352 (13%)

Query: 100 PPTERLAVADTGSDLI-WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
           PP+ +  +A+   D I WTQC+PC   +C       FDP  S TY    C  S       
Sbjct: 83  PPSPQEILAEMNPDSITWTQCKPC--VRCLKDSHRHFDPSASLTYSLGSCIPST------ 134

Query: 159 KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKT 218
                V   Y+++YGD S S GN   +T+TL  +       P   FGCG NN G F S  
Sbjct: 135 -----VGNTYNMTYGDKSTSVGNYGCDTMTLEPSD----VFPKFQFGCGRNNEGDFGSGA 185

Query: 219 TGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST-KINFGTNG----------IVSGPG 267
            G++GLG G +S +SQ  +     FSYCL    S   + FG             +V+GPG
Sbjct: 186 DGMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSLLFGEKATSQSSLKFTSLVNGPG 245

Query: 268 VVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLS 322
             ++ L ++  ++V  +D ISVGN+RL V     ++P  +IDSGT +T LPQ   S L +
Sbjct: 246 --TSGLEESGYYFVKLLD-ISVGNKRLNVPSSVFASPGTIIDSGTVITCLPQRAYSALTA 302

Query: 323 VMSSMIEAQPVADPTGS----LELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFVKV 375
                +   P+++        L+ CY+ +    V  PE+ +HF  GADV+L+        
Sbjct: 303 AFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIWGN 362

Query: 376 SEDIVCSVFKG-----ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
               +C  F G     + + + I GN  Q +  V YDI+   + F    C+K
Sbjct: 363 DASRLCLAFAGNSKSTMNSELTIIGNRQQVSLTVLYDIQGGRIGFGGNGCSK 414


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 111/364 (30%), Positives = 166/364 (45%), Gaps = 38/364 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I +GTPP       DTGSD++W     CE CP       D  L+DPK SST   + 
Sbjct: 86  YYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGSMVM 145

Query: 148 CSSSQCASLNQ----KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C  + CA+       K  + V C+YSV+YGDGS + G+  T+ +     T      P   
Sbjct: 146 CDQAFCAATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQTQPANA 205

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
            + FGCG   GG     N    GI+G G  + S++SQ+  T AGK    F++CL  +   
Sbjct: 206 SVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQL--TTAGKVKKIFAHCLDTIKGG 263

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
            I F    +V  P V +TPL   K  Y + +  I VG   L +             +IDS
Sbjct: 264 GI-FSIGDVVQ-PKVKTTPLVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGEKKGTIIDS 321

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVK 365
           GTTLT+LP+     ++  + +  +     D  G L   Y  +     P +T HF   D+ 
Sbjct: 322 GTTLTYLPELVFKEVMLAVFNKHQDITFHDVQGFLCFQYPGSVDDGFPTITFHFE-DDLA 380

Query: 366 LSR--SNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKP 417
           L      +F     D+ C  F+ G + S     + + G+++ +N LV YD+E + + +  
Sbjct: 381 LHVYPHEYFFANGNDVYCVGFQNGASQSKDGKDIVLMGDLVLSNKLVIYDLENRVIGWTD 440

Query: 418 TDCT 421
            +C+
Sbjct: 441 YNCS 444


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 119/371 (32%), Positives = 170/371 (45%), Gaps = 53/371 (14%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+  + +G    E   + DT S+L W QC PC    C+ Q  PLFDP  S +Y ++PC+
Sbjct: 152 NYVATVGLGG--GEATVIVDTASELTWVQCAPC--ESCHDQQDPLFDPSSSPSYAAVPCN 207

Query: 150 SSQCASLN---------QKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           SS C +L            +C G +     C Y++SY DGS+S G LA + ++L      
Sbjct: 208 SSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGEV-- 265

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV----S 251
              + G  FGCGT+N G     T+G++GLG   +SL+SQ      G FSYCL P+    S
Sbjct: 266 ---IDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCL-PLKESDS 321

Query: 252 STKINFGTNGIV---SGP----GVVSTPLTKAKTFYVLTIDAISVGNQRL-------GVS 297
           S  +  G +  V   S P     +VS PL     FY + +  I+VG Q +       G  
Sbjct: 322 SGSLVIGDDSSVYRNSTPIVYASMVSDPLQGP--FYFVNLTGITVGGQEVESSGFSSGGG 379

Query: 298 TPDIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPE 354
               +IDSGT +T  +P  YN+     +S   E  P A     L+ C++   L   QVP 
Sbjct: 380 GGKAIIDSGTVITSLVPSIYNAVKAEFLSQFAE-YPQAPGFSILDTCFNMTGLREVQVPS 438

Query: 355 VTIHFRGA-DVKLSRSN--FFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIE 409
           + + F G  +V++      +FV      VC     + +     I GN  Q N  V +D  
Sbjct: 439 LKLVFDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRVIFDTS 498

Query: 410 QQTVSFKPTDC 420
              V F    C
Sbjct: 499 GSQVGFAQETC 509


>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 475

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 104/332 (31%), Positives = 155/332 (46%), Gaps = 29/332 (8%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCS---- 162
           DT  D+ W QC PCP  QCY Q  PLFDP  SST  ++ C S  C SL      CS    
Sbjct: 153 DTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLGPYGNGCSNRSA 212

Query: 163 GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
              C+Y + Y D   + G   T+T+T+  TT    A+    FGC     G F+  T G +
Sbjct: 213 NAECRYLIEYSDDRATAGTYMTDTLTISGTT----AVRNFRFGCSHAVRGRFSDLTAGTM 268

Query: 223 GLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-INFGTNGIVSGPGV-VSTPLTKAK--- 277
            LGGG  SL++Q   ++   FSYC+   S++  ++ G     +   V  +TPL ++    
Sbjct: 269 SLGGGAQSLLAQTARSLGNAFSYCVPQASASGFLSIGGPATTNSTTVFATTPLVRSAINP 328

Query: 278 TFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV 333
           + Y++ +  I V  +RLG+     +   V+DS   +T LP      L     + + A P 
Sbjct: 329 SLYLVRLQGIVVAGRRLGIPPVAFSAGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPR 388

Query: 334 ADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNS 390
           +  TG+L+ CY F  L+  +VP V++ F  GA V L      +       C  F   ++ 
Sbjct: 389 SGATGTLDTCYDFLGLTNVRVPAVSLVFGGGAVVVLDPPAVMIG-----GCLAFTATSSD 443

Query: 391 VPI--YGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + +   GN+ Q    V YD+    V F+   C
Sbjct: 444 LALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475


>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
          Length = 393

 Score =  139 bits (350), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 119/407 (29%), Positives = 172/407 (42%), Gaps = 87/407 (21%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD------ 83
           SV L HR  P SP   +S        + L R   R ++  +  S S+  A+  D      
Sbjct: 32  SVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQSSKV 91

Query: 84  IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLF 135
            +P       +   Y+I + +G+P   +  V DTGSD+ W QCEPCP PS C+     LF
Sbjct: 92  SVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 151

Query: 136 DPKMSSTYKSLPCSSSQCASL-NQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLG 190
           DP  SSTY +  CS++ CA L +    +G +    CQY V YGDGS + G          
Sbjct: 152 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGT--------- 202

Query: 191 STTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP 249
                     G  FGC     G   + KT G++GLGG   SL+SQ               
Sbjct: 203 ----------GFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQ--------------- 237

Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-----VID 304
                               +    K  T+Y   ++ I+VG ++LG+S P +     ++D
Sbjct: 238 -------------------TAARSKKVPTYYFAALEDIAVGGKKLGLS-PSVFAAGSLVD 277

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGA 362
           SGT +T LP    + L S   + +     A+P G L+ C++F  L +V  P V + F G 
Sbjct: 278 SGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCFNFTGLDKVSIPTVALVFAGG 337

Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYD 407
            V    ++  V       C  F    +  +    GN+ Q  F V YD
Sbjct: 338 AVVDLDAHGIVSGG----CLAFAPTRDDKAFGTIGNVQQRTFEVLYD 380


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  139 bits (349), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 102/370 (27%), Positives = 170/370 (45%), Gaps = 42/370 (11%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSL 146
            Y ++  +GTP    + VADTGSDL W +C       P    +    +F P  S ++  +
Sbjct: 109 QYFVQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPRVFRPANSKSWAPI 168

Query: 147 PCSSSQCAS---LNQKSCSG-----VNCQYSVSYGDGSFSNGNLATETVTL---GSTTGQ 195
           PCSS  C S    +  +CS        C Y   Y D S + G + T+  T+   GS + +
Sbjct: 169 PCSSDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSGSGSDR 228

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVS 251
              L  +  GC T+  G     + G++ LG  +IS  S+      G+FSYCLV    P +
Sbjct: 229 KAKLQEVVLGCTTSYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRN 288

Query: 252 STK-INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTP--DI---- 301
           +T  + FG  G    P    TPL    +   FY +T+DA+SV  + L +     D+    
Sbjct: 289 ATSYLTFGPVGAAHSPS--RTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEVWDVKKNG 346

Query: 302 --VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNSLSQ---VPE 354
             ++DSGT+LT L       +++ +S  +   P    DP    E CY++ +  +   VP 
Sbjct: 347 GAILDSGTSLTILATPAYKAVVAALSKQLARVPRVTMDP---FEYCYNWTATRRPPAVPR 403

Query: 355 VTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
           + + F G A ++    ++ +  +  + C  + +G+   V + GNI+Q   L  +D+  + 
Sbjct: 404 LEVRFAGSARLRPPTKSYVIDAAPGVKCIGLQEGVWPGVSVIGNILQQEHLWEFDLANRW 463

Query: 413 VSFKPTDCTK 422
           + F+ + C  
Sbjct: 464 LRFQESRCAH 473


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 101/369 (27%), Positives = 167/369 (45%), Gaps = 36/369 (9%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQC------YMQDSPLFDPKMSS 141
             Y +   +GTP  + + VADTGSDL W  C+  C    C       ++   +F   +SS
Sbjct: 10  GQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSS 69

Query: 142 TYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           ++K++PC +  C        SL         C Y   Y DGS + G  A ETVT+    G
Sbjct: 70  SFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEG 129

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK 254
           + + L  +  GC  +  G       G++GLG    S   +      GKFSYCLV   S K
Sbjct: 130 RKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHK 189

Query: 255 -----INFGT----NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP--DI-- 301
                + FG+      +++        L    +FY + +  IS+G   L + +   D+  
Sbjct: 190 NVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDVKG 249

Query: 302 ----VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPE 354
               ++DSG++LTFL +  Y   + ++  S+++ + V    G LE C++     +  VP 
Sbjct: 250 AGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEESLVPR 309

Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 412
           +  HF  GA+ +    ++ +  ++ + C  F  +      + GNIMQ N L  +D+  + 
Sbjct: 310 LVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGLKK 369

Query: 413 VSFKPTDCT 421
           + F P+ CT
Sbjct: 370 LGFAPSSCT 378


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score =  138 bits (348), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 106/348 (30%), Positives = 158/348 (45%), Gaps = 37/348 (10%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y   + +GTPPT  L V DTGSD++W QC PC   QCY Q   +FDP+ S +Y ++
Sbjct: 138 GSGEYFASVGVGTPPTPALLVLDTGSDVVWLQCAPC--RQCYAQSGRVFDPRRSRSYAAV 195

Query: 147 PCSSSQC-----ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
            C +  C                 C Y V+YGDGS + G+LATET+       +   +P 
Sbjct: 196 RCGAPPCRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWF----ARGARVPR 251

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
           +  GCG +N GLF +    ++GLG G +SL +Q       +FSYC               
Sbjct: 252 VAVGCGHDNEGLFVAAAG-LLGLGRGRLSLPTQTARRYGRRFSYCF-------------- 296

Query: 262 IVSGPGVVSTPLTKAKTFYVLTIDAISVGNQ--RLGVSTPD--IVIDSGTTLTFLPQGYN 317
              G  +    + +    +V       VG +  RL  ST    +++DSGT++T L +   
Sbjct: 297 --QGSDLDHRTIIRTVHQHVGGARVRGVGERSLRLDPSTGRGGVILDSGTSVTRLARPVY 354

Query: 318 SNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIHFR-GADVKLSRSNFFV 373
             +     +      +A    SL + CY      + +VP V++H   GA+V L   N+ +
Sbjct: 355 VAVREAFRAAAGGLRLAPGGFSLFDTCYDLRGRRVVKVPTVSVHLAGGAEVALPPENYLI 414

Query: 374 KV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            V +    C    G    V I GNI Q  F V +D ++Q V+  P  C
Sbjct: 415 PVDTRGTFCLALAGTDGGVSIVGNIQQQGFRVVFDGDRQRVALVPKSC 462


>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
 gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
          Length = 443

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 120/428 (28%), Positives = 188/428 (43%), Gaps = 53/428 (12%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           G  ++L H D+        + T  +R+R A+  S       N  S+ +      A +   
Sbjct: 33  GIRMKLTHVDA------KGNYTAPERVRRAIALS----RQINLASTRAEGGGVSAPVHWA 82

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
              Y+    +G PP    A+ DTGS LIWTQC  C    C  QD P F+   S ++  +P
Sbjct: 83  TRQYIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVP 142

Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           C    CA      C+    C + V+YG G    G L T+  T  S  G  +A   ++F  
Sbjct: 143 CQDKACAGNYLHFCALDGTCTFRVTYGAGGII-GFLGTDAFTFQS-GGATLAFGCVSFTR 200

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINFGTNG 261
                 L  +  +G++GLG G +SL SQ   T A +FSYCL P      +S+ +  G   
Sbjct: 201 FAAPDVLHGA--SGLIGLGRGRLSLASQ---TGAKRFSYCLTPYFHNNGASSHLFVGAAA 255

Query: 262 IVSGPG--VVSTPLTKA------KTFYVLTIDAISVGNQRL--------------GVSTP 299
            +SG G  V+S    ++       TFY L +  I+VG  +L              G    
Sbjct: 256 SLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWEG 315

Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ---PVADPTGSLELCYSFNSLSQ-VPEV 355
            ++IDSG+  T L +     L+  ++  +      P  +  G + LC +   L + VP +
Sbjct: 316 GVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLDRVVPTL 375

Query: 356 TIHFR-GADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
            +HF  GAD+ L   N++  + +   C ++ +G   S  I GN  Q N  + +D+    +
Sbjct: 376 VLHFSGGADMALPPENYWAPLEKSTACMAIVRGYLQS--IIGNFQQQNMHILFDVGGGRL 433

Query: 414 SFKPTDCT 421
           SF+  DC+
Sbjct: 434 SFQNADCS 441


>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 120/427 (28%), Positives = 192/427 (44%), Gaps = 38/427 (8%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN-QNSSISSSKASQADIIP 86
           GFS+E++HR S +SPFY  + T Y+R+   +  S  R ++     SS  S +A +  I  
Sbjct: 27  GFSLEIVHRYSRESPFYPGNITDYERITRLVELSKIRAHNLAITTSSGFSPEAFRLRISQ 86

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           ++  YL+++ IG+P      V DTGS L WTQCEPC  ++ + Q  P+F+   S TY+ L
Sbjct: 87  DDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPC--TRRFRQLPPIFNSTASRTYRDL 144

Query: 147 PCSSSQCA-SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           PC    C  + N   C    C Y ++Y  GS + G  A + +     + +   +P   FG
Sbjct: 145 PCQHQFCTNNQNVFQCRDDKCVYRIAYAGGSATAGVAAQDIL----QSAENDRIP-FYFG 199

Query: 206 CGTNNGGL----FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL------VPVSSTK- 254
           C  +N        + K  GI+GL    +SL+ QM      +FSYCL       P  +T  
Sbjct: 200 CSRDNQNFSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATSL 259

Query: 255 INFGTNGIVSGPGVVSTPLTKAKTF--YVLTIDAISVGNQRLGVS------TPD----IV 302
           + FG +   S    +STP    +    Y L +  +SV   R+ +        PD     +
Sbjct: 260 LRFGNDIRKSRRKYLSTPFVSPRGMPNYFLNLIDVSVAGNRMQIPPGTFALKPDGTGGTI 319

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCYSF--NSLSQVPEVTIH 358
           IDSGT +T++ Q     +++   +  +          L   +CY    ++    P +  H
Sbjct: 320 IDSGTAVTYISQTAYFPVITAFKNYFDQHGFQRVNIQLSGYICYKQQGHTFHNYPSMAFH 379

Query: 359 FRGADVKLSRSNFFVKVSED-IVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           F+GAD  +     ++ V +    C   + I+     I G + Q N    YD   + + F 
Sbjct: 380 FQGADFFVEPEYVYLTVQDRGAFCVALQPISPQQRTIIGALNQANTQFIYDAANRQLLFT 439

Query: 417 PTDCTKQ 423
           P +C   
Sbjct: 440 PENCQDH 446


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score =  138 bits (347), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 112/379 (29%), Positives = 174/379 (45%), Gaps = 49/379 (12%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y + I +G+PP   L VADTGSDL W +C  C  +         F  + S+T+    
Sbjct: 80  SGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTH 139

Query: 148 CSSSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           C SS C  + Q + +  N       C+Y   Y DGS ++G  + ET TL +++G+ + L 
Sbjct: 140 CFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLK 199

Query: 201 GITFGCGTNNGGL------FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV------ 248
            I FGCG +  G       FN   +G++GLG G IS  SQ+       FSYCL+      
Sbjct: 200 SIAFGCGFHASGPSLIGSSFNG-ASGVMGLGRGPISFASQLGRRFGRSFSYCLLDYTLSP 258

Query: 249 -PVSSTKINFGTNGIVSGPGVVS-TPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI-- 301
            P S   I    +       ++S TPL    +A TFY ++I  + V   +L +  P +  
Sbjct: 259 PPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHID-PSVWS 317

Query: 302 ---------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-----LELCYSFN 347
                    VIDSGTTLTFL +     +LS     ++  P   P G+      +LC +  
Sbjct: 318 LDELGNGGTVIDSGTTLTFLTEPAYREILSAFKREVKL-PSPTPGGASTRSGFDLCVNVT 376

Query: 348 SLS--QVPEVTIHFRGADV-KLSRSNFFVKVSEDIVCSVFKGI---TNSVPIYGNIMQTN 401
            +S  + P +++   G  +      N+F+ +SE I C   + +   +    + GN+MQ  
Sbjct: 377 GVSRPRFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVEAESGRFSVIGNLMQQG 436

Query: 402 FLVGYDIEQQTVSFKPTDC 420
           FL+ +D  +  + F    C
Sbjct: 437 FLLEFDRGKSRLGFSRRGC 455


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score =  137 bits (346), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 113/335 (33%), Positives = 154/335 (45%), Gaps = 33/335 (9%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--CSGV 164
           V DT SD+ W QC PCP   CY Q   L+DP  SS+     C+S  C  L   +  C+  
Sbjct: 172 VLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNN 231

Query: 165 N-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC--GTNNGGLFNSKTTGI 221
           N CQY V Y DG+ + G   ++ +T+   T    A+    FGC  G      F S   GI
Sbjct: 232 NQCQYRVRYPDGTSTAGTYISDLLTITPAT----AVRSFQFGCSHGVQGSFSFGSSAAGI 287

Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI--VSGPGVVSTPLTK---- 275
           + LGGG  SL+SQ   T    FS+C  P   T+  F T G+  V+    V TP+ K    
Sbjct: 288 MALGGGPESLVSQTAATYGRVFSHCFPP--PTRRGFFTLGVPRVAAWRYVLTPMLKNPAI 345

Query: 276 AKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEA 330
             TFY++ ++AI+V  QR+ V          +DS T +T LP   Y +   +    M   
Sbjct: 346 PPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMY 405

Query: 331 QPVADPTGSLELCYSFNSLSQ--VPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KG 386
           QP A P G L+ CY    +    +P +T+ F + A V+L  S    +      C  F  G
Sbjct: 406 QP-APPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQ-----GCLAFTAG 459

Query: 387 ITNSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             + VP I GNI      V Y+I    V F+   C
Sbjct: 460 PNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score =  137 bits (346), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 113/335 (33%), Positives = 154/335 (45%), Gaps = 33/335 (9%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--CSGV 164
           V DT SD+ W QC PCP   CY Q   L+DP  SS+     C+S  C  L   +  C+  
Sbjct: 147 VLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNN 206

Query: 165 N-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC--GTNNGGLFNSKTTGI 221
           N CQY V Y DG+ + G   ++ +T+   T    A+    FGC  G      F S   GI
Sbjct: 207 NQCQYRVRYPDGTSTAGTYISDLLTITPAT----AVRSFQFGCSHGVQGSFSFGSSAAGI 262

Query: 222 VGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGI--VSGPGVVSTPLTK---- 275
           + LGGG  SL+SQ   T    FS+C  P   T+  F T G+  V+    V TP+ K    
Sbjct: 263 MALGGGPESLVSQTAATYGRVFSHCFPP--PTRRGFFTLGVPRVAAWRYVLTPMLKNPAI 320

Query: 276 AKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEA 330
             TFY++ ++AI+V  QR+ V          +DS T +T LP   Y +   +    M   
Sbjct: 321 PPTFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMY 380

Query: 331 QPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KG 386
           QP A P G L+ CY    +    +P +T+ F + A V+L  S    +      C  F  G
Sbjct: 381 QP-APPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQ-----GCLAFTAG 434

Query: 387 ITNSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             + VP I GNI      V Y+I    V F+   C
Sbjct: 435 PNDQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score =  137 bits (345), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 109/366 (29%), Positives = 166/366 (45%), Gaps = 42/366 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I +GTPP       DTGSD++W     CE CP       D   +DPK SS+  ++ 
Sbjct: 84  YFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGSTVS 143

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    CA+       G    V C+YSV YGDGS + G   T+ +     TG     PG  
Sbjct: 144 CDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQPGNA 203

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
            +TFGCG   GG     N    GI+G G  + S++SQ+    AGK    F++CL  +   
Sbjct: 204 TVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAA--AGKVKKIFAHCLDTIKGG 261

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
            I F    +V  P V +TPL      Y + + +I VG   L +             +IDS
Sbjct: 262 GI-FAIGNVVQ-PKVKTTPLVADMPHYNVNLKSIDVGGTTLQLPAHVFETGERKGTIIDS 319

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGAD 363
           GTTLT+LP+     +++ + +  + Q +        +C+ +  +     P +T HF   D
Sbjct: 320 GTTLTYLPELVFKEVMAAIFN--KHQDIVFHNVQDFMCFQYPGSVDDGFPTITFHFE-DD 376

Query: 364 VKLS--RSNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVGYDIEQQTVSF 415
           + L      +F     D+ C  F+ G   S     + + G+++ +N LV YD+E Q + +
Sbjct: 377 LALHVYPHEYFFPNGNDMYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVIYDLENQVIGW 436

Query: 416 KPTDCT 421
              +C+
Sbjct: 437 TDYNCS 442


>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
 gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
 gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
          Length = 475

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 111/394 (28%), Positives = 187/394 (47%), Gaps = 43/394 (10%)

Query: 65  LNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPPTERLAVADTGSDLIW 116
           L HF  + +   S+   +  +P   +        Y  +I +G+PP E     DTGSD++W
Sbjct: 40  LEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILW 99

Query: 117 TQCEPCP--PSQCYMQ-DSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVS 171
             C+PCP  P++  +     LFD   SST K + C    C+ ++Q  SC   + C Y + 
Sbjct: 100 INCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIV 159

Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNGGLF---NSKTTGIVGLG 225
           Y D S S+G    + +TL   TG     P    + FGCG++  G     +S   G++G G
Sbjct: 160 YADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFG 219

Query: 226 GGDISLISQMRTTIAGK--FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYV-- 281
             + S++SQ+  T   K  FS+CL  V    I F   G+V  P V +TP+   +  Y   
Sbjct: 220 QSNTSVLSQLAATGDAKRVFSHCLDNVKGGGI-FAV-GVVDSPKVKTTPMVPNQMHYNVM 277

Query: 282 ---LTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPT 337
              + +D  S+   R  V     ++DSGTTL + P+    +L+    +++  QPV     
Sbjct: 278 LMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIV 334

Query: 338 GSLELCYSF--NSLSQVPEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK--GIT--- 388
                C+SF  N     P V+  F  + VKL+    ++   + E++ C  ++  G+T   
Sbjct: 335 EETFQCFSFSTNVDEAFPPVSFEFEDS-VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDE 393

Query: 389 -NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            + V + G+++ +N LV YD++ + + +   +C+
Sbjct: 394 RSEVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 112/365 (30%), Positives = 167/365 (45%), Gaps = 40/365 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   + +GTPP       DTGSD++W     C+ CP       D  L+DPK SST  ++ 
Sbjct: 88  YYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASSTGSTVM 147

Query: 148 CSSSQCASL---NQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    CA         CS  V C+YSV+YGDGS + G+   + +     TG     P   
Sbjct: 148 CDQGFCADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDGQTQPANA 207

Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
            + FGCG   GG   S +    GI+G G  + S++SQ+ T  AGK    F++CL  +   
Sbjct: 208 SVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLAT--AGKVKKIFAHCLDTIKGG 265

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------VID 304
            I F    +V  P V +TPL   K  Y + +  I VG   L +   DI         +ID
Sbjct: 266 GI-FAIGDVVQ-PKVKTTPLVADKPHYNVNLKTIDVGGTTLELPA-DIFKPGEKRGTIID 322

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADV 364
           SGTTLT+LP+     ++  + +  +     D    L   YS +     P +T HF   D+
Sbjct: 323 SGTTLTYLPELVFKKVMLAVFNKHQDITFHDVQDFLCFEYSGSVDDGFPTLTFHFE-DDL 381

Query: 365 KLSR--SNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFK 416
            L      +F     D+ C  F+ G   S     + + G+++ +N LV YD+E + + + 
Sbjct: 382 ALHVYPHEYFFPNGNDVYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDLENRVIGWT 441

Query: 417 PTDCT 421
             +C+
Sbjct: 442 DYNCS 446


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 116/373 (31%), Positives = 176/373 (47%), Gaps = 41/373 (10%)

Query: 85  IPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMS 140
           IP +   Y  +I IGTP        DTGSD++W     C+ CP       D  L+DP  S
Sbjct: 82  IPTDTGLYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTAS 141

Query: 141 STYKSLPCSSSQCASLNQK----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           ++ K++ C    CA+        SC+  + CQYS++YGDGS + G    + +     +G 
Sbjct: 142 ASSKTVTCGQEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGD 201

Query: 196 A---VALPGITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSY 245
               +A   +TFGCG   GG   S      GI+G G  + S++SQ+  T AGK    FS+
Sbjct: 202 GQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQL--TSAGKVTKIFSH 259

Query: 246 CLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL---------GV 296
           CL  V+   I F    +V  P V +TPL      Y + +  I VG   L         G 
Sbjct: 260 CLDTVNGGGI-FAIGNVVQ-PKVKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGG 317

Query: 297 STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVT 356
            +   +IDSGTTL +LP+     +LS + S      + +    L   YS +  +  PEVT
Sbjct: 318 GSRGTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQDFLCFQYSGSVDNGFPEVT 377

Query: 357 IHFRGADVKLSR--SNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDI 408
            HF G D+ L     ++  + +ED+ C  F+  G+       + + G++  +N LV YD+
Sbjct: 378 FHFDG-DLPLVVYPHDYLFQNTEDVYCVGFQSGGVQSKDGKDMVLLGDLALSNKLVVYDL 436

Query: 409 EQQTVSFKPTDCT 421
           E Q + +   +C+
Sbjct: 437 ENQVIGWTNYNCS 449


>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
          Length = 446

 Score =  136 bits (342), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 112/368 (30%), Positives = 167/368 (45%), Gaps = 48/368 (13%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y+    IG PP    A+ DTGSDL+WTQC  C    C  Q  P ++   SST+  +PC+
Sbjct: 89  QYVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCA 148

Query: 150 SSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           +  CA+ +     C     C     YG G  + G L TE     S T +      + FGC
Sbjct: 149 ARICAANDDIIHFCDLAAGCSVIAGYGAGVVA-GTLGTEAFAFQSGTAE------LAFGC 201

Query: 207 GT----NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SSTKINF 257
            T      G L  +  +G++GLG G +SL+SQ   T A KFSYCL P      ++  +  
Sbjct: 202 VTFTRIVQGALHGA--SGLIGLGRGRLSLVSQ---TGATKFSYCLTPYFHNNGATGHLFV 256

Query: 258 GTNGIVSGPG-VVSTPLTKAKT---FYVLTIDAISVGNQRL--------------GVSTP 299
           G +  + G G V++T   K      FY L +  ++VG  RL              G+ + 
Sbjct: 257 GASASLGGHGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSG 316

Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCYSFNSLSQ-VPEVT 356
            ++IDSG+  T L       L S +++ +    VA P  + +  LC +   + + VP V 
Sbjct: 317 GVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGALCVARRDVGRVVPAVV 376

Query: 357 IHFR-GADVKLSRSNFFVKVSE--DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
            HFR GAD+ +   +++  V +    +     G      + GN  Q N  V YD+     
Sbjct: 377 FHFRGGADMAVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDF 436

Query: 414 SFKPTDCT 421
           SF+P DC+
Sbjct: 437 SFQPADCS 444


>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
          Length = 419

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 105/403 (26%), Positives = 188/403 (46%), Gaps = 51/403 (12%)

Query: 58  LTRSLNRLNHFNQNSSISSSKASQADIIP---NNANYLIRISIGTPPTERLAVADTGSDL 114
           L R L+R     +  + +++      ++P   + A Y+   +IGTPP     + D   +L
Sbjct: 26  LRRGLDRQGMRGRILADATAAPPGGAVVPLHWSGACYVANFTIGTPPQAVSGIVDLSGEL 85

Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVS-- 171
           +WTQC  C  S C+ Q+ P+FDP  S+TY++  C S  C S+  ++CSG   C Y     
Sbjct: 86  VWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSIPTRNCSGDGECGYEAPSM 145

Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKT---TGIVGLGGGD 228
           +GD   + G  +T+ + +G+  G+      + FGC   + G  +      +G VGLG   
Sbjct: 146 FGD---TFGIASTDAIAIGNAEGR------LAFGCVVASDGSIDGAMDGPSGFVGLGRTP 196

Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVSGPGVVS--TPL---------- 273
            SL+ Q   T    FSYCL P    K   +  G +  ++G G  +  TPL          
Sbjct: 197 WSLVGQSNVT---AFSYCLAPHGPGKKSALFLGASAKLAGAGKSNPPTPLLGQHASNTSD 253

Query: 274 TKAKTFYVLTIDAISVGNQRLGVSTPD------IVIDSGTTLTFLPQGYNSNLLSVMSSM 327
             +  +Y + ++ I  G+  +  ++        + +++   L++LP      L  V+++ 
Sbjct: 254 DGSDPYYTVQLEGIKAGDVAVAAASSGGGAITILQLETFRPLSYLPDAAYQALEKVVTAA 313

Query: 328 IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG--------ADVKLSRSNFFVKVSEDI 379
           + +  +A+P    +LC+   ++S VP++   F+G        +   L   N    V   I
Sbjct: 314 LGSPSMANPPEPFDLCFQNAAVSGVPDLVFTFQGGATLTAPPSKYLLGDGNGNGTVCLSI 373

Query: 380 VCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           + S       + V I G+++Q N    +D+E++T+SF+P DC+
Sbjct: 374 LSSTRLDSADDGVSILGSLLQENVHFLFDLEKETLSFEPADCS 416


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score =  135 bits (341), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 116/408 (28%), Positives = 179/408 (43%), Gaps = 36/408 (8%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           V L+HR  P +P  ++   P   + +   RS  RL++      +S        +   +  
Sbjct: 56  VPLLHRHGPCAPSLSTDTPP--SMSEMFRRSHARLSYIVSGKKVSVPAHLGTSV--KSLE 111

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+  +S GTP   ++ V DTGSDL W QC+PC   QC  Q  PLFDP  SSTY ++PC+S
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCAS 171

Query: 151 SQCASLNQKS----CS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
            +C  L   +    CS G  C +++SY DG+ + G    + +TL         +    FG
Sbjct: 172 GECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTL----APGAIVKDFYFG 227

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG 265
           CG +   L       +      + SL +Q        FSYCL P  ++K  F   G    
Sbjct: 228 CGHSKSSLPGLFDGLLGLGRLSE-SLGAQYGGGGG--FSYCL-PAVNSKPGFLAFGAGRN 283

Query: 266 P-GVVSTPLTKA---KTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQGYN 317
           P G V TP+ +     TF  +T+  I+VG ++L +     +  +++DSGT +T L     
Sbjct: 284 PSGFVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFSGGMIVDSGTVVTVLQSTVY 343

Query: 318 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVK 374
             L +     ++A  +    G L+ CY         VP++ + F  GA + L   N  + 
Sbjct: 344 RALRAAFREAMKAYRLVH--GDLDTCYDLTGYKNVVVPKIALTFSGGATINLDVPNGILV 401

Query: 375 VSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                 C  F   G   +  + GN+ Q  F V +D       F+   C
Sbjct: 402 NG----CLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445


>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
          Length = 419

 Score =  135 bits (339), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 101/403 (25%), Positives = 191/403 (47%), Gaps = 51/403 (12%)

Query: 58  LTRSLNRLNHFNQNSSISSSKASQADIIP---NNANYLIRISIGTPPTERLAVADTGSDL 114
           L R L++     +  + +++      ++P   + A+Y+   +IGTPP     + D   +L
Sbjct: 26  LRRGLDQQGMRGRILADATAAPPGGAVVPLHWSGAHYVANFTIGTPPQAVSGIVDLSGEL 85

Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVS-- 171
           +WTQC  C  S C+ Q+ P+FDP  S+TY++  C S  C S+  ++CSG   C Y     
Sbjct: 86  VWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSIPTRNCSGDGECGYEAPSM 145

Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKT---TGIVGLGGGD 228
           +GD   + G  +T+ + +G+  G+      + FGC   + G  +      +G VGLG   
Sbjct: 146 FGD---TFGIASTDAIAIGNAEGR------LAFGCVVASDGSIDGAMDGPSGFVGLGRTP 196

Query: 229 ISLISQMRTTIAGKFSYCLV---PVSSTKINFGTNGIVSGPGVVS--TPL---------- 273
            SL+ Q   T    FSYCL    P   + +  G +  ++G G  +  TPL          
Sbjct: 197 WSLVGQSNVT---AFSYCLALHGPGKKSALFLGASAKLAGAGKSNPPTPLLGQHASNTSD 253

Query: 274 TKAKTFYVLTIDAISVGNQRLGVSTPD------IVIDSGTTLTFLPQGYNSNLLSVMSSM 327
             +  +Y + ++ I  G+  +  ++        + +++   L++LP      L  V+++ 
Sbjct: 254 DGSDPYYTVQLEGIKAGDVAVAAASSGGGAITVLQLETFRPLSYLPDAAYQALEKVVTAA 313

Query: 328 IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFV--------KVSEDI 379
           + +  +A+P    +LC+   ++S VP++   F+G     ++ + ++         V   I
Sbjct: 314 LGSPSMANPPEPFDLCFQNAAVSGVPDLVFTFQGGATLTAQPSKYLLGDGNGNGTVCLSI 373

Query: 380 VCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           + S       + V I G+++Q N    +D+E++T+SF+P DC+
Sbjct: 374 LSSTRLDSADDGVSILGSLLQENVHFLFDLEKETLSFEPADCS 416


>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
          Length = 397

 Score =  134 bits (338), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 106/367 (28%), Positives = 176/367 (47%), Gaps = 47/367 (12%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N AN+    +IGTPP    A+ D   +L+WTQC  C  S+C+ QD PLF P  SST++  
Sbjct: 43  NVANF----TIGTPPQPASAIIDVAGELVWTQCSRC--SRCFKQDLPLFIPNASSTFRPE 96

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYG---DGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           PC +  C S    +CSG  C Y  +     D   + G + TET  +G+ T        + 
Sbjct: 97  PCGTDACKSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGTATAS------LA 150

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV---SSTKINFGTN 260
           FGC   +       T+G +GLG    SL++QM+ T   KFSYCL P     S+++  G++
Sbjct: 151 FGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLT---KFSYCLSPRGTGKSSRLFLGSS 207

Query: 261 GIVSG-------PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF-- 311
             ++G       P + ++P   +  +Y+L++DAI  GN  +  +    ++   T   F  
Sbjct: 208 AKLAGGESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTIATAQSGGILVMHTVSPFSL 267

Query: 312 -LPQGYNSNLLSVMSSM--IEAQPVADPTGSLELCYSFN---SLSQVPEVTIHFRGADVK 365
            +   Y +   +V  ++    A P+A P    +LC+      S +  P++   F+G    
Sbjct: 268 LVDSAYRAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGGGAA 327

Query: 366 LS--RSNFFVKVSE--DIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVS 414
           L+   + + + V E  D  C+    +          V + G++ Q N    YD++++T+S
Sbjct: 328 LTVPPAKYLIDVGEEKDTACAAILSMARLNRTGLEGVSVLGSLQQENVHFLYDLKKETLS 387

Query: 415 FKPTDCT 421
           F+P DC+
Sbjct: 388 FEPADCS 394


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score =  134 bits (338), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 103/372 (27%), Positives = 165/372 (44%), Gaps = 46/372 (12%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP--LFDPKMSSTYKSLP 147
            Y +R  +GTP    + VADTGSDL W +C     +      SP  +F    S ++  + 
Sbjct: 100 QYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAPIA 159

Query: 148 CSSSQCAS-----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG-----------S 191
           CSS  C S     L   S     C Y   Y DGS + G + T++ T+            S
Sbjct: 160 CSSDTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGGDS 219

Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS 251
           + G+   L G+  GC     G     + G++ LG  +IS  S+      G+FSYCL  V 
Sbjct: 220 SGGRRAKLQGVVLGCAATYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCL--VD 277

Query: 252 STKINFGTNGIVSGPGVVS----TPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI--- 301
                  T+ +  GPG  +    TPL    +   FY +T+DA+ V  + L +   D+   
Sbjct: 278 HLAPRNATSYLTFGPGATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPA-DVWDV 336

Query: 302 ------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNSLS--Q 351
                 ++DSGT+LT L       +++ +S  +   P    DP    E CY++      +
Sbjct: 337 DRNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRVTMDP---FEYCYNWTDAGALE 393

Query: 352 VPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIE 409
           +P++ +HF G A ++    ++ +  +  + C  V +G    V + GNI+Q   L  +D+ 
Sbjct: 394 IPKMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQEGSWPGVSVIGNILQQEHLWEFDLR 453

Query: 410 QQTVSFKPTDCT 421
            + + FK T C 
Sbjct: 454 DRWLRFKHTRCA 465


>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
           [Arabidopsis thaliana]
          Length = 449

 Score =  134 bits (338), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 110/388 (28%), Positives = 184/388 (47%), Gaps = 43/388 (11%)

Query: 65  LNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPPTERLAVADTGSDLIW 116
           L HF  + +   S+   +  +P   +        Y  +I +G+PP E     DTGSD++W
Sbjct: 40  LEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILW 99

Query: 117 TQCEPCP--PSQCYMQ-DSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVS 171
             C+PCP  P++  +     LFD   SST K + C    C+ ++Q  SC   + C Y + 
Sbjct: 100 INCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIV 159

Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNGGLF---NSKTTGIVGLG 225
           Y D S S+G    + +TL   TG     P    + FGCG++  G     +S   G++G G
Sbjct: 160 YADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFG 219

Query: 226 GGDISLISQMRTTIAGK--FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYV-- 281
             + S++SQ+  T   K  FS+CL  V    I F   G+V  P V +TP+   +  Y   
Sbjct: 220 QSNTSVLSQLAATGDAKRVFSHCLDNVKGGGI-FAV-GVVDSPKVKTTPMVPNQMHYNVM 277

Query: 282 ---LTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPT 337
              + +D  S+   R  V     ++DSGTTL + P+    +L+    +++  QPV     
Sbjct: 278 LMGMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIV 334

Query: 338 GSLELCYSF--NSLSQVPEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK--GIT--- 388
                C+SF  N     P V+  F  + VKL+    ++   + E++ C  ++  G+T   
Sbjct: 335 EETFQCFSFSTNVDEAFPPVSFEFEDS-VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDE 393

Query: 389 -NSVPIYGNIMQTNFLVGYDIEQQTVSF 415
            + V + G+++ +N LV YD++ + + +
Sbjct: 394 RSEVILLGDLVLSNKLVVYDLDNEVIGW 421


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score =  134 bits (337), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 110/359 (30%), Positives = 158/359 (44%), Gaps = 35/359 (9%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +   +++ +  G+P        DTGSD+ W QC PC    CY Q  P+FDP  S+TY ++
Sbjct: 157 DTLEFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCS-GHCYKQHDPVFDPTKSATYSAV 215

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           PC   QCA+   K  +   C Y V+YGDGS + G L+ ET++L ST      LPG  FGC
Sbjct: 216 PCGHPQCAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRD----LPGFAFGC 271

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
           G  N G F      +    G  +SL SQ   T    FSYCL P   T   + T G  +  
Sbjct: 272 GQTNLGEFGGVDGLVGLGRGA-LSLPSQAAATFGATFSYCL-PSYDTTHGYLTMGSTTPA 329

Query: 267 G------VVSTPLTKAKTF---YVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFL 312
                  V  T + + + +   Y + + +I +G   L V     +    + DSGT LT+L
Sbjct: 330 ASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFTRDGTLFDSGTILTYL 389

Query: 313 -PQGYNS--NLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKL 366
            P+ Y S  +      +  +  P  DP    + CY F   + +  P V   F  GA   L
Sbjct: 390 PPEAYASLRDRFKFTMTQYKPAPAYDP---FDTCYDFTGHNAIFMPAVAFKFSDGAVFDL 446

Query: 367 SRSNFFV---KVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           S     +     +    C  F    +++P  I GN  Q    V YD+  + + F    C
Sbjct: 447 SPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDVAAEKIGFGQFTC 505


>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
          Length = 315

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 106/322 (32%), Positives = 158/322 (49%), Gaps = 46/322 (14%)

Query: 139 MSSTYKSLPCSSSQC---ASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
           MSST+K++ C    C   + ++  +C+  N  C Y  SYGD S + G++  +T T  S  
Sbjct: 1   MSSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSPN 60

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
           G  VA+  + FGCG  N GLF S  +GI G G G  SL SQ++    G+FSYCL  V+ +
Sbjct: 61  GVPVAVSELAFGCGDYNTGLFVSNESGIAGFGRGPQSLPSQLK---VGRFSYCLTLVTES 117

Query: 254 KINF----------GTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVS--- 297
           K +           G     +GP   STP+       TFY L+++ I+VG  RL      
Sbjct: 118 KSSVVILGTPPDPDGLRAHTTGP-FQSTPIIYNPLIPTFYYLSLEGITVGKTRLPFDKSV 176

Query: 298 -------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ---PVADPTGSL--ELCYS 345
                  +   VIDSGT+LT LP+     +  ++   + AQ   P  D T  +   LC+ 
Sbjct: 177 FALKKDGSGGTVIDSGTSLTTLPEA----VFELLQEELVAQFPLPRYDNTPEVGDRLCFR 232

Query: 346 FNSLSQ---VPEVTIHFRGADVKLSRSNFFVKVSED-IVCSVFKGITN-SVPIYGNIMQT 400
                +   VP++ +H  GAD+ L R N+FV+  +  ++C    G  + ++ + GN  Q 
Sbjct: 233 RPKGGKQVPVPKLILHLAGADMDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGNFQQQ 292

Query: 401 NFLVGYDIEQQTVSFKPTDCTK 422
           N  V YD+E   + F P  C K
Sbjct: 293 NMHVVYDVENNKLLFAPAQCDK 314


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 106/359 (29%), Positives = 168/359 (46%), Gaps = 37/359 (10%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
           +NY+I++  GTPP     V DTGS++ W  C PC  S C  +  P F+P  SSTY  L C
Sbjct: 122 SNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPC--SGCSSKQQP-FEPSKSSTYNYLTC 178

Query: 149 SSSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           +S QC  L    KS + VNC  +  YGD S  +  L++ET+++GS       +    FGC
Sbjct: 179 ASQQCQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSVGSQ-----QVENFVFGC 233

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN----FGTNGI 262
                GL   +T  +VG G   +S +SQ  T     FSYCL  + S+        G   +
Sbjct: 234 SNAARGLIQ-RTPSLVGFGRNPLSFVSQTATLYDSTFSYCLPSLFSSAFTGSLLLGKEAL 292

Query: 263 VSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTTL 309
            S  G+  TPL   ++  +FY + ++ ISVG + + +    +          +IDSGT +
Sbjct: 293 -SAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDSGTVI 351

Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLS 367
           T L +   + +     S +    +A PT   + CY+  S   + P +T+HF    D+ L 
Sbjct: 352 TRLVEPAYNAMRDSFRSQLSNLTMASPTDLFDTCYNRPSGDVEFPLITLHFDDNLDLTLP 411

Query: 368 RSNFFVKVSED--IVCSVF----KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             N     ++D  ++C  F     G  + +  +GN  Q    + +D+ +  +     +C
Sbjct: 412 LDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDVAESRLGIASENC 470


>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
          Length = 414

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 105/336 (31%), Positives = 158/336 (47%), Gaps = 52/336 (15%)

Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLA 183
           +C  + +P F P  SST+  LPC+SS C  L     +C+   C Y   YG G F+ G LA
Sbjct: 87  ECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLA 145

Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
           TET+ +G  +      PG+ FGC T NG    + ++GIVGLG   +SL+SQ+     G+F
Sbjct: 146 TETLHVGGAS-----FPGVAFGCSTENG--VGNSSSGIVGLGRSPLSLVSQVGV---GRF 195

Query: 244 SYCL---VPVSSTKINFGTNGIVSG----PGVVSTPLTKAKTFYVLTIDAISVGNQRLGV 296
           SYCL        + I FG+   V+G    P ++  P   + ++Y + +  I+VG   L V
Sbjct: 196 SYCLRSDADAGDSPILFGSLAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATDLPV 255

Query: 297 STPDI--------------VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS-- 339
           ++                 ++DSGTTLT+L  +GY     + +S M  A       G+  
Sbjct: 256 TSTTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRF 315

Query: 340 -LELCYSFNSLSQ-----VPEVTIHFRGADVKLSRSNFFVKVSE-------DIVCSVFKG 386
             +LC+  N+        VP + + F G      R   +V V E        + C +   
Sbjct: 316 GFDLCFDANAAGGGSGVPVPTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLVLP 375

Query: 387 ITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            +   S+ I GN+MQ +  V YD++    SF P DC
Sbjct: 376 ASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADC 411


>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 413

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 109/402 (27%), Positives = 192/402 (47%), Gaps = 51/402 (12%)

Query: 55  RDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN---YLIRISIGTPPTERLAVADTG 111
            D L R L +       +   ++ A  A ++P   +   Y+   +IGTPP    A+ D  
Sbjct: 25  HDDLRRGLEQATRGRLLAD--ATPAGGAAVVPIRWSPPYYVANFTIGTPPQPASAIVDVA 82

Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVS 171
            +L+WTQC  C   +C+ QD P+F P  SST+K  PC ++ C S+  +SCSG  C Y   
Sbjct: 83  GELVWTQCSAC--RRCFKQDLPVFVPNASSTFKPEPCGTAVCESIPTRSCSGDVCSYK-- 138

Query: 172 YGDGSFSNGN----LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
            G  +   GN     AT+T  +G+ T +      + FGC   +        +G +GLG  
Sbjct: 139 -GPPTQLRGNTSGFAATDTFAIGTATVR------LAFGCVVASDIDTMDGPSGFIGLGRT 191

Query: 228 DISLISQMRTTIAGKFSYCLVPVS---STKINFGTNGIVSG-------PGVVSTPLTKAK 277
             SL++QM+ T   +FSYCL P +   S+++  G++  ++G       P + ++P   + 
Sbjct: 192 PWSLVAQMKLT---RFSYCLSPRNTGKSSRLFLGSSAKLAGGESTSTAPFIKTSPDDDSH 248

Query: 278 TFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF---LPQGYNSNLLSVMSSM--IEAQP 332
            +Y+L++DAI  GN  +  +    ++   T   F   +   Y +   +V  ++    A P
Sbjct: 249 HYYLLSLDAIRAGNTTIATAQSGGILVMHTVSPFSLLVDSAYRAFKKAVTEAVGGAAAPP 308

Query: 333 VADPTGSLELCYSFN---SLSQVPEVTIHFRG-ADVKLSRSNFFVKVSE--DIVCSVFKG 386
           +A P    +LC+      S +  P++   F+G A + +  + + + V E  D  C+    
Sbjct: 309 MATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILS 368

Query: 387 IT-------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           +          V + G++ Q +    YD++++T+SF+P DC+
Sbjct: 369 MAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPADCS 410


>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 397

 Score =  132 bits (333), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 100/356 (28%), Positives = 167/356 (46%), Gaps = 36/356 (10%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N AN+    +IGTPP    A  D   +L+WTQC  C    C+ QD P+F P  SST+K  
Sbjct: 54  NVANF----TIGTPPQAASAFIDLTGELVWTQCSQC--IHCFKQDLPVFVPNASSTFKPE 107

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           PC +  C S+    C+   C Y    G G  + G +AT+T  +G+      A   + FGC
Sbjct: 108 PCGTDVCKSIPTPKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTA-----APASLGFGC 162

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKINFGTNGIV 263
              +        +G +GLG    SL++QM+ T   +FSYCL P  +   +++  G +  +
Sbjct: 163 VVASDIDTMGGPSGFIGLGRTPWSLVAQMKLT---RFSYCLAPHDTGKNSRLFLGASAKL 219

Query: 264 SG-----PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLT----FLPQ 314
           +G     P V ++P      +Y + ++ I  G+  + +      +   T +      +  
Sbjct: 220 AGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAVVRVSLLVDS 279

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS-LELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFF 372
            Y     +VM+S + A P A P G+  E+C+    +S  P++   F+ GA + +  +N+ 
Sbjct: 280 VYQEFKKAVMAS-VGAAPTATPVGAPFEVCFPKAGVSGAPDLVFTFQAGAALTVPPANYL 338

Query: 373 VKVSEDIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             V  D VC     I        + + I G+  Q N  + +D+++  +SF+P DC+
Sbjct: 339 FDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCS 394


>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
          Length = 494

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 109/369 (29%), Positives = 174/369 (47%), Gaps = 48/369 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI IGTP        DTGSD++W     C+ CP       +  ++DP+ S + + + 
Sbjct: 90  YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149

Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    C +       SC+  + C+YS+SYGDGS + G   T+ +     +G     P   
Sbjct: 150 CDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANA 209

Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
            ++FGCG   GG   S      GI+G G  + S++SQ+    AGK    F++CL  V+  
Sbjct: 210 SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAA--AGKVRKMFAHCLDTVNGG 267

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDS 305
            I F    +V  P V +TPL      Y + +  I VG   LG+ T           +IDS
Sbjct: 268 GI-FAIGNVVQ-PKVKTTPLVSDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDS 325

Query: 306 GTTLTFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFR 360
           GTTL ++P+G    L +++      I  Q + D +     C+ ++       PEVT HF 
Sbjct: 326 GTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVDDGFPEVTFHFE 380

Query: 361 GADVKL--SRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQT 412
           G DV L  S  ++  +  +++ C  F+  G+       + + G+++ +N LV YD+E Q 
Sbjct: 381 G-DVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYDLENQA 439

Query: 413 VSFKPTDCT 421
           + +   +C+
Sbjct: 440 IGWADYNCS 448


>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
          Length = 396

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 102/363 (28%), Positives = 177/363 (48%), Gaps = 46/363 (12%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+   +IGTPP    A+ D   +L+WTQC  C   +C+ QD P+F P  SST+K  PC +
Sbjct: 45  YVANFTIGTPPQPASAIVDVAGELVWTQCSAC--RRCFKQDLPVFVPNASSTFKPEPCGT 102

Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGN----LATETVTLGSTTGQAVALPGITFGC 206
           + C S+  +SCSG  C Y    G  +   GN     AT+T  +G+ T +      + FGC
Sbjct: 103 AVCESIPTRSCSGDVCSYK---GPPTQLRGNTSGFAATDTFAIGTATVR------LAFGC 153

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS---STKINFGTNGIV 263
              +        +G +GLG    SL++QM+ T   +FSYCL P +   S+++  G++  +
Sbjct: 154 VVASDIDTMDGPSGFIGLGRTPWSLVAQMKLT---RFSYCLSPRNTGKSSRLFLGSSAKL 210

Query: 264 SG-------PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF---LP 313
           +G       P + ++P      +Y+L++DAI  GN  +  +    ++   T   F   + 
Sbjct: 211 AGSESTSTAPFIKTSPDDDGSNYYLLSLDAIRAGNTTIATAQSGGILVMHTVSPFSLLVD 270

Query: 314 QGYNSNLLSVMSSM--IEAQPVADPTGSLELCYSFN---SLSQVPEVTIHFRG-ADVKLS 367
             Y +   +V  ++    A P+A P    +LC+      S +  P++   F+G A + + 
Sbjct: 271 SAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVP 330

Query: 368 RSNFFVKVSE--DIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
            + + + V E  D  C+    +          V + G++ Q +    YD++++T+SF+P 
Sbjct: 331 PAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPA 390

Query: 419 DCT 421
           DC+
Sbjct: 391 DCS 393


>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
 gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
 gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
          Length = 494

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 106/367 (28%), Positives = 175/367 (47%), Gaps = 44/367 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I IGTP        DTGSD++W     C+ CP       +  L+DPK SST   + 
Sbjct: 89  YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 148

Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    CA+    L     + + C+YSV+YGDGS + G   ++ +     +G     P   
Sbjct: 149 CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 208

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
            +TFGCG+  GG     N    GI+G G  + S++SQ+  + AGK    F++CL  ++  
Sbjct: 209 TVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQL--SAAGKVKKIFAHCLDTINGG 266

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
            I F    +V  P V +TPL      Y + + +I VG   L +             +IDS
Sbjct: 267 GI-FAIGNVVQ-PKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDS 324

Query: 306 GTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGA 362
           GTTLT+LP+  Y   +L+V +   + + +        LC+ +        P++T HF   
Sbjct: 325 GTTLTYLPEIVYKEIMLAVFA---KHKDITFHNVQEFLCFQYVGRVDDDFPKITFHFEN- 380

Query: 363 DVKLSR--SNFFVKVSEDIVCSVFK--GITNS----VPIYGNIMQTNFLVGYDIEQQTVS 414
           D+ L+    ++F +  +++ C  F+  G+ +     + + G+++ +N LV YD+E Q + 
Sbjct: 381 DLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIG 440

Query: 415 FKPTDCT 421
           +   +C+
Sbjct: 441 WTEYNCS 447


>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
          Length = 409

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 106/367 (28%), Positives = 175/367 (47%), Gaps = 44/367 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I IGTP        DTGSD++W     C+ CP       +  L+DPK SST   + 
Sbjct: 4   YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 63

Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    CA+    L     + + C+YSV+YGDGS + G   ++ +     +G     P   
Sbjct: 64  CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 123

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
            +TFGCG+  GG     N    GI+G G  + S++SQ+  + AGK    F++CL  ++  
Sbjct: 124 TVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQL--SAAGKVKKIFAHCLDTINGG 181

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
            I F    +V  P V +TPL      Y + + +I VG   L +             +IDS
Sbjct: 182 GI-FAIGNVVQ-PKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDS 239

Query: 306 GTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGA 362
           GTTLT+LP+  Y   +L+V +   + + +        LC+ +        P++T HF   
Sbjct: 240 GTTLTYLPEIVYKEIMLAVFA---KHKDITFHNVQEFLCFQYVGRVDDDFPKITFHFEN- 295

Query: 363 DVKLSR--SNFFVKVSEDIVCSVFK--GITNS----VPIYGNIMQTNFLVGYDIEQQTVS 414
           D+ L+    ++F +  +++ C  F+  G+ +     + + G+++ +N LV YD+E Q + 
Sbjct: 296 DLPLNVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIG 355

Query: 415 FKPTDCT 421
           +   +C+
Sbjct: 356 WTEYNCS 362


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score =  132 bits (331), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 109/369 (29%), Positives = 174/369 (47%), Gaps = 48/369 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI IGTP        DTGSD++W     C+ CP       +  ++DP+ S + + + 
Sbjct: 90  YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149

Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    C +       SC+  + C+YS+SYGDGS + G   T+ +     +G     P   
Sbjct: 150 CDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANA 209

Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
            ++FGCG   GG   S      GI+G G  + S++SQ+    AGK    F++CL  V+  
Sbjct: 210 SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAA--AGKVRKMFAHCLDTVNGG 267

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDS 305
            I F    +V  P V +TPL      Y + +  I VG   LG+ T           +IDS
Sbjct: 268 GI-FAIGNVVQ-PKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDS 325

Query: 306 GTTLTFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFR 360
           GTTL ++P+G    L +++      I  Q + D +     C+ ++       PEVT HF 
Sbjct: 326 GTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVDDGFPEVTFHFE 380

Query: 361 GADVKL--SRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQT 412
           G DV L  S  ++  +  +++ C  F+  G+       + + G+++ +N LV YD+E Q 
Sbjct: 381 G-DVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYDLENQA 439

Query: 413 VSFKPTDCT 421
           + +   +C+
Sbjct: 440 IGWADYNCS 448


>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
          Length = 506

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 107/376 (28%), Positives = 162/376 (43%), Gaps = 46/376 (12%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I +GTPP       DTGSD++W     C  CP       D   +DPK SS+  ++ 
Sbjct: 87  YFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSGSTVS 146

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    CA+       G    V C+YSV YGDGS + G   T+ +     TG     PG  
Sbjct: 147 CDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQPGNA 206

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLVPVSSTKI 255
            ITFGCG   GG     N    GI+G G  + S++SQ+      K  F++CL  +    I
Sbjct: 207 TITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTIKGGGI 266

Query: 256 --------------NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV----- 296
                          F  +G+++ P  +   +  ++  Y + + +I VG   L +     
Sbjct: 267 FAIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQLPAHVF 326

Query: 297 ---STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVP 353
                   +IDSGTTLT+LP+     ++ V+ S        +    L   YS +     P
Sbjct: 327 ETGEKKGTIIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDFLCFQYSGSVDDGFP 386

Query: 354 EVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVG 405
            +T HF   D+ L      +F     DI C  F+ G   S     + + G+++ +N LV 
Sbjct: 387 TITFHFE-DDLALHVYPHEYFFPNGNDIYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVV 445

Query: 406 YDIEQQTVSFKPTDCT 421
           YD+E Q + +   +C+
Sbjct: 446 YDLENQVIGWTDYNCS 461


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 122/441 (27%), Positives = 187/441 (42%), Gaps = 51/441 (11%)

Query: 28  GFSVELIHRDSPK----SPFYNSSETPYQRLR------DALTRSLNRLNHFNQNSSISSS 77
           G   E+ H  SPK    S F    ++     R      +A  + ++ L H  +  +   S
Sbjct: 42  GVWFEMFHMHSPKLKSQSKFLGPPKSRLDGTRQLLQSDNARRQMISSLRHGTRRKAFEVS 101

Query: 78  KASQADIIPN----NANYLIRISIGTP-PTERLAVADTGSDLIWTQCE----PCPPSQCY 128
             +Q  I        + Y + I IGTP P + + V DTGSDL W  CE     CP    +
Sbjct: 102 HTAQIPIHSGADSGQSQYFVSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPH 161

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGN 181
                +F    SS+++++PCSS  C        SL +       C +   Y +G  + G 
Sbjct: 162 --PGRVFRANDSSSFRTIPCSSDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGV 219

Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
            A ETVT+G    + + L  +  GC T +    N    G++GLG    SL  ++      
Sbjct: 220 FANETVTVGLNDHKKIRLFDVLIGC-TESFNETNGFPDGVMGLGYRKHSLALRLAEIFGN 278

Query: 242 KFSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRL 294
           KFSYCLV   S+      ++FG    +  P +  T L       FY + +  ISVG   L
Sbjct: 279 KFSYCLVDHLSSSNHKNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSML 338

Query: 295 GVSTP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL---C 343
            +S+          +++DSGT+LT L       ++  +  + +      P    EL   C
Sbjct: 339 SISSDIWNVTGVGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFC 398

Query: 344 YSFNSLSQ--VPEVTIHF-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQ 399
           +      +  VP + IHF  GA  K    ++ + V+E I C  + K       I GN+MQ
Sbjct: 399 FEDKGFDRAAVPRLLIHFADGAIFKPPVKSYIIDVAEGIKCLGIIKADFPGSSILGNVMQ 458

Query: 400 TNFLVGYDIEQQTVSFKPTDC 420
            N L  YD+ +  + F P+ C
Sbjct: 459 QNHLWEYDLGRGKLGFGPSSC 479


>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
          Length = 453

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 118/396 (29%), Positives = 187/396 (47%), Gaps = 53/396 (13%)

Query: 57  ALTRSLNRLNHFN----QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGS 112
           A+ RS +RL+        N+  +  +++Q  +   + +Y +   IGTP T     ADTGS
Sbjct: 54  AVQRSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGEADTGS 113

Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-------- 164
           DLIWT+C  C  ++C  + SP + P  SS+   + C    C  L +  CS V        
Sbjct: 114 DLIWTKCGAC--ARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSG 171

Query: 165 NCQYSVSYGDG----SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTG 220
           NC Y  +YG+      ++ G L TET T G     A A PGI FGC   + G F +  +G
Sbjct: 172 NCSYHYAYGNARDTHHYTEGILMTETFTFGD---DAAAFPGIAFGCTLRSEGGFGTG-SG 227

Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCLVP--VSSTKINFGTNGIVSGPG--------VVS 270
           +VGLG G +SL++Q+       F Y L     + + I+FG+   V+G          +++
Sbjct: 228 LVGLGRGKLSLVTQLNVE---AFGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLT 284

Query: 271 TPLTKAKTFYVLTIDAISVGNQRLGV-----------STPDIVIDSGTTLTFLPQ-GYNS 318
            P+ +   FY + +  ISVG + + +               ++ DSGTTLT LP   Y  
Sbjct: 285 NPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTL 344

Query: 319 NLLSVMSSMIEAQPVADPTGSLELCYS-FNSLSQVPEVTIHFR-GADVKLSRSNFFVKV- 375
               ++S M   +P         +C++  +S +  P + +HF  GAD+ LS  N+  ++ 
Sbjct: 345 VRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQ 404

Query: 376 ---SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
               E   C      + ++ I GNIMQ +F V +D+
Sbjct: 405 GQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDL 440


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 124/425 (29%), Positives = 205/425 (48%), Gaps = 64/425 (15%)

Query: 52  QRLRDALTR-----SLNRLNHFN-QNSSISSSKASQADIIPNNANYLIRISIGTPPTERL 105
           +++R++L+R       N+ NH + + +  +S   S    + + A + +++ IG+      
Sbjct: 55  EQVRESLSRIQSQVQDNQNNHLDLRGNRPTSGVRSVVTPLEDYALFSMQLGIGSLQKNLS 114

Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-- 163
           A+ DTGS+ +  QC          +  P+FDP  S +Y+ +PC S  C ++ Q++ +G  
Sbjct: 115 AIIDTGSEAVLVQCGS--------RSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSS 166

Query: 164 -------VNCQYSVSYGDGSFSNGNLATETVTLGST--TGQAVALPGITFGCGTN-NGGL 213
                    C YS+SYGD   S G+ + + + L ST  +GQAV    + FGC  +  G L
Sbjct: 167 QPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRDVAFGCAHSPQGFL 226

Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAG-KFSYCL-----VPVSSTKINFGTNGI----V 263
            +  + GIVG   G++SL SQ++  + G KFSYC       P ++  I  G +G+    V
Sbjct: 227 VDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGVIFLGDSGLSKSKV 286

Query: 264 SGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV---------STPD--IVIDSGTTLT- 310
               ++  P+T A++  Y + + +ISV  + L +         ST D   V+DSGTT T 
Sbjct: 287 GYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGTVLDSGTTFTR 346

Query: 311 FLPQGYNS--NLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQVPEVTIHFR-GADV 364
            +   Y +  N  +  +     + V    G  + CY+    +SL  VPEV +  +    +
Sbjct: 347 VVDDAYTAFRNAFAASNRSGLRKKVGAAAG-FDDCYNISAGSSLPGVPEVRLSLQNNVRL 405

Query: 365 KLSRSNFFVKVS----EDIVC----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           +L   + FV VS    E  VC    S  K     + + GN  Q+N+LV YD E+  V F+
Sbjct: 406 ELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYLVEYDNERSRVGFE 465

Query: 417 PTDCT 421
             DC+
Sbjct: 466 RADCS 470


>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 497

 Score =  131 bits (330), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 105/365 (28%), Positives = 173/365 (47%), Gaps = 38/365 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTPP       DTGSD++W     C+ CP       D  L+DPK SS+  ++ 
Sbjct: 87  YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146

Query: 148 CSSSQCASL---NQK--SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---A 198
           C +  CA+     +K   C +G  C+Y   YGDGS + G+  ++++     +G A    A
Sbjct: 147 CDNKFCAATYGSGEKLPGCTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTRHA 206

Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSST 253
              + FGCG   GG     N    GI+G G  + S +SQ+ +   +   FS+CL  +   
Sbjct: 207 KANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTIKGG 266

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------VID 304
            I F    +V  P V STPL    + Y + + +I V    L +  P I         +ID
Sbjct: 267 GI-FAIGEVVQ-PKVKSTPLLPNMSHYNVNLQSIDVAGNALQLP-PHIFETSEKRGTIID 323

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADV 364
           SGTTLT+LP+    ++L+ +    +        G L   YS +     P++T HF   D+
Sbjct: 324 SGTTLTYLPELVYKDILAAVFQKHQDITFRTIQGFLCFEYSESVDDGFPKITFHFE-DDL 382

Query: 365 KLSR--SNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
            L+    ++F +  +++ C  F+           + + G+++ +N +V YD+E+Q + + 
Sbjct: 383 GLNVYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVLSNKVVVYDLEKQVIGWT 442

Query: 417 PTDCT 421
             +C+
Sbjct: 443 DYNCS 447


>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 488

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 108/393 (27%), Positives = 179/393 (45%), Gaps = 41/393 (10%)

Query: 67  HFNQNSSISSSKASQADI------IPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQ- 118
           H   +S+      + AD+      +P +   Y   I IGTPP +     DTGSD++W   
Sbjct: 52  HLTHDSNRRGRLLAAADVPLGGLGLPTDTGLYYTEIEIGTPPKQYHVQVDTGSDILWVNC 111

Query: 119 --CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG----VNCQYSVSY 172
             C  CP       D  L+DPK SS+  ++ C    CA+       G    + C+YSV Y
Sbjct: 112 ISCNKCPRKSDLGIDLRLYDPKGSSSGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMY 171

Query: 173 GDGSFSNGNLATETVTLGSTTGQAV---ALPGITFGCGTNNGGLF---NSKTTGIVGLGG 226
           GDGS + G   ++++     +G      A   + FGCG   GG     N    GI+G G 
Sbjct: 172 GDGSSTTGYFVSDSLQYNQVSGDGQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQ 231

Query: 227 GDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTI 284
            + S++SQ+     +   FS+CL  +    I F    +V  P V STPL      Y + +
Sbjct: 232 SNTSMLSQLAAAGEVKKIFSHCLDTIKGGGI-FAIGDVVQ-PKVKSTPLVPDMPHYNVNL 289

Query: 285 DAISVGNQRLGV--------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 336
           ++I+VG   L +             +IDSGTTLT+LP+    ++L+ + +          
Sbjct: 290 ESINVGGTTLQLPSHMFETGEKKGTIIDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSV 349

Query: 337 TGSLELCYSFNSLSQVPEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK--GIT---- 388
              L + Y  +     P++T HF   D+ L+    ++F +  +++ C  F+  G+     
Sbjct: 350 QDFLCIQYFQSVDDGFPKITFHFE-DDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDG 408

Query: 389 NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             + + G+++ +N +V YD+E Q V +   +C+
Sbjct: 409 KDMVLLGDLVLSNKVVVYDLENQVVGWTDYNCS 441


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 106/400 (26%), Positives = 173/400 (43%), Gaps = 72/400 (18%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-------------PSQCYMQDSPLFD 136
            Y +R  +GTP    L VADTGSDL W +C                 P+         F 
Sbjct: 86  QYFVRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRTFR 145

Query: 137 PKMSSTYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE--TVTL 189
           P  S T+  +PCSS+ C      SL   +     C Y   Y DGS + G +  +  T+ L
Sbjct: 146 PDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIAL 205

Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV- 248
                +   L G+  GC T+  G     + G++ LG  +IS  S+  +   G+FSYCLV 
Sbjct: 206 SGRAARKAKLRGVVLGCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGRFSYCLVD 265

Query: 249 ---PVSSTK-INFGTNGIVS----GPGVVS-------------------TPLT---KAKT 278
              P ++T  + FG N   S      G+ S                   TPL    + + 
Sbjct: 266 HLAPRNATSYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRTRP 325

Query: 279 FYVLTIDAISVGNQRLGV--STPDI------VIDSGTTLTFLPQGYNSNLLSVMSSMIEA 330
           FY +T+  +SV  + L +  +  D+      ++DSGT+LT L +     +++ +S  +  
Sbjct: 326 FYAVTVKGVSVAGELLKIPRAVWDVEQGGGAILDSGTSLTMLAKPAYRAVVAALSKRLAG 385

Query: 331 QP--VADPTGSLELCYSFNSLS------QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC 381
            P    DP    + CY++ S S       +P + +HF G A ++    ++ +  +  + C
Sbjct: 386 LPRVTMDP---FDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVIDAAPGVKC 442

Query: 382 -SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             + +G    + + GNI+Q   L  YD++ + + FK + C
Sbjct: 443 IGLQEGPWPGLSVIGNILQQEHLWEYDLKNRRLRFKRSRC 482


>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
          Length = 453

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 118/396 (29%), Positives = 187/396 (47%), Gaps = 53/396 (13%)

Query: 57  ALTRSLNRLNHFN----QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGS 112
           A+ RS +RL+        N+  +  +++Q  +   + +Y +   IGTP T     ADTGS
Sbjct: 54  AVQRSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGEADTGS 113

Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-------- 164
           DLIWT+C  C  ++C  + SP + P  SS+   + C    C  L +  CS V        
Sbjct: 114 DLIWTKCGAC--ARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSG 171

Query: 165 NCQYSVSYGDG----SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTG 220
           NC Y  +YG+      ++ G L TET T G     A A PGI FGC   + G F +  +G
Sbjct: 172 NCSYHYAYGNARDTHHYTEGILMTETFTFGD---DAAAFPGIAFGCTLRSEGGFGTG-SG 227

Query: 221 IVGLGGGDISLISQMRTTIAGKFSYCLVP--VSSTKINFGTNGIVSGPG--------VVS 270
           +VGLG G +SL++Q+       F Y L     + + I+FG+   V+G          +++
Sbjct: 228 LVGLGRGKLSLVTQLNVE---AFGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLT 284

Query: 271 TPLTKAKTFYVLTIDAISVGNQRLGV-----------STPDIVIDSGTTLTFLPQ-GYNS 318
            P+ +   FY + +  ISVG + + +               ++ DSGTTLT LP   Y  
Sbjct: 285 NPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTL 344

Query: 319 NLLSVMSSMIEAQPVADPTGSLELCYS-FNSLSQVPEVTIHFR-GADVKLSRSNFFVKV- 375
               ++S M   +P         +C++  +S +  P + +HF  GAD+ LS  N+  ++ 
Sbjct: 345 VRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQ 404

Query: 376 ---SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
               E   C      + ++ I GNIMQ +F V +D+
Sbjct: 405 GQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDL 440


>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
 gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
          Length = 431

 Score =  131 bits (329), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 124/423 (29%), Positives = 189/423 (44%), Gaps = 47/423 (11%)

Query: 21  PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN--SSISSSK 78
           P+     GF  EL H      P+  SS   +   R +   S  R+          +S   
Sbjct: 30  PVAGSDAGFRAELHH------PYAGSSLPVHDMWRRSARASKARVARLEARLTGDMSVPL 83

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           A  +D       Y + I IGTPP     +ADT SDL WTQC     +    Q  PLFDP 
Sbjct: 84  ARISD-----EGYTVTIGIGTPPQLHTLIADTASDLTWTQCNLF--NDTAKQVEPLFDPA 136

Query: 139 MSSTYKSLPCSSSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
            SS++  + CSS  C   N   K CS   C+Y   Y     + G LA E+ TL S   Q 
Sbjct: 137 KSSSFAFVTCSSKLCTEDNPGTKRCSNKTCRYVYPYVSVE-AAGVLAYESFTL-SDNNQH 194

Query: 197 VALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK- 254
           + +    FGCG   +G L  +  +GI+G+    +S++SQ+      KFSYCL P +  K 
Sbjct: 195 ICM-SFGFGCGALTDGNLLGA--SGILGMSPAILSMVSQLAIP---KFSYCLTPYTDRKS 248

Query: 255 --INFGTNGIVSGPGVVSTPLTKAKTF-YVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF 311
             + FG    + G    + P+ K+ TF Y + +  +S+G +RL V      +  G T+  
Sbjct: 249 SPLFFGAWADL-GRYKTTGPIQKSLTFYYYVPLVGLSLGTRRLDVPAATFALKQGGTVVD 307

Query: 312 LPQGYNSNLLSVMSSMIEA------QPVADPT-GSLELCYSFNS-----LSQVPEVTIHF 359
           L            +++ EA       P+ + T    ++C++  S       Q P + ++F
Sbjct: 308 LGCTVGQLAEPAFTALKEAVLHTLNLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYF 367

Query: 360 R-GADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
             GAD+ L R N+F + +  ++C ++  G    + I GN+ Q NF + +D+      F P
Sbjct: 368 DGGADMVLPRDNYFQEPTAGLMCLALVPG--GGMSIIGNVQQQNFHLLFDVHDSKFLFAP 425

Query: 418 TDC 420
           T C
Sbjct: 426 TIC 428


>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
          Length = 367

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 97/358 (27%), Positives = 166/358 (46%), Gaps = 40/358 (11%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N AN+    +IGTPP    A  D   +L+WTQC  C    C+ QD P+F P  SST+K  
Sbjct: 24  NVANF----TIGTPPQAASAFIDLTGELVWTQCSQC--IHCFKQDLPVFVPNASSTFKPE 77

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           PC +  C S+    C+   C +    G G  + G +AT+T  +G+      A   + FGC
Sbjct: 78  PCGTDVCKSIPTPKCASDVCAFDGVTGLGGHTVGIVATDTFAIGTA-----APASLGFGC 132

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKINFGTNGIV 263
              +        +G +GLG    SL++QM+ T   +FSYCL P  +   +++  G +  +
Sbjct: 133 VVASDIDTMGGPSGFIGLGRTPWSLVAQMKLT---RFSYCLAPHDTGKNSRLFLGASAKL 189

Query: 264 SG-----PGVVSTPLTKAKTFYVLTIDAISVGNQ-------RLGVSTPDIVIDSGTTLTF 311
           +G     P V ++P      +Y + ++ I  G+        R  V     V+     +  
Sbjct: 190 AGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAVVRVSLLVDS 249

Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 370
           + Q +   +++ + +   A PV +P    E+C+    +S  P++   F+ GA + +  +N
Sbjct: 250 VYQEFKKAVMASVGAAPTATPVGEP---FEVCFPKAGVSGAPDLVFTFQAGAALTVPPAN 306

Query: 371 FFVKVSEDIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           +   V  D VC     I        + + I G+  Q N  + +D+++  +SF+P DC+
Sbjct: 307 YLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCS 364


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 168/370 (45%), Gaps = 41/370 (11%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD---SPLFDPKMSSTYKSL 146
            Y +R+ +GTP    + VADTGSDL W +C     S           +F P  S ++  L
Sbjct: 103 QYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSWSPL 162

Query: 147 PCSSSQCAS---LNQKSCSGVN--CQYSVSYGDGSFSNG--NLATETVTLGSTTG-QAVA 198
           PC S  C S    +  +CS     C Y   Y D S + G   L + TV+L    G +   
Sbjct: 163 PCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVSLSGNDGTRKAK 222

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTK 254
           L  +  GC T+  G     + G++ LG  +IS  S+  +   G+FSYCLV    P ++T 
Sbjct: 223 LQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGRFSYCLVDHLAPRNATS 282

Query: 255 -INFGTNGIVSGPGVV--STPL-----TKAKTFYVLTIDAISVGNQRLGVSTPDI----- 301
            + FG      G       TPL      + + FY +++DA++V  +RL +  PD+     
Sbjct: 283 FLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAGERLEI-LPDVWDFRK 341

Query: 302 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFNSLS-QVPE 354
               ++DSGT+LT L       ++  +S      P    DP    E CY++  +S ++P 
Sbjct: 342 NGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNMDP---FEYCYNWTGVSAEIPR 398

Query: 355 VTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
           + + F G A +     ++ +  +  + C  V +G    V + GNI+Q   L  +D+  + 
Sbjct: 399 MELRFAGAATLAPPGKSYVIDTAPGVKCIGVVEGAWPGVSVIGNILQQEHLWEFDLANRW 458

Query: 413 VSFKPTDCTK 422
           + FK + C  
Sbjct: 459 LRFKQSRCAH 468


>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Brachypodium distachyon]
          Length = 429

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 112/367 (30%), Positives = 163/367 (44%), Gaps = 41/367 (11%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM---QDSPLFDPKMSSTY 143
           +   + + IS+GTPP   L   DTGS L W  C+ C  S C+    +   +FDP  S+TY
Sbjct: 71  HEGKFFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQIS-CHTTAPEAGSVFDPDKSTTY 129

Query: 144 KSLPCSSSQCASLNQKSCSGVN-------CQYSVSYG---DGSFSNGNLATETVTLGSTT 193
           + + CSS  CA + +   +          C YS+ YG    G +S G L T+ +TL S++
Sbjct: 130 ELVGCSSRDCADVQRSLVAPFGCIEETDTCLYSLRYGSGPSGQYSAGRLGTDKLTLASSS 189

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM-RTTIAGKFSYCLVPVSS 252
                + G  FGC  ++   F    +G++G GG + S  +Q+ R T    FSYC  P   
Sbjct: 190 S---IIDGFIFGCSGDDS--FKGYESGVIGFGGANFSFFNQVARQTNYRAFSYCF-PGDH 243

Query: 253 TKINFGTNGIVSGPGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPD-----IVID 304
           T   F + G      +V T   P    ++ Y L    + V   RL V   +     +V+D
Sbjct: 244 TAEGFLSIGAYPKDELVYTNLIPHFGDRSVYSLQQIDMMVDGNRLQVDQSEYTKRMMVVD 303

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV-----PEVTIHF 359
           SGT  TFL           M+S ++A+     T   E C+  N    V     P V + F
Sbjct: 304 SGTVDTFLLGPVFDAFSKAMASAMQAKGFLSDTVGTETCFRPNGGDSVDSGDLPTVEMRF 363

Query: 360 RGADVKLSRSNFFVKV--SEDIVCSVFK----GITNSVPIYGNIMQTNFLVGYDIEQQTV 413
            G  +KL   N F  +  S D +C  FK    G+ N V I GN    +F V YD++    
Sbjct: 364 IGTTLKLPPENVFHDLLPSHDKICLAFKPDVAGVRN-VQILGNKATXSFRVVYDLQAMYF 422

Query: 414 SFKPTDC 420
            F+   C
Sbjct: 423 GFQAGAC 429


>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
 gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
          Length = 280

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 95/234 (40%), Positives = 128/234 (54%), Gaps = 21/234 (8%)

Query: 20  SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDA-----LTRSLNRLNHFNQNSSI 74
           SP  + T   S++L  R S  S     S T  +  RD+     +T  LN+  + ++ S  
Sbjct: 61  SPFTSSTSTLSLQLHSRASLSSHADYKSLTLSRLDRDSARVKYITTKLNQNFNTDKLSGP 120

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
             S  SQ      +  Y  RI IG PP++   V DTGSD+ W QC PC  + CY Q  P+
Sbjct: 121 IISGTSQG-----SGEYFSRIGIGEPPSQAYMVLDTGSDISWVQCAPC--ADCYRQADPI 173

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           F+P  S++Y  L C ++QC  L+Q  C   NC Y VSYGDGS++ G+  TETVT+G    
Sbjct: 174 FEPTASASYAPLSCEAAQCRYLDQSQCRNGNCLYQVSYGDGSYTVGDFVTETVTIGVNKV 233

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
           + VAL     GCG NN GLF     G++GLGGG +S  +Q+ +T    FSYCLV
Sbjct: 234 KNVAL-----GCGHNNEGLF-VGAAGLIGLGGGPLSFPAQLNST---SFSYCLV 278


>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
          Length = 494

 Score =  130 bits (326), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 108/394 (27%), Positives = 172/394 (43%), Gaps = 69/394 (17%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSP------------LFD 136
            Y +R  +GTP    + +ADTGSDL W +C     PS      SP            +F 
Sbjct: 109 QYFVRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPSPAVAPPRVFR 168

Query: 137 PKMSSTYKSLPCSSSQCAS-----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG- 190
           P  S T+  +PCSS  C S     L   S S   C Y   Y D S + G + T++ T+  
Sbjct: 169 PGDSKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAARGVVGTDSATVAL 228

Query: 191 -------STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
                      +   L G+  GC T + G     + G++ LG  +IS  S+  +   G+F
Sbjct: 229 SGGRGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASDGVLSLGYSNISFASRAASRFGGRF 288

Query: 244 SYCLV----PVSSTK-INFGTNGIVSGPGVVS---------TPL---TKAKTFYVLTIDA 286
           SYCLV    P ++T  + FG     +GP   S         TPL    + + FY + +D+
Sbjct: 289 SYCLVDHLAPRNATSYLTFG-----AGPDAASSSAPAPGSRTPLLLDARVRPFYAVAVDS 343

Query: 287 ISVGNQRLGV--------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADP 336
           +SV    L +        S    +IDSGT+LT L       +++ +S  +   P    DP
Sbjct: 344 VSVDGVALDIPAEVWDVGSNGGTIIDSGTSLTVLATPAYKAVVAALSEQLAGLPRVAMDP 403

Query: 337 TGSLELCYSFNSLSQ------VPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGIT 388
               + CY++ +         VP++ + F G A ++    ++ +  +  + C  V +G  
Sbjct: 404 ---FDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAW 460

Query: 389 NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
             V + GNI+Q   L  +D+  + + F+ T CT+
Sbjct: 461 PGVSVIGNILQQEHLWEFDLNNRWLRFRQTSCTQ 494


>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 452

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 111/349 (31%), Positives = 149/349 (42%), Gaps = 53/349 (15%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPC 148
           NY++  S+GTP   +    DTGSDL W QC+PC  +  CY Q  PLFDP  SS+Y ++PC
Sbjct: 139 NYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
               CA L                         +   +    +  G   A+ G  FGCG 
Sbjct: 199 GGPVCAGL------------------------GIYAASACSAAQCG---AVQGFFFGCGH 231

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTKINFGTNGIV-SG 265
              GLFN    G++GLG    SL+ Q   T  G FSYCL   P ++  +  G  G   + 
Sbjct: 232 AQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAA 290

Query: 266 PGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTF----LPQGYNS 318
           PG  +T   P   A T+YV+ +  ISVG Q+L V        +          LP    +
Sbjct: 291 PGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYA 350

Query: 319 NLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFV 373
            L S   S + +   P A   G L+ CY+F     V  P V + F  GA V L       
Sbjct: 351 ALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL- 409

Query: 374 KVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                  C  F   G    + I GN+ Q +F V   I+  +V FKP+ C
Sbjct: 410 ----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 452


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 122/384 (31%), Positives = 176/384 (45%), Gaps = 72/384 (18%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N    I ++IG+PP     V DTGS+L W  C+  P        +  F+P +SS+Y   
Sbjct: 55  HNVTLTISLTIGSPPQNVTMVLDTGSELSWLHCKKLP------NLNSTFNPLLSSSYTPT 108

Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           PC+SS C +  +      SC   N  C   VSY D S + G LA ET +L        A 
Sbjct: 109 PCNSSVCMTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLA-----GAAQ 163

Query: 200 PGITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
           PG  FGC  + G       ++KTTG++G+  G +SL++QM   +  KFSYC+    S + 
Sbjct: 164 PGTLFGCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQM---VLPKFSYCI----SGED 216

Query: 256 NFGTNGIVSGPGVVS----TPLTKAKT--------FYVLTIDAISVGNQRL----GVSTP 299
            FG   +  GP   S    TPL  A T         Y + ++ I V  + L     V  P
Sbjct: 217 AFGVLLLGDGPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVP 276

Query: 300 D------IVIDSGTTLTFLPQG-YNS---NLLSVMSSMIEAQPVADPT----GSLELCYS 345
           D       ++DSGT  TFL    YNS     L     ++    + DP     G+++LCY 
Sbjct: 277 DHTGAGQTMVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTR--IEDPNFVFEGAMDLCYH 334

Query: 346 F-NSLSQVPEVTIHFRGADVKLSRSNFFVKVSED---IVCSVFK-----GITNSVPIYGN 396
              SL+ VP VT+ F GA++++S      +VS+    + C  F      GI   V   G+
Sbjct: 335 APASLAAVPAVTLVFSGAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYV--IGH 392

Query: 397 IMQTNFLVGYDIEQQTVSFKPTDC 420
             Q N  + +D+ +  V F  T C
Sbjct: 393 HHQQNVWMEFDLVKSRVGFTETTC 416


>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 439

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 108/374 (28%), Positives = 161/374 (43%), Gaps = 84/374 (22%)

Query: 83  DIIPNN------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
           D  PNN       N+L+ ++ GTPP     + DTGS + WTQC+ C              
Sbjct: 114 DHTPNNKLFDEDGNFLVDVAFGTPPQNFTLILDTGSSITWTQCKACT------------- 160

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
                                      V   Y+++YGD S S GN   +T+TL  +    
Sbjct: 161 ---------------------------VENNYNMTYGDDSTSVGNYGCDTMTLEPSD--- 190

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST-KI 255
                  FG G NN G F S   G++GLG G +S +SQ  +     FSYCL    S   +
Sbjct: 191 -VFQKFQFGRGRNNKGDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGSL 249

Query: 256 NFGTNG-----------IVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STP 299
            FG              +V+GPG +     +   +Y + +  ISVGN+RL +     ++P
Sbjct: 250 LFGEKATSQSSSLKFTSLVNGPGTL-----QESGYYFVNLSDISVGNERLNIPSSVFASP 304

Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS----LELCYSFNSLSQV--P 353
             +IDS T +T LPQ   S L +     +   P+++        L+ CY+ +    V  P
Sbjct: 305 GTIIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLP 364

Query: 354 EVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYD 407
           E+ +HF  GADV+L+ +N      E  +C  F G + S     + I GN  Q +  V YD
Sbjct: 365 EIVLHFGGGADVRLNGTNIVWGSDESRLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYD 424

Query: 408 IEQQTVSFKPTDCT 421
           I+   + F+   C+
Sbjct: 425 IQGGRIGFRSNGCS 438


>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
 gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
          Length = 452

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 126/421 (29%), Positives = 171/421 (40%), Gaps = 67/421 (15%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
           + L HR  P +P   SS      + D L     R  +  +  S        S  A+ A  
Sbjct: 68  LRLTHRHGPCAPSRASSLA-APSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAAAAAT 126

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFD 136
           +P          NY++  S+GTP   +    DTGSDL W QC+PC  +  CY Q  PLFD
Sbjct: 127 VPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFD 186

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           P  SS+Y ++PC    CA L                         +   +    +  G  
Sbjct: 187 PAQSSSYAAVPCGGPVCAGL------------------------GIYAASACSAAQCG-- 220

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL--VPVSSTK 254
            A+ G  FGCG    GLFN    G++GLG    SL+ Q   T  G FSYCL   P ++  
Sbjct: 221 -AVQGFFFGCGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGY 278

Query: 255 INFGTNGIV-SGPGVVST---PLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLT 310
           +  G  G   + PG  +T   P   A T+YV+ +  ISVG Q+L V        +     
Sbjct: 279 LTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTG 338

Query: 311 F----LPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF-RG 361
                LP    + L S   S + +   P A   G L+ CY+F     V  P V + F  G
Sbjct: 339 TVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSG 398

Query: 362 ADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
           A V L              C  F   G    + I GN+ Q +F V   I+  +V FKP+ 
Sbjct: 399 ATVTLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSS 451

Query: 420 C 420
           C
Sbjct: 452 C 452


>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 105/386 (27%), Positives = 170/386 (44%), Gaps = 57/386 (14%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
            Y +R  +GTP    L VADTGSDL W +C       S+        F P+ S T+  + 
Sbjct: 93  QYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPIS 152

Query: 148 CSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG----STTGQAVA 198
           C+S  C      SL      G  C Y   Y DGS + G + TE+ T+         +   
Sbjct: 153 CASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAK 212

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVSSTK 254
           L G+  GC ++  G     + G++ LG  D+S  S   +  AG+FSYCLV    P ++T 
Sbjct: 213 LKGLVLGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRFAGRFSYCLVDHLSPRNATS 272

Query: 255 -INFGTN--------------------GIVSGPGVVSTPL---TKAKTFYVLTIDAISVG 290
            + FG N                         P    TPL    + + FY + + A+SV 
Sbjct: 273 YLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVSVA 332

Query: 291 NQRLGVSTP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSL 340
            Q L +            +++DSGT+LT L +     +++ +S  +   P    DP    
Sbjct: 333 GQFLKIPRAVWDVDAGGGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRVTMDP---F 389

Query: 341 ELCYSFNSLS---QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYG 395
           E CY++ S S    +P++ +HF G A ++    ++ +  +  + C  + +G    + + G
Sbjct: 390 EYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWPGISVIG 449

Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDCT 421
           NI+Q   L  +DI+ + + F+ + CT
Sbjct: 450 NILQQEHLWEFDIKNRRLKFQRSRCT 475


>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 293

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 84/198 (42%), Positives = 109/198 (55%), Gaps = 12/198 (6%)

Query: 53  RLRDALTRSLNRLNHFNQNSSISSSKASQA----DIIPNNANYLIRISIGTPPTERLAVA 108
           R  +A   S++     N    +S +K+++      II  + NY++ I IGTP  +   + 
Sbjct: 92  RRDEARVESIHSKLSKNIADEVSKAKSTKLPAKNGIILGSPNYIVTIGIGTPKHDISLMF 151

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
           DTGSDL WTQCEPC  S CY Q  P F+P  SS+Y ++ CSS  C   N +SCS  NC Y
Sbjct: 152 DTGSDLTWTQCEPCLGS-CYSQKEPKFNPSSSSSYHNVSCSSPMCG--NPESCSASNCLY 208

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
            + YGDGS + G LA E  TL ++      L  I FGCG NN G+F   + GI+GLG G 
Sbjct: 209 GIGYGDGSVTVGFLAKEKFTLTNSD----VLDDIYFGCGENNKGVFIG-SAGILGLGPGK 263

Query: 229 ISLISQMRTTIAGKFSYC 246
            S   Q  TT    FSYC
Sbjct: 264 FSFPLQTTTTYNNIFSYC 281


>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 442

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 117/379 (30%), Positives = 174/379 (45%), Gaps = 54/379 (14%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
           A +      Y+    IG+PP    A+ DTGSDLIWTQC   C P  C  Q  P ++   S
Sbjct: 77  AQVHRATRQYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQS 136

Query: 141 STYKSLPCSSSQ--CASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           ST+  +PC+     CA+     C G++  C +  SYG G    G+L TE+    S T   
Sbjct: 137 STFVPVPCADKAGFCAANGVHLC-GLDGSCTFIASYGAGRVI-GSLGTESFAFESGT--- 191

Query: 197 VALPGITFGCGT----NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-- 250
                + FGC +     +G L  +  +G++GLG G +SL+SQ+  T   +FSYCL P   
Sbjct: 192 ---TSLAFGCVSLTRITSGAL--NDASGLIGLGRGRLSLVSQIGAT---RFSYCLTPYFH 243

Query: 251 --SSTKINFGTNGIVSGPGVVSTPLTKA------KTFYVLTIDAISVGNQRL-------- 294
              ++   F       G G  S P  K+       TFY L ++ I+VG  RL        
Sbjct: 244 SSGASSHLFVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTF 303

Query: 295 -------GVSTPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGS-LELCYS 345
                  G     ++ID+G+ LT L    Y +    V + +     V  P  S LELC +
Sbjct: 304 QLRQLFKGYWAGGVIIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGLELCVA 363

Query: 346 FNSLSQ-VPEVTIHF-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNF 402
                + VP +  HF  GAD+ +  ++++  V +   C  + +G  +S  I GN  Q + 
Sbjct: 364 REGFQKVVPALVFHFGGGADMAVPAASYWAPVDKAAACMMILEGGYDS--IIGNFQQQDM 421

Query: 403 LVGYDIEQQTVSFKPTDCT 421
            + YD+ +   SF+  DCT
Sbjct: 422 HLLYDLRRGRFSFQTADCT 440


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 110/383 (28%), Positives = 175/383 (45%), Gaps = 65/383 (16%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  + +++G+PP     V DTGS+L W  C+  P          +FDP  SS+Y  +
Sbjct: 52  HNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS------VFDPLRSSSYSPI 105

Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           PC+S  C +  +      SC     C   +SY D S   GNLA++T  +G++     A+P
Sbjct: 106 PCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNS-----AIP 160

Query: 201 GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI-N 256
              FGC   G ++    +SKTTG++G+  G +S ++QM      KFSYC+    S+ I  
Sbjct: 161 ATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQ---KFSYCISGQDSSGILL 217

Query: 257 FGTNGIVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD---- 300
           FG +       +  TPL +  T         Y + ++ I V N  L     V  PD    
Sbjct: 218 FGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGA 277

Query: 301 --IVIDSGTTLTFLPQGYNSNLLS--VMSSMIEAQPVADPT----GSLELCYSF----NS 348
              ++DSGT  TFL     + L +  V  +    + + DP     G+++LCY       +
Sbjct: 278 GQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRT 337

Query: 349 LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFK-----GITNSVPIYGNI 397
           L  +P VT+ FRGA++ +S      +V      S+ + C  F      G+ +   I G+ 
Sbjct: 338 LPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESY--IIGHH 395

Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
            Q N  + +D+ +  V F    C
Sbjct: 396 HQQNVWMEFDLAKSRVGFAEVRC 418


>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 107/365 (29%), Positives = 171/365 (46%), Gaps = 39/365 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTPP       DTGSD++W    QC+ CP       D  L+D K SS+ K +P
Sbjct: 85  YYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFVP 144

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---ALP 200
           C    C  +N    +G    ++C Y   YGDGS + G    + V     +G      A  
Sbjct: 145 CDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANG 204

Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
            I FGCG    G  +S       GI+G G  + S+ISQ+ ++  +   F++CL  V+   
Sbjct: 205 SIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGG 264

Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDSG 306
           I F    +V  P V  TPL   +  Y + + A+ VG+  L +ST           +IDSG
Sbjct: 265 I-FAIGHVVQ-PKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDRKGTIIDSG 322

Query: 307 TTLTFLPQG-YNSNLLSVMSSM--IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GA 362
           TTL +LP+G Y   +  ++S    ++ + + D     +  YS +     P VT +F  G 
Sbjct: 323 TTLAYLPEGIYEPLVYKIISQHPDLKVRTLHDEYTCFQ--YSESVDDGFPAVTFYFENGL 380

Query: 363 DVKLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
            +K+   ++    S D  C  ++        + ++ + G+++ +N LV YD+E Q + + 
Sbjct: 381 SLKVYPHDYLFP-SGDFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWT 439

Query: 417 PTDCT 421
             +C+
Sbjct: 440 EYNCS 444


>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 488

 Score =  129 bits (323), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 108/365 (29%), Positives = 169/365 (46%), Gaps = 39/365 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTPP       DTGSD++W    QC+ CP       D  L+D K SS+ K +P
Sbjct: 83  YYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLVP 142

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---ALP 200
           C    C  +N    +G    ++C Y   YGDGS + G    + V     +G      A  
Sbjct: 143 CDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANG 202

Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
            I FGCG    G  +S       GI+G G  + S+ISQ+ ++  +   F++CL  V+   
Sbjct: 203 SIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGG 262

Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDSG 306
           I F    +V  P V  TPL   +  Y + + A+ VG+  L +ST           +IDSG
Sbjct: 263 I-FAIGHVVQ-PKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDRKGTIIDSG 320

Query: 307 TTLTFLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GA 362
           TTL +LP+G    L+  M S    ++ Q + D     +  YS +     P VT  F  G 
Sbjct: 321 TTLAYLPEGIYEPLVYKMISQHPDLKVQTLHDEYTCFQ--YSESVDDGFPAVTFFFENGL 378

Query: 363 DVKLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
            +K+   ++    S +  C  ++        + ++ + G+++ +N LV YD+E Q + + 
Sbjct: 379 SLKVYPHDYLFP-SVNFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLENQAIGWA 437

Query: 417 PTDCT 421
             +C+
Sbjct: 438 EYNCS 442


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 110/383 (28%), Positives = 175/383 (45%), Gaps = 65/383 (16%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  + +++G+PP     V DTGS+L W  C+  P          +FDP  SS+Y  +
Sbjct: 59  HNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS------VFDPLRSSSYSPI 112

Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           PC+S  C +  +      SC     C   +SY D S   GNLA++T  +G++     A+P
Sbjct: 113 PCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNS-----AIP 167

Query: 201 GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI-N 256
              FGC   G ++    +SKTTG++G+  G +S ++QM      KFSYC+    S+ I  
Sbjct: 168 ATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQMGLQ---KFSYCISGQDSSGILL 224

Query: 257 FGTNGIVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD---- 300
           FG +       +  TPL +  T         Y + ++ I V N  L     V  PD    
Sbjct: 225 FGESSFSWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGA 284

Query: 301 --IVIDSGTTLTFLPQGYNSNLLS--VMSSMIEAQPVADPT----GSLELCYSF----NS 348
              ++DSGT  TFL     + L +  V  +    + + DP     G+++LCY       +
Sbjct: 285 GQTMVDSGTQFTFLLGPVYTALKNEFVRQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRT 344

Query: 349 LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFK-----GITNSVPIYGNI 397
           L  +P VT+ FRGA++ +S      +V      S+ + C  F      G+ +   I G+ 
Sbjct: 345 LPPLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESY--IIGHH 402

Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
            Q N  + +D+ +  V F    C
Sbjct: 403 HQQNVWMEFDLAKSRVGFAEVRC 425


>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
 gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
          Length = 519

 Score =  128 bits (322), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 115/419 (27%), Positives = 181/419 (43%), Gaps = 91/419 (21%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQC---EPCPPSQCYMQDSP------------- 133
            Y +R  +GTP    L VADTGSDL W +C   +   P+  Y   +P             
Sbjct: 106 QYFVRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGYAAPASNDSSTSSLSAA 165

Query: 134 ---------LFDPKMSSTYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSN 179
                    +F P  S T+  +PCSS  C      SL      G  C Y   Y DGS + 
Sbjct: 166 AASSSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYDYRYKDGSAAR 225

Query: 180 GNLATETVTL-----GSTTGQAVA-LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           G + T++ T+     G+   Q  A L G+  GC T+  G     + G++ LG  +IS  S
Sbjct: 226 GTVGTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLASDGVLSLGYSNISFAS 285

Query: 234 QMRTTIAGKFSYCLV----PVSSTK-INFGTNGIVSG---------------------PG 267
           +      G+FSYCLV    P ++T  + FG N  VS                       G
Sbjct: 286 RAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSSPPSKTACAGGGSPAAAPPGPGG 345

Query: 268 VVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDIV----------IDSGTTLTFLPQ 314
              TPL    + + FY +T++ ISV  + L +  P +V          +DSGT+LT L  
Sbjct: 346 ARQTPLLLDHRMRPFYAVTVNGISVDGELLRI--PRLVWDVAKGGGAILDSGTSLTVLVS 403

Query: 315 GYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNSLS-------QVPEVTIHFRG-ADV 364
                +++ ++  +   P    DP    + CY++ S S        +PE+ +HF G A +
Sbjct: 404 PAYRAVVAALNKKLAGLPRVTMDP---FDYCYNWTSPSTGEDLTVAMPELAVHFAGSARL 460

Query: 365 KLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           +    ++ +  +  + C  + +G    V + GNI+Q   L  +D++ + + FK + CT+
Sbjct: 461 QPPAKSYVIDAAPGVKCIGLQEGEWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRCTQ 519


>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
          Length = 477

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 106/343 (30%), Positives = 149/343 (43%), Gaps = 74/343 (21%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
           + DTGSDL W QC+PC  S CY Q  PLFDP  S++Y ++PC++S C ASL        S
Sbjct: 179 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 236

Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
           C+ V           C YS++YGDGSFS G LAT+TV LG  +     + G  FGCG +N
Sbjct: 237 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 291

Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVS 270
            GLF   T G++GLG                                  +G ++G     
Sbjct: 292 RGLFGG-TAGLMGLG---------------------------------PDGALAG----- 312

Query: 271 TPLTKAKTFYVLTI---DAISVGNQRLGVSTPDIVIDSGTTLTFL-PQGYNSNLLSVMSS 326
            P      FY + +             G+   ++++DSGT +T L P  Y +        
Sbjct: 313 LPDGAPPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQ 372

Query: 327 M-IEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVKVSED--IV 380
              E  P A P   L+ CY+     +V  P +T+   G AD+ +  +       +D   V
Sbjct: 373 FGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEGGADMTVDAAGMLFMARKDGSQV 432

Query: 381 CSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           C     ++  +  PI GN  Q N  V YD     + F   DC+
Sbjct: 433 CLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 475


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score =  128 bits (321), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 102/327 (31%), Positives = 149/327 (45%), Gaps = 37/327 (11%)

Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSN 179
           Q  M   P FD   SST     C S+ C  L   SC          C Y+  Y D S + 
Sbjct: 168 QQNMHALPYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTT 227

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           G L  +  T G+      ++PG+ FGCG  N G+F S  TGI G G G +SL SQ++   
Sbjct: 228 GLLEVDKFTFGA----GASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV-- 281

Query: 240 AGKFSYCLVPVSSTK-----INFGTNGIVSGPGVV-STPLTKAK---TFYVLTIDAISVG 290
            G FS+C   V+  K     ++   +   +G G V STPL +     T Y L++  I+VG
Sbjct: 282 -GNFSHCFTAVNGLKQSTVLLDLLADLYKNGRGAVQSTPLIQNSANPTLYYLSLKGITVG 340

Query: 291 NQRLGV---------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
           + RL V          T   +IDSGT++T LP      +    ++ I+   V        
Sbjct: 341 STRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPY 400

Query: 342 LCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSED----IVCSVFKGITNSVPIYG 395
            C+S  S ++  VP++ +HF GA + L R N+  +V +D    ++C     + +     G
Sbjct: 401 TCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSMICLAINELGDERATIG 460

Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           N  Q N  V YD++   +SF    C K
Sbjct: 461 NFQQQNMHVLYDLQNNMLSFVAAQCDK 487



 Score = 46.6 bits (109), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 40/132 (30%), Positives = 63/132 (47%), Gaps = 18/132 (13%)

Query: 287 ISVGNQRLGV---------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT 337
           I+VG+ RL V          T   +IDSGT++T LP      +    ++ I+   V    
Sbjct: 42  ITVGSTRLPVPESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNA 101

Query: 338 GSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSED----IVC-SVFKGITNS 390
                C+S  S ++  VP++ +HF GA + L R N+  +V +D    I+C ++ KG  + 
Sbjct: 102 TGPYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKG--DE 159

Query: 391 VPIYGNIMQTNF 402
             I GN  Q N 
Sbjct: 160 TTIIGNFQQQNM 171


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score =  128 bits (321), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 104/373 (27%), Positives = 174/373 (46%), Gaps = 48/373 (12%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           ++  IGTPP E L + DT S+L W Q   C  + C     P F+P +SS++ S PC+SS 
Sbjct: 1   MQTKIGTPPREVLLLVDTASELTWVQGTSC--TNCSPTKVPPFNPGLSSSFISEPCTSSV 58

Query: 153 CASLN----QKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           C   +    Q +C  S  +C + V+Y DGS + G +A E  +L S  G A  L  + FGC
Sbjct: 59  CLGRSKLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGC 118

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQM----RTTIAGKFSYCLVPVSSTKIN------ 256
            + +       ++G +GL  G  S  +Q+    ++ ++ +FSYC  P  +  +N      
Sbjct: 119 ASKDLQRPVDFSSGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCF-PNRAEHLNSSGVII 177

Query: 257 FGTNGIVSGPGVV-----STPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVI-------- 303
           FG +GI +            P+     FY + +  ISVG + L +      I        
Sbjct: 178 FGDSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNGGT 237

Query: 304 --DSGTTLTFLPQGYNSNLLSVM-SSMIEAQPVADPTGSLELCYSFNS----LSQVPEVT 356
             DSGTT++FL +  ++ L+      ++     +    + ELCY   +    L   P VT
Sbjct: 238 YFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDARLPTAPLVT 297

Query: 357 IHFR-GADVKLSRSNFFVKVSED----IVCSVFKG----ITNSVPIYGNIMQTNFLVGYD 407
           +HF+   D++L  ++ +V ++       +C  F          V + GN  Q ++L+ +D
Sbjct: 298 LHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHD 357

Query: 408 IEQQTVSFKPTDC 420
           +E+  + F P +C
Sbjct: 358 LERSRIGFAPANC 370


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score =  127 bits (320), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 114/394 (28%), Positives = 185/394 (46%), Gaps = 38/394 (9%)

Query: 55  RDALTRSLNRLNHFNQNSSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGS 112
           R  L R  +RL H        SS A     D +  N  Y  R+ IG+PP E   + DTGS
Sbjct: 52  RRVLDRD-HRLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGS 110

Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
            + +  C  C   QC     P F P++SSTY+ + C++      N     GV C Y   Y
Sbjct: 111 TVTYVPCSNC--VQCGNHQDPRFQPELSSTYQPVKCNADCNCDEN-----GVQCTYERRY 163

Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISL 231
            + S S+G LA + ++ G  +   +      FGC T  +G L+  +  GI+GLG G +S+
Sbjct: 164 AEMSTSSGVLAEDVMSFGKES--ELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSV 221

Query: 232 ISQM--RTTIAGKFSYCL--VPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDA 286
           + Q+  +  ++  FS C   + V    +  G  GI S PG+V +    +++ +Y + +  
Sbjct: 222 MDQLVGKGVVSNSFSLCYGGMDVGGGAMVLG--GISSPPGMVFSHSDPSRSPYYNIELKE 279

Query: 287 ISVGNQ--RLGVSTPD----IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGS 339
           I V  +  +L   T D     ++DSGTT  + P+  Y +   ++M  +   + ++ P  +
Sbjct: 280 IHVAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPN 339

Query: 340 L-ELCYS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGIT 388
             ++C+S        L +V PEV + F  G  + LS  N+     KVS      +FK   
Sbjct: 340 FKDICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGN 399

Query: 389 NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           +   + G I+  N LV Y+ E  T+ F  T+C++
Sbjct: 400 DQTTLLGGIIVRNTLVTYNRENSTIGFWKTNCSE 433


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score =  127 bits (320), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 112/392 (28%), Positives = 182/392 (46%), Gaps = 34/392 (8%)

Query: 55  RDALTRSLNRLNHFNQNSSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGS 112
           R  L R  +RL H        SS A     D +  N  Y  R+ IG+PP E   + DTGS
Sbjct: 52  RRVLDRD-HRLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGS 110

Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
            + +  C  C   QC     P F P++SSTY+ + C++      N     GV C Y   Y
Sbjct: 111 TVTYVPCSNC--VQCGNHQDPRFQPELSSTYQPVKCNADCNCDEN-----GVQCTYERRY 163

Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISL 231
            + S S+G LA + ++ G  +   +      FGC T  +G L+  +  GI+GLG G +S+
Sbjct: 164 AEMSTSSGVLAEDVMSFGKES--ELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSV 221

Query: 232 ISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAIS 288
           + Q+  +  ++  FS C   +          GI S PG+V +    +++ +Y + +  I 
Sbjct: 222 MDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIH 281

Query: 289 VGNQ--RLGVSTPD----IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL- 340
           V  +  +L   T D     ++DSGTT  + P+  Y +   ++M  +   + ++ P  +  
Sbjct: 282 VAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFK 341

Query: 341 ELCYS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNS 390
           ++C+S        L +V PEV + F  G  + LS  N+     KVS      +FK   + 
Sbjct: 342 DICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQ 401

Query: 391 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
             + G I+  N LV Y+ E  T+ F  T+C++
Sbjct: 402 TTLLGGIIVRNTLVTYNRENSTIGFWKTNCSE 433


>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 481

 Score =  127 bits (319), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 105/367 (28%), Positives = 172/367 (46%), Gaps = 41/367 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTG+D++W    QC+ CP       D  L++ K SS+ K +P
Sbjct: 73  YYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKLVP 132

Query: 148 CSSSQCASLNQKSCSGV------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
           C    C  +N    +G       +C Y   YGDGS + G    + V     +G    A A
Sbjct: 133 CDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKTASA 192

Query: 199 LPGITFGCGTNNGGLF----NSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSS 252
              + FGCG    G           GI+G G  + S+ISQ+ ++  +   F++CL  V+ 
Sbjct: 193 NGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCLNGVNG 252

Query: 253 TKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVID 304
             I F    +V  P V +TPL   +  Y + + AI VG+  L +ST           +ID
Sbjct: 253 GGI-FAIGHVVQ-PTVNTTPLLPDQPHYSVNMTAIQVGHTFLNLSTDASEQRDSKGTIID 310

Query: 305 SGTTLTFLPQG-YNSNLLSVMSSM--IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR- 360
           SGTTL +LP G Y   +  ++S    ++ Q + D     +  YS +     P VT +F  
Sbjct: 311 SGTTLAYLPDGIYQPLVYKILSQQPNLKVQTLHDEYTCFQ--YSGSVDDGFPNVTFYFEN 368

Query: 361 GADVKLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
           G  +K+   ++   +SE++ C  ++        + ++ + G+++ +N LV YD+E Q + 
Sbjct: 369 GLSLKVYPHDYLF-LSENLWCIGWQNSGAQSRDSKNMTLLGDLVLSNKLVFYDLENQVIG 427

Query: 415 FKPTDCT 421
           +   +C+
Sbjct: 428 WTEYNCS 434


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score =  127 bits (319), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 117/437 (26%), Positives = 195/437 (44%), Gaps = 52/437 (11%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
           A+ GG   + IH  +P+S    +        +     S +      +N +  + ++S   
Sbjct: 35  ARGGGIGFKAIHVAAPQSRVKANPSPSSAAQKSLFPYSAHIFQQHTKNPA--ALRSSTTT 92

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           +      Y   I +G+P  E + + DTGS+L W QC PC    C      ++D   S++Y
Sbjct: 93  LGRKFGEYYTSIKLGSPGQEAILIVDTGSELTWLQCLPC--KVCAPSVDTIYDAARSASY 150

Query: 144 KSLPCSSSQ-CASLNQKS----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAV 197
           + + C++SQ C++ +Q +      G  CQ++  YGDGSFS G+L+T+T+ + +   G+ V
Sbjct: 151 RPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPV 210

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN- 256
            +    FGC   +  L  +  +GI+GL  G ++L  Q+      KFS+C  P  S+ +N 
Sbjct: 211 TVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCF-PDRSSHLNS 269

Query: 257 -----FGTNGI----VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD---IVID 304
                FG   +    V    V  T     + FY + +  +S+ +  L V  P    +++D
Sbjct: 270 TGVVFFGNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHEL-VFLPRGSVVILD 328

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQP------VADPTGSLELCYSFN----------- 347
           SG++ +   + ++S L     + ++ +P        D  G L  C+  +           
Sbjct: 329 SGSSFSSFVRPFHSQL---REAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTL 385

Query: 348 -SLSQVPE--VTIHFRGADVKLSRSNFFVKVSEDIVCSVFK-GITNSVPIYGNIMQTNFL 403
            SLS V E  VTI      V L  + F   V    +C  F+ G  N V + GN  Q N  
Sbjct: 386 PSLSLVFEDGVTIGIPSIGVLLPVARFQNHVK---MCFAFEDGGPNPVNVIGNYQQQNLW 442

Query: 404 VGYDIEQQTVSFKPTDC 420
           V YDI++  V F    C
Sbjct: 443 VEYDIQRSRVGFARASC 459


>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Cucumis sativus]
          Length = 478

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 165/371 (44%), Gaps = 51/371 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI IG+PP +     DTGSD++W     C  CP       D  L++PK SST   + 
Sbjct: 73  YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132

Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
           C    C++       G      CQY V YGDGS + G    + + L    G         
Sbjct: 133 CDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNG 192

Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKI 255
            I FGCG    G   S +    GI+G G  + S+ISQ+  T  +   F++CL  +S   I
Sbjct: 193 SIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGI 252

Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS--------TPDIVIDSGT 307
            F    +V  P + +TP+   +  Y + ++ + VG+  L +             +IDSGT
Sbjct: 253 -FAIGEVVE-PKLXNTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGT 310

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL--------CYSF--NSLSQVPEVTI 357
           TL +LP+   S  L +M  ++ AQP       L+L        C+ F  N     P VT 
Sbjct: 311 TLAYLPE---SIYLPLMEKILGAQP------DLKLRTVDDQFTCFVFDKNVDDGFPTVTF 361

Query: 358 HFRGADV-KLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQ 410
            F  + +  +    +  ++ +D+ C  ++         N V + G+++  N LV Y++E 
Sbjct: 362 KFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNLEN 421

Query: 411 QTVSFKPTDCT 421
           QT+ +   +C+
Sbjct: 422 QTIGWTEYNCS 432


>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 478

 Score =  127 bits (319), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 105/371 (28%), Positives = 165/371 (44%), Gaps = 51/371 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI IG+PP +     DTGSD++W     C  CP       D  L++PK SST   + 
Sbjct: 73  YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132

Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
           C    C++       G      CQY V YGDGS + G    + + L    G         
Sbjct: 133 CDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNG 192

Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKI 255
            I FGCG    G   S +    GI+G G  + S+ISQ+  T  +   F++CL  +S   I
Sbjct: 193 SIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGI 252

Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS--------TPDIVIDSGT 307
            F    +V  P + +TP+   +  Y + ++ + VG+  L +             +IDSGT
Sbjct: 253 -FAIGEVVE-PKLKTTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGT 310

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL--------CYSF--NSLSQVPEVTI 357
           TL +LP   +S  L +M  ++ AQP       L+L        C+ F  N     P VT 
Sbjct: 311 TLAYLP---DSIYLPLMEKILGAQP------DLKLRTVDDQFTCFVFDKNVDDGFPTVTF 361

Query: 358 HFRGADV-KLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQ 410
            F  + +  +    +  ++ +D+ C  ++         N V + G+++  N LV Y++E 
Sbjct: 362 KFEESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNLEN 421

Query: 411 QTVSFKPTDCT 421
           QT+ +   +C+
Sbjct: 422 QTIGWTEYNCS 432


>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 373

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 118/384 (30%), Positives = 175/384 (45%), Gaps = 46/384 (11%)

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           Q ++I  S     D I  N  + + IS+GTP    L   DTGS + W QC+ C    CY 
Sbjct: 3   QAANIPDSAVIGDDSIRKN-QFFMGISLGTPAVFNLVTIDTGSTISWVQCQYC-IVHCYT 60

Query: 130 QDS---PLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGV-----NCQYSVSYGDGSFSN 179
           QD    P F+   SSTY+ + CS+  C  ++  Q   SG      +C YS+ Y  G +S 
Sbjct: 61  QDQRAGPTFNTSSSSTYRRVGCSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEYSA 120

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM-RTT 238
           G L+ + +TL ++     ++    FGCG++N   +N  + GI+G G    S  +Q+ + T
Sbjct: 121 GYLSQDRLTLANS----YSIQKFIFGCGSDN--RYNGHSAGIIGFGNKSYSFFNQIAQLT 174

Query: 239 IAGKFSYCLVPVSSTKINFGTNGIVSGPGVV-STPLTKAKTF--------YVLTIDAISV 289
               FSYC     S + N G   I  GP V  S  L   + F        Y L    + V
Sbjct: 175 NYSAFSYCF---PSNQENEGFLSI--GPYVRDSNKLILTQLFDYGAHLPVYALQQFDMMV 229

Query: 290 GNQRLGVSTP-----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY 344
              RL V  P       V+DSGT  TF+       L   ++  + A+     + S E+C+
Sbjct: 230 NGMRLQVDPPVYTTRMTVVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSKEICF 289

Query: 345 SFN----SLSQVPEVTIHFRGADVKLSRSN-FFVKVSEDIVCSVFKGITNSVP---IYGN 396
             N      S++P V I F  + +KL   N F+ + S+  +CS F+     VP   I GN
Sbjct: 290 HSNGDSVDWSKLPVVEIKFSRSILKLPAENVFYYETSDGSICSTFQPDDAGVPGVQILGN 349

Query: 397 IMQTNFLVGYDIEQQTVSFKPTDC 420
               +F V +DI+Q+   F+   C
Sbjct: 350 RATRSFRVVFDIQQRNFGFEAGAC 373


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 118/396 (29%), Positives = 177/396 (44%), Gaps = 67/396 (16%)

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
           SSSK +   +  +N      ++IGTPP     V DTGS+L W +C+  P        + +
Sbjct: 51  SSSKTTGKLLFHHNVTLTASLTIGTPPQNITMVLDTGSELSWLRCKKEP------NFTSI 104

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
           F+P  S TY  +PCSS  C +         +C     C + +SY D S   G+LA ET  
Sbjct: 105 FNPLASKTYTKIPCSSQTCKTRTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFR 164

Query: 189 LGSTTGQAVALPGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
            GS T      P   FGC   G+++    ++KTTG++G+  G +S ++QM      KFSY
Sbjct: 165 FGSLTR-----PATVFGCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQMGFR---KFSY 216

Query: 246 CLVPVSSTKINFGTNGIVSG-------PGV-VSTPLTK-AKTFYVLTIDAISVGNQRL-- 294
           C+  + ST          S        P V +STPL    +  Y + ++ I V N+ L  
Sbjct: 217 CISGLDSTGFLLLGEARYSWLKPLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPL 276

Query: 295 --GVSTPD------IVIDSGTTLTFLPQGYNSNLLS--------VMSSMIEAQPVADPTG 338
              V  PD       ++DSGT  TFL     S L          V+  + E Q V    G
Sbjct: 277 PKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRKEFLLQTAGVLRVLNEPQYVFQ--G 334

Query: 339 SLELCYSFNS----LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGIT 388
           +++LCY  +S    L  +P V + FRGA++ +S      +V       + + C  F G +
Sbjct: 335 AMDLCYLIDSTSSTLPNLPVVKLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTF-GNS 393

Query: 389 NSVPI----YGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + + I     G+  Q N  + YD+E   + F    C
Sbjct: 394 DELGISSFLIGHHQQQNVWMEYDLENSRIGFAELRC 429


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 105/368 (28%), Positives = 172/368 (46%), Gaps = 45/368 (12%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I +G PP +     DTGSD++W     C+ CP          L+DP+ S++   + 
Sbjct: 82  YFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATRIY 141

Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C    CA+    + Q     + CQYSV YGDGS + G    + +     TG    + A  
Sbjct: 142 CDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSSANG 201

Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
            + FGCG    G   + +    GI+G G  + S+ISQ+    AGK    F++CL  V   
Sbjct: 202 SVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQL--AAAGKVKRVFAHCLDNVKGG 259

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------VID 304
            I F    +VS P V +TP+   +  Y + +  I VG   L + T DI         +ID
Sbjct: 260 GI-FAIGEVVS-PKVNTTPMVPNQPHYNVVMKEIEVGGNVLELPT-DIFDTGDRRGTIID 316

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCYSF--NSLSQVPEVTIHFR 360
           SGTTL +LP+       S+M+ ++  QP        E   C+ +  N     P V  HF 
Sbjct: 317 SGTTLAYLPEVVYE---SMMTKIVSEQPGLKLHTVEEQFTCFQYTGNVNEGFPVVKFHFN 373

Query: 361 GA-DVKLSRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQTV 413
           G+  + ++  ++  ++ E++ C  ++  G+       + + G+++ +N LV YD+E Q +
Sbjct: 374 GSLSLTVNPHDYLFQIHEEVWCFGWQNSGMQSKDGRDMTLLGDLVLSNKLVLYDLENQAI 433

Query: 414 SFKPTDCT 421
            +   +C+
Sbjct: 434 GWTDYNCS 441


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 111/382 (29%), Positives = 176/382 (46%), Gaps = 64/382 (16%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N    + +++G+PP +   V DTGS+L W  C+  P        + +F+P  SS+Y  +
Sbjct: 36  HNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSP------NLTSVFNPLSSSSYSPI 89

Query: 147 PCSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           PCSS  C +  +   + V       C   VSY D S   GNLA++   +GS+     ALP
Sbjct: 90  PCSSPVCRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS-----ALP 144

Query: 201 GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI-N 256
           G  FGC   G ++    ++KTTG++G+  G +S ++Q+      KFSYC+    S+ +  
Sbjct: 145 GTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP---KFSYCISGRDSSGVLL 201

Query: 257 FGTNGIVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD---- 300
           FG + +     +  TPL +  T         Y + +D I VGN+ L     +  PD    
Sbjct: 202 FGDSHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGA 261

Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPT----GSLELCYSF---NSL 349
              ++DSGT  TFL     + L +      +    P+ DP     G+++LCY       L
Sbjct: 262 GQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYRVPAGGKL 321

Query: 350 SQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFK-----GITNSVPIYGNIM 398
            ++P V++ FRGA++ +       KV       E + C  F      GI   V   G+  
Sbjct: 322 PELPAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFGNSDLLGIEAFV--IGHHH 379

Query: 399 QTNFLVGYDIEQQTVSFKPTDC 420
           Q N  + +D+ +  V F  T C
Sbjct: 380 QQNVWMEFDLVKSRVGFVETRC 401


>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
          Length = 461

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 110/413 (26%), Positives = 176/413 (42%), Gaps = 85/413 (20%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC---------YMQDSP------- 133
            Y +R  +GTP    L VADTGSDL W +C                 Y   +P       
Sbjct: 54  QYFVRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSS 113

Query: 134 ----------LFDPKMSSTYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFS 178
                     +F P  S T+  +PCSS  C      SL      G  C Y   Y DGS +
Sbjct: 114 VSAAASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYEYRYKDGSAA 173

Query: 179 NGNLATETVTL---GSTTGQA---VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
            G + T++ T+   G   G+      L G+  GC T+  G     + G++ LG  ++S  
Sbjct: 174 RGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFLASDGVLSLGYSNVSFA 233

Query: 233 SQMRTTIAGKFSYCLV----PVSSTK-INFGTN--------------GIVSGPGVVSTPL 273
           S+      G+FSYCLV    P ++T  + FG N              G  + PG   TPL
Sbjct: 234 SRAAARFGGRFSYCLVDHLAPRNATSYLTFGPNPAVSSASASRTACAGSAAAPGARQTPL 293

Query: 274 ---TKAKTFYVLTIDAISVGNQRLGVSTPDIV----------IDSGTTLTFL-PQGYNSN 319
               + + FY + ++ +SV  + L +  P +V          +DSGT+LT L    Y + 
Sbjct: 294 LLDHRMRPFYAVAVNGVSVDGELLRI--PRLVWDVQKGGGAILDSGTSLTVLVSPAYRAV 351

Query: 320 LLSVMSSMIEAQPVA-DPTGSLELCYSFNS-------LSQVPEVTIHFRG-ADVKLSRSN 370
           + ++   ++    VA DP    + CY++ S          VP + +HF G A ++    +
Sbjct: 352 VAALGKKLVGLPRVAMDP---FDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQPPPKS 408

Query: 371 FFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           + +  +  + C  + +G    V + GNI+Q   L  +D++ + + FK + C +
Sbjct: 409 YVIDAAPGVKCIGLQEGDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRCMQ 461


>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
          Length = 337

 Score =  126 bits (317), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 70/167 (41%), Positives = 97/167 (58%), Gaps = 13/167 (7%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           + NY +++  G+P      + DTGS L W QC+PC    C++Q  PLFDP  S TYKSL 
Sbjct: 115 SGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCV-VYCHVQADPLFDPSASKTYKSLS 173

Query: 148 CSSSQC-----ASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           C+SSQC     A+LN   C  S   C Y+ SYGD S+S G L+ + +TL  +      LP
Sbjct: 174 CTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ----TLP 229

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           G  +GCG ++ GLF  +  GI+GLG   +S++ Q+ +     FSYCL
Sbjct: 230 GFVYGCGQDSDGLFG-RAAGILGLGRNKLSMLGQVSSKFGYAFSYCL 275


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 117/377 (31%), Positives = 183/377 (48%), Gaps = 58/377 (15%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           +++ IG+      A+ DTGS+ +  QC          +  P+FDP  S +Y+ +PC S  
Sbjct: 1   MQLGIGSLQKNLSAIIDTGSEAVLVQCGS--------RSRPVFDPAASQSYRQVPCISQL 52

Query: 153 CASLNQKSCSG-----VN----CQYSVSYGDGSFSNGNLATETVTLGST--TGQAVALPG 201
           C ++ Q++ +G     VN    C YS+SYGD   S G+ + + + L ST  + QAV    
Sbjct: 53  CLAVQQQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRD 112

Query: 202 ITFGCGTN-NGGLFNSKTTGIVGLGGGDISLISQMRTTIAG-KFSYCL-----VPVSSTK 254
           + FGC  +  G L +  + GIVG   G++SL SQ++  + G KFSYC       P ++  
Sbjct: 113 VAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSKFSYCFPSQPWQPRATGV 172

Query: 255 INFGTNGI----VSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV---------STPD 300
           I  G +G+    VS   ++  P+T A++  Y + + +ISV  + L +         ST D
Sbjct: 173 IFLGDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGD 232

Query: 301 --IVIDSGTTLT-FLPQGYNS--NLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQV 352
              V+DSGTT T  +   Y +  N  +  +     + V    G  + CY+    +SL  V
Sbjct: 233 GGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAG-FDDCYNISAGSSLPGV 291

Query: 353 PEVTIHFR-GADVKLSRSNFFVKVS----EDIVC----SVFKGITNSVPIYGNIMQTNFL 403
           PEV +  +    ++L   + FV VS    E  VC    S  K     + + GN  Q+N+L
Sbjct: 292 PEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQSNYL 351

Query: 404 VGYDIEQQTVSFKPTDC 420
           V YD E+  V F+  DC
Sbjct: 352 VEYDNERSRVGFERADC 368


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 120/413 (29%), Positives = 191/413 (46%), Gaps = 65/413 (15%)

Query: 51  YQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
           Y+ LR+   R L R+        + +   S  D       Y  RI +GTPP +     DT
Sbjct: 13  YRTLREHDQRRLRRIL-----PEVVAFPISGDDDTFTTGLYYTRIYLGTPPQQFYVHVDT 67

Query: 111 GSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKSLPCSSSQCASLNQKSCS--G 163
           GSD+ W  C PC  + C    +      +FDP+ S++  S+ C+  +C   +   CS   
Sbjct: 68  GSDVAWVNCVPC--TNCKRASNVALPISIFDPEKSTSKTSISCTDEECYLASNSKCSFNS 125

Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVALPG---ITFGCGTNNGGLFNSKTT 219
           ++C YS  YGDGS + G L  + ++     +G + A  G   +TFGCG+N  G +   T 
Sbjct: 126 MSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTW--LTD 183

Query: 220 GIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSG----PGVVSTPL 273
           G+VG G  ++SL SQ+  +      F++CL        N G+  +V G    PG+V TP+
Sbjct: 184 GLVGFGQAEVSLPSQLSKQNVSVNIFAHCL-----QGDNKGSGTLVIGHIREPGLVYTPI 238

Query: 274 TKAKTFYVLTIDAISVGNQRLGVSTP---------DIVIDSGTTLTFLPQ-GYNSNLLSV 323
              ++ Y   ++ +++G     V+TP          +++DSGTTLT+L Q  Y+     V
Sbjct: 239 VPKQSHY--NVELLNIGVSGTNVTTPTAFDLSNSGGVIMDSGTTLTYLVQPAYDQFQAKV 296

Query: 324 MSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVK--VSED 378
              M         +G L + + F    +   P VT++F  GA + LS S++  K  ++  
Sbjct: 297 RDCM--------RSGVLPVAFQFFCTIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTG 348

Query: 379 IVCSVFKGITNSVPIYGNIMQTNF--------LVGYDIEQQTVSFKPTDCTKQ 423
           +    F  +  S  +YG +  T F        LV YD     + +K  DCTK+
Sbjct: 349 LSAYCFSWL-ESTSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNFDCTKE 400


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 107/367 (29%), Positives = 175/367 (47%), Gaps = 39/367 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI +GTPP       DTGSD++W  C+P   CP +         FDP+ SST   L 
Sbjct: 41  YYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASPLS 100

Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---ALP 200
           C  S+C S NQ S S       C YS  YGDGS + G   ++         Q V   A  
Sbjct: 101 CIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNASA 160

Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTKI 255
            ITFGC  N  G     +    GI G G  D+S++SQ+ +  +A K FS+CL        
Sbjct: 161 KITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGADPGG- 219

Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
                G ++ PG+V TP+  ++  Y L +  I+V  Q+L +        +T   +ID GT
Sbjct: 220 GILVLGEITEPGMVYTPIVPSQPHYNLNLQGIAVNGQQLSIDPQVFATTNTRGTIIDCGT 279

Query: 308 TLTFL-PQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYSFNSLSQV-PEVTIHFRGADV 364
           TL +L  + Y   + ++++++ ++ QP         L  + +S+ ++ P VT++F GA +
Sbjct: 280 TLAYLAEEAYEPFVNTIIAAVSQSTQPFMLKGNPCFL--TVHSIDEIFPSVTLYFEGAPM 337

Query: 365 KLSRSNFFVKV----SEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
            L   ++ ++     S  + C  ++        ++ + I G+++  + +  YD+E Q + 
Sbjct: 338 DLKPKDYLIQQLSPDSSPVWCIGWQKSGQQATDSSKMTILGDLVLKDKVFVYDLENQRIG 397

Query: 415 FKPTDCT 421
           +   DC+
Sbjct: 398 WTSFDCS 404


>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 428

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 110/376 (29%), Positives = 169/376 (44%), Gaps = 56/376 (14%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N    + +++G+PP     V DTGS+L W  C+  P        +  F+P +SS+Y   
Sbjct: 56  HNVTLTVSLTVGSPPQNVTMVLDTGSELSWLHCKKLP------NLNSTFNPLLSSSYTPT 109

Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           PC+SS C +  +      SC   N  C   VSY D S + G LA ET +L        A 
Sbjct: 110 PCNSSICTTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLA-----GAAQ 164

Query: 200 PGITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
           PG  FGC  + G       +SKTTG++G+  G +SL++QM      KFSYC+    +  +
Sbjct: 165 PGTLFGCMDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMSLP---KFSYCISGEDALGV 221

Query: 256 NFGTNGIVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD--- 300
               +G  +   +  TPL  A T         Y + ++ I V  + L     V  PD   
Sbjct: 222 LLLGDGTDAPSPLQYTPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTG 281

Query: 301 ---IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPT----GSLELCYSF-NSLS 350
               ++DSGT  TFL     S+L        +     + DP     G+++LCY    S +
Sbjct: 282 AGQTMVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASFA 341

Query: 351 QVPEVTIHFRGADVKLSRSNFFVKVSED---IVCSVFKG---ITNSVPIYGNIMQTNFLV 404
            VP VT+ F GA++++S      +VS+    + C  F     +     + G+  Q N  +
Sbjct: 342 AVPAVTLVFSGAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWM 401

Query: 405 GYDIEQQTVSFKPTDC 420
            +D+ +  V F  T C
Sbjct: 402 EFDLLKSRVGFTQTTC 417


>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
 gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
          Length = 460

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 114/385 (29%), Positives = 171/385 (44%), Gaps = 46/385 (11%)

Query: 60  RSLNR--LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
           RS+N      ++   S         D +  +  +L+ +  GTP  +   + DTGSD  W 
Sbjct: 96  RSINAKIFGQYSTQESKDGWSPESMDTLNEDGLFLVNVGFGTPQQKFNLIIDTGSDTTWI 155

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSF 177
           QC  C    C+ + +  F+P +SS+Y +  C  S             +  Y++ Y D S+
Sbjct: 156 QCNSCSLGNCHNKKT--FNPSLSSSYSNRSCIPS------------TDTNYTMKYEDNSY 201

Query: 178 SNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD-ISLISQMR 236
           S G    + VTL     +    P   FGCG + GG F +  +G++GL  G+  SLISQ  
Sbjct: 202 SKGVFVCDEVTL-----KPDVFPKFQFGCGDSGGGEFGT-ASGVLGLAKGEQYSLISQTA 255

Query: 237 TTIAGKFSYCLVPVSST--KINFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQ 292
           +    KFSYC  P   T   + FG   I + P +  T L    +   Y + +  ISV  +
Sbjct: 256 SKFKKKFSYCFPPKEHTLGSLLFGEKAISASPSLKFTQLLNPPSGLGYFVELIGISVAKK 315

Query: 293 RLGVS-----TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGS--LELCY 344
           RL VS     +P  +IDSGT +T LP   Y +   +    M+    ++ P     L+ CY
Sbjct: 316 RLNVSSSLFASPGTIIDSGTVITRLPTAAYEALRTAFQQEMLHCPSISPPPQEKLLDTCY 375

Query: 345 SFNSLS----QVPEVTIHFRG-ADVKLSRSNFFVKVSEDI--VCSVFKGITN--SVPIYG 395
           +         ++PE+ +HF G  DV L  S   +  + D+   C  F   +N   V I G
Sbjct: 376 NLKGCGGRNIKLPEIVLHFVGEVDVSLHPSG-ILWANGDLTQACLAFARKSNPSHVTIIG 434

Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDC 420
           N  Q +  V YDIE   + F   DC
Sbjct: 435 NRQQVSLKVVYDIEGGRLGFG-NDC 458


>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
 gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 117/438 (26%), Positives = 189/438 (43%), Gaps = 53/438 (12%)

Query: 2   ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
           ATF   +F L F     V P   Q+    + +I   S  SPF    +  +  +   +T +
Sbjct: 8   ATFF--LFALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESW--VNTVITMA 63

Query: 62  LNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSDLIW 116
                     S+++  K +   I P       ANY++R+ +GTP  +   V DT +D  W
Sbjct: 64  SKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAW 123

Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYG 173
             C     S C    S  F P  S+T  SL CS +QC+ +   SC       C ++ SYG
Sbjct: 124 VPC-----SGCTGCSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYG 178

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLI 232
             S     L  + +TL +       +PG TFGC    +GG    +  G++GLG G ISLI
Sbjct: 179 GDSSLTATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPISLI 231

Query: 233 SQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP-----GVVSTPLTK---AKTFYVLTI 284
           SQ     +G FSYCL    S K  + +  +  GP      + +TPL +     + Y + +
Sbjct: 232 SQAGAMYSGVFSYCL---PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNL 288

Query: 285 DAISVGNQRLGVSTPDIV----------IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 334
             +SVG  ++ + +  +V          IDSGT +T   Q     +       +   P++
Sbjct: 289 TGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG-PIS 347

Query: 335 DPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSE-DIVCSVFKGITNSV-- 391
              G+ + C++  + ++ P +T+HF G ++ L   N  +  S   + C       N+V  
Sbjct: 348 S-LGAFDTCFAATNEAEAPAITLHFEGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNS 406

Query: 392 --PIYGNIMQTNFLVGYD 407
              +  N+ Q N  + +D
Sbjct: 407 VLNVIANLQQQNLRIMFD 424


>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
          Length = 494

 Score =  125 bits (313), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 110/369 (29%), Positives = 174/369 (47%), Gaps = 48/369 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI IGTP        DTGSD++W     C+ CP       +  ++DP+ S + + + 
Sbjct: 90  YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149

Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    C +       SC+  + C+YS+SYGDGS + G   T+ +     +G     P   
Sbjct: 150 CDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANA 209

Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
            ++FGCG   GG   S      GI+G G  + S++SQ+    AGK    F++CL  V+  
Sbjct: 210 SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQL--AAAGKVRKMFAHCLDTVNGG 267

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDS 305
            I F    +V  P V +TPL      Y + +  I VG   LG+ T           +IDS
Sbjct: 268 GI-FAIGNVVQ-PKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDS 325

Query: 306 GTTLTFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFR 360
           GTTL ++P+G    L +++      I  Q + D +     C+ ++       PEVT HF 
Sbjct: 326 GTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVDDGFPEVTFHFE 380

Query: 361 GADVKL--SRSNFFVKVSEDIVCSVFK---GITNS---VPIYGNIMQTNFLVGYDIEQQT 412
           G DV L  S  ++  +  +++ C  F+   G T     + + G+++ +N LV YD+E Q 
Sbjct: 381 G-DVSLIVSPHDYLFQNGKNLYCMGFQNGGGKTKDGKDLGLLGDLVLSNKLVLYDLENQA 439

Query: 413 VSFKPTDCT 421
           + +   +C+
Sbjct: 440 IGWADYNCS 448


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 118/427 (27%), Positives = 188/427 (44%), Gaps = 72/427 (16%)

Query: 51  YQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
           ++ +  A   SL+R  H  +  +++  K +      +   Y +  S+GTPP +   V DT
Sbjct: 35  WESINLAALSSLSRARHLKRPPTLTG-KVTLPAYPRSYGGYSVIFSLGTPPQKVSLVLDT 93

Query: 111 GSDLIWT---------QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
           GS L+WT          C+ C  S       P++    SST +SLPC S +C   N    
Sbjct: 94  GSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLPCRSPKC---NWVFG 150

Query: 162 SGVNCQ-------YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
           S +NC        Y + YG GS + G L ++ + L         +P   FGC      + 
Sbjct: 151 SDLNCSTTKRCPYYGLEYGLGS-TTGQLVSDVLGLSKLN----RIPDFLFGCSL----VS 201

Query: 215 NSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSSTKI--------NFGT 259
           N +  GI G G G  S+ +Q+  T   KFSYCLV       P S   +        +   
Sbjct: 202 NRQPEGIAGFGRGLASIPAQLGLT---KFSYCLVSHRFDDTPQSGDLVLHRGRRHADAAA 258

Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQ------RLGVSTPD----IVIDSGTTL 309
           NG+   P   S  L+    +Y +++  I VG +      R  V + +    +++DSG+T 
Sbjct: 259 NGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKEGDGGMIVDSGSTF 318

Query: 310 TFLPQ----GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GA 362
           TF+ +         L   M+    A+ + D +G L  CY+    S+  VP++T  F+ GA
Sbjct: 319 TFMERIIFDPVARELEKHMTKYKRAKEIEDSSG-LGPCYNITGQSEVDVPKLTFSFKGGA 377

Query: 363 DVKLSRSNFFVKVSEDIVCSVF-------KGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           ++ L  +++F  V++ +VC             T    I GN  Q NF + YD+++Q   F
Sbjct: 378 NMDLPLTDYFSLVTDGVVCMTVLTDPDEPGSTTGPAIILGNYQQQNFYIEYDLKKQRFGF 437

Query: 416 KPTDCTK 422
           KP  C +
Sbjct: 438 KPQQCDR 444


>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 108/363 (29%), Positives = 171/363 (47%), Gaps = 37/363 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP        DTGSD++W     C+ CP       +  L+DP  SS+   + 
Sbjct: 81  YFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGVT 140

Query: 148 CSSSQCASLNQK---SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA---VALP 200
           C    C + +     SC     CQYS+SYGDGS + G   T+ +     +G +   +A  
Sbjct: 141 CGQDFCVATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQTTLANT 200

Query: 201 GITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
            ITFGCG   GG   S +    GI+G G  + S++SQ+    AGK    F++CL  ++  
Sbjct: 201 SITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAA--AGKVRKVFAHCLDTINGG 258

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP--DI------VIDS 305
            I F    +V  P V +TPL      Y + ++AI VG  +L + T   DI      +IDS
Sbjct: 259 GI-FAIGDVVQ-PKVSTTPLVPGMPHYNVNLEAIDVGGVKLQLPTNIFDIGESKGTIIDS 316

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA-DV 364
           GTTL +LP    + ++S + +     P+ +        YS +     P +T HF G   +
Sbjct: 317 GTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQDFQCFRYSGSVDDGFPIITFHFEGGLPL 376

Query: 365 KLSRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
            +   ++  +  E + C  F+  G+       + + G++  +N LV YD+E Q + +   
Sbjct: 377 NIHPHDYLFQNGE-LYCMGFQTGGLQTKDGKDMVLLGDLAFSNRLVLYDLENQVIGWTDY 435

Query: 419 DCT 421
           +C+
Sbjct: 436 NCS 438


>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
 gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
          Length = 492

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 110/370 (29%), Positives = 171/370 (46%), Gaps = 48/370 (12%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI IG+PP       DTGSD++W     C+ CP       +   +DP  S T  ++ 
Sbjct: 85  YYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGT--TVG 142

Query: 148 CSSSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           C    C +    + SGV          CQ+ ++YGDGS + G   T+ V     +G    
Sbjct: 143 CEQEFCVA--NSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQT 200

Query: 199 LP---GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPV 250
            P    ITFGCG   GG   S +    GI+G G  D S++SQ+     +   F++CL  V
Sbjct: 201 TPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCLDTV 260

Query: 251 SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIV 302
               I F    +V  P V +TPL    T Y + +  ISVG   L + T           +
Sbjct: 261 RGGGI-FAIGNVVQPPIVKTTPLVPNATHYNVNLQGISVGGATLQLPTSTFDSGDSKGTI 319

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQP-VADPTGSLELCYSFN-SL-SQVPEVTIHF 359
           IDSGTTL +LP+     LL   +++ +  P +A       +C+ F+ SL  + P +T  F
Sbjct: 320 IDSGTTLAYLPREVYRTLL---TAVFDKHPDLAVRNYEDFICFQFSGSLDEEFPVITFSF 376

Query: 360 RGADVKLSR--SNFFVKVSEDIVCSVF--KGIT----NSVPIYGNIMQTNFLVGYDIEQQ 411
            G D+ L+    ++  +   D+ C  F   G+       + + G+++ +N LV YD+E+Q
Sbjct: 377 EG-DLTLNVYPHDYLFQNGNDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVYDLEKQ 435

Query: 412 TVSFKPTDCT 421
            + +   +C+
Sbjct: 436 VIGWTDYNCS 445


>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
          Length = 382

 Score =  124 bits (312), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 100/359 (27%), Positives = 173/359 (48%), Gaps = 43/359 (11%)

Query: 96  SIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS 155
           +IGTPP    A  D G  L+WTQC  C  S C+ Q+ P FDP  SSTY+  PC ++ C  
Sbjct: 29  TIGTPPQPASAFIDVGGLLVWTQCSQCSSSSCFNQELPPFDPTKSSTYRPEPCGTALCEF 88

Query: 156 L--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGG 212
              + ++CSG  C Y  S      ++G + T+ V +G+ T  +VA     FGC   ++  
Sbjct: 89  FPASIRNCSGDVCAYEASTQLFEHTSGKIGTDAVAIGTATAASVA-----FGCVMASDIK 143

Query: 213 LFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP--------------VSSTKINFG 258
           L +   +G VGL    +SL++QM  T    FS+CL P               ++     G
Sbjct: 144 LMDGGPSGFVGLARTPLSLVAQMNVT---AFSHCLAPHDGGGGKNSRLFLGAAAKLAGGG 200

Query: 259 TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD----IVIDSGTTLTFLPQ 314
            +  ++ P V S+P      +Y++ ++ I  G++ + ++ P     +++ + + ++FL  
Sbjct: 201 KSAAMTTPFVKSSPDDIKSLYYLINLEGIKAGDEAI-ITVPQSGRTVLLQTFSPVSFLVD 259

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLE----LCYSFNSLSQVPEVTIHFRG-ADVKLSRS 369
           G   +L   +++ +   P A P    +    LC+    +S  P+V + F+G A + +  +
Sbjct: 260 GVYQDLKKAVTAAV-GGPTATPPEQFQSIFDLCFKRGGVSGAPDVVLTFQGAAALTVPPT 318

Query: 370 NFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           N+ + V +D VC                + I G + Q N    YD+E++T+SF+  DC+
Sbjct: 319 NYLLDVGDDTVCVAIASSARLNSTEVAGMSILGGLQQQNVHFLYDLEKETLSFEAADCS 377


>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  124 bits (311), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 116/440 (26%), Positives = 184/440 (41%), Gaps = 70/440 (15%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ--------- 81
           +EL+HR   +           + ++  + R   R    NQ   + S+  S+         
Sbjct: 35  LELVHRHHERFAGGGGDVDRVEAVKGFVKRDKLRRQRMNQRWGVVSNYDSRRKGFEMTTT 94

Query: 82  -ADI-IPNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
            A++ +P ++        Y   + +G+P      V DTGS+  W  C             
Sbjct: 95  PAEVEMPMHSGRDDALGEYFAEVKVGSPGQRFWLVVDTGSEFTWLNC------------- 141

Query: 133 PLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
                  S +++++ C+S +C        SL+        C Y +SY DGS + G   T+
Sbjct: 142 -------SKSFEAVTCASRKCKVDLSELFSLSVCPKPSDPCLYDISYADGSSAKGFFGTD 194

Query: 186 TVTLGSTTGQAVALPGITFGCGTN--NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
           ++T+G T G+   L  +T GC  +  NG  FN +T GI+GLG    S I +       KF
Sbjct: 195 SITVGLTNGKQGKLNNLTIGCTKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKF 254

Query: 244 SYCLVPVSSTK-------INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV 296
           SYCLV   S +       I    N  + G  +  T L     FY + +  IS+G Q L +
Sbjct: 255 SYCLVDHLSHRSVSSNLTIGGHHNAKLLGE-IRRTELILFPPFYGVNVVGISIGGQMLKI 313

Query: 297 --------STPDIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSF 346
                   +    +IDSGTTLT  L   Y +   ++  S+ + + V  +   +LE C+  
Sbjct: 314 PPQVWDFNAEGGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDALEFCFDA 373

Query: 347 NSL--SQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTN 401
                S VP +  HF  GA  +    ++ + V+  + C     I       + GNIMQ N
Sbjct: 374 EGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGASVIGNIMQQN 433

Query: 402 FLVGYDIEQQTVSFKPTDCT 421
            L  +D+   TV F P+ CT
Sbjct: 434 HLWEFDLSTNTVGFAPSTCT 453


>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
          Length = 450

 Score =  124 bits (311), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 109/355 (30%), Positives = 153/355 (43%), Gaps = 71/355 (20%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
           + DTGSDL W QC+PC  S CY Q  PLFDP  S++Y ++PC++S C ASL        S
Sbjct: 125 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 182

Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
           C+ V           C YS++YGDGSFS G LAT+TV LG  +     + G  FGCG +N
Sbjct: 183 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 237

Query: 211 GGLFNSKTT---------GIVGLGGGDISL---ISQMRTTIAGKFSYCLVPVSSTKINFG 258
            GL    +          G  G   G +SL    S  R            PVS T+    
Sbjct: 238 RGLRRPGSAASSPTASPPGTSGDAAGSLSLGGDTSSYRNA---------TPVSYTR---- 284

Query: 259 TNGIVSGPGVVSTPLTKAKTFYVLTI---DAISVGNQRLGVSTPDIVIDSGTTLTFL-PQ 314
              +++ P            FY + +             G+   ++++DSGT +T L P 
Sbjct: 285 ---MIADPA--------QPPFYFMNVTGASVGGAAVAAAGLGAANVLLDSGTVITRLAPS 333

Query: 315 GYNSNLLSVMSSM-IEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSN 370
            Y +           E  P A P   L+ CY+     +  VP +T+    GAD+ +  + 
Sbjct: 334 VYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEAGADMTVDAAG 393

Query: 371 FFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
                 +D   VC     ++  +  PI GN  Q N  V YD     + F   DC+
Sbjct: 394 MLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 448


>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
           Japonica Group]
          Length = 377

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 94/310 (30%), Positives = 142/310 (45%), Gaps = 31/310 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+   +IGTPP    AV D   +L+WTQC PC P  C+ QD PLFDP  SST++ LPC S
Sbjct: 57  YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CFEQDLPLFDPTKSSTFRGLPCGS 114

Query: 151 SQCASLNQKS--CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
             C S+ + S  C+   C Y      G  + G   T+T  +G+      A   + FGC  
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIGA------AKETLGFGCVV 167

Query: 209 NNGGLFNS--KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG-TNGIVSG 265
                  +    +GIVGLG    SL++QM  T    FSYCL   SS  +  G T   ++G
Sbjct: 168 MTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT---AFSYCLAGKSSGALFLGATAKQLAG 224

Query: 266 PGVVSTPLT----------KAKTFYVLTIDAISVGNQRLGVSTPD---IVIDSGTTLTFL 312
               STP             +  +Y++ +  I  G   L  ++     +++D+ +  ++L
Sbjct: 225 GKNSSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYL 284

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNF 371
             G    L   +++ +  QPVA P    +LC+        PE+   F  GA + +  +N+
Sbjct: 285 ADGAYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDAPELVFTFDGGAALTVPPANY 344

Query: 372 FVKVSEDIVC 381
            +      VC
Sbjct: 345 LLASGNGTVC 354


>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 491

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 102/338 (30%), Positives = 148/338 (43%), Gaps = 36/338 (10%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGVN- 165
           DT  D+ W QC PC   QCY Q +  FDP+ SST   + C S  C +L      CS  N 
Sbjct: 164 DTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYANGCSKPNS 223

Query: 166 ---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
              C Y + Y D   + G   T+T+T+  +T          FGC     G F+++ +G +
Sbjct: 224 TGDCLYRIEYSDHRLTLGTYMTDTLTISPST----TFLNFRFGCSHAVRGKFSAQASGTM 279

Query: 223 GLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP------GVVSTPLTKA 276
            LGGG  SL+SQ        FSYC VP  S        G V+G          +TPL ++
Sbjct: 280 SLGGGPQSLLSQTARAYGNAFSYC-VPGPSAAGFLSIGGPVNGDDGGGSGAFATTPLVRS 338

Query: 277 K-----TFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSM 327
                 T YV+ +  I V  +RL V     +   V+DS   +T LP      L     + 
Sbjct: 339 ANVINPTIYVVRLQGIEVAGRRLNVPPVVFSGGTVMDSSAVITQLPPTAYRALRLAFRNA 398

Query: 328 IEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF 384
           + A     PTG+L+ C+ F  +S+  VP V++ F  GA ++L   +  +       C  F
Sbjct: 399 MRAYKTRAPTGNLDTCFDFVGVSKVTVPTVSLVFDGGAVIELGLLSVLLD-----SCLAF 453

Query: 385 KGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             +    ++   GN+ Q    V YD+    V F+   C
Sbjct: 454 APMAADFALGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  123 bits (309), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 107/384 (27%), Positives = 178/384 (46%), Gaps = 35/384 (9%)

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
            RL  F  + ++S+++    D +  N  Y  R+ IGTPP +   + DTGS + +  C  C
Sbjct: 55  RRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 114

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCS-SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN 181
              QC     P FDP+ SSTYK + C+    C S       GV C Y   Y + S S+G 
Sbjct: 115 --EQCGRHQDPKFDPESSSTYKPIKCNIDCICDS------DGVQCVYERQYAEMSTSSGV 166

Query: 182 LATETVTLGSTTGQAVALP-GITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RT 237
           L  + ++ G+   Q+  +P    FGC     G LF+ +  GI+GLG GD+SL+ Q+  + 
Sbjct: 167 LGEDVISFGN---QSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKG 223

Query: 238 TIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV 296
            I   FS C   +          GI     ++ T     ++ +Y + +  I V  ++L +
Sbjct: 224 AINDSFSLCYGGMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPL 283

Query: 297 STP------DIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--- 345
           S+         V+DSGTT  +LP + +++   ++M  +   + +  P  +  ++C+S   
Sbjct: 284 SSGIFDGRYGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAG 343

Query: 346 --FNSLS-QVPEVTIHFR-GADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIM 398
                LS + P V + F  G  + L+  N+F    KV       +F+   +   + G I+
Sbjct: 344 SDAAELSNKFPTVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIV 403

Query: 399 QTNFLVGYDIEQQTVSFKPTDCTK 422
             N LV YD     + F  T+C++
Sbjct: 404 VRNTLVMYDRANSKIGFWKTNCSE 427


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 107/384 (27%), Positives = 178/384 (46%), Gaps = 35/384 (9%)

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
            RL  F  + ++S+++    D +  N  Y  R+ IGTPP +   + DTGS + +  C  C
Sbjct: 55  RRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 114

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCS-SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN 181
              QC     P FDP+ SSTYK + C+    C S       GV C Y   Y + S S+G 
Sbjct: 115 --EQCGRHQDPKFDPESSSTYKPIKCNIDCICDS------DGVQCVYERQYAEMSTSSGV 166

Query: 182 LATETVTLGSTTGQAVALP-GITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RT 237
           L  + ++ G+   Q+  +P    FGC     G LF+ +  GI+GLG GD+SL+ Q+  + 
Sbjct: 167 LGEDVISFGN---QSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKG 223

Query: 238 TIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV 296
            I   FS C   +          GI     ++ T     ++ +Y + +  I V  ++L +
Sbjct: 224 AINDSFSLCYGGMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPL 283

Query: 297 STP------DIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--- 345
           S+         V+DSGTT  +LP + +++   ++M  +   + +  P  +  ++C+S   
Sbjct: 284 SSGIFDGRYGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAG 343

Query: 346 --FNSLS-QVPEVTIHFR-GADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIM 398
                LS + P V + F  G  + L+  N+F    KV       +F+   +   + G I+
Sbjct: 344 SDAAELSNKFPTVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIV 403

Query: 399 QTNFLVGYDIEQQTVSFKPTDCTK 422
             N LV YD     + F  T+C++
Sbjct: 404 VRNTLVMYDRANSKIGFWKTNCSE 427


>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 457

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 123/451 (27%), Positives = 197/451 (43%), Gaps = 49/451 (10%)

Query: 4   FLSCVFILFFLCFYVVSP----IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALT 59
           F+ C+  L  LCF    P    ++    GF V L+H  S +SPFY  + T  +  + ++ 
Sbjct: 9   FMICIQTL--LCFSSSLPDHVLLKDNRLGFKVPLLHWLSTESPFYEPNLTLAELTQASIR 66

Query: 60  RSLNR---LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
            S  R   +      +  SS K   + +   +  Y+++ SIG+P  +  A+ D+GS L+W
Sbjct: 67  TSGARGDSIRSIMSGNITSSMKYPISRMSYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVW 126

Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK--SCSGVN--CQYSVS 171
            QC       CY Q  PLF+P  S TY    C++++C  +L  +   C   N  C+Y   
Sbjct: 127 LQCGTPYCRNCYRQKIPLFNPSKSVTYMKRLCNTAECRVALGDEYWRCKKPNQICKYHED 186

Query: 172 YGDGSFSNGNLATETVTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDIS 230
           Y D S++ G ++T+  T     +G       I FGCG NN    +    G+VGL     S
Sbjct: 187 YLDDSYTEGVISTDIFTFPEHISGFGNYTLRIIFGCGYNNSDPQHFYPPGLVGLTNNKAS 246

Query: 231 LISQMRTTIAGKFSYCLVPVS------STKINFGTNGIVSGPGVVSTPLTKAKTFYVL-T 283
           L+ QM      +FSYC+   +      S +I FG    +SG      P   +  +Y+   
Sbjct: 247 LVGQMD---VDQFSYCVSIDTEQNLKGSMEIRFGLAASISGHSTQLVP--NSDGWYIFKN 301

Query: 284 IDAISVGNQRLGVSTPDIV------------IDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
           +D I V N+      P  V            +D+GTT T L       L+ ++   I   
Sbjct: 302 VDGIYV-NEFEVEGYPAWVFKYTEGGQGGLTMDTGTTYTELHNSVMDPLIKLLEEHITIV 360

Query: 332 PVADPTGS-LELCYSFNSL--SQVPEVTIHF---RGADVKLSRSNFFVKVSEDIVC-SVF 384
           P  D + S  ELCY  +    + +P++ + F   +      +  N +       +C ++F
Sbjct: 361 PEKDYSNSGFELCYFSDDFLGATLPDIELRFTDNKDTYFSFNTRNAWTPNGRSQMCLAMF 420

Query: 385 KGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           +  TN + I G     +  +GYD+    VSF
Sbjct: 421 R--TNGMSIIGMHQLRDIKIGYDLHHNIVSF 449


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 119/412 (28%), Positives = 183/412 (44%), Gaps = 87/412 (21%)

Query: 60  RSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
           RS N+L HF+ N S++                 + +++GTPP     V DTGS+L W +C
Sbjct: 72  RSPNKL-HFHHNVSLT-----------------VSLTVGTPPQNVSMVLDTGSELSWLRC 113

Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-----SC-SGVNCQYSVSYG 173
                 Q        FDP  SS+Y  +PCSS  C    +      SC S   C   +SY 
Sbjct: 114 NKTQTFQT------TFDPNRSSSYSPVPCSSLTCTDRTRDFPIPASCDSNQLCHAILSYA 167

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGC-----GTNNGGLFNSKTTGIVGLGGGD 228
           D S S GNLA++T  +G++      +PG  FGC      TN     +SK TG++G+  G 
Sbjct: 168 DASSSEGNLASDTFYIGNSD-----MPGTIFGCMDSSFSTNTEE--DSKNTGLMGMNRGS 220

Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-------PGV-VSTPLTK-AKTF 279
           +S +SQM      KFSYC+     + +    +   S        P + +STPL    +  
Sbjct: 221 LSFVSQMDFP---KFSYCISDSDFSGVLLLGDANFSWLMPLNYTPLIQISTPLPYFDRVA 277

Query: 280 YVLTIDAISVGNQRL----GVSTPD------IVIDSGTTLTFLP----QGYNSNLLSVMS 325
           Y + ++ I V ++ L     V  PD       ++DSGT  TFL         +  L+  S
Sbjct: 278 YTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALRNEFLNQTS 337

Query: 326 SMIEAQPVADPT----GSLELCY----SFNSLSQVPEVTIHFRGADVKLSRSNFFVKV-- 375
            ++    + DP     G ++LCY    S  SL  +P V++ FRGA++K+S      +V  
Sbjct: 338 QILRV--LEDPNYVFQGGMDLCYRVPLSQTSLPWLPTVSLMFRGAEMKVSGDRLLYRVPG 395

Query: 376 ----SEDIVCSVFKG---ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
               S+ + C  F     +     + G+  Q N  + +D+E+  + F    C
Sbjct: 396 EVRGSDSVYCFTFGNSDLLAVEAYVIGHHHQQNVWMEFDLEKSRIGFAQVQC 447


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 104/381 (27%), Positives = 180/381 (47%), Gaps = 43/381 (11%)

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           QNS + +++    D + +N  Y  R+ IGTPP E   + DTGS + +  C  C   QC  
Sbjct: 56  QNSELPNARMRLFDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSSC--EQCGK 113

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL 189
              P F P +SSTY+ + C+ S C   ++    G  C Y   Y + S S+G +A + V+ 
Sbjct: 114 HQDPRFQPDLSSTYRPVKCNPS-CNCDDE----GKQCTYERRYAEMSSSSGVIAEDVVSF 168

Query: 190 GSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYC 246
           G+ +   +      FGC     G L++ +  GI+GLG G +S++ Q+  +  I   FS C
Sbjct: 169 GNES--ELKPQRAVFGCENVETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLC 226

Query: 247 LVPVSSTKINFGTNGIVSG-----PGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVSTPD 300
                   ++ G   +V G     P +V +     ++ +Y + +  + V  + L +  P 
Sbjct: 227 Y-----GGMDVGGGAMVLGQISPPPNMVFSHSNPYRSPYYNIELKELHVAGKPLKLK-PK 280

Query: 301 I-------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----F 346
           +       V+DSGTT  + P+  +++   ++M  +   + +  P  +  ++C+S      
Sbjct: 281 VFDEKHGTVLDSGTTYAYFPEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREV 340

Query: 347 NSLSQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNSVPIYGNIMQTN 401
           + LS+V PEV + F  G  + LS  N+     KVS      +F+   +   + G I+  N
Sbjct: 341 SHLSKVFPEVNMVFGSGQKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRN 400

Query: 402 FLVGYDIEQQTVSFKPTDCTK 422
            LV YD E   + F  T+C++
Sbjct: 401 TLVTYDRENDKIGFWKTNCSE 421


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  122 bits (307), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 113/396 (28%), Positives = 178/396 (44%), Gaps = 67/396 (16%)

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
           ++SK +   +  +N    + ++ GTP      V DTGS+L W  C+  P        + +
Sbjct: 51  TTSKTTDKLLFHHNVTLTVSLTAGTPLQNITMVLDTGSELSWLHCKKEP------NFNSI 104

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
           F+P  S TY  +PCSS  C +  +      SC     C + +SY D S   GNLA ET  
Sbjct: 105 FNPLASKTYTKIPCSSPTCETRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFR 164

Query: 189 LGSTTGQAVALPGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
           +GS TG     P   FGC   G ++    ++KTTG++G+  G +S ++QM      KFSY
Sbjct: 165 VGSVTG-----PATVFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVNQMGFR---KFSY 216

Query: 246 CLVPVSSTKINFGTNGIVSG-------PGV-VSTPLTK-AKTFYVLTIDAISVGNQRL-- 294
           C+    S+ +        S        P V +STPL    +  Y + ++ I V ++ L  
Sbjct: 217 CISDRDSSGVLLLGEASFSWLKPLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSL 276

Query: 295 --GVSTPD------IVIDSGTTLTF--------LPQGYNSNLLSVMSSMIEAQPVADPTG 338
              V  PD       ++DSGT  TF        L Q +      V+  + E + V    G
Sbjct: 277 PKSVFVPDHTGAGQTMVDSGTQFTFLLGPVYSALKQEFLLQTKGVLRVLNEPRYVFQ--G 334

Query: 339 SLELCYSFN----SLSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGIT 388
           +++LCY       +L  +P V + FRGA++ +S      +V       + + C  F G +
Sbjct: 335 AMDLCYLIEPTRAALPNLPVVNLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTF-GNS 393

Query: 389 NSVPI----YGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           +S+ I     G+  Q N  + YD+E+  + F    C
Sbjct: 394 DSLGIESFVIGHHQQQNVWMEYDLEKSRIGFAEVRC 429


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 106/371 (28%), Positives = 171/371 (46%), Gaps = 50/371 (13%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y   I +G+P  E + + DTGS+L W +C PC    C      ++D   S +YK + C+
Sbjct: 99  EYYTSIKLGSPGQEAILIVDTGSELTWLKCLPC--KVCAPSVDTIYDAARSVSYKPVTCN 156

Query: 150 SSQ-CASLNQKS----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAVALPGIT 203
           +SQ C++ +Q +      G  CQ++  YGDGSFS G+L+T+T+ + +   G+ V +    
Sbjct: 157 NSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPVTVQDFA 216

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN------F 257
           FGC   +  L  +  +GI+GL  G ++L  Q+      KFS+C  P  S+ +N      F
Sbjct: 217 FGCAQGDLELVPTGASGILGLNAGKMALPMQLGQRFGWKFSHCF-PDRSSHLNSTGVVFF 275

Query: 258 GTNGI----VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD---IVIDSGTTLT 310
           G   +    V    V  T     + FY + +  +S+ +  L V  P    +++DSG++ +
Sbjct: 276 GNAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHEL-VLLPRGSVVILDSGSSFS 334

Query: 311 FLPQGYNSNLLSVMSSMIEAQP------VADPTGSLELCYSFN------------SLSQV 352
              + ++S L     + ++ +P        D  G L  C+  +            SLS V
Sbjct: 335 SFVRPFHSQL---REAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSLSLV 391

Query: 353 PE--VTIHFRGADVKLSRSNFFVKVSEDIVCSVFK-GITNSVPIYGNIMQTNFLVGYDIE 409
            E  VTI      V L  + +   V    +C  F+ G  N V + GN  Q N  V YDI+
Sbjct: 392 FEDGVTIGIPSIGVLLPVARYQNHVK---MCFAFEDGGPNPVNVIGNYQQQNLWVEYDIQ 448

Query: 410 QQTVSFKPTDC 420
           +  V F    C
Sbjct: 449 RSRVGFARASC 459


>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
          Length = 438

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 116/438 (26%), Positives = 188/438 (42%), Gaps = 53/438 (12%)

Query: 2   ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
           ATF   +  L F     V P   Q+    + +I   S  SPF    +  +  +   +T +
Sbjct: 8   ATFF--LVALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESW--VNTVITMA 63

Query: 62  LNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSDLIW 116
                     S+++  K +   I P       ANY++R+ +GTP  +   V DT +D  W
Sbjct: 64  SKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAW 123

Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYG 173
             C     S C    S  F P  S+T  SL CS +QC+ +   SC       C ++ SYG
Sbjct: 124 VPC-----SGCTGFSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYG 178

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLI 232
             S     L  + +TL +       +PG TFGC    +GG    +  G++GLG G ISLI
Sbjct: 179 GDSSLTATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPISLI 231

Query: 233 SQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP-----GVVSTPLTK---AKTFYVLTI 284
           SQ     +G FSYCL    S K  + +  +  GP      + +TPL +     + Y + +
Sbjct: 232 SQAGAMYSGVFSYCL---PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYYVNL 288

Query: 285 DAISVGNQRLGVSTPDIV----------IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 334
             +SVG  ++ + +  +V          IDSGT +T   Q     +       +   P++
Sbjct: 289 TGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG-PIS 347

Query: 335 DPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSE-DIVCSVFKGITNSV-- 391
              G+ + C++  + ++ P +T+HF G ++ L   N  +  S   + C       N+V  
Sbjct: 348 S-LGAFDTCFAATNEAEAPAITLHFEGLNLVLPMENSLIHSSSGSLACLSMAAAPNNVNS 406

Query: 392 --PIYGNIMQTNFLVGYD 407
              +  N+ Q N  + +D
Sbjct: 407 VLNVIANLQQQNLRIMFD 424


>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
 gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
 gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 389

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 99/346 (28%), Positives = 163/346 (47%), Gaps = 32/346 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD-SPLFDPKMSSTYKSLPCS 149
           ++  I  G+P  ++    DTGS L WTQC PC  S CY Q   P + P  S TY+   C 
Sbjct: 58  FMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPC--SDCYAQKIYPKYRPAASITYRDAMCE 115

Query: 150 SSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
            S   S    +   +   C Y   Y D +   G LA E +T+ +  G    + G+ FGC 
Sbjct: 116 DSHPKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVYFGCN 175

Query: 208 T-NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL----VPVSSTKINFGTNGI 262
           T ++G  F    TGI+GLG G  S+I +       KFS+CL     P +S  +  G    
Sbjct: 176 TLSDGSYFTG--TGILGLGVGKYSIIGEF----GSKFSFCLGEISEPKASHNLILGDGAN 229

Query: 263 VSG-PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLL 321
           V G P V++  +T+  T  +  +++I VG +        + +D+G+TL+ L        +
Sbjct: 230 VQGHPTVIN--ITEGHT--IFQLESIIVGEEITLDDPVQVFVDTGSTLSHLSTNLYYKFV 285

Query: 322 SVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVPEVTIHFR---GADVKLSRSNFFVKVS- 376
                +I ++P++ +PT    LCY  +++ ++ ++ + F+   GA++ ++  N F++   
Sbjct: 286 DAFDDLIGSRPLSYEPT----LCYKADTIERLEKMDVGFKFDVGAELSVNIHNIFIQQGP 341

Query: 377 EDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            +I C   +    S    I G I    + VGYD+  +T      DC
Sbjct: 342 PEIRCLAIQNNKESFSHVIIGVIAMQGYNVGYDLSAKTAYINKQDC 387


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 106/378 (28%), Positives = 177/378 (46%), Gaps = 37/378 (9%)

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
            NS + ++     D + +N  Y  R+ IGTPP E   + DTGS + +  C  C   QC  
Sbjct: 67  HNSDLPNAHMRLYDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSTC--EQCGK 124

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL 189
              P F P+ SSTYK + C+ S C   ++    G  C Y   Y + S S+G LA + ++ 
Sbjct: 125 HQDPRFQPESSSTYKPMQCNPS-CNCDDE----GKQCTYERRYAEMSSSSGLLAEDVLSF 179

Query: 190 GSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYC 246
           G+ +   +      FGC T   G LF+ +  GI+GLG G +S++ Q+  +  +   FS C
Sbjct: 180 GNES--ELTPQRAIFGCETVETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLC 237

Query: 247 LVPVSSTKINFGTNGIVSGPGVV---STPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-- 301
              +           I   P +V   S P   A  +Y + +  + V  +RL ++ P +  
Sbjct: 238 YGGMDVVGGAMVLGNIPPPPDMVFAHSDPYRSA--YYNIELKELHVAGKRLKLN-PRVFD 294

Query: 302 -----VIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSL 349
                V+DSGTT  +LP + + +   +++  +   + +  P  S  ++C+S      + L
Sbjct: 295 GKHGTVLDSGTTYAYLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQL 354

Query: 350 SQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLV 404
           S++ PEV + F  G  + LS  N+     KVS      +F+   +   + G I+  N LV
Sbjct: 355 SKIFPEVNMVFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLV 414

Query: 405 GYDIEQQTVSFKPTDCTK 422
            YD +   + F  T+C++
Sbjct: 415 TYDRDNDKIGFWKTNCSE 432


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 107/368 (29%), Positives = 172/368 (46%), Gaps = 60/368 (16%)

Query: 87   NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +N    + +++G+PP +   V DTGS+L W  C+  P        + +F+P  SS+Y  +
Sbjct: 996  HNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSP------NLTSVFNPLSSSSYSPI 1049

Query: 147  PCSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
            PCSS  C +  +   + V       C   VSY D S   GNLA++   +GS+     ALP
Sbjct: 1050 PCSSPICRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS-----ALP 1104

Query: 201  GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI-N 256
            G  FGC   G ++    ++KTTG++G+  G +S ++Q+      KFSYC+    S+ +  
Sbjct: 1105 GTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQLGLP---KFSYCISGRDSSGVLL 1161

Query: 257  FGTNGIVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD---- 300
            FG   +     +  TPL +  T         Y + +D I VGN+ L     +  PD    
Sbjct: 1162 FGDLHLSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGA 1221

Query: 301  --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPT----GSLELCYSFNS---L 349
               ++DSGT  TFL     + L +      +    P+ DP     G+++LCYS  +   L
Sbjct: 1222 GQTMVDSGTQFTFLLGPVYTALRNEFLEQTKGVLAPLGDPNFVFQGAMDLCYSVAAGGKL 1281

Query: 350  SQVPEVTIHFRGA------DVKLSRSNFFVKVSEDIVCSVFKG---ITNSVPIYGNIMQT 400
              +P V++ FRGA      +V L R    +K +E + C  F     +     + G+  Q 
Sbjct: 1282 PTLPSVSLMFRGAEMVVGGEVLLYRVPEMMKGNEWVYCLTFGNSDLLGIEAFVIGHHHQQ 1341

Query: 401  NFLVGYDI 408
            N  + +D+
Sbjct: 1342 NVWMEFDL 1349


>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
 gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
          Length = 486

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 105/337 (31%), Positives = 156/337 (46%), Gaps = 39/337 (11%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--C-SG 163
           V DT  D+ W +C PC  +QC       +DP  SSTY + PC+SS C  L + +  C + 
Sbjct: 166 VLDTAGDVPWMRCVPCTFAQCAD-----YDPTRSSTYSAFPCNSSACKQLGRYANGCDAN 220

Query: 164 VNCQYSV-SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
             CQY V + GD   ++G  +++ +T+ S  G  V   G  FGC  N  G F ++  GI+
Sbjct: 221 GQCQYMVVTAGDSFTTSGTYSSDVLTINS--GDRVE--GFRFGCSQNEQGSFENQADGIM 276

Query: 223 GLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPG--VVSTPLTK----- 275
            LG G  SL++Q  +T    FSYCL P  +TK  F   G+  G     V+TP+ K     
Sbjct: 277 ALGRGVQSLMAQTSSTYGDAFSYCLPPTETTK-GFFQIGVPIGASYRFVTTPMLKERGGA 335

Query: 276 ---AKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMI 328
              A T Y   + AI+V  + L V         V+DS T +T LP      L +   + +
Sbjct: 336 SAAAATLYRALLLAITVDGKELNVPAEVFAAGTVMDSRTIITRLPVTAYGALRAAFRNRM 395

Query: 329 EAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFK 385
             + VA P   L+ CY    +   ++P + + F G A V++ RS   +       C  F 
Sbjct: 396 RYR-VAPPQEELDTCYDLTGVRYPRLPRIALVFDGNAVVEMDRSGILLN-----GCLAFA 449

Query: 386 GITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              +  S  I GN+ Q    V +D+    + F+   C
Sbjct: 450 SNDDDSSPSILGNVQQQTIQVLHDVGGGRIGFRSAAC 486


>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 330

 Score =  121 bits (304), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 100/308 (32%), Positives = 147/308 (47%), Gaps = 40/308 (12%)

Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATET 186
           P FD   SST     C S+ C  L   SC          C Y+  Y D S + G +  + 
Sbjct: 23  PYFDRSTSSTLLLTSCDSTLCQGLLVASCGNTKFWPNQTCVYTYYYNDKSVTTGLIEVDK 82

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYC 246
            T G+      ++PG+ FGCG  N G+F S  TGI G G G +SL SQ++    G FS+C
Sbjct: 83  FTFGA----GASVPGVAFGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLKV---GNFSHC 135

Query: 247 LVPVSSTK-----INFGTNGIVSGPGVV-STPLTKAK---TFYVLTIDAISVGNQRLGV- 296
              V+  K     ++   +   +G G V STPL +     TFY L++  I+VG+ RL V 
Sbjct: 136 FTAVNGLKQSTVLLDLPADLYKNGRGAVQSTPLIQNSANPTFYYLSLKGITVGSTRLPVP 195

Query: 297 --------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS 348
                    T   +IDSGT++T LP      +    ++ I+   V         C+S  S
Sbjct: 196 ESAFALTNGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGPYTCFSAPS 255

Query: 349 LSQ--VPEVTIHFRGADVKLSRSNFFVKVSED----IVC-SVFKGITNSVPIYGNIMQTN 401
            ++  VP++ +HF GA + L R N+  +V +D    I+C ++ KG  +   I GN  Q N
Sbjct: 256 QAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKG--DETTIIGNFQQQN 313

Query: 402 FLVGYDIE 409
             V YD++
Sbjct: 314 MHVLYDLQ 321


>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
          Length = 376

 Score =  121 bits (304), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 78/212 (36%), Positives = 112/212 (52%), Gaps = 18/212 (8%)

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           GT    +  + D+GSD+ W QC+PCP   C+ Q  PLFDP  S+TY ++PCSS+ CA L 
Sbjct: 155 GTSAVRQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYSAVPCSSAACARLG 214

Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
             ++ CS  V CQ+  +Y DG+ + G  +++ +TLG        + G  FGC   + G  
Sbjct: 215 PYRRGCSANVQCQFGFTYTDGATATGTYSSDDLTLGPYD----VVRGFLFGCAHADRGST 270

Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGV 268
           F+   +G + LGGG  S + Q  T     FSYC +P S + + F T G+        P  
Sbjct: 271 FSFDVSGTLALGGGAQSFVQQTATQYGRVFSYC-IPPSPSSLGFITLGVPPQRAALVPTF 329

Query: 269 VSTPLTKAK----TFYVLTIDAISVGNQRLGV 296
           VSTPL  +     TFY + + AI V  + L V
Sbjct: 330 VSTPLLSSSSMPPTFYRVLLRAIIVAGRPLPV 361


>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
          Length = 256

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 69/161 (42%), Positives = 91/161 (56%), Gaps = 10/161 (6%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ IG+PP     V DTGSD+ W QC PC  + CY Q  P+F+P  SS+Y  L 
Sbjct: 50  SGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPC--ADCYQQADPIFEPSFSSSYAPLT 107

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C + QC SL+   C   +C Y VSYGDGS++ G+ ATET+TL      + +L  +  GCG
Sbjct: 108 CETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDG----SASLNNVAIGCG 163

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
            +N GLF      +   GG  +S  SQ+    A  FSYCLV
Sbjct: 164 HDNEGLFVGAAGLLGLGGGS-LSFPSQIN---ASSFSYCLV 200


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score =  121 bits (303), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 102/362 (28%), Positives = 166/362 (45%), Gaps = 35/362 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +GTPP E     DTGSD++W   + C  CP +         FD   SST + +P
Sbjct: 81  YFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVP 140

Query: 148 CSSSQCASLNQKSCSGV-----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
           CS   C S  Q + +        C Y+  YGDGS ++G   ++T    +  G+++   + 
Sbjct: 141 CSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSS 200

Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTK 254
             I FGC T   G     +    GI G G G++S+ISQ+ +       FS+CL    S  
Sbjct: 201 AAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCLKGEDSGG 260

Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSG 306
                 G +  PG+V +PL  ++  Y L + +I+V  Q L +        S    +ID+G
Sbjct: 261 -GILVLGEILEPGIVYSPLVPSQPHYNLDLQSIAVSGQLLPIDPAAFATSSNRGTIIDTG 319

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPT-GSLELCYSF-NSLSQV-PEVTIHFRGAD 363
           TTL +L +      +S +++ +    +A PT      CY   NS+S+V P V+ +F G  
Sbjct: 320 TTLAYLVEEAYDPFVSAITAAVSQ--LATPTINKGNQCYLVSNSVSEVFPPVSFNFAGGA 377

Query: 364 VKLSRSNFFVK-----VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
             L +   ++          + C  F+ I   + I G+++  + +  YD+  Q + +   
Sbjct: 378 TMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGITILGDLVLKDKIFVYDLAHQRIGWANY 437

Query: 419 DC 420
           DC
Sbjct: 438 DC 439


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 104/389 (26%), Positives = 171/389 (43%), Gaps = 41/389 (10%)

Query: 65  LNHFNQNSSISSSKASQA--------DIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
            +HFN    +  S +           D +  N  Y  R+ IGTPP     + DTGS + +
Sbjct: 59  FSHFNPRRQLKESDSEHHPNARMRLYDDLLRNGYYTARLWIGTPPQRFALIVDTGSTVTY 118

Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGS 176
             C  C    C     P F P+ S TY+ + C + QC   N +      C Y   Y + S
Sbjct: 119 VPCSTC--RHCGSHQDPKFRPEDSETYQPVKC-TWQCNCDNDRK----QCTYERRYAEMS 171

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQM 235
            S+G L  + V+ G+ T   ++     FGC  +  G ++N +  GI+GLG GD+S++ Q+
Sbjct: 172 TSSGALGEDVVSFGNQT--ELSPQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQL 229

Query: 236 --RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQ 292
             +  I+  FS C   +          GI     +V T     ++ +Y + +  I V  +
Sbjct: 230 VEKKVISDSFSLCYGGMGVGGGAMVLGGISPPADMVFTRSDPVRSPYYNIDLKEIHVAGK 289

Query: 293 RLGVSTPDI-------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELC 343
           RL ++ P +       V+DSGTT  +LP+  + +   ++M      + ++ P     ++C
Sbjct: 290 RLHLN-PKVFDGKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYNDIC 348

Query: 344 YSFNSL--SQV----PEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPI 393
           +S   +  SQ+    P V + F  G  + LS  N+     KV       VF    +   +
Sbjct: 349 FSGAEIDVSQISKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTL 408

Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
            G I+  N LV YD E   + F  T+C++
Sbjct: 409 LGGIVVRNTLVMYDREHTKIGFWKTNCSE 437


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 113/364 (31%), Positives = 164/364 (45%), Gaps = 50/364 (13%)

Query: 57  ALTRSLNRLNHFNQNSSISSSKASQADIIPNN--ANYLIRISIGTPPTERLAVADTGSDL 114
           A  RS  RL+ +      +S   ++A +  +     Y+++ SIG PP    A  DTGSDL
Sbjct: 57  AAERSRRRLSVY------TSGTGTKAPVTKSQKGGKYIMQFSIGEPPLLIWAEVDTGSDL 110

Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-----KSCSGVN--CQ 167
           +W +C PC  + C    SPL+DP  S +   LPCSS  C +L +       CS     C 
Sbjct: 111 MWVKCSPC--NGCNPPPSPLYDPARSRSSGKLPCSSQLCQALGRGRIISDQCSDDPPLCG 168

Query: 168 YSVSYGD-GSFS-NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
           Y  +YG  G  S  G L TET T     G       ++FG      G     T G+VGLG
Sbjct: 169 YHYAYGHSGDHSTQGVLGTETFTF----GDGYVANNVSFGRSDTIDGSQFGGTAGLVGLG 224

Query: 226 GGDISLISQMRTTIAGKFSYCLV--PVSSTKINFG-------TNGIVSGPGVVSTPLTKA 276
            G +SL+SQ+    AG+F+YCL   P   + I FG       + G VS   +V+ P    
Sbjct: 225 RGHLSLVSQLG---AGRFAYCLAADPNVYSTILFGSLAALDTSAGDVSSTPLVTNPKPDR 281

Query: 277 KTFYVLTIDAISVGNQRLGVSTPDIVIDS-GTTLTFLPQGYNSNLLSVMSSMIEAQPVAD 335
            T Y + +  ISVG  RL +      I+S G+   F   G     L   +  +  Q +  
Sbjct: 282 DTHYYVNLQGISVGGSRLPIKDGTFAINSDGSGGVFFDSGAIDTSLKDAAYQVVRQAITS 341

Query: 336 PTGSL------ELCY---SFNSLSQVPEVTIHF-RGADVKLSRSNFFVKV----SEDIVC 381
               L      + C+   +  +++Q+P + +HF  GAD+ L+  N+        SE +VC
Sbjct: 342 EIQRLGYDAGDDTCFVAANQQAVAQMPPLVLHFDDGADMSLNGRNYLKTSTKGPSEVLVC 401

Query: 382 SVFK 385
              K
Sbjct: 402 MAIK 405


>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
 gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
          Length = 491

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 104/367 (28%), Positives = 166/367 (45%), Gaps = 44/367 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI IG+PP       DTGSD++W    +C+ CP       +   +DP  S T  ++ 
Sbjct: 84  YYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TVG 141

Query: 148 CSSSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQA-- 196
           C    C +    S  GV          CQ+ ++YGDGS + G   T+ V     +G    
Sbjct: 142 CEQEFCVA---NSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQT 198

Query: 197 -VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPV 250
             +   ITFGCG   GG     N    GI+G G  D S++SQ+     +   F++CL  V
Sbjct: 199 TTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTV 258

Query: 251 SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIV 302
               I F    +V  P V +TPL    T Y + +  ISVG   L + T           +
Sbjct: 259 RGGGI-FAIGNVVQ-PKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFDSGDSKGTI 316

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA 362
           IDSGTTL +LP+     LL+ +    +  P+ +    +   +S +     P +T  F+G 
Sbjct: 317 IDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQDFVCFQFSGSIDDGFPVITFSFKG- 375

Query: 363 DVKLSR--SNFFVKVSEDIVCSVF--KGIT----NSVPIYGNIMQTNFLVGYDIEQQTVS 414
           D+ L+    ++  +   D+ C  F   G+       + + G+++ +N LV YD+E++ + 
Sbjct: 376 DLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLSNKLVVYDLEKEVIG 435

Query: 415 FKPTDCT 421
           +   +C+
Sbjct: 436 WTDYNCS 442


>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
          Length = 362

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 68/165 (41%), Positives = 89/165 (53%), Gaps = 12/165 (7%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +R+ +GTP T    V DTGSD++W QC PC    CY Q   +FDPK S T+ ++P
Sbjct: 132 SGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC--KACYNQTDAIFDPKKSKTFATVP 189

Query: 148 CSSSQCASLNQKS-C---SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           C S  C  L+  S C       C Y VSYGDGSF+ G+ +TET+T          +  + 
Sbjct: 190 CGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF-----HGARVDHVP 244

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
            GCG +N GLF      +    GG +S  SQ +    GKFSYCLV
Sbjct: 245 LGCGHDNEGLFVGAAGLLGLGRGG-LSFPSQTKNRYNGKFSYCLV 288


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 103/389 (26%), Positives = 173/389 (44%), Gaps = 41/389 (10%)

Query: 65  LNHFNQNSSISSSKASQA--------DIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
           L+HFN    +  S++           D +  N  Y  R+ IGTPP     + DTGS + +
Sbjct: 59  LSHFNPRRHLQGSQSEHHPNARMRLFDDLLRNGYYTTRLWIGTPPQRFALIVDTGSTVTY 118

Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGS 176
             C  C    C     P F P+ S TY+ + C + QC   + +      C Y   Y + S
Sbjct: 119 VPCSTC--KHCGSHQDPKFRPEASETYQPVKC-TWQCNCDDDRK----QCTYERRYAEMS 171

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQM 235
            S+G L  + V+ G+ +   ++     FGC  +  G ++N +  GI+GLG GD+S++ Q+
Sbjct: 172 TSSGVLGEDVVSFGNQS--ELSPQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQL 229

Query: 236 --RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQ 292
             +  I+  FS C   +          GI     +V T     ++ +Y + +  I V  +
Sbjct: 230 VEKKVISDAFSLCYGGMGVGGGAMVLGGISPPADMVFTHSDPVRSPYYNIDLKEIHVAGK 289

Query: 293 RLGVSTPDI-------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELC 343
           RL ++ P +       V+DSGTT  +LP+  + +   ++M      + ++ P     ++C
Sbjct: 290 RLHLN-PKVFDGKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPHYNDIC 348

Query: 344 YS-----FNSLSQ-VPEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPI 393
           +S      + LS+  P V + F  G  + LS  N+     KV       VF    +   +
Sbjct: 349 FSGAEINVSQLSKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTL 408

Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
            G I+  N LV YD E   + F  T+C++
Sbjct: 409 LGGIVVRNTLVMYDREHSKIGFWKTNCSE 437


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 172/361 (47%), Gaps = 32/361 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+PPTE     DTGSD++W   + C  CP S     D   FD   S T  S+ 
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159

Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           CS   C+S+ Q +   CS  N C YS  YGDGS ++G   T+T    +  G+++      
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
            I FGC T   G     +    GI G G G +S++SQ+  R      FS+CL    S   
Sbjct: 220 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGG 279

Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
            F    I+  PG+V +PL  ++  Y L + +I V  Q L +        +T   ++D+GT
Sbjct: 280 VFVLGEILV-PGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGT 338

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQV-PEVTIHFRGADVK 365
           TLT+L +      L+ +S+ + +Q V     + E CY  + S+S + P V+++F G    
Sbjct: 339 TLTYLVKEAYDLFLNAISNSV-SQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASM 397

Query: 366 LSRS-----NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + R      ++ +     + C  F+       I G+++  + +  YD+ +Q + +   DC
Sbjct: 398 MLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 457

Query: 421 T 421
           +
Sbjct: 458 S 458


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 106/361 (29%), Positives = 172/361 (47%), Gaps = 32/361 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+PPTE     DTGSD++W   + C  CP S     D   FD   S T  S+ 
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 164

Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           CS   C+S+ Q +   CS  N C YS  YGDGS ++G   T+T    +  G+++      
Sbjct: 165 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 224

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
            I FGC T   G     +    GI G G G +S++SQ+  R      FS+CL    S   
Sbjct: 225 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGG 284

Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
            F    I+  PG+V +PL  ++  Y L + +I V  Q L +        +T   ++D+GT
Sbjct: 285 VFVLGEILV-PGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGT 343

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQV-PEVTIHFRGADVK 365
           TLT+L +      L+ +S+ + +Q V     + E CY  + S+S + P V+++F G    
Sbjct: 344 TLTYLVKEAYDLFLNAISNSV-SQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASM 402

Query: 366 LSRS-----NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + R      ++ +     + C  F+       I G+++  + +  YD+ +Q + +   DC
Sbjct: 403 MLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 462

Query: 421 T 421
           +
Sbjct: 463 S 463


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 106/360 (29%), Positives = 171/360 (47%), Gaps = 32/360 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+PPTE     DTGSD++W   + C  CP S     D   FD   S T  S+ 
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159

Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           CS   C+S+ Q +   CS  N C YS  YGDGS ++G   T+T    +  G+++      
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
            I FGC T   G     +    GI G G G +S++SQ+  R      FS+CL    S   
Sbjct: 220 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGG 279

Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
            F    I+  PG+V +PL  ++  Y L + +I V  Q L +        +T   ++D+GT
Sbjct: 280 VFVLGEILV-PGMVYSPLVPSQPHYNLNLLSIGVNGQMLPLDAAVFEASNTRGTIVDTGT 338

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQV-PEVTIHFRGADVK 365
           TLT+L +      L+ +S+ + +Q V     + E CY  + S+S + P V+++F G    
Sbjct: 339 TLTYLVKEAYDLFLNAISNSV-SQLVTPIISNGEQCYLVSTSISDMFPSVSLNFAGGASM 397

Query: 366 LSRS-----NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + R      ++ +     + C  F+       I G+++  + +  YD+ +Q + +   DC
Sbjct: 398 MLRPQDYLFHYGIYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWASYDC 457


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 106/371 (28%), Positives = 162/371 (43%), Gaps = 52/371 (14%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W     C+ CP       D  L+D K S+T  ++ 
Sbjct: 155 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 214

Query: 148 CSSSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
           C  + C+  +     C  G+ C YSV YGDGS + G    + V     +G     P    
Sbjct: 215 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT 274

Query: 202 ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKIN 256
           + FGCG    G   S +    GI+G G  + S++SQ+ ++  +   FS+CL  V    I 
Sbjct: 275 VVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGI- 333

Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI--------VIDSGTT 308
           F    +V  P V  TPL + +  Y + +  I VG   L V +           +IDSGTT
Sbjct: 334 FAIGEVVE-PKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTT 392

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE------LCYSF--NSLSQVPEVTIHF- 359
           L + PQ        V   +IE      P   L        C+ +  N     P VT+HF 
Sbjct: 393 LAYFPQ-------EVYVPLIEKILSQQPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFD 445

Query: 360 RGADVKLSRSNFFVKVSEDIVCSVFKGITNS---------VPIYGNIMQTNFLVGYDIEQ 410
           +   + +    +  +V E   C    G  NS         + + G+++ +N LV YD+E+
Sbjct: 446 KSISLTVYPHEYLFQVKEFEWCI---GWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLEK 502

Query: 411 QTVSFKPTDCT 421
           Q + +   +C+
Sbjct: 503 QGIGWVEYNCS 513


>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 252

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 84/224 (37%), Positives = 117/224 (52%), Gaps = 26/224 (11%)

Query: 38  SPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA------NY 91
           S K   +N        L D   RS+   N   + +S  + +ASQ  I  ++       NY
Sbjct: 8   SEKKIDWNRRLQKQLILDDLRVRSMQ--NRIRRVASTHNVEASQTQIPLSSGINLQTLNY 65

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           ++ + +G+       + DT SDL W QCEPC    CY Q  P+F P  SS+Y+S+ C+SS
Sbjct: 66  IVTMGLGSK--NMTVIIDTRSDLTWVQCEPCMS--CYNQQGPIFKPSTSSSYQSVSCNSS 121

Query: 152 QCASL-----NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
            C SL     N  +C   N   C Y V+YGDGS++NG+L  E ++ G      V++    
Sbjct: 122 TCQSLQFATGNTGACGSSNPSTCNYVVNYGDGSYTNGDLGVEALSFG-----GVSVSDFV 176

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           FGCG NN GLF    +G++GLG   +SL+SQ   T  G FSYCL
Sbjct: 177 FGCGRNNKGLFGG-VSGLMGLGRSYLSLVSQTNATFGGVFSYCL 219


>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 476

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 106/413 (25%), Positives = 174/413 (42%), Gaps = 48/413 (11%)

Query: 50  PYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPP 101
           P QR  +   RSL+ +   +        +   A  +P   N        Y  ++ +G+P 
Sbjct: 26  PVQRKFNGPHRSLDAIKAHDDRRR---GRFLAAIDVPLGGNGLPSSTGLYYTKVGLGSPA 82

Query: 102 TERLAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
            E     DTGSD++W  C     CP       D  L+DP  S T  ++PC    C     
Sbjct: 83  KEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYS 142

Query: 159 KSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNG 211
              SG    ++C YS++YGDGS ++G+   +++T    +G     P    + FGCG    
Sbjct: 143 GPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQS 202

Query: 212 GLFNSKT----TGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSG 265
           G  +S +     GI+G G  + S++SQ+  +  +   FS+CL       I   + G V  
Sbjct: 203 GSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCLDSHHGGGIF--SIGQVME 260

Query: 266 PGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGTTLTFLPQG-Y 316
           P   +TPL      Y + +  + V  + + +        S    +IDSGTTL +LP   Y
Sbjct: 261 PKFNTTPLVPRMAHYNVILKDMDVDGEPILLPLYLFDSGSGRGTIIDSGTTLAYLPLSIY 320

Query: 317 NSNLLSVMSSM--IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVK 374
           N  L  V+     ++   V D        YS       P V  HF G  + +   ++   
Sbjct: 321 NQLLPKVLGRQPGLKLMIVEDQFTCFH--YSDKLDEGFPVVKFHFEGLSLTVHPHDYLFL 378

Query: 375 VSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             EDI C  ++  +        + + G+++ +N LV YD+E   + +   +C+
Sbjct: 379 YKEDIYCIGWQKSSTQTKEGRDLILIGDLVLSNKLVVYDLENMVIGWTNFNCS 431


>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
 gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 115/428 (26%), Positives = 179/428 (41%), Gaps = 57/428 (13%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYL 92
           + H   P SP    S     R  DA    L  L+     + +SS+  +     P+   Y+
Sbjct: 27  VYHNVHPSSPSPLESIIALARDDDA---RLLFLSSKAATAGVSSAPVASGQAPPS---YV 80

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           +R  +G+P  + L   DT +D  W  C PC    C    S LF P  SS+Y SLPCSSS 
Sbjct: 81  VRAGLGSPSQQLLLALDTSADATWAHCSPC--GTC--PSSSLFAPANSSSYASLPCSSSW 136

Query: 153 CASLNQKSCSGVN--------------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           C     ++C                  C +S  + D SF    LA++T+ LG       A
Sbjct: 137 CPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKD-----A 190

Query: 199 LPGITFGCGTN-NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
           +P  TFGC ++  G   N    G++GLG G ++L+SQ  +   G FSYCL P   +    
Sbjct: 191 IPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCL-PSYRSYYFS 249

Query: 258 GTNGIVSGPG----VVSTPLTK---AKTFYVLTIDAISVGNQRLGVST----------PD 300
           G+  + +G G    V  TP+ +     + Y + +  +SVG+  + V              
Sbjct: 250 GSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATGAG 309

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIH 358
            V+DSGT +T       + L       + A       G+ + C++ + ++    P VT+H
Sbjct: 310 TVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPAVTVH 369

Query: 359 FRGA-DVKLSRSNFFVKVSE-DIVCSVF----KGITNSVPIYGNIMQTNFLVGYDIEQQT 412
             G  D+ L   N  +  S   + C       + + + V +  N+ Q N  V +D+    
Sbjct: 370 MDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDVANSR 429

Query: 413 VSFKPTDC 420
           V F    C
Sbjct: 430 VGFAKESC 437


>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 64/157 (40%), Positives = 87/157 (55%), Gaps = 11/157 (7%)

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           Q    SSS  S   +   +  Y  R+ +GTPP     V DTGSD++W QC PC   +CY 
Sbjct: 155 QGGGFSSSVTS--GLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPC--RKCYS 210

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
           Q  P+FDPK S ++ S+ C S  C  L+   C S  +C Y V+YGDGSF+ G  +TET+T
Sbjct: 211 QTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLT 270

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
                 +   +P +  GCG +N GLF     G++GLG
Sbjct: 271 F-----RGTRVPKVALGCGHDNEGLFVG-AAGLLGLG 301


>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
 gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
          Length = 491

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 104/367 (28%), Positives = 165/367 (44%), Gaps = 44/367 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI IG+PP       DTGSD++W    +C+ CP       +   +DP  S T  ++ 
Sbjct: 84  YYTRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TVG 141

Query: 148 CSSSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQA-- 196
           C    C +    S  GV          CQ+ ++YGDGS + G   T+ V     +G    
Sbjct: 142 CEQEFCVA---NSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQT 198

Query: 197 -VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPV 250
             +   ITFGCG   GG     N    GI+G G  D S++SQ+     +   F++CL  V
Sbjct: 199 TTSNASITFGCGAQLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTV 258

Query: 251 SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIV 302
               I F    +V  P V +TPL    T Y + +  ISVG   L + T           +
Sbjct: 259 RGGGI-FAIGNVVQ-PKVKTTPLVPNVTHYNVNLQGISVGGATLQLPTSTFDSGDSKGTI 316

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA 362
           IDSGTTL +LP+     LL+ +    +  P+ +    +   +S +     P +T  F G 
Sbjct: 317 IDSGTTLAYLPREVYRTLLAAVFDKYQDLPLHNYQDFVCFQFSGSIDDGFPVITFSFEG- 375

Query: 363 DVKLSR--SNFFVKVSEDIVCSVF--KGIT----NSVPIYGNIMQTNFLVGYDIEQQTVS 414
           D+ L+    ++  +   D+ C  F   G+       + + G+++ +N LV YD+E++ + 
Sbjct: 376 DLTLNVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVLSNKLVVYDLEKEVIG 435

Query: 415 FKPTDCT 421
           +   +C+
Sbjct: 436 WTDYNCS 442


>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
          Length = 632

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 120/444 (27%), Positives = 180/444 (40%), Gaps = 32/444 (7%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHR------DSPKSP--FYNSSETPYQRLRDALT 59
            FIL F+   V     A    FS  LIHR       S KSP  F       Y RL  ++ 
Sbjct: 6   AFILLFILSLVSEKSLASL--FSSRLIHRFSDEGRASIKSPGSFPEKRSFEYYRLLTSID 63

Query: 60  RSLNRLNHFNQNSSISSSKASQADIIPNN---ANYLIRISIGTPPTERLAVADTGSDLIW 116
               ++N   +  S+  S+ S+  I P N     +   I IGTP    L   D+GSDL+W
Sbjct: 64  SRRQKMNLGAKFQSLVPSEGSKT-ISPGNYFGWLHYTWIDIGTPSVSFLVALDSGSDLLW 122

Query: 117 TQCE--PCPP------SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
             C    C P      S    +D   FDP  S+T K  PCS   C S          C Y
Sbjct: 123 IPCNCVQCAPLSSAYYSSLATKDLNEFDPSASTTSKVFPCSHKLCESAPACESPKEQCPY 182

Query: 169 SVSYG-DGSFSNGNLATETVTLGSTTGQAVALPG-ITFGCGTNNGGLFNSKTT--GIVGL 224
           +V+Y  + + S+G L  + + L  +   + ++   +  GCG    G F       G++GL
Sbjct: 183 TVTYASENTSSSGLLVEDVLHLAYSANASSSVKARVVVGCGEKQSGEFLKGIAPDGVMGL 242

Query: 225 GGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVL 282
           G G+IS+ S +     +   FS C     S +I FG  G  +       P       Y +
Sbjct: 243 GPGEISVPSFLAKAGLMRNSFSMCFDEEDSGRIYFGDVGPSTQQSTRFLPYKNEFVAYFV 302

Query: 283 TIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
            ++   VGN  L  S+   +IDSG + TFLP+     +   + S I A       G  E 
Sbjct: 303 GVEVCCVGNSCLKQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHINATVKKIEGGPWEY 362

Query: 343 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 402
           CY  +   +VP + + F   +  +     FV    + +      I+ S    G ++  N+
Sbjct: 363 CYETSFEPKVPAIKLKFSSNNTFVIHKPLFVLQRSEGLVQFCLPISASEEGTGGVIGQNY 422

Query: 403 LVGY----DIEQQTVSFKPTDCTK 422
           + GY    D E   + +  + C +
Sbjct: 423 MAGYRIVFDRENMKLGWSASKCQE 446


>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 481

 Score =  119 bits (297), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 102/373 (27%), Positives = 163/373 (43%), Gaps = 48/373 (12%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQC---EPCPPSQCYMQDSPLFDPKMSSTYKS 145
           +N L    IG  P +     DTGSD +W  C     CP       D  L+DP +S T K+
Sbjct: 72  SNGLYYTKIGLGPKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKTSKA 131

Query: 146 LPCSSSQCASLNQKSCS----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP- 200
           +PC    C S      S    G++C YS++YGDGS ++G+   + +T     G    +P 
Sbjct: 132 VPCDDEFCTSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPD 191

Query: 201 --GITFGCGTNNGGLFNSKT----TGIVGLGGGDISLISQMRTTIAGK----FSYCLVPV 250
              + FGCG+   G  +S T     GI+G G  + S++SQ+    AGK    FS+CL  +
Sbjct: 192 NTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAA--AGKVKRIFSHCLDSI 249

Query: 251 SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIV 302
           S   I F    +V  P V +TPL +    Y + +  I V    + +        S    +
Sbjct: 250 SGGGI-FAIGEVVQ-PKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDSSSGRGTI 307

Query: 303 IDSGTTLTFLPQGYNSNLLS---VMSSMIEAQPVADPTGSLELCYSFNSLSQVPEV--TI 357
           IDSGTTL +LP      LL       S ++   V D       C+ ++    V ++  T+
Sbjct: 308 IDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYLVEDQF----TCFHYSDEESVDDLFPTV 363

Query: 358 HF---RGADVKLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDI 408
            F    G  +     ++     ED+ C  ++           + + G+++  N LV YD+
Sbjct: 364 KFTFEEGLTLTTYPRDYLFLFKEDMWCVGWQKSMAQTKDGKELILLGDLVLANKLVVYDL 423

Query: 409 EQQTVSFKPTDCT 421
           +   + +   +C+
Sbjct: 424 DNMAIGWADYNCS 436


>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
          Length = 440

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 116/432 (26%), Positives = 180/432 (41%), Gaps = 61/432 (14%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           + + H   P SP    S     R  DA    L  L+     + +SS+  +     P+   
Sbjct: 27  LSVYHNVHPSSPSPLESIIALARDDDA---RLLFLSSKAATAGVSSAPVASGQAPPS--- 80

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y++R  +G+P  + L   DT +D  W  C PC    C    S LF P  SS+Y SLPCSS
Sbjct: 81  YVVRAGLGSPSQQLLLALDTSADATWAHCSPC--GTC--PSSSLFAPANSSSYASLPCSS 136

Query: 151 SQCASLNQKSCSGVN--------------CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           S C     ++C                  C +S  + D SF    LA++T+ LG      
Sbjct: 137 SWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKD---- 191

Query: 197 VALPGITFGCGTN-NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
            A+P  TFGC ++  G   N    G++GLG G ++L+SQ  +   G FSYCL P   +  
Sbjct: 192 -AIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQAGSLYNGVFSYCL-PSYRSYY 249

Query: 256 NFGTNGIVSGPG----VVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTP--------- 299
             G+  + +G G    V  TP+ +     + Y + +  +SVG  R  V  P         
Sbjct: 250 FSGSLRLGAGGGQPRSVRYTPMLRNPHRSSLYYVNVTGLSVG--RAWVKVPAGSFAFDAA 307

Query: 300 ---DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPE 354
                V+DSGT +T       + L       + A       G+ + C++ + ++    P 
Sbjct: 308 TGAGTVVDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPA 367

Query: 355 VTIHFRGA-DVKLSRSNFFVKVSE-DIVCSVF----KGITNSVPIYGNIMQTNFLVGYDI 408
           VT+H  G  D+ L   N  +  S   + C       + + + V +  N+ Q N  V +D+
Sbjct: 368 VTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVVFDV 427

Query: 409 EQQTVSFKPTDC 420
               + F    C
Sbjct: 428 ANSRIGFAKESC 439


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 161/361 (44%), Gaps = 45/361 (12%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ-DSPL--FDPKMSSTYKSLP 147
           Y  ++ +GTPP       DTGSDL+W  C PC     +     P+  +D K S++   +P
Sbjct: 36  YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVP 95

Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           CS   C  + Q S SG N    C YS  YGDGS + G L  + +          A   + 
Sbjct: 96  CSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHY-----MVNATATVI 150

Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSSTKIN 256
           FGCG    G  ++      GI+G G  D+S  SQ+     GK    F++CL         
Sbjct: 151 FGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQ--GKTPNVFAHCL-DGGERGGG 207

Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS----TPDI----VIDSGTT 308
               G V  P +  TPL    + Y + + +ISV N  L +     + D+    + DSGTT
Sbjct: 208 ILVLGNVIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTT 267

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR 368
           L +LP          +S ++    + D   S  +   F      P V ++F GA + L+ 
Sbjct: 268 LAYLPDEAYQAFTQAVSLVVAPFLLCDTRLSRFIYKLF------PNVVLYFEGASMTLTP 321

Query: 369 SNFFVK----VSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
           + + ++     +  I C  ++ + ++       I+G+++  N LV YD+E+  + ++P D
Sbjct: 322 AEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFD 381

Query: 420 C 420
           C
Sbjct: 382 C 382


>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
          Length = 480

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 105/376 (27%), Positives = 169/376 (44%), Gaps = 51/376 (13%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP--LFDPKMSSTYKSLP 147
            Y +R  +GTP    + VADTGSDL W +C           D+P  +F    S ++  + 
Sbjct: 111 QYFVRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDG---TGDAPRRVFRAAASRSWAPIA 167

Query: 148 CSSSQCAS-----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL---GSTT----GQ 195
           CSS  C S     L   S     C Y   Y DGS + G + T++ T+   GS +    G+
Sbjct: 168 CSSDTCTSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGGGR 227

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV----PVS 251
              L G+  GC  +  G     + G++ LG  +IS  S+      G+FSYCLV    P +
Sbjct: 228 RAKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRN 287

Query: 252 STK-INFGTNGIVSG--------PGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTP 299
           +T  + FG  G   G             TPL    +   FY + +DA+ V  + L +   
Sbjct: 288 ATSYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEALDIPA- 346

Query: 300 DI---------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFNS 348
           D+         ++DSGT+LT L       +++ +S  +   P    DP    E CY++ +
Sbjct: 347 DVWDVARGGGAILDSGTSLTVLATPAYRAVVAALSERLAGLPRVSMDP---FEYCYNWTA 403

Query: 349 LS-QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVG 405
            + ++P + + F G A ++    ++ V  +  + C  V +G    V + GNI+Q + L  
Sbjct: 404 AALEIPGLEVRFAGSARLQPPAKSYVVDAAPGVKCIGVQEGAWPGVSVIGNILQQDHLWE 463

Query: 406 YDIEQQTVSFKPTDCT 421
           +D+  + + FK T C 
Sbjct: 464 FDLRDRWLRFKHTRCA 479


>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
          Length = 477

 Score =  118 bits (296), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 165/361 (45%), Gaps = 35/361 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W    QC  CP       +  L+D K S T K + 
Sbjct: 98  YYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVS 157

Query: 148 CSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C    C ++N        + ++C Y+  Y DGS S G    + V     +G      A  
Sbjct: 158 CDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANG 217

Query: 201 GITFGCGTNNGGLFNSKTT--GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKIN 256
            + FGC     G  +S+    GI+G G  + S+ISQ+ ++  +   F++CL  ++   I 
Sbjct: 218 SVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGI- 276

Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP--DI------VIDSGTT 308
           F    IV  P V +TPL   +T Y + + A+ VG   L + T   D+      +IDSGTT
Sbjct: 277 FAIGHIVQ-PKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTT 335

Query: 309 LTFLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVK 365
           L +LP+     LLS + S    ++   + D     +  YS +     P VT HF  +   
Sbjct: 336 LAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFTCFQ--YSESLDDGFPAVTFHFENSLYL 393

Query: 366 LSRSNFFVKVSEDIVCSVFK--GI----TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
               + ++   + + C  ++  G+      ++ + G++  +N LV YD+E Q + +   +
Sbjct: 394 KVHPHEYLFSYDGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEYN 453

Query: 420 C 420
           C
Sbjct: 454 C 454


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 106/371 (28%), Positives = 161/371 (43%), Gaps = 52/371 (14%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W     C+ CP       D  L+D K S+T  ++ 
Sbjct: 74  YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 133

Query: 148 CSSSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
           C  + C+  +     C  G+ C YSV YGDGS + G    + V     +G     P    
Sbjct: 134 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT 193

Query: 202 ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKIN 256
           + FGCG    G   S +    GI+G G  + S++SQ+ ++  +   FS+CL  V    I 
Sbjct: 194 VVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGI- 252

Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGTT 308
           F    +V  P V  TPL + +  Y + +  I VG   L V             +IDSGTT
Sbjct: 253 FAIGEVVE-PKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTT 311

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE------LCYSF--NSLSQVPEVTIHF- 359
           L + PQ        V   +IE      P   L        C+ +  N     P VT+HF 
Sbjct: 312 LAYFPQ-------EVYVPLIEKILSQQPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHFD 364

Query: 360 RGADVKLSRSNFFVKVSEDIVCSVFKGITNS---------VPIYGNIMQTNFLVGYDIEQ 410
           +   + +    +  +V E   C    G  NS         + + G+++ +N LV YD+E+
Sbjct: 365 KSISLTVYPHEYLFQVKEFEWCI---GWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDLEK 421

Query: 411 QTVSFKPTDCT 421
           Q + +   +C+
Sbjct: 422 QGIGWVEYNCS 432


>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
 gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
          Length = 485

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 102/364 (28%), Positives = 162/364 (44%), Gaps = 37/364 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W    QC  CP +     D  L++   S T K +P
Sbjct: 78  YYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLVP 137

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C    C  +N     G    ++C Y   YGDGS + G    + V     +G      A  
Sbjct: 138 CDQEFCYEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAANG 197

Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
            + FGCG    G   S       GI+G G  + S+ISQ+  T  +   F++CL   +   
Sbjct: 198 SVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCLDGTNGGG 257

Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDSG 306
           I     G V  P V  TPL   +  Y + + A+ VG++ L + T           +IDSG
Sbjct: 258 IF--VIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGHEFLSLPTDVFEAGDRKGAIIDSG 315

Query: 307 TTLTFLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGAD 363
           TTL +LP+     L+S + S    ++   V D     +  YS +     P VT HF  + 
Sbjct: 316 TTLAYLPEMVYKPLVSKIISQQPDLKVHTVRDEYTCFQ--YSDSLDDGFPNVTFHFENSV 373

Query: 364 VKLSRSNFFVKVSEDIVCSVFK--GI----TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
           +     + ++   E + C  ++  G+      ++ + G+++ +N LV YD+E Q + +  
Sbjct: 374 ILKVYPHEYLFPFEGLWCIGWQNSGVQSRDRRNMTLLGDLVLSNKLVLYDLENQAIGWTE 433

Query: 418 TDCT 421
            +C+
Sbjct: 434 YNCS 437


>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 502

 Score =  118 bits (295), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 101/362 (27%), Positives = 166/362 (45%), Gaps = 35/362 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W    QC  CP       +  L+D K S T K + 
Sbjct: 98  YYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVS 157

Query: 148 CSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C    C ++N        + ++C Y+  Y DGS S G    + V     +G      A  
Sbjct: 158 CDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANG 217

Query: 201 GITFGCGTNNGGLFNSKTT--GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKIN 256
            + FGC     G  +S+    GI+G G  + S+ISQ+ ++  +   F++CL  ++   I 
Sbjct: 218 SVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGI- 276

Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP--DI------VIDSGTT 308
           F    IV  P V +TPL   +T Y + + A+ VG   L + T   D+      +IDSGTT
Sbjct: 277 FAIGHIVQ-PKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTT 335

Query: 309 LTFLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVK 365
           L +LP+     LLS + S    ++   + D     +  YS +     P VT HF  +   
Sbjct: 336 LAYLPEVVYDQLLSKIFSWQSDLKVHTIHDQFTCFQ--YSESLDDGFPAVTFHFENSLYL 393

Query: 366 LSRSNFFVKVSEDIVCSVFK--GI----TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
               + ++   + + C  ++  G+      ++ + G++  +N LV YD+E Q + +   +
Sbjct: 394 KVHPHEYLFSYDGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEYN 453

Query: 420 CT 421
           C+
Sbjct: 454 CS 455


>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
 gi|224030351|gb|ACN34251.1| unknown [Zea mays]
          Length = 342

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 94/350 (26%), Positives = 156/350 (44%), Gaps = 60/350 (17%)

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGD 174
           QC+PC    CY Q  P+F+PK+SS+Y  +PC+S  CA L+   C   +   CQY+  Y  
Sbjct: 2   QCQPC--VSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSG 59

Query: 175 GSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
              + G LA + + +G     AV      FGC  ++ G   ++ +G+VGLG G +SL+SQ
Sbjct: 60  HGVTKGTLAIDKLAIGGDVFHAV-----VFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQ 114

Query: 235 MRTTIAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDA 286
           +      +F YCL P  S       +  G + + +    V+  +   T+  ++Y L +D 
Sbjct: 115 LSVH---RFMYCLPPPMSRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDG 171

Query: 287 ISVGNQ-----RLGVSTPD------------------------IVIDSGTTLTFLPQGYN 317
           ++VG+Q     R   S P                         +++D  +T++FL     
Sbjct: 172 LAVGDQTPGTTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLY 231

Query: 318 SNLLSVMSSMIEAQPVADPTGS--LELCYSF-----NSLSQVPEVTIHFRGADVKLSRSN 370
             L   +   I   P A P+    L+LC+            VP V++ F G  ++L R  
Sbjct: 232 DELADDLEEEIRL-PRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFDGRWLELDRDR 290

Query: 371 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            F  V++  +  +  G T+ V I GN    N  V +++ +  ++F    C
Sbjct: 291 LF--VTDGRMMCLMIGRTSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 338


>gi|125595855|gb|EAZ35635.1| hypothetical protein OsJ_19925 [Oryza sativa Japonica Group]
          Length = 335

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 72/205 (35%), Positives = 111/205 (54%), Gaps = 18/205 (8%)

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           GT    +  + D+GSD+ W QC+PCP   C+ Q  PLFDP  S+TY ++PCSS+ CA L 
Sbjct: 75  GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLG 134

Query: 158 --QKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
             ++ C +   CQ+ ++Y +G+ + G  +++ +TLG        + G  FGC   + G  
Sbjct: 135 PYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYD----VVRGFLFGCAHADQGST 190

Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGV 268
           F+    G + LGGG  S + Q  +  +  FSYC VP S++   F   G+        P  
Sbjct: 191 FSYDVAGTLALGGGSQSFVQQTASQYSRVFSYC-VPPSTSSFGFIMFGVPPQRAALVPTF 249

Query: 269 VSTPL----TKAKTFYVLTIDAISV 289
           VSTPL    T + TFY +T+ +I++
Sbjct: 250 VSTPLLSSSTMSPTFYSITLPSIAL 274


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 107/389 (27%), Positives = 180/389 (46%), Gaps = 36/389 (9%)

Query: 58  LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
           L+ S   L     +S+ ++      D+IP    Y  RI IGTPP     + DTGS L + 
Sbjct: 60  LSHSRRHLQRSESHSTATARMPLYDDLIPY-GYYTTRIWIGTPPQTFALIVDTGSTLTYV 118

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSF 177
            C  C   QC     P F P  SSTY+ L C S +C   ++     ++C Y   Y + S 
Sbjct: 119 PCSTC--EQCGKHQDPNFQPDWSSTYQPLKC-SMECTCDSEM----MHCVYDRQYAEMSS 171

Query: 178 SNGNLATETVTLGSTTGQAVALPGIT-FGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           S+G L  + V+ G    Q+   P  T FGC     G +++ +  GI+GLG GD+S++ Q+
Sbjct: 172 SSGVLGEDIVSFGK---QSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQL 228

Query: 236 --RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQ 292
             +  I   FS C   +          GI    G+V T    A++ +Y + +  I +  +
Sbjct: 229 VEKGVIGNSFSLCYGGMDVGGGAMVLGGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGK 288

Query: 293 RLGVSTPDI-------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELC 343
           +L ++ P +       ++DSGTT  +LP+  + +   ++M  +   + +  P  +  ++C
Sbjct: 289 QLPIN-PMVFDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDIC 347

Query: 344 YS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFFVKVSE---DIVCSVFKGITNSVPI 393
           +S      + LS+  P V + F  G  + LS  N+  + S+        +F+   +   +
Sbjct: 348 FSGVGSDVSQLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTL 407

Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
            G I+  N LV YD E   + F  T+C++
Sbjct: 408 LGGIIVRNTLVMYDREHLKIGFWKTNCSE 436


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 107/389 (27%), Positives = 180/389 (46%), Gaps = 36/389 (9%)

Query: 58  LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
           L+ S   L     +S+ ++      D+IP    Y  RI IGTPP     + DTGS L + 
Sbjct: 60  LSHSRRHLQRSESHSTATARMPLYDDLIPY-GYYTTRIWIGTPPQTFALIVDTGSTLTYV 118

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSF 177
            C  C   QC     P F P  SSTY+ L C S +C   ++     ++C Y   Y + S 
Sbjct: 119 PCSTC--EQCGKHQDPNFQPDWSSTYQPLKC-SMECTCDSEM----MHCVYDRQYAEMSS 171

Query: 178 SNGNLATETVTLGSTTGQAVALPGIT-FGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           S+G L  + V+ G    Q+   P  T FGC     G +++ +  GI+GLG GD+S++ Q+
Sbjct: 172 SSGVLGEDIVSFGK---QSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQL 228

Query: 236 --RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQ 292
             +  I   FS C   +          GI    G+V T    A++ +Y + +  I +  +
Sbjct: 229 VEKGVIGNSFSLCYGGMDVGGGAMVLGGISPPAGMVFTHSDPARSAYYNIDLKEIHIAGK 288

Query: 293 RLGVSTPDI-------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELC 343
           +L ++ P +       ++DSGTT  +LP+  + +   ++M  +   + +  P  +  ++C
Sbjct: 289 QLPIN-PMVFDGKYGTILDSGTTYAYLPEPAFKAFKDAIMKELNSLKLIQGPDRNYNDIC 347

Query: 344 YS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFFVKVSE---DIVCSVFKGITNSVPI 393
           +S      + LS+  P V + F  G  + LS  N+  + S+        +F+   +   +
Sbjct: 348 FSGVGSDVSQLSKTFPAVDLVFSNGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTL 407

Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
            G I+  N LV YD E   + F  T+C++
Sbjct: 408 LGGIIVRNTLVMYDREHLKIGFWKTNCSE 436


>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 507

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 120/495 (24%), Positives = 193/495 (38%), Gaps = 86/495 (17%)

Query: 10  ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
           IL  +  +++ P+   +    +EL+HR   +           + ++  + R   R    N
Sbjct: 16  ILITITLHLILPVAVNS--MRLELVHRHHERFSGGGGDVDQVEAVKGFVNRDGLRRQRMN 73

Query: 70  QNSSISSSKASQADI---------IPNNA-------NYLIRISIGTPPTERLAVADTGSD 113
           Q   +S+    +  +         +P  A        Y   + +G+P       ADTGS+
Sbjct: 74  QRWGVSNYDRRRKGLETTTTTEVEMPMRAGRDDALGEYFTEVKVGSPGQRFWLAADTGSE 133

Query: 114 LIWTQC---------------------------------EPCPPSQCYMQDSP---LFDP 137
             W  C                                       +   + +P   +F P
Sbjct: 134 FTWFNCVMRNATTTATTKKTRKNKTKKKHHHHSKRNRTRTTRRTKKKKAKSNPCKGVFCP 193

Query: 138 KMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
             S +++++ C+S +C        SL+        C Y +SY DGS + G   T+T+T+ 
Sbjct: 194 HRSKSFQAVTCASQKCKIDLSQLFSLSLCPKPSDPCLYDISYADGSSAKGFFGTDTITVD 253

Query: 191 STTGQAVALPGITFGC--GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV 248
              G+   L  +T GC     NG  FN  T GI+GLG    S I +       KFSYCLV
Sbjct: 254 LKNGKEGKLNNLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFIDKAAYEYGAKFSYCLV 313

Query: 249 PVSSTK-------INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV----- 296
              S +       I    N  + G  +  T L     FY + +  IS+G Q L +     
Sbjct: 314 DHLSHRNVSSYLTIGGHHNAKLLGE-IKRTELILFPPFYGVNVVGISIGGQMLKIPPQVW 372

Query: 297 ---STPDIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSL-- 349
              S    +IDSGTTLT  L   Y     +++ S+ + + V  +  G+L+ C+       
Sbjct: 373 DFNSQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRVTGEDFGALDFCFDAEGFDD 432

Query: 350 SQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNFLVGY 406
           S VP +  HF  GA  +    ++ + V+  + C     I       + GNIMQ N L  +
Sbjct: 433 SVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGASVIGNIMQQNHLWEF 492

Query: 407 DIEQQTVSFKPTDCT 421
           D+   T+ F P+ CT
Sbjct: 493 DLSTNTIGFAPSICT 507


>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 104/360 (28%), Positives = 162/360 (45%), Gaps = 63/360 (17%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + + +G+PP     + DTGSDL W QC PC    C+ Q+                   
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPC--YDCFQQND------------------ 209

Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG----QAVALPGITFGC 206
                 NQ      +C Y   YGD S + G+ A ET T+  TT     +   +  + FGC
Sbjct: 210 ------NQ------SCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGC 257

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-----STKINFGTN- 260
           G  N GLF+     +    G  +S  SQ+++     FSYCLV  +     S+K+ FG + 
Sbjct: 258 GHWNRGLFHGAAGLLGLGRGP-LSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLIFGEDK 316

Query: 261 GIVSGPGVVSTPLTKAK-----TFYVLTIDAISVGNQRLGV--STPDI--------VIDS 305
            ++S P +  T     K     TFY + I +I V  + L +   T +I        +IDS
Sbjct: 317 DLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDS 376

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQ-PVADPTGSLELCYSFNSLS--QVPEVTIHF-RG 361
           GTTL++  +     + + ++   + + PV      L+ C++ + +   Q+PE+ I F  G
Sbjct: 377 GTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELGIAFADG 436

Query: 362 ADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           A       N F+ ++ED+VC    G   S   I GN  Q NF + YD ++  + + PT C
Sbjct: 437 AVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKC 496


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 160/361 (44%), Gaps = 45/361 (12%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ-DSPL--FDPKMSSTYKSLP 147
           Y  ++ +GTPP       DTGSDL+W  C PC     +     P+  +D K S++   +P
Sbjct: 36  YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVP 95

Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           CS   C  + Q S SG N    C YS  YGDGS + G L  + +          A   + 
Sbjct: 96  CSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHY-----MVNATATVI 150

Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSSTKIN 256
           FGCG    G  ++      GI+G G  D+S  SQ+     GK    F++CL         
Sbjct: 151 FGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQ--GKTPNVFAHCL-DGGERGGG 207

Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS----TPDI----VIDSGTT 308
               G V  P +  TPL      Y + + +ISV N  L +     + D+    + DSGTT
Sbjct: 208 ILVLGNVIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTT 267

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR 368
           L +LP          +S ++    + D   S  +   F      P V ++F GA + L+ 
Sbjct: 268 LAYLPDEAYQAFTQAVSLVVAPFLLCDTRLSRFIYKLF------PNVVLYFEGASMTLTP 321

Query: 369 SNFFVK----VSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
           + + ++     +  I C  ++ + ++       I+G+++  N LV YD+E+  + ++P D
Sbjct: 322 AEYLIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFD 381

Query: 420 C 420
           C
Sbjct: 382 C 382


>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 498

 Score =  117 bits (293), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 101/363 (27%), Positives = 166/363 (45%), Gaps = 35/363 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W    QC  CP +     +   +D + S+T K + 
Sbjct: 87  YYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVS 146

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG 201
           C    C  +N    SG    ++C Y   YGDGS + G    + V     +G  +  A  G
Sbjct: 147 CDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANG 206

Query: 202 -ITFGCGTNNGGLFNS----KTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
            I FGCG    G   S       GI+G G  + S+ISQ+ +T  +   F++CL   +   
Sbjct: 207 SIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGGG 266

Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDSG 306
           I F    +V  P V  TPL   +  Y + +  + VG+  L +S            +IDSG
Sbjct: 267 I-FAMGHVVQ-PKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKGTIIDSG 324

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADV 364
           TTL +LP+     L++ + S      V    G  + C+ ++       P V  HF  + +
Sbjct: 325 TTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYK-CFQYSERVDDGFPPVIFHFENSLL 383

Query: 365 KLSRSNFFVKVSEDIVCSVFK--GI----TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
                + ++   E++ C  ++  G+      +V ++G+++ +N LV YD+E QT+ +   
Sbjct: 384 LKVYPHEYLFQYENLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEY 443

Query: 419 DCT 421
           +C+
Sbjct: 444 NCS 446


>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 402

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 89/277 (32%), Positives = 130/277 (46%), Gaps = 30/277 (10%)

Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
           C Y+++YGDGSF+ G L  E +  G+     + +    FGCG NN GLF    +G++GLG
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKFGT-----ILVKDFIFGCGRNNKGLFGG-VSGLMGLG 186

Query: 226 GGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV---STPLTKAK----- 277
             D+SLISQ      G FSYCL    ST+     + I+ G   V   S+P++ AK     
Sbjct: 187 RSDLSLISQTSGIFGGVFSYCL---PSTERKGSGSLILGGNSSVYRNSSPISYAKMIENP 243

Query: 278 ---TFYVLTIDAISVGN---QRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
               FY + +  IS+G    Q   V    I++DSGT +T LP      L +         
Sbjct: 244 QLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGF 303

Query: 332 PVADPTGSLELCYSFNSLSQV--PEVTIHFRG---ADVKLSRSNFFVKVSEDIVCSVFKG 386
           P A     L+ C++ ++  +V  P + +HF G     V ++   +FVK     VC     
Sbjct: 304 PPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALAS 363

Query: 387 IT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           +   + V I GN  Q N  V YD ++  V F    C+
Sbjct: 364 LEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 400


>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 492

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 104/372 (27%), Positives = 164/372 (44%), Gaps = 53/372 (14%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ IGTP  +     DTGSD++W    QC  CP +     +  L++ K S + K +P
Sbjct: 86  YYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLVP 145

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG 201
           C    C  +N    SG    ++C Y   YGDGS + G    + V     +G  Q  +  G
Sbjct: 146 CDEEFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSSNG 205

Query: 202 -ITFGCGTNNGGLF----NSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLVPVSSTK 254
            + FGCG    G           GI+G G  + S+ISQ+  T   K  F++CL  ++   
Sbjct: 206 SVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCLDGINGGG 265

Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI--------VIDSG 306
           I F    +V  P V  TPL   +  Y + + A+ VG   L + T +         +IDSG
Sbjct: 266 I-FAIGHVVQ-PKVNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDRKGAIIDSG 323

Query: 307 TTLTFLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGAD 363
           TTL +LP+     L+S + S    ++   V D     +  YS +     P VT HF    
Sbjct: 324 TTLAYLPEIVYEPLVSKIISQQPDLKVHIVRDEYTCFQ--YSGSVDDGFPNVTFHF---- 377

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGI--------------TNSVPIYGNIMQTNFLVGYDIE 409
                ++ F+KV        F+G+                ++ + G+++ +N LV YD+E
Sbjct: 378 ----ENSVFLKVHPHEYLFPFEGLWCIGWQNSGMQSRDRRNMTLLGDLVLSNKLVLYDLE 433

Query: 410 QQTVSFKPTDCT 421
            Q + +   +C+
Sbjct: 434 NQAIGWTEYNCS 445


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 107/361 (29%), Positives = 172/361 (47%), Gaps = 32/361 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+PPTE     DTGSD++W   + C  CP S     D   FD   S T  S+ 
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSVT 159

Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           CS   C+S+ Q +   CS  N C YS  YGDGS ++G   T+T    +  G+++      
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
            I FGC T   G     +    GI G G G +S++SQ+  R      FS+CL    S   
Sbjct: 220 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQLSSRGITPPVFSHCLKGDGSGGG 279

Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
            F    I+  PG+V +PL  ++  Y L + +I V  Q L +        +T   ++D+GT
Sbjct: 280 VFVLGEILV-PGMVYSPLLPSQPHYNLNLLSIGVNGQILPIDAAVFEASNTRGTIVDTGT 338

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQV-PEVTIHFR-GADV 364
           TLT+L +      L+ +S+ + +Q V     + E CY  + S+S + P V+++F  GA +
Sbjct: 339 TLTYLVKEAYDPFLNAISNSV-SQLVTLIISNGEQCYLVSTSISDMFPPVSLNFAGGASM 397

Query: 365 KLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            L   ++           + C  F+       I G+++  + +  YD+ +Q + +   DC
Sbjct: 398 MLRPQDYLFHYGFYDGASMWCIGFQKAPEEQTILGDLVLKDKVFVYDLARQRIGWANYDC 457

Query: 421 T 421
           +
Sbjct: 458 S 458


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 118/443 (26%), Positives = 191/443 (43%), Gaps = 61/443 (13%)

Query: 8   VFILFFLCFYVVSPIEA---------QTGGFSVELIHRDSPKSPFYNSSETPY-QRLRDA 57
           +F  F     VVS  +A         ++ G  + +IH     SPF       +   + + 
Sbjct: 3   IFTAFVFLTLVVSTTKAFDPCASPSSESKGSDLSVIHVYGQCSPFNQHKAGSWVNTVINM 62

Query: 58  LTRSLNRLNHFNQNSSISSSKASQADI-----IPNNANYLIRISIGTPPTERLAVADTGS 112
            ++   R+ + +  S ++S KA+   I     + N  NY++R+ +GTP      V DT  
Sbjct: 63  ASKDPARVTYLS--SLVASPKATSVPIASGQQVLNIGNYVVRVKLGTPGQLMFMVLDTSR 120

Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC---SGVNCQYS 169
           D  W  C  C  + C    SP F P  SSTY SL CS  QC  +   SC       C ++
Sbjct: 121 DAAWVPCADC--AGC---SSPTFSPNTSSTYASLQCSVPQCTQVRGLSCPTTGTAACFFN 175

Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDI 229
            +YG  S  +  L+ +++ L   T     LP  +FGC     G       G++GLG G +
Sbjct: 176 QTYGGDSSFSAMLSQDSLGLAVDT-----LPSYSFGCVNAVSG-STLPPQGLLGLGRGPM 229

Query: 230 SLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP-----GVVSTPLTK---AKTFYV 281
           SL+SQ  +  +G FSYC     S K  + +  +  GP      + +TPL +     T Y 
Sbjct: 230 SLLSQSGSLYSGVFSYCF---PSFKSYYFSGSLRLGPLGQPKNIRTTPLLRNPHRPTLYY 286

Query: 282 LTIDAISVGNQRLGVSTPDI-----------VIDSGTTLTFLPQGYNSNLLSVMSSMIEA 330
           + +  +SVG   + V+ P++           +IDSGT +T   +   + +       ++ 
Sbjct: 287 VNLTGVSVGRVLVPVA-PELLAFDPNTGAGTIIDSGTVITRFVEPVYAAIRDEFRKQVKG 345

Query: 331 QPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSE-DIVCSVFKGITN 389
            P A   G+ + C++  +    P VT HF G D+KL   N  +  S   + C       N
Sbjct: 346 -PFAT-IGAFDTCFAATNEDIAPPVTFHFTGMDLKLPLENTLIHSSAGSLACLAMAAAPN 403

Query: 390 SV----PIYGNIMQTNFLVGYDI 408
           +V     +  N+ Q N  + +D+
Sbjct: 404 NVNSVLNVIANLQQQNLRIMFDV 426


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 104/356 (29%), Positives = 160/356 (44%), Gaps = 45/356 (12%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD------SPLFDPKMSSTY 143
            Y  ++ +GTP T  L V DTGSD++W      PP    ++       +P   P+ +   
Sbjct: 121 EYFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAAPAPTPRWN--- 177

Query: 144 KSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
               C +  C  L+   C      C Y V+YGDGS + G+ A+ET+T      +   +  
Sbjct: 178 ----CVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF----ARGARVQR 229

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
           +  GCG +N GLF + +  ++GLG G +S  SQ+  +    FSYCLV  +S++    +  
Sbjct: 230 VAIGCGHDNEGLFIAASG-LLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSRRARPSRR 288

Query: 262 IVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL-GVSTPD-----------IVIDSGTTL 309
               P        +  TFY + +   SVG  R+ GVS  D           +++DSGT++
Sbjct: 289 WGGTP--------RMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTSV 340

Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIHFR-GADVK 365
           T L +     +     +      V+    SL + CY+     + +VP V++H   GA V 
Sbjct: 341 TRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVA 400

Query: 366 LSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           L   N+ + V +    C    G    V I GNI Q  F V +D + Q V F P  C
Sbjct: 401 LPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456


>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
 gi|194692946|gb|ACF80557.1| unknown [Zea mays]
          Length = 424

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 103/385 (26%), Positives = 172/385 (44%), Gaps = 44/385 (11%)

Query: 71  NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYM 129
           +SSI++      D+ P+   Y + ++IG PP       D+GSDL W QC+ PC    C  
Sbjct: 38  SSSIAAVFPLYGDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNE 94

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNL 182
              PL+ P  S   K +PC    CASL+     +  C   +  C Y + Y D   S G L
Sbjct: 95  VPHPLYRPTKS---KLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVL 151

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
             ++  L  T G +VA P + FGCG +     G  +S T G++GLG G +SL+SQ++   
Sbjct: 152 INDSFALRLTNG-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRG 210

Query: 240 AGK--FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLG 295
             K    +CL       + FG + +V       TP+ ++  + +Y     ++  G++ LG
Sbjct: 211 VTKNVVGHCLSLRGGGFLFFGDD-LVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLG 269

Query: 296 VSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQ 351
           V    +V DSG++ T+        L++ +   +      +P  SL LC+     F S+  
Sbjct: 270 VRLAKVVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLD 329

Query: 352 VPE----VTIHFRGAD---VKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNI 397
           V +    + ++F       +++   N+ +       C    GI N        + I G+I
Sbjct: 330 VRKEFKSLVLNFASGKKTLMEIPPENYLIVTENGNAC---LGILNGSEIGLKDLSIIGDI 386

Query: 398 MQTNFLVGYDIEQQTVSFKPTDCTK 422
              + +V YD E+  + +    C +
Sbjct: 387 TMQDHMVIYDNEKGKIGWIRAPCDR 411


>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
 gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
          Length = 490

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 104/365 (28%), Positives = 170/365 (46%), Gaps = 40/365 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IG+P        DTGSD++W    +C+ CP +     +   +DP  S T  ++ 
Sbjct: 85  YYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT--TVG 142

Query: 148 CSSSQCASLNQK----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP- 200
           C    C + +      +C   +  CQ+ ++YGDGS + G   +++V     +G     P 
Sbjct: 143 CDQEFCVANSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTTPS 202

Query: 201 --GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSST 253
              ITFGCG   GG   S +    GI+G G  D S++SQ+     +   F++CL  V   
Sbjct: 203 NASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCLDTVHGG 262

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--STPD------IVIDS 305
            I F    +V  P V +TPL +  T Y + +  ISVG   L +  ST D       +IDS
Sbjct: 263 GI-FAIGNVVQ-PKVKTTPLVQNVTHYNVNLQGISVGGATLQLPSSTFDSGDSKGTIIDS 320

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGA- 362
           GTTL +LP+     LL+ +    + Q +A       +C+ F+       P VT  F G  
Sbjct: 321 GTTLAYLPREVYRTLLTAV--FDKYQDLALHNYQDFVCFQFSGSIDDGFPVVTFSFEGEI 378

Query: 363 DVKLSRSNFFVKVSEDIVCSVF--KGIT----NSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
            + +   ++  +   D+ C  F   G+       + + G+++ +N LV YD+E+Q + + 
Sbjct: 379 TLNVYPHDYLFQNENDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVYDLEKQVIGWA 438

Query: 417 PTDCT 421
             +C+
Sbjct: 439 DYNCS 443


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  116 bits (290), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 106/379 (27%), Positives = 173/379 (45%), Gaps = 64/379 (16%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N    + +++G PP     V DTGS+L W  C+  P          +F+P  SSTY  +
Sbjct: 61  HNVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSP------NLGSVFNPVSSSTYSPV 114

Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           PCSS  C +  +      SC      C  ++SY D +   GNLA ET  +GS T      
Sbjct: 115 PCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVT-----R 169

Query: 200 PGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
           PG  FGC   G ++    ++K+TG++G+  G +S ++Q+  +   KFSYC+    S+   
Sbjct: 170 PGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCISGSDSSGFL 226

Query: 257 FGTNGIVSGPGVV--------STPLTK-AKTFYVLTIDAISVGNQRL----GVSTPD--- 300
              +   S  G +        STPL    +  Y + ++ I VG++ L     V  PD   
Sbjct: 227 LLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTG 286

Query: 301 ---IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADP----TGSLELCYSFNS--- 348
               ++DSGT  TFL     + L +   +  ++  + V DP     G+++LCY   S   
Sbjct: 287 AGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRLVDDPDFVFQGTMDLCYKVGSTTR 346

Query: 349 --LSQVPEVTIHFRGADVKLSRSNFFVKVS-------EDIVCSVFKG---ITNSVPIYGN 396
              S +P V++ FRGA++ +S      +V+       E++ C  F     +     + G+
Sbjct: 347 PNFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGH 406

Query: 397 IMQTNFLVGYDIEQQTVSF 415
             Q N  + +D+ +  V F
Sbjct: 407 HHQQNVWMEFDLAKSRVGF 425


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 172/375 (45%), Gaps = 43/375 (11%)

Query: 76  SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
           S++    D +  N  Y  R+ IGTPP E   + D+GS + +  C  C   QC     P F
Sbjct: 73  SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRF 130

Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
            P +SSTY  + C+   C   + K+     C Y   Y + S S+G L  + V+ G+ +  
Sbjct: 131 QPDLSSTYSPVKCNVD-CTCDSDKN----QCTYERQYAEMSSSSGVLGEDIVSFGTES-- 183

Query: 196 AVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSS 252
            +      FGC  +  G LF+    GI+GLG G +S++ Q+  +  I   FS C      
Sbjct: 184 ELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCY----- 238

Query: 253 TKINFGTNGIVSG-----PGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVSTPDI----- 301
             ++ G   +V G     PG++ T     ++ +Y + +  + V  + L V  P I     
Sbjct: 239 GGMDIGGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVD-PRIFDGKH 297

Query: 302 --VIDSGTTLTFLPQGYNSNLLSVMSSMIEA-QPVADPTGSL-ELCYS-----FNSLSQV 352
             V+DSGTT  +LP+         +SS +   + +  P  +  ++C++      + LS+V
Sbjct: 298 GTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEV 357

Query: 353 -PEVTIHF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 407
            P+V + F  G  + LS  N+  + S  E   C  VF+   +   + G I+  N LV YD
Sbjct: 358 FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYD 417

Query: 408 IEQQTVSFKPTDCTK 422
              + + F  T+C++
Sbjct: 418 RHNEKIGFWKTNCSE 432


>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
          Length = 442

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 106/381 (27%), Positives = 172/381 (45%), Gaps = 68/381 (17%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N    + +++G PP     V DTGS+L W  C+  P          +F+P  SSTY  +
Sbjct: 61  HNVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSP------NLGSVFNPVSSSTYSPV 114

Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           PCSS  C +  +      SC      C  ++SY D +   GNLA ET  +GS T      
Sbjct: 115 PCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVT-----R 169

Query: 200 PGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
           PG  FGC   G ++    ++K+TG++G+  G +S ++Q+  +   KFSYC+    S+   
Sbjct: 170 PGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCISGSDSSVFL 226

Query: 257 FGTNGIVSGPGVV--------STPLTK-AKTFYVLTIDAISVGNQRL----GVSTPD--- 300
              +   S  G +        STPL    +  Y + ++ I VG++ L     V  PD   
Sbjct: 227 LLGDASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTG 286

Query: 301 ---IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADP----TGSLELCYSFNS- 348
               ++DSGT  TFL         +  ++   S++    V DP     G+++LCY   S 
Sbjct: 287 AGQTMVDSGTQFTFLMGPVYTALKNEFITQTKSVLRL--VDDPDFVFQGTMDLCYKVGST 344

Query: 349 ----LSQVPEVTIHFRGADVKLSRSNFFVKVS-------EDIVCSVFKG---ITNSVPIY 394
                S +P V++ FRGA++ +S      +V+       E++ C  F     +     + 
Sbjct: 345 TRPNFSGLPMVSLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVI 404

Query: 395 GNIMQTNFLVGYDIEQQTVSF 415
           G+  Q N  + +D+ +  V F
Sbjct: 405 GHHHQQNVWMEFDLAKSRVGF 425


>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
 gi|194693730|gb|ACF80949.1| unknown [Zea mays]
 gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
 gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
          Length = 519

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 107/387 (27%), Positives = 166/387 (42%), Gaps = 23/387 (5%)

Query: 54  LRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSD 113
           LR  L R   RL   NQ  S+S   ++ +        Y   + +GTP T  L   DTGSD
Sbjct: 63  LRSDLQRQKRRLAGKNQLLSLSKGGSTFSPGNDLGWLYYAWVDVGTPTTSFLVALDTGSD 122

Query: 114 LIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
           L W  C+   C P   Y     +D  ++ P  S+T + LPCS   C   +  +     C 
Sbjct: 123 LFWVPCDCIQCAPLSSYRGNLDRDLGIYKPAESTTSRHLPCSHELCQPGSGCTNPKQPCT 182

Query: 168 YSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG--LFNSKTTGIVGL 224
           Y++ Y  + + S+G L  +++ L S  G A     +  GCG    G  L      G++GL
Sbjct: 183 YNIDYFSENTTSSGLLIEDSLHLNSREGHAPVNASVIIGCGRKQSGDYLDGIAPDGLLGL 242

Query: 225 GGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVL 282
           G  DIS+ S +     +   FS C    SS +I FG  G+ S       PL      Y +
Sbjct: 243 GMADISVPSFLARAGLVRNSFSMCFKEDSSGRIFFGDQGVSSQQSTPFVPLYGKLQTYAV 302

Query: 283 TIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 342
            +D   +G++ L  S+   ++DSGT+ T LP        +     I A  V     + + 
Sbjct: 303 NVDKSCIGHKCLEGSSFQALVDSGTSFTSLPPDVYKAFTTEFDKQINASRVPYEDSTWKY 362

Query: 343 CYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSED---IVCSVFKGITNSVPIYGNI 397
           CYS + L    VP + + F  A+      N  +  +++   +       + ++ PI   I
Sbjct: 363 CYSASPLEMPDVPTIILAF-AANKSFQAVNPILPFNDEQGALARFCLAVLPSTEPI--GI 419

Query: 398 MQTNFLVGY----DIEQQTVSFKPTDC 420
           +  NFLVGY    D E   + +  ++C
Sbjct: 420 IGQNFLVGYHVVFDRESMKLGWYRSEC 446


>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
          Length = 469

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 116/425 (27%), Positives = 182/425 (42%), Gaps = 70/425 (16%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISSSKASQADI------IP-------NNANYLIRISIG 98
           +R RD   R      H    S ++S +   AD+      +P           Y +R  +G
Sbjct: 59  ERARDDARR------HAYIRSQLASRRRRAADVGASAFAMPLSSGAYTGTGQYFVRFRVG 112

Query: 99  TPPTERLAVADTGSDLIWTQCEPC--PPSQCYMQDSPL--FDPKMSSTYKSLPCSSSQCA 154
           TP    + VADTGSDL W +C     PP+     D P   F    S ++  L CSS  C 
Sbjct: 113 TPAQPFVLVADTGSDLTWVKCRGAAGPPAS----DPPAREFRASESRSWAPLACSSDTCT 168

Query: 155 S-----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG----------STTGQAVAL 199
           S     L   S     C Y   Y DGS + G + T+  T+              G+   L
Sbjct: 169 SYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGRRAKL 228

Query: 200 PGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV-----SST 253
            G+  GC  T +G  F S + G++ LG  +IS  S+      G+FSYCLV       +S+
Sbjct: 229 QGVVLGCTATYDGQSFQS-SDGVLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNASS 287

Query: 254 KINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI--------- 301
            + FG      G     TPL    +   FY + +DA+ V  + L +   D+         
Sbjct: 288 YLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPA-DVWDVGRGGGA 346

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNS-LSQVPEVTIH 358
           ++DSGT+LT L       +++ +   + A P    DP    E CY++ +   ++P++ + 
Sbjct: 347 ILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAMDP---FEYCYNWTAGAPEIPKLEVS 403

Query: 359 FRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           F G A ++    ++ +  +  + C  V +G    V + GNI+Q   L  +D+  + + FK
Sbjct: 404 FAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDLRDRWLRFK 463

Query: 417 PTDCT 421
            T C 
Sbjct: 464 HTRCA 468


>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
          Length = 376

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 108/221 (48%), Gaps = 23/221 (10%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN----HFNQNSSISSSKASQADII 85
           S+E+IH+  P S           R +  L +  +R+N       +N +           +
Sbjct: 67  SLEVIHKHGPCSKLSQDKGRSPSRTQ-MLDQDESRVNSIRSRLAKNPADGGKLKGSKVTL 125

Query: 86  PNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           P+ +       NY++ + +GTP  +   + DTGSDL WTQCEPC    CY Q  P+F+P 
Sbjct: 126 PSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPC-ARYCYHQQEPIFNPS 184

Query: 139 MSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
            S++Y ++ CSS  C  L     N  SCS   C Y + YGD S+S G  A + + L ST 
Sbjct: 185 KSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTD 244

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
                     FGCG NN GLF     G++GLG   +SL+S+
Sbjct: 245 ----VFNNFLFGCGQNNRGLF-VGVAGLIGLGRNALSLMSK 280



 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 54/111 (48%), Gaps = 9/111 (8%)

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNF 371
           G   N LS+MS      P A P   L+ CY F+      VP++ ++F  GA++ L  S  
Sbjct: 269 GLGRNALSLMSKY----PKAAPASILDTCYDFSQYDTVDVPKINLYFSDGAEMDLDPSGI 324

Query: 372 FVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           F  ++   VC  F G +++  + I GN+ Q  F V YD+    + F P  C
Sbjct: 325 FYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPGGC 375


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 105/383 (27%), Positives = 177/383 (46%), Gaps = 65/383 (16%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N + ++ +++GTPP     V DTGS+L W  C         +     FDP  S++Y+++
Sbjct: 27  HNVSLIVSLTVGTPPQNVSMVIDTGSELSWLHCNKT------LSYPTTFDPTRSTSYQTI 80

Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           PCSS  C +  Q      SC   N C  ++SY D S S+GNLA++   +GS+      + 
Sbjct: 81  PCSSPTCTNRTQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSD-----IS 135

Query: 201 GITFGCGT---NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-STKIN 256
           G+ FGC     ++    +SK+TG++G+  G +S +SQ+      KFSYC+     S  + 
Sbjct: 136 GLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQLGFP---KFSYCISGTDFSGLLL 192

Query: 257 FGTNGIVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRLGVST----PD---- 300
            G + +     +  TPL +  T         Y + ++ I V ++ L +      PD    
Sbjct: 193 LGESNLTWSVPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTGA 252

Query: 301 --IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADP----TGSLELCY----SF 346
              ++DSGT  TFL         S  L+  SS++    + DP     G+++LCY    S 
Sbjct: 253 GQTMVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRV--LEDPDFVFQGAMDLCYLVPLSQ 310

Query: 347 NSLSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKG---ITNSVPIYGNI 397
             L  +P VT+ FRGA++ +S      +V      ++ + C  F     +     + G+ 
Sbjct: 311 RVLPLLPTVTLVFRGAEMTVSGDRVLYRVPGELRGNDSVHCLSFGNSDLLGVEAYVIGHH 370

Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
            Q N  + +D+E+  +      C
Sbjct: 371 HQQNVWMEFDLEKSRIGLAQVRC 393


>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
          Length = 287

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 98/288 (34%), Positives = 134/288 (46%), Gaps = 40/288 (13%)

Query: 159 KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKT 218
           + CSG +C Y V YGDGS++ G  A +T+TL S      A+ G  FGCG  N GLF  + 
Sbjct: 14  RGCSGGHCLYGVQYGDGSYTIGFFAMDTLTLSSHD----AIKGFRFGCGERNEGLFG-EA 68

Query: 219 TGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAK- 277
            G++GLG G  SL  Q      G F++C    SS     GT  +  GPG  S+P   AK 
Sbjct: 69  AGLLGLGRGKTSLPVQTYDKYGGVFAHCFPARSS-----GTGYLEFGPG--SSPAVSAKL 121

Query: 278 -----------TFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLL 321
                      TFY + +  I VG + L +     +    ++DSGT +T LP    S+L 
Sbjct: 122 STTPMLIDTGPTFYYVGMTGIRVGGKLLPIPQSVFAAAGTIVDSGTVITRLPPAAYSSLR 181

Query: 322 SVMSSMIEAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFRGA---DVKLSRSNFFVK 374
           S  ++ + A+    A     L+ CY     S+V  P V++ F+G    DV  S   +   
Sbjct: 182 SAFAASMAARGYKRAPALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASGIIYAAS 241

Query: 375 VSEDIVCSVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           VS+   C  F G    + V I GN     F V YDI  + V F P  C
Sbjct: 242 VSQ--ACLGFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287


>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
 gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 432

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 100/375 (26%), Positives = 165/375 (44%), Gaps = 45/375 (12%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + ++IG PP       D+GSDL W QC+ PC    C     PL+ P  S
Sbjct: 56  GDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNEVPHPLYRPTKS 112

Query: 141 STYKSLPCSSSQCASLNQKSCSGVN--------CQYSVSYGDGSFSNGNLATETVTLGST 192
              K +PC    CASL+     G +        C Y + Y D   S G L  ++  L  T
Sbjct: 113 ---KLVPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGVLVNDSFALRLT 169

Query: 193 TGQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCL 247
            G +VA P + FGCG +     G  +S T G++GLG G +SL+SQ++     K    +CL
Sbjct: 170 NG-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCL 228

Query: 248 VPVSSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVSTPDIVIDS 305
                  + FG + +V       TP+ ++  + +Y     ++  G++ LGV    +V DS
Sbjct: 229 SLRGGGFLFFGDD-LVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDS 287

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTI 357
           G++ T+        L++ +   +      +P  SL LC+     F S+  V +    + +
Sbjct: 288 GSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLDVRKEFKSLVL 347

Query: 358 HFRGAD---VKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYD 407
           +F       +++   N+ +       C    GI N        + I G+I   + +V YD
Sbjct: 348 NFASGKKTLMEIPPENYLIVTENGNAC---LGILNGSEIGLKDLSIIGDITMQDHMVIYD 404

Query: 408 IEQQTVSFKPTDCTK 422
            E+  + +    C +
Sbjct: 405 NEKGKIGWIRAPCDR 419


>gi|255563739|ref|XP_002522871.1| DNA binding protein, putative [Ricinus communis]
 gi|223537955|gb|EEF39569.1| DNA binding protein, putative [Ricinus communis]
          Length = 414

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 114/428 (26%), Positives = 172/428 (40%), Gaps = 76/428 (17%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
           ++  GF ++LIHRDSP+SPFY    T  +R+   +  S  R ++F+   S  SS+A +  
Sbjct: 27  SKPNGFRLQLIHRDSPESPFYPGKLTNSERISRLVEFSKIRAHNFD---SGFSSEAFRPP 83

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           +  +   YL+++ IG P      V DTGS LIWT                          
Sbjct: 84  VFQDFTCYLVKVRIGNPGIPLYLVPDTGSALIWT-------------------------- 117

Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
                      + N   C    C Y+  Y DGS + G  A +   L S   + +      
Sbjct: 118 ---------VNNQNIFQCRNNKCSYTRRYDDGSITTGVAAQDI--LQSEGSERIPF---Y 163

Query: 204 FGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-------S 252
           FGC  +N          K+ G++GL    +SL+ Q+      +FSYCL P         S
Sbjct: 164 FGCSRDNQNFSVFEHTGKSGGVMGLNTSPVSLLQQLSHITQRRFSYCLNPYQHGSEPPPS 223

Query: 253 TKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGV----------STPD 300
           + + FG +         STPL  +  +  Y L +  ++V  QRL +           T  
Sbjct: 224 SLLRFGNDIRKGRRRFQSTPLMSSPDRPNYFLNLLDMTVAGQRLHLPPGTFALRQDGTGG 283

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCYSF---NSLSQVPE 354
            +IDSGT LTF+ Q     L+S   +  +    Q V  P    +LCYSF   ++      
Sbjct: 284 TIIDSGTGLTFITQTAYPRLISAFQNYFDHRGFQRVHIP--EFDLCYSFRGNHTFHDHAS 341

Query: 355 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQT 412
           +T HF  AD  +     ++ + +D    V    T      + G I Q N    YD     
Sbjct: 342 MTFHFERADFTVQADYVYLPMEDDNAFCVALQPTPPQQRTVIGAINQGNTRFIYDAAAHQ 401

Query: 413 VSFKPTDC 420
           + F   +C
Sbjct: 402 LLFIAENC 409


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 107/377 (28%), Positives = 165/377 (43%), Gaps = 50/377 (13%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  + +++GTPP     V DTGS+L W  C P        + +  F P+ S T+ S+
Sbjct: 62  HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASV 121

Query: 147 PCSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           PC S+QC S +  S   C G +  C+ S+SY DGS S+G LATE  T+G       A   
Sbjct: 122 PCDSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGC 181

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
           +     T+  G+    T G++G+  G +S +SQ  T    +FSYC+       +    + 
Sbjct: 182 MATAFDTSPDGV---ATAGLLGMNRGALSFVSQASTR---RFSYCISDRDDAGVLLLGHS 235

Query: 262 IVSGPGVVSTPLTKA--------KTFYVLTIDAISVGNQRL----GVSTPD------IVI 303
            +    +  TPL +         +  Y + +  I VG + L     V  PD       ++
Sbjct: 236 DLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMV 295

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA--DPT----GSLELCYSFNS----LSQVP 353
           DSGT  TFL     S L +  S   +    A  DP      + + C+         +++P
Sbjct: 296 DSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLP 355

Query: 354 EVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGITNSVPI----YGNIMQTNFL 403
            VT+ F GA + ++      KV       + + C  F G  + VPI     G+  Q N  
Sbjct: 356 AVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTF-GNADMVPITAYVIGHHHQMNVW 414

Query: 404 VGYDIEQQTVSFKPTDC 420
           V YD+E+  V   P  C
Sbjct: 415 VEYDLERGRVGLAPIRC 431


>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
 gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
 gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
 gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
 gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
 gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
 gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
 gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
 gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
 gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
 gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
 gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
 gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
 gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
 gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
 gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
 gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
 gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
 gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
 gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
 gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
 gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
 gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
 gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
 gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
 gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
 gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
 gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
 gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
 gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
 gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
 gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
 gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
 gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
 gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
 gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
 gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
          Length = 339

 Score =  115 bits (288), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 99/349 (28%), Positives = 158/349 (45%), Gaps = 47/349 (13%)

Query: 59  TRSLNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSD 113
           ++   RL +    S+++  K +   I P       ANY++R+ +GTP  +   V DT +D
Sbjct: 11  SKDPERLKYL---STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSND 67

Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSV 170
             W  C     S C    S  F P  S+T  SL CS +QC+ +   SC       C ++ 
Sbjct: 68  AAWVPC-----SGCTGCSSTTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQ 122

Query: 171 SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDI 229
           SYG  S     L  + +TL +       +PG TFGC    +GG    +  G++GLG G I
Sbjct: 123 SYGGDSSLAATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPI 175

Query: 230 SLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP-----GVVSTPLTK---AKTFYV 281
           SLISQ     +G FSYCL    S K  + +  +  GP      + +TPL +     + Y 
Sbjct: 176 SLISQAGAMYSGVFSYCL---PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYY 232

Query: 282 LTIDAISVGNQRLGVSTPDIV----------IDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
           + +  +SVG  ++ + +  +V          IDSGT +T   Q     +       +   
Sbjct: 233 VNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG- 291

Query: 332 PVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIV 380
           P++   G+ + C++  + ++ P VT+HF G ++ L   N  +  S   V
Sbjct: 292 PISS-LGAFDTCFAATNEAEAPAVTLHFEGLNLVLPMENSLIHSSSGSV 339


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 107/377 (28%), Positives = 165/377 (43%), Gaps = 50/377 (13%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  + +++GTPP     V DTGS+L W  C P        + +  F P+ S T+ S+
Sbjct: 61  HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASV 120

Query: 147 PCSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           PC S+QC S +  S   C G +  C+ S+SY DGS S+G LATE  T+G       A   
Sbjct: 121 PCGSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGC 180

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG 261
           +     T+  G+    T G++G+  G +S +SQ  T    +FSYC+       +    + 
Sbjct: 181 MATAFDTSPDGV---ATAGLLGMNRGALSFVSQASTR---RFSYCISDRDDAGVLLLGHS 234

Query: 262 IVSGPGVVSTPLTKA--------KTFYVLTIDAISVGNQRL----GVSTPD------IVI 303
            +    +  TPL +         +  Y + +  I VG + L     V  PD       ++
Sbjct: 235 DLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQTMV 294

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA--DPT----GSLELCYSFNS----LSQVP 353
           DSGT  TFL     S L +  S   +    A  DP      + + C+         +++P
Sbjct: 295 DSGTQFTFLLGDAYSALKAEFSRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPPARLP 354

Query: 354 EVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGITNSVPI----YGNIMQTNFL 403
            VT+ F GA + ++      KV       + + C  F G  + VPI     G+  Q N  
Sbjct: 355 AVTLLFNGAQMTVAGDRLLYKVPGERRGGDGVWCLTF-GNADMVPITAYVIGHHHQMNVW 413

Query: 404 VGYDIEQQTVSFKPTDC 420
           V YD+E+  V   P  C
Sbjct: 414 VEYDLERGRVGLAPIRC 430


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score =  115 bits (288), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 108/373 (28%), Positives = 163/373 (43%), Gaps = 57/373 (15%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W     C+ CP       D  L+D K S+T  ++ 
Sbjct: 155 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 214

Query: 148 CSSSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
           C  + C+  +     C  G+ C YSV YGDGS + G    + V     +G     P    
Sbjct: 215 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT 274

Query: 202 ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKIN 256
           + FGCG    G   S +    GI+G G  + S++SQ+ ++  +   FS+CL  V    I 
Sbjct: 275 VVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGI- 333

Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI--------VIDSGTT 308
           F    +V  P V  TPL + +  Y + +  I VG   L V +           +IDSGTT
Sbjct: 334 FAIGEVVE-PKVNITPLVQNQAHYNVVMKEIEVGGDPLDVPSDAFESGDRKGTIIDSGTT 392

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE------LCYSF--NSLSQVPEVTIHFR 360
           L + PQ        V   +IE      P   L        C+ +  N     P VT+HF 
Sbjct: 393 LAYFPQ-------EVYVPLIEKILSQQPDLRLHTVEQAFTCFDYTGNVDDGFPTVTLHF- 444

Query: 361 GADVKLSRSNFFVKVSEDIVCSVFK---GITNS---------VPIYGNIMQTNFLVGYDI 408
             D  +S +   V   E +    F+   G  NS         + + G+++ +N LV YD+
Sbjct: 445 --DKSISLT---VYPHEYLFQHEFEWCIGWQNSGAQTKDGKDLTLLGDLVLSNKLVVYDL 499

Query: 409 EQQTVSFKPTDCT 421
           E+Q + +   +C+
Sbjct: 500 EKQGIGWVEYNCS 512


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 96/368 (26%), Positives = 159/368 (43%), Gaps = 39/368 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G P    +   DTGSD++W  C P   CP          ++DP+ SST   + 
Sbjct: 2   YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 61

Query: 148 CSSSQCA---SLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLG--STTGQAVALP 200
           CS   C       +  CS    NC+Y  SYGDGS S G    + +     S+ G A    
Sbjct: 62  CSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTS 121

Query: 201 GITFGCGTNNGGLFNS---KTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
            + FGC     G  ++      GI+G G  ++S+ +Q+  +  I   FS+CL        
Sbjct: 122 QVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL-EGEKRGG 180

Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD--------IVIDSGT 307
                G ++ PG+  TPL      Y + +  ISV + RL +   D        +++DSGT
Sbjct: 181 GILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSGT 240

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV-PEVTIHFRGADVKL 366
           TL + P G  +  +  +     A PV       +       LS + P VT++F G  ++L
Sbjct: 241 TLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLNFEGGAMEL 300

Query: 367 SRSNFFV------KVSEDIVCSVFKGITNS--------VPIYGNIMQTNFLVGYDIEQQT 412
              N+ +        + D+ C  ++  ++S        + I G+I+  + LV YD++   
Sbjct: 301 QPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSR 360

Query: 413 VSFKPTDC 420
           + +   +C
Sbjct: 361 IGWMSYNC 368


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 109/407 (26%), Positives = 179/407 (43%), Gaps = 40/407 (9%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNAN-YLIRISIGTPPTERLAV 107
            R+  A  ++ +R  H      ++        Q    PN+   Y  ++ +GTPP E    
Sbjct: 35  HRVEVAALKARDRARHARMLRGVAGGVVDFSVQGTSDPNSVGLYYTKVKMGTPPKEFNVQ 94

Query: 108 ADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS---C 161
            DTGSD++W  C     CP S     +   FD   SST   +PCS   C S  Q +   C
Sbjct: 95  IDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDPICTSRVQGAAAEC 154

Query: 162 S-GVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL---PGITFGCGTNNGGLF-- 214
           S  VN C Y+  YGDGS ++G   ++ +      GQ  A+     I FGC  +  G    
Sbjct: 155 SPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAVNSSATIVFGCSISQSGDLTK 214

Query: 215 -NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVST 271
            +    GI G G G +S++SQ+  R      FS+CL              I+  P +V +
Sbjct: 215 TDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCLKGDGDGGGVLVLGEILE-PSIVYS 273

Query: 272 PLTKAKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTTLTFLPQGYNSNLL 321
           PL  ++  Y L + +I+V  Q L ++ P +          ++D GTTL +L Q     L+
Sbjct: 274 PLVPSQPHYNLNLQSIAVNGQLLPIN-PAVFSISNNRGGTIVDCGTTLAYLIQEAYDPLV 332

Query: 322 SVMSSMIEAQPVADPTGSLELCYSFN-SLSQV-PEVTIHFR-GADVKLSRSNFFVK---- 374
           + +++ + +Q           CY  + S+  + P V+++F  GA + L    + +     
Sbjct: 333 TAINTAV-SQSARQTNSKGNQCYLVSTSIGDIFPSVSLNFEGGASMVLKPEQYLMHNGYL 391

Query: 375 VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
              ++ C  F+       I G+++  + +V YDI QQ + +   DC+
Sbjct: 392 DGAEMWCIGFQKFQEGASILGDLVLKDKIVVYDIAQQRIGWANYDCS 438


>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
          Length = 339

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 99/349 (28%), Positives = 158/349 (45%), Gaps = 47/349 (13%)

Query: 59  TRSLNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSD 113
           ++   RL +    S+++  K +   I P       ANY++R+ +GTP  +   V DT +D
Sbjct: 11  SKDPERLKYL---STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSND 67

Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSV 170
             W  C     S C    S  F P  S+T  SL CS +QC+ +   SC       C ++ 
Sbjct: 68  AAWVPC-----SGCTGCSSTTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQ 122

Query: 171 SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDI 229
           SYG  S     L  + +TL +       +PG TFGC    +GG    +  G++GLG G I
Sbjct: 123 SYGGDSSLAATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPI 175

Query: 230 SLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP-----GVVSTPLTK---AKTFYV 281
           SLISQ     +G FSYCL    S K  + +  +  GP      + +TPL +     + Y 
Sbjct: 176 SLISQAGAMYSGVFSYCL---PSFKSYYFSGSLKLGPVGQPKSIRTTPLLRNPHRPSLYY 232

Query: 282 LTIDAISVGNQRLGVSTPDIV----------IDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
           + +  +SVG  ++ + +  +V          IDSGT +T   Q     +       +   
Sbjct: 233 VNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVITRFVQPVYFAIRDEFRKQVNG- 291

Query: 332 PVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIV 380
           P++   G+ + C++  + ++ P VT+HF G ++ L   N  +  S   V
Sbjct: 292 PISS-LGAFDTCFAETNEAEAPAVTLHFEGLNLVLPMENSLIHSSSGSV 339


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 118/448 (26%), Positives = 190/448 (42%), Gaps = 55/448 (12%)

Query: 8   VFILFFLCFYVVSPIEA------QTGGFSVELIHRDSPKSPFYNSSETPYQR-LRDALTR 60
           +F L FL F +   +        Q  G ++++ H  SP SPF+ S    ++  +     +
Sbjct: 5   LFSLAFLFFTLAQGMHLNPKCGIQDQGSNLQVFHVYSPCSPFWPSKPLKWEESVLQMQAK 64

Query: 61  SLNRLNHFNQNSSISSSK-----ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLI 115
              RL      SS+ + K     AS   I+  +  Y++R  IGTP    L   DT +D  
Sbjct: 65  DQARLQFL---SSLVARKSVVPIASGRQIV-QSPTYIVRAKIGTPAQTMLLAMDTSNDAA 120

Query: 116 WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDG 175
           W  C     S C    S +F+   S+T+K++ C + QC  +    C G  C ++++YG  
Sbjct: 121 WIPC-----SGCVGCSSTVFNNVKSTTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSS 175

Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           S +  NL+ + VTL + +     +P  TFGC T   G  +    G++GLG G +SL+SQ 
Sbjct: 176 SIA-ANLSQDVVTLATDS-----IPSYTFGCLTEATG-SSIPPQGLLGLGRGPMSLLSQT 228

Query: 236 RTTIAGKFSYCLVPVSSTKINFGTN---GIVSGPG-VVSTPLTK---AKTFYVLTIDAIS 288
           +      FSYCL    S  +NF  +   G V  P  + +TPL K     + Y + + AI 
Sbjct: 229 QNLYQSTFSYCLPSFRS--LNFSGSLRLGPVGQPKRIKTTPLLKNPRRSSLYYVNLMAIR 286

Query: 289 VGNQRLGVSTPDI----------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 338
           VG + + +    +          + DSGT  T L     + +       +    V    G
Sbjct: 287 VGRRVVDIPPSALAFNPTTGAGTIFDSGTVFTRLVAPAYTAVRDAFRKRVGNATVTS-LG 345

Query: 339 SLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVK-VSEDIVCSVFKGITNSV----PI 393
             + CY+  S    P +T  F G +V L   N  +   +  I C       ++V     +
Sbjct: 346 GFDTCYT--SPIVAPTITFMFSGMNVTLPPDNLLIHSTASSITCLAMAAAPDNVNSVLNV 403

Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             N+ Q N  + +D+    +      CT
Sbjct: 404 IANMQQQNHRILFDVPNSRLGVAREPCT 431


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 121/415 (29%), Positives = 189/415 (45%), Gaps = 44/415 (10%)

Query: 42  PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN-YLIRISIGTP 100
           P  +  E    R RD L     R     Q+SS     + Q    P     Y  ++ +GTP
Sbjct: 33  PTNHGVELSQLRARDEL-----RHRRMLQSSSGVVDFSVQGTFDPFQVGLYYTKVQLGTP 87

Query: 101 PTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           P E     DTGSD++W  C     CP +         FDP  SST   + CS  +C +  
Sbjct: 88  PVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGK 147

Query: 158 QKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTL-----GSTTGQAVALPGITFGCG 207
           Q S   CS  N  C Y+  YGDGS ++G   ++ + L     GS T  + A P + FGC 
Sbjct: 148 QSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTNSTA-P-VVFGCS 205

Query: 208 TNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTKINFGTNGI 262
               G     +    GI G G  ++S+ISQ+ +  IA + FS+CL   SS         I
Sbjct: 206 NQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGDSSGGGILVLGEI 265

Query: 263 VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGTTLTFLPQ 314
           V  P +V T L  A+  Y L + +ISV  Q L +        ++   ++DSGTTL +L +
Sbjct: 266 VE-PNIVYTSLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAE 324

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIHFR-GADVKLSRSNF 371
                 +S +++ I  Q V         CY   +S++ V P+V+++F  GA + L   ++
Sbjct: 325 EAYDPFVSAITAAIP-QSVRTVVSRGNQCYLITSSVTDVFPQVSLNFAGGASMILRPQDY 383

Query: 372 FVKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            ++ +      + C  F+ I    + I G+++  + +V YD+  Q + +   DC+
Sbjct: 384 LIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDCS 438


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 172/375 (45%), Gaps = 43/375 (11%)

Query: 76  SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
           S++    D +  N  Y  R+ IGTPP E   + D+GS + +  C  C   QC     P F
Sbjct: 73  SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRF 130

Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
            P +SSTY  + C+   C   + K+     C Y   Y + S S+G L  + V+ G+ +  
Sbjct: 131 QPDLSSTYSPVKCNVD-CTCDSDKN----QCTYERQYAEMSSSSGVLGEDIVSFGTES-- 183

Query: 196 AVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSS 252
            +      FGC  +  G LF+    GI+GLG G +S++ Q+  +  I   FS C      
Sbjct: 184 ELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCY----- 238

Query: 253 TKINFGTNGIVSG-----PGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVSTPDI----- 301
             ++ G   +V G     PG++ T     ++ +Y + +  + V  + L V  P I     
Sbjct: 239 GGMDIGGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVD-PRIFDGKH 297

Query: 302 --VIDSGTTLTFLPQGYNSNLLSVMSSMIEA-QPVADPTGSL-ELCYS-----FNSLSQV 352
             V+DSGTT  +LP+         +SS +   + +  P  +  ++C++      + LS+V
Sbjct: 298 GTVLDSGTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEV 357

Query: 353 -PEVTIHF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 407
            P+V + F  G  + LS  N+  + S  E   C  VF+   +   + G I+  N LV YD
Sbjct: 358 FPKVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYD 417

Query: 408 IEQQTVSFKPTDCTK 422
              + + F  T+C++
Sbjct: 418 RHNEKIGFWKTNCSE 432


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 106/374 (28%), Positives = 171/374 (45%), Gaps = 41/374 (10%)

Query: 76  SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
           S++    D +  N  Y  R+ IGTPP E   + D+GS + +  C  C   QC     P F
Sbjct: 70  SARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRF 127

Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
            P +SSTY  + CS+  C   + KS     C Y   Y + S S+G L  + V+ G  T  
Sbjct: 128 QPDLSSTYSPVKCSAD-CTCDSDKS----QCTYERQYAEMSSSSGVLGEDIVSFG--TES 180

Query: 196 AVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSS 252
            +      FGC  +  G LF+    GI+GLG G +S++ Q+  +  I   FS C      
Sbjct: 181 ELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCY----- 235

Query: 253 TKINFGTNGIVSG-----PGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV------STPD 300
             ++ G   +V G     P +V +     ++ +Y + +  I V  + L +      S   
Sbjct: 236 GGMDIGGGAMVLGAMPAPPDMVFSRSDPVRSPYYNIELKEIHVAGKALRLDPRIFDSKHG 295

Query: 301 IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV- 352
            V+DSGTT  +LP Q + +   +V S +   + +  P  +  ++C++      + LSQ  
Sbjct: 296 TVLDSGTTYAYLPEQAFVAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRNVSQLSQAF 355

Query: 353 PEVTIHF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDI 408
           P+V + F  G  + LS  N+  + S  E   C  VF+   +   + G I+  N LV YD 
Sbjct: 356 PDVDMVFGDGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDR 415

Query: 409 EQQTVSFKPTDCTK 422
             + + F  T+C++
Sbjct: 416 HNEKIGFWKTNCSE 429


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score =  115 bits (287), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 120/415 (28%), Positives = 192/415 (46%), Gaps = 44/415 (10%)

Query: 42  PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN-YLIRISIGTP 100
           P  ++ E    R RDAL     R     Q+S+     + Q    P     Y  ++ +GTP
Sbjct: 30  PTNHTVELSQLRARDAL-----RHRRMLQSSNGVVDFSVQGTFDPFQVGLYYTKVQLGTP 84

Query: 101 PTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           P E     DTGSD++W  C     CP +         FDP  SST   + CS  +C +  
Sbjct: 85  PVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGI 144

Query: 158 QKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTL-----GSTTGQAVALPGITFGCG 207
           Q S   CS  N  C Y+  YGDGS ++G   ++ + L     GS T  + A P + FGC 
Sbjct: 145 QSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNSTA-P-VVFGCS 202

Query: 208 TNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTKINFGTNGI 262
               G     +    GI G G  ++S+ISQ+ +  IA + FS+CL   SS         I
Sbjct: 203 NQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGGGILVLGEI 262

Query: 263 VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGTTLTFLPQ 314
           V  P +V T L  A+  Y L + +I+V  Q L +        ++   ++DSGTTL +L +
Sbjct: 263 VE-PNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAE 321

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIHFR-GADVKLSRSNF 371
                 +S +++ I  Q V         CY   +S+++V P+V+++F  GA + L   ++
Sbjct: 322 EAYDPFVSAITASIP-QSVHTVVSRGNQCYLITSSVTEVFPQVSLNFAGGASMILRPQDY 380

Query: 372 FVKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            ++ +      + C  F+ I    + I G+++  + +V YD+  Q + +   DC+
Sbjct: 381 LIQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDCS 435


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 107/368 (29%), Positives = 172/368 (46%), Gaps = 43/368 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +GTPP E     DTGSD++W  C     CP S     +   FD   SST   +P
Sbjct: 84  YTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALVP 143

Query: 148 CSSSQCASLNQKS---CS-GVN-CQYSVSYGDGSFSNGNLATET----VTLGSTTGQAVA 198
           CS   CAS  Q +   CS  VN C Y+  Y DGS ++G   ++     + LG +T   VA
Sbjct: 144 CSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANVA 203

Query: 199 LPG-ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSS 252
               I FGC T   G     +    GI+G G G++S++SQ+  R      FS+CL     
Sbjct: 204 SSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCL----- 258

Query: 253 TKINFGTNGI-----VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STPD-- 300
            K +    GI     +  P +V +PL  ++  Y L + +I+V  Q L +     +T D  
Sbjct: 259 -KGDGNGGGILVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQVLSINPAVFATSDKR 317

Query: 301 -IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-VPEVTIH 358
             +IDSGTTL++L Q     L++ + + +     +  +   +      S+    P V+ +
Sbjct: 318 GTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQCYLVLTSIDDSFPTVSFN 377

Query: 359 FR-GADVKLSRSNFFV----KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
           F  GA + L  S + +    +    + C  F+ +   V I G+++  + +V YD+ +Q +
Sbjct: 378 FEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQKVQEGVTILGDLVLKDKIVVYDLARQQI 437

Query: 414 SFKPTDCT 421
            +   DC+
Sbjct: 438 GWTNYDCS 445


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 109/386 (28%), Positives = 175/386 (45%), Gaps = 68/386 (17%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  + +++GTPP     V DTGS+L W  C P      +   S  F P+ SST+ ++
Sbjct: 81  HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMS--FRPRASSTFAAV 138

Query: 147 PCSSSQCASLNQKS---CSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           PC+S+QC S +  S   C G    C  S+SY DGS S+G LAT+   +GS      A   
Sbjct: 139 PCASAQCRSRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPPLRAA--- 195

Query: 202 ITFGCGTNNGGLFNS-----KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI- 255
             FGC ++    F+S      + G++G+  G +S +SQ  T    +FSYC+       + 
Sbjct: 196 --FGCMSSA---FDSSPDGVASAGLLGMNRGALSFVSQASTR---RFSYCISDRDDAGVL 247

Query: 256 NFGTNGIVSGPGVVSTPLTKA--------KTFYVLTIDAISVGNQRL----GVSTPD--- 300
             G + + +   +  TP+ +         +  Y + +  I VG + L     V  PD   
Sbjct: 248 LLGHSDLPTFLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTG 307

Query: 301 ---IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA----DPT----GSLELCYSFNS- 348
               ++DSGT  TFL     S L +  +   +A+P+     DP+     + + C+     
Sbjct: 308 AGQTMVDSGTQFTFLLGDAYSALKAEFTR--QARPLLPALDDPSFAFQEAFDTCFRVPQG 365

Query: 349 ----LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGITNSVPIY---- 394
                +++P VT+ F GA++ ++      KV       + + C  F G  + VPI     
Sbjct: 366 RSPPTARLPGVTLLFNGAEMAVAGDRLLYKVPGERRGGDGVWCLTF-GNADMVPIMAYVI 424

Query: 395 GNIMQTNFLVGYDIEQQTVSFKPTDC 420
           G+  Q N  V YD+E+  V   P  C
Sbjct: 425 GHHHQMNVWVEYDLERGRVGLAPVRC 450


>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
          Length = 353

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 97/355 (27%), Positives = 153/355 (43%), Gaps = 32/355 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---LFDPKMSSTYKSLP 147
           Y + IS+GTPP   L   DTGS L W QC+ C   +CY Q +    +F+P  SSTY  + 
Sbjct: 6   YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQI-KCYDQAAKAGQIFNPYNSSTYSKVG 64

Query: 148 CSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           CS+  C  ++     +  C   +  C YS+ YG G +S G L  + +TL S      ++ 
Sbjct: 65  CSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNR----SID 120

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM-RTTIAGKFSYCLVPVSSTKINFGT 259
              FGCG +N  L+N    GI+G G    S  +Q+ + T    FSYC       + +   
Sbjct: 121 NFIFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTI 178

Query: 260 NGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLGVS-----TPDIVIDSGTTLTFL 312
                   ++ T L     K  Y +    + V   RL +      +   ++DSGT  T++
Sbjct: 179 GPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMTIVDSGTADTYI 238

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS----LSQVPEVTIHFRGADVKLSR 368
                  L   M+  ++A+          +C+  NS     +  P V +    + +KL  
Sbjct: 239 LSPVFDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIRSTLKLPV 298

Query: 369 SNFFVKVSEDIVCSVF---KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            N F + S +++CS F         V + GN    +F + +DI+     FK   C
Sbjct: 299 ENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 353


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 110/414 (26%), Positives = 181/414 (43%), Gaps = 45/414 (10%)

Query: 36  RDSPKSPFYNSSETPY---QRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYL 92
           R +P  P +      Y    RL  +L R L    H N       ++    D +  N  Y 
Sbjct: 37  RPAPGPPLFLPLTRSYPNASRLAASLRRGLGDGVHPN-------ARMRLHDDLLTNGYYT 89

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
            R+ IGTPP E   + D+GS + +  C  C   QC     P F P +SS+Y  + C+   
Sbjct: 90  TRLYIGTPPQEFALIVDSGSTVTYVPCSSC--EQCGNHQDPRFQPDLSSSYSPVKCNVDC 147

Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNG 211
               ++K C+     Y   Y + S S+G L  + V+ G  +   +      FGC  +  G
Sbjct: 148 TCDSDKKQCT-----YERQYAEMSSSSGVLGEDIVSFGRES--ELKPQHAIFGCENSETG 200

Query: 212 GLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV 269
            LF+    GI+GLG G +S++ Q+  +  I+  FS C   +          G+++ P ++
Sbjct: 201 DLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGMLAPPDMI 260

Query: 270 ---STPLTKAKTFYVLTIDAISVGNQRLGV------STPDIVIDSGTTLTFLP-QGYNSN 319
              S PL     +Y + +  I V  + L V      S    V+DSGTT  +LP Q + + 
Sbjct: 261 FSNSDPLRSP--YYNIELKEIHVAGKALRVESRIFNSKHGTVLDSGTTYAYLPEQAFVAF 318

Query: 320 LLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PEVTIHF-RGADVKLSRSNF 371
             +V S +   + +  P  S  ++C++      + L +V P+V + F  G  + L+  N+
Sbjct: 319 KEAVTSKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKLSLTPENY 378

Query: 372 FV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
                KV       VF+   +   + G I+  N LV YD   + + F  T+C++
Sbjct: 379 LFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNCSE 432


>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
 gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
          Length = 433

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 100/374 (26%), Positives = 166/374 (44%), Gaps = 44/374 (11%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + ++IG PP       D+GSDL W QC+ PC    C     PL+ P  S
Sbjct: 58  GDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNEVPHPLYRPTKS 114

Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
              K +PC    CASL+     +  C   +  C Y + Y D   S G L  ++  L  T 
Sbjct: 115 ---KLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTN 171

Query: 194 GQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
           G +VA P + FGCG +     G  +S T G++GLG G +SL+SQ++     K    +CL 
Sbjct: 172 G-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLS 230

Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
                 + FG + +V       TP+ ++  + +Y     ++  G++ LGV    +V DSG
Sbjct: 231 LRGGGFLFFGDD-LVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSG 289

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIH 358
           ++ T+        L++ +   +      +P  SL LC+     F S+  V +    + ++
Sbjct: 290 SSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLDVRKEFKSLVLN 349

Query: 359 FRGAD---VKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDI 408
           F       +++   N+ +       C    GI N        + I G+I   + +V YD 
Sbjct: 350 FASGKKTLMEIPPENYLIVTENGNAC---LGILNGSEIGLKDLSIIGDITMQDHMVIYDN 406

Query: 409 EQQTVSFKPTDCTK 422
           E+  + +    C +
Sbjct: 407 EKGKIGWIRAPCDR 420


>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
 gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
          Length = 536

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 124/444 (27%), Positives = 186/444 (41%), Gaps = 50/444 (11%)

Query: 22  IEAQTG-GFSVELIHR--DSPKS--------------PFYNSSETPYQRLRDALTRSLNR 64
            EA  G  FS +LIHR  D  KS              P   S E     L + L R   +
Sbjct: 20  FEASIGLTFSSKLIHRFSDEAKSISISRKGNASGDLWPKRYSFEYFQLLLGNDLKRQRMK 79

Query: 65  LNHFNQNSSISSSKASQADIIPNNANYL--IRISIGTPPTERLAVADTGSDLIWTQCE-- 120
           L    +N  +  S+ SQA    N  ++L    I IGTP    L   D GSDL+W  C+  
Sbjct: 80  LGS-QKNQLLFPSQGSQALFFGNELDWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCI 138

Query: 121 PCPP-SQCYM-----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGD 174
            C P S  Y      +D   + P +SST + L C    C   +        C Y  +Y D
Sbjct: 139 QCAPLSASYYNISLDRDLSEYSPSLSSTSRHLSCDHQLCEWGSNCKNPKDPCPYIFNYDD 198

Query: 175 --GSFSNGNLATETVTL---GSTTGQAVALPGITFGCGTNNGGLF--NSKTTGIVGLGGG 227
              + S G L  + + L   G  T + +    +  GCG   GG F   +   G++GLG G
Sbjct: 199 FENTTSAGFLVEDKLHLASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPG 258

Query: 228 DISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTID 285
           DIS+ S +     I   FS C     S +I FG  G  S       P+      Y + ++
Sbjct: 259 DISVPSLLAKAGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGTYVAYFVGVE 318

Query: 286 AISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 345
           +  VGN  L  S    ++DSG++ T+LP    + L+S     + A+ ++   G  + CY+
Sbjct: 319 SYCVGNSCLKRSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKRISFQDGLWDYCYN 378

Query: 346 FNS--LSQVPEVTIHF-RGADVKLSRSNFFVKVSED--IVCSVFKGITNSVPIYGNIMQT 400
            +S  L  +P + + F R  +  +    + +   +   + C   +    S   YG I Q 
Sbjct: 379 ASSQELHDIPAIQLKFPRNQNFVVHNPTYSIPHHQGFTMFCLSLQPTDGS---YGIIGQ- 434

Query: 401 NFLVGY----DIEQQTVSFKPTDC 420
           NF++GY    DIE   + +  + C
Sbjct: 435 NFMIGYRMVFDIENLKLGWSNSSC 458


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 98/364 (26%), Positives = 164/364 (45%), Gaps = 35/364 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G+PP E     DTGSD++W  C P   CP S         F+P  SST   +P
Sbjct: 91  YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150

Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
           CS  +C +  Q S   C   +   C Y+ +YGDGS ++G   ++T+   S  G    A +
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTANS 210

Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSST 253
              I FGC  +  G     +    GI G G   +S++SQ+ +  ++ K FS+CL   S  
Sbjct: 211 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL-KGSDN 269

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
                  G +  PG+V TPL  ++  Y L +++I V  Q+L +        +T   ++DS
Sbjct: 270 GGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDS 329

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGAD 363
           GTTL +L  G     ++ +++ +    V         C+  +S   S  P V+++F G  
Sbjct: 330 GTTLAYLADGAYDPFVNAITAAVSPS-VRSLVSKGNQCFVTSSSVDSSFPTVSLYFMGGV 388

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGI------TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
               +   ++     I  +V   I         + I G+++  + +  YD+    + +  
Sbjct: 389 AMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTD 448

Query: 418 TDCT 421
            DC+
Sbjct: 449 YDCS 452


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 104/379 (27%), Positives = 174/379 (45%), Gaps = 64/379 (16%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N    + +++G+PP     V DTGS+L W  C+  P          +F+P  SSTY  +
Sbjct: 57  HNVTLTVTLAVGSPPQNISMVLDTGSELSWLHCKKSP------NLGSVFNPVSSSTYSPV 110

Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           PCSS  C +  +      SC      C  ++SY D +   GNLA +T  +GS T      
Sbjct: 111 PCSSPICRTRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGSVT-----R 165

Query: 200 PGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN 256
           PG  FGC   G ++    ++K+TG++G+  G +S ++Q+  +   KFSYC+    S+ I 
Sbjct: 166 PGTLFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQLGFS---KFSYCISGSDSSGIL 222

Query: 257 FGTNGIVSGPGVVS-TPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD--- 300
              +   S  G +  TPL    T         Y + ++ I VG++ L     V  PD   
Sbjct: 223 LLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTG 282

Query: 301 ---IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPT----GSLELCYSFNS--- 348
               ++DSGT  TFL     + L +   +  ++  + V DP     G+++LCY   S   
Sbjct: 283 AGQTMVDSGTQFTFLMGPVYTALKNEFIAQTKSVLRIVDDPNFVFQGTMDLCYRVGSSTR 342

Query: 349 --LSQVPEVTIHFRGADVKLSRSNFFVKVS-------EDIVCSVFKG---ITNSVPIYGN 396
              + +P +++ FRGA++ +S      +V+       E++ C  F     +     + G+
Sbjct: 343 PNFTGLPVISLMFRGAEMSVSGQKLLYRVNGAGSEGKEEVYCFTFGNSDLLGIEAFVIGH 402

Query: 397 IMQTNFLVGYDIEQQTVSF 415
             Q N  + +D+ +  V F
Sbjct: 403 HHQQNVWMEFDLAKSRVGF 421


>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 372

 Score =  114 bits (285), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 97/355 (27%), Positives = 153/355 (43%), Gaps = 32/355 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---LFDPKMSSTYKSLP 147
           Y + IS+GTPP   L   DTGS L W QC+ C   +CY Q +    +F+P  SSTY  + 
Sbjct: 25  YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQI-KCYDQAAKAGQIFNPYNSSTYSKVG 83

Query: 148 CSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           CS+  C  ++     +  C   +  C YS+ YG G +S G L  + +TL S      ++ 
Sbjct: 84  CSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNR----SID 139

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM-RTTIAGKFSYCLVPVSSTKINFGT 259
              FGCG +N  L+N    GI+G G    S  +Q+ + T    FSYC       + +   
Sbjct: 140 NFIFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTI 197

Query: 260 NGIVSGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLGVS-----TPDIVIDSGTTLTFL 312
                   ++ T L     K  Y +    + V   RL +      +   ++DSGT  T++
Sbjct: 198 GPYARDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMTIVDSGTADTYI 257

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS----LSQVPEVTIHFRGADVKLSR 368
                  L   M+  ++A+          +C+  NS     +  P V +    + +KL  
Sbjct: 258 LSPVFDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIRSTLKLPV 317

Query: 369 SNFFVKVSEDIVCSVF---KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            N F + S +++CS F         V + GN    +F + +DI+     FK   C
Sbjct: 318 ENAFYESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 372


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 108/387 (27%), Positives = 178/387 (45%), Gaps = 60/387 (15%)

Query: 83  DIIP--NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           D +P  +N +  + +++GTPP     V DTGS+L W  C     SQ     S  F+P  S
Sbjct: 63  DKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCN---TSQNSSSSSSTFNPVWS 119

Query: 141 STYKSLPCSSSQCASLNQK-----SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           S+Y  +PCSSS C    +      SC S   C  ++SY D S S GNLAT+T  +GS+  
Sbjct: 120 SSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSS-- 177

Query: 195 QAVALPGITFGCGT---NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS 251
               +P + FGC     ++    +SK TG++G+  G +S +SQM      KFSYC+    
Sbjct: 178 ---GIPNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQMGFP---KFSYCISEYD 231

Query: 252 STKI------NFGTNGIVSGPGVV--STPLTK-AKTFYVLTIDAISVGNQRL----GVST 298
            + +      NF     ++   ++  STPL    +  Y + ++ I V ++ L     V  
Sbjct: 232 FSGLLLLGDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFE 291

Query: 299 PD------IVIDSGTTLTF-LPQGYNSNLLSVMSSMIEAQPVADPT-----GSLELCYSF 346
           PD       ++DSGT  TF L   Y +     ++    +  V + +     G+++LCY  
Sbjct: 292 PDHTGAGQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGAMDLCYRV 351

Query: 347 ----NSLSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKG---ITNSVPI 393
                 L  +P VT+ FRGA++ ++      +V      ++ I C  F     +     +
Sbjct: 352 PTNQTRLPPLPSVTLVFRGAEMTVTGDRILYRVPGERRGNDSIHCFTFGNSDLLGVEAFV 411

Query: 394 YGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            G++ Q N  + +D+++  +      C
Sbjct: 412 IGHLHQQNVWMEFDLKKSRIGLAEIRC 438


>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 432

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 100/369 (27%), Positives = 153/369 (41%), Gaps = 51/369 (13%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           +Y++R  +G+P    L   DT +D  W  C PC    C    S LF P  S++Y  LPCS
Sbjct: 76  SYVVRAGLGSPAQPILLALDTSADATWAHCSPC--GTCPSSGS-LFAPANSTSYAPLPCS 132

Query: 150 SSQCASLNQKSCSGVN----------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           S+ C  L  + C   +          C ++  + D SF   +LA++ + LG       A+
Sbjct: 133 STMCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFADASF-QASLASDWLHLGKD-----AI 186

Query: 200 PGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS----STK 254
           P   FGC    +G   N    G++GLG G ++L+SQ+     G FSYCL        S  
Sbjct: 187 PNYAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQVGNMYNGVFSYCLPSYKSYYFSGS 246

Query: 255 INFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTP------------ 299
           +  G  G     GV  TP+ K     + Y + +  +SVG  R  V  P            
Sbjct: 247 LRLGAAGQPR--GVRYTPMLKNPNRSSLYYVNVTGLSVG--RAPVKVPAGSFAFDPATGA 302

Query: 300 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTI 357
             V+DSGT +T       + L       + A       G+ + C++ + ++    P VT+
Sbjct: 303 GTVVDSGTVITRWTPPVYAALREEFRRHVAAPSGYTSLGAFDTCFNTDEVAAGVAPAVTV 362

Query: 358 HFRGA-DVKLSRSNFFVKVSE-DIVCSVF----KGITNSVPIYGNIMQTNFLVGYDIEQQ 411
           H  G  D+ L   N  +  S   + C       + +   V +  N+ Q N  V +D+   
Sbjct: 363 HMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDVANS 422

Query: 412 TVSFKPTDC 420
            V F    C
Sbjct: 423 RVGFARESC 431


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 97/364 (26%), Positives = 164/364 (45%), Gaps = 35/364 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G+PP E     DTGSD++W  C P   CP S         F+P  SST   +P
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176

Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
           CS  +C +  Q S   C   +   C Y+ +YGDGS ++G   ++T+   +  G    A +
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236

Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSST 253
              I FGC  +  G     +    GI G G   +S++SQ+ +  ++ K FS+CL   S  
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL-KGSDN 295

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
                  G +  PG+V TPL  ++  Y L +++I V  Q+L +        +T   ++DS
Sbjct: 296 GGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDS 355

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGAD 363
           GTTL +L  G     ++ +++ +    V         C+  +S   S  P V+++F G  
Sbjct: 356 GTTLAYLADGAYDPFVNAITAAVSPS-VRSLVSKGNQCFVTSSSVDSSFPTVSLYFMGGV 414

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGI------TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
               +   ++     I  +V   I         + I G+++  + +  YD+    + +  
Sbjct: 415 AMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTD 474

Query: 418 TDCT 421
            DC+
Sbjct: 475 YDCS 478


>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
 gi|255641727|gb|ACU21134.1| unknown [Glycine max]
          Length = 475

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 98/370 (26%), Positives = 157/370 (42%), Gaps = 48/370 (12%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+PP +     DTGSD++W    +C  CP       D  L+DPK S T   + 
Sbjct: 70  YFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLTLYDPKGSETSDVVS 129

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
           C    C++       G    + C YS++YGDGS + G    + +T     G     P   
Sbjct: 130 CDQDFCSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNRINGNLRTSPQNS 189

Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
            I FGCG    G   S +     GI+G G  + S++SQ+  +  +   FS+CL  V    
Sbjct: 190 SIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDNVRGGG 249

Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL--------GVSTPDIVIDSG 306
           I F    +V  P V +TPL      Y + + +I V    L         V+    VIDSG
Sbjct: 250 I-FAIGEVVE-PKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPSDIFDSVNGKGTVIDSG 307

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC--------YSFNSLSQVPEVTIH 358
           TTL +LP         V   +I+      P   L L         Y+ N     P V +H
Sbjct: 308 TTLAYLPD-------IVYDELIQKVLARQPGLKLYLVEQQFRCFLYTGNVDRGFPVVKLH 360

Query: 359 FRGA-DVKLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQQ 411
           F+ +  + +   ++  +  + I C  ++           + + G+++ +N LV YD+E  
Sbjct: 361 FKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDLENM 420

Query: 412 TVSFKPTDCT 421
            + +   +C+
Sbjct: 421 VIGWTDYNCS 430


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 97/364 (26%), Positives = 164/364 (45%), Gaps = 35/364 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G+PP E     DTGSD++W  C P   CP S         F+P  SST   +P
Sbjct: 91  YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150

Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
           CS  +C +  Q S   C   +   C Y+ +YGDGS ++G   ++T+   +  G    A +
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 210

Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSST 253
              I FGC  +  G     +    GI G G   +S++SQ+ +  ++ K FS+CL   S  
Sbjct: 211 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL-KGSDN 269

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
                  G +  PG+V TPL  ++  Y L +++I V  Q+L +        +T   ++DS
Sbjct: 270 GGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDS 329

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGAD 363
           GTTL +L  G     ++ +++ +    V         C+  +S   S  P V+++F G  
Sbjct: 330 GTTLAYLADGAYDPFVNAITAAVSPS-VRSLVSKGNQCFVTSSSVDSSFPTVSLYFMGGV 388

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGI------TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
               +   ++     I  +V   I         + I G+++  + +  YD+    + +  
Sbjct: 389 AMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFVYDLANMRMGWTD 448

Query: 418 TDCT 421
            DC+
Sbjct: 449 YDCS 452


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 96/368 (26%), Positives = 159/368 (43%), Gaps = 39/368 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G P    +   DTGSD++W  C P   CP          ++DP+ SST   + 
Sbjct: 29  YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 88

Query: 148 CSSSQCA---SLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLG--STTGQAVALP 200
           CS   C       +  CS    NC+Y  SYGDGS S G    + +     S+ G A    
Sbjct: 89  CSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTS 148

Query: 201 GITFGCGTNNGGLFNS---KTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
            + FGC     G  ++      GI+G G  ++S+ +Q+  +  I   FS+CL        
Sbjct: 149 QVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL-EGEKRGG 207

Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD--------IVIDSGT 307
                G ++ PG+  TPL      Y + +  ISV + RL +   D        +++DSGT
Sbjct: 208 GILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSGT 267

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV-PEVTIHFRGADVKL 366
           TL + P G  +  +  +     A PV       +       LS + P VT++F G  ++L
Sbjct: 268 TLAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLNFEGGAMEL 327

Query: 367 SRSNFFV------KVSEDIVCSVFKGITNS--------VPIYGNIMQTNFLVGYDIEQQT 412
              N+ +        + D+ C  ++  ++S        + I G+I+  + LV YD++   
Sbjct: 328 QPDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSR 387

Query: 413 VSFKPTDC 420
           + +   +C
Sbjct: 388 IGWMSYNC 395


>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
 gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
          Length = 321

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 85/261 (32%), Positives = 125/261 (47%), Gaps = 30/261 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I IGTP        DTGSD++W     C+ CP       +  L+DPK SST   + 
Sbjct: 33  YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 92

Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    CA+    L     + + C+YSV+YGDGS + G   ++ +     +G     P   
Sbjct: 93  CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 152

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
            +TFGCG+  GG     N    GI+G G  + S++SQ+  + AGK    F++CL  ++  
Sbjct: 153 TVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQL--SAAGKVKKIFAHCLDTINGG 210

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
            I F    +V  P V +TPL      Y + + +I VG   L +             +IDS
Sbjct: 211 GI-FAIGNVVQ-PKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDS 268

Query: 306 GTTLTFLPQ-GYNSNLLSVMS 325
           GTTLT+LP+  Y   +L+V +
Sbjct: 269 GTTLTYLPEIVYKEIMLAVFA 289


>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
 gi|223942623|gb|ACN25395.1| unknown [Zea mays]
          Length = 378

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 106/374 (28%), Positives = 166/374 (44%), Gaps = 51/374 (13%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPC--PPSQCYMQDSPL--FDPKMSSTYKS 145
            Y +R  +GTP    + VADTGSDL W +C     PP+     D P   F    S ++  
Sbjct: 13  QYFVRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPAS----DPPAREFRASESRSWAP 68

Query: 146 LPCSSSQCAS-----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG---------- 190
           L CSS  C S     L   S     C Y   Y DGS + G + T+  T+           
Sbjct: 69  LACSSDTCTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGS 128

Query: 191 STTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP 249
              G+   L G+  GC  T +G  F S + G++ LG  +IS  S+      G+FSYCLV 
Sbjct: 129 GGGGRRAKLQGVVLGCTATYDGQSFQS-SDGVLSLGNSNISFASRAAARFGGRFSYCLVD 187

Query: 250 V-----SSTKINFGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI 301
                 +S+ + FG      G     TPL    +   FY + +DA+ V  + L +   D+
Sbjct: 188 HLAPRNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPA-DV 246

Query: 302 ---------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNS-L 349
                    ++DSGT+LT L       +++ +   + A P    DP    E CY++ +  
Sbjct: 247 WDVGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAMDP---FEYCYNWTAGA 303

Query: 350 SQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 407
            ++P++ + F G A ++    ++ +  +  + C  V +G    V + GNI+Q   L  +D
Sbjct: 304 PEIPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPGVSVIGNILQQEHLWEFD 363

Query: 408 IEQQTVSFKPTDCT 421
           +  + + FK T C 
Sbjct: 364 LRDRWLRFKHTRCA 377


>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
 gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
          Length = 428

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 115/388 (29%), Positives = 177/388 (45%), Gaps = 50/388 (12%)

Query: 65  LNHFNQNSSISSSKASQA--DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
             +    +S  S+KA Q   D     + Y+I + +GTP   ++   DTGS   W  CE C
Sbjct: 54  FRYITNKTSRLSTKAVQVGWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-C 112

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSF 177
               C+         + S+T   + C +S C         Q S +  +C + VSY DGS 
Sbjct: 113 --DGCHTNPRTFLQSR-STTCAKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSA 169

Query: 178 SNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFN-SKTTGIVGLGGGDISLISQMR 236
           S G L  +T+T          +PG +FGC  ++ G        G++G+G G +S++ Q  
Sbjct: 170 SYGILYQDTLTFSDVQ----KIPGFSFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSS 225

Query: 237 TTIAGKFSYCLVPVSSTKINF--GTNGIVSGPGVVST----------PLTKAKTFYVLTI 284
            T    FSYCL P+  ++  F   T G  S  G V+T             K    + + +
Sbjct: 226 PTFDC-FSYCL-PLQKSERGFFSKTTGYFS-LGKVATRTDVRYTKMVARKKNTELFFVDL 282

Query: 285 DAISVGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEA---QPVADP 336
            AISV  +RLG+     S   +V DSG+ L+++P       LSV+S  I     +  A  
Sbjct: 283 TAISVDGERLGLSPSVFSRKGVVFDSGSELSYIPD----RALSVLSQRIRELLLKRGAAE 338

Query: 337 TGSLELCYSFNSLSQ--VPEVTIHF-RGADVKLSRSNFFVKVS---EDIVCSVFKGITNS 390
             S   CY   S+ +  +P +++HF  GA   L     FV+ S   +D+ C  F   T S
Sbjct: 339 EESERNCYDMRSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF-APTES 397

Query: 391 VPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
           V I G++MQT+  V YD+++Q +   P+
Sbjct: 398 VSIIGSLMQTSKEVVYDLKRQLIGIGPS 425


>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
          Length = 137

 Score =  113 bits (283), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 59/135 (43%), Positives = 81/135 (60%), Gaps = 7/135 (5%)

Query: 78  KASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           K  QA +   N  +L++++IG P     A+ DTGSDL WTQC PC  S CY Q +P++DP
Sbjct: 8   KDVQAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCMPC--SDCYKQPTPIYDP 65

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
            +SSTY ++ C SS C +L   +C    C+Y  +YGD S + G L+ ET TL S +    
Sbjct: 66  SLSSTYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQS---- 121

Query: 198 ALPGITFGCGTNNGG 212
            +P I FGCG +N G
Sbjct: 122 -IPHIAFGCGQDNEG 135


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 97/358 (27%), Positives = 165/358 (46%), Gaps = 33/358 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           N  Y  R+ IGTPP     + DTGS + +  C  C   QC     P F P +SSTY+S+ 
Sbjct: 10  NGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSC--EQCGRHQDPKFQPDLSSTYQSVK 67

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC- 206
           C+   C   ++K      C Y   Y + S S+G L  + ++ G+ +  A+A     FGC 
Sbjct: 68  CNID-CNCDDEKQ----QCVYERQYAEMSTSSGVLGEDIISFGNLS--ALAPQRAVFGCE 120

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
               G L++    GI+G+G GD+S++  +  +  I   FS C   +          GI  
Sbjct: 121 NMETGDLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYGGMGIGGGAMVLGGISP 180

Query: 265 GPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVSTPDI-------VIDSGTTLTFLPQ-G 315
              +V +     ++ +Y + +  I V  + L ++ P +       ++DSGTT  +LP+  
Sbjct: 181 PSNMVFSQSDPVRSPYYNIDLKEIHVAGKPLPLN-PTVFDGKHGTILDSGTTYAYLPEAA 239

Query: 316 YNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--FNSLSQV----PEVTIHF-RGADVKLS 367
           + S   ++M  +   +P+  P  +  ++C+S   + +SQ+    P V + F  G  + LS
Sbjct: 240 FVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFPAVEMVFGNGQKLLLS 299

Query: 368 RSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
             N+     KV       +F+   +   + G I+  N LV YD E   + F  T+C++
Sbjct: 300 PENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENSKIGFWKTNCSE 357


>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
          Length = 428

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 109/360 (30%), Positives = 166/360 (46%), Gaps = 48/360 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+I + +GTP   ++   DTGS   W  CE C    C+         + S+T   + C +
Sbjct: 82  YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-C--DGCHTNPRTFLQSR-STTCAKVSCGT 137

Query: 151 SQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           S C         Q S +  +C + VSY DGS S G L  +T+T          +P  TFG
Sbjct: 138 SMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ----KIPSFTFG 193

Query: 206 CGTNNGGLFN-SKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF--GTNGI 262
           C  ++ G        G++G+G G +S++ Q      G FSYCL P+  ++  F   T G 
Sbjct: 194 CNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDG-FSYCL-PLQKSERGFFSKTTGY 251

Query: 263 VSGPGVVST----------PLTKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGT 307
            S  G V+T             K    + + + AISV  +RLG+     S   +V DSG+
Sbjct: 252 FS-LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGS 310

Query: 308 TLTFLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCYSFNSLSQ--VPEVTIHF-RG 361
            L+++P       LSV+S  I     +  A    S   CY   S+ +  +P +++HF  G
Sbjct: 311 ELSYIPD----RALSVLSQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDG 366

Query: 362 ADVKLSRSNFFVKVS---EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
           A   L     FV+ S   +D+ C  F   T SV I G++MQT+  V YD+++Q +   P+
Sbjct: 367 ARFDLGSHGVFVERSVQEQDVWCLAF-APTESVSIIGSLMQTSKEVVYDLKRQLIGIGPS 425


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 91/363 (25%), Positives = 170/363 (46%), Gaps = 33/363 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G P  E     DTGSD++W  C P   CP S         F+P  SST   +P
Sbjct: 89  YFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRIP 148

Query: 148 CSSSQCASLNQKS---CSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---A 196
           CS  +C +  Q     C   +     C Y+ +YGDGS ++G   ++T+   +  G    A
Sbjct: 149 CSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQTA 208

Query: 197 VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVS 251
            +   + FGC  +  G     +    GI G G   +S++SQ+ +  ++ K FS+CL   S
Sbjct: 209 NSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHCL-KGS 267

Query: 252 STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVI 303
                    G +  PG+V TPL  ++  Y L +++I+V  Q+L +        +T   ++
Sbjct: 268 DNGGGILVLGEIVEPGLVFTPLVPSQPHYNLNLESIAVSGQKLPIDSSLFATSNTQGTIV 327

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL-SQVPEVTIHFRGA 362
           DSGTTL +L  G     ++ +++ +     +  +  ++   + +S+ S  P  T++F+G 
Sbjct: 328 DSGTTLVYLVDGAYDPFINAIAAAVSPSVRSVVSKGIQCFVTTSSVDSSFPTATLYFKGG 387

Query: 363 -DVKLSRSNFFVK---VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
             + +   N+ ++   V  +++  +    +  + I G+++  + +  YD+    + +   
Sbjct: 388 VSMTVKPENYLLQQGSVDNNVLWCIGWQRSQGITILGDLVLKDKIFVYDLANMRMGWADY 447

Query: 419 DCT 421
           DC+
Sbjct: 448 DCS 450


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 105/380 (27%), Positives = 164/380 (43%), Gaps = 73/380 (19%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQC--EPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           +I + IGTPP  +  V DTGS L W QC  +  PP     +    FDP +SS++ +LPCS
Sbjct: 73  IISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP-----KPKTSFDPSLSSSFSTLPCS 127

Query: 150 SSQCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
              C       +L     S   C YS  Y DG+F+ GNL  E +T  +T       P + 
Sbjct: 128 HPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE----ITPPLI 183

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG-- 261
            GC T      +S   GI+G+  G +S +SQ + +   KFSYC +P  S +  F   G  
Sbjct: 184 LGCATE-----SSDDRGILGMNRGRLSFVSQAKIS---KFSYC-IPPKSNRPGFTPTGSF 234

Query: 262 -IVSGPGVVSTPLTKAKTF-------------YVLTIDAISVGNQRLGVS----TPD--- 300
            +   P           TF             Y + +  I  G ++L +S     PD   
Sbjct: 235 YLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGG 294

Query: 301 ---IVIDSGTTLTFL-PQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVP 353
               ++DSG+  T L    Y+     +M+ +   ++   V    G+ ++C+  N ++ +P
Sbjct: 295 SGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYG--GTADMCFDGN-VAMIP 351

Query: 354 E-----VTIHFRGADVKLSRSNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNF 402
                 V +  RG ++ + +    V V   I C      S+    +N   I GN+ Q N 
Sbjct: 352 RLIGDLVFVFTRGVEILVPKERVLVNVGGGIHCVGIGRSSMLGAASN---IIGNVHQQNL 408

Query: 403 LVGYDIEQQTVSFKPTDCTK 422
            V +D+  + V F   DC++
Sbjct: 409 WVEFDVTNRRVGFAKADCSR 428


>gi|222637182|gb|EEE67314.1| hypothetical protein OsJ_24556 [Oryza sativa Japonica Group]
          Length = 304

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 104/351 (29%), Positives = 159/351 (45%), Gaps = 69/351 (19%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP-----LFDPKMSSTYKSLP 147
           + +++GTPP    A+    SDL W +C PC  S C    +P     L+D   SS++   P
Sbjct: 1   MELAVGTPPVTVQALFGI-SDLCWVECTPC--SGCNNNAAPPAGARLYDRANSSSFS--P 55

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
            + ++C         G    Y  +  D ++  G L TET+  GS    A  +   TFGC 
Sbjct: 56  LADTEC---------GYRYVYGATDTDRNYVKGILGTETIKFGSN--DAATVQSFTFGC- 103

Query: 208 TN---NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV--PVSSTKINFGTNGI 262
           TN      LF+  T G+VGLG   +SL+ Q+      +FSYCL   P  ++ + FG+   
Sbjct: 104 TNTVYRNDLFDGNT-GVVGLGRSKLSLVGQLGLD---RFSYCLASNPNVASPVLFGSTAS 159

Query: 263 VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLS 322
           + G GV STPL      Y + +  ISV   RL +                      N  +
Sbjct: 160 MDGNGVSSTPLLPDDANYYVNLLGISVDGTRLAIP---------------------NDTA 198

Query: 323 VMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTIHFRGADVKLSRSNFFVKVSE- 377
            MS   EA       GS  LC+  +  S+    VP +T+HF G D++L   N+F    + 
Sbjct: 199 RMSRTYEAV-----NGSGLLCFLVDDASKNVVTVPTMTMHFDGMDMELLFGNYFAYTGKQ 253

Query: 378 ------DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
                 D++C +  G +++    GN +Q +F V Y+++   +S +P DC K
Sbjct: 254 SGGGGGDVLC-LMIGKSSTGSRIGNYLQMDFHVLYELKNSVLSVQPADCGK 303


>gi|255576064|ref|XP_002528927.1| pepsin A, putative [Ricinus communis]
 gi|223531629|gb|EEF33456.1| pepsin A, putative [Ricinus communis]
          Length = 493

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 123/515 (23%), Positives = 214/515 (41%), Gaps = 125/515 (24%)

Query: 1   MATFLSC-VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETP-YQRLRDAL 58
           MAT  SC  F+ F LCF  +S   ++     + L H  S      N+  T  +  L+   
Sbjct: 1   MAT--SCYAFLCFILCFSCISVSISEI--LYLPLTHSLS------NTQFTSTHHLLKSTS 50

Query: 59  TRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAV-ADTGSDLIWT 117
           +RS +R  H +Q   + +       + P  ++Y +  ++ + P + +++  DTGSDL+W 
Sbjct: 51  SRSASRFQHQHQKRHLRNRHQVSLPLSPG-SDYTLSFTLNSNPPQHVSLYLDTGSDLVWF 109

Query: 118 QCEPCPPSQCYMQDSPLFD-------PKMSSTYKSLPCSSSQCA---------------- 154
              PC P +C + +    +       P++SST +S+ C SS C+                
Sbjct: 110 ---PCKPFECILCEGKAENTTASTPPPRLSSTARSVHCKSSACSAAHSNLPTSDLCAIAD 166

Query: 155 ----SLNQKSCSGVNC-QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
               S+    C   +C  +  +YGDGS     L  +++ L   T  +++L   TFGC   
Sbjct: 167 CPLESIETSDCHSFSCPSFYYAYGDGSLV-ARLYHDSIKLPLAT-PSLSLHNFTFGCAHT 224

Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRT---TIAGKFSYCLVP----------------- 249
                 ++  G+ G G G +SL +Q+ +    +  +FSYCLV                  
Sbjct: 225 A----LAEPVGVAGFGRGVLSLPAQLASFAPQLGNRFSYCLVSHSFNSDRLRLPSPLILG 280

Query: 250 --------VSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD- 300
                   V+   + F    ++  P        K   FY + ++ IS+G ++  +  P+ 
Sbjct: 281 HSDDKEKRVNKDDVQFVYTSMLDNP--------KHPYFYCVGLEGISIGKKK--IPAPEF 330

Query: 301 -----------IVIDSGTTLTFLPQGYNSNLLSVMSSMI-----EAQPVADPTGSLELCY 344
                      +V+DSGTT T LP    +++++   + +      A+ V D TG L  CY
Sbjct: 331 LKRVDREGSGGVVVDSGTTFTMLPASLYNSVVAEFDNRVGRVYERAKEVEDKTG-LGPCY 389

Query: 345 SFNSLSQVPEVTIHFRGAD--VKLSRSNFF---------VKVSEDIVCSVFKGITNSVPI 393
            ++++  +P + +HF G +  V L + N+F         V+    + C +         +
Sbjct: 390 YYDTVVNIPSLVLHFVGNESSVVLPKKNYFYDFLDGGDGVRRKRRVGCLMLMNGGEEAEL 449

Query: 394 -------YGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
                   GN  Q  F V YD+EQ+ V F    C 
Sbjct: 450 TGGPGATLGNYQQHGFEVVYDLEQRRVGFARRKCA 484


>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 482

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 103/367 (28%), Positives = 156/367 (42%), Gaps = 42/367 (11%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQC---EPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
           L    IG  P +     DTGSD +W  C     CP       +  L+DP  S T K +PC
Sbjct: 76  LYYTKIGLGPNDYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKVVPC 135

Query: 149 SSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
               C S      SG    ++C YS++YGDGS ++G+   + +T     G    +P    
Sbjct: 136 DDEFCTSTYDGPISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTS 195

Query: 202 ITFGCGTNNGGLFNSKT----TGIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSST 253
           + FGCG+   G  +S T     GI+G G  + S++SQ+    AGK    FS+CL  V+  
Sbjct: 196 VIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAA--AGKVKRVFSHCLDTVNGG 253

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------VID 304
            I F    +V  P V +TPL      Y + +  I V    + + T DI         +ID
Sbjct: 254 GI-FAIGEVVQ-PKVKTTPLVPRMAHYNVVLKDIEVAGDPIQLPT-DIFDSTSGRGTIID 310

Query: 305 SGTTLTFLPQGYNSNLLS---VMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF-R 360
           SGTTL +LP      LL       S +E   V D           +     P V   F  
Sbjct: 311 SGTTLAYLPVSIYDQLLEKTLAQRSGMELYLVEDQFTCFHYSDEKSLDDAFPTVKFTFEE 370

Query: 361 GADVKLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQQTVS 414
           G  +     ++     ED+ C  ++  T        + + G+++ TN L  YD++  ++ 
Sbjct: 371 GLTLTAYPHDYLFPFKEDMWCIGWQKSTAQTKDGKDLILLGDLVLTNKLFIYDLDNMSIG 430

Query: 415 FKPTDCT 421
           +   +C+
Sbjct: 431 WTDYNCS 437


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 104/397 (26%), Positives = 171/397 (43%), Gaps = 43/397 (10%)

Query: 42  PFYNSSETPYQRLRDAL-------TRSLNRLNHFNQNS-SISSSKASQADIIPNNANYLI 93
           PF+N  E P      +          SL   +H ++N  S+  + +    I    +N+L+
Sbjct: 130 PFHNQEEFPQTFSSSSSFKLKLYPAASLYNTHHQHKNYYSLDLNASLNPGITTGTSNFLV 189

Query: 94  RISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC 153
           +I +G PP +   + D  +D  W QC+PC   +CY Q   +FDP  SS+Y  L C +  C
Sbjct: 190 QIGVGGPPQKFYMIFDLQTDFTWLQCQPCI--KCYDQPDSIFDPSQSSSYTLLSCETKHC 247

Query: 154 ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
             L   SCS    C+Y+++Y DG+ + G L  ETV+  S+      +  ++ GC   N G
Sbjct: 248 NLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFESSG----WVDRVSLGCSNKNQG 303

Query: 213 LFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP----VSSTKINFGT---NGIVSG 265
            F   + G  GLG G +S  S++    A   SYCLV      SS+ + F +   +G V  
Sbjct: 304 PF-VGSDGTFGLGRGSLSFPSRIN---ASSMSYCLVESKDGYSSSTLEFNSPPCSGSVKA 359

Query: 266 PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVID----------SGTTLTFLPQG 315
             ++  P  KA+  Y + +  I VG +++ V      ID          S + +T L   
Sbjct: 360 K-LLQNP--KAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSSLITMLEND 416

Query: 316 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVK---LSRSNFF 372
             + +     +  +           + CY+ +S + V    + F   D K   L + ++ 
Sbjct: 417 TYNVVRDAFVAKTQHLERLKAFLQFDTCYNLSSNNTVELPILEFEVNDGKSWLLPKESYL 476

Query: 373 VKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
             V ++   C  F     S  I G + Q    V +D+
Sbjct: 477 YAVDKNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDL 513


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 105/380 (27%), Positives = 164/380 (43%), Gaps = 73/380 (19%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQC--EPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           +I + IGTPP  +  V DTGS L W QC  +  PP     +    FDP +SS++ +LPCS
Sbjct: 73  IISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP-----KPKTSFDPSLSSSFSTLPCS 127

Query: 150 SSQCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
              C       +L     S   C YS  Y DG+F+ GNL  E +T  +T       P + 
Sbjct: 128 HPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE----ITPPLI 183

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG-- 261
            GC T      +S   GI+G+  G +S +SQ + +   KFSYC +P  S +  F   G  
Sbjct: 184 LGCATE-----SSDDRGILGMNRGRLSFVSQAKIS---KFSYC-IPPKSNRPGFTPTGSF 234

Query: 262 -IVSGPGVVSTPLTKAKTF-------------YVLTIDAISVGNQRLGVS----TPD--- 300
            +   P           TF             Y + +  I  G ++L +S     PD   
Sbjct: 235 YLGDNPNSHGFKYVSLLTFPESQRMPNLDPLAYTVPMIGIRFGLKKLNISGSVFRPDAGG 294

Query: 301 ---IVIDSGTTLTFL-PQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVP 353
               ++DSG+  T L    Y+     +M+ +   ++   V    G+ ++C+  N ++ +P
Sbjct: 295 SGQTMVDSGSEFTHLVDAAYDKVRAEIMTRVGRRLKKGYVYG--GTADMCFDGN-VAMIP 351

Query: 354 E-----VTIHFRGADVKLSRSNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNF 402
                 V +  RG ++ + +    V V   I C      S+    +N   I GN+ Q N 
Sbjct: 352 RLIGDLVFVFTRGVEIFVPKERVLVNVGGGIHCVGIGRSSMLGAASN---IIGNVHQQNL 408

Query: 403 LVGYDIEQQTVSFKPTDCTK 422
            V +D+  + V F   DC++
Sbjct: 409 WVEFDVTNRRVGFAKADCSR 428


>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
          Length = 137

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 59/135 (43%), Positives = 81/135 (60%), Gaps = 7/135 (5%)

Query: 78  KASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           K  QA +   N  +L++++IG P     A+ DTGSDL WTQC PC  S CY Q +P++DP
Sbjct: 8   KDVQAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCIPC--SDCYKQPTPIYDP 65

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
            +SSTY ++ C SS C +L   +C    C+Y  +YGD S + G L+ ET TL S +    
Sbjct: 66  SLSSTYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQS---- 121

Query: 198 ALPGITFGCGTNNGG 212
            +P I FGCG +N G
Sbjct: 122 -IPHIAFGCGQDNEG 135


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score =  112 bits (281), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 118/446 (26%), Positives = 196/446 (43%), Gaps = 49/446 (10%)

Query: 4   FLSCVFILFFLCFYVVSPIEAQTGGFSVELI---HRDSPKSPFYNSSETPYQRLRDALTR 60
            +S + IL F+  Y  S  +    G    +I   +  SPKS  +          R A+  
Sbjct: 8   LISAIVILSFVTIYSSSASQIPNRGVRRPMIFPLYFASPKSSGH----------RQAIEG 57

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
           S  R +  +      +++    D + +N  Y  R+ IGTPP E   + DTGS + +  C 
Sbjct: 58  SYWRRHLKSDPYHHPNARMRLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCS 117

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS-QCASLNQKSCSGVNCQYSVSYGDGSFSN 179
            C    C     P F P  SSTY  + C+    C         GVNC Y   Y + S S+
Sbjct: 118 DC--EHCGKHQDPRFQPDESSTYHPVKCNMDCNCDH------DGVNCVYERRYAEMSSSS 169

Query: 180 GNLATETVTLGSTTGQAVALP-GITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM-- 235
           G L  + ++ G+   Q+  +P    FGC     G L++ +  GI+GLG G +S++ Q+  
Sbjct: 170 GVLGEDIISFGN---QSEVVPQRAVFGCENVETGDLYSQRADGIMGLGRGQLSIVDQLVD 226

Query: 236 RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQ-- 292
           +  I   FS C   +          GI   P +V +     ++ +Y + +  I V  +  
Sbjct: 227 KNVINDSFSLCYGGMHVGGGAMVLGGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPL 286

Query: 293 RLGVSTPD----IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS- 345
           +L  ST D     V+DSGTT  +LP + + +   +++      + +  P  +  ++C+S 
Sbjct: 287 KLSPSTFDRKHGTVLDSGTTYAYLPEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSG 346

Query: 346 ----FNSLSQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNSVPIYGN 396
                + LS+  PEV + F  G  + L+  N+     KV       +F+   +S  + G 
Sbjct: 347 AGRDVSQLSKAFPEVDMVFSNGQKLSLTPENYLFQHTKVHGAYCLGIFRN-GDSTTLLGG 405

Query: 397 IMQTNFLVGYDIEQQTVSFKPTDCTK 422
           I+  N LV YD E + + F  T+C++
Sbjct: 406 IIVRNTLVTYDRENEKIGFWKTNCSE 431


>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 106/388 (27%), Positives = 171/388 (44%), Gaps = 69/388 (17%)

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           +  +N +  + +++G+PP     V DTGS+L W  C+     +    +S +F+P  S TY
Sbjct: 62  LFHHNVSLTVSLTVGSPPQNVTMVLDTGSELSWLHCK-----KTQFLNS-VFNPLSSKTY 115

Query: 144 KSLPCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
             +PC S  C +  +      SC     C   VSY D +   GNLA ET  LGS T    
Sbjct: 116 SKVPCLSPTCKTRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTK--- 172

Query: 198 ALPGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK 254
             P   FGC   G ++    +SKTTG++G+  G +S ++QM      KFSYC+    S  
Sbjct: 173 --PATIFGCMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQMGYP---KFSYCISGFDSAG 227

Query: 255 INFGTNGIVSGPGV----------VSTPLTK-AKTFYVLTIDAISVGNQRL----GVSTP 299
           +    N   S P +          +STPL    +  Y + ++ I V N+ L     V  P
Sbjct: 228 VLLLGNA--SFPWLKPLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVP 285

Query: 300 D------IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPT----GSLELCYS 345
           D       ++DSGT  TFL         +  LS    +++   + D      G+++LCY 
Sbjct: 286 DHTGAGQTMVDSGTQFTFLLGPVYTALKNEFLSQTRGILKV--LNDDNFVFQGAMDLCYL 343

Query: 346 FNS----LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKG---ITNSVP 392
            +S    L  +P V++ F+GA++ +S      +V       + + C  F     +     
Sbjct: 344 LDSSRPNLQNLPVVSLMFQGAEMSVSGERLLYRVPGEVRGRDSVWCFTFGNSDLLGVEAF 403

Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + G+  Q N  + +D+E+  +      C
Sbjct: 404 VIGHHHQQNVWMEFDLEKSRIGLADVRC 431


>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
 gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
          Length = 442

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 92/329 (27%), Positives = 132/329 (40%), Gaps = 72/329 (21%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
           DT  DL W QC PCP  +CY Q + LFDP+ S T  ++PC S+ C  L +    CS   C
Sbjct: 169 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 228

Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
           QY V YGDG  ++G    + +TL  +T     +    FGC     G F++ T+G +    
Sbjct: 229 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTM---- 280

Query: 227 GDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDA 286
                                         F    +V  P ++        T Y++ +  
Sbjct: 281 ------------------------------FARTPLVRNPSII-------PTLYLVRLRG 303

Query: 287 ISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
           I VG +RL V         V+DS   +T L P  Y +  L+  S+M     VA     L+
Sbjct: 304 IEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAGGRAGLD 363

Query: 342 LCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP------- 392
            CY F   +   VP V++ F G  V          V  D +  + +G    VP       
Sbjct: 364 TCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAMGVMVEGCLAFVPTPGDFAL 413

Query: 393 -IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              GN+ Q    V YD+   +V F+   C
Sbjct: 414 GFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442


>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 421

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 103/347 (29%), Positives = 151/347 (43%), Gaps = 63/347 (18%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
           E+  RD  +  F NS    Y         S N  NH + N           ++   + N+
Sbjct: 88  EIFGRDESRVSFINSKCNQYT--------SGNLKNHAHNN-----------NLFDEDGNF 128

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           L+ ++ GTPP   + + DTGS + WTQC+ C    C       F+   SSTY S  C   
Sbjct: 129 LVDVAFGTPPQNFMLILDTGSSITWTQCKAC--VNCLQDSHRYFNWSASSTYSSGSCIPG 186

Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
                       V   Y+++YGD S S GN   +T+TL  +           FGCG NN 
Sbjct: 187 T-----------VENNYNMTYGDDSTSVGNYGCDTMTLEPSD----VFQKFQFGCGRNNK 231

Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST-KINFGTNG--------- 261
           G F S   G++GLG G +S +SQ  +     FSYCL    S   + FG            
Sbjct: 232 GDFGSGVDGMLGLGQGQLSTVSQTASKFNKVFSYCLPEEDSIGSLLFGEKATSQSSSLKF 291

Query: 262 --IVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV-----STPDIVIDSGTTLTFLPQ 314
             +V+GPG +     +   +Y + +  ISVGN+RL +     ++P  +IDS T +T LPQ
Sbjct: 292 TSLVNGPGTL-----QESGYYFVNLSDISVGNERLNIPSSVFASPGTIIDSRTVITRLPQ 346

Query: 315 GYNSNLLSVMSSMIEAQPVADPTGS----LELCYSFNSLSQVPEVTI 357
              S L +     +   P+++        L+ CY+       PE+TI
Sbjct: 347 RAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYN-XXXXXXPELTI 392


>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
          Length = 424

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 92/329 (27%), Positives = 132/329 (40%), Gaps = 72/329 (21%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
           DT  DL W QC PCP  +CY Q + LFDP+ S T  ++PC S+ C  L +    CS   C
Sbjct: 151 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 210

Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
           QY V YGDG  ++G    + +TL  +T     +    FGC     G F++ T+G +    
Sbjct: 211 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTM---- 262

Query: 227 GDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDA 286
                                         F    +V  P ++        T Y++ +  
Sbjct: 263 ------------------------------FARTPLVRNPSII-------PTLYLVRLRG 285

Query: 287 ISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
           I VG +RL V         V+DS   +T L P  Y +  L+  S+M     VA     L+
Sbjct: 286 IEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAGGRAGLD 345

Query: 342 LCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP------- 392
            CY F   +   VP V++ F G  V          V  D +  + +G    VP       
Sbjct: 346 TCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAMGVMVEGCLAFVPTPGDFAL 395

Query: 393 -IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              GN+ Q    V YD+   +V F+   C
Sbjct: 396 GFIGNVQQQTHEVLYDVGGGSVGFRRGAC 424


>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 434

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 115/420 (27%), Positives = 171/420 (40%), Gaps = 87/420 (20%)

Query: 65  LNHFNQNSSISSSKASQADIIPNNA----NY----------LIRISIGTPPTERLAVADT 110
           L+  ++NS  SSS ASQ    PN      NY          ++ + IGTPP  +  V DT
Sbjct: 38  LSSHSKNSLFSSSLASQFKQNPNTKTTSYNYRSSFKYSMALIVSLPIGTPPQTQQMVLDT 97

Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA------SLNQKSCSGV 164
           GS L W QC+  PP          FDP +SS++  LPC+ S C       +L        
Sbjct: 98  GSQLSWIQCK-VPPK----TPPTAFDPLLSSSFSVLPCNHSLCKPRVPDYTLPTSCDQNR 152

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C YS  Y DG+++ GNL  E  T  S+       P +  GC T+     +S T GI+G+
Sbjct: 153 LCHYSYFYADGTYAEGNLVREKFTFSSSQ----TTPPLILGCATD-----SSDTQGILGM 203

Query: 225 GGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTF----- 279
             G +S  S  + +   KFSYC+ P  S   +  T     GP   S              
Sbjct: 204 NLGRLSFSSLAKIS---KFSYCVPPRRSQSGSSPTGSFYLGPNPSSAGFKYVNLMTYRQS 260

Query: 280 ----------YVLTIDAISVGNQRLGVSTP----------DIVIDSGTTLTFLPQGYNSN 319
                     Y L +  I +  ++L +ST             +IDSGT  TFL     S 
Sbjct: 261 QRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPSGAGQTLIDSGTWFTFLVDEAYSK 320

Query: 320 LLSVMSSMIEAQPVADPT--------GSLELCYSFNSL---SQVPEVTIHFR-GADVKLS 367
           +        E   +A P         GSL++C+  +++     +  +   F  G ++ + 
Sbjct: 321 VKE------EIVKLAGPKLKKGYVYGGSLDMCFDGDAMVIGRMIGNMAFEFENGVEIVVE 374

Query: 368 RSNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           R      V   + C     S   G+ ++  I GN  Q +  V +D+  + V F  TDC++
Sbjct: 375 REKMLADVGGGVQCLGIGRSDLLGVASN--IIGNFHQQDLWVEFDLVGRRVGFGRTDCSR 432


>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
          Length = 424

 Score =  112 bits (280), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 92/329 (27%), Positives = 132/329 (40%), Gaps = 72/329 (21%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
           DT  DL W QC PCP  +CY Q + LFDP+ S T  ++PC S+ C  L +    CS   C
Sbjct: 151 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 210

Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
           QY V YGDG  ++G    + +TL  +T     +    FGC     G F++ T+G +    
Sbjct: 211 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTM---- 262

Query: 227 GDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDA 286
                                         F    +V  P ++        T Y++ +  
Sbjct: 263 ------------------------------FARTPLVRNPSII-------PTLYLVRLRG 285

Query: 287 ISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
           I VG +RL V         V+DS   +T L P  Y +  L+  S+M     VA     L+
Sbjct: 286 IEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYPRVAGGRAGLD 345

Query: 342 LCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP------- 392
            CY F   +   VP V++ F G  V          V  D +  + +G    VP       
Sbjct: 346 TCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAMGVMVEGCLAFVPTPGDFAL 395

Query: 393 -IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              GN+ Q    V YD+   +V F+   C
Sbjct: 396 GFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 112/425 (26%), Positives = 185/425 (43%), Gaps = 44/425 (10%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQR--LRDALTRSLNRLNHFNQNSSISSSKA----S 80
            G ++++ H   P SP    +  P     L D  +R  +RL + +  ++   ++A    +
Sbjct: 40  AGNTLQVSHAFGPCSPLGPGTTAPSWAGFLADQASRDASRLLYLDSLAARGKARAYAPIA 99

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
               +     Y++R  +GTPP + L   DT +D  W  C  C  + C    +P FDP  S
Sbjct: 100 SGRQLLQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCAGC--AGCPTSSAPPFDPAAS 157

Query: 141 STYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           ++Y+S+PC S  CA     +C   G  C +S++Y D S     L+ +++ +    G AV 
Sbjct: 158 TSYRSVPCGSPLCAQAPNAACPPGGKACGFSLTYADSSL-QAALSQDSLAVA---GDAVK 213

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---- 254
               TFGC     G   +   G++GLG G +S +SQ R    G FSYCL    S      
Sbjct: 214 T--YTFGCLQKATGT-AAPPQGLLGLGRGPLSFLSQTRDMYQGTFSYCLPSFKSLNFSGT 270

Query: 255 INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI---------- 301
           +  G NG    P + +TPL       + Y + +  I VG + + +  P +          
Sbjct: 271 LRLGRNG--QPPRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGT 328

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 361
           V+DSGT  T L       +   +   + A PV+   G  + C++  +++  P VT+ F G
Sbjct: 329 VLDSGTMFTRLVAPAYVAVRDEVRRRVGA-PVSS-LGGFDTCFNTTAVAW-PPVTLLFDG 385

Query: 362 ADVKLSRSNFFVK-----VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
             V L   N  +      +S   + +   G+   + +  ++ Q N  V +D+    V F 
Sbjct: 386 MQVTLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFA 445

Query: 417 PTDCT 421
              CT
Sbjct: 446 RERCT 450


>gi|297794561|ref|XP_002865165.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297795163|ref|XP_002865466.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311000|gb|EFH41424.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311301|gb|EFH41725.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 134

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 60/138 (43%), Positives = 77/138 (55%), Gaps = 18/138 (13%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           + + C F+ FF                +VELIH DSP SP YN   T    L  A  RS+
Sbjct: 7   SLVDCDFLFFF----------NDWENLTVELIHSDSPHSPLYNPHHTVSDGLNAAFLRSI 56

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R   FN  + +      Q+ +I N   Y + ISIGTPP++ LA+ADTGSDL W QC+PC
Sbjct: 57  SRSRRFNTKTDL------QSGLISNGGEYFMSISIGTPPSKVLAIADTGSDLTWVQCKPC 110

Query: 123 PPSQCYMQDSPLFDPKMS 140
              QCY Q+SPLFD K+S
Sbjct: 111 --QQCYKQNSPLFDKKIS 126


>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
          Length = 346

 Score =  112 bits (279), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 96/351 (27%), Positives = 151/351 (43%), Gaps = 32/351 (9%)

Query: 95  ISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---LFDPKMSSTYKSLPCSSS 151
           IS+GTPP   L   DTGS L W QC+ C   +CY Q +    +F+P  SSTY  + CS+ 
Sbjct: 3   ISLGTPPVFNLVTIDTGSTLSWVQCKNCQI-KCYDQAAKAGQIFNPYNSSTYSKVGCSTE 61

Query: 152 QCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C  ++     +  C   +  C YS+ YG G +S G L  + +TL S      ++    F
Sbjct: 62  ACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNR----SIDNFIF 117

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQM-RTTIAGKFSYCLVPVSSTKINFGTNGIV 263
           GCG +N  L+N    GI+G G    S  +Q+ + T    FSYC       + +       
Sbjct: 118 GCGEDN--LYNGVNAGIIGFGTKSYSFFNQVCQQTDYTAFSYCFPRDHENEGSLTIGPYA 175

Query: 264 SGPGVVSTPLT--KAKTFYVLTIDAISVGNQRLGVS-----TPDIVIDSGTTLTFLPQGY 316
               ++ T L     K  Y +    + V   RL +      +   ++DSGT  T++    
Sbjct: 176 RDINLMWTKLIYYDHKPAYAIQQLDMMVNGIRLEIDPYIYISKMTIVDSGTADTYILSPV 235

Query: 317 NSNLLSVMSSMIEAQPVADPTGSLELCYSFNS----LSQVPEVTIHFRGADVKLSRSNFF 372
              L   M+  ++A+          +C+  NS     +  P V +    + +KL   N F
Sbjct: 236 FDALDKAMTKEMQAKGYTRGWDERRICFISNSGSANWNDFPTVEMKLIRSTLKLPVENAF 295

Query: 373 VKVSEDIVCSVF---KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            + S +++CS F         V + GN    +F + +DI+     FK   C
Sbjct: 296 YESSNNVICSTFLPDDAGVRGVQMLGNRAVRSFKLVFDIQAMNFGFKARAC 346


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 109/366 (29%), Positives = 171/366 (46%), Gaps = 40/366 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +GTPP +     DTGSD++W     C  CP +         FDP  S T   + 
Sbjct: 52  YYTRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLIS 111

Query: 148 CSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
           CS  +C+   Q S   CS  N  C Y+  YGDGS ++G   ++ +   +  G +V   + 
Sbjct: 112 CSDQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNNSS 171

Query: 200 PGITFGC-GTNNGGLFNSKTT--GIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTK 254
             I FGC     G L  S     GI G G  D+S++SQ+ +  I+ + FS+CL    S  
Sbjct: 172 APIVFGCSALQTGDLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHCLKGDDSGG 231

Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSG 306
                  IV  P +V TPL  ++  Y L + +ISV  Q L +        S+   +IDSG
Sbjct: 232 GILVLGEIVE-PNIVYTPLVPSQPHYNLNMQSISVNGQTLAIDPSVFGTSSSQGTIIDSG 290

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY----SFNSLSQVPEVTIHFR- 360
           TTL +L +      +S ++S++   P   P  S    CY    S N +   P+V+++F  
Sbjct: 291 TTLAYLAEAAYDPFISAITSIVS--PSVRPYLSKGNHCYLISSSINDI--FPQVSLNFAG 346

Query: 361 GADVKLSRSNFFVKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           GA + L   ++ ++ S      + C  F+ I    + I G+++  + +  YDI  Q + +
Sbjct: 347 GASMILIPQDYLIQQSSIGGAALWCIGFQKIQGQGITILGDLVLKDKIFVYDIANQRIGW 406

Query: 416 KPTDCT 421
              DC+
Sbjct: 407 ANYDCS 412


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 103/367 (28%), Positives = 171/367 (46%), Gaps = 41/367 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +GTPP E     DTGSD++W  C     CP S         FD   SS+   + 
Sbjct: 79  YFTKVKLGTPPMEFTVQIDTGSDILWVNCNSCNGCPRSSGLGIQLNFFDASSSSSSSLVS 138

Query: 148 CSSSQCASLNQKSCSGV-----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
           CS   C S  Q + +        C Y+  YGDGS ++G   +E++      GQ++   + 
Sbjct: 139 CSDPICNSAFQTTATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQSMIANSS 198

Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTK 254
             + FGC T   G     +    GI G G GD+S+ISQ+  R      FS+CL      +
Sbjct: 199 ASVVFGCSTYQSGDLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCL----KGE 254

Query: 255 INFG---TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------V 302
            N G     G V  PG+V +PL  ++  Y L + +ISV  Q L +  P +         +
Sbjct: 255 GNGGGILVLGEVLEPGIVYSPLVPSQPHYNLYLQSISVNGQTLPID-PSVFATSINRGTI 313

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQV-PEVTIHFR 360
           IDSGTTL +L +   +  +S +++ + +Q V         CY  + S+ ++ P V+++F 
Sbjct: 314 IDSGTTLAYLVEEAYTPFVSAITAAV-SQSVTPTISKGNQCYLVSTSVGEIFPLVSLNFA 372

Query: 361 G-ADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           G A + L    + + +       + C  F+ +   V I G+++  + +  YD+ +Q + +
Sbjct: 373 GSASMVLKPEEYLMHLGFYDGAALWCIGFQKVQEGVTILGDLVMKDKIFVYDLARQRIGW 432

Query: 416 KPTDCTK 422
              DC++
Sbjct: 433 ASYDCSQ 439


>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
          Length = 383

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 83/278 (29%), Positives = 131/278 (47%), Gaps = 23/278 (8%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + ++IG PP       D+GSDL W QC+ PC    C     PL+ P  S
Sbjct: 58  GDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNEVPHPLYRPTKS 114

Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
              K +PC    CASL+     +  C   +  C Y + Y D   S G L  ++  L  T 
Sbjct: 115 ---KLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTN 171

Query: 194 GQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
           G +VA P + FGCG +     G  +S T G++GLG G +SL+SQ++     K    +CL 
Sbjct: 172 G-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLS 230

Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
                 + FG + +V       TP+ ++  + +Y     ++  G++ LGV    +V DSG
Sbjct: 231 LRGGGFLFFGDD-LVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSG 289

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY 344
           ++ T+        L++ +   +      +P  SL LC+
Sbjct: 290 SSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCW 327


>gi|115475303|ref|NP_001061248.1| Os08g0207800 [Oryza sativa Japonica Group]
 gi|45735815|dbj|BAD12851.1| unknown protein [Oryza sativa Japonica Group]
 gi|113623217|dbj|BAF23162.1| Os08g0207800 [Oryza sativa Japonica Group]
 gi|125602549|gb|EAZ41874.1| hypothetical protein OsJ_26419 [Oryza sativa Japonica Group]
          Length = 449

 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 97/383 (25%), Positives = 161/383 (42%), Gaps = 55/383 (14%)

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
           I  +  YL  + IG    ++  + DTGS L+WTQC+ CP   C++ D P +    S T++
Sbjct: 76  IYEDVVYLAEMEIGERQQKQYLLIDTGSSLVWTQCDECP--HCHIGDVPPYGRSQSRTFQ 133

Query: 145 SLPCSSSQCASLNQK--------------SCSGVNCQYSVSY---GDGSFSNGNLATETV 187
            + C         +                C    C +   Y   G G    G ++ +T 
Sbjct: 134 EVSCGDDDDNDKEEAIASYCPAKPPGYITLCVNGRCMFKALYNLTGQGETVQGYMSMDTF 193

Query: 188 T-LGSTTGQAVALPGITFGCGTNNGGLFNS--KTTGIVGLGGGDISLISQMRTTIAGKFS 244
             +        A   + FGC      +  +  + TGI+GLG GD S + Q   T   KFS
Sbjct: 194 HFIDDRRFDYQAKFRMVFGCAHQENIVLTAVKECTGILGLGMGDASFLRQTGIT---KFS 250

Query: 245 YCLVP-------VSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS 297
           YC+ P          + + FG++  +SG  V   PL      Y L + AI+     L   
Sbjct: 251 YCVPPRMPGYSYRRHSWLRFGSHAQISGKKV---PLVMRWGKYYLPLTAITYTYNELMSP 307

Query: 298 TP-----------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD-PTGSLELCYS 345
            P            +++D+GT+L  LP   + +L+  M ++I+++ + +  T   + CY 
Sbjct: 308 VPIIAYKSQEDYLHMMVDTGTSLLSLPTSLHDDLIKEMEAIIKSENIMEGATRWPKHCYK 367

Query: 346 FNSLSQVPEVTIHFR---GADVKLSRSNFFVKVSED---IVCSVFKGITN-SVPIYGNIM 398
             ++ +V ++T+      G D++L  S  F+K        VC     + + S  I G   
Sbjct: 368 -RTMDEVKDITVTLSFDGGLDIELFTSALFIKTETTKGPAVCLAVNRVDDSSKAILGMFA 426

Query: 399 QTNFLVGYDIEQQTVSFKPTDCT 421
           QTN  VGYD+  + ++  P  C 
Sbjct: 427 QTNINVGYDLLSREIAMDPIRCA 449


>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
          Length = 389

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 86/266 (32%), Positives = 126/266 (47%), Gaps = 30/266 (11%)

Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
           C Y+++YGDGSF+ G L  E +  G+     + +    FGCG NN GLF    +G++GLG
Sbjct: 76  CNYAINYGDGSFTRGELGHEKLKFGT-----ILVKDFIFGCGRNNKGLFGG-VSGLMGLG 129

Query: 226 GGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVV---STPLTKAK----- 277
             D+SLISQ      G FSYCL    ST+     + I+ G   V   S+P++ AK     
Sbjct: 130 RSDLSLISQTSGIFGGVFSYCL---PSTERKGSGSLILGGNSSVYRNSSPISYAKMIENP 186

Query: 278 ---TFYVLTIDAISVGNQRL---GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
               FY + +  IS+G   L    V    I++DSGT +T LP      L +         
Sbjct: 187 QLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGTVITRLPPTIYKALKAEFLKQFTGF 246

Query: 332 PVADPTGSLELCYSFNSLSQV--PEVTIHFRG---ADVKLSRSNFFVKVSEDIVCSVFKG 386
           P A     L+ C++ ++  +V  P + +HF G     V ++   +FVK     VC     
Sbjct: 247 PPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYFVKSDASQVCLALAS 306

Query: 387 IT--NSVPIYGNIMQTNFLVGYDIEQ 410
           +   + V I GN  Q N  V YD ++
Sbjct: 307 LEYQDEVAILGNYQQKNLRVIYDTKE 332


>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
 gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
          Length = 372

 Score =  111 bits (277), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 99/363 (27%), Positives = 155/363 (42%), Gaps = 57/363 (15%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I +G P  +     DTGSD++W     C+ CP          L+DP  S +   + 
Sbjct: 27  YFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKSDLGIKLTLYDPASSVSATRVS 86

Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C    C S    L       + CQY+V YGDGS + G   ++ V     TG     ++  
Sbjct: 87  CDDDFCTSTYNGLLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFERVTGNLQTGLSNG 146

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN 260
            +TFGCG    G   +    + G               I G F++CL  V+   I F   
Sbjct: 147 TVTFGCGAQQSGGLGTSGEALDG---------------ILGAFAHCLDNVNGGGI-FAIG 190

Query: 261 GIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP--------DIVIDSGTTLTFL 312
            +VS P V +TP+   +  Y + +  I VG   L + T           +IDSGTTL +L
Sbjct: 191 ELVS-PKVNTTPMVPNQAHYNVYMKEIEVGGTVLELPTDVFDSGDRRGTIIDSGTTLAYL 249

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLE------LC--YSFNSLSQVPEVTIHFRGA-D 363
           P+        V  SM+       P  SL       +C  YS N     P++  HF+ +  
Sbjct: 250 PE-------VVYDSMMNEIRSQQPGLSLHTVEEQFICFKYSGNVDDGFPDIKFHFKDSLT 302

Query: 364 VKLSRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
           + +   ++  ++SEDI C  ++  G+       + + G+++ +N LV YDIE Q + +  
Sbjct: 303 LTVYPHDYLFQISEDIWCFGWQNGGMQSKDGRDMTLLGDLVLSNKLVLYDIENQAIGWTE 362

Query: 418 TDC 420
            +C
Sbjct: 363 YNC 365


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score =  111 bits (277), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 120/459 (26%), Positives = 189/459 (41%), Gaps = 91/459 (19%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA 89
           ++ L H  + + PF    +  YQ+L   +T SL R  H     +  ++  +      +  
Sbjct: 10  TIPLQHPQTNQIPF----QDQYQKLNHLVTTSLARARHLKNPQTTPATTTTAPLFSHSYG 65

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPL------FDPKMS 140
            Y + +S GTPP     + DTGSD++W  C     C    C    S        F PK S
Sbjct: 66  GYSVSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLC--KHCSFSSSSPSSRIQPFIPKES 123

Query: 141 STYKSLPCSSSQCASLNQ-----------KSCSGVNC-QYSVSYGDGSFSNGNLATETVT 188
           S+ K L C + +C+ ++            KSC    C  Y + YG G+ + G   +ET+ 
Sbjct: 124 SSSKLLGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSGT-TGGVALSETLH 182

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNS-KTTGIVGLGGGDISLISQMRTTIAGKFSYCL 247
           L      +++ P    GC      +F+S +  GI G G G  SL SQ+     GKFSYCL
Sbjct: 183 L-----HSLSKPNFLVGC-----SVFSSHQPAGIAGFGRGLSSLPSQLGL---GKFSYCL 229

Query: 248 ----------------VPVSSTKINFGTNGIVSGPGVVSTPLTKAKTF---YVLTIDAIS 288
                           + +     +  TN +V  P V +  +    +F   Y L +  I+
Sbjct: 230 LSHRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRIT 289

Query: 289 VGNQRLGV----------STPDIVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVA 334
           VG   + V              ++IDSGTT TF+     +  +   +  +      + + 
Sbjct: 290 VGGHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIE 349

Query: 335 DPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSV 391
           D  G L  C++ +    V  PE+ ++F+ GADV L   N+F  V  ++ C     +T+ V
Sbjct: 350 DAIG-LRPCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTV--VTDGV 406

Query: 392 P----------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                      I GN    NF V YD+  + + FK   C
Sbjct: 407 AGPERVGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score =  110 bits (276), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 108/385 (28%), Positives = 176/385 (45%), Gaps = 66/385 (17%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  I I++GTPP     V DTGS+L W  C     +       P F+P +SS+Y  +
Sbjct: 62  HNVSLTISITVGTPPQNMSMVIDTGSELSWLHCN---TNTTATIPYPFFNPNISSSYTPI 118

Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
            CSS  C +  +      SC   N C  ++SY D S S GNLA++T   GS+       P
Sbjct: 119 SCSSPTCTTRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFN-----P 173

Query: 201 GITFGC-----GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKI 255
           GI FGC      TN+    +S TTG++G+  G +SL+SQ++     KFSYC+     + I
Sbjct: 174 GIVFGCMNSSYSTNSES--DSNTTGLMGMNLGSLSLVSQLKIP---KFSYCISGSDFSGI 228

Query: 256 ------NFGTNGIVSGPGVV--STPLTK-AKTFYVLTIDAISVGNQRLGVS----TPD-- 300
                 NF   G ++   +V  STPL    ++ Y + ++ I + ++ L +S     PD  
Sbjct: 229 LLLGESNFSWGGSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHT 288

Query: 301 ----IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTG----SLELCYSF-- 346
                + D GT  ++L            L+  +  + A  + DP      +++LCY    
Sbjct: 289 GAGQTMFDLGTQFSYLLGPVYNALRDEFLNQTNGTLRA--LDDPNFVFQIAMDLCYRVPV 346

Query: 347 --NSLSQVPEVTIHFRGADVK------LSRSNFFVKVSEDIVCSVFKG---ITNSVPIYG 395
             + L ++P V++ F GA+++      L R   FV  ++ + C  F     +     I G
Sbjct: 347 NQSELPELPSVSLVFEGAEMRVFGDQLLYRVPGFVWGNDSVYCFTFGNSDLLGVEAFIIG 406

Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDC 420
           +  Q +  + +D+ +  V      C
Sbjct: 407 HHHQQSMWMEFDLVEHRVGLAHARC 431


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 105/391 (26%), Positives = 176/391 (45%), Gaps = 49/391 (12%)

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
            R  H +++    +++    D +  N  Y  R+ IGTPP     + DTGS + +  C  C
Sbjct: 53  RRQLHGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTC 112

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
              QC     P F P +SSTY+ + C+   C   N +    + C Y   Y + S S+G L
Sbjct: 113 --EQCGRHQDPKFQPDLSSTYQPVKCTLD-CNCDNDR----MQCVYERQYAEMSTSSGVL 165

Query: 183 ATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTI 239
             + V+ G+ +   +A     FGC     G L++    GI+GLG GD+S++ Q+  +  +
Sbjct: 166 GEDVVSFGNQS--ELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVV 223

Query: 240 AGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT------FYVLTIDAISVGNQR 293
           +  FS C        ++ G   +V G     + +  A++      +Y + +  I V  +R
Sbjct: 224 SDSFSLCY-----GGMDVGGGAMVLGGISPPSDMVFAQSDPVRSPYYNIDLKEIHVAGKR 278

Query: 294 LGVSTPDI-------VIDSGTTLTFLPQGYNSNLLSVMSSMI-EAQPVADPTGSL----E 341
           L ++ P +       V+DSGTT  +LP+      L+   +++ E Q  +  +G      +
Sbjct: 279 LPLN-PSVFDGKHGSVLDSGTTYAYLPE---EAFLAFKEAIVKELQSFSQISGPDPNYND 334

Query: 342 LCYS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSV 391
           LC+S      + LS+  P V + F  G    LS  N+     KV       +F+   +  
Sbjct: 335 LCFSGAGIDVSQLSKTFPVVDMIFGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGKDPT 394

Query: 392 PIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
            + G I+  N LV YD EQ  + F  T+C +
Sbjct: 395 TLLGGIVVRNTLVLYDREQTKIGFWKTNCAE 425


>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 481

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 120/479 (25%), Positives = 195/479 (40%), Gaps = 98/479 (20%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
           A      +EL H D+      N   T  +R+R A  R+ +R    + +++ ++   +   
Sbjct: 18  AGGAALRLELAHVDA------NEHCTMEERVRRATERTHHR-RLLHASTAAAAGGVAAPL 70

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC--------PPSQCYMQDSPLF 135
                  Y+    IG PP    AV DTGSDL+WTQC  C            C+ Q+ P +
Sbjct: 71  RWSGKTQYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYY 130

Query: 136 DPKMSSTYKSLPCS---------SSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATE 185
           +  +S T +++PC          + + A   +   SG + C  + SYG G  + G L T+
Sbjct: 131 NFSLSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTD 189

Query: 186 TVTLGSTTGQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
             T  S++   +A     FGC +    + G  N   +GI+GLG G +SL+SQ+  T   +
Sbjct: 190 AFTFPSSSSVTLA-----FGCVSQTRISPGALNG-ASGIIGLGRGALSLVSQLNAT---E 240

Query: 243 FSYCLVP-----VSSTKINFGTNGIVSGPG-----------VVSTPLTKA------KTFY 280
           FSYCL P     VS + +  G   +                V + P  K        TFY
Sbjct: 241 FSYCLTPYFRDTVSPSHLFVGDGELAGLSAAAGGGGGGGAPVTTVPFAKNPKDSPFSTFY 300

Query: 281 VLTIDAISVGNQRLGV---------STPDI-----VIDSGTTLTFLPQGYNSNLLSVMSS 326
            L +  ++ GN  + +         + P +     +IDSG+  T L    +  L   ++ 
Sbjct: 301 YLPLVGLAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELAR 360

Query: 327 MIEAQ-----PVADPTGSLELCYSFN------SLSQVPEVTIHFR-----GADVKLSRSN 370
            +        P A   G+LELC          + + VP + + F      G ++ +    
Sbjct: 361 QLRGSGSLVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPAEK 420

Query: 371 FFVKVSEDIVCSVFKG--------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           ++ +V     C              TN   I GN MQ +  V YD+    +SF+P +C+
Sbjct: 421 YWARVEASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 479


>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
 gi|194689376|gb|ACF78772.1| unknown [Zea mays]
 gi|224031455|gb|ACN34803.1| unknown [Zea mays]
 gi|238011528|gb|ACR36799.1| unknown [Zea mays]
 gi|238015454|gb|ACR38762.1| unknown [Zea mays]
          Length = 304

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 92/311 (29%), Positives = 150/311 (48%), Gaps = 43/311 (13%)

Query: 146 LPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI-- 202
           + C+ + C+ +   SC   + C Y  +YGDG+ + G  ATE  T  S+ G  +    +  
Sbjct: 1   MRCAGTLCSDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPL 60

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-------- 254
            FGCG+ N G  N+  +GIVG G   +SL+SQ+      +FSYCL   +S +        
Sbjct: 61  GFGCGSVNVGSLNNG-SGIVGFGRNPLSLVSQLSIR---RFSYCLTSYASRRQSTLLFGS 116

Query: 255 INFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGVS------TPD----I 301
           ++ G  G  +G  V +TPL ++    TFY +    ++VG +RL +        PD    +
Sbjct: 117 LSDGVYGDATGR-VQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGV 175

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCY---------SFNSLS 350
           ++DSGT LT LP    + ++      +   P A+  G+ E  +C+         S  S  
Sbjct: 176 IVDSGTALTLLPAAVLAEVVRAFRQQLRL-PFAN-GGNPEDGVCFLVPAAWRRSSSTSQM 233

Query: 351 QVPEVTIHFRGADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
            VP + +HF+GAD+ L R N+ +       +C +     +     GN++Q +  V YD+E
Sbjct: 234 PVPRMVLHFQGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLE 293

Query: 410 QQTVSFKPTDC 420
            +T+S  P  C
Sbjct: 294 AETLSIAPARC 304


>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
 gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
          Length = 381

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 80/263 (30%), Positives = 128/263 (48%), Gaps = 26/263 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G+PP E     DTGSD++W  C P   CP S         F+P  SST   +P
Sbjct: 91  YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150

Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
           CS  +C +  Q S   C   +   C Y+ +YGDGS ++G   ++T+   +  G    A +
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 210

Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSST 253
              I FGC  +  G     +    GI G G   +S++SQ+ +  ++ K FS+CL   S  
Sbjct: 211 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL-KGSDN 269

Query: 254 KINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDS 305
                  G +  PG+V TPL  ++  Y L +++I V  Q+L +        +T   ++DS
Sbjct: 270 GGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIVVNGQKLPIDSSLFTTSNTQGTIVDS 329

Query: 306 GTTLTFLPQGYNSNLLSVMSSMI 328
           GTTL +L  G     ++ +++ +
Sbjct: 330 GTTLAYLADGAYDPFVNAITAAV 352


>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 445

 Score =  110 bits (276), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 106/383 (27%), Positives = 170/383 (44%), Gaps = 63/383 (16%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N    + +++GTPP     V DTGS+L W  C+           + +F+P +SS+Y  +
Sbjct: 66  HNVTLTVSLTVGTPPQSVTMVLDTGSELSWLHCKK------QQNINSVFNPHLSSSYTPI 119

Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           PC S  C +  +      SC   N C  +VSY D +   GNLA++T  + S +GQ    P
Sbjct: 120 PCMSPICKTRTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAI-SGSGQ----P 174

Query: 201 GITFG---CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF 257
           GI FG    G ++    +SKTTG++G+  G +S ++QM      KFSYC+    ++ +  
Sbjct: 175 GIIFGSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQMGFP---KFSYCISGKDASGVLL 231

Query: 258 GTNGIVSGPGVVS-TPLTKAKT--------FYVLTIDAISVGNQRLGVS----TPD---- 300
             +      G +  TPL K  T         Y + +  I VG++ L V      PD    
Sbjct: 232 FGDATFKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGA 291

Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPT----GSLELCYSFNS---L 349
              ++DSGT  TFL     + L +   +        + DP     G+++LC+       +
Sbjct: 292 GQTMVDSGTRFTFLLGSVYTALRNEFVAQTRGVLTLLEDPNFVFEGAMDLCFRVRRGGVV 351

Query: 350 SQVPEVTIHFRGADVKLSRSNFFVKV---------SEDIVCSVFKG---ITNSVPIYGNI 397
             VP VT+ F GA++ +S      +V         + D+ C  F     +     + G+ 
Sbjct: 352 PAVPAVTMVFEGAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDLLGIEAYVIGHH 411

Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
            Q N  + +D+    V F  T C
Sbjct: 412 HQQNVWMEFDLVNSRVGFADTKC 434


>gi|224115494|ref|XP_002332148.1| predicted protein [Populus trichocarpa]
 gi|222875198|gb|EEF12329.1| predicted protein [Populus trichocarpa]
          Length = 483

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 111/398 (27%), Positives = 174/398 (43%), Gaps = 74/398 (18%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE----PCPPSQCYMQD--------------- 131
           YLI +SIGTPP       DTGSDL W  C      C     Y  +               
Sbjct: 80  YLISLSIGTPPQVIQVYMDTGSDLTWAPCGNISFDCIECDNYRNNRMMASFSPSHSSSSH 139

Query: 132 -----SPLFDPKMSSTYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLAT 184
                SP      SS     PC+ + C  ++L + +CS     ++ +YG G    G L  
Sbjct: 140 RDSCTSPFCIDVHSSDNPLDPCTMAGCSLSTLVKATCSWPCPPFAYTYGAGGVVTGTLTR 199

Query: 185 ETVTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
           +T+ + G   G    +P   FGC  ++      +  GI G G G +SL SQ+     G F
Sbjct: 200 DTLRVHGRNLGVTQEIPRFCFGCVASS----YREPIGIAGFGRGALSLPSQLGFLRKG-F 254

Query: 244 SYCLV-------PVSSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGN-- 291
           S+C +       P  S+ +  G   + S   +  TP+ K+     +Y + ++AI+VGN  
Sbjct: 255 SHCFLAFKYANNPNISSPLIIGDIALTSKDDMQFTPMLKSPMYPNYYYVGLEAITVGNVS 314

Query: 292 ---------QRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD---PTGS 339
                    +   +    +++DSGTT T LP+ + S +LSV+ S+I      D    TG 
Sbjct: 315 ATEVPSSLREFDSLGNGGMLVDSGTTYTHLPEPFYSQVLSVLQSIINYPRATDMEMRTG- 373

Query: 340 LELCYSF----NSL---SQVPEVTIHF-RGADVKLSRSNFFVKVSED-----IVCSVFKG 386
            +LCY      NS+     +P +T HF   A + LSR + F  +S       + C +F+ 
Sbjct: 374 FDLCYKVPCQNNSILTGDLLPSITFHFLNNASLVLSRGSHFYAMSAPSNSTVVKCLLFQS 433

Query: 387 ITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + +       + G+  Q +  V YD+E++ + F+P DC
Sbjct: 434 MDDGDYGPAGVLGSFQQQDVEVVYDMEKERIGFRPMDC 471


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 109/414 (26%), Positives = 186/414 (44%), Gaps = 63/414 (15%)

Query: 42  PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
           P   SS  P  R+ D   R L++       S + ++     D + +N  Y  R+ IGTPP
Sbjct: 34  PLSYSSLPPRPRVEDFRRRRLHQ-------SQLPNAHMKLYDDLLSNGYYTTRLWIGTPP 86

Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
            E   + DTGS + +  C  C   QC     P F P++S++Y++L C+   C   ++   
Sbjct: 87  QEFALIVDTGSTVTYVPCSTC--KQCGKHQDPKFQPELSTSYQALKCNPD-CNCDDE--- 140

Query: 162 SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTG 220
            G  C Y   Y + S S+G L+ + ++ G+ +   ++     FGC     G LF+ +  G
Sbjct: 141 -GKLCVYERRYAEMSSSSGVLSEDLISFGNES--QLSPQRAVFGCENEETGDLFSQRADG 197

Query: 221 IVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGVV---S 270
           I+GLG G +S++ Q+  +  I   FS C        +  G   +V G     PG+V   S
Sbjct: 198 IMGLGRGKLSVVDQLVDKGVIEDVFSLCY-----GGMEVGGGAMVLGKISPPPGMVFSHS 252

Query: 271 TPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-------VIDSGTTLTFLPQGYNSNLLSV 323
            P      +Y + +  + V  + L ++ P +       V+DSGTT  + P+      +++
Sbjct: 253 DPFRSP--YYNIDLKQMHVAGKSLKLN-PKVFNGKHGTVLDSGTTYAYFPK---EAFIAI 306

Query: 324 MSSMIEAQPV------ADPTGSLELCYS--FNSLSQV----PEVTIHF-RGADVKLSRSN 370
             ++I+  P        DP    ++C+S     ++++    PE+ + F  G  + LS  N
Sbjct: 307 KDAVIKEIPSLKRIHGPDPNYD-DVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPEN 365

Query: 371 FF---VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           +     KV       +F    +S  + G I+  N LV YD E   + F  T+C+
Sbjct: 366 YLFRHTKVRGAYCLGIFPD-RDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 418


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 109/414 (26%), Positives = 186/414 (44%), Gaps = 63/414 (15%)

Query: 42  PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
           P   SS  P  R+ D   R L++       S + ++     D + +N  Y  R+ IGTPP
Sbjct: 34  PLSYSSLPPRPRVEDFRRRRLHQ-------SQLPNAHMKLYDDLLSNGYYTTRLWIGTPP 86

Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
            E   + DTGS + +  C  C   QC     P F P++S++Y++L C+   C   ++   
Sbjct: 87  QEFALIVDTGSTVTYVPCSTC--KQCGKHQDPKFQPELSTSYQALKCNPD-CNCDDE--- 140

Query: 162 SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTG 220
            G  C Y   Y + S S+G L+ + ++ G+ +   ++     FGC     G LF+ +  G
Sbjct: 141 -GKLCVYERRYAEMSSSSGVLSEDLISFGNES--QLSPQRAVFGCENEETGDLFSQRADG 197

Query: 221 IVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSG-----PGVV---S 270
           I+GLG G +S++ Q+  +  I   FS C        +  G   +V G     PG+V   S
Sbjct: 198 IMGLGRGKLSVVDQLVDKGVIEDVFSLCY-----GGMEVGGGAMVLGKISPPPGMVFSHS 252

Query: 271 TPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-------VIDSGTTLTFLPQGYNSNLLSV 323
            P      +Y + +  + V  + L ++ P +       V+DSGTT  + P+      +++
Sbjct: 253 DPFRSP--YYNIDLKQMHVAGKSLKLN-PKVFNGKHGTVLDSGTTYAYFPK---EAFIAI 306

Query: 324 MSSMIEAQPV------ADPTGSLELCYS--FNSLSQV----PEVTIHF-RGADVKLSRSN 370
             ++I+  P        DP    ++C+S     ++++    PE+ + F  G  + LS  N
Sbjct: 307 KDAVIKEIPSLKRIHGPDPNYD-DVCFSGAGRDVAEIHNFFPEIAMEFGNGQKLILSPEN 365

Query: 371 FF---VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           +     KV       +F    +S  + G I+  N LV YD E   + F  T+C+
Sbjct: 366 YLFRHTKVRGAYCLGIFPD-RDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 418


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 98/356 (27%), Positives = 155/356 (43%), Gaps = 45/356 (12%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N   Y+    IGTPP +     D  SDL+WT C    P          F+P  S+T   +
Sbjct: 96  NAGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP----------FNPVRSTTVADV 145

Query: 147 PCSSSQCASLNQKSC--SGVNCQYSVSYGDGSF-SNGNLATETVTLGSTTGQAVALPGIT 203
           PC+   C     ++C      C Y+  YG G+  + G L TE  T G T      + G+ 
Sbjct: 146 PCTDDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTFGDTR-----IDGVV 200

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----INFGT 259
           FGCG  N G F S  +G++GLG G++SL+SQ++     +FSY   P  S      I FG 
Sbjct: 201 FGCGLKNVGDF-SGVSGVIGLGRGNLSLVSQLQVD---RFSYHFAPDDSVDTQSFILFGD 256

Query: 260 NGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGV--STPDIVIDSGT------- 307
           +        +ST L  +    + Y + +  I V  + L +   T D+    G+       
Sbjct: 257 DATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSI 316

Query: 308 --TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGAD 363
              +T L +     L   ++S I    V      L+LCY+  SL  ++VP + + F G  
Sbjct: 317 TDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGA 376

Query: 364 V-KLSRSN-FFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           V +L   N F++  +  + C ++         + G+++Q    + YDI    + F+
Sbjct: 377 VMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVFE 432


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 99/362 (27%), Positives = 167/362 (46%), Gaps = 31/362 (8%)

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
           D +  N  Y  R+ IGTP  E   + D+GS + +  C  C   QC     P F P +SST
Sbjct: 83  DDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATC--EQCGNHQDPRFQPDLSST 140

Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           Y  + C+   C   N++S     C Y   Y + S S+G L  + ++ G  +   +     
Sbjct: 141 YSPVKCNVD-CTCDNERS----QCTYERQYAEMSSSSGVLGEDIMSFGKES--ELKPQRA 193

Query: 203 TFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGT 259
            FGC  T  G LF+    GI+GLG G +S++ Q+  +  I+  FS C   +         
Sbjct: 194 VFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVL 253

Query: 260 NGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV------STPDIVIDSGTTLTFL 312
            G+ + P +V +     ++ +Y + +  I V  + L +      S    V+DSGTT  +L
Sbjct: 254 GGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYL 313

Query: 313 P-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PEVTIHF-RGAD 363
           P Q + +   +V + +   + +  P  +  ++C++      + LS+V P+V + F  G  
Sbjct: 314 PEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQK 373

Query: 364 VKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + LS  N+  + S  E   C  VF+   +   + G I+  N LV YD   + + F  T+C
Sbjct: 374 LSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 433

Query: 421 TK 422
           ++
Sbjct: 434 SE 435


>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
 gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
 gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
 gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
 gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
 gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
 gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
 gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
          Length = 474

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 112/394 (28%), Positives = 177/394 (44%), Gaps = 53/394 (13%)

Query: 70  QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           Q   I+SS +++ D+I     N+  +L+ +S+G PP   L   DTGS L W QC+PC   
Sbjct: 91  QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 149

Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
            C+ Q +   P+FDP  S T + + CSS +C  L       Q +C     +C YSV+YG+
Sbjct: 150 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 209

Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           G ++S G + T+T+ +G +         + FGC  +    ++    GI G G    S   
Sbjct: 210 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMDVK--YSEFEAGIFGFGSSSFSFFE 261

Query: 234 QMR---TTIAGK-FSYCLVPVSSTKINFGTNGIVSGPGVVS--TPLTKA--KTFYVLTID 285
           Q+      ++ K FSYCL P   TK  +   G      +    TPL ++  +  Y LT++
Sbjct: 262 QLAGYPDILSYKAFSYCL-PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTME 320

Query: 286 AISVGNQRLGVSTPDIVIDSG--------TTLTFLPQ---------GYNSNLLSVMSSMI 328
            +    QRL  S+ ++++DSG        +T   L +         GY+    +   S I
Sbjct: 321 MLIANGQRLVTSSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 380

Query: 329 EAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF-KG 386
                 D +G       F++ S +P + I F  GA + LS  N F       +C  F + 
Sbjct: 381 CYLSEHDYSGWNGTITPFSNWSALPPLEIGFAGGAALALSPRNVFYNDPHRGLCMTFAQN 440

Query: 387 ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                 I GN +  +F   +DI+ +   FK   C
Sbjct: 441 PALRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 474


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 99/364 (27%), Positives = 167/364 (45%), Gaps = 45/364 (12%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           N  Y  R+ IGTPP     + DTGS + +  C  C   QC     P F P+ SSTY+ + 
Sbjct: 109 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTC--EQCGRHQDPKFQPESSSTYQPVK 166

Query: 148 CSSSQCASLNQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           C+   C      +C G  + C Y   Y + S S+G L  + ++ G+ +   +A     FG
Sbjct: 167 CTID-C------NCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQS--ELAPQRAVFG 217

Query: 206 C-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGI 262
           C     G L++    GI+GLG GD+S++ Q+  +  I+  FS C        ++ G   +
Sbjct: 218 CENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCY-----GGMDVGGGAM 272

Query: 263 VSGPGVVSTPLTKAKT------FYVLTIDAISVGNQRLGVST------PDIVIDSGTTLT 310
           V G     + +T A +      +Y + +  + V  +RL ++          V+DSGTT  
Sbjct: 273 VLGGISPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVLDSGTTYA 332

Query: 311 FLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--FNSLSQV----PEVTIHF-RG 361
           +LP+  + +   +++  +   + ++ P  +  ++C+S   N +SQ+    P V + F  G
Sbjct: 333 YLPEAAFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMVFGNG 392

Query: 362 ADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
               LS  N+     KV       +F+   +   + G I+  N LV YD EQ  + F  T
Sbjct: 393 HKYSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKT 452

Query: 419 DCTK 422
           +C +
Sbjct: 453 NCAE 456


>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
 gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
          Length = 422

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 94/363 (25%), Positives = 164/363 (45%), Gaps = 39/363 (10%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
            +Y + ++IG PP       DTGSDL W QC+  P   C      L+ PK +     +PC
Sbjct: 66  GHYSVILNIGNPPKAFDLDIDTGSDLTWVQCD-APCKGCTKPLDKLYKPKNN----RVPC 120

Query: 149 SSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           +SS C ++   +C      C Y V Y D   S G L ++   L    G  +  P I FGC
Sbjct: 121 ASSLCQAIQNNNCDIPTEQCDYEVEYADLGSSLGVLLSDYFPLRLNNGSLLQ-PRIAFGC 179

Query: 207 GTNN---GGLFNSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKINFGTNG 261
           G +    G      T GI+GLG G  S++SQ+RT         +C   V+   + FG + 
Sbjct: 180 GYDQKYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRVTGGFLFFGDH- 238

Query: 262 IVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 319
           ++   G+  TP+ +  + T Y      +  G +  G+    ++ DSG++ T+       +
Sbjct: 239 LLPPSGITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQS 298

Query: 320 LLSVMSSMIEAQPVAD--PTGSLELCYS--------FNSLSQVPEVTIHF---RGADVKL 366
           +L+++   +   P+ D     +L +C+          +  S    +TI+F   +   ++L
Sbjct: 299 ILNLVRKDLSGMPLKDAPEEKALAVCWKTAKPIKSILDIKSFFKPLTINFIKAKNVQLQL 358

Query: 367 SRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
           +  ++ +   +  VC    GI N       ++ + G+I   + +V YD E+Q + + PT+
Sbjct: 359 APEDYLIITKDGNVCL---GILNGGEQGLGNLNVIGDIFMQDRVVVYDNERQQIGWFPTN 415

Query: 420 CTK 422
           C +
Sbjct: 416 CNR 418


>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
 gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 508

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 110/351 (31%), Positives = 153/351 (43%), Gaps = 30/351 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQDSPL----FDPKMSSTYK 144
           Y   +SIGTP    L   DTGSDL W  CE   CP       +       +    SST  
Sbjct: 104 YYANVSIGTPGLYFLVALDTGSDLFWLPCECTKCPTYLTKRDNGKFWLNHYSSNASSTSI 163

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALP-GI 202
            +PCSSS C   NQ S +  +C Y   Y  + S S G L  + + + +   Q   +   +
Sbjct: 164 RVPCSSSLCELANQCSSNKSSCPYQTHYLSENSSSAGYLVQDILHMATDDSQLKPVDVKV 223

Query: 203 TFGCGTNNGGLFNSKTT--GIVGLGGGDIS----LISQMRTTIAGKFSYCLVPVSSTKIN 256
           T GCG    G F++ T   G++GLG G +S    L SQ  TT    FS C       +I+
Sbjct: 224 TLGCGKVQTGKFSNVTAPNGLIGLGMGKVSVPSFLASQGLTT--DSFSMCFGYYGYGRID 281

Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGY 316
           FG  G V   G   TP   A   Y +TI  I V N+   V    I IDSG + T+L   +
Sbjct: 282 FGDIGPV---GQRETPFNPASLSYNVTILQIIVTNRPTNVHLTAI-IDSGASFTYLTDPF 337

Query: 317 NSNLLSVMSSMIEAQPV-ADPTGSLELCY--SFNSLSQVPEVTIHFRGADVKLSRSNFFV 373
            S +   M + +E + + +D     E CY  S  ++ Q P +     G   K      +V
Sbjct: 338 YSIITENMDAAMELERIKSDSDFPFEYCYRLSLATIFQQPNLNFTMEGGR-KFDVITSYV 396

Query: 374 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI----EQQTVSFKPTDC 420
            V  D   ++   I  S  I  N++  NF  GY +    E+ T+ +K  DC
Sbjct: 397 SVDTDDGPALCLAIVKSTDI--NVIGHNFFGGYRVVFNREKMTLGWKEVDC 445


>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
          Length = 469

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 119/423 (28%), Positives = 204/423 (48%), Gaps = 73/423 (17%)

Query: 44  YNSSETPY--QRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
           Y++   P+  + +   L + L    +  Q   +    AS A         +I I++GTP 
Sbjct: 44  YSAKSRPWVSKLVAGFLKKQLRNRGNKQQQQQLGGEAASGA-----APPLVINITVGTPV 98

Query: 102 TERLA-VADTGSDLIWTQCEPC--------PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
            + ++ + D  S  +W QC PC        PP+         F P  S+T+  LPCSS  
Sbjct: 99  AQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATA-------FRPNGSATFSPLPCSSDM 151

Query: 153 CASLNQKSC----------SGVNCQ-YSVSYGDGSFSN--GNLATETVTLGSTTGQAVAL 199
           C  + +++C          +G  C  YS++YG GS +N  G LAT+T T G+T     A+
Sbjct: 152 CLPVLRETCGRAGAAANATAGARCDSYSLTYG-GSAANTSGYLATDTFTFGAT-----AV 205

Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----- 254
           PG+ FGC   + G F +  +G++G+G G++SLISQ++    GKFSY L+   +T      
Sbjct: 206 PGVVFGCSDASYGDF-AGASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGSAD 261

Query: 255 --INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGV-----------ST 298
             I FG + +       STPL   T    FY + +  + V   RL              T
Sbjct: 262 SVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANGT 321

Query: 299 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCYSFNSLS--QVPE 354
             +++ S T +T+L Q     + + ++S I   P  + + +LE  LCY+ +S++  +VP+
Sbjct: 322 GGVILSSTTPVTYLEQAAYDVVRAAVASRI-GLPAVNGSAALELDLCYNASSMAKVKVPK 380

Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
           +T+ F  GAD+ LS +N+F   ++  +  +    +    + G ++QT   + YD++   +
Sbjct: 381 LTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMIYDVDAGRL 440

Query: 414 SFK 416
           +F+
Sbjct: 441 TFE 443


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score =  109 bits (273), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 111/415 (26%), Positives = 177/415 (42%), Gaps = 44/415 (10%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           + + H +SP SPF   +   ++     L +   RL + +  +   S   +    I  +  
Sbjct: 34  LRVFHVNSPCSPFKQPNTVSWE---STLLKDKARLQYLSSLAKKPSVPIASGRAIVQSPT 90

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y++R +IGTP    L   DT +D  W  C  C         S LFDP  SS+ ++L C +
Sbjct: 91  YIVRANIGTPAQPMLVALDTSNDAAWVPCSGC----VGCASSVLFDPSKSSSSRNLQCDA 146

Query: 151 SQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
            QC      +C +G +C ++++YG GS    +L  +T+TL +       +   TFGC + 
Sbjct: 147 PQCKQAPNPTCTAGKSCGFNMTYG-GSTIEASLTQDTLTLAND-----VIKSYTFGCISK 200

Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPG-- 267
             G  +    G++GLG G +SLISQ +      FSYCL   +S   NF +  +  GP   
Sbjct: 201 ATGT-SLPAQGLMGLGRGPLSLISQTQNLYMSTFSYCL--PNSKSSNF-SGSLRLGPKYQ 256

Query: 268 ---VVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTTLTF 311
              + +TPL K     + Y + +  I VGN+ + + T  +          + DSGT  T 
Sbjct: 257 PVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIFDSGTVFTR 316

Query: 312 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNF 371
           L +     + +     I+    A   G  + CYS + +   P VT  F G +V L   N 
Sbjct: 317 LVEPAYVAVRNEFRRRIK-NANATSLGGFDTCYSGSVV--YPSVTFMFAGMNVTLPPDNL 373

Query: 372 FVKVSE-DIVCSVFKGITNSV----PIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            +  S     C       N+V     +  ++ Q N  V  D+    +      CT
Sbjct: 374 LIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLGISRETCT 428


>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
          Length = 469

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 119/423 (28%), Positives = 204/423 (48%), Gaps = 73/423 (17%)

Query: 44  YNSSETPY--QRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
           Y++   P+  + +   L + L    +  Q   +    AS A         +I I++GTP 
Sbjct: 44  YSAKSRPWVSKLVAGFLKKQLRNRGNKQQQQQLGGEAASGA-----APPLVINITVGTPV 98

Query: 102 TERLA-VADTGSDLIWTQCEPC--------PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
            + ++ + D  S  +W QC PC        PP+         F P  S+T+  LPCSS  
Sbjct: 99  AQTVSGLVDITSYFVWAQCAPCAAAAGCLPPPATA-------FRPNGSATFSPLPCSSDM 151

Query: 153 CASLNQKSC----------SGVNCQ-YSVSYGDGSFSN--GNLATETVTLGSTTGQAVAL 199
           C  + +++C          +G  C  YS++YG GS +N  G LAT+T T G+T     A+
Sbjct: 152 CLPVLRETCGRAGAAANATAGARCDSYSLTYG-GSAANTSGYLATDTFTFGAT-----AV 205

Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----- 254
           PG+ FGC   + G F +  +G++G+G G++SLISQ++    GKFSY L+   +T      
Sbjct: 206 PGVVFGCSDASYGDF-AGASGVIGIGRGNLSLISQLQF---GKFSYQLLAPEATDDGSAD 261

Query: 255 --INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGV-----------ST 298
             I FG + +       STPL   T    FY + +  + V   RL              T
Sbjct: 262 SVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTGVRVDGNRLDAIPAGTFDLRANGT 321

Query: 299 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCYSFNSLS--QVPE 354
             +++ S T +T+L Q     + + ++S I   P  + + +LE  LCY+ +S++  +VP+
Sbjct: 322 GGVILSSTTPVTYLEQAAYDVVRAAVASRI-GLPAVNGSAALELDLCYNASSMAKVKVPK 380

Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
           +T+ F  GAD+ LS +N+F   ++  +  +    +    + G ++QT   + YD++   +
Sbjct: 381 LTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPSQGGSVLGTLLQTGTNMIYDVDAGRL 440

Query: 414 SFK 416
           +F+
Sbjct: 441 TFE 443


>gi|413924530|gb|AFW64462.1| hypothetical protein ZEAMMB73_591827, partial [Zea mays]
          Length = 469

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 97/350 (27%), Positives = 150/350 (42%), Gaps = 23/350 (6%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYK 144
           Y   + +GTP T  L   DTGSDL W  C+   C P   Y     +D  ++ P  S+T +
Sbjct: 96  YYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSR 155

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
            LPCS   C S+   +     C Y++ Y  + + S+G L  +T+ L            + 
Sbjct: 156 HLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASVI 215

Query: 204 FGCGTNNGG--LFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
            GCG    G  L      G++GLG  DIS+ S +     +   FS C    SS +I FG 
Sbjct: 216 IGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFFGD 275

Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 319
            G+ S       PL      Y + +D   +G++ L  ++   ++DSGT+ T LP      
Sbjct: 276 QGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPFDVYKA 335

Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSE 377
                   + A  V     + + CYS + L    VP +T+ F  AD  L   N  +  ++
Sbjct: 336 FTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLTF-AADKSLQAVNPILPFND 394

Query: 378 D---IVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDC 420
               +       + ++ PI   I+  NFLVGY    D E   + +  ++C
Sbjct: 395 KQGALAGFCLAVLPSTEPI--GIIAQNFLVGYHVVFDRESMKLGWYRSEC 442


>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 413

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 101/369 (27%), Positives = 170/369 (46%), Gaps = 37/369 (10%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
            D+ P   +Y + ++IG P        DTGSDL W QC+  P   C     PL+ P  + 
Sbjct: 44  GDVYPT-GHYYVTMNIGDPAKPYFLDIDTGSDLTWLQCD-APCQSCNKVPHPLYKPTKN- 100

Query: 142 TYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
             K +PC++S C +L      N+K      C Y + Y D + S G L T+  TL      
Sbjct: 101 --KLVPCAASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRNSS 158

Query: 196 AVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLVP 249
           +V  P  TFGCG +      G+  + T G++GLG G +SL+SQ++     K    +CL  
Sbjct: 159 SVR-PSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLST 217

Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQRLGVSTPDIVIDSGT 307
                + FG N +V        P+ ++ +  +Y      +    + LGV   ++V DSG+
Sbjct: 218 NGGGFLFFGDN-VVPTSRATWVPMVRSTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGS 276

Query: 308 TLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS----FNSLSQVPE----VTI 357
           T T F  Q Y + + ++ + + ++ Q V+DP  SL LC+     F S+S V      + +
Sbjct: 277 TYTYFAAQPYQATVSALKAGLSKSLQQVSDP--SLPLCWKGQKVFKSVSDVKNDFKSLFL 334

Query: 358 HF-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTV 413
            F + + +++   N+ +       C  +  G    +   I G+I   + L+ YD E+  +
Sbjct: 335 SFVKNSVLEIPPENYLIVTKNGNACLGILDGSAAKLTFNIIGDITMQDQLIIYDNERGQL 394

Query: 414 SFKPTDCTK 422
            +    C++
Sbjct: 395 GWIRGSCSR 403


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 96/371 (25%), Positives = 158/371 (42%), Gaps = 50/371 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+PP +     DTGSD++W    +C  CP       D  L+DPK S T + + 
Sbjct: 70  YFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSELIS 129

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
           C    C++       G    + C YS++YGDGS + G    + +T           P   
Sbjct: 130 CDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAPQNS 189

Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
            I FGCG    G  +S +     GI+G G  + S++SQ+  +  +   FS+CL  +    
Sbjct: 190 SIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCLDNIRGGG 249

Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------VIDS 305
           I F    +V  P V +TPL      Y + + +I V    L + + DI         +IDS
Sbjct: 250 I-FAIGEVVE-PKVSTTPLVPRMAHYNVVLKSIEVDTDILQLPS-DIFDSGNGKGTIIDS 306

Query: 306 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC--------YSFNSLSQVPEVTI 357
           GTTL +LP         V   +I       P   L L         Y+ N     P V +
Sbjct: 307 GTTLAYLPA-------IVYDELIPKVMARQPRLKLYLVEQQFSCFQYTGNVDRGFPVVKL 359

Query: 358 HFRGA-DVKLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQ 410
           HF  +  + +   ++  +  + I C  ++           + + G+++ +N LV YD+E 
Sbjct: 360 HFEDSLSLTVYPHDYLFQFKDGIWCIGWQKSVAQTKNGKDMTLLGDLVLSNKLVIYDLEN 419

Query: 411 QTVSFKPTDCT 421
             + +   +C+
Sbjct: 420 MAIGWTDYNCS 430


>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
          Length = 375

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 102/353 (28%), Positives = 162/353 (45%), Gaps = 65/353 (18%)

Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG 163
           R  + DTGSDLIWTQC                  K+SS+  +     S   S    + +G
Sbjct: 53  RKLIVDTGSDLIWTQC------------------KLSSSTAAAARHGSPPLSRTAPARTG 94

Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVG 223
               ++ +    + + G LA+ET T G+   +AV+L  + FGCG  + G      TGI+G
Sbjct: 95  A---FTRTCTASAAAVGVLASETFTFGAR--RAVSLR-LGFGCGALSAGSLIG-ATGILG 147

Query: 224 LGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN---FG---------TNGIVSGPGVVST 271
           L    +SLI+Q++     +FSYCL P +  K +   FG         T   +    +VS 
Sbjct: 148 LSPESLSLITQLKIQ---RFSYCLTPFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSN 204

Query: 272 PLTKAKTFYVLTIDAISVGNQRLGVST------PD----IVIDSGTTLTFLPQGYNSNLL 321
           P+     +Y + +  IS+G++RL V        PD     ++DSG+T+ +L +     + 
Sbjct: 205 PVET--VYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVK 262

Query: 322 SVMSSMIEAQPVADPT-GSLELCYSFNSLS--------QVPEVTIHFRG-ADVKLSRSNF 371
             +  ++   PVA+ T    ELC+     +        QVP + +HF G A + L R N+
Sbjct: 263 EAVMDVVRL-PVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNY 321

Query: 372 FVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           F +    ++C      T+   V I GN+ Q N  V +D++    SF PT C +
Sbjct: 322 FQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQCDQ 374


>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
          Length = 472

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 113/398 (28%), Positives = 176/398 (44%), Gaps = 61/398 (15%)

Query: 70  QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           Q   I+SS +++ D+I     N+  +L+ +S+G PP   L   DTGS L W QC+PC   
Sbjct: 89  QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147

Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
            C+ Q +   P+FDP  S T + + CSS +C  L       Q +C     +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207

Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           G ++S G + T+T+ +G +         + FGC  +    ++    GI G G    S   
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMDVK--YSEFEAGIFGFGSSSFSFFE 259

Query: 234 QMRTTIAG--------KFSYCLVPVSSTKINFGTNGIVSGPGVVS--TPLTKA--KTFYV 281
           Q+    AG         FSYCL P   TK  +   G      +    TPL ++  +  Y 
Sbjct: 260 QL----AGYPDILSYKAFSYCL-PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYS 314

Query: 282 LTIDAISVGNQRLGVSTPDIVIDSG--------TTLTFLPQ---------GYNSNLLSVM 324
           LT++ +    QRL  S+ ++++DSG        +T   L +         GY+    +  
Sbjct: 315 LTMEMLIANGQRLVTSSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ 374

Query: 325 SSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSV 383
            S I      D +G       F++ S +P + I F  GA + LS  N F       +C  
Sbjct: 375 ESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALSPRNVFYNDPHRGLCMT 434

Query: 384 F-KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           F +       I GN +  +F   +DI+ +   FK   C
Sbjct: 435 FAQNPALRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472


>gi|195619700|gb|ACG31680.1| pepsin A [Zea mays]
          Length = 485

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 97/350 (27%), Positives = 150/350 (42%), Gaps = 23/350 (6%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYK 144
           Y   + +GTP T  L   DTGSDL W  C+   C P   Y     +D  ++ P  S+T +
Sbjct: 66  YYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSR 125

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
            LPCS   C S+   +     C Y++ Y  + + S+G L  +T+ L            + 
Sbjct: 126 HLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASVI 185

Query: 204 FGCGTNNGG--LFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
            GCG    G  L      G++GLG  DIS+ S +     +   FS C    SS +I FG 
Sbjct: 186 IGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFFGD 245

Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 319
            G+ S       PL      Y + +D   +G++ L  ++   ++DSGT+ T LP      
Sbjct: 246 QGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPLDVYKA 305

Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSE 377
                   + A  V     + + CYS + L    VP +T+ F  AD  L   N  +  ++
Sbjct: 306 FTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLTF-AADKSLQAVNPILPFND 364

Query: 378 D---IVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDC 420
               +       + ++ PI   I+  NFLVGY    D E   + +  ++C
Sbjct: 365 KQGALAGFCLAVLPSTEPI--GIIAQNFLVGYHVVFDRESMKLGWYRSEC 412


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 103/371 (27%), Positives = 161/371 (43%), Gaps = 49/371 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I +G+P  +     DTGSD++W    +C  CP          L+DPK S T + + 
Sbjct: 69  YFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVS 128

Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C  + C+S  +     C   N C YS+SYGDGS + G    + +T     G    A    
Sbjct: 129 CEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQNS 188

Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
            I FGCG    G F S +     GI+G G  + S++SQ+  +  +   FS+CL     T 
Sbjct: 189 SIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL----DTN 244

Query: 255 INFG--TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI--------VID 304
           +  G  + G V  P V +TPL      Y + +  I V    L + +           VID
Sbjct: 245 VGGGIFSIGEVVEPKVKTTPLVPNMAHYNVILKNIEVDGDILQLPSDTFDSENGKGTVID 304

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQP------VADPTGSLELCYSFNSLSQVPEVTIH 358
           SGTTL +LP+     L   MS ++  QP      V +     +  Y+ N  S  P V +H
Sbjct: 305 SGTTLAYLPRIVYDQL---MSKVLAKQPRLKVYLVEEQYSCFQ--YTGNVDSGFPIVKLH 359

Query: 359 FRGA-DVKLSRSNFFVKVSEDIVCSVFKGITNS-------VPIYGNIMQTNFLVGYDIEQ 410
           F  +  + +   ++      D    +    + S       + + G+ + +N LV YD+E 
Sbjct: 360 FEDSLSLTVYPHDYLFNYKGDSYWCIGWQKSASETKNGKDMTLLGDFVLSNKLVVYDLEN 419

Query: 411 QTVSFKPTDCT 421
            T+ +   +C+
Sbjct: 420 MTIGWTDYNCS 430


>gi|242089103|ref|XP_002440384.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
 gi|241945669|gb|EES18814.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
          Length = 555

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 94/393 (23%), Positives = 157/393 (39%), Gaps = 68/393 (17%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSP---------------- 133
           YL+ +  GTP      V DT +DL W  C       + Y + S                 
Sbjct: 140 YLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRQSSKTMSVGGDDDVVAALA 199

Query: 134 -------LFDPKMSSTYKSLPCSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNL 182
                   + P  SS+++ + CS  QCA L   +C       +C Y     DG+ + G  
Sbjct: 200 KKEARKNWYRPAKSSSWRRIRCSEQQCAHLPYNTCQSPSKLESCSYYQKTQDGTVTIGIY 259

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
             E  T+  + G+   LPG+  GC     G       G++ LG G +S          G+
Sbjct: 260 GNEKATVTVSDGRMAKLPGLVLGCSVLEAGASVDAHDGVLSLGNGHMSFAIHAVLRFGGR 319

Query: 243 FSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL 294
           FS+CL+  +S++     + FG N  V GPG + T +      K  Y   + A+ VG +RL
Sbjct: 320 FSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETEILYNVDVKAAYGPRVTAVLVGGERL 379

Query: 295 GVSTPD------------IVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
            +  PD            +++D+ T++T  +P+ Y   L++ +   +   P     G  E
Sbjct: 380 DI--PDDVWNIDKGLGSGVILDTSTSVTSLVPEAYEP-LVAALDRHLAHLPRESFAG-FE 435

Query: 342 LCYSFNSLSQ---------VPEVTIHFRGADVKL---SRSNFFVKVSEDIVCSVFKGI-T 388
            CY +              +P+VT+   G   +L   ++S    +V   + C  F+ +  
Sbjct: 436 YCYRWTFTGDGVDPAHNVTIPKVTVEMTGG-ARLEPEAKSVVMPEVGHGVACLAFRKLPW 494

Query: 389 NSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              P I GN++   ++   D  + T  F+   C
Sbjct: 495 GGGPCIIGNVLMQEYIWEIDHSKATFRFRKDKC 527


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 98/360 (27%), Positives = 155/360 (43%), Gaps = 49/360 (13%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N   Y+    IGTPP +     D  SDL+WT C    P          F+P  S+T   +
Sbjct: 96  NAGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP----------FNPVRSTTVADV 145

Query: 147 PCSSSQCASLNQKSCSG------VNCQYSVSYGDGSF-SNGNLATETVTLGSTTGQAVAL 199
           PC+   C     ++C          C Y+  YG G+  + G L TE  T G T      +
Sbjct: 146 PCTDDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGDTR-----I 200

Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK----I 255
            G+ FGCG  N G F S  +G++GLG G++SL+SQ++     +FSY   P  S      I
Sbjct: 201 DGVVFGCGLQNVGDF-SGVSGVIGLGRGNLSLVSQLQVD---RFSYHFAPDDSVDTQSFI 256

Query: 256 NFGTNGIVSGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGV--STPDIVIDSGT--- 307
            FG +        +ST L  +    + Y + +  I V  + L +   T D+    G+   
Sbjct: 257 LFGDDATPQTSHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGV 316

Query: 308 ------TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHF 359
                  +T L +     L   ++S I    V      L+LCY+  SL  ++VP + + F
Sbjct: 317 FLSITDLVTVLEEAAYKPLRQAVASKIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVF 376

Query: 360 RGADV-KLSRSN-FFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
            G  V +L   N F++  +  + C ++         + G+++Q    + YDI    + F+
Sbjct: 377 AGGAVMELELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVFE 436


>gi|226495123|ref|NP_001141522.1| uncharacterized protein LOC100273634 precursor [Zea mays]
 gi|194704920|gb|ACF86544.1| unknown [Zea mays]
 gi|223949445|gb|ACN28806.1| unknown [Zea mays]
 gi|413924531|gb|AFW64463.1| pepsin A [Zea mays]
          Length = 515

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 97/350 (27%), Positives = 150/350 (42%), Gaps = 23/350 (6%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYK 144
           Y   + +GTP T  L   DTGSDL W  C+   C P   Y     +D  ++ P  S+T +
Sbjct: 96  YYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSR 155

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
            LPCS   C S+   +     C Y++ Y  + + S+G L  +T+ L            + 
Sbjct: 156 HLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASVI 215

Query: 204 FGCGTNNGG--LFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
            GCG    G  L      G++GLG  DIS+ S +     +   FS C    SS +I FG 
Sbjct: 216 IGCGQKQSGDYLDGIAPDGLLGLGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFFGD 275

Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 319
            G+ S       PL      Y + +D   +G++ L  ++   ++DSGT+ T LP      
Sbjct: 276 QGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPFDVYKA 335

Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSE 377
                   + A  V     + + CYS + L    VP +T+ F  AD  L   N  +  ++
Sbjct: 336 FTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLTF-AADKSLQAVNPILPFND 394

Query: 378 D---IVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDC 420
               +       + ++ PI   I+  NFLVGY    D E   + +  ++C
Sbjct: 395 KQGALAGFCLAVLPSTEPI--GIIAQNFLVGYHVVFDRESMKLGWYRSEC 442


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 106/364 (29%), Positives = 171/364 (46%), Gaps = 36/364 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G PP +     DTGSD++W  C     CP +         FDP  S+T   + 
Sbjct: 83  YYTRVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLVS 142

Query: 148 CSSSQCASLNQKSCSGV-----NCQYSVSYGDGSFSNGNLATETVTLG---STTGQAVAL 199
           CS   CA   Q S S        C Y   YGDGS ++G    + + L     ++  + + 
Sbjct: 143 CSDQICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNSS 202

Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTK 254
             + FGC T+  G     +    GI G G  D+S+ISQ+ +  IA K FS+CL    S  
Sbjct: 203 ASVVFGCSTSQTGDLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHCLKGDDSGG 262

Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSG 306
                  IV  P VV TPL  ++  Y L + +ISV  Q L +        S+   +IDSG
Sbjct: 263 GILVLGEIVE-PNVVYTPLVPSQPHYNLNLQSISVNGQVLPISPAVFATSSSQGTIIDSG 321

Query: 307 TTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCY-SFNSLSQV-PEVTIHFR-GA 362
           TTL +L  + YN+ +++V +  I +Q           CY + +S+S + P+V+++F  GA
Sbjct: 322 TTLAYLAEEAYNAFVVAVTN--IVSQSTQSVVLKGNRCYVTSSSVSDIFPQVSLNFAGGA 379

Query: 363 DVKLSRSNFFVKVSE----DIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
            + L   ++ ++ +      + C  F+ I    + I G+++  + +  YD+  Q + +  
Sbjct: 380 SLVLGAQDYLIQQNSVGGTTVWCIGFQKIPGQGITILGDLVLKDKIFIYDLANQRIGWTN 439

Query: 418 TDCT 421
            DC+
Sbjct: 440 YDCS 443


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 101/363 (27%), Positives = 165/363 (45%), Gaps = 35/363 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G+PP E     DTGSD++W     C  CP S         FDP  SST   + 
Sbjct: 68  YFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLIS 127

Query: 148 CSSSQCASLNQKS---CS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV--ALP 200
           CS  +C+   Q S   CS  G  C Y+  YGDGS ++G   ++ +   +  G +V  +  
Sbjct: 128 CSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSSA 187

Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTKI 255
            I FGC  +  G     +    GI G G  D+S+ISQM +  I  K FS+CL        
Sbjct: 188 SIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGG 247

Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------VIDSG 306
                 IV    +V +PL  ++  Y L + +ISV  + L +  P++         ++DSG
Sbjct: 248 ILVLGEIVE-EDIVYSPLVPSQPHYNLNLQSISVNGKSLAID-PEVFATSTNRGTIVDSG 305

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGA-D 363
           TTL +L +      +S ++  + +Q V         CY   S  +   P V+++F G   
Sbjct: 306 TTLAYLAEEAYDPFVSAITEAV-SQSVRPLLSKGTQCYLITSSVKGIFPTVSLNFAGGVS 364

Query: 364 VKLSRSNFFVKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
           + L   ++ ++ +      + C  F+ I    + I G+++  + +  YD+  Q + +   
Sbjct: 365 MNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAGQRIGWANY 424

Query: 419 DCT 421
           DC+
Sbjct: 425 DCS 427


>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 436

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 110/416 (26%), Positives = 181/416 (43%), Gaps = 58/416 (13%)

Query: 41  SPFYN-SSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-----IPNNANYLIR 94
           SPF    SE+    + D  ++   R+ +    SS+++ K   A I     + N  NY++R
Sbjct: 42  SPFTAPKSESWMNTVIDMASKDPARIRYL---SSLTAQKTVAAPIASGQQVLNVGNYVVR 98

Query: 95  ISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA 154
           + +GTP      V DT +D  W  C  C         +  F  + SST+ +L CS  +C 
Sbjct: 99  VQLGTPGQTMYMVLDTSNDAAWAPCSGC----IGCSSTTTFSAQNSSTFATLDCSKPECT 154

Query: 155 SLNQKSC---SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
                SC     V+C ++ +YG  S  +  L  +++ LG        +P  +FGC ++  
Sbjct: 155 QARGLSCPTTGNVDCLFNQTYGGDSTFSATLVQDSLHLGPNV-----IPNFSFGCISSAS 209

Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP----- 266
           G  +    G++GLG G +SLISQ  +  +G FSYCL    S K  + +  +  GP     
Sbjct: 210 G-SSIPPQGLMGLGRGPLSLISQSGSLYSGLFSYCL---PSFKSYYFSGSLKLGPVGQPK 265

Query: 267 GVVSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPDI-----------VIDSGTTLT-F 311
            + +TPL       + Y + +  ISVG   + +S P++           +IDSGT +T F
Sbjct: 266 AIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPIS-PELLAFDPNTGAGTIIDSGTVITRF 324

Query: 312 LPQGYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRS 369
           +P  Y     + +      Q      P G+ + C++ N+    P +T+H  G D+KL   
Sbjct: 325 VPAIY-----TAVRDEFRKQVGGSFSPLGAFDTCFATNNEVSAPAITLHLSGLDLKLPME 379

Query: 370 NFFVKVSE-DIVCSVFKGI----TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           N  +  S   + C           + V +  N+ Q N  + +DI    +      C
Sbjct: 380 NSLIHSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDINNSKLGIARELC 435


>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 121/445 (27%), Positives = 183/445 (41%), Gaps = 81/445 (18%)

Query: 44  YNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTE 103
           ++S   P+  L+ A + SL R +H    ++ S S A+      +   Y I +++GTPP  
Sbjct: 45  HSSDSDPFHSLKFAASASLTRAHHLKHRNNNSPSVATTPAYPKSYGGYSIDLNLGTPPQT 104

Query: 104 RLAVADTGSDLIWTQCEP---CPPSQCYMQD-----SPLFDPKMSSTYKSLPCSSSQCAS 155
              V DTGS L+W  C     C  S C   +      P F PK SST K L C + +C  
Sbjct: 105 SPFVLDTGSSLVWFPCTSRYLC--SHCNFPNIDTTKIPTFIPKNSSTAKLLGCRNPKCGY 162

Query: 156 L--------------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           +                ++CS     Y + YG GS + G L  + +     T     +P 
Sbjct: 163 IFGSDVQFRCPQCKPESQNCSLTCPAYIIQYGLGS-TAGFLLLDNLNFPGKT-----VPQ 216

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSSTK 254
              GC      L   + +GI G G G  SL SQM      +FSYCLV       P SS  
Sbjct: 217 FLVGCSI----LSIRQPSGIAGFGRGQESLPSQMNLK---RFSYCLVSHRFDDTPQSSDL 269

Query: 255 I-------NFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGNQRLGV-------- 296
           +       +  TNG+   P   S P T     K +Y LT+  + VG + + +        
Sbjct: 270 VLQISSTGDTKTNGLSYTP-FRSNPSTNNPAFKEYYYLTLRKVIVGGKDVKIPYTFLEPG 328

Query: 297 --STPDIVIDSGTTLTFLPQG-YN---SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 350
                  ++DSG+T TF+ +  YN      +  +         A+    L  C++ + + 
Sbjct: 329 SDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAETQSGLSPCFNISGVK 388

Query: 351 QV--PEVTIHFR-GADVKLSRSNFFVKVSE-DIVCSVFKGITNSVP--------IYGNIM 398
            V  PE+T  F+ GA +     N+F  V + ++VC        + P        I GN  
Sbjct: 389 TVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGGAGPPKTTGPAIILGNYQ 448

Query: 399 QTNFLVGYDIEQQTVSFKPTDCTKQ 423
           Q NF + YD+E +   F P  C ++
Sbjct: 449 QQNFYIEYDLENERFGFGPRSCRRK 473


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 101/363 (27%), Positives = 165/363 (45%), Gaps = 35/363 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G+PP E     DTGSD++W     C  CP S         FDP  SST   + 
Sbjct: 83  YFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLIS 142

Query: 148 CSSSQCASLNQKS---CS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV--ALP 200
           CS  +C+   Q S   CS  G  C Y+  YGDGS ++G   ++ +   +  G +V  +  
Sbjct: 143 CSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSSA 202

Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTKI 255
            I FGC  +  G     +    GI G G  D+S+ISQM +  I  K FS+CL        
Sbjct: 203 SIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGG 262

Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------VIDSG 306
                 IV    +V +PL  ++  Y L + +ISV  + L +  P++         ++DSG
Sbjct: 263 ILVLGEIVE-EDIVYSPLVPSQPHYNLNLQSISVNGKSLAID-PEVFATSTNRGTIVDSG 320

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGA-D 363
           TTL +L +      +S ++  + +Q V         CY   S  +   P V+++F G   
Sbjct: 321 TTLAYLAEEAYDPFVSAITEAV-SQSVRPLLSKGTQCYLITSSVKGIFPTVSLNFAGGVS 379

Query: 364 VKLSRSNFFVKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
           + L   ++ ++ +      + C  F+ I    + I G+++  + +  YD+  Q + +   
Sbjct: 380 MNLKPEDYLLQQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVYDLAGQRIGWANY 439

Query: 419 DCT 421
           DC+
Sbjct: 440 DCS 442


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score =  108 bits (270), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 119/441 (26%), Positives = 189/441 (42%), Gaps = 88/441 (19%)

Query: 50  PYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN-ANYLIRISIGTPPTERLAVA 108
           P++ +   L+ SLNR  H     S S++      + P +   Y + ++ GTPP     + 
Sbjct: 90  PFKTINLLLSASLNRAQHLKTPQSKSNTSIQNVSLFPRSYGAYSVSLAFGTPPQNLSFIF 149

Query: 109 DTGSDLIWTQCEP---CPPSQC---YMQDSPL--FDPKMSSTYKSLPCSSSQCASL---- 156
           DTGS L+W  C     C  S+C   Y+  + +  F PK+SS+ K + C + +CA +    
Sbjct: 150 DTGSSLVWFPCTAGYRC--SRCSFPYVDPATISKFVPKLSSSVKVVGCRNPKCAWIFGPN 207

Query: 157 --------NQKS--CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
                   N KS  CS     Y + YG G+ + G L +ET+ L     +   +P    GC
Sbjct: 208 LKSRCRNCNSKSRKCSDSCPGYGLQYGSGA-TAGILLSETLDL-----ENKRVPDFLVGC 261

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSSTKI---- 255
                 +   +  GI G G G  SL SQMR     +FS+CLV       PVSS  +    
Sbjct: 262 SV----MSVHQPAGIAGFGRGPESLPSQMRLK---RFSHCLVSRGFDDSPVSSPLVLDSG 314

Query: 256 ----NFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVS----TPD----- 300
                  T   +  P   +  ++ A  + +Y L++  I +G + +        PD     
Sbjct: 315 SESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRILIGGKPVKFPYKYLVPDSTGNG 374

Query: 301 -IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-------LELCYSF---NSL 349
             +IDSG+T TFL    +  +   ++  +E Q V  P          L  C++       
Sbjct: 375 GAIIDSGSTFTFL----DKPIFEAIADELEKQLVKYPRAKDVEAQSGLRPCFNIPKEEES 430

Query: 350 SQVPEVTIHFR-GADVKLSRSNFFVKVS-EDIVC-------SVFKGITNSVPIYGNIMQT 400
           ++ P+V + F+ G  + L+  N+   V+ E +VC       +V  G      I G   Q 
Sbjct: 431 AEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAVVGGGGGPAIILGAFQQQ 490

Query: 401 NFLVGYDIEQQTVSFKPTDCT 421
           N LV YD+ +Q + F+   CT
Sbjct: 491 NVLVEYDLAKQRIGFRKQKCT 511


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 105/363 (28%), Positives = 167/363 (46%), Gaps = 36/363 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G+PP +     DTGSD++W   + C  CP S         FDP  S T   + 
Sbjct: 90  YYTRLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLIS 149

Query: 148 CSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
           CS  +C+   Q S   C+  N  C Y+  YGDGS ++G   ++ +   +  G +V   + 
Sbjct: 150 CSDQRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNSS 209

Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTK 254
             I FGC T   G     +    GI G G  D+S+ISQ+ +       FS+CL    S  
Sbjct: 210 APIVFGCSTLQTGDLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHCLKGDDSGG 269

Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSG 306
                  IV  P +V TPL  ++  Y L + +I V  Q L +        S    +IDSG
Sbjct: 270 GILVLGEIVE-PNIVYTPLVPSQPHYNLNLQSIYVNGQTLAIDPSVFATSSNQGTIIDSG 328

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY-SFNSLSQV-PEVTIHFRGA- 362
           TTL +L +      +S ++S +   P   P  S    CY + +S++ V P+V+++F G  
Sbjct: 329 TTLAYLTEAAYDPFISAITSTVS--PSVSPYLSKGNQCYLTSSSINDVFPQVSLNFAGGT 386

Query: 363 DVKLSRSNFFVKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
            + L   ++ ++ S      + C  F+ I    + I G+++  + +  YDI  Q + +  
Sbjct: 387 SMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDKIFVYDIAGQRIGWAN 446

Query: 418 TDC 420
            DC
Sbjct: 447 YDC 449


>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 482

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 104/367 (28%), Positives = 161/367 (43%), Gaps = 43/367 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I +GTP  +     DTGSD++W  C     CP       +  L+ P  SST   + 
Sbjct: 74  YFAKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPSSSSTSNRVT 133

Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG 201
           C+   C S       G      C+Y V+YGDGS + G    + V L   TG  Q  +  G
Sbjct: 134 CNQDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTTSTNG 193

Query: 202 -ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKI 255
            I FGCG    G   + +    GI+G G  + S+ISQ+ ++  +   F++CL  ++   I
Sbjct: 194 SIVFGCGAQQSGQLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLDNINGGGI 253

Query: 256 NFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST--------PDIVIDSGT 307
            F    +V  P V +TPL   +  Y + + AI V N+ L + T           +IDSGT
Sbjct: 254 -FAIGEVVQ-PKVRTTPLVPQQAHYNVFMKAIEVDNEVLNLPTDVFDTDLRKGTIIDSGT 311

Query: 308 TLTFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA-D 363
           TL + P      L+S +    S ++   V +     E  Y  N     P VT HF  +  
Sbjct: 312 TLAYFPDVIYEPLISKIFARQSTLKLHTVEEQFTCFE--YDGNVDDGFPTVTFHFEDSLS 369

Query: 364 VKLSRSNFFVKVSEDIVCSVFKGITNS---------VPIYGNIMQTNFLVGYDIEQQTVS 414
           + +    +   +  +  C    G  NS         + + G+++  N LV YD+E QT+ 
Sbjct: 370 LTVYPHEYLFDIDSNKWCV---GWQNSGAQSRDGKDMILLGDLVLQNRLVMYDLENQTIG 426

Query: 415 FKPTDCT 421
           +   +C+
Sbjct: 427 WTEYNCS 433


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score =  108 bits (269), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 100/367 (27%), Positives = 154/367 (41%), Gaps = 44/367 (11%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM-QDSPLFDPKMSSTYKSLPC 148
           +Y+ R  +GTPP   L   D  +D  W  C  C    C     SP FDP  SSTY+ + C
Sbjct: 99  SYVARARLGTPPQTLLVAIDPSNDAAWVPCSAC--LGCAPGASSPSFDPTQSSTYRPVRC 156

Query: 149 SSSQCASLNQKSCS-----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
            + QCA +   + S     G +C +++SY   +  +  L  + ++L  + G AV     T
Sbjct: 157 GAPQCAQVPPATPSCPAGPGASCAFNLSYASSTL-HAVLGQDALSLSDSNGAAVPDDHYT 215

Query: 204 FGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN 260
           FGC    T +GG  +    G+VG G G +S +SQ + T    FSYCL    S+  NF + 
Sbjct: 216 FGCLRVVTGSGG--SVPPQGLVGFGRGPLSFLSQTKATYGSIFSYCLPSYKSS--NF-SG 270

Query: 261 GIVSGPG-----VVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI----------- 301
            +  GP      + +TPL       + Y + +  + V  + + +    +           
Sbjct: 271 TLRLGPAGQPRRIKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGT 330

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR- 360
           ++D+GT  T L     + L +     + A P A   G  + CY  N    VP V   F  
Sbjct: 331 IVDAGTMFTRLSPPAYAALRNAFRRGVSA-PAAPALGGFDTCYYVNGTKSVPAVAFVFAG 389

Query: 361 GADVKLSRSNFFV-KVSEDIVCSVFK-----GITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
           GA V L   N  +   S  + C         G+   + +  ++ Q N  V +D+    V 
Sbjct: 390 GARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVG 449

Query: 415 FKPTDCT 421
           F    CT
Sbjct: 450 FSRELCT 456


>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 627

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 106/389 (27%), Positives = 160/389 (41%), Gaps = 27/389 (6%)

Query: 54  LRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN----YLIRISIGTPPTERLAVAD 109
           +R  L R   RL    ++  +S SK     IIP   +    Y   + +GTP T  +   D
Sbjct: 170 VRSDLQRQKRRLGG-GKHQLLSFSK--DGGIIPTGNDFGWLYYTWVDVGTPNTSFMVALD 226

Query: 110 TGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG 163
           TGSDL W  C+   C P   Y     +D  ++ P  S+T + LPCS   C   +  +   
Sbjct: 227 TGSDLFWIPCDCIECAPLSGYHGSLDRDLGIYKPAESTTSRHLPCSHELCLLGSDCTNQK 286

Query: 164 VNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG--LFNSKTTG 220
             C Y+  Y  + + S+G L  + + L S    A     +  GCG    G  L      G
Sbjct: 287 QPCPYNTKYLQENTTSSGLLVEDILHLDSRESHAPVKASVIIGCGRKQSGSYLDGIAPDG 346

Query: 221 IVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT 278
           ++GLG  DIS+ S +     +   FS C     S +I FG  G+ +       PL     
Sbjct: 347 LLGLGMADISVPSFLARAGLVRNSFSMCFT-KDSGRIFFGDQGVSTQQSTPFVPLYGKLQ 405

Query: 279 FYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 338
            Y + +D   VG++    ++   ++DSGT+ T LP      +       + A  +     
Sbjct: 406 TYTVNVDKSCVGHKCFESTSFQAIVDSGTSFTALPLDIYKAVAIEFDKQVNASRLPQEAT 465

Query: 339 SLELCYSFNSL--SQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYG 395
           S + CYS + L    VP VT+ F G    +     F +   E  V      +  S    G
Sbjct: 466 SFDYCYSASPLVMPDVPTVTLTFAGNKSFQPVNPTFLLHDEEGAVAGFCLAVVQSPEPIG 525

Query: 396 NIMQTNFLVGY----DIEQQTVSFKPTDC 420
            I Q NFL+GY    D E   + +  ++C
Sbjct: 526 IIAQ-NFLLGYHVVFDRENMKLGWYRSEC 553


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 101/370 (27%), Positives = 165/370 (44%), Gaps = 47/370 (12%)

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
           D +  N  Y  R+ IGTPP E   + D+GS + +  C  C   QC     P F P +SS+
Sbjct: 81  DDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRFQPDLSSS 138

Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           Y  + C+       ++K C+     Y   Y + S S+G L  + V+ G  +   +     
Sbjct: 139 YSPVKCNVDCTCDSDKKQCT-----YERQYAEMSSSSGVLGEDIVSFGRES--ELKPQRA 191

Query: 203 TFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGT 259
            FGC  +  G LF+    GI+GLG G +S++ Q+  +  I+  FS C        ++ G 
Sbjct: 192 VFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCY-----GGMDIGG 246

Query: 260 NGIVSGPGV---------VSTPLTKAKTFYVLTIDAISVGNQRLGV------STPDIVID 304
             +V G GV          S PL     +Y + +  I V  + L V      S    V+D
Sbjct: 247 GAMVLG-GVPAPSDMVFSHSDPLRSP--YYNIELKEIHVAGKALRVDSRVFNSKHGTVLD 303

Query: 305 SGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PEVT 356
           SGTT  +LP Q + +   +V S +   + +  P  +  ++C++      + L +V P+V 
Sbjct: 304 SGTTYAYLPEQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDVD 363

Query: 357 IHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
           + F  G  + L+  N+     KV       VF+   +   + G I+  N LV YD   + 
Sbjct: 364 MVFGNGQKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEK 423

Query: 413 VSFKPTDCTK 422
           + F  T+C++
Sbjct: 424 IGFWKTNCSE 433


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 98/376 (26%), Positives = 160/376 (42%), Gaps = 64/376 (17%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           L+ + IGTPP  +  + DTGS L W QC    P +     S +FDP +SS++  LPC+  
Sbjct: 83  LVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRK--PPPSSVFDPSLSSSFSVLPCNHP 140

Query: 152 QCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
            C       +L         C YS  Y DG+ + GNL  E +T      ++ + P +  G
Sbjct: 141 LCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITF----SRSQSTPPLILG 196

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG---I 262
           C        +S   GI+G+  G +S  SQ + T   KFSYC VP    +  F   G   +
Sbjct: 197 CAEE-----SSDAKGILGMNLGRLSFASQAKLT---KFSYC-VPTRQVRPGFTPTGSFYL 247

Query: 263 VSGPGVVSTPLTKAKTF-------------YVLTIDAISVGNQRLGVS----TPD----- 300
              P           TF             Y + +  I +GNQ+L +      PD     
Sbjct: 248 GENPNSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPDPSGAG 307

Query: 301 -IVIDSGTTLTFL-PQGYN---SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---QV 352
             +IDSG+  T+L  + YN     ++ ++ + ++   V    G  ++C++ N++     +
Sbjct: 308 QTMIDSGSEFTYLVDEAYNKVREEVVRLVGARLKKGYVYG--GVSDMCFNGNAIEIGRLI 365

Query: 353 PEVTIHF-RGADVKLSRSNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGY 406
             +   F +G ++ + +      V   + C     S   G  ++  I GN  Q N  V +
Sbjct: 366 GNMVFEFDKGVEIVVEKERVLADVGGGVHCVGIGRSEMLGAASN--IIGNFHQQNIWVEF 423

Query: 407 DIEQQTVSFKPTDCTK 422
           D+  + V F   DC++
Sbjct: 424 DLANRRVGFGKADCSR 439


>gi|297818124|ref|XP_002876945.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322783|gb|EFH53204.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 206

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 60/134 (44%), Positives = 76/134 (56%), Gaps = 9/134 (6%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           +F   +  L+   F   S I A     +VELIHRDSP SP YN   T    L     RS+
Sbjct: 70  SFFEVILHLYTAIFCFSSTI-ANRENLTVELIHRDSPHSPLYNPHHTVSDGLNATFLRSI 128

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R   FN  + +      Q+ +I N   YL+ ISIGTPP++ LA+ADTGSDL W QC+P 
Sbjct: 129 SRSRRFNTKTDL------QSGLISNGGEYLMSISIGTPPSKVLAIADTGSDLTWVQCKPY 182

Query: 123 PPSQCYMQDSPLFD 136
              QCY Q+SPLFD
Sbjct: 183 --QQCYKQNSPLFD 194


>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
 gi|219888509|gb|ACL54629.1| unknown [Zea mays]
 gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
          Length = 415

 Score =  107 bits (268), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 97/372 (26%), Positives = 169/372 (45%), Gaps = 40/372 (10%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           Q D+ P   +Y + ++IG P        DTGSDL W QC+  P   C     PL+ P   
Sbjct: 44  QGDVYPT-GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCD-APCRSCNKVPHPLYRP--- 98

Query: 141 STYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           +  + +PC+++ C +L      N K  S   C Y + Y D + S G L  ++ +L   + 
Sbjct: 99  TANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSS 158

Query: 195 QAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
                PG+TFGCG +      G   +   G++GLG G +SL+SQ++     K    +CL 
Sbjct: 159 N--IRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLS 216

Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQRLGVSTPDIVIDSG 306
                 + FG + +V    V   P+ +  +  +Y      +    + LGV   ++V DSG
Sbjct: 217 TNGGGFLFFGDD-VVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSG 275

Query: 307 TTLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS--------FNSLSQVPEVT 356
           +T T F  Q Y + + ++   + ++ + V+DPT  L LC+         F+  ++   + 
Sbjct: 276 STYTYFTAQPYQAVVSALKGGLSKSLKQVSDPT--LPLCWKGQKAFKSVFDVKNEFKSMF 333

Query: 357 IHF---RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQ 410
           + F   + A +++   N+ +      VC  +  G     S  + G+I   + +V YD E+
Sbjct: 334 LSFASAKNAAMEIPPENYLIVTKNGNVCLGILDGTAAKLSFNVIGDITMQDQMVIYDNEK 393

Query: 411 QTVSFKPTDCTK 422
             + +    CT+
Sbjct: 394 SQLGWARGACTR 405


>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 415

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 97/372 (26%), Positives = 169/372 (45%), Gaps = 40/372 (10%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           Q D+ P   +Y + ++IG P        DTGSDL W QC+  P   C     PL+ P   
Sbjct: 44  QGDVYPT-GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCD-APCRSCNKVPHPLYRP--- 98

Query: 141 STYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           +  + +PC+++ C +L      N K  S   C Y + Y D + S G L  ++ +L   + 
Sbjct: 99  TANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSS 158

Query: 195 QAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
                PG+TFGCG +      G   +   G++GLG G +SL+SQ++     K    +CL 
Sbjct: 159 N--IRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLS 216

Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTKAKT--FYVLTIDAISVGNQRLGVSTPDIVIDSG 306
                 + FG + +V    V   P+ +  +  +Y      +    + LGV   ++V DSG
Sbjct: 217 TNGGGFLFFGDD-VVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSG 275

Query: 307 TTLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS--------FNSLSQVPEVT 356
           +T T F  Q Y + + ++   + ++ + V+DPT  L LC+         F+  ++   + 
Sbjct: 276 STYTYFTAQPYQAVVSALKGGLSKSLKQVSDPT--LPLCWKGQKAFKSVFDVKNEFKSMF 333

Query: 357 IHF---RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQ 410
           + F   + A +++   N+ +      VC  +  G     S  + G+I   + +V YD E+
Sbjct: 334 LSFSSAKNAAMEIPPENYLIVTKNGNVCLGILDGTAAKLSFNVIGDITMQDQMVIYDNEK 393

Query: 411 QTVSFKPTDCTK 422
             + +    CT+
Sbjct: 394 SQLGWARGACTR 405


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score =  107 bits (268), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 103/388 (26%), Positives = 171/388 (44%), Gaps = 67/388 (17%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQC------EPCPPSQCYMQDSPLFDPKMS 140
           +N +  + +++GTPP     V DTGS+L W  C           +   M +S  F P+ S
Sbjct: 59  HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGES--FRPRAS 116

Query: 141 STYKSLPCSSSQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           +T+ ++PC S+QC+S +     SC G +  C  S+SY DGS S+G LAT+   +G     
Sbjct: 117 ATFAAVPCGSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAVGEAPPL 176

Query: 196 AVALPGITFGCGTN--NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
             A     FGC +   +       T G++G+  G +S ++Q  T    +FSYC+      
Sbjct: 177 RSA-----FGCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQASTR---RFSYCISDRDDA 228

Query: 254 KINFGTNGIVSGPGVVSTPLTKA--------KTFYVLTIDAISVGNQRL----GVSTPD- 300
            +    +  +    +  TPL +         +  Y + +  I VG + L     V  PD 
Sbjct: 229 GVLLLGHSDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDH 288

Query: 301 -----IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPT----GSLELCYSFN 347
                 ++DSGT  TFL         +  L     ++ A  + DP+     +L+ C+   
Sbjct: 289 TGAGQTMVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRA--LDDPSFAFQEALDTCFRVP 346

Query: 348 S-----LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGITNSVP---- 392
           +      +++P VT+ F GA++ ++      KV      ++ + C  F G  + VP    
Sbjct: 347 AGRPPPSARLPPVTLLFNGAEMSVAGDRLLYKVPGEHRGADGVWCLTF-GNADMVPLTAY 405

Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + G+  Q N  V YD+E+  V   P  C
Sbjct: 406 VIGHHHQMNLWVEYDLERGRVGLAPVKC 433


>gi|168000300|ref|XP_001752854.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696017|gb|EDQ82358.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 525

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 100/351 (28%), Positives = 161/351 (45%), Gaps = 30/351 (8%)

Query: 95  ISIGTPPTERLAVADTGSDLIWT--QCEPCPPSQCYMQD---SPL--FDPKMSSTYKSLP 147
           I IGTP  + L V DTGSDL+W   +CE C P     +D   S L  + P +SST K + 
Sbjct: 115 IDIGTPNVQFLVVLDTGSDLLWIPCECESCAPLSAESKDPRTSQLNPYTPSLSSTAKPVL 174

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT--LGSTTGQAVALPGITFG 205
           CS   C   +        C Y ++Y   + S      E     +  + G  V LP +  G
Sbjct: 175 CSDPLCEMSSTCMAPTDQCPYEINYVSANTSTSGALYEDYMYFMRESGGNPVKLP-VYLG 233

Query: 206 CGTNNGG--LFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNG 261
           CG    G  L  +   G++GLG  DIS+ +++ +T  +A  FS C+ P  S  + FG  G
Sbjct: 234 CGKVQTGSLLKGAAPNGLMGLGTTDISVPNKLASTGQLADSFSLCISPGGSGTLTFGDEG 293

Query: 262 IVSGPGVVSTPLTKAKT----FYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYN 317
             +     +TP+          Y++ ID+I+VGN  L +++   + D+GT+ T+L +   
Sbjct: 294 PAAQ---RTTPIIPKSVSMLDTYIVEIDSITVGNTNLLMAS-HALFDTGTSFTYLSKTVY 349

Query: 318 SNLLSVMSSMIEAQPVADPTGS-LELCY-SFNSLSQVPEVTIHFRGADVKLSRSNFFVKV 375
              +    + +      DP  S  +LCY + N+  QVP V++   G +  L   +    +
Sbjct: 350 PQFVQAYDAQMSLPKWNDPRFSKWDLCYQTSNTNFQVPVVSLALSGGN-SLDVVSGLKSI 408

Query: 376 SED-----IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            +D      VC         + I G    TN+ + Y+  + T+ + P+DC+
Sbjct: 409 VDDNNAMIAVCVTVMDSGAGLSIIGQNFMTNYSITYNRAKMTIGWTPSDCS 459


>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 117/441 (26%), Positives = 190/441 (43%), Gaps = 97/441 (21%)

Query: 45  NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTER 104
           N S+   Q+L   ++ SL R +H     +      S          Y I +S GTPP   
Sbjct: 38  NPSQDHLQKLNYLVSTSLARAHHLKNPQTTPVFSHSYG-------GYSISLSFGTPPQTL 90

Query: 105 LAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--- 158
             V DTGS  +W  C     C       + SP F PK SS+ K + C + +C+ ++Q   
Sbjct: 91  SFVMDTGSSFVWFPCTLRYLCNNCSFTSRISP-FLPKHSSSSKIIGCKNPKCSWIHQTDL 149

Query: 159 ---------KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
                    ++CS +   Y + YG G+ + G   +ET+ L       + +P    GC   
Sbjct: 150 RCTDCDNNSRNCSQICPPYLILYGSGT-TGGVALSETLHL-----HGLIVPNFLVGC--- 200

Query: 210 NGGLFNSKT-TGIVGLGGGDISLISQMRTTIAGKFSYCLV--------PVSSTKINFGTN 260
              +F+S+   GI G G G  SL SQ+  T   KFSYCL+          SS  ++  ++
Sbjct: 201 --SVFSSRQPAGIAGFGRGPSSLPSQLGLT---KFSYCLLSHKFDDTQESSSLVLDSQSD 255

Query: 261 GIVSGPGVVSTPLTKA---------KTFYVLTIDAISVGNQRLGVS----TPD------I 301
                  ++ TPL K            +Y +++  IS+G + + +     +PD       
Sbjct: 256 SDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIPYKYLSPDKDGNGGT 315

Query: 302 VIDSGTTLT-------------FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS 348
           +IDSGTT T             F+ Q  N     ++ ++   +P  + +G+ EL      
Sbjct: 316 IIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCFNVSGAKEL------ 369

Query: 349 LSQVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVP-------IYGNIMQ 399
             ++P++ +HF+ GADV+L   N+F  + S ++ C  F  +T+          I GN   
Sbjct: 370 --ELPQLRLHFKGGADVELPLENYFAFLGSREVAC--FTVVTDGAEKASGPGMILGNFQM 425

Query: 400 TNFLVGYDIEQQTVSFKPTDC 420
            NF V YD++ + + FK   C
Sbjct: 426 QNFYVEYDLQNERLGFKKESC 446


>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 491

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 100/370 (27%), Positives = 162/370 (43%), Gaps = 50/370 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G P  E     DTGSD++W  C P   CP S     +  LFD   SS+ + LP
Sbjct: 84  YFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLP 143

Query: 148 CSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C+   CA++    +Q      +C YS  Y D S ++G   T+++      G+   A +  
Sbjct: 144 CTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIANSSA 203

Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
            I FGC     G     T    GI G G G+ S+ISQ+  R      FS+CL        
Sbjct: 204 TIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL-------- 255

Query: 256 NFGTNG---IVSG----PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP-------DI 301
             G NG   +V G    P +V +PL  ++  Y L + +I++  Q     T        + 
Sbjct: 256 KGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNPTMFPISNAGET 315

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT---GSLELCYSFNSLSQVPEVTIH 358
           +IDSGTTL +L +     ++SV++S +     A PT   GS     S +     P +  +
Sbjct: 316 IIDSGTTLAYLVEEVYDWIVSVITSAVSQS--ATPTISRGSQCFRVSMSVADIFPVLRFN 373

Query: 359 FRGADVKLSRSNFFVKVSEDIVCSVFKGI--------TNSVPIYGNIMQTNFLVGYDIEQ 410
           F G    +     +++    + C  F  +         + + I G+++  + ++ YD+ Q
Sbjct: 374 FEGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKAEDGLNILGDLVLKDKIIVYDLAQ 433

Query: 411 QTVSFKPTDC 420
           Q + +   DC
Sbjct: 434 QRIGWANYDC 443


>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 100/383 (26%), Positives = 170/383 (44%), Gaps = 32/383 (8%)

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R  H + + S+  S+    D +  N  Y  R+ IGTPP     + D+GS + +  C  C
Sbjct: 66  HRKLHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDC 125

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
              QC     P F P++SSTY+ + C +  C   + K      C Y   Y + S S G L
Sbjct: 126 --EQCGKHQDPKFQPELSSTYQPVKC-NMDCNCDDDKE----QCVYEREYAEHSSSKGVL 178

Query: 183 ATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM--RTTI 239
             + ++ G+ +   +      FGC T   G L++ +  GI+GLG GD+SL+ Q+  +  I
Sbjct: 179 GEDLISFGNES--QLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLI 236

Query: 240 AGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVST 298
           +  F  C   +     +    G      ++ T     ++ +Y + +  I V  ++L +++
Sbjct: 237 SNSFGLCYGGMDVGGGSMILGGFDYPSDMIFTDSDPDRSPYYNIDLTGIRVAGKKLSLNS 296

Query: 299 ------PDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLE----LCYSFN 347
                    V+DSGTT  +LP   + +   +VM  +   + +  P  + +    L  + N
Sbjct: 297 RVFDGEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSPLKQIDGPDPNFKDTCFLVAASN 356

Query: 348 SLSQV----PEVTIHFR-GADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQ 399
            +S++    P V + F+ G    LS  N+     KV       VF    +   + G I+ 
Sbjct: 357 DVSELSKIFPSVEMIFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVV 416

Query: 400 TNFLVGYDIEQQTVSFKPTDCTK 422
            N LV YD E   V F  T+C++
Sbjct: 417 RNTLVVYDRENSKVGFWRTNCSE 439


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 100/365 (27%), Positives = 163/365 (44%), Gaps = 47/365 (12%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           N  Y  R+ IGTPP     + DTGS + +  C  C   QC     P F P+ SSTY+ + 
Sbjct: 81  NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTC--EQCGRHQDPKFQPESSSTYQPVK 138

Query: 148 CS-SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           C+    C S        + C Y   Y + S S+G L  + ++ G+ +   +A     FGC
Sbjct: 139 CTIDCNCDS------DRMQCVYERQYAEMSTSSGVLGEDLISFGNQS--ELAPQRAVFGC 190

Query: 207 -GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIV 263
                G L++    GI+GLG GD+S++ Q+  +  I+  FS C        ++ G   +V
Sbjct: 191 ENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLCY-----GGMDVGGGAMV 245

Query: 264 SGPGVVSTPLTKA--------KTFYVLTIDAISVGNQRLGVST------PDIVIDSGTTL 309
            G   +S P   A          +Y + +  I V  +RL ++          V+DSGTT 
Sbjct: 246 LGG--ISPPSDMAFAYSDPVRSPYYNIDLKEIHVAGKRLPLNANVFDGKHGTVLDSGTTY 303

Query: 310 TFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQ-VPEVTIHFR- 360
            +LP+  + +   +++  +   + ++ P  +  ++C+S      + LS+  P V + F  
Sbjct: 304 AYLPEAAFLAFKDAIVKELQSLKKISGPDPNYNDICFSGAGIDVSQLSKSFPVVDMVFEN 363

Query: 361 GADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
           G    LS  N+     KV       VF+   +   + G I+  N LV YD EQ  + F  
Sbjct: 364 GQKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNTLVVYDREQTKIGFWK 423

Query: 418 TDCTK 422
           T+C +
Sbjct: 424 TNCAE 428


>gi|357482031|ref|XP_003611301.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355512636|gb|AES94259.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 481

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 113/415 (27%), Positives = 178/415 (42%), Gaps = 101/415 (24%)

Query: 89  ANYLIRISIGTPPTERLAV-ADTGSDLIWTQCEPCPPSQCYMQD---------------- 131
           ++Y +  ++G+ P + + +  DTGSDL+W    PC P +C + +                
Sbjct: 73  SDYTLSFNLGSNPPQLITLYMDTGSDLVWF---PCSPFECILCEGKPQTTKPANITKQTH 129

Query: 132 -----SPLFDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNC-QYSVSYGDGSFSNGNLA 183
                SP      +S   S  C+ S+C    +    CS  +C  +  +YGDGSF   NL 
Sbjct: 130 SVSCQSPACSAAHASMSSSNLCAISRCPLDYIETSDCSSFSCPPFYYAYGDGSFV-ANLY 188

Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT---TIA 240
            +T++L S     + L   TFGC         ++ TG+ G G G +SL +Q+ T    + 
Sbjct: 189 QQTLSLSS-----LHLQNFTFGCAHTA----LAEPTGVAGFGRGILSLPAQLSTLSPHLG 239

Query: 241 GKFSYCLVPVS---------STKINFGTNGIVSGPG-----------VVSTPLTKAKTFY 280
            +FSYCLV  S         S  I    N  ++G G           ++S P  K   +Y
Sbjct: 240 NRFSYCLVSHSFDGDRLRRPSPLILGRHNDTITGAGDGESVEFVYTSMLSNP--KHPYYY 297

Query: 281 VLTIDAISVGNQRLGVSTPDI------------VIDSGTTLTFLPQGYNSNLLSVMSSMI 328
            + +  ISVG +   V  P+I            V+DSGTT T LP+ + + +++     +
Sbjct: 298 CVGLAGISVGKRT--VPAPEILKRVDEKGNGGMVVDSGTTFTMLPESFYNAVVNEFDKRV 355

Query: 329 -----EAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG--ADVKLSRSNFF--------- 372
                 A  +   TG L  CY  N LSQ+P + +HF G  +DV L R N+F         
Sbjct: 356 NRFHKRASEIETKTG-LGPCYYLNGLSQIPVLKLHFVGNNSDVVLPRKNYFYEFMDGGDG 414

Query: 373 VKVSEDIVCSVFKGITNSVPI-------YGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           ++    + C +     +   +        GN  Q  F V YD+E++ V F   +C
Sbjct: 415 IRRKGKVGCMMLMNGEDETELDGGPGATLGNYQQQGFEVVYDLEKERVGFAKKEC 469


>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 98/357 (27%), Positives = 159/357 (44%), Gaps = 38/357 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY++R  +GTPP     V DT +D +W  C  C  S C    S  F+   SSTY ++ CS
Sbjct: 104 NYVVRARLGTPPQLMFMVLDTSNDAVWLPCSGC--SGC-SNASTSFNTNSSSTYSTVSCS 160

Query: 150 SSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           ++QC      +C         C ++ SYG  S  + NL  +T+TL         +P  +F
Sbjct: 161 TTQCTQARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTLSPD-----VIPNFSF 215

Query: 205 GCGTNNGGLFNS-KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN-GI 262
           GC  +  G  NS    G++GLG G +SL+SQ  +  +G FSYCL    S   +     G+
Sbjct: 216 GCINSASG--NSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGL 273

Query: 263 VSGPGVVS-TPLT---KAKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTT 308
           +  P  +  TPL    +  + Y + +  +SVG+ ++ V          S    +IDSGT 
Sbjct: 274 LGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTIIDSGTV 333

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR 368
           +T   Q     +       +         G+ + C+S ++ +  P++T+H    D+KL  
Sbjct: 334 ITRFAQPVYEAIRDEFRKQVNGS--FSTLGAFDTCFSADNENVTPKITLHMTSLDLKLPM 391

Query: 369 SNFFVKVSED-IVCSVFKGITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            N  +  S   + C    GI  +    + +  N+ Q N  + +D+    +   P  C
Sbjct: 392 ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448


>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
 gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
 gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
 gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
 gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
 gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
 gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
 gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
 gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
 gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
 gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
 gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
 gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
 gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
 gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
 gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
 gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
 gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
 gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
 gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
 gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
 gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
 gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
 gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
 gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
 gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
 gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
 gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
 gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
 gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
 gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
 gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
 gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
 gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
 gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
 gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
 gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
 gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
 gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
 gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
 gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
 gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
 gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
 gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
 gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
 gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
 gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
 gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
 gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
 gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
 gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
 gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
 gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
 gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
          Length = 472

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 111/394 (28%), Positives = 176/394 (44%), Gaps = 53/394 (13%)

Query: 70  QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           Q   I+SS +++ D+I     N+  +L+ +S+G PP   L   DTGS L W QC+PC   
Sbjct: 89  QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147

Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
            C+ Q +   P+FDP  S T + + CSS +C  L       Q +C     +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207

Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           G ++S G + T+T+ +G +         + FGC  +    ++    GI G G    S   
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMDVK--YSEFEAGIFGFGSSSFSFFE 259

Query: 234 QMR---TTIAGK-FSYCLVPVSSTKINFGTNGIVSGPGVVS--TPLTKA--KTFYVLTID 285
           Q+      ++ K FSYCL P   TK  +   G      +    TPL ++  +  Y LT++
Sbjct: 260 QLAGYPDILSYKAFSYCL-PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTME 318

Query: 286 AISVGNQRLGVSTPDIVIDSG--------TTLTFLPQ---------GYNSNLLSVMSSMI 328
            +    QRL  S+ ++++DSG        +T   L +         GY+    +   S I
Sbjct: 319 MLIANGQRLVTSSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 378

Query: 329 EAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF-KG 386
                 D +G       F++ S +P + I F  GA + L   N F       +C  F + 
Sbjct: 379 CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQN 438

Query: 387 ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                 I GN +  +F   +DI+ +   FK   C
Sbjct: 439 PALRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472


>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
 gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
          Length = 482

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 97/367 (26%), Positives = 163/367 (44%), Gaps = 50/367 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I IGTP  +     DTGS   W     C+ CP     ++    +DP+ S + K + 
Sbjct: 83  YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVK 142

Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---GIT 203
           C  + C S  +  C+  + C Y   Y DG  + G L T+ +      G     P    +T
Sbjct: 143 CDDTICTS--RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVT 200

Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSSTKIN 256
           FGCG    G  N+      GI+G G  + + +SQ+    AGK    FS+CL   +   I 
Sbjct: 201 FGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAA--AGKTKKIFSHCLDSTNGGGI- 257

Query: 257 FGTNGIVSGPGVVSTPLTK-AKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
           F    +V  P V +TP+ K  + ++++ + +I+V    L +         T    IDSG+
Sbjct: 258 FAIGEVVE-PKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 316

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-------QVPEVTIHFR 360
           TL +LP+        + S +I A     P  ++   Y+F           + P++T HF 
Sbjct: 317 TLVYLPE-------IIYSELILAVFAKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFE 369

Query: 361 GADVKLSR--SNFFVKVSEDIVCSVFK--GIT--NSVPIYGNIMQTNFLVGYDIEQQTVS 414
             D+ L     ++ ++   +  C  F+  GI     + I G+++ +N +V YD+E+Q + 
Sbjct: 370 N-DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIG 428

Query: 415 FKPTDCT 421
           +   +C+
Sbjct: 429 WTEHNCS 435


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 102/368 (27%), Positives = 168/368 (45%), Gaps = 46/368 (12%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL----FDPKMSSTYKSL 146
           Y  +I +GTPP       DTGSD+ W  C PC       Q   +    +DP  SST  +L
Sbjct: 37  YYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTDGAL 96

Query: 147 PCSSSQCASL---NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALP 200
            C  S C +    N+ SC+    C YS +YGDGS + G    + +T        Q     
Sbjct: 97  SCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVNGTA 156

Query: 201 GITFGCGTNNGG--LFNSKT-TGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKI 255
            + FGCGT   G  L +S+   G++G G   +S+ SQ+ +   +  +F++CL        
Sbjct: 157 SVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCL---QGDNQ 213

Query: 256 NFGT--NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP-----------DIV 302
             GT   G VS P +  TP+  ++  Y + +  I+V  +   V+TP            ++
Sbjct: 214 GGGTIVIGSVSEPNISYTPIV-SRNHYAVGMQNIAVNGRN--VTTPASFDTTSTSAGGVI 270

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF-RG 361
           +DSGTTL +L     +  ++ +S+  E+   +  +  L+L +  +  +  P V + F  G
Sbjct: 271 MDSGTTLAYLVDPAYTQFVNAVST-FESSMFSSHSQCLQLAWC-SLQADFPTVKLFFDAG 328

Query: 362 ADVKLSRSNFF----VKVSEDIVCSVFKGITN-----SVPIYGNIMQTNFLVGYDIEQQT 412
           A + L+  N+     ++  +   C  ++  T      S  I G+I+  + LV YD + + 
Sbjct: 329 AVMNLTPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHLVVYDNDNRV 388

Query: 413 VSFKPTDC 420
           V +K  DC
Sbjct: 389 VGWKSFDC 396


>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
 gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
 gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
 gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
 gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
 gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
 gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
          Length = 472

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 111/394 (28%), Positives = 176/394 (44%), Gaps = 53/394 (13%)

Query: 70  QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           Q   I+SS +++ D+I     N+  +L+ +S+G PP   L   DTGS L W QC+PC   
Sbjct: 89  QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147

Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
            C+ Q +   P+FDP  S T + + CSS +C  L       Q +C     +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKENSCTYSVTYGN 207

Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           G ++S G + T+T+ +G +         + FGC  +    ++    GI G G    S   
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMDVK--YSEFEAGIFGFGSSSFSFFE 259

Query: 234 QMR---TTIAGK-FSYCLVPVSSTKINFGTNGIVSGPGVVS--TPLTKA--KTFYVLTID 285
           Q+      ++ K FSYCL P   TK  +   G      +    TPL ++  +  Y LT++
Sbjct: 260 QLAGYPDILSYKAFSYCL-PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTME 318

Query: 286 AISVGNQRLGVSTPDIVIDSG--------TTLTFLPQ---------GYNSNLLSVMSSMI 328
            +    QRL  S+ ++++DSG        +T   L +         GY+    +   S I
Sbjct: 319 MLIANGQRLVTSSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 378

Query: 329 EAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF-KG 386
                 D +G       F++ S +P + I F  GA + L   N F       +C  F + 
Sbjct: 379 CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQN 438

Query: 387 ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                 I GN +  +F   +DI+ +   FK   C
Sbjct: 439 PALRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472


>gi|219887985|gb|ACL54367.1| unknown [Zea mays]
          Length = 515

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 96/350 (27%), Positives = 149/350 (42%), Gaps = 23/350 (6%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYK 144
           Y   + +GTP T  L   DTGSDL W  C+   C P   Y     +D  ++ P  S+T +
Sbjct: 96  YYAWVDVGTPATSFLVALDTGSDLFWVPCDCIQCAPLSGYRGNLDRDLRIYRPAESTTSR 155

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
            LPCS   C S+   +     C Y++ Y  + + S+G L  +T+ L            + 
Sbjct: 156 HLPCSHELCQSVPGCTNPKQPCPYNIDYFSENTTSSGLLIEDTLHLNYREDHVPVNASVI 215

Query: 204 FGCGTNNGG--LFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
            GCG    G  L      G++ LG  DIS+ S +     +   FS C    SS +I FG 
Sbjct: 216 IGCGQKQSGDYLDGIAPDGLLALGMADISVPSFLARAGLVQNSFSMCFKEDSSGRIFFGD 275

Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 319
            G+ S       PL      Y + +D   +G++ L  ++   ++DSGT+ T LP      
Sbjct: 276 QGVPSQQSTPFVPLYGKLQTYAVNVDKSCIGHKCLEGTSFKALVDSGTSFTSLPFDVYKA 335

Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSE 377
                   + A  V     + + CYS + L    VP +T+ F  AD  L   N  +  ++
Sbjct: 336 FTMEFDKQMNATRVPYEDTTWKYCYSASPLEMPDVPTITLTF-AADKSLQAVNPILPFND 394

Query: 378 D---IVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDC 420
               +       + ++ PI   I+  NFLVGY    D E   + +  ++C
Sbjct: 395 KQGALAGFCLAVLPSTEPI--GIIAQNFLVGYHVVFDRESMKLGWYRSEC 442


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 101/383 (26%), Positives = 169/383 (44%), Gaps = 32/383 (8%)

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R  H + + S+  S+    D +  N  Y  R+ IGTPP     + D+GS + +  C  C
Sbjct: 65  HRKLHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDC 124

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
              QC     P F P+MSSTY+ + C+   C   + +      C Y   Y + S S G L
Sbjct: 125 --EQCGKHQDPKFQPEMSSTYQPVKCNMD-CNCDDDRE----QCVYEREYAEHSSSKGVL 177

Query: 183 ATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM--RTTI 239
             + ++ G+ +   +      FGC T   G L++ +  GI+GLG GD+SL+ Q+  +  I
Sbjct: 178 GEDLISFGNES--QLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQLVDKGLI 235

Query: 240 AGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVST 298
           +  F  C   +     +    G      +V T     ++ +Y + +  I V  ++L + +
Sbjct: 236 SNSFGLCYGGMDVGGGSMILGGFDYPSDMVFTDSDPDRSPYYNIDLTGIRVAGKQLSLHS 295

Query: 299 ------PDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS----- 345
                    V+DSGTT  +LP   + +   +VM  +   + +  P  +  + C+      
Sbjct: 296 RVFDGEHGAVLDSGTTYAYLPDAAFAAFEEAVMREVSTLKQIDGPDPNFKDTCFQVAASN 355

Query: 346 -FNSLSQV-PEVTIHFR-GADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQ 399
             + LS++ P V + F+ G    LS  N+     KV       VF    +   + G I+ 
Sbjct: 356 YVSELSKIFPSVEMVFKSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVV 415

Query: 400 TNFLVGYDIEQQTVSFKPTDCTK 422
            N LV YD E   V F  T+C++
Sbjct: 416 RNTLVVYDRENSKVGFWRTNCSE 438


>gi|125575538|gb|EAZ16822.1| hypothetical protein OsJ_32294 [Oryza sativa Japonica Group]
          Length = 392

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 69/190 (36%), Positives = 102/190 (53%), Gaps = 18/190 (9%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+   +IGTPP    AV D   +L+WTQC+ C  S+C+ QD+PLFDP  S+TY++ PC 
Sbjct: 50  NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--SRCFEQDTPLFDPTASNTYRAEPCG 107

Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           +  C S+  + ++CSG  C Y  S   G  + G + T+T  +G+      A   + FGC 
Sbjct: 108 TPLCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK---INFGTNGIVS 264
             +        +GIVGLG    SL++Q   T    FSYCL P  + +   +  G++  ++
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQ---TGVAAFSYCLAPHDAGRNSALFLGSSAKLA 217

Query: 265 GPG-VVSTPL 273
           G G   STP 
Sbjct: 218 GGGKAASTPF 227


>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 110/427 (25%), Positives = 184/427 (43%), Gaps = 43/427 (10%)

Query: 21  PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF-NQNSSISSSKA 79
           P    T    + ++HR+ P +P   +S+ P +R   AL     R+    N+ SS  + +A
Sbjct: 52  PNSPSTSTIRLTILHREHPCAP---ASKRPVRRSPSALQEYHTRVRRLANRLSSCPADEA 108

Query: 80  SQADIIPNNA------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
           + + +I  N       +Y+ ++ +GTP      + DT S L W  CEPC  + C +   P
Sbjct: 109 TASGLIFANGVPWDYYSYVTQVQLGTPAKTHNVLVDTASSLSWVGCEPC-INACLI---P 164

Query: 134 LFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSG--VNCQYSVSYGDGSFSNGNLATET 186
            F+P  SSTYK + C S+ C     A++ +KSC      C Y  SY D S S G ++++T
Sbjct: 165 TFNPNASSTYKVVGCGSALCNAVPSATMARKSCMAPTEGCSYRQSYHDYSLSVGVVSSDT 224

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF--- 243
           +T G  + + +      FGC     G+   + +GI+G+     SL SQM  T+  ++   
Sbjct: 225 LTYGLGSQKFI------FGCCNLFRGV-GGRYSGILGMSVNKFSLFSQM--TVGHRYRAM 275

Query: 244 SYCL-VPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYV----LTIDAISVGNQRLGVST 298
           SYC   P +   + FG           +        ++V    + ++ +S+  Q  G  T
Sbjct: 276 SYCFPHPRNQGFLQFGRYDEHKSLLRFTPLYIDGNNYFVHVSNVMVETMSLDVQSSGNQT 335

Query: 299 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYSFNSLS---QVPE 354
                D+GT  T LPQ    +L   + +++E    V   TG        N +     +P 
Sbjct: 336 MRCFFDTGTPYTMLPQSLFVSLSDTVGNLVEGYYRVGASTGQTCFQADGNWIEGDLYMPT 395

Query: 355 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
           V I F+ GA + L+  +       ++ C  FK       + G+          D+E  T+
Sbjct: 396 VKIEFQNGARITLNSEDLMFMEEPNVFCLAFKMNDGGDIVLGSRHLMGVHTVVDLEMMTM 455

Query: 414 SFKPTDC 420
             +   C
Sbjct: 456 GLRGQGC 462


>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
 gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
 gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 449

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 97/357 (27%), Positives = 161/357 (45%), Gaps = 37/357 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY++R  +GTPP     V DT +D +W  C  C  S C    S  F+   SSTY ++ CS
Sbjct: 103 NYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGC--SGCS-NASTSFNTNSSSTYSTVSCS 159

Query: 150 SSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           ++QC      +C   +     C ++ SYG  S  + +L  +T+TL         +P  +F
Sbjct: 160 TAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPDV-----IPNFSF 214

Query: 205 GCGTNNGGLFNS-KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN-GI 262
           GC  +  G  NS    G++GLG G +SL+SQ  +  +G FSYCL    S   +     G+
Sbjct: 215 GCINSASG--NSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLGL 272

Query: 263 VSGPGVVS-TPLT---KAKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGTT 308
           +  P  +  TPL    +  + Y + +  +SVG+ ++ V          S    +IDSGT 
Sbjct: 273 LGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGTV 332

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR 368
           +T   Q     +       +     +   G+ + C+S ++ +  P++T+H    D+KL  
Sbjct: 333 ITRFAQPVYEAIRDEFRKQVNVSSFST-LGAFDTCFSADNENVAPKITLHMTSLDLKLPM 391

Query: 369 SNFFVKVSE-DIVCSVFKGITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            N  +  S   + C    GI  +    + +  N+ Q N  + +D+    +   P  C
Sbjct: 392 ENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 448


>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
 gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
          Length = 534

 Score =  107 bits (266), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 90/392 (22%), Positives = 154/392 (39%), Gaps = 60/392 (15%)

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSP---------- 133
           I +   YL+ + IGTP      V DT +DL W  C       + Y + S           
Sbjct: 119 IAHVGMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSTGQTMSMGGEG 178

Query: 134 -------LFDPKMSSTYKSLPCSSSQCASLNQKSC----SGVNCQYSVSYGDGSFSNGNL 182
                   + P  SS+++ + CS  +CA L   +C       +C Y     DG+ + G  
Sbjct: 179 AKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVTIGIY 238

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
             E  T+  + G+   LPG+  GC     G       G++ LG GD+S           +
Sbjct: 239 GKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKRFGQR 298

Query: 243 FSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL 294
           FS+CL+  +S++     + FG N  V GPG + T +      K  Y   +  + VG +RL
Sbjct: 299 FSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVGGERL 358

Query: 295 GVSTPD------------IVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 341
            +  PD            +++D+ T++T  +P+ Y + + + +   +   P        E
Sbjct: 359 DI--PDEVWDAERFVGGGVILDTSTSVTSLVPEAY-APVTAALDRHLSHLPRVYELEGFE 415

Query: 342 LCYSFNSLSQ---------VPEVTIHFRGADVKL---SRSNFFVKVSEDIVCSVFKGITN 389
            CY +              +P  T+   G   +L   ++S    +V   + C  F+ +  
Sbjct: 416 YCYKWTFTGDGVDPAHNVTIPSFTVEMAGG-ARLEPEAKSVVMPEVEPGVACLAFRKLLR 474

Query: 390 SVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             P I GN+    ++   D     + F+   C
Sbjct: 475 GGPGILGNVFMQEYIWEIDHGDGKIRFRKDKC 506


>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
          Length = 375

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 97/358 (27%), Positives = 161/358 (44%), Gaps = 37/358 (10%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
            NY++R  +GTPP     V DT +D +W  C  C  S C    S  F+   SSTY ++ C
Sbjct: 28  GNYVVRAKLGTPPQLMFMVLDTSNDAVWLPCSGC--SGC-SNASTSFNTNSSSTYSTVSC 84

Query: 149 SSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           S++QC      +C   +     C ++ SYG  S  + +L  +T+TL         +P  +
Sbjct: 85  STAQCTQARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTLAPD-----VIPNFS 139

Query: 204 FGCGTNNGGLFNS-KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN-G 261
           FGC  +  G  NS    G++GLG G +SL+SQ  +  +G FSYCL    S   +     G
Sbjct: 140 FGCINSASG--NSLPPQGLMGLGRGPMSLVSQTTSLYSGVFSYCLPSFRSFYFSGSLKLG 197

Query: 262 IVSGPGVVS-TPLT---KAKTFYVLTIDAISVGNQRLGV----------STPDIVIDSGT 307
           ++  P  +  TPL    +  + Y + +  +SVG+ ++ V          S    +IDSGT
Sbjct: 198 LLGQPKSIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSGT 257

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLS 367
            +T   Q     +       +     +   G+ + C+S ++ +  P++T+H    D+KL 
Sbjct: 258 VITRFAQPVYEAIRDEFRKQVNVSSFST-LGAFDTCFSADNENVAPKITLHMTSLDLKLP 316

Query: 368 RSNFFVKVSED-IVCSVFKGITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             N  +  S   + C    GI  +    + +  N+ Q N  + +D+    +   P  C
Sbjct: 317 MENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGIAPEPC 374


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 98/365 (26%), Positives = 161/365 (44%), Gaps = 35/365 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G P  E     DTGSD++W  C P   CP S         F+P  SST   + 
Sbjct: 91  YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 150

Query: 148 CSSSQCASLNQKS---CSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---A 196
           CS  +C +  Q     C   N     C Y+ +YGDGS ++G   ++T+   +  G    A
Sbjct: 151 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTA 210

Query: 197 VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVS 251
            +   I FGC  +  G     +    GI G G   +S+ISQ+ +  ++ K FS+CL   S
Sbjct: 211 NSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL-KGS 269

Query: 252 STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVI 303
                    G +  PG+V TPL  ++  Y L +++I+V  Q+L +        +T   ++
Sbjct: 270 DNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIV 329

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVPEVTIHFRGA 362
           DSGTTL +L  G     +S +++ +     +    GS     S +  S  P VT++F G 
Sbjct: 330 DSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTVTLYFMGG 389

Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGI------TNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
                +   ++     +  SV   I         + I G+++  + +  YD+    + + 
Sbjct: 390 VAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWA 449

Query: 417 PTDCT 421
             DC+
Sbjct: 450 DYDCS 454


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 98/365 (26%), Positives = 161/365 (44%), Gaps = 35/365 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G P  E     DTGSD++W  C P   CP S         F+P  SST   + 
Sbjct: 89  YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 148

Query: 148 CSSSQCASLNQKS---CSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---A 196
           CS  +C +  Q     C   N     C Y+ +YGDGS ++G   ++T+   +  G    A
Sbjct: 149 CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTA 208

Query: 197 VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVS 251
            +   I FGC  +  G     +    GI G G   +S+ISQ+ +  ++ K FS+CL   S
Sbjct: 209 NSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL-KGS 267

Query: 252 STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVI 303
                    G +  PG+V TPL  ++  Y L +++I+V  Q+L +        +T   ++
Sbjct: 268 DNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIV 327

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVPEVTIHFRGA 362
           DSGTTL +L  G     +S +++ +     +    GS     S +  S  P VT++F G 
Sbjct: 328 DSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTVTLYFMGG 387

Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGI------TNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
                +   ++     +  SV   I         + I G+++  + +  YD+    + + 
Sbjct: 388 VAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWA 447

Query: 417 PTDCT 421
             DC+
Sbjct: 448 DYDCS 452


>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 564

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 95/332 (28%), Positives = 136/332 (40%), Gaps = 16/332 (4%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ----DSPLFDPKMSSTYK 144
           Y   + +GTP T  +   DTGSDL W  C+   C P   Y +    D  ++ P  S+T +
Sbjct: 143 YYTWVDVGTPNTSFMVALDTGSDLFWVPCDCIECAPLAGYRETLDRDLGIYKPAESTTSR 202

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
            LPCS   C   +  S     C YS  Y  + + S+G L  + + L S    A     + 
Sbjct: 203 HLPCSHELCPPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLDSRESHAPVKASVV 262

Query: 204 FGCGTNNGG--LFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
            GCG    G  L      G++GLG  DIS+ S +     +   FS C     S +I FG 
Sbjct: 263 IGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFK-EDSGRIFFGD 321

Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 319
            G+         PL      Y + +D   VG++    ++ + ++DSGT+ T LP      
Sbjct: 322 QGVSIQQSTPFVPLYGKYQTYAVNVDKSCVGHKCFEATSFEALVDSGTSFTALPLNVYKA 381

Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG-ADVKLSRSNFFVKVS 376
           +       + A  +     S E CYS + L    VP VT+ F      +       +K  
Sbjct: 382 VAVEFDKQVHAPRITQEDASFEYCYSASPLKMPDVPTVTLTFAANKSFQAVNPTIVLKDG 441

Query: 377 EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 408
           E  V      +  S    G I Q NFL GY I
Sbjct: 442 EGSVAGFCLALQKSPEPIGIIGQ-NFLTGYHI 472


>gi|297744129|emb|CBI37099.3| unnamed protein product [Vitis vinifera]
          Length = 299

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 73/215 (33%), Positives = 104/215 (48%), Gaps = 44/215 (20%)

Query: 25  QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
           +  GF V L H DS        + T ++RL+ A+ R   RL   +  ++ S   + +A +
Sbjct: 38  EKNGFRVSLRHVDS------GGNYTKFERLQRAVKRGRLRLQRLSAKTA-SFEPSVEAPV 90

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
              N  +L+ ++IGTP     A+ DTGSDLIWTQC+PC    C+ Q +P+FDP+ SS++ 
Sbjct: 91  HAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPC--KVCFDQPTPIFDPEKSSSFS 148

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            LPCSS     L   S  GV                 LATET T G  +     +  I F
Sbjct: 149 KLPCSS----DLYHSSTQGV-----------------LATETFTFGDAS-----VSKIGF 182

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           GCG +N G   S+  G+          ISQM+  +
Sbjct: 183 GCGEDNRGRAYSQGAGL---------FISQMKLDV 208



 Score = 45.4 bits (106), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 5/74 (6%)

Query: 335 DPTGS--LELCYSF---NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN 389
           D +GS  LELC++     S   VP++  HF G D+KL + N+ ++ S   V  +  G ++
Sbjct: 209 DASGSTELELCFTLPPDGSPVDVPQLVFHFEGVDLKLPKENYIIEDSALRVICLTMGSSS 268

Query: 390 SVPIYGNIMQTNFL 403
            + I+GN  Q N +
Sbjct: 269 GMSIFGNFQQQNIV 282


>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 484

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 95/363 (26%), Positives = 158/363 (43%), Gaps = 35/363 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP        DTGSD++W    QC+ CP       +  L++   S + K + 
Sbjct: 80  YYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVS 139

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C    C  ++    SG    ++C Y   YGDGS + G    + V   S  G      A  
Sbjct: 140 CDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANG 199

Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
            + FGCG    G  +S       GI+G G  + S+ISQ+ ++  +   F++CL   +   
Sbjct: 200 SVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGG 259

Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSG 306
           I F    +V  P V  TPL   +  Y + + A+ VG + L +             +IDSG
Sbjct: 260 I-FAIGRVVQ-PKVNMTPLVPNQPHYNVNMTAVQVGQEFLNIPADLFQPGDRKGAIIDSG 317

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADV 364
           TTL +LP+     L+  ++S   A  V       + C+ ++       P VT HF  +  
Sbjct: 318 TTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYSGRVDEGFPNVTFHFENSVF 376

Query: 365 KLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
                + ++   E + C  ++          ++ + G+++ +N LV YD+E Q + +   
Sbjct: 377 LRVYPHDYLFPYEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEY 436

Query: 419 DCT 421
           +C+
Sbjct: 437 NCS 439


>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
          Length = 371

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 81/317 (25%), Positives = 146/317 (46%), Gaps = 29/317 (9%)

Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
            C+ QD P+F P  SST+K  PC +  C S+    C+   C Y    G G  + G +AT+
Sbjct: 60  HCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPTPKCASDVCAYDGVTGLGGHTVGIVATD 119

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
           T  +G+         G ++   +       +  +G +GLG    SL++QM+ T   +FSY
Sbjct: 120 TFAIGTAAPARPPASGASWRATSTPW----AGPSGFIGLGRTPWSLVAQMKLT---RFSY 172

Query: 246 CLVPVSS---TKINFGTNGIVSG-----PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS 297
           CL P  +   +++  G +  ++G     P V ++P      +Y + ++ I  G+  + + 
Sbjct: 173 CLAPHDTGKNSRLFLGASAKLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMP 232

Query: 298 TPDIVIDSGTTLT----FLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSFNSLSQV 352
                +   T +      +   Y     +VM+S + A P A P G+  E+C+    +S  
Sbjct: 233 RGRNTVLVQTAVVRVSLLVDSVYQEFKKAVMAS-VGAAPTATPVGAPFEVCFPKAGVSGA 291

Query: 353 PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGIT-------NSVPIYGNIMQTNFLV 404
           P++   F+ GA + +  +N+   V  D VC     I        + + I G+  Q N  +
Sbjct: 292 PDLVFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHL 351

Query: 405 GYDIEQQTVSFKPTDCT 421
            +D+++  +SF+P DC+
Sbjct: 352 LFDLDKDMLSFEPADCS 368


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 103/360 (28%), Positives = 154/360 (42%), Gaps = 44/360 (12%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ R  +GTP    L   D  +D  W  C  C  + C    SP F P  SSTY+++PC 
Sbjct: 82  NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSAC--AGC-AASSPSFSPTQSSTYRTVPCG 138

Query: 150 SSQCASLNQKSCS---GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           S QCA +   SC    G +C ++++Y   +F    L  +++ L +       +   TFGC
Sbjct: 139 SPQCAQVPSPSCPAGVGSSCGFNLTYAASTF-QAVLGQDSLALENNV-----VVSYTFGC 192

Query: 207 GTNNGGLFNS-KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN---GI 262
                G  NS    G++G G G +S +SQ + T    FSYCL    S+  NF      G 
Sbjct: 193 LRVVSG--NSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSS--NFSGTLKLGP 248

Query: 263 VSGPG-VVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTT 308
           +  P  + +TPL       + Y + +  I VG++ + V    +          +ID+GT 
Sbjct: 249 IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 308

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA-DVKLS 367
            T L     + +       +   PVA P G  + CY  N    VP VT  F GA  V L 
Sbjct: 309 FTRLAAPVYAAVRDAFRGRVRT-PVAPPLGGFDTCY--NVTVSVPTVTFMFAGAVAVTLP 365

Query: 368 RSNFFVKVSE-DIVCSVFK-----GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             N  +  S   + C         G+  ++ +  ++ Q N  V +D+    V F    CT
Sbjct: 366 EENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELCT 425


>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 556

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 95/375 (25%), Positives = 160/375 (42%), Gaps = 61/375 (16%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS--PLFDPKMSSTYK 144
           NN  +L+ I +GTPP   L   DTG+ L + QCEPC   +C+ Q     +FDP  S ++ 
Sbjct: 202 NNFLFLMPIKLGTPPVWNLVAVDTGATLSFVQCEPC-TLRCHKQTDAGEIFDPSKSESFS 260

Query: 145 SLPCSSSQCAS------LNQKSC--SGVNCQYSVSYGD-GSFSNGNLATETVTLGSTTGQ 195
            + CS ++C +      L  K+C     +C YS+++G   S+S G L  + + +G    +
Sbjct: 261 RVGCSENKCRTVQRALHLQSKACMEKEDSCLYSMTFGGTSSYSVGKLVRDRLAIGK-YAK 319

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK-FSYCLVPVSSTK 254
             + P   FGC  +    ++    G+VG      S   Q+   +  K FSYC  P    K
Sbjct: 320 GYSFPDFLFGCSLDTE--YHQYEAGLVGFADEPFSFFEQVAPLVNYKAFSYCF-PSDRRK 376

Query: 255 INFGTNGIVSGPGVVSTP--LTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFL 312
             + + G  +      TP  L + ++ Y L +D + V    L  +  ++++DSG+  T L
Sbjct: 377 TGYLSIGDYTRVNSTYTPLFLARQQSRYALKLDEVLVNGMALVTTPSEMIVDSGSRWTIL 436

Query: 313 -----------------PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS------FNSL 349
                            P GYN N                  GS  +C+       F+  
Sbjct: 437 LSDTFTQLDAAITEAMRPLGYNRNYYR---------------GSDYICFEDAHFQQFSDW 481

Query: 350 SQVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF---KGITNSVPIYGNIMQTNFLVG 405
           + +P V + F  G  + L   + F   ++  +C+ F     + + V + GN M  +  + 
Sbjct: 482 AALPVVELKFDMGVKMVLQPQSSFHFNNDYGLCTYFMRDASLGSGVQLLGNTMTRSVGIT 541

Query: 406 YDIEQQTVSFKPTDC 420
           +DI+     F+  DC
Sbjct: 542 FDIQGGQFGFRKGDC 556


>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
          Length = 418

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 99/369 (26%), Positives = 177/369 (47%), Gaps = 37/369 (10%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
            D+ P   +Y + ++IG P        DTGSDL W QC+  P   C     PL+ P  + 
Sbjct: 49  GDVYPT-GHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCD-APCQSCNKVPHPLYRPTKN- 105

Query: 142 TYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
             K +PC++S C +L      N+K  +   C Y + Y D + S G L T++ +L     +
Sbjct: 106 --KLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSL-PLRNK 162

Query: 196 AVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLVP 249
           +   P ++FGCG +      G   + T G++GLG G +SL+SQ++     K    +CL  
Sbjct: 163 SNVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLST 222

Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP--DIVIDSGT 307
                + FG + +V    V   P+ ++ +    +  + ++   R  +ST   ++V DSG+
Sbjct: 223 SGGGFLFFGDD-MVPTSRVTWVPMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVVFDSGS 281

Query: 308 TLTFL-PQGYNSNLLSVMSSMIEA-QPVADPTGSLELCY----SFNSLSQVPE--VTIHF 359
           T T+   Q Y + + ++  S+ ++ + V+DP  SL LC+    +F S+S V +   ++ F
Sbjct: 282 TYTYFSAQPYQATISAIKGSLSKSLKQVSDP--SLPLCWKGQKAFKSVSDVKKDFKSLQF 339

Query: 360 ---RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTV 413
              + A +++   N+ +      VC  +  G     S  I G+I   + +V YD E+  +
Sbjct: 340 IFGKNAVMEIPPENYLIVTKNGNVCLGILDGSAAKLSFSIIGDITMQDQMVIYDNEKAQL 399

Query: 414 SFKPTDCTK 422
            +    C++
Sbjct: 400 GWIRGSCSR 408


>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 488

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 100/367 (27%), Positives = 164/367 (44%), Gaps = 47/367 (12%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G P  E     DTGSD++W  C P   CP S     +  LFD   SS+ + LP
Sbjct: 84  YFTKVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLP 143

Query: 148 CSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C+   CA++    +Q      +C YS  Y D S ++G   T+++      G+   A +  
Sbjct: 144 CTDPICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIANSSA 203

Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKI 255
            I FGC     G     T    GI G G G+ S+ISQ+  R      FS+CL        
Sbjct: 204 TIVFGCSIYQYGDLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHCL-------- 255

Query: 256 NFGTNG---IVSG----PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP-------DI 301
             G NG   +V G    P +V +PL  ++  Y L + +I++  Q     T        + 
Sbjct: 256 KGGENGGGILVLGEILEPSIVYSPLIPSQPHYTLKLQSIALSGQLFPNPTMFPISNAGET 315

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT---GSLELCYSFNSLSQVPEVTIH 358
           +IDSGTTL +L +     ++SV++S +     A PT   GS     S +     P +  +
Sbjct: 316 IIDSGTTLAYLVEEVYDWIVSVITSAVSQS--ATPTISRGSQCFRVSMSVADIFPVLRFN 373

Query: 359 FRG-ADVKLSRSNFF----VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 413
           F G A + ++   +     +     + C  F+   + + I G+++  + ++ YD+ +Q +
Sbjct: 374 FEGIASMVVTPEEYLQFDSIVREPALWCIGFQKAEDGLNILGDLVLKDKIIVYDLARQRI 433

Query: 414 SFKPTDC 420
            +   DC
Sbjct: 434 GWANYDC 440


>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
          Length = 342

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 90/358 (25%), Positives = 155/358 (43%), Gaps = 73/358 (20%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+  ++IGTPP    A+     + +WTQC PC   +C+ QD PLF+              
Sbjct: 28  YMANLTIGTPPQPASAIIHLAGEFVWTQCSPC--RRCFKQDLPLFN-------------- 71

Query: 151 SQCASLNQKSCSGVNCQYSVS--YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
                           +Y V   +GD S   G   T+T  +G+ T        + FGC  
Sbjct: 72  ----------------RYEVETMFGDTSGIGG---TDTFAIGTATAS------LAFGCAM 106

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS----TKINFGTNG-IV 263
           ++        +G+VGLG    SL+ QM  T    FSYCL P  +    + +  G +  + 
Sbjct: 107 DSNIKQLLGASGVVGLGRTPWSLVGQMNAT---AFSYCLAPHGAAGKKSALLLGASAKLA 163

Query: 264 SGPGVVSTPLTKAK---TFYVLTIDAISVGNQRLGVSTPD----IVIDSGTTLTFLPQGY 316
            G    +TPL       + Y++ ++ I  G+  + +  P     +++D+   ++FL    
Sbjct: 164 GGKSAATTPLVNTSDDSSDYMIHLEGIKFGD--VIIEPPPNGSVVLVDTIFGVSFLVDAA 221

Query: 317 NSNLLSVMSSMIEAQPVADPTGSLELCY-------SFNSLSQVPEVTIHFRGAD-VKLSR 368
              +   ++  + A P+A PT   +LC+         NS   +P+V + F+GA  + +  
Sbjct: 222 FHAIKKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALTVPP 281

Query: 369 SNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           S +        VC     S    +T  + I G + Q N    +D++++T+SF+P DC+
Sbjct: 282 SKYMYDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPADCS 339


>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
          Length = 431

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 104/363 (28%), Positives = 165/363 (45%), Gaps = 64/363 (17%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           + + IGTP      V DT SDL+WTQC+PC    C  Q   ++DP  + TY +L  SS  
Sbjct: 90  VFLGIGTPAMNVTLVFDTTSDLLWTQCQPC--LSCVAQAGDMYDPNKTETYANLTSSS-- 145

Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
                          Y+ +Y   SF++G  ATET  LG+ T     +  ITFGCGT N G
Sbjct: 146 ---------------YNYTYSKQSFTSGYFATETFALGNVT-----VANITFGCGTRNQG 185

Query: 213 LFN--SKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV------------SSTKINFG 258
            ++  +   G+   G G +SL++Q+      +FSYC                S       
Sbjct: 186 YYDNVAGVFGVGRGGRGGVSLLNQLGID---RFSYCFSSSGAPGSSAVFLGGSPELATNA 242

Query: 259 TNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD--------IVIDSGTTLT 310
           T    +   +V+ P+ K+  F  L    ++VG   + V+           +VIDS + +T
Sbjct: 243 TTTPAASTPMVADPVLKSGYFVKLV--GVTVGATLVDVAGASSAEGGGRALVIDSTSPVT 300

Query: 311 FLPQG----YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVP-----EVTIHFRG 361
            L +         L++ ++ + EA   A     L+LC+   +    P      +T+HF G
Sbjct: 301 VLDEATYGPVRRALVAQLAPLKEANANASAGVGLDLCFELAAGGATPTPPNVTMTLHFDG 360

Query: 362 --ADVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
             AD+ L  +++  K S   ++C ++    +N VP+ G+    + LV YD+ +  VSF+P
Sbjct: 361 GAADLVLPPASYLAKDSAGGLICLTMTPSSSNGVPVLGSWALLDTLVLYDLAKNVVSFQP 420

Query: 418 TDC 420
            DC
Sbjct: 421 LDC 423


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 103/360 (28%), Positives = 154/360 (42%), Gaps = 44/360 (12%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ R  +GTP    L   D  +D  W  C  C  + C    SP F P  SSTY+++PC 
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSAC--AGC-AASSPSFSPTQSSTYRTVPCG 157

Query: 150 SSQCASLNQKSCS---GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           S QCA +   SC    G +C ++++Y   +F    L  +++ L +       +   TFGC
Sbjct: 158 SPQCAQVPSPSCPAGVGSSCGFNLTYAASTF-QAVLGQDSLALENNV-----VVSYTFGC 211

Query: 207 GTNNGGLFNS-KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTN---GI 262
                G  NS    G++G G G +S +SQ + T    FSYCL    S+  NF      G 
Sbjct: 212 LRVVSG--NSVPPQGLIGFGRGPLSFLSQTKDTYGSVFSYCLPNYRSS--NFSGTLKLGP 267

Query: 263 VSGPG-VVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTT 308
           +  P  + +TPL       + Y + +  I VG++ + V    +          +ID+GT 
Sbjct: 268 IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 327

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA-DVKLS 367
            T L     + +       +   PVA P G  + CY  N    VP VT  F GA  V L 
Sbjct: 328 FTRLAAPVYAAVRDAFRGRVRT-PVAPPLGGFDTCY--NVTVSVPTVTFMFAGAVAVTLP 384

Query: 368 RSNFFVKVSE-DIVCSVFK-----GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             N  +  S   + C         G+  ++ +  ++ Q N  V +D+    V F    CT
Sbjct: 385 EENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSRELCT 444


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  106 bits (264), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 107/401 (26%), Positives = 182/401 (45%), Gaps = 59/401 (14%)

Query: 53  RLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGS 112
           R+ D   R L++       S + ++     D + +N  Y  R+ IGTPP E   + DTGS
Sbjct: 49  RVEDFRRRRLHQ-------SQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGS 101

Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
            + +  C  C   QC     P F P++SS+YK+L C+   C   ++    G  C Y   Y
Sbjct: 102 TVTYVPCSTC--KQCGKHQDPKFQPELSSSYKALKCNPD-CNCDDE----GKLCVYERRY 154

Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISL 231
            + S S+G L+ + ++ G+ +   +      FGC     G LF+ +  GI+GLG G +S+
Sbjct: 155 AEMSSSSGVLSEDLISFGNES--QLTPQRAVFGCENVETGDLFSQRADGIMGLGRGKLSV 212

Query: 232 ISQM--RTTIAGKFSYCLVPVSSTKINFGTN--GIVSGP-GVV---STPLTKAKTFYVLT 283
           + Q+  +  I   FS C       ++  G    G +S P G+V   S P      +Y + 
Sbjct: 213 VDQLVDKGVIEDVFSLC---YGGMEVGGGAMVLGKISPPAGMVFSHSDPFRSP--YYNID 267

Query: 284 IDAISVGNQRLGVSTPDI-------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--- 333
           +  + V  + L ++ P +       V+DSGTT  + P+      +++  ++I+  P    
Sbjct: 268 LKQMHVAGKSLKLN-PKVFNGKHGTVLDSGTTYAYFPK---EAFIAIKDAIIKEIPSLKR 323

Query: 334 ---ADPTGSLELCYS--FNSLSQV----PEVTIHF-RGADVKLSRSNFF---VKVSEDIV 380
               DP    ++C+S     ++++    PE+ + F  G  + LS  N+     KV     
Sbjct: 324 IHGPDPNYD-DVCFSGAGRDVAEIHNFFPEIDMEFGNGQKLILSPENYLFRHTKVRGAYC 382

Query: 381 CSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             +F    +S  + G I+  N LV YD E   + F  T+C+
Sbjct: 383 LGIFPD-RDSTTLLGGIVVRNTLVTYDRENDKLGFLKTNCS 422


>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 407

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 102/383 (26%), Positives = 170/383 (44%), Gaps = 62/383 (16%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  + +++GTPP     V DTGS+L W  C     +         F+   S +Y+ +
Sbjct: 27  HNISLTVSLTVGTPPQNVSMVIDTGSELSWLYCN---KTTTTTSYPTTFNQTRSISYRPI 83

Query: 147 PCSSSQCASLNQK-----SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           PCSSS C +  +      SC S   C  ++SY D S S GNLA++T  +G     A  +P
Sbjct: 84  PCSSSTCTNQTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMG-----ASDIP 138

Query: 201 GITFGCGT---NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS-STKIN 256
           G+ FGC     ++    +SK TG++G+  G +S +SQM      KFSYC+     S  + 
Sbjct: 139 GMVFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQMGFP---KFSYCISGTDFSGMLL 195

Query: 257 FGTNGIVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD---- 300
            G +       +  TPL +  T         Y + ++ I V ++ L     V  PD    
Sbjct: 196 LGESNFTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGA 255

Query: 301 --IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADP----TGSLELCY----SF 346
              ++DSGT  TFL         S  L+  +  +    + DP     G+++LCY    S 
Sbjct: 256 GQTMVDSGTQFTFLLGPAYTALRSEFLNQTTGFLRV--LEDPDFVFQGAMDLCYRVPISQ 313

Query: 347 NSLSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKG---ITNSVPIYGNI 397
             L ++P V++ F GA++ ++      +V      ++ + C  F     +     + G+ 
Sbjct: 314 RVLPRLPTVSLVFNGAEMTVADERVLYRVPGEIRGNDSVHCLSFGNSDLLGVEAYVIGHH 373

Query: 398 MQTNFLVGYDIEQQTVSFKPTDC 420
            Q N  + +D+E+  +      C
Sbjct: 374 HQQNVWMEFDLERSRIGLAQVRC 396


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 98/365 (26%), Positives = 161/365 (44%), Gaps = 35/365 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G P  E     DTGSD++W  C P   CP S         F+P  SST   + 
Sbjct: 5   YFTRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRIT 64

Query: 148 CSSSQCASLNQKS---CSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---A 196
           CS  +C +  Q     C   N     C Y+ +YGDGS ++G   ++T+   +  G    A
Sbjct: 65  CSDDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTA 124

Query: 197 VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVS 251
            +   I FGC  +  G     +    GI G G   +S+ISQ+ +  ++ K FS+CL   S
Sbjct: 125 NSSASIVFGCSNSQSGDLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL-KGS 183

Query: 252 STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVI 303
                    G +  PG+V TPL  ++  Y L +++I+V  Q+L +        +T   ++
Sbjct: 184 DNGGGILVLGEIVEPGLVYTPLVPSQPHYNLNLESIAVNGQKLPIDSSLFTTSNTQGTIV 243

Query: 304 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVPEVTIHFRGA 362
           DSGTTL +L  G     +S +++ +     +    GS     S +  S  P VT++F G 
Sbjct: 244 DSGTTLAYLADGAYDPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVDSSFPTVTLYFMGG 303

Query: 363 DVKLSRSNFFVKVSEDIVCSVFKGI------TNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
                +   ++     +  SV   I         + I G+++  + +  YD+    + + 
Sbjct: 304 VAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKIFVYDLANMRMGWA 363

Query: 417 PTDCT 421
             DC+
Sbjct: 364 DYDCS 368


>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 95/363 (26%), Positives = 158/363 (43%), Gaps = 35/363 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP        DTGSD++W    QC+ CP       +  L++   S + K + 
Sbjct: 80  YYAKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVS 139

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C    C  ++    SG    ++C Y   YGDGS + G    + V   S  G      A  
Sbjct: 140 CDDDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANG 199

Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTK 254
            + FGCG    G  +S       GI+G G  + S+ISQ+ ++  +   F++CL   +   
Sbjct: 200 SVIFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCLDGRNGGG 259

Query: 255 INFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVIDSG 306
           I F    +V  P V  TPL   +  Y + + A+ VG + L +             +IDSG
Sbjct: 260 I-FAIGRVVQ-PKVNMTPLVPNQPHYNVNMTAVQVGQEFLTIPADLFQPGDRKGAIIDSG 317

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADV 364
           TTL +LP+     L+  ++S   A  V       + C+ ++       P VT HF  +  
Sbjct: 318 TTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYK-CFQYSGRVDEGFPNVTFHFENSVF 376

Query: 365 KLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
                + ++   E + C  ++          ++ + G+++ +N LV YD+E Q + +   
Sbjct: 377 LRVYPHDYLFPHEGMWCIGWQNSAMQSRDRRNMTLLGDLVLSNKLVLYDLENQLIGWTEY 436

Query: 419 DCT 421
           +C+
Sbjct: 437 NCS 439


>gi|356558304|ref|XP_003547447.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 336

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 100/356 (28%), Positives = 154/356 (43%), Gaps = 51/356 (14%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL- 146
           N  Y   +SIG PP  +L + DT SD++W  C              LFDP  SST+  L 
Sbjct: 6   NKPYWSILSIGQPPIPQLVIMDTSSDILWIMCN---------HVGLLFDPSKSSTFSPLC 56

Query: 147 --PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
             PC    C       C  +   +++SY D S ++G   ++TV   +T      +  +  
Sbjct: 57  KTPCGFKGC------KCDPI--PFNISYVDKSSTSGTFGSDTVVFETTDEGHSQIFDVLV 108

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVS 264
            CG N G   +    GI GL  G  SL     T I  KFSYC+  ++    N+    +  
Sbjct: 109 RCGHNIGFNTDPGYNGIRGLNNGPNSL----ATKIGQKFSYCVGNLADPYYNYNQLILCE 164

Query: 265 GPGV--VSTPLTKAKTFYVLTIDAISVGNQRLGVS----------TPDIVIDSGTTLTFL 312
           G  +   STP      FY +T+  I VG +RL ++          T  ++ DSGTT+T+L
Sbjct: 165 GADLEGYSTPFEVHHGFYYVTLKGIIVGEKRLDIAPITFEIKGNNTGGVIRDSGTTITYL 224

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS---FNSLSQVPEVTIHF-RGADVKLSR 368
               +  L + + +++            +LC+       L   P VT HF  GAD+ L  
Sbjct: 225 VDSVHKLLYNEVRNLLSWS-------FRQLCHYGIISRDLVGFPVVTFHFADGADLALDT 277

Query: 369 SNFFVKVSEDIVCSV----FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            +FF +++  +  +V        T S  +   + Q ++ VGYD+    V F+  DC
Sbjct: 278 GSFFNQLNSILCMTVSPASILNTTISPSVIELLAQQSYNVGYDLLTNFVYFQRIDC 333


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 100/364 (27%), Positives = 172/364 (47%), Gaps = 35/364 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +GTPP E     DTGSD++W   T C  CP +         FDP +SS+   + 
Sbjct: 84  YYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVS 143

Query: 148 CSSSQCAS--LNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG--- 201
           CS  +C S    +  CS  N C YS  YGDGS ++G   ++ ++  +     +A+     
Sbjct: 144 CSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINSSAP 203

Query: 202 ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTKIN 256
             FGC     G          GI GLG G +S+ISQ+    +A + FS+CL    S    
Sbjct: 204 FVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGG-G 262

Query: 257 FGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRL-------GVSTPD-IVIDSGTT 308
               G +  P  V TPL  ++  Y + + +I+V  Q L        ++T D  +ID+GTT
Sbjct: 263 IMVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTT 322

Query: 309 LTFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNS--LSQVPEVTIHFRGADV 364
           L +LP    S  +  +++ +    +P+   T     C+   +  +   PEV++ F G   
Sbjct: 323 LAYLPDEAYSPFIQAIANAVSQYGRPI---TYESYQCFEITAGDVDVFPEVSLSFAGGAS 379

Query: 365 KLSRSNFFVKV----SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 419
            + R + ++++       I C  F+ +++  + I G+++  + +V YD+ +Q + +   D
Sbjct: 380 MVLRPHAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYD 439

Query: 420 CTKQ 423
           C+ +
Sbjct: 440 CSLE 443


>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
 gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
          Length = 538

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 88/390 (22%), Positives = 154/390 (39%), Gaps = 56/390 (14%)

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY---------------- 128
           I +   YL+ +  GTP      V DT +DL W  C        +                
Sbjct: 121 IAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAA 180

Query: 129 --MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC----SGVNCQYSVSYGDGSFSNGNL 182
              +    + P  SS+++ + CS  +CA L   +C       +C Y     DG+ + G  
Sbjct: 181 KEARRKNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIY 240

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
             E  T+  + G+   LPG+  GC     G       G++ LG G++S           +
Sbjct: 241 GKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFGQR 300

Query: 243 FSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL 294
           FS+CL+  +S++     + FG N  V GPG + T +      K  Y   +  I VG +RL
Sbjct: 301 FSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERL 360

Query: 295 GVSTP----------DIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC 343
            +              +++D+ T++T  +P+ Y + + S +   +   P        E C
Sbjct: 361 DIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAY-AAVTSALDRHLSHLPRVYELDGFEYC 419

Query: 344 YSFN------SLSQ---VPEVTIHFRGADVKL---SRSNFFVKVSEDIVCSVFKGITNSV 391
           Y +        L+    VP +T+   G   +L   ++S    +V   + C  F+ +    
Sbjct: 420 YRWTFAGDGVDLAHNVTVPRLTVEMAGG-ARLEPEAKSVVMPEVVPGVACLAFRKLPRGG 478

Query: 392 P-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           P I GN++   ++   D  +  + F+   C
Sbjct: 479 PGILGNVLMQEYIWEIDHGKGKMRFRKDKC 508


>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  105 bits (263), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 99/361 (27%), Positives = 160/361 (44%), Gaps = 45/361 (12%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-----YMQDSPLFDPKMSS 141
           N   Y++  S+GTPP     V D  SD +W QC  C  + C         +P F   +SS
Sbjct: 93  NTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSAC--ATCGADAPAATSAPPFYAFLSS 150

Query: 142 TYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSN--GNLATETVTLGSTTGQAV 197
           T + + C++  C  L  ++CS  +  C YS  YG G+ +   G LA +     +     V
Sbjct: 151 TIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFAT-----V 205

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKIN- 256
              G+ FGC     G       G++GLG G++S +SQ++    G+FSY L P  +  +  
Sbjct: 206 RADGVIFGCAVATEG----DIGGVIGLGRGELSPVSQLQI---GRFSYYLAPDDAVDVGS 258

Query: 257 ---FGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGV--STPDIVID--SG 306
              F  +        VSTPL     +++ Y + +  I V  + L +   T D+  D   G
Sbjct: 259 FILFLDDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGG 318

Query: 307 TTL------TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIH 358
             L      TFL  G    +   M+S IE +        L+LCY+  SL  ++VP + + 
Sbjct: 319 VVLSITIPVTFLDAGAYKVVRQAMASKIELRAADGSELGLDLCYTSESLATAKVPSMALV 378

Query: 359 FRGADV-KLSRSN-FFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 415
           F G  V +L   N F++  +  + C ++         + G+++Q    + YDI    + F
Sbjct: 379 FAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYDISGSRLVF 438

Query: 416 K 416
           +
Sbjct: 439 E 439


>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
 gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
          Length = 557

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 124/475 (26%), Positives = 197/475 (41%), Gaps = 65/475 (13%)

Query: 2   ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKS---PFYNSS----------- 47
           A  L  + +  + CFY  S +  Q  G   E   R+  +S   P Y  +           
Sbjct: 83  ALVLGALAVAAYYCFY--SDVAVQFLGMEQEEEQRNETRSFLLPLYPKARQGRALREFGD 140

Query: 48  -ETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTE 103
            +   +R+ D   ++ NR+      ++ ++S A    + ++ P+   Y   I IG PP  
Sbjct: 141 VKLAARRVDDGGRKARNRMEVAKAATARTNSTALLPIKGNVFPD-GQYYTSIFIGNPPRP 199

Query: 104 RLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKS 160
                DTGSDL W QC+ PC  + C     PL+ P   +  K +P     C  L  NQ  
Sbjct: 200 YFLDVDTGSDLTWIQCDAPC--TNCAKGPHPLYKP---AKEKIVPPRDLLCQELQGNQNY 254

Query: 161 CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS--- 216
           C     C Y + Y D S S G LA + + + +T G    L    FGC  +  G   S   
Sbjct: 255 CETCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKL-DFVFGCAYDQQGQLLSSPA 313

Query: 217 KTTGIVGLGGGDISLISQMRT--TIAGKFSYCLV-PVSSTKINFGTNGIVSGPGVVSTPL 273
           KT GI+GL    IS  SQ+ +   IA  F +C+          F  +  V   GV  T +
Sbjct: 314 KTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGYMFLGDDYVPRWGVTWTSI 373

Query: 274 TKA-KTFYVLTIDAISVGNQRL-----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVM--S 325
                  Y      +  G+Q+L       ST  ++ DSG++ T+LP     NL++ +  +
Sbjct: 374 RSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFDSGSSYTYLPNEIYENLVAAIKYA 433

Query: 326 SMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFRGADVKLSRSNFFVKVS-----EDI 379
           S    Q  +D T  L LC+  +  +  + +V   F   ++   +   F+  +     ED 
Sbjct: 434 SPGFVQDTSDRT--LPLCWKADFPVRYLEDVKQFFEPLNLHFGKKWLFMSKTFTISPEDY 491

Query: 380 VC-----SVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           +      +V  G+ N       S  I G++     LV YD +++ + +  +DCTK
Sbjct: 492 LIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNQRKQIGWADSDCTK 546


>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 102/338 (30%), Positives = 153/338 (45%), Gaps = 48/338 (14%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+I + +GTP   ++   DTGS   W  CE C    C+         + S+T   + C +
Sbjct: 1   YVISVGLGTPSKTQIVEIDTGSSTSWVFCE-C--DGCHTNPRTFLQSR-STTCAKVSCGT 56

Query: 151 SQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           S C         Q S +  +C + VSY DGS S G L  +T+T          +PG TFG
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ----KIPGFTFG 112

Query: 206 CGTNNGGLFN-SKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF--GTNGI 262
           C  ++ G        G++G+G G +S++ Q   T  G FSYCL P+  ++  F   T G 
Sbjct: 113 CNMDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCL-PLQMSERGFFSKTTGY 170

Query: 263 VSGPGVVSTPLTKAK-----------TFYVLTIDAISVGNQRLGV-----STPDIVIDSG 306
            S  G ++   T  +             + + + AISV  +RLG+     S   +V DSG
Sbjct: 171 FSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSG 230

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCYSFNSLSQ--VPEVTIHF-R 360
           + L+++P       LSV+S  I     +  A    S   CY   S+ +  +P +++HF  
Sbjct: 231 SELSYIPD----RALSVLSQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 286

Query: 361 GADVKLSRSNFFVKVS---EDIVCSVFKGITNSVPIYG 395
           GA   L R   FV+ S   +D+ C  F   T SV I G
Sbjct: 287 GARFDLGRHGVFVERSVQEQDVWCLAF-APTESVSIIG 323


>gi|356539352|ref|XP_003538162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 489

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 108/369 (29%), Positives = 177/369 (47%), Gaps = 45/369 (12%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +GTPP E     DTGSD++W     C  CP +         FDP  SST   + 
Sbjct: 77  YYTKVKLGTPPRELYVQIDTGSDVLWVSCGSCNGCPQTSGLQIQLNYFDPGSSSTSSLIS 136

Query: 148 CSSSQCASLNQ---KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGS------TTGQA 196
           C   +C S  Q    SCSG N  C Y+  YGDGS ++G   ++ +   S      TT  +
Sbjct: 137 CLDRRCRSGVQTSDASCSGRNNQCTYTFQYGDGSGTSGYYVSDLMHFASIFEGTLTTNSS 196

Query: 197 VALPGITFGCGT-NNGGLFNSKTT--GIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVS 251
            +   + FGC     G L  S+    GI G G   +S+ISQ+ +  IA + FS+CL   +
Sbjct: 197 AS---VVFGCSILQTGDLTKSERAVDGIFGFGQQGMSVISQLSSQGIAPRVFSHCLKGDN 253

Query: 252 STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI---------V 302
           S         IV  P +V +PL  ++  Y L + +ISV  Q + ++ P +         +
Sbjct: 254 SGGGVLVLGEIVE-PNIVYSPLVPSQPHYNLNLQSISVNGQIVRIA-PSVFATSNNRGTI 311

Query: 303 IDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV---PEVTIH 358
           +DSGTTL +L  + YN  ++++  + +  Q V         CY   + S V   P+V+++
Sbjct: 312 VDSGTTLAYLAEEAYNPFVIAI--AAVIPQSVRSVLSRGNQCYLITTSSNVDIFPQVSLN 369

Query: 359 FR-GADVKLSRSNFFVK---VSE-DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQT 412
           F  GA + L   ++ ++   + E  + C  F+ I+  S+ I G+++  + +  YD+  Q 
Sbjct: 370 FAGGASLVLRPQDYLMQQNFIGEGSVWCIGFQKISGQSITILGDLVLKDKIFVYDLAGQR 429

Query: 413 VSFKPTDCT 421
           + +   DC+
Sbjct: 430 IGWANYDCS 438


>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
 gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
          Length = 538

 Score =  105 bits (262), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 88/390 (22%), Positives = 154/390 (39%), Gaps = 56/390 (14%)

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY---------------- 128
           I +   YL+ +  GTP      V DT +DL W  C        +                
Sbjct: 121 IAHVGMYLVSVRFGTPALPYNLVLDTANDLTWINCRLRRRKGKHYGRTMSVGAGDDGAAA 180

Query: 129 --MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC----SGVNCQYSVSYGDGSFSNGNL 182
              +    + P  SS+++ + CS  +CA L   +C       +C Y     DG+ + G  
Sbjct: 181 KEARRKNWYRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIY 240

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK 242
             E  T+  + G+   LPG+  GC     G       G++ LG G++S           +
Sbjct: 241 GKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGEMSFAVHAAKRFGQR 300

Query: 243 FSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRL 294
           FS+CL+  +S++     + FG N  V GPG + T +      K  Y   +  I VG +RL
Sbjct: 301 FSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERL 360

Query: 295 GVSTP----------DIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELC 343
            +              +++D+ T++T  +P+ Y + + S +   +   P        E C
Sbjct: 361 DIPQEIWDAEKVVGGGVILDTSTSVTSLVPEAY-AAVTSALDRHLSHLPRVYELDGFEYC 419

Query: 344 YSFN------SLSQ---VPEVTIHFRGADVKL---SRSNFFVKVSEDIVCSVFKGITNSV 391
           Y +        L+    VP +T+   G   +L   ++S    +V   + C  F+ +    
Sbjct: 420 YRWTFAGDGVDLTHNVTVPRLTVEMAGG-ARLEPEAKSVVMPEVVPGVACLAFRKLPRGG 478

Query: 392 P-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           P I GN++   ++   D  +  + F+   C
Sbjct: 479 PGILGNVLMQEYIWEIDHGKGKMRFRKDKC 508


>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
 gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
          Length = 537

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 90/396 (22%), Positives = 154/396 (38%), Gaps = 64/396 (16%)

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDS----------- 132
           I +   YL+ + IGTP      V DT +DL W  C       + Y + S           
Sbjct: 118 IAHVGMYLVSVRIGTPALPYNLVLDTATDLTWINCRLRRRKGKHYGRQSMGQTMSVGGEG 177

Query: 133 ----------PLFDPKMSSTYKSLPCSSSQCASLNQKSC----SGVNCQYSVSYGDGSFS 178
                       + P  SS+++ + CS  +CA L   +C       +C Y     DG+ +
Sbjct: 178 ATAAKKEASKNWYRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVT 237

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
            G    E  T+  + G+   LPG+  GC     G       G++ LG GD+S        
Sbjct: 238 IGIYGKEKATVTVSDGRMAKLPGLILGCSVLEAGGSVDAHDGVLSLGNGDMSFAVHAAKR 297

Query: 239 IAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVG 290
              +FS+CL+  +S++     + FG N  V GPG + T +      K  Y   +  + VG
Sbjct: 298 FGQRFSFCLLSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLVG 357

Query: 291 NQRLGVSTPD------------IVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPT 337
            +RL +  PD            +++D+ T++T  +P+ Y + + + +   +   P     
Sbjct: 358 GERLDI--PDEVWDAERFVGGGVILDTSTSVTSLVPEAY-APVTAALDRHLSHLPRVYEL 414

Query: 338 GSLELCYSFNSLSQ---------VPEVTIHFRGADVKL---SRSNFFVKVSEDIVCSVFK 385
              E CY +              +P  T+   G   +L   ++S    +V   + C  F+
Sbjct: 415 EGFEYCYKWTFTGDGVXPAHNVTIPSFTVEMAGG-ARLEPEAKSVVMPEVEPGVACLAFR 473

Query: 386 GITNSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            +    P I GN+    ++   D     + F+   C
Sbjct: 474 KLLRGGPGILGNVFMQEYIWEIDHGDGKIRFRKDKC 509


>gi|38605896|emb|CAD41523.2| OSJNBb0020O11.8 [Oryza sativa Japonica Group]
          Length = 519

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 114/411 (27%), Positives = 177/411 (43%), Gaps = 91/411 (22%)

Query: 89  ANYLIRISIGTPPTERLAVA---DTGSDLIWTQCEPCPPSQCYM----------QDSPL- 134
           ++Y + +S+G PP+   +V+   DTGSDL+W    PC P  C +            SPL 
Sbjct: 86  SDYTLSLSVG-PPSTASSVSLFLDTGSDLVWF---PCAPFTCMLCEGKATPGGNHSSPLP 141

Query: 135 ----------FDPKMSSTYKSLP----CSSSQCA--SLNQKSCSGVNCQ-YSVSYGDGSF 177
                       P  S+ + S P    C++++C   ++   SC+   C     +YGDGS 
Sbjct: 142 PPIDSRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSL 201

Query: 178 SNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
              NL    V L ++    +A+   TF C         ++  G+ G G G +SL +Q+  
Sbjct: 202 V-ANLRRGRVGLAAS----MAVENFTFACAHTA----LAEPVGVAGFGRGPLSLPAQLAP 252

Query: 238 TIAGKFSYCLVP--------VSSTKINFGTNGIVSGPGV-----VSTPL---TKAKTFYV 281
           +++G+FSYCLV         + S+ +  G +   +  G      V TPL    K   FY 
Sbjct: 253 SLSGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYS 312

Query: 282 LTIDAISVGNQR------LGVSTPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
           + ++A+SVG +R      LG    D    +V+DSGTT T LP    + +    +  + A 
Sbjct: 313 VALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAA 372

Query: 332 PVADPTGS-----LELCYSFN-SLSQVPEVTIHFRG-ADVKLSRSNFFVKVSED----IV 380
                 G+     L  CY ++ S   VP V +HFRG A V L R N+F+    +    + 
Sbjct: 373 RFTRAEGAEAQTGLAPCYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVG 432

Query: 381 CSVFKGITNS----------VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           C +   +  +              GN  Q  F V YD++   V F    CT
Sbjct: 433 CLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCT 483


>gi|125575539|gb|EAZ16823.1| hypothetical protein OsJ_32295 [Oryza sativa Japonica Group]
          Length = 383

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 94/360 (26%), Positives = 168/360 (46%), Gaps = 44/360 (12%)

Query: 96  SIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP-KMSSTYKSLPCSSSQCA 154
           +IGTPP    A  D G  L+WTQC  C  S C+ Q +P   P ++       PC ++ C 
Sbjct: 29  TIGTPPQPASAFIDVGGLLVWTQCSQCSSSSCFNQGAPAVRPDQVVPPTGPEPCGTALCE 88

Query: 155 --SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNG 211
               + ++CSG  C Y  S      ++G + T+ V +G+ T  +VA     FGC   ++ 
Sbjct: 89  FFPASIRNCSGDVCAYEASTQLFEHTSGKIGTDAVAIGTATAASVA-----FGCVMASDI 143

Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVP--------------VSSTKINF 257
            L +   +G VGL    +SL++QM  T    FS+CL P               ++     
Sbjct: 144 KLMDGGPSGFVGLARTPLSLVAQMNVT---AFSHCLAPHDGGGGKNSRLFLGAAAKLAGG 200

Query: 258 GTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD----IVIDSGTTLTFLP 313
           G +  ++ P V S+P      +Y++ ++ I  G++ + ++ P     +++ + + ++FL 
Sbjct: 201 GKSAAMTTPFVKSSPDDIKSLYYLINLEGIKAGDEAI-ITVPQSGRTVLLQTFSPVSFLV 259

Query: 314 QGYNSNLLSVMSSMIEAQPVADPTGSLE----LCYSFNSLSQVPEVTIHFRG-ADVKLSR 368
            G   +L   +++ +   P A P    +    LC+    +S  P+V + F+G A + +  
Sbjct: 260 DGVYQDLKKAVTAAV-GGPTATPPEQFQSIFDLCFKRGGVSGAPDVVLTFQGAAALTVPP 318

Query: 369 SNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           +N+ + V +D VC                + I G + Q N    YD+E++T+SF+  DC+
Sbjct: 319 TNYLLDVGDDTVCVAIASSARLNSTEVAGMSILGGLQQQNVHFLYDLEKETLSFEAADCS 378


>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 97/378 (25%), Positives = 158/378 (41%), Gaps = 65/378 (17%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           ++ + IGTP   +  V DTGS L W QC P    +     +  FDP +SS++  LPCS  
Sbjct: 82  ILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 141

Query: 152 QCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
            C       +L     S   C YS  Y DG+F+ GNL  E  T  ++       P +  G
Sbjct: 142 LCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQ----TTPPLILG 197

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG---I 262
           C        ++   GI+G+  G +S ISQ + +   KFSYC +P  S +    + G   +
Sbjct: 198 CAKE-----STDVKGILGMNLGRLSFISQAKIS---KFSYC-IPTRSNRPGLASTGSFYL 248

Query: 263 VSGPGVVSTPLTKAKTF-------------YVLTIDAISVGNQRLGVST----PD----- 300
              P           TF             Y + +  I +G +RL + +    PD     
Sbjct: 249 GENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFRPDAGGSG 308

Query: 301 -IVIDSGTTLTFLPQ-GYN---SNLLSVMSSMIEAQPVADPTGSLELCYSFNSL----SQ 351
             ++DSG+  T L    Y+     ++ ++ S ++   V   T   ++C+  N        
Sbjct: 309 QTMVDSGSEFTHLVDVAYDKVKEEIVRLVGSRLKKGYVYGSTA--DMCFDGNHQMVIGRL 366

Query: 352 VPEVTIHF-RGADVKLSRSNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNFLV 404
           + ++   F RG ++ + +    V V   I C      S+    +N   I GN+ Q N  V
Sbjct: 367 IGDLVFEFGRGVEILVEKQRLLVNVGGGIHCVGIGRSSMLGAASN---IIGNVHQQNLWV 423

Query: 405 GYDIEQQTVSFKPTDCTK 422
            +D+  + V F   +C++
Sbjct: 424 EFDVANRRVGFSKAECSR 441


>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 451

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 99/374 (26%), Positives = 159/374 (42%), Gaps = 44/374 (11%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + +SIG PP       DTGSDL W QC+ PC    C     PL+ P  +
Sbjct: 50  GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106

Query: 141 STYKSLPCSSSQCASLN-----QKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
              K +PC    CA+L+     +  C      C Y + Y D   S G L T++  L    
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162

Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
             ++  PG+ FGCG +         S T G++GLG G +SL+SQ++     K    +CL 
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222

Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
                 + FG + IV        P+ +  ++ +Y      +  G + LGV   ++V DSG
Sbjct: 223 TRGGGFLFFGDD-IVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIH 358
           ++ T+        L+  +   +       P  SL LC+     F S+  V +    V + 
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFRTVVLS 341

Query: 359 F---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDI 408
           F   + A +++   N+ +       C    GI N        + I G+I   + +V YD 
Sbjct: 342 FSNGKKALMEIPPENYLIVTKYGNAC---LGILNGSEVGLKDLNIVGDITMQDQMVIYDN 398

Query: 409 EQQTVSFKPTDCTK 422
           E+  + +    C +
Sbjct: 399 ERGQIGWIRAPCDR 412


>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
 gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
          Length = 429

 Score =  105 bits (261), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 104/374 (27%), Positives = 166/374 (44%), Gaps = 45/374 (12%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + ++IG PP       DTGSDL W QC+ PC    C     PL+ P  +
Sbjct: 58  GDVYPHGL-YYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPC--RSCNKVPHPLYRPTKN 114

Query: 141 STYKSLPCSSSQCASLN-----QKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
              K +PC    CASL+     +  C      C Y + Y D   S G L  ++  L    
Sbjct: 115 ---KLVPCVDQLCASLHNGLNRKHKCDSPYEQCDYVIKYADQGSSTGVLVNDSFALRLAN 171

Query: 194 GQAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCL 247
           G +V  P + FGCG +    +G +  S T G++GLG G +SL+SQ +     K    +CL
Sbjct: 172 G-SVVRPSLAFGCGYDQQVSSGEM--SPTDGVLGLGTGSVSLLSQFKQHGVTKNVVGHCL 228

Query: 248 VPVSSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVSTPDIVIDS 305
                  + FG + +V    V  TP+ ++  + +Y     ++  G+Q L V   ++V DS
Sbjct: 229 SLRGGGFLFFGDD-LVPYQRVTWTPMVRSPLRNYYSPGSASLYFGDQSLRVKLTEVVFDS 287

Query: 306 GTTLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYSFNS-LSQVPEVTIHFRGA 362
           G++ T F  Q Y + + ++   +    + V+DP  SL LC+        V +V   F+  
Sbjct: 288 GSSFTYFAAQPYQALVTALKGDLSRTLKEVSDP--SLPLCWKGKKPFKSVLDVKKEFKSL 345

Query: 363 DVKLSRSN-FFVKVSEDIVCSVFK------GITN-------SVPIYGNIMQTNFLVGYDI 408
            +     N  F+++       V K      GI N        + I G+I   + +V YD 
Sbjct: 346 VLNFGNGNKAFMEIPPQNYLIVTKYGNACLGILNGSEVGLKDLSILGDITMQDQMVIYDN 405

Query: 409 EQQTVSFKPTDCTK 422
           E+  + +    C +
Sbjct: 406 EKGQIGWIRAPCDR 419


>gi|115459640|ref|NP_001053420.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|113564991|dbj|BAF15334.1| Os04g0535200 [Oryza sativa Japonica Group]
 gi|116310090|emb|CAH67110.1| H0502G05.1 [Oryza sativa Indica Group]
 gi|116310464|emb|CAH67468.1| OSIGBa0159I10.13 [Oryza sativa Indica Group]
 gi|215715343|dbj|BAG95094.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765807|dbj|BAG87504.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767550|dbj|BAG99778.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218195278|gb|EEC77705.1| hypothetical protein OsI_16781 [Oryza sativa Indica Group]
          Length = 492

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 114/412 (27%), Positives = 177/412 (42%), Gaps = 91/412 (22%)

Query: 89  ANYLIRISIGTPPTERLAVA---DTGSDLIWTQCEPCPPSQCYM----------QDSPL- 134
           ++Y + +S+G PP+   +V+   DTGSDL+W    PC P  C +            SPL 
Sbjct: 86  SDYTLSLSVG-PPSTASSVSLFLDTGSDLVWF---PCAPFTCMLCEGKATPGGNHSSPLP 141

Query: 135 ----------FDPKMSSTYKSLP----CSSSQCA--SLNQKSCSGVNCQ-YSVSYGDGSF 177
                       P  S+ + S P    C++++C   ++   SC+   C     +YGDGS 
Sbjct: 142 PPIDSRRISCASPLCSAAHSSAPTSDLCAAARCPLDAIETDSCASHACPPLYYAYGDGSL 201

Query: 178 SNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
              NL    V L ++    +A+   TF C         ++  G+ G G G +SL +Q+  
Sbjct: 202 V-ANLRRGRVGLAAS----MAVENFTFACAHTA----LAEPVGVAGFGRGPLSLPAQLAP 252

Query: 238 TIAGKFSYCLVP--------VSSTKINFGTNGIVSGPGV-----VSTPL---TKAKTFYV 281
           +++G+FSYCLV         + S+ +  G +   +  G      V TPL    K   FY 
Sbjct: 253 SLSGRFSYCLVAHSFRADRLIRSSPLILGRSTDAAAIGASETDFVYTPLLHNPKHPYFYS 312

Query: 282 LTIDAISVGNQR------LGVSTPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ 331
           + ++A+SVG +R      LG    D    +V+DSGTT T LP    + +    +  + A 
Sbjct: 313 VALEAVSVGGKRIQAQPELGDVDRDGNGGMVVDSGTTFTMLPSDTFARVADEFARAMAAA 372

Query: 332 PVADPTGS-----LELCYSFN-SLSQVPEVTIHFRG-ADVKLSRSNFFVKVSED----IV 380
                 G+     L  CY ++ S   VP V +HFRG A V L R N+F+    +    + 
Sbjct: 373 RFTRAEGAEAQTGLAPCYHYSPSDRAVPPVALHFRGNATVALPRRNYFMGFKSEEGRSVG 432

Query: 381 CSVFKGITNS----------VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           C +   +  +              GN  Q  F V YD++   V F    CT 
Sbjct: 433 CLMLMNVGGNNDDGEDGGGPAGTLGNFQQQGFEVVYDVDAGRVGFARRRCTD 484


>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
          Length = 508

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 100/365 (27%), Positives = 161/365 (44%), Gaps = 45/365 (12%)

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-----YMQDSPLFDP 137
           D   N   Y++  S+GTPP     V D  SD +W QC  C  + C         +P F  
Sbjct: 89  DPATNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSAC--ATCGADAPAATSAPPFYA 146

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSN--GNLATETVTLGSTT 193
            +SST + + C++  C  L  ++CS  +  C YS  YG G+ +   G LA +     +  
Sbjct: 147 FLSSTIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFAT-- 204

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSST 253
              V   G+ FGC     G       G++GLG G++SL+SQ++    G+FSY L P  + 
Sbjct: 205 ---VRADGVIFGCAVATEG----DIGGVIGLGRGELSLVSQLQI---GRFSYYLAPDDAV 254

Query: 254 KIN----FGTNGIVSGPGVVSTPLT---KAKTFYVLTIDAISVGNQRLGV--STPDIVID 304
            +     F  +        VSTPL     +++ Y + +  I V  + L +   T D+  D
Sbjct: 255 DVGSFILFLDDAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTFDLQAD 314

Query: 305 --SGTTL------TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPE 354
              G  L      TFL  G    +   M+S I  +        L+LCY+  SL  ++VP 
Sbjct: 315 GSGGVVLSITIPVTFLDAGAYKVVRQAMASKIGLRAADGSELGLDLCYTSESLATAKVPS 374

Query: 355 VTIHFRGADV-KLSRSN-FFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 411
           + + F G  V +L   N F++  +  + C ++         + G+++Q    + YDI   
Sbjct: 375 MALVFAGGAVMELEMGNYFYMDSTTGLECLTILPSPAGDGSLLGSLIQVGTHMIYDISGS 434

Query: 412 TVSFK 416
            + F+
Sbjct: 435 RLVFE 439


>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 96/353 (27%), Positives = 157/353 (44%), Gaps = 43/353 (12%)

Query: 97  IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS-QCAS 155
           IGTPP E   + DTGS + +  C  C   QC     P F P +S TY  + C+    C +
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCNSC--DQCGNHQDPKFQPDLSDTYHPVKCNPDCTCDT 59

Query: 156 LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLF 214
            N +      C Y   Y + S S+G L  + V+ G+ +   +      FGC     G LF
Sbjct: 60  ENDQ------CTYERQYAEMSSSSGILGEDLVSFGNMS--ELKPQRAVFGCENAETGDLF 111

Query: 215 NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTN--GIVSGPG--V 268
           +    GI+GLG GD+S++ Q+  +  I   FS C       ++  G    G +S P   V
Sbjct: 112 SQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCY---GGMEVGGGAMVLGQISPPSDMV 168

Query: 269 VSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-------VIDSGTTLTFLPQGYNSNLL 321
            S        +Y + +  + V  ++L ++ P +       ++DSGTT  +LP+      +
Sbjct: 169 FSHSDPDRSPYYNIELRGLHVAGKKLDIN-PQVFDGKHGTILDSGTTYAYLPEAAFLPFI 227

Query: 322 SVMSSMIEA-QPVADPTGSL-ELCYSFNSLSQVPEVTIHFRGADV--------KLSRSNF 371
             ++S +   + +  P  +  ++C+S  + S++PE+   F   D+         LS  N+
Sbjct: 228 QAITSELHGLKQIRGPDPNYNDVCFS-GAGSEIPELYKTFPSVDMVFDNGEKYSLSPENY 286

Query: 372 FVKVSE---DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             K S+        VF+   +   + G I+  N LV YD E   V F  T+C+
Sbjct: 287 LFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339


>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 421

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 99/374 (26%), Positives = 159/374 (42%), Gaps = 44/374 (11%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + +SIG PP       DTGSDL W QC+ PC    C     PL+ P  +
Sbjct: 50  GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106

Query: 141 STYKSLPCSSSQCASLN-----QKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
              K +PC    CA+L+     +  C      C Y + Y D   S G L T++  L    
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162

Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
             ++  PG+ FGCG +         S T G++GLG G +SL+SQ++     K    +CL 
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222

Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
                 + FG + IV        P+ +  ++ +Y      +  G + LGV   ++V DSG
Sbjct: 223 TRGGGFLFFGDD-IVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIH 358
           ++ T+        L+  +   +       P  SL LC+     F S+  V +    V + 
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFRTVVLS 341

Query: 359 F---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDI 408
           F   + A +++   N+ +       C    GI N        + I G+I   + +V YD 
Sbjct: 342 FSNGKKALMEIPPENYLIVTKYGNAC---LGILNGSEVGLKDLNIVGDITMQDQMVIYDN 398

Query: 409 EQQTVSFKPTDCTK 422
           E+  + +    C +
Sbjct: 399 ERGQIGWIRAPCDR 412


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 98/362 (27%), Positives = 164/362 (45%), Gaps = 41/362 (11%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           N  Y  R+ IGTPP     + DTGS + +  C  C    C     P F P +S TY+ + 
Sbjct: 86  NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTC--EHCGRHQDPKFQPDLSETYQPVK 143

Query: 148 CSSSQCASLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           C+   C      +C G    C Y   Y + S S+G L  + V+ G+ +   +A     FG
Sbjct: 144 CTPD-C------NCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLS--ELAPQRAVFG 194

Query: 206 CGTNN-GGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCL--VPVSSTKINFGTN 260
           C  +  G L++ +  GI+GLG GD+S++ Q+  +  I+  FS C   + V    +  G  
Sbjct: 195 CENDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILG-- 252

Query: 261 GIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVSTPDI-------VIDSGTTLTFL 312
           GI     +V T     ++ +Y + +  + V  ++L ++ P +       V+DSGTT  +L
Sbjct: 253 GISPPEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLN-PKVFDGKHGTVLDSGTTYAYL 311

Query: 313 PQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSFNSL--SQV----PEVTIHFR-GAD 363
           P+  + +   ++M      + +  P  +  ++C++   +  SQ+    P V + F  G  
Sbjct: 312 PETAFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHK 371

Query: 364 VKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + LS  N+     KV       VF    +   + G I   N LV YD E   + F  T+C
Sbjct: 372 LSLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMYDRENSKIGFWKTNC 431

Query: 421 TK 422
           ++
Sbjct: 432 SE 433


>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
          Length = 472

 Score =  104 bits (260), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 111/398 (27%), Positives = 172/398 (43%), Gaps = 61/398 (15%)

Query: 70  QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           Q   I+SS +++ D+I     N+  +L+ +S+G PP   L   DTGS L W QC+PC   
Sbjct: 89  QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147

Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
            C+ Q +   P+FDP  S T + + CSS +C  L       Q +C     +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207

Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           G ++S G + T+T+ +G +         + FGC  +    ++    GI G G    S   
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMDVK--YSEFEAGIFGFGSSSFSFFE 259

Query: 234 QMRTTIAG--------KFSYCLVPVSSTKINFGTNGIVSGP----GVVSTPLTKAKTFYV 281
           Q+    AG         FSYCL P   TK  +   G         G  S   +  +  Y 
Sbjct: 260 QL----AGYPDILSYKAFSYCL-PTDETKPGYMILGRYDRAAMDGGYTSLFRSINRPTYS 314

Query: 282 LTIDAISVGNQRLGVSTPDIVIDSG--------TTLTFLPQ---------GYNSNLLSVM 324
           LT++ +    QRL  S+ ++++DSG        +T   L +         GY+    +  
Sbjct: 315 LTMEMLIANGQRLVTSSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQ 374

Query: 325 SSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSV 383
            S I      D +G       F++ S +P + I F  GA + L   N F       +C  
Sbjct: 375 ESYICYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMT 434

Query: 384 F-KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           F +       I GN +  +F   +DI+ +   FK   C
Sbjct: 435 FAQNPALRSQILGNRVTRSFGTTFDIQGKQFGFKYAAC 472


>gi|242094480|ref|XP_002437730.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
 gi|241915953|gb|EER89097.1| hypothetical protein SORBIDRAFT_10g001450 [Sorghum bicolor]
          Length = 507

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 101/330 (30%), Positives = 140/330 (42%), Gaps = 34/330 (10%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN---QKSCSG 163
           V DT SD+ W QC P   S      S  +DP  SSTY +L C+S+ C  L    + +C  
Sbjct: 127 VLDTASDVPWVQCHPLASSATTDSSSSSYDPARSSTYYALACNSAACTELGRLYRGACVN 186

Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG-------GLFNS 216
             CQY V       S+ +  T    L   T        ++F  G ++G       G  ++
Sbjct: 187 NQCQYRVPIPSSPASSSSSGTYGSDLLKLTADPADGASMSFKFGCSHGEAKQGGEGSIDN 246

Query: 217 KTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFGTNGIVSGPGVVST 271
            T GI+ LGGG  SL+SQ        FSYC+    S +     +  G   +    G   T
Sbjct: 247 ATAGIMALGGGPESLVSQNAAMYGSAFSYCIPATESRRPGFFVLGGGVGDLSGAGGYAVT 306

Query: 272 PL---TKAKTFYVLTIDAISVGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSV 323
           P+    +  T Y + + AI+V  Q+L V TP +     V+DS T +T LP      L   
Sbjct: 307 PMLRYARVPTLYRVRLLAIAVDGQQLNV-TPSVFASGSVLDSRTAITRLPPTAYQALREA 365

Query: 324 MSSMIEAQPVADPTGSLELCYSFNS--LSQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIV 380
             S +     A P G+L+ CY F    L  VP V +   G A V L R            
Sbjct: 366 FRSRMAMYREAPPQGNLDTCYDFAGAFLVMVPRVALLLDGNAVVALDRQGILFH-----D 420

Query: 381 CSVFKGITNS-VP-IYGNIMQTNFLVGYDI 408
           C VF   T+  +P I GN+ Q    V Y++
Sbjct: 421 CLVFTSNTDDRMPGILGNVQQQTMEVLYNV 450


>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
          Length = 421

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 99/374 (26%), Positives = 159/374 (42%), Gaps = 44/374 (11%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + +SIG PP       DTGSDL W QC+ PC    C     PL+ P  +
Sbjct: 50  GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106

Query: 141 STYKSLPCSSSQCASLN-----QKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
              K +PC    CA+L+     +  C      C Y + Y D   S G L T++  L    
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162

Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
             ++  PG+ FGCG +         S T G++GLG G +SL+SQ++     K    +CL 
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222

Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
                 + FG + IV        P+ +  ++ +Y      +  G + LGV   ++V DSG
Sbjct: 223 TRGGGFLFFGDD-IVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIH 358
           ++ T+        L+  +   +       P  SL LC+     F S+  V +    V + 
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFKTVVLS 341

Query: 359 F---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDI 408
           F   + A +++   N+ +       C    GI N        + I G+I   + +V YD 
Sbjct: 342 FSNGKKALMEIPPENYLIVTKYGNAC---LGILNGSEVGLKDLNIVGDITMQDQMVIYDN 398

Query: 409 EQQTVSFKPTDCTK 422
           E+  + +    C +
Sbjct: 399 ERGQIGWIRAPCDR 412


>gi|125595873|gb|EAZ35653.1| hypothetical protein OsJ_19940 [Oryza sativa Japonica Group]
          Length = 468

 Score =  104 bits (260), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 92/340 (27%), Positives = 132/340 (38%), Gaps = 66/340 (19%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
           DT  DL W QC PCP  +CY Q + LFDP+ S T  ++PC S+ C  L +          
Sbjct: 167 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGR---------- 216

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
              YG             +       +      +         G F++ T+G + LGGG 
Sbjct: 217 ---YGRWLLQQPVPVLRRLRRRQGQPRGRTCHAVR--------GNFSASTSGTMSLGGGR 265

Query: 229 ISLISQMRTTIAGKFSYCLVPVSSTKI-------------NFGTNGIVSGPGVVSTPLTK 275
            SL+SQ   T    FSYC+   SS+                F    +V  P ++      
Sbjct: 266 QSLLSQTAATFGNAFSYCVPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSII------ 319

Query: 276 AKTFYVLTIDAISVGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEA 330
             T Y++ +  I VG +RL V         V+DS   +T L P  Y +  L+  S+M   
Sbjct: 320 -PTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMAAY 378

Query: 331 QPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGIT 388
             VA     L+ CY F   +   VP V++ F G  V          V  D +  + +G  
Sbjct: 379 PRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAMGVMVEGCL 428

Query: 389 NSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             VP          GN+ Q    V YD+   +V F+   C
Sbjct: 429 AFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 468


>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
          Length = 422

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 96/361 (26%), Positives = 160/361 (44%), Gaps = 50/361 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I IGTP  +     DTGS   W     C+ CP     ++    +DP+ S + K + 
Sbjct: 59  YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVK 118

Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---GIT 203
           C  + C S  +  C+  + C Y   Y DG  + G L T+ +      G     P    +T
Sbjct: 119 CDDTICTS--RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVT 176

Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSSTKIN 256
           FGCG    G  N+      GI+G G  + + +SQ+    AGK    FS+CL   +   I 
Sbjct: 177 FGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAA--AGKTKKIFSHCLDSTNGGGI- 233

Query: 257 FGTNGIVSGPGVVSTPLTK-AKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
           F    +V  P V +TP+ K  + ++++ + +I+V    L +         T    IDSG+
Sbjct: 234 FAIGEVVE-PKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 292

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL-------SQVPEVTIHFR 360
           TL +LP+        + S +I A     P  ++   Y+F           + P++T HF 
Sbjct: 293 TLVYLPE-------IIYSELILAVFAKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFE 345

Query: 361 GADVKLSR--SNFFVKVSEDIVCSVFK--GIT--NSVPIYGNIMQTNFLVGYDIEQQTVS 414
             D+ L     ++ ++   +  C  F+  GI     + I G+++ +N +V YD+E+Q + 
Sbjct: 346 N-DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIG 404

Query: 415 F 415
           +
Sbjct: 405 W 405


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 95/375 (25%), Positives = 160/375 (42%), Gaps = 62/375 (16%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           ++ + IGTPP  +  V DTGS L W QC    P++     S  FDP +SST+ +LPC+  
Sbjct: 98  IVDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTAS--FDPSLSSTFSTLPCTHP 155

Query: 152 QCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
            C       +L         C YS  Y DG+++ GNL  E  T      +++  P +  G
Sbjct: 156 VCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF----SRSLFTPPLILG 211

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG---I 262
           C T      ++   GI+G+  G +S  SQ + T   KFSYC VP   T+  +   G   +
Sbjct: 212 CATE-----STDPRGILGMNRGRLSFASQSKIT---KFSYC-VPTRVTRPGYTPTGSFYL 262

Query: 263 VSGPGVVSTPLTKAKTF-------------YVLTIDAISVGNQRLGVS----------TP 299
              P   +    +  TF             Y + +  I +G ++L +S          + 
Sbjct: 263 GHNPNSNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRADAGGSG 322

Query: 300 DIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSFNSLS---QVPE 354
             ++DSG+  T+L  + Y+     V+ ++          G + ++C+  N++     + +
Sbjct: 323 QTMLDSGSEFTYLVNEAYDKVRAEVVRAVGPRMKKGYVYGGVADMCFDGNAIEIGRLIGD 382

Query: 355 VTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP------IYGNIMQTNFLVGYD 407
           +   F +G  + + +      V   + C    GI NS        I GN  Q N  V +D
Sbjct: 383 MVFEFEKGVQIVVPKERVLATVEGGVHCI---GIANSDKLGAASNIIGNFHQQNLWVEFD 439

Query: 408 IEQQTVSFKPTDCTK 422
           +  + + F   DC++
Sbjct: 440 LVNRRMGFGTADCSR 454


>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 96/353 (27%), Positives = 157/353 (44%), Gaps = 43/353 (12%)

Query: 97  IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS-QCAS 155
           IGTPP E   + DTGS + +  C  C   QC     P F P +S TY  + C+    C +
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCNSC--DQCGNHQDPKFQPDLSDTYHPVKCNPDCTCDT 59

Query: 156 LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLF 214
            N +      C Y   Y + S S+G L  + V+ G+ +   +      FGC     G LF
Sbjct: 60  ENDQ------CTYERQYAEMSSSSGILGEDLVSFGNMS--ELKPQRAVFGCENAETGDLF 111

Query: 215 NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTN--GIVSGPG--V 268
           +    GI+GLG GD+S++ Q+  +  I   FS C       ++  G    G +S P   V
Sbjct: 112 SQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCY---GGMEVGGGAMVLGQISPPSDMV 168

Query: 269 VSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-------VIDSGTTLTFLPQGYNSNLL 321
            S        +Y + +  + V  ++L ++ P +       ++DSGTT  +LP+      +
Sbjct: 169 FSHSDPDRSPYYNIELRGLHVAGKKLDIN-PQVFDGKHGTILDSGTTYAYLPEAAFLPFI 227

Query: 322 SVMSSMIEA-QPVADPTGSL-ELCYSFNSLSQVPEVTIHFRGADV--------KLSRSNF 371
             ++S +   + +  P  +  ++C+S  + S++PE+   F   D+         LS  N+
Sbjct: 228 QAITSELHGLKQIRGPDPNYNDVCFS-GAGSEIPELYKTFPSVDMVFDNGEKYSLSPENY 286

Query: 372 FVKVSE---DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             K S+        VF+   +   + G I+  N LV YD E   V F  T+C+
Sbjct: 287 LFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 102/365 (27%), Positives = 168/365 (46%), Gaps = 40/365 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+PP E     DTGSD++W     C  CP +         FD   SST   + 
Sbjct: 66  YFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVR 125

Query: 148 CSSSQCASLNQKS---CSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV----- 197
           CS   C S  Q +   CS     C Y+  YGDGS ++G   ++T+   +  GQ++     
Sbjct: 126 CSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSS 185

Query: 198 ALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSS 252
           AL  I FGC     G     +    GI G G G++S+ISQ+  R      FS+CL    S
Sbjct: 186 AL--IVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGDGS 243

Query: 253 TKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDIVID 304
                   G +  PG+V +PL  ++  Y L + +I+V  Q L +        ++   ++D
Sbjct: 244 GG-GILVLGEILEPGIVYSPLVPSQPHYNLNLLSIAVNGQLLPIDPAAFATSNSQGTIVD 302

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP-TGSLELCYSFN-SLSQV-PEVTIHFR- 360
           SGTTL +L        +S +++++   P   P T     CY  + S+SQ+ P  + +F  
Sbjct: 303 SGTTLAYLVAEAYDPFVSAVNAIVS--PSVTPITSKGNQCYLVSTSVSQMFPLASFNFAG 360

Query: 361 GADVKLSRSNFFVKVSED----IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 416
           GA + L   ++ +         + C  F+ +   V I G+++  + +  YD+ +Q + + 
Sbjct: 361 GASMVLKPEDYLIPFGSSGGSAMWCIGFQKV-QGVTILGDLVLKDKIFVYDLVRQRIGWA 419

Query: 417 PTDCT 421
             DC+
Sbjct: 420 NYDCS 424


>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 437

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 100/365 (27%), Positives = 164/365 (44%), Gaps = 45/365 (12%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + I+IG          D+GSDL W QC+  P + C      L+ P  ++    L C  
Sbjct: 55  YSVSINIGKGDEAFEFDIDSGSDLTWVQCD-APCTHCTKPREQLYKPNNNA----LNCFE 109

Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
             C SL+      C   +  CQY + Y D   S G L  + V L  T G ++A P I FG
Sbjct: 110 PLCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNG-SLAAPRIAFG 168

Query: 206 CGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKINFGTN 260
           CG ++       +  T G++GLG G++S ISQ+ +   +     +CL         F  +
Sbjct: 169 CGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCL--SDEGGFLFFGD 226

Query: 261 GIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFL-PQGYN 317
             V   GV  T ++     ++Y      +  G +  G+    +V DSG++ T+   Q YN
Sbjct: 227 EFVPSSGVTWTSMSHESIGSYYSSGPAEVYFGGKATGIKDLTLVFDSGSSYTYFNSQAYN 286

Query: 318 SNLLSVMSSMIEAQPVAD--PTGSLELCYS----FNSLSQVPE----VTIHF---RGADV 364
           S +L+++ + +  +P+ D     SL +C+     F SL  V +    + + F   + A +
Sbjct: 287 S-ILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNLLALRFTKTKNAQI 345

Query: 365 KLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
           +L   N+ +      VC    GI N        + I G+I   + +V YD E++ + + P
Sbjct: 346 QLPPENYLIITKYGNVCF---GILNGTEVGLGDLNIIGDISLKDKMVIYDNERRRIGWFP 402

Query: 418 TDCTK 422
           T+C K
Sbjct: 403 TNCNK 407


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 100/372 (26%), Positives = 169/372 (45%), Gaps = 52/372 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+P  E     DTGSD++W     C  CP S     +   FD   SST   + 
Sbjct: 83  YFTKVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVS 142

Query: 148 CSSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGST-TGQAVALPG 201
           C    C+   Q + S  +     C Y+  YGDGS + G   ++T+   +   GQ+V    
Sbjct: 143 CGDPICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVANS 202

Query: 202 ---ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSST 253
              I FGC T   G     +    GI G G G +S+ISQ+  R      FS+CL      
Sbjct: 203 SSTIIFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCL------ 256

Query: 254 KINFGTNG---IVSG----PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST-------- 298
               G NG   +V G    P +V +PL  ++  Y L + +I+V  Q L + +        
Sbjct: 257 --KGGENGGGVLVLGEILEPSIVYSPLVPSQPHYNLNLQSIAVNGQLLPIDSNVFATTNN 314

Query: 299 PDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF-NSLSQV-PE 354
              ++DSGTTL +L Q  YN  + ++ +++ + ++P+         CY   NS+  + P+
Sbjct: 315 QGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPIISKGNQ---CYLVSNSVGDIFPQ 371

Query: 355 VTIHFR-GADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
           V+++F  GA + L+  ++ +         + C  F+ +     I G+++  + +  YD+ 
Sbjct: 372 VSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVYDLA 431

Query: 410 QQTVSFKPTDCT 421
            Q + +   DC+
Sbjct: 432 NQRIGWADYDCS 443


>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 462

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 107/384 (27%), Positives = 166/384 (43%), Gaps = 42/384 (10%)

Query: 60  RSLNR--LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
           RS+N   L  ++   S           +  +  +L+ +  G P      + DTGSD  W 
Sbjct: 96  RSINARILGQYSTEESKDGGSPESMHSLNEDGFFLVNVGFGKPQQNLNLIIDTGSDTTWI 155

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSF 177
           +C  C    C+ +  P F+P +SS+Y +  C  S                Y+++Y D S+
Sbjct: 156 RCNSCSLGNCHNKKIPTFNPSLSSSYSNRSCIPS------------TKTNYTMNYEDNSY 203

Query: 178 SNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD-ISLISQMR 236
           S G    + VTL     +    P   FG   ++GG      +G++GL  G+  SLISQ  
Sbjct: 204 SKGVFVCDEVTL-----KPDVFPKFQFG-CGDSGGGDFGSASGVLGLAQGEQYSLISQTA 257

Query: 237 TTIAGKFSYCLVPVSSTK--INFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQ 292
           +    KFSYC     +T+  + FG   I + P +  T L    + + Y + +  ISV  +
Sbjct: 258 SKFKKKFSYCFPHNENTRGSLLFGEKAISASPSLKFTRLLNPSSGSVYFVELIGISVAKK 317

Query: 293 RLGVS-----TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADP--TGSLELCY 344
           RL VS     +P  +IDSGT +T LP   Y +   +    M+    V+ P     L+ CY
Sbjct: 318 RLNVSSSLFASPGTIIDSGTVITHLPTAAYEALRTAFQQEMLHCPSVSPPPQEKPLDTCY 377

Query: 345 SFNSLS----QVPEVTIHFRG-ADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGN 396
           +         ++PE+ +HF G  DV L  S        +++  +    K   + V I GN
Sbjct: 378 NLKGCGGRNIKLPEIVLHFVGEVDVSLHPSGILWANGDLTQACLAFARKSHPSHVTIIGN 437

Query: 397 IMQTNFLVGYDIEQQTVSFKPTDC 420
             Q +  V YDIE   + F   DC
Sbjct: 438 RQQVSLKVVYDIEGGRLGFG-NDC 460


>gi|296082634|emb|CBI21639.3| unnamed protein product [Vitis vinifera]
          Length = 278

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 69/191 (36%), Positives = 95/191 (49%), Gaps = 37/191 (19%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
           F V L H DS        + T ++RL+ A+ R   RL   +  ++ S   + +A +   N
Sbjct: 35  FRVSLRHVDS------GGNYTKFERLQRAMKRGKLRLQRLSAKTA-SFESSVEAPVHAGN 87

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             +L++++IGTP     A+ DTGSDLIWTQC+PC    C+ Q +P+FDPK SS++  LPC
Sbjct: 88  GEFLMKLAIGTPAETYSAIMDTGSDLIWTQCKPC--KDCFDQPTPIFDPKKSSSFSKLPC 145

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
           SS    S  Q                     G LATET   G       ++  I FGCG 
Sbjct: 146 SSDLYYSSTQ---------------------GVLATETFAFGD-----ASVSKIGFGCGE 179

Query: 209 NNGGLFNSKTT 219
           +N G  NS TT
Sbjct: 180 DNDG--NSGTT 188


>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 447

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 108/398 (27%), Positives = 169/398 (42%), Gaps = 76/398 (19%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEP--CPPSQCYMQDSPLFDPKMSSTYK 144
           +N +  + +++GTPP     V DTGS+L W  C     PP       +P F+   SS+Y 
Sbjct: 51  HNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPL------TPAFNASGSSSYG 104

Query: 145 SLPCSSSQCASLNQ--------KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           ++PC S+ C    +         +     C+ S+SY D S ++G LAT+T  L  T G  
Sbjct: 105 AVPCPSTACEWRGRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLL--TGGAP 162

Query: 197 VALPGITFGC--------GTNNGGL---FNSKTTGIVGLGGGDISLISQMRTTIAGKFSY 245
               G  FGC         TN+ G     +   TG++G+  G +S ++Q  T    +F+Y
Sbjct: 163 PVAVGAYFGCITSYSSTTATNSNGTGTDVSEAATGLLGMNRGTLSFVTQTGTR---RFAY 219

Query: 246 CLVPVSSTKI-NFGTNGIVSGPGVVSTPLTKA--------KTFYVLTIDAISVGNQRL-- 294
           C+ P     +   G +G V+ P +  TPL +         +  Y + ++ I VG   L  
Sbjct: 220 CIAPGEGPGVLLLGDDGGVA-PPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPI 278

Query: 295 --GVSTPD------IVIDSGTTLTFLPQGYNSNLLSVMSSMIE--AQPVADP----TGSL 340
              V TPD       ++DSGT  TFL     + L +  +S       P+ +P     G+ 
Sbjct: 279 PKSVLTPDHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQARLLLAPLGEPGFVFQGAF 338

Query: 341 ELCYS------FNSLSQVPEVTIHFRGADVKLSRSNFFVKV---------SEDIVCSVFK 385
           + C+         +   +PEV +  RGA+V +S       V         +E + C  F 
Sbjct: 339 DACFRGPEARVAAASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFG 398

Query: 386 G---ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                  S  + G+  Q N  V YD++   V F P  C
Sbjct: 399 NSDMAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436


>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
 gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
 gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
 gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
 gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 106/416 (25%), Positives = 170/416 (40%), Gaps = 77/416 (18%)

Query: 64  RLNHFNQNSSISSSKASQADIIPNNANYLIR------------ISIGTPPTERLAVADTG 111
           RL     +SS  +S  S+ +  P ++ Y  R            + IGTP   +  V DTG
Sbjct: 41  RLTPTTNSSSFKTSLLSRRNPSPPSSPYTFRSNIKYSMALILSLPIGTPSQSQELVLDTG 100

Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA------SLNQKSCSGVN 165
           S L W QC P    +     +  FDP +SS++  LPCS   C       +L     S   
Sbjct: 101 SQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRL 160

Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
           C YS  Y DG+F+ GNL  E  T  ++       P +  GC        ++   GI+G+ 
Sbjct: 161 CHYSYFYADGTFAEGNLVKEKFTFSNSQ----TTPPLILGCAKE-----STDEKGILGMN 211

Query: 226 GGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG---IVSGPGVVSTPLTKAKTF--- 279
            G +S ISQ + +   KFSYC +P  S +    + G   +   P           TF   
Sbjct: 212 LGRLSFISQAKIS---KFSYC-IPTRSNRPGLASTGSFYLGDNPNSRGFKYVSLLTFPQS 267

Query: 280 ----------YVLTIDAISVGNQRLGVS----TPD------IVIDSGTTLTFLPQ-GYN- 317
                     Y + +  I +G +RL +      PD       ++DSG+  T L    Y+ 
Sbjct: 268 QRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFRPDAGGSGQTMVDSGSEFTHLVDVAYDK 327

Query: 318 --SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTIHF-RGADVKLSRSN 370
               ++ ++ S ++   V   T   ++C+  N   +    + ++   F RG ++ + + +
Sbjct: 328 VKEEIVRLVGSRLKKGYVYGSTA--DMCFDGNHSMEIGRLIGDLVFEFGRGVEILVEKQS 385

Query: 371 FFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             V V   I C      S+    +N   I GN+ Q N  V +D+  + V F   +C
Sbjct: 386 LLVNVGGGIHCVGIGRSSMLGAASN---IIGNVHQQNLWVEFDVTNRRVGFSKAEC 438


>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 466

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 119/441 (26%), Positives = 186/441 (42%), Gaps = 85/441 (19%)

Query: 45  NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN-ANYLIRISIGTPPTE 103
           +SS  P+  L+ A++ S+ R +H   +     +K+ +  + P     Y I +  GTP   
Sbjct: 42  SSSSHPFHTLKLAVSTSITRAHHLKNHKP---NKSLETPVHPKTYGGYSIDLEFGTPSQT 98

Query: 104 RLAVADTGSDLIWTQCEP---CPPSQC-YMQDSPLFDPKMSSTYKSLPCSSSQCASL--- 156
              V DTGS L+W  C     C  S+C    ++P F PK SS+ K + C++ +CA +   
Sbjct: 99  FPFVLDTGSTLVWLPCSSHYLC--SKCNSFSNTPKFIPKNSSSSKFVGCTNPKCAWVFGP 156

Query: 157 -------NQKSCSGVNCQ-----YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
                   Q   +  NC      Y+V YG GS + G L +E +   +       L     
Sbjct: 157 DVKSHCCRQDKAAFNNCSQTCPAYTVQYGLGS-TAGFLLSENLNFPTKKYSDFLL----- 210

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCL---------------VP 249
           GC      +   +  GI G G G+ SL SQM  T   +FSYCL               V 
Sbjct: 211 GCSV----VSVYQPAGIAGFGRGEESLPSQMNLT---RFSYCLLSHQFDDSATITSNLVL 263

Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAK----TFYVLTIDAISVGNQRLGVS----TPDI 301
            +++  +  TNG+   P  +  P TK       +Y +T+  I VG +R+ V      P++
Sbjct: 264 ETASSRDGKTNGVSYTP-FLKNPTTKKNPAFGAYYYITLKRIVVGEKRVRVPRRLLEPNV 322

Query: 302 ------VIDSGTTLTFLPQ---GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 352
                 ++DSG+T TF+ +      +   +   S   A+      G L  C+     ++ 
Sbjct: 323 DGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVSYTRAREAEKQFG-LSPCFVLAGGAET 381

Query: 353 ---PEVTIHFRG-ADVKLSRSNFFVKVSE-DIVCSVF--------KGITNSVPIYGNIMQ 399
              PE+   FRG A ++L  +N+F  V + D+ C            G      I GN  Q
Sbjct: 382 ASFPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIVSDDVAGSGGTVGPAVILGNYQQ 441

Query: 400 TNFLVGYDIEQQTVSFKPTDC 420
            NF V YD+E +   F+   C
Sbjct: 442 QNFYVEYDLENERFGFRSQSC 462


>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
 gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
 gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
 gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
 gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
 gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
 gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
 gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
 gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
 gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
 gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
 gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
 gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
 gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
 gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
 gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
 gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
 gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
 gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
 gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
 gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
          Length = 472

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 110/394 (27%), Positives = 175/394 (44%), Gaps = 53/394 (13%)

Query: 70  QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           Q   I+SS +++ D+I     N+  +L+ +S+G PP   L   DTGS L W QC+PC   
Sbjct: 89  QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147

Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
            C+ Q +   P+FDP  S T + + CSS +C  L       Q +C     +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207

Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           G ++S G + T+T+ +G +         + FGC  +    ++    GI G G    S   
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMDVK--YSEFEAGIFGFGSSSFSFFE 259

Query: 234 QMR---TTIAGK-FSYCLVPVSSTKINFGTNGIVSGPGVVS--TPLTKA--KTFYVLTID 285
           Q+      ++ K  SYCL P   TK  +   G      +    TPL ++  +  Y LT++
Sbjct: 260 QLAGYPDILSYKALSYCL-PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTME 318

Query: 286 AISVGNQRLGVSTPDIVIDSG--------TTLTFLPQ---------GYNSNLLSVMSSMI 328
            +    QRL  S+ ++++DSG        +T   L +         GY+    +   S I
Sbjct: 319 MLIANGQRLVTSSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 378

Query: 329 EAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF-KG 386
                 D +G       F++ S +P + I F  GA + L   N F       +C  F + 
Sbjct: 379 CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQN 438

Query: 387 ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                 I GN +  +F   +DI+ +   FK   C
Sbjct: 439 PALRSQILGNRVTRSFGTTFDIQGKQFGFKYAVC 472


>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
 gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
 gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
          Length = 474

 Score =  104 bits (259), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 110/394 (27%), Positives = 175/394 (44%), Gaps = 53/394 (13%)

Query: 70  QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           Q   I+SS +++ D+I     N+  +L+ +S+G PP   L   DTGS L W QC+PC   
Sbjct: 91  QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 149

Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
            C+ Q +   P+FDP  S T + + CSS +C  L       Q +C     +C YSV+YG+
Sbjct: 150 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 209

Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           G ++S G + T+T+ +G +         + FGC  +    ++    GI G G    S   
Sbjct: 210 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMDVK--YSEFEAGIFGFGSSSFSFFE 261

Query: 234 QMR---TTIAGK-FSYCLVPVSSTKINFGTNGIVSGPGVVS--TPLTKA--KTFYVLTID 285
           Q+      ++ K  SYCL P   TK  +   G      +    TPL ++  +  Y LT++
Sbjct: 262 QLAGYPDILSYKALSYCL-PTDETKPGYMILGRYDRAAMDGGYTPLFRSINRPTYSLTME 320

Query: 286 AISVGNQRLGVSTPDIVIDSG--------TTLTFLPQ---------GYNSNLLSVMSSMI 328
            +    QRL  S+ ++++DSG        +T   L +         GY+    +   S I
Sbjct: 321 MLIANGQRLVTSSSEMIVDSGAQRTSLWPSTFALLDKTITQAMSSIGYHRTSRARQESYI 380

Query: 329 EAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF-KG 386
                 D +G       F++ S +P + I F  GA + L   N F       +C  F + 
Sbjct: 381 CYLSEHDYSGWNGTITPFSNWSALPLLEIGFAGGAALALPPRNVFYNDPHRGLCMTFAQN 440

Query: 387 ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
                 I GN +  +F   +DI+ +   FK   C
Sbjct: 441 PALRSQILGNRVTRSFGTTFDIQGKQFGFKYAVC 474


>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 433

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 96/361 (26%), Positives = 160/361 (44%), Gaps = 50/361 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I IGTP  +     DTGS   W     C+ CP     ++    +DP+ S + K + 
Sbjct: 83  YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVK 142

Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---GIT 203
           C  + C S  +  C+  + C Y   Y DG  + G L T+ +      G     P    +T
Sbjct: 143 CDDTICTS--RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVT 200

Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSSTKIN 256
           FGCG    G  N+      GI+G G  + + +SQ+    AGK    FS+CL   +   I 
Sbjct: 201 FGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAA--AGKTKKIFSHCLDSTNGGGI- 257

Query: 257 FGTNGIVSGPGVVSTPLTK-AKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
           F    +V  P V +TP+ K  + ++++ + +I+V    L +         T    IDSG+
Sbjct: 258 FAIGEVVE-PKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 316

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL-------SQVPEVTIHFR 360
           TL +LP+        + S +I A     P  ++   Y+F           + P++T HF 
Sbjct: 317 TLVYLPE-------IIYSELILAVFAKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFE 369

Query: 361 GADVKLSR--SNFFVKVSEDIVCSVFK--GIT--NSVPIYGNIMQTNFLVGYDIEQQTVS 414
             D+ L     ++ ++   +  C  F+  GI     + I G+++ +N +V YD+E+Q + 
Sbjct: 370 N-DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIG 428

Query: 415 F 415
           +
Sbjct: 429 W 429


>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 102/378 (26%), Positives = 164/378 (43%), Gaps = 72/378 (19%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           +I + IGTPP  +  V DTGS L W QC +  PP+         FDP +SST+  LPC+ 
Sbjct: 76  IINLPIGTPPQTQPMVLDTGSQLSWIQCHKKQPPTAS-------FDPSLSSTFSILPCTH 128

Query: 151 SQCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
             C       +L         C YS  Y DG+++ GNL  E  T      ++V+ P +  
Sbjct: 129 PLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF----SRSVSTPPLIL 184

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG--- 261
           GC T      ++   GI+G+  G +S   Q + T   KFSYC VP   T+  F   G   
Sbjct: 185 GCATE-----STDPRGILGMNLGRLSFAKQSKIT---KFSYC-VPPRQTRPGFTPTGSFY 235

Query: 262 IVSGP--------GVVSTPLTKAKTF----YVLTIDAISVGNQRLGVS----------TP 299
           + + P        G++++   +   F    Y + +  I +  ++L +S          + 
Sbjct: 236 LGNNPSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSG 295

Query: 300 DIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSFNSLSQVP---- 353
             +IDSG+  T+L  + Y+     V+ ++          G + ++C  F+S+  V     
Sbjct: 296 QTMIDSGSEFTYLVSEAYDKVRAQVVRAVGPRLKKGYVYGGVADMC--FDSVKAVEIGRL 353

Query: 354 --EVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP------IYGNIMQTNFLV 404
             E+   F RG +V + +      V   + C    GI +S        I GN  Q N  V
Sbjct: 354 IGEMVFEFERGVEVVIPKERVLADVGGGVHCV---GIGSSDKLGAASNIIGNFHQQNLWV 410

Query: 405 GYDIEQQTVSFKPTDCTK 422
            +D+ ++ V F   DC++
Sbjct: 411 EFDLVRRRVGFGKADCSR 428


>gi|356537161|ref|XP_003537098.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 601

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 123/451 (27%), Positives = 188/451 (41%), Gaps = 97/451 (21%)

Query: 46  SSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN-ANYLIRISIGTPPTER 104
           S+  P+  L+ A++ S+ R +H   +++ SS K     + P     Y I +  GTPP   
Sbjct: 173 SNSHPFHTLQLAVSTSITRAHHLKNHNNPSSLKTL---VHPKTYGGYSIDLKFGTPPQTF 229

Query: 105 LAVADTGSDLIWTQCEP---CPPSQCYM---QDSPLFDPKMSSTYKSLPCSSSQCASL-- 156
             V DTGS L+W  C     C  S+C      ++P F PK S + K + C + +CA +  
Sbjct: 230 PFVLDTGSSLVWLPCYSHYLC--SKCNSFSNNNTPKFIPKDSFSSKFVGCRNPKCAWVFG 287

Query: 157 ----------------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
                           N  +CS     Y+V YG GS + G L +E +        A  + 
Sbjct: 288 SDVTSHCCKLAKAAFSNNNNCSQTCPAYTVQYGLGS-TAGFLLSENLNF-----PAKNVS 341

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSST 253
               GC      +   +  GI G G G+ SL +QM  T   +FSYCL+       P +S 
Sbjct: 342 DFLVGCSV----VSVYQPGGIAGFGRGEESLPAQMNLT---RFSYCLLSHQFDESPENSD 394

Query: 254 KINFGTN-------GIVSGPGVVSTPLTKAKTF---YVLTIDAISVGNQRLGVS----TP 299
            +   TN         VS    +  P TK   F   Y +T+  I VG +R+ V      P
Sbjct: 395 LVMEATNSGEGKKTNGVSYTAFLKNPSTKKPAFGAYYYITLRKIVVGEKRVRVPRRMLEP 454

Query: 300 DI------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN------ 347
           D+      ++DSG+TLTF+ +     +  +++     Q   + T + EL   F       
Sbjct: 455 DVNGDGGFIVDSGSTLTFMERP----IFDLVAEEFVKQ--VNYTRARELEKQFGLSPCFV 508

Query: 348 -----SLSQVPEVTIHFR-GADVKLSRSNFFVKVSE-DIVCSVF--------KGITNSVP 392
                  +  PE+   FR GA ++L  +N+F +V + D+ C            G      
Sbjct: 509 LAGGAETASFPEMRFEFRGGAKMRLPVANYFSRVGKGDVACLTIVSDDVAGQGGAVGPAV 568

Query: 393 IYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 423
           I GN  Q NF V  D+E +   F+   C K+
Sbjct: 569 ILGNYQQQNFYVECDLENERFGFRSQSCQKR 599


>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
 gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
 gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
 gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
          Length = 431

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 96/369 (26%), Positives = 163/369 (44%), Gaps = 50/369 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I IGTP  +     DTGS   W     C+ CP     ++    +DP+ S + K + 
Sbjct: 59  YYTDIGIGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVK 118

Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---GIT 203
           C  + C S  +  C+  + C Y   Y DG  + G L T+ +      G     P    +T
Sbjct: 119 CDDTICTS--RPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVT 176

Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTTIAGK----FSYCLVPVSSTKIN 256
           FGCG    G  N+      GI+G G  + + +SQ+    AGK    FS+CL   +   I 
Sbjct: 177 FGCGLQQSGSLNNSAVAIDGIIGFGNSNQTALSQLAA--AGKTKKIFSHCLDSTNGGGI- 233

Query: 257 FGTNGIVSGPGVVSTPLTK-AKTFYVLTIDAISVGNQRLGV--------STPDIVIDSGT 307
           F    +V  P V +TP+ K  + ++++ + +I+V    L +         T    IDSG+
Sbjct: 234 FAIGEVVE-PKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKTKGTFIDSGS 292

Query: 308 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL-------SQVPEVTIHFR 360
           TL +LP+        + S +I A     P  ++   Y+F           + P++T HF 
Sbjct: 293 TLVYLPE-------IIYSELILAVFAKHPDITMGAMYNFQCFHFLGSVDDKFPKITFHFE 345

Query: 361 GADVKLSR--SNFFVKVSEDIVCSVFK--GIT--NSVPIYGNIMQTNFLVGYDIEQQTVS 414
             D+ L     ++ ++   +  C  F+  GI     + I G+++ +N +V YD+E+Q + 
Sbjct: 346 N-DLTLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMIILGDMVISNKVVVYDMEKQAIG 404

Query: 415 FKPTDCTKQ 423
           +   +  ++
Sbjct: 405 WTEHNSVEE 413


>gi|218191589|gb|EEC74016.1| hypothetical protein OsI_08957 [Oryza sativa Indica Group]
          Length = 520

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 100/388 (25%), Positives = 158/388 (40%), Gaps = 25/388 (6%)

Query: 54  LRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN----YLIRISIGTPPTERLAVAD 109
           +R  L R   R+    Q  S+S   +    I P+  +    Y   + +GTP T  L   D
Sbjct: 65  VRSDLQRQKRRVGGKYQLLSLSQGGS----IFPSGNDLGWLYYTWVDVGTPNTSFLVALD 120

Query: 110 TGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG 163
           TGSDL W  C+   C P   Y     +D  ++ P  S+T + LPCS   C+  +  +   
Sbjct: 121 TGSDLFWVPCDCIQCAPLSSYHGSLDRDLGIYKPSESTTSRHLPCSHELCSPASGCTNPK 180

Query: 164 VNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG--LFNSKTTG 220
             C Y++ Y  + + S+G L  + + L S  G A     +  GCG    G  L      G
Sbjct: 181 QPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAPVNASVIIGCGKKQSGSYLEGIAPDG 240

Query: 221 IVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKT 278
           ++GLG  DIS+ S +     +   FS C     S +I FG  G+ +       P+     
Sbjct: 241 LLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIFFGDQGVPTQQSTPFVPMNGKLQ 300

Query: 279 FYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 338
            Y + +D   +G++    +    ++D+GT+ T LP     ++       I A   +    
Sbjct: 301 TYAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSLPLDAYKSITMEFDKQINASRASSDDY 360

Query: 339 SLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGN 396
           S E CYS   L    VP +T+ F       + +            +VF       P    
Sbjct: 361 SFEYCYSTGPLEMPDVPTITLTFAENKSFQAVNPILPFNDRQGEFAVFCLAVLPSPEPVG 420

Query: 397 IMQTNFLVGY----DIEQQTVSFKPTDC 420
           I+  NF+VGY    D E   + +  ++C
Sbjct: 421 IIGQNFMVGYHVVFDRENMKLGWYRSEC 448


>gi|115448709|ref|NP_001048134.1| Os02g0751100 [Oryza sativa Japonica Group]
 gi|46390211|dbj|BAD15642.1| aspartyl protease-like [Oryza sativa Japonica Group]
 gi|113537665|dbj|BAF10048.1| Os02g0751100 [Oryza sativa Japonica Group]
 gi|222623681|gb|EEE57813.1| hypothetical protein OsJ_08401 [Oryza sativa Japonica Group]
          Length = 520

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 90/350 (25%), Positives = 147/350 (42%), Gaps = 23/350 (6%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYK 144
           Y   + +GTP T  L   DTGSDL W  C+   C P   Y     +D  ++ P  S+T +
Sbjct: 102 YYTWVDVGTPNTSFLVALDTGSDLFWVPCDCIQCAPLSSYHGSLDRDLGIYKPSESTTSR 161

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
            LPCS   C+  +  +     C Y++ Y  + + S+G L  + + L S  G A     + 
Sbjct: 162 HLPCSHELCSPASGCTNPKQPCPYNIDYFSENTTSSGLLIEDMLHLDSREGHAPVNASVI 221

Query: 204 FGCGTNNGG--LFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
            GCG    G  L      G++GLG  DIS+ S +     +   FS C     S +I FG 
Sbjct: 222 IGCGKKQSGSYLEGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKKDDSGRIFFGD 281

Query: 260 NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 319
            G+ +       P+      Y + +D   +G++    +    ++D+GT+ T LP     +
Sbjct: 282 QGVPTQQSTPFVPMNGKLQTYAVNVDKYCIGHKCTEGAGFQALVDTGTSFTSLPLDAYKS 341

Query: 320 LLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSE 377
           +       I A   +    S E CYS   L    VP +T+ F   +      N  +  ++
Sbjct: 342 ITMEFDKQINASRASSDDYSFEYCYSTGPLEMPDVPTITLTF-AENKSFQAVNPILPFND 400

Query: 378 ---DIVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDC 420
              +        + +  P+   I+  NF+VGY    D E   + +  ++C
Sbjct: 401 RQGEFAVFCLAVLPSPEPV--GIIGQNFMVGYHVVFDRENMKLGWYRSEC 448


>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
          Length = 424

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 94/381 (24%), Positives = 164/381 (43%), Gaps = 42/381 (11%)

Query: 77  SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
           ++A+ + + P + N      Y + I+IG PP       DTGSDL W QC+  P   C   
Sbjct: 37  TRAASSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVHCLEA 95

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
             PL+ P    +   +PC+   C +L    N +  +   C Y V Y DG  S G L  + 
Sbjct: 96  PHPLYQP----SNDLIPCNDPLCKALHFNGNHRCETPEQCDYEVEYADGGSSLGVLVRDV 151

Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT--IAGK 242
            +L  T G  +  P +  GCG +   G   +    G++GLG G +S++SQ+ +   +   
Sbjct: 152 FSLNYTKGLRLT-PRLALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 210

Query: 243 FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPD 300
             +CL  +    + FG N +     V  TP+ +  +K +       +  G +  G+    
Sbjct: 211 VGHCLSSLGGGILFFG-NDLYDSSRVSWTPMARENSKHYSPAMGGELLFGGRTTGLKNLL 269

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTI 357
            V DSG++ T+        +  ++   +  +P+  A    +L LC+        + EV  
Sbjct: 270 TVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKK 329

Query: 358 HFRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTN 401
           +F+   +       S++ F +     ++ S    V  GI N   I        G+I   +
Sbjct: 330 YFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQD 389

Query: 402 FLVGYDIEQQTVSFKPTDCTK 422
            ++ YD E+Q++ + P DC +
Sbjct: 390 QMIIYDNEKQSIGWIPADCDE 410


>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
 gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
          Length = 358

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 86/280 (30%), Positives = 135/280 (48%), Gaps = 27/280 (9%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
           Q ++ P   +Y + ++IG P        DTGSDL W QC+ PC    C     PL+ P  
Sbjct: 45  QGNVYPT-GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPC--RSCNKVPHPLYRPTA 101

Query: 140 SSTYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           +S    +PC+++ C +L      N K  S   C Y + Y D + S G L  +  +L   +
Sbjct: 102 NSL---VPCANALCTALHSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSLPMRS 158

Query: 194 GQAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCL 247
                 PG+TFGCG +      G   + T G++GLG G +SL+SQ++     K    +CL
Sbjct: 159 SN--IRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCL 216

Query: 248 VPVSSTKINFGTNGIVSGPGVVSTPLTK-AKTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
                  + FG + IV    V   P+ K +  +Y      +    + LGV   ++V DSG
Sbjct: 217 STNGGGFLFFGDD-IVPTSRVTWVPMAKISGNYYSPGSGTLYFDRRSLGVKPMEVVFDSG 275

Query: 307 TTLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCY 344
           +T T F  Q Y + + ++ S + ++ + V+DP  SL LC+
Sbjct: 276 STYTYFTAQPYQAVVSALKSGLSKSLKQVSDP--SLPLCW 313


>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
 gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
 gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
 gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
 gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
 gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 469

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 131/462 (28%), Positives = 188/462 (40%), Gaps = 94/462 (20%)

Query: 39  PKSPFYNSSETP---YQRLRDALTRSLNRLNHFNQNSSI-------SSSKASQADII--P 86
           P SPF +S ++P   Y  LR     S+ R +     +SI       SS+  + A ++  P
Sbjct: 22  PLSPFSHSDQSPKDPYLSLRRLAESSIARAHKLKHGTSIKPDEDALSSTTTASATVVKSP 81

Query: 87  NNAN----YLIRISIGTPPTERLAVADTGSDLIWTQCEP------CPPSQCYMQDSPLFD 136
            +A     Y + +S GTP      V DTGS L+W  C        C  S       P F 
Sbjct: 82  LSAKSYGGYSVSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFI 141

Query: 137 PKMSSTYKSLPCSSSQCASL------------NQKSCSGVNC-QYSVSYGDGSFSNGNLA 183
           PK SS+ K + C S +C  L            N ++C+ V C  Y + YG GS + G L 
Sbjct: 142 PKNSSSSKIIGCQSPKCQFLYGPNVQCRGCDPNTRNCT-VGCPPYILQYGLGS-TAGVLI 199

Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
           TE +     T     +P    GC      +   +  GI G G G +SL SQM      +F
Sbjct: 200 TEKLDFPDLT-----VPDFVVGCSI----ISTRQPAGIAGFGRGPVSLPSQMNLK---RF 247

Query: 244 SYCLVPVSSTKINFGTN-------GIVSG---PGVVSTPLTKAKT--------FYVLTID 285
           S+CLV       N  T+       G  SG   PG+  TP  K           +Y L + 
Sbjct: 248 SHCLVSRRFDDTNVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLR 307

Query: 286 AISVGNQRLGVSTPDI----------VIDSGTTLTFLPQG----YNSNLLSVMSSMIEAQ 331
            I VG + + +    +          ++DSG+T TF+ +           S MS+    +
Sbjct: 308 RIYVGRKHVKIPYKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREK 367

Query: 332 PVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSE-DIVCSVF--- 384
            +   TG L  C++ +      VPE+   F+ GA ++L  SN+F  V   D VC      
Sbjct: 368 DLEKETG-LGPCFNISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSD 426

Query: 385 -----KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
                 G T    I G+  Q N+LV YD+E     F    C+
Sbjct: 427 KTVNPSGGTGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 93/381 (24%), Positives = 166/381 (43%), Gaps = 42/381 (11%)

Query: 77  SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
           ++A  + + P + N      Y + I+IG PP       DTGSDL W QC+  P  +C   
Sbjct: 40  TRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEA 98

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
             PL+ P    +   +PC+   C +L    NQ+  +   C Y V Y DG  S G L  + 
Sbjct: 99  PHPLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDV 154

Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT--IAGK 242
            ++  T G  +  P +  GCG +   G   +    G++GLG G +S++SQ+ +   +   
Sbjct: 155 FSMNYTKGLRLT-PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 213

Query: 243 FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPD 300
             +CL  +    + FG + +     V  TP+++  +K +       +  G +  G+    
Sbjct: 214 IGHCLSSLGGGILFFGDD-LYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLL 272

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTI 357
            V DSG++ T+        +  ++   +  +P+  A    +L LC+        + EV  
Sbjct: 273 TVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKK 332

Query: 358 HFRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTN 401
           +F+   +       S++ F +     ++ S    V  GI N   I        G+I   +
Sbjct: 333 YFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQD 392

Query: 402 FLVGYDIEQQTVSFKPTDCTK 422
            ++ YD E+Q++ + P DC +
Sbjct: 393 QMIIYDNEKQSIGWMPADCDE 413


>gi|356540510|ref|XP_003538731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 417

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 108/426 (25%), Positives = 172/426 (40%), Gaps = 105/426 (24%)

Query: 77  SKASQADIIPNNANYLIRISIGTPPTERLAV-ADTGSDLIWTQCEPCPPSQCYMQD---- 131
           S + +  I    ++Y +  ++G+ P++ + +  DTGSDL+W    PC P +C + +    
Sbjct: 5   SPSRRQPISNRESDYTLSFNLGSHPSQSITLYMDTGSDLVWF---PCAPFECILCEGKFN 61

Query: 132 ----------------SPLFDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNCQ-YSVSY 172
                           SP      SS      C+ ++C   ++    CS   C  +  +Y
Sbjct: 62  ATKPLNITRSHRVSCQSPACSTAHSSVSSHDLCAIARCPLDNIETSDCSSATCPPFYYAY 121

Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
           GDGSF   +L  +T+++       + L   TFGC         ++ TG+ G G G +SL 
Sbjct: 122 GDGSFI-AHLHRDTLSMSQ-----LFLKNFTFGCAHTA----LAEPTGVAGFGRGLLSLP 171

Query: 233 SQMRT---TIAGKFSYCLV---------------------PVSSTKINFGTNGIVSGPGV 268
           +Q+ T    +  +FSYCLV                       SS ++ F    ++  P  
Sbjct: 172 AQLATLSPNLGNRFSYCLVSHSFDKERVRKPSPLILGHYDDYSSERVEFVYTSMLRNP-- 229

Query: 269 VSTPLTKAKTFYVLTIDAISVGNQRLGVSTPD------------IVIDSGTTLTFLPQGY 316
                 K   FY + +  ISVG +   +  P+            +V+DSGTT T LP   
Sbjct: 230 ------KHSYFYCVGLTGISVGKRT--ILAPEMLRRVDRRGDGGVVVDSGTTFTMLPASL 281

Query: 317 NSNLLSVMSSMI-----EAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG--ADVKLSRS 369
            +++++     +      A  V + TG L  CY    L +VP VT HF G  ++V L R 
Sbjct: 282 YNSVVAEFDRRVGRVHKRASEVEEKTG-LGPCYFLEGLVEVPTVTWHFLGNNSNVMLPRM 340

Query: 370 NFFVK-------VSEDIVCSVFKGITNSVP-------IYGNIMQTNFLVGYDIEQQTVSF 415
           N+F +           + C +     +          I GN  Q  F V YD+E Q V F
Sbjct: 341 NYFYEFLDGEDEARRKVGCLMLMNGGDDTELSGGPGAILGNYQQQGFEVVYDLENQRVGF 400

Query: 416 KPTDCT 421
               C 
Sbjct: 401 AKRQCA 406


>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
          Length = 413

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 93/381 (24%), Positives = 166/381 (43%), Gaps = 42/381 (11%)

Query: 77  SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
           ++A  + + P + N      Y + I+IG PP       DTGSDL W QC+  P  +C   
Sbjct: 28  TRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEA 86

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
             PL+ P    +   +PC+   C +L    NQ+  +   C Y V Y DG  S G L  + 
Sbjct: 87  PHPLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDV 142

Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT--IAGK 242
            ++  T G  +  P +  GCG +   G   +    G++GLG G +S++SQ+ +   +   
Sbjct: 143 FSMNYTQGLRLT-PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 201

Query: 243 FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPD 300
             +CL  +    + FG + +     V  TP+++  +K +       +  G +  G+    
Sbjct: 202 IGHCLSSLGGGILFFGDD-LYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLL 260

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTI 357
            V DSG++ T+        +  ++   +  +P+  A    +L LC+        + EV  
Sbjct: 261 TVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKK 320

Query: 358 HFRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTN 401
           +F+   +       S++ F +     ++ S    V  GI N   I        G+I   +
Sbjct: 321 YFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQD 380

Query: 402 FLVGYDIEQQTVSFKPTDCTK 422
            ++ YD E+Q++ + P DC +
Sbjct: 381 QMIIYDNEKQSIGWMPVDCDE 401


>gi|356513737|ref|XP_003525567.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Glycine
           max]
          Length = 455

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 120/446 (26%), Positives = 195/446 (43%), Gaps = 103/446 (23%)

Query: 62  LNRLNHFNQNSSISSSKASQADI---IPNNANYLIRISIGTPPTERLAVA---DTGSDLI 115
            N  +H  +++S  S+K  +  +   +   ++Y +  ++G P  +   +    DTGSDL+
Sbjct: 16  FNNTHHLLKSTSTLSAKRFRRQLSLPLSPGSDYTLSFNLG-PRAQAQPITLYMDTGSDLV 74

Query: 116 WTQCEPCPPSQCYM-QDSPLFDPKMSSTY------KSLPCSSSQ--------CA------ 154
           W    PC P +C + +  P   P +++T       KS  CS++         CA      
Sbjct: 75  WF---PCAPFKCILCEGKPNASPPVNTTRSVAVSCKSPACSAAHNLASPSDLCAAARCPL 131

Query: 155 -SLNQKSCSGVNCQ-YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
            S+    C+   C  +  +YGDGS     L  +T++L S     + L   TFGC      
Sbjct: 132 ESIETSDCANFKCPPFYYAYGDGSLI-ARLYRDTLSLSS-----LFLRNFTFGCAYTT-- 183

Query: 213 LFNSKTTGIVGLGGGDISLISQMRT---TIAGKFSYCLVPVS--STKINFGTNGIVS--- 264
              ++ TG+ G G G +SL +Q+ T    +  +FSYCLV  S  S ++   +  I+    
Sbjct: 184 --LAEPTGVAGFGRGLLSLPAQLATLSPQLGNRFSYCLVSHSFDSERVRKPSPLILGRYE 241

Query: 265 --------GPGV---VSTPLT---KAKTFYVLTIDAISVGNQRLGVSTPD---------- 300
                   G GV   V TP+    K   FY + +  ISVG +R+ V  P+          
Sbjct: 242 EEEEEEKVGGGVAEFVYTPMLENPKHPYFYTVGLIGISVG-KRI-VPAPEMLRRVNNRGD 299

Query: 301 --IVIDSGTTLTFLPQGYNSNLLSVMSSMI-----EAQPVADPTGSLELCYSFNSLSQVP 353
             +V+DSGTT T LP G+ ++++      +      A+ + + TG L  CY  NS+++VP
Sbjct: 300 GGVVVDSGTTFTMLPAGFYNSVVDEFDRGVGRVNERARKIEEKTG-LAPCYYLNSVAEVP 358

Query: 354 EVTIHFRGAD--VKLSRSNFF---------VKVSEDIVCSVFKGITNSVPI-------YG 395
            +T+ F G +  V L R N+F          K    + C +     +   +        G
Sbjct: 359 VLTLRFAGGNSSVVLPRKNYFYEFLDGRDAAKGKRRVGCLMLMNGGDEAELSGGPGATLG 418

Query: 396 NIMQTNFLVGYDIEQQTVSFKPTDCT 421
           N  Q  F V YD+E++ V F    C 
Sbjct: 419 NYQQQGFEVEYDLEEKRVGFARRQCA 444


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 100/372 (26%), Positives = 168/372 (45%), Gaps = 41/372 (11%)

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---------- 132
           D +  N  Y  R+ IGTP  E   + D+GS + +  C  C   QC    S          
Sbjct: 84  DDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATC--EQCGNHQSESPNIIEAHD 141

Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
           P F P +SSTY  + C+   C   N++S     C Y   Y + S S+G L  + ++ G  
Sbjct: 142 PRFQPDLSSTYSPVKCNVD-CTCDNERS----QCTYERQYAEMSSSSGVLGEDIMSFGKE 196

Query: 193 TGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVP 249
           +   +      FGC  T  G LF+    GI+GLG G +S++ Q+  +  I+  FS C   
Sbjct: 197 S--ELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGG 254

Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV------STPDIV 302
           +          G+ + P +V +     ++ +Y + +  I V  + L +      S    V
Sbjct: 255 MDVGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTV 314

Query: 303 IDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PE 354
           +DSGTT  +LP Q + +   +V + +   + +  P  +  ++C++      + LS+V P+
Sbjct: 315 LDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPD 374

Query: 355 VTIHF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
           V + F  G  + LS  N+  + S  E   C  VF+   +   + G I+  N LV YD   
Sbjct: 375 VDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHN 434

Query: 411 QTVSFKPTDCTK 422
           + + F  T+C++
Sbjct: 435 EKIGFWKTNCSE 446


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 100/372 (26%), Positives = 168/372 (45%), Gaps = 41/372 (11%)

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---------- 132
           D +  N  Y  R+ IGTP  E   + D+GS + +  C  C   QC    S          
Sbjct: 83  DDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATC--EQCGNHQSESPNIIEAHD 140

Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
           P F P +SSTY  + C+   C   N++S     C Y   Y + S S+G L  + ++ G  
Sbjct: 141 PRFQPDLSSTYSPVKCNVD-CTCDNERS----QCTYERQYAEMSSSSGVLGEDIMSFGKE 195

Query: 193 TGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVP 249
           +   +      FGC  T  G LF+    GI+GLG G +S++ Q+  +  I+  FS C   
Sbjct: 196 S--ELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGG 253

Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGV------STPDIV 302
           +          G+ + P +V +     ++ +Y + +  I V  + L +      S    V
Sbjct: 254 MDVGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTV 313

Query: 303 IDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PE 354
           +DSGTT  +LP Q + +   +V + +   + +  P  +  ++C++      + LS+V P+
Sbjct: 314 LDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPD 373

Query: 355 VTIHF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 410
           V + F  G  + LS  N+  + S  E   C  VF+   +   + G I+  N LV YD   
Sbjct: 374 VDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHN 433

Query: 411 QTVSFKPTDCTK 422
           + + F  T+C++
Sbjct: 434 EKIGFWKTNCSE 445


>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
 gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
          Length = 486

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 108/398 (27%), Positives = 170/398 (42%), Gaps = 73/398 (18%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE----PCPPSQCYMQDS-------------- 132
           YLI ++IGTPP     + DTGSDL W  C      C     Y  +               
Sbjct: 82  YLISLNIGTPPQVIQVLMDTGSDLTWVPCGNLSFDCMECDDYRNNKLMATFSPSYSSSSY 141

Query: 133 ------PLFDPKMSSTYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLAT 184
                 P      SS      C+ + C  ++L + +CS     ++ +YG G    G L  
Sbjct: 142 RASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGILTR 201

Query: 185 ETVTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
           +T+ + GS+ G A  +P   FGC     G    +  GI G G G +S++SQ+     G F
Sbjct: 202 DTLRVNGSSPGVAKEIPKFCFGC----VGSAYREPIGIAGFGRGTLSMVSQLGFLQKG-F 256

Query: 244 SYCLV-------PVSSTKINFGTNGIVSGPGVVSTPLTKA---KTFYVLTIDAISVGN-- 291
           S+C +       P  S+ +  G   + S   +  TP+  +     FY + ++AI+VGN  
Sbjct: 257 SHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAITVGNVS 316

Query: 292 ---------QRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE--AQPVADPTGSL 340
                    +   +    + IDSGTT T LP+ + S +LS++ S I        +     
Sbjct: 317 ATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQSTINYPRDTGMEMQTGF 376

Query: 341 ELCYSF-----NSLSQ---VPEVTIHF-RGADVKLSRSNFFVKVSED-----IVCSVFK- 385
           +LCY       N+L+    +P +T HF     + L + N F  VS       + C +F+ 
Sbjct: 377 DLCYKVPRPNNNTLTSDDLLPSITFHFLNNVSLVLPQGNHFYPVSAPGNPAVVKCLMFQS 436

Query: 386 ---GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              G      ++G+  Q N  V YD+E++ + F+P DC
Sbjct: 437 TDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDC 474


>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
 gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 93/381 (24%), Positives = 166/381 (43%), Gaps = 42/381 (11%)

Query: 77  SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
           ++A  + + P + N      Y + I+IG PP       DTGSDL W QC+  P  +C   
Sbjct: 40  TRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEA 98

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
             PL+ P    +   +PC+   C +L    NQ+  +   C Y V Y DG  S G L  + 
Sbjct: 99  PHPLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDV 154

Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT--IAGK 242
            ++  T G  +  P +  GCG +   G   +    G++GLG G +S++SQ+ +   +   
Sbjct: 155 FSMNYTQGLRLT-PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 213

Query: 243 FSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPD 300
             +CL  +    + FG + +     V  TP+++  +K +       +  G +  G+    
Sbjct: 214 IGHCLSSLGGGILFFGDD-LYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLL 272

Query: 301 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTI 357
            V DSG++ T+        +  ++   +  +P+  A    +L LC+        + EV  
Sbjct: 273 TVFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKK 332

Query: 358 HFRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTN 401
           +F+   +       S++ F +     ++ S    V  GI N   I        G+I   +
Sbjct: 333 YFKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQD 392

Query: 402 FLVGYDIEQQTVSFKPTDCTK 422
            ++ YD E+Q++ + P DC +
Sbjct: 393 QMIIYDNEKQSIGWMPVDCDE 413


>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 445

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/376 (26%), Positives = 170/376 (45%), Gaps = 51/376 (13%)

Query: 86  PNNANYLIRISIGTP--PTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           P    Y + ++IGT    +    V DT S L W +C  C P Q   Q SP+FDP  SS+Y
Sbjct: 69  PLEYTYGVAVTIGTGRGKSTYFLVLDTASSLPWMRCAHCLPVQ--RQRSPVFDPSDSSSY 126

Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           + L  +S  C + N    +G  C + +  G+   ++G + T+T+ LG+ T   + +  + 
Sbjct: 127 RPLHPTSPLCRAPNPVLPAGDKCSFHLP-GE---AHGYVGTDTIILGNPT---LPIHSVA 179

Query: 204 FGCGTNNGGLFNSKTT--GIVGLGGGDISLISQMRTTIAGKFSYCLV-----PVSSTKIN 256
           FGC  +  G F++K T  G +G+G    SLI Q++  +  +FSYCL+     P  +  I 
Sbjct: 180 FGCAQSTEG-FDTKGTFAGTLGMGKLPTSLIMQIKDRVGSRFSYCLIGLGHSPGRNGFIR 238

Query: 257 FGT----------NGIVSGPGVVSTPLTKAKTFYVLTIDAISVGN-----------QRLG 295
           FG           + I   P     P   A + Y + +  IS+             +R  
Sbjct: 239 FGADIPDPTLLVHHRIKILPTPPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRS 298

Query: 296 VSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCYSFNSLSQV 352
             +    +D+GT +T L     + +   ++ M++    + V DP  SL         S +
Sbjct: 299 DGSGGCFVDAGTQVTHLVPAAYAVVEEAVAHMVQQWGYKRVRDPNFSLCFREHPGIWSHI 358

Query: 353 PEVTIHFRG------ADVKLSRSNFFVKV-SEDIVC-SVFKGITNSVPIYGNIMQTNFLV 404
           P++T+ F G      A +++   N F+KV ++ +VC  V++    S  + G + Q +   
Sbjct: 359 PKLTLDFEGPASRTVAHLEIVSRNLFLKVDNQPLVCFGVYRTSRGSPTVVGAMQQVDTRF 418

Query: 405 GYDIEQQTVSFKPTDC 420
            +D+   T++F    C
Sbjct: 419 IFDLHANTITFHRESC 434


>gi|125564663|gb|EAZ10043.1| hypothetical protein OsI_32347 [Oryza sativa Indica Group]
          Length = 330

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 101/347 (29%), Positives = 161/347 (46%), Gaps = 63/347 (18%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNC 166
           V DT SDL+WTQC+PC    C  Q   ++DP  + TY +L  S+                
Sbjct: 6   VFDTTSDLLWTQCQPC--LSCVAQAGDMYDPNKTETYANLTSSN---------------- 47

Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
            Y+ +Y   SF++G  ATET  LG+ T     +  ITFGCGT N G +++    + G+G 
Sbjct: 48  -YNYTYSKQSFTSGYFATETFALGNVT-----VANITFGCGTRNQGYYDNVAG-VFGVGR 100

Query: 227 GDISLISQMRTTIAGKFSYCLVPV------------SSTKINFGTNGIVSGPGVVSTPLT 274
           G +SL++Q+      +FSYC                S       T    +   +V+ P+ 
Sbjct: 101 GGVSLLNQLGID---RFSYCFSSSGAPGSSAVFLGGSPELATNATTTPAASTPMVADPVL 157

Query: 275 KAKTFYVLTIDAISVGNQRLGVSTPD--------IVIDSGTTLTFLPQG----YNSNLLS 322
           K+  F  L    ++VG  R+ V+           +VIDS + +T L +         L++
Sbjct: 158 KSGYFVKLV--GVTVGATRVDVAGASSAEGGGRALVIDSTSPVTVLDEATYGPVRRALVA 215

Query: 323 VMSSMIEAQPVADPTGSLELCYSFNSLSQVP-----EVTIHFRG--ADVKLSRSNFFVKV 375
            ++ + EA   A     L+LC+   +    P      +T+HF G  AD+ L  +N+  K 
Sbjct: 216 QLAPLKEANANASAGVGLDLCFELAAGGATPTPPNVTMTLHFDGGAADLVLPPANYLAKD 275

Query: 376 SE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           S   ++C ++    +N VP+ G+    + LV YD+ +  VSF+P DC
Sbjct: 276 SAGGLICLTMTPSSSNGVPVLGSSALLDTLVLYDLAKNVVSFQPLDC 322


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 97/376 (25%), Positives = 157/376 (41%), Gaps = 64/376 (17%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           L+ + IGTPP  +  + DTGS L W QC    P +     S +FDP +SS++  LPC+  
Sbjct: 78  LVSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRK--PPPSTVFDPSLSSSFSVLPCNHP 135

Query: 152 QCA----SLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
            C          +   +N  C YS  Y DG+ + GNL  E +T   +T Q+   P +  G
Sbjct: 136 LCKPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITF--STSQST--PPLILG 191

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNG---I 262
           C  +      S   GI+G+  G +S  SQ + T   KFSYC VP    +  F   G   +
Sbjct: 192 CAED-----ASDDKGILGMNLGRLSFASQAKIT---KFSYC-VPTRQVRPGFTPTGSFYL 242

Query: 263 VSGPGVVSTPLTKAKTF-------------YVLTIDAISVGNQRLGVSTPDI-------- 301
              P           TF             + + +  I +GN++L +             
Sbjct: 243 GENPNSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRADPSGAG 302

Query: 302 --VIDSGTTLTFLPQ-GYN---SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---QV 352
             +IDSG+  T+L    YN     ++ +    ++   V   +G  ++C+  N++     +
Sbjct: 303 QSMIDSGSEFTYLVDVAYNKVREEVVRLAGPRLKKGYVY--SGVSDMCFDGNAMEIGRLI 360

Query: 353 PEVTIHF-RGADVKLSRSNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGY 406
             +   F +G ++ + +      V   + C     S   G  ++  I GN  Q N  V +
Sbjct: 361 GNMVFEFDKGVEIVIEKGRVLADVGGGVHCVGIGRSEMLGAASN--IIGNFHQQNLWVEF 418

Query: 407 DIEQQTVSFKPTDCTK 422
           DI  + V F   DC++
Sbjct: 419 DIANRRVGFGKADCSR 434


>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
          Length = 515

 Score =  103 bits (257), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 122/447 (27%), Positives = 189/447 (42%), Gaps = 44/447 (9%)

Query: 9   FILFFLCFYVVSPIEAQTGGFSVELIHRDS-------PKSPFYNSSETPYQRL---RDAL 58
            IL  +  +V+   E   G F  E  HR S       P     N   + Y R+   RD L
Sbjct: 14  LILMLVSSWVLDRCEG-LGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL 72

Query: 59  TRSLNRLNHFNQNSSISSSKASQADIIPNNANYL--IRISIGTPPTERLAVADTGSDLIW 116
            R   RL   +++ S+ +       I  N   +L    +++GTP    L   DTGSDL W
Sbjct: 73  IRG-RRLA--SEDQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFW 129

Query: 117 TQCEPCPPSQCYMQ---------DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
             C+ C  + C  +         D  ++ P  SST   +PC+S+ C  +++ +    +C 
Sbjct: 130 LPCD-CS-TNCVRELKAPGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCP 187

Query: 168 YSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPG-ITFGCGTNNGGLFN--SKTTGIVG 223
           Y + Y  +G+ S G L  + + L S    +  +   IT GCG    G+F+  +   G+ G
Sbjct: 188 YQIRYLSNGTSSTGVLVEDVLHLVSMEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFG 247

Query: 224 LGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLT--KAKTF 279
           LG  DIS+ S +      A  FS C     + +I+FG  G V       TPL   +    
Sbjct: 248 LGLEDISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQR---ETPLNIRQPHPT 304

Query: 280 YVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQG-YNSNLLSVMSSMIEAQPVADPTG 338
           Y +T+  ISVG    G    D V D+GT+ T+L    Y     S  S  ++ +   D   
Sbjct: 305 YNVTVTQISVGGN-TGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSEL 363

Query: 339 SLELCYSF--NSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIY 394
             E CY+   N  S + P+V +  +G           V   ED V      + +  + I 
Sbjct: 364 PFEYCYAVSPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPIEDTVVYCLAIMKSEDISII 423

Query: 395 GNIMQTNFLVGYDIEQQTVSFKPTDCT 421
           G    T + V +D E+  + +K +DC+
Sbjct: 424 GQNFMTGYRVVFDREKLILGWKESDCS 450


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 117/419 (27%), Positives = 182/419 (43%), Gaps = 52/419 (12%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRD-ALTRSLNRLNHFNQNS-SISSSKASQADIIPNN 88
           + + H +S  SPF  S       L+D A    L+ L    ++S  I+S +A     I  +
Sbjct: 31  LRVFHINSQCSPFKTSVSWADTLLQDKARFLYLSSLAGVRKSSVPIASGRA-----IVQS 85

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y++R +IGTP    L   DT +D  W  C  C         S LFDP  SS+ ++L C
Sbjct: 86  PTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGC----VGCSSSVLFDPSKSSSSRTLQC 141

Query: 149 SSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
            + QC      SC+   +C ++++YG GS     L  +T+TL S       +P  TFGC 
Sbjct: 142 EAPQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDV-----IPNYTFGC- 194

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPG 267
            N     +    G++GLG G +SLISQ +      FSYCL   +S   NF +  +  GP 
Sbjct: 195 INKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCL--PNSKSSNF-SGSLRLGPK 251

Query: 268 -----VVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTTL 309
                + +TPL K     + Y + +  I VGN+ + + T  +          + DSGT  
Sbjct: 252 NQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVY 311

Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPT--GSLELCYSFNSLSQVPEVTIHFRGADVKLS 367
           T L +      ++V +        A+ T  G  + CYS + +   P VT  F G +V L 
Sbjct: 312 TRLVE---PAYVAVRNEFRRRVKNANATSLGGFDTCYSGSVV--FPSVTFMFAGMNVTLP 366

Query: 368 RSNFFVKVSE-DIVCSVFKG----ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             N  +  S  ++ C         + + + +  ++ Q N  V  D+    +      CT
Sbjct: 367 PDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425


>gi|125575541|gb|EAZ16825.1| hypothetical protein OsJ_32297 [Oryza sativa Japonica Group]
          Length = 416

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 93/364 (25%), Positives = 160/364 (43%), Gaps = 59/364 (16%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N AN+    +IGTPP    A+ D              P+ C         P  SST++  
Sbjct: 67  NVANF----TIGTPPQPASAIIDVAG-----------PAPCSF-------PNASSTFRPE 104

Query: 147 PCSSSQCASLNQKSCSGVNCQY--SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           PC +  C S+   +CS   C Y  +++   G  + G +AT+T  +G+ T        + F
Sbjct: 105 PCGTDACKSIPTSNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGTATAS------LGF 158

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSS---TKINFGTNG 261
           GC   +G       +G++GLG    SL+SQM  T   KFSYCL P  S   +++  G++ 
Sbjct: 159 GCVVASGIDTMGGPSGLIGLGRAPSSLVSQMNIT---KFSYCLTPHDSGKNSRLLLGSSA 215

Query: 262 IVSG-------PGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--STPDIVIDSGTTLTFL 312
            ++G       P V ++P      +Y + +D I  G+  + +  S   +++ +   ++FL
Sbjct: 216 KLAGGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPSGNTVLVQTLAPMSFL 275

Query: 313 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR--GADVKLSR 368
                  L   ++  + A P A P    +LC+    LS    P++   F+   A + +  
Sbjct: 276 VDSAYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPP 335

Query: 369 SNFFVKVSED--IVCSVF--------KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 418
             + + V E+   VC             +  ++ I G++ Q N     D+E++T+SF+P 
Sbjct: 336 PKYLIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPA 395

Query: 419 DCTK 422
           DC  
Sbjct: 396 DCAH 399


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 171/372 (45%), Gaps = 52/372 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+P  +     DTGSD++W     C  CP S     +   FD   SST   + 
Sbjct: 83  YFTKVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVS 142

Query: 148 CSSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGST-TGQAVALPG 201
           C+   C+   Q + SG +     C Y+  YGDGS + G   ++T+   +   GQ++    
Sbjct: 143 CADPICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVANS 202

Query: 202 ---ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSST 253
              I FGC T   G     +    GI G G G +S+ISQ+  R      FS+CL      
Sbjct: 203 SSTIVFGCSTYQSGDLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHCL------ 256

Query: 254 KINFGTNG---IVSG----PGVVSTPLTKAKTFYVLTIDAISVGNQRLGVST-------- 298
               G NG   +V G    P +V +PL  +   Y L + +I+V  Q L + +        
Sbjct: 257 --KGGENGGGVLVLGEILEPSIVYSPLVPSLPHYNLNLQSIAVNGQLLPIDSNVFATTNN 314

Query: 299 PDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF-NSLSQV-PE 354
              ++DSGTTL +L Q  YN  + ++ +++ + ++P+         CY   NS+  + P+
Sbjct: 315 QGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPIISKGNQ---CYLVSNSVGDIFPQ 371

Query: 355 VTIHFR-GADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 409
           V+++F  GA + L+  ++ +      S  + C  F+ +     I G+++  + +  YD+ 
Sbjct: 372 VSLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLKDKIFVYDLA 431

Query: 410 QQTVSFKPTDCT 421
            Q + +   +C+
Sbjct: 432 NQRIGWADYNCS 443


>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
 gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 513

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 110/403 (27%), Positives = 171/403 (42%), Gaps = 36/403 (8%)

Query: 45  NSSETPYQRL---RDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
           N   + Y R+   RD L R   RL + +Q+    S       +      +   +++GTP 
Sbjct: 56  NRDSSKYYRVMAHRDRLIRG-RRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPS 114

Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQ---------DSPLFDPKMSSTYKSLPCSSSQ 152
              +   DTGSDL W    PC  + C  +         D  ++ P  SST   +PC+S+ 
Sbjct: 115 DWFMVALDTGSDLFWL---PCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTL 171

Query: 153 CASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPG-ITFGCGTNN 210
           C   ++ +    +C Y + Y  +G+ S G L  + + L S    + A+P  +TFGCG   
Sbjct: 172 CTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQ 231

Query: 211 GGLFN--SKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
            G+F+  +   G+ GLG  DIS+ S +      A  FS C     + +I+FG  G V   
Sbjct: 232 TGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQR 291

Query: 267 GVVSTPLT--KAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVM 324
               TPL   +    Y +T+  ISVG    G    D V DSGT+ T+L     + +    
Sbjct: 292 ---ETPLNIRQPHPTYNITVTKISVGGN-TGDLEFDAVFDSGTSFTYLTDAAYTLISESF 347

Query: 325 SSMI--EAQPVADPTGSLELCYSF--NSLS-QVPEVTIHFRGADVKLSRSNFFV--KVSE 377
           +S+   +     D     E CY+   N  S Q P V +  +G           V      
Sbjct: 348 NSLALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDT 407

Query: 378 DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           D+ C     I + + I G    T + V +D E+  + +K +DC
Sbjct: 408 DVYCLAIMKIED-ISIIGQNFMTGYRVVFDREKLILGWKESDC 449


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/369 (26%), Positives = 172/369 (46%), Gaps = 46/369 (12%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +GTPP +     DTGSD++W  C     CP +         FDP  S T   + 
Sbjct: 81  YYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPIS 140

Query: 148 CSSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
           CS  +C+   Q S SG +     C Y+  YGDGS ++G   ++ +      G ++   + 
Sbjct: 141 CSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNST 200

Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTK 254
             + FGC T+  G     +    GI G G   +S+ISQ+ +  IA + FS+CL      K
Sbjct: 201 APVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL------K 254

Query: 255 INFGTNGI-----VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-------- 301
              G  GI     +  P +V TPL  ++  Y + + +ISV  Q L ++ P +        
Sbjct: 255 GENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPIN-PSVFSTSNGQG 313

Query: 302 -VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIH 358
            +ID+GTTL +L +      +  +++ + +Q V         CY    S+  + P V+++
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAV-SQSVRPVVSKGNQCYVITTSVGDIFPPVSLN 372

Query: 359 FR-GADVKLSRSNFFVKVSE----DIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 412
           F  GA + L+  ++ ++ +      + C  F+ I N  + I G+++  + +  YD+  Q 
Sbjct: 373 FAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLVGQR 432

Query: 413 VSFKPTDCT 421
           + +   DC+
Sbjct: 433 IGWANYDCS 441


>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
          Length = 467

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/380 (26%), Positives = 156/380 (41%), Gaps = 55/380 (14%)

Query: 87  NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
             + YL+++ IGTP      R  + DTGSDL WTQCEPC     +    P  DP  S T+
Sbjct: 98  GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 156

Query: 144 KSLPCSSSQCA---SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
           + L C    C    ++         C +   YGDG   +G L ++    G+   G    L
Sbjct: 157 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 216

Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV------- 250
              + FGC    +       +TGI+ LG G  S ++Q+      +FSYC +P        
Sbjct: 217 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYC-IPASEITDDD 272

Query: 251 -------SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDA----------------I 287
                  S++ + FG++  ++G      P  +  + Y + + +                +
Sbjct: 273 DDDDEERSASFLRFGSHARMTGK---RAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPV 329

Query: 288 SVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
            V  +    + P +++DSGTTL +LP      L   +   I      D T     CY  N
Sbjct: 330 YVAGEEAAAAMP-MLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGN 388

Query: 348 SLS-QVPEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNF 402
               +   VT+ F  GAD++L  ++ F     ++ED VC        +  I G   Q N 
Sbjct: 389 MTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRA--ILGVYPQRNI 446

Query: 403 LVGYDIEQQTVSFKPTDCTK 422
            VGYD+    ++F    C +
Sbjct: 447 NVGYDLSTMEIAFDRDQCDR 466


>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
          Length = 488

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/380 (26%), Positives = 156/380 (41%), Gaps = 55/380 (14%)

Query: 87  NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
             + YL+++ IGTP      R  + DTGSDL WTQCEPC     +    P  DP  S T+
Sbjct: 119 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 177

Query: 144 KSLPCSSSQCA---SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
           + L C    C    ++         C +   YGDG   +G L ++    G+   G    L
Sbjct: 178 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 237

Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPV------- 250
              + FGC    +       +TGI+ LG G  S ++Q+      +FSYC +P        
Sbjct: 238 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYC-IPASEITDDD 293

Query: 251 -------SSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDA----------------I 287
                  S++ + FG++  ++G      P  +  + Y + + +                +
Sbjct: 294 DDDDEERSASFLRFGSHARMTGK---RAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPVPV 350

Query: 288 SVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 347
            V  +    + P +++DSGTTL +LP      L   +   I      D T     CY  N
Sbjct: 351 YVAGEEAAAAMP-MLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYLGN 409

Query: 348 SLS-QVPEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNF 402
               +   VT+ F  GAD++L  ++ F     ++ED VC        +  I G   Q N 
Sbjct: 410 MTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRA--ILGVYPQRNI 467

Query: 403 LVGYDIEQQTVSFKPTDCTK 422
            VGYD+    ++F    C +
Sbjct: 468 NVGYDLSTMEIAFDRDQCDR 487


>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 547

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/351 (28%), Positives = 155/351 (44%), Gaps = 26/351 (7%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ---DSPLFDPKMSSTYKS 145
           Y   +++GTP    L   DTGSDL W  C+   C       Q   +  ++ P  SST K 
Sbjct: 130 YYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKE 189

Query: 146 LPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPG-IT 203
           + CSSS C+ L+Q S     C Y VSY  D + S G L  + + L +   Q+  +   IT
Sbjct: 190 VQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNARIT 249

Query: 204 FGCGTNNGGLFNSKTT--GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
            GCG +  G F S     G+ GLG  ++S+ S +     I+  FS C  P    +I FG 
Sbjct: 250 LGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGD 309

Query: 260 NGIVSGPGVVSTP--LTKAKTFYVLTIDAISVGNQRLGVSTPD--IVIDSGTTLTFLPQG 315
            G    PG   TP  L +    Y ++I  I VG     +S  D  ++ DSGT+ T+L   
Sbjct: 310 KG---SPGQNETPFNLGRRHPTYNVSITQIGVGGH---ISDLDVAVIFDSGTSFTYLNDP 363

Query: 316 YNSNLLSVMSSMI-EAQPVADPTGSLELCYSFN---SLSQVPEVTIHFR-GADVKLSRSN 370
             S      +SM+ E Q   +     E CY  +   +    P + +  + G    ++   
Sbjct: 364 AYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLTMKGGGHFVINHPI 423

Query: 371 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             +      +  +    ++S+ I G    T + + +D E+  + +K ++CT
Sbjct: 424 VLISTESKRLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCT 474


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 117/419 (27%), Positives = 182/419 (43%), Gaps = 52/419 (12%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRD-ALTRSLNRLNHFNQNS-SISSSKASQADIIPNN 88
           + + H +S  SPF  S       L+D A    L+ L    ++S  I+S +A     I  +
Sbjct: 31  LRVFHINSLCSPFKTSVSWADTLLQDKARFLYLSSLAGVRKSSVPIASGRA-----IVQS 85

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y++R +IGTP    L   DT +D  W  C  C         S LFDP  SS+ ++L C
Sbjct: 86  PTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGC----VGCSSSVLFDPSKSSSSRTLQC 141

Query: 149 SSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
            + QC      SC+   +C ++++YG GS     L  +T+TL S       +P  TFGC 
Sbjct: 142 EAPQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDV-----IPNYTFGC- 194

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGPG 267
            N     +    G++GLG G +SLISQ +      FSYCL   +S   NF +  +  GP 
Sbjct: 195 INKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCL--PNSKSSNF-SGSLRLGPK 251

Query: 268 -----VVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTTL 309
                + +TPL K     + Y + +  I VGN+ + + T  +          + DSGT  
Sbjct: 252 NQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVY 311

Query: 310 TFLPQGYNSNLLSVMSSMIEAQPVADPT--GSLELCYSFNSLSQVPEVTIHFRGADVKLS 367
           T L +      ++V +        A+ T  G  + CYS + +   P VT  F G +V L 
Sbjct: 312 TRLVE---PAYVAVRNEFRRRVKNANATSLGGFDTCYSGSVV--FPSVTFMFAGMNVTLP 366

Query: 368 RSNFFVKVSE-DIVCSVFKG----ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             N  +  S  ++ C         + + + +  ++ Q N  V  D+    +      CT
Sbjct: 367 PDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425


>gi|357476865|ref|XP_003608718.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355509773|gb|AES90915.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 482

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 113/411 (27%), Positives = 167/411 (40%), Gaps = 92/411 (22%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP----------- 137
           ++Y +  ++G          DTGSDL+W  C P     C ++     DP           
Sbjct: 73  SDYTLSFNLGPHSQPITLYMDTGSDLVWFPCTPFNCILCELKPKLTSDPSPPTNISHSTP 132

Query: 138 ----------KMSSTYKSLPCSSSQCA--SLNQKSCSGVNC-QYSVSYGDGSFSNGNLAT 184
                       SST  S  C+ + C   S+  K C   +C  +  +YGDGS    +L  
Sbjct: 133 ISCNSHACSVAHSSTPSSDLCTMAHCPLDSIETKDCGSFHCPPFYYAYGDGSLI-ASLYR 191

Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT---TIAG 241
           +T++L +     + L   TFGC         S+ TG+ G G G +SL +Q+ T    +  
Sbjct: 192 DTLSLST-----LQLTNFTFGCAHTT----FSEPTGVAGFGRGLLSLPAQLATHSPQLGN 242

Query: 242 KFSYCLVPVS--STKI---------NFGTNGIVSGPGVVSTPLT------KAKTFYVLTI 284
           +FSYCLV  S  S +I          +      +G  VV    T      K   FY + +
Sbjct: 243 RFSYCLVSHSFRSERIRKPSPLILGRYNDEKQSNGDEVVEFVYTSMLENPKHSYFYTVGL 302

Query: 285 DAISVGNQRLGVSTPDI------------VIDSGTTLTFLPQGYNSNLLS-----VMSSM 327
             ISVG +   V  P I            V+DSGTT T LP+ + ++++         S 
Sbjct: 303 KGISVGKKT--VPAPKILRRVNKKGDGGVVVDSGTTFTMLPEKFYNSVVEGFDRRARKSN 360

Query: 328 IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGAD--VKLSRSNFF---------VKVS 376
             A  +   TG L  CY  N+ + VP VT+ F G +  V L R N+F         V+  
Sbjct: 361 RRAPEIEQKTG-LSPCYYLNTAAIVPAVTLRFVGMNSSVVLPRKNYFYEFMDGGDGVRRK 419

Query: 377 EDIVCSVFKGITNSVP-------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           E + C +F    +          + GN  Q  F V YD+E++ V F    C
Sbjct: 420 ERVGCLMFMNGGDEAEMSGGPGGVLGNYQQQGFEVEYDLEKKRVGFARRKC 470


>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           1-like [Cucumis sativus]
          Length = 524

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 99/351 (28%), Positives = 155/351 (44%), Gaps = 26/351 (7%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ---DSPLFDPKMSSTYKS 145
           Y   +++GTP    L   DTGSDL W  C+   C       Q   +  ++ P  SST K 
Sbjct: 107 YYAEVTVGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKE 166

Query: 146 LPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPG-IT 203
           + CSSS C+ L+Q S     C Y VSY  D + S G L  + + L +   Q+  +   IT
Sbjct: 167 VQCSSSLCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNARIT 226

Query: 204 FGCGTNNGGLFNSKTT--GIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGT 259
            GCG +  G F S     G+ GLG  ++S+ S +     I+  FS C  P    +I FG 
Sbjct: 227 LGCGKDQSGAFLSSAAPNGLFGLGIENVSVPSILANAGLISNSFSLCFGPARMGRIEFGD 286

Query: 260 NGIVSGPGVVSTP--LTKAKTFYVLTIDAISVGNQRLGVSTPD--IVIDSGTTLTFLPQG 315
            G    PG   TP  L +    Y ++I  I VG     +S  D  ++ DSGT+ T+L   
Sbjct: 287 KG---SPGQNETPFNLGRRHPTYNVSITQIGVGGH---ISDLDVAVIFDSGTSFTYLNDP 340

Query: 316 YNSNLLSVMSSMI-EAQPVADPTGSLELCYSFN---SLSQVPEVTIHFR-GADVKLSRSN 370
             S      +SM+ E Q   +     E CY  +   +    P + +  + G    ++   
Sbjct: 341 AYSLFADKFASMVEEKQFTMNSDIPFENCYELSPNQTTFTYPLMNLTMKGGGHFVINHPI 400

Query: 371 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
             +      +  +    ++S+ I G    T + + +D E+  + +K ++CT
Sbjct: 401 VLISTESKRLFCLAIARSDSINIIGQNFMTGYHIVFDREKMVLGWKESNCT 451


>gi|359492825|ref|XP_002284255.2| PREDICTED: aspartic proteinase-like protein 1-like [Vitis vinifera]
          Length = 531

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 114/429 (26%), Positives = 179/429 (41%), Gaps = 37/429 (8%)

Query: 29  FSVELIHR--DSPKSPFYN------SSETPYQR----LRDALTRSLNR--LNHFNQNSSI 74
           FS +LIHR  D  K+ F +      +   P +R     R  L+  L R  L    +   +
Sbjct: 25  FSSKLIHRFSDEAKAFFVSRNGNIFADSWPKKRSFDYYRLLLSSDLKRQKLKLGAEYQLL 84

Query: 75  SSSKASQADIIPNNANYL--IRISIGTPPTERLAVADTGSDLIWTQCE--PCPP-SQCYM 129
             S+ S A  + N   +L    I IGTP    L   D GSDL+W  C+   C P S  Y 
Sbjct: 85  FPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCMQCAPLSASYY 144

Query: 130 ----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLAT 184
               +D   + P +SST K L C+   C   +    S   C Y  SY  + + S+G L  
Sbjct: 145 DRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTSSSGLLIE 204

Query: 185 ETVTLGSTTGQAV---ALPGITFGCGTNNGGLFN--SKTTGIVGLGGGDISLISQMRTT- 238
           + + L   +  A        +  GCG    G F+  +   G++GLG GD+S+ S +    
Sbjct: 205 DRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPSLLAKAG 264

Query: 239 -IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS 297
            +   FS C     S  I FG  G+V+       PL      Y++ ++   VG+  L  +
Sbjct: 265 LVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGSSSLKTA 324

Query: 298 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS--LSQVPEV 355
               ++DSGT+ TFLP      ++      + A   +      + CY+ +S  L  +P V
Sbjct: 325 GFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPWKYCYNSSSQELLNIPTV 384

Query: 356 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQ 411
           T+ F      +  +     +SE+   +VF      +     I+  NF+ GY    D E  
Sbjct: 385 TLVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNFMWGYRMVFDRENL 444

Query: 412 TVSFKPTDC 420
            + +  ++C
Sbjct: 445 KLGWSTSNC 453


>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 101/338 (29%), Positives = 152/338 (44%), Gaps = 48/338 (14%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+I + +GTP   ++   DTGS   W  CE C    C+         + S+T   + C +
Sbjct: 1   YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-C--DGCHTNPRTFLQSR-STTCAKVSCGT 56

Query: 151 SQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           S C         Q S +  +C + VSY DGS S G L  +T+T          +PG TFG
Sbjct: 57  SMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ----KIPGFTFG 112

Query: 206 CGTNNGGLFN-SKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINF--GTNGI 262
           C  ++ G        G++G+G G +S++ Q   T  G FSYCL P+  ++  F   T G 
Sbjct: 113 CNMDSFGANEFGNVDGLLGMGAGQMSVLKQSSPTFDG-FSYCL-PLQMSERGFFSKTTGY 170

Query: 263 VSGPGVVSTPLTKAK-----------TFYVLTIDAISVGNQRLGV-----STPDIVIDSG 306
            S  G ++   T  +             + + + AISV  +RLG+     S   +V DSG
Sbjct: 171 FSLGGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFSRKGVVFDSG 230

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCYSFNSLSQ--VPEVTIHF-R 360
           + L+++P       LSV+S  I     +  A    S   CY   S+ +  +P +++HF  
Sbjct: 231 SELSYIPD----RALSVLSQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDD 286

Query: 361 GADVKLSRSNFFVKVS---EDIVCSVFKGITNSVPIYG 395
           GA   L     FV+ S   +D+ C  F   T SV I G
Sbjct: 287 GARFDLGSHGVFVERSVQEQDVWCLAF-APTESVSIIG 323


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 99/369 (26%), Positives = 172/369 (46%), Gaps = 46/369 (12%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +GTPP +     DTGSD++W  C     CP +         FDP  S T   + 
Sbjct: 81  YYTKLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPIS 140

Query: 148 CSSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
           CS  +C+   Q S SG +     C Y+  YGDGS ++G   ++ +      G ++   + 
Sbjct: 141 CSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNST 200

Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTK 254
             + FGC T+  G     +    GI G G   +S+ISQ+ +  IA + FS+CL      K
Sbjct: 201 APVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL------K 254

Query: 255 INFGTNGI-----VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-------- 301
              G  GI     +  P +V TPL  ++  Y + + +ISV  Q L ++ P +        
Sbjct: 255 GENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPIN-PSVFSTSNGQG 313

Query: 302 -VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIH 358
            +ID+GTTL +L +      +  +++ + +Q V         CY    S+  + P V+++
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAV-SQSVRPVVSKGNQCYVITTSVGDIFPPVSLN 372

Query: 359 FR-GADVKLSRSNFFVKVSE----DIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 412
           F  GA + L+  ++ ++ +      + C  F+ I N  + I G+++  + +  YD+  Q 
Sbjct: 373 FAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLVGQR 432

Query: 413 VSFKPTDCT 421
           + +   DC+
Sbjct: 433 IGWANYDCS 441


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 111/418 (26%), Positives = 178/418 (42%), Gaps = 50/418 (11%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK---ASQADIIPN 87
           + + H +S  SPF  S         D L +   R  + +  + ++ S    AS   I+  
Sbjct: 31  LRVFHINSQCSPFKTSVS-----WADTLLQDKARFLYLSSLAGVTKSSVPIASGRGIV-Q 84

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y++R +IGTP    L   DT +D  W  C  C         S LFDP  SS+ ++L 
Sbjct: 85  SPTYIVRANIGTPAQAMLVALDTSNDAAWIPCSGC----VGCSSSVLFDPSKSSSSRTLQ 140

Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           C + QC      SC+   +C ++++YG GS     L  +T+TL +       +P  TFGC
Sbjct: 141 CEAPQCKQAPNPSCTVSKSCGFNMTYG-GSAIEAYLTQDTLTLATDV-----IPNYTFGC 194

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGTNGIVSGP 266
             N     +    G++GLG G +SLISQ +      FSYCL   +S   NF +  +  GP
Sbjct: 195 -INKASGTSLPAQGLMGLGRGPLSLISQSQNLYQSTFSYCL--PNSKSSNF-SGSLRLGP 250

Query: 267 G-----VVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI----------VIDSGTT 308
                 + +TPL K     + Y + +  I VGN+ + + T  +          + DSGT 
Sbjct: 251 KNQPIRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTV 310

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR 368
            T L +     + +     ++    A   G  + CYS + +   P VT  F G +V L  
Sbjct: 311 YTRLVEPAYVAMRNEFRRRVK-NANATSLGGFDTCYSGSVV--FPSVTFMFAGMNVTLPP 367

Query: 369 SNFFVKVSE-DIVCSVFKG----ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
            N  +  S  ++ C         + + + +  ++ Q N  V  D+    +      CT
Sbjct: 368 DNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETCT 425


>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
          Length = 435

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 98/382 (25%), Positives = 164/382 (42%), Gaps = 62/382 (16%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  + +++GTPP     V DTGS+L W  C      +     +  F P+ S+T+ ++
Sbjct: 57  HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCA---TGRAAAAAADSFRPRASATFAAV 113

Query: 147 PCSSSQCASLN---QKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           PC S++C+S +     SC      C+ S+SY DGS S+G LAT+   +G       A   
Sbjct: 114 PCGSARCSSRDLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAVGDAPPLRSA--- 170

Query: 202 ITFGC--GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFGT 259
             FGC     +       T G++G+  G +S ++Q  T    +FSYC+       +    
Sbjct: 171 --FGCMSAAYDSSPDAVATAGLLGMNRGALSFVTQASTR---RFSYCISDRDDAGVLLLG 225

Query: 260 NGIVSGPGVVSTPLTKA--------KTFYVLTIDAISVGNQRL----GVSTPD------I 301
           +  +    +  TPL +         +  Y + +  I VG + L     V  PD       
Sbjct: 226 HSDLPFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQT 285

Query: 302 VIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPT----GSLELCYSFNS----- 348
           ++DSGT  TFL         +  L     ++ A  + DP+     + + C+         
Sbjct: 286 MVDSGTQFTFLLGDAYSAVKAEFLKQTKPLLPA--LEDPSFAFQEAFDTCFRVPKGRPPP 343

Query: 349 LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGITNSVP----IYGNIM 398
            +++P VT+ F GA + ++      KV      ++ + C  F G  + VP    + G+  
Sbjct: 344 SARLPPVTLLFNGAQMSVAGDRLLYKVPGERRGADGVWCLTF-GNADMVPLTAYVIGHHH 402

Query: 399 QTNFLVGYDIEQQTVSFKPTDC 420
           Q N  V YD+E+  V   P  C
Sbjct: 403 QMNLWVEYDLERGRVGLAPVKC 424


>gi|302141912|emb|CBI19115.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 114/429 (26%), Positives = 179/429 (41%), Gaps = 37/429 (8%)

Query: 29  FSVELIHR--DSPKSPFYN------SSETPYQR----LRDALTRSLNR--LNHFNQNSSI 74
           FS +LIHR  D  K+ F +      +   P +R     R  L+  L R  L    +   +
Sbjct: 15  FSSKLIHRFSDEAKAFFVSRNGNIFADSWPKKRSFDYYRLLLSSDLKRQKLKLGAEYQLL 74

Query: 75  SSSKASQADIIPNNANYL--IRISIGTPPTERLAVADTGSDLIWTQCE--PCPP-SQCYM 129
             S+ S A  + N   +L    I IGTP    L   D GSDL+W  C+   C P S  Y 
Sbjct: 75  FPSEGSDALFLGNEFGWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCMQCAPLSASYY 134

Query: 130 ----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLAT 184
               +D   + P +SST K L C+   C   +    S   C Y  SY  + + S+G L  
Sbjct: 135 DRLGRDLNEYSPSLSSTSKPLSCNDQLCELGSDCKSSKDPCPYLASYYSENTSSSGLLIE 194

Query: 185 ETVTLGSTTGQAV---ALPGITFGCGTNNGGLFN--SKTTGIVGLGGGDISLISQMRTT- 238
           + + L   +  A        +  GCG    G F+  +   G++GLG GD+S+ S +    
Sbjct: 195 DRLHLAPFSEHASRSSVWASVIIGCGRKQSGAFSDGAAPDGLMGLGPGDLSVPSLLAKAG 254

Query: 239 -IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVS 297
            +   FS C     S  I FG  G+V+       PL      Y++ ++   VG+  L  +
Sbjct: 255 LVRNTFSICFDDNHSGTILFGDQGLVTQKSTSFVPLEGKFVTYLIEVEGYLVGSSSLKTA 314

Query: 298 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS--LSQVPEV 355
               ++DSGT+ TFLP      ++      + A   +      + CY+ +S  L  +P V
Sbjct: 315 GFQALVDSGTSFTFLPYEIYEKIVVEFDKQVNATRSSFKGSPWKYCYNSSSQELLNIPTV 374

Query: 356 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQ 411
           T+ F      +  +     +SE+   +VF      +     I+  NF+ GY    D E  
Sbjct: 375 TLVFAMNQSFIVHNPVIKLISENEEFNVFCLPIQPIHEEFGIIGQNFMWGYRMVFDRENL 434

Query: 412 TVSFKPTDC 420
            + +  ++C
Sbjct: 435 KLGWSTSNC 443


>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 431

 Score =  102 bits (255), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 115/426 (26%), Positives = 169/426 (39%), Gaps = 55/426 (12%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           + + H   P SP    S     R  DA  R L   +    +  ++S+  +     P+   
Sbjct: 24  LSVYHNVHPPSPSPLESIIALARADDA--RLLFLSSKAASSGGVTSAPVASGQTPPS--- 78

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y++R  +GTP  + L   DT +D  W+ C PC    C       F P  SS+Y SLPC+S
Sbjct: 79  YVVRAGLGTPVQQLLLALDTSADATWSHCAPC--DTCPAGSR--FIPASSSSYASLPCAS 134

Query: 151 SQCASLNQKSCSGVN--------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
             C     + C            C +S  + D SF   +L ++T+ LG       A+ G 
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLGKD-----AIAGY 188

Query: 203 TFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS----STKINF 257
            FGC G   G   N    G++GLG G +SL+SQ  +T  G FSYCL        S  +  
Sbjct: 189 AFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNGVFSYCLPSYRSYYFSGSLRL 248

Query: 258 GTNGIVSGPGVVSTPL---TKAKTFYVLTIDAISVGNQRLGVSTP------------DIV 302
           G  G      V  TPL       + Y + +  +SVG  R  V  P              V
Sbjct: 249 GAAGQPR--NVRYTPLLTNPHRPSLYYVNVTGLSVG--RTWVKVPAGSFAFDPATGAGTV 304

Query: 303 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR 360
           IDSGT +T       + L       + A       G+ + C++ + ++    P VT+H  
Sbjct: 305 IDSGTVITRWTAPVYAALREEFRRQVAAPSGYTSLGAFDTCFNTDEVAAGGAPPVTLHMD 364

Query: 361 GA-DVKLSRSNFFVKVSE-DIVCSVF----KGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
           G  D+ L   N  +  S   + C       + +   V +  N+ Q N  V  D+    V 
Sbjct: 365 GGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVDVAGSRVG 424

Query: 415 FKPTDC 420
           F    C
Sbjct: 425 FAREPC 430


>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
          Length = 354

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 95/306 (31%), Positives = 144/306 (47%), Gaps = 32/306 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +GTPP E     DTGSD++W  C     CP +         FDP  SST   + 
Sbjct: 25  YYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIA 84

Query: 148 CSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTL-----GSTTGQAV 197
           CS  +C +  Q S   CS  N  C Y+  YGDGS ++G   ++ + L     GS T  + 
Sbjct: 85  CSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNST 144

Query: 198 ALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSS 252
           A   + FGC     G     +    GI G G  ++S+ISQ+ +  IA + FS+CL   SS
Sbjct: 145 AP--VVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSS 202

Query: 253 TKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTP--------DIVID 304
                    IV  P +V T L  A+  Y L + +I+V  Q L + +           ++D
Sbjct: 203 GGGILVLGEIVE-PNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVD 261

Query: 305 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIHFRGA 362
           SGTTL +L +      +S +++ I  Q V         CY   +S+++V P+V+++F G 
Sbjct: 262 SGTTLAYLAEEAYDPFVSAITASI-PQSVHTAVSRGNQCYLITSSVTEVFPQVSLNFAGG 320

Query: 363 DVKLSR 368
              + R
Sbjct: 321 ASMILR 326


>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
 gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
          Length = 471

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 99/382 (25%), Positives = 156/382 (40%), Gaps = 57/382 (14%)

Query: 87  NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
             + YL+++ IGTP      R  + DTGSDL WTQCEPC     +    P  DP  S T+
Sbjct: 100 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 158

Query: 144 KSLPCSSSQCA---SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
           + L C    C    ++         C +   YGDG   +G L ++    G+   G    L
Sbjct: 159 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 218

Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS------ 251
              + FGC    +       +TGI+ LG G  S ++Q+      +FSYC +P S      
Sbjct: 219 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYC-IPASEITDDD 274

Query: 252 ----------STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAI-------------- 287
                     ++ + FG++  ++G      P  +  + Y + + ++              
Sbjct: 275 DDDDDDEERSASFLRFGSHARMTGK---RAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPV 331

Query: 288 --SVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 345
              V  +    + P +++DSGTTL +LP      L   +   I      D T     CY 
Sbjct: 332 PVYVAGEEAAAAMP-MLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYL 390

Query: 346 FNSLS-QVPEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQT 400
            N    +   VT+ F  GAD++L  ++ F     ++ED VC        +  I G   Q 
Sbjct: 391 GNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRA--ILGVYPQR 448

Query: 401 NFLVGYDIEQQTVSFKPTDCTK 422
           N  VGYD+    ++F    C +
Sbjct: 449 NINVGYDLSTMEIAFDRDQCDR 470


>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
 gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
          Length = 489

 Score =  102 bits (254), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 99/382 (25%), Positives = 156/382 (40%), Gaps = 57/382 (14%)

Query: 87  NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
             + YL+++ IGTP      R  + DTGSDL WTQCEPC     +    P  DP  S T+
Sbjct: 118 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 176

Query: 144 KSLPCSSSQCA---SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
           + L C    C    ++         C +   YGDG   +G L ++    G+   G    L
Sbjct: 177 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 236

Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS------ 251
              + FGC    +       +TGI+ LG G  S ++Q+      +FSYC +P S      
Sbjct: 237 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYC-IPASEITDDD 292

Query: 252 ----------STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAI-------------- 287
                     ++ + FG++  ++G      P  +  + Y + + ++              
Sbjct: 293 DDDDDDEERSASFLRFGSHARMTGK---RAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPV 349

Query: 288 --SVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 345
              V  +    + P +++DSGTTL +LP      L   +   I      D T     CY 
Sbjct: 350 PVYVAGEEAAAAMP-MLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYL 408

Query: 346 FNSLS-QVPEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQT 400
            N    +   VT+ F  GAD++L  ++ F     ++ED VC        +  I G   Q 
Sbjct: 409 GNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRA--ILGVYPQR 466

Query: 401 NFLVGYDIEQQTVSFKPTDCTK 422
           N  VGYD+    ++F    C +
Sbjct: 467 NINVGYDLSTMEIAFDRDQCDR 488


>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
          Length = 468

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 99/382 (25%), Positives = 156/382 (40%), Gaps = 57/382 (14%)

Query: 87  NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
             + YL+++ IGTP      R  + DTGSDL WTQCEPC     +    P  DP  S T+
Sbjct: 97  GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 155

Query: 144 KSLPCSSSQCA---SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
           + L C    C    ++         C +   YGDG   +G L ++    G+   G    L
Sbjct: 156 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 215

Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVS------ 251
              + FGC    +       +TGI+ LG G  S ++Q+      +FSYC +P S      
Sbjct: 216 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQLGVD---RFSYC-IPASEITDDD 271

Query: 252 ----------STKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAI-------------- 287
                     ++ + FG++  ++G      P  +  + Y + + ++              
Sbjct: 272 DDDDDDEERSASFLRFGSHARMTGK---RAPFKQDGSGYAVRLKSVVYQHGGRLNQQQPV 328

Query: 288 --SVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 345
              V  +    + P +++DSGTTL +LP      L   +   I      D T     CY 
Sbjct: 329 PVYVAGEEAAAAMP-MLVDSGTTLLWLPGSVFYPLQRRIEEDISLTRRYDLTHPSLYCYL 387

Query: 346 FNSLS-QVPEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQT 400
            N    +   VT+ F  GAD++L  ++ F     ++ED VC        +  I G   Q 
Sbjct: 388 GNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNRA--ILGVYPQR 445

Query: 401 NFLVGYDIEQQTVSFKPTDCTK 422
           N  VGYD+    ++F    C +
Sbjct: 446 NINVGYDLSTMEIAFDRDQCDR 467


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 104/381 (27%), Positives = 164/381 (43%), Gaps = 80/381 (20%)

Query: 100 PPTERLAVADTGSDLIWTQC----EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS 155
           PP     V DTGS+L W +C     P P +         FDP  SS+Y  +PCSS  C +
Sbjct: 82  PPQNISMVIDTGSELSWLRCNRSSNPNPVNN--------FDPTRSSSYSPIPCSSPTCRT 133

Query: 156 LNQK-----SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
             +      SC S   C  ++SY D S S GNLA E    G++T  +     + FGC  +
Sbjct: 134 RTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDS----NLIFGCMGS 189

Query: 210 NGG---LFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTK-----INFGTNG 261
             G     ++KTTG++G+  G +S ISQM      KFSYC   +S T      +  G + 
Sbjct: 190 VSGSDPEEDTKTTGLLGMNRGSLSFISQMGFP---KFSYC---ISGTDDFPGFLLLGDSN 243

Query: 262 IVSGPGVVSTPLTKAKT--------FYVLTIDAISVGNQRL----GVSTPD------IVI 303
                 +  TPL +  T         Y + +  I V  + L     V  PD       ++
Sbjct: 244 FTWLTPLNYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMV 303

Query: 304 DSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADP----TGSLELCYSFNS------- 348
           DSGT  TFL         S+ L+  + ++      DP     G+++LCY  +        
Sbjct: 304 DSGTQFTFLLGPVYTALRSHFLNRTNGILTV--YEDPDFVFQGTMDLCYRISPVRIRSGI 361

Query: 349 LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKG---ITNSVPIYGNIMQ 399
           L ++P V++ F GA++ +S      +V      ++ + C  F     +     + G+  Q
Sbjct: 362 LHRLPTVSLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQ 421

Query: 400 TNFLVGYDIEQQTVSFKPTDC 420
            N  + +D+++  +   P +C
Sbjct: 422 QNMWIEFDLQRSRIGLAPVEC 442


>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 437

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 99/365 (27%), Positives = 163/365 (44%), Gaps = 45/365 (12%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + I+IG          D+GSDL W QC+  P + C      L+ P  ++    L C  
Sbjct: 55  YSVSINIGKGDEAFEFDIDSGSDLTWVQCD-APCTHCTKPREQLYKPNNNA----LNCFE 109

Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
             C SL+      C   +  CQY + Y D   S G L  + V L  T G ++A P I FG
Sbjct: 110 PLCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNG-SLAAPRIAFG 168

Query: 206 CGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKINFGTN 260
           CG ++       +  T G++GLG G++S ISQ+ +   +     +CL         F  +
Sbjct: 169 CGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCL--SDEGGFLFFGD 226

Query: 261 GIVSGPGVVSTPLTKAK--TFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFL-PQGYN 317
             V   GV  T ++     ++Y      +    +  G+    +V DSG++ T+   Q YN
Sbjct: 227 EFVPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKATGIKDLTLVFDSGSSYTYFNSQAYN 286

Query: 318 SNLLSVMSSMIEAQPVAD--PTGSLELCYS----FNSLSQVPE----VTIHF---RGADV 364
           S +L+++ + +  +P+ D     SL +C+     F SL  V +    + + F   + A +
Sbjct: 287 S-ILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNPLALRFTKTKNAQI 345

Query: 365 KLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKP 417
           +L   N+ +      VC    GI N        + I G+I   + +V YD E++ + + P
Sbjct: 346 QLPPENYLIITKYGNVCF---GILNGTEVGLGDLNIIGDISLKDKMVIYDNERRRIGWFP 402

Query: 418 TDCTK 422
           T+C K
Sbjct: 403 TNCNK 407


>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
          Length = 395

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 89/305 (29%), Positives = 133/305 (43%), Gaps = 26/305 (8%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + +SIG PP       DTGSDL W QC+ PC    C     PL+ P  +
Sbjct: 50  GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106

Query: 141 STYKSLPCSSSQCASLN-----QKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
              K +PC    CA+L+     +  C      C Y + Y D   S G L T++  L    
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162

Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
             ++  PG+ FGCG +         S T G++GLG G +SL+SQ++     K    +CL 
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222

Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
                 + FG + IV        P+ +  ++ +Y      +  G + LGV   ++V DSG
Sbjct: 223 TRGGGFLFFGDD-IVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSG 281

Query: 307 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS-LSQVPEVTIHFRGADVK 365
           ++ T+        L+  +   +       P  SL LC+        V +V   FR   V 
Sbjct: 282 SSFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFR--TVV 339

Query: 366 LSRSN 370
           LS SN
Sbjct: 340 LSFSN 344


>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
          Length = 535

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 93/360 (25%), Positives = 153/360 (42%), Gaps = 34/360 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y   IS+G+PP       DTGS   W QC+  P + C     PL+ P  + T  +LP S 
Sbjct: 160 YYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHPLYRP--ARTADALPASD 217

Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
             C     ++ +   C Y +SY DGS S G    +++      G+      I FGCG + 
Sbjct: 218 PLCEGAQHENPN--QCDYEISYADGSSSMGVYVRDSMQFVGEDGERENA-DIVFGCGYDQ 274

Query: 211 GG-LFNS--KTTGIVGLGGGDISLISQM--RTTIAGKFSYCLV--PVSSTKINFGTNGIV 263
            G L N+   T G++GL    +SL +Q+  R  I+  F +C+   P  +    F  +  +
Sbjct: 275 QGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFLGDDYI 334

Query: 264 SGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTP--DIVIDSGTTLTFLPQGYNSN 319
              G+   P+    A       +  I+ G+Q+L        +V D+G+T T+ P    + 
Sbjct: 335 PRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLTQVVFDTGSTYTYFPDEALTR 394

Query: 320 LLSVMSSMIEAQPVADPTG-SLELCYSFN-SLSQVPEVTIHFRGADVKLSRSNFFVKV-- 375
           L+S +      + V D +  +L  C   +  +  V +V   F+   ++  +  FF +   
Sbjct: 395 LISSLKEAASPRFVQDDSDKTLPFCMKSDFPVRSVEDVKHFFKPLSLQFEKRFFFSRTFN 454

Query: 376 -----------SEDIVCSVFKGIT---NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 421
                        ++   V  G T   +SV I G++     LV YD ++  V +   DCT
Sbjct: 455 IRPEHYLVISDKGNVCLGVLNGTTIGYDSVVIVGDVSLRGKLVAYDNDKNEVGWVDFDCT 514


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 114/412 (27%), Positives = 183/412 (44%), Gaps = 44/412 (10%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQ----RLRDALTRSLNRLNHFNQNSSISSSK 78
           + Q  G ++++IH  SP SPF  S    ++    +++   T  L  L+      SI    
Sbjct: 23  DVQDNGSTLQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSLVARKSIVP-I 81

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           AS   II  +  Y++R  IGTPP   L   DT +D  W  C  C    C    S LF P+
Sbjct: 82  ASGRQII-QSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTAC--DGC---ASTLFAPE 135

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            S+T+K++ C++ +C  +    C   +  ++++YG  S +  NL  +T+TL +       
Sbjct: 136 KSTTFKNVSCAAPECKQVPNPGCGVSSRNFNLTYGSSSIA-ANLVQDTITLATD-----P 189

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG 258
           +P  TFGC +   G  ++   G++GLG G +SL+SQ +      FSYCL    S  +NF 
Sbjct: 190 VPSYTFGCVSKTTGT-SAPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKS--LNFS 246

Query: 259 TN---GIVSGPGVVS-TPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI---------- 301
            +   G V+ P  +  TPL K     + Y + ++AI VG + + +    +          
Sbjct: 247 GSLRLGPVAQPKRIKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGT 306

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 361
           + DSGT  T L       +       +  +      G  + CY  N    VP +T  F G
Sbjct: 307 IFDSGTVFTRLVAPVYVAVRDEFRRRVGPKLTVTSLGGFDTCY--NVPIVVPTITFIFTG 364

Query: 362 ADVKLSRSNFFVK-VSEDIVCSVFKGITNSV----PIYGNIMQTNFLVGYDI 408
            +V L + N  +   +    C    G  ++V     +  N+ Q N  V YD+
Sbjct: 365 MNVTLPQDNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDV 416


>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
 gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
          Length = 410

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 89/360 (24%), Positives = 159/360 (44%), Gaps = 35/360 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + ++IG PP       DTGSDL W QC+  P   C      L+ PK +     +PCS+
Sbjct: 54  YSVILNIGNPPKAFDFDIDTGSDLTWVQCD-APCKGCTKPRDKLYKPKNN----LVPCSN 108

Query: 151 SQCASL---NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           S C ++       C   +  C Y + Y D   S G L +++  L  + G  +  P + FG
Sbjct: 109 SLCQAVSTGENYHCDAPDDQCDYEIEYADLGSSIGVLLSDSFPLRLSNGTLLQ-PKMAFG 167

Query: 206 CGTNNGGLFNS---KTTGIVGLGGGDISLISQMRT--TIAGKFSYCLVPVSSTKINFGTN 260
           CG +   L       T GI+GLG G +S++SQ+RT         +C        + FG +
Sbjct: 168 CGYDQKHLGPHPPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCFSRARGGFLFFGDH 227

Query: 261 GIVSGPGVVSTPLTK--AKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNS 318
            +     +  TP+ +  + T Y      +  G +  G+    ++ DSG++ T+       
Sbjct: 228 -LFPSSRITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQ 286

Query: 319 NLLSVMSSMIEAQPVAD-PTGSLELCYS--------FNSLSQVPEVTIHFRGA---DVKL 366
           ++L+++   +  +P+ D P   L +C+          +  S    +TI F  A    ++L
Sbjct: 287 SILNLVRKDLAGKPLKDAPEKELAVCWKTAKPIKSILDIKSYFKPLTISFMNAKNVQLQL 346

Query: 367 SRSNFFVKVSEDIVC-SVFKGITNSVP---IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           +  ++ +   +  VC  +  G    +    + G+I   + +V YD E+Q + + P +C +
Sbjct: 347 APEDYLIITKDGNVCLGILNGSEQQLGNFNVIGDIFMQDRVVIYDNEKQQIGWFPANCDR 406


>gi|18409320|ref|NP_566948.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|27754243|gb|AAO22575.1| unknown protein [Arabidopsis thaliana]
 gi|332645259|gb|AEE78780.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 529

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 104/356 (29%), Positives = 154/356 (43%), Gaps = 38/356 (10%)

Query: 95  ISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM--------QDSP--LFDPKMSSTYK 144
           +S+GTP T  L   DTGSDL W  C  C  S C          Q  P  L+ P  SST  
Sbjct: 106 VSVGTPATWFLVALDTGSDLFWLPCN-C-GSTCIRDLKEVGLSQSRPLNLYSPNTSSTSS 163

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTL-GSTTGQAVALPGI 202
           S+ CS  +C   ++ S    +C Y + Y    +F+ G L  + + L     G       I
Sbjct: 164 SIRCSDDRCFGSSRCSSPASSCPYQIQYLSKDTFTTGTLFEDVLHLVTEDEGLEPVKANI 223

Query: 203 TFGCGTNNGGLFNSKTT--GIVGLGGGDI---SLISQMRTTIAGKFSYCLVPVSST--KI 255
           T GCG N  G   S     G++GLG  D    S++++ + T A  FS C   +     +I
Sbjct: 224 TLGCGKNQTGFLQSSAAVNGLLGLGLKDYSVPSILAKAKIT-ANSFSMCFGNIIDVVGRI 282

Query: 256 NFGTNGIVSGPGVVSTPL--TKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLP 313
           +FG  G       + TPL  T+    Y +++  +SVG   +GV     + D+GT+ T L 
Sbjct: 283 SFGDKGYTDQ---METPLLPTEPSPTYAVSVTEVSVGGDAVGVQLL-ALFDTGTSFTHLL 338

Query: 314 QGYNSNLLSVMSSMI--EAQPVADPTGSLELCYSF---NSLSQVPEVTIHFRGADVKLSR 368
           +     +       +  + +P+ DP    E CY      +    P V + F G      R
Sbjct: 339 EPEYGLITKAFDDHVTDKRRPI-DPELPFEFCYDLSPNKTTILFPRVAMTFEGGSQMFLR 397

Query: 369 SNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDC 420
           +  F+  +ED       GI  SV    NI+  NF+ GY    D E+  + +K +DC
Sbjct: 398 NPLFIVWNEDNSAMYCLGILKSVDFKINIIGQNFMSGYRIVFDRERMILGWKRSDC 453


>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 87/286 (30%), Positives = 137/286 (47%), Gaps = 33/286 (11%)

Query: 51  YQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
           Y  LR    R L R+        +S   +   DI      Y  RIS+GTPP +     DT
Sbjct: 6   YHTLRKHDQRRLRRM----LPEVVSFPISGDNDIFAMGL-YYTRISLGTPPQQFYVDVDT 60

Query: 111 GSDLIWTQCEPCPPSQCYMQDSPL----FDPKMSSTYKSLPCSSSQCASLNQK-SCS--G 163
           GS++ W +C PC   + +  D P+    FDP+ S+T  S+ C+ ++C  LN+K  CS   
Sbjct: 61  GSNVAWVKCAPCTGCE-HSGDVPVPMSTFDPRKSTTKISISCTDAECGVLNKKLQCSPER 119

Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVALPG---ITFGCGTNNGGLFNSKTT 219
           ++C YS+ YGDGS + G    +  T     +  + A  G   + FGCG    G ++    
Sbjct: 120 LSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGSWS--VD 177

Query: 220 GIVGLGGGDISLISQM--RTTIAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAK 277
           G++G G   +SL +Q+  +      F++CL    S + +    G +  P +V TP+   +
Sbjct: 178 GLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVSGRGSL-VIGTIREPDLVYTPMVFGE 236

Query: 278 TFYVLTIDAISVGNQRLGVSTP---------DIVIDSGTTLTFLPQ 314
             Y   +  +++G     V+TP          ++IDSGTTLT+L Q
Sbjct: 237 DHY--NVQLLNIGISGRNVTTPASFDLEYTGGVIIDSGTTLTYLVQ 280


>gi|255545620|ref|XP_002513870.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223546956|gb|EEF48453.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 535

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 114/449 (25%), Positives = 185/449 (41%), Gaps = 39/449 (8%)

Query: 10  ILFFLCFYVVSPIEAQTGGFSVELIHR--DSPKSPFYN-----SSET-----PYQRLRDA 57
           +LF +CF  +S   +    FS +LIHR  +  KS   +     SS+T      +Q L+  
Sbjct: 6   LLFVICFCFLSN-HSIGLTFSSKLIHRFSEEAKSLLISGNDNVSSQTWPNKNSFQYLQLL 64

Query: 58  LTRSLNR--LNHFNQNSSISSSKASQADIIPNNANYL--IRISIGTPPTERLAVADTGSD 113
           L   L R  +    QN  +  S  S      N+ ++L    I IGTP    L   D GSD
Sbjct: 65  LDNDLKRQKMKLGAQNQLLFPSLGSHTFFYGNDLDWLHYTWIDIGTPNVSFLVALDAGSD 124

Query: 114 LIWTQCE--PCPPSQCYM-----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNC 166
           L W  C+   C P    +     +D   + P +S+T + L C+   C   +        C
Sbjct: 125 LSWVPCDCIQCAPLSASLYKPLDRDLSEYRPSLSTTSRHLSCNHQLCELGSHCKNLKDPC 184

Query: 167 QYSVSYGDGSFSNGNLATE------TVTLGSTTGQAVALPGITFGCGTNNGG--LFNSKT 218
            Y   Y D + S+     E      +V+  S + Q      +  GCG    G  L  +  
Sbjct: 185 PYIADYADPNTSSSGFLVEDILHLASVSDDSNSTQKRVQASVILGCGRKQTGGYLDGAAP 244

Query: 219 TGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKA 276
            G++GLG G IS+ S +     I   FS C     S  I FG  G  S       P    
Sbjct: 245 DGVMGLGPGSISVPSLLAKAGLIRKSFSLCFDVNGSGTILFGDQGHTSQKSTPLLPTQGN 304

Query: 277 KTFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 336
              Y++ +++  VGN  L  S    ++DSG + T+LP    + ++      + AQ ++  
Sbjct: 305 YDAYLIEVESYCVGNSCLKQSGFKALVDSGASFTYLPIDVYNKIVLEFDKQVNAQRISSQ 364

Query: 337 TGSLELCYSFNS--LSQVPEVTIHF-RGADVKLSRSNFFVKVSED--IVCSVFKGITNSV 391
            G    CY+ +S  L  VP + + F     + +  S ++V  +++  + C   +    + 
Sbjct: 365 GGPWNYCYNTSSKQLDNVPAMRLSFLMNQSLLIHNSTYYVPQNQEFAVFCLTLQPTDLNY 424

Query: 392 PIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
            I G    T + V +D+E   + +  ++C
Sbjct: 425 GIIGQNYMTGYRVVFDMENLKLGWSSSNC 453


>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
 gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
          Length = 492

 Score =  102 bits (253), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 104/373 (27%), Positives = 164/373 (43%), Gaps = 40/373 (10%)

Query: 74  ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
           + S++    D +     Y  R+ IGTPP E   + DTGS + +  C  C  + C     P
Sbjct: 18  LGSARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSC--THCGNHQDP 75

Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
            F P +SS+YK L C  S+C++     C G   +Y   Y + S S+G L  + +  G + 
Sbjct: 76  RFSPALSSSYKPLEC-GSECST---GFCDGSR-KYQRQYAEKSTSSGVLGKDVI--GFSN 128

Query: 194 GQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM--RTTIAGKFSYCLVPV 250
              +    + FGC T   G L++    GI+GLG G +S+I Q+  +  +   FS C   +
Sbjct: 129 SSDLGGQRLVFGCETAETGDLYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCYGGM 188

Query: 251 SSTKINFGTNGIVSGPGVVSTPLTKAKT-FYVLTIDAISVGNQRLGVSTPDI-------V 302
                     G      +V T     ++ +Y L +  I VG   L +  P++       V
Sbjct: 189 DEGGGAMILGGFQPPKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLK-PEVFDGKYGTV 247

Query: 303 IDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQ- 351
           +DSGTT  + P    Q + S +   + S+ E   V  P     ++CY+      ++LSQ 
Sbjct: 248 LDSGTTYAYFPGAAFQAFKSAVKEQVGSLKE---VPGPDEKFKDICYAGAGTNVSNLSQF 304

Query: 352 VPEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 407
            P V   F  G  V LS  N+     K+S      VF+   +   + G I+  N LV Y+
Sbjct: 305 FPSVDFVFGDGQSVTLSPENYLFRHTKISGAYCLGVFEN-GDPTTLLGGIIVRNMLVTYN 363

Query: 408 IEQQTVSFKPTDC 420
             + ++ F  T C
Sbjct: 364 RGKASIGFLKTKC 376


>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
 gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
          Length = 418

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 97/368 (26%), Positives = 170/368 (46%), Gaps = 35/368 (9%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
            D+ P   +Y + ++IG P        DTGSDL W QC+  P   C     PL+ P  + 
Sbjct: 49  GDVYPT-GHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCD-APCQSCNKVPHPLYRPTKN- 105

Query: 142 TYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
             K +PC++S C +L      N+K  +   C Y + Y D + S G L  ++ +L     +
Sbjct: 106 --KLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSL-PLRNK 162

Query: 196 AVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLVP 249
           +   P ++FGCG +      G   + T G++GLG G +SL+SQ++     K    +CL  
Sbjct: 163 SNVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLST 222

Query: 250 VSSTKINFGTNGI-VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDIVIDSGTT 308
                + FG + +  S    VS   + +  +Y      +    + L     ++V DSG+T
Sbjct: 223 SGGGFLFFGDDMVPTSRVTWVSMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVVFDSGST 282

Query: 309 LTFL-PQGYNSNLLSVMSSMIEA-QPVADPTGSLELCY----SFNSLSQVPE--VTIHF- 359
            T+   Q Y + + ++  S+ ++ + V+DP  SL LC+    +F S+S V +   ++ F 
Sbjct: 283 YTYFSAQPYQATISAIKGSLSKSLKQVSDP--SLPLCWKGQKAFKSVSDVKKDFKSLQFI 340

Query: 360 --RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVS 414
             + A + +   N+ +      VC  +  G     S  I G+I   + +V YD E+  + 
Sbjct: 341 FGKNAVMDIPPENYLIITKNGNVCLGILDGSAAKLSFSIIGDITMQDQMVIYDNEKAQLG 400

Query: 415 FKPTDCTK 422
           +    C++
Sbjct: 401 WIRGSCSR 408


>gi|356570895|ref|XP_003553619.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 470

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 116/445 (26%), Positives = 183/445 (41%), Gaps = 81/445 (18%)

Query: 44  YNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTE 103
           ++S   P+  ++ A + SL R +H    ++ S S A+      +   Y I +++GTPP  
Sbjct: 41  HSSDSDPFHSVKLAASSSLTRAHHLKHRNNNSPSVATTPAYPKSYGGYSIDLNLGTPPQT 100

Query: 104 RLAVADTGSDLIWTQCEP---CPPSQCYMQD-----SPLFDPKMSSTYKSLPCSSSQCAS 155
              V DTGS L+W  C     C  S C   +      P F PK SST K L C + +C  
Sbjct: 101 SPFVLDTGSSLVWFPCTSHYLC--SHCNFPNIDPTKIPTFIPKNSSTAKLLGCRNPKCGY 158

Query: 156 L---------------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           L                 ++CS     Y + YG G+ + G L  + +     T     +P
Sbjct: 159 LFGPDVESRCPQCKKPGSQNCSLTCPSYIIQYGLGA-TAGFLLLDNLNFPGKT-----VP 212

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSST 253
               GC      L   + +GI G G G  SL SQM      +FSYCLV       P SS 
Sbjct: 213 QFLVGCSI----LSIRQPSGIAGFGRGQESLPSQMNLK---RFSYCLVSHRFDDTPQSSD 265

Query: 254 KI-NFGTNGIVSGPGVVSTPLTKA-------KTFYVLTIDAISVGNQRLGVSTPDI---- 301
            +    + G     G+  TP           + +Y +T+  + VG   + +    +    
Sbjct: 266 LVLQISSTGDTKTNGLSYTPFRSNPSNNSVFREYYYVTLRKLIVGGVDVKIPYKFLEPGS 325

Query: 302 ------VIDSGTTLTFLPQG-YN---SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ 351
                 ++DSG+T TF+ +  YN      L  +      +   +    L  C++ + +  
Sbjct: 326 DGNGGTIVDSGSTFTFMERPVYNLVAQEFLRQLGKKYSREENVEAQSGLSPCFNISGVKT 385

Query: 352 V--PEVTIHFRGADVKLSRS--NFFVKVSE-DIVC-SVFKGITNSVP-------IYGNIM 398
           +  PE T  F+G   K+S+   N+F  V + +++C +V        P       I GN  
Sbjct: 386 ISFPEFTFQFKGG-AKMSQPLLNYFSFVGDAEVLCFTVVSDGGAGQPKTAGPAIILGNYQ 444

Query: 399 QTNFLVGYDIEQQTVSFKPTDCTKQ 423
           Q NF V YD+E +   F P +C ++
Sbjct: 445 QQNFYVEYDLENERFGFGPRNCKRK 469


>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
 gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
          Length = 334

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 95/326 (29%), Positives = 148/326 (45%), Gaps = 48/326 (14%)

Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV--------NCQYSVSYGDG----SFSNG 180
           PL  P  SS+   + C    C  L +  CS V        NC Y  +YG+      ++ G
Sbjct: 13  PLLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEG 72

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
            L TET T G     A A PGI FGC   + G F + + G+VGLG G +SL++Q+     
Sbjct: 73  ILMTETFTFGD---DAAAFPGIAFGCTLRSEGGFGTGS-GLVGLGRGKLSLVTQLNVE-- 126

Query: 241 GKFSYCLVPVSS--TKINFGTNGIVSGPG--------VVSTPLTKAKTFYVLTIDAISVG 290
             F Y L    S  + I+FG+   V+G          +++ P+ +   FY + +  ISVG
Sbjct: 127 -AFGYRLSSDLSAPSPISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVG 185

Query: 291 NQRLGV-----------STPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTG 338
            + + +               ++ DSGTTLT LP   Y      ++S M   +P      
Sbjct: 186 GKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAAND 245

Query: 339 SLELCYS-FNSLSQVPEVTIHFR-GADVKLSRSNFFVKV----SEDIVCSVFKGITNSVP 392
              +C++  +S +  P + +HF  GAD+ LS  N+  ++     E   C      + ++ 
Sbjct: 246 DDLICFTGGSSTTTFPSMVLHFDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALT 305

Query: 393 IYGNIMQTNFLVGYDIE-QQTVSFKP 417
           I GNIMQ +F V +D+     + F+P
Sbjct: 306 IIGNIMQMDFHVVFDLSGNARMLFQP 331


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 125/441 (28%), Positives = 191/441 (43%), Gaps = 87/441 (19%)

Query: 46  SSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN-ANYLIRISIGTPPTER 104
           SS+ P+  L    + SL+R +H    S  ++    +  + P +   Y I ++ GTPP   
Sbjct: 39  SSKKPWGSLNHLASLSLSRAHHIK--SPKTNFSLIKTPLFPRSYGGYSISLNFGTPPQTT 96

Query: 105 LAVADTGSDLIWTQCEP---CPPSQCYMQD-----SPLFDPKMSSTYKSLPCSSSQCASL 156
             V DTGS L+W  C     C  S+C   +      P F PK+SS+ K + C + +C+ +
Sbjct: 97  KFVMDTGSSLVWFPCTSRYLC--SECNFPNIKKTGIPTFLPKLSSSSKLIGCKNPRCSMI 154

Query: 157 N----QKSC-----SGVNCQ-----YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
                Q  C     +  NC      Y + YG GS + G L +ET+   +       +P  
Sbjct: 155 FGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSGS-TAGLLLSETLDFPNKK----TIPDF 209

Query: 203 TFGCGTNNGGLFNSKT-TGIVGLGGGDISLISQMRTTIAGKFSYCLV-------PVSSTK 254
             GC      +F+ K   GI G G    SL SQ+      KFSYCLV       P SS  
Sbjct: 210 LVGC-----SIFSIKQPEGIAGFGRSPESLPSQLGLK---KFSYCLVSHAFDDTPTSSDL 261

Query: 255 I-NFGT-NGIVSGPGVVSTPLTKAKT-----FYVLTIDAISVGNQRLGVSTPDIV----- 302
           + + G+ +G+    G+  TP  K  T     +Y + +  I +G+  + V    +V     
Sbjct: 262 VLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHVKVPYKFLVPGTDG 321

Query: 303 -----IDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF---NSLS 350
                +DSGTT TF+     +         M+    A  + + TG L  CY+     SLS
Sbjct: 322 NGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTG-LRPCYNISGEKSLS 380

Query: 351 QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVP----------IYGNIMQ 399
            VP++   F+ GA + L  SN+F  V   ++C     ++++V           I GN  Q
Sbjct: 381 -VPDLIFQFKGGAKMALPLSNYFSIVDSGVICLTI--VSDNVAGPGLGGGPAIILGNYQQ 437

Query: 400 TNFLVGYDIEQQTVSFKPTDC 420
            NF V +D+E +   FK   C
Sbjct: 438 RNFYVEFDLENEKFGFKQQSC 458


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 98/369 (26%), Positives = 173/369 (46%), Gaps = 46/369 (12%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I +G+PP +     DTGSD++W  C     CP +         FDP  S T   + 
Sbjct: 81  YYTKIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPVS 140

Query: 148 CSSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
           CS  +C+   Q S SG +     C Y+  YGDGS ++G   ++ +      G ++   + 
Sbjct: 141 CSDQRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNST 200

Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT-IAGK-FSYCLVPVSSTK 254
             + FGC T+  G     +    GI G G   +S+ISQ+ +  +A + FS+CL      K
Sbjct: 201 APVVFGCSTSQTGDLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCL------K 254

Query: 255 INFGTNGI-----VSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGVSTPDI-------- 301
              G  GI     +  P +V TPL  ++  Y + + +ISV  Q L ++ P +        
Sbjct: 255 GENGGGGILVLGEIVEPNMVFTPLVPSQPHYNVNLLSISVNGQALPIN-PSVFSTSNGQG 313

Query: 302 -VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIH 358
            +ID+GTTL +L +      +  +++ + +Q V         CY    S++ + P V+++
Sbjct: 314 TIIDTGTTLAYLSEAAYVPFVEAITNAV-SQSVRPVVSKGNQCYVIATSVADIFPPVSLN 372

Query: 359 FR-GADVKLSRSNFFVKVSE----DIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 412
           F  GA + L+  ++ ++ +      + C  F+ I N  + I G+++  + +  YD+  Q 
Sbjct: 373 FAGGASMFLNPQDYLIQQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFVYDLVGQR 432

Query: 413 VSFKPTDCT 421
           + +   DC+
Sbjct: 433 IGWANYDCS 441


>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
          Length = 426

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 94/360 (26%), Positives = 157/360 (43%), Gaps = 37/360 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           Y ++ +IG PP       DTGSDL W QC+ PC   QC     PL+ P    T   + C 
Sbjct: 67  YHVQFNIGQPPKPYFLDPDTGSDLTWLQCDAPC--IQCTPAPHPLYQP----TNDLVVCK 120

Query: 150 SSQCASL---NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
              CASL   N +      C Y V Y DG  S G L  +   +  T+G   A P +T GC
Sbjct: 121 DPICASLHPDNYRCDDPDQCDYEVEYADGGSSIGVLVNDLFPVNLTSGMR-ARPRLTIGC 179

Query: 207 GTNN-GGLFNSKTTGIVGLGGGDISLISQMRTT--IAGKFSYCLVPVSSTKINFGTNGIV 263
           G +   G+      G++GLG G  S+++Q+ +   +     +C        + FG + I 
Sbjct: 180 GYDQLPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRRGGGYLFFGDD-IY 238

Query: 264 SGPGVVSTPLTKAK-TFYVLTIDAISVGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLS 322
               V+ TP+++     Y      + +  +  G+    +V DSG++ T+        LLS
Sbjct: 239 DSSKVIWTPMSRDYLKHYTPGFAELILNGRSSGLKNLLVVFDSGSSYTYFNTQTYQTLLS 298

Query: 323 VMSSMIEAQPVADPT--GSLELCYSFNS-LSQVPEVTIHFR------GADVKLSRSNFFV 373
            +   +  +P+ +     +L +C+        + +   +F+      G+  K ++S F +
Sbjct: 299 FIKKDLHGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKKYFKPLALSFGSGWK-TKSQFEI 357

Query: 374 KVSEDIVC----SVFKGITNSVP-------IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           +    ++     SV  GI N          I G+I     LV YD E+Q + ++P++C +
Sbjct: 358 QQESYLIISSKGSVCLGILNGTEVGLQNYNIIGDISMQEKLVIYDNEKQVIGWQPSNCDR 417


>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
          Length = 434

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 111/400 (27%), Positives = 169/400 (42%), Gaps = 77/400 (19%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE----PCPPSQCYMQD--------------- 131
           YLI +++GTPP       DTGSDL W  C      C     Y  +               
Sbjct: 29  YLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSSL 88

Query: 132 -----SPLFDPKMSSTYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLAT 184
                SPL     SS     PC+ + C  ++L + +C      ++ +YG G    G L  
Sbjct: 89  RDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTR 148

Query: 185 ETVTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
           +T+T  GS+      +P   FGC     G    +  GI G G G +SL SQ+     G F
Sbjct: 149 DTLTTHGSSPSFTREVPNFCFGC----VGSTYREPIGIAGFGRGVLSLPSQLGFLQKG-F 203

Query: 244 SYCLV-------PVSSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQR 293
           S+C +       P  S+ +  G   I S   +  T L K      +Y + ++AI+VGN  
Sbjct: 204 SHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNAT 263

Query: 294 LGVSTPD------------IVIDSGTTLTFLPQGYNSNLLSVMSSMI---EAQPVADPTG 338
             +  P             ++IDSGTT T LP  + + LLS++ S+I    AQ     TG
Sbjct: 264 -AIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTG 322

Query: 339 SLELCYSFNSLSQV--------PEVTIHF-RGADVKLSRSNFFVKV-----SEDIVCSVF 384
             +LCY     + V        P ++ HF     + L + N F  +     S  + C + 
Sbjct: 323 -FDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLL 381

Query: 385 KGITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + + +S      ++G+  Q N  V YD+E++ + F+P DC
Sbjct: 382 QNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 421


>gi|449455475|ref|XP_004145478.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449518962|ref|XP_004166504.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 449

 Score =  101 bits (252), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 119/417 (28%), Positives = 173/417 (41%), Gaps = 93/417 (22%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT-------QCEPCPPSQCYMQDSPL--FDPKMSS 141
           YL+ +SIGTPP       DTGSDL W         C+ C   Q  +    L  F P  SS
Sbjct: 21  YLMSLSIGTPPQVVQVYMDTGSDLTWVPCGNLSFDCQDCEEYQNNISGPRLAAFLPTHSS 80

Query: 142 TYKSLPCSSSQC--------------------ASLNQKSCSGVNCQYSVSYGDGSFSNGN 181
           T     C SS C                    ASL + +C      ++ +YG      G+
Sbjct: 81  TSIRDTCGSSFCMDIHSSDNPFDPCTIAGCSLASLVKGTCPRPCPSFAYTYGASGVVTGS 140

Query: 182 LATETV-TLGSTTGQAVA---LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
           L  + + T G+          +P   FGC     G    +  GI G G G +SL  Q+  
Sbjct: 141 LTRDVLFTHGNYNNNNNNNKQIPRFCFGC----VGATYREPIGIAGFGRGLLSLPFQLGF 196

Query: 238 TIAGKFSYCLVPVS-STKINFGTNGIVSGPGVVS-------TPLTKA---KTFYVLTIDA 286
           +  G FS+C +P   S   NF +  I+    + S       TPL K+     +Y + +++
Sbjct: 197 SHKG-FSHCFLPFKFSNNPNFSSPLILGNLAISSKDENLQFTPLLKSPMYPNYYYIGLES 255

Query: 287 ISVGNQ----RLGVS----------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMI---E 329
           I++GN     R GVS             ++IDSGTT T LP+   S L+S +  +I    
Sbjct: 256 ITIGNGDNNFRFGVSFKLREIDTKGNGGMLIDSGTTYTHLPEPLYSQLISNLELVIGYPR 315

Query: 330 AQPVADPTGSLELCY---------SFNSLSQVPEVTIHF-RGADVKLSRSNFFVKVSEDI 379
           A+ V   TG  +LCY         SF   +Q+P +T HF     V L + N F  ++  I
Sbjct: 316 AKQVELNTG-FDLCYKVPCKNNNSSFVDDAQLPSITFHFLNNVSVVLPQGNNFYAMAAPI 374

Query: 380 VCSVFKGI----------------TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
             +V K +                     I+G+  Q N  V YD+E++ + F+P DC
Sbjct: 375 NSTVVKCLLYQSMDGVGDDNDSDDNGPAGIFGSFQQQNIEVVYDLEKERLGFQPMDC 431


>gi|15242307|ref|NP_199325.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|9758987|dbj|BAB09497.1| chloroplast nucleoid DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|332007824|gb|AED95207.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 491

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 106/409 (25%), Positives = 172/409 (42%), Gaps = 92/409 (22%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE----PCPPSQCY------MQDSPLFDPKMS 140
           YLI ++IGTPP       DTGSDL W  C      C   +CY      ++   +F P  S
Sbjct: 83  YLITLNIGTPPQAVQVYLDTGSDLTWVPCGNLSFDC--IECYDLKNNDLKSPSVFSPLHS 140

Query: 141 STYKSLPCSSSQCASLNQKS-----CSGVNC---------------QYSVSYGDGSFSNG 180
           ST     C+SS C  ++        C+   C                ++ +YG+G   +G
Sbjct: 141 STSFRDSCASSFCVEIHSSDNPFDPCAVAGCSVSMLLKSTCVRPCPSFAYTYGEGGLISG 200

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
            L  + +       +   +P  +FGC T+       +  GI G G G +SL SQ+     
Sbjct: 201 ILTRDIL-----KARTRDVPRFSFGCVTST----YREPIGIAGFGRGLLSLPSQLGFLEK 251

Query: 241 GKFSYCLVPVSSTKINFGTNGIVSGPGVVSTPLTKAKTF------------YVLTIDAIS 288
           G FS+C +P         ++ ++ G   +S  LT +  F            Y + +++I+
Sbjct: 252 G-FSHCFLPFKFVNNPNISSPLILGASALSINLTDSLQFTPMLNTPMYPNSYYIGLESIT 310

Query: 289 VGNQRLGVSTP------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 336
           +G        P             +++DSGTT T LP+ + S LL+ + S I   P A  
Sbjct: 311 IGTNITPTQVPLTLRQFDSQGNGGMLVDSGTTYTHLPEPFYSQLLTTLQSTI-TYPRATE 369

Query: 337 TGS---LELCYSF----NSLSQV--------PEVTIHF-RGADVKLSRSNFFVKVSED-- 378
           T S    +LCY      N+L+ +        P +T HF   A + L + N F  +S    
Sbjct: 370 TESRTGFDLCYKVPCPNNNLTSLENDVMMIFPSITFHFLNNATLLLPQGNSFYAMSAPSD 429

Query: 379 ---IVCSVFKGITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
              + C +F+ + +       ++G+  Q N  V YD+E++ + F+  DC
Sbjct: 430 GSVVQCLLFQNMEDGDYGPAGVFGSFQQQNVKVVYDLEKERIGFQAMDC 478


>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 417

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 111/400 (27%), Positives = 169/400 (42%), Gaps = 77/400 (19%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE----PCPPSQCYMQD--------------- 131
           YLI +++GTPP       DTGSDL W  C      C     Y  +               
Sbjct: 12  YLISLNLGTPPKVIQVYMDTGSDLTWVPCGNLSFDCMDCNDYRNNKLMSTYSPSYSSSSL 71

Query: 132 -----SPLFDPKMSSTYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLAT 184
                SPL     SS     PC+ + C  ++L + +C      ++ +YG G    G L  
Sbjct: 72  RDLCVSPLCSDVHSSDNSYDPCAVAGCSLSTLVKGTCPRPCPSFAYTYGAGGVVIGTLTR 131

Query: 185 ETVTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKF 243
           +T+T  GS+      +P   FGC     G    +  GI G G G +SL SQ+     G F
Sbjct: 132 DTLTTHGSSPSFTREVPNFCFGC----VGSTYREPIGIAGFGRGVLSLPSQLGFLQKG-F 186

Query: 244 SYCLV-------PVSSTKINFGTNGIVSGPGVVSTPLTK---AKTFYVLTIDAISVGNQR 293
           S+C +       P  S+ +  G   I S   +  T L K      +Y + ++AI+VGN  
Sbjct: 187 SHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTSLLKNPMYPNYYYIGLEAITVGNAT 246

Query: 294 LGVSTPD------------IVIDSGTTLTFLPQGYNSNLLSVMSSMI---EAQPVADPTG 338
             +  P             ++IDSGTT T LP  + + LLS++ S+I    AQ     TG
Sbjct: 247 -AIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYTQLLSMLQSIITYPRAQEQEARTG 305

Query: 339 SLELCYSFNSLSQV--------PEVTIHF-RGADVKLSRSNFFVKV-----SEDIVCSVF 384
             +LCY     + V        P ++ HF     + L + N F  +     S  + C + 
Sbjct: 306 -FDLCYRIPCPNNVVTDHDHLLPSISFHFSNNVSLVLPQGNHFYAMGAPSNSTVVKCLLL 364

Query: 385 KGITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 420
           + + +S      ++G+  Q N  V YD+E++ + F+P DC
Sbjct: 365 QNMDDSDSGPAGVFGSFQQQNVKVVYDLEKERIGFQPMDC 404


>gi|255565531|ref|XP_002523756.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223537060|gb|EEF38696.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 507

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 100/367 (27%), Positives = 170/367 (46%), Gaps = 37/367 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G+PP +     DTGSD++W   + C  CP +         FDP  S+T   + 
Sbjct: 84  YFTRVQLGSPPKDFYVQIDTGSDVLWVSCSSCNGCPVTSGLQIPLTFFDPGSSTTAALVS 143

Query: 148 CSSSQCASLNQKS---CSGV--NCQYSVSYGDGSFSNGNLATETVTLGS---TTGQAVAL 199
           CS  +C +  Q S   CS     C Y+  YGDGS ++G    + + L +   ++G+   +
Sbjct: 144 CSDQRCTAGIQSSDSLCSSRTNQCGYTFQYGDGSGTSGYYVADLMHLDTLLLSSGELSQI 203

Query: 200 -----PGITFGCGT-NNGGLFNSKTT--GIVGLGGGDISLISQMRT--TIAGKFSYCLVP 249
                  ++F C T   G L  S     GI G G  ++S+ISQ+ +       FS+CL  
Sbjct: 204 CQTYDSSVSFMCSTLQTGDLTKSDRAVDGIFGFGQQEMSVISQLASQGITPRVFSHCLKG 263

Query: 250 VSSTKINFGTNGIVSGPGVVSTPLTKAKTFYVLTIDAISVGNQRLGV--------STPDI 301
             S         IV  P +V TPL  ++  Y L + +ISV  Q L +        S    
Sbjct: 264 DDSGGGVLVLGEIVE-PNIVYTPLVPSQPHYNLYLQSISVAGQTLAIDPSVFGASSNQGT 322

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV-PEVTIHFR 360
           ++DSGTTL +L +G     +S ++S++        +   +     +S++ V P+V+++F 
Sbjct: 323 IVDSGTTLAYLAEGAYDPFVSAITSVVSLNARTYLSKGNQCYLVTSSVNDVFPQVSLNFA 382

Query: 361 -GADVKLSRSNFFVKVSE----DIVCSVF-KGITNSVPIYGNIMQTNFLVGYDIEQQTVS 414
            GA + L+  ++ ++ +      + C  F K     + I G+++  + +  YDI  Q V 
Sbjct: 383 GGASLILNPQDYLLQQNSVGGAAVWCVGFQKTPGQQITILGDLVLKDKIFVYDIANQRVG 442

Query: 415 FKPTDCT 421
           +   DC+
Sbjct: 443 WTNYDCS 449


>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 438

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 120/430 (27%), Positives = 186/430 (43%), Gaps = 48/430 (11%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQ----RLRDALTRSLNRLNHFNQNSSISSSK 78
           + Q  G ++E+ H  SP SPF  S    +     +L+      L  L       SI    
Sbjct: 27  DTQDHGSTLEVFHVFSPCSPFRPSKPLSWAESVLQLQAKDQARLQFLASMVAGRSIVP-I 85

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           AS   II  +  Y++R  IGTPP   L   DT +D  W  C  C    C    S LF P+
Sbjct: 86  ASGRQII-QSPTYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTAC--DGC---TSTLFAPE 139

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            S+T+K++ C S +C  +   SC    C ++++YG  S +  N+  +TVTL +       
Sbjct: 140 KSTTFKNVSCGSPECNKVPSPSCGTSACTFNLTYGSSSIA-ANVVQDTVTLATD-----P 193

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGKFSYCLVPVSSTKINFG 258
           +PG TFGC     G  ++   G++GLG G +SL+SQ +      FSYCL    S  +NF 
Sbjct: 194 IPGYTFGCVAKTTGP-STPPQGLLGLGRGPLSLLSQTQNLYQSTFSYCLPSFKS--LNFS 250

Query: 259 TN---GIVSGP-GVVSTPLTK---AKTFYVLTIDAISVGNQRLGVSTPDI---------- 301
            +   G V+ P  +  TPL K     + Y + + AI VG + + +    +          
Sbjct: 251 GSLRLGPVAQPIRIKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATGAGT 310

Query: 302 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT----GSLELCYSFNSLSQVPEVTI 357
           V DSGT  T L     + +       +     A+ T    G  + CY+   ++  P +T 
Sbjct: 311 VFDSGTVFTRLVAPVYTAVRDEFRRRVAMAAKANLTVTSLGGFDTCYTVPIVA--PTITF 368

Query: 358 HFRGADVKLSRSNFFVKVSED-----IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 412
            F G +V L + N  +  +        + S    + + + +  N+ Q N  V YD+    
Sbjct: 369 MFSGMNVTLPQDNILIHSTAGSTSCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSR 428

Query: 413 VSFKPTDCTK 422
           +      CTK
Sbjct: 429 LGVARELCTK 438


>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 551

 Score =  101 bits (251), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 104/371 (28%), Positives = 164/371 (44%), Gaps = 41/371 (11%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
           + ++ P+   Y   I +G PP       DTGSDL W QC+ PC  + C     PL+ P  
Sbjct: 182 KGNVFPD-GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPC--TNCAKGPHPLYKP-- 236

Query: 140 SSTYKSLPCSSSQCASL--NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
            +  K +P   S C  L  +Q  C     C Y + Y D S S G LA + + L +T G  
Sbjct: 237 -AKEKIVPPRDSLCQELQGDQNYCETCKQCDYEIEYADRSSSMGVLAKDDMHLIATNGGR 295

Query: 197 VALPGITFGCGTNNGGLFNS---KTTGIVGLGGGDISLISQM--RTTIAGKFSYCLV-PV 250
             L    FGC  +  G   S   KT GI+GL    ISL SQ+  +  I+  F +C+    
Sbjct: 296 EKL-DFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRET 354

Query: 251 SSTKINFGTNGIVSGPGVVSTPLTKA-KTFYVLTIDAISVGNQRLGV-STPDIVIDSGTT 308
           +     F  +  V   G+   P+       Y      ++ G+Q L   ++  ++ DSG++
Sbjct: 355 NGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNSVQVIFDSGSS 414

Query: 309 LTFLPQGYNSNLLSVMSSMIEAQP--VADPTG-SLELCYS--FNSLSQVPEVTIHFRGAD 363
            T+LP+    NL+  +    E  P  V D +  +L LC+   F+  S    + +HF    
Sbjct: 415 YTYLPEEMYKNLIDAIK---EDSPSFVQDSSDTTLPLCWKADFSVRSFFKPLNLHFGRRW 471

Query: 364 VKLSRSNFFVKVSEDIVC-----SVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQ 411
             + ++  F  V +D +      +V  G+ N       S  I G++     LV YD E++
Sbjct: 472 FVVPKT--FTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNERR 529

Query: 412 TVSFKPTDCTK 422
            + +  ++CTK
Sbjct: 530 QIGWANSECTK 540


>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 421

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 101/376 (26%), Positives = 168/376 (44%), Gaps = 48/376 (12%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + +SIG PP       DTGSDL W QC+ PC    C     PL+ P   
Sbjct: 50  GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCNKVPHPLYRP--- 103

Query: 141 STYKSLPCSSSQCASLN-----QKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           +  K +PC    C+SL+     +  C      C Y + Y D   S G L T++  +    
Sbjct: 104 TKNKIVPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYADQGSSLGVLLTDSFAV-RLA 162

Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMRTTIAGK--FSYCLV 248
             ++  P + FGCG +         + T G++GLG G ISL+SQ++     K    +CL 
Sbjct: 163 NSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCLS 222

Query: 249 PVSSTKINFGTNGIVSGPGVVSTPLTKA--KTFYVLTIDAISVGNQRLGVSTPDIVIDSG 306
                 + FG N +V        P+ ++  K +Y     ++  G + LGV   ++V+DSG
Sbjct: 223 IRGGGFLFFGDN-LVPYSRATWVPMVRSAFKNYYSPGTASLYFGGRSLGVRPMEVVLDSG 281

Query: 307 TTLTFL-PQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS----FNSLSQVPE----VT 356
           ++ T+   Q Y + + ++ S + +  + V DP  SL LC+     F S+  V +    + 
Sbjct: 282 SSFTYFGAQPYQALVTALKSDLSKTLKEVFDP--SLPLCWKGKKPFKSVLDVKKEFKSLV 339

Query: 357 IHF---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGY 406
           + F   + A +++   N+ +       C    GI N        + I G+I   + +V Y
Sbjct: 340 LSFSNGKKALMEIPPENYLIVTKFGNAC---LGILNGSEIGLKDLNIVGDITMQDQMVIY 396

Query: 407 DIEQQTVSFKPTDCTK 422
           D E+  + +    C +
Sbjct: 397 DNERGQIGWIRAPCDR 412


>gi|226495677|ref|NP_001146995.1| pepsin A precursor [Zea mays]
 gi|195606284|gb|ACG24972.1| pepsin A [Zea mays]
          Length = 504

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 112/425 (26%), Positives = 171/425 (40%), Gaps = 107/425 (25%)

Query: 88  NANYLIRISIGTPPTERLAVA---DTGSDLIWTQCEPCPPSQCYM--------QDSPLFD 136
            ++Y + +S+G P +    V+   DTGSDL+W    PC P  C +        +  PL  
Sbjct: 87  GSDYTLSLSVG-PASAAAPVSLFLDTGSDLVWF---PCAPFTCMLCEGKPTPGRSGPLPP 142

Query: 137 PKMSSTYKSLPCSSSQCASLNQKS-----CSGVNCQYS-----------------VSYGD 174
           P  S   + +PC+S  C++ +  +     C+   C                     +YGD
Sbjct: 143 PPDS---RRIPCASPLCSAAHASAPPSDLCAAARCPLEDIETGSCGASHACPPLYYAYGD 199

Query: 175 GSFSNGNLATETVTLGS--TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
           GS    +L    V LG+      AVA+   TF C     G    +  G+ G G G +SL 
Sbjct: 200 GSLV-AHLRRGRVALGAGARASVAVAVDNFTFACAHTALG----EPVGVAGFGRGPLSLP 254

Query: 233 SQMRTTIAGKFSYCLV--------------------PVSSTKINFGTNGIVSGPGVVSTP 272
            Q+   ++G+FSYCLV                    P  +      T+G V  P ++  P
Sbjct: 255 GQLSPQLSGRFSYCLVSHSFRADRLIRPSPLILGRSPDDADAAAAETDGFVYTP-LLHNP 313

Query: 273 LTKAKTFYVLTIDAISVGNQRLGVSTPDI-----------VIDSGTTLTFLPQGYNSNLL 321
             K   FY + ++A+SVG  R+  + P++           V+DSGTT T LP    + + 
Sbjct: 314 --KHPYFYSVALEAVSVGAARI-QARPELARVDRAGNGGMVVDSGTTFTMLPNEMYARVA 370

Query: 322 SVMSSMIEAQPV-----ADPTGSLELCYSFNSLSQ-VPEVTIHFRG-ADVKLSRSNFFVK 374
              +  + A        A+    L  CY + +  + VP + +HFRG A V L R N+F+ 
Sbjct: 371 EAFARAMAAAGFARAERAEEQTGLTPCYRYAASDRGVPPLALHFRGNATVALPRRNYFMG 430

Query: 375 V----------SEDIVCSVF------KGITNSVPI--YGNIMQTNFLVGYDIEQQTVSFK 416
                       +D+ C +        G     P    GN  Q  F V YD++   V F 
Sbjct: 431 FKSEDAGAGTRKDDVGCLMLMNGGDASGEEGDGPAGTLGNFQQQGFEVVYDVDAGRVGFA 490

Query: 417 PTDCT 421
              CT
Sbjct: 491 RRRCT 495


>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
 gi|219888491|gb|ACL54620.1| unknown [Zea mays]
          Length = 557

 Score =  101 bits (251), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 123/475 (25%), Positives = 196/475 (41%), Gaps = 65/475 (13%)

Query: 2   ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKS---PFYNSS----------- 47
           A  L  + +  + CFY  S +  Q  G   E   R+  +S   P Y  +           
Sbjct: 83  ALVLGALAVAAYYCFY--SDVAVQFLGMEQEEEQRNETRSFLLPLYPKARQGRALREFGD 140

Query: 48  -ETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTE 103
            +   +R+ D   ++ NR+      ++ ++S A    + ++ P+   Y   I IG PP  
Sbjct: 141 VKLAARRVDDGGRKARNRMEVAKAATARTNSTALLPIKGNVFPD-GQYYTSIFIGNPPRP 199

Query: 104 RLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKS 160
                DTGSDL W QC+ PC  +       PL+ P   +  K +P     C  L  NQ  
Sbjct: 200 YFLDVDTGSDLTWIQCDAPC--TNFAKGPHPLYKP---AKEKIVPPRDLLCQELQGNQNY 254

Query: 161 CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS--- 216
           C     C Y + Y D S S G LA + + + +T G    L    FGC  +  G   S   
Sbjct: 255 CETCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKL-DFVFGCAYDQQGQLLSSPA 313

Query: 217 KTTGIVGLGGGDISLISQMRT--TIAGKFSYCLV-PVSSTKINFGTNGIVSGPGVVSTPL 273
           KT GI+GL    IS  SQ+ +   IA  F +C+          F  +  V   GV  T +
Sbjct: 314 KTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGGGYMFLGDDYVPRWGVTWTSI 373

Query: 274 TKA-KTFYVLTIDAISVGNQRL-----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVM--S 325
                  Y      +  G+Q+L       ST  ++ DSG++ T+LP     NL++ +  +
Sbjct: 374 RSGPDNLYHTQAHHVKYGDQQLRRPEQAGSTVQVIFDSGSSYTYLPNEIYENLVAAIKYA 433

Query: 326 SMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFRGADVKLSRSNFFVKVS-----EDI 379
           S    Q  +D T  L LC+  +  +  + +V   F   ++   +   F+  +     ED 
Sbjct: 434 SPGFVQDTSDRT--LPLCWKADFPVRYLEDVKQFFEPLNLHFGKKWLFMSKTFTISPEDY 491

Query: 380 VC-----SVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 422
           +      +V  G+ N       S  I G++     LV YD +++ + +  +DCTK
Sbjct: 492 LIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNQRKQIGWADSDCTK 546


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.132    0.390 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,634,089,967
Number of Sequences: 23463169
Number of extensions: 284146421
Number of successful extensions: 745117
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1191
Number of HSP's successfully gapped in prelim test: 3638
Number of HSP's that attempted gapping in prelim test: 734244
Number of HSP's gapped (non-prelim): 5903
length of query: 423
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 278
effective length of database: 8,957,035,862
effective search space: 2490055969636
effective search space used: 2490055969636
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 78 (34.7 bits)