BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 017265
         (374 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score =  335 bits (860), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 195/443 (44%), Positives = 260/443 (58%), Gaps = 73/443 (16%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           MA  +S + I+  +    + PI+A   GF+VELI+RDSPKSPFYN  ETP QR+  A+ R
Sbjct: 1   MAASVSLLAIVTLIFSGTLVPIDAAKDGFTVELINRDSPKSPFYNPRETPTQRIVSAVRR 60

Query: 61  SLNRLNHFN--QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
           S++R++HF+  +NS I +  A Q+++I N   YL++ S+GTP  + LA+ADTGSDLIWTQ
Sbjct: 61  SMSRVHHFSPTKNSDIFTDTA-QSEMISNQGEYLMKFSLGTPAFDILAIADTGSDLIWTQ 119

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSG---VNCQYSVSYGD 174
           C+PC   QCY QD+PLFDPK SSTY+ + CS+ QC  L +  SCSG     C YS SYGD
Sbjct: 120 CKPC--DQCYEQDAPLFDPKSSSTYRDISCSTKQCDLLKEGASCSGEGNKTCHYSYSYGD 177

Query: 175 GSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK----------------- 217
            SF++GN+A +T+TLGST+G+ V LP    GCG NNGG F  K                 
Sbjct: 178 RSFTSGNVAADTITLGSTSGRPVLLPKAIIGCGHNNGGSFTEKGSGIVGLGGGPISLISQ 237

Query: 218 -----------------------------TTGIVGLGG-GDISLISQ---------MRTT 238
                                        + GIV  GG     LIS+         +   
Sbjct: 238 LGSTIDGKFSYCLVPLSSNATNSSKLNFGSNGIVSGGGVQSTPLISKDPDTFYFLTLEAV 297

Query: 239 IAGNQRL-------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 291
             G++R+       G S  +I+IDSGTTLT  P+ + S L S +   +   PV DP+G L
Sbjct: 298 SVGSERIKFPGSSFGTSEGNIIIDSGTTLTLFPEDFFSELSSAVQDAVAGTPVEDPSGIL 357

Query: 292 ELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQT 351
            LCYS ++  + P +T HF GADVKL+  N FV+VS+ ++C  F  I NS  I+GN+ Q 
Sbjct: 358 SLCYSIDADLKFPSITAHFDGADVKLNPLNTFVQVSDTVLCFAFNPI-NSGAIFGNLAQM 416

Query: 352 NFLVGYDIEQQTVSFKPTDCTKQ 374
           NFLVGYD+E +TVSFKPTDCT+ 
Sbjct: 417 NFLVGYDLEGKTVSFKPTDCTQD 439


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  316 bits (809), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 196/413 (47%), Positives = 242/413 (58%), Gaps = 70/413 (16%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK-ASQADIIP 86
           GF+ +LIHRDSPKSPFYN +ET  QRLR+A+ RS++R+ HF   S   +S  A Q D+  
Sbjct: 30  GFTADLIHRDSPKSPFYNPTETSSQRLRNAIHRSVSRVFHFTDISQKDASDNAPQIDLTS 89

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N+  YL+ IS+GTPP   +A+ADTGSDL+WTQC+PC    CY Q  PLFDPK SSTYK +
Sbjct: 90  NSGEYLMNISLGTPPFPIMAIADTGSDLLWTQCKPC--DDCYTQVDPLFDPKASSTYKDV 147

Query: 147 PCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
            CSSSQC +L NQ SCS  +  C YS SYGD S++ GN+A +T+TLGST  + V L  I 
Sbjct: 148 SCSSSQCTALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNII 207

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG------------NQRLG----- 246
            GCG NN G FN K +GIVGLGGG +SLI+Q+  +I G            N R       
Sbjct: 208 IGCGHNNAGTFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLVPLTSENDRTSKINFG 267

Query: 247 ----------VSTPDIVI-------------------------DSG-----------TTL 260
                     VSTP I                           DSG           TTL
Sbjct: 268 TNAVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSDSGSGEGNIIIDSGTTL 327

Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRS 320
           T LP  + S L   ++S I+A+   DP   L LCYS     +VP +T+HF GADV L  S
Sbjct: 328 TLLPTEFYSELEDAVASSIDAEKKQDPQTGLSLCYSATGDLKVPAITMHFDGADVNLKPS 387

Query: 321 NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           N FV++SED+VC  F+G + S  IYGN+ Q NFLVGYD   +TVSFKPTDC K
Sbjct: 388 NCFVQISEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTDCAK 439


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score =  307 bits (786), Expect = 6e-81,   Method: Compositional matrix adjust.
 Identities = 176/438 (40%), Positives = 247/438 (56%), Gaps = 86/438 (19%)

Query: 9   FILFFLC--FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
            +LF+LC  FY    +EA  GGFSVE+IHRDS +SPF+  +ET +QR+ +A+ RS+NR N
Sbjct: 11  LVLFYLCNIFY----LEAFNGGFSVEMIHRDSSRSPFFRPTETQFQRVANAVHRSVNRAN 66

Query: 67  HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
           HF++     + KA++A I  N+  YLI  S+G PP +   + DTGSD+IW QC+PC   +
Sbjct: 67  HFHK-----AHKAAKATITQNDGEYLISYSVGIPPFQLYGIIDTGSDMIWLQCKPC--EK 119

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLA 183
           CY Q + +FDP  S+TYK LP SS+ C S+   SCS  N   C+Y++ YGDGS+S G+L+
Sbjct: 120 CYNQTTRIFDPSKSNTYKILPFSSTTCQSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLS 179

Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG----------------- 226
            ET+TLGST G +V       GCG NN   F  K++GIVGLG                  
Sbjct: 180 VETLTLGSTNGSSVKFRRTVIGCGRNNTVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIG 239

Query: 227 -------------------GDISLIS--------------------QMRTTIAGNQRLGV 247
                              GD +++S                     +     GN R+  
Sbjct: 240 RKFSYCLASMSNISSKLNFGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEF 299

Query: 248 STP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SF 297
           ++         +I+IDSGTTLT LP    S L S ++ ++E   V DP   L LCY  +F
Sbjct: 300 TSSSFRFGEKGNIIIDSGTTLTLLPNDIYSKLESAVADLVELDRVKDPLKQLSLCYRSTF 359

Query: 298 NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVG 356
           + L+  P +  HF GADVKL+  N F++V + + C  F  I++ + PI+GN+ Q NFLVG
Sbjct: 360 DELN-APVIMAHFSGADVKLNAVNTFIEVEQGVTCLAF--ISSKIGPIFGNMAQQNFLVG 416

Query: 357 YDIEQQTVSFKPTDCTKQ 374
           YD++++ VSFKPTDC+KQ
Sbjct: 417 YDLQKKIVSFKPTDCSKQ 434


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  304 bits (778), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 191/429 (44%), Positives = 246/429 (57%), Gaps = 67/429 (15%)

Query: 10  ILFFLCFY---VVSPIEAQTG-GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL 65
           +L  LC +   ++S + A+   GF+ +LIHRDSPKSPFYN +ETP QR+R+A+ RS NR+
Sbjct: 8   VLLSLCLFSSHILSNVNAKPKLGFTTDLIHRDSPKSPFYNPAETPSQRIRNAIHRSFNRV 67

Query: 66  NHFNQNSSISSSKAS-QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPP 124
           +HF   S + +S  S Q DI P    YL+ +S+GTPP+  +AVADTGS+LIWTQC+PC  
Sbjct: 68  SHFTDLSEMDASLNSPQTDITPCGGEYLMNLSLGTPPSPIMAVADTGSNLIWTQCKPC-- 125

Query: 125 SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGN 181
             CY Q  PLFDPK SSTYK + CSSSQC +L NQ SCS  +  C Y VSY DGS++ G 
Sbjct: 126 DDCYTQVDPLFDPKASSTYKDVSCSSSQCTALENQASCSTEDKTCSYLVSYADGSYTMGK 185

Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
            A +T+TLGST  + V L  I  GCG NN   F +K++G+VGLGGG +SLI Q+  +I G
Sbjct: 186 FAVDTLTLGSTDNRPVQLKNIIIGCGQNNAVTFRNKSSGVVGLGGGAVSLIKQLGDSIDG 245

Query: 242 ---------NQRLG---------------VSTPDIV------------------------ 253
                    N +                 VSTP +V                        
Sbjct: 246 KFSYCLVPENDQTSKINFGTNAVVSGPGTVSTPLVVKSRDTFYYLTLKSISVGSKNMQTP 305

Query: 254 ---------IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVP 304
                    IDSGTTLT LP  Y   + + ++S+I A    D      LCY+  +   +P
Sbjct: 306 DSNIKGNMVIDSGTTLTLLPVKYYIEIENAVASLINADKSKDERIGSSLCYNATADLNIP 365

Query: 305 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
            +T+HF GADVKL   N F KV+ED+VC  F        IYGN+ Q NFLVGYD   +T+
Sbjct: 366 VITMHFEGADVKLYPYNSFFKVTEDLVCLAFGMSFYRNGIYGNVAQKNFLVGYDTASKTM 425

Query: 365 SFKPTDCTK 373
           SFKPTDC K
Sbjct: 426 SFKPTDCAK 434


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score =  300 bits (768), Expect = 8e-79,   Method: Compositional matrix adjust.
 Identities = 166/408 (40%), Positives = 242/408 (59%), Gaps = 53/408 (12%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           +  F + V + F   F ++    A+ GGFSV+LIHRDSP SPF++ S+T  +RL DA  R
Sbjct: 6   VKIFFNVVVVGFL--FQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRR 63

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
           S++R+  F   +   +S   Q+ I+P+   YL+ + IGTPP   +A+ DTGSDL WTQC 
Sbjct: 64  SVSRVGRFRPTAM--TSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCR 121

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSG-VNCQYSVSYGDGSFS 178
           PC  + CY Q  PLFDPK SSTY+   C +S C +L + +SCS    C +  SY DGSF+
Sbjct: 122 PC--THCYKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFT 179

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
            GNLA+ET+T+ ST G+ V+ PG  FGCG ++GG+F+  ++GIVGLGGG++SLISQ+++T
Sbjct: 180 GGNLASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKST 239

Query: 239 IAG---------------NQRLG------------VSTP-----------------DIVI 254
           I G               + R+             VSTP                 +I++
Sbjct: 240 INGLFSYCLLPVSTDSSISSRINFGASGRVSGYGTVSTPLRLPYKGYSKKTEVEEGNIIV 299

Query: 255 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGAD 314
           DSGTT TFLPQ + S L   +++ I+ + V DP G   LCY+  +    P +T HF+ A+
Sbjct: 300 DSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAEINAPIITAHFKDAN 359

Query: 315 VKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
           V+L   N F+++ ED+VC      T+ + + GN+ Q NFLVG+D+ ++
Sbjct: 360 VELQPLNTFMRMQEDLVCFTV-APTSDIGVLGNLAQVNFLVGFDLRKK 406



 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 53/142 (37%), Positives = 86/142 (60%), Gaps = 6/142 (4%)

Query: 234 QMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
            +R     +++  V   +I++DSGTT T+LP  +   L   ++  I+ + V DP G   L
Sbjct: 402 DLRKKRGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGISSL 461

Query: 294 CYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQ 350
           CY+  ++ Q+  P +T HF+ A+V+L   N F+++ ED+VC +V    T+ + I GN+ Q
Sbjct: 462 CYN-TTVDQIDAPIITAHFKDANVELQPWNTFLRMQEDLVCFTVLP--TSDIGILGNLAQ 518

Query: 351 TNFLVGYDIEQQTVSFKPTDCT 372
            NFLVG+D+ ++ VSFK  DCT
Sbjct: 519 VNFLVGFDLRKKRVSFKAADCT 540


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score =  297 bits (760), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 179/432 (41%), Positives = 237/432 (54%), Gaps = 75/432 (17%)

Query: 9   FILFFLC--FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
            +LF+LC  FY    +EA  GGFSVE+IHRDS +SPF++ +ET +QR+ +A+ RS+NR N
Sbjct: 11  LVLFYLCNIFY----LEAFNGGFSVEMIHRDSSRSPFFSPTETQFQRVANAVHRSINRAN 66

Query: 67  HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
           H NQ  S  S  + +  +I     YLI  S+GTP  +   + DTGSD+IW QC+PC   +
Sbjct: 67  HLNQ--SFVSPNSPETTVISALGEYLISYSVGTPSLQVFGILDTGSDIIWLQCQPC--KK 122

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATE 185
           CY Q +P+FD   S TYK+LPC S+ C S+    CS   +C YS+ Y DGS S G+L+ E
Sbjct: 123 CYEQTTPIFDSSKSQTYKTLPCPSNTCQSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVE 182

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN--- 242
           T+TLGST G  V  PG   GCG  N      K +GIVGLG G +SLI+Q+  +  G    
Sbjct: 183 TLTLGSTNGSPVQFPGTVIGCGRYNAIGIEEKNSGIVGLGRGPMSLITQLSPSTGGKFSY 242

Query: 243 ----------------------QRLGVSTP------------------------------ 250
                                  R  VSTP                              
Sbjct: 243 CLVPGLSTASSKLNFGNAAVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPG 302

Query: 251 -----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSL-SQ 302
                +I+IDSGTTLT LP G  S L + ++  +  Q V DP   L LCY    + L + 
Sbjct: 303 SGGKGNIIIDSGTTLTALPNGVYSKLEAAVAKTVILQRVRDPNQVLGLCYKVTPDKLDAS 362

Query: 303 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
           VP +T HF GADV L+  N FV+V++D+VC  F+  T +  ++GN+ Q N LVGYD++  
Sbjct: 363 VPVITAHFSGADVTLNAINTFVQVADDVVCFAFQP-TETGAVFGNLAQQNLLVGYDLQMN 421

Query: 363 TVSFKPTDCTKQ 374
           TVSFK TDCTKQ
Sbjct: 422 TVSFKHTDCTKQ 433


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  297 bits (760), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 172/429 (40%), Positives = 239/429 (55%), Gaps = 72/429 (16%)

Query: 11  LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ 70
           LF LCF + S   A + GFSVELIHRDSPKSP+Y  +E  YQ   DA  RS+NR NHF +
Sbjct: 11  LFSLCF-IASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARRSINRANHFFK 69

Query: 71  NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
           +S  S+ +++   +IP+   YL+  S+GTPPT+   +ADTGSD++W QCEPC   QCY Q
Sbjct: 70  DSDTSTPEST---VIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCEPC--EQCYNQ 124

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTL 189
            +P+F+P  SS+YK++PCSS  C S+   SCS  N CQY +SYGD S S G+L+ +T++L
Sbjct: 125 TTPIFNPSKSSSYKNIPCSSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQGDLSVDTLSL 184

Query: 190 GSTTGQAVALPGITFGC-----------------------------GTNNGG-------- 212
            ST+G  V+ P I  GC                             G++ GG        
Sbjct: 185 ESTSGSPVSFPKIVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLVP 244

Query: 213 LFNSKTTGIVGLGGGDISLIS-------------------QMRTTIAGNQRL-------- 245
           L N ++     L  GD +++S                    ++    GN+R+        
Sbjct: 245 LLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEG 304

Query: 246 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVP 304
           G    +I+IDSGTTLT +P    +NL S +  +++   V DP     LCYS  S     P
Sbjct: 305 GDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCYSLKSNEYDFP 364

Query: 305 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
            +T+HF+GADV+L   + FV +++ IVC  F+       I+GN+ Q N LVGYD++Q+TV
Sbjct: 365 IITVHFKGADVELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTV 424

Query: 365 SFKPTDCTK 373
           SFKPTDCTK
Sbjct: 425 SFKPTDCTK 433


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  291 bits (744), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 171/439 (38%), Positives = 238/439 (54%), Gaps = 72/439 (16%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           M T       LF LCF + S   A + GFSVELIHRDSPKSP+Y  +E  YQ   DA  R
Sbjct: 1   MNTLCFLTLSLFSLCF-IASFSHALSNGFSVELIHRDSPKSPYYKPTENKYQHFVDAARR 59

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
           S+NR NHF ++S  S+ +++   +IP+   YL+  S+GTPPT+   +ADTGSD++W QCE
Sbjct: 60  SINRANHFFKDSDTSTPEST---VIPDRGGYLMTYSVGTPPTKIYGIADTGSDIVWLQCE 116

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSN 179
           PC   QCY Q +P+F+P  SS+YK++PC S  C S+   SCS  N CQY +SYGD S S 
Sbjct: 117 PC--EQCYNQTTPIFNPSKSSSYKNIPCLSKLCHSVRDTSCSDQNSCQYKISYGDSSHSQ 174

Query: 180 GNLATETVTLGSTTGQAVALPGITFGC-----------------------------GTNN 210
           G+L+ +T++L ST+G  V+ P    GC                             G++ 
Sbjct: 175 GDLSVDTLSLESTSGSPVSFPKTVIGCGTDNAGTFGGASSGIVGLGGGPVSLITQLGSSI 234

Query: 211 GG--------LFNSKTTGIVGLGGGDISLIS-------------------QMRTTIAGNQ 243
           GG        L N ++     L  GD +++S                    ++    GN+
Sbjct: 235 GGKFSYCLVPLLNKESNASSILSFGDAAVVSGDGVVSTPLIKKDPVFYFLTLQAFSVGNK 294

Query: 244 RL--------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY 295
           R+        G    +I+IDSGTTLT +P    +NL S +  +++   V DP     LCY
Sbjct: 295 RVEFGGSSEGGDDEGNIIIDSGTTLTLIPSDVYTNLESAVVDLVKLDRVDDPNQQFSLCY 354

Query: 296 SFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFL 354
           S  S     P +T HF+GAD++L   + FV +++ IVC  F+       I+GN+ Q N L
Sbjct: 355 SLKSNEYDFPIITAHFKGADIELHSISTFVPITDGIVCFAFQPSPQLGSIFGNLAQQNLL 414

Query: 355 VGYDIEQQTVSFKPTDCTK 373
           VGYD++Q+TVSFKPTDCTK
Sbjct: 415 VGYDLQQKTVSFKPTDCTK 433


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score =  280 bits (716), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 169/429 (39%), Positives = 226/429 (52%), Gaps = 71/429 (16%)

Query: 11  LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ 70
           L  LC Y +   EA   GFSVE+IHRDS +SPFY ++ET +QR+ +A+ RS+NR NHFNQ
Sbjct: 9   LVLLCLYNICFSEALKSGFSVEIIHRDSSRSPFYRATETQFQRVTNAVRRSMNRANHFNQ 68

Query: 71  NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
            S  S++  S   ++ ++ +YL+  S+GTPP     + DT SD+IW QC+ C    CY  
Sbjct: 69  ISVYSNAVESPVTLL-DDGDYLMSYSLGTPPFPVYGIVDTASDIIWVQCQLC--ETCYND 125

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETV 187
            SP+FDP  S TYK+LPCSS+ C S+   SCS      C+++V+Y DGS S G+L  ETV
Sbjct: 126 TSPMFDPSYSKTYKNLPCSSTTCKSVQGTSCSSDERKICEHTVNYKDGSHSQGDLIVETV 185

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG--------------------- 226
           TLGS     V  P    GC  N    F+S   GIVGLGG                     
Sbjct: 186 TLGSYNDPFVHFPRTVIGCIRNTNVSFDS--IGIVGLGGGPVSLVPQLSSSISKKFSYCL 243

Query: 227 ------------GDISLIS--------------------QMRTTIAGNQRLGVSTP---- 250
                       GD +++S                     +     GN R+   +     
Sbjct: 244 APISDRSSKLKFGDAAMVSGDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEFRSSSSRS 303

Query: 251 ----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY-SFNSLSQVPE 305
               +I+IDSGTT T LP    S L S ++ +++ +   DP     LCY S      VP 
Sbjct: 304 SGKGNIIIDSGTTFTVLPDDVYSKLESAVADVVKLERAEDPLKQFSLCYKSTYDKVDVPV 363

Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
           +T HF GADVKL+  N F+  S  +VC  F   + S  I+GN+ Q NFLVGYD++++ VS
Sbjct: 364 ITAHFSGADVKLNALNTFIVASHRVVCLAFLS-SQSGAIFGNLAQQNFLVGYDLQRKIVS 422

Query: 366 FKPTDCTKQ 374
           FKPTDCTKQ
Sbjct: 423 FKPTDCTKQ 431


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  279 bits (713), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 164/443 (37%), Positives = 243/443 (54%), Gaps = 81/443 (18%)

Query: 2   ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
            +FL+  F  FFLCF  +S  +A + GFS+ELIHRDS KSPFY  ++  YQ + DA+ RS
Sbjct: 4   VSFLTLSF--FFLCF-SISFSQAVSNGFSIELIHRDSSKSPFYKPTQNKYQHVVDAVHRS 60

Query: 62  LNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
           +NR+NH N+NS  S+ +++   +I    +Y++  S+GTPP +   + DTGSD++W QCEP
Sbjct: 61  INRVNHSNKNSLASTPEST---VISYEGDYIMSYSVGTPPIKSYGIVDTGSDIVWLQCEP 117

Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-NCQYSVSYGDGSFSNG 180
           C   QCY Q +P F+P  SS+YK++ CSS  C S+   SC+   NC+YS++YG+ S S G
Sbjct: 118 C--EQCYNQTTPKFNPSKSSSYKNISCSSKLCQSVRDTSCNDKKNCEYSINYGNQSHSQG 175

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNN-----------------------------G 211
           +L+ ET+TL STTG+ V+ P    GCGTNN                             G
Sbjct: 176 DLSLETLTLESTTGRPVSFPKTVIGCGTNNIGSFKRVSSGVVGLGGGPASLITQLGPSIG 235

Query: 212 GLF-----------NSKTTGIVGLGGGDISLIS--------------------------- 233
           G F            + + G   L  GD++++S                           
Sbjct: 236 GKFSYCLVRMSITLKNMSMGSSKLNFGDVAIVSGHNVLSTPIVKKDHSFFYYLTIEAFSV 295

Query: 234 -QMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 292
              R   AG+ + GV   +I+IDS T +TF+P    + L S +  ++  + V DP     
Sbjct: 296 GDKRVEFAGSSK-GVEEGNIIIDSSTIVTFVPSDVYTKLNSAIVDLVTLERVDDPNQQFS 354

Query: 293 LCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQ 350
           LCY+ +S  +   P +T HF+GAD+ L  +N FV+V+ D++C  F   +N   I+G+  Q
Sbjct: 355 LCYNVSSDEEYDFPYMTAHFKGADILLYATNTFVEVARDVLCFAF-APSNGGAIFGSFSQ 413

Query: 351 TNFLVGYDIEQQTVSFKPTDCTK 373
            +F+VGYD++Q+TVSFK  DCT+
Sbjct: 414 QDFMVGYDLQQKTVSFKSVDCTE 436


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score =  276 bits (707), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 165/433 (38%), Positives = 231/433 (53%), Gaps = 75/433 (17%)

Query: 10  ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
           +LFF   ++VS   AQ  GFSVELIHRDS KSP Y  ++  YQ   DA  RS+NR NHF 
Sbjct: 9   LLFFSICFIVSFSHAQKNGFSVELIHRDSLKSPLYKPTQNKYQYFVDAARRSINRANHFY 68

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           +    S +   Q+ +IP+   YL+  S+GTPP +   + DTGSD++W QCEPC   +CY 
Sbjct: 69  K---YSLANIPQSTVIPDIGEYLMTYSVGTPPFKLYGIVDTGSDIVWLQCEPC--QECYN 123

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
           Q +P+F+P  SS+YK++PC S  C S+   SC+  N C+YS  YGD S S G+L+ +T+T
Sbjct: 124 QTTPMFNPSKSSSYKNIPCPSKLCQSMEDTSCNDKNYCEYSTYYGDNSHSGGDLSVDTLT 183

Query: 189 LGSTTGQAVALPGITFGCGTNN-----------------------------GGLF----- 214
           L ST G  V+ P I  GCGTNN                             GG F     
Sbjct: 184 LESTNGLTVSFPNIVIGCGTNNILSYEGASSGIVGFGSGPASFITQLGSSTGGKFSYCLT 243

Query: 215 ------NSKTTGIVGLGGGDISLIS--------------------QMRTTIAGNQRL--- 245
                 N ++     L  GD + +S                     +     GN+R+   
Sbjct: 244 PLFSVTNIQSNATSKLNFGDAATVSGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIG 303

Query: 246 ----GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 301
               G +  +I+IDSGTTLT L +   S L S +  +++ + V DPT +L LCYS  +  
Sbjct: 304 GVPNGDNEGNIIIDSGTTLTSLTKDDYSFLESAVVDLVKLERVDDPTQTLNLCYSVKAEG 363

Query: 302 -QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
              P +T+HF+GADV L   + FV V++ + C  F+   +   I+GN+ Q N +VGYD++
Sbjct: 364 YDFPIITMHFKGADVDLHPISTFVSVADGVFCLAFESSQDHA-IFGNLAQQNLMVGYDLQ 422

Query: 361 QQTVSFKPTDCTK 373
           Q+ VSFKP+DCTK
Sbjct: 423 QKIVSFKPSDCTK 435


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 165/420 (39%), Positives = 222/420 (52%), Gaps = 82/420 (19%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
           G F+  LIHRDSP SP YN   T + RL+ +  RS++R N F  NS +S++K  + DIIP
Sbjct: 31  GSFTASLIHRDSPISPLYNPKNTYFDRLQSSFHRSISRANRFTPNS-VSAAKTLEYDIIP 89

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
               Y +RISIGTPP E L +ADTGSDLIW QC+PC   +CY Q SP+F+PK SSTY+ +
Sbjct: 90  GGGEYFMRISIGTPPIEVLVIADTGSDLIWVQCQPC--QECYKQKSPIFNPKQSSTYRRV 147

Query: 147 PCSSSQCASLN--QKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
            C +  C +LN   ++CS       C YS SYGD SF+ G LATE   +GST     ++ 
Sbjct: 148 LCETRYCNALNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFIIGSTNN---SIQ 204

Query: 201 GITFGCGTNNGGLFNSKTTG-----------IVGLGG----------------------- 226
            + FGCG +NGG F+   +G           I  LG                        
Sbjct: 205 ELAFGCGNSNGGNFDEVGSGIVGLGGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGK 264

Query: 227 ---GDISLISQMRTTIA---------------------GNQRLG---------VSTPDIV 253
              GD S IS   T ++                     GN+RL          V   +I+
Sbjct: 265 IVFGDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRNDGNVEKGNII 324

Query: 254 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA 313
           IDSGTTLTFL     + L  V+   +E + V+DP G   +C+      ++P +T+HF  A
Sbjct: 325 IDSGTTLTFLDSKLYNKLELVLEKAVEGERVSDPNGIFSICFRDKIGIELPIITVHFTDA 384

Query: 314 DVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           DV+L   N F K  ED++C  F  I +N + I+GN+ Q NFLVGYD+++  VSF PTDC+
Sbjct: 385 DVELKPINTFAKAEEDLLC--FTMIPSNGIAIFGNLAQMNFLVGYDLDKNCVSFMPTDCS 442


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  267 bits (683), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 163/437 (37%), Positives = 226/437 (51%), Gaps = 98/437 (22%)

Query: 8   VFILFF--LCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL 65
           + ILF+  LCF ++S   A   GFSVELIHRDS KSP Y  ++  YQ + +A  RS+NR 
Sbjct: 6   LLILFYFSLCF-IISLSHALNNGFSVELIHRDSSKSPLYQPTQNKYQHIVNAARRSINRA 64

Query: 66  NHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           NHF + +    +   Q+ +IP++  YL+  S+GTPP +   +ADTGSD++W QCEPC   
Sbjct: 65  NHFYKTAL---TNTPQSTVIPDHGEYLMTYSVGTPPFKLYGIADTGSDIVWLQCEPC--K 119

Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
           +CY Q +P F P  SSTYK++PCSS  C S  Q                     GNL+ +
Sbjct: 120 ECYNQTTPKFKPSKSSTYKNIPCSSDLCKSGQQ---------------------GNLSVD 158

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG------------------- 226
           T+TL S+TG  ++ P    GCGT+N   F   ++GIVGLGG                   
Sbjct: 159 TLTLESSTGHPISFPKTVIGCGTDNTVSFEGASSGIVGLGGGPASLITQLGSSIDAKFSY 218

Query: 227 -----------------GDISLIS--------------------QMRTTIAGNQRL---- 245
                            GD +++S                     +     GN+R+    
Sbjct: 219 CLLPNPVESNTTSKLNFGDTAVVSGDGVVSTPIVKKDPIVFYYLTLEAFSVGNKRIEFEG 278

Query: 246 ---GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS- 301
              G    +I+IDSGTTLT +P    +NL S +  +++ + V DPT    LCYS  S   
Sbjct: 279 SSNGGHEGNIIIDSGTTLTVIPTDVYNNLESAVLELVKLKRVNDPTRLFNLCYSVTSDGY 338

Query: 302 QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP-----IYGNIMQTNFLVG 356
             P +T HF+GADVKL   + FV V++ IVC  F   +  +P     I+GN+ Q N LVG
Sbjct: 339 DFPIITTHFKGADVKLHPISTFVDVADGIVCLAFATTSAFIPSDVVSIFGNLAQQNLLVG 398

Query: 357 YDIEQQTVSFKPTDCTK 373
           YD++Q+ VSFKPTDC+K
Sbjct: 399 YDLQQKIVSFKPTDCSK 415


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 166/443 (37%), Positives = 234/443 (52%), Gaps = 83/443 (18%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTG---GFSVELIHRDSPKSPFYNSSETPYQRLRDA 57
           MA   S V ++ FL    V  + A TG   GF+VELIHRDSPKSP YN  E  Y R+ D 
Sbjct: 1   MAPIFSLVIVIIFLISTAV--VSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADT 58

Query: 58  LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
           L RS++       N+ + ++   +A I  N   YL+++S+GTPP   +AVADTGSD+IWT
Sbjct: 59  LRRSIS------HNTGLVTNTV-EAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWT 111

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVSYGDG 175
           QCEPC  + CY QD P+F+P  S+TY+ + CSS  C+   +  SCS   +C YS+SYGD 
Sbjct: 112 QCEPC--TNCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDN 169

Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           S S G+ A +T+T+GST+G+ VA P    GCG +N G F++  +GIVGLG G  SLI QM
Sbjct: 170 SHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQM 229

Query: 236 RTTIAGN---------------QRL------------GVSTPDIVIDS-----GTTLTFL 263
            + + G                 +L             VSTP  + D         L  +
Sbjct: 230 GSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAV 289

Query: 264 PQGYNSNLLSVMSSMI--EAQPVADPTGSLELC--------------------------- 294
             G N+   S  +S++  +A  + D   +L L                            
Sbjct: 290 SVGRNNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQF 349

Query: 295 --YSFNSLSQ---VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNI 348
             Y F + +    VP + +HF GA+++L R N  ++VS++++C  F G   N + IYGNI
Sbjct: 350 LEYCFETTTDDYKVPFIAMHFEGANLRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNI 409

Query: 349 MQTNFLVGYDIEQQTVSFKPTDC 371
            Q NFLVGYD+   ++SFKP +C
Sbjct: 410 AQINFLVGYDVTNMSLSFKPMNC 432


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score =  265 bits (677), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 171/411 (41%), Positives = 224/411 (54%), Gaps = 70/411 (17%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           GF+++LIHRDSPKSPFYNS+ET  QR+R+A+ RS      F+ + +  S  + Q+ I  N
Sbjct: 25  GFTIDLIHRDSPKSPFYNSAETSSQRMRNAIRRSARSTLQFSNDDA--SPNSPQSFITSN 82

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
              YL+ ISIGTPP   LA+ADTGSDLIWTQC PC    CY Q SPLFDPK SSTY+ + 
Sbjct: 83  RGEYLMNISIGTPPVPILAIADTGSDLIWTQCNPC--EDCYQQTSPLFDPKESSTYRKVS 140

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           CSSSQC +L   SCS     C Y+++YGD S++ G++A +TVT+GS+  + V+L  +  G
Sbjct: 141 CSSSQCRALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIG 200

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN---------------QRLGVSTP 250
           CG  N G F+   +GI+GLGGG  SL+SQ+R +I G                 ++   T 
Sbjct: 201 CGHENTGTFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLVPFTSETGLTSKINFGTN 260

Query: 251 DIVIDSGTTLTFL----PQGY---NSNLLSVMSSMI----------EAQPVADPTGSLEL 293
            IV   G   T +    P  Y   N   +SV S  I          E   V D   +L L
Sbjct: 261 GIVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFGTGEGNIVIDSGTTLTL 320

Query: 294 -------------------------------CYSFNSLSQVPEVTIHFRGADVKLSRSNF 322
                                          CY  +S  +VP++T+HF+G DVKL   N 
Sbjct: 321 LPSNFYYELESVVASTIKAERVQDPDGILSLCYRDSSSFKVPDITVHFKGGDVKLGNLNT 380

Query: 323 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           FV VSED+ C  F      + I+GN+ Q NFLVGYD    TVSFK TDC++
Sbjct: 381 FVAVSEDVSCFAFAA-NEQLTIFGNLAQMNFLVGYDTVSGTVSFKKTDCSQ 430


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  263 bits (672), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 165/443 (37%), Positives = 233/443 (52%), Gaps = 83/443 (18%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTG---GFSVELIHRDSPKSPFYNSSETPYQRLRDA 57
           MA   S V ++ FL    V  + A TG   GF+VELIHRDSPKSP YN  E  Y R+ D 
Sbjct: 1   MAPIFSLVIVIIFLISTAV--VSAATGPDYGFTVELIHRDSPKSPMYNPLENHYHRVADT 58

Query: 58  LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
           L RS++       N+ + ++   +A I  N   YL+++S+GTPP   +AVADTGSD+IWT
Sbjct: 59  LRRSIS------HNTGLVTNTV-EAPIYNNRGEYLMKLSVGTPPFPIIAVADTGSDIIWT 111

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVSYGDG 175
           QC PC  + CY QD P+F+P  S+TY+ + CSS  C+   +  SCS   +C YS+SYGD 
Sbjct: 112 QCVPC--TNCYQQDLPMFNPSKSTTYRKVSCSSPVCSFTGEDNSCSFKPDCTYSISYGDN 169

Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           S S G+ A +T+T+GST+G+ VA P    GCG +N G F++  +GIVGLG G  SLI QM
Sbjct: 170 SHSQGDFAVDTLTMGSTSGRVVAFPRTAIGCGHDNAGSFDANVSGIVGLGLGPASLIKQM 229

Query: 236 RTTIAGN---------------QRL------------GVSTPDIVIDS-----GTTLTFL 263
            + + G                 +L             VSTP  + D         L  +
Sbjct: 230 GSAVGGKFSYCLTPIGNDDGGSNKLNFGSNANVSGSGAVSTPIYISDKFKSFYSLKLKAV 289

Query: 264 PQGYNSNLLSVMSSMI--EAQPVADPTGSLELC--------------------------- 294
             G N+   S  +S++  +A  + D   +L L                            
Sbjct: 290 SVGRNNTFYSTANSILGGKANIIIDSGTTLTLLPVDLYHNFAKAISNSINLQRTDDPNQF 349

Query: 295 --YSFNSLSQ---VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNI 348
             Y F + +    VP + +HF GA+++L R N  ++VS++++C  F G   N + IYGNI
Sbjct: 350 LEYCFETTTDDYKVPFIAMHFEGANLRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNI 409

Query: 349 MQTNFLVGYDIEQQTVSFKPTDC 371
            Q NFLVGYD+   ++SFKP +C
Sbjct: 410 AQINFLVGYDVTNMSLSFKPMNC 432


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 162/441 (36%), Positives = 218/441 (49%), Gaps = 75/441 (17%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           +    S V  L F+    +S  E + G FS++LIHRDSPKSP YN SETP +RL     R
Sbjct: 7   LGLLFSIVIALSFVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERL----DR 62

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
              R   F++ S   S    +  +  NN  YL++ISIGTPP +   + DTGSDL+WTQC 
Sbjct: 63  FFRRFMSFSEASI--SPNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCL 120

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFS 178
           PC    CY Q +P+FDP  S+++K + C S QC  L+  SCS     C +S  YGDGS +
Sbjct: 121 PC--LSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLA 178

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
            G +ATET+TL S +GQ  ++  I FGCG NN G FN    G+ G GG  +SL SQ+ +T
Sbjct: 179 QGVIATETLTLNSNSGQPXSIXNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMST 238

Query: 239 IAGNQRLG-----------------------------VSTPDIVIDSGT----TL----- 260
           +   ++                               VSTP +  D  T    TL     
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISV 298

Query: 261 ------------------TFLPQGYNSNLL------SVMSSMIEAQP---VADPTGSLEL 293
                              F+  G    LL       ++  + EA P   V DP    +L
Sbjct: 299 GDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQL 358

Query: 294 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 353
           CY   +L   P +T HF GADV+L   N F+   E + C   + I     I+GN +Q NF
Sbjct: 359 CYRSATLIDGPILTAHFDGADVQLKPLNTFISPKEGVYCFAMQPIDGDTGIFGNFVQMNF 418

Query: 354 LVGYDIEQQTVSFKPTDCTKQ 374
           L+G+D++ + VSFK  DCTKQ
Sbjct: 419 LIGFDLDGKKVSFKAVDCTKQ 439


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score =  257 bits (657), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 162/441 (36%), Positives = 218/441 (49%), Gaps = 75/441 (17%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           +    S V  L F+    +S  E + G FS++LIHRDSPKSP YN SETP +RL     R
Sbjct: 7   LGLLFSIVIALSFVSVAHISAAEVKNGRFSIDLIHRDSPKSPLYNPSETPAERL----DR 62

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
              R   F++ S   S    +  +  NN  YL++ISIGTPP +   + DTGSDL+WTQC 
Sbjct: 63  FFRRFMSFSEASI--SPNTPEPPVSSNNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCL 120

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFS 178
           PC    CY Q +P+FDP  S+++K + C S QC  L+  SCS     C +S  YGDGS +
Sbjct: 121 PC--LSCYKQKNPMFDPSKSTSFKEVSCESQQCRLLDTVSCSQPQKLCDFSYGYGDGSLA 178

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
            G +ATET+TL S +GQ  ++  I FGCG NN G FN    G+ G GG  +SL SQ+ +T
Sbjct: 179 QGVIATETLTLNSNSGQPTSILNIVFGCGHNNSGTFNENEMGLFGTGGRPLSLTSQIMST 238

Query: 239 IAGNQRLG-----------------------------VSTPDIVIDSGT----TL----- 260
           +   ++                               VSTP +  D  T    TL     
Sbjct: 239 LGSGRKFSQCLVPFRTDPSITSKIIFGPEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISV 298

Query: 261 ------------------TFLPQGYNSNLL------SVMSSMIEAQP---VADPTGSLEL 293
                              F+  G    LL       ++  + EA P   V DP    +L
Sbjct: 299 GDKLFPFSSSSPMATKGNVFIDAGTPPTLLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQL 358

Query: 294 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 353
           CY   +L   P +T HF GADV+L   N F+   E + C   + I     I+GN +Q NF
Sbjct: 359 CYRSATLIDGPILTAHFDGADVQLKPLNTFISPKEGVYCFAMQPIDGDTGIFGNFVQMNF 418

Query: 354 LVGYDIEQQTVSFKPTDCTKQ 374
           L+G+D++ + VSFK  DCTKQ
Sbjct: 419 LIGFDLDGKKVSFKAVDCTKQ 439


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score =  256 bits (655), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 160/433 (36%), Positives = 223/433 (51%), Gaps = 86/433 (19%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           +FL+ +F   F CF ++S   A   GF++ELIHRDS KSPFY  ++  Y+R+ +A+ RS+
Sbjct: 5   SFLTLLFFTIF-CF-IISLSHALNNGFTLELIHRDSSKSPFYQPTQNKYERIANAVRRSI 62

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           NR+NHF + S  S+    Q+ +  +   YL+  SIGTPP +     DTGSDL+W QCEPC
Sbjct: 63  NRVNHFYKYSLTST---PQSTVNSDKGEYLMSYSIGTPPFKVFGFVDTGSDLVWLQCEPC 119

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
              QCY Q +P+FDP +SS+Y+++PC S  C S+   SC                  G L
Sbjct: 120 --KQCYPQITPIFDPSLSSSYQNIPCLSDTCHSMRTTSCD---------------VRGYL 162

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
           + ET+TL STTG +V+ P    GCG  N G F+  ++GIVGLG G +SL SQ+ T+I G 
Sbjct: 163 SVETLTLDSTTGYSVSFPKTMIGCGYRNTGTFHGPSSGIVGLGSGPMSLPSQLGTSIGGK 222

Query: 243 -------------QRLGVSTPDIVIDSGTTLTFLPQ-----GYNSNL--LSVMSSMIE-- 280
                         +L      IV   G   T + +     GY   L   SV + +IE  
Sbjct: 223 FSYCLGPWLPNSTSKLNFGDAAIVYGDGAMTTPIVKKDAQSGYYLTLEAFSVGNKLIEFG 282

Query: 281 ---------------------------------------AQPVADPTGSLELCYSFNSLS 301
                                                   + V DP G+ +LCY+     
Sbjct: 283 GPTYGGNEGNILIDSGTTFTFLPYDVYYRFESAVAEYINLEHVEDPNGTFKLCYNVAYHG 342

Query: 302 -QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
            + P +T HF+GAD+KL   + F+KVS+ I C  F  I +   I+GN+ Q N LVGY++ 
Sbjct: 343 FEAPLITAHFKGADIKLYYISTFIKVSDGIACLAF--IPSQTAIFGNVAQQNLLVGYNLV 400

Query: 361 QQTVSFKPTDCTK 373
           Q TV+FKP DCTK
Sbjct: 401 QNTVTFKPVDCTK 413


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  249 bits (637), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 124/246 (50%), Positives = 167/246 (67%), Gaps = 6/246 (2%)

Query: 1   MATF---LSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDA 57
           MA F   LS    +  LC      I A+  GF+V+LIHRDSP SPFYNS ET  QR+ +A
Sbjct: 1   MAAFRSPLSFALAIALLCVSGFGCIYARKVGFTVDLIHRDSPLSPFYNSEETDLQRINNA 60

Query: 58  LTRSLNRLNHFNQNSSIS-SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
           L RS++R++HF+  ++ S S KA+++D+  N   YL+ +S+GTPP + + +ADTGSDLIW
Sbjct: 61  LRRSISRVHHFDPIAAASVSPKAAESDVTSNRGEYLMSLSLGTPPFKIMGIADTGSDLIW 120

Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGS 176
           TQC+PC   +CY Q  PLFDPK S TY+   C + QC+ L+Q +CSG  CQY  SYGD S
Sbjct: 121 TQCKPC--ERCYKQVDPLFDPKSSKTYRDFSCDARQCSLLDQSTCSGNICQYQYSYGDRS 178

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           ++ GN+A++T+TL STTG  V+ P    GCG  N G F+ K +GIVGLG G +SLISQM 
Sbjct: 179 YTMGNVASDTITLDSTTGSPVSFPKTVIGCGHENDGTFSDKGSGIVGLGAGPLSLISQMG 238

Query: 237 TTIAGN 242
           +++ G 
Sbjct: 239 SSVGGK 244



 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 66/134 (49%), Positives = 92/134 (68%)

Query: 241 GNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 300
           G+  LG    +I+IDSGTTLT +P  + SNL + + + +E +   DP+G L +CYS  S 
Sbjct: 307 GDSSLGTGEGNIIIDSGTTLTIVPDDFFSNLSTAVGNQVEGRRAEDPSGFLSVCYSATSD 366

Query: 301 SQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
            +VP +T HF GADVKL   N FV+VS+D+VC  F   T+ + IYGN+ Q NFLV Y+I+
Sbjct: 367 LKVPAITAHFTGADVKLKPINTFVQVSDDVVCLAFASTTSGISIYGNVAQMNFLVEYNIQ 426

Query: 361 QQTVSFKPTDCTKQ 374
            +++SFKPTDCTK+
Sbjct: 427 GKSLSFKPTDCTKK 440


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 128/249 (51%), Positives = 162/249 (65%), Gaps = 12/249 (4%)

Query: 1   MATFLSCVFILFF-----LCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLR 55
           MATF S   +L F     LC      I A   GF+ EL+HRDSPKSP YNS +T  QR  
Sbjct: 1   MATFQS---VLSFASAIALCVASFGCIYAHNAGFTTELVHRDSPKSPLYNSQQTHLQRWN 57

Query: 56  DALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLI 115
            A+ RS++R++HF + ++  S K  +++II N   YL+ +S+GTPP E LA+ADTGSDLI
Sbjct: 58  KAMRRSVSRVHHFQRTAATVSPKEVESEIIANGGEYLMSLSLGTPPFEILAIADTGSDLI 117

Query: 116 WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSGVN-CQYSVSYG 173
           WTQC PC   +CY Q +PLFDPK S TY+ L C + QC +L +  SCS    CQYS  YG
Sbjct: 118 WTQCTPC--DKCYKQIAPLFDPKSSKTYRDLSCDTRQCQNLGESSSCSSEQLCQYSYYYG 175

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           D SF+NGNLA +TVTL ST G  V  P    GCG  N G F+ K +GI+GLGGG +SLIS
Sbjct: 176 DRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIGCGRRNNGTFDKKDSGIIGLGGGPMSLIS 235

Query: 234 QMRTTIAGN 242
           QM +++ G 
Sbjct: 236 QMGSSVGGK 244



 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 54/124 (43%), Positives = 79/124 (63%), Gaps = 2/124 (1%)

Query: 251 DIVIDSGTTLTFLPQGYNSNLL-SVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIH 309
           +I+IDSGT+LT  P  + +    +V +++I  +   D +G L  CY      +VP +T H
Sbjct: 317 NIIIDSGTSLTLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPTPDLKVPVITAH 376

Query: 310 FRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
           F GADV L   N F+ +S+D++C  F   T S  I+GN+ Q NFL+GYDI+ ++VSFKPT
Sbjct: 377 FNGADVVLQTLNTFILISDDVLCLAFNS-TQSGAIFGNVAQMNFLIGYDIQGKSVSFKPT 435

Query: 370 DCTK 373
           DCT+
Sbjct: 436 DCTQ 439


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 171/437 (39%), Positives = 226/437 (51%), Gaps = 76/437 (17%)

Query: 9   FILFFLCFYVVSPI---EAQTG--GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
           F+ F L FY VS +   EA     GF+V+LIHRDSP SPFYN S TP QR+ +A  RS++
Sbjct: 4   FVFFCLAFYSVSSLFSTEANESPSGFTVDLIHRDSPLSPFYNPSLTPSQRIINAALRSIS 63

Query: 64  RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
           RLN  + N    ++K  Q+ +I +N  YL+R  IGTPP ERLA ADTGSDLIW QC PC 
Sbjct: 64  RLNRVS-NLLDQNNKLPQSVLILHNGEYLMRFYIGTPPVERLATADTGSDLIWVQCSPC- 121

Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSC--SGVNCQYSVSYGDG-SFS 178
            + C+ Q +PLF P  SST+    C S  C  L   QK C  SG  C Y+  YGD  SFS
Sbjct: 122 -ASCFPQSTPLFQPLKSSTFMPTTCRSQPCTLLLPEQKGCGKSG-ECIYTYKYGDQYSFS 179

Query: 179 NGNLATETVTLGSTTG-QAVALPGITFGCGT-NNGGLFNS-KTTGIVGLGGGDISLISQM 235
            G L+TET+   S  G Q VA P   FGCG  NN  +F S K TGI+GLG G +SL+SQ+
Sbjct: 180 EGLLSTETLRFDSQGGVQTVAFPNSFFGCGLYNNITVFPSYKLTGIMGLGAGPLSLVSQI 239

Query: 236 RTTIA--------------------GNQRL-----GVSTP-------------------- 250
              I                     GN+ +      VSTP                    
Sbjct: 240 GDQIGHKFSYCLLPLGSTSTSKLKFGNESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTV 299

Query: 251 ------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 298
                       +++IDSGT LT+L + +  N  + +   +  + V D    L  C+ + 
Sbjct: 300 AQKTVPTGSTDGNVIIDSGTLLTYLGESFYYNFAASLQESLAVELVQDVLSPLPFCFPYR 359

Query: 299 SLSQVPEVTIHFRGADVKLSRSNFFVKVSE-DIVCSVFKGITNS-VPIYGNIMQTNFLVG 356
                PE+   F GA V L  +N FV   + + VC +    + S + I+G+  Q +F V 
Sbjct: 360 DNFVFPEIAFQFTGARVSLKPANLFVMTEDRNTVCLMIAPSSVSGISIFGSFSQIDFQVE 419

Query: 357 YDIEQQTVSFKPTDCTK 373
           YD+E + VSF+PTDC+K
Sbjct: 420 YDLEGKKVSFQPTDCSK 436


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 157/432 (36%), Positives = 222/432 (51%), Gaps = 71/432 (16%)

Query: 9   FILFFLCFYVVSPIEAQTG--GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
            IL       +S  EA+ G  GFSV+LIHRDSP SPFYN S TP +R+ +A  RS++RL 
Sbjct: 7   MILALFSLSTLSSREAREGLRGFSVDLIHRDSPSSPFYNPSLTPSERIINAALRSMSRLQ 66

Query: 67  HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
             +    +  +K  ++ +IP+   YL+R  IG+PP ERLA+ DTGS LIW QC PC    
Sbjct: 67  RVSH--FLDENKLPESLLIPDKGEYLMRFYIGSPPVERLAMVDTGSSLIWLQCSPC--HN 122

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGV-NCQYSVSYGDGSFSNGNLA 183
           C+ Q++PLF+P  SSTYK   C S  C  L  +Q+ C  +  C Y + YGD SFS G L 
Sbjct: 123 CFPQETPLFEPLKSSTYKYATCDSQPCTLLQPSQRDCGKLGQCIYGIMYGDKSFSVGILG 182

Query: 184 TETVTLGSTTG-QAVALPGITFGCGT-NNGGLFNS-KTTGIVGLGGGDISLISQMRTTIA 240
           TET++ GST G Q V+ P   FGCG  NN  ++ S K  GI GLG G +SL+SQ+   I 
Sbjct: 183 TETLSFGSTGGAQTVSFPNTIFGCGVDNNFTIYTSNKVMGIAGLGAGPLSLVSQLGAQIG 242

Query: 241 ---------------GNQRLG----------VSTP------------------------- 250
                             + G          VSTP                         
Sbjct: 243 HKFSYCLLPYDSTSTSKLKFGSEAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVV 302

Query: 251 -------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 303
                  +IVIDSGT LT+L   + +N ++ +   +  + + D    L+ C+   +   +
Sbjct: 303 STGQTDGNIVIDSGTPLTYLENTFYNNFVASLQETLGVKLLQDLPSPLKTCFPNRANLAI 362

Query: 304 PEVTIHFRGADVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
           P++   F GA V L   N  + +++ +I+C +V       + ++G+I Q +F V YD+E 
Sbjct: 363 PDIAFQFTGASVALRPKNVLIPLTDSNILCLAVVPSSGIGISLFGSIAQYDFQVEYDLEG 422

Query: 362 QTVSFKPTDCTK 373
           + VSF PTDC K
Sbjct: 423 KKVSFAPTDCAK 434


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score =  240 bits (613), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 152/411 (36%), Positives = 219/411 (53%), Gaps = 69/411 (16%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD----I 84
           F+++LIH DSP SPFYNSS T  Q +R+A  RS++R N  + + S S ++  ++     I
Sbjct: 30  FTIDLIHHDSPPSPFYNSSMTRSQLIRNAAMRSISRANQLSLSLSHSLNQLKESSPEPII 89

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
           IPNN NYL+RI IGTP  ERLA+ADTGSDL W QC PC  ++C+ Q++PL+DP  SST+ 
Sbjct: 90  IPNNGNYLMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFT 149

Query: 145 SLPCSSSQCASL--NQKSCSGV-NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
            LPC S  C  L  +Q  CS   +C Y+ +YGD S+S G L+++++ L     Q      
Sbjct: 150 LLPCDSQPCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRL--MLLQLHYNSK 207

Query: 202 ITFGCGTNNGGLFNS--KTTGIVGLGGGDISLISQMRTTIA-------------GNQRLG 246
           I FGCG  N    +   KTTGIVGLG G +SL+SQ+   I               N +L 
Sbjct: 208 ICFGCGFQNKFTADKSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCLLPFSSNSNSKLK 267

Query: 247 ------------VSTP-------------------------------DIVIDSGTTLTFL 263
                       VSTP                               +I+IDSG+TLT+L
Sbjct: 268 FGEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVKTGQTDGNIIIDSGSTLTYL 327

Query: 264 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS-LSQVPEVTIHFRGADVKLSRSNF 322
            + + +  +S++   +  +         + C+++   +S  P+V  HF G DV L   N 
Sbjct: 328 EESFYNEFVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTPPDVVFHFTGGDVVLKPMNT 387

Query: 323 FVKVSEDIVCS-VFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            V + ++++CS V     + + I+GN+ Q +F VGYDI+   VSF PTDC+
Sbjct: 388 LVLIEDNLICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPTDCS 438


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score =  240 bits (612), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 158/440 (35%), Positives = 225/440 (51%), Gaps = 76/440 (17%)

Query: 4   FLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
           F S + +LF  CF  VS  + Q  GFSVELIH  S KSPFYN++E+ +QR+ + +  S N
Sbjct: 3   FYSSLLLLF--CFCRVSVSKTQNNGFSVELIHPISSKSPFYNTAESHFQRMSNNMKHSTN 60

Query: 64  RLNHFNQNSSISSSKASQADIIPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           R+++ N   S   +K     + P   + Y+I   IGTPP +   V DT +D IW QC PC
Sbjct: 61  RVHYLNHVFSFPPNKVPNIVVSPFMGDGYIISFLIGTPPFQLYGVMDTANDNIWFQCNPC 120

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSN 179
            P  C+   SP+FDP  SSTYK++PCSS +C ++    CS  +   C+YS +YG  ++S 
Sbjct: 121 KP--CFNTTSPMFDPSKSSTYKTIPCSSPKCKNVENTHCSSDDKKVCEYSFTYGGEAYSQ 178

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           G+L+ +T+TL S     ++   I  GCG  N G      +G +GLG G +S ISQ+ ++I
Sbjct: 179 GDLSIDTLTLNSNNDTPISFKNIVIGCGHRNKGPLEGYVSGNIGLGRGPLSFISQLNSSI 238

Query: 240 AG-----------NQRL-------------GVSTPDIVIDSG-----TTLTFLPQG---- 266
            G           N+ +             GV T    I +G     TTL  L  G    
Sbjct: 239 GGKFSYCLVPLFSNEGISGKLHFGDKSVVSGVGTVSTPITAGEIGYSTTLNALSVGDHII 298

Query: 267 -------YNSNL----------LSVM------------SSMIEAQPVADPTGSLELCY-- 295
                   N NL          L+++            +SM++ +    P    +LCY  
Sbjct: 299 KFENSTSKNDNLGNTIIDSGTTLTILPENVYSRLESIVTSMVKLERAKSPNQQFKLCYKA 358

Query: 296 SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNF 353
           +  +L  VP +T HF GADV L+  N F  +  ++VC  F  + N  P  I GNI Q NF
Sbjct: 359 TLKNL-DVPIITAHFNGADVHLNSLNTFYPIDHEVVCFAFVSVGN-FPGTIIGNIAQQNF 416

Query: 354 LVGYDIEQQTVSFKPTDCTK 373
           LVG+D+++  +SFKPTDCTK
Sbjct: 417 LVGFDLQKNIISFKPTDCTK 436


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  239 bits (610), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 155/440 (35%), Positives = 228/440 (51%), Gaps = 80/440 (18%)

Query: 1   MATFLSCVF--ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL 58
           MA  +S  F  ILF + F   + I    G F+  L HRDS  SP   SS + Y RL +A 
Sbjct: 1   MAATISLFFHLILFLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLANAF 59

Query: 59  TRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
            RSL+R       ++ S +   Q+ I P +  YL+ +SIGTPP + L +ADTGSDL W Q
Sbjct: 60  RRSLSRSAALLNRAATSGAVGLQSSIGPGSGEYLMSVSIGTPPVDYLGIADTGSDLTWAQ 119

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGS 176
           C PC   +CY Q  P+F+P  S+++  +PC++  C +++   C GV   C YS +YGD +
Sbjct: 120 CLPCL--KCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHC-GVQGVCDYSYTYGDRT 176

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           +S G+L  E +T+GS++ ++V       GCG  + G F    +G++GLGGG +SL+SQM 
Sbjct: 177 YSKGDLGFEKITIGSSSVKSV------IGCGHASSGGFGF-ASGVIGLGGGQLSLVSQMS 229

Query: 237 TTIAGNQRLG---------------------------VSTP------------------- 250
            T   ++R                             VSTP                   
Sbjct: 230 QTSGISRRFSYCLPTLLSHANGKINFGENAVVSGPGVVSTPLISKNTVTYYYITLEAISI 289

Query: 251 ------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--S 296
                       +++IDSGTTLT LP+     ++S +  +++A+ V DP GSL+LC+   
Sbjct: 290 GNERHMAFAKQGNVIIDSGTTLTILPKELYDGVVSSLLKVVKAKRVKDPHGSLDLCFDDG 349

Query: 297 FNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQT 351
            N+ +   +P +T HF  GA+V L   N F KV++++ C   K    T    I GN+ Q 
Sbjct: 350 INAAASLGIPVITAHFSGGANVNLLPINTFRKVADNVNCLTLKAASPTTEFGIIGNLAQA 409

Query: 352 NFLVGYDIEQQTVSFKPTDC 371
           NFL+GYD+E + +SFKPT C
Sbjct: 410 NFLIGYDLEAKRLSFKPTVC 429


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 145/439 (33%), Positives = 228/439 (51%), Gaps = 74/439 (16%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           M+ F     I F+LC ++     A   G S+E+IHRD  KSP Y+ + T +QR  + + R
Sbjct: 1   MSRFSVLTLIFFYLCCFIYFS-HASKKGLSIEMIHRDFSKSPLYHPTVTKFQRAYNVVHR 59

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
           S+NR+N+F +  S++ ++   + + P    YLI  S+GTPP +     DTGS+++W QC+
Sbjct: 60  SINRVNYFTKEFSLNKNQPV-STLTPELGEYLISYSVGTPPFKVYGFMDTGSNIVWLQCQ 118

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCS--GVNCQYSVSYGDGS 176
           PC  + C+ Q SP+F+P  SS+YK++PC+SS C   N    SCS  G  C+YS++YG  +
Sbjct: 119 PC--NTCFNQTSPIFNPSKSSSYKNIPCTSSTCKDTNDTHISCSNGGDVCEYSITYGGDA 176

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
            S G+L+ +++TL ST+G +V  P I  GCG  N    NS+++G+VG+G G +SLI Q+ 
Sbjct: 177 KSQGDLSNDSLTLDSTSGSSVLFPNIVIGCGHINVLQDNSQSSGVVGMGRGPMSLIKQVG 236

Query: 237 TTIAGNQ----------------------------RLGVSTPDIVIDSGTTLTFL----- 263
           ++  G++                             + VSTP + ++      FL     
Sbjct: 237 SSSVGSKFSYCLIPYNSDSNSSSKLIFGEDVVVSGEIVVSTPMVKVNGQENYYFLTLEAF 296

Query: 264 -------------------------------PQGYNSNLLSVMSSMIEAQPVADPTGSLE 292
                                          P  + S L+S ++  ++   +  P   L 
Sbjct: 297 SVGNNRIEYGERSNASTQNILIDSGTPLTMLPNLFLSKLVSYVAQEVKLPRIEPPDHHLS 356

Query: 293 LCYSFNSLS-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQT 351
           LCY+       VP++T HF GADVKL+ +  F    + I+C  F   +N + I+GNI Q 
Sbjct: 357 LCYNTTGKQLNVPDITAHFNGADVKLNSNGTFFPFEDGIMCFGFIS-SNGLEIFGNIAQN 415

Query: 352 NFLVGYDIEQQTVSFKPTD 370
           N L+ YD+E++ +SFKPTD
Sbjct: 416 NLLIDYDLEKEIISFKPTD 434


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 151/429 (35%), Positives = 215/429 (50%), Gaps = 86/429 (20%)

Query: 9   FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
           F+L   CF  +S  + Q  GF+VELIH  S +SPFYN  ET  QR+   L  S+NR+ + 
Sbjct: 7   FVLLLFCFCRLSLTKTQNHGFNVELIHPISSRSPFYNPKETQIQRISSILNYSINRVRYL 66

Query: 69  NQNSSISSSKASQADIIP-NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
           N   S S +K     +     A Y++  SIGTPP +  ++ DTG+D IW QC+PC P  C
Sbjct: 67  NHVFSFSPNKIQDVPLSSFMGAGYVMSYSIGTPPFQLYSLIDTGNDNIWFQCKPCKP--C 124

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
             Q SP+F P  SSTYK++PC+S  C +                  DG +    L  +T+
Sbjct: 125 LNQTSPMFHPSKSSTYKTIPCTSPICKN-----------------ADGHY----LGVDTL 163

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN----- 242
           TL S  G  ++   I  GCG  N G      +G +GL  G +S ISQ+ ++I G      
Sbjct: 164 TLNSNNGTPISFKNIVIGCGHRNQGPLEGYVSGNIGLARGPLSFISQLNSSIGGKFSYCL 223

Query: 243 ---------------------QRLG-VSTP-----------------DIVI--------- 254
                                  LG VSTP                 D +I         
Sbjct: 224 VPLFSKENVSSKLHFGDKSTVSGLGTVSTPIKEENGYFVSLEAFSVGDHIIKLENSDNRG 283

Query: 255 ----DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS---LSQVPEVT 307
               DSGTT+T LP+   S L SV+  M++ + V DP+    LCY   S   L++V  +T
Sbjct: 284 NSIIDSGTTMTILPKDVYSRLESVVLDMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLIIT 343

Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
            HF G++V L+  N F  ++++++C  F   G  +S+ I+GN++Q NFLVG+D+ ++T+S
Sbjct: 344 AHFSGSEVHLNALNTFYPITDEVICFAFVSGGNFSSLAIFGNVVQQNFLVGFDLNKKTIS 403

Query: 366 FKPTDCTKQ 374
           FKPTDCTK 
Sbjct: 404 FKPTDCTKH 412


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score =  234 bits (596), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 151/435 (34%), Positives = 209/435 (48%), Gaps = 105/435 (24%)

Query: 9   FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
            +   L  ++   IEA  G F+V+LI R        NSS+  + R+              
Sbjct: 9   LLAILLLVFIFPSIEAHNGRFTVKLIPR--------NSSQVLFNRI-------------- 46

Query: 69  NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
                      +Q  +  ++ +YL+ +SIGTPP +  A  DTGSDLIW QC PC  + CY
Sbjct: 47  ----------TAQTPVSVHHYDYLMELSIGTPPVKTYAQVDTGSDLIWLQCIPC--TNCY 94

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATET 186
            Q +P+FDP+ SSTY ++   S  C+ L   SCS    NC Y+ SY D S + G LA ET
Sbjct: 95  KQLNPMFDPQSSSTYSNIAYGSESCSKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQET 154

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQ--- 243
           +TL STTG+ VAL G+ FGCG NN G+FN K  GI+GLG G +SL+SQ+ ++  G     
Sbjct: 155 LTLTSTTGKPVALKGVIFGCGHNNNGVFNDKEMGIIGLGRGPLSLVSQIGSSFGGKMFSQ 214

Query: 244 ----------------------------------------------RLGVSTPDI----- 252
                                                          LG+S  DI     
Sbjct: 215 CLVPFHTNPSITSPMSFGKGSEVLGNGVVSTPLVSKNTHQAFYFVTLLGISVEDINLPFN 274

Query: 253 -------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFN 298
                        VIDSGT  T LP+ +   L+  + + +   P+  DPT   +LCY   
Sbjct: 275 DGSSLEPITKGNMVIDSGTPTTLLPEDFYHRLVEEVRNKVALDPIPIDPTLGYQLCYRTP 334

Query: 299 SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKG-ITNSVPIYGNIMQTNFLVGY 357
           +  +   +T HF GADV L+ +  F+ V + I C  F    +N   IYGN  Q+N+L+G+
Sbjct: 335 TNLKGTTLTAHFEGADVLLTPTQIFIPVQDGIFCFAFTSTFSNEYGIYGNHAQSNYLIGF 394

Query: 358 DIEQQTVSFKPTDCT 372
           D+E+Q VSFK TDCT
Sbjct: 395 DLEKQLVSFKATDCT 409


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 154/433 (35%), Positives = 214/433 (49%), Gaps = 79/433 (18%)

Query: 14  LCFYVVSPI---EAQTG--GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
           L  Y++S +   E   G  GFS++LIHRDSP SPFY  S TP  R+ +   RS+ +LN  
Sbjct: 9   LALYLLSTVSSREVSEGQRGFSIDLIHRDSPLSPFYKPSLTPSDRIINTALRSIYQLNR- 67

Query: 69  NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
             +S ++  K  +   IPN+  YL+R  IGTPP ERLA+ADT SDLIW QC PC    C+
Sbjct: 68  ASHSDLNEKKTLERVRIPNHGEYLMRFYIGTPPVERLAIADTASDLIWVQCSPC--ETCF 125

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATET 186
            QD+PLF+P  SST+ +L C S  C S N   C  V   C Y+ +YGDGS + G L TE+
Sbjct: 126 PQDTPLFEPHKSSTFANLSCDSQPCTSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTES 185

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGL--FNSKTTGIVGLGGGDISLISQM--------- 235
           +  GS   Q V  P   FGCG+NN  +   ++K TGIVGLG G +SL+SQ+         
Sbjct: 186 IHFGS---QTVTFPKTIFGCGSNNDFMHQISNKVTGIVGLGAGPLSLVSQLGDQIGHKFS 242

Query: 236 ------------------RTTIAGN----------------------------QRLGVST 249
                              TTI GN                            + L V T
Sbjct: 243 YCLLPFTSTSTIKLKFGNDTTITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQVRT 302

Query: 250 PD-----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLSQV 303
            D     I+ID GT LT+L   +  N ++++   +      D      + C+   +    
Sbjct: 303 TDHTNGNIIIDLGTVLTYLEVNFYHNFVTLLREALGISETKDDIPYPFDFCFPNQANITF 362

Query: 304 PEVTIHFRGADVKLSRSNFFVKVSE-DIVC-SVFKGI-TNSVPIYGNIMQTNFLVGYDIE 360
           P++   F GA V LS  N F +  + +++C +V          ++GN+ Q +F V YD +
Sbjct: 363 PKIVFQFTGAKVFLSPKNLFFRFDDLNMICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRK 422

Query: 361 QQTVSFKPTDCTK 373
            + VSF P DC+K
Sbjct: 423 GKKVSFAPADCSK 435


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score =  231 bits (590), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 117/240 (48%), Positives = 163/240 (67%), Gaps = 8/240 (3%)

Query: 4   FLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
           F + V + F   F ++    A+ GGFSV+LIHRDSP SPF++ S+T  +RL DA  RS++
Sbjct: 9   FFNVVVVGFL--FQLLEVALARGGGFSVDLIHRDSPHSPFFDPSKTQAERLTDAFRRSVS 66

Query: 64  RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
           R+  F   +   +S   Q+ I+P+   YL+ + IGTPP   +A+ DTGSDL WTQC PC 
Sbjct: 67  RVGRFRPTAM--TSDGIQSRIVPSAGEYLMNLYIGTPPVPVIAIVDTGSDLTWTQCRPC- 123

Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCSG-VNCQYSVSYGDGSFSNGN 181
            + CY Q  PLFDPK SSTY+   C +S C +L + +SCS    C +  SY DGSF+ GN
Sbjct: 124 -THCYKQVVPLFDPKNSSTYRDSSCGTSFCLALGKDRSCSKEKKCTFRYSYADGSFTGGN 182

Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           LA+ET+T+ ST G+ V+ PG  FGCG ++GG+F+  ++GIVGLGGG++SLISQ+++TI G
Sbjct: 183 LASETLTVDSTAGKPVSFPGFAFGCGHSSGGIFDKSSSGIVGLGGGELSLISQLKSTING 242



 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 52/133 (39%), Positives = 83/133 (62%), Gaps = 1/133 (0%)

Query: 242 NQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 301
           +++  V   +I++DSGTT TFLPQ + S L   +++ I+ + V DP G   LCY+  +  
Sbjct: 308 SKKTEVEEGNIIVDSGTTYTFLPQEFYSKLEKSVANSIKGKRVRDPNGIFSLCYNTTAEI 367

Query: 302 QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
             P +T HF+ A+V+L   N F+++ ED+VC      T+ + + GN+ Q NFLVG+D+ +
Sbjct: 368 NAPIITAHFKDANVELQPLNTFMRMQEDLVCFTV-APTSDIGVLGNLAQVNFLVGFDLRK 426

Query: 362 QTVSFKPTDCTKQ 374
           + VSFK  DCT+ 
Sbjct: 427 KRVSFKAADCTQH 439


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score =  227 bits (578), Expect = 9e-57,   Method: Compositional matrix adjust.
 Identities = 104/218 (47%), Positives = 148/218 (67%), Gaps = 5/218 (2%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           M T    + ++   C Y +S ++A  GGFSVE+IHRDS +SP Y  +ETP+QR+ +A+ R
Sbjct: 3   MITRYCSLALVLLWCLYNISFLKANDGGFSVEMIHRDSSRSPLYRPTETPFQRVANAVRR 62

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
           S+NR NHF +  +  S+ ++++ ++ +   YL+R S+G+PP + L + DTGSD++W QCE
Sbjct: 63  SINRGNHFKK--AFVSTDSAESTVVASQGEYLMRYSVGSPPFQVLGIVDTGSDILWLQCE 120

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSN 179
           PC    CY Q +P+FDP  S TYK+LPCSS+ C SL   +CS  N C+YS+ YGDGS S+
Sbjct: 121 PC--EDCYKQTTPIFDPSKSKTYKTLPCSSNTCESLRNTACSSDNVCEYSIDYGDGSHSD 178

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
           G+L+ ET+TLGST G +V  P    GCG NNGG F  +
Sbjct: 179 GDLSVETLTLGSTDGSSVHFPKTVIGCGHNNGGTFQEE 216



 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 61/125 (48%), Positives = 84/125 (67%), Gaps = 4/125 (3%)

Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIH 309
           +I+IDSGTTLT LPQ    NL S +S +I+ +   DP+  L LCY   S    +P +T H
Sbjct: 316 NIIIDSGTTLTLLPQEDYLNLESAVSDVIKLERARDPSKLLSLCYKTTSDELDLPVITAH 375

Query: 310 FRGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVSFKP 368
           F+GADV+L+  + FV V + +VC  F  I++ +  I+GN+ Q N LVGYD+ ++TVSFKP
Sbjct: 376 FKGADVELNPISTFVPVEKGVVCFAF--ISSKIGAIFGNLAQQNLLVGYDLVKKTVSFKP 433

Query: 369 TDCTK 373
           TDCTK
Sbjct: 434 TDCTK 438


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score =  226 bits (576), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 114/241 (47%), Positives = 159/241 (65%), Gaps = 8/241 (3%)

Query: 4   FLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
           F + V + F   F+++    A  GGFSV+LIHRDSP SPF++ S+T  +RL DA  RS +
Sbjct: 9   FFNVVVVGFL--FHLLEVGLASGGGFSVDLIHRDSPHSPFFDPSKTRTERLTDAFHRSAS 66

Query: 64  RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
           R+  F Q  S  +S   Q+ ++P+   Y++ +SIGTPP   +A+ DTGSDL WTQC PC 
Sbjct: 67  RVGRFRQ--SAMTSDGIQSRLVPSAGEYIMNLSIGTPPVPVIAIVDTGSDLTWTQCRPC- 123

Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-NQKSC-SGVNCQYSVSYGDGSFSNGN 181
            + CY Q  P FDPK SSTY+   C +S C +L N +SC +G  C +  SY DGSF+ GN
Sbjct: 124 -THCYKQVVPFFDPKNSSTYRDSSCGTSFCLALGNDRSCRNGKKCTFMYSYADGSFTGGN 182

Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           LA ET+T+ ST G+ V+ PG  FGC   +GG+F+  ++GIVGLG  ++S+ISQ+++TI G
Sbjct: 183 LAVETLTVASTAGKPVSFPGFAFGCVHRSGGIFDEHSSGIVGLGVAELSMISQLKSTING 242

Query: 242 N 242
            
Sbjct: 243 R 243



 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 54/145 (37%), Positives = 89/145 (61%), Gaps = 7/145 (4%)

Query: 232 ISQMRTTIAG-NQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 290
           + + R +  G +++  V   +I++DSGTT T+LP  +   L   ++  I+ + V DP G 
Sbjct: 298 VGKKRLSYKGFSKKAEVEEGNIIVDSGTTYTYLPLEFYVKLEESVAHSIKGKRVRDPNGI 357

Query: 291 LELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGN 347
             LCY+  ++ Q+  P +T HF+ A+V+L   N F+++ ED+VC +V    T+ + I GN
Sbjct: 358 SSLCYN-TTVDQIDAPIITAHFKDANVELQPWNTFLRMQEDLVCFTVLP--TSDIGILGN 414

Query: 348 IMQTNFLVGYDIEQQTVSFKPTDCT 372
           + Q NFLVG+D+ ++ VSFK  DCT
Sbjct: 415 LAQVNFLVGFDLRKKRVSFKAADCT 439


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  223 bits (569), Expect = 9e-56,   Method: Compositional matrix adjust.
 Identities = 121/241 (50%), Positives = 159/241 (65%), Gaps = 12/241 (4%)

Query: 5   LSCVFILFFLC--FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           ++ VF L FL     V S + A+  GF+VELIHRDSPKSP YNSSET + R+ +AL RS 
Sbjct: 1   MAPVFSLLFLISTASVFSAVTARDYGFTVELIHRDSPKSPMYNSSETHFDRIVNALRRSS 60

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R      N+ +  S  ++A I  N   YL+ IS+GTPP   +AVADTGSD+IWTQC+PC
Sbjct: 61  HR------NTVVLESDTAEAPIFNNGGEYLVEISVGTPPFSIVAVADTGSDVIWTQCKPC 114

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA-SLNQKSCSG-VNCQYSVSYGDGSFSNG 180
             S CY Q++P+FDP  S+TYK++ CSS  C+ S +  SCS    C YS++YGD S S G
Sbjct: 115 --SNCYQQNAPMFDPSKSTTYKNVACSSPVCSYSGDGSSCSDDSECLYSIAYGDDSHSQG 172

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
           NLA +TVT+ ST+G+ VA P    GCG +N G FN+  +GIVGLG G  SL++Q+     
Sbjct: 173 NLAVDTVTMQSTSGRPVAFPRTVIGCGHDNAGTFNANVSGIVGLGRGPASLVTQLGPATG 232

Query: 241 G 241
           G
Sbjct: 233 G 233



 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 57/133 (42%), Positives = 84/133 (63%), Gaps = 3/133 (2%)

Query: 241 GNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 300
           G  +LG  + +I+IDSGTTLT+LP    ++  S +S  +      DP+  L+ C++  + 
Sbjct: 300 GASKLGGES-NIIIDSGTTLTYLPSALLNSFGSAISQSMSLPHAQDPSEFLDYCFATTTD 358

Query: 301 S-QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYD 358
             ++P VT+HF GADV L R N FV++S+D +C  F     +++ IYGNI Q+NFLVGYD
Sbjct: 359 DYEMPPVTMHFEGADVPLQRENLFVRLSDDTICLAFGSFPDDNIFIYGNIAQSNFLVGYD 418

Query: 359 IEQQTVSFKPTDC 371
           I+   VSF+P  C
Sbjct: 419 IKNLAVSFQPAHC 431


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  222 bits (565), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 146/427 (34%), Positives = 221/427 (51%), Gaps = 68/427 (15%)

Query: 10  ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
           +LFF   +++S   +    FS ELIHRDS KSP Y  ++  +Q + +A  RS+NR N   
Sbjct: 9   LLFFSLCFIISFSHSLRNSFSFELIHRDSSKSPLYKPAQNKFQHVVNAARRSINRANRLF 68

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           ++S    S   ++ +  N   YL+  S+GTPP     V DTGSD++W QC+PC   QCY 
Sbjct: 69  KDSL---SNTPESTVYVNGGEYLMTYSVGTPPFNVYGVVDTGSDIVWLQCKPC--EQCYK 123

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
           Q +P+F+P  SS+YK++PCSS+ C S+   SC+  N C+Y++++ D S+S G L+ ET+T
Sbjct: 124 QTTPIFNPSKSSSYKNIPCSSNLCQSVRYTSCNKQNSCEYTINFSDQSYSQGELSVETLT 183

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG------- 241
           L STTG +V+ P    GCG NN G+F  +T+GIVGLG G +SL +Q++++I G       
Sbjct: 184 LDSTTGHSVSFPKTVIGCGHNNRGMFQGETSGIVGLGIGPVSLTTQLKSSIGGKFSYCLL 243

Query: 242 ------NQRLGVSTPDIVIDSGTTLT---FL---PQGY---NSNLLSVMSSMIEAQPVAD 286
                 N+   ++  D  + SG  +    F+   PQ +        SV +  IE + + D
Sbjct: 244 PLLVDSNKTSKLNFGDAAVVSGDGVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEFEVLDD 303

Query: 287 PTGSLELCYSFNSLSQVP-EVTIHFRGAD---VKLSRS-------NFFVKVSED-----I 330
                 +  S  +L+ +P  V  +   A    VKL R        N    ++ D     I
Sbjct: 304 SEEGNIILDSGTTLTLLPSHVYTNLESAVAQLVKLDRVDDPNQLLNLCYSITSDQYDFPI 363

Query: 331 VCSVFKGIT---NSVPIYGNI---------------------MQTNFLVGYDIEQQTVSF 366
           + + FKG     N +  + ++                      Q N LVGYD++Q  VSF
Sbjct: 364 ITAHFKGADIKLNPISTFAHVADGVVCLAFTSSQTGPIFGNLAQLNLLVGYDLQQNIVSF 423

Query: 367 KPTDCTK 373
           KP+DC K
Sbjct: 424 KPSDCIK 430


>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  221 bits (563), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 153/440 (34%), Positives = 219/440 (49%), Gaps = 76/440 (17%)

Query: 3   TFLSCVFILFFLCFYV--VSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           T LS    + FL   +   S ++A+   F+ ELIHRDSP SP +N+SET   RL +A+ R
Sbjct: 9   TLLSFALSIIFLTVSMSGFSLVQAEKLSFTTELIHRDSPNSPLFNASETTDIRLANAVER 68

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC- 119
           S +R+N FN   S S + A    I+ +N ++L++ISIG PPTE L    TGSDL+W  C 
Sbjct: 69  SADRVNRFNDLISNSITAAEFPSIL-DNGDFLMKISIGIPPTELLVNVATGSDLVWIPCL 127

Query: 120 --EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVS-YGDGS 176
             +PC    C   D   FDP  SSTYK++PC S +C   N  +C   +C YS       S
Sbjct: 128 SFKPC-THNC---DLRFFDPMESSTYKNVPCDSYRCQITNAATCQFSDCFYSCDPRHQDS 183

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
             +G+LA +T+TL STTG++  LP   F CG   GG  +    GI+GLG G +SL++++ 
Sbjct: 184 CPDGDLAMDTLTLNSTTGKSFMLPNTGFICGNRIGG--DYPGVGILGLGHGSLSLLNRIS 241

Query: 237 TTIAG-----------NQRLGVSTPDIVIDSGTTL------------------------- 260
             I G           NQ   +S  D  + SG+ +                         
Sbjct: 242 HLIDGKFSHCIVPYSSNQTSKLSFGDKAVVSGSAMFSTRLDMTGGPYSYTLSFYGISVGN 301

Query: 261 -------------------------TFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELC 294
                                    T+ P+ + S L   +   I+ +P+  DPT  L LC
Sbjct: 302 KSISAGGIGSDYYMNGLGMDSGTMFTYFPEYFYSQLEYDVRYAIQQEPLYPDPTRRLRLC 361

Query: 295 YSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFK-GITNSVPIYGNIMQTNF 353
           Y ++     P +T+HF G  V+LS SN F++++EDIVC  F    +    ++G   QTN 
Sbjct: 362 YRYSPDFSPPTITMHFEGGSVELSSSNSFIRMTEDIVCLAFATSSSEQDAVFGYWQQTNL 421

Query: 354 LVGYDIEQQTVSFKPTDCTK 373
           L+GYD++   +SF  TDCTK
Sbjct: 422 LIGYDLDAGFLSFLKTDCTK 441


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 138/375 (36%), Positives = 192/375 (51%), Gaps = 76/375 (20%)

Query: 70  QNSSISSSKAS--QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
           +NSS  S K S  Q+ +   +  YL+ +SIGTPP +  A ADTGSDL+W QC PC  ++C
Sbjct: 37  RNSSHDSYKPSTIQSPVSAYDCEYLMELSIGTPPIKIYAEADTGSDLVWFQCIPC--TKC 94

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATE 185
           Y Q +P+FDP+ SS+Y ++ C +  C  L+   CS     C Y+ SY D S + G LA E
Sbjct: 95  YKQQNPMFDPRSSSSYTNITCGTESCNKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQE 154

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA----- 240
           T+TL STTG+ VA  GI FGCG NN G FN +  G++GLG G +SLISQ+ +++      
Sbjct: 155 TLTLTSTTGEPVAFQGIIFGCGHNNSG-FNDREMGLIGLGRGPLSLISQIGSSLGAGGNM 213

Query: 241 ----------------------GNQRLG---VSTPDI----------------------- 252
                                 G++ LG   VSTP I                       
Sbjct: 214 FSQCLVPFNTDPSITSQMNFGKGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPF 273

Query: 253 --------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 298
                         +IDSGTT+T+LP+ +   L+  + + +  +P        ELCY   
Sbjct: 274 SNGSSLGTITKGNILIDSGTTITYLPEEFYHRLIEQVRNKVALEPFR--IDGYELCYQTP 331

Query: 299 SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 358
           +    P +TIHF G DV L+ +  F+ V +D  C            YGN  Q+N+L+G+D
Sbjct: 332 TNLNGPTLTIHFEGGDVLLTPAQMFIPVQDDNFCFAVFDTNEEYVTYGNYAQSNYLIGFD 391

Query: 359 IEQQTVSFKPTDCTK 373
           +E+Q VSFK TDCTK
Sbjct: 392 LERQVVSFKATDCTK 406


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 120/217 (55%), Positives = 152/217 (70%), Gaps = 8/217 (3%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           GF+ +LIHRDSPKSPFYN  ET  QRLR+A+ RS+NR+ HF +  +   +   Q D+  N
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDN---TPQPQIDLTSN 86

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  YL+ +SIGTPP   +A+ADTGSDL+WTQC PC    CY Q  PLFDPK SSTYK + 
Sbjct: 87  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPC--DDCYTQVDPLFDPKTSSTYKDVS 144

Query: 148 CSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           CSSSQC +L NQ SCS  +  C YS+SYGD S++ GN+A +T+TLGS+  + + L  I  
Sbjct: 145 CSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIII 204

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           GCG NN G FN K +GIVGLGGG +SLI Q+  +I G
Sbjct: 205 GCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDG 241



 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 69/123 (56%), Positives = 85/123 (69%), Gaps = 1/123 (0%)

Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF 310
           +I+IDSGTTLT LP  + S L   ++S I+A+   DP   L LCYS     +VP +T+HF
Sbjct: 315 NIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHF 374

Query: 311 RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
            GADVKL  SN FV+VSED+VC  F+G + S  IYGN+ Q NFLVGYD   +TVSFKPTD
Sbjct: 375 DGADVKLDSSNAFVQVSEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTD 433

Query: 371 CTK 373
           C K
Sbjct: 434 CAK 436


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score =  219 bits (558), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 120/217 (55%), Positives = 152/217 (70%), Gaps = 8/217 (3%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           GF+ +LIHRDSPKSPFYN  ET  QRLR+A+ RS+NR+ HF +  +   +   Q D+  N
Sbjct: 30  GFTADLIHRDSPKSPFYNPMETSSQRLRNAIHRSVNRVFHFTEKDN---TPQPQIDLTSN 86

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  YL+ +SIGTPP   +A+ADTGSDL+WTQC PC    CY Q  PLFDPK SSTYK + 
Sbjct: 87  SGEYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPC--DDCYTQVDPLFDPKTSSTYKDVS 144

Query: 148 CSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           CSSSQC +L NQ SCS  +  C YS+SYGD S++ GN+A +T+TLGS+  + + L  I  
Sbjct: 145 CSSSQCTALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIII 204

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           GCG NN G FN K +GIVGLGGG +SLI Q+  +I G
Sbjct: 205 GCGHNNAGTFNKKGSGIVGLGGGPVSLIKQLGDSIDG 241



 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 69/123 (56%), Positives = 85/123 (69%), Gaps = 1/123 (0%)

Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF 310
           +I+IDSGTTLT LP  + S L   ++S I+A+   DP   L LCYS     +VP +T+HF
Sbjct: 315 NIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATGDLKVPVITMHF 374

Query: 311 RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
            GADVKL  SN FV+VSED+VC  F+G + S  IYGN+ Q NFLVGYD   +TVSFKPTD
Sbjct: 375 DGADVKLDSSNAFVQVSEDLVCFAFRG-SPSFSIYGNVAQMNFLVGYDTVSKTVSFKPTD 433

Query: 371 CTK 373
           C K
Sbjct: 434 CAK 436


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score =  218 bits (556), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 141/440 (32%), Positives = 212/440 (48%), Gaps = 87/440 (19%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           M  + +   +   +C  ++    + T GFSV LI ++S      ++   P +RL +    
Sbjct: 1   MVVYPTSFHLATIICLMLLPLHISATEGFSVNLIRKNSS-----HAHVLPLRRLMEL--- 52

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
                      S++  +   Q+ I     +YL+ +SIGTPP +   +ADTGSDL WT C 
Sbjct: 53  -----------SAMEKTLTPQSPIYAYLGHYLMELSIGTPPFKIYGIADTGSDLTWTSCV 101

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSN 179
           PC  + CY Q +P+FDP+ S+TY+++ C S  C  L+   CS    C Y+ +Y   + + 
Sbjct: 102 PC--NNCYKQRNPMFDPQKSTTYRNISCDSKLCHKLDTGVCSPQKRCNYTYAYASAAITR 159

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG------------ 227
           G LA ET+TL ST G++V L GI FGCG NN G FN    GI+GLGGG            
Sbjct: 160 GVLAQETITLSSTKGKSVPLKGIVFGCGHNNTGGFNDHEMGIIGLGGGPVSLISQMGSSF 219

Query: 228 --------------DISLISQMR--------------TTIAGNQR--------LGVSTPD 251
                         D+S+ S+M               T +   Q         LG+S  +
Sbjct: 220 GGKRFSQCLVPFHTDVSVSSKMSFGKGSKVSGKGVVSTPLVAKQDKTPYFVTLLGISVEN 279

Query: 252 ----------------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD-PTGSLELC 294
                           + +DSGT  T LP      +++ + S +  +PV D P    +LC
Sbjct: 280 TYLHFNGSSQNVEKGNMFLDSGTPPTILPTQLYDQVVAQVRSEVAMKPVTDDPDLGPQLC 339

Query: 295 YSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFL 354
           Y   +  + P +T HF GADVKLS +  F+   + + C  F   ++   +YGN  Q+N+L
Sbjct: 340 YRTKNNLRGPVLTAHFEGADVKLSPTQTFISPKDGVFCLGFTNTSSDGGVYGNFAQSNYL 399

Query: 355 VGYDIEQQTVSFKPTDCTKQ 374
           +G+D+++Q VSFKP DCTK 
Sbjct: 400 IGFDLDRQVVSFKPKDCTKH 419


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 141/430 (32%), Positives = 216/430 (50%), Gaps = 76/430 (17%)

Query: 9   FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
            IL  + F   + I    G F+  L HRDS  SP   SS + Y RL +A  RSL+R    
Sbjct: 11  LILLLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATL 69

Query: 69  NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
              ++ + +   QA + P +  YL+ +SIGTPP + + +ADTGSDL+W QC PC   +CY
Sbjct: 70  LNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPC--LKCY 127

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETV 187
            Q  P+FDP  S+++  +PC+S  C +++   C     C YS +YGD +++ G+L  E +
Sbjct: 128 KQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKI 187

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLG- 246
           T+GS++ ++V       GCG +  G      +G++GLGGG +SL+SQM  T   ++R   
Sbjct: 188 TIGSSSVKSV------IGCG-HESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSY 240

Query: 247 --------------------------VSTP------------------------------ 250
                                     VSTP                              
Sbjct: 241 CLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNERHMASAKQ 300

Query: 251 -DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY----SFNSLSQVPE 305
            +++IDSGTTL+FLP+     ++S +  +++A+ V DP    +LC+    +  + S +P 
Sbjct: 301 GNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPI 360

Query: 306 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQ 362
           +T  F  GA+V L   N F KV+ ++ C        T+   I GN+   NFL+GYD+E +
Sbjct: 361 ITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALANFLIGYDLEAK 420

Query: 363 TVSFKPTDCT 372
            +SFKPT CT
Sbjct: 421 RLSFKPTVCT 430


>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 447

 Score =  216 bits (550), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 121/235 (51%), Positives = 158/235 (67%), Gaps = 7/235 (2%)

Query: 10  ILFFLCFYVVSPIEA-QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
           ++FF+ F  +S  EA   GGFS +LI RDSP SPFYN SET + RL+ A  RS++R NHF
Sbjct: 15  VIFFIHFSGLSHTEASNKGGFSTDLISRDSPLSPFYNPSETQFDRLQKAFHRSISRANHF 74

Query: 69  NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
             N    S+ + Q+ +I NN  YL+ IS+GTPP     +ADTGSDL+W QC+PC    CY
Sbjct: 75  RANGV--STNSIQSPVISNNGEYLMNISLGTPPVSMHGIADTGSDLLWRQCKPC--DSCY 130

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLN-QKSCSGVN-CQYSVSYGDGSFSNGNLATET 186
            Q  P+FDP  S TY+ L C    C++L  Q  CS  N C YS SYGDGS ++G+LA +T
Sbjct: 131 EQIEPIFDPAKSKTYQILSCEGKSCSNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAVDT 190

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           +T+GSTTG+ V++P + FGCG NNGG F    +G+VGLGGG +S+ISQ+R  I G
Sbjct: 191 LTIGSTTGRPVSVPKVVFGCGHNNGGTFELHGSGLVGLGGGPLSMISQLRPLIGG 245



 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 62/123 (50%), Positives = 83/123 (67%), Gaps = 1/123 (0%)

Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF 310
           +I+IDSGTTLT LPQ +   L S + S I  +PV DP     LCYS  S  ++P +T HF
Sbjct: 324 NIIIDSGTTLTLLPQDFYGTLESNVVSAIGGKPVRDPNNVFSLCYSNLSGLRIPTITAHF 383

Query: 311 RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
            GAD++L   N FV+V ED+ C     +++ + I+GN+ Q NFLVGYD++ +TVSFKPTD
Sbjct: 384 VGADLELKPLNTFVQVQEDLFCFAMIPVSD-LAIFGNLAQMNFLVGYDLKSRTVSFKPTD 442

Query: 371 CTK 373
           CTK
Sbjct: 443 CTK 445


>gi|296085499|emb|CBI29231.3| unnamed protein product [Vitis vinifera]
          Length = 308

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 136/339 (40%), Positives = 185/339 (54%), Gaps = 70/339 (20%)

Query: 69  NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
           N  + ++S    Q+++I    +YL+ IS+GTPP   L +ADTGSDLIW QC PC    CY
Sbjct: 7   NTGNQLASPNDIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPC--DDCY 64

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
            Q  PLFDPK S TYK+L                                 G L++ET T
Sbjct: 65  KQVEPLFDPKKSKTYKTL---------------------------------GYLSSETFT 91

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRL--- 245
           +GST G   + PG+ FGCG +NGG FN K +G++GLGGG +SL+ Q+ + + G       
Sbjct: 92  IGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVGGQFSYCLV 151

Query: 246 -----------------------GVSTP------DIVIDSGTTLTFLPQGYNSNLLSVMS 276
                                  G S+P      +I+IDSGTTLT LP+ + +++ S ++
Sbjct: 152 PLSSDSTASSKINFGKSAVVSGSGTSSPAAAEESNIIIDSGTTLTLLPRDFYTDMESALT 211

Query: 277 SMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFK 336
            +I  Q   DP G+  LCYS     ++P +T HF GADV+L   N FV+  ED+VC  F 
Sbjct: 212 KVIGGQTTTDPRGTFSLCYSGVKKLEIPTITAHFIGADVQLPPLNTFVQAQEDLVC--FS 269

Query: 337 GITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 374
            I +S + I+GN+ Q NFLVGYD++   VSFKPTDCTKQ
Sbjct: 270 MIPSSNLAIFGNLSQMNFLVGYDLKNNKVSFKPTDCTKQ 308


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 152/430 (35%), Positives = 199/430 (46%), Gaps = 71/430 (16%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
           V  LFFL   ++        GFS++LI R SP SP YNS  T  + ++ A  RS+ R   
Sbjct: 5   VLTLFFLVSTMLVDASKSLMGFSIDLIPRHSPISPLYNSQMTQTELVKSAALRSITRSKR 64

Query: 68  FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
            N    IS   +     IP++  YL+R S+GTP  ERLA+ DTGSDL W QC PC    C
Sbjct: 65  VNFIGQISPPLSPIITPIPDHGEYLMRFSLGTPSVERLAIFDTGSDLSWLQCTPC--KTC 122

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSC-SGVNCQYSVSYGDGSFSNGNLAT 184
           Y Q++PLFDP  SSTY  +PC S  C     NQ+ C S   C Y   YG  SF+ G L  
Sbjct: 123 YPQEAPLFDPTQSSTYVDVPCESQPCTLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGY 182

Query: 185 ETVTLGST-TGQAVA-LPGITFGCGTNNGGLFN--SKTTGIVGLGGGDISLISQM----- 235
           +T++  ST  GQ  A  P   FGC   +   F   +K  G VGLG G +SL SQ+     
Sbjct: 183 DTISFSSTGMGQGGATFPKSVFGCAFYSNFTFKISTKANGFVGLGPGPLSLASQLGDQIG 242

Query: 236 ----------RTTIAGNQRLG--------VSTP--------------------------- 250
                      +T  G  + G        VSTP                           
Sbjct: 243 HKFSYCMVPFSSTSTGKLKFGSMAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKVLT 302

Query: 251 -----DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPE 305
                +I+IDS   LT L QG  ++ +S +   I  +   D     E C    +    PE
Sbjct: 303 GQIGGNIIIDSVPILTHLEQGIYTDFISSVKEAINVEVAEDAPTPFEYCVRNPTNLNFPE 362

Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVF---KGITNSVPIYGNIMQTNFLVGYDIEQQ 362
              HF GADV L   N F+ +  ++VC      KGI+    I+GN  Q NF V YD+ ++
Sbjct: 363 FVFHFTGADVVLGPKNMFIALDNNLVCMTVVPSKGIS----IFGNWAQVNFQVEYDLGEK 418

Query: 363 TVSFKPTDCT 372
            VSF PT+C+
Sbjct: 419 KVSFAPTNCS 428


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score =  213 bits (543), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 151/442 (34%), Positives = 212/442 (47%), Gaps = 83/442 (18%)

Query: 8   VFILFFLCFY-VVSPIEAQT--GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR 64
           VF    LC + + S  EA     GFS+ LIHR+SP SPFYN S TP +R+++ + RS  R
Sbjct: 5   VFCFLLLCSHSIASFAEASKTLSGFSINLIHRESPLSPFYNPSLTPSERIKNTVLRSFAR 64

Query: 65  LNH---FNQNSSISSSKASQADIIPNN--ANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
                  +QN   S    +    IP+     YL+R  IGTPP ER A+ADTGSDLIW QC
Sbjct: 65  SKRRLRLSQNDDRSPGTIT----IPDEPITEYLMRFYIGTPPVERFAIADTGSDLIWVQC 120

Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGVN--CQYSVSYGDG 175
            PC   +C  Q++PLFDP+ SST+K++PC S  C  L  +Q++C G +  C Y   YGD 
Sbjct: 121 APC--EKCVPQNAPLFDPRKSSTFKTVPCDSQPCTLLPPSQRACVGKSGQCYYQYIYGDH 178

Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGLFNSK-TTGIVGLGGGDISLIS 233
           +  +G L  E++  GS    A+  P +TFGC  +NN  +  SK   G+VGLG G +SLIS
Sbjct: 179 TLVSGILGFESINFGSKN-NAIKFPKLTFGCTFSNNDTVDESKRNMGLVGLGVGPLSLIS 237

Query: 234 QMRTTIA---------------GNQRLG-----------VSTP----------------- 250
           Q+   I                   R G           VSTP                 
Sbjct: 238 QLGYQIGRKFSYCFPPLSSNSTSKMRFGNDAIVKQIKGVVSTPLIIKSIGPSYYYLNLEG 297

Query: 251 -----------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
                            +I+IDSGT+ T L Q + +  ++++  +   + V  P      
Sbjct: 298 VSIGNKKVKTSESQTDGNILIDSGTSFTILKQSFYNKFVALVKEVYGVEAVKIPPLVYNF 357

Query: 294 CYSFN-SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP-IYGNIMQT 351
           C+       + P+V   F GA V++  SN F     +++C V    ++    I+GN  Q 
Sbjct: 358 CFENKGKRKRFPDVVFLFTGAKVRVDASNLFEAEDNNLLCMVALPTSDEDDSIFGNHAQI 417

Query: 352 NFLVGYDIEQQTVSFKPTDCTK 373
            + V YD++   VSF P DC K
Sbjct: 418 GYQVEYDLQGGMVSFAPADCAK 439


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 141/422 (33%), Positives = 202/422 (47%), Gaps = 82/422 (19%)

Query: 20  SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS-SSK 78
           +P EA   GFS +LIH++SP SPFY S+   + +         N+L  F Q    S   K
Sbjct: 21  TPTEAYNKGFSFKLIHKNSPNSPFYKSNN--FHK---------NKLRSFYQVPKKSFVQK 69

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           +    +  NN +YL+++++G+PP +   + DTGSDL+W QC PC    CY Q SP+F+P 
Sbjct: 70  SPYTRVTSNNGDYLMKLTLGSPPVDIYGLVDTGSDLVWAQCTPC--GGCYRQKSPMFEPL 127

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            S TY  +PC S QC+           C YS SY D S + G LA E +T  ST G  V 
Sbjct: 128 RSKTYSPIPCESEQCSFFGYSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVV 187

Query: 199 LPGITFGCGTNN-------------------------GGLFNSK--------------TT 219
           +  I FGCG +N                         G L+ SK              T+
Sbjct: 188 VGDIIFGCGHSNSGTFNENDMGIIGMGGGPLSLVSQIGTLYGSKRFSQCLVPFHTDAHTS 247

Query: 220 GIVGLGG-GDISLISQMRTTIAGNQRL--------GVSTPD---------------IVID 255
           G +  G   D+S    + T +A  +          G+S  D               I+ID
Sbjct: 248 GTINFGEESDVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRFNSSETLSKGNIMID 307

Query: 256 SGTTLTFLPQGYNSNL---LSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
           SGT  T++PQ +   L   L V SS++  +   DP    +LCY   +  + P +T HF G
Sbjct: 308 SGTPATYIPQEFYERLVEELKVQSSLLPIE--DDPDLGTQLCYRSETNLEGPILTAHFEG 365

Query: 313 ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           ADV+L     F+   + + C    G T+   I+GN  Q+N L+G+D++++T+SFKPTDCT
Sbjct: 366 ADVQLLPIQTFIPPKDGVFCFAMAGSTDGDYIFGNFAQSNILMGFDLDRKTISFKPTDCT 425

Query: 373 KQ 374
            Q
Sbjct: 426 NQ 427


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 120/241 (49%), Positives = 155/241 (64%), Gaps = 7/241 (2%)

Query: 4   FLSCVF-ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           F+ C   I+  + F   S  EA+  GF+ + I RDSP SPFYN SET YQRL+ A  RS+
Sbjct: 8   FVFCTLAIIILIHFSEHSHAEAKIDGFTTDFISRDSPHSPFYNPSETKYQRLQKAFRRSI 67

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
            R NHF   +  +S    Q+D+I     YL+ IS+GTPP   L +ADTGSDLIW QC PC
Sbjct: 68  LRGNHFR--AMRASPNDIQSDVISGGGAYLMNISLGTPPVPMLGIADTGSDLIWRQCLPC 125

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGVN-CQYSVSYGDGSFSNG 180
           P   CY Q  PLFDPK S TYK+L C +  C  L Q+ SC   N C YS SYGD S++ G
Sbjct: 126 P--NCYEQVEPLFDPKESETYKTLDCDNEFCQDLGQQGSCDDDNTCTYSYSYGDRSYTRG 183

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
           +L+++T+T+GST G   + PGI FGCG +NGG FN K  G++GLGGG +SL+ Q+ + + 
Sbjct: 184 DLSSDTLTIGSTEGDPASFPGIAFGCGHDNGGTFNEKDGGLIGLGGGPLSLVMQLSSEVG 243

Query: 241 G 241
           G
Sbjct: 244 G 244



 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 62/129 (48%), Positives = 85/129 (65%), Gaps = 3/129 (2%)

Query: 246 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPE 305
            V   +I+IDSGTTLT LPQ + +++ S +++ I  Q   DP G   LCYS  +  ++P 
Sbjct: 318 AVEEGNIIIDSGTTLTLLPQDFYTDVESALTNAIGGQTTTDPNGIFSLCYSSVNNLEIPT 377

Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTV 364
           +T HF GADV+L   N FV+V ED+VC  F  I +S + I+GN+ Q NFLVGYD++   V
Sbjct: 378 ITAHFTGADVQLPPLNTFVQVQEDLVC--FSMIPSSNLAIFGNLAQINFLVGYDLKNNKV 435

Query: 365 SFKPTDCTK 373
           SFK TDCT+
Sbjct: 436 SFKQTDCTE 444


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score =  210 bits (535), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 102/212 (48%), Positives = 145/212 (68%), Gaps = 7/212 (3%)

Query: 11  LFFLCFYV-VSPIEA-QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
           +  LC Y+ +S + A   GGFSVE+IHRDS +SP+Y  +ET +QR+ +AL RS+NR NHF
Sbjct: 12  IVLLCLYINISFLNALDGGGFSVEIIHRDSSRSPYYRPTETQFQRVANALRRSINRANHF 71

Query: 69  NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
           N+ + ++S+  +++ +I +   YL+  S+GTPP + L + DTGSD+IW QC+PC    CY
Sbjct: 72  NKPNLVASTNTAESTVIASQGEYLMSYSVGTPPFQILGIVDTGSDIIWLQCQPC--EDCY 129

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATE 185
            Q +P+FDP  S TYK+LPCSS+ C S+ +  SCS  N  C+Y+++YGD S S G+L+ E
Sbjct: 130 NQTTPIFDPSQSKTYKTLPCSSNICQSVQSAASCSSNNDECEYTITYGDNSHSQGDLSVE 189

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
           T+TLGST G +V  P    GCG NN G F  +
Sbjct: 190 TLTLGSTDGSSVQFPKTVIGCGHNNKGTFQRE 221



 Score =  120 bits (302), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 62/126 (49%), Positives = 86/126 (68%), Gaps = 3/126 (2%)

Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTI 308
           +I+IDSGTTLT LP+    NL S ++  IE + V DP+  L LCY   S  +  VP +T 
Sbjct: 320 NIIIDSGTTLTILPEDDYLNLESAVADAIELERVEDPSKFLRLCYRTTSSDELNVPVITA 379

Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           HF+GADV+L+  + F++V E +VC  F+  +   PI+GN+ Q N LVGYD+ +QTVSFKP
Sbjct: 380 HFKGADVELNPISTFIEVDEGVVCFAFRS-SKIGPIFGNLAQQNLLVGYDLVKQTVSFKP 438

Query: 369 TDCTKQ 374
           TDCT++
Sbjct: 439 TDCTQE 444


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 146/441 (33%), Positives = 217/441 (49%), Gaps = 92/441 (20%)

Query: 1   MATFLSCVF--ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL 58
           MA  +S  F  ILF + F   + I    G F+  L HRDS  SP   SS + Y RL +A 
Sbjct: 1   MAATISLFFHLILFLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLANAF 59

Query: 59  TRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
            RSL+R       ++ S +   Q+ II            GTPP + L +ADTGSDL W Q
Sbjct: 60  RRSLSRSAALLNRAATSGAVGLQSSII------------GTPPVDYLGIADTGSDLTWAQ 107

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGS 176
           C PC   +CY Q  P+F+P  S+++  +PC++  C +++   C GV   C YS +YGD +
Sbjct: 108 CLPCL--KCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVDDGHC-GVQGVCDYSYTYGDRT 164

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           +S G+L  E +T+GS++ ++V       GCG  + G F    +G++GLGGG +SL+SQM 
Sbjct: 165 YSKGDLGFEKITIGSSSVKSV------IGCGHASSGGFGF-ASGVIGLGGGQLSLVSQMS 217

Query: 237 TTIAGNQRLG---------------------------VSTP------------------- 250
            T   ++R                             VSTP                   
Sbjct: 218 QTSGISRRFSYCLPTLLSHANGKINFGQNAVVSGPGVVSTPLISKNTVTYYYITLEAISI 277

Query: 251 ------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--- 295
                       +++IDSGTTL+FLP+     ++S +  +++A+ V DP    +LC+   
Sbjct: 278 GNERHMAFAKQGNVIIDSGTTLSFLPKELYDGVVSSLLKVVKAKRVKDPGNFWDLCFDDG 337

Query: 296 -SFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQT 351
            +  + S +P +T  F  GA+V L   N F KV+ ++ C        T+   I GN+   
Sbjct: 338 INVATSSGIPIITAQFSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEFGIIGNLALA 397

Query: 352 NFLVGYDIEQQTVSFKPTDCT 372
           NFL+GYD+E + +SFKPT CT
Sbjct: 398 NFLIGYDLEAKRLSFKPTVCT 418


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 142/438 (32%), Positives = 200/438 (45%), Gaps = 106/438 (24%)

Query: 4   FLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
           +L+ +F+LF +    +S IEAQ  GF+++L  + S                        N
Sbjct: 18  YLAIIFLLFHVLH--LSSIEAQNDGFTIKLFRKTS------------------------N 51

Query: 64  RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
            + +             QA I      +L+ I IGTPP +   + DTGSDLIW QC PC 
Sbjct: 52  NIQNI-----------VQAPINAYIGQHLMEIYIGTPPIKITGLVDTGSDLIWIQCAPC- 99

Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNL 182
              CY Q  P+FDP  SSTY ++ C S  C  L+   CS    C Y+  YGD S + G L
Sbjct: 100 -LGCYKQIKPMFDPLKSSTYNNISCDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVL 158

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
           A +T T  S TG+ V+L    FGCG NN G FN    G++GLGGG  SLISQ+     G 
Sbjct: 159 AQDTATFTSNTGKPVSLSRFLFGCGHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGK 218

Query: 243 Q------------------------------------------------RLGVSTPD--- 251
           +                                                 LG+S  D   
Sbjct: 219 KFSQCLVPFLTDIKISSRMSFGKGSQVLGNGVVTTPLVPREKDTSYFVTLLGISVEDTYF 278

Query: 252 ----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSL 300
                     +++DSGT    LPQ     + + + + +  +P+  DP+   +LCY   + 
Sbjct: 279 PMNSTIGKANMLVDSGTPPILLPQQLYDKVFAEVRNKVALKPITDDPSLGTQLCYRTQTN 338

Query: 301 SQVPEVTIHFRGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVP-IYGNIMQTNFLVG 356
            + P +T HF GA+V L+    F+     ++ I C      TNS P +YGN  Q+N+L+G
Sbjct: 339 LKGPTLTFHFVGANVLLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIG 398

Query: 357 YDIEQQTVSFKPTDCTKQ 374
           +D+++Q VSFKPTDCTKQ
Sbjct: 399 FDLDRQVVSFKPTDCTKQ 416


>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score =  205 bits (522), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 116/241 (48%), Positives = 157/241 (65%), Gaps = 7/241 (2%)

Query: 4   FLSCVFILFFLCFYVV-SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           F+ C+  + FL ++   S  EA+  GF+ + I RDSP+SPFYN SET YQRL+ A  RS+
Sbjct: 8   FVFCLLAIIFLIYFAKHSQAEAKVDGFTTDFISRDSPRSPFYNPSETKYQRLQKAFRRSI 67

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
            R NHF   +  +S    Q+++I    +YL+ IS+GTPP   L +ADTGSDLIW QC PC
Sbjct: 68  LRGNHFR--AIRASPNDIQSNVISGGGSYLMNISLGTPPVSMLGIADTGSDLIWRQCLPC 125

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGVN-CQYSVSYGDGSFSNG 180
               CY Q  PLFDPK S TYK+L C++  C  L Q+ SC   N C  S SYGD S++  
Sbjct: 126 --DDCYKQVEPLFDPKKSKTYKTLGCNNDFCQDLGQQGSCGDDNTCTSSYSYGDQSYTRR 183

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
           +L++ET T+GST G   + PG+ FGCG +NGG FN K +G++GLGGG +SL+ Q+ + + 
Sbjct: 184 DLSSETFTIGSTEGDPASFPGLAFGCGHSNGGTFNEKDSGLIGLGGGPLSLVMQLSSKVG 243

Query: 241 G 241
           G
Sbjct: 244 G 244



 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 62/125 (49%), Positives = 85/125 (68%), Gaps = 3/125 (2%)

Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF 310
           +I+IDSGTTLT LP+ + +++ S ++ +I  Q   DP G+  LCYS     ++P +T HF
Sbjct: 323 NIIIDSGTTLTLLPRDFYTDMESALTKVIGGQTTTDPRGTFSLCYSGVKKLEIPTITAHF 382

Query: 311 RGADVKLSRSNFFVKVSEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
            GADV+L   N FV+  ED+VC  F  I +S + I+GN+ Q NFLVGYD++   VSFKPT
Sbjct: 383 IGADVQLPPLNTFVQAQEDLVC--FSMIPSSNLAIFGNLSQMNFLVGYDLKNNKVSFKPT 440

Query: 370 DCTKQ 374
           DCTKQ
Sbjct: 441 DCTKQ 445


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 95/190 (50%), Positives = 130/190 (68%), Gaps = 5/190 (2%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           GFSVE+IHRDS +SP Y  +ETP+QR+ +A+ RS+NR NHFN+ S ++S+  +++ +  +
Sbjct: 34  GFSVEMIHRDSSRSPLYRHTETPFQRVANAMRRSINRANHFNKKSFVASTNTAESTVKAS 93

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
              YL+  S+GTPP E L V DTGS + W QC+ C    CY Q +P+FDP  S TYK+LP
Sbjct: 94  QGEYLMSYSVGTPPFEILGVVDTGSGITWMQCQRC--EDCYEQTTPIFDPSKSKTYKTLP 151

Query: 148 CSSSQCAS-LNQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           CSS+ C S ++  SCS   + C+Y++ YGDGS S G+L+ ET+TLGST G +V  P    
Sbjct: 152 CSSNMCQSVISTPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLTLGSTNGSSVQFPNTVI 211

Query: 205 GCGTNNGGLF 214
           GCG NN G F
Sbjct: 212 GCGHNNKGTF 221



 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 65/125 (52%), Positives = 86/125 (68%), Gaps = 3/125 (2%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIH 309
           I+IDSGTTLT LPQ   SNL S ++  I+A  V+DP+  L LCY      Q  VP +T H
Sbjct: 327 IIIDSGTTLTLLPQEDYSNLESAVADAIQANRVSDPSNFLSLCYQTTPSGQLDVPVITAH 386

Query: 310 FRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
           F+GADV+L+  + FV+V+E +VC  F   +  V I+GN+ Q N LVGYD+ +QTVSFKPT
Sbjct: 387 FKGADVELNPISTFVQVAEGVVCFAFHS-SEVVSIFGNLAQLNLLVGYDLMEQTVSFKPT 445

Query: 370 DCTKQ 374
           DCT++
Sbjct: 446 DCTQE 450


>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 396

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 137/417 (32%), Positives = 194/417 (46%), Gaps = 89/417 (21%)

Query: 24  AQTGGFSVELIHRDSPK-SPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
           A   GF+++LI  +SP  SPFY S E    RL                      S     
Sbjct: 3   ADNSGFTIQLIRHNSPNYSPFYKSDELHMHRL---------------------GSNGVFT 41

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
            +  NN +YL+++++GTPP +   + DTGSDL+W QC PC    CY Q SP+F+P  S+T
Sbjct: 42  RVTSNNGDYLMKLTLGTPPVDVYGLVDTGSDLVWAQCTPC--QGCYRQKSPMFEPLRSNT 99

Query: 143 YKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           Y  +PC S +C SL   SCS    C YS +Y D S + G LA ETVT  ST G+ V +  
Sbjct: 100 YTPIPCDSEECNSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVGD 159

Query: 202 ITFGCGTNN-------------------------GGLFNSK--------------TTGIV 222
           I FGCG +N                         G L+ SK              T G +
Sbjct: 160 IVFGCGHSNSGTFNENDMGIIGLGGGPLSLVSQFGNLYGSKRFSQCLVPFHADPHTLGTI 219

Query: 223 GLG-GGDIS--------LISQMRTT---------------IAGNQRLGVSTPDIVIDSGT 258
             G   D+S        L+S+   T               ++ N    +S  +I+IDSGT
Sbjct: 220 SFGDASDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSSEMLSKGNIMIDSGT 279

Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVAD-PTGSLELCYSFNSLSQVPEVTIHFRGADVKL 317
             T+LPQ +   L+  +       P+ D P    +LCY   +  + P +  HF GADV+L
Sbjct: 280 PATYLPQEFYDRLVKELKVQSNMLPIDDDPDLGTQLCYRSETNLEGPILIAHFEGADVQL 339

Query: 318 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 374
                F+   + + C    G T+   I+GN  Q+N L+G+D++++TVSFK TDC+ Q
Sbjct: 340 MPIQTFIPPKDGVFCFAMAGTTDGEYIFGNFAQSNVLIGFDLDRKTVSFKATDCSNQ 396


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score =  204 bits (518), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 115/244 (47%), Positives = 157/244 (64%), Gaps = 14/244 (5%)

Query: 9   FILFFLCFYVVSPI------EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           F+   +CF  +SP        +   GFS+ LIHRDSP SP YN + T + RLR+A +RS+
Sbjct: 8   FVTIVICFISLSPFPLLGAAASPDPGFSLNLIHRDSPLSPLYNPNHTDFDRLRNAFSRSI 67

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R+N F   +   +S   Q D++PN   Y +++SIGTP  E + +ADTGSDL W QC PC
Sbjct: 68  SRVNVFKTKAVDINS--FQNDLVPNGGEYFMKMSIGTPLVEVIVIADTGSDLTWVQCLPC 125

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGVN--CQYSVSYGDGSFS 178
            P  CY Q SPLFDP  SS+Y+ + C S  C +L+  +++C+     C+Y  SYGD S++
Sbjct: 126 DP--CYRQKSPLFDPSRSSSYRHMLCGSRFCNALDVSEQACTMDTNICEYHYSYGDKSYT 183

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           NGNLATE  T+GST+ + V L  I FGCGT NGG F+   +GIVGLGGG +SL+SQ+ + 
Sbjct: 184 NGNLATEKFTIGSTSSRPVHLSPIVFGCGTGNGGTFDELGSGIVGLGGGALSLVSQLSSI 243

Query: 239 IAGN 242
           I G 
Sbjct: 244 IKGK 247



 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 63/145 (43%), Positives = 89/145 (61%), Gaps = 12/145 (8%)

Query: 240 AGNQRL---------GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 290
            GN+RL          V   +++IDSGTTLTFL   + + L  V+   ++A+ V+DP G 
Sbjct: 301 VGNKRLPYTNGLLNGNVEKGNVIIDSGTTLTFLDSEFFTELERVLEETVKAERVSDPRGL 360

Query: 291 LELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIM 349
             +C+       +P + +HF  ADVKL   N FVK  ED++C  F  I +N + I+GN+ 
Sbjct: 361 FSVCFRSAGDIDLPVIAVHFNDADVKLQPLNTFVKADEDLLC--FTMISSNQIGIFGNLA 418

Query: 350 QTNFLVGYDIEQQTVSFKPTDCTKQ 374
           Q +FLVGYD+E++TVSFKPTDCTK 
Sbjct: 419 QMDFLVGYDLEKRTVSFKPTDCTKH 443


>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 455

 Score =  204 bits (518), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 148/449 (32%), Positives = 208/449 (46%), Gaps = 84/449 (18%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           +F S + IL  +     + I+A    F+ ELIH DSP SPF+N+SET   RL  AL RS 
Sbjct: 12  SFTSLIIILSTVFLSSFAIIQADKFSFTAELIHIDSPNSPFFNASETTTHRLAKALQRSA 71

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           NR+   N  S+  S +   A I   + NYL+++ IGTPPTE  A  DTGS++IW  C  C
Sbjct: 72  NRVARLNPLSN--SDEGVHASIFSGDGNYLMKLLIGTPPTEIHAAIDTGSNVIWIPCINC 129

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDG-SFSNGN 181
               C+ Q S +F+P  SSTY+  PC S QC + +    S   C YS       +  NG 
Sbjct: 130 --KDCFNQSSSIFNPLASSTYQDAPCDSYQCETTSSSCQSDNVCLYSCDEKHQLNCPNGR 187

Query: 182 LATETVTLGSTTGQAVALPGITFGC--------------GTNNGGLFNSKTTGIVGLGGG 227
           +A +T+TL S+ G+   LP   F C              G   G L  S T+ +  L  G
Sbjct: 188 IAVDTMTLTSSDGRPFPLPYSDFVCGNSIYKTFAGVGVIGLGRGAL--SLTSKLYHLSDG 245

Query: 228 DISL-------------------------ISQMRTTIAGNQRLG---------------- 246
             S                          +  + TT+  ++  G                
Sbjct: 246 KFSYCLADYYSKQPSKINFGLQSFISDDDLEVVSTTLGHHRHSGNYYVTLEGISVGEKRQ 305

Query: 247 ----VSTP------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS------ 290
               V  P      +++IDSGT  T LP+ +   L S +S  I   P   P  S      
Sbjct: 306 DLYYVDDPFAPPVGNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSM 365

Query: 291 -----LELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGIT-NSVPI 344
                L  C+ +    + P++TIHF  ADV+LS  N F++V+ED+VC  F         +
Sbjct: 366 DNTLKLSPCFWYYPELKFPKITIHFTDADVELSDDNSFIRVAEDVVCFAFAATQPGQSTV 425

Query: 345 YGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           YG+  Q NF++GYD+++ TVSFK TDC+K
Sbjct: 426 YGSWQQMNFILGYDLKRGTVSFKRTDCSK 454


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 142/421 (33%), Positives = 191/421 (45%), Gaps = 101/421 (23%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
           IEAQ  GF+V+LI + S                            H + N+        Q
Sbjct: 26  IEAQNDGFTVKLIRKSS----------------------------HLSSNNI---QDIVQ 54

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           A I      YL+ + IGTPP +     DTGSDLIW QC PC    CY Q +P+FDP  SS
Sbjct: 55  APINAYIGQYLMELYIGTPPIKISGTVDTGSDLIWVQCVPC--LGCYNQINPMFDPLKSS 112

Query: 142 TYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           TY ++ C S  C       CS    C Y+  Y D S + G LA ETVTL S TG+ ++L 
Sbjct: 113 TYTNISCDSPLCYKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPISLQ 172

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN------------------ 242
           GI FGCG NN G FN    G++GLGGG  SL+SQ+     G                   
Sbjct: 173 GILFGCGHNNTGNFNDHEMGLIGLGGGPTSLVSQIGPLFGGKKFSQCLVPFLTDITISSQ 232

Query: 243 --------------------QR-----------LGVSTPD-------------IVIDSGT 258
                               QR           LG+S  D             +++DSGT
Sbjct: 233 MSFGKGSEVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYLPMNSTIEKGNMLVDSGT 292

Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVPEVTIHFRGADVKL 317
               LPQ     +   + + +  +P+  DP+   +LCY   +  + P +T HF GA++ L
Sbjct: 293 PPNILPQQLYDRVYVEVKNKVPLEPITDDPSLGPQLCYRTQTNLKGPTLTYHFEGANLLL 352

Query: 318 SRSNFFVKVSED---IVCSVFKGITNSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           +    F+  + +   + C       NS P IYGN  QTN+L+G+D+++Q VSFKPTDCTK
Sbjct: 353 TPIQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNYLIGFDLDRQIVSFKPTDCTK 412

Query: 374 Q 374
           Q
Sbjct: 413 Q 413


>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 449

 Score =  201 bits (511), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 116/244 (47%), Positives = 155/244 (63%), Gaps = 13/244 (5%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
           +FI F       S +EA+  GFS  LIHRDS  SP YN  +T + RLR++  RS++R N 
Sbjct: 11  LFIAFISMVSAFSLVEARNAGFSANLIHRDSSVSPLYNPRDTYFDRLRNSFHRSISRANR 70

Query: 68  FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
           F  NS IS+    Q+DI+P    YL+RISIG P  E LA+ADTGSDLIW QC+PC    C
Sbjct: 71  FKPNS-ISARALVQSDIVPGGGEYLMRISIGNPQVEILAIADTGSDLIWVQCQPC--EMC 127

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSG----VNCQYSVSYGDGSFSNGN 181
           Y Q+SP+FDP+ SS+Y+++ C +  C  L+   +SC        C Y+ SYGD SFS+G+
Sbjct: 128 YKQNSPIFDPRRSSSYRNVLCGNEFCNKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGH 187

Query: 182 LATETVTLGST---TGQAVA-LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
           LA E   +GST   T  A+A    + FGCGT NGG F+   +GI+GLGGG +SL+SQ+  
Sbjct: 188 LAIERFGIGSTNSNTSAAIAYFQEVAFGCGTKNGGTFDELGSGIIGLGGGSMSLVSQLGP 247

Query: 238 TIAG 241
            ++G
Sbjct: 248 KLSG 251



 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 61/129 (47%), Positives = 85/129 (65%), Gaps = 3/129 (2%)

Query: 247 VSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEV 306
           V   +I+IDSGTTLTFL   + +NL S +   ++ + V+DP G   +C+      ++P +
Sbjct: 323 VEKGNIIIDSGTTLTFLDSEFFNNLDSAVEEAVKGERVSDPHGLFNICFKDEKAIELPII 382

Query: 307 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVS 365
           T HF GADV+L   N F KV ED++C  F  I +N + I+GN+ Q NFLVGYD+E++ VS
Sbjct: 383 TAHFTGADVELQPVNTFAKVEEDLLC--FTMIPSNDIAIFGNLAQMNFLVGYDLEKKAVS 440

Query: 366 FKPTDCTKQ 374
           F PTDCTKQ
Sbjct: 441 FLPTDCTKQ 449


>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
           Precursor
 gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 447

 Score =  199 bits (505), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 113/237 (47%), Positives = 149/237 (62%), Gaps = 9/237 (3%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
           + + FFL F V          FSVELIHRDSP SP YN   T   RL  A  RS++R   
Sbjct: 5   ILLCFFLFFSVTLSSSGHPKNFSVELIHRDSPLSPIYNPQITVTDRLNAAFLRSVSRSRR 64

Query: 68  FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
           FN   S +     Q+ +I  +  + + I+IGTPP +  A+ADTGSDL W QC+PC   QC
Sbjct: 65  FNHQLSQTDL---QSGLIGADGEFFMSITIGTPPIKVFAIADTGSDLTWVQCKPC--QQC 119

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGVN--CQYSVSYGDGSFSNGNLA 183
           Y ++ P+FD K SSTYKS PC S  C +L+  ++ C   N  C+Y  SYGD SFS G++A
Sbjct: 120 YKENGPIFDKKKSSTYKSEPCDSRNCQALSSTERGCDESNNICKYRYSYGDQSFSKGDVA 179

Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
           TETV++ S +G  V+ PG  FGCG NNGG F+   +GI+GLGGG +SLISQ+ ++I+
Sbjct: 180 TETVSIDSASGSPVSFPGTVFGCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSIS 236



 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 63/130 (48%), Positives = 84/130 (64%), Gaps = 3/130 (2%)

Query: 245 LGVSTPDIVIDSGTTLTFLPQGYNSNLLS-VMSSMIEAQPVADPTGSLELCYSFNSLS-Q 302
           L  ++ +I+IDSGTTLT L  G+     S V  S+  A+ V+DP G L  C+   S    
Sbjct: 316 LSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAEIG 375

Query: 303 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
           +PE+T+HF GADV+LS  N FVK+SED+VC +    T  V IYGN  Q +FLVGYD+E +
Sbjct: 376 LPEITVHFTGADVRLSPINAFVKLSEDMVC-LSMVPTTEVAIYGNFAQMDFLVGYDLETR 434

Query: 363 TVSFKPTDCT 372
           TVSF+  DC+
Sbjct: 435 TVSFQHMDCS 444


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 141/426 (33%), Positives = 210/426 (49%), Gaps = 73/426 (17%)

Query: 18  VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS 77
           VV+PIE+Q  GFSVELIH DS +SPFYN  ET  QR+ + +T S+ R ++ N   S+S +
Sbjct: 16  VVTPIESQNRGFSVELIHPDSSRSPFYNIRETQLQRISNVVTHSIKRAHYLNHVFSLSHN 75

Query: 78  KASQADIIPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
              +  IIP   + Y++  SIGTPP +   V DTGSD IW QC+PC P  C  Q SP+F+
Sbjct: 76  DLPKPTIIPYAGSYYVMSYSIGTPPFQLYGVVDTGSDGIWFQCKPCKP--CLNQTSPIFN 133

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           P  SSTYK++ CSS  C    +  CS      C+Y ++Y D S S G+++ +T+TL S  
Sbjct: 134 PSKSSTYKNIRCSSPICKRGEKTRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSND 193

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN---------QR 244
           G  ++ P I  GCG  N        +GI+G G G+ S++SQ+ ++I G           +
Sbjct: 194 GSPISFPKIVIGCGHKNSLTTEGLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLASLFSK 253

Query: 245 LGVSTP----DIVIDSGTTL-------TFLPQGYNSNL--LSVMSSMIEAQP---VADPT 288
             +S+     D+ + SG  +       +F    Y +NL   SV   +I+ +    + D  
Sbjct: 254 ANISSKLYFGDMAVVSGHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDNE 313

Query: 289 GSLELCYSFNSLSQVP-EVTIHFRGA---DVKLSR------------SNFFVKVSEDIVC 332
           G+  +  S ++++Q+P +V      A    VKL R                 K    I+ 
Sbjct: 314 GN-AVIDSGSTITQLPNDVYSQLETAVISMVKLKRVKDPTQQLSLCYKTTLKKYEVPIIT 372

Query: 333 SVFKGITNSVPIYGNIMQTN-------------------------FLVGYDIEQQTVSFK 367
           + F+G    +  +   +Q N                         FLVGYD  +  +SFK
Sbjct: 373 AHFRGADVKLNAFNTFIQMNHEVMCFAFNSSAFPWVVYGNIAQQNFLVGYDTLKNIISFK 432

Query: 368 PTDCTK 373
           PT+CTK
Sbjct: 433 PTNCTK 438


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score =  191 bits (484), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 146/432 (33%), Positives = 204/432 (47%), Gaps = 89/432 (20%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLR---------DALTRSLNRLNHFNQNSSI 74
           A  GGFSV+ IHRDS +SP+ + + +P+ R           + L RS +  +      S 
Sbjct: 28  AGEGGFSVDFIHRDSARSPYRHPALSPHARALAAARRSLRGEVLGRSYSGASPAAAPVSA 87

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP- 133
           +     ++ II  +  YL+ +++GTPPT+ LA+ADTGSDL+W  C     S   + D+  
Sbjct: 88  ADGGV-ESKIITRSFEYLMYVNVGTPPTQLLAIADTGSDLVWVNCS---SSGGGLADADA 143

Query: 134 ----LFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVT 188
               +F P  SSTY  L C S+ C +L+Q SC     CQY  SYGDGS + G L+TET +
Sbjct: 144 GGNVVFQPTRSSTYSQLSCQSNACQALSQASCDADSECQYQYSYGDGSRTIGVLSTETFS 203

Query: 189 L--GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRL- 245
              G   GQ V +P + FGC T + G F S   G+VGLG G  SL+SQ+  T   +++L 
Sbjct: 204 FVDGGGKGQ-VRVPRVNFGCSTASAGTFRSD--GLVGLGAGAFSLVSQLGATTHIDRKLS 260

Query: 246 ---------------------------GVSTPDIVID----------------------- 255
                                        STP +  D                       
Sbjct: 261 YCLIPSYDANSSSTLNFGSRAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQEVATHD 320

Query: 256 ------SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-----VP 304
                 SGTTLTFL       L++ +   I+ Q V  P   L+LCY     S+     +P
Sbjct: 321 SRIIVDSGTTLTFLDPALLGPLVTELERRIKLQRVQPPEQLLQLCYDVQGKSETDNFGIP 380

Query: 305 EVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQ 361
           +VT+ F  GA V L   N F  + E  +C V   ++ S P  I GNI Q NF VGYD++ 
Sbjct: 381 DVTLRFGGGAAVTLRPENTFSLLQEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDA 440

Query: 362 QTVSFKPTDCTK 373
           +TV+F   DC +
Sbjct: 441 RTVTFAAADCAR 452


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 144/428 (33%), Positives = 199/428 (46%), Gaps = 81/428 (18%)

Query: 25  QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN--RLNHFNQNSSISSSKASQA 82
           + GGFSV+ IHRDS +SPF   S  P+ R   A  RSL    L  +   +S +     +A
Sbjct: 26  EAGGFSVDFIHRDSARSPFAQPSLPPHARALAAARRSLRGAALGRYVGGASPAPGPVPEA 85

Query: 83  D------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
           D      II  +  YL+ +++GTPP + LA+ADTGSDL+W  C            + +F 
Sbjct: 86  DGGVESKIITRSFEYLMYVNVGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFH 145

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           P  S+TY  L C S+ C +L+Q SC     CQY  +YGDGS + G L+TET +  +  G 
Sbjct: 146 PSRSTTYSLLSCQSAACQALSQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGG 205

Query: 196 A---VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM----------------- 235
               V +P ++FGC T + G F S   G+VGLG G +SL+SQ+                 
Sbjct: 206 GEGQVRVPRVSFGCSTGSAGSFRSD--GLVGLGAGALSLVSQLGAAARIARRFSYCLVPP 263

Query: 236 ------RTTIAGNQRLGVSTPDIV--------IDS------------------------- 256
                  +T++   R  VS P           +DS                         
Sbjct: 264 YAAANSSSTLSFGARAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQDVASANSSRII 323

Query: 257 ---GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-----VPEVTI 308
              GTTLTFL       L++ +   I       P   L+LCY     SQ     +P+VT+
Sbjct: 324 VDSGTTLTFLDPALLRPLVAELERRIRLPRAQPPEQLLQLCYDVQGKSQAEDFGIPDVTL 383

Query: 309 HF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVS 365
            F  GA V L   N F  + E  +C V   ++ S P  I GNI Q NF VGYD++ +TV+
Sbjct: 384 RFGGGASVTLRPENTFSLLEEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDARTVT 443

Query: 366 FKPTDCTK 373
           F   DCT+
Sbjct: 444 FAAVDCTR 451


>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 445

 Score =  187 bits (476), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 112/242 (46%), Positives = 150/242 (61%), Gaps = 14/242 (5%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           TFL C   L  + F+  S   A     +VELIHRDSP SP YN   T   RL  A  RS+
Sbjct: 5   TFLYCS--LLAISFFFASNSSANRENLTVELIHRDSPHSPLYNPHHTVSDRLNAAFLRSI 62

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R   F   + +      Q+ +I N   Y + ISIGTPP++  A+ADTGSDL W QC+PC
Sbjct: 63  SRSRRFTTKTDL------QSGLISNGGEYFMSISIGTPPSKVFAIADTGSDLTWVQCKPC 116

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGVN--CQYSVSYGDGSFS 178
              QCY Q+SPLFD K SSTYK+  C S  C +L  +++ C      C+Y  SYGD SF+
Sbjct: 117 --QQCYKQNSPLFDKKKSSTYKTESCDSKTCQALSEHEEGCDESKDICKYRYSYGDNSFT 174

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
            G++ATET+++ S++G +V+ PG  FGCG NNGG F    +GI+GLGGG +SL+SQ+ ++
Sbjct: 175 KGDVATETISIDSSSGSSVSFPGTVFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSS 234

Query: 239 IA 240
           I 
Sbjct: 235 IG 236



 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 61/133 (45%), Positives = 85/133 (63%), Gaps = 3/133 (2%)

Query: 242 NQRLGVSTPDIVIDSGTTLTFLPQGYNSNL-LSVMSSMIEAQPVADPTGSLELCY-SFNS 299
           N +    T +I+IDSGTTLT L  G+  +   +V  S+  A+ V+DP G L  C+ S + 
Sbjct: 311 NGKSSKRTGNIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKRVSDPQGLLTHCFKSGDK 370

Query: 300 LSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
              +P +T+HF  ADVKLS  N FVK++ED VC +    T  V IYGN++Q +FLVGYD+
Sbjct: 371 EIGLPAITMHFTNADVKLSPINAFVKLNEDTVC-LSMIPTTEVAIYGNMVQMDFLVGYDL 429

Query: 360 EQQTVSFKPTDCT 372
           E +TVSF+  DC+
Sbjct: 430 ETKTVSFQRMDCS 442


>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 105/216 (48%), Positives = 139/216 (64%), Gaps = 9/216 (4%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
            SVELIHRDSP SP YN   T   RL  A  RS++R    N    I S    Q+ +I  +
Sbjct: 26  LSVELIHRDSPLSPLYNPKNTVTDRLNAAFLRSISRSRRLNN---ILSQTDLQSGLIGAD 82

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             + + I+IGTPP +  A+ADTGSDL W QC+PC   QCY ++ P+FD K SSTYKS PC
Sbjct: 83  GEFFMSITIGTPPMKVFAIADTGSDLTWVQCKPC--QQCYKENGPIFDKKKSSTYKSEPC 140

Query: 149 SSSQCASLN--QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            S  C +L+  ++ C      C+Y  SYGD SFS G++ATET+++ S +G  V+ PG  F
Sbjct: 141 DSRNCHALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTVF 200

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
           GCG NNGG F+   +GI+GLGGG +SLISQ+ ++I+
Sbjct: 201 GCGYNNGGTFDETGSGIIGLGGGHLSLISQLGSSIS 236



 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 60/124 (48%), Positives = 81/124 (65%), Gaps = 3/124 (2%)

Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSFNSLS-QVPEVTI 308
           +I+IDSGTTLT L  G+     + +  ++  A+ V+DP G L  C+   S    +PE+T+
Sbjct: 322 NIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKRVSDPQGLLSHCFKSGSAEIGLPEITV 381

Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           HF GADV+LS  N FVKVSED+VC +    T  V IYGN  Q +FLVGYD+E +TVSF+ 
Sbjct: 382 HFTGADVRLSPINAFVKVSEDMVC-LSMVPTTEVAIYGNFAQMDFLVGYDLETRTVSFQR 440

Query: 369 TDCT 372
            DC+
Sbjct: 441 MDCS 444


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score =  187 bits (474), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 133/399 (33%), Positives = 196/399 (49%), Gaps = 84/399 (21%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           +AT +  +F L  + +++ +   +   GF+++LIHR S  S                   
Sbjct: 3   LATTMIAIF-LQIITYFLFTTTASSPHGFTIDLIHRRSNAS------------------- 42

Query: 61  SLNRLNHFNQNSSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
                     +S +S+++A    AD + +   YL+++ IGTPP E  AV DTGS+LIWTQ
Sbjct: 43  ----------SSRVSNTQAGSPYADTVFDTYEYLMKLQIGTPPFEVEAVLDTGSELIWTQ 92

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFS 178
           C PC    CY Q +P+FDP  SST+K   C++   +           C Y + Y D S++
Sbjct: 93  CLPC--LHCYDQKAPIFDPSKSSTFKETRCNTPDHS-----------CPYKLVYDDKSYT 139

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQM-- 235
            G LATETVT+ ST+G    +P    GC  NN G  F   ++GIVGL  G +SLISQM  
Sbjct: 140 QGTLATETVTIHSTSGVPFVMPETIIGCSRNNSGSGFRPSSSGIVGLSRGSLSLISQMGG 199

Query: 236 ---------RTTIAGNQRLG------------------VSTP------DIVIDSGTTLTF 262
                     T  A   + G                  V TP      +IVIDSGT LT+
Sbjct: 200 AYPGDGVVSTTMFAKTAKRGQYYLNLDAVSVGDTRIETVGTPFHALNGNIVIDSGTPLTY 259

Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 321
            P  Y + +   +  ++ A  V DP+ +  LCY  N++   P +T+HF  GAD+ L + N
Sbjct: 260 FPVSYCNLVRKAVERVVTADRVVDPSRNDMLCYYSNTIEIFPVITVHFSGGADLVLDKYN 319

Query: 322 FFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 358
            +++++   + C ++       V I+GN  Q NFLVGYD
Sbjct: 320 MYMELNRGGVFCLAIICNNPTQVAIFGNRAQNNFLVGYD 358



 Score =  167 bits (424), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 120/353 (33%), Positives = 166/353 (47%), Gaps = 78/353 (22%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           AD + +N+ YL+++ +GTPP E  AV DTGS++ WTQC PC    CY Q++P+FDP  SS
Sbjct: 371 ADTVFDNSVYLMKLQVGTPPFEIEAVIDTGSEITWTQCLPC--VHCYKQNAPIFDPSKSS 428

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           T+K             +K C   +C Y V Y D +++ G LAT+TVT+ ST+G+   +  
Sbjct: 429 TFK-------------EKRCHDHSCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAE 475

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ--------MRTTIAGN--QRLGVSTPD 251
              GCG NN   F     G VGL  G +SLI+Q        M    AGN   ++   T  
Sbjct: 476 TIIGCGRNN-SWFRPSFEGFVGLNWGPLSLITQMGGEYPGLMSYCFAGNGTSKINFGTNA 534

Query: 252 IVIDSGTTLTFL------PQGYNSNLLSV------------------------------- 274
           IV   G   T +      P  Y  NL +V                               
Sbjct: 535 IVGGGGVVSTTMFVTTARPGFYYLNLDAVSVGDTRIETLGTPFHALEGNIVIDSGTTLTY 594

Query: 275 ------------MSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 321
                       +  ++ A P ADPTG+  LCY  N+    P +T+HF  GAD+ L + N
Sbjct: 595 FPESYCNLVRQAVEHVVPAVPAADPTGNDLLCYYSNTTEIFPVITMHFSGGADLVLDKYN 654

Query: 322 FFVK-VSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            F++  S  + C ++         I+GN  Q NFLVGYD     VSFKPT+C+
Sbjct: 655 MFMESYSGGLFCLAIICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 707


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score =  185 bits (469), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 140/426 (32%), Positives = 206/426 (48%), Gaps = 72/426 (16%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           MA FL  V+IL  L +  +S   +   G  +EL H D      Y  +E    R+R A  R
Sbjct: 1   MAAFL--VWILLLLPYVAISSTASH--GVRLELTHADDRGG--YVGAE----RVRRAADR 50

Query: 61  SLNRLNHF-----------NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
           S  R+N F              S  + +  ++A +  + A YL+ I+IGTPP    AV D
Sbjct: 51  SHRRVNGFLGAIEGPSSTARLGSDGAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLD 110

Query: 110 TGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCS--GV 164
           TGSDLIWTQC+ PC   +C+ Q +PL+ P  S+TY ++ C S  C +L      CS    
Sbjct: 111 TGSDLIWTQCDAPC--RRCFPQPAPLYAPARSATYANVSCRSPMCQALQSPWSRCSPPDT 168

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C Y  SYGDG+ ++G LATET TLGS T    A+ G+ FGCGT N G     ++G+VG+
Sbjct: 169 GCAYYFSYGDGTSTDGVLATETFTLGSDT----AVRGVAFGCGTENLG-STDNSSGLVGM 223

Query: 225 GGGDISLISQMRTT---------------------------IAGNQRLGVS------TP- 250
           G G +SL+SQ+  T                             G+  L +       TP 
Sbjct: 224 GRGPLSLVSQLGVTRPRRSCRARAAARGGGAPTTTSPLEGITVGDTLLPIDPAVFRLTPM 283

Query: 251 ---DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS--LSQVPE 305
               ++IDSGTT T L +     L   ++S +     +     L LC++  S    +VP 
Sbjct: 284 GDGGVIIDSGTTFTALEERAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPR 343

Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
           + +HF GAD++L R ++ V+     V  +       + + G++ Q N  + YD+E+  +S
Sbjct: 344 LVLHFDGADMELRRESYVVEDRSAGVACLGMVSARGMSVLGSMQQQNTHILYDLERGILS 403

Query: 366 FKPTDC 371
           F+P  C
Sbjct: 404 FEPAKC 409


>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 437

 Score =  184 bits (466), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 107/222 (48%), Positives = 139/222 (62%), Gaps = 20/222 (9%)

Query: 8   VFILF-FLCFYVVSPI---EAQTG--GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
           VF++F  L  Y  S I   EA  G  GFS++LIHRDSP SPFY+ S TP +R+ +A  RS
Sbjct: 5   VFMVFMLLALYSPSSISTREAGEGLRGFSIDLIHRDSPLSPFYDPSLTPSERITNAAFRS 64

Query: 62  ---LNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
              LNR++HF   +++  S      +IP N  YL+ + IGTPP ERLA+ADTGSDLIW Q
Sbjct: 65  SSRLNRVSHFLDENNLPESL-----LIPENGEYLMTLYIGTPPVERLAIADTGSDLIWVQ 119

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGV-NCQYSVSYGDG 175
           C PC    C+ QD+PLF+P  SST+K+  C S  C S+  +Q+ C  V  C YS SYGD 
Sbjct: 120 CSPC--QNCFPQDTPLFEPLKSSTFKAATCDSQPCTSVPPSQRQCGKVGQCIYSYSYGDK 177

Query: 176 SFSNGNLATETVTLGST-TGQAVALPGITFGCGTNNGGLFNS 216
           SF+ G + TET++ GST   Q V+ P   FGCG  N   F++
Sbjct: 178 SFTVGVVGTETLSFGSTGDAQTVSFPSSIFGCGVYNNFTFHT 219



 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 41/125 (32%), Positives = 73/125 (58%), Gaps = 3/125 (2%)

Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF 310
           +I+IDSGT LT+L Q + +N ++ +  ++  +   D     + C+ +  ++ +P +   F
Sbjct: 313 NIIIDSGTVLTYLEQTFYNNFVASLQEVLSVESAQDLPFPFKFCFPYRDMT-IPVIAFQF 371

Query: 311 RGADVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
            GA V L   N  +K+ + +++C +V     + + I+GN+ Q +F V YD+E + VSF P
Sbjct: 372 TGASVALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFDFQVVYDLEGKKVSFAP 431

Query: 369 TDCTK 373
           TDCTK
Sbjct: 432 TDCTK 436


>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 445

 Score =  180 bits (457), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 110/242 (45%), Positives = 146/242 (60%), Gaps = 14/242 (5%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           T L C   L  +  +  S   A     SVELIHRDSP SP YN   T   RL  A     
Sbjct: 5   TLLYCS--LLAITIFFTSTSSAHRKNLSVELIHRDSPHSPLYNPQHTVSDRLNAAF---- 58

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
             L   +++   S+    Q+ +I N   Y + ISIGTPP++ LA+ADTGSDL W QC+PC
Sbjct: 59  --LRSISRSRRFSTKTDLQSGLISNGGEYFMSISIGTPPSKFLAIADTGSDLTWVQCKPC 116

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSC--SGVNCQYSVSYGDGSFS 178
              QCY Q++PLFD K SSTYK+  C S  C +L  +++ C  S   C+Y  SYGD SF+
Sbjct: 117 --QQCYKQNTPLFDKKKSSTYKTESCDSITCNALSEHEEGCDESRNACKYRYSYGDESFT 174

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
            G +ATET+++ S++G  V+ PG  FGCG NNGG F    +GI+GLGGG +SL+SQ+ ++
Sbjct: 175 KGEVATETISIDSSSGSPVSFPGTAFGCGYNNGGTFEETGSGIIGLGGGPLSLVSQLGSS 234

Query: 239 IA 240
           I 
Sbjct: 235 IG 236



 Score =  111 bits (278), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 70/151 (46%), Positives = 93/151 (61%), Gaps = 16/151 (10%)

Query: 225 GGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMS-SMIEAQP 283
           GGG  SL            R    T +I+IDSGTTLT L  G+  +  +V+  S+  A+ 
Sbjct: 305 GGGGYSL-----------NRKSKKTGNIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKR 353

Query: 284 VADPTGSLELCY-SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVC-SVFKGITNS 341
           V+DP G L  C+ S +    +P +T+HF GADVKLS  N FVK+SEDIVC S+    T  
Sbjct: 354 VSDPQGILTHCFKSGDKEIGLPTITMHFTGADVKLSPINSFVKLSEDIVCLSMIP--TTE 411

Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           V IYGN++Q +FLVGYD+E +TVSF+  DC+
Sbjct: 412 VAIYGNMVQMDFLVGYDLETKTVSFQRMDCS 442


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score =  179 bits (454), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 122/399 (30%), Positives = 183/399 (45%), Gaps = 68/399 (17%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN-- 87
           S+ L+HRD+     Y S      ++   + R   R+ H  +    S+S     D++    
Sbjct: 64  SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120

Query: 88  ------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                 +  Y +R+ +G+PPT++  V D+GSD+IW QC PC   QCY Q  PLFDP  SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178

Query: 142 TYKSLPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           ++  + C S+ C +L+   C G      C YSV+YGDGS++ G LA ET+TLG T     
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT----- 233

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG---------------- 241
           A+ G+  GCG  N GLF     G++GLG G +SL+ Q+     G                
Sbjct: 234 AVQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGS 292

Query: 242 ----------------NQRLGVS------TPD----IVIDSGTTLTFLPQGYNSNLLSVM 275
                            +RL +       T D    +V+D+GT +T LP+   + L    
Sbjct: 293 LASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTRLPREAYAALRGAF 352

Query: 276 SSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVC 332
              + A P +     L+ CY  +  +  +VP V+ +F +GA + L   N  V+V   + C
Sbjct: 353 DGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPARNLLVEVGGAVFC 412

Query: 333 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             F   ++ + I GNI Q    +  D     V F P  C
Sbjct: 413 LAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 451


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 127/385 (32%), Positives = 177/385 (45%), Gaps = 65/385 (16%)

Query: 49  TPYQRLRDALTRSLNRLNHF-NQNSSISSSKASQADIIPNN-------ANYLIRISIGTP 100
           T ++ LR    RS  R  H  +        +++ A + P           YL+ ++ GTP
Sbjct: 38  THWELLRRMAQRSKARATHLLSAQDQSGRGRSASAPVNPGAYDDGFPFTEYLVHLAAGTP 97

Query: 101 PTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS 160
           P E     DTGSD+ WTQC+ CP S C+ Q  PLFDP  SS++ SLPCSS  C +     
Sbjct: 98  PQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSSPACET--TPP 155

Query: 161 CSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGG 212
           C G N      C YS+SYGDGS S G +  E  T  S TG+  + A+PG+ FGCG  N G
Sbjct: 156 CGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVFGCGHANRG 215

Query: 213 LFNSKTTGIVGLGGGDISLISQMR--------TTIAGNQRLGV----------------- 247
           +F S  TGI G G G +SL SQ++        TTI G++   V                 
Sbjct: 216 VFTSNETGIAGFGRGSLSLPSQLKVGNFSHCFTTITGSKTSAVLLGLPGVAPPSASPLGR 275

Query: 248 --------STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-- 297
                   STP    +SGT++T LP      +    ++ ++   V         C+S   
Sbjct: 276 RRGSYRCRSTPR-SSNSGTSITSLPPRTYRAVREEFAAQVKLPVVPGNATDPFTCFSAPL 334

Query: 298 -NSLSQVPEVTIHFRGADVKLSRSNFFVKVSED--------IVCSVFKGITNSVPIYGNI 348
                 VP + +HF GA ++L + N+  +V +D        I+C     I     I GNI
Sbjct: 335 RGPKPDVPTMALHFEGATMRLPQENYVFEVVDDDDAGNSSRIICLAV--IEGGEIILGNI 392

Query: 349 MQTNFLVGYDIEQQTVSFKPTDCTK 373
            Q N  V YD++   +SF P  C +
Sbjct: 393 QQQNMHVLYDLQNSKLSFVPAQCDQ 417


>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 756

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 122/375 (32%), Positives = 183/375 (48%), Gaps = 80/375 (21%)

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           N L  ++ +S +    +  AD + + + YL+++ +GTPP E +A  DTGSD+IWTQC PC
Sbjct: 393 NFLVGYDSSSLLLQGASPYADTLYDYSIYLMKLQVGTPPFEIVAEIDTGSDIIWTQCMPC 452

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
           P   CY Q +P+FDP  SST++             ++ C+G +C Y + Y D ++S G L
Sbjct: 453 P--NCYSQFAPIFDPSKSSTFR-------------EQRCNGNSCHYEIIYADKTYSKGIL 497

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQM--- 235
           ATETVT+ ST+G+   +     GCG +N  L    F S ++GIVGL  G +SLISQM   
Sbjct: 498 ATETVTIPSTSGEPFVMAETKIGCGLDNTNLQYSGFASSSSGIVGLNMGPLSLISQMDLP 557

Query: 236 ---------------------RTTIAGNQRL------------------GVSTPDIVI-- 254
                                   +AG+  +                   VS  D +I  
Sbjct: 558 YPGLISYCFSGQGTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNLIAT 617

Query: 255 --------------DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL 300
                         DSGTTLT+ P  Y + +   +  ++ A  V D      LCY  +++
Sbjct: 618 LGTPFHAEDGNIFIDSGTTLTYFPMSYCNLVREAVEQVVTAVKVPDMGSDNLLCYYSDTI 677

Query: 301 SQVPEVTIHFR-GADVKLSRSNFFVK-VSEDIVCSVFKGITNSVP-IYGNIMQTNFLVGY 357
              P +T+HF  GAD+ L + N +++ ++  I C        S+P ++GN  Q NFLVGY
Sbjct: 678 DIFPVITMHFSGGADLVLDKYNMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGY 737

Query: 358 DIEQQTVSFKPTDCT 372
           D     +SF PT+C+
Sbjct: 738 DPSSNVISFSPTNCS 752



 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 131/423 (30%), Positives = 188/423 (44%), Gaps = 106/423 (25%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           +AT +  +F+    CF   + + +  G F+++LI R S  S F         RL      
Sbjct: 18  LATTMIVLFLQIITCFLFTTTVSSPHG-FTIDLIQRRSNSSSF---------RL------ 61

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
           S N+L             +  AD + +   YL+++ +GTPP E  A  DTGSDLIWTQC 
Sbjct: 62  SKNQLQ----------GASPYADTLFDYNIYLMKLQVGTPPFEIAAEIDTGSDLIWTQCM 111

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
           PCP   CY Q  P+FDP  SST+             N++ C G +C Y + Y D ++S G
Sbjct: 112 PCP--DCYSQFDPIFDPSKSSTF-------------NEQRCHGKSCHYEIIYEDNTYSKG 156

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMR 236
            LATETVT+ ST+G+   +   T GCG +N  L    F S ++GIVGL  G  SLISQM 
Sbjct: 157 ILATETVTIHSTSGEPFVMAETTIGCGLHNTDLDNSGFASSSSGIVGLNMGPRSLISQMD 216

Query: 237 TTI----------AGNQRLGVSTPDIVIDSGT---------------------------- 258
                         G  ++   T  IV   GT                            
Sbjct: 217 LPYPGLISYCFSGQGTSKINFGTNAIVAGDGTVAADMFIKKDNPFYYLNLDAVSVEDNRI 276

Query: 259 --------------------TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 298
                               T+T+ P  Y + +   +  ++ A  V DP+G+  LCY   
Sbjct: 277 ETLGTPFHAEDGNIVIDSGSTVTYFPVSYCNLVRKAVEQVVTAVRVPDPSGNDMLCYFSE 336

Query: 299 SLSQVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVC-SVFKGITNSVPIYGNIMQTNFLV 355
           ++   P +T+HF  GAD+ L + N +++  S  + C ++         I+GN  Q NFLV
Sbjct: 337 TIDIFPVITMHFSGGADLVLDKYNMYMESNSGGLFCLAIICNSPTQEAIFGNRAQNNFLV 396

Query: 356 GYD 358
           GYD
Sbjct: 397 GYD 399


>gi|356528675|ref|XP_003532925.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 342

 Score =  178 bits (452), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 125/370 (33%), Positives = 177/370 (47%), Gaps = 80/370 (21%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           GFS++LIHRDSP SPFYN S TP +R+ DA   S       N+N      K  ++ +IPN
Sbjct: 28  GFSIDLIHRDSPLSPFYNPSLTPSERITDAALSS-------NEN------KLPESILIPN 74

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           N  YL+R+ IGTPP ERL +ADTGSD IW QC                           P
Sbjct: 75  NGEYLMRLYIGTPPVERLVIADTGSDFIWVQCS--------------------------P 108

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG-QAVALPGITFGC 206
           C + QC  LN              Y + SF+   + TET++  ST G Q V+ P   FGC
Sbjct: 109 CQNCQCVYLN-------------IYANKSFTIEVVGTETLSFDSTGGAQTVSFPNSIFGC 155

Query: 207 GTNNGGLFNS--KTTGIVGLGGGDISLISQMRTTIA--------GNQRL----GVSTPDI 252
           G NN   F S  K TG+VGL  G +SL+SQ+   I         G++ +    GV +  +
Sbjct: 156 GANNNLTFRSSDKATGLVGLVAGQLSLVSQLGAQIGYKFSYLKFGSEAIITTNGVVSTPL 215

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIE-----AQPVADPTGSLELCYSFNSLSQVPEVT 307
           +I     L FL    N  ++++   ++       + V D     + C+ +     VP + 
Sbjct: 216 IIKPSLPLYFL----NLEVVTIGQKVVPTETLGVESVQDLPFPFKFCFPYRDNMTVPAIA 271

Query: 308 IHFRGADVKLSRSNFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQT 363
             F GA V L   N  +K+ +   +  +V    +  + + I+G I Q +F V YD++ + 
Sbjct: 272 FQFTGASVALRPKNLLIKLQDRNMLXLAVVPSASSLSVISIFGIIAQFDFQVLYDLDGKK 331

Query: 364 VSFKPTDCTK 373
           VS  PTDCTK
Sbjct: 332 VSVAPTDCTK 341


>gi|255566002|ref|XP_002523989.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536716|gb|EEF38357.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  176 bits (447), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 146/432 (33%), Positives = 199/432 (46%), Gaps = 96/432 (22%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-----NSSISS 76
           ++A   GF+ ELI RDSP SPFYN+       L  A TRS N   H++      N    S
Sbjct: 30  VKADNFGFTAELIRRDSPNSPFYNA-------LEAAATRSTNASQHYDAQIGRFNLMSDS 82

Query: 77  SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
             ASQ+++  +  NYLI+IS+GTPP E LA+AD   DL W  C+ C   Q   +D   F 
Sbjct: 83  YYASQSELNFSKGNYLIKISVGTPPAEILALADITGDLTWLPCKTC---QDCTKDGFTFF 139

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY---SVSYGDGSFSN-GNLATETVTLGST 192
           P  SSTY S  C S QC   N   C    C Y    +     S +N G +A +T++  S+
Sbjct: 140 PSESSTYTSAACESYQCQITNGAVCQTKMCIYLCGPLPQQRSSCTNKGLVAMDTISFHSS 199

Query: 193 TGQAVALPGITFGCGT--NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG--NQRL--- 245
           +GQA++ P   F CGT  +N     +   GIVGLG G  S+ SQM+  I G  +Q L   
Sbjct: 200 SGQALSYPNTNFICGTFIDNWHYIGA---GIVGLGRGLFSMTSQMKHLINGTFSQCLVPY 256

Query: 246 -------------GVSTPDIVI--------DSGTTLTFL--------------------- 263
                        GV + + V+        +SG    FL                     
Sbjct: 257 SSKQSSKINFGLKGVVSGEGVVSTPIADDGESGAYFLFLEAMSVGGNRVANNFYSAPKSN 316

Query: 264 ------------PQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLS--QVPEVTI 308
                       P  +  N+ + +   I   P+  +    L LCY   S      P +T+
Sbjct: 317 IYIDWRTTFTSLPHDFYENVEAEVRKAINLTPINYNNERKLSLCYKSESDHDFDAPPITM 376

Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSVF--------KGITNSVPIYGNIMQTNFLVGYDIE 360
           HF  ADV+LS  N FV++  ++VC  F        K IT++V  YG+  Q NF+VGYD++
Sbjct: 377 HFTNADVQLSPLNTFVRMDWNVVCFAFLDGTFNATKRITHAV--YGSWQQMNFIVGYDLK 434

Query: 361 QQTVSFKPTDCT 372
             TVSFK  DCT
Sbjct: 435 SSTVSFKQADCT 446


>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 294

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 102/243 (41%), Positives = 142/243 (58%), Gaps = 37/243 (15%)

Query: 3   TFLSCVFILFFLCFYVVSP-IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
           T+   + ++  L F  + P IEA  GGF+ +LI R+S K                     
Sbjct: 2   TYPRKIHLISILLFVFIFPHIEAHNGGFTGKLIPRNSSK--------------------- 40

Query: 62  LNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
               + FN+N+        Q+ +  N+ +YL+ +SIGTPP +  A ADTGSDLIW QC P
Sbjct: 41  ----DFFNRNTI-------QSPVSANHYDYLMELSIGTPPVKIYAQADTGSDLIWLQCIP 89

Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSN 179
           C  + CY Q +P+FD + SST+ ++ C S  C+ L   SCS   +NC+Y+ SY DGS + 
Sbjct: 90  C--TNCYKQLNPMFDSQSSSTFSNIACGSESCSKLYSTSCSPDQINCKYNYSYVDGSETQ 147

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           G LA ET+TL STTG+ VA  G+ FGCG NN G FN K  GI+GLG G +SL+SQ+ +++
Sbjct: 148 GVLAQETLTLTSTTGEPVAFKGVIFGCGHNNNGAFNDKEMGIIGLGRGPLSLVSQIGSSL 207

Query: 240 AGN 242
            GN
Sbjct: 208 GGN 210


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score =  176 bits (445), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 140/436 (32%), Positives = 200/436 (45%), Gaps = 111/436 (25%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           +AT +  +F L  + +++++   +   GF+++LIHR S  S                 +R
Sbjct: 3   LATTMIAIF-LQIITYFLITTTASSPQGFTIDLIHRRSNASS----------------SR 45

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
             N           +   +  AD + +   YL+++ IGTPP E  AV DTGS+ IWTQC 
Sbjct: 46  VFN-----------TQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCL 94

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
           PC    CY Q +P+FDP  SST+K + C +   +           C Y + YG  S++ G
Sbjct: 95  PC--VHCYNQTAPIFDPSKSSTFKEIRCDTHDHS-----------CPYELVYGGKSYTKG 141

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM----- 235
            L TETVT+ ST+GQ   +P    GCG NN G F     G+VGL  G  SLI+QM     
Sbjct: 142 TLVTETVTIHSTSGQPFVMPETIIGCGRNNSG-FKPGFAGVVGLDRGPKSLITQMGGEYP 200

Query: 236 -----------------------------RTTI------------------AGNQRLG-V 247
                                         TT+                   GN R+  V
Sbjct: 201 GLMSYCFAGKGTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETV 260

Query: 248 STP------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS 301
            TP      +IVIDSG+TLT+ P+ Y + +   +  ++ A  V  P   + LCY   ++ 
Sbjct: 261 GTPFHALKGNIVIDSGSTLTYFPESYCNLVRKAVEQVVTA--VRFPRSDI-LCYYSKTID 317

Query: 302 QVPEVTIHFR-GADVKLSRSNFFVKVSE-DIVCSVFKGITNS---VPIYGNIMQTNFLVG 356
             P +T+HF  GAD+ L + N +V  +   + C     I NS     I+GN  Q NFLVG
Sbjct: 318 IFPVITMHFSGGADLVLDKYNMYVASNTGGVFCLAI--ICNSPIEEAIFGNRAQNNFLVG 375

Query: 357 YDIEQQTVSFKPTDCT 372
           YD     VSFKPT+C+
Sbjct: 376 YDSSSLLVSFKPTNCS 391


>gi|222628951|gb|EEE61083.1| hypothetical protein OsJ_14969 [Oryza sativa Japonica Group]
          Length = 367

 Score =  174 bits (441), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 114/339 (33%), Positives = 177/339 (52%), Gaps = 34/339 (10%)

Query: 49  TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
           T ++ LR A+ RS  RL      +  + S+ KA  A+  I+P    YL+++ IGTPP + 
Sbjct: 43  TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102

Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
            A  DT SDLIWTQC+PC  + CY Q  P+F+P++SSTY +LPCSS  C  L+   C   
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160

Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
           +   CQY+ +Y   + + G LA + + +G       A  G+ FGC T++ GG    + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215

Query: 221 IVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE 280
           +VGLG G +SL+SQ          L V    ++ID  +T+TFL       L++ +   I 
Sbjct: 216 VVGLGRGPLSLVSQ----------LSVRRYGMIIDIASTITFLEASLYDELVNDLEVEIR 265

Query: 281 AQPVADPTGSLELCY------SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSED-IVC- 332
                  +  L+LC+      +F+ +  VP V + F G  ++L ++  F +  E  ++C 
Sbjct: 266 LPRGTGSSLGLDLCFILPDGVAFDRV-YVPAVALAFDGRWLRLDKARLFAEDRESGMMCL 324

Query: 333 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            V +    SV I GN  Q N  V Y++ +  V+F  + C
Sbjct: 325 MVGRAEAGSVSILGNFQQQNMQVLYNLRRGRVTFVQSPC 363


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 137/426 (32%), Positives = 194/426 (45%), Gaps = 110/426 (25%)

Query: 11  LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ 70
           L  + +++++   +   GF+++LIHR S  S                 +R  N       
Sbjct: 6   LQIITYFLITTTASSPQGFTIDLIHRRSNASS----------------SRVFN------- 42

Query: 71  NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
               +   +  AD + +   YL+++ IGTPP E  AV DTGS+ IWTQC PC    CY Q
Sbjct: 43  ----TQLGSPYADTVFDTYEYLMKLQIGTPPFEIEAVLDTGSEHIWTQCLPC--VHCYNQ 96

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
            +P+FDP  SST+K + C +   +           C Y + YG  S++ G L TETVT+ 
Sbjct: 97  TAPIFDPSKSSTFKEIRCDTHDHS-----------CPYELVYGGKSYTKGTLVTETVTIH 145

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM--------------- 235
           ST+GQ   +P    GCG NN G F     G+VGL  G  SLI+QM               
Sbjct: 146 STSGQPFVMPETIIGCGRNNSG-FKPGFAGVVGLDRGPKSLITQMGGEYPGLMSYCFAGK 204

Query: 236 -------------------RTTI------------------AGNQRLG-VSTP------D 251
                               TT+                   GN R+  V TP      +
Sbjct: 205 GTSKINFGANAIVAGDGVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFHALKGN 264

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR 311
           IVIDSG+TLT+ P+ Y + +   +  ++ A  V  P   + LCY   ++   P +T+HF 
Sbjct: 265 IVIDSGSTLTYFPESYCNLVRKAVEQVVTA--VRFPRSDI-LCYYSKTIDIFPVITMHFS 321

Query: 312 -GADVKLSRSNFFVKVSE-DIVCSVFKGITNS---VPIYGNIMQTNFLVGYDIEQQTVSF 366
            GAD+ L + N +V  +   + C     I NS     I+GN  Q NFLVGYD     VSF
Sbjct: 322 GGADLVLDKYNMYVASNTGGVFCLAI--ICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSF 379

Query: 367 KPTDCT 372
           KPT+C+
Sbjct: 380 KPTNCS 385


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score =  172 bits (437), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 89/221 (40%), Positives = 134/221 (60%), Gaps = 14/221 (6%)

Query: 21  PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
           P +  + GF V L H D  K+       T ++RLR  + R  NRL+  N    ++++ A+
Sbjct: 298 PNKLPSHGFRVRLKHVDHVKN------LTRFERLRRGVARGKNRLHRLNA-MVLAAANAT 350

Query: 81  QAD-----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
             D     ++  N  +L++++IG+PP    A+ DTGSDLIWTQC+PC   QC+ Q +P+F
Sbjct: 351 VGDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPC--QQCFDQSTPIF 408

Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           DPK SS++  + CSS  C +L   +CS   C+Y  +YGD S + G LA ET T G +T  
Sbjct: 409 DPKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTED 468

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
            +++PG+ FGCG +N G   S+  G+VGLG G +SL+SQ++
Sbjct: 469 QISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLK 509



 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 45/126 (35%), Positives = 78/126 (61%), Gaps = 9/126 (7%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--PVADP-TGSLELCYSF---NSLSQVPE 305
           ++IDSGTT+T++    NS   S+ +  I     PV D  TG L+LC++     +  +VP+
Sbjct: 591 VIIDSGTTITYVE---NSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPK 647

Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
           +T HF+GAD++L   N+ +  S+  +  +  G +  + I+GN+ Q NF+V +D++++T+S
Sbjct: 648 LTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVVHDLQEETLS 707

Query: 366 FKPTDC 371
           F PT C
Sbjct: 708 FLPTQC 713


>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 392

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 120/355 (33%), Positives = 172/355 (48%), Gaps = 82/355 (23%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           AD + +   YL+++ +GTPP E  A  DTGSDLIWTQC PC  + CY Q +P+FDP  SS
Sbjct: 52  ADTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPC--TNCYSQYAPIFDPSNSS 109

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           T+K             +K C+G +C Y + Y D ++S G LATETVT+ ST+G+   +P 
Sbjct: 110 TFK-------------EKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPE 156

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM------------------------RT 237
            T GCG +N   F    +G+VGL  G  SLI+QM                          
Sbjct: 157 TTIGCG-HNSSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNA 215

Query: 238 TIAGNQRL-------------------GVSTPDIVIDS-GTTL---------------TF 262
            +AG+  +                    VS  D  +++ GTT                T+
Sbjct: 216 IVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTY 275

Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 321
            P  Y + +   +   + A   ADPTG+  LCY  +++   P +T+HF  GAD+ L + N
Sbjct: 276 FPVSYCNLVREAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYN 335

Query: 322 FFVK-VSEDIVCSVFKGITNSVP---IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            +++ ++    C     I N+ P   I+GN  Q NFLVGYD     VSF PT+C+
Sbjct: 336 MYIETITRGTFCLAI--ICNNPPQDAIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score =  172 bits (435), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 88/220 (40%), Positives = 132/220 (60%), Gaps = 12/220 (5%)

Query: 21  PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ----NSSISS 76
           P +  + GF V L H D  K+       T ++RLR  + R  NRL+  N      ++ + 
Sbjct: 43  PNKLPSHGFRVRLKHVDHVKN------LTRFERLRRGVARGKNRLHRLNAMVLAAANATV 96

Query: 77  SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
               +A ++  N  +L++++IG+PP    A+ DTGSDLIWTQC+PC   QC+ Q +P+FD
Sbjct: 97  GDQVKAPVVAGNGEFLMKLAIGSPPRSFSAIMDTGSDLIWTQCKPC--QQCFDQSTPIFD 154

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           PK SS++  + CSS  C +L   +CS   C+Y  +YGD S + G LA ET T G +T   
Sbjct: 155 PKQSSSFYKISCSSELCGALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQ 214

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           +++PG+ FGCG +N G   S+  G+VGLG G +SL+SQ++
Sbjct: 215 ISIPGLGFGCGNDNNGDGFSQGAGLVGLGRGPLSLVSQLK 254



 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 45/126 (35%), Positives = 79/126 (62%), Gaps = 9/126 (7%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--PVADP-TGSLELCYSFNSLS---QVPE 305
           ++IDSGTT+T++    NS   S+ +  I     PV D  TG L+LC++  + +   +VP+
Sbjct: 336 VIIDSGTTITYVE---NSAFTSLKNEFIAQMNLPVDDSGTGGLDLCFNLPAGTNQVEVPK 392

Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
           +T HF+GAD++L   N+ +  S+  +  +  G +  + I+GN+ Q NF+V +D++++T+S
Sbjct: 393 LTFHFKGADLELPGENYMIGDSKAGLLCLAIGSSRGMSIFGNLQQQNFMVVHDLQEETLS 452

Query: 366 FKPTDC 371
           F PT C
Sbjct: 453 FLPTQC 458


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score =  170 bits (431), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 123/406 (30%), Positives = 185/406 (45%), Gaps = 73/406 (17%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           G  V+L   DS K+       T Y+ ++ A+ R   R+   N  + + SS   +  +   
Sbjct: 41  GLRVDLEQVDSGKN------LTKYELIKRAIKRGERRMRSIN--AMLQSSSGIETPVYAG 92

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  YL+ ++IGTP +   A+ DTGSDLIWTQCEPC  +QC+ Q +P+F+P+ SS++ +LP
Sbjct: 93  DGEYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPC--TQCFSQPTPIFNPQDSSSFSTLP 150

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL-----------------G 190
           C S  C  L  ++C+   CQY+  YGDGS + G +ATET T                  G
Sbjct: 151 CESQYCQDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTFETSSVPNIAFGCGEDNQG 210

Query: 191 STTGQAVALPGITFG---------------CGTNNGG------LFNSKTTGIVGLGGGDI 229
              G    L G+ +G               C T+ G          S  +G V  G    
Sbjct: 211 FGQGNGAGLIGMGWGPLSLPSQLGVGQFSYCMTSYGSSSPSTLALGSAASG-VPEGSPST 269

Query: 230 SLISQ----------MRTTIAGNQRLGV----------STPDIVIDSGTTLTFLPQGYNS 269
           +LI            ++    G   LG+           T  ++IDSGTTLT+LPQ   +
Sbjct: 270 TLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYLPQDAYN 329

Query: 270 NLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPEVTIHFRGADVKLSRSNFFVKV 326
            +    +  I    V + +  L  C+   S  S  QVPE+++ F G  + L   N  +  
Sbjct: 330 AVAQAFTDQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVLNLGEQNILISP 389

Query: 327 SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           +E ++C      +   + I+GNI Q    V YD++   VSF PT C
Sbjct: 390 AEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435


>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 397

 Score =  170 bits (431), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 122/357 (34%), Positives = 173/357 (48%), Gaps = 81/357 (22%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           AD + + + YL+R+ +GTPP E +A  DTGSDLIWTQC PCP   CY Q +P+FDP  SS
Sbjct: 52  ADTVFDYSIYLMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCP--NCYTQFAPIFDPSKSS 109

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           T+K             +K C G +C Y + Y D S+S G LATETVT+ ST+G+   +  
Sbjct: 110 TFK-------------EKRCHGNSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAE 156

Query: 202 ITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQMRTTI----------AGNQRLGV 247
            + GCG NN  L    + + ++GIVGL  G  SLISQM   I           G  ++  
Sbjct: 157 TSIGCGLNNSNLMTPGYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCFSSQGTSKINF 216

Query: 248 STPDIVIDSGT------------------------------------------------T 259
            T  +V   GT                                                T
Sbjct: 217 GTNAVVAGDGTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFHAQDGNIFIDSGTT 276

Query: 260 LTFLPQGY-NSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKL 317
            T+LP  Y N    +V +S++ A  V DP+    LCY+++++   P +T+HF  GAD+ L
Sbjct: 277 YTYLPTSYCNLVREAVAASVVAANQVPDPSSENLLCYNWDTMEIFPVITLHFAGGADLVL 336

Query: 318 SRSNFFVK-VSEDIVCSVFKGITNSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            + N +V+ ++    C     +  S+P I+GN    N LVGYD     +SF PT+C+
Sbjct: 337 DKYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNCS 393


>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 392

 Score =  169 bits (429), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 119/355 (33%), Positives = 171/355 (48%), Gaps = 82/355 (23%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           AD + +   YL+++ +GTPP E  A  DTGSDLIWTQC PC  + CY Q +P+FDP  SS
Sbjct: 52  ADTLFDYNIYLMKLQVGTPPFEIEAEIDTGSDLIWTQCMPC--TNCYSQYAPIFDPSNSS 109

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           T+K             +K C+G +C Y + Y D ++S G LATETVT+ ST+G+   +P 
Sbjct: 110 TFK-------------EKRCNGNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPE 156

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM------------------------RT 237
            T GCG +N   F    +G+VGL  G  SLI+QM                          
Sbjct: 157 TTIGCG-HNSSWFKPTFSGMVGLSWGPSSLITQMGGEYPGLMSYCFASQGTSKINFGTNA 215

Query: 238 TIAGNQRL-------------------GVSTPDIVIDS-GTTL---------------TF 262
            +AG+  +                    VS  D  +++ GTT                T+
Sbjct: 216 IVAGDGVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFHALEGNIIIDSGTTLTY 275

Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSN 321
            P  Y + +   +   + A   ADPTG+  LCY  +++   P +T+HF  GAD+ L + N
Sbjct: 276 FPVSYCNLVREAVDHYVTAVRTADPTGNDMLCYYTDTIDIFPVITMHFSGGADLVLDKYN 335

Query: 322 FFVK-VSEDIVCSVFKGITNSVP---IYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            +++ ++    C     I N+ P   I+GN  Q NFLVGYD     V F PT+C+
Sbjct: 336 MYIETITRGTFCLAI--ICNNPPQDAIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 118/348 (33%), Positives = 160/348 (45%), Gaps = 70/348 (20%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +  NY++ + +GTP ++   V DTGSD  W QC PC   +CY Q  PLFDP  SSTY ++
Sbjct: 159 STGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCV-VKCYKQKEPLFDPAKSSTYANV 217

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C+ S CA L+   C+G +C Y+V YGDGS++ G  A +T+T+        A+ G  FGC
Sbjct: 218 SCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHD-----AIKGFRFGC 272

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG------------------------- 241
           G  N GLF  KT G++GLG G  SL  Q      G                         
Sbjct: 273 GEKNNGLFG-KTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDFGPGSAGN 331

Query: 242 NQRL-----------------GV--------------STPDIVIDSGTTLTFLPQGYNSN 270
           N RL                 G+              ST   ++DSGT +T LP    + 
Sbjct: 332 NARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYTA 391

Query: 271 LLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVK 325
           L S    ++ A+      G   L+ CY F  LS V  P V++ F+ GA + +  S     
Sbjct: 392 LSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYA 451

Query: 326 VSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           +SE  VC  F   G   SV I GN  Q  + V YD+ ++TV F P  C
Sbjct: 452 ISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 118/348 (33%), Positives = 160/348 (45%), Gaps = 70/348 (20%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +  NY++ + +GTP ++   V DTGSD  W QC PC   +CY Q  PLFDP  SSTY ++
Sbjct: 159 STGNYVVTVGLGTPASKYTVVFDTGSDTTWVQCRPCV-VKCYKQKGPLFDPAKSSTYANV 217

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C+ S CA L+   C+G +C Y+V YGDGS++ G  A +T+T+        A+ G  FGC
Sbjct: 218 SCTDSACADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIAHD-----AIKGFRFGC 272

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG------------------------- 241
           G  N GLF  KT G++GLG G  SL  Q      G                         
Sbjct: 273 GEKNNGLFG-KTAGLMGLGRGKTSLTVQAYNKYGGAFAYCLPALTTGTGYLDFGPGSAGN 331

Query: 242 NQRL-----------------GV--------------STPDIVIDSGTTLTFLPQGYNSN 270
           N RL                 G+              ST   ++DSGT +T LP    + 
Sbjct: 332 NARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTAGTLVDSGTVITRLPATAYTA 391

Query: 271 LLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVK 325
           L S    ++ A+      G   L+ CY F  LS V  P V++ F+ GA + +  S     
Sbjct: 392 LSSAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGGACLDVDVSGIVYA 451

Query: 326 VSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           +SE  VC  F   G   SV I GN  Q  + V YD+ ++TV F P  C
Sbjct: 452 ISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGSC 499


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 133/409 (32%), Positives = 192/409 (46%), Gaps = 73/409 (17%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
           A  GGFSVE IHRDSP+SPF++ + T + R   A  RS+ R      ++S S+S    AD
Sbjct: 29  ASGGGFSVEFIHRDSPRSPFHDPAFTAHGRALAAARRSVARAAAIAGSASSSASGGGAAD 88

Query: 84  -----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE---------PCPPSQCYM 129
                ++  +  YL+ +++G+PP   LA+ADTGSDL+W +C+           P +Q   
Sbjct: 89  DVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQ--- 145

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
                FDP  SSTY  + C +  C +L + +C  G NC Y  +YGDGS + G L+TET T
Sbjct: 146 -----FDPSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFT 200

Query: 189 L----GSTTGQAVALPGITFGCGTNNGGLF---------NSKTTGIVGLGG--------- 226
                   + + V + G+ FGC T   G F             + +  LGG         
Sbjct: 201 FDDGGAGRSPRQVRIGGVKFGCSTATAGSFPADGLVGLGGGAVSLVTQLGGATSLGRRFS 260

Query: 227 -------------------GDISLISQMRTTIAGNQRLG-VSTPDIVIDSGTTLTFLPQG 266
                               D++      T + GN+ +   ++  I++DSGTTLTFL   
Sbjct: 261 YCLVPHSVNASSALNFGALADVTEPGAASTPLVGNKTVASAASSRIIVDSGTTLTFLDPS 320

Query: 267 YNSNLLSVMSSMIEAQPVADPTGSLELCYS-----FNSLSQVPEVTIHF-RGADVKLSRS 320
               ++  +S  I   PV  P G L+LCY+       +   +P++T+ F  GA V L   
Sbjct: 321 LLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPDLTLEFGGGAAVALKPE 380

Query: 321 NFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFK 367
           N FV V E  +C      T   P  I GN+ Q N  VGYD++  TV  K
Sbjct: 381 NAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLDAGTVGNK 429



 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 49/128 (38%), Positives = 66/128 (51%), Gaps = 8/128 (6%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-----FNSLSQVPEV 306
           I++DSGTTLTFL       ++  +S  I   PV  P G L+LCY+       +   +P++
Sbjct: 439 IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPDL 498

Query: 307 TIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQT 363
           T+ F  GA V L   N FV V E  +C      T   P  I GN+ Q N  VGYD++  T
Sbjct: 499 TLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLDAGT 558

Query: 364 VSFKPTDC 371
           V+F   DC
Sbjct: 559 VTFAVADC 566


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score =  169 bits (427), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 131/434 (30%), Positives = 197/434 (45%), Gaps = 106/434 (24%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           +AT +  +F+   LCF + +   +   GF+++LIHR                        
Sbjct: 3   LATTIIVLFLQISLCF-LFTTTASPPHGFTMDLIHR------------------------ 37

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
              R N  ++ S+  S  +  A+ + +N+ YL+++ +GTPP E  A+ DTGS++ WTQC 
Sbjct: 38  ---RSNASSRVSNTQSGSSPYANTVFDNSVYLMKLQVGTPPFEIQAIIDTGSEITWTQCL 94

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
           PC    CY Q++P+FDP  SST+K             +K C G +C Y V Y D +++ G
Sbjct: 95  PC--VHCYEQNAPIFDPSKSSTFK-------------EKRCDGHSCPYEVDYFDHTYTMG 139

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI- 239
            LATET+TL ST+G+   +P    GCG NN   F    +G+VGL  G  SLI+QM     
Sbjct: 140 TLATETITLHSTSGEPFVMPETIIGCGHNN-SWFKPSFSGMVGLNWGPSSLITQMGGEYP 198

Query: 240 ---------AGNQRLGVSTPDIVIDSGTTLTFL------PQGYNSNL--LSVMSSMIE-- 280
                     G  ++      IV   G   T +      P  Y  NL  +SV ++ IE  
Sbjct: 199 GLMSYCFSGQGTSKINFGANAIVAGDGVVSTTMFMTTAKPGFYYLNLDAVSVGNTRIETM 258

Query: 281 ---------------------------------------AQPVADPTGSLELCYSFNSLS 301
                                                  A   ADPTG+  LCY+ +++ 
Sbjct: 259 GTTFHALEGNIVIDSGTTLTYFPVSYCNLVRQAVEHVVTAVRAADPTGNDMLCYNSDTID 318

Query: 302 QVPEVTIHFRGA-DVKLSRSNFFVKVSE-DIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 358
             P +T+HF G  D+ L + N +++ +   + C ++         I+GN  Q NFLVGYD
Sbjct: 319 IFPVITMHFSGGVDLVLDKYNMYMESNNGGVFCLAIICNSPTQEAIFGNRAQNNFLVGYD 378

Query: 359 IEQQTVSFKPTDCT 372
                VSF PT+C+
Sbjct: 379 SSSLLVSFSPTNCS 392


>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 364

 Score =  168 bits (425), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 112/352 (31%), Positives = 161/352 (45%), Gaps = 80/352 (22%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           NN +YL+++++GTPP +   + DT SDL+W QC PC    CY Q +P+FDP         
Sbjct: 27  NNGDYLMKLTLGTPPVDVYGLVDTDSDLVWAQCTPC--QGCYKQKNPMFDPL-------- 76

Query: 147 PCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
                +C S    SCS    C Y  +Y D S + G LA E  T  ST G+ + +  I FG
Sbjct: 77  ----KECNSFFDHSCSPEKACDYVYAYADDSATKGMLAKEIATFSSTDGKPI-VESIIFG 131

Query: 206 CGTNNGGLFNS---------------------------------------KTTGIVGLG- 225
           CG NN G+FN                                         T+G + LG 
Sbjct: 132 CGHNNTGVFNENDMGLIGLGGGPLSLVSQMGNLYGSKRFSQCLVPFHADPHTSGTISLGE 191

Query: 226 GGDIS--------LISQMRTT---------------IAGNQRLGVSTPDIVIDSGTTLTF 262
             D+S        L+S+   T               +  N    +S  +I+IDSGT  T+
Sbjct: 192 ASDVSGEGVVTTPLVSEEGQTPYLVTLEGISVGDTFVPFNSSEMLSKGNIMIDSGTPETY 251

Query: 263 LPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSN 321
           LPQ +   L+  +   I   P+  DP    +LCY   +  + P +T HF GADVKL    
Sbjct: 252 LPQEFYDRLVEELKVQINLPPIHVDPDLGTQLCYKSETNLEGPILTAHFEGADVKLLPLQ 311

Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
            F+   + + C    G T+ + I+GN  Q+N L+G+D++++ V FKPTD TK
Sbjct: 312 TFIPPKDGVFCFAMTGTTDGLYIFGNFAQSNVLIGFDLDKRIVFFKPTDFTK 363


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score =  167 bits (424), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 122/412 (29%), Positives = 182/412 (44%), Gaps = 81/412 (19%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN-- 87
           S+ L+HRD+     Y S      ++   + R   R+ H  +    S+S     D++    
Sbjct: 64  SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120

Query: 88  ------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                 +  Y +R+ +G+PPT++  V D+GSD+IW QC PC   QCY Q  PLFDP  SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178

Query: 142 TYKSLPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           ++  + C S+ C +L+   C G      C YSV+YGDGS++ G LA ET+TLG T     
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGT----- 233

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLG---------------------------GGDIS 230
           A+ G+  GCG  N GLF     G++GLG                           GG  S
Sbjct: 234 AVQGVAIGCGHRNSGLF-VGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGS 292

Query: 231 LISQMRTTI------------------AGNQRLGVS------TPD----IVIDSGTTLTF 262
           L+      +                   G +RL +       T D    +V+D+GT +T 
Sbjct: 293 LVLGRTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAVTR 352

Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSR 319
           LP+   + L       + A P +     L+ CY  +  +  +VP V+ +F +GA + L  
Sbjct: 353 LPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLTLPA 412

Query: 320 SNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            N  V+V   + C  F   ++ + I GNI Q    +  D     V F P  C
Sbjct: 413 RNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 464


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score =  167 bits (422), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 90/216 (41%), Positives = 128/216 (59%), Gaps = 15/216 (6%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
           EA+  GF + L H DS K+       T +Q L  A+ R   RL      + ++     + 
Sbjct: 35  EAKVTGFQIMLEHVDSGKN------LTKFQLLERAIERGSRRLQRLE--AMLNGPSGVET 86

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
            +   +  YL+ +SIGTP     A+ DTGSDLIWTQC+PC  +QC+ Q +P+F+P+ SS+
Sbjct: 87  SVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPC--TQCFNQSTPIFNPQGSSS 144

Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           + +LPCSS  C +L+  +CS   CQY+  YGDGS + G++ TET+T GS     V++P I
Sbjct: 145 FSTLPCSSQLCQALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNI 199

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           TFGCG NN G       G+VG+G G +SL SQ+  T
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT 235



 Score = 71.2 bits (173), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 42/126 (33%), Positives = 60/126 (47%), Gaps = 3/126 (2%)

Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPE 305
           T  I+IDSGTTLT+       ++     S I    V   +   +LC+   S  S  Q+P 
Sbjct: 309 TGGIIIDSGTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPSDPSNLQIPT 368

Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
             +HF G D++L   N+F+  S  ++C      +  + I+GNI Q N LV YD     VS
Sbjct: 369 FVMHFDGGDLELPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVS 428

Query: 366 FKPTDC 371
           F    C
Sbjct: 429 FASAQC 434


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score =  166 bits (419), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 125/419 (29%), Positives = 183/419 (43%), Gaps = 84/419 (20%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-------NSSISSSKASQADII 85
           ++HR  P SP       P     + L R  +R++  ++       +++   S AS+   +
Sbjct: 68  VVHRHGPCSPLQARGGEPSHA--EILDRDQDRVDSIHRLAAARPSSTADDPSSASKGVSL 125

Query: 86  P-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           P         ANY++ + +GTP  + L V DTGSDL W QC+PC    CY Q  PLFDP 
Sbjct: 126 PARRGVPLGTANYIVSVGLGTPKRDLLVVFDTGSDLSWVQCKPC--DGCYQQHDPLFDPS 183

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            S+TY ++PC + +C  L+  SCS   C+Y V YGD S ++GNLA +T+TLG ++  + +
Sbjct: 184 QSTTYSAVPCGAQECRRLDSGSCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSS 243

Query: 199 --LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR--------------TTIAGN 242
             L    FGCG ++ GLF  K  G+ GLG   +SL SQ                +T  G 
Sbjct: 244 DQLQEFVFGCGDDDTGLFG-KADGLFGLGRDRVSLASQAAAKYGAGFSYCLPSSSTAEGY 302

Query: 243 QRLGVSTP------------------------------------------DIVIDSGTTL 260
             LG + P                                            VIDSGT +
Sbjct: 303 LSLGSAAPPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTPGTVIDSGTVI 362

Query: 261 TFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADV 315
           T LP    + L S  + ++   +   A     L+ CY F   +  Q+P V + F  GA +
Sbjct: 363 TRLPSRAYAALRSSFAGLMRRYSYKRAPALSILDTCYDFTGRNKVQIPSVALLFDGGATL 422

Query: 316 KLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            L         ++   C  F   G   S+ I GN+ Q  F V YD+  Q + F    C+
Sbjct: 423 NLGFGEVLYVANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGCS 481


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 88/216 (40%), Positives = 127/216 (58%), Gaps = 15/216 (6%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
           E +  GF + L H DS K+       T ++ L  A+ R   RL      + ++     + 
Sbjct: 35  EPKVAGFQIMLEHVDSGKN------LTKFELLERAVERGSRRLQRLE--AMLNGPSGVET 86

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
            +   +  YL+ +SIGTP     A+ DTGSDLIWTQC+PC  +QC+ Q +P+F+P+ SS+
Sbjct: 87  PVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPC--TQCFNQSTPIFNPQGSSS 144

Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           + +LPCSS  C +L   +CS  +CQY+  YGDGS + G++ TET+T GS     V++P I
Sbjct: 145 FSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNI 199

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           TFGCG NN G       G+VG+G G +SL SQ+  T
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT 235



 Score = 69.3 bits (168), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 41/126 (32%), Positives = 58/126 (46%), Gaps = 3/126 (2%)

Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPE 305
           T  I+IDSGTTLT+        +     S +    V   +   +LC+   S  S  Q+P 
Sbjct: 309 TGGIIIDSGTTLTYFADNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPT 368

Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
             +HF G D+ L   N+F+  S  ++C      +  + I+GNI Q N LV YD     VS
Sbjct: 369 FVMHFDGGDLVLPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVS 428

Query: 366 FKPTDC 371
           F    C
Sbjct: 429 FLFAQC 434


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score =  165 bits (417), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 88/216 (40%), Positives = 127/216 (58%), Gaps = 15/216 (6%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
           E +  GF + L H DS K+       T ++ L  A+ R   RL      + ++     + 
Sbjct: 35  EPKVAGFQIMLEHVDSGKN------LTKFELLERAVERGSRRLQRLE--AMLNGPSGVET 86

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
            +   +  YL+ +SIGTP     A+ DTGSDLIWTQC+PC  +QC+ Q +P+F+P+ SS+
Sbjct: 87  PVYAGDGEYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPC--TQCFNQSTPIFNPQGSSS 144

Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           + +LPCSS  C +L   +CS  +CQY+  YGDGS + G++ TET+T GS     V++P I
Sbjct: 145 FSTLPCSSQLCQALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTFGS-----VSIPNI 199

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           TFGCG NN G       G+VG+G G +SL SQ+  T
Sbjct: 200 TFGCGENNQGFGQGNGAGLVGMGRGPLSLPSQLDVT 235



 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 41/126 (32%), Positives = 58/126 (46%), Gaps = 3/126 (2%)

Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVPE 305
           T  I+IDSGTTLT+        +     S +    V   +   +LC+   S  S  Q+P 
Sbjct: 309 TGGIIIDSGTTLTYFVDNAYQAVRQAFISQMNLSVVNGSSSGFDLCFQMPSDQSNLQIPT 368

Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
             +HF G D+ L   N+F+  S  ++C      +  + I+GNI Q N LV YD     VS
Sbjct: 369 FVMHFDGGDLVLPSENYFISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVS 428

Query: 366 FKPTDC 371
           F    C
Sbjct: 429 FLSAQC 434


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 132/427 (30%), Positives = 203/427 (47%), Gaps = 91/427 (21%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS-SSKASQADIIP 86
           GF + L H DS K+       T  Q+++  + R  +RLN     + ++ +SK    + I 
Sbjct: 44  GFRLSLRHVDSGKNL------TKIQKIQRGINRGFHRLNRLGAVAVLAVASKPDDTNNIK 97

Query: 87  -----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                 +  +L+ +SIG P  +  A+ DTGSDLIWTQC+PC  ++C+ Q +P+FDP+ SS
Sbjct: 98  APTHGGSGEFLMELSIGNPAVKYSAIVDTGSDLIWTQCKPC--TECFDQPTPIFDPEKSS 155

Query: 142 TYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTL-------GST 192
           +Y  + CSS  C +L + +C+     C+Y  +YGD S + G LATET T        G  
Sbjct: 156 SYSKVGCSSGLCNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIG 215

Query: 193 TGQAVALPGITFGCGTNNGGL---------------------------------FNSKTT 219
            G  V   G  F  G+   GL                                   S  +
Sbjct: 216 FGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLAS 275

Query: 220 GIVGLGGGDI--------SLIS----------QMRTTIAGNQRLGVS----------TPD 251
           GIV   G  +        SL+           +++    G +RL V           T  
Sbjct: 276 GIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGG 335

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ---VPEV 306
           ++IDSGTT+T+L +     L    +S + + PV D +GS  L+LC+     ++   VP++
Sbjct: 336 MIIDSGTTITYLEETAFKVLKEEFTSRM-SLPV-DDSGSTGLDLCFKLPDAAKNIAVPKM 393

Query: 307 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
             HF+GAD++L   N+ V  S   V  +  G +N + I+GN+ Q NF V +D+E++TVSF
Sbjct: 394 IFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLEKETVSF 453

Query: 367 KPTDCTK 373
            PT+C K
Sbjct: 454 VPTECGK 460


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score =  163 bits (413), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 133/425 (31%), Positives = 189/425 (44%), Gaps = 84/425 (19%)

Query: 22  IEAQTGGFSVELIHRDSPKSPF-YNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
           +++ TG  +V L HR  P SP       T  +RL     R+      F+      S   +
Sbjct: 51  VKSSTGAATVPLHHRHGPCSPLPTKKMPTLEERLHRDQLRAAYIQRKFSGGGVNGSRGGA 110

Query: 81  QADIIPNNA-------------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
             D+  ++A              YLI + +G+P   +  + DTGSD+ W QC+PC  SQC
Sbjct: 111 -GDVQQSHATVPTTLGTSLDTLEYLITVRLGSPGKSQTMLIDTGSDVSWVQCKPC--SQC 167

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--CSGVNCQYSVSYGDGSFSNGNLATE 185
           + Q  PLFDP  SSTY    CSS+ CA L Q+   CS   CQY+V+YGDGS + G  +++
Sbjct: 168 HSQADPLFDPSSSSTYSPFSCSSAACAQLGQEGNGCSSSQCQYTVTYGDGSSTTGTYSSD 227

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA----- 240
           T+ LGS      A+    FGC     G FN +T G++GLGGG  SL+SQ   T       
Sbjct: 228 TLALGSN-----AVRKFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTFGAAFSY 281

Query: 241 ---------------------------------------------GNQRLGVST----PD 251
                                                        G ++L + T      
Sbjct: 282 CLPATSSSSGFLTLGAGTSGFVKTPMLRSSQVPTFYGVRIQAIRVGGRQLSIPTSVFSAG 341

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
            ++DSGT LT LP    S L S   + ++  P A P+G L+ C+ F+  S V  P V + 
Sbjct: 342 TIMDSGTVLTRLPPTAYSALSSAFKAGMKQYPSAPPSGILDTCFDFSGQSSVSIPTVALV 401

Query: 310 FR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSF 366
           F  GA V ++     ++ S  I+C  F   ++  S+ I GN+ Q  F V YD+    V F
Sbjct: 402 FSGGAVVDIASDGIMLQTSNSILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGF 461

Query: 367 KPTDC 371
           K   C
Sbjct: 462 KAGAC 466


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 111/359 (30%), Positives = 162/359 (45%), Gaps = 78/359 (21%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+R+++GTP        DTGSDL+WTQC PC    C+ QD P+ DP  SSTY +LPC 
Sbjct: 83  EYLVRLAVGTPRRPVALTLDTGSDLVWTQCAPC--RDCFDQDLPVLDPAASSTYAALPCG 140

Query: 150 SSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGST--TGQAVALP 200
           +++C +L   SC GV        C Y+  YGD S + G +AT+  T G +  +G+++   
Sbjct: 141 AARCRALPFTSC-GVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTR 199

Query: 201 GITFGCGTNNGGLFNSKTTGIVG---------------------------------LGGG 227
            +TFGCG  N G+F S  TGI G                                 LGG 
Sbjct: 200 RLTFGCGHLNKGVFQSNETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFESKSSLVTLGGS 259

Query: 228 DISLIS-----QMRTT-------------------IAGNQRLGVSTPDI---VIDSGTTL 260
             +L S     ++RTT                     G  RL V        +IDSG ++
Sbjct: 260 PAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRSTIIDSGASI 319

Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-----QVPEVTIHFRGADV 315
           T LP+     + +  ++ +   P      +L+LC++    +      VP +T+H  GAD 
Sbjct: 320 TTLPEEVYEAVKAEFAAQVGLPPSGVEGSALDLCFALPVTALWRRPAVPSLTLHLEGADW 379

Query: 316 KLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           +L RSN+ F  +   ++C V         + GN  Q N  V YD+E   +SF P  C +
Sbjct: 380 ELPRSNYVFEDLGARVMCIVLDAAPGEQTVIGNFQQQNTHVVYDLENDRLSFAPARCDR 438


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 132/429 (30%), Positives = 205/429 (47%), Gaps = 95/429 (22%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           GF + L H DS K+       T  Q+++  + R  +RLN     + ++   AS  D   N
Sbjct: 45  GFRLSLRHVDSGKNL------TKIQKIQRGINRGFHRLNRLGAVAVLAV--ASNPDDTNN 96

Query: 88  --------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
                   +  +L+ +SIG P  +  A+ DTGSDLIWTQC+PC  ++C+ Q +P+FDP+ 
Sbjct: 97  IKAPTHGGSGEFLMELSIGNPAVKYAAIVDTGSDLIWTQCKPC--TECFDQPTPIFDPEK 154

Query: 140 SSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTL-------G 190
           SS+Y  + CSS  C +L + +C+    +C+Y  +YGD S + G LATET T        G
Sbjct: 155 SSSYSKVGCSSGLCNALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTFEDENSISG 214

Query: 191 STTGQAVALPGITFGCGTNNGGL---------------------------------FNSK 217
              G  V   G  F  G+   GL                                   S 
Sbjct: 215 IGFGCGVENEGDGFSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSL 274

Query: 218 TTGIVGLGGGDI--------SLIS----------QMRTTIAGNQRLGVS----------T 249
            +GIV   G ++        SL+           +++    G +RL V           T
Sbjct: 275 ASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGT 334

Query: 250 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ---VP 304
             ++IDSGTT+T+L +     L    +S + + PV D +GS  L+LC+   + ++   VP
Sbjct: 335 GGMIIDSGTTITYLEETAFKVLKEEFTSRM-SLPV-DDSGSTGLDLCFKLPNAAKNIAVP 392

Query: 305 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
           ++  HF+GAD++L   N+ V  S   V  +  G +N + I+GN+ Q NF V +D+E++TV
Sbjct: 393 KLIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLEKETV 452

Query: 365 SFKPTDCTK 373
           +F PT+C K
Sbjct: 453 TFVPTECGK 461


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 110/342 (32%), Positives = 156/342 (45%), Gaps = 65/342 (19%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
           ANY+I +  GTP   +  + DTGS++ W QC+PC  S CY Q  PLFDP +SSTY+++ C
Sbjct: 14  ANYVITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVS-CYPQQEPLFDPTLSSTYRNISC 72

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
           +S+ C  L+ + CSG  C Y V+YGDGS + G LATET TL +            FGCG 
Sbjct: 73  TSAACTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTLAAGN----VFNNFIFGCGQ 128

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIA---------------------------- 240
           NN GLF +   G++GLG    SL SQ+ T++                             
Sbjct: 129 NNQGLF-TGAAGLIGLGRSPYSLNSQLATSLGNIFSYCLPSTSSATGYLNIGNPLRTPGY 187

Query: 241 ----------------------GNQRLGVSTP-----DIVIDSGTTLTFLPQGYNSNLLS 273
                                 G  RL +S+        +IDSGT +T LP      L +
Sbjct: 188 TAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSVGTIIDSGTVITRLPPTAYGALRT 247

Query: 274 VMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIV 331
              + +     A     L+ CY F+  + V  P + +H+ G DV +  +  F  +S   V
Sbjct: 248 AFRAAMTQYTRAAAASILDTCYDFSRTTTVTFPTIKLHYTGLDVTIPGAGVFYVISSSQV 307

Query: 332 CSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           C  F G ++S  + I GN+ Q    V YD   + + F    C
Sbjct: 308 CLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score =  161 bits (407), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 114/348 (32%), Positives = 155/348 (44%), Gaps = 69/348 (19%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC    CY Q   LFDP  SSTY ++
Sbjct: 175 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANV 233

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C+ L+ + CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 234 SCAAPACSDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 289

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTP-- 250
           G  N GLF  +  G++GLG G  SL  Q               R+T  G    G  +P  
Sbjct: 290 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSPAA 348

Query: 251 ----------------------------------------DIVIDSGTTLTFLPQGYNSN 270
                                                     ++DSGT +T LP    S+
Sbjct: 349 RLTTTPMLVDNGPTFYYVGLTGIRVGGRLLYIPQSVFATAGTIVDSGTVITRLPPAAYSS 408

Query: 271 LLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVK 325
           L S  ++ + A+    A     L+ CY F  +SQV  P V++ F+ GA + +  S     
Sbjct: 409 LRSAFAAAMSARGYKKAPAVSLLDTCYDFAGMSQVAIPTVSLLFQGGARLDVDASGIMYA 468

Query: 326 VSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            S   VC  F    +   V I GN     F V YDI ++ VSF P  C
Sbjct: 469 ASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score =  160 bits (406), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 126/429 (29%), Positives = 186/429 (43%), Gaps = 89/429 (20%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN------------ 69
           I +   G +V L HR  P SP  +S + P +   + L R   R  H              
Sbjct: 45  ISSSLSGTTVALNHRHGPCSPVPSSKKRPTEE--ELLKRDQLRAEHIQRKFAMNAAVDGA 102

Query: 70  ---QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
              Q S +SSS  ++     +   Y+I + +GTP   +    DTGSD+ W QC PCP   
Sbjct: 103 GDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPP 162

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCSGVN--CQYSVSYGDGSFSNGNL 182
           CY Q   LFDP  SSTY+++ C++++CA L Q+   C   N  CQY V YGDGS +NG  
Sbjct: 163 CYAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTY 222

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ-------- 234
           + +T+TL   +G + A+ G  FGC     G F+ +T G++GLGGG  SL+SQ        
Sbjct: 223 SRDTLTL---SGASDAVKGFQFGCSHVESG-FSDQTDGLMGLGGGAQSLVSQTAAAYGNS 278

Query: 235 --------------------------------------------MRTTIAGNQRLGVSTP 250
                                                       ++    G ++LG+S P
Sbjct: 279 FSYCLPPTSGSSGFLTLGGGGGVSGFVTTRMLRSRQIPTFYGARLQDIAVGGKQLGLS-P 337

Query: 251 DI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--V 303
            +     V+DSGT +T LP    S L S   + ++    A     L+ C+ F   +Q  +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397

Query: 304 PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
           P V + F  GA + L  +        + +     G   +  I GN+ Q  F V YD+   
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYG---NCLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSS 454

Query: 363 TVSFKPTDC 371
           T+ F+   C
Sbjct: 455 TLGFRSGAC 463


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  160 bits (405), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 97/225 (43%), Positives = 132/225 (58%), Gaps = 9/225 (4%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           GF+  L  RDSP SP +N S + Y  L DA  RS +R      + +  S+   ++ IIP+
Sbjct: 27  GFTTSLFRRDSPLSPLHNPSLSRYDSLIDAFRRSFSRSATLLTHLTSVSTACIRSPIIPD 86

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  +L+ I IGTPP   +A+ADTGSDL WTQC PC   +C+ Q  P+F+P+ SS+Y+ + 
Sbjct: 87  SGEFLMSIFIGTPPVNVIAIADTGSDLTWTQCLPC--RECFNQSQPIFNPRRSSSYRKVS 144

Query: 148 CSSSQCASLNQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           C+S  C SL    C     +C Y  SYGD SF+ G+LA++ +T+GS       LP    G
Sbjct: 145 CASDTCRSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITIGS-----FKLPKTVIG 199

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTP 250
           CG  NGG F   T+GI+GLGGG +SL+SQMRT      R     P
Sbjct: 200 CGHQNGGTFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLP 244



 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 55/124 (44%), Positives = 78/124 (62%), Gaps = 4/124 (3%)

Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTI 308
           +I+IDSGTTLT LP+     + S ++ +I+A+ V DP+G LELCYS   +    +P +T 
Sbjct: 311 NIIIDSGTTLTLLPRSLYYGVFSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITA 370

Query: 309 HFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           HF  GADVKL   N F  V++++ C  F   T  V I+GN+ Q NF VGYD+  + +SF+
Sbjct: 371 HFAGGADVKLLPVNTFAPVADNVTCLTFAPATQ-VAIFGNLAQINFEVGYDLGNKRLSFE 429

Query: 368 PTDC 371
           P  C
Sbjct: 430 PKLC 433


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score =  159 bits (402), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 128/399 (32%), Positives = 173/399 (43%), Gaps = 80/399 (20%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
           E+I RD  +       E+ Y +L      S N  N  ++  + S+   +++ I   + NY
Sbjct: 87  EIIRRDQARV------ESIYSKL------SKNSANEVSE--AKSTELPAKSGITLGSGNY 132

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           ++ I IGTP  +   V DTGSDL WTQCEPC  S CY Q  P F+P  SSTY+++ CSS 
Sbjct: 133 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGS-CYSQKEPKFNPSSSSTYQNVSCSSP 191

Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
            C   + +SCS  NC YS+ YGD SF+ G LA E  TL ++      L  + FGCG NN 
Sbjct: 192 MCE--DAESCSASNCVYSIGYGDKSFTQGFLAKEKFTLTNSD----VLEDVYFGCGENNQ 245

Query: 212 GLFN-----------------------------------SKTTGIVGLGGGDISL----- 231
           GLF+                                   S +TG +  G   IS      
Sbjct: 246 GLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFT 305

Query: 232 -ISQMRTTI----------AGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVM 275
            IS   +             G++ L +     ST   +IDSGT  T LP    + L SV 
Sbjct: 306 PISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVF 365

Query: 276 SSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR---GADVKLSRSNFFVKVSEDIVC 332
              + +       G  + CY F  L  V   TI F    G  V+L  S   + +    VC
Sbjct: 366 KEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGGTVVELDGSGISLPIKISQVC 425

Query: 333 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             F G  +   I+GN+ QT   V YD+    V F P  C
Sbjct: 426 LAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 125/429 (29%), Positives = 186/429 (43%), Gaps = 89/429 (20%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN------------ 69
           I +   G +V L HR  P SP  +S + P +   + L R   R  H              
Sbjct: 45  ISSSLSGTTVALNHRHGPCSPVPSSKKRPTEE--ELLKRDQLRAEHIQRKFAMNAAVDGA 102

Query: 70  ---QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
              Q S +SSS  ++     +   Y+I + +GTP   +    DTGSD+ W QC PCP   
Sbjct: 103 GDLQQSKVSSSVPTKLGSSLDTLEYVISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPP 162

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCSGVN--CQYSVSYGDGSFSNGNL 182
           C+ Q   LFDP  SSTY+++ C++++CA L Q+   C   N  CQY V YGDGS +NG  
Sbjct: 163 CHAQTGALFDPAKSSTYRAVSCAAAECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTY 222

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ-------- 234
           + +T+TL   +G + A+ G  FGC     G F+ +T G++GLGGG  SL+SQ        
Sbjct: 223 SRDTLTL---SGASDAVKGFQFGCSHLESG-FSDQTDGLMGLGGGAQSLVSQTAAAYGNS 278

Query: 235 --------------------------------------------MRTTIAGNQRLGVSTP 250
                                                       ++    G ++LG+S P
Sbjct: 279 FSYCLPPTSGSSGFLTLGGGGGASGFVTTRMLRSKQIPTFYGARLQDIAVGGKQLGLS-P 337

Query: 251 DI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--V 303
            +     V+DSGT +T LP    S L S   + ++    A     L+ C+ F   +Q  +
Sbjct: 338 SVFAAGSVVDSGTIITRLPPTAYSALSSAFKAGMKQYRSAPARSILDTCFDFAGQTQISI 397

Query: 304 PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
           P V + F  GA + L  +        + +     G   +  I GN+ Q  F V YD+   
Sbjct: 398 PTVALVFSGGAAIDLDPNGIMYG---NCLAFAATGDDGTTGIIGNVQQRTFEVLYDVGSS 454

Query: 363 TVSFKPTDC 371
           T+ F+   C
Sbjct: 455 TLGFRSGAC 463


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score =  158 bits (400), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 131/420 (31%), Positives = 186/420 (44%), Gaps = 85/420 (20%)

Query: 26  TGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS---KASQA 82
           +GG +V L HR  P SP   S++ P   L + L R   R  +  +  S +     + S A
Sbjct: 58  SGGITVPLHHRHGPCSPV-PSNKMP-ASLEERLQRDQLRAAYIKRKFSGAKGGDVEQSDA 115

Query: 83  DIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
             +P       +   Y+I + IG+P   +    DTGSD+ W QC+PC  SQC+ +   LF
Sbjct: 116 ATVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPC--SQCHSEVDSLF 173

Query: 136 DPKMSSTYKSLPCSSSQCASLNQK----SCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
           DP  SSTY    CSS+ C  L+Q      CS   CQY VSY DGS + G  +++T+TLGS
Sbjct: 174 DPSASSTYSPFSCSSAACVQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTLTLGS 233

Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA----------- 240
                 A+ G  FGC  +  G F+ +T G++GLGG   SL+SQ   T             
Sbjct: 234 N-----AIKGFQFGCSQSESGGFSDQTDGLMGLGGDAQSLVSQTAGTFGKAFSYCLPPTP 288

Query: 241 ----------------------------------------GNQRLGVST----PDIVIDS 256
                                                   G Q+L + T       V+DS
Sbjct: 289 GSSGFLTLGAASRSGFVKTPMLRSTQIPTYYGVLLEAIRVGGQQLNIPTSVFSAGSVMDS 348

Query: 257 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GA 313
           GT +T LP    S L S   + ++  P A P+G L+ C+ F+  S V  P V + F  GA
Sbjct: 349 GTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGILDTCFDFSGQSSVSIPSVALVFSGGA 408

Query: 314 DVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            V L  +   +++  D  C  F   ++  S+   GN+ Q  F V YD+    V F+   C
Sbjct: 409 VVNLDFNGIMLEL--DNWCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGAC 466


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 129/418 (30%), Positives = 181/418 (43%), Gaps = 82/418 (19%)

Query: 30  SVELIHRDSPKSPFYNSSETP--YQRLRDALTR--SLNRLNHFNQNSSISSSKASQADII 85
           ++ ++HR  P SP       P   + L D   R  S++R      +  +  ++  +   +
Sbjct: 74  ALNVVHRQGPCSPLQARGAPPPHAELLNDDQARVDSIHRKIAAAASPVLDQARGKKGVTL 133

Query: 86  P-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           P          NY++ + +GTP  +   V DTGSDL W QC PC  S CY Q  PLFDP 
Sbjct: 134 PAQRGISLGTGNYVVSMGLGTPARDMTVVFDTGSDLSWVQCTPC--SDCYEQKDPLFDPA 191

Query: 139 MSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
            SSTY ++PC+S +C  L+ +SCS    C+Y V YGD S ++G LA +T+TL     Q+ 
Sbjct: 192 RSSTYSAVPCASPECQGLDSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTL----TQSD 247

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT--------------TIAGNQ 243
            LPG  FGCG  + GLF  +  G+VGLG   +SL SQ  +              + AG  
Sbjct: 248 VLPGFVFGCGEQDTGLFG-RADGLVGLGREKVSLSSQAASKYGAGFSYCLPSSPSAAGYL 306

Query: 244 RLGVSTP------------------------------------------DIVIDSGTTLT 261
            LG   P                                            VIDSGT +T
Sbjct: 307 SLGGPAPANARFTAMETRHDSPSFYYVRLVGVKVAGRTVRVSPIVFSAAGTVIDSGTVIT 366

Query: 262 FLPQGYNSNLLSVMS-SMIEAQPVADPTGS-LELCYSF--NSLSQVPEVTIHFR-GADVK 316
            LP    + L S  + SM        P  S L+ CY F  ++  ++P V + F  GA V 
Sbjct: 367 RLPPRVYAALRSAFARSMGRYGYKRAPALSILDTCYDFTGHTTVRIPSVALVFAGGAAVG 426

Query: 317 LSRSN--FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           L  S   +  KVS+  +     G      I GN  Q    V YD+ +Q + F    C+
Sbjct: 427 LDFSGVLYVAKVSQACLAFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGANGCS 484


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score =  158 bits (399), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 127/399 (31%), Positives = 174/399 (43%), Gaps = 80/399 (20%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
           E+I RD  +       E+ Y +L      S N  N  ++  + S+   +++ I   + NY
Sbjct: 87  EIIRRDQARV------ESIYSKL------SKNSANEVSE--AKSTELPAKSGITLGSGNY 132

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           ++ I IGTP  +   V DTGSDL WTQCEPC  S CY Q  P F+P  SSTY+++ CSS 
Sbjct: 133 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGS-CYSQKEPKFNPSSSSTYQNVSCSSP 191

Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
            C   + +SCS  NC YS+ YGD SF+ G LA E  TL ++      L  + FGCG NN 
Sbjct: 192 MCE--DAESCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSD----VLEDVYFGCGENNQ 245

Query: 212 GLFN-----------------------------------SKTTGIVGLGGGDISL----- 231
           GLF+                                   S +TG +  G   IS      
Sbjct: 246 GLFDGVAGLLGLGPGKLSLPAQTTTTYNNIFSYCLPSFTSNSTGHLTFGSAGISESVKFT 305

Query: 232 -ISQMRTTI----------AGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVM 275
            IS   +             G++ L +     ST   +IDSGT  T LP    + L SV 
Sbjct: 306 PISSFPSAFNYGIDIIGISVGDKELAITPNSFSTEGAIIDSGTVFTRLPTKVYAELRSVF 365

Query: 276 SSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGAD-VKLSRSNFFVKVSEDIVC 332
              + +       G  + CY F  L  V  P +   F G+  V+L  S   + +    VC
Sbjct: 366 KEKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVELDGSGISLPIKISQVC 425

Query: 333 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             F G  +   I+GN+ QT   V YD+    V F P  C
Sbjct: 426 LAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score =  158 bits (399), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 134/426 (31%), Positives = 185/426 (43%), Gaps = 90/426 (21%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL---RDALTRSLN-RLNHFNQNSSISSSKASQAD 83
           G  +EL H  SP SP    ++ P+  +    DA   SL  RL       + S    + A 
Sbjct: 42  GLHLELHHPRSPCSPAPVPADLPFTAVLTHDDARISSLAARLAKTPSARATSLDADADAG 101

Query: 84  IIPNNA-------------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
           +  + A             NY+ R+ +GTP T+ + V DTGS L W QC PC  S C+ Q
Sbjct: 102 LAGSLASVPLSPGASVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVS-CHRQ 160

Query: 131 DSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLAT 184
             P+F+PK SSTY S+ CS+ QC     A+LN  +CS  N C Y  SYGD SFS G L+ 
Sbjct: 161 SGPVFNPKSSSTYASVGCSAQQCSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSK 220

Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR-------- 236
           +TV+ GST+     LP   +GCG +N GLF  ++ G++GL    +SL+ Q+         
Sbjct: 221 DTVSFGSTS-----LPNFYYGCGQDNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFT 274

Query: 237 -------------------------------------------TTIAGN-----QRLGVS 248
                                                       T+AGN          S
Sbjct: 275 YCLPSSSSSGYLSLGSYNPGQYSYTPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSS 334

Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVT 307
            P I+ DSGT +T LP    S L   +++ ++    A     L+ C+    S    P VT
Sbjct: 335 LPTII-DSGTVITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQASRVSAPAVT 393

Query: 308 IHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           + F  GA +KLS  N  V V +   C  F     S  I GN  Q  F V YD++   + F
Sbjct: 394 MSFAGGAALKLSAQNLLVDVDDSTTCLAF-APARSAAIIGNTQQQTFSVVYDVKSSRIGF 452

Query: 367 KPTDCT 372
               C+
Sbjct: 453 AAGGCS 458


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 76/173 (43%), Positives = 112/173 (64%), Gaps = 3/173 (1%)

Query: 72  SSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD 131
           S++  + + Q+ I     +YL+ +SIGTPP +   +ADTGSDL WT C PC  ++CY Q 
Sbjct: 6   SAMEKTVSPQSPIYAYLGHYLMEVSIGTPPFKIYGIADTGSDLTWTSCVPC--NKCYKQR 63

Query: 132 SPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLG 190
           +P+FDP+ S++Y+++ C S  C  L+   CS   +C Y+ +Y   + + G LA ET+TL 
Sbjct: 64  NPIFDPQKSTSYRNISCDSKLCHKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLS 123

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQ 243
           ST G++V L GI FGCG NN G FN +  GI+GLGGG +S ISQ+ ++  G +
Sbjct: 124 STKGESVPLKGIVFGCGHNNTGGFNDREMGIIGLGGGPVSFISQIGSSFGGKR 176



 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 74/135 (54%), Gaps = 1/135 (0%)

Query: 241 GNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNS 299
           G+    V   ++ +DSGT  T LP      L++ + S +  +PV  D     +LCY   +
Sbjct: 239 GSSSQSVEKGNVFLDSGTPPTILPTQLYDRLVAQVRSEVAMKPVTNDLDLGPQLCYRTKN 298

Query: 300 LSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
             + P +T HF G DVKL  +  FV   + + C  F   ++   +YGN  Q+N+L+G+D+
Sbjct: 299 NLRGPVLTAHFEGGDVKLLPTQTFVSPKDGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDL 358

Query: 360 EQQTVSFKPTDCTKQ 374
           ++Q VSFKP DCTK 
Sbjct: 359 DRQVVSFKPMDCTKH 373


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score =  157 bits (398), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 92/213 (43%), Positives = 139/213 (65%), Gaps = 17/213 (7%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           ++LIHRDSP SP +  + T   RL+ +  R+++R     Q+  +      Q D++P+   
Sbjct: 29  LDLIHRDSPLSPLHTPNLTFSDRLQASFLRAISR-----QSRHVDF----QTDLLPSGGE 79

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y++ +SIGTPP   LA+ADTGSDL W Q +PC   QCY Q  P+FDP  S+T+  LPC++
Sbjct: 80  YMMNLSIGTPPFPILAIADTGSDLTWLQSKPC--DQCYPQKGPIFDPSNSTTFHKLPCTT 137

Query: 151 SQCASLNQ--KSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           + C +L++  +SC+    C Y+ SYGD S++ G LA++TVT+G+    +V +  + FGCG
Sbjct: 138 APCNALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTVGNA---SVQIRNVAFGCG 194

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
           T NGG F+ + +GIVGLGGG++S +SQ+  TI 
Sbjct: 195 TRNGGNFDEQGSGIVGLGGGNLSFVSQLGDTIG 227



 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 58/131 (44%), Positives = 81/131 (61%), Gaps = 4/131 (3%)

Query: 247 VSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY-SFNSLSQVP 304
           V   +I+IDSGTTLTFL + +   L + +   I+ + V D   S+  LC+ S     ++P
Sbjct: 320 VEEGNIIIDSGTTLTFLEEEFYGALEAALVEEIKMERVNDVKNSMFSLCFKSGKEEVELP 379

Query: 305 EVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
            + +HFRG ADV+L   N FV+  E +VC      TN V IYGN+ Q NF+VGYD+ ++T
Sbjct: 380 LMKVHFRGGADVELKPVNTFVRAEEGLVCFTMLP-TNDVGIYGNLAQMNFVVGYDLGKRT 438

Query: 364 VSFKPTDCTKQ 374
           VSF P DC+KQ
Sbjct: 439 VSFLPADCSKQ 449


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score =  157 bits (398), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 115/347 (33%), Positives = 156/347 (44%), Gaps = 68/347 (19%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC  + CY Q   LFDP  SSTY ++
Sbjct: 175 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVA-CYEQREKLFDPASSSTYANV 233

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C+ L+   CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 234 SCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 289

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTP-- 250
           G  N GLF  +  G++GLG G  SL  Q               R+T  G    G  +P  
Sbjct: 290 GERNDGLFG-EAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPA 348

Query: 251 ---------------------------------------DIVIDSGTTLTFLPQGYNSNL 271
                                                    ++DSGT +T LP    S+L
Sbjct: 349 TTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSL 408

Query: 272 LSVMSSMIEAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKV 326
            S  ++ + A+    A     L+ CY F  +SQV  P V++ F+ GA + +  S     V
Sbjct: 409 RSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV 468

Query: 327 SEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           S   VC  F G  +   V I GN     F V YDI ++ V F P  C
Sbjct: 469 SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 515


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 127/421 (30%), Positives = 193/421 (45%), Gaps = 91/421 (21%)

Query: 20  SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSK 78
            P+   TG     LIH+DS  S         YQ L R+ + R   R   F        + 
Sbjct: 37  KPLRLVTG-----LIHQDSILSS--------YQSLDRNNVERRRTRRAAF-------ITD 76

Query: 79  ASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
             QA+++ ++    +L+  S+G PP  +L   DTGSDL+W QC PC  + C+ Q +P+FD
Sbjct: 77  EIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPC--ADCFRQSTPIFD 134

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           P  SSTY  L   S  C +  QK  + +N C Y+ SY DGS S+GNLATE +   ++   
Sbjct: 135 PSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQG 194

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA--------------- 240
            V +  + FGCG +N G F+ + +GI+GL  GD S++S++ +  +               
Sbjct: 195 TVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRLGSRFSYCIGDLFDPHYTHNQ 254

Query: 241 -----GNQRLGVSTP-------------------------------------DIVIDSGT 258
                G +  G STP                                      +V+DSGT
Sbjct: 255 LVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGT 314

Query: 259 TLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSF---NSLSQVPEVTIHF-RG 312
           T TFL +     L + +  ++    Q V   T    LCY       L   PE+  HF  G
Sbjct: 315 TATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEG 374

Query: 313 ADVKLSRSNFFVKVSEDIVC-SVFK-GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
           AD+ L  ++ FV+ ++D+ C +V +  + N   + G + Q ++ V YD+  + V F+ TD
Sbjct: 375 ADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTD 434

Query: 371 C 371
           C
Sbjct: 435 C 435


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score =  157 bits (397), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 115/347 (33%), Positives = 156/347 (44%), Gaps = 68/347 (19%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC  + CY Q   LFDP  SSTY ++
Sbjct: 179 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVA-CYEQREKLFDPASSSTYANV 237

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C+ L+   CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 238 SCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 293

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTP-- 250
           G  N GLF  +  G++GLG G  SL  Q               R+T  G    G  +P  
Sbjct: 294 GERNDGLFG-EAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPARSTGTGYLDFGAGSPPA 352

Query: 251 ---------------------------------------DIVIDSGTTLTFLPQGYNSNL 271
                                                    ++DSGT +T LP    S+L
Sbjct: 353 TTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSL 412

Query: 272 LSVMSSMIEAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKV 326
            S  ++ + A+    A     L+ CY F  +SQV  P V++ F+ GA + +  S     V
Sbjct: 413 RSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV 472

Query: 327 SEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           S   VC  F G  +   V I GN     F V YDI ++ V F P  C
Sbjct: 473 SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
          Length = 334

 Score =  157 bits (397), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 113/353 (32%), Positives = 151/353 (42%), Gaps = 104/353 (29%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           NN  YL++ISIGTPP +   + DTGSDL+WTQC PC    CY Q +P+FDP  S+++K +
Sbjct: 20  NNGEYLMKISIGTPPFDVYGIYDTGSDLMWTQCLPC--LSCYKQKNPMFDPSKSTSFKEV 77

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C S QC  L+                          T T  L            I FGC
Sbjct: 78  SCESQQCRLLD--------------------------TPTSILN-----------IVFGC 100

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLG-------------------- 246
           G NN G FN    G+ G GG  +SL SQ+ +T+   ++                      
Sbjct: 101 GHNNSGTFNENEMGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLVPFRTDPSITSKIIFGP 160

Query: 247 ---------VSTPDIVIDSGT----TL-----------------------TFLPQGYNSN 270
                    VSTP +  D  T    TL                        F+  G    
Sbjct: 161 EAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPFSSSSPMATKGNVFIDAGTPPT 220

Query: 271 LL------SVMSSMIEAQP---VADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSN 321
           LL       ++  + EA P   V DP    +LCY   +L   P +T HF GADV+L   N
Sbjct: 221 LLPRDFYNRLVQGVKEAIPMEPVQDPDLQPQLCYRSATLIDGPILTAHFDGADVQLKPLN 280

Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 374
            F+   E + C   + I     I+GN +Q NFL+G+D++ + VSFK  DCTKQ
Sbjct: 281 TFISPKEGVYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCTKQ 333


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 115/347 (33%), Positives = 156/347 (44%), Gaps = 68/347 (19%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC  + CY Q   LFDP  SSTY ++
Sbjct: 176 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVA-CYEQREKLFDPASSSTYANV 234

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C+ L+   CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 235 SCAAPACSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 290

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQM--------------RTTIAGNQRLGVSTP-- 250
           G  N GLF  +  G++GLG G  SL  Q               R+T  G    G  +P  
Sbjct: 291 GERNDGLFG-EAAGLLGLGRGKTSLPVQTYGKYGGVFAHCLPPRSTGTGYLDFGAGSPPA 349

Query: 251 ---------------------------------------DIVIDSGTTLTFLPQGYNSNL 271
                                                    ++DSGT +T LP    S+L
Sbjct: 350 TTTTPMLTGNGPTFYYVGMTGIRVGGRLLPIAPSVFAAAGTIVDSGTVITRLPPAAYSSL 409

Query: 272 LSVMSSMIEAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKV 326
            S  ++ + A+    A     L+ CY F  +SQV  P V++ F+ GA + +  S     V
Sbjct: 410 RSAFAAAMAARGYRKAAAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAALDVDASGIMYTV 469

Query: 327 SEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           S   VC  F G  +   V I GN     F V YDI ++ V F P  C
Sbjct: 470 SASQVCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 516


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 135/424 (31%), Positives = 195/424 (45%), Gaps = 85/424 (20%)

Query: 21  PIEAQTGGFSVELIHRDSPKSP-----FYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS 75
           P  A++ GFS  +I R    +      F  ++   ++RL    +RS ++++   Q+SS S
Sbjct: 22  PAHAESRGFSGTMIRRGRTDTTTAAINFTQAALESHRRLSFLASRS-SQVDK-PQSSSAS 79

Query: 76  SSKASQADIIP-----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
               +  D +P         Y +  SIGTPP +  A+ADTGSDLIWT+C+          
Sbjct: 80  QLSNNDTDTVPLRMDGGGGAYDMEFSIGTPPQKLTALADTGSDLIWTKCDAG--GGAAWG 137

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-----GVNCQYSVSYG---DGSFSNGNL 182
            S  + P  SST+  LPCS   CA+L   S +     G  C Y  +YG   D  F+ G L
Sbjct: 138 GSSSYHPNASSTFTRLPCSDRLCAALRSYSLARCAAGGAECDYKYAYGLGDDPDFTQGFL 197

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ-------- 234
            +ET TLG       A+PG+ FGC T   G +  +  G+VGLG G +SL+SQ        
Sbjct: 198 GSETFTLGGD-----AVPGVGFGCTTALEGDYG-EGAGLVGLGRGPLSLVSQLDAGTFMY 251

Query: 235 -------------------MRTTIAGNQRLG---------VSTPDIVI------------ 254
                              M    AG Q  G         V+   I I            
Sbjct: 252 CLTADASKASPLLFGALATMTGAGAGVQSTGLLASTTFYAVNLRSITIGSATTAGVGGPG 311

Query: 255 ----DSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQVPEVTI 308
               DSGTTLT+L +  Y     + +S      PV    G  E CY   +S   +P + +
Sbjct: 312 GVVFDSGTTLTYLAEPAYTEAKAAFLSQTTSLTPVEGRYG-FEACYEKPDSARLIPAMVL 370

Query: 309 HFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           HF  GAD+ L  +N+ V+V + +VC V +  + S+ I GNIMQ N+LV +D+ +  +SF+
Sbjct: 371 HFDGGADMALPVANYVVEVDDGVVCWVVQ-RSPSLSIIGNIMQMNYLVLHDVRKSVLSFQ 429

Query: 368 PTDC 371
           P +C
Sbjct: 430 PANC 433


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 127/421 (30%), Positives = 193/421 (45%), Gaps = 91/421 (21%)

Query: 20  SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSK 78
            P+   TG     LIH+DS  S         YQ L R+ + R   R   F        + 
Sbjct: 5   KPLRLVTG-----LIHQDSILSS--------YQSLDRNNVERRRTRRAAF-------ITD 44

Query: 79  ASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
             QA+++ ++    +L+  S+G PP  +L   DTGSDL+W QC PC  + C+ Q +P+FD
Sbjct: 45  EIQANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPC--ADCFRQSTPIFD 102

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           P  SSTY  L   S  C +  QK  + +N C Y+ SY DGS S+GNLATE +   ++   
Sbjct: 103 PSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQG 162

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA--------------- 240
            V +  + FGCG +N G F+ + +GI+GL  GD S++S++ +  +               
Sbjct: 163 TVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRLGSRFSYCIGDLFDPHYTHNQ 222

Query: 241 -----GNQRLGVSTP-------------------------------------DIVIDSGT 258
                G +  G STP                                      +V+DSGT
Sbjct: 223 LVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGT 282

Query: 259 TLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSF---NSLSQVPEVTIHF-RG 312
           T TFL +     L + +  ++    Q V   T    LCY       L   PE+  HF  G
Sbjct: 283 TATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEG 342

Query: 313 ADVKLSRSNFFVKVSEDIVC-SVFK-GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
           AD+ L  ++ FV+ ++D+ C +V +  + N   + G + Q ++ V YD+  + V F+ TD
Sbjct: 343 ADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTD 402

Query: 371 C 371
           C
Sbjct: 403 C 403


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  157 bits (396), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 91/210 (43%), Positives = 131/210 (62%), Gaps = 15/210 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS-QADIIP 86
           GF V L H DS K+       T  +R+R  + R  NRL      + ++SS +  +A ++P
Sbjct: 39  GFRVRLKHVDSGKN------LTKLERIRHGVKRGRNRLQRLQAMALVASSSSEIEAPVLP 92

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            N  +L++++IGTPP    A+ DTGSDLIWTQC+PC  +QC+ Q +P+FDPK SS++  L
Sbjct: 93  GNGEFLMKLAIGTPPETYSAILDTGSDLIWTQCKPC--TQCFHQSTPIFDPKKSSSFSKL 150

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            CSS  C +L Q SC+   C+Y  SYGD S + G LA+ET+T G       ++P + FGC
Sbjct: 151 SCSSQLCEALPQSSCNN-GCEYLYSYGDYSSTQGILASETLTFGK-----ASVPNVAFGC 204

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           G +N G   S+  G+VGLG G +SL+SQ++
Sbjct: 205 GADNEGSGFSQGAGLVGLGRGPLSLVSQLK 234



 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 45/125 (36%), Positives = 77/125 (61%), Gaps = 7/125 (5%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLS---QVPEV 306
           ++IDSGTT+T+L +   + +    ++ I   PV D +GS  L++C++  S S   +VP++
Sbjct: 314 LIIDSGTTITYLEESAFNLVAKEFTAKINL-PV-DSSGSTGLDVCFTLPSGSTNIEVPKL 371

Query: 307 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
             HF GAD++L   N+ +  S   V  +  G ++ + I+GN+ Q N LV +D+E++T+SF
Sbjct: 372 VFHFDGADLELPAENYMIGDSSMGVACLAMGSSSGMSIFGNVQQQNMLVLHDLEKETLSF 431

Query: 367 KPTDC 371
            PT C
Sbjct: 432 LPTQC 436


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 94/216 (43%), Positives = 131/216 (60%), Gaps = 10/216 (4%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           G  ++L+  DSP SPF   + +  +R + A+ RS +RL       S+   KA +A +   
Sbjct: 54  GLRIDLVRTDSPLSPFSPGNISSTERFKRAIKRSQDRLEKLQM--SVDEVKAVEAPVYAG 111

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           N  +L++++IGTP     A+ DTGSDL WTQC+PC  + CY Q +P++DP  SSTY  +P
Sbjct: 112 NGEFLMKMAIGTPSLSFSAILDTGSDLTWTQCKPC--TDCYPQPTPIYDPSQSSTYSKVP 169

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           CSSS C +L   SCSG NC+Y  SYGD S + G L+ E+ TL S      +LP I FGCG
Sbjct: 170 CSSSMCQALPMYSCSGANCEYLYSYGDQSSTQGILSYESFTLTSQ-----SLPHIAFGCG 224

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQ 243
             N G   S+  G+VG G G +SLISQ+  ++ GN+
Sbjct: 225 QENEGGGFSQGGGLVGFGRGPLSLISQLGQSL-GNK 259



 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 50/127 (39%), Positives = 71/127 (55%), Gaps = 6/127 (4%)

Query: 249 TPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCY---SFNSLSQVP 304
           T  ++IDSGTT+T+L Q GY+    +V+SS+   Q      G L+LC+   S +S S  P
Sbjct: 333 TGGVIIDSGTTVTYLEQSGYDVVKKAVISSINLPQVDGSNIG-LDLCFEPQSGSSTSHFP 391

Query: 305 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
            +T HF GAD  L + N+    S  I C      +N + I+GNI Q N+ + YD E+  +
Sbjct: 392 TITFHFEGADFNLPKENYIYTDSSGIACLAMLP-SNGMSIFGNIQQQNYQILYDNERNVL 450

Query: 365 SFKPTDC 371
           SF PT C
Sbjct: 451 SFAPTVC 457


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score =  156 bits (395), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 127/421 (30%), Positives = 193/421 (45%), Gaps = 91/421 (21%)

Query: 20  SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSK 78
            P+   TG     LIH+DS  S         YQ L R+ + R   R   F  +       
Sbjct: 5   KPLRLVTG-----LIHQDSILSS--------YQSLDRNNVERRRTRRAAFIXDEI----- 46

Query: 79  ASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
             QA+++ ++    +L+  S+G PP  +L   DTGSDL+W QC PC  + C+ Q +P+FD
Sbjct: 47  --QANMVADDRGQAFLVNFSVGRPPVPQLVGIDTGSDLLWVQCRPC--ADCFRQSTPIFD 102

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           P  SSTY  L   S  C +  QK  + +N C Y+ SY DGS S+GNLATE +   ++   
Sbjct: 103 PSKSSTYVDLSYDSPICPNSPQKKYNHLNQCIYNASYADGSTSSGNLATEDIVFETSDQG 162

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA--------------- 240
            V +  + FGCG +N G F+ + +GI+GL  GD S++S++ +  +               
Sbjct: 163 TVTVSSVVFGCGHSNRGRFDGQQSGILGLSAGDQSIVSRLGSRFSYCIGDLFDPHYTHNQ 222

Query: 241 -----GNQRLGVSTP-------------------------------------DIVIDSGT 258
                G +  G STP                                      +V+DSGT
Sbjct: 223 LVLGDGVKMEGSSTPFHTFNGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGT 282

Query: 259 TLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSF---NSLSQVPEVTIHF-RG 312
           T TFL +     L + +  ++    Q V   T    LCY       L   PE+  HF  G
Sbjct: 283 TATFLAKDGFDPLSNEIQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEG 342

Query: 313 ADVKLSRSNFFVKVSEDIVC-SVFK-GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
           AD+ L  ++ FV+ ++D+ C +V +  + N   + G + Q ++ V YD+  + V F+ TD
Sbjct: 343 ADLVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTD 402

Query: 371 C 371
           C
Sbjct: 403 C 403


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score =  156 bits (394), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 159/366 (43%), Gaps = 86/366 (23%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ +++GTPP       DTGSDL+WTQC PC    C+ Q  PL DP  SSTY +LPC 
Sbjct: 91  EYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPC--RDCFHQGLPLLDPAASSTYAALPCG 148

Query: 150 SSQCASLNQKSCSG----------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA- 198
           + +C +L   SC G           +C Y   YGD S + G +AT+  T G   G   + 
Sbjct: 149 APRCRALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSR 208

Query: 199 LP--GITFGCGTNNGGLFNSKTTGIVG--------------------------------- 223
           LP   +TFGCG  N G+F S  TGI G                                 
Sbjct: 209 LPTRRLTFGCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNVTTFSYCFTSMFESKSSLVT 268

Query: 224 LGGGDISLI---------SQMRTTI-------------------AGNQRLGVSTPDI--- 252
           LGG   + +          ++RTT                     G  RL V    +   
Sbjct: 269 LGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPEAKLRST 328

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSF--NSLSQ---VPEV 306
           +IDSG ++T LP+     + +  ++ +   P     GS L+LC++    +L +   VP +
Sbjct: 329 IIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEGSALDLCFALPVTALWRRPPVPSL 388

Query: 307 TIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
           T+H  GAD +L R N+ F  ++  ++C V         + GN  Q N  V YD+E   +S
Sbjct: 389 TLHLDGADWELPRGNYVFEDLAARVMCVVLDAAPGDQTVIGNFQQQNTHVVYDLENDWLS 448

Query: 366 FKPTDC 371
           F P  C
Sbjct: 449 FAPARC 454


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 120/415 (28%), Positives = 184/415 (44%), Gaps = 78/415 (18%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP--- 86
           S+ L+HRD+     Y S      ++   + R   R+ H  +    S+S     D++    
Sbjct: 64  SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120

Query: 87  -----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                 +  Y +R+ +G+PPT++  V D+GSD+IW QC PC   QCY Q  PLFDP  SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178

Query: 142 TYKSLPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           ++  + C S+ C +L+   C G      C YSV+YGDGS++ G LA ET+TLG T  Q V
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQGV 238

Query: 198 A----------------LPGITFGC-------GTNNGGLFN----SKTTGIVG------- 223
           A                L G+ +G        G   GG+F+    S+  G  G       
Sbjct: 239 AIGCGHRNSGLFVGAAGLLGLGWGAMSLIGQLGGAAGGVFSYCLASRGAGGAGSLVLGRT 298

Query: 224 ----LGGGDISLISQMRTTI----------AGNQRLGVS------TPD----IVIDSGTT 259
               +G   + L+   + +            G +RL +       T D    +V+D+GT 
Sbjct: 299 EAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTGTA 358

Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVK 316
           +T LP+   + L       + A P +     L+ CY  +  +  +VP V+ +F +GA + 
Sbjct: 359 VTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLT 418

Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           L   N  V+V   + C  F   ++ + I GNI Q    +  D     V F P  C
Sbjct: 419 LPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 120/415 (28%), Positives = 184/415 (44%), Gaps = 78/415 (18%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN-- 87
           S+ L+HRD+     Y S      ++   + R   R+ H  +    S+S     D++    
Sbjct: 64  SLSLVHRDAISGATYPSRR---HQVVGLVARDNARVEHLEKRLVASTSPYLPEDLVSEVV 120

Query: 88  ------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                 +  Y +R+ +G+PPT++  V D+GSD+IW QC PC   QCY Q  PLFDP  SS
Sbjct: 121 PGVDDGSGEYFVRVGVGSPPTDQYLVVDSGSDVIWVQCRPC--EQCYAQTDPLFDPAASS 178

Query: 142 TYKSLPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           ++  + C S+ C +L+   C G      C YSV+YGDGS++ G LA ET+TLG T  Q V
Sbjct: 179 SFSGVSCGSAICRTLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTLGGTAVQGV 238

Query: 198 A----------------LPGITFGC-------GTNNGGLFN----SKTTGIVG------- 223
           A                L G+ +G        G   GG+F+    S+  G  G       
Sbjct: 239 AIGCGHRNSGLFVGAAGLLGLGWGAMSLVGQLGGAAGGVFSYCLASRGAGGAGSLVLGRT 298

Query: 224 ----LGGGDISLISQMRTTI----------AGNQRLGVS------TPD----IVIDSGTT 259
               +G   + L+   + +            G +RL +       T D    +V+D+GT 
Sbjct: 299 EAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTA 358

Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVK 316
           +T LP+   + L       + A P +     L+ CY  +  +  +VP V+ +F +GA + 
Sbjct: 359 VTRLPREAYAALRGAFDGAMGALPRSPAVSLLDTCYDLSGYASVRVPTVSFYFDQGAVLT 418

Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           L   N  V+V   + C  F   ++ + I GNI Q    +  D     V F P  C
Sbjct: 419 LPARNLLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNTC 473


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 113/351 (32%), Positives = 163/351 (46%), Gaps = 76/351 (21%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  RI +GTP  E   V DTGSD+ W QC PC  S+CY Q  P+FDP  SST+KSL 
Sbjct: 161 SGEYFSRIGVGTPAKEMYVVLDTGSDVNWIQCLPC--SECYQQSDPIFDPTSSSTFKSLT 218

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           CS  +CASL+  +C    C Y VSYGDGSF+ GN AT+TVT     G++  +  +  GCG
Sbjct: 219 CSDPKCASLDVSACRSNKCLYQVSYGDGSFTVGNYATDTVTF----GESGKVNDVALGCG 274

Query: 208 TNNGGLF---------------------------------NSKTTGI----VGLGGGDIS 230
            +N GLF                                 ++K++ +    V +G GD +
Sbjct: 275 HDNEGLFTGAAGLLGLGGGALSMTNQIKAKSFSYCLVDRDSAKSSSLDFNSVQIGAGDAT 334

Query: 231 --LI--SQMRT---------TIAGNQRLGVSTPD------------IVIDSGTTLTFL-P 264
             L+  S+M T         ++ G Q   VS P             +++D GT +T L  
Sbjct: 335 APLLRNSKMDTFYYVGLSGFSVGGQQ---VSIPSSLFEVDASGAGGVILDCGTAVTRLQT 391

Query: 265 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSN 321
           Q YNS   + +    + +    P    + CY F+SLS  +VP VT HF G   + L   N
Sbjct: 392 QAYNSLRDAFVKLTTDFKKGTSPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSLNLPAKN 451

Query: 322 FFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           + + + +    C  F   ++S+ I GN+ Q    + YD+    +      C
Sbjct: 452 YLIPIDDAGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANNLIGLSANKC 502


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 92/215 (42%), Positives = 129/215 (60%), Gaps = 14/215 (6%)

Query: 26  TGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ----NSSISSSKASQ 81
           T GF V L H DS K+       T  +R++  + R  +RL   N      S++ S    +
Sbjct: 45  TKGFRVMLRHVDSGKN------LTKLERVQHGIKRGKSRLQRLNAMVLAASTLDSEDQLE 98

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           A I   N  YL+ ++IGTPP    AV DTGSDLIWTQC+PC  +QCY Q +P+FDPK SS
Sbjct: 99  APIHAGNGEYLMELAIGTPPVSYPAVLDTGSDLIWTQCKPC--TQCYKQPTPIFDPKKSS 156

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           ++  + C SS C+++   +CS   C+Y  SYGD S + G LATET T G +  + V++  
Sbjct: 157 SFSKVSCGSSLCSAVPSSTCSD-GCEYVYSYGDYSMTQGVLATETFTFGKSKNK-VSVHN 214

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           I FGCG +N G    + +G+VGLG G +SL+SQ++
Sbjct: 215 IGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLK 249



 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 48/156 (30%), Positives = 82/156 (52%), Gaps = 7/156 (4%)

Query: 221 IVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE 280
           + G+  GD  L  +  T   G+   G     ++IDSGTT+T++ Q     L     S  +
Sbjct: 301 LEGISVGDTRLSIEKSTFEVGDDGNG----GVIIDSGTTITYIEQKAFEALKKEFISQTK 356

Query: 281 AQPVADPTGSLELCYSFNSLS---QVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKG 337
                  +  L+LC+S  S S   ++P++  HF+G D++L   N+ +  S   V  +  G
Sbjct: 357 LPLDKTSSTGLDLCFSLPSGSTQVEIPKIVFHFKGGDLELPAENYMIGDSNLGVACLAMG 416

Query: 338 ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
            ++ + I+GN+ Q N LV +D+E++T+SF PT C +
Sbjct: 417 ASSGMSIFGNVQQQNILVNHDLEKETISFVPTSCDQ 452


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 115/411 (27%), Positives = 178/411 (43%), Gaps = 76/411 (18%)

Query: 23  EAQTGG--FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
           + + GG  + ++++HRD      + +S+    RL   L R   R+    +  S     + 
Sbjct: 125 DHEEGGEKWMMKVVHRDQLS---FGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSY 181

Query: 81  QAD---------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD 131
           + D         +   +  Y +RI +G+PP  +  V D+GSD++W QC+PC  +QCY Q 
Sbjct: 182 RVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--TQCYHQS 239

Query: 132 SPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
            P+FDP  S+++  + CSSS C  L    C    C+Y VSYGDGS++ G LA ET+T G 
Sbjct: 240 DPVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGR 299

Query: 192 TTGQAVALPGITFGCGTNNGGLF---------NSKTTGIVGLGGGDIS------LISQMR 236
           T  ++VA+     GCG  N G+F            +   VG  GG         L+S   
Sbjct: 300 TMVRSVAI-----GCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSAAW 354

Query: 237 TTIAGNQRL--------------GVSTP--------------DIVIDSGTTLTFLP---- 264
             +  N R               G+  P               +V+D+GT +T LP    
Sbjct: 355 VPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTRLPTLAY 414

Query: 265 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADV-KLSRSN 321
           Q +    L+  +++  A  VA      + CY        +VP V+ +F G  +  L   N
Sbjct: 415 QAFRDAFLAQTANLPRATGVA----IFDTCYDLLGFVSVRVPTVSFYFSGGPILTLPARN 470

Query: 322 FFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           F + + +    C  F   T+ + I GNI Q    + +D     V F P  C
Sbjct: 471 FLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPNIC 521


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  155 bits (392), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 134/451 (29%), Positives = 200/451 (44%), Gaps = 94/451 (20%)

Query: 1   MATFLSCVFILFFLCFYV----VSPIEAQTGG---------FSVELIHRDSPKSPFYNSS 47
           MA+  S + I+  L   V    VSP  + + G         F V L H DS        +
Sbjct: 1   MASSGSHMIIVILLALAVSSALVSPAASTSRGLDRRPEKTWFRVSLRHVDS------GGN 54

Query: 48  ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAV 107
            T ++RL+ A+ R   RL   +  ++ S   + +A +   N  +L++++IGTP     A+
Sbjct: 55  YTKFERLQRAMKRGKLRLQRLSAKTA-SFESSVEAPVHAGNGEFLMKLAIGTPAETYSAI 113

Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
            DTGSDLIWTQC+PC    C+ Q +P+FDPK SS++  LPCSS  CA+L   SCS   C+
Sbjct: 114 MDTGSDLIWTQCKPC--KDCFDQPTPIFDPKKSSSFSKLPCSSDLCAALPISSCSD-GCE 170

Query: 168 YSVSY-----------------GDGSFSNGNLATETVTLGSTTGQAVALPGITFG----- 205
           Y  SY                 GD S S           GS   Q   L G+  G     
Sbjct: 171 YLYSYGDYSSTQGVLATETFAFGDASVSKIGFGCGEDNDGSGFSQGAGLVGLGRGPLSLI 230

Query: 206 ----------CGTN--------------NGGLFNSKTTGIV--------------GLGGG 227
                     C T+                 + N+ TT ++              G+  G
Sbjct: 231 SQLGEPKFSYCLTSMDDSKGISSLLVGSEATMKNAITTPLIQNPSQPSFYYLSLEGISVG 290

Query: 228 DISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 287
           D  L  +  T    N   G     ++IDSGTT+T+L     + L     S ++     D 
Sbjct: 291 DTLLPIEKSTFSIQNDGSG----GLIIDSGTTITYLEDSAFAALKKEFISQLKLD--VDE 344

Query: 288 TGS--LELCYSF---NSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSV 342
           +GS  L+LC++     S   VP++  HF GAD+KL   N+ +  S   V  +  G ++ +
Sbjct: 345 SGSTGLDLCFTLPPDASTVDVPQLVFHFEGADLKLPAENYIIADSGLGVICLTMGSSSGM 404

Query: 343 PIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
            I+GN  Q N +V +D+E++T+SF P  C +
Sbjct: 405 SIFGNFQQQNIVVLHDLEKETISFAPAQCNQ 435


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 130/434 (29%), Positives = 181/434 (41%), Gaps = 88/434 (20%)

Query: 18  VVSPIEAQTGGFSVELIHRDSPKSPFYNSSET-----PYQRLRDALTRSLNR-------L 65
           V+SP  A T   S+ + HR    S   N   T        RL  A   S++         
Sbjct: 51  VLSP-RASTTKSSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLTT 109

Query: 66  NHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           NH +Q+ S        + +   + NY++ + +GTP  +   + DTGSDL WTQC+PC  +
Sbjct: 110 NHVSQSQSTDLPAKDGSTL--GSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRT 167

Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNG 180
            CY Q  P+F+P  S++Y ++ CSS+ C SL     N  SCS  NC Y + YGD SFS G
Sbjct: 168 -CYDQKEPIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVG 226

Query: 181 NLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT-- 238
            LA +  TL S+        G+ FGCG NN GLF +   G++GLG   +S  SQ  T   
Sbjct: 227 FLAKDKFTLTSSD----VFDGVYFGCGENNQGLF-TGVAGLLGLGRDKLSFPSQTATAYN 281

Query: 239 --------------------------------------------------IAGNQRLGV- 247
                                                               G Q+L + 
Sbjct: 282 KIFSYCLPSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIP 341

Query: 248 ----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 303
               STP  +IDSGT +T LP    + L S   + +   P       L+ C+  +    V
Sbjct: 342 STVFSTPGALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTV 401

Query: 304 --PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYD 358
             P+V   F  GA V+L     F       VC  F G ++  +  I+GN+ Q    V YD
Sbjct: 402 TIPKVAFSFSGGAVVELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYD 461

Query: 359 IEQQTVSFKPTDCT 372
                V F P  C+
Sbjct: 462 GAGGRVGFAPNGCS 475


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score =  155 bits (391), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 119/412 (28%), Positives = 187/412 (45%), Gaps = 77/412 (18%)

Query: 25  QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
           +  GF V L H DS        + T ++RL+ A+ R   RL   +  ++ S   + +A +
Sbjct: 38  EKNGFRVSLRHVDS------GGNYTKFERLQRAVKRGRLRLQRLSAKTA-SFEPSVEAPV 90

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
              N  +L+ ++IGTP     A+ DTGSDLIWTQC+PC    C+ Q +P+FDP+ SS++ 
Sbjct: 91  HAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPC--KVCFDQPTPIFDPEKSSSFS 148

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVS-----------------YGDGSFSNGNLATETV 187
            LPCSS  C +L   SCS   C+Y  S                 +GD S S         
Sbjct: 149 KLPCSSDLCVALPISSCSD-GCEYRYSYGDHSSTQGVLATETFTFGDASVSKIGFGCGED 207

Query: 188 TLGSTTGQAVALPGITFG---------------CGTNNGGLFNSKTTGIVGLGGGDISLI 232
             G    Q   L G+  G               C T+   + +SK    + L G + ++ 
Sbjct: 208 NRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTS---IDDSKGISTL-LVGSEATVK 263

Query: 233 SQMRTTIAGNQRL---------GVSTPD-------------------IVIDSGTTLTFLP 264
           S + T +  N            G+S  D                   ++IDSGTT+T+L 
Sbjct: 264 SAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLK 323

Query: 265 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQVPEVTIHFRGADVKLSRSN 321
               + L     S ++    A  +  LELC++     S  +VP++  HF G D+KL + N
Sbjct: 324 DNAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVEVPQLVFHFEGVDLKLPKEN 383

Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           + ++ S   V  +  G ++ + I+GN  Q N +V +D+E++T+SF P  C +
Sbjct: 384 YIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQ 435


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score =  154 bits (390), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 128/445 (28%), Positives = 183/445 (41%), Gaps = 103/445 (23%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
           A + G  + ++HR  P SP  ++   P     + L    NR+   +   S +++   +  
Sbjct: 83  ASSSGTRMTIVHRHGPCSPLADAHGKPPSH-DEILAADQNRVESIHHRVSTTATVRGKPK 141

Query: 84  IIPN---------------------------------NANYLIRISIGTPPTERLAVADT 110
             P+                                   NY++ I +GTP +    V DT
Sbjct: 142 RRPSPSRRQQQPSAPAPAASLSSSTASLPASSGRALGTGNYVVTIGLGTPASRYTVVFDT 201

Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSV 170
           GSD  W QC+PC    CY Q   LFDP  SSTY ++ C++  C+ L  + CSG +C YSV
Sbjct: 202 GSDTTWVQCQPCV-VVCYKQQEKLFDPARSSTYANVSCAAPACSDLYTRGCSGGHCLYSV 260

Query: 171 SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF---------------- 214
            YGDGS+S G  A +T+TL S      A+ G  FGCG  N GLF                
Sbjct: 261 QYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGCGERNEGLFGEAAGLLGLGRGKTSL 316

Query: 215 ------------------NSKTTGIVGLGGGDISLISQMRTT------------------ 238
                              S  TG +  G G  + +   +TT                  
Sbjct: 317 PVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAAVGARQTTPMLTDNGPTFYYVGMTGI 376

Query: 239 IAGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--PVADPTGSL 291
             G Q L +     ST   ++DSGT +T LP    S+L S  +S + A+    A     L
Sbjct: 377 RVGGQLLSIPQSVFSTAGTIVDSGTVITRLPPAAYSSLRSAFASAMAARGYKKAPALSLL 436

Query: 292 ELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKG--ITNSVPIYG 346
           + CY F  +S+V  P+V++ F+ GA + ++ S      S   VC  F      + V I G
Sbjct: 437 DTCYDFTGMSEVAIPKVSLLFQGGAYLDVNASGIMYAASLSQVCLGFAANEDDDDVGIVG 496

Query: 347 NIMQTNFLVGYDIEQQTVSFKPTDC 371
           N     F V YDI ++TV F P  C
Sbjct: 497 NTQLKTFGVVYDIGKKTVGFSPGAC 521


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  154 bits (390), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 89/210 (42%), Positives = 133/210 (63%), Gaps = 15/210 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS-QADIIP 86
           GF  +L H DS K+       T ++R++  + R  +RL  F   + ++SS +   A ++P
Sbjct: 39  GFRAKLKHVDSGKNL------TKFERIQHGVKRGRHRLQRFKAMALVASSNSEIDAPVLP 92

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            N  +L++++IGTPP    A+ DTGSDLIWTQC+PC  +QC+ Q +P+FDPK SS++  L
Sbjct: 93  GNGEFLMKLAIGTPPETYSAIMDTGSDLIWTQCKPC--TQCFDQPTPIFDPKKSSSFSKL 150

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            CSS  C +L Q +CS   C+Y   YGD S + G LA+ET+T G      V++P + FGC
Sbjct: 151 SCSSKLCEALPQSTCSD-GCEYLYGYGDYSSTQGMLASETLTFGK-----VSVPEVAFGC 204

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           G +N G   S+ +G+VGLG G +SL+SQ++
Sbjct: 205 GEDNEGSGFSQGSGLVGLGRGPLSLVSQLK 234



 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 48/127 (37%), Positives = 77/127 (60%), Gaps = 7/127 (5%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLS---QVPEV 306
           ++IDSGTT+T+L Q     +    +S I   PV D +GS  LE+C++  S S   +VP++
Sbjct: 314 LIIDSGTTITYLEQSAFDLVAKEFTSQINL-PV-DNSGSTGLEVCFTLPSGSTDIEVPKL 371

Query: 307 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
             HF GAD++L   N+ +  +   V  +  G ++ + I+GNI Q N LV +D+E++T+SF
Sbjct: 372 VFHFDGADLELPAENYMIADASMGVACLAMGSSSGMSIFGNIQQQNMLVLHDLEKETLSF 431

Query: 367 KPTDCTK 373
            PT C +
Sbjct: 432 LPTQCDE 438


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 93/216 (43%), Positives = 129/216 (59%), Gaps = 15/216 (6%)

Query: 26  TGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-----NSSISSSKAS 80
           T GF V L H DS K+       T  +R++  + R  +RL   N      +S+  S    
Sbjct: 44  TNGFRVMLRHVDSGKN------LTKLERVQHGIKRGKSRLQKLNAMVLAASSTPDSEDQL 97

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           +A I   N  YLI ++IGTPP    AV DTGSDLIWTQC+PC  ++CY Q +P+FDPK S
Sbjct: 98  EAPIHAGNGEYLIELAIGTPPVSYPAVLDTGSDLIWTQCKPC--TRCYKQPTPIFDPKKS 155

Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           S++  + C SS C++L   +CS   C+Y  SYGD S + G LATET T G +  + V++ 
Sbjct: 156 SSFSKVSCGSSLCSALPSSTCSD-GCEYVYSYGDYSMTQGVLATETFTFGKSKNK-VSVH 213

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
            I FGCG +N G    + +G+VGLG G +SL+SQ++
Sbjct: 214 NIGFGCGEDNEGDGFEQASGLVGLGRGPLSLVSQLK 249



 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 44/126 (34%), Positives = 74/126 (58%), Gaps = 5/126 (3%)

Query: 252 IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---QVPEVT 307
           ++IDSGTT+T++ Q  Y +     +S    A      TG L+LC+S  S S   ++P++ 
Sbjct: 328 VIIDSGTTITYVQQKAYEALKKEFISQTKLALDKTSSTG-LDLCFSLPSGSTQVEIPKLV 386

Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            HF+G D++L   N+ +  S   V  +  G ++ + I+GN+ Q N LV +D+E++T+SF 
Sbjct: 387 FHFKGGDLELPAENYMIGDSNLGVACLAMGASSGMSIFGNVQQQNILVNHDLEKETISFV 446

Query: 368 PTDCTK 373
           PT C +
Sbjct: 447 PTSCDQ 452


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  154 bits (389), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 116/388 (29%), Positives = 171/388 (44%), Gaps = 77/388 (19%)

Query: 56  DALTRSLNRLNHFNQNSSISS--SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSD 113
           +A+ RS  R+  +    S  +  S+  Q+ +   N  YL+ +++G+PP     + DTGSD
Sbjct: 2   EAVQRSHERVAFYTLKLSPDAFGSQEFQSPVKAGNGEYLMTLTLGSPPQSFDVIVDTGSD 61

Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC--ASLNQKSCSGVNCQYSVS 171
           L W QC PC    CY Q  P FDP  S +++   C+ + C  ++L  K+C+   CQY  +
Sbjct: 62  LNWVQCLPC--RVCYQQPGPKFDPSKSRSFRKAACTDNLCNVSALPLKACAANVCQYQYT 119

Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
           YGD S +NG+LA ET++L +  G   ++P   FGCGT N G F +   G+VGLG G +SL
Sbjct: 120 YGDQSNTNGDLAFETISLNNGAGTQ-SVPNFAFGCGTQNLGTF-AGAAGLVGLGQGPLSL 177

Query: 232 ISQMRTTIA--------------------------------------------------- 240
            SQ+  T A                                                   
Sbjct: 178 NSQLSHTFANKFSYCLVSLNSLSASPLTFGSIAAAANIQYTSIVVNARHPTYYYVQLNSI 237

Query: 241 --GNQRLGVSTPDIVIDSGT-----------TLTFLPQGYNSNLLSVMSSMIEAQPVADP 287
             G Q L ++     ID  T           T+T L     S +L    S +    +   
Sbjct: 238 EVGGQPLNLAPSVFAIDQSTGRGGTIIDSGTTITMLTLPAYSAVLRAYESFVNYPRLDGS 297

Query: 288 TGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKV--SEDIVCSVFKGITNSVP 343
              L+LC++   +S   VP++   F+GAD ++   N FV V  S   +C    G +    
Sbjct: 298 AYGLDLCFNIAGVSNPSVPDMVFKFQGADFQMRGENLFVLVDTSATTLCLAMGG-SQGFS 356

Query: 344 IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           I GNI Q N LV YD+E + + F   DC
Sbjct: 357 IIGNIQQQNHLVVYDLEAKKIGFATADC 384


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  154 bits (389), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 122/221 (55%), Gaps = 18/221 (8%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL----NHFNQNSSISSSKASQADI 84
           + ++L+HRD  K P +N+S     R    + R   R+     H        + +A  +D+
Sbjct: 66  YKLKLVHRD--KVPTFNTSHDHRTRFNARMQRDTKRVAALRRHLAAGKPTYAEEAFGSDV 123

Query: 85  IPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           +      +  Y +RI +G+PP  +  V D+GSD+IW QCEPC  +QCY Q  P+F+P  S
Sbjct: 124 VSGMEQGSGEYFVRIGVGSPPRNQYVVIDSGSDIIWVQCEPC--TQCYHQSDPVFNPADS 181

Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           S+Y  + C+S+ C+ ++   C    C+Y VSYGDGS++ G LA ET+T G T  + VA+ 
Sbjct: 182 SSYAGVSCASTVCSHVDNAGCHEGRCRYEVSYGDGSYTKGTLALETLTFGRTLIRNVAI- 240

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
               GCG +N G+F     G++GLG G +S + Q+     G
Sbjct: 241 ----GCGHHNQGMF-VGAAGLLGLGSGPMSFVGQLGGQAGG 276


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score =  154 bits (388), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 119/412 (28%), Positives = 186/412 (45%), Gaps = 77/412 (18%)

Query: 25  QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
           +  GF V L H DS        + T ++RL+ A+ R   RL   +  ++ S   + +A +
Sbjct: 38  EKNGFRVSLRHVDS------GGNYTKFERLQRAVKRGRLRLQRLSAKTA-SFEPSVEAPV 90

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
              N  +L+ ++IGTP     A+ DTGSDLIWTQC+PC    C+ Q +P+FDP+ SS++ 
Sbjct: 91  HAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPC--KVCFDQPTPIFDPEKSSSFS 148

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVS-----------------YGDGSFSNGNLATETV 187
            LPCSS  C +L   SCS   C+Y  S                 +GD S S         
Sbjct: 149 KLPCSSDLCVALPISSCSD-GCEYRYSYGDHSSTQGVLATETFTFGDASVSKIGFGCGED 207

Query: 188 TLGSTTGQAVALPGITFG---------------CGTNNGGLFNSKTTGIVGLGGGDISLI 232
             G    Q   L G+  G               C T+   + +SK    + L G + ++ 
Sbjct: 208 NRGRAYSQGAGLVGLGRGPLSLISQLGVPKFSYCLTS---IDDSKGISTL-LVGSEATVK 263

Query: 233 SQMRTTIAGNQRL---------GVSTPD-------------------IVIDSGTTLTFLP 264
           S + T +  N            G+S  D                   ++IDSGTT+T+L 
Sbjct: 264 SAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITYLK 323

Query: 265 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQVPEVTIHFRGADVKLSRSN 321
               + L     S ++    A  +  LELC++     S   VP++  HF G D+KL + N
Sbjct: 324 DSAFAALKKEFISQMKLDVDASGSTELELCFTLPPDGSPVDVPQLVFHFEGVDLKLPKEN 383

Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           + ++ S   V  +  G ++ + I+GN  Q N +V +D+E++T+SF P  C +
Sbjct: 384 YIIEDSALRVICLTMGSSSGMSIFGNFQQQNIVVLHDLEKETISFAPAQCNQ 435


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score =  154 bits (388), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 127/416 (30%), Positives = 174/416 (41%), Gaps = 83/416 (19%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN----SSISSSK---------A 79
           ++HR  P SP  ++ +       + L    NR     +     +++S  K         A
Sbjct: 91  IVHRHGPCSPLADAHDGKLPSHEEILAADQNRAKSIQRRVSTTTTVSRGKPKRNRPSLPA 150

Query: 80  SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
           S    +    NY++ I +GTP      V DTGSD  W QCEPC    CY Q   LFDP  
Sbjct: 151 SSGSAL-GTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCV-VVCYKQQEKLFDPAR 208

Query: 140 SSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           SSTY ++ C++  C+ L  K CSG +C Y V YGDGS+S G  A +T+TL S      A+
Sbjct: 209 SSTYANISCAAPACSDLYIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AI 264

Query: 200 PGITFGCGTNNGGLF----------------------------------NSKTTGIVGLG 225
            G  FGCG  N GL+                                   S  TG +  G
Sbjct: 265 KGFRFGCGERNEGLYGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDFG 324

Query: 226 GGDISLISQMRTT------------------IAGNQRLGV-----STPDIVIDSGTTLTF 262
            G +  +S   TT                    G + L +     +T   ++DSGT +T 
Sbjct: 325 PGSLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTSGTIVDSGTVITR 384

Query: 263 LPQGYNSNLLSVM-SSMIEAQPVADPTGS-LELCYSFNSLSQV--PEVTIHFR-GADVKL 317
           LP    S+L S   S+M E      P  S L+ CY F  +S+V  P V++ F+ GA + +
Sbjct: 385 LPPAAYSSLRSAFASAMAERGYKKAPALSLLDTCYDFTGMSEVAIPTVSLLFQGGASLDV 444

Query: 318 SRSNFFVKVSEDIVCSVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             S      S    C  F G    + V I GN     F V YDI ++ V F P  C
Sbjct: 445 HASGIIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKKVVGFCPGAC 500


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 110/344 (31%), Positives = 153/344 (44%), Gaps = 66/344 (19%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           + NY+I +  GTP   +  V DTGSD+ W QC+PC   +CY Q  PLFDP +SSTY+++ 
Sbjct: 13  SGNYVITVGFGTPTRTQTVVFDTGSDVNWLQCKPCA-VRCYAQQEPLFDPSLSSTYRNVS 71

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C+   C  L+ + CS   C Y V YGDGS + G LA +T  L      A       FGCG
Sbjct: 72  CTEPACVGLSTRGCSSSTCLYGVFYGDGSSTIGFLAMDTFML----TPAQKFKNFIFGCG 127

Query: 208 TNNGGLFNSKTTGIVGLG-GGDISLISQMRTTIA-------------------------- 240
            NN GLF   T G+VGLG     SL SQ+  ++                           
Sbjct: 128 QNNTGLFQG-TAGLVGLGRSSTYSLNSQVAPSLGNVFSYCLPSTSSATGYLNIGNPQNTP 186

Query: 241 ------------------------GNQRLGVSTP-----DIVIDSGTTLTFLPQGYNSNL 271
                                   G  RL +S+        +IDSGT +T LP    S L
Sbjct: 187 GYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQSVGTIIDSGTVITRLPPTAYSAL 246

Query: 272 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSED 329
            + + + +    +A     L+ CY F+  + V  P + +HF G DV++  +  F   +  
Sbjct: 247 KTAVRAAMTQYTLAPAVTILDTCYDFSRTTSVVYPVIVLHFAGLDVRIPATGVFFVFNSS 306

Query: 330 IVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            VC  F G T+S  + I GN+ Q    V YD E + + F    C
Sbjct: 307 QVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 127/407 (31%), Positives = 175/407 (42%), Gaps = 92/407 (22%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
           F+VE + R   K P YN  +T YQ   + LT  +              S ASQ      +
Sbjct: 122 FAVEGVDRSDLK-PVYNE-DTRYQT--EDLTTPV-------------VSGASQG-----S 159

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y  RI +GTP  E   V DTGSD+ W QCEPC  + CY Q  P+F+P  SSTYKSL C
Sbjct: 160 GEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPC--ADCYQQSDPVFNPTSSSTYKSLTC 217

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
           S+ QC+ L   +C    C Y VSYGDGSF+ G LAT+TVT G++      +  +  GCG 
Sbjct: 218 SAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSG----KINNVALGCGH 273

Query: 209 NNGGLFNS------------------KTTGI-------------------VGLGGGDISL 231
           +N GLF                    K T                     V LGGGD + 
Sbjct: 274 DNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATA 333

Query: 232 ISQMRTTIAGNQRLGVS----------TPD------------IVIDSGTTLTFL-PQGYN 268
                  I     +G+S           PD            +++D GT +T L  Q YN
Sbjct: 334 PLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYN 393

Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVK 325
           S   + +   +  +  +      + CY F+SLS  +VP V  HF G   + L   N+ + 
Sbjct: 394 SLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIP 453

Query: 326 VSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           V +    C  F   ++S+ I GN+ Q    + YD+ +  +      C
Sbjct: 454 VDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 119/382 (31%), Positives = 174/382 (45%), Gaps = 78/382 (20%)

Query: 59  TRSLNRLNHFNQNSSISSSKASQADIIPNNA----NYLIRISIGTPPTERLAVADTGSDL 114
           T ++  L   N ++++  S AS   + P  +    NY+ R+ +GTP    + V DTGS L
Sbjct: 102 TVTVASLYRANDDAAVDGSLAS-VPLTPGTSYGVGNYVTRMGLGTPAKPYIMVVDTGSSL 160

Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQY 168
            W QC PC  S C+ Q  P+FDPK SS+Y ++ CS+ QC     A+LN  +CS  + C Y
Sbjct: 161 TWLQCSPCRVS-CHRQSGPVFDPKTSSSYAAVSCSTPQCNDLSTATLNPAACSSSDVCIY 219

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
             SYGD SFS G L+ +TV+ GS +     +P   +GCG +N GLF  ++ G++GL    
Sbjct: 220 QASYGDSSFSVGYLSKDTVSFGSNS-----VPNFYYGCGQDNEGLFG-RSAGLMGLARNK 273

Query: 229 ISLISQMR---------------------------------------------------T 237
           +SL+ Q+                                                     
Sbjct: 274 LSLLYQLAPTLGYSFSYCLPSSSSSGYLSIGSYNPGQYSYTPMVSSTLDDSLYFIKLSGM 333

Query: 238 TIAGNQRLGVSTPD-----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 292
           T+AG + L VS+ +      +IDSGT +T LP      L   ++  ++    AD    L+
Sbjct: 334 TVAG-KPLAVSSSEYSSLPTIIDSGTVITRLPTTVYDALSKAVAGAMKGTKRADAYSILD 392

Query: 293 LCYSFNSLS-QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQ 350
            C+   + S +VP V++ F G A +KLS  N  V V     C  F     S  I GN  Q
Sbjct: 393 TCFVGQASSLRVPAVSMAFSGGAALKLSAQNLLVDVDSSTTCLAF-APARSAAIIGNTQQ 451

Query: 351 TNFLVGYDIEQQTVSFKPTDCT 372
             F V YD++   + F    CT
Sbjct: 452 QTFSVVYDVKSNRIGFAAGGCT 473


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 127/427 (29%), Positives = 179/427 (41%), Gaps = 83/427 (19%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSET-----PYQRLRDALTRSLN-RLNHFNQNSSISS 76
            A T   S+ + HR    S   N   T        RL  A   S++ +L+       +S 
Sbjct: 54  RASTTKSSLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLATDHVSE 113

Query: 77  SKA----SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
           SK+    ++      + NY++ + +GTP  +   + DTGSDL WTQC+PC  + CY Q  
Sbjct: 114 SKSTDLPAKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRT-CYDQKE 172

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           P+F+P  S++Y ++ CSS+ C SL     N  SCS  NC Y + YGD SFS G LA E  
Sbjct: 173 PIFNPSKSTSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKF 232

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT--------- 238
           TL ++        G+ FGCG NN GLF +   G++GLG   +S  SQ  T          
Sbjct: 233 TLTNSD----VFDGVYFGCGENNQGLF-TGVAGLLGLGRDKLSFPSQTATAYNKIFSYCL 287

Query: 239 -------------------------------------------IAGNQRLGV-----STP 250
                                                        G Q+L +     STP
Sbjct: 288 PSSASYTGHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTP 347

Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTI 308
             +IDSGT +T LP    + L S   + +   P       L+ C+  +    V  P+V  
Sbjct: 348 GALIDSGTVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAF 407

Query: 309 HFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVS 365
            F  GA V+L     F       VC  F G ++  +  I+GN+ Q    V YD     V 
Sbjct: 408 SFSGGAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVG 467

Query: 366 FKPTDCT 372
           F P  C+
Sbjct: 468 FAPNGCS 474


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  152 bits (385), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 85/214 (39%), Positives = 125/214 (58%), Gaps = 16/214 (7%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-I 85
           GG  V L H D+      + + +  Q L+ A  RS +R++     ++   + A   D+ +
Sbjct: 38  GGLRVRLTHVDA------HGNYSRLQLLQRAARRSHHRMSRLVARATGVKAVAGGGDLQV 91

Query: 86  P---NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
           P    N  +L+ ++IGTP     A+ DTGSDL+WTQC+PC    C+ Q +P+FDP  SST
Sbjct: 92  PVHAGNGEFLMDVAIGTPALSYAAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSST 149

Query: 143 YKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           Y ++PCSS+ C+ L   +C S   C Y+ +YGD S + G LA+ET TLG    +   LPG
Sbjct: 150 YATVPCSSALCSDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTLGK---EKKKLPG 206

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           + FGCG  N G   ++  G+VGLG G +SL+SQ+
Sbjct: 207 VAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQL 240



 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 43/132 (32%), Positives = 67/132 (50%), Gaps = 9/132 (6%)

Query: 249 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 302
           T  +++DSGT++T+L  QGY +   + ++ M  A P  D +   L+LC+   +      Q
Sbjct: 322 TGGVIVDSGTSITYLELQGYRALKKAFVAQM--ALPTVDGSEIGLDLCFQGPAKGVDEVQ 379

Query: 303 VPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
           VP++ +HF G AD+ L   N+ V  S      +    +  + I GN  Q NF   YD+  
Sbjct: 380 VPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVAPSRGLSIIGNFQQQNFQFVYDVAG 439

Query: 362 QTVSFKPTDCTK 373
            T+SF P  C K
Sbjct: 440 DTLSFAPVQCNK 451


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 90/239 (37%), Positives = 138/239 (57%), Gaps = 28/239 (11%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           +   +SC+ +L  L         + + G+ + L H DS              ++    T 
Sbjct: 8   LQALMSCLVLLTSLAV-------SASSGYRLALTHVDS--------------KIGLTKTE 46

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
            + R  H ++  ++S   A+   +      YL+ ++IGTPP   +A+ADTGSDL WTQC+
Sbjct: 47  LMRRAAHRSRLRALSGYDANSPRLHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQCQ 106

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS-LNQKSCSGVN--CQYSVSYGDGSF 177
           PC    C+ QD+P++DP  SST+  +PCSS+ C   L  ++CS  +  C+Y  SY DG++
Sbjct: 107 PC--KLCFPQDTPVYDPSASSTFSPVPCSSATCLPVLRSRNCSTPSSLCRYGYSYSDGAY 164

Query: 178 SNGNLATETVTLGSTT-GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           S G L TET+TLGS+  GQAV++  + FGCGT+NGG  +  +TG VGLG G +SL++Q+
Sbjct: 165 SAGILGTETLTLGSSVPGQAVSVSDVAFGCGTDNGG-DSLNSTGTVGLGRGTLSLLAQL 222


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 125/420 (29%), Positives = 177/420 (42%), Gaps = 83/420 (19%)

Query: 30  SVELIHRDSPKSPFYNSSET-----PYQRLRDALTRSLN-RLNHFNQNSSISSSKA---- 79
           S+ + HR    S   N   T        RL  A   S++ +L+       +S SK+    
Sbjct: 33  SLHVTHRHGTCSRLNNGKATSPDHVEILRLDQARVNSIHSKLSKKLATDHVSESKSTDLP 92

Query: 80  SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
           ++      + NY++ + +GTP  +   + DTGSDL WTQC+PC  + CY Q  P+F+P  
Sbjct: 93  AKDGSTLGSGNYIVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRT-CYDQKEPIFNPSK 151

Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           S++Y ++ CSS+ C SL     N  SCS  NC Y + YGD SFS G LA E  TL ++  
Sbjct: 152 STSYYNVSCSSAACGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNSD- 210

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT---------------- 238
                 G+ FGCG NN GLF +   G++GLG   +S  SQ  T                 
Sbjct: 211 ---VFDGVYFGCGENNQGLF-TGVAGLLGLGRDKLSFPSQTATAYNKIFSYCLPSSASYT 266

Query: 239 ------------------------------------IAGNQRLGV-----STPDIVIDSG 257
                                                 G Q+L +     STP  +IDSG
Sbjct: 267 GHLTFGSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTPGALIDSG 326

Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GAD 314
           T +T LP    + L S   + +   P       L+ C+  +    V  P+V   F  GA 
Sbjct: 327 TVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVTIPKVAFSFSGGAV 386

Query: 315 VKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           V+L     F       VC  F G ++  +  I+GN+ Q    V YD     V F P  C+
Sbjct: 387 VELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPNGCS 446


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score =  152 bits (383), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 126/407 (30%), Positives = 175/407 (42%), Gaps = 92/407 (22%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
           F+VE + R   K P YN  +T YQ   + LT  +              S ASQ      +
Sbjct: 122 FAVEGVDRSDLK-PVYNE-DTRYQT--EDLTTPV-------------VSGASQG-----S 159

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y  RI +GTP  +   V DTGSD+ W QCEPC  + CY Q  P+F+P  SSTYKSL C
Sbjct: 160 GEYFSRIGVGTPAKDMYLVLDTGSDVNWIQCEPC--ADCYQQSDPVFNPTSSSTYKSLTC 217

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
           S+ QC+ L   +C    C Y VSYGDGSF+ G LAT+TVT G++      +  +  GCG 
Sbjct: 218 SAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSG----KINNVALGCGH 273

Query: 209 NNGGLFNS------------------KTTGI-------------------VGLGGGDISL 231
           +N GLF                    K T                     V LGGGD + 
Sbjct: 274 DNEGLFTGAAGLLGLGGGVLSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGGGDATA 333

Query: 232 ISQMRTTIAGNQRLGVS----------TPD------------IVIDSGTTLTFL-PQGYN 268
                  I     +G+S           PD            +++D GT +T L  Q YN
Sbjct: 334 PLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQTQAYN 393

Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVK 325
           S   + +   +  +  +      + CY F+SLS  +VP V  HF G   + L   N+ + 
Sbjct: 394 SLRDAFLKLTVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPAKNYLIP 453

Query: 326 VSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           V +    C  F   ++S+ I GN+ Q    + YD+ +  +      C
Sbjct: 454 VDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 126/425 (29%), Positives = 184/425 (43%), Gaps = 91/425 (21%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF-----NQNSSISS--SKASQADII 85
           ++HR  P SP     + P     D L +   R++       N+ S++    S  ++  I 
Sbjct: 91  VMHRHGPCSPLQTPGDAPSDA--DLLDQDQARVDSILGMITNETSAVGPGVSLPAERGIS 148

Query: 86  PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
               NY++ + +GTP  +   V DTGSDL W QC PC    CY Q  PLF P  SST+ +
Sbjct: 149 VGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSA 208

Query: 146 LPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAVA-- 198
           + C + +C +  ++SC G      C Y V YGD S + G+L  +T+TLG+     A A  
Sbjct: 209 VRCGARECRA--RQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAEN 266

Query: 199 ---LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM-------------------- 235
              LPG  FGCG NN GLF  +  G+ GLG G +SL SQ                     
Sbjct: 267 DNKLPGFVFGCGENNTGLFG-QADGLFGLGRGKVSLSSQAAGKFGEGFSYCLPSSSSSAP 325

Query: 236 ----------------------RTT-------------IAGNQRLGVSTPDI----VIDS 256
                                 RTT             +AG + + VS+P +    ++DS
Sbjct: 326 GYLSLGTPVPAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAG-RAIRVSSPRVALPLIVDS 384

Query: 257 GTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSF----NSLSQVPEVTIHF 310
           GT +T L P+ Y +   + +S+M +      P  S L+ CY F    N+   +P V + F
Sbjct: 385 GTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVF 444

Query: 311 RGA---DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            G     V  S   +  KV++  +     G   S  I GN  Q    V YD+ +Q + F 
Sbjct: 445 AGGATISVDFSGVLYVAKVAQACLAFAPNGDGRSAGILGNTQQRTLAVVYDVARQKIGFA 504

Query: 368 PTDCT 372
              C+
Sbjct: 505 AKGCS 509


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score =  152 bits (383), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 115/348 (33%), Positives = 162/348 (46%), Gaps = 75/348 (21%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+  + +GTP T    V DTGS L W QC PC  S C+ Q  PL+DP+ SSTY ++PCS
Sbjct: 133 NYVTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVS-CHRQVGPLYDPRASSTYATVPCS 191

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           +SQC     A+LN  +CS  N C Y  SYGD SFS G L+ +TV+ GS +      P   
Sbjct: 192 ASQCDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSFGSGS-----YPNFY 246

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA----------------------- 240
           +GCG +N GLF  ++ G++GL    +SL+ Q+  ++                        
Sbjct: 247 YGCGQDNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTPASTGYLSIGPYTS 305

Query: 241 ---------------------------GNQRLGVSTPD-----IVIDSGTTLTFLPQG-Y 267
                                      G   L VS  +      +IDSGT +T LP   Y
Sbjct: 306 GHYSYTPMASSSLDASLYFVTLSGMSVGGSPLAVSPAEYSSLPTIIDSGTVITRLPTAVY 365

Query: 268 NSNLLSVMSSMIEAQPVADPTGS-LELCYSFN-SLSQVPEVTIHFR-GADVKLSRSNFFV 324
            +   +V ++M+  Q  + P  S L+ C+    S  +VP V + F  GA +KL+  N  +
Sbjct: 366 TALSKAVAAAMVGVQ--SAPAFSILDTCFQGQASQLRVPAVAMAFAGGATLKLATQNVLI 423

Query: 325 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            V +   C  F   T+S  I GN  Q  F V YD+ Q  + F    C+
Sbjct: 424 DVDDSTTCLAFA-PTDSTTIIGNTQQQTFSVVYDVAQSRIGFAAGGCS 470


>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  151 bits (381), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 119/425 (28%), Positives = 190/425 (44%), Gaps = 75/425 (17%)

Query: 18  VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH-FNQNSSISS 76
           +V    AQ      +LIH  S  SP++N + +  +R    +  S  R+ + + Q      
Sbjct: 23  IVEAYNAQPKQLVTKLIHWGSILSPYFNPNASVAERAERIVKTSATRIAYLYAQIKGDIH 82

Query: 77  SKASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
               + +++P+     +L+  S+G P T +LA+ DTGS+++W +C PC   +C  Q+ PL
Sbjct: 83  MNDFELNLLPSTYEPLFLVNFSMGQPATPQLAIMDTGSNILWVRCAPC--KRCTQQNGPL 140

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTT 193
            DP  SSTY SLPC+++ C       C+ +N C Y++SY  G  S G LATE +   S+ 
Sbjct: 141 LDPSKSSTYASLPCTNTMCHYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSD 200

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA-----------GN 242
               A+P + FGC   NG   + + TG+ GLG G  S +++M +  +           G 
Sbjct: 201 EGVNAVPSVVFGCSHENGDYKDRRFTGVFGLGKGITSFVTRMGSKFSYCLGNIADPHYGY 260

Query: 243 QRL---------GVSTP----------------------DI--------------VIDSG 257
            +L         G STP                      DI              +IDSG
Sbjct: 261 NQLVFGEKANFEGYSTPLKVVNGHYYVTLEGISVGEKRLDIDSTAFSMKGNEKSALIDSG 320

Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTIHFR-G 312
           T LT+L +     L + +  +++   +    GS   CY   ++SQ     P VT HF  G
Sbjct: 321 TALTWLAESAFRALDNEVRQLLDGVLMPFWRGSFA-CYK-GTVSQDLIGFPVVTFHFSGG 378

Query: 313 ADVKLSRSNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           AD+ L   + F + + DI+C      S +     S  + G + Q  + + YD+    + F
Sbjct: 379 ADLDLDTESMFYQATPDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKLFF 438

Query: 367 KPTDC 371
           +  DC
Sbjct: 439 QRIDC 443


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score =  150 bits (380), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 116/378 (30%), Positives = 173/378 (45%), Gaps = 65/378 (17%)

Query: 50  PYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN--YLIRISIGTPPTERLAV 107
           P   L  A  +S  RL+        ++S ++Q  +  ++    Y +  SIGTPP E  A+
Sbjct: 39  PAINLTRAAHKSHQRLSMLAARLDDAASGSAQTPLQLDSGGGAYDMTFSIGTPPQELSAL 98

Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVN 165
           ADTGSDLIW +C  C  ++C  Q SP + P  SS++  LPCS S C+ L    CS  G  
Sbjct: 99  ADTGSDLIWAKCGAC--TRCVPQGSPSYYPNKSSSFSKLPCSGSLCSDLPSSQCSAGGAE 156

Query: 166 CQYSVSYGDGS----FSNGNLATETVTLGSTTGQAVALPGITFGCGTN------------ 209
           C Y  SYG  S    ++ G L +ET TLGS      A+PGI FGC T             
Sbjct: 157 CDYKYSYGLASDPHHYTQGYLGSETFTLGSD-----AVPGIGFGCTTMSEGGYGSGSGLV 211

Query: 210 -------------NGGLFN-------SKTTGIV----GLGGGDISLISQMRTT------- 238
                        N G F+       +KT+ ++     L G  +     +RT+       
Sbjct: 212 GLGRGPLSLVSQLNVGAFSYCLTSDAAKTSPLLFGSGALTGAGVQSTPLLRTSTYYYTVN 271

Query: 239 -----IAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
                I      G  +  I+ DSGTT+ FL +   +     + S      +A      E+
Sbjct: 272 LESISIGAATTAGTGSSGIIFDSGTTVAFLAEPAYTLAKEAVLSQTTNLTMASGRDGYEV 331

Query: 294 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 353
           C+   S +  P + +HF G D+ L   N+F  V + + C + +  + S+ I GNIMQ N+
Sbjct: 332 CFQ-TSGAVFPSMVLHFDGGDMDLPTENYFGAVDDSVSCWIVQK-SPSLSIVGNIMQMNY 389

Query: 354 LVGYDIEQQTVSFKPTDC 371
            + YD+E+  +SF+P +C
Sbjct: 390 HIRYDVEKSMLSFQPANC 407


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  150 bits (380), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 113/350 (32%), Positives = 154/350 (44%), Gaps = 74/350 (21%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  RI +GTP  E   V DTGSD+ W QCEPC  S CY Q  P+F+P  SSTYKSL 
Sbjct: 159 SGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPC--SDCYQQSDPVFNPTSSSTYKSLT 216

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           CS+ QC+ L   +C    C Y VSYGDGSF+ G LAT+TVT G++      +  +  GCG
Sbjct: 217 CSAPQCSLLETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSG----KINDVALGCG 272

Query: 208 TNNGGLFNS------------------KTTGI-------------------VGLGGGDIS 230
            +N GLF                    K T                     V LG GD +
Sbjct: 273 HDNEGLFTGAAGLLGLGGGALSITNQMKATSFSYCLVDRDSGKSSSLDFNSVQLGSGDAT 332

Query: 231 ---LISQMRTTI---------AGNQRLGVSTPD------------IVIDSGTTLTFL-PQ 265
              L +Q   T           G Q+  V  PD            +++D GT +T L  Q
Sbjct: 333 APLLRNQKIDTFYYVGLSGFSVGGQK--VMMPDAIFDVDASGSGGVILDCGTAVTRLQTQ 390

Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNF 322
            YNS   + +      +         + CY F+SLS  +VP V  HF G   + L   N+
Sbjct: 391 AYNSLRDAFLKLTTNLKKGTSSISLFDTCYDFSSLSSVKVPTVAFHFTGGKSLDLPAKNY 450

Query: 323 FVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            + V ++   C  F   ++S+ I GN+ Q    + YD+  + +      C
Sbjct: 451 LIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANKIIGLSGNKC 500


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  150 bits (379), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 118/386 (30%), Positives = 170/386 (44%), Gaps = 77/386 (19%)

Query: 55  RDALTRSLNRLNHFNQNSSISSSKASQADIIPN---NANYLIRISIGTPPTERLAVADTG 111
           RD L     R  H + NSS +         +P       Y + + +GTP  +   + DTG
Sbjct: 94  RDQLRVKSIRAKH-SMNSSTTGVFNEMKTRVPTTHFGGGYAVTVGLGTPKKDFSLLFDTG 152

Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV----NCQ 167
           SDL WTQCEPC    C+ Q+   FDP  S++YK+L CSS  C S+ ++S  G     +C 
Sbjct: 153 SDLTWTQCEPC-SGGCFPQNDEKFDPTKSTSYKNLSCSSEPCKSIGKESAQGCSSSNSCL 211

Query: 168 YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
           Y V YG G ++ G LATET+T+  +            GCG  NGG F S T G++GLG  
Sbjct: 212 YGVKYGTG-YTVGFLATETLTITPSD----VFENFVIGCGERNGGRF-SGTAGLLGLGRS 265

Query: 228 DISLISQMRTTI------------------------------------------------ 239
            ++L SQ  +T                                                 
Sbjct: 266 PVALPSQTSSTYKNLFSYCLPASSSSTGHLSFGGGVSQAAKFTPITSKIPELYGLDVSGI 325

Query: 240 -AGNQRLGVS-----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
             G ++L +      T   +IDSGTTLT+LP   +S L S    M+    +   T  L+ 
Sbjct: 326 SVGGRKLPIDPSVFRTAGTIIDSGTTLTYLPSTAHSALSSAFQEMMTNYTLTKGTSGLQP 385

Query: 294 CYSFNSLSQ----VPEVTIHFRGA-DVKLSRSNFFVKVSE-DIVCSVFK--GITNSVPIY 345
           CY F+  +     +P+++I F G  +V +  S  F+  +  + VC  FK  G    V I+
Sbjct: 386 CYDFSKHANDNITIPQISIFFEGGVEVDIDDSGIFIAANGLEEVCLAFKDNGNDTDVAIF 445

Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDC 371
           GN+ Q  + V YD+ +  V F P  C
Sbjct: 446 GNVQQKTYEVVYDVAKGMVGFAPGGC 471


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score =  150 bits (379), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 125/423 (29%), Positives = 172/423 (40%), Gaps = 84/423 (19%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPY-QRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
           EA   G  + L H     SP    + + +   +  +  R  +RLN     ++ + S  S 
Sbjct: 65  EALKPGVKIRLDHIHGACSPLRPINSSSWIDMVSQSFDRDNDRLNTIWSKNNGTYSTMSN 124

Query: 82  ADIIPNN----ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
             + P +     NY++    GTP    L + DTGSD+ W QC+PC  S CY Q  P+F+P
Sbjct: 125 LPLQPGSKVGTGNYIVTAGFGTPAKNSLLIIDTGSDVTWIQCKPC--SDCYSQVDPIFEP 182

Query: 138 KMSSTYKSLPCSSSQCASL-NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           + SS+YK L C SS C  L     C    C Y ++YGDGS S G+ + ET+TLGS +   
Sbjct: 183 QQSSSYKHLSCLSSACTELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTLGSDS--- 239

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDS 256
              P   FGCG  N GLF   + G++GLG   +S  SQ ++   G  +     PD V  +
Sbjct: 240 --FPSFAFGCGHTNTGLFKG-SAGLLGLGRTALSFPSQTKSKYGG--QFSYCLPDFVSST 294

Query: 257 GT------------TLTFLPQGYNSN---------------------------------- 270
            T            T TF+P   NSN                                  
Sbjct: 295 STGSFSVGQGSIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPPAVLGRGGTIVD 354

Query: 271 ---------------LLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-G 312
                          L +   S     P A P   L+ CY  +S SQV  P +T HF+  
Sbjct: 355 SGTVITRLVPQAYDALKTSFRSKTRNLPSAKPFSILDTCYDLSSYSQVRIPTITFHFQNN 414

Query: 313 ADVKLSRSNFFVKVSED--IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKP 368
           ADV +S       +  D   VC  F   + S+   I GN  Q    V +D     + F P
Sbjct: 415 ADVAVSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRIGFAP 474

Query: 369 TDC 371
             C
Sbjct: 475 GSC 477


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 105/354 (29%), Positives = 154/354 (43%), Gaps = 78/354 (22%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ +GTP      V DTGSD++W QC PC   +CY Q  P+FDP+ S TY ++P
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC--RRCYSQSDPIFDPRKSKTYATIP 196

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           CSS  C  L+   C+     C Y VSYGDGSF+ G+ +TET+T      + VAL     G
Sbjct: 197 CSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVAL-----G 251

Query: 206 CGTNNGGLF-------------------------------------NSKTTGIVGLGGGD 228
           CG +N GLF                                     +SK + +V  G   
Sbjct: 252 CGHDNEGLFVGAAGLLGLGKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSSVV-FGNAA 310

Query: 229 ISLISQ------------------MRTTIAGNQRLGVSTPDIVID----------SGTTL 260
           +S I++                  +  ++ G +  GV+     +D          SGT++
Sbjct: 311 VSRIARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSGTSV 370

Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLS 318
           T L +     +        +A   A      + C+  +++++  VP V +HFRGADV L 
Sbjct: 371 TRLIRPAYIAMRDAFRVGAKALKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRGADVSLP 430

Query: 319 RSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            +N+ + V  +   C  F G    + I GNI Q  F V YD+    V F P  C
Sbjct: 431 ATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score =  150 bits (379), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 113/350 (32%), Positives = 154/350 (44%), Gaps = 71/350 (20%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC    CY Q   LFDP  SSTY ++
Sbjct: 176 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANI 234

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C+ L+ + CSG NC Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 235 SCAAPACSDLDTRGCSGGNCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 290

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTPD- 251
           G  N GLF  +  G++GLG G  SL  Q               R++  G    G  +P  
Sbjct: 291 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAA 349

Query: 252 --------IVIDSG-----------------------------------TTLTFLPQGYN 268
                   ++ D+G                                   T +T LP    
Sbjct: 350 AGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFTTAGTIVDSGTVITRLPPAAY 409

Query: 269 SNLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFF 323
           S+L S  +S + A+    A     L+ CY F  +SQV  P V++ F+ GA + +  S   
Sbjct: 410 SSLRSAFASAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIM 469

Query: 324 VKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
              S   VC  F    +   V I GN     F V YDI ++ V F P  C
Sbjct: 470 YAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 116/414 (28%), Positives = 176/414 (42%), Gaps = 87/414 (21%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADIIPNN 88
           S++++H+  P S   N        L + L    +R++  +   S  S  K + A  +P  
Sbjct: 66  SLKVVHKHGPCSQL-NQQNGNAPNLVEILLEDQSRVDSIHAKLSDHSGVKETDAAKLPTK 124

Query: 89  A-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           +       NY++ I +G+P  + + + DTGSDL W +C            +  FDP  S+
Sbjct: 125 SGMSLGTGNYIVSIGLGSPKKDLMLIFDTGSDLTWARCSA----------AETFDPTKST 174

Query: 142 TYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           +Y ++ CS+  C+S+     N   C+   C Y + YGDGS+S G L  E +T+GST    
Sbjct: 175 SYANVSCSTPLCSSVISATGNPSRCAASTCVYGIQYGDGSYSIGFLGKERLTIGSTD--- 231

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ---------------------- 234
                  FGCG +  GLF  K  G++GLG   +S++SQ                      
Sbjct: 232 -IFNNFYFGCGQDVDGLFG-KAAGLLGLGRDKLSVVSQTAPKYNQLFSYCLPSSSSTGFL 289

Query: 235 --------------------------MRTTIAGNQRLGV-----STPDIVIDSGTTLTFL 263
                                     +     G Q+L +     ST   +IDSGT +T L
Sbjct: 290 SFGSSQSKSAKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTAGTIIDSGTVVTRL 349

Query: 264 PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA-DVKLSRS 320
           P    S L S     + + P+  P   L+ CY F+     +VP++ I F G  DV + ++
Sbjct: 350 PPAAYSALRSAFRKAMASYPMGKPLSILDTCYDFSKYKTIKVPKIVISFSGGVDVDVDQA 409

Query: 321 NFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
             FV      VC  F G T +    I+GN  Q NF V YD+    V F P  C+
Sbjct: 410 GIFVANGLKQVCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVGFAPASCS 463


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 114/420 (27%), Positives = 175/420 (41%), Gaps = 82/420 (19%)

Query: 27  GGFSVELIHRDSPKS---PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ-- 81
           G + ++L+HRD   +     Y+ S   + R++    R    +   +   + SS    +  
Sbjct: 69  GKWKLKLVHRDKITAFNKSSYDHSHNFHARIQRDKKRVATLIRRLSPRDATSSYSVEEFG 128

Query: 82  ADIIPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           A+++      +  Y IRI +G+PP E+  V D+GSD++W QC+PC  +QCY Q  P+FDP
Sbjct: 129 AEVVSGMNQGSGEYFIRIGVGSPPREQYVVIDSGSDIVWVQCQPC--TQCYHQTDPVFDP 186

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
             S+++  +PCSSS C  +    C    C+Y V YGDGS++ G LA ET+T G T  + V
Sbjct: 187 ADSASFMGVPCSSSVCERIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTFGRTVVRNV 246

Query: 198 ALPGITFGCGTNN----------------------------GGLFN----SKTT------ 219
           A+     GCG  N                            GG F+    S+ T      
Sbjct: 247 AI-----GCGHRNRGMFVGAAGLLGLGGGSMSLVGQLGGQTGGAFSYCLVSRGTDSAGSL 301

Query: 220 ----GIVGLGGGDISLISQMRT------TIAGNQRLGVSTP--------------DIVID 255
               G + +G   I LI   R        ++G    G+  P               +V+D
Sbjct: 302 EFGRGAMPVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMD 361

Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA 313
           +GT +T +P                  P A      + CY+ N     +VP V+ +F G 
Sbjct: 362 TGTAVTRIPTVAYVAFRDAFIGQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYFAGG 421

Query: 314 DV-KLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            +  L   NF + V +    C  F    + + I GNI Q    + +D     V F P  C
Sbjct: 422 PILTLPARNFLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNVC 481


>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 130/434 (29%), Positives = 195/434 (44%), Gaps = 99/434 (22%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN---SSISSSKASQADI 84
           GFSVE IHRDS +SPF++ S T   R+ +A  RS  R    +++       S+    +++
Sbjct: 34  GFSVEFIHRDSARSPFHDPSLTAPARVLEAARRSTVRAAALSRSYVRVDAPSADGFVSEL 93

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP----------- 133
                 YL+ ++IGTPPT  +A+ADTGSDLIW  C        Y  D P           
Sbjct: 94  TSTPFEYLMAVNIGTPPTRMVAIADTGSDLIWLNCS-------YGGDGPGLAAARDADAQ 146

Query: 134 ----LFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
                FDP  S+T++ + C S  C+ L + SC +   C+YS SYGDGS ++G L+TET T
Sbjct: 147 PPGVQFDPSKSTTFRLVDCDSVACSELPEASCGADSKCRYSYSYGDGSHTSGVLSTETFT 206

Query: 189 LGSTTGQ-----AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQ 243
                G         +  + FGC T   G  +S   G+VGLGGGD+SL+SQ+    +  +
Sbjct: 207 FADAPGARGDGTTTRVANVNFGCSTTFVG--SSVGDGLVGLGGGDLSLVSQLGADTSLGR 264

Query: 244 RLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--------------------- 282
           R         + + + L F P+   ++  +V + +I +Q                     
Sbjct: 265 RFSYCLVPYSVKASSALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGNKTFEA 324

Query: 283 -------------------PVADP-----TGSLEL------------CYSFNSLSQ---- 302
                               + DP     TG ++L            C+  + + +    
Sbjct: 325 PDRSPLIVDSGTTLTFLPEALVDPLVKELTGRIKLPPAQSPERLLPLCFDVSGVREGQVA 384

Query: 303 --VPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGY 357
             +P+VT+    GA V L   N FV+V E  +C     ++   P  I GNI Q N  VGY
Sbjct: 385 AMIPDVTVGLGGGAAVTLKAENTFVEVQEGTLCLAVSAMSEQFPASIIGNIAQQNMHVGY 444

Query: 358 DIEQQTVSFKPTDC 371
           D+++ TV+F P  C
Sbjct: 445 DLDKGTVTFAPAAC 458


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 90/210 (42%), Positives = 131/210 (62%), Gaps = 15/210 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS-QADIIP 86
           GF + L H DS K+       T +QR++  + R+ +RL   N     +SS A   + ++ 
Sbjct: 42  GFRITLKHVDSDKN------LTKFQRIQHGIKRANHRLERLNAMVLAASSNAEINSPVLS 95

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            N  +L+ ++IGTPP    A+ DTGSDLIWTQC+PC  +QC+ Q SP+FDPK SS++  L
Sbjct: 96  GNGEFLMNLAIGTPPETYSAIMDTGSDLIWTQCKPC--TQCFDQPSPIFDPKKSSSFSKL 153

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            CSS  C +L Q SCS  +C+Y  +YGD S + G +ATET T G      V++P + FGC
Sbjct: 154 SCSSQLCKALPQSSCSD-SCEYLYTYGDYSSTQGTMATETFTFGK-----VSIPNVGFGC 207

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           G +N G   ++ +G+VGLG G +SL+SQ++
Sbjct: 208 GEDNEGDGFTQGSGLVGLGRGPLSLVSQLK 237



 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 46/128 (35%), Positives = 77/128 (60%), Gaps = 7/128 (5%)

Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLS---QV 303
           T  ++IDSGTT+T+L +     +    +S +   PV D +G+  LELCY+  S +   +V
Sbjct: 314 TGGLIIDSGTTITYLEESAFDLVKKEFTSQM-GLPV-DNSGATGLELCYNLPSDTSELEV 371

Query: 304 PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
           P++ +HF GAD++L   N+ +  S   V  +  G +  + I+GN+ Q N  V +D+E++T
Sbjct: 372 PKLVLHFTGADLELPGENYMIADSSMGVICLAMGSSGGMSIFGNVQQQNMFVSHDLEKET 431

Query: 364 VSFKPTDC 371
           +SF PT+C
Sbjct: 432 LSFLPTNC 439


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 123/416 (29%), Positives = 177/416 (42%), Gaps = 90/416 (21%)

Query: 30  SVELIHRDSPKSPFYNSSETPY--QRLRDALTRSLNRLNHFNQ-NSSISSSKASQADIIP 86
           SV L+HR  P +P   SS+ P   +RLR +  RS   ++  ++ N SI +      D + 
Sbjct: 60  SVPLVHRHGPCAPSTRSSDEPSLSERLRRSRARSKYIMSRASKSNVSIPTHLGGSVDSL- 118

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
               Y++ + +GTP   ++ + DTGSDL W QC PC  + CY Q  PLFDP  SSTY  +
Sbjct: 119 ---EYVVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPI 175

Query: 147 PCSSSQCASLNQK---------SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           PC++  C  L +          S  G  C Y+++YGDGS + G  + ET+T+       V
Sbjct: 176 PCNTDACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLTMAP----GV 231

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG---------NQRLG-- 246
            +    FGCG +  G  N K  G++GLGG   SL+ Q  +   G         N + G  
Sbjct: 232 TVKDFHFGCGHDQDGP-NDKYDGLLGLGGAPESLVVQTSSVYGGAFSYCLPAANDQAGFL 290

Query: 247 ------------VSTPDI--------------------------------VIDSGTTLTF 262
                       V TP +                                +IDSGT +T 
Sbjct: 291 ALGAPVNDASGFVFTPMVREQQTFYVVNMTGITVGGEPIDVPPSAFSGGMIIDSGTVVTE 350

Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRS 320
           L     + L +     + A P+  P G L+ CY+F   S   VP V + F G       +
Sbjct: 351 LQHTAYAALQAAFRKAMAAYPLL-PNGELDTCYNFTGHSNVTVPRVALTFSGG------A 403

Query: 321 NFFVKVSEDIV---CSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
              + V + I+   C  F+  G  N   I GN+ Q    V YD+    V F    C
Sbjct: 404 TVDLDVPDGILLDNCLAFQEAGPDNQPGILGNVNQRTLEVLYDVGHGRVGFGADAC 459


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 111/355 (31%), Positives = 157/355 (44%), Gaps = 80/355 (22%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            + NY +++ +G+P      + DTGS L W QC+PC    C++Q  PLFDP  S TYKSL
Sbjct: 9   GSGNYYVKVGLGSPARYYSMIVDTGSSLSWLQCKPCV-VYCHVQADPLFDPSASKTYKSL 67

Query: 147 PCSSSQC-----ASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
            C+SSQC     A+LN   C  S   C Y+ SYGD S+S G L+ + +TL  +      L
Sbjct: 68  SCTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ----TL 123

Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA------------------- 240
           PG  +GCG ++ GLF  +  GI+GLG   +S++ Q+ +                      
Sbjct: 124 PGFVYGCGQDSEGLFG-RAAGILGLGRNKLSMLGQVSSKFGYAFSYCLPTRGGGGFLSIG 182

Query: 241 ---------------------------------GNQRLGVSTPDI----VIDSGTTLTFL 263
                                            G + LGV+        +IDSGT +T L
Sbjct: 183 KASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPTIIDSGTVITRL 242

Query: 264 PQG----YNSNLLSVMSSMIEAQPVADPTGSLELCYSFN--SLSQVPEVTIHFR-GADVK 316
           P      +    + +MSS     P       L+ C+  N   +  VPEV + F+ GAD+ 
Sbjct: 243 PMSVYTPFQQAFVKIMSSKYARAPGFS---ILDTCFKGNLKDMQSVPEVRLIFQGGADLN 299

Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           L   N  ++V E + C  F G  N V I GN  Q  F V +DI    + F    C
Sbjct: 300 LRPVNVLLQVDEGLTCLAFAG-NNGVAIIGNHQQQTFKVAHDISTARIGFATGGC 353


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 84/209 (40%), Positives = 127/209 (60%), Gaps = 19/209 (9%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           G+ + L H DS          T  + +R A+ RS  R        ++S   A+   +   
Sbjct: 22  GYRLVLTHVDS------KGGYTKTELMRRAVHRSRLR--------ALSGYDATSPRLHSV 67

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
              YL+ ++IG PP   +A+ADTGSDL WTQC+PC    C+ QD+P++DP  SST+  LP
Sbjct: 68  QVEYLMELAIGKPPVPFVALADTGSDLTWTQCQPC--KLCFPQDTPVYDPSASSTFSPLP 125

Query: 148 CSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           CSS+ C  +  ++C+  + C+Y  +YGDG++S G L TET+TLG ++   V++ G+ FGC
Sbjct: 126 CSSATCLPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLTLGPSSA-PVSVGGVAFGC 184

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           GT+NGG  +  +TG VGLG G +SL++Q+
Sbjct: 185 GTDNGG-DSLNSTGTVGLGRGTLSLLAQL 212



 Score = 52.4 bits (124), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 39/133 (29%), Positives = 64/133 (48%), Gaps = 6/133 (4%)

Query: 246 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQV 303
           G  T  +++DSGTT T L +     ++  ++ ++  QP  + +     C+         +
Sbjct: 288 GDGTGGMIVDSGTTFTILAESGFREVVGRVARVL-GQPPVNASSLDAPCFPAPAGEPPYM 346

Query: 304 PEVTIHFRG-ADVKLSRSNFFVKVSEDI-VCSVFKGIT-NSVPIYGNIMQTNFLVGYDIE 360
           P++ +HF G AD++L R N+     ED   C    G T  S  + GN  Q N  + +D  
Sbjct: 347 PDLVLHFAGGADMRLYRDNYMSYNEEDSSFCLNIAGTTPESTSVLGNFQQQNIQMLFDTT 406

Query: 361 QQTVSFKPTDCTK 373
              +SF PTDC+K
Sbjct: 407 VGQLSFLPTDCSK 419


>gi|449467979|ref|XP_004151699.1| PREDICTED: probable aspartic protease At2g35615-like, partial
           [Cucumis sativus]
          Length = 209

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 78/207 (37%), Positives = 118/207 (57%), Gaps = 10/207 (4%)

Query: 9   FILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF 68
            IL  + F   + I    G F+  L HRDS  SP   SS + Y RL +A  RSL+R    
Sbjct: 11  LILLLISFSQTTIINGDNG-FTTSLFHRDSLLSPLEFSSLSHYDRLTNAFRRSLSRSATL 69

Query: 69  NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
              ++ + +   QA + P +  YL+ +SIGTPP + + +ADTGSDL+W QC PC   +CY
Sbjct: 70  LNRAATNGALDLQAPLTPGSGEYLMSVSIGTPPVDYIGMADTGSDLMWAQCLPCL--KCY 127

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETV 187
            Q  P+FDP  S+++  +PC+S  C +++   C     C YS +YGD +++ G+L  E +
Sbjct: 128 KQSRPIFDPLKSTSFSHVPCNSQNCKAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKI 187

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
           T+GS++ ++V       GCG  +GG F
Sbjct: 188 TIGSSSVKSV------IGCGHESGGGF 208


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 85/214 (39%), Positives = 125/214 (58%), Gaps = 16/214 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK------ASQ 81
           GF ++L H D+       +S T  Q L  A+ RS  R+    Q++++S +       A++
Sbjct: 27  GFQLKLTHVDA------GTSYTKPQLLSRAIARSKARVAAL-QSAAVSPAPVADPITAAR 79

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
             +  ++  YL+ ++IGTPP    A+ DTGSDLIWTQC PC    C  Q +P FD K S+
Sbjct: 80  VLVTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCAAQPTPYFDVKRSA 137

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           TY++LPC SS+CA+L+  SC    C Y   YGD + + G LA ET T G+ +   V    
Sbjct: 138 TYRALPCRSSRCAALSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAAN 197

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           I+FGCG+ N G   + ++G+VG G G +SL+SQ+
Sbjct: 198 ISFGCGSLNAGEL-ANSSGMVGFGRGPLSLVSQL 230



 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 57/124 (45%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF----NSLSQVPEVT 307
           ++IDSGT++T+L Q     +   ++S I    + D    L+ C+ +    N    VP+  
Sbjct: 315 VIIDSGTSITWLQQDAYEAVRRGLASTIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDFV 374

Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            HF GA++ L   N+ +  S      +    T+   I GN  Q N  + YDI    +SF 
Sbjct: 375 FHFDGANMTLPPENYMLIASTTGYLCLAMAPTSVGTIIGNYQQQNLHLLYDIANSFLSFV 434

Query: 368 PTDC 371
           P  C
Sbjct: 435 PAPC 438


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score =  148 bits (373), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 113/342 (33%), Positives = 154/342 (45%), Gaps = 73/342 (21%)

Query: 95  ISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC- 153
           + +GTP T+ + V DTGS L W QC PC  S C+ Q  P+F+PK SSTY S+ CS+ QC 
Sbjct: 1   MGLGTPATQYVMVVDTGSSLTWLQCSPCLVS-CHRQSGPVFNPKSSSTYASVGCSAQQCS 59

Query: 154 ----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
               A+LN  +CS  N C Y  SYGD SFS G L+ +TV+ GST+     LP   +GCG 
Sbjct: 60  DLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----LPNFYYGCGQ 114

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMR-------------------------------- 236
           +N GLF  ++ G++GL    +SL+ Q+                                 
Sbjct: 115 DNEGLFG-RSAGLIGLARNKLSLLYQLAPSLGYSFTYCLPSSSSSGYLSLGSYNPGQYSY 173

Query: 237 -------------------TTIAGN-----QRLGVSTPDIVIDSGTTLTFLPQGYNSNLL 272
                               T+AGN          S P I+ DSGT +T LP    S L 
Sbjct: 174 TPMVSSSLDDSLYFIKLSGMTVAGNPLSVSSSAYSSLPTII-DSGTVITRLPTSVYSALS 232

Query: 273 SVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDI 330
             +++ ++    A     L+ C+    S    P VT+ F  GA +KLS  N  V V +  
Sbjct: 233 KAVAAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGAALKLSAQNLLVDVDDST 292

Query: 331 VCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            C  F     S  I GN  Q  F V YD++   + F    C+
Sbjct: 293 TCLAF-APARSAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score =  147 bits (372), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 75/187 (40%), Positives = 113/187 (60%), Gaps = 10/187 (5%)

Query: 49  TPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVA 108
           T Y+ ++ A+ R   R+   N  + + SS   +  +   +  YL+ ++IGTP +   A+ 
Sbjct: 56  TKYELIKRAIKRGERRMRSIN--AMLQSSSGIETPVYAGSGEYLMNVAIGTPASSLSAIM 113

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
           DTGSDLIWTQCEPC  +QC+ Q +P+F+P+ SS++ +LPC S  C  L  +SC   +CQY
Sbjct: 114 DTGSDLIWTQCEPC--TQCFSQPTPIFNPQDSSSFSTLPCESQYCQDLPSESCYN-DCQY 170

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
           +  YGDGS + G +ATET T      +  ++P I FGCG +N G       G++G+G G 
Sbjct: 171 TYGYGDGSSTQGYMATETFTF-----ETSSVPNIAFGCGEDNQGFGQGNGAGLIGMGWGP 225

Query: 229 ISLISQM 235
           +SL SQ+
Sbjct: 226 LSLPSQL 232



 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 42/127 (33%), Positives = 65/127 (51%), Gaps = 4/127 (3%)

Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQVPE 305
           T  ++IDSGTTLT+LPQ   + +    +  I   PV + +  L  C+      S  QVPE
Sbjct: 308 TGGMIIDSGTTLTYLPQDAYNAVAQAFTDQINLSPVDESSSGLSTCFQLPSDGSTVQVPE 367

Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
           +++ F G  + L   N  +  +E ++C ++       + I+GNI Q    V YD++   V
Sbjct: 368 ISMQFDGGVLNLGEENVLISPAEGVICLAMGSSSQQGISIFGNIQQQETQVLYDLQNLAV 427

Query: 365 SFKPTDC 371
           SF PT C
Sbjct: 428 SFVPTQC 434


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score =  147 bits (372), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 132/412 (32%), Positives = 184/412 (44%), Gaps = 75/412 (18%)

Query: 28  GFSVELIHRDSPKSPFYNSS-ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
           G +V L HR  P SP  +    T  +RLR    R+      F+    I  S A+      
Sbjct: 54  GVTVPLHHRYDPCSPVPSKKVPTLEERLRRDQLRAAYIKRKFSGAGDIEQSDAATVPTTL 113

Query: 87  NNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
             +     Y+I + IG+P   +    DTGSD+ W QC+PC  SQC+ +   LFDP  SST
Sbjct: 114 GTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPC--SQCHSEVDSLFDPSSSST 171

Query: 143 YKSLPCSSSQCASLNQK----SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           Y    CSS+ CA L+Q      C    CQY V+YGD S + G  +++T+TLGS+     A
Sbjct: 172 YSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDTLTLGSS-----A 226

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA------------------ 240
           +    FGC  +  G FN +T G++GLGGG  SL SQ   T                    
Sbjct: 227 MTDFQFGCSQSESGGFNDQTDGLMGLGGGAQSLASQTAGTFGTAFSYCLPPTSGSSGFLT 286

Query: 241 --------------------------------GNQRLGVST----PDIVIDSGTTLTFLP 264
                                           G+Q+L + T       ++DSGT +T LP
Sbjct: 287 LGTGSSGFVKTPMLRSTQIPTYYVVLLESIKVGSQQLNLPTSVFSAGSLMDSGTIITRLP 346

Query: 265 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSN 321
               S L S   + ++  P A P+G L+ C+ F+  S   +P VT+ F  GA V L+   
Sbjct: 347 PTAYSALSSAFKAGMQQYPPATPSGILDTCFDFSGQSSISIPTVTLVFSGGAAVDLAFDG 406

Query: 322 FFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             +++S  I C  F   G  +S+ I GN+ Q  F V YD+    V FK   C
Sbjct: 407 IMLEISSSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 458


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  147 bits (371), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 115/356 (32%), Positives = 172/356 (48%), Gaps = 79/356 (22%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           + +SIG P  +  A+ DTGSDLIWTQC+PC  ++C+ Q +P+FDP+ SS+Y  + CSS  
Sbjct: 1   MELSIGNPAVKYSAIVDTGSDLIWTQCKPC--TECFDQPTPIFDPEKSSSYSKVGCSSGL 58

Query: 153 CASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTL-------GSTTGQAVALPGIT 203
           C +L + +C+     C+Y  +YGD S + G LATET T        G   G  V   G  
Sbjct: 59  CNALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTFEDENSISGIGFGCGVENEGDG 118

Query: 204 FGCGTNNGGL---------------------------------FNSKTTGIVGLGGGDI- 229
           F  G+   GL                                   S  +GIV   G  + 
Sbjct: 119 FSQGSGLVGLGRGPLSLISQLKETKFSYCLTSIEDSEASSSLFIGSLASGIVNKTGASLD 178

Query: 230 -------SLIS----------QMRTTIAGNQRLGVS----------TPDIVIDSGTTLTF 262
                  SL+           +++    G +RL V           T  ++IDSGTT+T+
Sbjct: 179 GEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTGGMIIDSGTTITY 238

Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ---VPEVTIHFRGADVKL 317
           L +     L    +S + + PV D +GS  L+LC+     ++   VP++  HF+GAD++L
Sbjct: 239 LEETAFKVLKEEFTSRM-SLPV-DDSGSTGLDLCFKLPDAAKNIAVPKMIFHFKGADLEL 296

Query: 318 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
              N+ V  S   V  +  G +N + I+GN+ Q NF V +D+E++TVSF PT+C K
Sbjct: 297 PGENYMVADSSTGVLCLAMGSSNGMSIFGNVQQQNFNVLHDLEKETVSFVPTECGK 352


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score =  147 bits (371), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 121/427 (28%), Positives = 177/427 (41%), Gaps = 93/427 (21%)

Query: 30  SVELIHRDSP---------------KSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSI 74
           S+E++H+  P                +   N      + ++  L+++L R N   +  S 
Sbjct: 62  SLEVVHKHGPCSQLNHNGKAKTTISHTDIMNLDNERVKYIQSRLSKNLGRENSVKELDST 121

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
           +    S + I   +ANY + + +GTP  +   V DTGSDL WTQCEPC  S CY Q   +
Sbjct: 122 TLPAKSGSLI--GSANYFVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGS-CYKQQDAI 178

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK------SCSGVNCQYSVSYGDGSFSNGNLATETVT 188
           FDP  SS+Y ++ C+SS C  L         S S   C Y + YGD S S G L+ E +T
Sbjct: 179 FDPSKSSSYINITCTSSLCTQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLT 238

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR------------ 236
           + +T      +    FGCG +N GLF S + G++GLG   IS + Q              
Sbjct: 239 ITATD----IVDDFLFGCGQDNEGLF-SGSAGLIGLGRHPISFVQQTSSIYNKIFSYCLP 293

Query: 237 --------------------------TTIAGNQRL-----------GVSTPDI------- 252
                                     +TI+G+              G   P +       
Sbjct: 294 STSSSLGHLTFGASAATNANLKYTPLSTISGDNTFYGLDIVGISVGGTKLPAVSSSTFSA 353

Query: 253 ---VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVT 307
              +IDSGT +T L     + L S     +E  PVA+  G  + CY F+   +  VP++ 
Sbjct: 354 GGSIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGLFDTCYDFSGYKEISVPKID 413

Query: 308 IHFRGA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
             F G   V+L      +  S   VC  F   G  N + I+GN+ Q    V YD+E   +
Sbjct: 414 FEFAGGVTVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGRI 473

Query: 365 SFKPTDC 371
            F    C
Sbjct: 474 GFGAAGC 480


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 115/429 (26%), Positives = 168/429 (39%), Gaps = 100/429 (23%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP----- 86
            ++HRD+     +  + T  + L+  L R   R    ++ +        +    P     
Sbjct: 68  RVVHRDT-----FAVNATAGELLKHRLQRDKRRAARISEAAGAGGGNGRKGVAAPVVSGL 122

Query: 87  --NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
              +  Y  +I +GTP T+ L V DTGSD++W QC PC   +CY Q  P+FDP+ SS+Y 
Sbjct: 123 AQGSGEYFTKIGVGTPATQALMVLDTGSDVVWVQCAPC--RRCYEQSGPVFDPRRSSSYG 180

Query: 145 SLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           ++ C ++ C  L+   C      C Y V+YGDGS + G+  TET+T     G  VA   +
Sbjct: 181 AVGCGAALCRRLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAG--GARVAR--V 236

Query: 203 TFGCGTNNGGLF--------------------------------------------NSKT 218
             GCG +N GLF                                             S  
Sbjct: 237 ALGCGHDNEGLFVAAAGLLGLGRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHR 296

Query: 219 TGIVGLGGGDISLISQMRTTIAGNQRL-------------------GVSTPDIVID---- 255
           +  V  G G +   S   T +  N R+                   GV+  D+ +D    
Sbjct: 297 SSTVSFGAGSVGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTG 356

Query: 256 -------SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG--SLELCYSFNS--LSQVP 304
                  SGT++T L +   S L     +         P G    + CY      + +VP
Sbjct: 357 RGGVIVDSGTSVTRLARASYSALRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVP 416

Query: 305 EVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
            V++HF  GA+  L   N+ + V S    C  F G    V I GNI Q  F V +D + Q
Sbjct: 417 TVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQ 476

Query: 363 TVSFKPTDC 371
            V F P  C
Sbjct: 477 RVGFAPKGC 485


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 110/349 (31%), Positives = 146/349 (41%), Gaps = 69/349 (19%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC    CY Q   LFDP  SSTY ++
Sbjct: 176 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANV 234

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C+ LN   CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 235 SCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 290

Query: 207 GTNNGGLF----------------------------------NSKTTGIVGLGGGDISLI 232
           G  N GLF                                   S  TG +  G G ++  
Sbjct: 291 GERNEGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSLAAA 350

Query: 233 SQMRTT------------------IAGNQRLGV-----STPDIVIDSGTTLTFLPQGYNS 269
               TT                    G Q L +     +T   ++DSGT +T LP    S
Sbjct: 351 RARLTTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPAAYS 410

Query: 270 NL--LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFV 324
           +L      +        A     L+ CY F  +SQV  P V++ F+G A + +  S    
Sbjct: 411 SLRYAFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMY 470

Query: 325 KVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             S   VC  F    +   V I GN     F V YDI ++ V F P  C
Sbjct: 471 AASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 83/217 (38%), Positives = 120/217 (55%), Gaps = 23/217 (10%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK-----ASQADI 84
           S  L+ RD+     Y S       + D + R   R  +     S ++ +      S++ +
Sbjct: 60  SFALVRRDAVTGSTYPSRR---HAVLDLVARDNARAEYLASRLSPAAYQPTGFSGSESKV 116

Query: 85  I----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           +      +  Y +R+ IG+PPTE+  V D+GSD+IW QC+PC   +CY Q  PLFDP  S
Sbjct: 117 VSGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPC--LECYAQADPLFDPATS 174

Query: 141 STYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           +T+ ++PC S+ C +L    C  SG  C Y VSYGDGS++ G LA ET+TLG T     A
Sbjct: 175 ATFSAVPCGSAVCRTLRTSGCGDSG-GCDYEVSYGDGSYTKGALALETLTLGGT-----A 228

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           + G+  GCG  N GLF     G++GLG G +SL+ Q+
Sbjct: 229 VEGVAIGCGHRNRGLFVG-AAGLLGLGWGPMSLVGQL 264


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 115/355 (32%), Positives = 159/355 (44%), Gaps = 83/355 (23%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           + NY +++ +G+PP     + DTGS L W QC+PC    C+ Q  PLF+P  S+TY+ L 
Sbjct: 117 SGNYYLKLGLGSPPKYYTMILDTGSSLSWLQCKPCV-VYCHSQVDPLFEPSASNTYRPLY 175

Query: 148 CSSSQC-----ASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           CSSS+C     A+LN   C  SGV C Y+ SYGD S+S G L+ + +TL  T  Q   LP
Sbjct: 176 CSSSECSLLKAATLNDPLCTASGV-CVYTASYGDASYSMGYLSRDLLTL--TPSQT--LP 230

Query: 201 GITFGCGTNNGGLFNSKTTGIVGL-----------------------------GGGDISL 231
             T+GCG +N GLF  K  GIVGL                             GGG +S+
Sbjct: 231 SFTYGCGQDNEGLFG-KAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSGGGFLSI 289

Query: 232 ------------------------ISQMRTTIAGN----QRLGVSTPDIVIDSGTTLTFL 263
                                   +     T+AG        G   P I IDSGT +T L
Sbjct: 290 GKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPTI-IDSGTVVTRL 348

Query: 264 P----QGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHFRG-ADVK 316
           P           + +MS   E  P       L+ C+  S  S+S  PE+ + F+G AD+ 
Sbjct: 349 PISIYAALREAFVKIMSRRYEQAPAYS---ILDTCFKGSLKSMSGAPEIRMIFQGGADLS 405

Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           L   N  ++  + I C  F   +N + I GN  Q  + + YD+    + F P  C
Sbjct: 406 LRAPNILIEADKGIACLAFAS-SNQIAIIGNHQQQTYNIAYDVSASKIGFAPGGC 459


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 90/222 (40%), Positives = 125/222 (56%), Gaps = 20/222 (9%)

Query: 21  PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
           P      G  V L H D+      + + T  Q LR A  RS +R++     ++  S KA+
Sbjct: 49  PAAGLLDGLRVPLTHVDA------HGNYTKLQLLRRAARRSHHRMSRLVARTATGSVKAA 102

Query: 81  -----QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
                Q  +   N  +L+ +SIGTP     A+ DTGSDL+WTQC+PC   +C+ Q +P+F
Sbjct: 103 AAPDLQVPVHAGNGEFLMDMSIGTPALAYAAIVDTGSDLVWTQCKPC--VECFNQSTPVF 160

Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           DP  SSTY +LPCSSS C+ L   +C+    +C Y+ +YGD S + G LA ET TL  T 
Sbjct: 161 DPSSSSTYSTLPCSSSLCSDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTLAKTK 220

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
                LPG+ FGCG  N G   ++  G+VGLG G +SL+SQ+
Sbjct: 221 -----LPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQL 257



 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 44/133 (33%), Positives = 69/133 (51%), Gaps = 11/133 (8%)

Query: 249 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 302
           T  +++DSGT++T+L  QGY     +  + M    PVAD +   L+LC+   +      +
Sbjct: 338 TGGVIVDSGTSITYLELQGYRPLKKAFAAQM--KLPVADGSAVGLDLCFKAPASGVDDVE 395

Query: 303 VPEVTIHFRG-ADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
           VP++ +HF G AD+ L   N+ V  S    +C    G +  + I GN  Q N    YD++
Sbjct: 396 VPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMG-SRGLSIIGNFQQQNIQFVYDVD 454

Query: 361 QQTVSFKPTDCTK 373
           + T+SF P  C K
Sbjct: 455 KDTLSFAPVQCAK 467


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 101/254 (39%), Positives = 138/254 (54%), Gaps = 33/254 (12%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           MA FL  V+IL  L +  +S   +   G  +EL H D      Y  +E    R+R A  R
Sbjct: 1   MAAFL--VWILLLLPYVAISSTASH--GVRLELTHADDRGG--YVGAE----RVRRAADR 50

Query: 61  SLNRLNHF-----------NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
           S  R+N F              S  + +  ++A +  + A YL+ I+IGTPP    AV D
Sbjct: 51  SHRRVNGFLGAIEGPSSTARLGSDGAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLD 110

Query: 110 TGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCS--GV 164
           TGSDLIWTQC+ PC   +C+ Q +PL+ P  S+TY ++ C S  C +L      CS    
Sbjct: 111 TGSDLIWTQCDAPC--RRCFPQPAPLYAPARSATYANVSCRSPMCQALQSPWSRCSPPDT 168

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C Y  SYGDG+ ++G LATET TLGS T    A+ G+ FGCGT N G     ++G+VG+
Sbjct: 169 GCAYYFSYGDGTSTDGVLATETFTLGSDT----AVRGVAFGCGTENLG-STDNSSGLVGM 223

Query: 225 GGGDISLISQMRTT 238
           G G +SL+SQ+  T
Sbjct: 224 GRGPLSLVSQLGVT 237



 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 62/122 (50%), Gaps = 2/122 (1%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS--LSQVPEVTIH 309
           ++IDSGTT T L +     L   ++S +     +     L LC++  S    +VP + +H
Sbjct: 317 VIIDSGTTFTALEERAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLH 376

Query: 310 FRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
           F GAD++L R ++ V+     V  +       + + G++ Q N  + YD+E+  +SF+P 
Sbjct: 377 FDGADMELRRESYVVEDRSAGVACLGMVSARGMSVLGSMQQQNTHILYDLERGILSFEPA 436

Query: 370 DC 371
            C
Sbjct: 437 KC 438


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 84/212 (39%), Positives = 118/212 (55%), Gaps = 13/212 (6%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS----ISSSKASQAD 83
           GF ++L H D+       +S T  Q L  A+ RS  R+      +     +    A++  
Sbjct: 28  GFQLKLTHVDA------GTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVL 81

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           +  ++  YL+ ++IGTPP    A+ DTGSDLIWTQC PC    C  Q +P FD K S+TY
Sbjct: 82  VTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATY 139

Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           ++LPC SS+CASL+  SC    C Y   YGD + + G LA ET T G+     V    I 
Sbjct: 140 RALPCRSSRCASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIA 199

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           FGCG+ N G   + ++G+VG G G +SL+SQ+
Sbjct: 200 FGCGSLNAGDL-ANSSGMVGFGRGPLSLVSQL 230



 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF----NSLSQVPEVT 307
           ++IDSGT++T+L Q     +   + S I    + D    L+ C+ +    N    VP++ 
Sbjct: 315 VIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLV 374

Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            HF  A++ L   N+ +  S      +    T    I GN  Q N  + YDI    +SF 
Sbjct: 375 FHFDSANMTLLPENYMLIASTTGYLCLVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSFV 434

Query: 368 PTDC 371
           P  C
Sbjct: 435 PAPC 438


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 86/215 (40%), Positives = 119/215 (55%), Gaps = 20/215 (9%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDA-------LTRSLNRLNHFNQNSSISSSKAS 80
           G  V L H D+      + + T  Q LR A       ++R + R       SS + + A 
Sbjct: 38  GLRVALTHVDA------HGNYTKLQLLRRAARRSRHRMSRLVARTTGVPVMSSKAVAPAL 91

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           Q  +   N  +L+ +SIGTP     A+ DTGSDL+WTQC+PC   +C+ Q +P+FDP  S
Sbjct: 92  QVPVHAGNGEFLMDMSIGTPAVAYAAIIDTGSDLVWTQCKPC--VECFNQSTPVFDPSSS 149

Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           STY +LPCSS+ C+ L    C+   C Y+ +YGD S + G LA ET TL  T      LP
Sbjct: 150 STYAALPCSSTLCSDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTLAKTK-----LP 204

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
            + FGCG  N G   ++  G+VGLG G +SL+SQ+
Sbjct: 205 DVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQL 239



 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 42/132 (31%), Positives = 66/132 (50%), Gaps = 10/132 (7%)

Query: 249 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 302
           T  +++DSGT++T+L  QGY +   +  + M    P AD +G  L+ C+   +      +
Sbjct: 320 TGGVIVDSGTSITYLELQGYRALKKAFAAQM--KLPAADGSGIGLDTCFEAPASGVDQVE 377

Query: 303 VPEVTIHFRGADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
           VP++  H  GAD+ L   N+ V  S    +C    G +  + I GN  Q N    YD+ +
Sbjct: 378 VPKLVFHLDGADLDLPAENYMVLDSGSGALCLTVMG-SRGLSIIGNFQQQNIQFVYDVGE 436

Query: 362 QTVSFKPTDCTK 373
            T+SF P  C K
Sbjct: 437 NTLSFAPVQCAK 448


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 103/254 (40%), Positives = 138/254 (54%), Gaps = 33/254 (12%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           MA FL  V+IL  L +  +S   +   G  +EL H D      Y  +E    R+R A  R
Sbjct: 1   MAAFL--VWILLLLPYVAISSTASH--GVRLELTHADDRGG--YVGAE----RVRRAADR 50

Query: 61  SLNRLNHFNQNSSISSSKA-----------SQADIIPNNANYLIRISIGTPPTERLAVAD 109
           S  R+N F       SS A           ++A +  + A YL+ I+IGTPP    AV D
Sbjct: 51  SHRRVNGFLGAIEGPSSTARLGIDGAGAGGAEASVHASTATYLVDIAIGTPPLPLTAVLD 110

Query: 110 TGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCS--GV 164
           TGSDLIWTQC+ PC   +C+ Q +PL+ P  S+TY ++ C S  C +L      CS    
Sbjct: 111 TGSDLIWTQCDAPC--RRCFPQPAPLYAPARSATYANVSCRSPMCQALQSPWSRCSPPDT 168

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C Y  SYGDG+ ++G LATET TLGS T    A+ G+ FGCGT N G     ++G+VG+
Sbjct: 169 GCAYYFSYGDGTSTDGVLATETFTLGSDT----AVRGVAFGCGTENLG-STDNSSGLVGM 223

Query: 225 GGGDISLISQMRTT 238
           G G +SL+SQ+  T
Sbjct: 224 GRGPLSLVSQLGVT 237



 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 35/122 (28%), Positives = 62/122 (50%), Gaps = 2/122 (1%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS--LSQVPEVTIH 309
           ++IDSGTT T L +     L   ++S +     +     L LC++  S    +VP + +H
Sbjct: 317 VIIDSGTTFTALEESAFVALARALASRVRLPLASGAHLGLSLCFAAASPEAVEVPRLVLH 376

Query: 310 FRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
           F GAD++L R ++ V+     V  +       + + G++ Q N  + YD+E+  +SF+P 
Sbjct: 377 FDGADMELRRESYVVEDRSAGVACLGMVSARGMSVLGSMQQQNTHILYDLERGILSFEPA 436

Query: 370 DC 371
            C
Sbjct: 437 KC 438


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 121/414 (29%), Positives = 179/414 (43%), Gaps = 79/414 (19%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ---NSSISSSKASQAD 83
           G  S++L+HR  P +P + +S  P     + L R   R++   Q   + +++SS      
Sbjct: 59  GSSSLKLVHRFGPCNP-HRTSTAPASSFNEILRRDKLRVDSIIQARRSMNLTSSVEHMKS 117

Query: 84  IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
            +P         ++Y++ + IGTP  E   + DTGS LIWTQC+PC    CY +  P+FD
Sbjct: 118 SVPFYGLSKITASDYIVNVGIGTPKKEMPLIFDTGSGLIWTQCKPC--KACYPK-VPVFD 174

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           P  S+++K LPCSS  C S+ Q  CS   C Y  +Y D S S G LATET++    +   
Sbjct: 175 PTKSASFKGLPCSSKLCQSIRQ-GCSSPKCTYLTAYVDNSSSTGTLATETISF---SHLK 230

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI--------------AGN 242
                I  GC     G  +   +GI+GL    ISL SQ                    G+
Sbjct: 231 YDFKNILIGCSDQVSGE-SLGESGIMGLNRSPISLASQTANIYDKLFSYCIPSTPGSTGH 289

Query: 243 QRLGVSTPDIV----------------------------------------IDSGTTLTF 262
              G   P+ V                                        IDSG  LT 
Sbjct: 290 LTFGGKVPNDVRFSPVSKTAPSSDYDIKMTGISVGGRKLLIDASAFKIASTIDSGAVLTR 349

Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGA---DVKL 317
           LP    S L SV   M++  P+ D    L+ CY F++ S V  P +++ F G    D+ +
Sbjct: 350 LPPKAYSALRSVFREMMKGYPLLDQDDFLDTCYDFSNYSTVAIPSISVFFEGGVEMDIDV 409

Query: 318 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           S   + V  S+ + C  F  + + V I+GN  Q  + V +D  ++ + F P  C
Sbjct: 410 SGIMWQVPGSK-VYCLAFAELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score =  146 bits (368), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 83/216 (38%), Positives = 117/216 (54%), Gaps = 22/216 (10%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS--------ISSSKASQ 81
           S  L+ RD+     Y S   P   + D ++R   R  +     S          S     
Sbjct: 59  SFALVRRDAVTGATYPS---PRHAVLDLVSRDNARAEYLASRLSPAYQPTDFFGSESKVV 115

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           + +   +  Y +R+ IG+PPTE+  V D+GSD+IW QC+PC   +CY Q  PLFDP  S+
Sbjct: 116 SGLDEGSGEYFVRVGIGSPPTEQYLVVDSGSDVIWVQCKPC--LECYAQADPLFDPASSA 173

Query: 142 TYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           T+ ++ C S+ C +L    C  SG  C+Y VSYGDGS++ G LA ET+TLG T     A+
Sbjct: 174 TFSAVSCGSAICRTLRTSGCGDSG-GCEYEVSYGDGSYTKGTLALETLTLGGT-----AV 227

Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
            G+  GCG  N GLF     G++GLG G +SL+ Q+
Sbjct: 228 EGVAIGCGHRNRGLF-VGAAGLLGLGWGPMSLVGQL 262



 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 56/123 (45%), Gaps = 3/123 (2%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           +V+D+GT +T LPQ   + L       + A P A     L+ CY  +  +  +VP V+ +
Sbjct: 349 VVMDTGTAVTRLPQEAYAALRDAFVGAVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFY 408

Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           F G A + L   N  ++V   I C  F   ++ + I GNI Q    +  D     + F P
Sbjct: 409 FDGAATLTLPARNLLLEVDGGIYCLAFAPSSSGLSILGNIQQEGIQITVDSANGYIGFGP 468

Query: 369 TDC 371
             C
Sbjct: 469 ATC 471


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 114/349 (32%), Positives = 149/349 (42%), Gaps = 69/349 (19%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ I +GTP      V DTGSD  W QCEPC    CY Q   LFDP  SST  ++
Sbjct: 182 GTGNYVVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCV-VVCYEQQEKLFDPARSSTDANI 240

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C+ L  K CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 241 SCAAPACSDLYTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AIKGFRFGC 296

Query: 207 GTNNGGLF----------------------------------NSKTTGIVGLGGGDISLI 232
           G  N GLF                                   S  TG +  G G    +
Sbjct: 297 GERNEGLFGEAAGLLGLGRGKTSLPVQAYDKYGGVFAHCFPARSSGTGYLDFGPGSSPAV 356

Query: 233 SQMRTT------------------IAGNQRLGV-----STPDIVIDSGTTLTFLPQGYNS 269
           S   TT                    G + L +     +T   ++DSGT +T LP    S
Sbjct: 357 STKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTAGTIVDSGTVITRLPPAAYS 416

Query: 270 NLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV--PEVTIHFRGA---DVKLSRSNF 322
           +L S  +S I A+    A     L+ CY F  +SQV  P V++ F+G    DV  S   +
Sbjct: 417 SLRSAFASAIAARGYKKAPALSLLDTCYDFTGMSQVAIPTVSLLFQGGASLDVDASGIIY 476

Query: 323 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
              VS+  +        + V I GN     F V YDI ++ V F P  C
Sbjct: 477 AASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSPGAC 525


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 87/213 (40%), Positives = 119/213 (55%), Gaps = 15/213 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS----ISSSKASQAD 83
           GF ++L H D+       +S T  Q L  A+ RS  R+      +     +    A++  
Sbjct: 28  GFQLKLTHVDA------GTSYTKLQLLSRAIARSKARVAALQSAAVLPPVVDPITAARVL 81

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           +  ++  YL+ ++IGTPP    A+ DTGSDLIWTQC PC    C  Q +P FD K S+TY
Sbjct: 82  VTASSGEYLVDLAIGTPPLYYTAIMDTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATY 139

Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           ++LPC SS+CASL+  SC    C Y   YGD + + G LA ET T G+     V    I 
Sbjct: 140 RALPCRSSRCASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIA 199

Query: 204 FGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM 235
           FGCG+ N G L NS  +G+VG G G +SL+SQ+
Sbjct: 200 FGCGSLNAGDLANS--SGMVGFGRGPLSLVSQL 230


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 78/181 (43%), Positives = 118/181 (65%), Gaps = 7/181 (3%)

Query: 59  TRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
           T  + R  H ++  ++S   A+   +      YL+ ++IGTPP   +A+ADTGSDL WTQ
Sbjct: 34  TELMRRAAHRSRLQALSGYDANSPRLHSVQVEYLMELAIGTPPVPFVALADTGSDLTWTQ 93

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQKSCSGVN--CQYSVSYGDG 175
           C+PC    C+ QD+P++DP  SST+  +PCSS+ C  +   ++CS  +  C+Y  SY DG
Sbjct: 94  CQPC--KLCFPQDTPVYDPSASSTFSPVPCSSATCLPTWRSRNCSNPSSPCRYIYSYSDG 151

Query: 176 SFSNGNLATETVTLGSTT-GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
           ++S G L TET+T+GS+  GQ V++  + FGCGT+NGG  +  +TG VGLG G +SL++Q
Sbjct: 152 AYSVGILGTETLTIGSSVPGQTVSVGSVAFGCGTDNGG-DSLNSTGTVGLGRGTLSLLAQ 210

Query: 235 M 235
           +
Sbjct: 211 L 211


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 121/444 (27%), Positives = 187/444 (42%), Gaps = 105/444 (23%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS----ISSSK 78
           +A  G   + L H D+ K        +  + +R A+ RS  R    +   S    +    
Sbjct: 28  DAFAGDVRLHLTHVDAGKQ------MSRRELIRRAMQRSKARAAALSVARSGSGRVPGKS 81

Query: 79  ASQAD--------IIPN-NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           A Q +        + P+ +  YLI ++IGTPP    A+ DTGSDLIWTQC PC  + C  
Sbjct: 82  AQQGEQHQQPGVPVRPSGDLEYLIDLAIGTPPQPVSALLDTGSDLIWTQCAPC--ASCLA 139

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
           Q  PLF P  SS+Y  + CS   C  +   SC   + C Y  +YGDG+ + G  ATE  T
Sbjct: 140 QPDPLFAPAASSSYVPMRCSGQLCNDILHHSCQRPDTCTYRYNYGDGTTTLGVYATERFT 199

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG---------------------- 226
             S++G+ +++P + FGCGT N G  N+ + GIVG G                       
Sbjct: 200 FASSSGEKLSVP-LGFGCGTMNVGSLNNGS-GIVGFGRDPLSLVSQLSIRRFSYCLTPYT 257

Query: 227 ------------------GDISLISQMRTT-------------------IAGNQRLGVS- 248
                             GD +   Q++TT                     G +RL +  
Sbjct: 258 STRKSTLMFGSLSDGVFEGDDAATGQVQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPL 317

Query: 249 -----TPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS--- 296
                 PD    +++DSGT LT  P    + +L    + +     +  +    +C++   
Sbjct: 318 SAFALRPDGSGGVIVDSGTALTLFPAAVLTEVLRAFRAQLRLPFTSSSSPDDGVCFATPM 377

Query: 297 --------FNSLSQVPEVTIHFRGADVKLSRSNFFVK-VSEDIVCSVFKGITNSVPIYGN 347
                     ++  VP +  HF+GAD++L R N+ +       +C +     +S    GN
Sbjct: 378 AAGGRRASAATVVSVPRMAFHFQGADLELPRRNYVLDDPRRGSLCILLADSGDSGATIGN 437

Query: 348 IMQTNFLVGYDIEQQTVSFKPTDC 371
            +Q +  V YD+E +T+SF P  C
Sbjct: 438 FVQQDMRVLYDLEAETLSFAPAQC 461


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 122/415 (29%), Positives = 181/415 (43%), Gaps = 86/415 (20%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
           ++I +D  +  F +S  T  + +R++ T      +      S+ S+   ++ +   + NY
Sbjct: 59  DMITKDEERVRFLHSRLTNKESVRNSATT-----DKLRGGPSLVSTTPLKSGLSIGSGNY 113

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC--- 148
            ++I +GTP      + DTGS L W QC+PC    C++Q  P+F P  S TYK+LPC   
Sbjct: 114 YVKIGLGTPAKYFSMIVDTGSSLSWLQCQPCV-IYCHVQVDPIFTPSTSKTYKALPCSSS 172

Query: 149 --SSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
             SS + ++LN   CS     C Y  SYGD SFS G L+ + +TL   T       G  +
Sbjct: 173 QCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTL---TPSEAPSSGFVY 229

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMR---------------------------- 236
           GCG +N GLF  +++GI+GL    IS++ Q+                             
Sbjct: 230 GCGQDNQGLFG-RSSGIIGLANDKISMLGQLSKKYGNAFSYCLPSSFSAPNSSSLSGFLS 288

Query: 237 -------------TTIAGNQR------------------LGVST-----PDIVIDSGTTL 260
                        T +  NQ+                  LGVS      P I IDSGT +
Sbjct: 289 IGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPTI-IDSGTVI 347

Query: 261 TFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHFR-GADVK 316
           T LP   YN+   S +  M +    A     L+ C+  S   +S VPE+ I FR GA ++
Sbjct: 348 TRLPVAVYNALKKSFVLIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIQIIFRGGAGLE 407

Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           L   N  V++ +   C      +N + I GN  Q  F V YD+    + F P  C
Sbjct: 408 LKAHNSLVEIEKGTTCLAIAASSNPISIIGNYQQQTFKVAYDVANFKIGFAPGGC 462


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 115/416 (27%), Positives = 175/416 (42%), Gaps = 83/416 (19%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ---ADII 85
           + ++L HRD  K P     + P +R ++ ++R   R++   +  S  S +      +D++
Sbjct: 71  WKLKLFHRD--KLPLNFDPDHP-RRFKERISRDSKRVSSLLRLLSSGSDEQVTDFGSDVV 127

Query: 86  PN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                 +  Y +RI +G+PP  +  V D+GSD++W QC+PC  S+CY Q  P+FDP  S+
Sbjct: 128 SGTEQGSGEYFVRIGVGSPPRSQYVVIDSGSDIVWVQCQPC--SECYQQSDPVFDPAGSA 185

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           TY  + C SS C  L+   C+   C+Y VSYGDGS++ G LA ET+T G      V +  
Sbjct: 186 TYAGISCDSSVCDRLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTFGR-----VLIRN 240

Query: 202 ITFGC----------------------------GTNNGGLFN-------SKTTGIVGLGG 226
           I  GC                            G   GG F+       +++TG +  G 
Sbjct: 241 IAIGCGHMNRGMFIGAAGLLGLGGGAMSFVGQLGGQTGGAFSYCLVSRGTESTGTLEFGR 300

Query: 227 GD-------ISLISQMRT--------TIAGNQRLGVSTPD------------IVIDSGTT 259
           G        + LI   R         +  G   + V  P+            +V+D+GT 
Sbjct: 301 GAMPVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTA 360

Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGADV-K 316
           +T LP                  P +D     + CY+ N     +VP V+ +F G  +  
Sbjct: 361 VTRLPAPAYEAFRDTFIGQTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPILT 420

Query: 317 LSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           L   NF + V  E   C  F    + + I GNI Q    +  D     V F PT C
Sbjct: 421 LPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score =  145 bits (365), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 82/213 (38%), Positives = 121/213 (56%), Gaps = 21/213 (9%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS----ISSSKASQADIIPNN 88
           ++HR  P SP       P     + L R  +R++  ++ ++       S AS+   +P +
Sbjct: 121 VVHRHGPCSPLLARGGEPSHA--EILDRDQDRVDSIHRMTAGPWTAGQSSASKGVSLPAH 178

Query: 89  -------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                  ANY++ + +GTP  + L V DTGSDL W QC+PC  + CY Q  PLFDP  S+
Sbjct: 179 RGLRLGTANYIVSVGLGTPRRDLLVVFDTGSDLSWVQCKPC--NNCYKQHDPLFDPSQST 236

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           TY ++PC + +C  L+  +CS   C+Y V YGD S ++GNLA +T+TLG ++ Q   L G
Sbjct: 237 TYSAVPCGAQEC--LDSGTCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSDQ---LQG 291

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
             FGCG ++ GLF  +  G+ GLG   +SL SQ
Sbjct: 292 FVFGCGDDDTGLFG-RADGLFGLGRDRVSLASQ 323



 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 41/128 (32%), Positives = 54/128 (42%), Gaps = 5/128 (3%)

Query: 250 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVT 307
           P  VIDSGT +T LP    S L S  +  +     A     L+ CY F   +  Q+P V 
Sbjct: 396 PGTVIDSGTVITRLPSRAYSALRSSFAGFMRRYKRAPALSILDTCYDFTGRTKVQIPSVA 455

Query: 308 IHFR-GADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTV 364
           + F  GA + L         +    C  F   G   SV I GN+ Q  F V YD+  Q +
Sbjct: 456 LLFDGGATLNLGFGGVLYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKI 515

Query: 365 SFKPTDCT 372
            F    C+
Sbjct: 516 GFGAKGCS 523


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score =  144 bits (364), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 89/214 (41%), Positives = 125/214 (58%), Gaps = 19/214 (8%)

Query: 31  VEL--IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN--HFNQNSSISSSKASQADIIP 86
           VEL  IH D         S T  Q +RDAL R ++R N      +SS  ++ ++   I P
Sbjct: 30  VELTRIHADP--------SVTASQFVRDALRRDMHRHNARQLAASSSNGTTVSAPTQISP 81

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
               YL+ ++IGTPP    A+ADTGSDLIWTQC PC  SQC+ Q +PL++P  S+T+  L
Sbjct: 82  TAGEYLMTLAIGTPPVSYQAIADTGSDLIWTQCAPC-SSQCFQQPTPLYNPSSSTTFAVL 140

Query: 147 PCSS--SQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAVALPG 201
           PC+S  S CA+    +    G  C Y+++YG G +++    +ET T GS+T      +PG
Sbjct: 141 PCNSSLSMCAAALAGTTPPPGCTCMYNMTYGSG-WTSVYQGSETFTFGSSTPANQTGVPG 199

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           I FGC   +GG   S  +G+VGLG G +SL+SQ+
Sbjct: 200 IAFGCSNASGGFNTSSASGLVGLGRGSLSLVSQL 233



 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 68/129 (52%), Gaps = 12/129 (9%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD----PTGSLELCYSFNSLSQ----VP 304
           +IDSGTT+T L       + + + S++   P  D     TG L+LC+   S +     +P
Sbjct: 318 IIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGGSAATG-LDLCFELPSSTSAPPTMP 375

Query: 305 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 363
            +T+HF GAD+ L   ++ + +  ++ C   +  T+  V I GN  Q N  + YD+ Q+T
Sbjct: 376 SMTLHFDGADMVLPADSYMM-LDSNLWCLAMQNQTDGGVSILGNYQQQNMHILYDVGQET 434

Query: 364 VSFKPTDCT 372
           ++F P  C+
Sbjct: 435 LTFAPAKCS 443


>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  144 bits (363), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 118/379 (31%), Positives = 177/379 (46%), Gaps = 48/379 (12%)

Query: 30  SVELIHRDSPKS---PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS-KASQADII 85
           S+E++H+  P S   P   +S +  Q L    +R  +  +   +N +  S+ KAS+A + 
Sbjct: 18  SLEVVHKHGPCSKLRPHKANSPSHTQILAQDESRVASIQSRLAKNLAGGSNLKASKATLP 77

Query: 86  PNNA------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
             +A      NY++ + +G+P  +   + DTGSDL WTQCEPC    CY Q   +FDP  
Sbjct: 78  SKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPC-VGYCYQQREHIFDPST 136

Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           S +Y ++ C S  C  L     N   CS   C Y + YGDGS+S G  A E ++L ST  
Sbjct: 137 SLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD- 195

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVI 254
                    FGCG NN GLF   T G++GL    +SL+SQ  T     +      P    
Sbjct: 196 ---VFNNFQFGCGQNNRGLFGG-TAGLLGLARNPLSLVSQ--TAQKYGKVFSYCLPSSSS 249

Query: 255 DSGTTLTF---------------LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS 299
            +G  L+F               LP    S++  V   ++   P       L+ CY  + 
Sbjct: 250 STG-YLSFGSGDGDSKAVKFTPRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSK 308

Query: 300 LS--QVPEVTIHFR-GADVKLSRSN--FFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTN 352
               +VP++ ++F  GA++ L+     + +KVS+  VC  F G +  + V I GN+ Q  
Sbjct: 309 YKTVKVPKIILYFSGGAEMDLAPEGIIYVLKVSQ--VCLAFAGNSDDDEVAIIGNVQQKT 366

Query: 353 FLVGYDIEQQTVSFKPTDC 371
             V YD  +  V F P+ C
Sbjct: 367 IHVVYDDAEGRVGFAPSGC 385


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score =  144 bits (362), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 119/429 (27%), Positives = 178/429 (41%), Gaps = 84/429 (19%)

Query: 19  VSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS--LNRLNHFNQNSSISS 76
           VS  ++    F + L+HRD       +      +  RDA+  +  + RL+H    +++  
Sbjct: 62  VSGYKSDNNTFKLNLLHRDKLSHVHGHRRGFNDRMKRDAIRVATLVRRLSH-GAPAAVKD 120

Query: 77  SKASQA----DII----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
           S+   A    D+I      +  Y +RI +G+PP  +  V D+GSD++W QC+PC  S+CY
Sbjct: 121 SRYKVANFATDVISGMEAGSGEYFVRIGVGSPPRNQYMVIDSGSDIVWVQCKPC--SRCY 178

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
            Q  P+FDP  SS++  + C S  C  L    C+   C+Y VSYGDGS++ G LA ET+T
Sbjct: 179 QQSDPVFDPADSSSFAGVSCGSDVCDRLENTGCNAGRCRYEVSYGDGSYTKGTLALETLT 238

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLF---------------------------------- 214
           +G    + VA+     GCG  N G+F                                  
Sbjct: 239 VGQVMIRDVAI-----GCGHTNQGMFIGAAGLLGLGGGSMSFIGQLGGQTGGAFSYCLVS 293

Query: 215 -NSKTTGIVGLGGGD-------ISLISQMRT------TIAGNQRLG--VSTPD------- 251
             + +TG +  G G        ISLI   R        +AG    G  VS P+       
Sbjct: 294 RGTGSTGALEFGRGALPVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTE 353

Query: 252 -----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVP 304
                +V+D+GT +T  P           ++     P A      + CY  N     +VP
Sbjct: 354 YGTNGVVMDTGTAVTRFPTAAYVAFRDSFTAQTSNLPRAPGVSIFDTCYDLNGFESVRVP 413

Query: 305 EVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
            V+ +F  G  + L   NF + V      C  F    + + I GNI Q    + +D    
Sbjct: 414 TVSFYFSDGPVLTLPARNFLIPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFDGANG 473

Query: 363 TVSFKPTDC 371
            V F P  C
Sbjct: 474 FVGFGPNIC 482


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score =  144 bits (362), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 87/226 (38%), Positives = 124/226 (54%), Gaps = 24/226 (10%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQ--------RLRDALTRSLNRLNHFNQNSSISSSKA 79
           G    L H  SP SP   SS+ P+         R+    +R   +   +   SS+  +  
Sbjct: 41  GLHQTLHHPQSPCSPAPLSSDLPFSAFITHDAARIAGLASRLATKDKDWVAASSVPLASG 100

Query: 80  SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
           +   +     NY+ R+ +GTP T  + V D+GS L W QC PC  S C+ Q  PL+DP+ 
Sbjct: 101 ASVGV----GNYITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVS-CHPQAGPLYDPRA 155

Query: 140 SSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTT 193
           SSTY ++PCS+ QC     A+LN  SCSG   CQY  SYGDGSFS G L+ +TV+L S+ 
Sbjct: 156 SSTYAAVPCSAPQCAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSG 215

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
               + PG  +GCG +N GLF  +  G++GL    +SL+SQ+  ++
Sbjct: 216 ----SFPGFYYGCGQDNVGLFG-RAAGLIGLARNKLSLLSQLAPSV 256



 Score = 45.4 bits (106), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 37/122 (30%), Positives = 56/122 (45%), Gaps = 4/122 (3%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR 311
           +IDSGT +T LP    + L   + + + A      +  L+ C+        VP V + F 
Sbjct: 331 IIDSGTVITRLPTPVYTALSKAVGAALAAPSAPAYS-ILQTCFKGQVAKLPVPAVNMAFA 389

Query: 312 G-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
           G A ++L+  N  V V+E   C  F   T+S  I GN  Q  F V YD++   + F    
Sbjct: 390 GGATLRLTPGNVLVDVNETTTCLAFA-PTDSTAIIGNTQQQTFSVVYDVKGSRIGFAAGG 448

Query: 371 CT 372
           C+
Sbjct: 449 CS 450


>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
 gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
          Length = 459

 Score =  144 bits (362), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 116/345 (33%), Positives = 161/345 (46%), Gaps = 72/345 (20%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y +  S+GTPP +  A+ADTGSDLIW +C     + C  Q SP + P  SST+  LPCS 
Sbjct: 91  YDMEFSMGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPCSD 150

Query: 151 SQCASLNQKS-----CSGVNCQYSVSYG----DGSFSNGNLATETVTLGSTTGQAVALPG 201
             C+ L   S      +G  C Y  SYG    D  ++ G LA ET TLG     A A+P 
Sbjct: 151 RLCSLLRSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTLG-----ADAVPS 205

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR------------------------- 236
           + FGC T          +G+VGLG G +SL+SQ+                          
Sbjct: 206 VRFGC-TTASEGGYGSGSGLVGLGRGPLSLVSQLNASTFMYCLTSDASKASPLLFGSLAS 264

Query: 237 ---------------TTIAGNQRL---------GVSTPD-IVIDSGTTLTFLPQGYNSNL 271
                          T  A N R          GV  P+ +V DSGTTLT+L +   S  
Sbjct: 265 LTGAQVQSTGLLASTTFYAVNLRSISIGSATTPGVGEPEGVVFDSGTTLTYLAEPAYSEA 324

Query: 272 LSVMSSMIEAQPVADPTGSLELCYSFN-----SLSQVPEVTIHFRGADVKLSRSNFFVKV 326
            +   S      V D  G  E C+        S + VP + +HF GAD+ L  +N+ V+V
Sbjct: 325 KAAFLSQTSLDQVEDTDG-FEACFQKPANGRLSNAAVPTMVLHFDGADMALPVANYVVEV 383

Query: 327 SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            + +VC + +  + S+ I GNIMQ N+LV +D+ +  +SF+P +C
Sbjct: 384 EDGVVCWIVQR-SPSLSIIGNIMQVNYLVLHDVHRSVLSFQPANC 427


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score =  144 bits (362), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 114/352 (32%), Positives = 154/352 (43%), Gaps = 75/352 (21%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC    CY Q   LFDP  SSTY ++
Sbjct: 175 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQQEKLFDPARSSTYANV 233

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C++  C  L+ + CSG +C Y V YGDGS+S G  A +T+TL S      A+ G  FGC
Sbjct: 234 SCAAPACFDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYD----AVKGFRFGC 289

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTPD- 251
           G  N GLF  +  G++GLG G  SL  Q               R++  G    G  +P  
Sbjct: 290 GERNEGLFG-EAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSSGTGYLDFGPGSPAA 348

Query: 252 --------IVIDSGTT-----------------------------------LTFLPQGYN 268
                   ++ D+G T                                   +T LP    
Sbjct: 349 AGARLTTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPPAY 408

Query: 269 SNLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV--PEVTIHFRGA---DVKLSRSN 321
           S+L S   S + A+    A     L+ CY F  +SQV  P V++ F+G    DV  S   
Sbjct: 409 SSLRSAFVSAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAILDVDASGIM 468

Query: 322 FFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           +   VS+  VC  F    +   V I GN     F V YDI ++ V F P  C
Sbjct: 469 YAASVSQ--VCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 518


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score =  144 bits (362), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 74/155 (47%), Positives = 101/155 (65%), Gaps = 4/155 (2%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           ++ +     +Y+  IS+GTP      +ADTGSDLIW QC+PC    C+ Q  P+FDP+ S
Sbjct: 30  ESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPC--QACFNQKDPIFDPEGS 87

Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           S+Y ++ C  + C SL +KSCS  NC YS  YGDGS + G L++ETVTL ST G+ +A  
Sbjct: 88  SSYTTMSCGDTLCDSLPRKSCS-PNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAK 146

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
            I FGCG  N G FN   +G+VGLG G++S +SQ+
Sbjct: 147 NIAFGCGHLNRGSFN-DASGLVGLGRGNLSFVSQL 180



 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 48/155 (30%), Positives = 74/155 (47%), Gaps = 14/155 (9%)

Query: 228 DISLISQMRTTIAGNQRLGVSTPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 283
           DIS+  +     AG+  +    PD    ++ DSGTTLT LP      +L  + S +    
Sbjct: 243 DISIAGRALRIPAGSFDI---KPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKVSFPE 299

Query: 284 VADPTGSLELCYSFNS-----LSQVPEVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFK 336
           +   +  L+LCY  +        ++P +  HF GAD +L   N+F+  ++   IVC    
Sbjct: 300 IDGSSAGLDLCYDVSGSKASYKKKIPAMVFHFEGADHQLPVENYFIAANDAGTIVCLAMV 359

Query: 337 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
                + IYGN+MQ NF V YDI    + + P+ C
Sbjct: 360 SSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQC 394


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 114/391 (29%), Positives = 169/391 (43%), Gaps = 79/391 (20%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTG 111
           + ++  L+++L R N      S  ++  +++  +  +ANY++ + +GTP  +   V DTG
Sbjct: 9   KYIQSRLSKNLGRENTVKDLDS--TTLPAESGSLIGSANYVVVVGLGTPKRDLSLVFDTG 66

Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN----QKSCSG---V 164
           SDL WTQCEPC  S CY Q   +FDP  SS+Y ++ C+SS C  L     +  CS     
Sbjct: 67  SDLTWTQCEPCAGS-CYKQQDAIFDPSKSSSYTNITCTSSLCTQLTSDGIKSECSSSTDA 125

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
           +C Y   YGD S S G L+ E +T+ +T      +    FGCG +N GLFN  + G++GL
Sbjct: 126 SCIYDAKYGDNSTSVGFLSQERLTITATD----IVDDFLFGCGQDNEGLFNG-SAGLMGL 180

Query: 225 GGGDIS--------------------------------------LISQMRTTIAGNQRL- 245
           G   IS                                      LI    +TI+G+    
Sbjct: 181 GRHPISIVQQTSSNYNKIFSYCLPATSSSLGHLTFGASAATNASLIYTPLSTISGDNSFY 240

Query: 246 ----------GVSTPDI----------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 285
                     G   P +          +IDSGT +T L     + L S     +E  PVA
Sbjct: 241 GLDIVSISVGGTKLPAVSSSTFSAGGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYPVA 300

Query: 286 DPTGSLELCYSFNSLSQ--VPEVTIHFRGA-DVKLSRSNFFVKVSEDIVCSVF--KGITN 340
           +  G L+ CY  +   +  VP +   F G   V+L         SE  VC  F   G  N
Sbjct: 301 NEAGLLDTCYDLSGYKEISVPRIDFEFSGGVTVELXHRGILXVESEQQVCLAFAANGSDN 360

Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            + ++GN+ Q    V YD++   + F    C
Sbjct: 361 DITVFGNVQQKTLEVVYDVKGGRIGFGAAGC 391


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 90/217 (41%), Positives = 124/217 (57%), Gaps = 22/217 (10%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS---LNRLNHFNQNSSISSSKAS---- 80
           G  V L H D+      + + + +Q LR A  RS   ++RL        ++SSKA+    
Sbjct: 40  GLRVHLTHVDA------HGNYSRHQLLRRAARRSHHRMSRLVARATGVPMTSSKAAGGGD 93

Query: 81  -QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
            Q  +   N  +L+ +SIGTP     A+ DTGSDL+WTQC+PC    C+ Q +P+FDP  
Sbjct: 94  LQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSS 151

Query: 140 SSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           SSTY ++PCSS+ C+ L    C S   C Y+ +YGD S + G LATET TL  +      
Sbjct: 152 SSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----- 206

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           LPG+ FGCG  N G   S+  G+VGLG G +SL+SQ+
Sbjct: 207 LPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQL 243



 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 67/133 (50%), Gaps = 11/133 (8%)

Query: 249 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 302
           T  +++DSGT++T+L  QGY +   +  + M  A P AD +G  L+LC+   +      +
Sbjct: 324 TGGVIVDSGTSITYLEVQGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVE 381

Query: 303 VPEVTIHFRG-ADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
           VP +  HF G AD+ L   N+ V       +C    G +  + I GN  Q NF   YD+ 
Sbjct: 382 VPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVG 440

Query: 361 QQTVSFKPTDCTK 373
             T+SF P  C K
Sbjct: 441 HDTLSFAPVQCNK 453


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  143 bits (361), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 118/422 (27%), Positives = 171/422 (40%), Gaps = 88/422 (20%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN--------QNSSISSSKA-- 79
           S+E+IH+  P S           R +  L +  +R+N               +  SK   
Sbjct: 67  SLEVIHKHGPCSKLSQDKGRSPSRTQ-MLDQDESRVNSIRSRLAKNPADGGKLKGSKVTL 125

Query: 80  -SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
            S++       NY++ + +GTP  +   + DTGSDL WTQCEPC    CY Q  P+F+P 
Sbjct: 126 PSKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPC-ARYCYHQQEPIFNPS 184

Query: 139 MSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
            S++Y ++ CSS  C  L     N  SCS   C Y + YGD S+S G  A + + L ST 
Sbjct: 185 KSTSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTD 244

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI-------------- 239
                     FGCG NN GLF     G++GLG   +SL+SQ                   
Sbjct: 245 ----VFNNFLFGCGQNNRGLFVG-VAGLIGLGRNALSLVSQTAQKYGKLFSYCLPSTSSS 299

Query: 240 ---------AGNQRLGVSTPDIVIDSGTTLTFL--------------------------- 263
                     G  +    TP +V   G +  FL                           
Sbjct: 300 TGYLTFGSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFSTAGTIID 359

Query: 264 ---------PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR- 311
                    P  Y S+L +     +   P A P   L+ CY F+      VP++ ++F  
Sbjct: 360 SGTVISRLPPTAY-SDLRASFQQQMSKYPKAAPASILDTCYDFSQYDTVDVPKINLYFSD 418

Query: 312 GADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
           GA++ L  S  F  ++   VC  F G +++  + I GN+ Q  F V YD+    + F P 
Sbjct: 419 GAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPG 478

Query: 370 DC 371
            C
Sbjct: 479 GC 480


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 90/217 (41%), Positives = 124/217 (57%), Gaps = 22/217 (10%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS---LNRLNHFNQNSSISSSKAS---- 80
           G  V L H D+      + + + +Q LR A  RS   ++RL        ++SSKA+    
Sbjct: 30  GLRVHLTHVDA------HGNYSRHQLLRRAARRSHHRMSRLVARATGVPMTSSKAAGGGD 83

Query: 81  -QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
            Q  +   N  +L+ +SIGTP     A+ DTGSDL+WTQC+PC    C+ Q +P+FDP  
Sbjct: 84  LQVPVHAGNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSS 141

Query: 140 SSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           SSTY ++PCSS+ C+ L    C S   C Y+ +YGD S + G LATET TL  +      
Sbjct: 142 SSTYATVPCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK----- 196

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           LPG+ FGCG  N G   S+  G+VGLG G +SL+SQ+
Sbjct: 197 LPGVVFGCGDTNEGDGFSQGAGLVGLGRGPLSLVSQL 233



 Score = 64.3 bits (155), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 67/133 (50%), Gaps = 11/133 (8%)

Query: 249 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 302
           T  +++DSGT++T+L  QGY +   +  + M  A P AD +G  L+LC+   +      +
Sbjct: 314 TGGVIVDSGTSITYLEVQGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVE 371

Query: 303 VPEVTIHFRG-ADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
           VP +  HF G AD+ L   N+ V       +C    G +  + I GN  Q NF   YD+ 
Sbjct: 372 VPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVG 430

Query: 361 QQTVSFKPTDCTK 373
             T+SF P  C K
Sbjct: 431 HDTLSFAPVQCNK 443


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 126/436 (28%), Positives = 193/436 (44%), Gaps = 93/436 (21%)

Query: 28  GFSVELIHR------DSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
           G +++++HR      D    P ++      +R R  + RS+ R     + ++ +++  ++
Sbjct: 54  GSTLQIVHRACLQTGDDIAVPDHHHYTGILRRDRHRV-RSIYRRLTAAETTTTTTTIPAR 112

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
             +   +  Y++ I IGTPP     + DTGSDL W QC PCP S CY Q  PLFDP  SS
Sbjct: 113 LGLAFQSLEYVVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSS 172

Query: 142 TYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           TY  +PCS+ +C    + Q  C   +C+YSV YGD S ++G+LA ET TL   +  A A 
Sbjct: 173 TYVDVPCSAPECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAA 232

Query: 200 PGITFGCGTNNGGLFNSK---TTGIVGLGGGDISLISQMRTTIAGNQRL--------GVS 248
            G+ FGC      +FN       G++GLG GD S++SQ R +I     +        G S
Sbjct: 233 TGVVFGCSHEYISVFNDTGMGVAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLPPRGSS 292

Query: 249 TPDIVIDSG-----------------TTLTFLPQGYNSNLLSV---------------MS 276
           T  + I  G                 TT++ L   Y  NL  V               + 
Sbjct: 293 TGYLTIGGGAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSLG 352

Query: 277 SMIEA------------QPVAD------------PTGSLEL---CYSFNSLSQV--PEVT 307
           ++I++             P+ D            P GS++L   CY       V  P V 
Sbjct: 353 AVIDSGTVVTHMPAAAYYPLRDEFRLHMGSYKMLPEGSMKLLDTCYDVTGQDVVTAPRVA 412

Query: 308 IHF-RGADVKLSRSNFFVKV-SED-------IVCSVFKGITNS--VPIYGNIMQTNFLVG 356
           + F  GA + +  S   + + +ED       + C  F   TNS  + I GN+ Q  + V 
Sbjct: 413 LEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFL-PTNSAGLVIVGNMQQRAYNVV 471

Query: 357 YDIEQQTVSFKPTDCT 372
           +D++   + F P  C+
Sbjct: 472 FDVDGGRIGFGPNGCS 487


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 81/213 (38%), Positives = 117/213 (54%), Gaps = 16/213 (7%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRL--RDALTRSL--NRLNHFNQNSSISSSKASQADII 85
           S+ L+HRD+     Y S+      L  RD         RL+     + + S   S   I 
Sbjct: 70  SLALLHRDAVSGRTYPSTRHAMLGLAARDGARVEYLQRRLSPTTMTTEVGSEVVS--GIS 127

Query: 86  PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
             +  Y +R+ +G+PPTE+  V D+GSD+IW QC PC  ++CY Q  PLFDP  S+++ +
Sbjct: 128 EGSGEYFVRVGVGSPPTEQYLVVDSGSDVIWIQCRPC--AECYQQADPLFDPAASASFTA 185

Query: 146 LPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           +PC S  C +L   S    +   C+Y VSYGDGS++ G LA ET+T G +T     + G+
Sbjct: 186 VPCDSGVCRTLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTFGDST----PVQGV 241

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
             GCG  N GLF     G++GLG G +SL+ Q+
Sbjct: 242 AIGCGHRNRGLF-VGAAGLLGLGWGPMSLVGQL 273



 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 56/125 (44%), Gaps = 5/125 (4%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ-PVADPTGSLELCYSFNSLS--QVPEVTI 308
           +V+D+GT +T LP    + L    +S I    P A     L+ CY  +  +  +VP V +
Sbjct: 355 VVMDTGTAVTRLPPDAYAALRDAFASTIGGDLPRAPGVSLLDTCYDLSGYASVRVPTVAL 414

Query: 309 HF--RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           +F   GA + L   N  V++   + C  F    + + I GNI Q    +  D     V F
Sbjct: 415 YFGRDGAALTLPARNLLVEMGGGVYCLAFAASASGLSILGNIQQQGIQITVDSANGYVGF 474

Query: 367 KPTDC 371
            P+ C
Sbjct: 475 GPSTC 479


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 75/165 (45%), Positives = 111/165 (67%), Gaps = 10/165 (6%)

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
           +SS A  A +    A YL+ ++IGTPP   +A+ADTGSDL WTQC+PC    C+ QD+P+
Sbjct: 77  TSSDAGPARLRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCQPC--KLCFPQDTPI 134

Query: 135 FDPKMSSTYKSLPCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGS 191
           +D  +SS++  +PC+S+ C  + + ++C+  +  C+Y  +YGDG++S G L TET+T   
Sbjct: 135 YDTAVSSSFSPVPCASATCLPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPG 194

Query: 192 TTGQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQM 235
             G +V   GI FGCG +NGGL +NS  TG VGLG G +SL++Q+
Sbjct: 195 APGVSVG--GIAFGCGVDNGGLSYNS--TGTVGLGRGSLSLVAQL 235



 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 66/129 (51%), Gaps = 8/129 (6%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS----LSQVPEVT 307
           +++DSGTT TFL +     ++  ++ ++  QPV + +     C+   +    L  +P++ 
Sbjct: 319 MIVDSGTTFTFLVESAFRVVVDHVAGVLR-QPVVNASSLDSPCFPAATGEQQLPAMPDMV 377

Query: 308 IHFRG-ADVKLSRSNF--FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
           +HF G AD++L R N+  F +       ++    +  V I GN  Q N  + +DI    +
Sbjct: 378 LHFAGGADMRLHRDNYMSFNQEESSFCLNIAGSPSADVSILGNFQQQNIQMLFDITVGQL 437

Query: 365 SFKPTDCTK 373
           SF PTDC K
Sbjct: 438 SFMPTDCGK 446


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 123/418 (29%), Positives = 171/418 (40%), Gaps = 90/418 (21%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
           + L HR  P +P   SS      + D L     R  H  +  S      +   KA+ A +
Sbjct: 66  LRLTHRHGPCAPLRASSLAA-PSVADTLRADQRRAEHILRRVSGRGAPQLWDYKAAAATV 124

Query: 85  IPN------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
             N       +NY++  S+GTP   +    DTGSDL W QC+PC    CY Q  PLFDP 
Sbjct: 125 PANWGYDIGTSNYVVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPA 184

Query: 139 MSSTYKSLPCSSSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
            SS+Y ++PC  S CA L     +CS   C Y VSYGDGS + G  +++T+TL +     
Sbjct: 185 QSSSYAAVPCGRSACAGLGIYASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLAAN---- 240

Query: 197 VALPGITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAG 241
             + G  FGCG   +GGLF +   G++G G    SL+ Q               +++  G
Sbjct: 241 ATVQGFLFGCGHAQSGGLF-TGIDGLLGFGREQPSLVQQTAGAYGGVFSYCLPTKSSTTG 299

Query: 242 NQRLG-------------------------------------VSTPDIVIDSGT------ 258
              LG                                     +S P     +GT      
Sbjct: 300 YLTLGGPSGVAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAFAAGTVVDTGT 359

Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADV 315
            +T LP    + L S   S + + P A P G L+ CYSF     V    V + F  GA +
Sbjct: 360 VITRLPPAAYAALRSAFRSGMASYPSAPPIGILDTCYSFAGYGTVNLTSVALTFSSGATM 419

Query: 316 KLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            L              C  F   G   S+ I GN+ Q +F V   I+  +V F+P+ C
Sbjct: 420 TLGADGIM-----SFGCLAFASSGSDGSMAILGNVQQRSFEV--RIDGSSVGFRPSSC 470


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score =  142 bits (358), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 122/422 (28%), Positives = 183/422 (43%), Gaps = 88/422 (20%)

Query: 30  SVELIHRDSPKS---PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS-KASQADII 85
           S+E++H+  P S   P   +S +  Q L    +R  +  +   +N +  S+ KAS+A + 
Sbjct: 76  SLEVVHKHGPCSKLRPHKANSPSHTQILAQDESRVASIQSRLAKNLAGGSNLKASKATLP 135

Query: 86  PNNA------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
             +A      NY++ + +G+P  +   + DTGSDL WTQCEPC    CY Q   +FDP  
Sbjct: 136 SKSASTLGSGNYVVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCV-GYCYQQREHIFDPST 194

Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           S +Y ++ C S  C  L     N   CS   C Y + YGDGS+S G  A E ++L ST  
Sbjct: 195 SLSYSNVSCDSPSCEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD- 253

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGL------------------------------ 224
                    FGCG NN GLF   T G++GL                              
Sbjct: 254 ---VFNNFQFGCGQNNRGLFGG-TAGLLGLARNPLSLVSQTAQKYGKVFSYCLPSSSSST 309

Query: 225 -------GGGDISLIS----------------QMRTTIAGNQRLGV-----STPDIVIDS 256
                  G GD   +                  M     G ++L +     ST   +IDS
Sbjct: 310 GYLSFGSGDGDSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFSTAGTIIDS 369

Query: 257 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GA 313
           GT ++ LP    S++  V   ++   P       L+ CY  +     +VP++ ++F  GA
Sbjct: 370 GTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILDTCYDLSKYKTVKVPKIILYFSGGA 429

Query: 314 DVKLSRSN--FFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
           ++ L+     + +KVS+  VC  F G +  + V I GN+ Q    V YD  +  V F P+
Sbjct: 430 EMDLAPEGIIYVLKVSQ--VCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPS 487

Query: 370 DC 371
            C
Sbjct: 488 GC 489


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 73/155 (47%), Positives = 101/155 (65%), Gaps = 4/155 (2%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           ++ +     +Y+  IS+GTP      +ADTGSDLIW QC+PC    C+ Q  P+FDP+ S
Sbjct: 30  ESPVASGGGDYVTTISLGTPAKVFSVIADTGSDLIWIQCKPC--QACFNQKDPIFDPEGS 87

Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           S+Y ++ C  + C SL +KSCS  +C YS  YGDGS + G L++ETVTL ST G+ +A  
Sbjct: 88  SSYTTMSCGDTLCDSLPRKSCS-PDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAK 146

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
            I FGCG  N G FN   +G+VGLG G++S +SQ+
Sbjct: 147 NIAFGCGHLNRGSFN-DASGLVGLGRGNLSFVSQL 180



 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 49/155 (31%), Positives = 74/155 (47%), Gaps = 14/155 (9%)

Query: 228 DISLISQMRTTIAGNQRLGVSTPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 283
           DIS+  +     AG+  +    PD    ++ DSGTTLT LP      +L  + S I    
Sbjct: 243 DISIAGRALRIPAGSFDI---KPDGSGGMIFDSGTTLTLLPDAPYQIVLRALRSKISFPK 299

Query: 284 VADPTGSLELCYSFNSLS-----QVPEVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFK 336
           +   +  L+LCY  +        ++P +  HF GAD +L   N+F+  ++   IVC    
Sbjct: 300 IDGSSAGLDLCYDVSGSKASYKMKIPAMVFHFEGADYQLPVENYFIAANDAGTIVCLAMV 359

Query: 337 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
                + IYGN+MQ NF V YDI    + + P+ C
Sbjct: 360 SSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAPSQC 394


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 121/416 (29%), Positives = 172/416 (41%), Gaps = 79/416 (18%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
            S+E++HR  P     N  +        + L +  +R++  +   S       +   +P 
Sbjct: 63  LSLEVVHRSGPCIQVLNQEKAANAPSNMEILLQDRHRVDSIHARLSSHGVFQEKQATLPV 122

Query: 87  ------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
                  + +Y + + +GTP  E   + DTGSDL WTQCEPC  + CY Q  P  DP  S
Sbjct: 123 QSGASIGSGDYAVTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKT-CYKQKEPRLDPTKS 181

Query: 141 STYKSLPCSSSQCASLNQ---KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           ++YK++ CSS+ C  L+    +SCS   C Y V YGDGS+S G  ATET+TL S+     
Sbjct: 182 TSYKNISCSSAFCKLLDTEGGESCSSPTCLYQVQYGDGSYSIGFFATETLTLSSSN---- 237

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA----------------- 240
                 FGCG  N GLF     G++GLG   +SL SQ                       
Sbjct: 238 VFKNFLFGCGQQNSGLFRG-AAGLLGLGRTKLSLPSQTAQKYKKLFSYCLPASSSSKGYL 296

Query: 241 --GNQ-------------------------RLGVSTPDIVID------SGTTL------T 261
             G Q                          L V    + ID      SGT +      T
Sbjct: 297 SFGGQVSKTVKFTPLSEDFKSTPFYGLDITELSVGGNKLSIDASIFSTSGTVIDSGTVIT 356

Query: 262 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGA-DVKLS 318
            LP    S L S    ++   P  D     + CY F  N   ++P+V + F+G  ++ + 
Sbjct: 357 RLPSTAYSALSSAFQKLMTDYPSTDGYSIFDTCYDFSKNETIKIPKVGVSFKGGVEMDID 416

Query: 319 RSNFFVKVSE-DIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            S     V+    VC  F G  + V   I+GN  Q  + V YD  +  V F P+ C
Sbjct: 417 VSGILYPVNGLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPSGC 472


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 106/354 (29%), Positives = 148/354 (41%), Gaps = 78/354 (22%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ +GTP      V DTGSD++W QC PC   +CY Q  P+FDP  S ++ ++P
Sbjct: 142 SGEYFTRLGVGTPARYVYMVLDTGSDIVWIQCAPC--IKCYSQTDPVFDPTKSRSFANIP 199

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVALPGITF 204
           C S  C  L+   CS     C Y VSYGDGSF+ G  +TET+T  G+  G+ V       
Sbjct: 200 CGSPLCRRLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGTRVGRVV------L 253

Query: 205 GCGTNNGGL------------------------FNSKTTGIVG------------LGGGD 228
           GCG +N GL                        FNSK +  +G             G   
Sbjct: 254 GCGHDNEGLFVGAAGLLGLGRGRLSFPSQIGRRFNSKFSYCLGDRSASSRPSSIVFGDSA 313

Query: 229 IS------------------LISQMRTTIAGNQRLGVSTP----------DIVIDSGTTL 260
           IS                   +  +  ++ G +  G+S             ++IDSGT++
Sbjct: 314 ISRTTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIIDSGTSV 373

Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLS 318
           T L +     L             A      + C+  +  ++V  P V +HFRGADV L 
Sbjct: 374 TRLTRAAYVALRDAFLVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRGADVPLP 433

Query: 319 RSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            SN+ + V      C  F G  + + I GNI Q  F V YD+    V F P  C
Sbjct: 434 ASNYLIPVDNSGSFCFAFAGTASGLSIIGNIQQQGFRVVYDLATSRVGFAPRGC 487


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score =  141 bits (356), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 115/420 (27%), Positives = 167/420 (39%), Gaps = 96/420 (22%)

Query: 39  PKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP----------- 86
           P+   Y      Y+ L    L R   R N       ++    S++D+ P           
Sbjct: 88  PRETIYKIHHKDYKSLVLSRLHRDTVRFNSLTARLQLALEDISKSDLKPLETEIKPEDLS 147

Query: 87  ---------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
                     +  Y  R+ +G P  +   V DTGSD+ W QC+PC  + CY Q  P+FDP
Sbjct: 148 TPVTSGTSQGSGEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPC--TDCYQQTDPIFDP 205

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
             SSTY  + C S QC+SL   SC    C Y V+YGDGS++ G+ ATE+V+ G++     
Sbjct: 206 TASSTYAPVTCQSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSG---- 261

Query: 198 ALPGITFGCGTNNGGL------------------------------------------FN 215
           ++  +  GCG +N GL                                          FN
Sbjct: 262 SVKNVALGCGHDNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCLVNRDSAGSSTLDFN 321

Query: 216 SKTTGI------------------VGLGGGDISLISQMRTTIAGNQRLGVS-TPDIVIDS 256
           S   G+                  VGL G  +S+  QM +      RL  S    I++D 
Sbjct: 322 SAQLGVDSVTAPLMKNRKIDTFYYVGLSG--MSVGGQMVSIPESTFRLDESGNGGIIVDC 379

Query: 257 GTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-G 312
           GT +T L  Q YN  L      M +   +       + CY  +  +  +VP V+ HF  G
Sbjct: 380 GTAITRLQTQAYNP-LRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADG 438

Query: 313 ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
               L  +N+ + V S    C  F   T+S+ I GN+ Q    V +D+    + F P  C
Sbjct: 439 KSWNLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score =  141 bits (355), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 129/426 (30%), Positives = 177/426 (41%), Gaps = 91/426 (21%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ 81
           + + +G  +V L HR  P SP   + + P   L D L R   R  +  +  S    K  Q
Sbjct: 50  VRSSSGATTVPLHHRHGPCSPL-PTKKMP--SLEDRLHRDQLRAAYIKRKFSGDVKKDGQ 106

Query: 82  AD--------IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
                      +P       N   YLI + +G+P   +  + D+GSD+ W QC+PC   Q
Sbjct: 107 GAGGVEQSHVTVPTTLGTSLNTLEYLITVRLGSPAKTQTVLIDSGSDVSWVQCKPC--LQ 164

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLA 183
           C+ Q  PLFDP +SSTY    CSS+ CA L Q      S   CQY V Y DGS + G  +
Sbjct: 165 CHSQVDPLFDPSLSSTYSPFSCSSAACAQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYS 224

Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI---- 239
           ++T+ LGS T     +    FGC     G FN  T G++GLGGG  SL SQ   T     
Sbjct: 225 SDTLALGSNT-----ISNFQFGCSHVESG-FNDLTDGLMGLGGGAPSLASQTAGTFGTAF 278

Query: 240 ----------AGNQRLG------VSTP--------------------------------- 250
                     +G   LG      V TP                                 
Sbjct: 279 SYCLPPTPSSSGFLTLGAGTSGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVFS 338

Query: 251 -DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVT 307
             +V+DSGT +T LP+   S L S   + ++    A P   ++ C+ F+  S V  P V 
Sbjct: 339 AGMVMDSGTIITRLPRTAYSALSSAFKAGMKQYRPAPPRSIMDTCFDFSGQSSVRLPSVA 398

Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVS 365
           + F G  V    +N  +  +    C  F   ++  S  I GN+ Q  F V YD+    V 
Sbjct: 399 LVFSGGAVVNLDANGIILGN----CLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGGAVG 454

Query: 366 FKPTDC 371
           FK   C
Sbjct: 455 FKAGAC 460


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score =  140 bits (354), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 110/350 (31%), Positives = 147/350 (42%), Gaps = 72/350 (20%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP      V DTGSD  W QC+PC  + CY Q  PLFDP  S+TY ++
Sbjct: 92  GTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCV-AYCYRQKEPLFDPTKSATYANI 150

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            CSSS C+ L    CSG +C Y + YGDGS++ G  A +T+TL   T     +    FGC
Sbjct: 151 SCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYDT-----IKNFRFGC 205

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG------------------------- 241
           G  N GLF  +  G++GLG G  SL  Q      G                         
Sbjct: 206 GEKNRGLFG-RAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGFLDLGPGAPAA 264

Query: 242 NQRL-----------------GV--------------STPDIVIDSGTTLTFLPQGYNSN 270
           N RL                 G+              ST   ++DSGT +T LP    + 
Sbjct: 265 NARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPPSAYAP 324

Query: 271 LLSVMSSMIEAQPV-ADPTGS-LELCYSFNSLS----QVPEVTIHFRGA---DVKLSRSN 321
           L S  S  ++     A P  S L+ CY           +P V++ F+G    DV  S   
Sbjct: 325 LRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDASGIL 384

Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           +   VS+  +          V I GN  Q    V YDI ++ V F P  C
Sbjct: 385 YVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 434


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score =  140 bits (354), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 78/195 (40%), Positives = 115/195 (58%), Gaps = 15/195 (7%)

Query: 49  TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
           T ++ LR A+ RS  RL      +  + S+ KA  A+  I+P    YL+++ IGTPP + 
Sbjct: 43  TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102

Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
            A  DT SDLIWTQC+PC  + CY Q  P+F+P++SSTY +LPCSS  C  L+   C   
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160

Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
           +   CQY+ +Y   + + G LA + + +G       A  G+ FGC T++ GG    + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215

Query: 221 IVGLGGGDISLISQM 235
           +VGLG G +SL+SQ+
Sbjct: 216 VVGLGRGPLSLVSQL 230



 Score = 46.2 bits (108), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 61/128 (47%), Gaps = 9/128 (7%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY------SFNSLSQVPE 305
           ++ID  +T+TFL       L++ +   I        +  L+LC+      +F+ +  VP 
Sbjct: 334 MIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRV-YVPA 392

Query: 306 VTIHFRGADVKLSRSNFFVKVSED-IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
           V + F G  ++L ++  F +  E  ++C  V +    SV I GN  Q N  V Y++ +  
Sbjct: 393 VALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGR 452

Query: 364 VSFKPTDC 371
           V+F  + C
Sbjct: 453 VTFVQSPC 460


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score =  140 bits (354), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 110/350 (31%), Positives = 147/350 (42%), Gaps = 72/350 (20%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP      V DTGSD  W QC+PC  + CY Q  PLFDP  S+TY ++
Sbjct: 157 GTGNYVVPVRLGTPAERFTVVFDTGSDTTWVQCQPCV-AYCYRQKEPLFDPTKSATYANI 215

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            CSSS C+ L    CSG +C Y + YGDGS++ G  A +T+TL   T     +    FGC
Sbjct: 216 SCSSSYCSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLAYDT-----IKNFRFGC 270

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG------------------------- 241
           G  N GLF  +  G++GLG G  SL  Q      G                         
Sbjct: 271 GEKNRGLFG-RAAGLLGLGRGKTSLPVQAYDKYGGVFAYCLPATSAGTGFLDLGPGAPAA 329

Query: 242 NQRL-----------------GV--------------STPDIVIDSGTTLTFLPQGYNSN 270
           N RL                 G+              ST   ++DSGT +T LP    + 
Sbjct: 330 NARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTAGTLVDSGTVITRLPPSAYAP 389

Query: 271 LLSVMSSMIEAQPV-ADPTGS-LELCYSFNSLS----QVPEVTIHFRGA---DVKLSRSN 321
           L S  S  ++     A P  S L+ CY           +P V++ F+G    DV  S   
Sbjct: 390 LRSAFSKAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVFQGGACLDVDASGIL 449

Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           +   VS+  +          V I GN  Q    V YDI ++ V F P  C
Sbjct: 450 YVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFAPGAC 499


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score =  140 bits (354), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 124/436 (28%), Positives = 180/436 (41%), Gaps = 100/436 (22%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF--NQNSSISSSKA 79
           +E  +   S+ L+HR  P +P    S  P   + + L RS  R N+     + S+    A
Sbjct: 48  LEPSSATVSMSLVHRYGPCAP-SQYSNVPTPSISETLRRSRARTNYIMSQASKSMGMGMA 106

Query: 80  SQAD------IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
           S  D       IP       ++  Y++ +  GTP   ++ + DTGSD+ W QC PC  ++
Sbjct: 107 STPDDDDAAVTIPTRLGGFVDSLEYVVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTK 166

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGN 181
           CY Q  PLFDP  SSTY  + C++  C  L     N  +  G  C YSV Y DGS S G 
Sbjct: 167 CYPQKDPLFDPSKSSTYAPIACNTDACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGV 226

Query: 182 LATETVTLGSTTGQAVALPGIT-----FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
            + ET+TL          PGIT     FGCG +  G  + K  G++GLGG  +SL+ Q  
Sbjct: 227 YSNETLTLA---------PGITVEDFHFGCGRDQRGP-SDKYDGLLGLGGAPVSLVVQTS 276

Query: 237 TTI------------------------AGNQRLGVSTP---------------------- 250
           +                          +GN+   V TP                      
Sbjct: 277 SVYGGAFSYCLPALNSEAGFLVLGSPPSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGG 336

Query: 251 ------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN 298
                        ++IDSGT  T LP+   + L + +   ++A P+  P+   + CY+F 
Sbjct: 337 KPLHIPQSAFRGGMIIDSGTVDTELPETAYNALEAALRKALKAYPLV-PSDDFDTCYNFT 395

Query: 299 SLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLV 355
             S   VP V   F  GA + L   N  +    D +     G  + + I GN+ Q    V
Sbjct: 396 GYSNITVPRVAFTFSGGATIDLDVPNGILV--NDCLAFQESGPDDGLGIIGNVNQRTLEV 453

Query: 356 GYDIEQQTVSFKPTDC 371
            YD  +  V F+   C
Sbjct: 454 LYDAGRGNVGFRAGAC 469


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score =  140 bits (354), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 78/195 (40%), Positives = 115/195 (58%), Gaps = 15/195 (7%)

Query: 49  TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
           T ++ LR A+ RS  RL      +  + S+ KA  A+  I+P    YL+++ IGTPP + 
Sbjct: 43  TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102

Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
            A  DT SDLIWTQC+PC  + CY Q  P+F+P++SSTY +LPCSS  C  L+   C   
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160

Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
           +   CQY+ +Y   + + G LA + + +G       A  G+ FGC T++ GG    + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215

Query: 221 IVGLGGGDISLISQM 235
           +VGLG G +SL+SQ+
Sbjct: 216 VVGLGRGPLSLVSQL 230



 Score = 46.2 bits (108), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 61/128 (47%), Gaps = 9/128 (7%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY------SFNSLSQVPE 305
           ++ID  +T+TFL       L++ +   I        +  L+LC+      +F+ +  VP 
Sbjct: 334 MIIDIASTITFLEASLYDELVNDLEVEIRLPRGTGSSLGLDLCFILPDGVAFDRV-YVPA 392

Query: 306 VTIHFRGADVKLSRSNFFVKVSED-IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
           V + F G  ++L ++  F +  E  ++C  V +    SV I GN  Q N  V Y++ +  
Sbjct: 393 VALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSVSILGNFQQQNMQVLYNLRRGR 452

Query: 364 VSFKPTDC 371
           V+F  + C
Sbjct: 453 VTFVQSPC 460


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score =  140 bits (353), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 120/227 (52%), Gaps = 30/227 (13%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ-------- 81
           S+ L+ RD      Y S       LR A+   + R N   +  +   S A Q        
Sbjct: 105 SLALVRRDEVTGSTYPS-------LRHAVLDLVARDNARAEYLATRLSPAYQPPGFSGSE 157

Query: 82  ----ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
               + +   +  YL+R+S+G+PPTE+  V D+GSD++W QC+PC   +CY+Q  PLFDP
Sbjct: 158 SKVVSGLDEGSGEYLVRVSVGSPPTEQYLVVDSGSDVMWVQCKPC--LECYVQADPLFDP 215

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTG 194
             S+T+  + C S+ C  L   +C       C+Y VSY DGS++ G LA ET+TLG T  
Sbjct: 216 ATSATFSGVSCGSAICRILPTSACGDGELGGCEYEVSYADGSYTKGALALETLTLGGT-- 273

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
              A+ G+  GCG  N GLF     G++GLG G +SL+ Q+   + G
Sbjct: 274 ---AVEGVVIGCGHRNRGLFVG-AAGLMGLGWGPMSLVGQLGGEVGG 316


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  140 bits (352), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 114/346 (32%), Positives = 156/346 (45%), Gaps = 70/346 (20%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ ++ +GTP T    V DTGS L W QC PC  S C+ Q  PLFDP+ SSTY S+ CS
Sbjct: 133 NYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVS-CHRQVGPLFDPRASSTYASVRCS 191

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           +SQC     A+LN  +CS  N C Y  SYGD SFS G+L+T+TV+ GST       P   
Sbjct: 192 ASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSFGSTR-----YPSFY 246

Query: 204 FGCGT-NNG---------GLFNSK-----------------------TTGIVGLG----G 226
           +GCG  N G         GL  +K                       +TG + +G    G
Sbjct: 247 YGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTGYLSIGPYNTG 306

Query: 227 GDISLISQMRTTI-------------AGNQRLGVSTPD-----IVIDSGTTLTFLPQGYN 268
              S      +++              G   L VS  +      +IDSGT +T LP   +
Sbjct: 307 HYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTAVH 366

Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFRG-ADVKLSRSNFFVKV 326
           + L   ++  +     A     L+ C+    S  +VP V + F G A +KL+  N  + V
Sbjct: 367 TALSKAVAQAMAGAQRAPAFSILDTCFEGQASQLRVPTVAMAFAGGASMKLTTRNVLIDV 426

Query: 327 SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            +   C  F   T+S  I GN  Q  F V YD+ Q  + F    C+
Sbjct: 427 DDSTTCLAFA-PTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471


>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 475

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 69/194 (35%), Positives = 109/194 (56%), Gaps = 17/194 (8%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR----LNHFNQNSSISSSKASQADI 84
           + ++L+HRD  K P +N+      R    + R   R    L          +++A  +D+
Sbjct: 68  YKLKLVHRD--KVPTFNTYHDHRTRFNARMQRDTKRAASLLRRLAAGKPTYAAEAFGSDV 125

Query: 85  IPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           +      +  Y +RI +G+PP  +  V D+GSD+IW QCEPC  +QCY Q  P+F+P  S
Sbjct: 126 VSGMEQGSGEYFVRIGVGSPPRNQYVVMDSGSDIIWVQCEPC--TQCYHQSDPVFNPADS 183

Query: 141 STYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           S++  + C+S+ C+ ++  +C    C+Y VSYGDGS++ G LA ET+T G T  + VA+ 
Sbjct: 184 SSFSGVSCASTVCSHVDNAACHEGRCRYEVSYGDGSYTKGTLALETITFGRTLIRNVAI- 242

Query: 201 GITFGCGTNNGGLF 214
               GCG +N G+F
Sbjct: 243 ----GCGHHNQGMF 252



 Score = 40.4 bits (93), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 57/128 (44%), Gaps = 12/128 (9%)

Query: 252 IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLS-QVPE 305
           +V+D+GT +T LP    + +    ++  +++    P A      + CY  F  +S +VP 
Sbjct: 352 VVMDTGTAVTRLPTVAYEAFRDGFIAQTTNL----PRASGVSIFDTCYDLFGFVSVRVPT 407

Query: 306 VTIHFRGADV-KLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
           V+ +F G  +  L   NF + V +    C  F   ++ + I GNI Q    +  D     
Sbjct: 408 VSFYFSGGPILTLPARNFLIPVDDVGTFCFAFAPSSSGLSIIGNIQQEGIQISVDGANGF 467

Query: 364 VSFKPTDC 371
           V F P  C
Sbjct: 468 VGFGPNVC 475


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 85/211 (40%), Positives = 115/211 (54%), Gaps = 13/211 (6%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---I 84
           GF   L H D+      ++  T  Q L  AL RS  R+      ++++   A  A    +
Sbjct: 30  GFKATLRHVDA------DAGYTEEQLLSRALRRSSARVATLQSLAALAPGDAITAARILV 83

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
           + ++  YL+ + IGTP     A+ DTGSDLIWTQC PC    C  Q +P FDP  S+TY+
Sbjct: 84  LASDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPARSATYR 141

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           SL C+S  C +L    C    C Y   YGD + + G LA ET T G T    V+LPGI+F
Sbjct: 142 SLGCASPACNALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFG-TNETRVSLPGISF 200

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           GCG  N GL  +  +G+VG G G +SL+SQ+
Sbjct: 201 GCGNLNAGLL-ANGSGMVGFGRGSLSLVSQL 230



 Score = 42.7 bits (99), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 62/126 (49%), Gaps = 8/126 (6%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPEV 306
           +IDSGTT+T+L +     + +  +S I   P+ + T +  L+ C+ +    +    +P++
Sbjct: 316 IIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNVTDASVLDTCFQWPPPPRQSVTLPQL 374

Query: 307 TIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
            +HF GAD +L   N+  V  S      +    ++   I G+    NF V YD+E   +S
Sbjct: 375 VLHFDGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMS 434

Query: 366 FKPTDC 371
           F P  C
Sbjct: 435 FVPAPC 440


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  139 bits (351), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 72/150 (48%), Positives = 95/150 (63%), Gaps = 8/150 (5%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            N  +L+ +SIGTP     A+ DTGSDL+WTQC+PC    C+ Q +P+FDP  SSTY ++
Sbjct: 70  GNGEFLMDVSIGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATV 127

Query: 147 PCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           PCSS+ C+ L    C S   C Y+ +YGD S + G LATET TL  +      LPG+ FG
Sbjct: 128 PCSSASCSDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK-----LPGVVFG 182

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           CG  N G   S+  G+VGLG G +SL+SQ+
Sbjct: 183 CGDTNEGDGFSQGAGLVGLGRGPLSLVSQL 212



 Score = 64.3 bits (155), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 67/133 (50%), Gaps = 11/133 (8%)

Query: 249 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 302
           T  +++DSGT++T+L  QGY +   +  + M  A P AD +G  L+LC+   +      +
Sbjct: 293 TGGVIVDSGTSITYLEVQGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVE 350

Query: 303 VPEVTIHFRG-ADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
           VP +  HF G AD+ L   N+ V       +C    G +  + I GN  Q NF   YD+ 
Sbjct: 351 VPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVG 409

Query: 361 QQTVSFKPTDCTK 373
             T+SF P  C K
Sbjct: 410 HDTLSFAPVQCNK 422


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 125/409 (30%), Positives = 182/409 (44%), Gaps = 82/409 (20%)

Query: 32  ELIHRDSPKSPFY-NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           ELIHR+ P SP   N+S+T  +    A+ R   R    +++  ++  +     +   N  
Sbjct: 21  ELIHREHPSSPLRSNTSKTTTEIFLAAVKRGAERRAQLSKHI-LAEGRLFSTPVASGNGE 79

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           YLI IS G+PP +   + DTGSDLIWTQC PC    C    S +FDP  SSTY ++ C+S
Sbjct: 80  YLIDISFGSPPQKASVIVDTGSDLIWTQCLPC--ETCNAAASVIFDPVKSSTYDTVSCAS 137

Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
           + C+SL  +SC+  +C+Y   YGDGS ++G L+TETVT+         +P + FGCG  N
Sbjct: 138 NFCSSLPFQSCT-TSCKYDYMYGDGSSTSGALSTETVTV-----GTGTIPNVAFGCGHTN 191

Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQR-----LG--VSTPDIVIDS----GTT 259
            G F +   GIVGLG G +SLISQ  +  +         LG   ++P ++ DS    G  
Sbjct: 192 LGSF-AGAAGIVGLGQGPLSLISQASSITSKKFSYCLVPLGSTKTSPMLIGDSAAAGGVA 250

Query: 260 LTFL------PQGYNSNLLSVMSSMIEAQPVADPTGSLE----------------LCY-- 295
            T L      P  Y ++L  +    +  + V  P G+                  L Y  
Sbjct: 251 YTALLTNTANPTFYYADLTGI---SVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYLE 307

Query: 296 --SFNSLSQVPEVTIHFRGAD-------------------------------VKLSRSNF 322
             +FN+L    +  + F  AD                                +L   N 
Sbjct: 308 TGAFNALVAALKAEVPFPEADGSLYGLDYCFSTAGVANPTYPTMTFHFKGADYELPPENV 367

Query: 323 FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           FV +       +    +    I GNI Q N L+ +D+  Q V FK  +C
Sbjct: 368 FVALDTGGSICLAMAASTGFSIMGNIQQQNHLIVHDLVNQRVGFKEANC 416


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 114/346 (32%), Positives = 156/346 (45%), Gaps = 70/346 (20%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ ++ +GTP T    V DTGS L W QC PC  S C+ Q  PLFDP+ SSTY S+ CS
Sbjct: 133 NYVTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVS-CHRQVGPLFDPRASSTYTSVRCS 191

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           +SQC     A+LN  +CS  N C Y  SYGD SFS G L+T+TV+ GST+      P   
Sbjct: 192 ASQCDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSFGSTS-----YPSFY 246

Query: 204 FGCGT-NNG---------GLFNSK-----------------------TTGIVGLG----G 226
           +GCG  N G         GL  +K                       +TG + +G    G
Sbjct: 247 YGCGQDNEGLFGRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTGYLSIGPYNTG 306

Query: 227 GDISLISQMRTTI-------------AGNQRLGVSTPD-----IVIDSGTTLTFLPQGYN 268
              S      +++              G   L VS  +      +IDSGT +T LP   +
Sbjct: 307 HYYSYTPMASSSLDASLYFITLSGMSVGGSPLAVSPSEYSSLPTIIDSGTVITRLPTAVH 366

Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFRG-ADVKLSRSNFFVKV 326
           + L   ++  +     A     L+ C+    S  +VP V + F G A +KL+  N  + V
Sbjct: 367 TALSKAVAQAMAGAQRAPAFSILDTCFEGQASQLRVPTVVMAFAGGASMKLTTRNVLIDV 426

Query: 327 SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            +   C  F   T+S  I GN  Q  F V YD+ Q  + F    C+
Sbjct: 427 DDSTTCLAFA-PTDSTAIIGNTQQQTFSVIYDVAQSRIGFSAGGCS 471


>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
          Length = 417

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 78/195 (40%), Positives = 115/195 (58%), Gaps = 15/195 (7%)

Query: 49  TPYQRLRDALTRSLNRLNHFN--QNSSISSSKASQAD--IIPNNANYLIRISIGTPPTER 104
           T ++ LR A+ RS  RL      +  + S+ KA  A+  I+P    YL+++ IGTPP + 
Sbjct: 43  TEHELLRRAIQRSRYRLAGIGMARGEAASARKAVVAETPIMPAGGEYLVKLGIGTPPYKF 102

Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
            A  DT SDLIWTQC+PC  + CY Q  P+F+P++SSTY +LPCSS  C  L+   C   
Sbjct: 103 TAAIDTASDLIWTQCQPC--TGCYHQVDPMFNPRVSSTYAALPCSSDTCDELDVHRCGHD 160

Query: 165 N---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTG 220
           +   CQY+ +Y   + + G LA + + +G       A  G+ FGC T++ GG    + +G
Sbjct: 161 DDESCQYTYTYSGNATTEGTLAVDKLVIGED-----AFRGVAFGCSTSSTGGAPPPQASG 215

Query: 221 IVGLGGGDISLISQM 235
           +VGLG G +SL+SQ+
Sbjct: 216 VVGLGRGPLSLVSQL 230


>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
          Length = 376

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 105/362 (29%), Positives = 155/362 (42%), Gaps = 73/362 (20%)

Query: 30  SVELIHRDSPKSPF---YNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
           S+E+IH+  P S        S +  Q L    +R  +  +   +N +           +P
Sbjct: 67  SLEVIHKHGPCSKLSQDKGRSPSRTQMLDQDESRVNSIRSRLAKNPADGGKLKGSKVTLP 126

Query: 87  NNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
           + +       NY++ + +GTP  +   + DTGSDL WTQCEPC    CY Q  P+F+P  
Sbjct: 127 SKSGSTIGTGNYVVTVGLGTPKRDLTFIFDTGSDLTWTQCEPC-ARYCYHQQEPIFNPSK 185

Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           S++Y ++ CSS  C  L     N  SCS   C Y + YGD S+S G  A + + L ST  
Sbjct: 186 STSYTNISCSSPTCDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTD- 244

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVI 254
                    FGCG NN GLF     G++GLG   +SL+S+                    
Sbjct: 245 ---VFNNFLFGCGQNNRGLF-VGVAGLIGLGRNALSLMSKY------------------- 281

Query: 255 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR- 311
                                       P A P   L+ CY F+      VP++ ++F  
Sbjct: 282 ----------------------------PKAAPASILDTCYDFSQYDTVDVPKINLYFSD 313

Query: 312 GADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
           GA++ L  S  F  ++   VC  F G +++  + I GN+ Q  F V YD+    + F P 
Sbjct: 314 GAEMDLDPSGIFYILNISQVCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFAPG 373

Query: 370 DC 371
            C
Sbjct: 374 GC 375


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 104/349 (29%), Positives = 148/349 (42%), Gaps = 75/349 (21%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y  R+ +G P  +   V DTGSD+ W QC+PC  + CY Q  P+FDP  SSTY  + C
Sbjct: 18  GEYFTRVGVGNPARQFYMVLDTGSDINWLQCQPC--TDCYQQTDPIFDPTASSTYAPVTC 75

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
            S QC+SL   SC    C Y V+YGDGS++ G+ ATE+V+ G++     ++  +  GCG 
Sbjct: 76  QSQQCSSLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSG----SVKNVALGCGH 131

Query: 209 NNGGL------------------------------------------FNSKTTGI----- 221
           +N GL                                          FNS   G+     
Sbjct: 132 DNEGLFVGAAGLLGLGGGPLSLTNQLKATSFSYCLVNRDSAGSSTLDFNSAQLGVDSVTA 191

Query: 222 -------------VGLGGGDISLISQMRTTIAGNQRLGVS-TPDIVIDSGTTLTFLP-QG 266
                        VGL G  +S+  QM +      RL  S    I++D GT +T L  Q 
Sbjct: 192 PLMKNRKIDTFYYVGLSG--MSVGGQMVSIPESTFRLDESGNGGIIVDCGTAITRLQTQA 249

Query: 267 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFF 323
           YN  L      M +   +       + CY  +  +  +VP V+ HF  G    L  +N+ 
Sbjct: 250 YNP-LRDAFVRMTQNLKLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSWNLPAANYL 308

Query: 324 VKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           + V S    C  F   T+S+ I GN+ Q    V +D+    + F P  C
Sbjct: 309 IPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 107/411 (26%), Positives = 175/411 (42%), Gaps = 96/411 (23%)

Query: 56  DALTRSLNRLNHFNQNSSISSSKASQADIIPNNA------------------NYLIRISI 97
           D+  +   R++  ++ +++S S A++ D  P  A                   YL+ + +
Sbjct: 96  DSAEKDAVRIDTMHRRAALSGSAAARRDSAPRRALSERVVATVESGVPVGSGEYLVDVYL 155

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC---- 153
           GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  S +Y+++ C   +C    
Sbjct: 156 GTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQSGPIFDPAASISYRNVTCGDDRCRLVS 213

Query: 154 --ASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
             A    + C       C Y   YGD S + G+LA E  T+  T      + G+ FGCG 
Sbjct: 214 PPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVAFGCGH 273

Query: 209 NNGGLFN----------------SKTTGIVG-------------------LGGGDISLIS 233
            N GLF+                S+  G+ G                   + G D +L++
Sbjct: 274 RNRGLFHGAAGLLGLGRGPLSFASQLRGVYGGHAFSYCLVEHGSAAGSKIIFGHDDALLA 333

Query: 234 ---------------------QMRTTIAGNQRLGVSTPDI-----VIDSGTTLTFLPQ-G 266
                                Q+++ + G + + +S+  +     +IDSGTTL++ P+  
Sbjct: 334 HPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLSAGGTIIDSGTTLSYFPEPA 393

Query: 267 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFF 323
           Y +   + +  M  + P+      L  CY+ +     +VPE+++ F  GA  +    N+F
Sbjct: 394 YQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEVPELSLVFADGAAWEFPAENYF 453

Query: 324 VKVS-EDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           +++  E I+C    G   S + I GN  Q NF V YD+E   + F P  C 
Sbjct: 454 IRLEPEGIMCLAVLGTPRSGMSIIGNYQQQNFHVLYDLEHNRLGFAPRRCA 504


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 109/354 (30%), Positives = 156/354 (44%), Gaps = 71/354 (20%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           A I+P    Y++ + +GTP  +     DTGSDL WTQCEPC    C+ Q+ P FDP  S+
Sbjct: 131 ASIVPTGGAYVVTVGLGTPKKDFTLSFDTGSDLTWTQCEPC-LGGCFPQNQPKFDPTTST 189

Query: 142 TYKSLPCSSSQCASLNQ-----KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           +YK++ CSS  C  + +     + C    C Y + YG G ++ G LATET+ + S+    
Sbjct: 190 SYKNVSCSSEFCKLIAEGNYPAQDCISNTCLYGIQYGSG-YTIGFLATETLAIASSD--- 245

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI--------------AGN 242
                  FGC   + G FN  TTG++GLG   I+L SQ                    G+
Sbjct: 246 -VFKNFLFGCSEESRGTFNG-TTGLLGLGRSPIALPSQTTNKYKNLFSYCLPASPSSTGH 303

Query: 243 QRLGV-------STP------------------------------DIVIDSGTTLTFLPQ 265
              GV       STP                                +IDSGTT TFLP 
Sbjct: 304 LSFGVEVSQAAKSTPISPKLKQLYGLNTVGISVRGRELPINGSISRTIIDSGTTFTFLPS 363

Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTIHFRGA-DVKLSRS 320
              S L S    M+    + + T S + CY F+++      +P ++I F G  +V++  S
Sbjct: 364 PTYSALGSAFREMMANYTLTNGTSSFQPCYDFSNIGNGTLTIPGISIFFEGGVEVEIDVS 423

Query: 321 NFFVKVSE-DIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
              + V+    VC  F   G  +   I+GN  Q  + V YD+ +  V F P  C
Sbjct: 424 GIMIPVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDVAKGMVGFAPKGC 477


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 103/363 (28%), Positives = 153/363 (42%), Gaps = 81/363 (22%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-PLFDPKMSSTYKSLPC 148
            YL+ +S+GTPP       DTGSDL+WTQC PC    C+ Q + P+ DP  SST+ +LPC
Sbjct: 89  EYLMHVSVGTPPRPVALTLDTGSDLVWTQCAPC--LDCFEQGAAPVLDPAASSTHAALPC 146

Query: 149 SSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLG-STTGQAVALPGI 202
            +  C +L   SC G +     C Y   YGD S + G LAT++ T G       +A   +
Sbjct: 147 DAPLCRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAARRV 206

Query: 203 TFGCGTNNGG----------------------------------LFNSKTTGIVGLGGGD 228
           TFGCG  N G                                  +F++K++ +V LG   
Sbjct: 207 TFGCGHINKGIFQANETGIAGFGRGRWSLPSQLNVTSFSYCFTSMFDTKSSSVVTLGAAA 266

Query: 229 ISLISQMRTTIAGNQR-------------------------LGVSTPD------IVIDSG 257
             L+        G+ R                           V+ P+       +IDSG
Sbjct: 267 AELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLRSSTIIDSG 326

Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-----QVPEVTIHFRG 312
            ++T LP+     + +   S +     A  + +L+LC++    +      VP +T+H  G
Sbjct: 327 ASITTLPEDVYEAVKAEFVSQVGLPAAAAGSAALDLCFALPVAALWRRPAVPALTLHLDG 386

Query: 313 -ADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
            AD +L R N+ F   +  ++C V         + GN  Q N  V YD+E   +SF P  
Sbjct: 387 GADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQNTHVVYDLENDVLSFAPAR 446

Query: 371 CTK 373
           C K
Sbjct: 447 CDK 449


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score =  138 bits (348), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 110/348 (31%), Positives = 157/348 (45%), Gaps = 73/348 (20%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ R+ +GTP    + V DTGS L W QC PC  S C+ Q  P+FDPK SS+Y ++ CS
Sbjct: 116 NYVTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVS-CHRQSGPVFDPKTSSSYAAVSCS 174

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           S QC     A+LN   CS  N C Y  SYGD SFS G L+ +TV+ G     A ++P   
Sbjct: 175 SPQCDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSFG-----ANSVPNFY 229

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI-------------AGNQRLGVSTP 250
           +GCG +N GLF  ++ G++GL    +SL+ Q+  T+             +G   +G   P
Sbjct: 230 YGCGQDNEGLFG-RSAGLMGLARNKLSLLYQLAPTLGYSFSYCLPSTSSSGYLSIGSYNP 288

Query: 251 D-----------------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--------- 284
                              +  SG T+   P   +S+  + + ++I++  V         
Sbjct: 289 GGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPTIIDSGTVITRLPTSVY 348

Query: 285 --------ADPTGS---------LELCYS--FNSLSQVPEVTIHFR-GADVKLSRSNFFV 324
                   A   GS         L+ C+    + L  VP V++ F  GA +KLS  N  V
Sbjct: 349 TALSKAVAAAMKGSTKRAAAYSILDTCFEGQASKLRAVPAVSMAFSGGATLKLSAGNLLV 408

Query: 325 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            V     C  F     S  I GN  Q  F V YD++   + F    C+
Sbjct: 409 DVDGATTCLAF-APARSAAIIGNTQQQTFSVVYDVKSNRIGFAAAGCS 455


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score =  138 bits (347), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 116/412 (28%), Positives = 175/412 (42%), Gaps = 88/412 (21%)

Query: 30  SVELIHRDSPKSPFYN-----SSETPYQRLRDALTRSLNRLN-----HFNQNSSISSSKA 79
           S+E++H+  P S   N      S+TP+  + +     +  +N     +  Q+SS+S   +
Sbjct: 70  SLEVVHKHGPCSQLNNHDGKAKSKTPHSEILNQDKERVKYINSRISKNLGQDSSVSELDS 129

Query: 80  ----SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
               +++  +  + NY + + +GTP  +   + DTGSDL WTQCEPC  S CY Q   +F
Sbjct: 130 VTLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARS-CYKQQDAIF 188

Query: 136 DPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVT 188
           DP  S++Y ++ C+S+ C  L     N+  CS     C Y + YGD SFS G  + E ++
Sbjct: 189 DPSKSTSYSNITCTSTLCTQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLS 248

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR------------ 236
           + +T      +    FGCG NN GLF   + G++GLG   IS + Q              
Sbjct: 249 VTATD----IVDNFLFGCGQNNQGLFGG-SAGLIGLGRHPISFVQQTAAVYRKIFSYCLP 303

Query: 237 ------------------------TTIA---------------GNQRLGVS-----TPDI 252
                                   +TI+               G  +L VS     T   
Sbjct: 304 ATSSSTGRLSFGTTTTSYVKYTPFSTISRGSSFYGLDITGISVGGAKLPVSSSTFSTGGA 363

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
           +IDSGT +T LP    + L S     +   P A     L+ CY  +      +P++   F
Sbjct: 364 IIDSGTVITRLPPTAYTALRSAFRQGMSKYPSAGELSILDTCYDLSGYEVFSIPKIDFSF 423

Query: 311 RGA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDI 359
            G   V+L         S   VC  F   G  + V IYGN+ Q    V YD+
Sbjct: 424 AGGVTVQLPPQGILYVASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYDV 475


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score =  138 bits (347), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 122/446 (27%), Positives = 171/446 (38%), Gaps = 102/446 (22%)

Query: 21  PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS--- 77
           PI A +    V ++HR  P SP   +         + L    NR+   +   S +++   
Sbjct: 65  PITATSSAARVPIVHRHGPCSPLAGAHAGKPPSHAEILAADQNRVESLHHRVSSTTTGLG 124

Query: 78  -KASQADIIPNN------------------------ANYLIRISIGTPPTERLAVADTGS 112
            K       P +                        ANY++ I +GTPP+    V DTGS
Sbjct: 125 GKPRTKKKTPGHSSVPASSSSSSSSVPASSGLSLGTANYVVPIGLGTPPSRFTVVFDTGS 184

Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
           D  W QC PC  S CY Q   LFDP  SSTY ++ C+   CA L+   C+  +C Y + Y
Sbjct: 185 DTTWVQCRPCVVS-CYKQKDRLFDPAKSSTYANVSCADPACADLDASGCNAGHCLYGIQY 243

Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN---------------------- 210
           GDGS++ G  A +T+ +        A+ G  FGCG  N                      
Sbjct: 244 GDGSYTVGFFAKDTLAVAQD-----AIKGFKFGCGEKNRGLFGQTAGLLGLGRGPTSITV 298

Query: 211 ------GGLF--------------------------NSKTTGIVGLGGGDISLISQMRTT 238
                 GG F                          N+KTT ++   G     +      
Sbjct: 299 QAYEKYGGSFSYCLPASSAATGYLEFGPLSPSSSGSNAKTTPMLTDKGPTFYYVGLTGIR 358

Query: 239 IAGNQRLGV------STPDIVIDSGTTLTFLPQ--GYNSNLLSVMSSMIEAQPVADPTGS 290
           + G Q LG       S    ++DSGT +T LP       +     +        A     
Sbjct: 359 VGGKQ-LGAIPESVFSNSGTLVDSGTVITRLPDTAYAALSSAFAAAMAASGYKKAAAYSI 417

Query: 291 LELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIY 345
           L+ CY F  LSQV  P V++ F+G A + L  S     +S+  VC  F   G   SV I 
Sbjct: 418 LDTCYDFTGLSQVSLPTVSLVFQGGACLDLDASGIVYAISQSQVCLGFASNGDDESVGIV 477

Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDC 371
           GN  Q  + V YD+ ++ V F P  C
Sbjct: 478 GNTQQRTYGVLYDVSKKVVGFAPGAC 503


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 128/423 (30%), Positives = 174/423 (41%), Gaps = 97/423 (22%)

Query: 31  VELIHRDSPKSPFYNSS-ETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQAD 83
           + L H+  P +P   SS  TP   + D L     R  +  +  S      +  SKA  A 
Sbjct: 67  LRLTHKHGPCAPSRASSLATP--SVADTLRADQRRAEYILRRVSGRGTPQLWDSKAEAAT 124

Query: 84  I-IPNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
             +P N        NY++ +S+GTP   +    DTGSDL W QC PC    CY Q  PLF
Sbjct: 125 ATVPANWGFNIGTLNYVVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLF 184

Query: 136 DPKMSSTYKSLPCSSSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           DP  SS+Y ++PC    C  L     SCS   C Y VSYGDGS + G  +++T+TL    
Sbjct: 185 DPAQSSSYAAVPCGGPVCGGLGIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSPND 244

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ------------------- 234
               A+ G  FGCG    G   +   G++GLG  + SL+ Q                   
Sbjct: 245 ----AVRGFFFGCGHAQSGFTGND--GLLGLGREEASLVEQTAGTYGGVFSYCLPTRPST 298

Query: 235 ----------------------------------MRTTIA-GNQRLGVSTP----DIVID 255
                                             M T I+ G Q+L V +       V+D
Sbjct: 299 TGYLTLGGPSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVFAGGTVVD 358

Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHFR 311
           +GT +T LP    + L S   S + +   P A  TG L+ CY+F+    V  P V + F 
Sbjct: 359 TGTVITRLPPTAYAALRSAFRSGMASYGYPSAPATGILDTCYNFSGYGTVTLPNVALTFS 418

Query: 312 -GADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
            GA V L              C  F   G    + I GN+ Q +F V   I+  +V FKP
Sbjct: 419 GGATVTLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKP 471

Query: 369 TDC 371
           + C
Sbjct: 472 SSC 474


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score =  137 bits (346), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 93/244 (38%), Positives = 129/244 (52%), Gaps = 20/244 (8%)

Query: 2   ATFLSCVFILFFLCFYVVSPIEAQTGGFSVEL--IHRDSPKSPFYNSSETPYQRLRDALT 59
           A   S   ++  L F  ++       G  VEL  +H D         S T  Q +R AL 
Sbjct: 7   AQMASLAVLIISLVFAALASDSDAAAGVRVELTRVHADP--------SVTASQFVRGALR 58

Query: 60  RSLNRLNHFNQNSSISSSKASQADI--IPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
           R ++R N      + SS     A     P    YL+ ++IGTPP    A+ADTGSDLIWT
Sbjct: 59  RDMHRHNARKLALAASSGATVSAPTQDSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWT 118

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ--CASLNQKSCS----GVNCQYSVS 171
           QC PC  SQC+ Q +PL++P  S+T+  LPC+SS   CA+    + +    G  C Y+V+
Sbjct: 119 QCAPC-TSQCFRQPTPLYNPSSSTTFAVLPCNSSLSVCAAALAGTGTAPPPGCACTYNVT 177

Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
           YG G +++    +ET T GST      +PGI FGC T + G   S  +G+VGLG G +SL
Sbjct: 178 YGSG-WTSVFQGSETFTFGSTPAGHARVPGIAFGCSTASSGFNASSASGLVGLGRGRLSL 236

Query: 232 ISQM 235
           +SQ+
Sbjct: 237 VSQL 240



 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 66/129 (51%), Gaps = 8/129 (6%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG--SLELCYSFNSLSQ----VPE 305
           ++IDSGTT+T L       + + + S++   P  D +    L+LC+   S +     +P 
Sbjct: 324 LIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSADTGLDLCFMLPSSTSAPPAMPS 382

Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTV 364
           +T+HF GAD+ L   ++ +     + C   +  T+  V I GN  Q N  + YDI Q+T+
Sbjct: 383 MTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIGQETL 442

Query: 365 SFKPTDCTK 373
           SF P  C+ 
Sbjct: 443 SFAPAKCSA 451


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 113/432 (26%), Positives = 179/432 (41%), Gaps = 99/432 (22%)

Query: 23  EAQTGG--FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
           + + GG  + ++++HRD      + +S+    RL   L R   R+    +  S     + 
Sbjct: 64  DHEEGGEKWMMKVVHRDQLS---FGNSDDHRHRLDGRLKRDAKRVASLIRRLSSGGGGSY 120

Query: 81  QAD---------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD 131
           + D         +   +  Y +RI +G+PP  +  V D+GSD++W QC+PC  +QCY Q 
Sbjct: 121 RVDDFGTDVISGMEQGSGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCQPC--TQCYHQS 178

Query: 132 SPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT--- 188
            P+FDP  S+++  + CSSS C  L    C    C+Y VSYGDGS++ G LA ET+T   
Sbjct: 179 DPVFDPADSASFTGVSCSSSVCDRLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTFGR 238

Query: 189 -------------------------------------LGSTTGQAVALPGITFGCGTNNG 211
                                                LG  TG A +   ++ G  ++  
Sbjct: 239 TMVRSVAIGCGHRNRGMFVGAAGLLGLGGGSMSFVGQLGGQTGGAFSYCLVSRGTDSSGS 298

Query: 212 GLFNSKT---------------------TGIVGLGGGDISL-ISQ--MRTTIAGNQRLGV 247
            +F  +                       G+ GLG G I + IS+   R T  G+     
Sbjct: 299 LVFGREALPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDG---- 354

Query: 248 STPDIVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-- 301
               +V+D+GT +T LP    Q +    L+  +++  A  VA      + CY        
Sbjct: 355 ---GVVMDTGTAVTRLPTLAYQAFRDAFLAQTANLPRATGVA----IFDTCYDLLGFVSV 407

Query: 302 QVPEVTIHFRGADV-KLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
           +VP V+ +F G  +  L   NF + + +    C  F   T+ + I GNI Q    + +D 
Sbjct: 408 RVPTVSFYFSGGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDG 467

Query: 360 EQQTVSFKPTDC 371
               V F P  C
Sbjct: 468 ANGYVGFGPNIC 479


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score =  137 bits (345), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 72/171 (42%), Positives = 111/171 (64%), Gaps = 14/171 (8%)

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
           +SS A  A +    A YL+ ++IGTPP   +A+ADTGSDL WTQC+PC    C+ QD+P+
Sbjct: 79  TSSNAGPARLRSGQAEYLMELAIGTPPVPFVALADTGSDLTWTQCKPC--KLCFPQDTPI 136

Query: 135 FDPKMSSTYKSLPCSSSQCASL--NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTL 189
           +D   S+++  +PC+S+ C  +  + ++C+      C+Y  +Y DG++S G L TET+T 
Sbjct: 137 YDTAASASFSPVPCASATCLPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTF 196

Query: 190 GSTT----GQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQM 235
             ++    G  V++ G+ FGCG +NGGL +NS  TG VGLG G +SL++Q+
Sbjct: 197 AGSSPGAPGPGVSVGGVAFGCGVDNGGLSYNS--TGTVGLGRGSLSLVAQL 245



 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 36/129 (27%), Positives = 66/129 (51%), Gaps = 8/129 (6%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS----LSQVPEVT 307
           +++DSGT  T L +     +++ ++ ++  QPV + +     C+   +    L  +P++ 
Sbjct: 331 MIVDSGTIFTVLVESAFRVVVNHVAGVLN-QPVVNASSLDSPCFPATAGEQQLPDMPDML 389

Query: 308 IHFRG-ADVKLSRSNF--FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
           +HF G AD++L R N+  F + S     ++    +    I GN  Q N  + +DI    +
Sbjct: 390 LHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSILGNFQQQNIQMLFDITVGQL 449

Query: 365 SFKPTDCTK 373
           SF PTDC+K
Sbjct: 450 SFVPTDCSK 458


>gi|222618833|gb|EEE54965.1| hypothetical protein OsJ_02555 [Oryza sativa Japonica Group]
          Length = 393

 Score =  137 bits (345), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 114/373 (30%), Positives = 163/373 (43%), Gaps = 68/373 (18%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD------ 83
           SV L HR  P SP   +S        + L R   R ++  +  S S+  A+  D      
Sbjct: 32  SVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQSSKV 91

Query: 84  IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLF 135
            +P       +   Y+I + +G+P   +  V DTGSD+ W QCEPCP PS C+     LF
Sbjct: 92  SVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 151

Query: 136 DPKMSSTYKSLPCSSSQCASL-NQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLG 190
           DP  SSTY +  CS++ CA L +    +G +    CQY V YGDGS + G          
Sbjct: 152 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGT--------- 202

Query: 191 STTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQ--------------- 234
                     G  FGC     G   + KT G++GLGG   SL+SQ               
Sbjct: 203 ----------GFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQTAARSKKVPTYYFAA 252

Query: 235 MRTTIAGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 289
           +     G ++LG+S P +     ++DSGT +T LP    + L S   + +     A+P G
Sbjct: 253 LEDIAVGGKKLGLS-PSVFAAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLG 311

Query: 290 SLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIY 345
            L+ C++F  L +V  P V + F G  V    ++  V       C  F    +  +    
Sbjct: 312 ILDTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHGIVSGG----CLAFAPTRDDKAFGTI 367

Query: 346 GNIMQTNFLVGYD 358
           GN+ Q  F V YD
Sbjct: 368 GNVQQRTFEVLYD 380


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score =  137 bits (345), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 117/416 (28%), Positives = 181/416 (43%), Gaps = 80/416 (19%)

Query: 31  VELIHRDSPKSPFY--NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP-- 86
           ++++H+  P S     + +E  Y  L+D  +R  +  +  +++S +S  KA+ A  +P  
Sbjct: 85  LKVVHKHGPCSDLRQGHKAEAQYILLQDQ-SRVDSIHSKLSKDSGLSDVKATAATTLPAK 143

Query: 87  -----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                 + NY + + +GTP  +   + DTGSDL WTQCEPC  S CY Q   +F+P  S+
Sbjct: 144 DGSIIGSGNYFVTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKS-CYNQKEAIFNPSQST 202

Query: 142 TYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           +Y ++ C S+ C SL     N  +C+   C Y + YGD SFS G    E ++L +T    
Sbjct: 203 SYANISCGSTLCDSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLTATD--- 259

Query: 197 VALPGITFGCGTNN----------------------------GGLFN------SKTTGIV 222
                  FGCG NN                              +F+      S +TG +
Sbjct: 260 -VFNDFYFGCGQNNKGLFGGAAGLLGLGRDKLSLVSQTAQRYNKIFSYCLPSSSSSTGFL 318

Query: 223 GLGG---------------GDISLISQMRTTIA-GNQRLGV-----STPDIVIDSGTTLT 261
             GG               G  S      T I+ G ++L +     ST   +IDSGT +T
Sbjct: 319 TFGGSTSKSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFSTAGTIIDSGTVIT 378

Query: 262 FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGA-DVKLS 318
            LP    S L S    ++   P A     L+ C+ F++     VP++ + F G   V + 
Sbjct: 379 RLPPAAYSALSSTFRKLMSQYPAAPALSILDTCFDFSNHDTISVPKIGLFFSGGVVVDID 438

Query: 319 RSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           ++  F       VC  F G +++  V I+GN+ Q    V YD     V F P  C+
Sbjct: 439 KTGIFYVNDLTQVCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRVGFAPAGCS 494


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 121/416 (29%), Positives = 172/416 (41%), Gaps = 95/416 (22%)

Query: 30  SVELIHRDSPKSPFYN-----SSETPYQRLRDALTRSLNRLNHFN--------QNSSI-- 74
           S+E++H+  P S   +      S TP+    D L +   R+ + N        Q+SS+  
Sbjct: 71  SLEVVHKHGPCSQLNDHDGKAKSTTPHS---DILNQDKERVKYINSRLSKNLGQDSSVEE 127

Query: 75  --SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
             S++  +++  +  + NY + + +GTP  +   + DTGSDL WTQCEPC  S CY Q  
Sbjct: 128 LDSATLPAKSGSLIGSGNYFVVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARS-CYKQQD 186

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN--CQYSVSYGDGSFSNGNLATE 185
            +FDP  S++Y ++ C+S+ C  L     N   CS     C Y + YGD SFS G  + E
Sbjct: 187 VIFDPSKSTSYSNITCTSALCTQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRE 246

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR--------- 236
            +T+ +T      +    FGCG NN GLF   + G++GLG   IS + Q           
Sbjct: 247 RLTVTATD----VVDNFLFGCGQNNQGLFGG-SAGLIGLGRHPISFVQQTAAKYRKIFSY 301

Query: 237 ------------------------------------------TTIA-GNQRLGVS----- 248
                                                     T IA G  +L VS     
Sbjct: 302 CLPSTSSSTGHLSFGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFS 361

Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEV 306
           T   +IDSGT +T LP      L S     +   P A     L+ CY  +      +P +
Sbjct: 362 TGGAIIDSGTVITRLPPTAYGALRSAFRQGMSKYPSAGELSILDTCYDLSGYKVFSIPTI 421

Query: 307 TIHFRGA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDI 359
              F G   VKL         S   VC  F   G  + V IYGN+ Q    V YD+
Sbjct: 422 EFSFAGGVTVKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVYDV 477


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 91/240 (37%), Positives = 128/240 (53%), Gaps = 27/240 (11%)

Query: 11  LFFLCFYVVSPIEAQTGGFSVEL----IHRDSPKSPFYNSSETPYQRLRDALTRSLNRL- 65
           L  L F VV    A +G  SV +    IH D    P   + E     +RDAL R ++R  
Sbjct: 11  LAVLVFLVVCATLA-SGAASVRVGLTRIHSD----PDITAPE----FVRDALRRDMHRQQ 61

Query: 66  ------NHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
                     ++   + S  ++ D+ PN   YL+ +SIGTPP    A+ADTGSDLIWTQC
Sbjct: 62  SRSLFGRELAESDGTTVSARTRKDL-PNGGEYLMTLSIGTPPLSYPAIADTGSDLIWTQC 120

Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASL--NQKSCSGVNCQYSVSYGDG 175
            PC   QC+ Q +PL++P  S+T+  LPC+S  S CA +   +    G  C Y+ +YG G
Sbjct: 121 APCSGDQCFAQPAPLYNPASSTTFGVLPCNSSLSMCAGVLAGKAPPPGCACMYNQTYGTG 180

Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
            ++ G   +ET T GS       +PGI FGC   +   +N  + G+VGLG G +SL+SQ+
Sbjct: 181 -WTAGVQGSETFTFGSAAADQARVPGIAFGCSNASSSDWNG-SAGLVGLGRGSLSLVSQL 238



 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 38/131 (29%), Positives = 70/131 (53%), Gaps = 9/131 (6%)

Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSF----NSLSQ 302
           T  ++IDSGTT+T L       + + + S++    +  +D TG L+LCY+     ++   
Sbjct: 318 TGGLIIDSGTTITSLVNAAYQQVRAAVQSLVTLPAIDGSDSTG-LDLCYALPTPTSAPPA 376

Query: 303 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQ 361
           +P +T+HF GAD+ L   ++ +  S  + C   +  T+ ++  +GN  Q N  + YD+  
Sbjct: 377 MPSMTLHFDGADMVLPADSYMISGS-GVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVRN 435

Query: 362 QTVSFKPTDCT 372
           + +SF P  C+
Sbjct: 436 EMLSFAPAKCS 446


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 84/211 (39%), Positives = 114/211 (54%), Gaps = 13/211 (6%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---I 84
           GF   L H D+      ++  T  Q L  AL RS  R+      ++++   A  A    +
Sbjct: 30  GFKATLRHVDA------DAGYTEEQLLSRALRRSSARVATLQSLAALAPGDAITAARILV 83

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
           + ++  YL+ + IGTP     A+ DTGSDLIWTQC PC    C  Q +P FDP  S+TY+
Sbjct: 84  LASDGEYLMEMGIGTPTRYYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPARSATYR 141

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           SL C+S  C +L    C    C Y   YGD + + G LA ET T G T    V+LPGI+F
Sbjct: 142 SLGCASPACNALYYPLCYQKVCVYQYFYGDSASTAGVLANETFTFG-TNETRVSLPGISF 200

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           GCG  N G   +  +G+VG G G +SL+SQ+
Sbjct: 201 GCGNLNAGSL-ANGSGMVGFGRGSLSLVSQL 230



 Score = 42.7 bits (99), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 62/126 (49%), Gaps = 8/126 (6%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPEV 306
           +IDSGTT+T+L +     + +  +S I   P+ + T +  L+ C+ +    +    +P++
Sbjct: 316 IIDSGTTITYLAEPAYDAVRAAFASQITL-PLLNVTDASVLDTCFQWPPPPRQSVTLPQL 374

Query: 307 TIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
            +HF GAD +L   N+  V  S      +    ++   I G+    NF V YD+E   +S
Sbjct: 375 VLHFDGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMS 434

Query: 366 FKPTDC 371
           F P  C
Sbjct: 435 FVPAPC 440


>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 456

 Score =  136 bits (343), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 111/411 (27%), Positives = 174/411 (42%), Gaps = 80/411 (19%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNR----LNHFNQNSSISSSKASQ-- 81
           +  +L HRD+      N  +T ++ R    + R + R    LN  N+N+    +  +   
Sbjct: 58  WKTKLFHRDN-----INLKKTTHKTRFISRINRDIKRVTFLLNRLNKNTQEQQTTTATEA 112

Query: 82  ---ADII----PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
              +D++      +  Y +RI IG+P   +  V D+GSD++W QCEPC   QCY Q  P+
Sbjct: 113 SFGSDVVSGTEEGSGEYFVRIGIGSPAIYQYMVIDSGSDIVWIQCEPC--DQCYNQTDPI 170

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK-SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           F+P  S+++  + CSS+ C  L+   +C    C Y V+YGDGS++ G LA ET+T+G T 
Sbjct: 171 FNPATSASFIGVACSSNVCNQLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITIGRTV 230

Query: 194 GQAVALPGITFGCGTNNGGLF--------------------NSKTTGIVG---------L 224
            Q  A+     GCG  N G+F                     ++T G  G         +
Sbjct: 231 IQDTAI-----GCGHWNEGMFVGAAGLLGLGGGPMSFVGQLGAQTGGAFGYCLVSRAMPV 285

Query: 225 GGGDISLISQ------------------MRTTIAGN--QRLGVSTPDIVIDSGTTLTFLP 264
           G   + LI                    +R  I+    Q   + T  +V+D+GT +T LP
Sbjct: 286 GAMWVPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTGTAITRLP 345

Query: 265 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADVKLSRSNF 322
               +       +     P A      + CY  N     +VP V+ +F G  +    +  
Sbjct: 346 TVAYNAFRDAFIAQTTNLPRAPGVSIFDTCYDLNGFVTVRVPTVSFYFSGGQILTFPARN 405

Query: 323 FVKVSEDI--VCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           F+  ++D+   C  F    + + I GNI Q    V  D     V F P  C
Sbjct: 406 FLIPADDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNGFVGFGPNVC 456


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 75/207 (36%), Positives = 116/207 (56%), Gaps = 28/207 (13%)

Query: 29  FSVELIHRDSP-KSPFYNSSETPYQRLRDALTRSLNRLNHFNQ--------NSSISSSKA 79
           +SV+++HRDS       N++ +  +RL + L R   R+    Q        N   + S  
Sbjct: 114 WSVQVVHRDSLLVKDAANATASYERRLEETLRRDARRVRGLEQRIEKRLRLNKDPAGSHE 173

Query: 80  SQADIIPN------------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
           + A++               +  Y  RI +GTP  E+  V DTGSD++W QCEPC  S+C
Sbjct: 174 NVAEVAAEFGGEVVSGMAQGSGEYFTRIGVGTPMREQYMVLDTGSDVVWIQCEPC--SKC 231

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           Y Q  P+F+P +S+++ +L C+S+ C+ L+  +C G  C Y VSYGDGS++ G+ ATE +
Sbjct: 232 YSQVDPIFNPSLSASFSTLGCNSAVCSYLDAYNCHGGGCLYKVSYGDGSYTIGSFATEML 291

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
           T G+T+ + VA+     GCG +N GLF
Sbjct: 292 TFGTTSVRNVAI-----GCGHDNAGLF 313


>gi|125602787|gb|EAZ42112.1| hypothetical protein OsJ_26672 [Oryza sativa Japonica Group]
          Length = 477

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 104/305 (34%), Positives = 144/305 (47%), Gaps = 47/305 (15%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
           + DTGSDL W QC+PC  S CY Q  PLFDP  S++Y ++PC++S C ASL        S
Sbjct: 179 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 236

Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
           C+ V           C YS++YGDGSFS G LAT+TV LG  +     + G  FGCG +N
Sbjct: 237 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 291

Query: 211 GGLFNSKTTGIVGLG--GGDISL------------ISQMRTTIAGNQRLGVSTPDIVIDS 256
            GLF   T G++GLG  G    L            ++      A     G+   ++++DS
Sbjct: 292 RGLFGG-TAGLMGLGPDGALAGLPDGAPPPFYFMNVTGASVGGAAVAAAGLGAANVLLDS 350

Query: 257 GTTLTFL-PQGYNSNLLSVMSSM-IEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR- 311
           GT +T L P  Y +           E  P A P   L+ CY+     +  VP +T+    
Sbjct: 351 GTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLLTLRLEG 410

Query: 312 GADVKLSRSNFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           GAD+ +  +       +D   VC     ++  +  PI GN  Q N  V YD     + F 
Sbjct: 411 GADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVGSRLGFA 470

Query: 368 PTDCT 372
             DC+
Sbjct: 471 DEDCS 475


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 76/166 (45%), Positives = 96/166 (57%), Gaps = 20/166 (12%)

Query: 90  NYLIRISIGTPPTERLAV-ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
           NY+  I++G    + L V  DTGSDL W QCEPCP S CY Q  PLFDP  S T+ ++PC
Sbjct: 179 NYVTTIALGGGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPC 238

Query: 149 SSSQCASLNQKSCSGV-------------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
            S  CA+ + K  +G               C Y++SYGDGSFS G LA +T+ LG+TT  
Sbjct: 239 GSPACAA-SLKDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTLGLGTTT-- 295

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
              L G  FGCG +N GLF   T G++GLG  D+SL+SQ      G
Sbjct: 296 --KLDGFVFGCGLSNRGLFGG-TAGLMGLGRTDLSLVSQTAARFGG 338


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 89/218 (40%), Positives = 121/218 (55%), Gaps = 20/218 (9%)

Query: 28  GFSVEL--IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI- 84
           G  VEL  +H D         S T  Q +R AL R ++R N      + SS     A   
Sbjct: 31  GVRVELTRVHADP--------SVTASQFVRGALRRDMHRHNARKLALAASSGATVSAPTQ 82

Query: 85  -IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
             P    YL+ ++IGTPP    A+ADTGSDLIWTQC PC  SQC+ Q +PL++P  S+T+
Sbjct: 83  NSPTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPC-TSQCFRQPTPLYNPSSSTTF 141

Query: 144 KSLPCSSSQ--CASLNQKSCS----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
             LPC+SS   CA+    + +    G  C Y+V+YG G +++    +ET T GST     
Sbjct: 142 AVLPCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGSG-WTSVFQGSETFTFGSTPAGQS 200

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
            +PGI FGC T + G   S  +G+VGLG G +SL+SQ+
Sbjct: 201 RVPGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQL 238



 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 67/129 (51%), Gaps = 8/129 (6%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPE 305
           ++IDSGTT+T L       + + + S++   P  D + +  L+LC+   S +     +P 
Sbjct: 322 LIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSAATGLDLCFMLPSSTSAPPAMPS 380

Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTV 364
           +T+HF GAD+ L   ++ +     + C   +  T+  V I GN  Q N  + YDI Q+T+
Sbjct: 381 MTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIGQETL 440

Query: 365 SFKPTDCTK 373
           SF P  C+ 
Sbjct: 441 SFAPAKCSA 449


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 115/416 (27%), Positives = 172/416 (41%), Gaps = 80/416 (19%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL--RDALTRSLNRLNHFNQNSSISSSKASQADII 85
           G ++ L HR  P SP  +  +  ++    RD L  +  +    ++ ++++      A  I
Sbjct: 57  GSTLALSHRHGPCSPVISKEKPSHEETLRRDQLRAAYIQAKVSSRYNNVAKELQQSAVTI 116

Query: 86  PNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           P ++        Y+I ++IGTP   ++   DTGSD+ W QC PC    C  Q   LFDP 
Sbjct: 117 PTSSGYSLGTTEYVITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPA 176

Query: 139 MSSTYKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           MS+TY +  C S+QCA L  +   C    CQY V YGDGS + G   ++T++L S+    
Sbjct: 177 MSATYSAFSCGSAQCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSD--- 233

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI---------------AG 241
            A+    FGC     G F  +  G++GLGG   SL+SQ   T                 G
Sbjct: 234 -AVKSFQFGCSHRAAG-FVGELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSGGG 291

Query: 242 NQRLGVS----------TPDIVIDSGTTLTFLPQGYN--SNLLSVMSSMIEAQPVAD--- 286
              LG +          TP +     T      QG      +L+V +S+     V D   
Sbjct: 292 FLTLGAAGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVFSGASVVDSGT 351

Query: 287 ----------------------------PTGSLELCYSFNSLS--QVPEVTIHF-RGADV 315
                                       P GSL+ C+ F+  +   VP VT+ F RGA +
Sbjct: 352 VITQLPPTAYQALRTAFKKEMKAYPSAAPVGSLDTCFDFSGFNTITVPTVTLTFSRGAAM 411

Query: 316 KLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            L  S            +   G T    I GN+ Q  F + +D+  +T+ F+   C
Sbjct: 412 DLDISGILYAGCLAFTATAHDGDTG---ILGNVQQRTFEMLFDVGGRTIGFRSGAC 464


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 83/210 (39%), Positives = 117/210 (55%), Gaps = 13/210 (6%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
           GF   L H D+      N+  T  Q L  A+ RS  R+      ++ + +  +   ++  
Sbjct: 30  GFKATLTHVDA------NAGYTKAQLLSRAVARSRARVAALQSLATAADAITAARILLRF 83

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +   YL+ + IG+PP    A+ DTGSDLIWTQC PC    C  Q +P F+P  S++Y SL
Sbjct: 84  SEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPC--LLCVEQPTPYFEPAKSTSYASL 141

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           PCSS+ C +L    C    C Y   YGD + S G LA ET T G+ + + VA+P ++FGC
Sbjct: 142 PCSSAMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTR-VAVPRVSFGC 200

Query: 207 GTNNGG-LFNSKTTGIVGLGGGDISLISQM 235
           G  N G LFN   +G+VG G G +SL+SQ+
Sbjct: 201 GNMNAGTLFNG--SGMVGFGRGALSLVSQL 228



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 62/128 (48%), Gaps = 5/128 (3%)

Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF----NSLSQV 303
           T  ++IDSGTT+TFL Q   + +     + +   +  A P+ + + C+ +      +  +
Sbjct: 311 TGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRRMVTL 370

Query: 304 PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
           PE+ +HF GAD++L   N+ V         +    ++   I G+    NF + YD+E   
Sbjct: 371 PEMVLHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLENSL 430

Query: 364 VSFKPTDC 371
           +SF P  C
Sbjct: 431 LSFVPAPC 438


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 83/210 (39%), Positives = 117/210 (55%), Gaps = 13/210 (6%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
           GF   L H D+      N+  T  Q L  A+ RS  R+      ++ + +  +   ++  
Sbjct: 27  GFKATLTHVDA------NAGYTKAQLLSRAVARSRARVAALQSLATAADAITAARILLRF 80

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +   YL+ + IG+PP    A+ DTGSDLIWTQC PC    C  Q +P F+P  S++Y SL
Sbjct: 81  SEGEYLMDVGIGSPPRYFSAMIDTGSDLIWTQCAPC--LLCVEQPTPYFEPAKSTSYASL 138

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           PCSS+ C +L    C    C Y   YGD + S G LA ET T G+ + + VA+P ++FGC
Sbjct: 139 PCSSAMCNALYSPLCFQNACVYQAFYGDSASSAGVLANETFTFGTNSTR-VAVPRVSFGC 197

Query: 207 GTNNGG-LFNSKTTGIVGLGGGDISLISQM 235
           G  N G LFN   +G+VG G G +SL+SQ+
Sbjct: 198 GNMNAGTLFNG--SGMVGFGRGALSLVSQL 225



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 62/128 (48%), Gaps = 5/128 (3%)

Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF----NSLSQV 303
           T  ++IDSGTT+TFL Q   + +     + +   +  A P+ + + C+ +      +  +
Sbjct: 308 TGGVIIDSGTTVTFLAQPAYAMVQGAFVAWVGLPRANATPSDTFDTCFKWPPPPRRMVTL 367

Query: 304 PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
           PE+ +HF GAD++L   N+ V         +    ++   I G+    NF + YD+E   
Sbjct: 368 PEMVLHFDGADMELPLENYMVMDGGTGNLCLAMLPSDDGSIIGSFQHQNFHMLYDLENSL 427

Query: 364 VSFKPTDC 371
           +SF P  C
Sbjct: 428 LSFVPAPC 435


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score =  134 bits (338), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 84/229 (36%), Positives = 121/229 (52%), Gaps = 24/229 (10%)

Query: 31  VELIHRDSPKSPFYNSSE--TPYQRLRDALTRSLNRLNHFNQNSSISSSKA-------SQ 81
           + L HR  P +P   +S   +P   L D L     R  +  +  S +++ A       S+
Sbjct: 67  LRLTHRHGPCAPAGKASALGSPPSFL-DTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSK 125

Query: 82  ADIIPNNAN-------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
           A  +P N         Y++ +S+GTP   +    DTGSD+ W QC+PCP   CY Q  PL
Sbjct: 126 AATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPL 185

Query: 135 FDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
           FDP  SS+Y ++PC+++ C+  +L    CSG  C Y VSYGDGS + G  +++T+TL  +
Sbjct: 186 FDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGS 245

Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
                AL G  FGCG    GLF +   G++GLG    SL+SQ  +T  G
Sbjct: 246 N----ALKGFLFGCGHAQQGLF-AGVDGLLGLGRQGQSLVSQASSTYGG 289


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score =  134 bits (337), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 84/229 (36%), Positives = 121/229 (52%), Gaps = 24/229 (10%)

Query: 31  VELIHRDSPKSPFYNSSE--TPYQRLRDALTRSLNRLNHFNQNSSISSSKA-------SQ 81
           + L HR  P +P   +S   +P   L D L     R  +  +  S +++ A       S+
Sbjct: 56  LRLTHRHGPCAPAGKASALGSPPSFL-DTLRADQRRAEYIQRRVSGAAAAAPGMQLAGSK 114

Query: 82  ADIIPNNAN-------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
           A  +P N         Y++ +S+GTP   +    DTGSD+ W QC+PCP   CY Q  PL
Sbjct: 115 AATVPANLGFSIGTLQYVVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPL 174

Query: 135 FDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
           FDP  SS+Y ++PC+++ C+  +L    CSG  C Y VSYGDGS + G  +++T+TL  +
Sbjct: 175 FDPTRSSSYSAVPCAAASCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGS 234

Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
                AL G  FGCG    GLF +   G++GLG    SL+SQ  +T  G
Sbjct: 235 N----ALKGFLFGCGHAQQGLF-AGVDGLLGLGRQGQSLVSQASSTYGG 278


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  134 bits (337), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 119/444 (26%), Positives = 173/444 (38%), Gaps = 89/444 (20%)

Query: 6   SCVF-ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR 64
           SC+   LFFL      P+ + T      L H D  +        T  + LR  + RS  R
Sbjct: 11  SCMLPYLFFLAILFAWPVTSAT--LRAHLSHVDDGRG------FTKRELLRRMVVRSRAR 62

Query: 65  LNHFNQNSSISSSKASQADIIPN---NANYLIRISIGTPPTERLAVA-DTGSDLIWTQCE 120
             +    S  ++  A+      N   N+ YLI +SIG P ++ + +  DTGSD++WTQCE
Sbjct: 63  AANLCPYSGATARPATAPVGRANTDVNSEYLIHLSIGAPRSQPVVLTLDTGSDVVWTQCE 122

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNG 180
           PC  ++C+ Q  P FD   S+T +S+ CS   C + ++  C    C Y   YGDGS S G
Sbjct: 123 PC--AECFTQPLPRFDTAASNTVRSVACSDPLCNAHSEHGCFLHGCTYVSGYGDGSLSFG 180

Query: 181 NLATETVTLGSTTGQA-VALPGITFGCGTNNGGLFNSKTTGIVGL--------------- 224
           +   ++ T     G   V +P I FGCG  N G F    TGI G                
Sbjct: 181 HFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNAGRFLQTETGIAGFGRGPLSLPSQLKVRQ 240

Query: 225 -------------------GGGDIS------------------------LISQMRTTIAG 241
                              G GD+                          +   +    G
Sbjct: 241 FSYCFTTRFEAKSSPVFLGGAGDLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVG 300

Query: 242 NQRLGVSTPDI--------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
             RL V  P+I         IDSGT +T  P      L S   +   A PV       ++
Sbjct: 301 KTRLPV--PEIKADGSGATFIDSGTDITTFPDAVFRQLKSAFIAQ-AALPVNKTADEDDI 357

Query: 294 CYSFN--SLSQVPEVTIHFRGADVKLSRSNFFVKVSE--DIVCSVFKGITNSVPIYGNIM 349
           C+S++    + +P++  H  GAD  L R N+  +  E   +  +V         + GN  
Sbjct: 358 CFSWDGKKTAAMPKLVFHLEGADWDLPRENYVTEDRESGQVCVAVSTSGQMDRTLIGNFQ 417

Query: 350 QTNFLVGYDIEQQTVSFKPTDCTK 373
           Q N  + YD+    +   P  C K
Sbjct: 418 QQNTHIVYDLAAGKLLLVPAQCDK 441


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score =  134 bits (337), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 96/331 (29%), Positives = 144/331 (43%), Gaps = 52/331 (15%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y   + +GTPPT  L V DTGSD++W QC PC   QCY Q   +FDP+ S +Y ++
Sbjct: 138 GSGEYFASVGVGTPPTPALLVLDTGSDVVWLQCAPC--RQCYAQSGRVFDPRRSRSYAAV 195

Query: 147 PCSSSQC-----ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
            C +  C                 C Y V+YGDGS + G+LATET+       +   +P 
Sbjct: 196 RCGAPPCRGLDAGGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWF----ARGARVPR 251

Query: 202 ITFGCGTNNGGLF----------------NSKTTGIVG------LGGGDISLISQMRTT- 238
           +  GCG +N GLF                 ++T    G        G D+   + +RT  
Sbjct: 252 VAVGCGHDNEGLFVAAAGLLGLGRGRLSLPTQTARRYGRRFSYCFQGSDLDHRTIIRTVH 311

Query: 239 --IAGNQRLGVSTPD-----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 285
             + G +  GV               +++DSGT++T L +     +     +      +A
Sbjct: 312 QHVGGARVRGVGERSLRLDPSTGRGGVILDSGTSVTRLARPVYVAVREAFRAAAGGLRLA 371

Query: 286 DPTGSL-ELCYSFNS--LSQVPEVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITN 340
               SL + CY      + +VP V++H   GA+V L   N+ + V +    C    G   
Sbjct: 372 PGGFSLFDTCYDLRGRRVVKVPTVSVHLAGGAEVALPPENYLIPVDTRGTFCLALAGTDG 431

Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            V I GNI Q  F V +D ++Q V+  P  C
Sbjct: 432 GVSIVGNIQQQGFRVVFDGDRQRVALVPKSC 462


>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
          Length = 418

 Score =  134 bits (337), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 110/380 (28%), Positives = 172/380 (45%), Gaps = 74/380 (19%)

Query: 57  ALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN--YLIRISIGTPPTERLAVADTGSDL 114
           A  RS  RL+        +S+ ++Q+ +  ++    Y +  S+GTPP    A+ADTGSDL
Sbjct: 45  AAHRSRERLSILATRLGAASAGSAQSPLQMDSGGGAYDMTFSMGTPPQTLSALADTGSDL 104

Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVN-----C 166
           IW +C  C   +C  + S  + P  SS++  LPCSS+ C +L  +S   C G       C
Sbjct: 105 IWAKCGAC--KRCAPRGSASYYPTKSSSFSKLPCSSALCRTLESQSLATCGGTRARGAVC 162

Query: 167 QYSVSYGDGS----FSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
            Y  SYG  S    ++ G + +ET TLGS      A+ GI FGC T          +G+V
Sbjct: 163 SYRYSYGLSSNPHHYTQGYMGSETFTLGSD-----AVQGIGFGC-TTMSEGGYGSGSGLV 216

Query: 223 GLGGGDISLISQMRT--------------------------------------------- 237
           GLG G +SL+ Q++                                              
Sbjct: 217 GLGRGKLSLVRQLKVGAFSYCLTSDPSTSSPLLFGAGALTGPGVQSTPLVNLKTSTFYTV 276

Query: 238 -----TIAGNQRLGVSTPDIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL 291
                +I   +  G     I+ DSGTTLTFL +  Y      ++S       V    G  
Sbjct: 277 NLDSISIGAAKTPGTGRHGIIFDSGTTLTFLAEPAYTLAEAGLLSQTTNLTRVPGTDG-Y 335

Query: 292 ELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQT 351
           E+C+  +  +  P + +HF G D+ L   N+F  V++ + C + +   + + I GNIMQ 
Sbjct: 336 EVCFQTSGGAVFPSMVLHFDGGDMALKTENYFGAVNDSVSCWLVQKSPSEMSIVGNIMQM 395

Query: 352 NFLVGYDIEQQTVSFKPTDC 371
           ++ + YD+++  +SF+PT+C
Sbjct: 396 DYHIRYDLDKSVLSFQPTNC 415


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score =  134 bits (336), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 108/359 (30%), Positives = 159/359 (44%), Gaps = 83/359 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + +S+GTPP    A+ DTGSDL WTQC PC  + C+ Q +PL+DP  SST+  LPC+S
Sbjct: 96  YHMILSVGTPPLAFPAIIDTGSDLTWTQCAPC-TTACFAQPTPLYDPARSSTFSKLPCAS 154

Query: 151 SQCASLNQ--KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA---LPGITFG 205
             C +L    ++C+   C Y   Y  G F+ G LA +T+ +G   G   A     G+ FG
Sbjct: 155 PLCQALPSAFRACNATGCVYDYRYAVG-FTAGYLAADTLAIGDGDGDGDASSSFAGVAFG 213

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQ------------------------------- 234
           C T NGG  +   +GIVGLG   +SL+SQ                               
Sbjct: 214 CSTANGGDMDGA-SGIVGLGRSALSLLSQIGVGRFSYCLRSDADAGASPILFGALANVTG 272

Query: 235 --------MRTTIAGNQR----------LGVSTPDIVIDS-----------------GTT 259
                   +R  +A  +R          + V + D+ + S                 GTT
Sbjct: 273 DKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIVDSGTT 332

Query: 260 LTFLPQ-GY---NSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL-SQVPEVTIHFR-GA 313
            T+L + GY       LS  + ++    V+      +LC+   +  + VP +   F  GA
Sbjct: 333 FTYLAEAGYTMLRQAFLSQTAGLLTR--VSGAQFDFDLCFEAGAADTPVPRLVFRFAGGA 390

Query: 314 DVKLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           +  + R ++F  V E   V  +    T  V + GN+MQ +  V YD++  T SF P DC
Sbjct: 391 EYAVPRQSYFDAVDEGGRVACLLVLPTRGVSVIGNVMQMDLHVLYDLDGATFSFAPADC 449


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score =  134 bits (336), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 75/207 (36%), Positives = 113/207 (54%), Gaps = 28/207 (13%)

Query: 29  FSVELIHRDSPK-SPFYNSSETPYQRLRDALTRSLNRLNHFNQN---------------- 71
           +SV+L+HRDS       N++ +  +RL + L R   R+    Q                 
Sbjct: 71  WSVQLVHRDSLLFKGAANATASYERRLEEKLRREAARVRALEQRIERKLKLKKDPAGSYE 130

Query: 72  --SSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
             + +++   S+  + +   +  Y  RI IGTP  E+  V DTGSD++W QCEPC   +C
Sbjct: 131 NVAGVTAEFGSEVVSGMEQGSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPC--REC 188

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           Y Q  P+F+P  S ++ ++ C S+ C+ L+   C G  C Y VSYGDGS++ G+ ATET+
Sbjct: 189 YSQADPIFNPSSSVSFSTVGCDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETL 248

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
           T G+T+ Q VA+     GCG +N GLF
Sbjct: 249 TFGTTSIQNVAI-----GCGHDNVGLF 270



 Score = 46.2 bits (108), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 50/124 (40%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           I+IDSGT +T L       L     +  +  P AD     + CY  ++L  V  P V  H
Sbjct: 372 IIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFH 431

Query: 310 F-RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F  GA   L   N  + + S    C  F    +++ I GNI Q    V +D     V F 
Sbjct: 432 FSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFA 491

Query: 368 PTDC 371
              C
Sbjct: 492 IDQC 495


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 82/194 (42%), Positives = 116/194 (59%), Gaps = 11/194 (5%)

Query: 45  NSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN--ANYLIRISIGTPPT 102
           + S T  Q +R AL R ++R N     +S SS     A + P      +L+ ++IGTPP 
Sbjct: 38  DPSVTASQFVRAALHRDMHRHNARKLAAS-SSDGTVSAPVSPTTVPGEFLMTLAIGTPPL 96

Query: 103 ERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS 162
             LA+ADTGSDLIWTQC PC   QC+ Q +PL++P  S+T+ +LPC+SS    L   +C+
Sbjct: 97  PFLAIADTGSDLIWTQCAPC-SRQCFQQPTPLYNPSSSTTFSALPCNSS--LGLCAPACA 153

Query: 163 GVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAVALPGITFGCGTNNGGLFNSKTTGI 221
              C Y+++YG G ++     TET T GS+T    V +PGI FGC   + G   S  +G+
Sbjct: 154 ---CMYNMTYGSG-WTYVFQGTETFTFGSSTPADQVRVPGIAFGCSNASSGFNASSASGL 209

Query: 222 VGLGGGDISLISQM 235
           VGLG G +SL+SQ+
Sbjct: 210 VGLGRGSLSLVSQL 223



 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/136 (29%), Positives = 70/136 (51%), Gaps = 16/136 (11%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPE 305
           ++IDSGTT+T L       + + + S++   P  D + +  L+LC+   S +     +P 
Sbjct: 303 LIIDSGTTITMLGNTAYQQVRAAVLSLVTL-PTTDGSAATGLDLCFELPSSTSAPPSMPS 361

Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIV-----CSVFKGITNS----VPIYGNIMQTNFLVG 356
           +T+HF GAD+ L   N+ + +S+        C   +  T++    V I GN  Q N  + 
Sbjct: 362 MTLHFDGADMVLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHIL 421

Query: 357 YDIEQQTVSFKPTDCT 372
           YD+ ++T+SF P  C+
Sbjct: 422 YDVGKETLSFAPAKCS 437


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 83/231 (35%), Positives = 121/231 (52%), Gaps = 36/231 (15%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-------------- 73
           G  V L H D+      + + +  Q L+ A  RS +R++     ++              
Sbjct: 44  GLRVRLTHVDA------HGNYSRLQLLQRAARRSHHRMSRLVARATGAASTSSSKAAAAG 97

Query: 74  -ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
             S  K  Q  +   N  +L+ +S+GTP     A+ DTGSDL+WTQC+PC   +C+ Q +
Sbjct: 98  DGSGGKDLQVPVHAGNGEFLMDLSVGTPALPYAAIVDTGSDLVWTQCKPC--VECFNQTT 155

Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ--------YSVSYGDGSFSNGNLAT 184
           P+FDP  SSTY +LPCSS+ CA L   +C+  +          Y+ +YGD S + G LAT
Sbjct: 156 PVFDPAASSTYAALPCSSALCADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLAT 215

Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           ET TL         +PG+ FGCG  N G   ++  G+VGLG G +SL+SQ+
Sbjct: 216 ETFTLARQK-----VPGVAFGCGDTNEGDGFTQGAGLVGLGRGPLSLVSQL 261



 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 39/132 (29%), Positives = 67/132 (50%), Gaps = 12/132 (9%)

Query: 252 IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS-------Q 302
           +++DSGT++T+L  + Y +   + ++ M  + P  D +   L+LC+   + +       Q
Sbjct: 345 VIVDSGTSITYLELRAYRALRKAFVAHM--SLPTVDASEIGLDLCFQGPAGAVDQDVQVQ 402

Query: 303 VPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
           VP++ +HF G AD+ L   N+ V  S      +    +  + I GN  Q NF   YD+  
Sbjct: 403 VPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMASRGLSIIGNFQQQNFQFVYDVAG 462

Query: 362 QTVSFKPTDCTK 373
            T+SF P +C K
Sbjct: 463 DTLSFAPAECNK 474


>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  133 bits (334), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 78/188 (41%), Positives = 111/188 (59%), Gaps = 9/188 (4%)

Query: 54  LRDALTRSLNRLNHFNQNSSISSSKASQADI---IPNNANYLIRISIGTPPTERLAVADT 110
           +RDAL R ++R   F +  + S  +   A     +PN   Y++ ++IGTPP    A+ADT
Sbjct: 48  VRDALRRDMHRHARFTRELASSGDRTVAAPTRKDLPNGGEYIMTLAIGTPPLSYPAIADT 107

Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKS-CSGVNCQ 167
           GSDLIWTQC PC  SQC+ Q    ++P  S+T+  LPC+S  S CA+L   S   G +C 
Sbjct: 108 GSDLIWTQCAPC-GSQCFKQAGQPYNPSSSTTFGVLPCNSSVSMCAALAGPSPPPGCSCM 166

Query: 168 YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
           Y+ +YG G ++ G  + ET T GST      +PGI FGC   +   +N  + G+VGLG G
Sbjct: 167 YNQTYGTG-WTAGIQSVETFTFGSTPADQTRVPGIAFGCSNASSDDWNG-SAGLVGLGRG 224

Query: 228 DISLISQM 235
            +SL+SQ+
Sbjct: 225 SMSLVSQL 232



 Score = 67.4 bits (163), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 42/131 (32%), Positives = 71/131 (54%), Gaps = 9/131 (6%)

Query: 249 TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ---- 302
           T  ++IDSGTT+T L       + + + S++   PVAD + S  L+LC++  S +     
Sbjct: 312 TGGLIIDSGTTITSLVDAAYQQVRAAIESLVTL-PVADGSDSTGLDLCFALTSETSTPPS 370

Query: 303 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQ 361
           +P +T HF GAD+ L   N+ + +   + C   +  T  ++  +GN  Q N  + YDI +
Sbjct: 371 MPSMTFHFDGADMVLPVDNYMI-LGSGVWCLAMRNQTVGAMSTFGNYQQQNVHLLYDIHE 429

Query: 362 QTVSFKPTDCT 372
           +T+SF P  C+
Sbjct: 430 ETLSFAPAKCS 440


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 104/355 (29%), Positives = 148/355 (41%), Gaps = 79/355 (22%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +   +++ +  GTP      + DTGSD+ W QC PC    CY Q  P+FDP  S+TY  +
Sbjct: 131 DTLEFVVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCS-GHCYKQHDPIFDPTKSATYSVV 189

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           PC   QCA+ +   CS   C Y V YGDGS S G L+ ET++L ST     ALPG  FGC
Sbjct: 190 PCGHPQCAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLTSTR----ALPGFAFGC 245

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN--------------QRLGVSTP-- 250
           G  N G F     G++GLG G +SL SQ   +  G                 +G +TP  
Sbjct: 246 GQTNLGDFG-DVDGLIGLGRGQLSLSSQAAASFGGTFSYCLPSDNTTHGYLTIGPTTPAS 304

Query: 251 -------------------------------------DIVIDSGT------TLTFL-PQG 266
                                                 +  D GT       LT+L P+ 
Sbjct: 305 NDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFTDDGTFLDSGTILTYLPPEA 364

Query: 267 YNS--NLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF- 323
           Y +  +      +  +  P  DP    + CY F   S +    + F+ +D  +   +FF 
Sbjct: 365 YTALRDRFKFTMTQYKPAPAYDP---FDTCYDFTGQSAIFIPAVSFKFSDGSVFDLSFFG 421

Query: 324 VKVSED-----IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           + +  D     I C  F    +++P  I GN+ Q N  V YD+  + + F    C
Sbjct: 422 ILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFASASC 476


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 79/192 (41%), Positives = 107/192 (55%), Gaps = 15/192 (7%)

Query: 55  RDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTERLAVADTG 111
           R+ L R   RL  F+ +   +S++      A+ +P+   YL+ ++IGTPP     + DTG
Sbjct: 378 REVLHRMAARL-LFSASGRAASARVDPGPYANGVPDT-EYLVHLAIGTPPQPVQLILDTG 435

Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-----C 166
           SDL+WTQC PCP   C+ +     DP  SST+  LPCSS  C +L   SC   N     C
Sbjct: 436 SDLVWTQCRPCP--VCFSRALGPLDPSNSSTFDVLPCSSPVCDNLTWSSCGKHNWGNQTC 493

Query: 167 QYSVSYGDGSFSNGNLATETVTLGST--TGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            Y  +Y DGS + G+L  ET T  +   TGQA  +P + FGCG  N G+F S  TGI G 
Sbjct: 494 VYVYAYADGSITTGHLDAETFTFAAADGTGQAT-VPDLAFGCGLFNNGIFTSNETGIAGF 552

Query: 225 GGGDISLISQMR 236
           G G +SL SQ++
Sbjct: 553 GRGALSLPSQLK 564



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 62/130 (47%), Gaps = 11/130 (8%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL--ELCYSF----NSLSQVPEV 306
           +IDSGT +T LPQ     +    ++ +   PV + T S    LC+SF     +   VP++
Sbjct: 646 IIDSGTGMTTLPQDAYKLVHDAFTAQVRL-PVDNATSSSLSRLCFSFSVPRRAKPDVPKL 704

Query: 307 TIHFRGADVKLSRSNFFVKVSE---DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
            +HF GA + L R N+  +  +    + C       + + I GN  Q N  V YD+ +  
Sbjct: 705 VLHFEGATLDLPRENYMFEFEDAGGSVTCLAINA-GDDLTIIGNYQQQNLHVLYDLVRNM 763

Query: 364 VSFKPTDCTK 373
           +SF P  C +
Sbjct: 764 LSFVPAQCNR 773


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 87/245 (35%), Positives = 123/245 (50%), Gaps = 20/245 (8%)

Query: 1   MATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           M+  L+   I   L     +P    T     +L H D  +        T ++RL     R
Sbjct: 6   MSELLAYALIFTLLFTAAATPTAGLT--MRADLTHVDKGRG------FTRWERLSRMAVR 57

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVA-DTGSDLIWTQC 119
           S  R     Q       +   A  +P++  YLI  +IGTP  +R+A+  DTGSDL+WTQC
Sbjct: 58  SRARAASLYQRGG-HYGQPVTATAVPSSGEYLIHFNIGTPRPQRVALTMDTGSDLVWTQC 116

Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC---ASLNQKSCS--GVNCQYSVSYGD 174
            PCP   C+ Q  PLFDP +SST++++ C    C   + L+  +C+     C Y  SYGD
Sbjct: 117 TPCP--VCFDQPFPLFDPSVSSTFRAVACPDPICRPSSGLSVSACALKTFRCFYLCSYGD 174

Query: 175 GSFSNGNLATETVTLGSTTGQA---VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
            S + G +  +T T  S  G+    VA+ G+ FGCG  N G+F S  +GI G G G +SL
Sbjct: 175 KSITAGYIFKDTFTFMSPNGEGAPPVAVSGLAFGCGDYNTGVFASNESGIAGFGRGPLSL 234

Query: 232 ISQMR 236
            SQ+R
Sbjct: 235 PSQLR 239



 Score = 46.6 bits (109), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 56/127 (44%), Gaps = 7/127 (5%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCYSFNSLSQ---VPEVT 307
           VIDSGT +T  P      L +   + +   P  D T  +   LC+      +   VP++ 
Sbjct: 325 VIDSGTGVTTFPAAVFEQLKNEFVAQLPL-PRYDNTSEVGNLLCFQRPKGGKQVPVPKLI 383

Query: 308 IHFRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
            H   AD+ L R N+  + ++  ++C +  G    + + GN  Q N  + YD+E   + F
Sbjct: 384 FHLASADMDLPRENYIPEDTDSGVMCLMINGAEVDMVLIGNFQQQNMHIVYDVENSKLLF 443

Query: 367 KPTDCTK 373
               C K
Sbjct: 444 ASAQCDK 450


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 92/324 (28%), Positives = 138/324 (42%), Gaps = 66/324 (20%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
           DTGSD+ W QC+PCP  QCY Q   LF P  S+TYK LPC+S+ C  L     SC   +C
Sbjct: 6   DTGSDITWIQCDPCP--QCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLNSSC 63

Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
            Y VSYGD S + G+ A ET+TL S     V++P   FGCG  N GLFN    G++GLG 
Sbjct: 64  NYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNG-AAGLMGLGK 122

Query: 227 GDISLISQMRTTIAGNQRLGVSTPDI-------------------------VIDS----- 256
             I   +Q  T++A  +      P +                         ++DS     
Sbjct: 123 SSIGFPAQ--TSVAFGKVFSYCLPSVSSTIPSGILHFGEAAMLDYDVRFTPLVDSSSGPS 180

Query: 257 --------------------------GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 290
                                     GT ++   Q     L    + ++     A     
Sbjct: 181 QYFVSMTGINVGDELLPISATVMVDSGTVISRFEQSAYERLRDAFTQILPGLQTAVSVAP 240

Query: 291 LELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGN 347
            + C+  +++    +P +T+HFR  A+++LS  +    V + ++C  F   ++   + GN
Sbjct: 241 FDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHILYPVDDGVMCFAFAPSSSGRSVLGN 300

Query: 348 IMQTNFLVGYDIEQQTVSFKPTDC 371
             Q N    YDI +  +     +C
Sbjct: 301 FQQQNLRFVYDIPKSRLGISAFEC 324


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 121/429 (28%), Positives = 177/429 (41%), Gaps = 98/429 (22%)

Query: 34  IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS---KASQADIIPN--- 87
           +HRDS  SP+  ++ T +  +R+ L R   RL   +   S+  +   K+S  + + N   
Sbjct: 1   MHRDSADSPYRPANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60

Query: 88  -----------------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
                            +  Y + + +GTPP     VADTGSD++W QC PC    CY Q
Sbjct: 61  FLQQDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPC--QSCYGQ 118

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
             PLF+P  SST++S+ C SS C  L  + C    C Y VSYGDGSF+ G  +TET++ G
Sbjct: 119 TDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFG 178

Query: 191 STTGQAVALPGITFGCGTNNGGLFNS---------------------------------K 217
           S    +VA+     GCG NN GLF                                   +
Sbjct: 179 SNAVNSVAI-----GCGHNNQGLFTGAAGLLGLGKGLLSFPSQVGQLYGSVFSYCLPTRE 233

Query: 218 TTGIVGLGGGDISLISQMR-TTIAGNQRL----------------GVSTP---------- 250
           +TG V L  G+ ++ S  + TT+  N +L                 VS P          
Sbjct: 234 STGSVPLIFGNQAVASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSST 293

Query: 251 ---DIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--P 304
               +++DSGT +T L    YN    +  + M     +       + CY  +  S +  P
Sbjct: 294 GNGGVILDSGTAVTRLVTSAYNPMRDAFRAGMPSDAKMTSGFSLFDTCYDLSGRSSIMLP 353

Query: 305 EVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
            V+  F  GA + L   N  V V +    C  F   + +  I GNI Q +F + +D    
Sbjct: 354 AVSFVFNGGATMALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGN 413

Query: 363 TVSFKPTDC 371
            V      C
Sbjct: 414 RVGIGANQC 422


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 75/230 (32%), Positives = 119/230 (51%), Gaps = 27/230 (11%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-------- 73
           + A +    + L+HRD      + ++ TP Q L   L R + R       ++        
Sbjct: 61  VAASSSTLHIRLLHRDR-----FAANATPAQLLARRLQRDVLRAAWIISKAAANGTPPPV 115

Query: 74  --ISSSKASQADII---PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
             +SS++   A ++   P +  Y+ +I++GTP  E L   DT SDL W QC+PC   +CY
Sbjct: 116 AGLSSARGFVAPVVSRAPTSGEYIAKIAVGTPGVEALLALDTASDLTWLQCQPC--RRCY 173

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDGSFSNGNLATE 185
            Q  P+FDP+ S++Y+ +  +++ C +L +          C Y+V YGDGS + G+   E
Sbjct: 174 PQSGPVFDPRHSTSYREMSFNAADCQALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEE 233

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           T+T        V LP I+ GCG +N GLF +   GI+GLG G +S  +Q+
Sbjct: 234 TLTFAG----GVRLPRISIGCGHDNKGLFGAPAAGILGLGRGLMSFPNQI 279



 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 41/129 (31%), Positives = 65/129 (50%), Gaps = 11/129 (8%)

Query: 252 IVIDSGTTLTFLPQ----GYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPE 305
           +++DSGT +T L +     +     +V   + +   +  P+G  + CY+     + +VP 
Sbjct: 364 VIVDSGTAVTRLARPAYTAFRDAFRAVAVDLGQVS-IGGPSGFFDTCYTVGGRGMKKVPT 422

Query: 306 VTIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQ 362
           V++HF G+ +VKL   N+ + V S   VC  F     +SV I GNI Q  F + YDI  +
Sbjct: 423 VSMHFAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIGNIQQQGFRIVYDIGGR 482

Query: 363 TVSFKPTDC 371
            V F P  C
Sbjct: 483 -VGFAPNSC 490


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 113/221 (51%), Gaps = 26/221 (11%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDA--LTRSLNRLNHFNQN--------SSISSSKAS-- 80
           ++HR  P SP           +  A  L R   R++  ++         S +  ++AS  
Sbjct: 73  VVHRHGPCSPVQARRRGGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARASEQ 132

Query: 81  ------QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
                 Q  I     NY++ + +GTP  +   + DTGSDL W QC+PC  + CY Q  PL
Sbjct: 133 GVSLPAQRGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPC--ADCYEQQDPL 190

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           FDP +SSTY ++ C + +C  L+   CS    C+Y V YGD S ++GNL  +T+TL ++ 
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD 250

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
                LPG  FGCG  N GLF  +  G+ GLG   +SL SQ
Sbjct: 251 ----TLPGFVFGCGDQNAGLFG-QVDGLFGLGREKVSLPSQ 286



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 57/125 (45%), Gaps = 5/125 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHF 310
           VIDSGT +T LP    + L +  +  +     A     L+ CY F  +  +Q+P V + F
Sbjct: 361 VIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAF 420

Query: 311 RG-ADVKLSRSN--FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            G A V L  +   +  KVS+  +        +S+ I GN  Q  F V YD+  Q + F 
Sbjct: 421 AGGATVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQRIGFG 480

Query: 368 PTDCT 372
              C+
Sbjct: 481 AKGCS 485


>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
          Length = 443

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 100/340 (29%), Positives = 159/340 (46%), Gaps = 32/340 (9%)

Query: 43  FYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA-DIIPNNANYLIRISIGTPP 101
           +Y+ + T   R   A  RS+  LN+    +S SSS    +  ++P    Y++   +G P 
Sbjct: 8   YYDHNMTSTDRSIWAADRSIAXLNYLLSVTSSSSSLGDISSKLVPEYYEYIMMYYLGVPS 67

Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
           T    +ADTGS+LIW QC PC  + CY Q  P+FDP  S TY+++   S  C ++ + SC
Sbjct: 68  TLVYGIADTGSELIWLQCLPC--THCYNQTPPIFDPAESYTYETVSSDSPICNAVRRISC 125

Query: 162 --SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTT 219
                +C Y  +YGDG+ + G L+T+       T   V +  +TFGC  +          
Sbjct: 126 REGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDTKARLKGHQA 185

Query: 220 GIVGLGGGDISLISQMRTTI-----------AGNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
           G+VGL     SL+SQ++                  R+   +  +++   T L  L   Y+
Sbjct: 186 GVVGLNRHPNSLVSQLKVKKFSYCMVIPDDHGSGSRMYFGSRAVILGGKTPL--LKGDYS 243

Query: 269 SNLLSVMS-SMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVS 327
              +++   S+ E +  +D   S             P++T HF GAD  L++   +V+V 
Sbjct: 244 HYFVTLKGISVGEEKGRSDELASAG-----------PDITFHFYGADFILTKXTTYVEVE 292

Query: 328 EDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
           + + C        T  + I GNI Q N+ VGYD+E Q V+
Sbjct: 293 KGLWCLAMLSSNSTRKLSILGNIQQQNYHVGYDLEAQEVA 332



 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/112 (33%), Positives = 53/112 (47%), Gaps = 3/112 (2%)

Query: 125 SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFS-NGN 181
           +QC+ Q  P+FDP  SSTY ++P  +  C      +C     +C Y +SYG GS S  G 
Sbjct: 332 AQCFNQTPPIFDPSKSSTYSTVPWDAPTCYQAGGYACHIDEEDCCYRISYGSGSTSTEGT 391

Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           ++ +           V +  + FGC     G F     GIVGL    +SL+S
Sbjct: 392 ISIDAFAFEDNRQNMVDVXHLVFGCSDYTTGTFKGYEVGIVGLNQDSLSLVS 443


>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 127/418 (30%), Positives = 180/418 (43%), Gaps = 79/418 (18%)

Query: 22  IEAQTGGFSVELIHRDSPKS--PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKA 79
           +   +G  +V L HR  P S  P  N+        RD L  +     +   N S    + 
Sbjct: 50  VAPSSGVVTVPLHHRHGPCSTVPSTNAPTLEDMLRRDQLRAAYITRKYSGVNGSAGDVEG 109

Query: 80  SQADIIP------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
           S   +        +   YLI + +G+P   +  + DTGSD+ W QC+PC  SQC+ Q   
Sbjct: 110 SDVTVPTTLGTSLDTLEYLITVGMGSPAVAQTMLIDTGSDVSWVQCKPC--SQCHSQADS 167

Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           LFDP  SSTY +  C+S+ CA L Q+ CS   CQY+V YGDGS  +G  +++T+ LGS+T
Sbjct: 168 LFDPSSSSTYSAFSCTSAACAQLRQRGCSSSQCQYTVKYGDGSTGSGTYSSDTLALGSST 227

Query: 194 GQAVALPGITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQMRTTI------------- 239
                +    FGC  + +G L   +T G++GLGGG  SL +Q   T              
Sbjct: 228 -----VENFQFGCSQSESGNLLQDQTAGLMGLGGGAESLATQTAGTFGKAFSYCLPPTPG 282

Query: 240 -AGNQRLGVSTPDIVI-----------------------------------------DSG 257
            +G   LG ST   V+                                         DSG
Sbjct: 283 SSGFLTLGASTSGFVVKTPMLRSTQVPSYYGVLLQAIRVGGRQLNIPASAFSAGSIMDSG 342

Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADV 315
           T +T LP+   S L S   + ++  P A P G  + C+ F+  S V  P V + F G  V
Sbjct: 343 TIITRLPRTAYSALSSAFKAGMKQYPPAQPMGIFDTCFDFSGQSSVSIPTVALVFSGGAV 402

Query: 316 KLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
               S+  +  S    C  F   ++  S+ I GN+ Q  F V YD+    V FK   C
Sbjct: 403 VDLASDGIILGS----CLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 80/213 (37%), Positives = 115/213 (53%), Gaps = 21/213 (9%)

Query: 31  VELIHRDS--PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP-N 87
           V + HRD+  P  P         QRL     R  + ++   +  S   S       IP  
Sbjct: 29  VPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATGRLHSPVFSG------IPFE 82

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y   + +GTP T+ + V DTGSDL+W QC PC   +CY Q   +FDP+ SSTY+ +P
Sbjct: 83  SGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPC--RRCYAQRGQVFDPRRSSTYRRVP 140

Query: 148 CSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           CSS QC +L    C     +G  C+Y V+YGDGS S G+LAT+ +   + T     +  +
Sbjct: 141 CSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFANDT----YVNNV 196

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           T GCG +N GLF+S   G++G+G G IS+ +Q+
Sbjct: 197 TLGCGRDNEGLFDS-AAGLLGVGRGKISISTQV 228



 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 32/134 (23%), Positives = 59/134 (44%), Gaps = 13/134 (9%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNS--LSQVPEV 306
           +V+DSGT ++   +   + L     +   A  +    G     + CY       +  P +
Sbjct: 312 VVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLI 371

Query: 307 TIHFRG-ADVKLSRSNFFV-------KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 358
            +HF G AD+ L   N+F+       + +    C  F+   + + + GN+ Q  F V +D
Sbjct: 372 VLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFD 431

Query: 359 IEQQTVSFKPTDCT 372
           +E++ + F P  CT
Sbjct: 432 VEKERIGFAPKGCT 445


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 69/154 (44%), Positives = 86/154 (55%), Gaps = 7/154 (4%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ +++GTPP       DTGSDL+WTQC PC    C+ Q  PL DP  SSTY +LPC 
Sbjct: 85  EYLVHLAVGTPPRPVALTLDTGSDLVWTQCAPC--RDCFDQGIPLLDPAASSTYAALPCG 142

Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-----TGQAVALPGITF 204
           + +C +L   SC G +C Y   YGD S + G +AT+  T G        G   A   +TF
Sbjct: 143 APRCRALPFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRRLTF 202

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           GCG  N G+F S  TGI G G G  SL SQ+  T
Sbjct: 203 GCGHFNKGVFQSNETGIAGFGRGRWSLPSQLNAT 236


>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  132 bits (331), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 89/216 (41%), Positives = 124/216 (57%), Gaps = 18/216 (8%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRL--RD-ALTRSLNRLNHFNQNSSISSSKASQADIIP 86
           S++L+HRD+     + S       L  RD A    L R    + + S +SS  S   I+ 
Sbjct: 58  SLQLLHRDTVSGTKHPSRRHAVLALASRDTARVAYLQRRLSPSPSPSSTSSVESGGTIVS 117

Query: 87  N-NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
           + +  YL+R+ IG+PP E+  VADTGSD+IW QC PC  S CY Q  PLFDP  S+++  
Sbjct: 118 HGSGEYLVRVGIGSPPLEQHLVADTGSDVIWVQCSPC--SDCYAQGDPLFDPANSASFSP 175

Query: 146 LPCSSSQCASLNQ-----KSCSGVNCQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVAL 199
           +PC+S  C +  +         G  C+Y VSYGD S++NG LA ET+TL G T  Q VA+
Sbjct: 176 VPCNSGVCRAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDGGTEVQGVAM 235

Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
                GCG  N GLF ++  G++GLG G +SL+ Q+
Sbjct: 236 -----GCGHENRGLF-AEAAGLLGLGWGPMSLVGQL 265



 Score = 46.2 bits (108), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 53/131 (40%), Gaps = 11/131 (8%)

Query: 252 IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTI 308
           +V+D+GT +T LP + Y +   +   +  E  P A      + CY  +  +  +VP V +
Sbjct: 349 VVMDTGTAVTRLPAEAYAALRGAFAGAFEEGAPRAPGVSLFDTCYDLSGYASVRVPTVAL 408

Query: 309 HF-------RGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
           +F         A + L   N  V V +    C  F  + +   I GNI Q    +  D  
Sbjct: 409 YFGGGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILGNIQQQGIEITVDSA 468

Query: 361 QQTVSFKPTDC 371
              V F P  C
Sbjct: 469 SGYVGFGPATC 479


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 79/221 (35%), Positives = 113/221 (51%), Gaps = 26/221 (11%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDA--LTRSLNRLNHFNQN--------SSISSSKAS-- 80
           ++HR  P SP           +  A  L R   R++  ++         S +  ++AS  
Sbjct: 73  VVHRHGPCSPVQARPRGGGGAVTHAEILERDQARVDSIHRKVAGAGGAPSVVDPARASEQ 132

Query: 81  ------QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
                 Q  I     NY++ + +GTP  +   + DTGSDL W QC+PC  + CY Q  PL
Sbjct: 133 GVSLPAQRGISLGTGNYVVSVGLGTPAKQYAVIFDTGSDLSWVQCKPC--ADCYEQQDPL 190

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           FDP +SSTY ++ C + +C  L+   CS    C+Y V YGD S ++GNL  +T+TL ++ 
Sbjct: 191 FDPSLSSTYAAVACGAPECQELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTLSASD 250

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
                LPG  FGCG  N GLF  +  G+ GLG   +SL SQ
Sbjct: 251 ----TLPGFVFGCGDQNAGLFG-QVDGLFGLGREKVSLPSQ 286



 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 57/125 (45%), Gaps = 5/125 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHF 310
           VIDSGT +T LP    + L +  +  +     A     L+ CY F  +  +Q+P V + F
Sbjct: 361 VIDSGTVITRLPPRAYAPLRAAFARSMAQYKKAPALSILDTCYDFTGHRTAQIPTVELAF 420

Query: 311 RG-ADVKLSRSN--FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            G A V L  +   +  KVS+  +        +S+ I GN  Q  F V YD+  Q + F 
Sbjct: 421 AGGATVSLDFTGVLYVSKVSQACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQRIGFG 480

Query: 368 PTDCT 372
              C+
Sbjct: 481 AKGCS 485


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 102/372 (27%), Positives = 159/372 (42%), Gaps = 81/372 (21%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           ++ +   +  YL+ + +GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  S
Sbjct: 140 ESGVAVGSGEYLVEVYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFDQRGPVFDPMAS 197

Query: 141 STYKSLPCSSSQC-------ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           ++Y+++ C  ++C       A    +S     C Y   YGD S + G+LA E  T+  T 
Sbjct: 198 TSYRNVTCGDTRCGLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 257

Query: 194 GQAVALPGITFGCGTNNGGLFN----------------SKTTGIVGLG------------ 225
             +  + G+  GCG  N GLF+                S+   + G              
Sbjct: 258 SSSRRVDGVVLGCGHRNRGLFHGAAGLLGLGRGPLSFASQLRAVYGHAFSYCLVDHGSAV 317

Query: 226 ------GGDISLIS---------------------QMRTTIAGNQRL-------GVSTPD 251
                 G D  L+S                     Q++  + G + L       GVS  D
Sbjct: 318 GSKIVFGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKED 377

Query: 252 ----IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVP 304
                +IDSGTTL++ P+  Y +   + +  M +A P+      L  CY+ + +   +VP
Sbjct: 378 GSGGTIIDSGTTLSYFPEPAYKAIRQAFVDRMDKAYPLIADFPVLSPCYNVSGVERVEVP 437

Query: 305 EVTIHFR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQ 361
           E ++ F  GA       N+F+++ +E I+C    G   S + I GN  Q NF V YD+  
Sbjct: 438 EFSLLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSIIGNYQQQNFHVLYDLHH 497

Query: 362 QTVSFKPTDCTK 373
             + F P  C +
Sbjct: 498 NRLGFAPRRCAE 509


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score =  131 bits (330), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 83/233 (35%), Positives = 121/233 (51%), Gaps = 24/233 (10%)

Query: 18  VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS 77
           V +P +A     ++ ++H   P SP  +    P     + L R  +R++   +  +  ++
Sbjct: 52  VCTPTKAAPSSSALTVVHGHGPCSPQESRRGAPSHT--EILGRDQDRVDAIRRKVAAVTT 109

Query: 78  KASQADI--IP---------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
            AS +    +P         +  NY   + +GTP T+ L   DTGSD  W QC+PCP   
Sbjct: 110 AASSSKPKGVPLQVGWGKYLDTTNYFTSLRLGTPATDLLVELDTGSDQSWIQCKPCP--D 167

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASL---NQKSCSG-VNCQYSVSYGDGSFSNGNL 182
           CY Q   LFDP  SSTY  + CSS +C  L   ++ +CS    C Y ++Y D S++ GNL
Sbjct: 168 CYEQHEALFDPSKSSTYSDITCSSRECQELGSSHKHNCSSDKKCPYEITYADDSYTVGNL 227

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           A +T+TL  T     A+PG  FGCG NN G F  +  G++GLG G  SL SQ+
Sbjct: 228 ARDTLTLSPTD----AVPGFVFGCGHNNAGSFG-EIDGLLGLGRGKASLSSQV 275



 Score = 40.4 bits (93), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 34/125 (27%), Positives = 53/125 (42%), Gaps = 6/125 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHF 310
           +IDSGT  + LP    + L S + S +     A  +   + CY    +   ++P V + F
Sbjct: 352 IIDSGTAFSCLPPSAYAALRSSVRSAMGRYKRAPSSTIFDTCYDLTGHETVRIPSVALVF 411

Query: 311 R-GADVKLSRSNF---FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
             GA V L  S     +  VS+  +  +      S+ + GN  Q    V YD++ Q V F
Sbjct: 412 ADGATVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGF 471

Query: 367 KPTDC 371
               C
Sbjct: 472 GANGC 476


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 69/140 (49%), Positives = 89/140 (63%), Gaps = 8/140 (5%)

Query: 97  IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
           IGTP     A+ DTGSDL+WTQC+PC    C+ Q +P+FDP  SSTY ++PCSS+ C+ L
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPC--VDCFKQSTPVFDPSSSSTYATVPCSSASCSDL 230

Query: 157 NQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFN 215
               C S   C Y+ +YGD S + G LATET TL  +      LPG+ FGCG  N G   
Sbjct: 231 PTSKCTSASKCGYTYTYGDSSSTQGVLATETFTLAKSK-----LPGVVFGCGDTNEGDGF 285

Query: 216 SKTTGIVGLGGGDISLISQM 235
           S+  G+VGLG G +SL+SQ+
Sbjct: 286 SQGAGLVGLGRGPLSLVSQL 305



 Score = 64.7 bits (156), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 45/133 (33%), Positives = 67/133 (50%), Gaps = 11/133 (8%)

Query: 249 TPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLS----Q 302
           T  +++DSGT++T+L  QGY +   +  + M  A P AD +G  L+LC+   +      +
Sbjct: 386 TGGVIVDSGTSITYLEVQGYRALKKAFAAQM--ALPAADGSGVGLDLCFRAPAKGVDQVE 443

Query: 303 VPEVTIHFRG-ADVKLSRSNFFV-KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
           VP +  HF G AD+ L   N+ V       +C    G +  + I GN  Q NF   YD+ 
Sbjct: 444 VPRLVFHFDGGADLDLPAENYMVLDGGSGALCLTVMG-SRGLSIIGNFQQQNFQFVYDVG 502

Query: 361 QQTVSFKPTDCTK 373
             T+SF P  C K
Sbjct: 503 HDTLSFAPVQCNK 515


>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
 gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 116/410 (28%), Positives = 172/410 (41%), Gaps = 86/410 (20%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH--------FNQNSSISSSKASQAD 83
           +LIHRDS  SP+Y S++T   R    +  SL RL++        F+ N    +   S ++
Sbjct: 40  KLIHRDSIVSPYYRSNDTVADRTERTMKASLARLSYLYAKIERDFDINDLWLNLHPSASE 99

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD-SPLFDPKMSST 142
            +     +L+  S+G PP  +LA+ DTGS L+W QC PC    C  Q   P+FDP +SST
Sbjct: 100 PL-----FLVNFSMGQPPVPQLAIMDTGSSLLWIQCAPC--KSCSQQIIGPMFDPSISST 152

Query: 143 YKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           Y SL C +  C       C S   C Y+ +Y +G  S G +ATE +  GS+     A+  
Sbjct: 153 YDSLSCKNIICRYAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNN 212

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA--------------------G 241
           + FGC   NG   + + TG+ GLG G  S+++QM +  +                    G
Sbjct: 213 VLFGCSHRNGNYKDRRFTGVFGLGSGITSVVNQMGSKFSYCIGNIADPDYSYNQLVLSEG 272

Query: 242 NQRLGVSTP------------------------------------DIVIDSGTTLTFLPQ 265
               G STP                                     ++IDSGT  T+L +
Sbjct: 273 VNMEGYSTPLDVVDGHYQVILEGISVGETRLVIDPSAFKRTEKQRRVIIDSGTAPTWLAE 332

Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN---SLSQVPEVTIHF-RGADVKLSRSN 321
                L   + ++++         S  LCY       L   P VT HF  GAD+      
Sbjct: 333 NEYRALEREVRNLLDRFLTPFMRESF-LCYKGKVGQDLVGFPAVTFHFAEGADL------ 385

Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             V  +E    SV+        + G + Q  + V YD+ +  + F+  DC
Sbjct: 386 --VVDTEMRQASVYGKDFKDFSVIGLMAQQYYNVAYDLNKHKLFFQRIDC 433


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score =  131 bits (329), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 83/205 (40%), Positives = 115/205 (56%), Gaps = 32/205 (15%)

Query: 54  LRDALTRSLNRLNHF-----NQNSSISSSKASQADIIPNNA------NYLIRISIG---- 98
           LR  L    +R N F     N  ++ +S+++  A++   +       NY+  I++G    
Sbjct: 137 LRRLLAADESRANSFQLRIRNDRAAAASTQSGSAEVPLTSGIRFQTLNYVTTIALGGGSS 196

Query: 99  -TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASL 156
            +P      + DTGSDL W QC+PC  S CY Q  PLFDP  S+TY ++ C++S C ASL
Sbjct: 197 GSPAANLTVIVDTGSDLTWVQCKPC--SACYAQRDPLFDPAGSATYAAVRCNASACAASL 254

Query: 157 NQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
                   SC G N  C Y+++YGDGSFS G LAT+TV LG  +     L G  FGCG +
Sbjct: 255 KAATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVALGGAS-----LDGFVFGCGLS 309

Query: 210 NGGLFNSKTTGIVGLGGGDISLISQ 234
           N GLF   T G++GLG  ++SL+SQ
Sbjct: 310 NRGLFGG-TAGLMGLGRTELSLVSQ 333



 Score = 43.9 bits (102), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 36/135 (26%), Positives = 56/135 (41%), Gaps = 9/135 (6%)

Query: 246 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ--PVADPTGSLELCYSFNSLSQV 303
           G+   +++IDSGT +T L       + +  +    A   P A     L+ CY      +V
Sbjct: 406 GLGASNVLIDSGTVITRLAPSVYRGVRAEFTRQFAAAGYPTAPGFSILDTCYDLTGHDEV 465

Query: 304 --PEVTIHFRG-ADVKLSRSNFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVG 356
             P +T+   G A+V +  +     V +D   VC     ++  +  PI GN  Q N  V 
Sbjct: 466 KVPLLTLRLEGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVV 525

Query: 357 YDIEQQTVSFKPTDC 371
           YD     + F   DC
Sbjct: 526 YDTVGSRLGFADEDC 540


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score =  130 bits (328), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 102/386 (26%), Positives = 159/386 (41%), Gaps = 78/386 (20%)

Query: 56  DALTRSLNRLNHFNQNSSISSSKASQADIIPN----NANYLIRISIGTPPTERLAVADTG 111
           D +TR L+ L   N ++  ++S A Q  ++      +  Y  R+ IG+P  +   V DTG
Sbjct: 129 DGVTR-LD-LRPANGSAVFAASAAIQGPVVSGVGQGSGEYFSRVGIGSPARQLYMVLDTG 186

Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYS 169
           SD+ W QC+PC  + CY Q  P+FDP +S++Y ++ C S +C  L+  +C      C Y 
Sbjct: 187 SDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGACLYE 244

Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF--------------- 214
           V+YGDGS++ G+ ATET+TLG +T     +  +  GCG +N GLF               
Sbjct: 245 VAYGDGSYTVGDFATETLTLGDST----PVGNVAIGCGHDNEGLFVGAAGLLALGGGPLS 300

Query: 215 -----------------NSKTTGIVGLGGGDIS-------LISQMRTTI----------A 240
                            +S     +  G G          L+   RT+            
Sbjct: 301 FPSQISASTFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVALSGISV 360

Query: 241 GNQRLGVSTPDIVIDS-----------GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 289
           G Q L +      +D+           GT +T L     + L         + P      
Sbjct: 361 GGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPRTSGVS 420

Query: 290 SLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVKVS-EDIVCSVFKGITNSVPIY 345
             + CY  +  +  +VP V++ F G   ++L   N+ + V      C  F     +V I 
Sbjct: 421 LFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSII 480

Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDC 371
           GN+ Q    V +D  +  V F P  C
Sbjct: 481 GNVQQQGTRVSFDTARGAVGFTPNKC 506


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 88/242 (36%), Positives = 127/242 (52%), Gaps = 25/242 (10%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRL- 65
           V I  +LC   V+   A  G   V+L H D+ K       E P + L R A+ RS  R  
Sbjct: 9   VLIACWLCGCPVAGEAAFAGDIRVDLTHVDAGK-------ELPKRELIRRAMQRSKARAA 61

Query: 66  ------NHFNQNSSISSSKASQ-----ADIIPNNANYLIRISIGTPPTERLAVADTGSDL 114
                 N      SI+ ++  +     A     +  Y++ +++GTPP    A+ DTGSDL
Sbjct: 62  ALSVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVLDLAVGTPPQPITALLDTGSDL 121

Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYG 173
           IWTQC+ C  + C  Q  PLF P+MSS+Y+ + C+   C  +   SC   + C Y  SYG
Sbjct: 122 IWTQCDTC--TACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSCVRPDTCTYRYSYG 179

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           DG+ + G  ATE  T  S++G+  ++P + FGCGT N G  N+  +GIVG G   +SL+S
Sbjct: 180 DGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSLNN-ASGIVGFGRDPLSLVS 237

Query: 234 QM 235
           Q+
Sbjct: 238 QL 239



 Score = 57.8 bits (138), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 35/132 (26%), Positives = 66/132 (50%), Gaps = 13/132 (9%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE-LCYSFNSLS--------- 301
           ++IDSGT LT  P    + ++    S +   P A+ +   + +C++  +++         
Sbjct: 323 VIIDSGTALTLFPVAVLAEVVRAFRSQLRL-PFANGSSPDDGVCFAAPAVAAGGGRMARQ 381

Query: 302 -QVPEVTIHFRGADVKLSRSNFFVK-VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
             VP +  HF+GAD+ L R N+ ++      +C +     +     GN +Q +  V YD+
Sbjct: 382 VAVPRMVFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDL 441

Query: 360 EQQTVSFKPTDC 371
           E++T+SF P +C
Sbjct: 442 ERETLSFAPVEC 453


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score =  130 bits (328), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 88/242 (36%), Positives = 127/242 (52%), Gaps = 25/242 (10%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRL- 65
           V I  +LC   V+   A  G   V+L H D+ K       E P + L R A+ RS  R  
Sbjct: 9   VLIACWLCGCPVAGEAAFAGDIRVDLTHVDAGK-------ELPKRELIRRAMQRSKARAA 61

Query: 66  ------NHFNQNSSISSSKASQ-----ADIIPNNANYLIRISIGTPPTERLAVADTGSDL 114
                 N      SI+ ++  +     A     +  Y++ +++GTPP    A+ DTGSDL
Sbjct: 62  ALSVVRNGGGFYGSIAQAREREREPGMAVRASGDLEYVLDLAVGTPPQPITALLDTGSDL 121

Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYG 173
           IWTQC+ C  + C  Q  PLF P+MSS+Y+ + C+   C  +   SC   + C Y  SYG
Sbjct: 122 IWTQCDTC--TACLRQPDPLFSPRMSSSYEPMRCAGQLCGDILHHSCVRPDTCTYRYSYG 179

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           DG+ + G  ATE  T  S++G+  ++P + FGCGT N G  N+  +GIVG G   +SL+S
Sbjct: 180 DGTTTLGYYATERFTFASSSGETQSVP-LGFGCGTMNVGSLNN-ASGIVGFGRDPLSLVS 237

Query: 234 QM 235
           Q+
Sbjct: 238 QL 239



 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 35/132 (26%), Positives = 66/132 (50%), Gaps = 13/132 (9%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE-LCYSFNSLS--------- 301
           ++IDSGT LT  P    + ++    S +   P A+ +   + +C++  +++         
Sbjct: 323 VIIDSGTALTLFPAAVLAEVVRAFRSQLRL-PFANGSSPDDGVCFAAPAVAAGGGRMARQ 381

Query: 302 -QVPEVTIHFRGADVKLSRSNFFVK-VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
             VP +  HF+GAD+ L R N+ ++      +C +     +     GN +Q +  V YD+
Sbjct: 382 VAVPRMVFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDL 441

Query: 360 EQQTVSFKPTDC 371
           E++T+SF P +C
Sbjct: 442 ERETLSFAPVEC 453


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/166 (42%), Positives = 92/166 (55%), Gaps = 11/166 (6%)

Query: 53  RLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA----NYLIRISIGTPPTERLAVA 108
           RL   L R  N   H  ++ +   S A Q  ++   +     Y +R+ IG PP++   V 
Sbjct: 107 RLDLFLKRVSNSDLHPAESKAEFESNALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVL 166

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
           DTGSD+ W QC PC  S+CY Q  P+FDP  S++Y  + C   QC SL+   C    C Y
Sbjct: 167 DTGSDVSWIQCAPC--SECYQQSDPIFDPISSNSYSPIRCDEPQCKSLDLSECRNGTCLY 224

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
            VSYGDGS++ G  ATETVTLGS   + VA+     GCG NN GLF
Sbjct: 225 EVSYGDGSYTVGEFATETVTLGSAAVENVAI-----GCGHNNEGLF 265



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 36/121 (29%), Positives = 50/121 (41%), Gaps = 4/121 (3%)

Query: 255 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR--- 311
           DSGT +T L       L        +  P A+     + CY  +S   V   T+ FR   
Sbjct: 364 DSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVEIPTVSFRFPE 423

Query: 312 GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
           G ++ L   N+ + V S    C  F   T+S+ I GN+ Q    VG+DI    V F    
Sbjct: 424 GRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANSLVGFSVDS 483

Query: 371 C 371
           C
Sbjct: 484 C 484


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 68/160 (42%), Positives = 93/160 (58%), Gaps = 11/160 (6%)

Query: 80  SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
           +Q+ +     NY++ + +GTP  +   + DTGSDL WTQC+PC  S CY Q  P+FDP  
Sbjct: 143 AQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKS-CYAQQQPIFDPSA 201

Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           S TY ++ C+S+ C+ L     N   CS  NC Y + YGD SF+ G  A +T+TL     
Sbjct: 202 SKTYSNISCTSTACSGLKSATGNSPGCSSSNCVYGIQYGDSSFTVGFFAKDTLTL----T 257

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
           Q     G  FGCG NN GLF  KT G++GLG   +S++ Q
Sbjct: 258 QNDVFDGFMFGCGQNNRGLFG-KTAGLIGLGRDPLSIVQQ 296



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/125 (28%), Positives = 58/125 (46%), Gaps = 5/125 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
           +IDSGT +T LP     +L S     +   P A     L+ CY  ++ +   +P+++ +F
Sbjct: 378 IIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFNF 437

Query: 311 RG-ADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            G A+V L  +   +      VC  F   G  +++ I+GNI Q    V YD+    + F 
Sbjct: 438 NGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQLGFG 497

Query: 368 PTDCT 372
              C+
Sbjct: 498 YKGCS 502


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 117/425 (27%), Positives = 174/425 (40%), Gaps = 88/425 (20%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP------ 86
           ++HR  P SP     + P     D L     R++  ++  +  ++   Q   +P      
Sbjct: 22  VMHRHGPCSPLQTPDDAPSDA--DLLEHDQARVDSIHRMIANETAVVGQDVSLPAERGIS 79

Query: 87  -NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
               NY++ + +GTP  +   V DTGSDL W QC PC    CY Q  PLF P  SST+ +
Sbjct: 80  VGTGNYVVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSA 139

Query: 146 LPCSSSQCASLNQKSCSGV----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA--- 198
           + C   +C    Q SCS       C Y V YGD S + G+L  +T+TLG+T     +   
Sbjct: 140 VRCGEPECPRARQ-SCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENN 198

Query: 199 ---LPGITFGCGTNN-------GGLF---------NSKTTGIVGLG------------GG 227
              LPG  FGCG NN        GLF         +S+  G  G G             G
Sbjct: 199 SNKLPGFVFGCGENNTGLFGKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSNAHG 258

Query: 228 DISL--------------------------ISQMRTTIAG-----NQRLGVSTPDIVIDS 256
            +SL                          +  +   +AG     + R  +    +++DS
Sbjct: 259 YLSLGTPAPAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALWPAGLIVDS 318

Query: 257 GTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSF----NSLSQVPEVTIHF 310
           GT +T L P+ Y++   + +S+M +      P  S L+ CY F    N+   +P V + F
Sbjct: 319 GTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSILDTCYDFTAHANATVSIPAVALVF 378

Query: 311 RGA---DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            G     V  S   +  KV++  +     G   S  I GN  Q    V YD+ +Q + F 
Sbjct: 379 AGGATISVDFSGVLYVAKVAQACLAFAPNGNGRSAGILGNTQQRTVAVVYDVGRQKIGFA 438

Query: 368 PTDCT 372
              C+
Sbjct: 439 AKGCS 443


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score =  130 bits (327), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 68/155 (43%), Positives = 91/155 (58%), Gaps = 7/155 (4%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N  NY++ I +GTP      V DTGSD  W QC+PC  + CY Q  PLF P  S+TY ++
Sbjct: 161 NTGNYVVPIRLGTPAARFTVVFDTGSDTTWVQCQPCV-AYCYQQKEPLFTPTKSATYANI 219

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C+SS C+ L+ + CSG +C Y+V YGDGS++ G  A +T+TLG  T     +    FGC
Sbjct: 220 SCTSSYCSDLDTRGCSGGHCLYAVQYGDGSYTVGFYAQDTLTLGYDT-----VKDFRFGC 274

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           G  N GLF  K  G++GLG G  S+  Q     +G
Sbjct: 275 GEKNRGLFG-KAAGLMGLGRGKTSVPVQAYDKYSG 308



 Score = 42.4 bits (98), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 52/127 (40%), Gaps = 8/127 (6%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLS---QVPEVT 307
           ++DSGT +T LP      L S  +  +E      A     L+ CY          +P V+
Sbjct: 377 LVDSGTVITRLPPSAYEPLRSAFAKGMEGLGYKTAPAFSILDTCYDLTGYQGSIALPAVS 436

Query: 308 IHFRGA---DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
           + F+G    DV  S   +   VS+  +          + I GN  Q  + V YD+ ++ V
Sbjct: 437 LVFQGGACLDVDASGILYVADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVV 496

Query: 365 SFKPTDC 371
            F P  C
Sbjct: 497 GFAPGAC 503


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 69/166 (41%), Positives = 94/166 (56%), Gaps = 11/166 (6%)

Query: 53  RLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA----NYLIRISIGTPPTERLAVA 108
           RL   L R  N   H  ++++   + A Q  ++   +     Y +R+ IG PP++   V 
Sbjct: 107 RLDLVLKRVSNSDLHPAESNAEFEANALQGPVVSGTSQGSGEYFLRVGIGKPPSQAYVVL 166

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
           DTGSD+ W QC PC  S+CY Q  P+FDP  S++Y  + C + QC SL+   C    C Y
Sbjct: 167 DTGSDVSWIQCAPC--SECYQQSDPIFDPVSSNSYSPIRCDAPQCKSLDLSECRNGTCLY 224

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
            VSYGDGS++ G  ATETVTLG+   + VA+     GCG NN GLF
Sbjct: 225 EVSYGDGSYTVGEFATETVTLGTAAVENVAI-----GCGHNNEGLF 265



 Score = 52.8 bits (125), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/121 (31%), Positives = 52/121 (42%), Gaps = 4/121 (3%)

Query: 255 DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-R 311
           DSGT +T L       L        +  P A+     + CY  +S    QVP V+ HF  
Sbjct: 364 DSGTAVTRLRSEVYDALRDAFVKGAKGIPKANGVSLFDTCYDLSSRESVQVPTVSFHFPE 423

Query: 312 GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
           G ++ L   N+ + V S    C  F   T+S+ I GN+ Q    VG+DI    V F    
Sbjct: 424 GRELPLPARNYLIPVDSVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGFSADS 483

Query: 371 C 371
           C
Sbjct: 484 C 484


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 79/225 (35%), Positives = 117/225 (52%), Gaps = 35/225 (15%)

Query: 18  VVSPIEAQT---GGFSVELIHRDSPKSPFYNSSETPY-QRLRDALTRSLNRLNHFNQNSS 73
           VV P + +T     +S+ L+HRD+ K     ++E  Y +R++  L R   R+   N    
Sbjct: 45  VVQPAKEETLEIKPWSIPLVHRDAMKGNSNKNNELSYAERMQQRLKRDAARVAAINSRLE 104

Query: 74  IS-------------------SSKASQADIIPN----NANYLIRISIGTPPTERLAVADT 110
           ++                   +    Q+ ++      +  Y  RI +G P  ++L V DT
Sbjct: 105 LAVNGIKRSSLKPDSSSSFTMAESDFQSPVVSGMDQGSGEYFSRIGVGAPRRDQLMVLDT 164

Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS-GVNCQYS 169
           GSD+ W QCEPC  S CY Q  P+++P +SS+YK + C ++ C  L+   CS   +C Y 
Sbjct: 165 GSDVTWIQCEPC--SDCYQQSDPIYNPALSSSYKLVGCQANLCQQLDVSGCSRNGSCLYQ 222

Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
           VSYGDGS++ GN ATET+TLG    Q VA+     GCG +N GLF
Sbjct: 223 VSYGDGSYTQGNFATETLTLGGAPLQNVAI-----GCGHDNEGLF 262



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           +++DSGT +T L      +L     +  +  P  D     + CY  +S     VP V  H
Sbjct: 362 VIVDSGTAVTRLQTAAYDSLRDAFRAGTKNLPSTDGVSLFDTCYDLSSKESVDVPTVVFH 421

Query: 310 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F  G  + L   N+ V V S    C  F   ++S+ I GNI Q    V +D     V F 
Sbjct: 422 FSGGGSMSLPAKNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFDRANNQVGFA 481

Query: 368 PTDC 371
              C
Sbjct: 482 VNKC 485


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 68/160 (42%), Positives = 93/160 (58%), Gaps = 11/160 (6%)

Query: 80  SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
           +Q+ +     NY++ + +GTP  +   + DTGSDL WTQC+PC  S CY Q  P+FDP  
Sbjct: 143 AQSGLPLGTGNYIVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKS-CYAQQQPIFDPST 201

Query: 140 SSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           S TY ++ C+S+ C+SL     N   CS  NC Y + YGD SF+ G  A + +TL     
Sbjct: 202 SKTYSNISCTSAACSSLKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTL----T 257

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
           Q     G  FGCG NN GLF  KT G++GLG   +S++ Q
Sbjct: 258 QNDVFDGFMFGCGQNNKGLFG-KTAGLIGLGRDPLSIVQQ 296



 Score = 55.8 bits (133), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 59/125 (47%), Gaps = 5/125 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
           +IDSGT +T LP     +L S     +   P A     L+ CY  ++ +   +P+++ +F
Sbjct: 378 IIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSLLDTCYDLSNYTSISIPKISFNF 437

Query: 311 RG-ADVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            G A+V+L  +   +      VC  F   G  +S+ I+GNI Q    V YD+    + F 
Sbjct: 438 NGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQLGFG 497

Query: 368 PTDCT 372
              C+
Sbjct: 498 YKGCS 502


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 72/200 (36%), Positives = 104/200 (52%), Gaps = 22/200 (11%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD--- 83
           G  S+ELIHR+S          T  Q L + L R   R+      + ++  K  +A    
Sbjct: 54  GTLSLELIHRNSLLREAKEKLHTHEQLLLETLQRDEQRVRWIESKAQLAGKKKDEASSTD 113

Query: 84  --------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
                   ++  +  Y +R+ +GTP      V DTGSDL W QC+PC    CY Q  P+F
Sbjct: 114 LNGPVTSGLLYGSGEYFVRLGVGTPARSLFMVVDTGSDLPWLQCQPC--KSCYKQADPIF 171

Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSG-----VNCQYSVSYGDGSFSNGNLATETVTLG 190
           DP+ SS+++ +PC S  C +L   SCSG       C Y V+YGDGSFS G+ +++  TLG
Sbjct: 172 DPRNSSSFQRIPCLSPLCKALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLG 231

Query: 191 STTGQAVALPGITFGCGTNN 210
            T  +A++   + FGCG +N
Sbjct: 232 -TGSKAMS---VAFGCGFDN 247



 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 64/124 (51%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           ++IDSGT++T  P    + +     +     P A      + CY+F+  +   VP + +H
Sbjct: 359 VIIDSGTSVTRFPTSVYATIRDAFRNATTNLPSAPRYSLFDTCYNFSGKASVDVPALVLH 418

Query: 310 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F  GAD++L  +N+ + + +    C  F   +  + I GNI Q +F +G+D+++  ++F 
Sbjct: 419 FENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFA 478

Query: 368 PTDC 371
           P  C
Sbjct: 479 PQQC 482


>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
 gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
          Length = 471

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 66/197 (33%), Positives = 101/197 (51%), Gaps = 18/197 (9%)

Query: 29  FSVELIHRDS-PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ------ 81
           +++ L+HRD  P   + N     + R+R    R    L   +    ++SS +        
Sbjct: 59  YTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVVVASSDSRYEVNDFG 118

Query: 82  ADIIPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           +D++      +  Y +RI +G+PP ++  V D+GSD++W QC+PC    CY Q  P+FDP
Sbjct: 119 SDVVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPC--KLCYKQSDPVFDP 176

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
             S +Y  + C SS C  +    C    C+Y V YGDGS++ G LA ET+T   T  + V
Sbjct: 177 AKSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNV 236

Query: 198 ALPGITFGCGTNNGGLF 214
           A+     GCG  N G+F
Sbjct: 237 AM-----GCGHRNRGMF 248



 Score = 47.8 bits (112), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 50/124 (40%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIH 309
           +V+D+GT +T LP G  +       S     P A      + CY  +     +VP V+ +
Sbjct: 348 VVMDTGTAVTRLPTGAYAAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFY 407

Query: 310 F-RGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F  G  + L   NF + V +    C  F      + I GNI Q    V +D     V F 
Sbjct: 408 FTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFG 467

Query: 368 PTDC 371
           P  C
Sbjct: 468 PNVC 471


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 79/213 (37%), Positives = 113/213 (53%), Gaps = 21/213 (9%)

Query: 31  VELIHRDS--PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP-N 87
           V + HRD+  P  P         QRL     R  + ++   +  S   S       IP  
Sbjct: 29  VPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATGRLHSPVFSG------IPFE 82

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y   + +GTP T+ + V DTGSDL+W QC PC   +CY Q   +FDP+ SSTY+ +P
Sbjct: 83  SGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPC--RRCYAQRGQVFDPRRSSTYRRVP 140

Query: 148 CSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           CSS QC +L    C     +G  C+Y V+YGDGS S G LAT+ +   + T     +  +
Sbjct: 141 CSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAFANDT----YVNNV 196

Query: 203 TFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           T GCG +N GLF+S   G++G+  G IS+ +Q+
Sbjct: 197 TLGCGRDNEGLFDS-AAGLLGVARGKISISTQV 228



 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 32/134 (23%), Positives = 59/134 (44%), Gaps = 13/134 (9%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNS--LSQVPEV 306
           +V+DSGT ++   +   + L     +   A  +    G     + CY       +  P +
Sbjct: 312 VVVDSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLI 371

Query: 307 TIHFRG-ADVKLSRSNFFV-------KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 358
            +HF G AD+ L   N+F+       + +    C  F+   + + + GN+ Q  F V +D
Sbjct: 372 VLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADDGLSVIGNVQQQGFRVVFD 431

Query: 359 IEQQTVSFKPTDCT 372
           +E++ + F P  CT
Sbjct: 432 VEKERIGFAPKGCT 445


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 82/223 (36%), Positives = 115/223 (51%), Gaps = 28/223 (12%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN-----SSISSSKASQADI 84
           S+E+IHR  P     +++ T      + L +  +R++  +        S+   + S+A  
Sbjct: 62  SLEVIHRHGPCGDEVSNAPTA----AEMLVKDQSRVDFIHSKIAGELESVDRLRGSKATK 117

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           IP        + NY++ + +GTP      + DTGSDL WTQC+PC    CY Q  P+F P
Sbjct: 118 IPAKSGATIGSGNYIVSVGLGTPKKYLSLIFDTGSDLTWTQCQPC-ARYCYNQKDPVFVP 176

Query: 138 KMSSTYKSLPCSSSQCASL-----NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGS 191
             S+TY ++ CSS  C+ L     NQ  CS    C Y + YGD SFS G  A ET+TL S
Sbjct: 177 SQSTTYSNISCSSPDCSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLTS 236

Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
           T      +    FGCG NN GLF S   G++GLG   IS++ Q
Sbjct: 237 TD----VIENFLFGCGQNNRGLFGS-AAGLIGLGQDKISIVKQ 274



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 55/129 (42%), Gaps = 5/129 (3%)

Query: 248 STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPE 305
           ST   +IDSGT +T LP    S L S     +   P A     L+ CY  +  S  Q+P+
Sbjct: 345 STSGAIIDSGTVITRLPPDAYSALKSAFEKGMAKYPKAPELSILDTCYDLSKYSTIQIPK 404

Query: 306 VTIHFRGA-DVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQ 362
           V   F+G  ++ L         S   VC  F G  +  +V I GN+ Q    V YD+   
Sbjct: 405 VGFVFKGGEELDLDGIGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGG 464

Query: 363 TVSFKPTDC 371
            + F    C
Sbjct: 465 KIGFGYNGC 473


>gi|388498308|gb|AFK37220.1| unknown [Lotus japonicus]
          Length = 363

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 120/230 (52%), Gaps = 23/230 (10%)

Query: 25  QTGGFSVELIHRD--SPKSPFYNSSETPYQRLRDALTRSL-NRLNHFNQNSSISSSKAS- 80
           + G   +E+  R   S K   ++        L D   RS+ NRL     + S+  S+   
Sbjct: 73  EKGAIMLEMKDRSYCSKKKVNWHRKLHNQLTLDDLHVRSMQNRLRKMVSSHSVEVSQIQI 132

Query: 81  --QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
              + +     NY++ + +G    +   + DTGSDL W QCEPC    CY Q  P+F P 
Sbjct: 133 PLASGVNFQTLNYIVTMELGG--QDMTVIIDTGSDLTWVQCEPC--MSCYNQQGPVFKPS 188

Query: 139 MSSTYKSLPCSSSQCASL-----NQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGS 191
            SS+Y+S+PC+SS C SL     N  +C     NC Y+V+YGDGS++NG L  E ++ G 
Sbjct: 189 TSSSYQSIPCNSSTCQSLQLTTGNAGACESNPSNCSYAVNYGDGSYTNGELGAEHLSFG- 247

Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
                +++    FGCG NN GLF    +G++GLG  ++SLISQ  +T  G
Sbjct: 248 ----GISVSNFVFGCGKNNKGLFGG-VSGLMGLGRSNLSLISQTNSTFGG 292


>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
           Short=AtASPG2; Flags: Precursor
 gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
 gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score =  129 bits (324), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 69/196 (35%), Positives = 105/196 (53%), Gaps = 17/196 (8%)

Query: 29  FSVELIHRDS-PKSPFYNSSETPYQRLR---DALTRSLNRLNHFNQNSSISSSKASQ--A 82
           +++ L+HRD  P   + N     + R+R   D ++  L R++     SS S  + +   +
Sbjct: 59  YTLRLLHRDRFPSVTYRNHHHRLHARMRRDTDRVSAILRRISGKVIPSSDSRYEVNDFGS 118

Query: 83  DIIPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           DI+      +  Y +RI +G+PP ++  V D+GSD++W QC+PC    CY Q  P+FDP 
Sbjct: 119 DIVSGMDQGSGEYFVRIGVGSPPRDQYMVIDSGSDMVWVQCQPC--KLCYKQSDPVFDPA 176

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            S +Y  + C SS C  +    C    C+Y V YGDGS++ G LA ET+T   T  + VA
Sbjct: 177 KSGSYTGVSCGSSVCDRIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTFAKTVVRNVA 236

Query: 199 LPGITFGCGTNNGGLF 214
           +     GCG  N G+F
Sbjct: 237 M-----GCGHRNRGMF 247



 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 48/124 (38%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           +V+D+GT +T LP            S     P A      + CY  +     +VP V+ +
Sbjct: 347 VVMDTGTAVTRLPTAAYVAFRDGFKSQTANLPRASGVSIFDTCYDLSGFVSVRVPTVSFY 406

Query: 310 F-RGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F  G  + L   NF + V +    C  F      + I GNI Q    V +D     V F 
Sbjct: 407 FTEGPVLTLPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFG 466

Query: 368 PTDC 371
           P  C
Sbjct: 467 PNVC 470


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 82/212 (38%), Positives = 109/212 (51%), Gaps = 15/212 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNR---LNHFNQNSSISSSKASQADI 84
           GF   L H D+       +  T  Q L  A+ RS  R   L      ++  +   ++  +
Sbjct: 29  GFQATLTHIDA------GAGYTEAQLLSRAVRRSKARVAALQSLATTTAADAITVARILV 82

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
           + +   YL+ + IGTPP    A+ DTGSDLIWTQC PC    C  Q +P FDP  S +Y 
Sbjct: 83  LASEGEYLMSMGIGTPPRYYSAILDTGSDLIWTQCAPC--MLCVDQPTPFFDPAQSPSYA 140

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            LPC+S  C +L    C    C Y   YGD + + G L+ ET T G T    V +P I F
Sbjct: 141 KLPCNSPMCNALYYPLCYRNVCVYQYFYGDSANTAGVLSNETFTFG-TNDTRVTVPRIAF 199

Query: 205 GCGT-NNGGLFNSKTTGIVGLGGGDISLISQM 235
           GCG  N G LFN   +G+VG G G +SL+SQ+
Sbjct: 200 GCGNLNAGSLFNG--SGMVGFGRGPLSLVSQL 229



 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 32/129 (24%), Positives = 59/129 (45%), Gaps = 12/129 (9%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMI-----EAQPVADPTGSLELCYSF----NSLSQ 302
           ++IDSG+T+T+L +     +    +  +      A  +AD    L+ C+ +      +  
Sbjct: 315 VIIDSGSTITYLARAAYDMVHQAFADQVGLPLTNATSLAD---VLDTCFVWPPPPRKIVT 371

Query: 303 VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
           +PE+  HF GA+++L   N+ +   +     +    ++   I G+    NF V YD E  
Sbjct: 372 MPELAFHFEGANMELPLENYMLIDGDTGNLCLAIAASDDGSIIGSFQHQNFHVLYDNENS 431

Query: 363 TVSFKPTDC 371
            +SF P  C
Sbjct: 432 LLSFTPATC 440


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 106/345 (30%), Positives = 145/345 (42%), Gaps = 61/345 (17%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC    CY Q   LFDP  SSTY ++
Sbjct: 176 GTGNYVVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCV-VVCYEQREKLFDPARSSTYANV 234

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT------------- 193
            C++  C+ LN   CSG +C Y V YGDGS+S G  A +T+TL S               
Sbjct: 235 SCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERN 294

Query: 194 ----GQAVALPGITFGCGT-------NNGGLFN----SKTTGIVGLG------------- 225
               G+A  L G+  G  +         GG+F     +++TG   L              
Sbjct: 295 EGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSLAAASARL 354

Query: 226 -------GGDISLISQMRTTIAGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNL-- 271
                   G       M     G Q L +     +T   ++DSGT +T LP    S+L  
Sbjct: 355 TTPMLTDNGPTFYYVGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPAAYSSLRY 414

Query: 272 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVKVSE 328
               +        A     L+ CY F  +SQV  P V++ F+G A + +  S      S 
Sbjct: 415 AFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASA 474

Query: 329 DIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             VC  F    +   V I GN     F V YDI ++ V F P  C
Sbjct: 475 SQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGAC 519


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 113/389 (29%), Positives = 172/389 (44%), Gaps = 89/389 (22%)

Query: 61  SLNRLNHFNQNSS--ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
           S+ RL +    ++  I +  +    IIP    +L+ ISIG+PP  +L   DT SDL+W Q
Sbjct: 55  SVERLEYLKAKATGDIIAHLSPNVPIIPQA--FLVNISIGSPPVTQLLHMDTASDLLWLQ 112

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA----SLNQKSCSGVNCQYSVSYGD 174
           C PC    CY Q  P+FDP  S T+++  C +SQ +      N K+ S   C+YS+ Y D
Sbjct: 113 CRPC--INCYAQSLPIFDPSRSYTHRNESCRTSQYSMPSLRFNAKTRS---CEYSMRYMD 167

Query: 175 GSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
           G+ S G LA E +   +   +  + AL  + FGCG +N G      TGI+GLG G+ SL+
Sbjct: 168 GTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCGHDNYGE-PLVGTGILGLGYGEFSLV 226

Query: 233 SQMRTTIA---------------------GNQRLGVSTP--------------------- 250
            +  T  +                     G   LG +TP                     
Sbjct: 227 HRFGTKFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIYNGFYYVTIEAISVDGII 286

Query: 251 -----------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
                              +ID+G +LT L +     L + +    E +  A      ++
Sbjct: 287 LPIDPWVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNKIEDYFEGRFTAADVNQDDM 346

Query: 294 ----CYSFN-----SLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVC-SVFKGITNSV 342
               CY+ N       S  P VT HF  GA++ L   + F+K+S ++ C +V  G  NS+
Sbjct: 347 FKVECYNGNLERDLVESGFPIVTFHFSDGAELSLDVKSVFMKLSPNVFCLAVTPGNMNSI 406

Query: 343 PIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
              G   Q ++ +GYD+E + +SF+  DC
Sbjct: 407 ---GATAQQSYNIGYDLEAKKISFERIDC 432


>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
 gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 472

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 91/246 (36%), Positives = 128/246 (52%), Gaps = 34/246 (13%)

Query: 11  LFFLCFYVVSPIEAQTGGFSVEL----IHRDSPKSPFYNSSETPYQRLRDALTRSLNR-- 64
           L  L F VV    A +G  SV +    IH D           T  Q +RDAL R ++R  
Sbjct: 27  LAVLVFLVVCATLA-SGAASVRVGLTRIHSDP--------DTTAPQFVRDALRRDMHRQR 77

Query: 65  -----------LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSD 113
                      L   +  +S + S  ++ D+ PN   YL+ ++IGTPP    AVADTGSD
Sbjct: 78  SRSFGRDRDRELAESDGRTSTTVSARTRKDL-PNGGEYLMTLAIGTPPLPYAAVADTGSD 136

Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKSCSGVN--CQYS 169
           LIWTQC PC  +QC+ Q +PL++P  S+T+  LPC+S  S CA     +       C Y 
Sbjct: 137 LIWTQCAPC-GTQCFEQPAPLYNPASSTTFSVLPCNSSLSMCAGALAGAAPPPGCACMYY 195

Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDI 229
            +YG G ++ G   +ET T GS+      +PG+ FGC   +   +N  + G+VGLG G +
Sbjct: 196 QTYGTG-WTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDWNG-SAGLVGLGRGSL 253

Query: 230 SLISQM 235
           SL+SQ+
Sbjct: 254 SLVSQL 259



 Score = 65.1 bits (157), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 39/130 (30%), Positives = 70/130 (53%), Gaps = 10/130 (7%)

Query: 252 IVIDSGTTLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ-----V 303
           ++IDSGTT+T L    Y     +V S ++   P  D + S  L+LC++  + +      +
Sbjct: 342 LIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVL 401

Query: 304 PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQ 362
           P +T+HF GAD+ L   ++ +  S  + C   +  T+ ++  +GN  Q N  + YD+ ++
Sbjct: 402 PSMTLHFDGADMVLPADSYMISGS-GVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREE 460

Query: 363 TVSFKPTDCT 372
           T+SF P  C+
Sbjct: 461 TLSFAPAKCS 470


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 110/407 (27%), Positives = 174/407 (42%), Gaps = 93/407 (22%)

Query: 54  LRDALTRSLNRLNHFN--QNSSISSSKASQ---ADIIP----NNANYLIRISIGTPPTER 104
           +R A+ RS  R    +  +N +  S K  Q   A ++P     +  Y++ ++IGTPP   
Sbjct: 50  IRRAMRRSKARAAALSAVRNRARFSGKNEQQTPAGVLPVRPSGDLEYVVDLAIGTPPQPV 109

Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
            A+ DTGSDLIWTQC PC  + C  Q  PLF P  S++Y+ + C+ + C+ +   SC   
Sbjct: 110 SALLDTGSDLIWTQCAPC--ASCLSQPDPLFAPGQSASYEPMRCAGTLCSDILHHSCERP 167

Query: 165 N-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT--FGCGTNNGG--------- 212
           + C Y  +YGDG+ + G  ATE  T  S+ G  +    +   FGCG+ N G         
Sbjct: 168 DTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVNVGSLNNGSGIV 227

Query: 213 ---------------------------------LFNSKTTGIVGLGGGDISLISQMRT-- 237
                                            LF S + G+ G   G +     +++  
Sbjct: 228 GFGRNPLSLVSQLSIRRFSYCLTSYASRRQSTLLFGSLSDGVYGDATGRVQTTPLLQSPQ 287

Query: 238 -----------TIAGNQRLGVS------TPD----IVIDSGTTLTFLPQGYNSNLLSVMS 276
                         G +RL +        PD    +++DSGT LT LP    + ++    
Sbjct: 288 NPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAAVLAEVVRAFR 347

Query: 277 SMIEAQPVADPTGSLE--LCY---------SFNSLSQVPEVTIHFRGADVKLSRSNFFV- 324
             +   P A+  G+ E  +C+         S  S   VP + +HF+GAD+ L R N+ + 
Sbjct: 348 QQLRL-PFAN-GGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHFQGADLDLPRRNYVLD 405

Query: 325 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
                 +C +     +     GN++Q +  V YD+E +T+S  P  C
Sbjct: 406 DHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 452


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 124/446 (27%), Positives = 186/446 (41%), Gaps = 99/446 (22%)

Query: 14  LCFYVVSPIEAQT------GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
           L FY+ + I + T         + +LIHR+S   P Y+ +ET   R +   T S+ R + 
Sbjct: 17  LAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDF 76

Query: 68  FNQNSSISSSKA----SQADIIPNN--ANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
               S I   K+    +++ +IP N  + +L+ +SIG+PP  +L V DTGS L+W QC P
Sbjct: 77  LE--SKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134

Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNG 180
           C    C+ Q +  FDP  S ++K+L C       +N   C+  N  +Y + Y  G  S G
Sbjct: 135 CI--NCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQG 192

Query: 181 NLATETVTLGSTTGQAVALPGITFGCG-----TNNGGLFNSKTTGIVGLGG--------- 226
            LA E++   +     +    ITFGCG     TNN   +N    G+ GLG          
Sbjct: 193 ILAKESLLFETLDEGKIKKSNITFGCGHMNIKTNNDDAYN----GVFGLGAYPHITMATQ 248

Query: 227 ---------GDIS--------LISQMRTTIAGN---------------QRLGVSTPDIVI 254
                    GDI+        L+    + I G+               Q + V +  + I
Sbjct: 249 LGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVTLQSISVGSKTLKI 308

Query: 255 -----------------DSGTTLTFLPQG----YNSNLLSVMSSMIEAQPVADPTGSLEL 293
                            DSG T T L  G        ++ +M  ++E  P         L
Sbjct: 309 DPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLERIPTQRKFEG--L 366

Query: 294 CYS---FNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNS----VPIY 345
           C+       L   P VT HF  GAD+ L   + F +   D  C      +NS    + + 
Sbjct: 367 CFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILP-SNSELLNLSVI 425

Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDC 371
           G + Q N+ VG+D+EQ  V F+  DC
Sbjct: 426 GILAQQNYNVGFDLEQMKVFFRRIDC 451


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 86/244 (35%), Positives = 123/244 (50%), Gaps = 39/244 (15%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQ--------RLRDALTRSLNR---------LNHFNQ 70
           G  + L H  SP SP    S+ P+         R+    +R  N          L H ++
Sbjct: 42  GLHLTLHHPQSPCSPAPLPSDLPFSAVVTHDDARIAHLASRLANNHPTSPSSSSLLHGHR 101

Query: 71  NSSISSSKASQAD-----IIPNNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
                    SQA      + P  +    NY+ R+ +GTP T  + V DTGS L W QC P
Sbjct: 102 KKKAGGVGGSQASSSSVPLTPGASVAVGNYVTRLGLGTPATSYVMVVDTGSSLTWLQCSP 161

Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDG 175
           C  S C+ Q  P+FDP+ S TY ++ CSSS+C     A+LN  +CS  N C Y  SYGD 
Sbjct: 162 CSVS-CHRQAGPVFDPRASGTYAAVQCSSSECGELQAATLNPSACSVSNVCIYQASYGDS 220

Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           S+S G L+ +TV+ GS +      PG  +GCG +N GLF  ++ G++GL    +SL+ Q+
Sbjct: 221 SYSVGYLSKDTVSFGSGS-----FPGFYYGCGQDNEGLFG-RSAGLIGLAKNKLSLLYQL 274

Query: 236 RTTI 239
             ++
Sbjct: 275 APSL 278



 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 58/123 (47%), Gaps = 4/123 (3%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCYSFNSLS-QVPEVTIHF 310
           +IDSGT +T LP    + L   +++ + +     PT S L+ C+  ++   +VP V + F
Sbjct: 348 IIDSGTVITRLPPNVYTALSRAVAAAMASAAPRAPTYSILDTCFRGSAAGLRVPRVDMAF 407

Query: 311 RG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
            G A + LS  N  + V +   C  F   T    I GN  Q  F V YD+ Q  + F   
Sbjct: 408 AGGATLALSPGNVLIDVDDSTTCLAFA-PTGGTAIIGNTQQQTFSVVYDVAQSRIGFAAG 466

Query: 370 DCT 372
            C+
Sbjct: 467 GCS 469


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 97/348 (27%), Positives = 147/348 (42%), Gaps = 69/348 (19%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +RI +G+PP  +  V D+GSD++W QC+PC  +QCY Q  PLFDP  S+++  + 
Sbjct: 40  SGEYFVRIGLGSPPRSQYMVIDSGSDIVWVQCKPC--TQCYHQTDPLFDPADSASFMGVS 97

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST--------------- 192
           CSS+ C  +    C+   C+Y VSYGDGS++ G LA ET+T G T               
Sbjct: 98  CSSAVCDRVENAGCNSGRCRYEVSYGDGSYTKGTLALETLTFGRTVVRNVAIGCGHSNRG 157

Query: 193 -------------------------TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGG 227
                                    TG A +   ++ G  TN    F S+   +   G  
Sbjct: 158 MFVGAAGLLGLGGGSMSFMGQLSGQTGNAFSYCLVSRGTNTNGFLEFGSEAMPV---GAA 214

Query: 228 DISLISQMRTTI----------AGNQRLGVS----------TPDIVIDSGTTLTFLPQGY 267
            I L+   R              G+ R+ VS          +  +V+D+GT +T  P   
Sbjct: 215 WIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVTRFPTVA 274

Query: 268 NSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLS-QVPEVTIHFRGADV-KLSRSNFFV 324
                +      +  P A      + CY+ F  LS +VP V+ +F G  +  +  +NF +
Sbjct: 275 YEAFRNAFIEQTQNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTIPANNFLI 334

Query: 325 KVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            V +    C  F    + + I GNI Q    +  D   + V F P  C
Sbjct: 335 PVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382


>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 469

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 88/243 (36%), Positives = 125/243 (51%), Gaps = 30/243 (12%)

Query: 11  LFFLCFYVVSPIEAQTGGFSVEL----IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
           L  L F VV    A +G  SV +    IH D           T  Q +RDAL R ++R  
Sbjct: 27  LAVLVFLVVCATLA-SGAASVRVGLTRIHSDP--------DTTAPQFVRDALRRDMHRQR 77

Query: 67  ----------HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
                        ++   ++  A     +PN   YL+ ++IGTPP    AVADTGSDLIW
Sbjct: 78  SRSFGRDRDRELAESDGRTTVSARTRKDLPNGGEYLMTLAIGTPPLPYAAVADTGSDLIW 137

Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKSCSGVN--CQYSVSY 172
           TQC PC  +QC+ Q +PL++P  S+T+  LPC+S  S CA     +       C Y+ +Y
Sbjct: 138 TQCAPC-GTQCFEQPAPLYNPASSTTFSVLPCNSSLSMCAGALAGAAPPPGCACMYNQTY 196

Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
           G G ++ G   +ET T GS+      +PG+ FGC   +   +N  + G+VGLG G +SL+
Sbjct: 197 GTG-WTAGVQGSETFTFGSSAADQARVPGVAFGCSNASSSDWNG-SAGLVGLGRGSLSLV 254

Query: 233 SQM 235
           SQ+
Sbjct: 255 SQL 257



 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 37/129 (28%), Positives = 70/129 (54%), Gaps = 9/129 (6%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ-----VP 304
           ++IDSGTT+T L       + + + S++   P  D + S  L+LC++  + +      +P
Sbjct: 340 LIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDSTGLDLCFALPAPTSAPPAVLP 399

Query: 305 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 363
            +T+HF GAD+ L   ++ +  S  + C   +  T+ ++  +GN  Q N  + YD+ ++T
Sbjct: 400 SMTLHFDGADMVLPADSYMISGS-GVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVREET 458

Query: 364 VSFKPTDCT 372
           +SF P  C+
Sbjct: 459 LSFAPAKCS 467


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 117/426 (27%), Positives = 176/426 (41%), Gaps = 95/426 (22%)

Query: 29  FSVELIHRDSPKSPFYNS---SETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADI 84
            S+E++HR  P     N    ++ P     +   R  NR++  +   SS       QA  
Sbjct: 48  LSLEVVHRHGPCIGIVNQEKGADAPSNM--EIFLRDQNRVDSIHARLSSRGMFPEKQATT 105

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           +P          +Y++ + +GTP  E   + DTGSD+ WTQCEPC  + CY Q  P  +P
Sbjct: 106 LPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNP 164

Query: 138 KMSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
             S++YK++ CSS+ C  +       +SCS   C Y V YGDGS+S G  ATET+TL S+
Sbjct: 165 STSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSS 224

Query: 193 TGQAVALPGITFGCGTNNG----------------------------GLF------NSKT 218
                      FGCG  N                              LF      +S +
Sbjct: 225 N----VFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSSS 280

Query: 219 TGIVGLGGG----------------------DISLISQMRTTIAGNQRLGVS----TPDI 252
            G + LGG                       DI+ +S       G ++L +     +   
Sbjct: 281 KGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLS------VGGRKLSIDESAFSAGT 334

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
           VIDSGT +T L     S L S   +++   P        + CY F+     ++P+V + F
Sbjct: 335 VIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTF 394

Query: 311 RGA-DVKLSRSNFFVKVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSF 366
           +G  ++ +  S     V+    VC  F G  +     I+GN+ Q  + V YD  +  V F
Sbjct: 395 KGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGF 454

Query: 367 KPTDCT 372
            P  C+
Sbjct: 455 APGGCS 460


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 117/426 (27%), Positives = 176/426 (41%), Gaps = 95/426 (22%)

Query: 29  FSVELIHRDSPKSPFYNS---SETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADI 84
            S+E++HR  P     N    ++ P     +   R  NR++  +   SS       QA  
Sbjct: 60  LSLEVVHRHGPCIGIVNQEKGADAPSNM--EIFLRDQNRVDSIHARLSSRGMFPEKQATT 117

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           +P          +Y++ + +GTP  E   + DTGSD+ WTQCEPC  + CY Q  P  +P
Sbjct: 118 LPVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNP 176

Query: 138 KMSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
             S++YK++ CSS+ C  +       +SCS   C Y V YGDGS+S G  ATET+TL S+
Sbjct: 177 STSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSS 236

Query: 193 TGQAVALPGITFGCGTNNG----------------------------GLF------NSKT 218
                      FGCG  N                              LF      +S +
Sbjct: 237 N----VFKNFLFGCGQQNNGLFGGAAGLLGLGRTKLALPSQTAKTYKKLFSYCLPASSSS 292

Query: 219 TGIVGLGGG----------------------DISLISQMRTTIAGNQRLGVS----TPDI 252
            G + LGG                       DI+ +S       G ++L +     +   
Sbjct: 293 KGYLSLGGQVSKSVKFTPLSADFDSTPFYGLDITGLS------VGGRKLSIDESAFSAGT 346

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
           VIDSGT +T L     S L S   +++   P        + CY F+     ++P+V + F
Sbjct: 347 VIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTF 406

Query: 311 RGA-DVKLSRSNFFVKVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSF 366
           +G  ++ +  S     V+    VC  F G  +     I+GN+ Q  + V YD  +  V F
Sbjct: 407 KGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGF 466

Query: 367 KPTDCT 372
            P  C+
Sbjct: 467 APGGCS 472


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score =  127 bits (320), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 77/226 (34%), Positives = 112/226 (49%), Gaps = 19/226 (8%)

Query: 30  SVELIHRDSPKSPFYNSSETP---YQRLRDALTRS---LNRLNHFNQNSSISSSKASQAD 83
           SV L+HR  P +P   SS+ P     RLR    RS   ++R++          S  +   
Sbjct: 57  SVPLVHRHGPCAPTQLSSDKPSSFTDRLRRNRARSKYIMSRVSKGMMGDDADVSIPTHLG 116

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
              ++  Y++ + +GTP   ++ + DTGSDL W QC+PC  + CY Q  PLFDP  SSTY
Sbjct: 117 GSVDSLEYVVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTY 176

Query: 144 KSLPCSSSQCASLNQKSCSG--------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
             +PC++  C  L      G          C ++++YGDGS + G  + ET+ L      
Sbjct: 177 APIPCNTDACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLALAP---- 232

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
            VA+    FGCG +  G  N K  G++GLGG   SL+ Q  +   G
Sbjct: 233 GVAVKDFRFGCGHDQDGA-NDKYDGLLGLGGAPESLVVQTASVYGG 277



 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 36/123 (29%), Positives = 55/123 (44%), Gaps = 6/123 (4%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           ++IDSGT +T L     + L +     + A P+    G L+ CY F+  S V  P+V + 
Sbjct: 349 MIIDSGTVVTELQHTAYNALQAAFRKAMAAYPLVR-NGELDTCYDFSGYSNVTLPKVALT 407

Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           F G A + L   N  +   +D +     G  +   I GN+ Q    V YD  +  V F+ 
Sbjct: 408 FSGGATIDLDVPNGILL--DDCLAFQESGPDDQPGILGNVNQRTLEVLYDAGRGRVGFRA 465

Query: 369 TDC 371
             C
Sbjct: 466 AVC 468


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  127 bits (319), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 70/163 (42%), Positives = 95/163 (58%), Gaps = 21/163 (12%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY++ + +G        + DTGSDL W QC+PC    CY Q  PL+DP +SS+YK++ C+
Sbjct: 137 NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 192

Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           SS C  L     N   C G N      C+Y VSYGDGS++ G+LA+E++ LG T      
Sbjct: 193 SSTCQDLVAATGNSGPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVLGDT-----K 247

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           L  + FGCG NN GLF    +G++GLG   +SL+SQ   T  G
Sbjct: 248 LENLVFGCGRNNKGLFGG-ASGLMGLGRSSVSLVSQTLKTFNG 289



 Score = 46.2 bits (108), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 7/127 (5%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIH 309
           I+IDSGT +T LP      + +         P A     L+ C++  S     +P + + 
Sbjct: 359 ILIDSGTVITRLPPSIYKAVKTEFLKQFSGFPSAPGYSILDTCFNLTSYEDISIPTIKMI 418

Query: 310 FRG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTV 364
           F G    +V ++   +FVK    +VC     ++  N V I GN  Q N  V YD  Q+ +
Sbjct: 419 FEGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERL 478

Query: 365 SFKPTDC 371
                +C
Sbjct: 479 GIAGENC 485


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score =  127 bits (319), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 118/460 (25%), Positives = 189/460 (41%), Gaps = 106/460 (23%)

Query: 8   VFILFFLCFYVVSPIEAQT----GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLN 63
           V +L     Y   P+ +          V L H D+ K    + SE     +R A+ RS  
Sbjct: 7   VLVLAIASLYYACPVASAAFVGDDDVRVALKHVDAGKQ--LSRSEL----IRRAMQRSKA 60

Query: 64  RLNHFN--QNSSISSSKASQAD-----------IIPN-NANYLIRISIGTPPTERLAVAD 109
           R    +  +N + S+  + + D           + P+ +  Y++ ++IGTPP    A+ D
Sbjct: 61  RAAALSAVRNRAASARFSGKNDDQRTTPPTGVSVRPSGDLEYVVDLAIGTPPQPVSALLD 120

Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQY 168
           TGSDLIWTQC PC  + C  Q  PLF P  S++Y+ + C+   C+ +    C   + C Y
Sbjct: 121 TGSDLIWTQCAPC--ASCLAQPDPLFAPGESASYEPMRCAGQLCSDILHHGCEMPDTCTY 178

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG---------------- 212
             +YGDG+ + G  ATE  T  S+ G  +    + FGCG+ N G                
Sbjct: 179 RYNYGDGTMTMGVYATERFTFTSSGGDRLMTVPLGFGCGSMNVGSLNNGSGIVGFGRNPL 238

Query: 213 --------------------------LFNSKTTGIVGLGGGDISLISQMRTT-------- 238
                                     LF S + G+ G   G +     +++         
Sbjct: 239 SLVSQLSIRRFSYCLTSYGSGRKSTLLFGSLSGGVYGDATGPVQTTPLLQSLQNPTFYYV 298

Query: 239 -----IAGNQRLGVST------PD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 283
                  G +RL +        PD    +++DSGT LT LP    + ++      +   P
Sbjct: 299 HLAGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPGAVLAEVVRAFRQQLRL-P 357

Query: 284 VADPTGSLE--LCY---------SFNSLSQVPEVTIHFRGADVKLSRSNFFVK-VSEDIV 331
            A+  G+ E  +C+         S  S   VP +  HF+ AD+ L R N+ +    +  +
Sbjct: 358 FAN-GGNPEDGVCFLVPAAWRRSSSTSQVPVPRMVFHFQDADLDLPRRNYVLDDHRKGRL 416

Query: 332 CSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           C +     +     GN++Q +  V YD+E +T+SF P  C
Sbjct: 417 CLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSFAPAQC 456


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score =  127 bits (319), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 79/229 (34%), Positives = 112/229 (48%), Gaps = 21/229 (9%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
           G +++++HR   +S    +    +      L R  NR+   ++  + +   A+    IP 
Sbjct: 59  GNTIQIVHRACLQSGDRKTVPDHHPHYTGILRRDHNRVRSIHRRLTGAGDTAA---TIPA 115

Query: 87  ------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
                 ++  Y++ I IGTP      + DTGSDL W QC+PC  S CY Q  PLFDP  S
Sbjct: 116 SLGLAFHSLEYVVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDS-CYQQQEPLFDPSKS 174

Query: 141 STYKSLPCSSSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           STY  +PC + QC        +C G  C+YSV YGD S + GNLA E  TL  +   A  
Sbjct: 175 STYVDVPCGTPQCKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAA- 233

Query: 199 LPGITFGCGTN-----NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
             G+ FGC         G        G++GLG GD S++SQ R   +G+
Sbjct: 234 --GVVFGCSHEYSSGVKGAEEEMSVAGLLGLGRGDSSILSQTRRGNSGD 280


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 93/348 (26%), Positives = 146/348 (41%), Gaps = 69/348 (19%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  R+ +G+P  +   V DTGSD+ W QC+PC  + CY Q  P+FDP +S++Y S+
Sbjct: 163 GSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSTSYASV 220

Query: 147 PCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C + +C  L+  +C  S   C Y V+YGDGS++ G+ ATET+TLG +      +  +  
Sbjct: 221 ACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDS----APVSSVAI 276

Query: 205 GCGTNNGGLF--------------------------------NSKTTGIVGLGGGDIS-- 230
           GCG +N GLF                                +S ++  +  G    +  
Sbjct: 277 GCGHDNEGLFVGAAGLLALGGGPLSFPSQISATTFSYCLVDRDSPSSSTLQFGDAADAEV 336

Query: 231 ---LISQMRTTI----------AGNQRLGVSTPDIVIDS----------GTTLTFLPQGY 267
              LI   RT+            G Q L +      +DS          GT +T L    
Sbjct: 337 TAPLIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTAVTRLQSSA 396

Query: 268 NSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFFV 324
            + L        ++ P        + CY  +  +  +VP V++ F  G +++L   N+ +
Sbjct: 397 YAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLRFAGGGELRLPAKNYLI 456

Query: 325 KVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            V      C  F     +V I GN+ Q    V +D  + TV F    C
Sbjct: 457 PVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTTNKC 504


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 109/345 (31%), Positives = 147/345 (42%), Gaps = 61/345 (17%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
              NY++ + +GTP +    V DTGSD  W QC+PC    CY Q   LFDP  SSTY ++
Sbjct: 174 GTGNYVVTVGLGTPASRYTVVFDTGSDTTWVQCQPCV-VVCYEQQEKLFDPVRSSTYANV 232

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT------------- 193
            C++  C+ LN   CSG +C Y V YGDGS+S G  A +T+TL S               
Sbjct: 233 SCAAPACSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTLSSYDAVKGFRFGCGERN 292

Query: 194 ----GQAVALPGITFGCGT-------NNGGLF------NSKTTGIVGLGGGDISLISQMR 236
               G+A  L G+  G  +         GG+F       S  TG +  G G  +  S   
Sbjct: 293 EGLFGEAAGLLGLGRGKTSLPVQTYDKYGGVFAHCLPARSTGTGYLDFGAGSPAAASARL 352

Query: 237 TT------------------IAGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNL-- 271
           TT                    G Q L +     +T   ++DSGT +T LP    S+L  
Sbjct: 353 TTPMLTDNGPTFYYIGMTGIRVGGQLLSIPQSVFATAGTIVDSGTVITRLPPPAYSSLRY 412

Query: 272 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVKVSE 328
               +        A     L+ CY F  +SQV  P V++ F+G A + +  S      S 
Sbjct: 413 AFAAAMAARGYKKAPAVSLLDTCYDFTGMSQVAIPTVSLLFQGGARLDVDASGIMYAASA 472

Query: 329 DIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             VC  F    +   V I GN     F V YDI ++ V F P  C
Sbjct: 473 SQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYPGVC 517


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score =  127 bits (318), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 80/234 (34%), Positives = 120/234 (51%), Gaps = 30/234 (12%)

Query: 30  SVELIHRDSPKSPFYNSS---ETP--YQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
           SV L HR  P +P  +S+   + P   +RLR    R+ + L   +    +S    +    
Sbjct: 55  SVPLAHRHGPCAPKGSSATDKKKPSFAERLRSDRARADHILRKASGRRMMSEGGGAS--- 111

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           IP       ++  Y++ + IGTP  ++  + DTGSDL W QC+PC  S CY Q  PLFDP
Sbjct: 112 IPTYLGGFVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDP 171

Query: 138 KMSSTYKSLPCSSSQCASL----------NQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
             SST+ ++PC+S  C  L          N  S     C Y++ YG+G+ + G  +TET+
Sbjct: 172 SKSSTFATIPCASDACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETL 231

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
            LGS+      +    FGCG++  G ++ K  G++GLGG   SL+SQ  +   G
Sbjct: 232 ALGSS----AVVKSFRFGCGSDQHGPYD-KFDGLLGLGGAPESLVSQTASVYGG 280


>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
 gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
          Length = 471

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 96/284 (33%), Positives = 141/284 (49%), Gaps = 37/284 (13%)

Query: 6   SCVFILFFLCFYVVSPI------------EAQTGGFSVELIHRDSPKSPFYNSSETPYQR 53
           S +F LF L  ++  P+            + +  GF   LIH  SP+SPFY  + TP + 
Sbjct: 8   SAIFRLFLLILHIPFPLSSSFSLPLKELAKGKAYGFKAPLIHWSSPESPFYEPNLTPGEL 67

Query: 54  LRDALTRSLNRLNHFNQ--NSSISSSK---ASQADIIPNNANYLIRISIGTPPTERLAVA 108
           +R ++  S  R +   +  +S IS+S+    S+  II  +  Y+++ +IG+PP E  A+ 
Sbjct: 68  MRASVRTSRARGDRIRKIRSSGISNSRKYPVSRISII--DKVYVMKFNIGSPPVETYAIP 125

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS--------LNQKS 160
           DTGS+++W QC     + CY Q  PLF+P  SSTY    C   +C          L  KS
Sbjct: 126 DTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGHRECKQALWGLGEYLGCKS 185

Query: 161 CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP-GITFGCGTNN----GGLFN 215
              V C+Y +SY D SFS G ++T+ +T      +       + FGCG NN    G   N
Sbjct: 186 SVQV-CRYHISYEDHSFSEGTISTDIITFPEHIAEFGNYSLRMFFGCGYNNSETPGQDPN 244

Query: 216 SKTT-GIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGT 258
           S T  G+VGLG    SL+ Q+     G     +STPD+   +GT
Sbjct: 245 SFTAPGVVGLGNEMASLVGQL---TLGQFSYCISTPDVQKPNGT 285


>gi|358346736|ref|XP_003637421.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
 gi|355503356|gb|AES84559.1| Aspartic proteinase nepenthesin-1, partial [Medicago truncatula]
          Length = 280

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 80/200 (40%), Positives = 107/200 (53%), Gaps = 17/200 (8%)

Query: 20  SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDA-----LTRSLNRLNHFNQNSSI 74
           SP  + T   S++L  R S  S     S T  +  RD+     +T  LN+  + ++ S  
Sbjct: 61  SPFTSSTSTLSLQLHSRASLSSHADYKSLTLSRLDRDSARVKYITTKLNQNFNTDKLSGP 120

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
             S  SQ      +  Y  RI IG PP++   V DTGSD+ W QC PC  + CY Q  P+
Sbjct: 121 IISGTSQG-----SGEYFSRIGIGEPPSQAYMVLDTGSDISWVQCAPC--ADCYRQADPI 173

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           F+P  S++Y  L C ++QC  L+Q  C   NC Y VSYGDGS++ G+  TETVT+G    
Sbjct: 174 FEPTASASYAPLSCEAAQCRYLDQSQCRNGNCLYQVSYGDGSYTVGDFVTETVTIGVNKV 233

Query: 195 QAVALPGITFGCGTNNGGLF 214
           + VAL     GCG NN GLF
Sbjct: 234 KNVAL-----GCGHNNEGLF 248


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 76/204 (37%), Positives = 106/204 (51%), Gaps = 30/204 (14%)

Query: 34  IHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS---KASQADIIPN--- 87
           +HRDS  SP+  ++ T +  +R+ L R   RL   +   S+  +   K+S  + + N   
Sbjct: 1   MHRDSADSPYRPANATVHGLVRNRLHRDELRLLSISSRISLGVAGIPKSSLTNPLKNTNP 60

Query: 88  -----------------NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
                            +  Y + + +GTPP     VADTGSD++W QC PC    CY Q
Sbjct: 61  FLQQDFETPLRSGLSDGSGEYFVSLGVGTPPRTVNMVADTGSDVLWLQCLPC--QSCYGQ 118

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
             PLF+P  SST++S+ C SS C  L  + C    C Y VSYGDGSF+ G  +TET++ G
Sbjct: 119 TDPLFNPSFSSTFQSITCGSSLCQQLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSFG 178

Query: 191 STTGQAVALPGITFGCGTNNGGLF 214
           S    +VA+     GCG NN GLF
Sbjct: 179 SNAVNSVAI-----GCGHNNQGLF 197


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 72/207 (34%), Positives = 107/207 (51%), Gaps = 28/207 (13%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRL------RDA-----LTRSLNRLNHFNQNSSISSS 77
           +SVE++HRD+       ++   Y+R       R+A     L R + R    N++      
Sbjct: 74  WSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPVNRYE 133

Query: 78  KASQAD----------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
             ++ D          +   +  Y  RI +GTP  E+  V DTGSD+ W QCEPC   +C
Sbjct: 134 NVAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPC--REC 191

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           Y Q  P+F+P  S+++ ++ C S+ C+ L+   C    C Y  SYGDGS+S G+ ATET+
Sbjct: 192 YSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETL 251

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
           T G+T+   VA+     GCG  N GLF
Sbjct: 252 TFGTTSVANVAI-----GCGHKNVGLF 273


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 83/229 (36%), Positives = 114/229 (49%), Gaps = 18/229 (7%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKAS- 80
           EA   G  + L H     SP    + + +  L   +  R   RLN     +S   +  S 
Sbjct: 64  EALKPGVKIRLDHIHGACSPLRPINSSSWIDLVSQSFERDNARLNTIRSKNSGPYTTMSN 123

Query: 81  ---QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
              Q+       NY++    GTP    L + DTGSDL W QC+PC  + CY Q   +F+P
Sbjct: 124 LPLQSGTTVGTGNYIVTAGFGTPAKNSLLIIDTGSDLTWIQCKPC--ADCYSQVDAIFEP 181

Query: 138 KMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
           K SS+YK+LPC S+ C  L     N   C    C Y ++YGDGS S G+ + ET+TLGS 
Sbjct: 182 KQSSSYKTLPCLSATCTELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTLGSD 241

Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           + Q  A     FGCG  N GLF   ++G++GLG   +S  SQ ++   G
Sbjct: 242 SFQNFA-----FGCGHTNTGLFKG-SSGLLGLGQNSLSFPSQSKSKYGG 284


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 64/127 (50%), Positives = 77/127 (60%), Gaps = 3/127 (2%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y IR+S+GTPP     V DTGSD++W QC PC    CY Q   +FDP  SSTY +L C
Sbjct: 35  GEYFIRVSVGTPPRGMYLVMDTGSDILWLQCAPC--VSCYHQCDEVFDPYKSSTYSTLGC 92

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA-VALPGITFGCG 207
           +S QC +L+   C G  C Y V YGDGSFS G  AT+ V+L ST+G   V L  I  GCG
Sbjct: 93  NSRQCLNLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPLGCG 152

Query: 208 TNNGGLF 214
            +N G F
Sbjct: 153 HDNEGYF 159



 Score = 62.4 bits (150), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 44/124 (35%), Positives = 61/124 (49%), Gaps = 5/124 (4%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           ++IDSGT++T L     ++L     +      +       + CY+ + LS   VP VT+H
Sbjct: 262 VIIDSGTSVTRLQNAAYASLREAFRAGTSDLVLTTEFSLFDTCYNLSDLSSVDVPTVTLH 321

Query: 310 FRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F+G AD+KL  SN+ V V +    C  F G T    I GNI Q  F V YD     V F 
Sbjct: 322 FQGGADLKLPASNYLVPVDNSSTFCLAFAGTTGP-SIIGNIQQQGFRVIYDNLHNQVGFV 380

Query: 368 PTDC 371
           P+ C
Sbjct: 381 PSQC 384


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 73/155 (47%), Positives = 89/155 (57%), Gaps = 18/155 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           YL+ ++IGTPP       DTGSDLIWTQC+PC    C+ Q  P FD   SST   LPC S
Sbjct: 35  YLVHLAIGTPPQPVQLTLDTGSDLIWTQCKPC--VSCFDQPLPYFDTSRSSTNALLPCES 92

Query: 151 SQC---------ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           +QC           LNQ   +   C Y  SYGD S + G LA +  T  + T    +LPG
Sbjct: 93  TQCKLDPTVTVCVKLNQTVQT---CAYYTSYGDNSVTIGLLAADKFTFVAGT----SLPG 145

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           +TFGCG NN G+FNS  TGI G G G +SL SQ++
Sbjct: 146 VTFGCGLNNTGVFNSNETGIAGFGRGPLSLPSQLK 180



 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 64/128 (50%), Gaps = 9/128 (7%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 310
           +IDSGT++T LP      +    ++ I+   V         C+S  S ++  VP++ +HF
Sbjct: 264 IIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHF 323

Query: 311 RGADVKLSRSNFFVKVSED----IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
            GA + L R N+  +V +D    I+C ++ KG  +   I GN  Q N  V YD++   +S
Sbjct: 324 EGATMDLPRENYVFEVPDDAGNSIICLAINKG--DETTIIGNFQQQNMHVLYDLQNNMLS 381

Query: 366 FKPTDCTK 373
           F    C K
Sbjct: 382 FVAAQCDK 389


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 71/156 (45%), Positives = 98/156 (62%), Gaps = 8/156 (5%)

Query: 86  PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
           P    YL+ ++IGTPP    A+ADTGSDLIWTQC PC  SQC+ Q +PL++P  S+T+  
Sbjct: 27  PTAGEYLMALAIGTPPLPYQAIADTGSDLIWTQCAPC-TSQCFRQPTPLYNPSSSTTFAV 85

Query: 146 LPCSSSQ--CASLNQKSCS----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           LPC+SS   CA+    + +    G  C Y+V+YG G +++    +ET T GST      +
Sbjct: 86  LPCNSSLSVCAAALAGTGTAPPPGCACTYNVTYGSG-WTSVFQGSETFTFGSTPAGHARV 144

Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           PGI FGC T + G   S  +G+VGLG G +SL+SQ+
Sbjct: 145 PGIAFGCSTASSGFNASSASGLVGLGRGRLSLVSQL 180



 Score = 64.3 bits (155), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 40/129 (31%), Positives = 66/129 (51%), Gaps = 8/129 (6%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG--SLELCYSFNSLSQ----VPE 305
           ++IDSGTT+T L       + + + S++   P  D +    L+LC+   S +     +P 
Sbjct: 264 LIIDSGTTITLLGNTAYQQVRAAVVSLVTL-PTTDGSADTGLDLCFMLPSSTSAPPAMPS 322

Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTV 364
           +T+HF GAD+ L   ++ +     + C   +  T+  V I GN  Q N  + YDI Q+T+
Sbjct: 323 MTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHILYDIGQETL 382

Query: 365 SFKPTDCTK 373
           SF P  C+ 
Sbjct: 383 SFAPAKCSA 391


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 110/411 (26%), Positives = 177/411 (43%), Gaps = 73/411 (17%)

Query: 30  SVELIHRDSPKSPFYNS-SETPY---QRLR-DALTRSLNRLNHFNQNSSISSSKASQADI 84
           S++++H+  P     N  S   +    +LR D++   L++++       + +   +Q+ I
Sbjct: 69  SLQVLHKYGPCMQVLNDRSHVEFLLQDQLRVDSIQARLSKISGHGIFEEMVTKLPAQSGI 128

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
                NY++ + +GTP  +   V DTGS + WTQC+PC  S CY Q    FDP  S++Y 
Sbjct: 129 AIGTGNYVVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGS-CYPQKEQKFDPTKSTSYN 187

Query: 145 SLPCSSSQCASL--NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           ++ CSS+ C  L  +++ CS  N  C Y + YGD S+S G  ATET+T+ S+        
Sbjct: 188 NVSCSSASCNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTISSSD----VFT 243

Query: 201 GITFGCGTNNGGLFNSK----------------------------------TTGIVGLGG 226
              FGCG +N GLF                                     +TG +  GG
Sbjct: 244 NFLFGCGQSNNGLFGQAAGLLGLSSSSVSLPSQTAEKYQKQFSYCLPSTPSSTGYLNFGG 303

Query: 227 -----GDISLISQMRTTIAGNQRLGVS--------------TPDIVIDSGTTLTFLPQGY 267
                   + IS   ++  G   +G+S              T   +IDSGT +T LP   
Sbjct: 304 KVSQTAGFTPISPAFSSFYGIDIVGISVAGSQLPIDPSIFTTSGAIIDSGTVITRLPPTA 363

Query: 268 NSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGA-DVKLSRSNFFV 324
              L       +   P  +    L+ CY F++ + V  P+V++ F+G  +V +  S    
Sbjct: 364 YKALKEAFDEKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVSVSFKGGVEVDIDASGILY 423

Query: 325 KVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            V+   +VC  F    +     I+GN  Q  + V YD  +  + F    C+
Sbjct: 424 LVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAGACS 474


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 81/234 (34%), Positives = 117/234 (50%), Gaps = 27/234 (11%)

Query: 30  SVELIHRDSPKSPFYNSSETP--YQRLRDALTRS---LNRLNHFNQNSSISSSKASQADI 84
           SV L+HR  P +P   S   P   +RLR    R+   + +       ++  S  A     
Sbjct: 18  SVPLVHRHGPCAPSAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTS 77

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           IP       N+  Y++ + IGTP  ++  + DTGSDL W QC+PC   +CY Q  PLFDP
Sbjct: 78  IPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDP 137

Query: 138 KMSSTYKSLPCSSSQCASLNQKS----CSGVN------CQYSVSYGDGSFSNGNLATETV 187
             SS+Y S+PC S  C  L   +    C+GV+      C+Y + YG+ + + G  +TET+
Sbjct: 138 SSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETL 197

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           TL       V +    FGCG +  G +  K  G++GLGG   SL+SQ  +   G
Sbjct: 198 TL----KPGVVVADFGFGCGDHQHGPYE-KFDGLLGLGGAPESLVSQTSSQFGG 246



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 59/125 (47%), Gaps = 7/125 (5%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT--GSLELCYSFNSLSQV--PEVT 307
           +VIDSGT +T LP    + L S   S +    +  P+  G L+ CY F   + V  P ++
Sbjct: 319 MVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHANVTVPTIS 378

Query: 308 IHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           + F G A + L+     +   +  +     G  N++ I GN+ Q  F V YD  + TV F
Sbjct: 379 LTFSGGATIDLAAPAGVLV--DGCLAFAGAGTDNAIGIIGNVNQRTFEVLYDSGKGTVGF 436

Query: 367 KPTDC 371
           +   C
Sbjct: 437 RAGAC 441


>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 315

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 94/295 (31%), Positives = 132/295 (44%), Gaps = 68/295 (23%)

Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           C S  C  L+   CS    C Y+  YGD S + G LA +T T  S TG+ V+L    FGC
Sbjct: 21  CDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKLVSLSRFLFGC 80

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN------------------------ 242
           G NN G FN    G++GLGGG  SLISQ+     G                         
Sbjct: 81  GHNNTGGFNDHEMGLIGLGGGPTSLISQIGPLFGGKKFSQCLVPFLTDIKISSRMSFGKG 140

Query: 243 --------------QR-----------LGVSTPD-------------IVIDSGTTLTFLP 264
                         QR           LG+S  D             +++DSGT    LP
Sbjct: 141 SQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDTYLPMNSTIEKGNMLVDSGTPPNILP 200

Query: 265 QG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNFF 323
           Q  Y+   + V +++       DP+   +LCY   +  + P +T HF GA++ L+    F
Sbjct: 201 QQLYDRVYVEVKNNVPLELITNDPSLGPQLCYRTQTNLKGPTLTYHFEGANLLLTPIQTF 260

Query: 324 V---KVSEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 374
           +     ++ + C      TNS   +YGN  Q+N+L+G+D+++Q VSFK TDCTKQ
Sbjct: 261 IPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLIGFDLDRQVVSFKATDCTKQ 315


>gi|297744129|emb|CBI37099.3| unnamed protein product [Vitis vinifera]
          Length = 299

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 98/333 (29%), Positives = 145/333 (43%), Gaps = 91/333 (27%)

Query: 25  QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
           +  GF V L H DS        + T ++RL+ A+ R   RL   +  ++ S   + +A +
Sbjct: 38  EKNGFRVSLRHVDS------GGNYTKFERLQRAVKRGRLRLQRLSAKTA-SFEPSVEAPV 90

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
              N  +L+ ++IGTP     A+ DTGSDLIWTQC+PC    C+ Q +P+FDP+ SS++ 
Sbjct: 91  HAGNGEFLMNLAIGTPAETYSAIMDTGSDLIWTQCKPC--KVCFDQPTPIFDPEKSSSFS 148

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            LPCSS     L   S  GV                 LATET T G  +     +  I F
Sbjct: 149 KLPCSS----DLYHSSTQGV-----------------LATETFTFGDAS-----VSKIGF 182

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLP 264
           GCG +N G   S+  G+          ISQM+  +                SG+T     
Sbjct: 183 GCGEDNRGRAYSQGAGL---------FISQMKLDVDA--------------SGST----- 214

Query: 265 QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF---NSLSQVPEVTIHFRGADVKLSRSN 321
                                     LELC++     S   VP++  HF G D+KL + N
Sbjct: 215 -------------------------ELELCFTLPPDGSPVDVPQLVFHFEGVDLKLPKEN 249

Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFL 354
           + ++ S   V  +  G ++ + I+GN  Q N +
Sbjct: 250 YIIEDSALRVICLTMGSSSGMSIFGNFQQQNIV 282


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 116/419 (27%), Positives = 173/419 (41%), Gaps = 85/419 (20%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL--RDALTRSLNRLNHFNQNSSISSSKASQADI- 84
           G ++ L+HR  P SP  +  +  ++    RD L R+ N     +   + S+ +  Q+ + 
Sbjct: 58  GATLPLVHRHGPCSPVMSKEKPSHEETLGRDQL-RAANIHAKLSSPRNSSAKELQQSGVT 116

Query: 85  IPNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           IP ++        Y+I +S+GTP   ++   DTGSD+ W QC PC    C  Q   LFDP
Sbjct: 117 IPTSSGYSLGTPEYVITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDP 176

Query: 138 KMSSTYKSLPCSSSQCASLNQKS--CSGVNCQYSVSYGD-----GSFSNGNLATET---- 186
             S+TY +  CSS+QCA L  +   C   +CQY V Y D     G++ +  L   T    
Sbjct: 177 AKSATYSAFSCSSAQCAQLGGEGNGCLNSHCQYIVKYVDHSNTTGTYGSDTLGLTTSDAV 236

Query: 187 ---------------------VTLGSTTGQAVALPGITFG---------CGTNNGGLFNS 216
                                + LG  T   V+    T+G           ++ GG    
Sbjct: 237 KNFQFGCSHRANGFVGQLDGLMGLGGDTESLVSQTAATYGKAFSYCLPPSSSSAGGFLT- 295

Query: 217 KTTGIVGLGGGDISLISQMRT------------------TIAG---NQRLGVSTPDIVID 255
                +G   G  S     RT                  T+AG   N    V +   V+D
Sbjct: 296 -----LGAAAGGTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVFSGASVVD 350

Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RG 312
           SGT +T LP      L +     ++A P A P G L+ C+ F+ +   +VP VT+ F RG
Sbjct: 351 SGTVITQLPPTAYQALRTAFKKEMKAYPSAAPVGILDTCFDFSGIKTVRVPVVTLTFSRG 410

Query: 313 ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           A + L  S  F         +   G T    I GN+ Q  F + +D+   T+ F+P  C
Sbjct: 411 AVMDLDVSGIFYAGCLAFTATAQDGDTG---ILGNVQQRTFEMLFDVGGSTLGFRPGAC 466


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 70/153 (45%), Positives = 96/153 (62%), Gaps = 9/153 (5%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y+++IS+GTPP +  A+ DTGSDL W QC PC  ++C+ Q  PLF P  SS+Y +  
Sbjct: 5   SGEYVLQISLGTPPQQFSAIVDTGSDLCWVQCAPC--ARCFEQPDPLFIPLASSSYSNAS 62

Query: 148 CSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           C+ S C +L + +CS  N C YS SYGDGS + G+ A ETVTL  +T     L  I FGC
Sbjct: 63  CTDSLCDALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGST-----LARIGFGC 117

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           G N  G F +   G++GLG G +SL SQ+ ++ 
Sbjct: 118 GHNQEGTF-AGADGLIGLGQGPLSLPSQLNSSF 149



 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 41/128 (32%), Positives = 66/128 (51%), Gaps = 9/128 (7%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG-SLELCYSFNSLSQ----VPEV 306
           +++DSGTT+T+        +L+ +   I + P ADPT   L LCY  +S+S     +P +
Sbjct: 226 VILDSGTTITYWRLAAFIPILAELRRQI-SYPEADPTPYGLNLCYDISSVSASSLTLPSM 284

Query: 307 TIHFRGADVKLSRSNFFVKVSE--DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
           T+H    D ++  SN +V V    + VC+     ++   I GN+ Q N L+  D+    V
Sbjct: 285 TVHLTNVDFEIPVSNLWVLVDNFGETVCTAMS-TSDQFSIIGNVQQQNNLIVTDVANSRV 343

Query: 365 SFKPTDCT 372
            F  TDC+
Sbjct: 344 GFLATDCS 351


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 63/133 (47%), Positives = 83/133 (62%), Gaps = 8/133 (6%)

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           I   + +Y  RI +GTP      VADTGSD+ W QC PC   +CY Q  P+F+P +SS++
Sbjct: 74  IAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPC--RKCYRQQDPIFNPSLSSSF 131

Query: 144 KSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           K L C+SS C  L  K CS  N C Y VSYGDGSF+ G+ +TET++ G    ++VA+   
Sbjct: 132 KPLACASSICGKLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSFGEHAVRSVAM--- 188

Query: 203 TFGCGTNNGGLFN 215
             GCG NN GLF+
Sbjct: 189 --GCGRNNQGLFH 199


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 80/219 (36%), Positives = 114/219 (52%), Gaps = 28/219 (12%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP-- 86
             + ++HRD+   P   +    + R R A         H  Q  S+ S+ A+ AD++   
Sbjct: 30  LHIPVVHRDAVFPPRRGAPPGSF-RCRHAAP-------HTAQLESLHSATAA-ADLLRSP 80

Query: 87  -------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
                  ++  Y   I +G PPT  L V DTGSDLIW QC PC   +CY Q +PL+DP+ 
Sbjct: 81  VMSGVPFDSGEYFAVIGVGDPPTHALVVIDTGSDLIWLQCLPC--RRCYRQVTPLYDPRN 138

Query: 140 SSTYKSLPCSSSQCAS-LNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           S T++ +PC+S QC   L    C      C Y V YGDGS S+G+LAT+T+ L   T   
Sbjct: 139 SKTHRRIPCASPQCRGVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDTLVLPDDT--- 195

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
             +  +T GCG +N GL  S   G++G G G +S  +Q+
Sbjct: 196 -RVHNVTLGCGHDNEGLLAS-AAGLLGAGRGQLSFPTQL 232



 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 31/136 (22%), Positives = 62/136 (45%), Gaps = 13/136 (9%)

Query: 252 IVIDSGTTLT-FLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLS-----QV 303
           +V+DSGT ++ F    Y +   + +S    A  + + +     + CY  +        +V
Sbjct: 317 VVVDSGTAISRFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPGTGVRV 376

Query: 304 PEVTIHFRGA-DVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 358
           P + +HF  A D+ L ++N+ + V         C   +   + + + GN+ Q  F V +D
Sbjct: 377 PSIVLHFAAAADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGVVFD 436

Query: 359 IEQQTVSFKPTDCTKQ 374
           +E+  + F P  C+ +
Sbjct: 437 VERGRIGFTPNGCSGE 452


>gi|116789442|gb|ABK25248.1| unknown [Picea sitchensis]
          Length = 366

 Score =  125 bits (315), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 72/207 (34%), Positives = 107/207 (51%), Gaps = 28/207 (13%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRL------RDA-----LTRSLNRLNHFNQNSSISSS 77
           +SVE++HRD+       ++   Y+R       R+A     L R + R    N++      
Sbjct: 74  WSVEVVHRDALLLKNAANATASYERRLKEKLRREAVRVRGLERQIERTLTLNKDPVNRYE 133

Query: 78  KASQAD----------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
             ++ D          +   +  Y  RI +GTP  E+  V DTGSD+ W QCEPC   +C
Sbjct: 134 NVAEVDADFGGEVVSGMEQGSGEYFTRIGVGTPTREQYMVLDTGSDVAWIQCEPC--REC 191

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           Y Q  P+F+P  S+++ ++ C S+ C+ L+   C    C Y  SYGDGS+S G+ ATET+
Sbjct: 192 YSQADPIFNPSYSASFSTVGCDSAVCSQLDAYDCHSGGCLYEASYGDGSYSTGSFATETL 251

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
           T G+T+   VA+     GCG  N GLF
Sbjct: 252 TFGTTSVANVAI-----GCGHKNVGLF 273


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 70/164 (42%), Positives = 91/164 (55%), Gaps = 12/164 (7%)

Query: 56  DALTRSLNRLNHFNQNSSISSSKASQADIIPN----NANYLIRISIGTPPTERLAVADTG 111
           + LTRS +R        +   S+  QA ++      +  Y IRIS+GTPP     V DTG
Sbjct: 24  NGLTRSRSR-----DRQTKVPSQDFQAPVVSGLSLGSGEYFIRISVGTPPRRMYLVMDTG 78

Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVS 171
           SD++W QC PC    CY Q   +FDP  SSTY +L CS+ QC +L+  +C    C Y V 
Sbjct: 79  SDILWLQCAPC--VNCYHQSDAIFDPYKSSTYSTLGCSTRQCLNLDIGTCQANKCLYQVD 136

Query: 172 YGDGSFSNGNLATETVTLGSTTGQA-VALPGITFGCGTNNGGLF 214
           YGDGSF+ G   T+ V+L ST+G   V L  I  GCG +N G F
Sbjct: 137 YGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGYF 180



 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 45/127 (35%), Positives = 62/127 (48%), Gaps = 11/127 (8%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL---CYSFNSLS--QVPEV 306
           ++IDSGT++T L    N+   S+  +         PT    L   CY  + L+   VP V
Sbjct: 283 VIIDSGTSVTRLQ---NAAYASLRDAFRAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTV 339

Query: 307 TIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
           T+HF+G  D+KL  SN+ + V + +  C  F G T    I GNI Q  F V YD     V
Sbjct: 340 TLHFQGGTDLKLPASNYLIPVDNSNTFCLAFAGTTGP-SIIGNIQQQGFRVIYDNLHNQV 398

Query: 365 SFKPTDC 371
            F P+ C
Sbjct: 399 GFVPSQC 405


>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
          Length = 394

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 97/352 (27%), Positives = 156/352 (44%), Gaps = 77/352 (21%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+   +IGTPP    AV D   +L+WTQC+ C  S+C+ QD+PLFDP  S+TY++ PC 
Sbjct: 50  NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--SRCFEQDTPLFDPTASNTYRAEPCG 107

Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           +  C S+  + ++CSG  C Y  S   G  + G + T+T  +G+      A   + FGC 
Sbjct: 108 TPLCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTT----------------------------- 238
             +        +GIVGLG    SL++Q                                 
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGKNSALFLGSSAKLAGGG 220

Query: 239 ---------IAGN----------QRLGVSTPDIVI-----------DSGTTLTFLPQGYN 268
                    I+GN          Q  G+   D +I           D+ + ++FL  G  
Sbjct: 221 KAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVLLDTFSPISFLVDGAY 280

Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRSNFFVKV 326
             +   ++  + A P+A P    +LC+  +  S   P++   FR GA + ++ SN+ +  
Sbjct: 281 QAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVAASNYLLDY 340

Query: 327 SEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
               VC     S     T  + + G++ Q N    +D++++T+SF+P DCTK
Sbjct: 341 KNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 72/210 (34%), Positives = 110/210 (52%), Gaps = 35/210 (16%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           ++++HRDS  S   +++    + L++ L R   R++  N    +++   S+A++ P N +
Sbjct: 70  LQVVHRDSLSSS--SNTSLVKEILQERLKRDAARVDSINARVQLAAMGVSKAEMKPLNGS 127

Query: 91  -------------------------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
                                    Y  R+ +GTPP     V DTGSD++W QC PC  +
Sbjct: 128 SIDARFDAKDFSSSIISGLAQGSGEYFTRLGVGTPPRYTYMVLDTGSDIMWIQCLPC--A 185

Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLAT 184
           +CY Q  PLF+P  SSTY+ +PC++  C  L+   C     C+Y VSYGDGSF+ G+ +T
Sbjct: 186 KCYGQTDPLFNPAASSTYRKVPCATPLCKKLDISGCRNKRYCEYQVSYGDGSFTVGDFST 245

Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
           ET+T      + VAL     GCG +N GLF
Sbjct: 246 ETLTFRGQVIRRVAL-----GCGHDNEGLF 270



 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 43/133 (32%), Positives = 62/133 (46%), Gaps = 22/133 (16%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL---------ELCYSFNSLS- 301
           ++IDSGT++T L         S  S+M +A  V   TG+L         + CY  + L  
Sbjct: 372 VIIDSGTSVTRLVD-------SAYSTMRDAFRVG--TGNLKSAGGFSLFDTCYDLSGLKT 422

Query: 302 -QVPEVTIHFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 358
            +VP +  HF+G A + L  +N+ + V S    C  F G T  + I GNI Q  + V +D
Sbjct: 423 VKVPTLVFHFQGGAHISLPATNYLIPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVVFD 482

Query: 359 IEQQTVSFKPTDC 371
                V FK   C
Sbjct: 483 SLANRVGFKAGSC 495


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 63/133 (47%), Positives = 83/133 (62%), Gaps = 8/133 (6%)

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           I   + +Y  RI +GTP      VADTGSD+ W QC PC   +CY Q  P+F+P +SS++
Sbjct: 7   IAGGSGDYFARIGVGTPARSVYMVADTGSDVSWLQCSPC--RKCYRQQDPIFNPSLSSSF 64

Query: 144 KSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           K L C+SS C  L  K CS  N C Y VSYGDGSF+ G+ +TET++ G    ++VA+   
Sbjct: 65  KPLACASSICGKLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSFGEHAVRSVAM--- 121

Query: 203 TFGCGTNNGGLFN 215
             GCG NN GLF+
Sbjct: 122 --GCGRNNQGLFH 132


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score =  125 bits (314), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 102/360 (28%), Positives = 160/360 (44%), Gaps = 53/360 (14%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS--ISSSKA 79
           +E  +   SV L+HR  P +     S+ P     + L  S  R N+    +S  ++S+  
Sbjct: 48  LEPSSATLSVPLVHRYGPCAA-SQYSDMPTPSFSETLRHSRARTNYIKSRASTGMASTPD 106

Query: 80  SQADIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
             A  +P       ++  Y++ +  GTP   ++ + DTGSD+ W QC PC  ++CY Q  
Sbjct: 107 DAAVTVPTRLGGFVDSLEYMVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKD 166

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           PLFDP  SSTY  + C +  C  L     N  +  G  C Y V YGDGS + G  + ET+
Sbjct: 167 PLFDPSKSSTYAPIACGADACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETI 226

Query: 188 TLGSTTGQAVALPGIT-----FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI--- 239
           T           PGIT     FGCG +  G  + K  G++GLGG   SL+ Q  +     
Sbjct: 227 TFA---------PGITVKDFHFGCGHDQRGP-SDKFDGLLGLGGAPESLVVQTASVYGGA 276

Query: 240 -----------AGNQRLGVSTPDIVIDSG----TTLTFLPQGYNSNLLSVMSSMIEAQPV 284
                      AG   LGV  P    ++     T +  LP    S ++++    +  +P+
Sbjct: 277 FSYCLPALNSEAGFLALGVR-PSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPL 335

Query: 285 ADPTGSLE---LCYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDI-VCSVFKGITN 340
             P  +     L  S   ++++PE   +   A ++ + + + +  SED   C  F G +N
Sbjct: 336 DIPRSAFRGGMLIDSGTIVTELPETAYNALNAALRKAFAAYPMVASEDFDTCYNFTGYSN 395


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 61/127 (48%), Positives = 78/127 (61%), Gaps = 7/127 (5%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ IG PP+    V DTGSD+ W QC PC  ++CY Q  P+F+P  S+++ SL 
Sbjct: 148 SGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPC--AECYEQTDPIFEPTSSASFTSLS 205

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C + QC SL+   C    C Y VSYGDGS++ G+  TETVTLGST     +L  I  GCG
Sbjct: 206 CETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGST-----SLGNIAIGCG 260

Query: 208 TNNGGLF 214
            NN GLF
Sbjct: 261 HNNEGLF 267



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           I++DSGT +T L     + L             A      + CY  +S S  +VP V+ H
Sbjct: 363 IIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFH 422

Query: 310 F-RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F  G ++ L   N+ + V SE   C  F    +++ I GN  Q    VG+D+    V F 
Sbjct: 423 FANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFS 482

Query: 368 PTDC 371
           P  C
Sbjct: 483 PNKC 486


>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 429

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 73/187 (39%), Positives = 105/187 (56%), Gaps = 10/187 (5%)

Query: 29  FSVELIHRDSPKSPFYNSS-ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           F  ELI+R+   SP  + + +TP +    A+ R   R     ++  ++  +  +  +   
Sbjct: 28  FRAELIYREHQSSPLRSETLKTPSEIFIAAVKRGHERRARLAKHV-LAGDQLFETPVASG 86

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           N  YLI IS G PP +  A+ DTGSDL W QC PC    CY   S  FDP  S++YK+L 
Sbjct: 87  NGEYLIDISYGNPPQKSTAIVDTGSDLNWVQCLPC--KSCYETLSAKFDPSKSASYKTLG 144

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C S+ C  L  +SC+  +CQY   YGDGS ++G L+T+ VT+G  TG+   +P + FGCG
Sbjct: 145 CGSNFCQDLPFQSCA-ASCQYDYMYGDGSSTSGALSTDDVTIG--TGK---IPNVAFGCG 198

Query: 208 TNNGGLF 214
            +N G F
Sbjct: 199 NSNLGTF 205



 Score = 62.0 bits (149), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 38/126 (30%), Positives = 66/126 (52%), Gaps = 10/126 (7%)

Query: 252 IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNSLSQ--VPE 305
           +++DSGTTLT+L    +N     +++++  A P  +  GS   LE C+S   ++    P 
Sbjct: 305 LILDSGTTLTYLDVDAFNP----MVAALKAALPYPEADGSFYGLEYCFSTAGVANPTYPT 360

Query: 306 VTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
           V  HF GADV L+  N F+ +  +    +    +    I+GNI Q N ++ +D+  + + 
Sbjct: 361 VVFHFNGADVALAPDNTFIALDFEGTTCLAMASSTGFSIFGNIQQLNHVIVHDLVNKRIG 420

Query: 366 FKPTDC 371
           FK  +C
Sbjct: 421 FKSANC 426


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 81/234 (34%), Positives = 117/234 (50%), Gaps = 27/234 (11%)

Query: 30  SVELIHRDSPKSPFYNSSETP--YQRLRDALTRS---LNRLNHFNQNSSISSSKASQADI 84
           SV L+HR  P +P   S   P   +RLR    R+   + +       ++  S  A     
Sbjct: 98  SVPLVHRHGPCAPSAASGGKPSLAERLRRDRARTNYIVTKATGGRTAATALSDAAGGGTS 157

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           IP       N+  Y++ + IGTP  ++  + DTGSDL W QC+PC   +CY Q  PLFDP
Sbjct: 158 IPTFLGDSVNSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDP 217

Query: 138 KMSSTYKSLPCSSSQCASLNQKS----CSGVN------CQYSVSYGDGSFSNGNLATETV 187
             SS+Y S+PC S  C  L   +    C+GV+      C+Y + YG+ + + G  +TET+
Sbjct: 218 SSSSSYASVPCDSDACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETL 277

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           TL       V +    FGCG +  G +  K  G++GLGG   SL+SQ  +   G
Sbjct: 278 TL----KPGVVVADFGFGCGDHQHGPYE-KFDGLLGLGGAPESLVSQTSSQFGG 326



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 59/125 (47%), Gaps = 7/125 (5%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT--GSLELCYSFNSLSQV--PEVT 307
           +VIDSGT +T LP    + L S   S +    +  P+  G L+ CY F   + V  P ++
Sbjct: 399 MVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGGVLDTCYDFTGHANVTVPTIS 458

Query: 308 IHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           + F G A + L+     +   +  +     G  N++ I GN+ Q  F V YD  + TV F
Sbjct: 459 LTFSGGATIDLAAPAGVLV--DGCLAFAGAGTDNAIGIIGNVNQRTFEVLYDSGKGTVGF 516

Query: 367 KPTDC 371
           +   C
Sbjct: 517 RAGAC 521


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 71/167 (42%), Positives = 93/167 (55%), Gaps = 23/167 (13%)

Query: 90  NYLIRISIG----TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
           NY+  IS+G    +P      + DTGSDL W QC+PC  S CY Q  PLFDP  S+TY +
Sbjct: 143 NYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPC--SACYAQRDPLFDPAGSATYAA 200

Query: 146 LPCSSSQCA-----------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           + C++S CA           S          C Y+++YGDGSFS G LAT+TV LG  + 
Sbjct: 201 VRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALGGAS- 259

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
               L G  FGCG +N GLF   T G++GLG  ++SL+SQ  +   G
Sbjct: 260 ----LGGFVFGCGLSNRGLFGG-TAGLMGLGRTELSLVSQTASRYGG 301


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 121/418 (28%), Positives = 169/418 (40%), Gaps = 98/418 (23%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           ++LIH +S  SP YNS +T +      + +            + S+   S     P    
Sbjct: 45  IKLIHHESSLSP-YNSKDTIWDHYSHKILKQ-----------TFSNDYISNLVPSPRYVV 92

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           +L+  SIG PP  +LAV DTGS L W  C PC  S C  Q  P+FDP  SSTY +L CS 
Sbjct: 93  FLMNFSIGEPPIPQLAVMDTGSSLTWVMCHPC--SSCSQQSVPIFDPSKSSTYSNLSCSE 150

Query: 151 -SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-- 207
            ++C  +N +      C YSV Y     S G  A E +TL +     + +P + FGCG  
Sbjct: 151 CNKCDVVNGE------CPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRK 204

Query: 208 --TNNGGLFNSKTTGIVGLGGGDISL-----------ISQMRTTIAGNQRL--------- 245
              ++ G       G+ GLG G  SL           I  +R T     RL         
Sbjct: 205 FSISSNGYPYQGINGVFGLGSGRFSLLPSFGKKFSYCIGNLRNTNYKFNRLVLGDKANMQ 264

Query: 246 ----------------------GVSTPDI----------------VIDSGTTLTFLPQGY 267
                                 G    DI                +IDSG   T+L + Y
Sbjct: 265 GDSTTLNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWLTK-Y 323

Query: 268 NSNLLSV-MSSMIEAQPV---ADPTGSLELCYS---FNSLSQVPEVTIHF-RGADVKLSR 319
              +LS  + +++E   V    D      LCYS      LS  P VT HF  GA + L  
Sbjct: 324 GFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFPLVTFHFAEGAVLDLDV 383

Query: 320 SNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           ++ F++ +E+  C      + F     S    G + Q N+ VGYD+ +  V F+  DC
Sbjct: 384 TSMFIQTTENEFCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMRVYFQRIDC 441


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 111/358 (31%), Positives = 154/358 (43%), Gaps = 79/358 (22%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           + NY ++I +GTP      + DTGS L W QC+PC    C++Q  P+F P +S TYK+L 
Sbjct: 104 SGNYYVKIGVGTPAKYFSMIVDTGSSLSWLQCQPCV-IYCHVQVDPIFTPSVSKTYKALS 162

Query: 148 CSSSQCA-----SLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           CSSSQC+     +LN   CS     C Y  SYGD SFS G L+ + +TL   T  A    
Sbjct: 163 CSSSQCSSLKSSTLNAPGCSNATGACVYKASYGDTSFSIGYLSQDVLTL---TPSAAPSS 219

Query: 201 GITFGCGTNNGGLF----------------------------------------NSKTTG 220
           G  +GCG +N GLF                                        NS  +G
Sbjct: 220 GFVYGCGQDNQGLFGRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPNSSVSG 279

Query: 221 IVGLGGG-------------------DISLISQMRTTIAGNQRLGVSTPDI----VIDSG 257
            + +G                      +  +     T+AG + LGVS        +IDSG
Sbjct: 280 FLSIGASSLSSSPYKFTPLVKNPKIPSLYFLGLTTITVAG-KPLGVSASSYNVPTIIDSG 338

Query: 258 TTLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHFRG-A 313
           T +T LP   YN+   S +  M +    A     L+ C+  S   +S VPE+ I FRG A
Sbjct: 339 TVITRLPVAIYNALKKSFVMIMSKKYAQAPGFSILDTCFKGSVKEMSTVPEIRIIFRGGA 398

Query: 314 DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            ++L   N  V++ +   C      +N + I GN  Q  F V YD+    + F P  C
Sbjct: 399 GLELKVHNSLVEIEKGTTCLAIAASSNPISIIGNYQQQTFTVAYDVANSKIGFAPGGC 456


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score =  125 bits (313), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 128/462 (27%), Positives = 199/462 (43%), Gaps = 106/462 (22%)

Query: 5   LSCVFILFFLCFYVV------SPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL 58
           +S  + LFF     +      S ++ +     ++L H  S KSP  NS+   +  +    
Sbjct: 1   MSLFWFLFFSAHLAIASSLKDSGLKHKQPDMQLKLYHMTSLKSP-PNSTSLLFAYM---F 56

Query: 59  TRSLNRLNHFNQN-SSISSSKASQADIIPNNA-------------NYLIRISIGTPPTER 104
            +   R+ +F+   +  S + AS   + P  A             NY +++ +G+P    
Sbjct: 57  AKDEERIRYFHSRLAKNSDANASSKKVGPKLAGIPLKSGLSMGSGNYYVKMGLGSPTKYY 116

Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQK 159
             + DTGS   W QC+PC    C++Q+ P+F+P  S TYK++PCSSSQC     A+LN+ 
Sbjct: 117 TMIVDTGSSFSWLQCQPCT-IYCHIQEDPVFNPSASKTYKTVPCSSSQCSSLKSATLNEP 175

Query: 160 SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF--- 214
           +CS  +  C Y  SYGD SFS G L+ + +TL  T  Q   L    +GCG +N GLF   
Sbjct: 176 TCSKQSNACVYKASYGDSSFSLGYLSQDVLTL--TPSQT--LSSFVYGCGQDNQGLFGRT 231

Query: 215 ------------------------------------NSKTTGIVGLGGGDIS-------- 230
                                               NS   G + +G   ++        
Sbjct: 232 DGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGFLSIGTSSLTPSSSYKFT 291

Query: 231 ------------LISQMRTTIAGNQRLGVSTPDI----VIDSGTTLTFLPQGYNSNLLSV 274
                        I     T+AG + LGV+        +IDSGT +T LP    + L + 
Sbjct: 292 PLLKNPNNPSLYFIDLESITVAG-RPLGVAASSYKVPTIIDSGTVITRLPTPVYTTLKNA 350

Query: 275 MSSMIEAQPVADPTGSL-ELCY--SFNSLSQV-PEVTIHFRG-ADVKLSRSNFFVKVSED 329
             +++  +    P  SL + C+  S   +S+V P++ I F+G AD++L   N  V++   
Sbjct: 351 YVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRIIFKGGADLQLKGHNSLVELETG 410

Query: 330 IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           I C    G ++S+ I GN  Q    V YD+    V F P  C
Sbjct: 411 ITCLAMAG-SSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451


>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 96/352 (27%), Positives = 156/352 (44%), Gaps = 77/352 (21%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+   +IGTPP    AV D   +L+WTQC+ C  S+C+ QD+PLFDP  S+TY++ PC 
Sbjct: 50  NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--SRCFEQDTPLFDPTASNTYRAEPCG 107

Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           +  C S+  + ++CSG  C Y  S   G  + G + T+T  +G+      A   + FGC 
Sbjct: 108 TPLCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTT----------------------------- 238
             +        +GIVGLG    SL++Q                                 
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGRNSALFLGSSAKLAGGG 220

Query: 239 ---------IAGN----------QRLGVSTPDIVI-----------DSGTTLTFLPQGYN 268
                    I+GN          Q  G+   D +I           D+ + ++FL  G  
Sbjct: 221 KAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVLLDTFSPISFLVDGAY 280

Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRSNFFVKV 326
             +   +++ + A P+A P    +LC+  +  S   P++   FR GA + +  +N+ +  
Sbjct: 281 QAVKKAVTAAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYLLDY 340

Query: 327 SEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
               VC     S     T  + + G++ Q N    +D++++T+SF+P DCTK
Sbjct: 341 KNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 99/347 (28%), Positives = 137/347 (39%), Gaps = 88/347 (25%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN- 165
           V DTGSD++W QC PC   +CY Q  P+FDP+ SS+Y ++ C ++ C  L+   C     
Sbjct: 2   VLDTGSDVVWVQCAPC--RRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRG 59

Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF---------- 214
            C Y V+YGDGS + G+  TET+T     G  VA   +  GCG +N GLF          
Sbjct: 60  ACMYQVAYGDGSVTAGDFVTETLTF--AGGARVAR--VALGCGHDNEGLFVAAAGLLGLG 115

Query: 215 ----------------------------------NSKTTGIVGLGGGDISLISQMRTTIA 240
                                              S  +  V  G G +   S   T + 
Sbjct: 116 RGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGSVGASSASFTPMV 175

Query: 241 GNQRL-------------------GVSTPDIVID-----------SGTTLTFLPQGYNSN 270
            N R+                   GV+  D+ +D           SGT++T L +   S 
Sbjct: 176 RNPRMETFYYVQLVGISVGGARVPGVAESDLRLDPSTGRGGVIVDSGTSVTRLARASYSA 235

Query: 271 LLSVMSSMIEAQPVADPTG--SLELCYSFNS--LSQVPEVTIHFR-GADVKLSRSNFFVK 325
           L     +         P G    + CY      + +VP V++HF  GA+  L   N+ + 
Sbjct: 236 LRDAFRAAAAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEAALPPENYLIP 295

Query: 326 V-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           V S    C  F G    V I GNI Q  F V +D + Q V F P  C
Sbjct: 296 VDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 73/199 (36%), Positives = 102/199 (51%), Gaps = 24/199 (12%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---- 83
             S+ L H D+      +S++TP Q  +  L R   R+      ++++ S A ++     
Sbjct: 61  ALSLHLHHIDA-----LSSNKTPEQLFQLRLQRDAKRVEGVVALAALNQSHARRSGSSFS 115

Query: 84  ------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
                 +   +  Y  RI +GTP      V DTGSD++W QC PC   +CY Q  P+FDP
Sbjct: 116 SSIISGLAQGSGEYFTRIGVGTPARYVYMVLDTGSDVVWLQCAPC--RKCYTQADPVFDP 173

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
             S TY  +PC +  C  L+   C+  N  CQY VSYGDGSF+ G+ +TET+T   T   
Sbjct: 174 TKSRTYAGIPCGAPLCRRLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRTRVT 233

Query: 196 AVALPGITFGCGTNNGGLF 214
            VAL     GCG +N GLF
Sbjct: 234 RVAL-----GCGHDNEGLF 247



 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 57/123 (46%), Gaps = 3/123 (2%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           ++IDSGT++T L +     L             A      + C+  + L++V  P V +H
Sbjct: 349 VIIDSGTSVTRLTRPAYIALRDAFRVGASHLKRAAEFSLFDTCFDLSGLTEVKVPTVVLH 408

Query: 310 FRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           FRGADV L  +N+ + V      C  F G  + + I GNI Q  F V +D+    V F P
Sbjct: 409 FRGADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAP 468

Query: 369 TDC 371
             C
Sbjct: 469 RGC 471


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 69/159 (43%), Positives = 96/159 (60%), Gaps = 6/159 (3%)

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           A++  +  +   YL+ ++IGTPP    A+ DTGSDLIWTQC PC    C  Q +P F P 
Sbjct: 80  AARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPC--VLCADQPTPYFRPA 137

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
            S+TY+ +PC S  CA+L   +C   + C Y   YGD + + G LA+ET T G+     V
Sbjct: 138 RSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKV 197

Query: 198 ALPGITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQM 235
            +  + FGCG  N+G L NS  +G+VGLG G +SL+SQ+
Sbjct: 198 MVSDVAFGCGNINSGQLANS--SGMVGLGRGPLSLVSQL 234


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 60/128 (46%), Positives = 82/128 (64%), Gaps = 7/128 (5%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  RI IGTP  E+  V DTGSD++W QCEPC   +CY Q  P+F+P  S ++ ++
Sbjct: 4   GSGEYFTRIGIGTPTREQYMVLDTGSDVVWIQCEPC--RECYSQADPIFNPSSSVSFSTV 61

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C S+ C+ L+   C G  C Y VSYGDGS++ G+ ATET+T G+T+ Q VA+     GC
Sbjct: 62  GCDSAVCSQLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTFGTTSIQNVAI-----GC 116

Query: 207 GTNNGGLF 214
           G +N GLF
Sbjct: 117 GHDNVGLF 124



 Score = 46.2 bits (108), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 50/124 (40%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           I+IDSGT +T L       L     +  +  P AD     + CY  ++L  V  P V  H
Sbjct: 226 IIIDSGTAVTRLQTSAYDALRDAFIAGTQHLPRADGISIFDTCYDLSALQSVSIPAVGFH 285

Query: 310 F-RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F  GA   L   N  + + S    C  F    +++ I GNI Q    V +D     V F 
Sbjct: 286 FSNGAGFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFA 345

Query: 368 PTDC 371
              C
Sbjct: 346 IDQC 349


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 69/159 (43%), Positives = 96/159 (60%), Gaps = 6/159 (3%)

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           A++  +  +   YL+ ++IGTPP    A+ DTGSDLIWTQC PC    C  Q +P F P 
Sbjct: 80  AARILVAASQGEYLMDLAIGTPPLRYTAMVDTGSDLIWTQCAPC--VLCADQPTPYFRPA 137

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
            S+TY+ +PC S  CA+L   +C   + C Y   YGD + + G LA+ET T G+     V
Sbjct: 138 RSATYRLVPCRSPLCAALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKV 197

Query: 198 ALPGITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQM 235
            +  + FGCG  N+G L NS  +G+VGLG G +SL+SQ+
Sbjct: 198 MVSDVAFGCGNINSGQLANS--SGMVGLGRGPLSLVSQL 234


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 70/157 (44%), Positives = 92/157 (58%), Gaps = 4/157 (2%)

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           A++  ++ ++  YL+ + IGTP     A+ DTGSDLIWTQC PC    C  Q +P FDP 
Sbjct: 80  AARILVLASDGEYLMEMGIGTPARFYSAILDTGSDLIWTQCAPC--LLCVDQPTPYFDPA 137

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            SSTY+SL CS+  C +L    C    C Y   YGD + + G LA ET T G T    V 
Sbjct: 138 NSSTYRSLGCSAPACNALYYPLCYQKTCVYQYFYGDSASTAGVLANETFTFG-TNDTRVT 196

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           LP I+FGCG  N G   +  +G+VG G G +SL+SQ+
Sbjct: 197 LPRISFGCGNLNAGSL-ANGSGMVGFGRGSLSLVSQL 232



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 37/127 (29%), Positives = 64/127 (50%), Gaps = 9/127 (7%)

Query: 253 VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPE 305
           +IDSGTT+T+L +  Y +   + +  +    P+ D T +  L+ C+ +    +    +P+
Sbjct: 316 IIDSGTTITYLAEPAYYAVREAFVLYLNSTLPLLDVTETSVLDTCFQWPPPPRQSVTLPQ 375

Query: 306 VTIHFRGADVKLSRSNF-FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
           + +HF GAD +L   N+  V  S   +C +    ++   I G+    NF V YD+E   +
Sbjct: 376 LVLHFDGADWELPLQNYMLVDPSTGGLC-LAMATSSDGSIIGSYQHQNFNVLYDLENSLL 434

Query: 365 SFKPTDC 371
           SF P  C
Sbjct: 435 SFVPAPC 441


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  124 bits (312), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 58/127 (45%), Positives = 81/127 (63%), Gaps = 7/127 (5%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +RI +G+PP  +  V D+GSD++W QC+PC  +QCY Q  PLFDP  S+++  + 
Sbjct: 40  SGEYFVRIGVGSPPRSQYMVIDSGSDIVWVQCKPC--TQCYHQTDPLFDPADSASFMGVS 97

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           CSS+ C  ++   C+   C+Y VSYGDGS + G LA ET+TLG T  Q VA+     GCG
Sbjct: 98  CSSAVCDQVDNAGCNSGRCRYEVSYGDGSSTKGTLALETLTLGRTVVQNVAI-----GCG 152

Query: 208 TNNGGLF 214
             N G+F
Sbjct: 153 HMNQGMF 159



 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 45/159 (28%), Positives = 66/159 (41%), Gaps = 14/159 (8%)

Query: 220 GIVGLGGGDISL-ISQ--MRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMS 276
           G+ GLG GD+ + IS+     T  GN         +V+D+GT +T  P            
Sbjct: 231 GLSGLGVGDMKVPISEDIFELTELGNG-------GVVMDTGTAVTRFPTVAYEAFRDAFI 283

Query: 277 SMIEAQPVADPTGSLELCYS-FNSLS-QVPEVTIHFRGADV-KLSRSNFFVKVSE-DIVC 332
                 P A      + CY+ F  LS +VP V+ +F G  +  L  +NF + V +    C
Sbjct: 284 DQTGNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPILTLPANNFLIPVDDAGTFC 343

Query: 333 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             F    + + I GNI Q    +  D   + V F P  C
Sbjct: 344 FAFAPSPSGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 107/423 (25%), Positives = 170/423 (40%), Gaps = 83/423 (19%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL--NHFNQNSSISSSKASQADII 85
           G   +L H DS +   +  +E   + +  +  R+  +L  +       +++  AS + ++
Sbjct: 30  GLRADLTHIDSGRG--FTRNELLRRMVLRSRARAAKQLCPSRSGTPVRVTAPVASGSHVV 87

Query: 86  PNNANYLIRISIGTPPTERLAV-ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
                YLI   IGTP  +++A+  DTGSD++WTQC PC    C+ Q  P FD   S T  
Sbjct: 88  -GYTEYLIHFGIGTPRPQQVALEVDTGSDVVWTQCRPC--FDCFTQPLPRFDTSASDTVH 144

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            + C+   C +L   +C    C Y V+YGD S + G LA ++ T     G  V +P + F
Sbjct: 145 GVLCTDPICRALRPHACFLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVF 204

Query: 205 GCGTNNGG----------------------------------LFNSKTTGIVGLGGGDIS 230
           GCG  N G                                  +F SK+T +  LGG    
Sbjct: 205 GCGQYNTGNFHSNETGIAGFGRGPLSLPRQLGVSSFSYCFTTIFESKSTPVF-LGGAPAD 263

Query: 231 LISQ----------------------MRTTIAGNQRLGVSTPDIV----------IDSGT 258
            +                        ++    G  RL V     V          IDSGT
Sbjct: 264 GLRAHATGPILSTPFLPNHPEYYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIIDSGT 323

Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLEL-CYSFNSLSQ-----VPEVTIHFR 311
            +T  P+    +L     + +     + + TG   L C+S  S+       VP++T+H  
Sbjct: 324 AITAFPRAVFRSLWEAFVAQVPLPHTSYNDTGEPTLQCFSTESVPDASKVPVPKMTLHLE 383

Query: 312 GADVKLSRSNFFVKVSE-DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
           GAD +L R N+  +  + D +C V     +   + GN  Q N  + +D+    +  +P  
Sbjct: 384 GADWELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIVHDLAGNKLVIEPAQ 443

Query: 371 CTK 373
           C K
Sbjct: 444 CDK 446


>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
 gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 118/421 (28%), Positives = 180/421 (42%), Gaps = 83/421 (19%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSS------KASQAD 83
           S++++H+  P S       +      + L +  +R+   +   S S +      K + + 
Sbjct: 75  SLKVVHKHGPCSKLSQDEASAAPTHTEILLQDQSRVKSIHSRLSNSKTSGGKDVKVTDST 134

Query: 84  IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
            IP        + NY++ + +GTP  +   + DTGSD+ WTQC+PC  S CY Q   +FD
Sbjct: 135 TIPAKDGSTVGSGNYIVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARS-CYKQKEQIFD 193

Query: 137 PKMSSTYKSLPCSSSQCASL-----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
           P  S++Y ++ CSSS C SL     N   C+   C Y + YGD SFS G   TE +TL S
Sbjct: 194 PSQSTSYTNISCSSSICNSLTSATGNTPGCASSACVYGIQYGDSSFSVGFFGTEKLTLTS 253

Query: 192 TTGQAVALPGITFGCGTNN----------------------------GGLFN------SK 217
           T     A   I FGCG NN                              +F+      S 
Sbjct: 254 TD----AFNNIYFGCGQNNQGLFGGSAGLLGLGRDKLSVVSQTAQKYNKIFSYCLPSSSS 309

Query: 218 TTGIVGLGG--GDISLISQMRTTIAG--------------NQRLGVS-----TPDIVIDS 256
           +TG +  GG     +  + + T  AG               ++L +S     T   +IDS
Sbjct: 310 STGFLTFGGSASKNAKFTPLSTISAGPSFYGLDFTGISVGGKKLAISASVFSTAGAIIDS 369

Query: 257 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGA 313
           GT +T LP    S L +   +++   P+      L+ CY F+S +   VP++   F  G 
Sbjct: 370 GTVITRLPPAAYSALRASFRNLMSKYPMTKALSILDTCYDFSSYTTISVPKIGFSFSSGI 429

Query: 314 DVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           +V +  +      S   VC  F G +++  V I+GN+ Q    V YD     V F P  C
Sbjct: 430 EVDIDATGILYASSLSQVCLAFAGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAPGGC 489

Query: 372 T 372
           +
Sbjct: 490 S 490


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  124 bits (311), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 102/207 (49%), Gaps = 31/207 (14%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
           FS++L  RDS     +N+    Y+ L    L+R  +R+         + S+  ++D+ P 
Sbjct: 76  FSLQLHPRDS----LHNAGHKDYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPL 131

Query: 87  -------------------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
                               +  Y  R+ +G P      V DTGSD+ W QC+PC  + C
Sbjct: 132 KTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPC--TDC 189

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           Y Q  P+FDP+ SS++ SLPC S QC +L    C    C Y VSYGDGSF+ G   TET+
Sbjct: 190 YQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFTVGEFVTETL 249

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
           T G++      +  +  GCG +N GLF
Sbjct: 250 TFGNSG----MINDVAVGCGHDNEGLF 272



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/128 (32%), Positives = 59/128 (46%), Gaps = 12/128 (9%)

Query: 252 IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLEL---CYSFNSLSQV--PE 305
           I++DSGT +T L  Q YN+    +  + +   P    T    L   CY  +S S+V  P 
Sbjct: 368 IIVDSGTAITRLQTQAYNT----LRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPT 423

Query: 306 VTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
           V+  F G   ++L   N+ + V S    C  F   T+S+ I GN+ Q    V YD+    
Sbjct: 424 VSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSV 483

Query: 364 VSFKPTDC 371
           V F P  C
Sbjct: 484 VGFSPHKC 491


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 61/127 (48%), Positives = 77/127 (60%), Gaps = 7/127 (5%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ IG PP+    V DTGSD+ W QC PC  ++CY Q  P F+P  S+++ SL 
Sbjct: 148 SGEYFSRVGIGRPPSPVYMVLDTGSDVSWVQCAPC--AECYEQTDPXFEPTSSASFTSLS 205

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C + QC SL+   C    C Y VSYGDGS++ G+  TETVTLGST     +L  I  GCG
Sbjct: 206 CETEQCKSLDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTLGST-----SLGNIAIGCG 260

Query: 208 TNNGGLF 214
            NN GLF
Sbjct: 261 HNNEGLF 267



 Score = 53.1 bits (126), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           I++DSGT +T L     + L             A      + CY  +S S  +VP V+ H
Sbjct: 363 IIVDSGTAVTRLQTTVYNVLRDAFVKSTHDLQTARGVALFDTCYDLSSKSRVEVPTVSFH 422

Query: 310 F-RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F  G ++ L   N+ + V SE   C  F    +++ I GN  Q    VG+D+    V F 
Sbjct: 423 FANGNELPLPAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFS 482

Query: 368 PTDC 371
           P  C
Sbjct: 483 PNKC 486


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 105/402 (26%), Positives = 162/402 (40%), Gaps = 73/402 (18%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           V L+HR  P +P   S  T  +   D   RS  R ++  +   +S        ++  +  
Sbjct: 56  VPLVHRHGPCAP-APSLSTDTRSFADIFRRSRARPSYIVRGKKVSVPAHLGTSVM--SLE 112

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y++R+S GTP   ++ V DTGSD+ W QC+PC   QC+ Q  PL+DP  SSTY ++PC+S
Sbjct: 113 YVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCAS 172

Query: 151 SQCASLNQKS-----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
             C  L   +      SG  C +++SY DG+ + G  + + +TL         +    FG
Sbjct: 173 DVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL----APGAIVQNFYFG 228

Query: 206 CGTNN------------------------GGLFN------SKTTGIVGLGGGD------- 228
           CG                           GG+F+      S   G + LG G        
Sbjct: 229 CGHGKHAVRGLFDGVLGLGRLRESLGARYGGVFSYCLPSVSSKPGFLALGAGKNPSGFVF 288

Query: 229 -----------ISLISQMRTTIAGNQ---RLGVSTPDIVIDSGTTLTFLPQGYNSNLLSV 274
                       S ++     + G +   R    +  +++DSGT +T L       L S 
Sbjct: 289 TPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMIVDSGTVITGLQSTAYRALRSA 348

Query: 275 MSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIV 331
               +EA  +  P G L+ CY+        VP++ + F  GA + L   N  +       
Sbjct: 349 FRKAMEAYRLL-PNGDLDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNGILVNG---- 403

Query: 332 CSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           C  F   G   S  + GN+ Q  F V +D       F+   C
Sbjct: 404 CLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 445


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score =  124 bits (310), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 108/423 (25%), Positives = 168/423 (39%), Gaps = 73/423 (17%)

Query: 10  ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
           ++  +  YV    E       V L+HR  P +P   S  T  +   D   RS  R ++  
Sbjct: 1   MILHIYIYVSVKPEQNGSTVYVPLVHRHGPCAP-APSLSTDTRSFADIFRRSRARPSYIV 59

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           +   +S        ++  +  Y++R+S GTP   ++ V DTGSD+ W QC+PC   QC+ 
Sbjct: 60  RGKKVSVPAHLGTSVM--SLEYVVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFP 117

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKS-----CSGVNCQYSVSYGDGSFSNGNLAT 184
           Q  PL+DP  SSTY ++PC+S  C  L   +      SG  C +++SY DG+ + G  + 
Sbjct: 118 QKDPLYDPSHSSTYSAVPCASDVCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQ 177

Query: 185 ETVTLGSTTGQAVALPGITFGCGTNN------------------------GGLFN----- 215
           + +TL         +    FGCG                           GG+F+     
Sbjct: 178 DKLTL----APGAIVQNFYFGCGHGKHAVRGLFDGVLGLGRLRESLGARYGGVFSYCLPS 233

Query: 216 -SKTTGIVGLGGGD------------------ISLISQMRTTIAGNQ---RLGVSTPDIV 253
            S   G + LG G                    S ++     + G +   R    +  ++
Sbjct: 234 VSSKPGFLALGAGKNPSGFVFTPMGTVPGQPTFSTVTLAGINVGGKKLDLRPSAFSGGMI 293

Query: 254 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR 311
           +DSGT +T L       L S     +EA  +  P G L+ CY+        VP++ + F 
Sbjct: 294 VDSGTVITGLQSTAYRALRSAFRKAMEAYRLL-PNGDLDTCYNLTGYKNVVVPKIALTFT 352

Query: 312 -GADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
            GA + L   N  +       C  F   G   S  + GN+ Q  F V +D       F+ 
Sbjct: 353 GGATINLDVPNGILVNG----CLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRA 408

Query: 369 TDC 371
             C
Sbjct: 409 KAC 411


>gi|218200805|gb|EEC83232.1| hypothetical protein OsI_28526 [Oryza sativa Indica Group]
          Length = 450

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 102/331 (30%), Positives = 142/331 (42%), Gaps = 72/331 (21%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
           + DTGSDL W QC+PC  S CY Q  PLFDP  S++Y ++PC++S C ASL        S
Sbjct: 125 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 182

Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
           C+ V           C YS++YGDGSFS G LAT+TV LG  +     + G  FGCG +N
Sbjct: 183 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 237

Query: 211 GGLFNSKTT---------GIVGLGGGDISL---ISQMRTTIAGNQRLGVSTP-------- 250
            GL    +          G  G   G +SL    S  R     +    ++ P        
Sbjct: 238 RGLRRPGSAASSPTASPPGTSGDAAGSLSLGGDTSSYRNATPVSYTRMIADPAQPPFYFM 297

Query: 251 --------------------DIVIDSGTTLTFL-PQGYNSNLLSVMSSM-IEAQPVADPT 288
                               ++++DSGT +T L P  Y +           E  P A P 
Sbjct: 298 NVTGASVGGAAVAAAGLGAANVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPF 357

Query: 289 GSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSED--IVCSVFKGIT--NS 341
             L+ CY+     +  VP +T+    GAD+ +  +       +D   VC     ++  + 
Sbjct: 358 SLLDACYNLTGHDEVKVPLLTLRLEAGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQ 417

Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            PI GN  Q N  V YD     + F   DC+
Sbjct: 418 TPIIGNYQQKNKRVVYDTVGSRLGFADEDCS 448


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 71/174 (40%), Positives = 102/174 (58%), Gaps = 22/174 (12%)

Query: 65  LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPP 124
           L H++  S+  SS    A +    A YL+ ++IGTPP   +A+ADTGSDL WTQC+PC  
Sbjct: 59  LLHYSTLST--SSDPGPARLRSGQAEYLMELAIGTPPVPFIALADTGSDLTWTQCKPC-- 114

Query: 125 SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNL 182
             C+ QD+P++D   SS++  LPCSS+ C  +    CS     C+Y  +Y DG++     
Sbjct: 115 KLCFGQDTPIYDTTTSSSFSPLPCSSATCLPIWSSRCSTPSATCRYRYAYDDGAY----- 169

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGL-FNSKTTGIVGLGGGDISLISQM 235
                   S     +++ GI FGCG +NGGL +NS  TG VGLG G +SL++Q+
Sbjct: 170 --------SPECAGISVGGIAFGCGVDNGGLSYNS--TGTVGLGRGSLSLVAQL 213



 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 37/131 (28%), Positives = 63/131 (48%), Gaps = 11/131 (8%)

Query: 252 IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCY-----SFNSLSQVPE 305
           +++DSGT  T L + G+   +  V    +  QPV + +     C+         L  +P+
Sbjct: 301 MIVDSGTIFTILVETGFRVVVDHVAG--VLGQPVVNASSLDRPCFPAPAAGVQELPDMPD 358

Query: 306 VTIHFRG-ADVKLSRSNF--FVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
           + +HF G AD++L R N+  F +       ++    + S  + GN  Q N  + +DI   
Sbjct: 359 MVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLGNFQQQNIQMLFDITVG 418

Query: 363 TVSFKPTDCTK 373
            +SF PTDC+K
Sbjct: 419 QLSFMPTDCSK 429


>gi|297745479|emb|CBI40559.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 86/240 (35%), Positives = 119/240 (49%), Gaps = 39/240 (16%)

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           Q    SSS  S   +   +  Y  R+ +GTPP     V DTGSD++W QC PC   +CY 
Sbjct: 155 QGGGFSSSVTS--GLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPC--RKCYS 210

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
           Q  P+FDPK S ++ S+ C S  C  L+   C S  +C Y V+YGDGSF+ G  +TET+T
Sbjct: 211 QTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLT 270

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL-----------GGGDISLI--SQM 235
                 +   +P +  GCG +N GLF     G++GL           GG  ++ I  S  
Sbjct: 271 F-----RGTRVPKVALGCGHDNEGLFVG-AAGLLGLGRQPRLNRPPVGGARVAGITASLF 324

Query: 236 RTTIAGNQRLGVSTPDIVIDSGTTLTFLPQ---GYNSNLLSVMSSMIEAQ-----PVADP 287
           +   AGN         ++IDSGT++T L +   G +SN  S +SS          P A P
Sbjct: 325 KLDTAGNG-------GVIIDSGTSVTRLTRRAYGTSSNKGSGLSSTWRLHGSGSLPAAAP 377


>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
          Length = 328

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 71/167 (42%), Positives = 93/167 (55%), Gaps = 23/167 (13%)

Query: 90  NYLIRISIG----TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
           NY+  IS+G    +P      + DTGSDL W QC+PC  S CY Q  PLFDP  S+TY +
Sbjct: 91  NYVTTISLGGSSGSPAANLTVIVDTGSDLTWVQCKPC--SACYAQRDPLFDPAGSATYAA 148

Query: 146 LPCSSSQCA-----------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           + C++S CA           S          C Y+++YGDGSFS G LAT+TV LG    
Sbjct: 149 VRCNASACADSLRAATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVALG---- 204

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
              +L G  FGCG +N GLF   T G++GLG  ++SL+SQ  +   G
Sbjct: 205 -GASLGGFVFGCGLSNRGLFGG-TAGLMGLGRTELSLVSQTASRYGG 249


>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
          Length = 459

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 68/195 (34%), Positives = 107/195 (54%), Gaps = 15/195 (7%)

Query: 49  TPYQRLRDALTRSLNRLNHFNQNSSISSSKA-----SQADIIPNNANYLIRISIGTPPTE 103
           T  + +R A+ RSL+R     ++   ++ +A     S+A ++P    YL+++  GTP   
Sbjct: 45  TDQELIRRAVQRSLDRPGIVARSGGGAADEAGKAVASEAPLVPGGGEYLVKLGTGTPQHF 104

Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG 163
             A  DT SDL+W QC+PC    CY Q  P+F+PK+SS+Y  +PC+S  CA L+   C  
Sbjct: 105 FSAAIDTASDLVWMQCQPC--VSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHE 162

Query: 164 VN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTG 220
            +   CQY+  Y     + G LA + + +G     AV      FGC  ++ G   ++ +G
Sbjct: 163 DDDGACQYTYKYSGHGVTKGTLAIDKLAIGGDVFHAV-----VFGCSDSSVGGPAAQASG 217

Query: 221 IVGLGGGDISLISQM 235
           +VGLG G +SL+SQ+
Sbjct: 218 LVGLGRGPLSLVSQL 232


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 118/420 (28%), Positives = 174/420 (41%), Gaps = 85/420 (20%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-----ISSSKASQADI 84
           S+ L++R  P +P  +++ T      + L R   R NH  + +S     +  S  +    
Sbjct: 57  SMPLMYRHGPCAP-ASAAATNRPSPAEMLRRDRARRNHILRKASGRRITLGVSIPTSLGA 115

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
             ++  Y++ +  GTP   ++ + DTGSDL W QC+PC  S CY Q  P+FDP  SSTY 
Sbjct: 116 FVDSLQYVVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYA 175

Query: 145 SLPCSSSQCASLN--------QKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
            +PC S  C  L+          S SG + CQY + YG+G  + G  +TET+TL  +   
Sbjct: 176 PVPCGSEACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTL--SPEA 233

Query: 196 AVALPGITFGCGT----------------------------NNGGLFN------SKTTGI 221
           A  +   +FGCG                               GG F+      + T G 
Sbjct: 234 ATVVNNFSFGCGLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYCLPAGNSTAGF 293

Query: 222 VGLG----GGD----------------ISLISQMRTTIAGNQ---RLGVSTPDIVIDSGT 258
           + LG    GG+                  L+     ++ G Q      V    ++IDSGT
Sbjct: 294 LALGAPATGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVFAGGMIIDSGT 353

Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTG--SLELCYSF--NSLSQVPEVTIHFRGA- 313
            +T LP+   S L +   S + A P+  P     L+ CY F  N+   VP V + F G  
Sbjct: 354 IVTGLPETAYSALRTAFRSAMSAYPLLPPNDDEDLDTCYDFTGNTNVTVPTVALTFEGGV 413

Query: 314 --DVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             D+ +  S   +      V     G T    I GN+ Q  F V YD  +  V F+   C
Sbjct: 414 TIDLDVP-SGVLLDGCLAFVAGASDGDTG---IIGNVNQRTFEVLYDSARGHVGFRAGAC 469


>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 466

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 127/440 (28%), Positives = 183/440 (41%), Gaps = 108/440 (24%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
           GGFSVELIHRDS KSPF++   T + R   A  R           S +SS      D+  
Sbjct: 25  GGFSVELIHRDSIKSPFHDPKLTRHDRFL-AAARRSRARAAALLASDVSS------DLFY 77

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM----------------- 129
            +  YL  +++GTPP   LAVADTGSDL+W +C     +   +                 
Sbjct: 78  GDFEYLAAVNVGTPPVRFLAVADTGSDLVWLKCNTTQNNNGIVSSDSGNNSNSSPPPPPP 137

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASL-NQKSCSGVN--CQYSVSYGDGSFSNGNLATET 186
           +    F+P  SS+Y  + C    C +L    SC+G +  C +  SY DG+ + G LA +T
Sbjct: 138 EAVVYFNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASATGLLAADT 197

Query: 187 VTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM---------- 235
            T  G+      +   I FGC T   G    +  G+VGLG G +SL SQ+          
Sbjct: 198 FTFGGNINNDTTSTASIDFGCATGTAGR-EFQADGMVGLGAGPLSLASQLGRKFSFCLTA 256

Query: 236 ------RTTIAGNQRLGVSTPD------------------IVIDS--------------- 256
                  + +    R  VS P                   I IDS               
Sbjct: 257 YDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYAISIDSLKVAGQPVPGTTSVS 316

Query: 257 ------GTTLTFLPQG-----YNSNLLSVM--SSMIEAQPVADPTGSLELCYSFNSLSQV 303
                 GT LTFL +         +L  VM  + +  A P   P  +LELCY  + +  V
Sbjct: 317 KVIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAPP---PDETLELCYDVSRVKDV 373

Query: 304 ----PEVTIHF---RGADVKLSRSNFFVKVSEDIVCSVFKGITNS-----VPIYGNIMQT 351
               P+VT+      G +V+L+    FV V E ++C     +T S     + + GN+   
Sbjct: 374 DGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGVLC--LAVVTTSPELQPLSVLGNVALQ 431

Query: 352 NFLVGYDIEQQTVSFKPTDC 371
           +  VG D++ +T +F   +C
Sbjct: 432 DLHVGIDLDARTATFATANC 451


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 86/229 (37%), Positives = 118/229 (51%), Gaps = 29/229 (12%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-- 84
           G  +V L HR  P SP   + + P   L + L R   R  +  +    S    +  D+  
Sbjct: 126 GAATVPLHHRHGPCSPL-PTKKMP--TLEETLHRDQLRAAYIQRK--FSGGGGAGGDVQR 180

Query: 85  ----IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
               +P       N   YLI + +G+P T +  + DTGSD+ W QC+PC  SQC+ Q  P
Sbjct: 181 SDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADP 238

Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
           LFDP  SSTY    C S+ CA L Q+     S   CQY V+YGDGS + G  +++T+ LG
Sbjct: 239 LFDPSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG 298

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           S+     A+    FGC     G FN +T G++GLGGG  SL+SQ   T+
Sbjct: 299 SS-----AVRSFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTL 341



 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 61/124 (49%), Gaps = 10/124 (8%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 310
           V+DSGT +T LP    S L S   + ++  P A P+G L+ C+ F+  S V  P V + F
Sbjct: 413 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 472

Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 367
             GA V L  S   +       C  F G ++  S+ I GN+ Q  F V YD+ +  V F+
Sbjct: 473 SGGAVVSLDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 527

Query: 368 PTDC 371
              C
Sbjct: 528 AGAC 531


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 105/353 (29%), Positives = 147/353 (41%), Gaps = 80/353 (22%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            +++ +  GTP      + DTGSD+ W QC PC    CY Q  P+FDP  S+TY ++PC 
Sbjct: 119 EFVVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCS-GHCYKQHDPIFDPTKSATYSAVPCG 177

Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
             QCA+   K  S   C Y V YGDGS + G L+ ET++L S    A ALPG  FGCG  
Sbjct: 178 HPQCAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSHETLSLTS----ARALPGFAFGCGET 233

Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRT--------------TIAGNQRLGVSTP----- 250
           N G F     G++GLG G +SL SQ                 T  G   +G +TP     
Sbjct: 234 NLGDFG-DVDGLIGLGRGQLSLSSQAAASFGAAFSYCLPSYNTSHGYLTIGTTTPASGSD 292

Query: 251 -------------------DIV----------------------IDSGTTLTFL-PQGYN 268
                              D+V                      +DSGT LT+L P+ Y 
Sbjct: 293 GVRYTAMIQKQDYPSFYFVDLVSIVVGGFVLPVPPILFTRDGTLLDSGTVLTYLPPEAYT 352

Query: 269 S--NLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGAD-VKLSRSNFFVK 325
           +  +      +  +  P  DP    + CY F   + +    + F+ +D      S F V 
Sbjct: 353 ALRDRFKFTMTQYKPAPAYDP---FDTCYDFAGQNAIFMPLVSFKFSDGSSFDLSPFGVL 409

Query: 326 VSEDIV-----CSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           +  D       C  F    +++P  I GN  Q N  + YD+  + + F    C
Sbjct: 410 IFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEKIGFVSGSC 462


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 89/245 (36%), Positives = 128/245 (52%), Gaps = 30/245 (12%)

Query: 10  ILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN 69
           +L  L   +++   A      +  IH D    P   +SE     +R AL R ++R   F 
Sbjct: 6   VLLILACTILASDAAAAVRVGLTRIHAD----PEVTASEF----VRGALRRDMHRHARFA 57

Query: 70  QNSSISSSKAS---------QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
           +     SS A+         Q D+  N   Y++ +SIGTPP    A+ADTGSDLIWTQC 
Sbjct: 58  REQLAPSSAAAAGLTVGAPTQKDLR-NGGEYIMTLSIGTPPLSYRAIADTGSDLIWTQCA 116

Query: 121 PCPPS------QCYMQDSPLFDPKMSSTYKSLPCSS--SQCASLNQKS-CSGVNCQYSVS 171
           PC  +      QC+ Q   L++P  S+T+  LPC+S  S CA++   S   G  C Y+ +
Sbjct: 117 PCGDTVTDTDNQCFKQSGCLYNPSSSTTFGVLPCNSPLSMCAAMAGPSPPPGCACMYNQT 176

Query: 172 YGDGSFSNGNLATETVTLG-STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDIS 230
           YG G ++ G  + ET T G S+T  AV +P I FGC   +   +N  + G+VGLG G +S
Sbjct: 177 YGTG-WTAGVQSVETFTFGSSSTPPAVRVPNIAFGCSNASSNDWNG-SAGLVGLGRGSMS 234

Query: 231 LISQM 235
           L+SQ+
Sbjct: 235 LVSQL 239



 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 41/133 (30%), Positives = 71/133 (53%), Gaps = 10/133 (7%)

Query: 249 TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVA---DPTGSLELCYSFNSLS--- 301
           T  ++IDSGTT+T L    Y     +V S ++   P+A   D +  L+LC++  + +   
Sbjct: 322 TGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDLCFALKASTPPP 381

Query: 302 QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDI 359
            +P +T+HF G AD+ L   N+ + +   + C   +  T  ++ + GN  Q N  V YD+
Sbjct: 382 AMPSMTLHFEGGADMVLPVENYMI-LGSGVWCLAMRNQTVGAMSMVGNYQQQNIHVLYDV 440

Query: 360 EQQTVSFKPTDCT 372
            ++T+SF P  C+
Sbjct: 441 RKETLSFAPAVCS 453


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 118/230 (51%), Gaps = 29/230 (12%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-- 84
           G  +V L HR  P SP   + + P   L + L R   R  +  +    S    +  D+  
Sbjct: 56  GAATVPLHHRHGPCSPL-PTKKMP--TLEETLHRDQLRAAYIQRK--FSGGGGAGGDVQR 110

Query: 85  ----IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
               +P       N   YLI + +G+P T +  + DTGSD+ W QC+PC  SQC+ Q  P
Sbjct: 111 SDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADP 168

Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
           LFDP  SSTY    C S+ CA L Q+     S   CQY V+YGDGS + G  +++T+ LG
Sbjct: 169 LFDPSSSSTYSPFSCGSAACAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG 228

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
           S+     A+    FGC     G FN +T G++GLGGG  SL+SQ   T+ 
Sbjct: 229 SS-----AVKSFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTLG 272



 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 42/124 (33%), Positives = 60/124 (48%), Gaps = 10/124 (8%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 310
           V+DSGT +T LP    S L S   + ++  P A P+G L+ C+ F+  S V  P V + F
Sbjct: 343 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 402

Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 367
             GA V L  S   +       C  F   ++  S+ I GN+ Q  F V YD+ +  V F+
Sbjct: 403 SGGAVVSLDASGIILS-----NCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 457

Query: 368 PTDC 371
              C
Sbjct: 458 AGAC 461


>gi|297811181|ref|XP_002873474.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319311|gb|EFH49733.1| hypothetical protein ARALYDRAFT_909035 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 293

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 80/190 (42%), Positives = 105/190 (55%), Gaps = 12/190 (6%)

Query: 53  RLRDALTRSLNRLNHFNQNSSISSSKASQA----DIIPNNANYLIRISIGTPPTERLAVA 108
           R  +A   S++     N    +S +K+++      II  + NY++ I IGTP  +   + 
Sbjct: 92  RRDEARVESIHSKLSKNIADEVSKAKSTKLPAKNGIILGSPNYIVTIGIGTPKHDISLMF 151

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
           DTGSDL WTQCEPC  S CY Q  P F+P  SS+Y ++ CSS  C   N +SCS  NC Y
Sbjct: 152 DTGSDLTWTQCEPCLGS-CYSQKEPKFNPSSSSSYHNVSCSSPMCG--NPESCSASNCLY 208

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
            + YGDGS + G LA E  TL ++      L  I FGCG NN G+F   + GI+GLG G 
Sbjct: 209 GIGYGDGSVTVGFLAKEKFTLTNSD----VLDDIYFGCGENNKGVFIG-SAGILGLGPGK 263

Query: 229 ISLISQMRTT 238
            S   Q  TT
Sbjct: 264 FSFPLQTTTT 273


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 123/435 (28%), Positives = 186/435 (42%), Gaps = 98/435 (22%)

Query: 23  EAQTGGFSVELIHRD--SPKSPFYNSSETPYQRLRDALTRSL-NRL-------NHFNQNS 72
             + G   +E+  R   S +   +N          D   RS+ NR+       N   Q+S
Sbjct: 57  RKEKGAIVLEMKDRGYCSERKINWNRKLQKQLIFDDLRVRSMQNRIRAKVSGHNSSEQSS 116

Query: 73  SISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
            I    AS  ++     NY++ I +G        + DTGSDL W QC+PC    CY Q  
Sbjct: 117 EIQIPLASGINL--ETLNYIVTIGLGNQ--NMTVIIDTGSDLTWVQCDPCMS--CYSQQG 170

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN---CQYSVSYGDGSFSNGNLAT 184
           P+F+P  SS+Y SL C+SS C +L     N ++C   N   C ++VSYGDGSF++G L  
Sbjct: 171 PVFNPSNSSSYNSLLCNSSTCQNLQFTTGNTEACESNNPSSCNHTVSYGDGSFTDGELGV 230

Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR-------- 236
           E ++ G      +++    FGCG NN GLF    +GI+GLG  ++S+ISQ          
Sbjct: 231 EHLSFG-----GISVSNFVFGCGRNNKGLFGG-VSGIMGLGRSNLSMISQTNTTFGGVFS 284

Query: 237 -------------------------------TTIAGNQRL--------------GVSTPD 251
                                          T++  N +L              GV+  D
Sbjct: 285 YCLPTTDSGASGSLVIGNESSLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAIQD 344

Query: 252 -------IVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV 303
                  I+IDSGT +T L P  YN+ L +         P+A     L+ C++   + +V
Sbjct: 345 TSFGNGGILIDSGTVITRLAPSLYNA-LKAEFLKQFSGYPIAPALSILDTCFNLTGIEEV 403

Query: 304 --PEVTIHFR-GADVKLSRSN-FFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGY 357
             P +++HF    D+ +      ++      VC     ++  N + I GN  Q N  V Y
Sbjct: 404 SIPTLSMHFENNVDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIY 463

Query: 358 DIEQQTVSFKPTDCT 372
           D +Q  + F   DC+
Sbjct: 464 DAKQSKIGFAREDCS 478


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 76/171 (44%), Positives = 96/171 (56%), Gaps = 7/171 (4%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + I +G+PP +  A+ DTGSDL+W QC+PC  SQCY Q  P++DP  SST+    CS+
Sbjct: 4   YTMEIELGSPPKKFNAIVDTGSDLVWIQCKPC--SQCYSQSDPIYDPSASSTFAKTSCST 61

Query: 151 SQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
           S C SL    CS     C Y   YGD S + G+ A ET+TL S+ G + A P   FGCG 
Sbjct: 62  SSCQSLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCGR 121

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTT 259
            N G F     GIVGLG G ISL +Q+ + I  N +      D   DS  T
Sbjct: 122 LNSGSFGG-AAGIVGLGQGKISLSTQLGSAI--NNKFSYCLVDFDDDSSKT 169



 Score = 56.2 bits (134), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 40/134 (29%), Positives = 65/134 (48%), Gaps = 5/134 (3%)

Query: 243 QRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS- 301
           + L V++   + DSGTTLT L     S + S  +S +    V   +   +LCY  +    
Sbjct: 234 RALEVNSGGTIFDSGTTLTLLDDAVYSKVKSAFASSVSLPTVDASSSGFDLCYDVSKSKN 293

Query: 302 -QVPEVTIHFRGADVKLSRSNFFVKV--SEDIVC-SVFKGITNSVPIYGNIMQTNFLVGY 357
            + P +T+ F+G      + N+FV V  +E + C ++    +  + I GN+MQ N+ V Y
Sbjct: 294 FKFPALTLAFKGTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYHVVY 353

Query: 358 DIEQQTVSFKPTDC 371
           D    T+S  P  C
Sbjct: 354 DRGTSTISMSPAQC 367


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 115/423 (27%), Positives = 176/423 (41%), Gaps = 100/423 (23%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ--NSSISSSKASQADIIPN-- 87
           +LIH  S   P Y  +ET   R+   +  S  R  +       S+ S+   +A + P+  
Sbjct: 38  KLIHPGSVHHPHYKPNETAKDRMELDIQHSAARFAYIQARIEGSLVSNNEYKARVSPSLT 97

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL- 146
               +  ISIG PP  +L V DTGSD++W  C PC  + C      LFDP MSST+  L 
Sbjct: 98  GRTIMANISIGQPPIPQLVVMDTGSDILWVMCTPC--TNCDNHLGLLFDPSMSSTFSPLC 155

Query: 147 --PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
             PC    C+      C  +   ++V+Y D S ++G    +TV   +T      +P + F
Sbjct: 156 KTPCDFKGCS-----RCDPI--PFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLF 208

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQ------------------------------ 234
           GCG N G   +    GI+GL  G  SL ++                              
Sbjct: 209 GCGHNIGQDTDPGHNGILGLNNGPDSLATKIGQKFSYCIGDLADPYYNYHQLILGEGADL 268

Query: 235 -----------------MRTTIAGNQRLGVS----------TPDIVIDSGTTLTFLPQGY 267
                            M     G +RL ++          T  ++ID+G+T+TFL    
Sbjct: 269 EGYSTPFEVHNGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTITFLVDSV 328

Query: 268 N-------SNLL--SVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTIHFR-GA 313
           +        NLL  S   + IE  P          C+ + S+S+     P VT HF  GA
Sbjct: 329 HRLLSKEVRNLLGWSFRQTTIEKSPWMQ-------CF-YGSISRDLVGFPVVTFHFADGA 380

Query: 314 DVKLSRSNFFVKVSEDIVCSVFKGIT----NSVP-IYGNIMQTNFLVGYDIEQQTVSFKP 368
           D+ L   +FF ++++++ C     ++     S P + G + Q ++ VGYD+  Q V F+ 
Sbjct: 381 DLALDSGSFFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVNQFVYFQR 440

Query: 369 TDC 371
            DC
Sbjct: 441 IDC 443


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 70/207 (33%), Positives = 101/207 (48%), Gaps = 31/207 (14%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
           FS++L  RDS     +N+    Y+ L    L+R  +R+         + S+  ++D+ P 
Sbjct: 76  FSLQLHPRDS----LHNAGHKDYKSLVLSRLSRDSSRVKSIYDRLEFALSELKRSDLEPL 131

Query: 87  -------------------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
                               +  Y  R+ +G P      V DTGSD+ W QC+PC  + C
Sbjct: 132 KTEILPEDLSTPIISGTSQGSGEYFSRVGVGQPAKPFYMVLDTGSDINWLQCQPC--TDC 189

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           Y Q  P+FDP+ SS++ SLPC S QC +L    C    C Y VSYGDGSF+ G    ET+
Sbjct: 190 YQQTDPIFDPRSSSSFASLPCESQQCQALETSGCRASKCLYQVSYGDGSFTVGEFVIETL 249

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLF 214
           T G++      +  +  GCG +N GLF
Sbjct: 250 TFGNSG----MINNVAVGCGHDNEGLF 272



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 41/128 (32%), Positives = 59/128 (46%), Gaps = 12/128 (9%)

Query: 252 IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLEL---CYSFNSLSQV--PE 305
           I++DSGT +T L  Q YN+    +  + +   P    T    L   CY  +S S+V  P 
Sbjct: 368 IIVDSGTAITRLQTQAYNT----LRDAFVSRTPYLKKTNGFALFDTCYDLSSQSRVTIPT 423

Query: 306 VTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
           V+  F G   ++L   N+ + V S    C  F   T+S+ I GN+ Q    V YD+    
Sbjct: 424 VSFEFAGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSV 483

Query: 364 VSFKPTDC 371
           V F P  C
Sbjct: 484 VGFSPHKC 491


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 67/196 (34%), Positives = 99/196 (50%), Gaps = 27/196 (13%)

Query: 40  KSPFYNSSETPYQRLRDA-LTRSLNRLNHFNQNSSISSSKASQADIIP------------ 86
           ++  + SS   Y+ L  A L R  +R+        ++ +  +++D+ P            
Sbjct: 83  RTSIHKSSHKDYKSLVLARLERDSDRVRSLATRMDLAIAGITKSDLKPVEKELEAEALET 142

Query: 87  --------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
                    +  Y  R+ IG+PP     V DTGSD+ W QC PC  + CY Q  P+F+P 
Sbjct: 143 PLVSGASQGSGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPC--ADCYQQADPIFEPS 200

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            SS+Y  L C + QC SL+   C   +C Y VSYGDGS++ G+ ATET+TL  +     +
Sbjct: 201 FSSSYAPLTCETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDGS----AS 256

Query: 199 LPGITFGCGTNNGGLF 214
           L  +  GCG +N GLF
Sbjct: 257 LNNVAIGCGHDNEGLF 272



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 55/124 (44%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           I++DSGT +T L     ++L        +  P        + CY  +S S  +VP V+ H
Sbjct: 368 IIVDSGTAVTRLQSDVYNSLRDSFVRGTQHLPSTSGVALFDTCYDLSSRSSVEVPTVSFH 427

Query: 310 F-RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F  G  + L   N+ + V S    C  F   T+++ I GN+ Q    V YD+    V F 
Sbjct: 428 FPDGKYLALPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFS 487

Query: 368 PTDC 371
           P  C
Sbjct: 488 PNGC 491


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 118/230 (51%), Gaps = 29/230 (12%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-- 84
           G  +V L HR  P SP   + + P   L + L R   R  +  +    S    +  D+  
Sbjct: 56  GAATVPLHHRHGPCSPL-PTKKMP--TLEETLHRDQLRAAYIQRK--FSGGGGAGGDVQR 110

Query: 85  ----IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
               +P       N   YLI + +G+P T +  + DTGSD+ W QC+PC  SQC+ Q  P
Sbjct: 111 SDATVPTALGTSLNTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADP 168

Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
           LFDP  SSTY    C S+ CA L Q+     S   CQY V+YGDGS + G  +++T+ LG
Sbjct: 169 LFDPSSSSTYSPFSCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALG 228

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
           S+     A+    FGC     G FN +T G++GLGGG  SL+SQ   T+ 
Sbjct: 229 SS-----AVRSFQFGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTLG 272



 Score = 58.5 bits (140), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 61/124 (49%), Gaps = 10/124 (8%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 310
           V+DSGT +T LP    S L S   + ++  P A P+G L+ C+ F+  S V  P V + F
Sbjct: 343 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 402

Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 367
             GA V L  S   +       C  F G ++  S+ I GN+ Q  F V YD+ +  V F+
Sbjct: 403 SGGAVVSLDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 457

Query: 368 PTDC 371
              C
Sbjct: 458 AGAC 461


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 99/374 (26%), Positives = 160/374 (42%), Gaps = 83/374 (22%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           ++ +   +  YL+ + +GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  S
Sbjct: 142 ESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 199

Query: 141 STYKSLPCSSSQCASL----NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTT 193
            +Y+++ C   +C  +      ++C   +   C Y   YGD S + G+LA E  T+  T 
Sbjct: 200 LSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 259

Query: 194 -GQAVALPGITFGCGTNNGGLFN----------------SKTTGIVGLG----------- 225
            G +  +  + FGCG +N GLF+                S+   + G             
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSS 319

Query: 226 -------GGDISLISQ-----------------------MRTTIAGNQRLGV--STPDI- 252
                  G D +L+                         ++  + G ++L +  ST D+ 
Sbjct: 320 VGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVG 379

Query: 253 -------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--Q 302
                  +IDSGTTL++  +  Y     + +  M +A P+      L  CY+ + +   +
Sbjct: 380 KDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVE 439

Query: 303 VPEVTIHFR-GADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGYDI 359
           VPE ++ F  GA       N+FV++  D I+C    G   S + I GN  Q NF V YD+
Sbjct: 440 VPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDL 499

Query: 360 EQQTVSFKPTDCTK 373
           +   + F P  C +
Sbjct: 500 QNNRLGFAPRRCAE 513


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 108/359 (30%), Positives = 162/359 (45%), Gaps = 82/359 (22%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           + NY +++ +G+P      + DTGS   W QC+PC    C++Q+ P+F+P  S TYK++P
Sbjct: 100 SGNYYVKMGLGSPTKYYTMIVDTGSSFSWLQCQPCT-IYCHIQEDPVFNPSASKTYKTVP 158

Query: 148 CSSSQC-----ASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           CSSSQC     A+LN+ +CS  +  C Y  SYGD SFS G L+ + +TL  T  Q   L 
Sbjct: 159 CSSSQCSSLKSATLNEPTCSKQSNACVYKASYGDSSFSLGYLSQDVLTL--TPSQT--LS 214

Query: 201 GITFGCGTNNGGLF---------------------------------------NSKTTGI 221
              +GCG +N GLF                                       NS   G 
Sbjct: 215 SFVYGCGQDNQGLFGRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSPKEGF 274

Query: 222 VGLGGGDIS--------------------LISQMRTTIAGNQRLGVSTPDI----VIDSG 257
           + +G   ++                     I     T+AG + LGV+        +IDSG
Sbjct: 275 LSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAG-RPLGVAASSYKVPTIIDSG 333

Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY--SFNSLSQV-PEVTIHFRG- 312
           T +T LP    + L +   +++  +    P  SL + C+  S   +S+V P++ I F+G 
Sbjct: 334 TVITRLPTPVYTTLKNAYVTILSKKYQQAPGISLLDTCFKGSLAGISEVAPDIRIIFKGG 393

Query: 313 ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           AD++L   N  V++   I C    G ++S+ I GN  Q    V YD+    V F P  C
Sbjct: 394 ADLQLKGHNSLVELETGITCLAMAG-SSSIAIIGNYQQQTVKVAYDVGNSRVGFAPGGC 451


>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
          Length = 465

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 78/234 (33%), Positives = 114/234 (48%), Gaps = 29/234 (12%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF--------NQNSSISSSKASQ 81
           SV L+HR  P +P   S   P   L + L R   R N+            +++S +    
Sbjct: 44  SVPLVHRHGPCAPSAASGGKP--SLAERLRRDRARANYIVTKAAGGRTAATAVSDAVGGG 101

Query: 82  ADIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
              IP       ++  Y++ + IGTP  +++ + DTGSDL W QC+PC   +CY Q  PL
Sbjct: 102 GTSIPTFLGDSVDSLEYVVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPL 161

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQ-------KSCSGVNCQYSVSYGDGSFSNGNLATETV 187
           FDP  SS+Y S+PC S  C  L          S +   C+Y + YG+ + + G  +TET+
Sbjct: 162 FDPSSSSSYASVPCDSDACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETL 221

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           TL       V +    FGCG +  G +  K  G++GLGG   SL+SQ  +   G
Sbjct: 222 TL----KPGVVVADFGFGCGDHQHGPYE-KFDGLLGLGGAPESLVSQTSSQFGG 270



 Score = 45.1 bits (105), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 57/125 (45%), Gaps = 7/125 (5%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQV--PEVT 307
           +VIDSGT +T LP    + L S   S +    +  P+    L+ CY F   + V  P + 
Sbjct: 343 MVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNGAVLDTCYDFTGHTNVTVPTIA 402

Query: 308 IHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           + F G A + L+     +   +  +     G  +++ I GN+ Q  F V YD  + TV F
Sbjct: 403 LTFSGGATIDLATPAGVLV--DGCLAFAGAGTDDTIGIIGNVNQRTFEVLYDSGKGTVGF 460

Query: 367 KPTDC 371
           +   C
Sbjct: 461 RAGAC 465


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 99/374 (26%), Positives = 160/374 (42%), Gaps = 83/374 (22%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           ++ +   +  YL+ + +GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  S
Sbjct: 142 ESGVAVGSGEYLVDLYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPATS 199

Query: 141 STYKSLPCSSSQCASL----NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTT 193
            +Y+++ C   +C  +      ++C   +   C Y   YGD S + G+LA E  T+  T 
Sbjct: 200 LSYRNVTCGDPRCGLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTA 259

Query: 194 -GQAVALPGITFGCGTNNGGLFN----------------SKTTGIVGLG----------- 225
            G +  +  + FGCG +N GLF+                S+   + G             
Sbjct: 260 PGASRRVDDVVFGCGHSNRGLFHGAAGLLGLGRGALSFASQLRAVYGHAFSYCLVDHGSS 319

Query: 226 -------GGDISLISQ-----------------------MRTTIAGNQRLGV--STPDI- 252
                  G D +L+                         ++  + G ++L +  ST D+ 
Sbjct: 320 VGSKIVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVG 379

Query: 253 -------VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--Q 302
                  +IDSGTTL++  +  Y     + +  M +A P+      L  CY+ + +   +
Sbjct: 380 KDGSGGTIIDSGTTLSYFAEPAYEVIRRAFVERMDKAYPLVADFPVLSPCYNVSGVERVE 439

Query: 303 VPEVTIHFR-GADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGYDI 359
           VPE ++ F  GA       N+FV++  D I+C    G   S + I GN  Q NF V YD+
Sbjct: 440 VPEFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDL 499

Query: 360 EQQTVSFKPTDCTK 373
           +   + F P  C +
Sbjct: 500 QNNRLGFAPRRCAE 513


>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 469

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 126/459 (27%), Positives = 187/459 (40%), Gaps = 112/459 (24%)

Query: 14  LCFYVVSPIEAQT------GGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNH 67
           L FY+ + I + T         + +LIHR+S   P Y+ +ET   R +   T S+ R + 
Sbjct: 17  LAFYLSTAIISSTLITTKPSRLATKLIHRNSYLHPLYDQNETVEDRSKREQTSSIERFDF 76

Query: 68  FNQNSSISSSKA----SQADIIPNN--ANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
               S I   K+    +++ +IP N  + +L+ +SIG+PP  +L V DTGS L+W QC P
Sbjct: 77  LE--SKIKELKSVGNEARSSLIPFNRGSGFLVNLSIGSPPVTQLVVVDTGSSLLWVQCLP 134

Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNG 180
           C    C+ Q +  FDP  S ++K+L C       +N   C+  N  +Y + Y  G  S G
Sbjct: 135 CI--NCFQQSTSWFDPLKSVSFKTLGCGFPGYNYINGYKCNRFNQAEYKLRYLGGDSSQG 192

Query: 181 NLATETVTLG-------------STTGQAVALPGITFGCG-----TNNGGLFNSKTTGIV 222
            LA E++                ST    +    ITFGCG     TNN   +N    G+ 
Sbjct: 193 ILAKESLLFETLDEGRVFQYNAISTQISKIKKSNITFGCGHMNIKTNNDDAYN----GVF 248

Query: 223 GLGG------------------GDIS--------LISQMRTTIAGN-------------- 242
           GLG                   GDI+        L+    + I G+              
Sbjct: 249 GLGAYPHITMATQLGNKFSYCIGDINNPLYTHNHLVLGQGSYIEGDSTPLQIHFGHYYVT 308

Query: 243 -QRLGVSTPDIVI-----------------DSGTTLTFLPQG----YNSNLLSVMSSMIE 280
            Q + V +  + I                 DSG T T L  G        ++ +M  ++E
Sbjct: 309 LQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGLLE 368

Query: 281 AQPVADPTGSLELCYS---FNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFK 336
             P         LC+       L   P VT HF  GAD+ L   + F +   D  C    
Sbjct: 369 RIPTQRKFEG--LCFKGVVSRDLVGFPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAIL 426

Query: 337 GITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             +NS    + + G + Q N+ VG+D+EQ  V F+  DC
Sbjct: 427 P-SNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 464


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 113/419 (26%), Positives = 167/419 (39%), Gaps = 88/419 (21%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN---------QNSSISSSKAS 80
           SV L HR+ P SP     E P   +   L R   R  +           Q+++ + S  +
Sbjct: 62  SVPLAHRNGPCSPVRGKGELPRAEM---LRRDRERTEYIIRRASRSRRLQDNNDAVSVPT 118

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           Q     ++  Y+  + +GTP   +  + DTGS L W QC+PC  SQCY Q  PLFDP  S
Sbjct: 119 QLGSSYDSQEYVATVGLGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTS 178

Query: 141 STYKSLPCSSSQC----ASLNQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           S+Y  +PC S +C    A ++   C+      C Y + YG G+   G  +T+ +TLG   
Sbjct: 179 SSYSPVPCDSQECRALAAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALTLGP-- 236

Query: 194 GQAVALPGITFGCGTN-NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN------QRLG 246
                +    FGCG +   G F+    G++GLG    SL  Q      G          G
Sbjct: 237 --GAIVKRFHFGCGHHQQRGKFD-MADGVLGLGRLPQSLAWQASARRGGGVFSHCLPPTG 293

Query: 247 VS-----------------TPDIVID---------------------------------- 255
           VS                 TP + +D                                  
Sbjct: 294 VSTGFLALGAPHDTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVFREGVITD 353

Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-G 312
           SGT L+ L +   + L +   S +   P+A P G L+ C++F       VP V++ FR G
Sbjct: 354 SGTVLSALQETAYTALRTAFRSAMAEYPLAPPVGHLDTCFNFTGYDNVTVPTVSLTFRGG 413

Query: 313 ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           A V L  S+    V  D   + +        + G++ Q    V YD+  + V F+   C
Sbjct: 414 ATVHLDASS---GVLMDGCLAFWSSGDEYTGLIGSVSQRTIEVLYDMPGRKVGFRTGAC 469


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 57/127 (44%), Positives = 78/127 (61%), Gaps = 7/127 (5%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ IG PP++   + DTGSD+ W QC PC  + CY Q  P+F+P  S+++ +L 
Sbjct: 146 SGEYFSRVGIGKPPSQAYLILDTGSDVNWVQCAPC--ADCYQQADPIFEPASSASFSTLS 203

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C++ QC SL+   C    C Y VSYGDGS++ G+  TET+TLGS     VA+     GCG
Sbjct: 204 CNTRQCRSLDVSECRNDTCLYEVSYGDGSYTVGDFVTETITLGSAPVDNVAI-----GCG 258

Query: 208 TNNGGLF 214
            NN GLF
Sbjct: 259 HNNEGLF 265



 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 45/157 (28%), Positives = 68/157 (43%), Gaps = 10/157 (6%)

Query: 220 GIVGLG-GGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSM 278
           G+ GL  GG++  I +    I  +   GV     ++DSGT +T L     ++L       
Sbjct: 333 GLTGLSVGGELVSIPESAFQIDESGNGGV-----IVDSGTAITRLQTDVYNSLRDAFVKR 387

Query: 279 IEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKV-SEDIVCSV 334
               P  +     + CY  +S    +VP V+ HF  G ++ L   N+ V + SE   C  
Sbjct: 388 TRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPLPAKNYLVPLDSEGTFCFA 447

Query: 335 FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           F    +S+ I GN+ Q    V YD+    V F P  C
Sbjct: 448 FAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 64/150 (42%), Positives = 90/150 (60%), Gaps = 11/150 (7%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y   + +GTPPT  L V DTGSD++W QC+PC    CY Q SPL+DP+ SSTY   P
Sbjct: 96  SGEYFASVGVGTPPTPALLVIDTGSDVVWLQCKPC--VHCYRQLSPLYDPRGSSTYAQTP 153

Query: 148 CSSSQCASLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           CS  QC   N ++C G    C Y + YGD S ++GNLAT+ +   + T    ++  +T G
Sbjct: 154 CSPPQCR--NPQTCDGTTGGCGYRIVYGDASSTSGNLATDRLVFSNDT----SVGNVTLG 207

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           CG +N GLF S   G++G+  G+ S  +Q+
Sbjct: 208 CGHDNEGLFGS-AAGLLGVARGNNSFATQV 236



 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 35/128 (27%), Positives = 62/128 (48%), Gaps = 8/128 (6%)

Query: 252 IVIDSGTTLT-FLPQGYNS--NLLSVMSSMIEAQPVADPTGSLELCYSFN--SLSQVPEV 306
           +V+DSGT++T F    Y +  +     ++ +  + V       + CY     +++  P V
Sbjct: 321 VVVDSGTSITRFARDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVADAPGV 380

Query: 307 TIHFRG-ADVKLSRSNFFV-KVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQT 363
            +HF G ADV L   N+ V + S    C   +    + + + GN++Q  F V +D+E + 
Sbjct: 381 VLHFAGGADVALPPENYLVPEESGRYHCFALEAAGHDGLSVIGNVLQQRFRVVFDVENER 440

Query: 364 VSFKPTDC 371
           V F+P  C
Sbjct: 441 VGFEPNGC 448


>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
 gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
          Length = 453

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 66/190 (34%), Positives = 105/190 (55%), Gaps = 13/190 (6%)

Query: 49  TPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVA 108
           T ++ +R A+ RSL+R     +N     +   +A ++P    YL+++ IGTP     A  
Sbjct: 49  TDHELIRRAVQRSLDRPGVAARNRK---AVVGEAPLVPRGGEYLVKLGIGTPQHYFSAAI 105

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--- 165
           DT SDL+W QC+PC    CY Q  P+F+P++SS+Y  +PCSS  C+ L+   C   +   
Sbjct: 106 DTASDLVWLQCQPC--VSCYRQLDPIFNPRLSSSYAVVPCSSDTCSQLDGHRCDEDDDQA 163

Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
           C+Y+  Y   + +NG LA + + +G     AV L     GC  ++ G    + +G+VGL 
Sbjct: 164 CRYNYKYSGNAVTNGTLAIDKLAVGGNVFHAVVL-----GCSDSSVGGPPPQASGLVGLA 218

Query: 226 GGDISLISQM 235
            G +SL+SQ+
Sbjct: 219 RGPLSLLSQL 228


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 104/347 (29%), Positives = 142/347 (40%), Gaps = 77/347 (22%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y++ +S+GTP   +    DTGSD+ W QC+PC    C  Q   LFDP  SSTY ++PC 
Sbjct: 142 QYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCG 201

Query: 150 SSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTL--GSTTGQAVALPGITFG 205
           +  C+ L   +  CSG  C Y VSYGDGS + G   ++T+ L  G+T G         FG
Sbjct: 202 ADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVGT------FLFG 255

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQM--------------RTTIAGNQRLG----- 246
           CG    G+F +   G++ LG   +SL SQ               + + AG   LG     
Sbjct: 256 CGHAQAGMF-AGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGGPTSA 314

Query: 247 -------------------------------VSTP------DIVIDSGTTLTFLPQGYNS 269
                                          V+ P        V+D+GT +T LP    +
Sbjct: 315 SGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTAYA 374

Query: 270 NLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFV 324
            L S     I     P A   G L+ CY F+    V  P V + F  GA + L       
Sbjct: 375 ALRSAFRGAIAPYGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGI-- 432

Query: 325 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            +S   +     G      I GN+ Q +F V +D    TV F P  C
Sbjct: 433 -LSSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476


>gi|125571841|gb|EAZ13356.1| hypothetical protein OsJ_03278 [Oryza sativa Japonica Group]
          Length = 447

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 75/202 (37%), Positives = 108/202 (53%), Gaps = 23/202 (11%)

Query: 31  VELIHRDS--PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-NSSISSSKASQADIIP- 86
           V + HRD+  P  P         QRL     R  + ++   + +S + S        IP 
Sbjct: 29  VPVFHRDALFPPPPGAKRGSLLRQRLAADAARYASLVDATGRLHSPVFSG-------IPF 81

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y   + +GTP T+ + V DTGSDL+W QC PC   +CY Q   +FDP+ SSTY+ +
Sbjct: 82  ESGEYFALVGVGTPSTKAMLVIDTGSDLVWLQCSPC--RRCYAQRGQVFDPRRSSTYRRV 139

Query: 147 PCSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           PCSS QC +L    C     +G  C+Y V+YGDGS S G+LAT+ +   + T     +  
Sbjct: 140 PCSSPQCRALRFPGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAFANDT----YVNN 195

Query: 202 ITFGCGTNNGGLFNSKTTGIVG 223
           +T GCG +N GLF+S   G++G
Sbjct: 196 VTLGCGRDNEGLFDS-AAGLLG 216



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/92 (26%), Positives = 43/92 (46%), Gaps = 10/92 (10%)

Query: 291 LELCYSFNS--LSQVPEVTIHFRG-ADVKLSRSNFFV-------KVSEDIVCSVFKGITN 340
            + CY       +  P + +HF G AD+ L   N+F+       + +    C  F+   +
Sbjct: 355 FDACYDLRGRPAASAPLIVLHFAGGADMALPPENYFLPVDGGRRRAASYRRCLGFEAADD 414

Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            + + GN+ Q  F V +D+E++ + F P  CT
Sbjct: 415 GLSVIGNVQQQGFRVVFDVEKERIGFAPKGCT 446


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 104/347 (29%), Positives = 142/347 (40%), Gaps = 77/347 (22%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y++ +S+GTP   +    DTGSD+ W QC+PC    C  Q   LFDP  SSTY ++PC 
Sbjct: 142 QYVVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCG 201

Query: 150 SSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTL--GSTTGQAVALPGITFG 205
           +  C+ L   +  CSG  C Y VSYGDGS + G   ++T+ L  G+T G         FG
Sbjct: 202 ADACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLALAPGNTVGT------FLFG 255

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQM--------------RTTIAGNQRLG----- 246
           CG    G+F +   G++ LG   +SL SQ               + + AG   LG     
Sbjct: 256 CGHAQAGMF-AGIDGLLALGRQSMSLKSQAAGAYGGVFSYCLPSKQSAAGYLTLGGPSSA 314

Query: 247 -------------------------------VSTP------DIVIDSGTTLTFLPQGYNS 269
                                          V+ P        V+D+GT +T LP    +
Sbjct: 315 SGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAFAGGTVVDTGTVITRLPPTAYA 374

Query: 270 NLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFV 324
            L S     I     P A   G L+ CY F+    V  P V + F  GA + L       
Sbjct: 375 ALRSAFRGAIAPCGYPSAPANGILDTCYDFSRYGVVTLPTVALTFSGGATLALEAPGI-- 432

Query: 325 KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            +S   +     G      I GN+ Q +F V +D    TV F P  C
Sbjct: 433 -LSSGCLAFAPNGGDGDAAILGNVQQRSFAVRFD--GSTVGFMPGAC 476


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score =  121 bits (303), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 68/198 (34%), Positives = 109/198 (55%), Gaps = 18/198 (9%)

Query: 49  TPYQRLRDALTRSLNRLNHFNQNSSISSSK----ASQADIIPNNANYLIRISIGTPPTER 104
           T ++ LR A+ RS +RL         +SS+     ++A ++     YL+++ +GTP    
Sbjct: 42  TDHELLRRAIQRSRDRLASIAPRLLPTSSRNKVVVAEAPVLSAGGEYLVKLGLGTPQHCF 101

Query: 105 LAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV 164
            A  DT SDLIWTQC+PC   +CY Q  P+F+P  S++Y  +PC+S  C  L+   C+  
Sbjct: 102 TAAIDTASDLIWTQCQPC--VKCYKQLDPVFNPVASTSYAVVPCNSDTCDELDTHRCARD 159

Query: 165 N-------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSK 217
                   CQY+ SYG  + + G LA + + +G    +     G+ FGC +++ G    +
Sbjct: 160 GDSDDEDACQYTYSYGGNATTRGILAVDRLAIGDDVFR-----GVVFGCSSSSVGGPPPQ 214

Query: 218 TTGIVGLGGGDISLISQM 235
            +G+VGLG G +SL+SQ+
Sbjct: 215 VSGVVGLGRGALSLVSQL 232



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/125 (25%), Positives = 59/125 (47%), Gaps = 5/125 (4%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS---LSQV--PEV 306
           ++ID  +T+TFL +     ++  +   I     +     L+LC+       +S+V  P V
Sbjct: 339 MIIDIASTITFLEESLYEEMVDDLEEEIRLPRGSGSDLGLDLCFILPEGVPMSRVYAPPV 398

Query: 307 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           ++ F G  ++L +   FV+     +  +  G T+ V I GN  Q N  V Y++ +  ++F
Sbjct: 399 SLAFEGVWLRLDKEQMFVEDRASGMMCLMVGKTDGVSILGNYQQQNMQVMYNLRRGRITF 458

Query: 367 KPTDC 371
             T C
Sbjct: 459 IKTAC 463


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score =  121 bits (303), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 64/152 (42%), Positives = 89/152 (58%), Gaps = 10/152 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           ++  Y   I++G PPT  L V DTGSDLIW QC PC    CY Q +PL+DP+ SST++ +
Sbjct: 84  DSGEYFAVINVGDPPTRALVVIDTGSDLIWLQCVPC--RHCYRQVTPLYDPRSSSTHRRI 141

Query: 147 PCSSSQCAS-LNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           PC+S +C   L    C      C Y V YGDGS S+G+LAT+ +     T     +  +T
Sbjct: 142 PCASPRCRDVLRYPGCDARTGGCVYMVVYGDGSASSGDLATDRLVFPDDT----HVHNVT 197

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
            GCG +N GL  S   G++G+G G +S  +Q+
Sbjct: 198 LGCGHDNVGLLES-AAGLLGVGRGQLSFPTQL 228



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/136 (22%), Positives = 60/136 (44%), Gaps = 15/136 (11%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA----QPVADPTGSLELCYSFN------SLS 301
           IV+DSGT ++   +   + +     S   A    + +A      + CY         +  
Sbjct: 313 IVVDSGTAISRFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAV 372

Query: 302 QVPEVTIHFRG-ADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVG 356
           +VP + +HF G AD+ L ++N+ + V         C   +   + + + GN+ Q  F + 
Sbjct: 373 RVPSIVLHFAGGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLV 432

Query: 357 YDIEQQTVSFKPTDCT 372
           +D+E+  + F P  C+
Sbjct: 433 FDVERGRIGFTPNGCS 448


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 65/157 (41%), Positives = 93/157 (59%), Gaps = 13/157 (8%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           + NY +++ +GTPP     + DTGS L W QC+PC    C+ Q  PL+DP +S TYK L 
Sbjct: 122 SGNYYVKLGLGTPPKYYAMILDTGSSLSWLQCQPC-AVYCHAQADPLYDPSVSKTYKKLS 180

Query: 148 CSSSQC-----ASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           C+S +C     A+LN   C   +  C Y+ SYGD SFS G L+ + +TL S+      LP
Sbjct: 181 CASVECSRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQ----TLP 236

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
             T+GCG +N GLF  +  GI+GL    +S+++Q+ T
Sbjct: 237 QFTYGCGQDNQGLFG-RAAGIIGLARDKLSMLAQLST 272



 Score = 64.7 bits (156), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 61/125 (48%), Gaps = 6/125 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCY--SFNSLSQVPEVTIH 309
           +IDSGT +T LP    + L      ++  +    P  S L+ C+  S  S+S VPE+ + 
Sbjct: 346 LIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMI 405

Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           F+G AD+ L   +  ++  + I C  F G   TN + I GN  Q  + + YD+    + F
Sbjct: 406 FQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGF 465

Query: 367 KPTDC 371
            P  C
Sbjct: 466 APGSC 470


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 122/442 (27%), Positives = 192/442 (43%), Gaps = 91/442 (20%)

Query: 1   MATFL-SCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALT 59
           MA F  S +F L  LCF +     + +    + L+H        Y+        +++A  
Sbjct: 1   MAIFFTSPLFFLIILCFSISVVHLSASPTLVLNLVH----SYHIYSRKPPHVYHIKEA-- 54

Query: 60  RSLNRLNHFNQNSS--ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
            S+ RL +    ++  I +  +    IIP    +L+ ISIG+PP  +L   DT SDL+W 
Sbjct: 55  -SVERLEYLKAKTTGDIIAHLSPNVPIIPQA--FLVNISIGSPPITQLLHMDTASDLLWI 111

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGVNCQYSVSYGDGS 176
           QC PC    CY Q  P+FDP  S T+++  C +SQ +  + K + +  +C+YS+ Y D +
Sbjct: 112 QCLPC--INCYAQSLPIFDPSRSYTHRNETCRTSQYSMPSLKFNANTRSCEYSMRYVDDT 169

Query: 177 FSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
            S G LA E +   +   +  + AL  + FGCG +N G      TGI+GLG G+ SL+ +
Sbjct: 170 GSKGILAREMLLFNTIYDESSSAALHDVVFGCGHDNYGE-PLVGTGILGLGYGEFSLVHR 228

Query: 235 MRTTIA---------------------GNQRLGVSTP-----------------DIVI-- 254
                +                     G   LG +TP                 D +I  
Sbjct: 229 FGKKFSYCFGSLDDPSYPHNVLVLGDDGANILGDTTPLEIHNGFYYVTIEAISVDGIILP 288

Query: 255 -------------------DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL-- 293
                              D+G +LT L +     L + +  + E +  A      ++  
Sbjct: 289 IDPRVFNRNHQTGLGGTIIDTGNSLTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIK 348

Query: 294 --CYSFN-----SLSQVPEVTIHF-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPI 344
             CY+ N       S  P VT HF  GA++ L   + F+K+S ++ C +V  G  NS+  
Sbjct: 349 MECYNGNFERDLVESGFPIVTFHFSEGAELSLDVKSLFMKLSPNVFCLAVTPGNLNSI-- 406

Query: 345 YGNIMQTNFLVGYDIEQQTVSF 366
            G   Q ++ +GYD+E   VSF
Sbjct: 407 -GATAQQSYNIGYDLEAMEVSF 427


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score =  120 bits (302), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 88/240 (36%), Positives = 122/240 (50%), Gaps = 43/240 (17%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD---- 83
           G  +EL H D+ +  F  S      R+R A  RS  R+N     +   ++   ++D    
Sbjct: 29  GIRLELTHVDA-RGDFTGS-----DRVRRAADRSHRRVNGLLAAAPPPAASTLRSDGGGG 82

Query: 84  ----------IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDS 132
                     +  + A YL+  +IGTPP    AV DTGSDLIWTQC+ PC   +C+ Q +
Sbjct: 83  GACAATAAASVHASTATYLVDFAIGTPPLALSAVLDTGSDLIWTQCDAPC--RRCFPQPA 140

Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-------------NCQYSVSYGDGSFSN 179
           PL+ P  S TY ++ C S  C +L     S                C Y  SYGDGS ++
Sbjct: 141 PLYAPARSVTYANVSCGSRLCDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTD 200

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQMRTT 238
           G LATET T G+ T     +  + FGCGT+N GG  NS  +G+VG+G G +SL+SQ+  T
Sbjct: 201 GVLATETFTFGAGT----TVHDLAFGCGTDNLGGTDNS--SGLVGMGRGPLSLVSQLGVT 254



 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 65/128 (50%), Gaps = 11/128 (8%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-----SLSQVPEV 306
           ++IDSGTT T L +     L   +++ +     +     L +C++           VP +
Sbjct: 333 LIIDSGTTFTALEERAFVVLARAVAARVALPLASGAHLGLSVCFAAPQGRGPEAVDVPRL 392

Query: 307 TIHFRGADVKLSRSNFFVKVSEDIVCSVF-KGITNS--VPIYGNIMQTNFLVGYDIEQQT 363
            +HF GAD++L RS+    V ED V  V   GI ++  + + G++ Q N  V YD+ +  
Sbjct: 393 VLHFDGADMELPRSS---AVVEDRVAGVACLGIVSARGMSVLGSMQQQNMHVRYDVGRDV 449

Query: 364 VSFKPTDC 371
           +SF+P +C
Sbjct: 450 LSFEPANC 457


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score =  120 bits (302), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 72/208 (34%), Positives = 97/208 (46%), Gaps = 33/208 (15%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           FS++L     P+    N     Y+ L    L R   R+N  N    ++ S  +++D+ P 
Sbjct: 77  FSLQL----HPRETLLNEQHPNYKTLVLSRLARDTARVNSLNTKLQLALSSLNRSDLYPT 132

Query: 88  N---------------------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
                                   Y  R+ +G P      V DTGSD+ W QC+PC  S 
Sbjct: 133 ETELLRPEDLSTPVSSGTAQGSGEYFSRVGVGQPSKPFYMVLDTGSDVNWLQCKPC--SD 190

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
           CY Q  P+FDP  SS+Y  L C + QC  L   +C    C Y VSYGDGSF+ G   TET
Sbjct: 191 CYQQSDPIFDPTASSSYNPLTCDAQQCQDLEMSACRNGKCLYQVSYGDGSFTVGEYVTET 250

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLF 214
           V+ G+ +   VA+     GCG +N GLF
Sbjct: 251 VSFGAGSVNRVAI-----GCGHDNEGLF 273



 Score = 54.3 bits (129), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 57/125 (45%), Gaps = 6/125 (4%)

Query: 252 IVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTI 308
           +++DSGT +T L  Q YNS   +        +P A+     + CY  +SL   +VP V+ 
Sbjct: 369 VIVDSGTAITRLRTQAYNSVRDAFKRKTSNLRP-AEGVALFDTCYDLSSLQSVRVPTVSF 427

Query: 309 HFRGADV-KLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           HF G     L   N+ + V      C  F   T+S+ I GN+ Q    V +D+    V F
Sbjct: 428 HFSGDRAWALPAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANSLVGF 487

Query: 367 KPTDC 371
            P  C
Sbjct: 488 SPNKC 492


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 76/227 (33%), Positives = 113/227 (49%), Gaps = 31/227 (13%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNS------------SISS 76
             V L+HRDS     +  + TP Q L   L R   R     + +             +SS
Sbjct: 61  LHVRLLHRDS-----FAVNATPAQLLARRLQRDELRAAWIIKAAAPAAAANDTPVVGLSS 115

Query: 77  SKASQADIIPN----NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
             A  A ++      +  Y+ +I++GTP  E L   DTGSD+ W QC+PC   +CY Q  
Sbjct: 116 GGAFVAPVVSRAPTTSGEYMAKIAVGTPAVEALLAMDTGSDITWLQCQPC--RRCYPQSG 173

Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKS---CSGVNCQYSVSYG-DGSFSNGNLATETVT 188
           P+FDP+ S++Y+ +   +  C +L +        + C Y+V YG DGS + G+   ET+T
Sbjct: 174 PVFDPRHSTSYREMGYDAPDCQALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLT 233

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
                   V +P ++ GCG +N GLF +   GI+GLG G IS  SQ+
Sbjct: 234 FAG----GVQVPHMSIGCGHDNKGLFAAPAAGILGLGRGQISCPSQI 276



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 41/127 (32%), Positives = 66/127 (51%), Gaps = 7/127 (5%)

Query: 252 IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFNSLS-QVPEVT 307
           +++DSGT +T L +  Y +   +  ++ ++   V+   P+G  + CY+    + +VP V+
Sbjct: 367 VILDSGTAVTRLARRAYIAFRDAFRAAAVDLGQVSIGGPSGFFDTCYTMGGRAMKVPTVS 426

Query: 308 IHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTV 364
           +HF G  ++ L   N+ + V S   VC  F G  + SV I GNI Q  F V Y+I    V
Sbjct: 427 MHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIGNIQQQGFRVVYNIGGGRV 486

Query: 365 SFKPTDC 371
            F P  C
Sbjct: 487 GFAPNSC 493


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 67/153 (43%), Positives = 83/153 (54%), Gaps = 11/153 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ ++IGTPP       DTGSDLIWTQC+PCP   C+ Q  P FDP  SST     C 
Sbjct: 81  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 138

Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           S+ C  L   SC          C Y+ SYGD S + G L  +  T     G   ++PG+ 
Sbjct: 139 STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 195

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           FGCG  N G+F S  TGI G G G +SL SQ++
Sbjct: 196 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLK 228


>gi|297737850|emb|CBI27051.3| unnamed protein product [Vitis vinifera]
          Length = 256

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 57/127 (44%), Positives = 76/127 (59%), Gaps = 6/127 (4%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ IG+PP     V DTGSD+ W QC PC  + CY Q  P+F+P  SS+Y  L 
Sbjct: 50  SGEYFSRVGIGSPPKHVYMVVDTGSDVNWVQCAPC--ADCYQQADPIFEPSFSSSYAPLT 107

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C + QC SL+   C   +C Y VSYGDGS++ G+ ATET+TL      + +L  +  GCG
Sbjct: 108 CETHQCKSLDVSECRNDSCLYEVSYGDGSYTVGDFATETITLDG----SASLNNVAIGCG 163

Query: 208 TNNGGLF 214
            +N GLF
Sbjct: 164 HDNEGLF 170


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 69/151 (45%), Positives = 89/151 (58%), Gaps = 24/151 (15%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
           + DTGSDL W QC+PC  S CY Q  PLFDP  S++Y ++PC++S C ASL        S
Sbjct: 179 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 236

Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
           C+ V           C YS++YGDGSFS G LAT+TV LG  +     + G  FGCG +N
Sbjct: 237 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 291

Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
            GLF   T G++GLG  ++SL+SQ      G
Sbjct: 292 RGLFGG-TAGLMGLGRTELSLVSQTAPRFGG 321



 Score = 44.7 bits (104), Expect = 0.066,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 56/131 (42%), Gaps = 9/131 (6%)

Query: 251 DIVIDSGTTLTFL-PQGYNSNLLSVMSSM-IEAQPVADPTGSLELCYSFNSLSQV--PEV 306
           ++++DSGT +T L P  Y +           E  P A P   L+ CY+     +V  P +
Sbjct: 392 NVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLL 451

Query: 307 TIHFRG-ADVKLSRSNFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQ 361
           T+   G AD+ +  +       +D   VC     ++  +  PI GN  Q N  V YD   
Sbjct: 452 TLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVG 511

Query: 362 QTVSFKPTDCT 372
             + F   DC+
Sbjct: 512 SRLGFADEDCS 522


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 69/151 (45%), Positives = 89/151 (58%), Gaps = 24/151 (15%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK-----S 160
           + DTGSDL W QC+PC  S CY Q  PLFDP  S++Y ++PC++S C ASL        S
Sbjct: 180 IVDTGSDLTWVQCKPC--SVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPGS 237

Query: 161 CSGV----------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
           C+ V           C YS++YGDGSFS G LAT+TV LG  +     + G  FGCG +N
Sbjct: 238 CATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVALGGAS-----VDGFVFGCGLSN 292

Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
            GLF   T G++GLG  ++SL+SQ      G
Sbjct: 293 RGLFGG-TAGLMGLGRTELSLVSQTAPRFGG 322



 Score = 44.7 bits (104), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 56/131 (42%), Gaps = 9/131 (6%)

Query: 251 DIVIDSGTTLTFL-PQGYNSNLLSVMSSM-IEAQPVADPTGSLELCYSFNSLSQV--PEV 306
           ++++DSGT +T L P  Y +           E  P A P   L+ CY+     +V  P +
Sbjct: 393 NVLLDSGTVITRLAPSVYRAVRAEFARQFGAERYPAAPPFSLLDACYNLTGHDEVKVPLL 452

Query: 307 TIHFRG-ADVKLSRSNFFVKVSED--IVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQ 361
           T+   G AD+ +  +       +D   VC     ++  +  PI GN  Q N  V YD   
Sbjct: 453 TLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTVG 512

Query: 362 QTVSFKPTDCT 372
             + F   DC+
Sbjct: 513 SRLGFADEDCS 523


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 67/153 (43%), Positives = 83/153 (54%), Gaps = 11/153 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ ++IGTPP       DTGSDLIWTQC+PCP   C+ Q  P FDP  SST     C 
Sbjct: 81  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 138

Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           S+ C  L   SC          C Y+ SYGD S + G L  +  T     G   ++PG+ 
Sbjct: 139 STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 195

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           FGCG  N G+F S  TGI G G G +SL SQ++
Sbjct: 196 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLK 228


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 67/153 (43%), Positives = 83/153 (54%), Gaps = 11/153 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ ++IGTPP       DTGSDLIWTQC+PCP   C+ Q  P FDP  SST     C 
Sbjct: 81  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 138

Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           S+ C  L   SC          C Y+ SYGD S + G L  +  T     G   ++PG+ 
Sbjct: 139 STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 195

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           FGCG  N G+F S  TGI G G G +SL SQ++
Sbjct: 196 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLK 228



 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 38/127 (29%), Positives = 61/127 (48%), Gaps = 8/127 (6%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHF 310
           +IDSGT +T LP      +    ++ ++   V+  T     C S    +   VP++ +HF
Sbjct: 309 IIDSGTAMTSLPTRVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHF 368

Query: 311 RGADVKLSRSNFFVKVSE---DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
            GA + L R N+  +V +    I+C ++ +G    V   GN  Q N  V YD++   +SF
Sbjct: 369 EGATMDLPRENYVFEVEDAGSSILCLAIIEG--GEVTTIGNFQQQNMHVLYDLQNSKLSF 426

Query: 367 KPTDCTK 373
            P  C K
Sbjct: 427 VPAQCDK 433


>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 459

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 113/425 (26%), Positives = 177/425 (41%), Gaps = 89/425 (20%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF----NQNSSI---------SS 76
           + +LIHRDS  SP YN +++   R +  L  S  R ++      +NS++         ++
Sbjct: 36  TTKLIHRDSIFSPAYNPNDSIKDRAKRMLKNSNARFDYVQAISKRNSAVVDYDGGDTSAA 95

Query: 77  SKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
             A +A ++     +L+  SIG PP  + AV DTGS L W QCEPC    C+ Q  PL++
Sbjct: 96  DDAYEASLLSELCTFLVNFSIGQPPVPQYAVMDTGSSLTWIQCEPC--INCHQQKGPLYN 153

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           P  SSTY S         +    +  G +C YS +Y D + + G  A E +   +     
Sbjct: 154 PSSSSTYVSCSDFDRTDTTFT--ATHGSDCNYSQTYADKTTTRGTYAREQLLFETPDDGI 211

Query: 197 VALPGITFGCGTNNGGL--FNSKTTGIVGLGGGDISLISQM------------------- 235
             +  + FGCG NN  L       +G+ GLG    S+IS++                   
Sbjct: 212 TIMHDVIFGCGHNNTQLPGPTGYASGVFGLGDSGSSIISKLGFGFSYCIGNIGDPLYGFH 271

Query: 236 RTTIAGNQRL-GVSTP--------------------------------------DIVIDS 256
           R T+    ++ G STP                                       IVIDS
Sbjct: 272 RLTLGNKLKIEGYSTPLVPRGLYYITLVGISIGQERLDIDPIVFQRVDLNGISSRIVIDS 331

Query: 257 GTTLTFLP-QGYN---SNLLSVMSSMIEAQPVADPTGSLELCY--SFN-SLSQVPEVTIH 309
           G TL+++P Q YN     + S++S  +           L LCY    N  L   P+ T H
Sbjct: 332 GATLSYIPRQAYNVVRDKVSSILSGFLSRYRYI--ARHLSLCYIGKLNQDLQGFPDATFH 389

Query: 310 F-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSF 366
              GAD+       F + +++++C       +     + G + Q  + V YD++QQ + F
Sbjct: 390 LADGADLVFQVEGLFFQYTDNVLCLALVPTESDEETCLIGLLAQQYYNVAYDLKQQKLYF 449

Query: 367 KPTDC 371
           +  +C
Sbjct: 450 QRIEC 454


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 59/133 (44%), Positives = 79/133 (59%), Gaps = 10/133 (7%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  RI IG+P  +   V DTGSD+ W QC PC  + CY Q  PLFDP +SS+Y ++P
Sbjct: 193 SGEYFSRIGIGSPARQLYMVLDTGSDVTWLQCAPC--ADCYAQSDPLFDPALSSSYATVP 250

Query: 148 CSSSQCASLNQKSC------SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           C S  C +L+  +C         +C Y V+YGDGS++ G+ ATET+TLG     AV    
Sbjct: 251 CDSPHCRALDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTLGGDGSAAVH--D 308

Query: 202 ITFGCGTNNGGLF 214
           +  GCG +N GLF
Sbjct: 309 VAIGCGHDNEGLF 321



 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 54/124 (43%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           +++DSGT +T L     S L        +A P A      + CY     S  QVP V++ 
Sbjct: 417 VIVDSGTAVTRLQSSAYSALRDAFVRGTQALPRASGVSLFDTCYDLAGRSSVQVPAVSLR 476

Query: 310 FRGA-DVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F G  ++KL   N+ + V      C  F     +V I GN+ Q    V +D  + TV F 
Sbjct: 477 FEGGGELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNTVGFS 536

Query: 368 PTDC 371
           P  C
Sbjct: 537 PNKC 540


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 80/231 (34%), Positives = 120/231 (51%), Gaps = 23/231 (9%)

Query: 25  QTGGFSVELIHRDSPKSPF--YNSSETPYQRLRDALTRSL-NRLNHF----NQNSSISSS 77
           + G   +E+ H+DS       +N     +  + D   RSL +R+       N + S+ + 
Sbjct: 62  ENGATILEMKHKDSCSGKILDWNKKLKKHLIMDDFQLRSLQSRMKSIISGRNIDDSVDAP 121

Query: 78  KASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
               + I     NY++ + +G    +   + DTGSDL W QC+PC   +CY Q  P+F+P
Sbjct: 122 IPLTSGIRLQTLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPC--KRCYNQQDPVFNP 177

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCS-GV------NCQYSVSYGDGSFSNGNLATETVTLG 190
             S +Y+++ CSS  C SL   + + GV      +C Y V+YGDGS++ G L TE + LG
Sbjct: 178 STSPSYRTVLCSSPTCQSLQSATGNLGVCGSNPPSCNYVVNYGDGSYTRGELGTEHLDLG 237

Query: 191 STTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           ++T    A+    FGCG NN GLF    +G+VGLG   +SLISQ      G
Sbjct: 238 NST----AVNNFIFGCGRNNQGLFGG-ASGLVGLGRSSLSLISQTSAMFGG 283



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 55/128 (42%), Gaps = 7/128 (5%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           ++IDSGT +T LP      L           P A     L+ C++ +   +V  P + +H
Sbjct: 353 MMIDSGTVITRLPPSIYQALKDEFVKQFSGFPSAPAFMILDTCFNLSGYQEVEIPNIKMH 412

Query: 310 FRG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTV 364
           F G    +V ++   +FVK     VC     ++  N V I GN  Q N  V YD +   +
Sbjct: 413 FEGNAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVGIIGNYQQKNQRVIYDTKGSML 472

Query: 365 SFKPTDCT 372
            F    CT
Sbjct: 473 GFAAEACT 480


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 69/178 (38%), Positives = 93/178 (52%), Gaps = 15/178 (8%)

Query: 45  NSSETPYQRLRDALTRSLNRLN------HFNQNSSISSSKASQADIIPNNANYLIRISIG 98
           +S++TP Q     L R   R+       H  +++  S S +  + +   +  Y  RI +G
Sbjct: 66  SSNKTPEQLFHLRLQRDAKRVEALLNQIHARRSAGSSFSSSIISGLAQGSGEYFTRIGVG 125

Query: 99  TPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
           TP      V DTGSD++W QC PC   +CY Q   +FDP  S TY  +PC +  C  L+ 
Sbjct: 126 TPARYVYMVLDTGSDVVWLQCAPC--RKCYTQTDHVFDPTKSRTYAGIPCGAPLCRRLDS 183

Query: 159 KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
             CS  N  CQY VSYGDGSF+ G+ +TET+T        VAL     GCG +N GLF
Sbjct: 184 PGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTFRRNRVTRVAL-----GCGHDNEGLF 236



 Score = 65.9 bits (159), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 46/155 (29%), Positives = 68/155 (43%), Gaps = 12/155 (7%)

Query: 222 VGLGGGDISLISQ--MRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMI 279
           + +GG  +  +S    R   AGN         ++IDSGT++T L +     L        
Sbjct: 313 ISVGGAPVRGLSASLFRLDAAGNG-------GVIIDSGTSVTRLTRPAYIALRDAFRIGA 365

Query: 280 EAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSED-IVCSVFK 336
                A      + C+  + L++V  P V +HFRGADV L  +N+ + V      C  F 
Sbjct: 366 SHLKRAPEFSLFDTCFDLSGLTEVKVPTVVLHFRGADVSLPATNYLIPVDNSGSFCFAFA 425

Query: 337 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           G  + + I GNI Q  F + YD+    V F P  C
Sbjct: 426 GTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score =  120 bits (300), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 70/188 (37%), Positives = 97/188 (51%), Gaps = 15/188 (7%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
            +++L H DS      + ++TP       L R   R++  N  ++  SS      +   +
Sbjct: 54  LTLDLHHLDS-----LSLNKTPTDLFNLRLHRDTLRVHALNSRAAGFSSSVVSG-LSQGS 107

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y  R+ +GTPP     V DTGSD++W QC PC   +CY Q  P+F+P  S ++  +PC
Sbjct: 108 GEYFTRLGVGTPPRYLYMVLDTGSDVVWLQCSPC--RKCYSQSDPIFNPYKSKSFAGIPC 165

Query: 149 SSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           SS  C  L+   CS     C Y VSYGDGSF+ G+ ATET+T        VAL     GC
Sbjct: 166 SSPLCRRLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVAL-----GC 220

Query: 207 GTNNGGLF 214
           G +N GLF
Sbjct: 221 GHHNEGLF 228



 Score = 68.6 bits (166), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 40/124 (32%), Positives = 59/124 (47%), Gaps = 3/124 (2%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           ++IDSGT++T L +   + L                    + CY  +  S  +VP V +H
Sbjct: 330 VIIDSGTSVTRLTRPAYTALRDAFRVGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVVLH 389

Query: 310 FRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           FRGAD+ L  +N+ + V E+   C  F G  + + I GNI Q  F V YD+    + F P
Sbjct: 390 FRGADMALPATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAP 449

Query: 369 TDCT 372
             CT
Sbjct: 450 RGCT 453


>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
          Length = 404

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 69/155 (44%), Positives = 95/155 (61%), Gaps = 13/155 (8%)

Query: 84  IIPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
           ++ N+A  Y + +SIGTPP     +ADTGS LIWTQC PC  ++C  + +P F P  SST
Sbjct: 82  LLDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPC--TECAARPAPPFQPASSST 139

Query: 143 YKSLPCSSSQCASLNQ--KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           +  LPC+SS C  L    ++C+   C Y   YG G F+ G LATET+ +G  +      P
Sbjct: 140 FSKLPCASSLCQFLTSPYRTCNATGCVYYYPYGMG-FTAGYLATETLHVGGAS-----FP 193

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           G+TFGC T NG    + ++GIVGLG   +SL+SQ+
Sbjct: 194 GVTFGCSTENG--VGNSSSGIVGLGRSPLSLVSQV 226


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 71/162 (43%), Positives = 96/162 (59%), Gaps = 12/162 (7%)

Query: 76  SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
           SS + QA +      Y + IS+GTP      VADTGSDLIWTQC PC  ++C+ Q +P F
Sbjct: 71  SSVSFQALLENGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPC--TKCFQQPAPPF 128

Query: 136 DPKMSSTYKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
            P  SST+  LPC+SS C  L  + ++C+   C Y+  YG G ++ G LATET+ +G   
Sbjct: 129 QPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGD-- 185

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
               + P + FGC T NG    + T+GI GLG G +SLI Q+
Sbjct: 186 ---ASFPSVAFGCSTENG--VGNSTSGIAGLGRGALSLIPQL 222



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 38/134 (28%), Positives = 60/134 (44%), Gaps = 15/134 (11%)

Query: 253 VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS----QVPEVT 307
           ++DSGTTLT+L + GY     + +S   +   V + T  L+LC+           VP + 
Sbjct: 304 IVDSGTTLTYLAKDGYEMVKQAFLSQTADVTTV-NGTRGLDLCFKSTGGGGGGIAVPSLV 362

Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--------IYGNIMQTNFLVGYDI 359
           + F G   + +   +F  V  D   SV       +P        + GN+MQ +  + YD+
Sbjct: 363 LRFDGG-AEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDL 421

Query: 360 EQQTVSFKPTDCTK 373
           +    SF P DC K
Sbjct: 422 DGGIFSFAPADCAK 435


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 107/365 (29%), Positives = 153/365 (41%), Gaps = 89/365 (24%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           ++ + +SIGTPP  R  + DTGSDLIWTQC+     Q   ++ PL+DP  SS++ + PC 
Sbjct: 88  HHTLTVSIGTPPQPRTLILDTGSDLIWTQCKLFDTRQ--HREKPLYDPAKSSSFAAAPCD 145

Query: 150 SSQC--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
              C   S N K+CS   C Y+ +YG  + + G LA+ET T G     +V+L    FGCG
Sbjct: 146 GRLCETGSFNTKNCSRNKCIYTYNYGSAT-TKGELASETFTFGEHRRVSVSL---DFGCG 201

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLI--------------------------------SQM 235
               G      +GI+G+    +SL+                                S+ 
Sbjct: 202 KLTSGSL-PGASGILGISPDRLSLVSQLQIPRFSYCLTPFLDRNTTSHIFFGAMADLSKY 260

Query: 236 RTT--------------------------IAGNQRLGVSTPDIVI-DSGTTLTFLPQGYN 268
           RTT                            G +RL V      I   G+  TF+  G  
Sbjct: 261 RTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVDSGDT 320

Query: 269 SNLLS--VMSSMIEAQ------PVADPTG---SLELCY--------SFNSLSQVPEVTIH 309
           + +L   VM ++ EA       PV + T      ELC+        +  +  QVP +  H
Sbjct: 321 TGMLPSVVMEALKEAMVEAVKLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPPLVYH 380

Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           F G A + L R ++ V+VS   +C V         I GN  Q N  V +D+E    SF P
Sbjct: 381 FDGGAAMLLRRDSYMVEVSAGRMCLVISSGARGA-IIGNYQQQNMHVLFDVENHEFSFAP 439

Query: 369 TDCTK 373
           T C +
Sbjct: 440 TQCNQ 444


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 119/428 (27%), Positives = 175/428 (40%), Gaps = 102/428 (23%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI-IP-- 86
           S+ L HR  P +P   SS   +  L + L R   R +H  + +  S    + +D+ IP  
Sbjct: 61  SMPLAHRHGPCAPATTSS---WPSLAERLRRDRARRDHITRKAKASGRTTTLSDVSIPTS 117

Query: 87  -----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                ++  Y++ + IGTP  ++  + DTGSDL W QC+PC  S CY Q  PL+DP  SS
Sbjct: 118 LGAAVDSLEYVVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASS 177

Query: 142 TYKSLPCSSSQCASL----NQKSC---SGVN-CQYSVSYGDGSFSNGNLATETVTLGSTT 193
           TY  +PC S  C  L        C   SG + CQY + YG+   + G  +TET+TL    
Sbjct: 178 TYAPVPCDSKACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTL---- 233

Query: 194 GQAVALPGITFGCG----------------------------TNNGGLFN------SKTT 219
              V++    FGCG                               GG F+      + TT
Sbjct: 234 SPQVSVKDFGFGCGLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYCLPPGNSTT 293

Query: 220 GIVGLGG---------------------GDISLISQMRTTIAGNQRLGVSTPDIVI---- 254
           G + LG                          L++    ++ G     +  P  V+    
Sbjct: 294 GFLALGAPTNNNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKP---LDIPPTVLSGGM 350

Query: 255 --DSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ--VPEVTI 308
             DSGT +T LP    S L +   + + A P+  P     L+ CY+F  ++   VP V +
Sbjct: 351 IIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNNDDVLDTCYNFTGIANVTVPTVAL 410

Query: 309 HFRGADVKLSRSNFFVKVSEDIV---CSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQT 363
            F G       +   + V   ++   C  F G  +   V I GN+ Q  F V YD  +  
Sbjct: 411 TFDGG------ATIDLDVPSGVLIQDCLAFAGGASDGDVGIIGNVNQRTFEVLYDSGRGH 464

Query: 364 VSFKPTDC 371
           V F+P  C
Sbjct: 465 VGFRPGAC 472


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 60/127 (47%), Positives = 77/127 (60%), Gaps = 7/127 (5%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ IG P  E   V DTGSD+ W QC PC  + CY Q  P+F+P  SS+Y+ L 
Sbjct: 145 SGEYFTRVGIGKPAREVYMVLDTGSDVNWLQCTPC--ADCYHQTEPIFEPSSSSSYEPLS 202

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C + QC +L    C    C Y VSYGDGS++ G+ ATET+T+GST  Q VA+     GCG
Sbjct: 203 CDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAV-----GCG 257

Query: 208 TNNGGLF 214
            +N GLF
Sbjct: 258 HSNEGLF 264



 Score = 46.6 bits (109), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 36/125 (28%), Positives = 58/125 (46%), Gaps = 6/125 (4%)

Query: 252 IVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTI 308
           I+IDSGT +T L  + YNS   S +   ++ +  A      + CY+ ++ +  +VP V  
Sbjct: 360 IIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAG-VAMFDTCYNLSAKTTVEVPTVAF 418

Query: 309 HFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           HF G   + L   N+ + V S    C  F    +S+ I GN+ Q    V +D+    + F
Sbjct: 419 HFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGF 478

Query: 367 KPTDC 371
               C
Sbjct: 479 SSNKC 483


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 57/129 (44%), Positives = 78/129 (60%), Gaps = 9/129 (6%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ +GTPP     V DTGSD++W QC+PC  ++CY Q   +FDP  S ++  +P
Sbjct: 127 SGEYFTRLGVGTPPKYLYMVLDTGSDVVWLQCKPC--TKCYSQTDQIFDPSKSKSFAGIP 184

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           C S  C  L+   CS  N  CQY VSYGDGSF+ G+ +TET+T      +  A+P +  G
Sbjct: 185 CYSPLCRRLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTF-----RRAAVPRVAIG 239

Query: 206 CGTNNGGLF 214
           CG +N GLF
Sbjct: 240 CGHDNEGLF 248



 Score = 68.2 bits (165), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 42/123 (34%), Positives = 58/123 (47%), Gaps = 3/123 (2%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           ++IDSGT++T L +    +L             A      + CY  + LS+V  P V +H
Sbjct: 350 VIIDSGTSVTRLTRPAYVSLRDAFRVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLH 409

Query: 310 FRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           FRGADV L  +N+ V V      C  F G  + + I GNI Q  F V +D+    V F P
Sbjct: 410 FRGADVSLPAANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAP 469

Query: 369 TDC 371
             C
Sbjct: 470 RGC 472


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 67/153 (43%), Positives = 83/153 (54%), Gaps = 11/153 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ ++IGTPP       DTGSDLIWTQC+PCP   C+ Q  P FDP  SST     C 
Sbjct: 34  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 91

Query: 150 SSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           S+ C  L   SC          C Y+ SYGD S + G L  +  T     G   ++PG+ 
Sbjct: 92  STLCQGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVA 148

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           FGCG  N G+F S  TGI G G G +SL SQ++
Sbjct: 149 FGCGLFNNGVFKSNETGIAGFGRGPLSLPSQLK 181



 Score = 58.2 bits (139), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 64/128 (50%), Gaps = 9/128 (7%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 310
           +IDSGT++T LP      +    ++ I+   V         C+S  S ++  VP++ +HF
Sbjct: 265 IIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHF 324

Query: 311 RGADVKLSRSNFFVKVSED----IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
            GA + L R N+  +V +D    I+C ++ KG  +   I GN  Q N  V YD++   +S
Sbjct: 325 EGATMDLPRENYVFEVPDDAGNSIICLAINKG--DETTIIGNFQQQNMHVLYDLQNNMLS 382

Query: 366 FKPTDCTK 373
           F    C K
Sbjct: 383 FVAAQCDK 390


>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
 gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
          Length = 458

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 69/169 (40%), Positives = 102/169 (60%), Gaps = 16/169 (9%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISSSKASQADII--------PNNANYLIRISIGTPPTE 103
           Q +RDAL R ++R   F +  + SSS +S A  +        PN   Y++ ++IGTPP  
Sbjct: 50  QFVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQS 109

Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS--QCASLNQKSC 161
             A+ADTGSDL+WTQC PC   +C+ Q SPL++P  S T++ LPCSS+   CA+  + + 
Sbjct: 110 YPAIADTGSDLVWTQCAPC-GERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAG 168

Query: 162 S----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           +    G  C+Y+ +YG G +++G   +ET T GS+    V +PGI FGC
Sbjct: 169 ATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGC 216



 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 71/129 (55%), Gaps = 10/129 (7%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPE 305
           ++IDSGTT+T L       + + + S+++  PV D + +  L+LC++  S S     +P 
Sbjct: 330 LIIDSGTTITSLVDAAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPPATLPS 388

Query: 306 VTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 363
           +T+HF G AD+ L   N+ + +   + C   +  T+  +   GN  Q N  + YD++++T
Sbjct: 389 MTLHFGGGADMVLPVENYMI-LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKET 447

Query: 364 VSFKPTDCT 372
           +SF P  C+
Sbjct: 448 LSFAPAKCS 456


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 82/206 (39%), Positives = 110/206 (53%), Gaps = 26/206 (12%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQ-RL-RDA---------LTRSLNRLNHFNQNSSISSS 77
           FS+ L  R +  +P Y    T  + RL RDA         L RSLN   HF +  SI+ S
Sbjct: 69  FSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGE--SINES 126

Query: 78  KASQADIIP--------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ-CY 128
               +   P        + A YL +I +G P      V DTGSD+ W QC+PC     CY
Sbjct: 127 LIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCY 186

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
            Q  P+FDPK SS+Y  L C+S QC  L++ +C+   C Y V YGDGSF+ G LATET++
Sbjct: 187 KQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLS 246

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLF 214
            G++     ++P +  GCG +N GLF
Sbjct: 247 FGNSN----SIPNLPIGCGHDNEGLF 268



 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 52/124 (41%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF- 310
           I++DSGT ++ LP     +L      +  +   A      + CY+F+  S V   TI F 
Sbjct: 364 IIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFV 423

Query: 311 --RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
              G  ++L   N+ + + +    C  F    +S+ I G+  Q    V YD+    V F 
Sbjct: 424 LSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFS 483

Query: 368 PTDC 371
              C
Sbjct: 484 TNKC 487


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 60/127 (47%), Positives = 77/127 (60%), Gaps = 7/127 (5%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ IG P  E   V DTGSD+ W QC PC  + CY Q  P+F+P  SS+Y+ L 
Sbjct: 148 SGEYFTRVGIGNPAREVYMVLDTGSDVNWLQCTPC--ADCYHQTEPIFEPSSSSSYEPLS 205

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C + QC +L    C    C Y VSYGDGS++ G+ ATET+T+GST  Q VA+     GCG
Sbjct: 206 CDTPQCNALEVSECRNATCLYEVSYGDGSYTVGDFATETLTIGSTLVQNVAV-----GCG 260

Query: 208 TNNGGLF 214
            +N GLF
Sbjct: 261 HSNEGLF 267



 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 60/128 (46%), Gaps = 12/128 (9%)

Query: 252 IVIDSGTTLTFLPQG-YNS---NLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPE 305
           I+IDSGT +T L  G YNS   + L   S + +A  VA      + CY+ ++ +  +VP 
Sbjct: 363 IIIDSGTAVTRLQTGIYNSLRDSFLKGTSDLEKAAGVA----MFDTCYNLSAKTTIEVPT 418

Query: 306 VTIHFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
           V  HF G   + L   N+ + V S    C  F    +S+ I GN+ Q    V +D+    
Sbjct: 419 VAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSL 478

Query: 364 VSFKPTDC 371
           + F    C
Sbjct: 479 IGFSSNKC 486


>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
 gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
          Length = 453

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 69/169 (40%), Positives = 102/169 (60%), Gaps = 16/169 (9%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISSSKASQADII--------PNNANYLIRISIGTPPTE 103
           Q +RDAL R ++R   F +  + SSS +S A  +        PN   Y++ ++IGTPP  
Sbjct: 45  QFVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQS 104

Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS--QCASLNQKSC 161
             A+ADTGSDL+WTQC PC   +C+ Q SPL++P  S T++ LPCSS+   CA+  + + 
Sbjct: 105 YPAIADTGSDLVWTQCAPC-GERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAG 163

Query: 162 S----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           +    G  C+Y+ +YG G +++G   +ET T GS+    V +PGI FGC
Sbjct: 164 ATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGC 211



 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 71/129 (55%), Gaps = 10/129 (7%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPE 305
           ++IDSGTT+T L       + + + S+++  PV D + +  L+LC++  S S     +P 
Sbjct: 325 LIIDSGTTITSLVDAAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPPATLPS 383

Query: 306 VTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 363
           +T+HF G AD+ L   N+ + +   + C   +  T+  +   GN  Q N  + YD++++T
Sbjct: 384 MTLHFGGGADMVLPVENYMI-LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKET 442

Query: 364 VSFKPTDCT 372
           +SF P  C+
Sbjct: 443 LSFAPAKCS 451


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 71/162 (43%), Positives = 96/162 (59%), Gaps = 12/162 (7%)

Query: 76  SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
           SS + QA +      Y + IS+GTP      VADTGSDLIWTQC PC  ++C+ Q +P F
Sbjct: 71  SSVSFQALLENGVGGYNMNISVGTPLLTFPVVADTGSDLIWTQCAPC--TKCFQQPAPPF 128

Query: 136 DPKMSSTYKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
            P  SST+  LPC+SS C  L  + ++C+   C Y+  YG G ++ G LATET+ +G   
Sbjct: 129 QPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGD-- 185

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
               + P + FGC T NG    + T+GI GLG G +SLI Q+
Sbjct: 186 ---ASFPSVAFGCSTENG--VGNSTSGIAGLGRGALSLIPQL 222



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 38/133 (28%), Positives = 59/133 (44%), Gaps = 14/133 (10%)

Query: 253 VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---QVPEVTI 308
           ++DSGTTLT+L + GY     + +S       V + T  L+LC+          VP + +
Sbjct: 304 IVDSGTTLTYLAKDGYEMVKQAFLSQTANVTTV-NGTRGLDLCFKSTGGGGGIAVPSLVL 362

Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--------IYGNIMQTNFLVGYDIE 360
            F G   + +   +F  V  D   SV       +P        + GN+MQ +  + YD++
Sbjct: 363 RFDGG-AEYAVPTYFAGVETDSQGSVTVACLMMLPAKGDQPMSVIGNVMQMDMHLLYDLD 421

Query: 361 QQTVSFKPTDCTK 373
               SF P DC K
Sbjct: 422 GGIFSFSPADCAK 434


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 82/206 (39%), Positives = 110/206 (53%), Gaps = 26/206 (12%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQ-RL-RDA---------LTRSLNRLNHFNQNSSISSS 77
           FS+ L  R +  +P Y    T  + RL RDA         L RSLN   HF +  SI+ S
Sbjct: 69  FSLPLYPRLALHNPSYKDYNTLVRARLTRDAARVQFLNRNLERSLNGGTHFGE--SINES 126

Query: 78  KASQADIIP--------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ-CY 128
               +   P        + A YL +I +G P      V DTGSD+ W QC+PC     CY
Sbjct: 127 LIGDSITAPVVSGQSKGSGAEYLAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCY 186

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
            Q  P+FDPK SS+Y  L C+S QC  L++ +C+   C Y V YGDGSF+ G LATET++
Sbjct: 187 KQFDPIFDPKSSSSYSPLSCNSQQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLS 246

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLF 214
            G++     ++P +  GCG +N GLF
Sbjct: 247 FGNSN----SIPNLPIGCGHDNEGLF 268



 Score = 46.2 bits (108), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 52/124 (41%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF- 310
           I++DSGT ++ LP     +L      +  +   A      + CY+F+  S V   TI F 
Sbjct: 364 IIVDSGTIISRLPSDVYESLREAFVKLTSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFV 423

Query: 311 --RGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
              G  ++L   N+ + + +    C  F    +S+ I G+  Q    V YD+    V F 
Sbjct: 424 LSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFS 483

Query: 368 PTDC 371
              C
Sbjct: 484 TNKC 487


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 69/208 (33%), Positives = 103/208 (49%), Gaps = 32/208 (15%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLR-DALTRSLNRLNHFNQNSSISSSKASQADIIP- 86
           FS+EL     P+   +  S   Y+ L    L R   R+   N    ++ S   ++D++P 
Sbjct: 80  FSLEL----HPRELLHGGSHKDYRALMLSRLARDSARVKAINTKLQLAVSGTDKSDLVPM 135

Query: 87  --------------------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
                                +  Y +R+ IG P      V DTGSD+ W QC+PC    
Sbjct: 136 DTEILHPQDFSTPVTSGTSQGSGEYFLRVGIGRPSKTFYMVIDTGSDVNWLQCKPC--DD 193

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
           CY Q  P+FDP  SS++  L C + QC +L+  +C   +C Y VSYGDGS++ G+ ATET
Sbjct: 194 CYQQVDPIFDPASSSSFSRLGCQTPQCRNLDVFACRNDSCLYQVSYGDGSYTVGDFATET 253

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLF 214
           V+ G++     ++  +  GCG +N GLF
Sbjct: 254 VSFGNSG----SVDKVAIGCGHDNEGLF 277



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 47/158 (29%), Positives = 69/158 (43%), Gaps = 12/158 (7%)

Query: 220 GIVGLG-GGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLP-QGYNSNLLSVMSS 277
           GI G+  GG+   I      + G+ + G     I++D GT +T L  Q YN+ L      
Sbjct: 345 GITGMSVGGEKLAIPPSIFEVDGSGKGG-----IIVDCGTAVTRLQTQAYNA-LRDTFVK 398

Query: 278 MIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVKV-SEDIVCS 333
           + +  P        + CY+ +S +  +VP V   F G   + L  SN+ + V S    C 
Sbjct: 399 LTKDLPSTSGFALFDTCYNLSSRTSVRVPTVAFLFDGGKSLPLPPSNYLIPVDSAGTFCL 458

Query: 334 VFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            F   T S+ I GN+ Q    V YD+    VSF    C
Sbjct: 459 AFAPTTASLSIIGNVQQQGTRVTYDLANSQVSFSSRKC 496


>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 453

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 69/169 (40%), Positives = 102/169 (60%), Gaps = 16/169 (9%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISSSKASQADII--------PNNANYLIRISIGTPPTE 103
           Q +RDAL R ++R   F +  + SSS +S A  +        PN   Y++ ++IGTPP  
Sbjct: 45  QFVRDALRRDMHRRARFGRELASSSSSSSPAGTVSAPTRKDLPNGGEYIMTLAIGTPPQS 104

Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS--QCASLNQKSC 161
             A+ADTGSDL+WTQC PC   +C+ Q SPL++P  S T++ LPCSS+   CA+  + + 
Sbjct: 105 YPAIADTGSDLVWTQCAPC-GERCFKQPSPLYNPSSSPTFRVLPCSSALNLCAAEARLAG 163

Query: 162 S----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           +    G  C+Y+ +YG G +++G   +ET T GS+    V +PGI FGC
Sbjct: 164 ATPPPGCACRYNQTYGTG-WTSGLQGSETFTFGSSPADQVRVPGIAFGC 211



 Score = 60.5 bits (145), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 71/129 (55%), Gaps = 10/129 (7%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLSQ----VPE 305
           ++IDSGTT+T L       + + + S+++  PV D + +  L+LC++  S S     +P 
Sbjct: 325 LIIDSGTTITSLVDAAYKRVRAAVRSLVKL-PVTDGSNATGLDLCFALPSSSAPPATLPS 383

Query: 306 VTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 363
           +T+HF G AD+ L   N+ + +   + C   +  T+  +   GN  Q N  + YD++++T
Sbjct: 384 MTLHFGGGADMVLPVENYMI-LDGGMWCLAMRSQTDGELSTLGNYQQQNLHILYDVQKET 442

Query: 364 VSFKPTDCT 372
           +SF P  C+
Sbjct: 443 LSFAPAKCS 451


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score =  119 bits (298), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 58/129 (44%), Positives = 76/129 (58%), Gaps = 9/129 (6%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ +GTP      V DTGSD++W QC PC   +CY Q  P+FDP+ S TY ++P
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC--RRCYSQSDPIFDPRKSKTYATIP 196

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           CSS  C  L+   C+     C Y VSYGDGSF+ G+ +TET+T      + VAL     G
Sbjct: 197 CSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVAL-----G 251

Query: 206 CGTNNGGLF 214
           CG +N GLF
Sbjct: 252 CGHDNEGLF 260



 Score = 61.6 bits (148), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 58/123 (47%), Gaps = 3/123 (2%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           ++IDSGT++T L +     +        +    A      + C+  +++++V  P V +H
Sbjct: 362 VIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPTVVLH 421

Query: 310 FRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           FR ADV L  +N+ + V  +   C  F G    + I GNI Q  F V YD+    V F P
Sbjct: 422 FRRADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAP 481

Query: 369 TDC 371
             C
Sbjct: 482 GGC 484


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 75/232 (32%), Positives = 111/232 (47%), Gaps = 25/232 (10%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDAL-TRSLNRLNHFNQNSSISSSKASQ 81
             + G   +E+  R               Q + D L  RS+   NH  + +S S    S 
Sbjct: 48  RKEKGAIILEMKDRGECSESERKGDWVEKQLVLDGLHVRSIQ--NHIRKRTSSSQIADSS 105

Query: 82  ADIIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
              +P          NY++ + +G+       + DTGSDL W QCEPC    CY Q+ PL
Sbjct: 106 ETQVPLTSGIKFQTLNYIVTMGLGSQNMS--VIVDTGSDLTWVQCEPC--RSCYNQNGPL 161

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQKSC-----SGVNCQYSVSYGDGSFSNGNLATETVTL 189
           F P  S +Y+ + C+S+ C SL   +C     +   C Y V+YGDGS+++G L  E +  
Sbjct: 162 FKPSTSPSYQPILCNSTTCQSLELGACGSDPSTSATCDYVVNYGDGSYTSGELGIEKLGF 221

Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           G      +++    FGCG NN GLF    +G++GLG  ++S+ISQ   T  G
Sbjct: 222 G-----GISVSNFVFGCGRNNKGLFGG-ASGLMGLGRSELSMISQTNATFGG 267



 Score = 43.9 bits (102), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 56/128 (43%), Gaps = 7/128 (5%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           +++DSGT ++ L       L +         P A     L+ C++     QV  P ++++
Sbjct: 341 VILDSGTVISRLAPSVYKALKAKFLEQFSGFPSAPGFSILDTCFNLTGYDQVNIPTISMY 400

Query: 310 FRG-ADVKLSRSNFFVKVSEDI--VCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTV 364
           F G A++ +  +  F  V ED   VC     +++   + I GN  Q N  V YD +   V
Sbjct: 401 FEGNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVLYDAKLSQV 460

Query: 365 SFKPTDCT 372
            F    CT
Sbjct: 461 GFAKEPCT 468


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 58/129 (44%), Positives = 76/129 (58%), Gaps = 9/129 (6%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ +GTP      V DTGSD++W QC PC   +CY Q  P+FDP+ S TY ++P
Sbjct: 139 SGEYFTRLGVGTPARYVYMVLDTGSDIVWLQCAPC--RRCYSQSDPIFDPRKSKTYATIP 196

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           CSS  C  L+   C+     C Y VSYGDGSF+ G+ +TET+T      + VAL     G
Sbjct: 197 CSSPHCRRLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFRRNRVKGVAL-----G 251

Query: 206 CGTNNGGLF 214
           CG +N GLF
Sbjct: 252 CGHDNEGLF 260



 Score = 65.1 bits (157), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 59/123 (47%), Gaps = 3/123 (2%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           ++IDSGT++T L +     +        +    A      + C+  +++++V  P V +H
Sbjct: 362 VIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLH 421

Query: 310 FRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           FRGADV L  +N+ + V  +   C  F G    + I GNI Q  F V YD+    V F P
Sbjct: 422 FRGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAP 481

Query: 369 TDC 371
             C
Sbjct: 482 GGC 484


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 70/157 (44%), Positives = 93/157 (59%), Gaps = 11/157 (7%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N   YLI + +G+P T +  + DTGSD+ W QC+PC  SQC+ Q  PLFDP  SSTY   
Sbjct: 48  NTLEYLITVGLGSPATSQTMLIDTGSDVSWVQCKPC--SQCHSQADPLFDPSSSSTYSPF 105

Query: 147 PCSSSQCASLNQKS---CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
            C S+ CA L Q+     S   CQY V+YGDGS + G  +++T+ LGS+     A+    
Sbjct: 106 SCGSADCAQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLALGSS-----AVRSFQ 160

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
           FGC     G FN +T G++GLGGG  SL+SQ   T+ 
Sbjct: 161 FGCSNVESG-FNDQTDGLMGLGGGAQSLVSQTAGTLG 196



 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 43/124 (34%), Positives = 61/124 (49%), Gaps = 10/124 (8%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF 310
           V+DSGT +T LP    S L S   + ++  P A P+G L+ C+ F+  S V  P V + F
Sbjct: 267 VMDSGTVITRLPPTAYSALSSAFKAGMKQYPPAQPSGILDTCFDFSGQSSVSIPSVALVF 326

Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 367
             GA V L  S   +       C  F G ++  S+ I GN+ Q  F V YD+ +  V F+
Sbjct: 327 SGGAVVSLDASGIILS-----NCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFR 381

Query: 368 PTDC 371
              C
Sbjct: 382 AGAC 385


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 55/130 (42%), Positives = 79/130 (60%), Gaps = 8/130 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  R+ IG+P  E   V DTGSD+ W QC+PC  + CY Q  P+FDP +S++Y ++
Sbjct: 165 GSGEYFSRVGIGSPARELYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAV 222

Query: 147 PCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C S +C  L+  +C      C Y V+YGDGS++ G+ ATET+TLG +T     +  +  
Sbjct: 223 SCDSPRCRDLDTAACRNATGACLYEVAYGDGSYTVGDFATETLTLGDST----PVTNVAI 278

Query: 205 GCGTNNGGLF 214
           GCG +N GLF
Sbjct: 279 GCGHDNEGLF 288



 Score = 48.1 bits (113), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 31/124 (25%), Positives = 51/124 (41%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           +++DSGT +T L     + L         + P        + CY  +  +  +VP V++ 
Sbjct: 386 VIVDSGTAVTRLQSSAYAALRDAFVRGTPSLPRTSGVSLFDTCYDLSDRTSVEVPAVSLR 445

Query: 310 FRGAD-VKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F G   ++L   N+ + V      C  F     +V I GN+ Q    V +D  +  V F 
Sbjct: 446 FEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVVGFT 505

Query: 368 PTDC 371
           P  C
Sbjct: 506 PNKC 509


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 74/191 (38%), Positives = 103/191 (53%), Gaps = 14/191 (7%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISS-SKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
           + +R    RS  R      +S+ +  S  +  D +P    YL+ ++IGTPP       DT
Sbjct: 52  ELMRRMALRSKARAPRLLSSSATAPVSPGAYDDGVPMT-EYLLHLAIGTPPQPVQLTLDT 110

Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN----- 165
           GSDL+WTQC+PC  + C+ Q  P +D   SST+    C S+QC  L+      VN     
Sbjct: 111 GSDLVWTQCQPC--AVCFNQSLPYYDASRSSTFALPSCDSTQC-KLDPSVTMCVNQTVQT 167

Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
           C +S SYGD S + G L  ETV+  +      ++PG+ FGCG NN G+F S  TGI G G
Sbjct: 168 CAFSYSYGDKSATIGFLDVETVSFVA----GASVPGVVFGCGLNNTGIFRSNETGIAGFG 223

Query: 226 GGDISLISQMR 236
            G +SL SQ++
Sbjct: 224 RGPLSLPSQLK 234



 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 62/126 (49%), Gaps = 6/126 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLSQ---VPEVTI 308
           +IDSGT  T LP      +    ++ ++   V ++ TG L LC+S   L +   VP++ +
Sbjct: 315 IIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPLGKAPHVPKLVL 373

Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           HF GA + L R N+  +  +   CS+    I   + I GN  Q N  V YD++   +SF 
Sbjct: 374 HFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFV 433

Query: 368 PTDCTK 373
              C K
Sbjct: 434 RAKCDK 439


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 61/161 (37%), Positives = 85/161 (52%), Gaps = 3/161 (1%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
           A +      YL  + +GTP      + DTGSDL W QC PC   +CY Q+  LF P  S+
Sbjct: 4   APVAAARGEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPC--GKCYSQNDALFLPNTST 61

Query: 142 TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           ++  L C S+ C  L    C+   C Y  SYGDGS + G+   +T+T+    GQ   +P 
Sbjct: 62  SFTKLACGSALCNGLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPN 121

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
             FGCG +N G F +   GI+GLG G +S  SQ+++   G 
Sbjct: 122 FAFGCGHDNEGSF-AGADGILGLGQGPLSFHSQLKSVYNGK 161



 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 41/127 (32%), Positives = 63/127 (49%), Gaps = 6/127 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVM--SSMIEAQPVADPTGSLELCYS---FNSLSQVPEVT 307
           + DSGTT+T L +     +L+ M  S+M  ++ + D    L+LC S    + L  VP +T
Sbjct: 239 IFDSGTTVTQLAEAAYKEVLAAMNASTMAYSRKIDD-ISRLDLCLSGFPKDQLPTVPAMT 297

Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            HF G D+ L  SN+F+ +            +  V I G++ Q NF V YD   + + F 
Sbjct: 298 FHFEGGDMVLPPSNYFIYLESSQSYCFAMTSSPDVNIIGSVQQQNFQVYYDTAGRKLGFV 357

Query: 368 PTDCTKQ 374
           P DC  +
Sbjct: 358 PKDCVGR 364


>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 113/444 (25%), Positives = 184/444 (41%), Gaps = 82/444 (18%)

Query: 5   LSCVFILFFLCF-YVVSP---IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTR 60
           ++ V +L  +CF ++ SP     + + GFS  LIH  SP SP+ N       +   AL  
Sbjct: 15  MASVNLLLIICFTFIFSPCISAASDSKGFSTNLIHIHSPSSPYKNVKAESLAK-DTALES 73

Query: 61  SLNRLNHFN--QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
           +L+R  +    Q  ++  +      +I + + +L  +SIG PPT    V DTGSDL W Q
Sbjct: 74  TLSRHAYLRARQQKALQPADFVPPPLIRDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQ 133

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGV-NCQYSVSYGDGS 176
           CEPC    CY Q  P+++   S +Y  + C+   C SL ++  CS   +C Y  SY DGS
Sbjct: 134 CEPC--DVCYKQKDPIYNRTKSDSYTEMLCNEPPCLSLGREGQCSDSGSCLYQTSYADGS 191

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-------------------------- 210
            ++G L+ E V   S          + FGCG  N                          
Sbjct: 192 RTSGLLSYEKVAFTSHYSDEDKTAQVGFGCGLQNLNFVTSSRDGGVLGLGPGLVSLVSQL 251

Query: 211 --------------GGLFNSKTTGIVGLG-----GGDIS--LISQMRTTIAGNQRLGVST 249
                         G L N    G +  G      GD++  +I++          LGV  
Sbjct: 252 SAIGKVSKSFAYCFGNLSNPNAGGFLVFGDATYLNGDMTPMVIAEFYYVNLLGIGLGVEE 311

Query: 250 PD-----------------IVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSL 291
           P                  ++IDSG+TL+ F P+ Y     +V+  + +   ++  T S 
Sbjct: 312 PRLDINSSSFERKPDGSGGVIIDSGSTLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSP 371

Query: 292 ELCYSFN---SLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNI 348
           + C+       L   P + ++     +   R + F++  +++ C  F      + I G +
Sbjct: 372 D-CFEGKIGRDLPLFPTLVLYLESTGILNDRWSIFLQRYDELFCLGFTS-GEGLSIIGTL 429

Query: 349 MQTNFLVGYDIEQQTVSFKPT-DC 371
            Q ++  GY++E  T+S +   DC
Sbjct: 430 AQQSYKFGYNLELSTLSIESNPDC 453


>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 94/352 (26%), Positives = 153/352 (43%), Gaps = 77/352 (21%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+   +IGTPP    AV D   +L+WTQC+ C   +C+ Q +PLFDP  S+TY++ PC 
Sbjct: 50  NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--GRCFEQGTPLFDPTASNTYRAEPCG 107

Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           +  C S+  + ++CSG  C Y  S   G  + G + T+T  +G+      A   + FGC 
Sbjct: 108 TPLCESIPSDVRNCSGNVCAYEASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTT----------------------------- 238
             +        +GIVGLG    SL++Q                                 
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQTGVAAFSYCLAPHDAGKNSALFLGSSAKLAGGG 220

Query: 239 ---------IAGN----------QRLGVSTPDIVI-----------DSGTTLTFLPQGYN 268
                    I+GN          Q  G+   D +I           D+ + ++FL  G  
Sbjct: 221 KAASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTVLLDTFSPISFLVDGAY 280

Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRSNFFVKV 326
             +   ++  + A P+A P    +LC+  +  S   P++   FR GA + +  +N+ +  
Sbjct: 281 QAVKKAVTVAVGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYLLDY 340

Query: 327 SEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
               VC     S     T  + + G++ Q N    +D++++T+SF+P DCTK
Sbjct: 341 KNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 392


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 120/423 (28%), Positives = 167/423 (39%), Gaps = 95/423 (22%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL--RDAL-TRSLNRLNHFNQNSSISSSKASQADI 84
           G +V L HR  P SP  ++ E     L  RD L  + +      N  S     + S A  
Sbjct: 52  GTTVPLSHRHGPCSPAPSTVEPTMAELLRRDQLRAKYIQAKLSVNSGSGTDGVQQSAAIT 111

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           +P       +   Y+I +SIGTP   +  + DTGSD+ W  C     ++     S  FDP
Sbjct: 112 LPTTLGSALDTLAYVITVSIGTPAMTQAVMIDTGSDVSWVHCH----ARAGAGSSLFFDP 167

Query: 138 KMSSTYKSLPCSSSQCASLNQK--SCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
             SSTY    CSS+ C  L  +   CS    CQY+V YGDGS + G   ++T+ L ST  
Sbjct: 168 GKSSTYTPFSCSSAACTRLEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLALNSTE- 226

Query: 195 QAVALPGITFGCGTNNG---GLFNSKTTGIVGLGGGDISLISQMRTTI------------ 239
               +    FGC   +    GL   +T G++GLGGG  SL+SQ   T             
Sbjct: 227 ---KVENFQFGCSETSDPGEGLDEDQTDGLMGLGGGAPSLVSQTAATYGSAFSYCLPATT 283

Query: 240 --AGNQRLGVST-------------------------------------PDI-----VID 255
             +G   LG ST                                     P +     ++D
Sbjct: 284 RSSGFLTLGASTGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVFAAGSIMD 343

Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGA 313
           SGT +T LP    S L +   + +   P A     L+ C+ F     V  P V + F G 
Sbjct: 344 SGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSILDTCFDFTGQDNVSIPAVELVFSGG 403

Query: 314 DVKLSRSNFFVKVSEDIV----CSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVSFKP 368
            V        V +  D +    C  F   T  +  I GN+ Q  F V +D+ Q  + F+P
Sbjct: 404 AV--------VDLDADGIMYGSCLAFAPATGGIGSIIGNVQQRTFEVLHDVGQSVLGFRP 455

Query: 369 TDC 371
             C
Sbjct: 456 GAC 458


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 58/127 (45%), Positives = 75/127 (59%), Gaps = 7/127 (5%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  R+ IG P +    V DTGSD+ W QC PC  + CY Q  P+F+P  S++Y  L 
Sbjct: 141 SGEYFSRVGIGKPSSPVYMVLDTGSDVNWIQCAPC--ADCYHQADPIFEPASSTSYSPLS 198

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           C + QC SL+   C    C Y VSYGDGS++ G+  TET+TLGS +   VA+     GCG
Sbjct: 199 CDTKQCQSLDVSECRNNTCLYEVSYGDGSYTVGDFVTETITLGSASVDNVAI-----GCG 253

Query: 208 TNNGGLF 214
            NN GLF
Sbjct: 254 HNNEGLF 260



 Score = 58.9 bits (141), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 56/124 (45%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           I+IDSGT +T L     + L        +  PV       + CY  +  +  +VP VT H
Sbjct: 356 IIIDSGTAVTRLQTAAYNALRDAFVKGTKDLPVTSEVALFDTCYDLSRKTSVEVPTVTFH 415

Query: 310 FRGADV-KLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
             G  V  L  +N+ + V  D   C  F   ++++ I GN+ Q    VG+D+    V F+
Sbjct: 416 LAGGKVLPLPATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFE 475

Query: 368 PTDC 371
           P  C
Sbjct: 476 PRQC 479


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 69/164 (42%), Positives = 93/164 (56%), Gaps = 14/164 (8%)

Query: 86  PNNANY---LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
           P +A Y   L+ I +GTPP + + + DTGSDL W Q EPC    C+ Q  P+FDP  SST
Sbjct: 17  PESAGYGEFLVPIYLGTPPQKAVVIIDTGSDLTWIQSEPC--RACFEQADPIFDPSKSST 74

Query: 143 YKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           Y  + CSSS CA L   Q   +  NC Y+  YGDGS + G  + ET+T   T G+ V   
Sbjct: 75  YNKIACSSSACADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVK-- 132

Query: 201 GITFGCGTNNGGLF-NSKTTGIVGLGGGDISLISQMRTTIAGNQ 243
              FG    N G F ++   GI+GLG G +S+ SQ+ + + GN+
Sbjct: 133 ---FGASVYNTGTFGDTGGEGILGLGQGPVSMPSQLGSVL-GNK 172



 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 40/122 (32%), Positives = 66/122 (54%), Gaps = 4/122 (3%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 310
           +IDSGTT+T+L Q   + L++  +S +        TG L+LC++         P +TIH 
Sbjct: 249 IIDSGTTITYLQQEVFNALVAAYTSQVRYPTTTSATG-LDLCFNTRGTGSPVFPAMTIHL 307

Query: 311 RGADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
            G  ++L  +N F+ +  +I+C  F    +  + I+GNI Q NF + YD++   + F P 
Sbjct: 308 DGVHLELPTANTFISLETNIICLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGFAPA 367

Query: 370 DC 371
           DC
Sbjct: 368 DC 369


>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
 gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
          Length = 493

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 125/433 (28%), Positives = 180/433 (41%), Gaps = 93/433 (21%)

Query: 30  SVELIHRDSPKSPFYNSS-ETPYQRL-RDALT-----RSLNRLNHFNQNSSISSS--KAS 80
           +V L HR  P SP  N    T  +RL RD L      R L+R        +      + S
Sbjct: 63  TVPLHHRHGPCSPLPNKKMPTLEERLHRDKLRAAYIHRKLSRGKKQGGGGAGGDVVVQQS 122

Query: 81  QADIIP-------NNANYLIRISIGTPPTE-RLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
            A  +P       +   Y+I + +G+PP + +  + DTGSD+ W +C+PC   QC  Q  
Sbjct: 123 HAMTVPTTLGTSLDTLEYVITVRLGSPPGKSQTMLIDTGSDISWVRCKPCW-QQCRPQVD 181

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGV-NCQYSVSYGDGSF-SNGNLATET 186
           PLFDP +SSTY    CSS+ CA L    N   CS    CQY   YGDGS  + G  +++T
Sbjct: 182 PLFDPSLSSTYSPFSCSSAACAQLFQEGNANGCSSSGQCQYIAMYGDGSVGTTGTYSSDT 241

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGL----------------FNSKTTGIVG------- 223
           + LGS +   V +    FGC     G+                  S+T G  G       
Sbjct: 242 LALGSNS-NTVVVSKFRFGCSHAETGITGLTAGLMGLGGGAQSLVSQTAGTFGTTAFSYC 300

Query: 224 ------------LGGGDISLISQMRTTI------------------AGNQRLGVST---- 249
                       LG    S    ++T +                   G ++L + T    
Sbjct: 301 LPPTPSSSGFLTLGAAGTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVFS 360

Query: 250 PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT---GSLELCYSFNSLSQV--P 304
             +++DSGT +T LP    S+L S   + ++  P A  +   G L+ C+  +  S V  P
Sbjct: 361 AGMIMDSGTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQSSVSMP 420

Query: 305 EVTIHFRGAD---VKLSRSNFFVKV-SEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYD 358
            V + F GA    V L  S   +++ +  I C  F   ++  S  I GN+ Q  F V YD
Sbjct: 421 TVALVFSGAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIGNVQQRTFQVLYD 480

Query: 359 IEQQTVSFKPTDC 371
           +    V FK   C
Sbjct: 481 VAGGAVGFKAGAC 493


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score =  118 bits (295), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 61/154 (39%), Positives = 81/154 (52%), Gaps = 3/154 (1%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             YL  + +GTP      + DTGSDL W QC PC    CY Q+  LF P  S+++  L C
Sbjct: 1   GEYLATVRLGTPERVFSVIVDTGSDLTWVQCSPC--GTCYSQNDSLFIPNTSTSFTKLAC 58

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
            +  C  L    C+   C Y  SYGDGS S G+   +T+T+    GQ   +P   FGCG 
Sbjct: 59  GTELCNGLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGH 118

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
           +N G F +   GI+GLG G +S  SQ++T   G 
Sbjct: 119 DNEGSF-AGADGILGLGQGPLSFPSQLKTVFNGK 151



 Score = 60.8 bits (146), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 48/154 (31%), Positives = 75/154 (48%), Gaps = 11/154 (7%)

Query: 226 GGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVM-SSMIEAQPV 284
           GG +  IS     I    R G      + DSGTT+T L    +  +L+ M +S ++    
Sbjct: 207 GGKLLNISSTAFDIDSVGRAGT-----IFDSGTTVTQLAGEVHQEVLAAMNASTMDYPRK 261

Query: 285 ADPTGSLELC---YSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN- 340
           +D +  L+LC   ++   L  VP +T HF G D++L  SN+F+ + E      F  +++ 
Sbjct: 262 SDDSSGLDLCLGGFAEGQLPTVPSMTFHFEGGDMELPPSNYFIFL-ESSQSYCFSMVSSP 320

Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTKQ 374
            V I G+I Q NF V YD   + + F P  C  +
Sbjct: 321 DVTIIGSIQQQNFQVYYDTVGRKIGFVPKSCVGR 354


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 97/363 (26%), Positives = 148/363 (40%), Gaps = 81/363 (22%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  YL+ +++GTPP    A+ DTGSDLIWTQC PC  + C  Q  P+F P  SS+Y+ +
Sbjct: 100 GDLEYLVDLAVGTPPQPVSALLDTGSDLIWTQCAPC--ASCLPQPDPIFSPGASSSYEPM 157

Query: 147 PCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQA----VALPG 201
            C+   C  +   SC   + C Y  SYGDG+ + G  ATE  T  S++       ++ P 
Sbjct: 158 RCAGELCNDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAP- 216

Query: 202 ITFGCGT------NNGG------------------------------------LFNSKTT 219
           + FGCGT      NNG                                     LF S   
Sbjct: 217 LGFGCGTMNKGSLNNGSGIVGFGRAPLSLVSQLAIRRFSYCLTPYASGRKSTLLFGSLRG 276

Query: 220 GIVGLGGGDISLISQMRT-------------TIAGNQRLGVS------TPD----IVIDS 256
           G+       +     +R+                G +RL +        PD     ++DS
Sbjct: 277 GVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAIVDS 336

Query: 257 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-------VPEVTIH 309
           GT LT  P    + ++    S +     A+ +   +    F + +        VP +  H
Sbjct: 337 GTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPDDGVCFAAAASRVPRPAVVPRMVFH 396

Query: 310 FRGADVKLSRSNFFVK-VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
            +GAD+ L R N+ +    +  +C +     +S    GN +Q +  V YD+E  T+SF P
Sbjct: 397 LQGADLDLPRRNYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMRVLYDLEADTLSFAP 456

Query: 369 TDC 371
             C
Sbjct: 457 AQC 459


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 64/178 (35%), Positives = 94/178 (52%), Gaps = 22/178 (12%)

Query: 49  TPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD-----------IIPNNANYLIRISI 97
           T  Q L + L R   R+      + ++  K  +A            ++  +  Y +R+ +
Sbjct: 1   THEQLLLETLQRDERRVRWIESKAKLAGKKKDEASSTDLNGPVTSGLLYGSGEYFVRLGL 60

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           GTP      V DTGSDL W QC+PC    CY Q  P+FDP+ SS+++ +PC S  C +L 
Sbjct: 61  GTPARSLFMVVDTGSDLPWLQCQPC--KSCYKQADPIFDPRNSSSFQRIPCLSPLCKALE 118

Query: 158 QKSCSG-----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
             SCSG       C Y V+YGDGSFS G+ +++  TLG T  +A++   + FGCG +N
Sbjct: 119 VHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTLG-TGSKAMS---VAFGCGFDN 172



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/124 (27%), Positives = 64/124 (51%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           ++IDSGT++T  P    + +     +     P A      + CY+F+  +   VP + +H
Sbjct: 284 VIIDSGTSVTRFPTSVYATIRDAFRNATINLPSAPRYSLFDTCYNFSGKASVDVPALVLH 343

Query: 310 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F  GAD++L  +N+ + + +    C  F   +  + I GNI Q +F +G+D+++  ++F 
Sbjct: 344 FENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKSHLAFA 403

Query: 368 PTDC 371
           P  C
Sbjct: 404 PQQC 407


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 89/331 (26%), Positives = 133/331 (40%), Gaps = 72/331 (21%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN- 165
           V DTGSD+ W QC+PC  + CY Q  P+FDP +S++Y ++ C S +C  L+  +C     
Sbjct: 2   VLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATG 59

Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF---------- 214
            C Y V+YGDGS++ G+ ATET+TLG +T     +  +  GCG +N GLF          
Sbjct: 60  ACLYEVAYGDGSYTVGDFATETLTLGDST----PVGNVAIGCGHDNEGLFVGAAGLLALG 115

Query: 215 ----------------------NSKTTGIVGLGGGDIS-------LISQMRTTI------ 239
                                 +S     +  G G          L+   RT+       
Sbjct: 116 GGPLSFPSQISASTFSYCLVDRDSPAASTLQFGDGAAEAGTVTAPLVRSPRTSTFYYVAL 175

Query: 240 ----AGNQRLGVSTPDIVID-----------SGTTLTFLPQGYNSNLLSVMSSMIEAQPV 284
                G Q L +      +D           SGT +T L     + L         + P 
Sbjct: 176 SGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALRDAFVQGAPSLPR 235

Query: 285 ADPTGSLELCYSFNSLS--QVPEVTIHFRGAD-VKLSRSNFFVKVS-EDIVCSVFKGITN 340
                  + CY  +  +  +VP V++ F G   ++L   N+ + V      C  F     
Sbjct: 236 TSGVSLFDTCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYLIPVDGAGTYCLAFAPTNA 295

Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           +V I GN+ Q    V +D  +  V F P  C
Sbjct: 296 AVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score =  117 bits (294), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 81/225 (36%), Positives = 114/225 (50%), Gaps = 24/225 (10%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD------ 83
           SV L HR  P SP   +S        + L R   R ++  +  S S+  A+  D      
Sbjct: 61  SVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQSSKV 120

Query: 84  IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLF 135
            +P       +   Y+I + +G+P   +  V DTGSD+ W QCEPCP PS C+     LF
Sbjct: 121 SVPTTLGSSLDTLEYVISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 180

Query: 136 DPKMSSTYKSLPCSSSQCASL-NQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLG 190
           DP  SSTY +  CS++ CA L +    +G +    CQY V YGDGS + G  +++ +TL 
Sbjct: 181 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTL- 239

Query: 191 STTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQ 234
             +G  V + G  FGC     G   + KT G++GLGG   SL+SQ
Sbjct: 240 --SGSDV-VRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSLVSQ 281



 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 39/140 (27%), Positives = 62/140 (44%), Gaps = 14/140 (10%)

Query: 241 GNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY 295
           G ++LG+S P +     ++DSGT +T LP    + L S   + +     A+P G L+ C+
Sbjct: 345 GGKKLGLS-PSVFAAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAEPLGILDTCF 403

Query: 296 SFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQT 351
           +F  L +V  P V + F G  V    ++  V       C  F    +  +    GN+ Q 
Sbjct: 404 NFTGLDKVSIPTVALVFAGGAVVDLDAHGIVSGG----CLAFAPTRDDKAFGTIGNVQQR 459

Query: 352 NFLVGYDIEQQTVSFKPTDC 371
            F V YD+      F+   C
Sbjct: 460 TFEVLYDVGGGVFGFRAGAC 479


>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 358

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 85/247 (34%), Positives = 123/247 (49%), Gaps = 40/247 (16%)

Query: 18  VVSPIEAQTGGFSVELIHRD-------SPKSPFYNSSETPYQRLRDALTRSLN-RLNHFN 69
            + P   Q+GG     IH         +P+ P   S    +    DA  ++LN RL    
Sbjct: 28  ALGPRVNQSGGVVQMTIHHVHGPGSSLAPQPPVSFSDVLAWD---DARVKTLNSRLTR-- 82

Query: 70  QNSSISSSKASQADI-------IPNN-------ANYLIRISIGTPPTERLAVADTGSDLI 115
           +++    S  ++ DI       +P N        NY +++  G+P      + DTGS L 
Sbjct: 83  KDTRFPKSVLTKKDIRFPKSVSVPLNPGASIGSGNYYVKVGFGSPARYYSMIVDTGSSLS 142

Query: 116 WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSC--SGVNCQY 168
           W QC+PC    C++Q  PLFDP  S TYKSL C+SSQC     A+LN   C  S   C Y
Sbjct: 143 WLQCKPCV-VYCHVQADPLFDPSASKTYKSLSCTSSQCSSLVDATLNNPLCETSSNVCVY 201

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
           + SYGD S+S G L+ + +TL  +      LPG  +GCG ++ GLF  +  GI+GLG   
Sbjct: 202 TASYGDSSYSMGYLSQDLLTLAPSQ----TLPGFVYGCGQDSDGLFG-RAAGILGLGRNK 256

Query: 229 ISLISQM 235
           +S++ Q+
Sbjct: 257 LSMLGQV 263


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score =  117 bits (293), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 72/197 (36%), Positives = 99/197 (50%), Gaps = 23/197 (11%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISS---------SKA 79
           FSV+L H D+     +NS  TP       L R   R+   +  +  +          S +
Sbjct: 60  FSVQLHHVDALS---FNS--TPETLFTTRLQRDAARVEAISYLAETAGTGKRVGTGFSSS 114

Query: 80  SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
             + +   +  Y  RI +GTPP     V DTGSD++W QC PC   +CY Q  P+FDP+ 
Sbjct: 115 VISGLAQGSGEYFTRIGVGTPPRYVYMVLDTGSDIVWIQCAPC--KRCYAQSDPVFDPRK 172

Query: 140 SSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           S ++ S+ C S  C  L+   C+     C Y VSYGDGSF+ G+ +TET+T   T    V
Sbjct: 173 SRSFASIACRSPLCHRLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFRRTRVARV 232

Query: 198 ALPGITFGCGTNNGGLF 214
           AL     GCG +N GLF
Sbjct: 233 AL-----GCGHDNEGLF 244



 Score = 63.2 bits (152), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 43/128 (33%), Positives = 63/128 (49%), Gaps = 13/128 (10%)

Query: 252 IVIDSGTTLTFLPQ----GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSFNSLSQV--P 304
           ++IDSGT++T L +     +     +  S++  A     P  SL + C+  +  ++V  P
Sbjct: 346 VIIDSGTSVTRLTRPAYIAFRDAFRAGASNLKRA-----PQFSLFDTCFDLSGKTEVKVP 400

Query: 305 EVTIHFRGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQT 363
            V +HFRGADV L  SN+ + V +    C  F G    + I GNI Q  F V YD+    
Sbjct: 401 TVVLHFRGADVSLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSR 460

Query: 364 VSFKPTDC 371
           V F P  C
Sbjct: 461 VGFAPHGC 468


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 72/223 (32%), Positives = 114/223 (51%), Gaps = 23/223 (10%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQA 82
             + L+HRDS  +    ++E   +RL+    R+   ++    N +      +S+ +   A
Sbjct: 64  LHIHLLHRDS-FAVNATAAELLARRLQRDELRAAWIISKAAANGTPPPVVGLSTGRGLVA 122

Query: 83  DII---PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKM 139
            ++   P +  Y+ +I++GTP  + L   DT SDL W QC+PC   +CY Q  P+FDP+ 
Sbjct: 123 PVVSRAPTSGEYMAKIAVGTPAVQALLALDTASDLTWLQCQPC--RRCYPQSGPVFDPRH 180

Query: 140 SSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDG----SFSNGNLATETVTLGST 192
           S++Y  +   +  C +L +          C Y+V YGDG    S S G+L  ET+T    
Sbjct: 181 STSYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAGG 240

Query: 193 TGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
             QA     ++ GCG +N GLF +   GI+GLG G IS+  Q+
Sbjct: 241 VRQAY----LSIGCGHDNKGLFGAPAAGILGLGRGQISIPHQI 279



 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/128 (33%), Positives = 68/128 (53%), Gaps = 8/128 (6%)

Query: 252 IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFNSLS--QVPEV 306
           +++DSGTT+T L +  Y +   +  ++      V+   P+G  + CY+    +  +VP V
Sbjct: 367 VILDSGTTVTRLARPAYVAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGGRAGVKVPAV 426

Query: 307 TIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 363
           ++HF G  +V L   N+ + V S   VC  F G  + SV + GNI+Q  F V YD+  Q 
Sbjct: 427 SMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVYDLAGQR 486

Query: 364 VSFKPTDC 371
           V F P +C
Sbjct: 487 VGFAPNNC 494


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 95/367 (25%), Positives = 155/367 (42%), Gaps = 86/367 (23%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + IGTPP     + DTGSDL W QC PC    C++Q+ P +DPK SS++K++ C
Sbjct: 190 GEYFMDVFIGTPPRHFSLILDTGSDLNWIQCVPC--YDCFVQNGPYYDPKESSSFKNIGC 247

Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT----GQAVA 198
              +C  ++     + C   N  C Y   YGD S + G+ A ET T+  T+     +   
Sbjct: 248 HDPRCHLVSSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKR 307

Query: 199 LPGITFGCGTNNGGL----------------FNSKTTGIVG------------------- 223
           +  + FGCG  N GL                F+S+   + G                   
Sbjct: 308 VENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSK 367

Query: 224 -LGGGDISLIS-----------------------QMRTTIAGNQRLGVS------TPD-- 251
            + G D  L++                       Q+++ + G + L +       +P+  
Sbjct: 368 LIFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEGA 427

Query: 252 --IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVT 307
              ++DSGTTL++  +     +       ++  PV      L+ CY+ + +   ++PE  
Sbjct: 428 GGTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPVIKDFPILDPCYNVSGVEKMELPEFR 487

Query: 308 IHFR-GADVKLSRSNFFVKVS-EDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTV 364
           I F  GA       N+F+K+  E+IVC    G   S + I GN  Q NF + YD ++  +
Sbjct: 488 ILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSALSIIGNYQQQNFHILYDTKKSRL 547

Query: 365 SFKPTDC 371
            + P  C
Sbjct: 548 GYAPMKC 554


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 74/191 (38%), Positives = 102/191 (53%), Gaps = 14/191 (7%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISS-SKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
           + +R    RS  R      +S+ +  S  +  D +P    YL+ ++IGTPP       DT
Sbjct: 52  ELMRRMALRSKARAPRLLSSSATAPVSPGAYDDGVPMT-EYLLHLAIGTPPQPVQLTLDT 110

Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN----- 165
           GS L+WTQC+PC  + C+ Q  P +D   SST+    C S+QC  L+      VN     
Sbjct: 111 GSVLVWTQCQPC--AVCFNQSLPYYDASRSSTFALPSCDSTQC-KLDPSVTMCVNQTVQT 167

Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
           C YS SYGD S + G L  ETV+  +      ++PG+ FGCG NN G+F S  TGI G G
Sbjct: 168 CAYSYSYGDKSATIGFLDVETVSFVA----GASVPGVVFGCGLNNTGIFRSNETGIAGFG 223

Query: 226 GGDISLISQMR 236
            G +SL SQ++
Sbjct: 224 RGPLSLPSQLK 234



 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 62/126 (49%), Gaps = 6/126 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLSQ---VPEVTI 308
           +IDSGT  T LP      +    ++ ++   V ++ TG L LC+S   L +   VP++ +
Sbjct: 315 IIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPLGKAPHVPKLVL 373

Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           HF GA + L R N+  +  +   CS+    I   + I GN  Q N  V YD++   +SF 
Sbjct: 374 HFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFV 433

Query: 368 PTDCTK 373
              C K
Sbjct: 434 RAKCDK 439


>gi|357127507|ref|XP_003565421.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 438

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 77/224 (34%), Positives = 113/224 (50%), Gaps = 37/224 (16%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ------ 81
           GFSVE IHRDS KS F++ + TP  RLR A  RS+ R  H  + ++ +++  +       
Sbjct: 3   GFSVEFIHRDSVKSLFHDPTLTPEARLRQAARRSMARHAHAARINNSAAAAGASGSDDSD 62

Query: 82  ----ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
               + ++P N  YL+ + + TPP   LA+ADTGS L+W +C+            P    
Sbjct: 63  ADVVSPMVPQNFEYLMALDVSTPPVRMLALADTGSSLVWLKCK-----------LPAAHT 111

Query: 138 KMSSTYKSLPCSSSQCASL-NQKSC----SGVN-CQYSVSYGDGSFSNGNLATETVTLGS 191
             SS+Y  LPC +  C +L +  SC    SG N C Y  ++ DGS + G +  +  T  +
Sbjct: 112 PASSSYARLPCDAFACKALGDAASCRATGSGNNICVYRYAFADGSCTAGPVTVDAFTFST 171

Query: 192 TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
                     + FGC T   GL +    G+VGL  G ISL+SQ+
Sbjct: 172 R---------LDFGCATRTEGL-SVPDDGLVGLANGPISLVSQL 205



 Score = 63.5 bits (153), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 41/132 (31%), Positives = 69/132 (52%), Gaps = 12/132 (9%)

Query: 245 LGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--- 301
           L  +T  +++DSGT LT+LP+     L++ +++ I+   V  P     +CY     +   
Sbjct: 276 LQTTTTKLIVDSGTMLTYLPKAVLDPLVAALTAAIKLPRVKSPETLYAVCYDVRRRAPED 335

Query: 302 ---QVPEVTIHF-RGADVKLSRSN-FFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFL 354
               +P+VT+    G +V+L   N F V+     VC     + + +P  I GN+ Q N  
Sbjct: 336 VGKSIPDVTLVLGGGGEVRLPWGNTFVVENKGTTVCLAL--VESHLPEFILGNVAQQNLH 393

Query: 355 VGYDIEQQTVSF 366
           VG+D+E++TVSF
Sbjct: 394 VGFDLERRTVSF 405


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 94/368 (25%), Positives = 152/368 (41%), Gaps = 87/368 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y I + +GTPP     + DTGSDL W QC+PC    C+ Q+ P ++P  SS+Y+++ C  
Sbjct: 170 YFIDMFVGTPPKHVWLILDTGSDLSWIQCDPC--YDCFEQNGPHYNPNESSSYRNISCYD 227

Query: 151 SQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGST----TGQAVALP 200
            +C  ++     + C   N  C Y   Y DGS + G+ A ET T+  T      +   + 
Sbjct: 228 PRCQLVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVV 287

Query: 201 GITFGCGTNNGGL----------------FNSKTTGIVG--------------------L 224
            + FGCG  N G                 F S+   I G                    +
Sbjct: 288 DVMFGCGHWNKGFFHGAGGLLGLGRGPLSFPSQLQSIYGHSFSYCLTDLFSNTSVSSKLI 347

Query: 225 GGGDISLIS-----------------------QMRTTIAGNQRLGVSTPDI--------- 252
            G D  L++                       Q+++ + G + L +              
Sbjct: 348 FGEDKELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPEKTWHWSSEGVGG 407

Query: 253 -VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
            +IDSG+TLTF P      +       I+ Q +A     +  CY+ +   QV  P+  IH
Sbjct: 408 TIIDSGSTLTFFPDSAYDVIKEAFEKKIKLQQIAADDFIMSPCYNVSGAMQVELPDYGIH 467

Query: 310 F-RGADVKLSRSNFFVKVSED-IVC-SVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVS 365
           F  GA       N+F +   D ++C ++ K   +S + I GN++Q NF + YD+++  + 
Sbjct: 468 FADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQQNFHILYDVKRSRLG 527

Query: 366 FKPTDCTK 373
           + P  C +
Sbjct: 528 YSPRRCAE 535


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 74/186 (39%), Positives = 99/186 (53%), Gaps = 20/186 (10%)

Query: 34  IHRDSPKSPFYNSSETPYQRLRDALTRS-LNRLNHFNQNSSIS---SSKASQADIIPNNA 89
           +HRDS +     +  T  Q + + +++S L  L    Q   +S   SS  SQ      + 
Sbjct: 106 LHRDSSR---VQAITTRLQLILNGVSKSDLKPLQTEIQPQDLSTPVSSGTSQG-----SG 157

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y  R+ +G P      V DTGSD+ W QC+PC  S CY Q  P+F P  SS+Y  L C 
Sbjct: 158 EYFTRVGVGNPAKSYYMVLDTGSDINWIQCQPC--SDCYQQSDPIFTPAASSSYSPLTCD 215

Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVALPGITFGCGT 208
           S QC SL   SC    C+Y V+YGDGSF+ G+  TET++  GS T  ++AL     GCG 
Sbjct: 216 SQQCNSLQMSSCRNGQCRYQVNYGDGSFTFGDFVTETMSFGGSGTVNSIAL-----GCGH 270

Query: 209 NNGGLF 214
           +N GLF
Sbjct: 271 DNEGLF 276



 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 54/125 (43%), Gaps = 6/125 (4%)

Query: 252 IVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTI 308
           +++D GT +T L  + YNS L     SM             + CY  +  S  +VP V+ 
Sbjct: 372 VIVDCGTAITRLQSEAYNS-LRDSFVSMSRHLRSTSGVALFDTCYDLSGQSSVKVPTVSF 430

Query: 309 HFRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           HF G     L  +N+ + V S    C  F   T+S+ I GN+ Q    V +D+    V F
Sbjct: 431 HFDGGKSWDLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVGF 490

Query: 367 KPTDC 371
               C
Sbjct: 491 STNKC 495


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 118/395 (29%), Positives = 165/395 (41%), Gaps = 88/395 (22%)

Query: 54  LRDALTRSLNRLNHFNQNSSISSSKASQADI-----IPNNA-NYLIRISIGTPPTERLAV 107
           L+D L R  +    F+  ++ S  K  QADI     IP  A NYL+++++GTP       
Sbjct: 3   LQDQL-RVKSMHARFSNKNAGSHFKEMQADIPVQSGIPLGAGNYLVKMALGTPKLSLSLA 61

Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA----SLNQKSCSG 163
            DTGSD+ WTQCEPC  S CY Q    FDP+ SS+YK++ CSSS C     S   + C  
Sbjct: 62  LDTGSDITWTQCEPCVGS-CYRQAQTKFDPRKSSSYKNVSCSSSSCRIITDSGGARGCVS 120

Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGL---------- 213
             C Y V YGDGS+S G  ATE +T+  +      +    FGCG  N G           
Sbjct: 121 STCIYKVQYGDGSYSVGFFATEKLTISPSD----VISNFLFGCGQQNAGRFGRIAGLLGL 176

Query: 214 -------------------------FNSKTTGIVGLGGGDISLIS--------------- 233
                                    F+S +TG + LGG     +                
Sbjct: 177 GRGKLSLALQTSEKYNNLFTYCLPSFSSSSTGHLTLGGQVPKSVKFTPLSPAFKNTPFYG 236

Query: 234 -QMRTTIAGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 287
             ++    G   L +     S    +IDSGT +T L     S L S    +++  P  D 
Sbjct: 237 IDIKGLSVGGHVLPIDASVFSNAGAIIDSGTVITRLQPTVYSALSSKFQQLMKDYPKTDG 296

Query: 288 TGSLELCYSF--NSLSQVPEVTIHFRGA---DVKLSRSNFF----VKVSEDIVCSVF--K 336
              L+ CY F  N    VP ++  F+G    D+K     FF    V  + D VC  F   
Sbjct: 297 FSILDTCYDFSGNESISVPRISFFFKGGVEVDIK-----FFGILTVINAWDKVCLAFAPN 351

Query: 337 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
                  ++GN  Q  + V +D+ +  + F P+ C
Sbjct: 352 DDDGDFVVFGNSQQQTYDVVHDLAKGRIGFAPSGC 386


>gi|296085638|emb|CBI29432.3| unnamed protein product [Vitis vinifera]
          Length = 337

 Score =  116 bits (291), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 65/160 (40%), Positives = 92/160 (57%), Gaps = 13/160 (8%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           + NY +++  G+P      + DTGS L W QC+PC    C++Q  PLFDP  S TYKSL 
Sbjct: 115 SGNYYVKVGFGSPARYYSMIVDTGSSLSWLQCKPCV-VYCHVQADPLFDPSASKTYKSLS 173

Query: 148 CSSSQC-----ASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           C+SSQC     A+LN   C  S   C Y+ SYGD S+S G L+ + +TL  +      LP
Sbjct: 174 CTSSQCSSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTLAPSQ----TLP 229

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
           G  +GCG ++ GLF  +  GI+GLG   +S++ Q+ +   
Sbjct: 230 GFVYGCGQDSDGLFG-RAAGILGLGRNKLSMLGQVSSKFG 268


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 68/155 (43%), Positives = 93/155 (60%), Gaps = 13/155 (8%)

Query: 84  IIPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
           ++ N+A  Y + +SIGTPP     +ADTGS LIWTQC PC  ++C  + +P F P  SST
Sbjct: 82  LLDNSAGAYNMNLSIGTPPVTFSVLADTGSSLIWTQCAPC--TECAARPAPPFQPASSST 139

Query: 143 YKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           +  LPC+SS C  L     +C+   C Y   YG G F+ G LATET+ +G       + P
Sbjct: 140 FSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLATETLHVG-----GASFP 193

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           G+ FGC T NG    + ++GIVGLG   +SL+SQ+
Sbjct: 194 GVAFGCSTENG--VGNSSSGIVGLGRSPLSLVSQV 226



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/137 (29%), Positives = 65/137 (47%), Gaps = 18/137 (13%)

Query: 253 VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNSL---SQVPE 305
           ++DSGTTLT+L  +GY     + +S M  A       G+    +LC+   +    S VP 
Sbjct: 312 IVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPV 371

Query: 306 VTIHFR---GADVKLSRSNFFVKVSED------IVCSVFKGITN--SVPIYGNIMQTNFL 354
            T+  R   GA+  + R ++   V+ D      + C +    +   S+ I GN+MQ +  
Sbjct: 372 PTLVLRFAGGAEYAVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLH 431

Query: 355 VGYDIEQQTVSFKPTDC 371
           V YD++    SF P DC
Sbjct: 432 VLYDLDGGMFSFAPADC 448


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 62/127 (48%), Positives = 79/127 (62%), Gaps = 3/127 (2%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
           DTGSDLIWTQC PC    C  Q +P FD K S+TY++LPC SS+CASL+  SC    C Y
Sbjct: 2   DTGSDLIWTQCAPC--LLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVY 59

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
              YGD + + G LA ET T G+     V    I FGCG+ N G   + ++G+VG G G 
Sbjct: 60  QYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDL-ANSSGMVGFGRGP 118

Query: 229 ISLISQM 235
           +SL+SQ+
Sbjct: 119 LSLVSQL 125



 Score = 51.6 bits (122), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 55/124 (44%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF----NSLSQVPEVT 307
           ++IDSGT++T+L Q     +   + S I    + D    L+ C+ +    N    VP++ 
Sbjct: 210 VIIDSGTSITWLQQDAYEAVRRGLVSAIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLV 269

Query: 308 IHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            HF  A++ L   N+ +  S      +    T    I GN  Q N  + YDI    +SF 
Sbjct: 270 FHFDSANMTLLPENYMLIASTTGYLCLVMAPTGVGTIIGNYQQQNLHLLYDIGNSFLSFV 329

Query: 368 PTDC 371
           P  C
Sbjct: 330 PAPC 333


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 112/423 (26%), Positives = 172/423 (40%), Gaps = 101/423 (23%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ--NSSISSSKASQADIIPN-- 87
           +LIH  S   P Y  +ET   R+   +  S  RL +       S+ S+   +A + P+  
Sbjct: 38  KLIHPGSVHHPHYKPNETAKDRMELDIQHSAARLANIQARIEGSLVSNNDYKARVSPSLT 97

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL- 146
               +  ISIG PP  +L V DTGSD++W  C PC  + C      LFDP  SST+  L 
Sbjct: 98  GRTIMANISIGQPPIPQLVVMDTGSDILWVMCTPC--TNCDNDLGLLFDPSKSSTFSPLC 155

Query: 147 --PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
             PC    C       C  +   ++V+Y D S ++G    +TV   +T      +  + F
Sbjct: 156 KTPCDFEGC------RCDPI--PFTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVLF 207

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQ------------------------------ 234
           GCG N G   +    GI+GL  G  SL+++                              
Sbjct: 208 GCGHNIGHDTDPGHNGILGLNNGPDSLVTKLGQKFSYCIGNLADPYYNYHQLILGEGADL 267

Query: 235 -----------------MRTTIAGNQRLGVS----------TPDIVIDSGTTLTFLPQGY 267
                            M     G +RL ++             ++ID+G+T+TFL    
Sbjct: 268 EGYSTPFEVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTITFLVDSV 327

Query: 268 NS-------NLL--SVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTIHF-RGA 313
           +        NLL  S   + IE  P          C+ + S+S+     P VT HF  GA
Sbjct: 328 HKLLSKEVRNLLGWSFRQATIEKSPWMQ-------CF-YGSISRDLVGFPVVTFHFSDGA 379

Query: 314 DVKLSRSNFFVKVSEDIVCSVFK-----GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           D+ L   +FF ++++++ C          I +   + G + Q ++ VGYD+  Q V F+ 
Sbjct: 380 DLALDSGSFFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVNQFVYFQR 439

Query: 369 TDC 371
            DC
Sbjct: 440 IDC 442


>gi|226509408|ref|NP_001141440.1| uncharacterized protein LOC100273550 precursor [Zea mays]
 gi|194704586|gb|ACF86377.1| unknown [Zea mays]
 gi|413938617|gb|AFW73168.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 478

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 107/352 (30%), Positives = 142/352 (40%), Gaps = 82/352 (23%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPC 148
           NY++  S+GTP   +    DTGSDL W QC+PC  +  CY Q  PLFDP  SS+Y ++PC
Sbjct: 139 NYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198

Query: 149 SSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
               CA L      +CS   C Y VSYGDGS + G  +++T+TL +++    A+ G  FG
Sbjct: 199 GGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS----AVQGFFFG 254

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTPD 251
           CG    GLFN    G++GLG    SL+ Q               + + AG   LGV  P 
Sbjct: 255 CGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPS 313

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIE------------------------------- 280
                 +T   LP         VM + I                                
Sbjct: 314 GAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPT 373

Query: 281 ----------------AQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSN 321
                             P A   G L+ CY+F     V  P V + F  GA V L    
Sbjct: 374 AYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADG 433

Query: 322 FFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
                     C  F   G    + I GN+ Q +F V   I+  +V FKP+ C
Sbjct: 434 IL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 478


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 102/197 (51%), Gaps = 20/197 (10%)

Query: 55  RDALTRSLNRLNHFNQNSSISSSKASQADIIPN-------NANYLIRISIGTPPTERLAV 107
           R+ L R   R     +++ + S +A+ A + P        +  YL+ ++IGTPP     +
Sbjct: 70  RELLHRMAARSK--ARSARLLSGRAASARVDPGSYTDGVPDTEYLVHMAIGTPPQPVQLI 127

Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC------ 161
            DTGSDL WTQC PC    C+ Q  P F+P  S T+  LPC    C  L   SC      
Sbjct: 128 LDTGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWG 185

Query: 162 SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTT 219
           +G+ C Y+ +Y D S + G+L ++T +  S        ++P +TFGCG  N G+F S  T
Sbjct: 186 NGI-CVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNET 244

Query: 220 GIVGLGGGDISLISQMR 236
           GI G   G +S+ +Q++
Sbjct: 245 GIAGFSRGALSMPAQLK 261



 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 60/128 (46%), Gaps = 9/128 (7%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIH 309
           ++DSGT +T LP+   + +     +  +   V + T SL +LC+S    +   VP + +H
Sbjct: 348 IVDSGTGMTMLPEAVYNLVCDAFVAQTKLT-VHNSTSSLSQLCFSVPPGAKPDVPALVLH 406

Query: 310 FRGADVKLSRSNFFVKVSE----DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
           F GA + L R N+  ++ E     + C         + + GN  Q N  V YD+    +S
Sbjct: 407 FEGATLDLPRENYMFEIEEAGGIRLTCLAINA-GEDLSVIGNFQQQNMHVLYDLANDMLS 465

Query: 366 FKPTDCTK 373
           F P  C K
Sbjct: 466 FVPARCNK 473


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 66/152 (43%), Positives = 87/152 (57%), Gaps = 12/152 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ ++IGTPP       DTGS L+WTQC+PC  + C+ Q  P +D   SST+    C 
Sbjct: 34  EYLLHLAIGTPPQPVQLTLDTGSVLVWTQCQPC--AVCFNQSLPYYDASRSSTFALPSCD 91

Query: 150 SSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           S+QC  L+      VN     C YS SYGD S + G L  ETV+  +      ++PG+ F
Sbjct: 92  STQC-KLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVA----GASVPGVVF 146

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           GCG NN G+F S  TGI G G G +SL SQ++
Sbjct: 147 GCGLNNTGIFRSNETGIAGFGRGPLSLPSQLK 178



 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 40/126 (31%), Positives = 62/126 (49%), Gaps = 6/126 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV-ADPTGSLELCYSFNSLSQ---VPEVTI 308
           +IDSGT  T LP      +    ++ ++   V ++ TG L LC+S   L +   VP++ +
Sbjct: 259 IIDSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGPL-LCFSAPPLGKAPHVPKLVL 317

Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           HF GA + L R N+  +  +   CS+    I   + I GN  Q N  V YD++   +SF 
Sbjct: 318 HFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFV 377

Query: 368 PTDCTK 373
              C K
Sbjct: 378 RAKCDK 383


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 69/195 (35%), Positives = 99/195 (50%), Gaps = 14/195 (7%)

Query: 52  QRLRDALTRSLNRLNHF--NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
           + LR    RS  R       + +S      S  D +P+   YL+ ++IGTPP     + D
Sbjct: 71  ELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDT-EYLVHMAIGTPPQPVQLILD 129

Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC------SG 163
           TGSDL WTQC PC    C+ Q  P F+P  S T+  LPC    C  L   SC      +G
Sbjct: 130 TGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNG 187

Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGI 221
           + C Y+ +Y D S + G+L ++T +  S        ++P +TFGCG  N G+F S  TGI
Sbjct: 188 I-CVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGI 246

Query: 222 VGLGGGDISLISQMR 236
            G   G +S+ +Q++
Sbjct: 247 AGFSRGALSMPAQLK 261



 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 60/128 (46%), Gaps = 9/128 (7%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIH 309
           ++DSGT +T LP+   + +     +  +   V + T SL +LC+S    +   VP + +H
Sbjct: 348 IVDSGTGMTMLPEAVYNLVCDAFVAQTKLT-VHNSTSSLSQLCFSVPPGAKPDVPALVLH 406

Query: 310 FRGADVKLSRSNFFVKVSE----DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
           F GA + L R N+  ++ E     + C         + + GN  Q N  V YD+    +S
Sbjct: 407 FEGATLDLPRENYMFEIEEAGGIRLTCLAINA-GEDLSVIGNFQQQNMHVLYDLANDMLS 465

Query: 366 FKPTDCTK 373
           F P  C K
Sbjct: 466 FVPARCNK 473


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 60/146 (41%), Positives = 81/146 (55%), Gaps = 10/146 (6%)

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           Q    SSS  S   +   +  Y  R+ +GTPP     V DTGSD++W QC PC   +CY 
Sbjct: 128 QGGGFSSSVTS--GLAQGSGEYFTRLGVGTPPKYVYMVLDTGSDVVWIQCAPC--RKCYS 183

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
           Q  P+FDPK S ++ S+ C S  C  L+   C S  +C Y V+YGDGSF+ G  +TET+T
Sbjct: 184 QTDPVFDPKKSGSFSSISCRSPLCLRLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLT 243

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLF 214
                 +   +P +  GCG +N GLF
Sbjct: 244 F-----RGTRVPKVALGCGHDNEGLF 264



 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/155 (27%), Positives = 71/155 (45%), Gaps = 12/155 (7%)

Query: 222 VGLGGGDISLI--SQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMI 279
           + +GG  ++ I  S  +   AGN         ++IDSGT++T L +    +L     +  
Sbjct: 341 ISVGGARVAGITASLFKLDTAGNG-------GVIIDSGTSVTRLTRRAYVSLRDAFRAGA 393

Query: 280 EAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSED-IVCSVFK 336
                A      + C+  +  ++V  P V +HFRGADV L  +N+ + V  + + C  F 
Sbjct: 394 ADLKRAPDYSLFDTCFDLSGKTEVKVPTVVMHFRGADVSLPATNYLIPVDTNGVFCFAFA 453

Query: 337 GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           G  + + I GNI Q  F V +D+    + F    C
Sbjct: 454 GTMSGLSIIGNIQQQGFRVVFDVAASRIGFAARGC 488


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 69/195 (35%), Positives = 99/195 (50%), Gaps = 14/195 (7%)

Query: 52  QRLRDALTRSLNRLNHF--NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
           + LR    RS  R       + +S      S  D +P+   YL+ ++IGTPP     + D
Sbjct: 45  ELLRRMAARSKARSARLLSGRAASARMDPGSYTDGVPDT-EYLVHMAIGTPPQPVQLILD 103

Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC------SG 163
           TGSDL WTQC PC    C+ Q  P F+P  S T+  LPC    C  L   SC      +G
Sbjct: 104 TGSDLTWTQCAPC--VSCFRQSLPRFNPSRSMTFSVLPCDLRICRDLTWSSCGEQSWGNG 161

Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ--AVALPGITFGCGTNNGGLFNSKTTGI 221
           + C Y+ +Y D S + G+L ++T +  S        ++P +TFGCG  N G+F S  TGI
Sbjct: 162 I-CVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGCGLFNNGIFVSNETGI 220

Query: 222 VGLGGGDISLISQMR 236
            G   G +S+ +Q++
Sbjct: 221 AGFSRGALSMPAQLK 235



 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 60/128 (46%), Gaps = 9/128 (7%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIH 309
           ++DSGT +T LP+   + +     +  +   V + T SL +LC+S    +   VP + +H
Sbjct: 322 IVDSGTGMTMLPEAVYNLVCDAFVAQTKLT-VHNSTSSLSQLCFSVPPGAKPDVPALVLH 380

Query: 310 FRGADVKLSRSNFFVKVSE----DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
           F GA + L R N+  ++ E     + C         + + GN  Q N  V YD+    +S
Sbjct: 381 FEGATLDLPRENYMFEIEEAGGIRLTCLAINA-GEDLSVIGNFQQQNMHVLYDLANDMLS 439

Query: 366 FKPTDCTK 373
           F P  C K
Sbjct: 440 FVPARCNK 447


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 73/221 (33%), Positives = 112/221 (50%), Gaps = 22/221 (9%)

Query: 30  SVELIHRDSPKSPFYNSSETPY------------QRLRDALTRSLNRLNHFNQNSSISSS 77
           S+E++H+  P S   +S +               +R++   +R    L   N+   + S+
Sbjct: 66  SLEVVHKHGPCSQLNHSGKAEATISHNDIMNLDNERVKYIQSRLSKNLGGENRVKELDST 125

Query: 78  KA-SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
              +++  +  +A+Y + + +GTP  +   + DTGS L WTQCEPC  S CY Q  P+FD
Sbjct: 126 TLPAKSGRLIGSADYYVVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGS-CYKQQDPIFD 184

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           P  SS+Y ++ C+SS C       CS     +C Y V YGD S S G L+ E +T+ +T 
Sbjct: 185 PSKSSSYTNIKCTSSLCTQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD 244

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
                +    FGCG +N GLF   T G++GL    IS + Q
Sbjct: 245 ----IVHDFLFGCGQDNEGLFRG-TAGLMGLSRHPISFVQQ 280



 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 54/124 (43%), Gaps = 5/124 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 310
           +IDSGT +T LP    + L S     +   PVA  T  L+ CY F+   +  VP +   F
Sbjct: 358 IIDSGTVITRLPPTAYAALRSAFRQFMMKYPVAYGTRLLDTCYDFSGYKEISVPRIDFEF 417

Query: 311 RGA-DVKLSRSNFFVKVSEDIVCSVF--KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            G   V+L         S   +C  F   G  N + I+GN+ Q    V YD+E   + F 
Sbjct: 418 AGGVKVELPLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFG 477

Query: 368 PTDC 371
              C
Sbjct: 478 AAGC 481


>gi|413938607|gb|AFW73158.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 478

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 122/424 (28%), Positives = 164/424 (38%), Gaps = 96/424 (22%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
           + L HR  P +P   SS      + D L     R  +  +  S        S  A+ A  
Sbjct: 68  LRLTHRHGPCAPSRASSLA-APSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAAAAAT 126

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFD 136
           +P          NY++  S+GTP   +    DTGSDL W QC+PC  +  CY Q  PLFD
Sbjct: 127 VPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFD 186

Query: 137 PKMSSTYKSLPCSSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           P  SS+Y ++PC    CA L      +CS   C Y VSYGDGS + G  +++T+TL +++
Sbjct: 187 PAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS 246

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTI 239
               A+ G  FGCG    GLFN    G++GLG    SL+ Q               + + 
Sbjct: 247 ----AVQGFFFGCGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPST 301

Query: 240 AGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE------------------- 280
           AG   LGV  P       +T   LP         VM + I                    
Sbjct: 302 AGYLTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVV 361

Query: 281 ----------------------------AQPVADPTGSLELCYSFNSLSQV--PEVTIHF 310
                                         P A   G L+ CY+F     V  P V + F
Sbjct: 362 DTGTVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTF 421

Query: 311 -RGADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
             GA V L              C  F   G    + I GN+ Q +F V   I+  +V FK
Sbjct: 422 GSGATVTLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFK 474

Query: 368 PTDC 371
           P+ C
Sbjct: 475 PSSC 478


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 110/421 (26%), Positives = 165/421 (39%), Gaps = 84/421 (19%)

Query: 30  SVELIHRDSPKSPFYN-SSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
           S+ ++HR  P SP  +  S  P     + L R  +R++   +  + SS+K      +  N
Sbjct: 72  SLTVVHRHGPCSPLRSRGSGAPSHT--EILRRDQDRVDAIRRKVTASSNKPKGGVSLLAN 129

Query: 89  -------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
                   NY+  + +GTP TE +   DTGSD  W QC+PC  + CY Q  P+FDP  SS
Sbjct: 130 WGKSLSTTNYVASLRLGTPATELVVELDTGSDQSWVQCKPC--ADCYEQRDPVFDPTASS 187

Query: 142 TYKSLPCSSSQCASLNQKSCSGV-------NCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           TY ++PC + +C  L   S S         NC Y VSY D S + G+LA +T+TL  +  
Sbjct: 188 TYSAVPCGARECQELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPS 247

Query: 195 QAVA--LPGITFGCGTNNGGLFNS----------------------------------KT 218
            + A  +PG  FGCG +N G F                                      
Sbjct: 248 PSPADTVPGFVFGCGHSNAGTFGEVDGLLGLGLGKASLPSQVAARYGAAFSYCLPSSPSA 307

Query: 219 TGIVGLGGGDISLISQMRTTIAGN----------------QRLGVSTPDIVIDSGTTL-- 260
            G +  GG      +Q    + G                 + + V        +GT +  
Sbjct: 308 AGYLSFGGAAARANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAAGTIIDS 367

Query: 261 -----TFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTIHFR- 311
                   P  Y +   S  S+M   +    P+  + + CY F  +   ++P V + F  
Sbjct: 368 GTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPIFDTCYDFTGHETVRIPAVELVFAD 427

Query: 312 GADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
           GA V L  S      + D+  +    + N  + I GN  Q    V YD+  Q + F    
Sbjct: 428 GATVHLHPSGVLYTWN-DVAQTCLAFVPNHDLGILGNTQQRTLAVIYDVGSQRIGFGRKG 486

Query: 371 C 371
           C
Sbjct: 487 C 487


>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 386

 Score =  115 bits (289), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 107/352 (30%), Positives = 142/352 (40%), Gaps = 82/352 (23%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPC 148
           NY++  S+GTP   +    DTGSDL W QC+PC  +  CY Q  PLFDP  SS+Y ++PC
Sbjct: 47  NYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 106

Query: 149 SSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
               CA L      +CS   C Y VSYGDGS + G  +++T+TL +++    A+ G  FG
Sbjct: 107 GGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS----AVQGFFFG 162

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTPD 251
           CG    GLFN    G++GLG    SL+ Q               + + AG   LGV  P 
Sbjct: 163 CGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPS 221

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIE------------------------------- 280
                 +T   LP         VM + I                                
Sbjct: 222 GAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPT 281

Query: 281 ----------------AQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSN 321
                             P A   G L+ CY+F     V  P V + F  GA V L    
Sbjct: 282 AYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADG 341

Query: 322 FFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
                     C  F   G    + I GN+ Q +F V   I+  +V FKP+ C
Sbjct: 342 IL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 386


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 80/224 (35%), Positives = 113/224 (50%), Gaps = 25/224 (11%)

Query: 33  LIHRD----SPKSPFYNSSETPYQRLRDALTRSL-NRLNHF---NQNSSISSSKASQADI 84
           + HRD    S KS  +N        L D   RSL +R+      N   ++ S     + +
Sbjct: 1   MKHRDFCNSSGKSTDWNKKLQKSLILDDFRVRSLQSRIKSIFSGNNIDALDSQIPLSSGV 60

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
                NY++ + IG        + DTGSDL W QC+PC    CY Q  PLF+P  S +Y+
Sbjct: 61  RLQTLNYIVTVEIGG--RNMTVIVDTGSDLTWVQCQPC--RLCYNQQDPLFNPSGSPSYQ 116

Query: 145 SLPCSSSQCASLNQKSCS-GV------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           ++ C+SS C SL   + + GV       C Y V+YGDGS++ G+L  E + LG+T     
Sbjct: 117 TILCNSSTCQSLQYATGNLGVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNLGTT----- 171

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
            +    FGCG NN GLF    +G++GLG  D+SL+SQ      G
Sbjct: 172 HVSNFIFGCGRNNKGLFGG-ASGLMGLGKSDLSLVSQTSAIFEG 214



 Score = 52.8 bits (125), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 36/128 (28%), Positives = 58/128 (45%), Gaps = 7/128 (5%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           I+IDSGT +T LP     +L +         P A P   L+ C++ N   +V  P + + 
Sbjct: 285 ILIDSGTVITRLPPPVYRDLKAEFLKQFSGFPSAPPFSILDTCFNLNGYDEVDIPTIRMQ 344

Query: 310 FRG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTV 364
           F G     V ++   +FVK     VC     ++  + +PI GN  Q N  V Y+ ++  +
Sbjct: 345 FEGNAELTVDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKL 404

Query: 365 SFKPTDCT 372
            F    C+
Sbjct: 405 GFAAEACS 412


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 123/424 (29%), Positives = 172/424 (40%), Gaps = 96/424 (22%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
           + L HR  P +P   SS      + D L     R  +  +  S        S  A+ A  
Sbjct: 68  LRLTHRHGPCAPSRASSLA-APSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAAAAAT 126

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFD 136
           +P          NY++  S+GTP   +    DTGSDL W QC+PC  +  CY Q  PLFD
Sbjct: 127 VPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFD 186

Query: 137 PKMSSTYKSLPCSSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           P  SS+Y ++PC    CA L      +CS   C Y VSYGDGS + G  +++T+TL +++
Sbjct: 187 PAQSSSYAAVPCGGPVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTLSASS 246

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTI 239
               A+ G  FGCG    GLFN    G++GLG    SL+ Q               + + 
Sbjct: 247 ----AVQGFFFGCGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPST 301

Query: 240 AGNQRLGVSTPD---------------------IVIDSGTTL------------------ 260
           AG   LG+  P                      +V+ +G ++                  
Sbjct: 302 AGYLTLGLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVV 361

Query: 261 ------TFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFNSLSQV--PEVTIHF 310
                 T LP    + L S   S + +   P A   G L+ CY+F     V  P V + F
Sbjct: 362 DTGTVITRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTF 421

Query: 311 -RGADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
             GA V L              C  F   G    + I GN+ Q +F V   I+  +V FK
Sbjct: 422 GSGATVMLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFK 474

Query: 368 PTDC 371
           P+ C
Sbjct: 475 PSSC 478


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 62/144 (43%), Positives = 78/144 (54%), Gaps = 4/144 (2%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y   + +GTP  +   V DTGSD+ W QC PC  + CY Q   LF+P  SS++K L C
Sbjct: 14  GEYFAVVGVGTPRRDMYLVVDTGSDITWLQCAPC--TNCYKQKDALFNPSSSSSFKVLDC 71

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA-VALPGITFGCG 207
           SSS C +L+   C    C Y   YGDGSF+ G L T+ V L    G   V L  I  GCG
Sbjct: 72  SSSLCLNLDVMGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVVLTNIPLGCG 131

Query: 208 TNNGGLFNSKTTGIVGLGGGDISL 231
            +N G F +   GI+GLG G +S 
Sbjct: 132 HDNEGTFGT-AAGILGLGRGPLSF 154


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 99/354 (27%), Positives = 140/354 (39%), Gaps = 97/354 (27%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            YL+ ++IGTPP       DTGSDLIWTQC+PCP   C+ Q  P FDP  SST     C 
Sbjct: 88  EYLVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCP--ACFDQALPYFDPSTSSTLSLTSCD 145

Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
           S+ C  L   S                       ++  T     G   ++PG+ FGCG  
Sbjct: 146 STLCQGLPVASLP--------------------RSDKFTF---VGAGASVPGVAFGCGLF 182

Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMR--------TTI---------------------- 239
           N G+F S  TGI G G G +SL SQ++        TTI                      
Sbjct: 183 NNGVFKSNETGIAGFGRGPLSLPSQLKVGNFSHCFTTITGAIPSTVLLDLPADLFSNGQG 242

Query: 240 -------------------------AGNQRLGVSTPDIVIDSG---------TTLTFLPQ 265
                                     G+ RL V   +  + +G         T +T LP 
Sbjct: 243 AVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALKNGTGGTIIDSGTAMTSLPT 302

Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHFRGADVKLSRSNFF 323
                +    ++ ++   V+  T     C S    +   VP++ +HF GA + L R N+ 
Sbjct: 303 RVYRLVRDAFAAQVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLPRENYV 362

Query: 324 VKVSE---DIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
            +V +    I+C ++ +G    V   GN  Q N  V YD++   +SF P  C K
Sbjct: 363 FEVEDAGSSILCLAIIEG--GEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQCDK 414


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 122/429 (28%), Positives = 178/429 (41%), Gaps = 80/429 (18%)

Query: 11  LFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQR---LRDALTRSLNRLNH 67
           L  + F +  P +  +  F++ L H  S K+      E+P  +   L    T + +RL+ 
Sbjct: 13  LLIILFALTCPKQCTSYRFTLRL-HTKSIKT-----KESPKIKPGYLHSKSTPAPSRLD- 65

Query: 68  FNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
            N  ++  +   S    IPN A +L  ISIG PP  +L + DTGSDL W QC PC   +C
Sbjct: 66  -NLWTTEIADIVSHVTPIPNPAAFLANISIGDPPVPQLLLIDTGSDLTWIQCLPC---KC 121

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
           Y Q  P F P  SSTY++  C S+  A   + +   +G NC+Y + Y D S + G LA E
Sbjct: 122 YPQTIPFFHPSRSSTYRNASCESAPHAMPQIFRDEKTG-NCRYHLRYRDFSNTRGILAKE 180

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGL--------------------FNSKTTGIVG-- 223
            +T  ++    ++ P I FGCG +N G                     F SK +   G  
Sbjct: 181 KLTFQTSDEGLISKPNIVFGCGQDNSGFTQYSGVLGLGPGTFSIVTRNFGSKFSYCFGSL 240

Query: 224 -----------LGGGD----------------------ISLISQMRTTIAGNQRLGVSTP 250
                      LG G                       ISL  ++     G  +   S  
Sbjct: 241 IDPTYPHNFLILGNGARIEGDPTPLQIFQDRYYLDLQAISLGEKLLDIEPGIFQRYRSKG 300

Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGSLELCYSFN---SLSQVPE 305
             VID+G + T L +     L   +  ++    + V D       CY  N    L   P 
Sbjct: 301 GTVIDTGCSPTILAREAYETLSEEIDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPV 360

Query: 306 VTIHFRG-ADVKLSRSNFFVK-VSEDIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQ 362
           VT HF G A++ L   + FV   S D  C      T + + + G + Q N+ VGY++   
Sbjct: 361 VTFHFAGGAELALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTM 420

Query: 363 TVSFKPTDC 371
            V F+ TDC
Sbjct: 421 KVYFQRTDC 429


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 116/221 (52%), Gaps = 24/221 (10%)

Query: 35  HRDSPKSPFYNSSETPYQRL-------RDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           H+DS      + ++   +RL       R   +R  N +   N + S+ +     + I   
Sbjct: 3   HKDSCSGKILDWNKKLQKRLIMDNFQLRSLQSRIKNIILSGNIDDSVDTQIPLTSGIRLQ 62

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           + NY++ + +G    +   + DTGSDL W QC+PC  ++CY Q  P+F+P  S +Y+++ 
Sbjct: 63  SLNYIVTVELGG--RKMTVIVDTGSDLSWVQCQPC--NRCYNQQDPVFNPSKSPSYRTVL 118

Query: 148 CSSSQCASLNQKSC-SGV------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           C+S  C SL   +  SGV       C Y V+YGDGS+++G +  E + LG+TT     + 
Sbjct: 119 CNSLTCRSLQLATGNSGVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNLGNTT-----VN 173

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
              FGCG  N GLF    +G+VGLG  D+SLISQ+     G
Sbjct: 174 NFIFGCGRKNQGLFGG-ASGLVGLGRTDLSLISQISPMFGG 213


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 73/195 (37%), Positives = 102/195 (52%), Gaps = 14/195 (7%)

Query: 52  QRLRDALTRSLNRLNHF--NQNSSISSSKASQADI----IPNNANYLIRISIGTPPTERL 105
           + +R  + +S  R+       NSS  SS A   D+     P+   Y++ IS+GTP     
Sbjct: 10  EAIRGLVAKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFR 69

Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN 165
           A+ADTGSDL+W Q EPC  + C      +FDP+ SST++ + CSS  C  L      G +
Sbjct: 70  AIADTGSDLVWVQSEPC--TGC--SGGTIFDPRQSSTFREMDCSSQLCTELPGSCEPGSS 125

Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C YS  YG G  + G  A +T++LG+T+G +   P    GCG  N G       G+VGL
Sbjct: 126 ACSYSYEYGSGE-TEGEFARDTISLGTTSGGSQKFPSFAVGCGMVNSGF--DGVDGLVGL 182

Query: 225 GGGDISLISQMRTTI 239
           G G +SL SQ+   I
Sbjct: 183 GQGPVSLTSQLSAAI 197



 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 47/125 (37%), Positives = 65/125 (52%), Gaps = 7/125 (5%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHF 310
           +IDSGTTLT++P G    +LS M SM+    V   +  L+LCY  S N   + P +TI  
Sbjct: 267 IIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRL 326

Query: 311 RGADVKLSRSNFFVKV--SEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSF 366
            GA +    SN+F+ V  S D VC +  G    +P  I GN+MQ  + + YD     +SF
Sbjct: 327 AGATMTPPSSNYFLVVDDSGDTVC-LAMGSAGGLPVSIIGNVMQQGYHILYDRGSSELSF 385

Query: 367 KPTDC 371
               C
Sbjct: 386 VQAKC 390


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 69/198 (34%), Positives = 98/198 (49%), Gaps = 26/198 (13%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-NSSISSSKASQAD------ 83
           V+L H D+      +S ETP       L R  +R+       +++ S+  ++A       
Sbjct: 80  VQLHHLDA-----LSSDETPQDLFNSRLARDASRVKSLTSLAAAVGSTNRTRARGPGFSS 134

Query: 84  -----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
                +   +  Y  R+ +GTP      V DTGSD++W QC PC   +CY Q  P+F+P 
Sbjct: 135 SVTSGLAQGSGEYFTRLGVGTPARYVFMVLDTGSDVVWIQCAPC--KKCYSQTDPVFNPT 192

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
            S ++ ++PC S  C  L+   CS     C Y VSYGDGSF+ G  +TET+T   T    
Sbjct: 193 KSRSFANIPCGSPLCRRLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTRVGR 252

Query: 197 VALPGITFGCGTNNGGLF 214
           VAL     GCG +N GLF
Sbjct: 253 VAL-----GCGHDNEGLF 265



 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 40/123 (32%), Positives = 56/123 (45%), Gaps = 3/123 (2%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           ++IDSGT++T L +     L             A      + C+  +  ++V  P V +H
Sbjct: 367 VIIDSGTSVTRLTRPAYVALRDAFRVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLH 426

Query: 310 FRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           FRGADV L  SN+ + V      C  F G  + + I GNI Q  F V YD+    V F P
Sbjct: 427 FRGADVSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGFAP 486

Query: 369 TDC 371
             C
Sbjct: 487 RGC 489


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 53/130 (40%), Positives = 79/130 (60%), Gaps = 8/130 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  R+ +G+P  +   V DTGSD+ W QC+PC  + CY Q  P+FDP +S++Y S+
Sbjct: 159 GSGEYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSTSYASV 216

Query: 147 PCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C + +C  L+  +C  S   C Y V+YGDGS++ G+ ATET+TLG +      +  +  
Sbjct: 217 ACDNPRCHDLDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTLGDS----APVSSVAI 272

Query: 205 GCGTNNGGLF 214
           GCG +N GLF
Sbjct: 273 GCGHDNEGLF 282



 Score = 48.5 bits (114), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 32/130 (24%), Positives = 54/130 (41%), Gaps = 4/130 (3%)

Query: 246 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QV 303
           G     +++DSGT +T L     + L        ++ P        + CY  +  +  +V
Sbjct: 371 GTGAGGVIVDSGTAVTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFDTCYDLSDRTSVEV 430

Query: 304 PEVTIHFR-GADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
           P V++ F  G +++L   N+ + V      C  F     +V I GN+ Q    V +D  +
Sbjct: 431 PAVSLRFAGGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAK 490

Query: 362 QTVSFKPTDC 371
            TV F    C
Sbjct: 491 STVGFTSNKC 500


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 79/210 (37%), Positives = 113/210 (53%), Gaps = 27/210 (12%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
           A  GGFSVE IHRDSP+SPF++ + T + R   A  RS+ R      ++S S+S    AD
Sbjct: 29  ASGGGFSVEFIHRDSPRSPFHDPAFTAHGRALAAARRSVARAAAIAGSASSSASGGGAAD 88

Query: 84  -----IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE---------PCPPSQCYM 129
                ++  +  YL+ +++G+PP   LA+ADTGSDL+W +C+           P +Q   
Sbjct: 89  DVVSKVVSRSFEYLMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQ--- 145

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
                FDP  SSTY  + C +  C +L + +C  G NC Y  +YGDGS + G L+TET T
Sbjct: 146 -----FDPSRSSTYGRVSCQTDACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFT 200

Query: 189 L----GSTTGQAVALPGITFGCGTNNGGLF 214
                   + + V + G+ FGC T   G F
Sbjct: 201 FDDGGSGRSPRQVRVGGVKFGCSTATAGSF 230



 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 49/128 (38%), Positives = 66/128 (51%), Gaps = 8/128 (6%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS-----FNSLSQVPEV 306
           I++DSGTTLTFL       ++  +S  I   PV  P G L+LCY+       +   +P++
Sbjct: 323 IIVDSGTTLTFLDPSLLGPIVDELSRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPDL 382

Query: 307 TIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQT 363
           T+ F  GA V L   N FV V E  +C      T   P  I GN+ Q N  VGYD++  T
Sbjct: 383 TLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLDAGT 442

Query: 364 VSFKPTDC 371
           V+F   DC
Sbjct: 443 VTFAGADC 450


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score =  115 bits (287), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 80/225 (35%), Positives = 113/225 (50%), Gaps = 24/225 (10%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD------ 83
           SV L HR  P SP   +S        + L R   R ++  +  S S+  A+  D      
Sbjct: 34  SVTLSHRYGPCSPADPNSGEKRPTDEELLRRDQLRADYIRRKFSGSNGTAAGEDGQSSKV 93

Query: 84  IIP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLF 135
            +P       +   Y+I + +G+P   +  V DTGSD+ W QCEPCP PS C+     LF
Sbjct: 94  SVPTTLGSSLDTLEYVISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALF 153

Query: 136 DPKMSSTYKSLPCSSSQCASL-NQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLG 190
           DP  SSTY +  CS++ CA L +    +G +    CQY V YGDGS + G  +++ +TL 
Sbjct: 154 DPAASSTYAAFNCSAAACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTL- 212

Query: 191 STTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQ 234
             +G  V + G  FGC     G   + KT G++GLGG   S +SQ
Sbjct: 213 --SGSDV-VRGFQFGCSHAELGAGMDDKTDGLIGLGGDAQSPVSQ 254



 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 37/136 (27%), Positives = 60/136 (44%), Gaps = 14/136 (10%)

Query: 232 ISQMRTTIAGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD 286
            + +     G ++LG+S P +     ++DSGT +T LP    + L S   + +     A+
Sbjct: 309 FAALEDIAVGGKKLGLS-PSVFAAGSLVDSGTVITRLPPAAYAALSSAFRAGMTRYARAE 367

Query: 287 PTGSLELCYSFNSLSQV--PEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITN--SV 342
           P G L+ C++F  L +V  P V + F G  V    ++  V       C  F    +  + 
Sbjct: 368 PLGILDTCFNFTGLDKVSIPTVALVFAGGAVVDLDAHGIVSGG----CLAFAPTRDDKAF 423

Query: 343 PIYGNIMQTNFLVGYD 358
              GN+ Q  F V YD
Sbjct: 424 GTIGNVQQRTFEVLYD 439


>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 94/343 (27%), Positives = 145/343 (42%), Gaps = 78/343 (22%)

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           GT    +  + D+GSD+ W QC+PCP   C+ Q  PLFDP MS+TY ++PC+S+ CA L 
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221

Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
             ++ CS    CQ+ ++YGDGS + G  + + +TLG        + G  FGC   + G  
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD----VIRGFRFGCAHADRGSA 277

Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIA--------------GNQRLG------------V 247
           F+    G + LGGG  SL+ Q  T                 G   LG            V
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTATRYGRVFSYCLPPTASSLGFLVLGVPPERAQLIPSFV 337

Query: 248 STPDI---------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--------- 283
           STP +               +I +G  L   P  ++++ +   S++I   P         
Sbjct: 338 STPLLSSSMAPTFYRVLLRAIIVAGRPLAVPPAVFSASSVIDSSTIISRLPPTAYQALRA 397

Query: 284 ----------VADPTGSLELCYSFNSLSQV--PEVTIHFR-GADVKLSRSNFFVKVSEDI 330
                      A P   L+ CY F  +  +  P + + F  GA V L  +   +      
Sbjct: 398 AFRSAMTMYRAAPPVSILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLG----- 452

Query: 331 VCSVFK-GITNSVPIY-GNIMQTNFLVGYDIEQQTVSFKPTDC 371
            C  F    ++ +P + GN+ Q    V YD+  + + F+   C
Sbjct: 453 SCLAFAPTASDRMPGFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495


>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
          Length = 442

 Score =  114 bits (286), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 109/439 (24%), Positives = 181/439 (41%), Gaps = 82/439 (18%)

Query: 10  ILFFLCF-YVVSP---IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRL 65
           +L  +CF ++ SP     + + GFS  LIH  SP SP+ N       +   AL  +L+R 
Sbjct: 7   LLLIICFTFIFSPCISAASDSKGFSTNLIHIHSPSSPYKNVKAESLAK-DTALESTLSRH 65

Query: 66  NHFN--QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
            +    Q  ++  +      +I + + +L  +SIG PPT    V DTGSDL W QCEPC 
Sbjct: 66  AYLRARQQKALQPADFVPPPLIRDKSAFLANLSIGNPPTNVYVVLDTGSDLFWIQCEPC- 124

Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-SCSGV-NCQYSVSYGDGSFSNGN 181
              CY Q  P+++   S +Y  + C+   C SL ++  CS   +C Y  +Y DG+ ++G 
Sbjct: 125 -DVCYKQKDPIYNRTKSDSYTEMLCNEPPCVSLGREGQCSDSGSCLYQTAYADGARTSGL 183

Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNN------------------------------- 210
           L+ E V   S          + FGCG  N                               
Sbjct: 184 LSYEKVAFTSHYSDEDKTAQVGFGCGLQNLNFITSNRDGGVLGLGPGLVSLVSQLSAIGK 243

Query: 211 ---------GGLFNSKTTGIVGLG-----GGDIS--LISQMRTTIAGNQRLGVSTPDI-- 252
                    G + N    G +  G      GD++  +I++          LGV  P +  
Sbjct: 244 VSKSFAYCFGNISNPNAGGFLVFGDATYLNGDMTPMVIAEFYYVNLLGIGLGVGEPRLDI 303

Query: 253 ---------------VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 296
                          +IDSG+TL+ F P+ Y     +V+  + +   ++  T S + C+ 
Sbjct: 304 NSSSFERKPDGSGGVIIDSGSTLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD-CFE 362

Query: 297 ---FNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 353
                 L   P + ++     +   R + F++  +++ C  F      + I G + Q ++
Sbjct: 363 GKIERDLPLFPTLVLYLESTGILNDRWSIFLQRYDELFCLGFTS-GEGLSIIGTLAQQSY 421

Query: 354 LVGYDIEQQTVSFKPT-DC 371
             GY++E  T+S +   DC
Sbjct: 422 KFGYNLELSTLSIESNPDC 440


>gi|21717166|gb|AAM76359.1|AC074196_17 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433306|gb|AAP54835.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125575546|gb|EAZ16830.1| hypothetical protein OsJ_32301 [Oryza sativa Japonica Group]
          Length = 373

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 92/354 (25%), Positives = 149/354 (42%), Gaps = 83/354 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+  ++IGTPP    A+     + +WTQC PC   +C+ QD PLF+   SSTY+  PC +
Sbjct: 28  YMANLTIGTPPQPASAIIHLAGEFVWTQCSPC--RRCFKQDLPLFNRSASSTYRPEPCGT 85

Query: 151 SQCASLNQKSCSGVN-CQYSVS--YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           + C S+   +CSG   C Y V   +GD S   G   T+T  +G+ T        + FGC 
Sbjct: 86  ALCESVPASTCSGDGVCSYEVETMFGDTSGIGG---TDTFAIGTATAS------LAFGCA 136

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTT-------------------------IAGN 242
            ++        +G+VGLG    SL+ QM  T                         +AG 
Sbjct: 137 MDSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHGAAGKKSALLLGASAKLAGG 196

Query: 243 QRL--------------------GVSTPDIVI-----------DSGTTLTFLPQGYNSNL 271
           +                      G+   D++I           D+   ++FL       +
Sbjct: 197 KSAATTPLVNTSDDSSDYMIHLEGIKFGDVIIAPPPNGSVVLVDTIFGVSFLVDAAFQAI 256

Query: 272 LSVMSSMIEAQPVADPTGSLELCY-------SFNSLSQVPEVTIHFRG-ADVKLSRSNFF 323
              ++  + A P+A PT   +LC+         NS   +P+V + F+G A + +  S + 
Sbjct: 257 KKAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALTVPPSKYM 316

Query: 324 VKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
                  VC     S    +T  + I G + Q N    +D++++T+SF+P DC+
Sbjct: 317 YDAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPADCS 370


>gi|238479902|ref|NP_001154646.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332643534|gb|AEE77055.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 350

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 91/350 (26%), Positives = 149/350 (42%), Gaps = 57/350 (16%)

Query: 40  KSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGT 99
           KSPF + ++      R     SL R       S + S  AS       +  Y + + IG 
Sbjct: 39  KSPFPSPTQALALDTRRLHFLSLRRKPIPFVKSPVVSGAAS------GSGQYFVDLRIGQ 92

Query: 100 PPTERLAVADTGSDLIWTQCEPCPPSQC-YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
           PP   L +ADTGSDL+W +C  C    C +   + +F P+ SST+    C    C  + +
Sbjct: 93  PPQSLLLIADTGSDLVWVKCSAC--RNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPK 150

Query: 159 KSCSGV--------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
              + +         C Y   Y DGS ++G  A ET +L +++G+   L  + FGCG   
Sbjct: 151 PDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCG--- 207

Query: 211 GGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
              F      + G GG                          V+DSGTTL FL +    +
Sbjct: 208 ---FRISGQSVSGNGG-------------------------TVVDSGTTLAFLAEPAYRS 239

Query: 271 LLSVMSSMIEAQPVADP-TGSLELCYSFNSLSQ----VPEVTIHFRGADVKL-SRSNFFV 324
           +++ +   ++  P+AD  T   +LC + + +++    +P +   F G  V +    N+F+
Sbjct: 240 VIAAVRRRVKL-PIADALTPGFDLCVNVSGVTKPEKILPRLKFEFSGGAVFVPPPRNYFI 298

Query: 325 KVSEDIVCSVFKGITNSV--PIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           +  E I C   + +   V   + GN+MQ  FL  +D ++  + F    C 
Sbjct: 299 ETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLGFSRRGCA 348


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score =  114 bits (285), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 93/332 (28%), Positives = 138/332 (41%), Gaps = 76/332 (22%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN- 165
           V DT SD+ W QC PCP  QC++Q  PL+DP  SST+  +PC S  C  L     +G + 
Sbjct: 172 VVDTSSDIPWVQCLPCPIPQCHLQKDPLYDPAKSSTFAPIPCGSPACKELGSSYGNGCSP 231

Query: 166 ----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGI 221
               C+Y V+YGDG  + G   T+T+T+  T    + +    FGC     G F+++  GI
Sbjct: 232 TTDECKYIVNYGDGKATTGTYVTDTLTMSPT----IVVKDFRFGCSHAVRGSFSNQNAGI 287

Query: 222 VGLGGGDISLISQM-------------RTTIAGNQRLG---------------------- 246
           + LGGG  SL+ Q              + + AG   LG                      
Sbjct: 288 LALGGGRGSLLEQTADAYGNAFSYCIPKPSSAGFLSLGGPVEASLKFSYTPLIKNKHAPT 347

Query: 247 ---------------VSTPDIVIDSGTTL------TFLP-QGYNSNLLSVMSSMIEAQPV 284
                          ++ P     +G  +      T LP Q Y +   +  S+M    P+
Sbjct: 348 FYIVHLEAIIVAGKQLAVPPTAFATGAVMDSGAVVTQLPPQVYAALRAAFRSAMAAYGPL 407

Query: 285 ADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI--T 339
           A P  +L+ CY F      +VP+V++ F  GA + L  ++  +       C  F      
Sbjct: 408 AAPVRNLDTCYDFTRFPDVKVPKVSLVFAGGATLDLEPASIILD-----GCLAFAATPGE 462

Query: 340 NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            SV   GN+ Q  + V YD+    V F+   C
Sbjct: 463 ESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  114 bits (284), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 73/195 (37%), Positives = 102/195 (52%), Gaps = 14/195 (7%)

Query: 52  QRLRDALTRSLNRLNHF--NQNSSISSSKASQADI----IPNNANYLIRISIGTPPTERL 105
           + +R  + +S  R+       NSS  SS A   D+     P+   Y++ IS+GTP     
Sbjct: 10  EAIRALVAKSHARVRWMAARANSSSWSSMAGTTDVESPLHPDGGGYVMDISVGTPGKRFR 69

Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN 165
           A+ADTGSDL+W Q EPC  + C      +FDP+ SST++ + CSS  CA L      G +
Sbjct: 70  AIADTGSDLVWVQSEPC--TGC--SGGTIFDPRQSSTFREMDCSSQLCAELPGSCEPGSS 125

Query: 166 -CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C YS  YG G  + G  A +T++LG+T+  +   P    GCG  N G       G+VGL
Sbjct: 126 TCSYSYEYGSGE-TEGEFARDTISLGTTSDGSQKFPSFAVGCGMVNSGF--DGVDGLVGL 182

Query: 225 GGGDISLISQMRTTI 239
           G G +SL SQ+   I
Sbjct: 183 GQGPVSLTSQLSAAI 197



 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 47/125 (37%), Positives = 66/125 (52%), Gaps = 7/125 (5%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIHF 310
           +IDSGTTLT++P G    +LS M SM+    V   +  L+LCY  S N   + P +TI  
Sbjct: 267 IIDSGTTLTYVPSGVYGRVLSRMESMVTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRL 326

Query: 311 RGADVKLSRSNFFVKV--SEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSF 366
            GA +    SN+F+ V  S D VC +  G  + +P  I GN+MQ  + + YD     +SF
Sbjct: 327 AGATMTPPSSNYFLVVDDSGDTVC-LAMGSASGLPVSIIGNVMQQGYHILYDRGSSELSF 385

Query: 367 KPTDC 371
               C
Sbjct: 386 VQAKC 390


>gi|22165126|gb|AAM93742.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|31433307|gb|AAP54836.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575547|gb|EAZ16831.1| hypothetical protein OsJ_32302 [Oryza sativa Japonica Group]
          Length = 405

 Score =  114 bits (284), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 92/355 (25%), Positives = 144/355 (40%), Gaps = 82/355 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+   +IGTPP    AV D   +L+WTQC PC P  C+ QD PLFDP  SST++ LPC S
Sbjct: 57  YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CFEQDLPLFDPTKSSTFRGLPCGS 114

Query: 151 SQCASLNQKS--CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-- 206
             C S+ + S  C+   C Y      G  + G   T+T  +G+      A   + FGC  
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIGA------AKETLGFGCVV 167

Query: 207 ---------------------------------------GTNNGGLF------------N 215
                                                  G ++G LF            N
Sbjct: 168 MTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKSSGALFLGATAKQLAGGKN 227

Query: 216 SKTTGIVGLGGGD---------ISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQG 266
           S T  ++    G          +  ++ ++T  A  Q    S   +++D+ +  ++L  G
Sbjct: 228 SSTPFVIKTSAGSSDNGSNPYYMVKLAGIKTGGAPLQAASSSGSTVLLDTVSRASYLADG 287

Query: 267 YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG-ADVKLSRSNFFVK 325
               L   +++ +  QPVA P    +LC+        PE+   F G A + +  +N+ + 
Sbjct: 288 AYKALKKALTAAVGVQPVASPPKPYDLCFPKAVAGDAPELVFTFDGGAALTVPPANYLLA 347

Query: 326 VSEDIVCSV--------FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
                VC            G      I G++ Q N  V +D++++T+SFKP DC+
Sbjct: 348 SGNGTVCLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPADCS 402


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 98/369 (26%), Positives = 152/369 (41%), Gaps = 88/369 (23%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + +GTPP     + DTGSDL W QC PC    C+ Q+   +DPK S+++K++ C
Sbjct: 160 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPC--YDCFHQNEAFYDPKTSASFKNITC 217

Query: 149 SSSQCASLNQKS----CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQA----VA 198
           +  +C+ ++       C   N  C Y   YGD S + G+ A ET T+  TT +       
Sbjct: 218 NDPRCSLISSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYK 277

Query: 199 LPGITFGCGTNNGGL----------------FNSKTTGIVG------------------- 223
           +  + FGCG  N GL                F+S+   + G                   
Sbjct: 278 VENMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSK 337

Query: 224 -LGGGDISLIS-----------------------QMRTTIAGNQRLGVS------TPD-- 251
            + G D  L++                       Q+++ + G + L +       +PD  
Sbjct: 338 LIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDGA 397

Query: 252 --IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VP 304
              +IDSGTTL++  +  Y          M E   V      L+ C++ + + +    +P
Sbjct: 398 GGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLVFRDFPVLDPCFNVSGIEENNIHLP 457

Query: 305 EVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQ 362
           E+ I F  GA       N F+ +SED+VC    G   S   I GN  Q NF + YD +  
Sbjct: 458 ELGIAFADGAVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKMS 517

Query: 363 TVSFKPTDC 371
            + F PT C
Sbjct: 518 RLGFTPTKC 526


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 101/363 (27%), Positives = 156/363 (42%), Gaps = 87/363 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKS 145
           Y  +I +G+PP E     DTGSD++W  C PCP  +C ++        L+D K SST K+
Sbjct: 74  YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCP--KCPVKTDLGIPLSLYDSKTSSTSKN 131

Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
           + C    C+ + Q    G    C Y V YGDGS S+G+   + +TL   TG     P   
Sbjct: 132 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 191

Query: 201 GITFGCGTNNGGLF---NSKTTGIVG----------------------------LGGGDI 229
            + FGCG N  G     +S   GI+G                            + GG I
Sbjct: 192 EVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGI 251

Query: 230 SLISQMR------TTIAGNQ---------------------RLGVSTPD--IVIDSGTTL 260
             + ++       T I  NQ                      L  +  D   +IDSGTTL
Sbjct: 252 FAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTL 311

Query: 261 TFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGADVKL 317
            +LPQ  YNS +  + +       +   T +   C+SF  N+    P V +HF  + +KL
Sbjct: 312 AYLPQNLYNSLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFEDS-LKL 367

Query: 318 SR--SNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
           S    ++   + ED+ C  ++  G+T      V + G+++ +N LV YD+E + + +   
Sbjct: 368 SVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADH 427

Query: 370 DCT 372
           +C+
Sbjct: 428 NCS 430


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 101/363 (27%), Positives = 156/363 (42%), Gaps = 87/363 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKS 145
           Y  +I +G+PP E     DTGSD++W  C PCP  +C ++        L+D K SST K+
Sbjct: 78  YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCP--KCPVKTDLGIPLSLYDSKTSSTSKN 135

Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
           + C    C+ + Q    G    C Y V YGDGS S+G+   + +TL   TG     P   
Sbjct: 136 VGCEDDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQ 195

Query: 201 GITFGCGTNNGGLF---NSKTTGIVG----------------------------LGGGDI 229
            + FGCG N  G     +S   GI+G                            + GG I
Sbjct: 196 EVVFGCGKNQSGQLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGI 255

Query: 230 SLISQMR------TTIAGNQ---------------------RLGVSTPD--IVIDSGTTL 260
             + ++       T I  NQ                      L  +  D   +IDSGTTL
Sbjct: 256 FAVGEVESPVVKTTPIVPNQVHYNVILKGMDVDGDPIDLPPSLASTNGDGGTIIDSGTTL 315

Query: 261 TFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGADVKL 317
            +LPQ  YNS +  + +       +   T +   C+SF  N+    P V +HF  + +KL
Sbjct: 316 AYLPQNLYNSLIEKITAKQQVKLHMVQETFA---CFSFTSNTDKAFPVVNLHFEDS-LKL 371

Query: 318 SR--SNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
           S    ++   + ED+ C  ++  G+T      V + G+++ +N LV YD+E + + +   
Sbjct: 372 SVYPHDYLFSLREDMYCFGWQSGGMTTQDGADVILLGDLVLSNKLVVYDLENEVIGWADH 431

Query: 370 DCT 372
           +C+
Sbjct: 432 NCS 434


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 70/199 (35%), Positives = 101/199 (50%), Gaps = 27/199 (13%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ----------NSSISSSKASQ 81
            ++HRD+     + ++ T  + LR  L R   R    ++          N + S   A  
Sbjct: 72  RVVHRDA-----FAANATAAELLRHRLQRDKRRAARISKAAAGGGAGAANGTRSRGGAVA 126

Query: 82  ADIIPNNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           A ++   A     Y  +I +GTP T  L V DTGSD++W QC PC   +CY Q  P+FDP
Sbjct: 127 APVVSGLAQGSGEYFTKIGVGTPSTPALMVLDTGSDVVWLQCAPC--RRCYDQSGPVFDP 184

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           + SS+Y ++ C++  C  L+   C      C Y V+YGDGS + G+ ATET+T     G 
Sbjct: 185 RRSSSYGAVDCAAPLCRRLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAG--GA 242

Query: 196 AVALPGITFGCGTNNGGLF 214
            VA   +  GCG +N GLF
Sbjct: 243 RVAR--VALGCGHDNEGLF 259



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 40/125 (32%), Positives = 59/125 (47%), Gaps = 5/125 (4%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTI 308
           +++DSGT++T L +   S L     +      ++    SL + CY      + +VP V++
Sbjct: 369 VIVDSGTSVTRLARPSYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLGGRKVVKVPTVSM 428

Query: 309 HFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           HF G A+  L   N+ + V S    C  F G    V I GNI Q  F V +D + Q V F
Sbjct: 429 HFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGF 488

Query: 367 KPTDC 371
            P  C
Sbjct: 489 APKGC 493


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 81/237 (34%), Positives = 117/237 (49%), Gaps = 28/237 (11%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLR-DA-----LTRSLNRLNHFNQNSSISS 76
            A++G   +EL H  S  S   + +E  +  L  DA     L R +        + + S+
Sbjct: 35  RAESGATVLELRHHASFSSGGKSRAEEAHAVLASDAARVSSLQRRIGSYGLIRSSDAASA 94

Query: 77  SKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
           SK +Q  +         NY+  + IG    E   + DT S+L W QCEPC    C+ Q  
Sbjct: 95  SKLAQVPVTSGARLRTLNYVATVGIGG--GEATVIVDTASELTWVQCEPC--DACHDQQE 150

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL------NQKSCSG--VNCQYSVSYGDGSFSNGNLAT 184
           PLFDP  S +Y ++PC+SS C +L      + ++C      C Y++SY DGS+S G LA 
Sbjct: 151 PLFDPSSSPSYAAVPCNSSSCDALRVATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAH 210

Query: 185 ETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           + ++L     Q     G  FGCGT+N G F   T+G++GLG   +SLISQ      G
Sbjct: 211 DRLSLAGEDIQ-----GFVFGCGTSNQGPFGG-TSGLMGLGRSQLSLISQTMDQFGG 261



 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 36/127 (28%), Positives = 55/127 (43%), Gaps = 9/127 (7%)

Query: 253 VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           ++DSGT +T  +P  Y +     +S + E  P A P   L+ C+    L   QVP + + 
Sbjct: 336 IVDSGTIITSLVPSVYAAVRAEFVSQLAE-YPQAAPFSILDTCFDLTGLREVQVPSLKLV 394

Query: 310 FR-GADVKLSRSNFFVKVSEDI--VCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTV 364
           F  GA+V++        V+ D   VC     + +    PI GN  Q N  V +D     +
Sbjct: 395 FDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVIFDTVGSQI 454

Query: 365 SFKPTDC 371
            F    C
Sbjct: 455 GFAQETC 461


>gi|367066697|gb|AEX12632.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066699|gb|AEX12633.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066701|gb|AEX12634.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066703|gb|AEX12635.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066705|gb|AEX12636.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066707|gb|AEX12637.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066709|gb|AEX12638.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066711|gb|AEX12639.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066713|gb|AEX12640.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066715|gb|AEX12641.1| hypothetical protein 2_5918_01 [Pinus taeda]
 gi|367066717|gb|AEX12642.1| hypothetical protein 2_5918_01 [Pinus taeda]
          Length = 137

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 59/135 (43%), Positives = 81/135 (60%), Gaps = 7/135 (5%)

Query: 78  KASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           K  QA +   N  +L++++IG P     A+ DTGSDL WTQC PC  S CY Q +P++DP
Sbjct: 8   KDVQAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCMPC--SDCYKQPTPIYDP 65

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
            +SSTY ++ C SS C +L   +C    C+Y  +YGD S + G L+ ET TL S +    
Sbjct: 66  SLSSTYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQS---- 121

Query: 198 ALPGITFGCGTNNGG 212
            +P I FGCG +N G
Sbjct: 122 -IPHIAFGCGQDNEG 135


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 65/153 (42%), Positives = 86/153 (56%), Gaps = 18/153 (11%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY++ + IG   +    + DTGSDL W QC PC    CY Q  PLF+P  SS++ SLPC+
Sbjct: 144 NYIVTVGIGGQNST--LIVDTGSDLTWVQCLPC--RLCYNQQEPLFNPSNSSSFLSLPCN 199

Query: 150 SSQCASLNQKS-----CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           S  C +L   +     CS  N   C Y + YGDGS+S G L  E +TLG T      +  
Sbjct: 200 SPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKT-----EIDN 254

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
             FGCG NN GLF    +G++GL   ++SL+SQ
Sbjct: 255 FIFGCGRNNKGLFGG-ASGLMGLARSELSLVSQ 286


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 97/365 (26%), Positives = 156/365 (42%), Gaps = 86/365 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + + +G+PP     + DTGSDL W QC PC    C+ Q+   +DPK S++YK++ C+ 
Sbjct: 155 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPC--HDCFQQNGAFYDPKASASYKNITCND 212

Query: 151 SQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTG----QAVALP 200
            +C  ++     K C   N  C Y   YGD S + G+ A ET T+  TT     +   + 
Sbjct: 213 PRCNLVSPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNVE 272

Query: 201 GITFGCGTNNGGL----------------FNSKTTGIVG--------------------L 224
            + FGCG  N GL                F+S+   + G                    +
Sbjct: 273 NMMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSDTNVSSKLI 332

Query: 225 GGGDISLIS-----------------------QMRTTIAGNQRLGV--STPDI------- 252
            G D  L+S                       Q+++ I   + L +   T +I       
Sbjct: 333 FGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNISSDGAGG 392

Query: 253 -VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ-PVADPTGSLELCYSFNSLS--QVPEVTI 308
            +IDSGTTL++  +     + + ++   + + PV      L+ C++ + +   Q+PE+ I
Sbjct: 393 TIIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIDSIQLPELGI 452

Query: 309 HF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVSF 366
            F  GA       N F+ ++ED+VC    G   S   I GN  Q NF + YD ++  + +
Sbjct: 453 AFADGAVWNFPTENSFIWLNEDLVCLAILGTPKSAFSIIGNYQQQNFHILYDTKRSRLGY 512

Query: 367 KPTDC 371
            PT C
Sbjct: 513 APTKC 517


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 73/226 (32%), Positives = 113/226 (50%), Gaps = 26/226 (11%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS-------ISSSKASQ 81
             V L+HRDS  +     +E   +RL+    R+   ++    N +       +S+ +   
Sbjct: 70  MHVRLLHRDS-FAVNATGAELLARRLQRDELRAAWIISTAAANGTPPPDVVGLSTGRGLV 128

Query: 82  ADII---PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           A ++   P + +Y+ +I++GTP  E L   DT SDL W QC+PC   +CY Q  P+FDP+
Sbjct: 129 APVVSRAPTSGDYIAKIAVGTPAVEALLALDTASDLTWLQCQPC--RRCYPQSGPVFDPR 186

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGDG------SFSNGNLATETVTL 189
            S++Y  +   +  C +L +          C Y+V YGDG      S S G+L  ET+T 
Sbjct: 187 HSTSYGEMNYDAPDCQALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTF 246

Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
                QA     ++ GCG +N GLF +   GI+GL  G IS+  Q+
Sbjct: 247 AGGVRQAY----LSIGCGHDNKGLFGAPAAGILGLSRGQISIPHQI 288



 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 43/132 (32%), Positives = 67/132 (50%), Gaps = 12/132 (9%)

Query: 252 IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFNSLS------Q 302
           +++DSGTT+T L +  Y +   +  ++      V+   P+G  + CY+    +      +
Sbjct: 376 VILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTVGGRAGLRHCVK 435

Query: 303 VPEVTIHFRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDI 359
           VP V++HF G  ++ L   N+ + V S   VC  F G  + SV + GNI+Q  F V YDI
Sbjct: 436 VPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVYDI 495

Query: 360 EQQTVSFKPTDC 371
             Q V F P  C
Sbjct: 496 GGQRVGFAPNSC 507


>gi|21717162|gb|AAM76355.1|AC074196_13 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433286|gb|AAP54824.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 397

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 96/353 (27%), Positives = 149/353 (42%), Gaps = 79/353 (22%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N AN+    +IGTPP    A  D   +L+WTQC  C    C+ QD P+F P  SST+K  
Sbjct: 54  NVANF----TIGTPPQAASAFIDLTGELVWTQCSQC--IHCFKQDLPVFVPNASSTFKPE 107

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           PC +  C S+    C+   C Y    G G  + G +AT+T  +G+      A   + FGC
Sbjct: 108 PCGTDVCKSIPTPKCASDVCAYDGVTGLGGHTVGIVATDTFAIGTA-----APASLGFGC 162

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTT----------IAGNQRL----------- 245
              +        +G +GLG    SL++QM+ T             N RL           
Sbjct: 163 VVASDIDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAPHDTGKNSRLFLGASAKLAGG 222

Query: 246 GVSTPDIV-----------------IDSGTTLTFLPQGYNSNLL---------------- 272
           G  TP +                  I +G     +P+G N+ L+                
Sbjct: 223 GAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAVVRVSLLVDSVYQ 282

Query: 273 ----SVMSSMIEAQPVADPTGS-LELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKV 326
               +VM+S + A P A P G+  E+C+    +S  P++   F+ GA + +  +N+   V
Sbjct: 283 EFKKAVMAS-VGAAPTATPVGAPFEVCFPKAGVSGAPDLVFTFQAGAALTVPPANYLFDV 341

Query: 327 SEDIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
             D VC     I        + + I G+  Q N  + +D+++  +SF+P DC+
Sbjct: 342 GNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCS 394


>gi|367066719|gb|AEX12643.1| hypothetical protein 2_5918_01 [Pinus radiata]
          Length = 137

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 59/135 (43%), Positives = 81/135 (60%), Gaps = 7/135 (5%)

Query: 78  KASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
           K  QA +   N  +L++++IG P     A+ DTGSDL WTQC PC  S CY Q +P++DP
Sbjct: 8   KDVQAPVSAGNGEFLMQLAIGKPSLAYSAILDTGSDLTWTQCIPC--SDCYKQPTPIYDP 65

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
            +SSTY ++ C SS C +L   +C    C+Y  +YGD S + G L+ ET TL S +    
Sbjct: 66  SLSSTYGTVSCKSSLCLALPASACISATCEYLYTYGDYSSTQGILSYETFTLSSQS---- 121

Query: 198 ALPGITFGCGTNNGG 212
            +P I FGCG +N G
Sbjct: 122 -IPHIAFGCGQDNEG 135


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 69/163 (42%), Positives = 93/163 (57%), Gaps = 21/163 (12%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY++ + +G        + DTGSDL W QC+PC    CY Q  PL+DP +SS+YK++ C+
Sbjct: 134 NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 189

Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           SS C  L     N   C G N      C+Y VSYGDGS++ G+LA+E++ LG T      
Sbjct: 190 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----K 244

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           L    FGCG NN GLF   +  ++GLG   +SL+SQ   T  G
Sbjct: 245 LENFVFGCGRNNKGLFGGSSG-LMGLGRSSVSLVSQTLKTFNG 286



 Score = 43.9 bits (102), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 7/127 (5%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           I+IDSGT +T LP      +           P A     L+ C++  S   +  P + + 
Sbjct: 356 ILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMI 415

Query: 310 FRG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTV 364
           F+G    +V ++   +FVK    +VC     ++  N V I GN  Q N  V YD  Q+ +
Sbjct: 416 FQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERL 475

Query: 365 SFKPTDC 371
                +C
Sbjct: 476 GIVGENC 482


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 80/241 (33%), Positives = 116/241 (48%), Gaps = 31/241 (12%)

Query: 12  FFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN 71
            F C  +++   A++     +L H DS +        T ++ LR  + RS  RL      
Sbjct: 19  LFPCVLLLTFSLAESAALRADLTHVDSGRG------FTKHELLRRMVARSKARL------ 66

Query: 72  SSISSSKASQADIIP--------NNANYLIRISIGTPPTERLAVA-DTGSDLIWTQCEPC 122
           +S+ SS    A   P         ++ YLI + IGTP  +R+ +  DTGSDL+WTQC  C
Sbjct: 67  ASLRSSACDTALTAPVDHGGSDVGSSEYLIHLGIGTPRPQRVVLHLDTGSDLVWTQCA-C 125

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-----NCQYSVSYGDGSF 177
             + C+ Q  P+F   +S T+  +PCS   C        SG      +C Y+  Y D S 
Sbjct: 126 --TVCFDQPVPVFRASVSHTFSRVPCSDPLCGHAVYLPLSGCAARDRSCFYAYGYMDHSI 183

Query: 178 SNGNLATETVTLGS--TTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           + G +A +T T  +      A A+P I FGCG  N GLF    +GI G G G +SL SQ+
Sbjct: 184 TTGKMAEDTFTFKAPDRADTAAAVPNIRFGCGMMNYGLFTPNQSGIAGFGTGPLSLPSQL 243

Query: 236 R 236
           +
Sbjct: 244 K 244



 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 42/133 (31%), Positives = 61/133 (45%), Gaps = 15/133 (11%)

Query: 254 IDSGTTLTFLPQGYNSNLLSVMSSMIE---AQPVADPTGSLELCYSFNSLSQ---VPEVT 307
           IDSGT +TF PQ    +L     + +    A+   DP     LC+S  +  +   VP++ 
Sbjct: 333 IDSGTAITFFPQAVFRSLREAFVAQVPLPVAKGYTDPDN--LLCFSVPAKKKAPAVPKLI 390

Query: 308 IHFRGADVKLSRSNFFVKVSED------IVCSVFKGITNS-VPIYGNIMQTNFLVGYDIE 360
           +H  GAD +L R N+ +   +D       +C V     NS   I GN  Q N  + YD+E
Sbjct: 391 LHLEGADWELPRENYVLDNDDDGSGAGRKLCVVILSAGNSNGTIIGNFQQQNMHIVYDLE 450

Query: 361 QQTVSFKPTDCTK 373
              + F P  C K
Sbjct: 451 SNKMVFAPARCDK 463


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 69/163 (42%), Positives = 93/163 (57%), Gaps = 21/163 (12%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY++ + +G        + DTGSDL W QC+PC    CY Q  PL+DP +SS+YK++ C+
Sbjct: 134 NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 189

Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           SS C  L     N   C G N      C+Y VSYGDGS++ G+LA+E++ LG T      
Sbjct: 190 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----K 244

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           L    FGCG NN GLF   +  ++GLG   +SL+SQ   T  G
Sbjct: 245 LENFVFGCGRNNKGLFGGSSG-LMGLGRSSVSLVSQTLKTFNG 286



 Score = 43.1 bits (100), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 7/127 (5%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           I+IDSGT +T LP      +           P A     L+ C++  S   +  P + + 
Sbjct: 356 ILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMI 415

Query: 310 FRG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTV 364
           F+G    +V ++   +FVK    +VC     ++  N V I GN  Q N  V YD  Q+ +
Sbjct: 416 FQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERL 475

Query: 365 SFKPTDC 371
                +C
Sbjct: 476 GIVGENC 482


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 65/153 (42%), Positives = 86/153 (56%), Gaps = 18/153 (11%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY++ + IG   +    + DTGSDL W QC PC    CY Q  PLF+P  SS++ SLPC+
Sbjct: 65  NYIVTVGIGGQNST--LIVDTGSDLTWVQCLPC--RLCYNQQEPLFNPSNSSSFLSLPCN 120

Query: 150 SSQCASLNQKS-----CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           S  C +L   +     CS  N   C Y + YGDGS+S G L  E +TLG T      +  
Sbjct: 121 SPTCVALQPTAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTLGKT-----EIDN 175

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
             FGCG NN GLF    +G++GL   ++SL+SQ
Sbjct: 176 FIFGCGRNNKGLFGG-ASGLMGLARSELSLVSQ 207


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 69/163 (42%), Positives = 93/163 (57%), Gaps = 21/163 (12%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY++ + +G        + DTGSDL W QC+PC    CY Q  PL+DP +SS+YK++ C+
Sbjct: 86  NYIVTVELGGKNMS--LIVDTGSDLTWVQCQPC--RSCYNQQGPLYDPSVSSSYKTVFCN 141

Query: 150 SSQCASL-----NQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           SS C  L     N   C G N      C+Y VSYGDGS++ G+LA+E++ LG T      
Sbjct: 142 SSTCQDLVAATSNSGPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILLGDT-----K 196

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           L    FGCG NN GLF   +  ++GLG   +SL+SQ   T  G
Sbjct: 197 LENFVFGCGRNNKGLFGGSSG-LMGLGRSSVSLVSQTLKTFNG 238



 Score = 43.9 bits (102), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 35/127 (27%), Positives = 55/127 (43%), Gaps = 7/127 (5%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           I+IDSGT +T LP      +           P A     L+ C++  S   +  P + + 
Sbjct: 308 ILIDSGTVITRLPPSIYKAVKIEFLKQFSGFPTAPGYSILDTCFNLTSYEDISIPIIKMI 367

Query: 310 FRG---ADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTV 364
           F+G    +V ++   +FVK    +VC     ++  N V I GN  Q N  V YD  Q+ +
Sbjct: 368 FQGNAELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERL 427

Query: 365 SFKPTDC 371
                +C
Sbjct: 428 GIVGENC 434


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 60/168 (35%), Positives = 88/168 (52%), Gaps = 4/168 (2%)

Query: 48  ETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAV 107
           E    RLR    R  +  +   +  S+  +    + +   +  Y  R+ IG+P       
Sbjct: 2   ERDEARLRWIHHRIQSSDHRHRRGRSLLQTAQVSSGLSLGSGEYFARMGIGSPQRSYYLE 61

Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
            DTGSD+ W QC PC  S CY Q  P++DP  SS+Y+ + C S+ C +L+  +C G+ C 
Sbjct: 62  LDTGSDVTWIQCAPC--SSCYSQVDPIYDPSNSSSYRRVYCGSALCQALDYSACQGMGCS 119

Query: 168 YSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFN 215
           Y V YGD S S+G+L  E+  LG  +  + A+  I FGCG +N GLF 
Sbjct: 120 YRVVYGDSSASSGDLGIESFYLGPNS--STAMRNIAFGCGHSNSGLFR 165



 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 32/130 (24%), Positives = 59/130 (45%), Gaps = 4/130 (3%)

Query: 246 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QV 303
           G  T   ++DSGT++T +     + L     +     P A     L+ C++F  L   Q+
Sbjct: 261 GNGTGGAILDSGTSVTRVVPAAYAVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQI 320

Query: 304 PEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
           P + +HF    D+ L   N  + V      C  F   +  + + GN+ Q  F +G+D+++
Sbjct: 321 PSLVLHFDNDVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQR 380

Query: 362 QTVSFKPTDC 371
             ++  P +C
Sbjct: 381 SLIAIAPREC 390


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score =  112 bits (280), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 81/236 (34%), Positives = 113/236 (47%), Gaps = 42/236 (17%)

Query: 16  FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSIS 75
            +V   + A+  GFSVE IHRDS KSPF++ + TP+ R   A  RS  R    +   +  
Sbjct: 27  LFVSPAVGAEEDGFSVEFIHRDSVKSPFHDPALTPHGRALAAARRSAARAAELHHLLARR 86

Query: 76  SSKASQ--------ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE------- 120
           SS A          A+++     YL+ I +GTPP   LA+ADTGSDL+W +C+       
Sbjct: 87  SSGAPSPGTGAGVVAEVVSRQFEYLMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNN 146

Query: 121 -PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVSYGDGSF 177
              PPS         F P  SSTY  + C +  C +L+   SCS   +C+Y  SYGDGS 
Sbjct: 147 STAPPSV-------YFVPSASSTYGRVGCDTKACRALSSAASCSPDGSCEYLYSYGDGSR 199

Query: 178 SNGNLATETVTLGSTTGQA-----------------VALPGITFGCGTNNGGLFNS 216
           ++G L+TET T  +    +                 V +  + FGC T   G F +
Sbjct: 200 ASGQLSTETFTFSTIADSSKTNSHGNNNNNSSSHGQVEIAKLDFGCSTTTTGTFRA 255



 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 50/144 (34%), Positives = 74/144 (51%), Gaps = 9/144 (6%)

Query: 239 IAGNQR-LGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF 297
           +AG +R    +   I++DSGTTLT+L     + L+  ++  I+      P   L+LCY  
Sbjct: 334 VAGTKRPTTAAQAHIIVDSGTTLTYLDSALLTPLVKDLTRRIKLPRAESPEKILDLCYDI 393

Query: 298 NSLSQ-----VPEVTIHFRGA-DVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIM 349
           + +       +P+VT+   G  +V L   N FV V E ++C      +   SV I GNI 
Sbjct: 394 SGVRGEDALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLALVATSERQSVSILGNIA 453

Query: 350 QTNFLVGYDIEQQTVSFKPTDCTK 373
           Q N  VGYD+E+ TV+F   DC K
Sbjct: 454 QQNLHVGYDLEKGTVTFAAADCAK 477


>gi|125595861|gb|EAZ35641.1| hypothetical protein OsJ_19928 [Oryza sativa Japonica Group]
          Length = 629

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 57/144 (39%), Positives = 82/144 (56%), Gaps = 8/144 (5%)

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           GT    +  + D+GSD+ W QC+PCP   C+ Q  PLFDP MS+TY ++PC+S+ CA L 
Sbjct: 71  GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 130

Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
             ++ CS    CQ+ ++YGDGS + G  + + +TLG        + G  FGC   + G  
Sbjct: 131 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD----VIRGFRFGCAHADRGSA 186

Query: 214 FNSKTTGIVGLGGGDISLISQMRT 237
           F+    G + LGGG  SL+ Q  T
Sbjct: 187 FDYDVAGSLALGGGSQSLVQQTAT 210


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 77/202 (38%), Positives = 108/202 (53%), Gaps = 26/202 (12%)

Query: 54  LRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA------NYLIRISIGTPPTERLAV 107
           L D   RS+   N   + +S  + +ASQ  I  ++       NY++ + +G+       +
Sbjct: 24  LDDLRVRSMQ--NRIRRVASTHNVEASQTQIPLSSGINLQTLNYIVTMGLGSK--NMTVI 79

Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCS 162
            DTGSDL W QCEPC    CY Q  P+F P  SS+Y+S+ C+SS C SL     N  +C 
Sbjct: 80  IDTGSDLTWVQCEPC--MSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGACG 137

Query: 163 GVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTT 219
             N   C Y V+YGDGS++NG L  E ++ G      V++    FGCG NN GLF    +
Sbjct: 138 SSNPSTCNYVVNYGDGSYTNGELGVEALSFG-----GVSVSDFVFGCGRNNKGLFGG-VS 191

Query: 220 GIVGLGGGDISLISQMRTTIAG 241
           G++GLG   +SL+SQ   T  G
Sbjct: 192 GLMGLGRSYLSLVSQTNATFGG 213



 Score = 47.8 bits (112), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 56/128 (43%), Gaps = 7/128 (5%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           I+IDSGT +T LP      L +         P A     L+ C++     +V  P +++ 
Sbjct: 285 ILIDSGTVITRLPSSVYKALKAEFLKKFTGFPSAPGFSILDTCFNLTGYDEVSIPTISLR 344

Query: 310 FRG-ADVKLSRSNFFVKVSEDI--VCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTV 364
           F G A + +  +  F  V ED   VC     ++++    I GN  Q N  V YD +Q  V
Sbjct: 345 FEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKV 404

Query: 365 SFKPTDCT 372
            F    C+
Sbjct: 405 GFAEEPCS 412


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 96/369 (26%), Positives = 153/369 (41%), Gaps = 88/369 (23%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + +GTPP     + DTGSDL W QC PC    C+ Q+   +DPK S+++K++ C
Sbjct: 158 GEYFMDVLVGTPPKHFSLILDTGSDLNWLQCLPC--YDCFHQNGMFYDPKTSASFKNITC 215

Query: 149 SSSQCASLN------QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA----VA 198
           +  +C+ ++      Q      +C Y   YGD S + G+ A ET T+  TT +       
Sbjct: 216 NDPRCSLISSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYK 275

Query: 199 LPGITFGCGTNNGGL----------------FNSKTTGIVG------------------- 223
           +  + FGCG  N GL                F+S+   + G                   
Sbjct: 276 VGNMMFGCGHWNRGLFSGASGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRNSNTNVSSK 335

Query: 224 -LGGGDISLIS-----------------------QMRTTIAGNQRLGV--STPDI----- 252
            + G D  L++                       Q+++ + G + L +   T +I     
Sbjct: 336 LIFGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGD 395

Query: 253 ---VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VP 304
              +IDSGTTL++  +  Y          M E  P+      L+ C++ + + +    +P
Sbjct: 396 GGTIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHLP 455

Query: 305 EVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQ 362
           E+ I F  G        N F+ +SED+VC    G   S   I GN  Q NF + YD ++ 
Sbjct: 456 ELGIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKRS 515

Query: 363 TVSFKPTDC 371
            + F PT C
Sbjct: 516 RLGFTPTKC 524


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 102/382 (26%), Positives = 157/382 (41%), Gaps = 106/382 (27%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-PLFDPKMSSTYKSLPC 148
            YL+ +S+GTPP       DTGSDL+WTQC PC    C+ Q + P+ DP  SST+ ++ C
Sbjct: 93  EYLVHLSVGTPPRPVALTLDTGSDLVWTQCAPC--LNCFDQGAIPVLDPAASSTHAAVRC 150

Query: 149 SSSQCASLNQKSCSG-------VNCQYSVSYGDGSFSNGNLATETVTLG---STTGQAVA 198
            +  C +L   SC          +C Y   YGD S + G LA++  T G   +  G  V+
Sbjct: 151 DAPVCRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFTFGPGDNADGGGVS 210

Query: 199 LPGITFGCGTNNGG----------------------------------LFNSKTTGI-VG 223
              +TFGCG  N G                                  +F S ++ + +G
Sbjct: 211 ERRLTFGCGHFNKGIFQANETGIAGFGRGRWSLPSQLGVTSFSYCFTSMFESTSSLVTLG 270

Query: 224 LGGGDISLISQMRTT-------------------IAGNQRLGV-------STPDIVIDSG 257
           +   ++ L  Q+++T                     G  R+ +            +IDSG
Sbjct: 271 VAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLREASAIIDSG 330

Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQ---PVADPTGS-LELCYSFNSLS------------ 301
            ++T LP+    ++   + +   AQ   PV+   GS L+LC++  S +            
Sbjct: 331 ASITTLPE----DVYEAVKAEFVAQVGLPVSAVEGSALDLCFALPSAAAPKSAFGWRWRG 386

Query: 302 -------QVPEVTIHFRG-ADVKLSRSNF-FVKVSEDIVCSVFKGIT---NSVPIYGNIM 349
                  +VP +  H  G AD +L R N+ F      ++C V    T   +   + GN  
Sbjct: 387 RGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGDQTVVIGNYQ 446

Query: 350 QTNFLVGYDIEQQTVSFKPTDC 371
           Q N  V YD+E   +SF P  C
Sbjct: 447 QQNTHVVYDLENDVLSFAPARC 468


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 112/417 (26%), Positives = 169/417 (40%), Gaps = 90/417 (21%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ--NSSISSSKASQADIIPN-- 87
           +LIH  S   P Y  +ET   R+   +  S  RL +       S+  +    A + P+  
Sbjct: 38  KLIHPGSVHHPHYKPNETAKDRMELDIEHSAARLAYIQARIEGSLVYNNDYTASVSPSLT 97

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL- 146
               L+ +SIG P   +L V DTGSD++W  C PC  + C      LFDP MSST+  L 
Sbjct: 98  GRTILVNLSIGQPSIPQLVVMDTGSDILWIMCNPC--TNCDNHLGLLFDPSMSSTFSPLC 155

Query: 147 --PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
             PC    C       C  +   +++SY D S ++G    + +   +T      +  +  
Sbjct: 156 KTPCGFKGC------KCDPI--PFTISYVDNSSASGTFGRDILVFETTDEGTSQISDVII 207

Query: 205 GCGTNNGGLFNSK--TTGIVGLGGGDISLISQ---------------------------- 234
           GCG N G  FNS     GI+GL  G  SL +Q                            
Sbjct: 208 GCGHNIG--FNSDPGYNGILGLNNGPNSLATQIGRKFSYCIGNLADPYYNYNQLRLGEGA 265

Query: 235 -------------------MRTTIAGNQRLGVS----------TPDIVIDSGTTLTFLPQ 265
                              M     G +RL ++          T  +++DSGTT+T+L  
Sbjct: 266 DLEGYSTPFEVYHGFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLVD 325

Query: 266 GYNSNLLSVMSSMIEA--QPVADPTGSLELCYS---FNSLSQVPEVTIHF-RGADVKLSR 319
             +  L + + ++++   + V       +LCY       L   P VT HF  GAD+ L  
Sbjct: 326 SAHKLLYNEVRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADLALDT 385

Query: 320 SNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            +FF +  +DI C     +     T S  + G + Q ++ VGYD+  Q V F+  DC
Sbjct: 386 GSFFSQ-RDDIFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRIDC 441


>gi|218197465|gb|EEC79892.1| hypothetical protein OsI_21411 [Oryza sativa Indica Group]
          Length = 720

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 57/144 (39%), Positives = 82/144 (56%), Gaps = 8/144 (5%)

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           GT    +  + D+GSD+ W QC+PCP   C+ Q  PLFDP MS+TY ++PC+S+ CA L 
Sbjct: 162 GTSAVTQTVIIDSGSDVSWVQCKPCPLPMCHRQRDPLFDPAMSTTYAAVPCTSAACAQLG 221

Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
             ++ CS    CQ+ ++YGDGS + G  + + +TLG        + G  FGC   + G  
Sbjct: 222 PYRRGCSANAQCQFGINYGDGSTATGTYSFDDLTLGPYD----VIRGFRFGCAHADRGSA 277

Query: 214 FNSKTTGIVGLGGGDISLISQMRT 237
           F+    G + LGGG  SL+ Q  T
Sbjct: 278 FDYDVAGSLALGGGSQSLVQQTAT 301


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 56/131 (42%), Positives = 82/131 (62%), Gaps = 13/131 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +R+ IG+P   +  V DTGSD+ W QC PC    CY Q+  +FDP+ SS+++ L 
Sbjct: 11  SGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPC--KSCYKQNDAVFDPRASSSFRRLS 68

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATET--VTLGSTTGQAVALPGIT 203
           CS+ QC  L+ K+C+  +  C Y VSYGDGSF+ G+LA+++  V+ G T+        + 
Sbjct: 69  CSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFLVSRGRTS-------PVV 121

Query: 204 FGCGTNNGGLF 214
           FGCG +N GLF
Sbjct: 122 FGCGHDNEGLF 132



 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/124 (33%), Positives = 60/124 (48%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           ++IDSGT++T LP    + +     S  +  P A      + CY F++L+ V  P V+ H
Sbjct: 234 VIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFH 293

Query: 310 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F  GA V+L  SN+ V V +    C  F   +  + I GNI Q    V  D++   V F 
Sbjct: 294 FEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFA 353

Query: 368 PTDC 371
           P  C
Sbjct: 354 PRQC 357


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 52/130 (40%), Positives = 79/130 (60%), Gaps = 8/130 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  R+ +G P  +   V DTGSD+ W QC+PC  + CY Q  P++DP +S++Y ++
Sbjct: 159 GSGEYFSRVGVGRPARQLYMVLDTGSDVTWLQCQPC--ADCYAQSDPVYDPSVSTSYATV 216

Query: 147 PCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C S +C  L+  +C  S  +C Y V+YGDGS++ G+ ATET+TLG +      +  +  
Sbjct: 217 GCDSPRCRDLDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTLGDS----APVSNVAI 272

Query: 205 GCGTNNGGLF 214
           GCG +N GLF
Sbjct: 273 GCGHDNEGLF 282



 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 54/124 (43%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           +++DSGT +T L  G    L        ++ P A      + CY     S  QVP V + 
Sbjct: 377 VIVDSGTAVTRLQSGAYGALREAFVQGTQSLPRASGVSLFDTCYDLAGRSSVQVPAVALW 436

Query: 310 FRGA-DVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F G  ++KL   N+ + V +    C  F G +  V I GN+ Q    V +D  + TV F 
Sbjct: 437 FEGGGELKLPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTVGFT 496

Query: 368 PTDC 371
              C
Sbjct: 497 ADKC 500


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 104/370 (28%), Positives = 160/370 (43%), Gaps = 89/370 (24%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS--PLFDPKMSSTYK 144
           ++  + + + IGTPP  R  + DTGSDLIWTQC+    +    +    P++DP  SST+ 
Sbjct: 87  SDQGHSLTVGIGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFA 146

Query: 145 SLPCSSSQC--ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
            LPCS   C     + K+C+  N C Y   YG  + + G LA+ET T G+   +AV+L  
Sbjct: 147 FLPCSDRLCQEGQFSFKNCTSKNRCVYEDVYGSAA-AVGVLASETFTFGAR--RAVSLR- 202

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISL------------------------------ 231
           + FGCG  + G      TGI+GL    +SL                              
Sbjct: 203 LGFGCGALSAGSLIG-ATGILGLSPESLSLITQLKIQRFSYCLTPFADKKTSPLLFGAMA 261

Query: 232 -ISQMRTTIA-------------------------GNQRLGVST------PD----IVID 255
            +S+ +TT                           G++RL V        PD     ++D
Sbjct: 262 DLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVD 321

Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT-GSLELCYSFNSLS--------QVPEV 306
           SG+T+ +L +     +   +  ++   PVA+ T    ELC+     +        QVP +
Sbjct: 322 SGSTVAYLVEAAFEAVKEAVMDVVRL-PVANRTVEDYELCFVLPRRTAAAAMEAVQVPPL 380

Query: 307 TIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQT 363
            +HF G A + L R N+F +    ++C      T+   V I GN+ Q N  V +D++   
Sbjct: 381 VLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHK 440

Query: 364 VSFKPTDCTK 373
            SF PT C +
Sbjct: 441 FSFAPTQCDQ 450


>gi|297794561|ref|XP_002865165.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297795163|ref|XP_002865466.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311000|gb|EFH41424.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311301|gb|EFH41725.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 134

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 60/138 (43%), Positives = 77/138 (55%), Gaps = 18/138 (13%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           + + C F+ FF                +VELIH DSP SP YN   T    L  A  RS+
Sbjct: 7   SLVDCDFLFFF----------NDWENLTVELIHSDSPHSPLYNPHHTVSDGLNAAFLRSI 56

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R   FN  + +      Q+ +I N   Y + ISIGTPP++ LA+ADTGSDL W QC+PC
Sbjct: 57  SRSRRFNTKTDL------QSGLISNGGEYFMSISIGTPPSKVLAIADTGSDLTWVQCKPC 110

Query: 123 PPSQCYMQDSPLFDPKMS 140
              QCY Q+SPLFD K+S
Sbjct: 111 --QQCYKQNSPLFDKKIS 126


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 56/131 (42%), Positives = 82/131 (62%), Gaps = 13/131 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +R+ IG+P   +  V DTGSD+ W QC PC    CY Q+  +FDP+ SS+++ L 
Sbjct: 11  SGEYFVRVGIGSPTKLQYLVMDTGSDVPWIQCSPC--KSCYKQNDAVFDPRASSSFRRLS 68

Query: 148 CSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATE--TVTLGSTTGQAVALPGIT 203
           CS+ QC  L+ K+C+  +  C Y VSYGDGSF+ G+LA++  +V+ G T+        + 
Sbjct: 69  CSTPQCKLLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSVSRGRTS-------PVV 121

Query: 204 FGCGTNNGGLF 214
           FGCG +N GLF
Sbjct: 122 FGCGHDNEGLF 132



 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 41/124 (33%), Positives = 60/124 (48%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           ++IDSGT++T LP    + +     S  +  P A      + CY F++L+ V  P V+ H
Sbjct: 234 VIIDSGTSVTRLPTYAYTVMRDAFRSATQKLPRAADFSLFDTCYDFSALTSVTIPTVSFH 293

Query: 310 FR-GADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F  GA V+L  SN+ V V +    C  F   +  + I GNI Q    V  D++   V F 
Sbjct: 294 FEGGASVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFA 353

Query: 368 PTDC 371
           P  C
Sbjct: 354 PRQC 357


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 71/212 (33%), Positives = 102/212 (48%), Gaps = 29/212 (13%)

Query: 22  IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK--- 78
           +   T   SV L H D+  S F ++S     +LR  L R   R+      +++S+ +   
Sbjct: 57  VSESTTSLSVHLSHVDALSS-FSDASPVDLFKLR--LQRDSLRVKSITSLAAVSTGRNAT 113

Query: 79  ------------ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQ 126
                       A  + +   +  Y +R+ +GTP T    V DTGSD++W QC PC    
Sbjct: 114 KRTPRSAGGFSGAVISGLSQGSGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC--KA 171

Query: 127 CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS-C---SGVNCQYSVSYGDGSFSNGNL 182
           CY Q   +FDPK S T+ ++PC S  C  L+  S C       C Y VSYGDGSF+ G+ 
Sbjct: 172 CYNQSDVIFDPKKSKTFATVPCGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDF 231

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
           +TET+T          +  +  GCG +N GLF
Sbjct: 232 STETLTF-----HGARVDHVPLGCGHDNEGLF 258



 Score = 60.8 bits (146), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 55/123 (44%), Gaps = 3/123 (2%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           ++IDSGT++T L Q     L             A      + C+  + ++  +VP V  H
Sbjct: 364 VIIDSGTSVTRLTQSAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFH 423

Query: 310 FRGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           F G +V L  SN+ + V +E   C  F G   S+ I GNI Q  F V YD+    V F  
Sbjct: 424 FGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLS 483

Query: 369 TDC 371
             C
Sbjct: 484 RAC 486


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 63/158 (39%), Positives = 89/158 (56%), Gaps = 8/158 (5%)

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
            S    IPN A +L  ISIG PP  +L + DTGSDL W  C PC   +CY Q  P F P 
Sbjct: 66  VSHVTPIPNPAAFLANISIGNPPVPQLLLIDTGSDLTWIHCLPC---KCYPQTIPFFHPS 122

Query: 139 MSSTYKSLPCSSSQCA--SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
            SSTY++  C S+  A   + +   +G NCQY + Y D S + G LA E +T  ++    
Sbjct: 123 RSSTYRNASCVSAPHAMPQIFRDEKTG-NCQYHLRYRDFSNTRGILAEEKLTFETSDDGL 181

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
           ++   I FGCG +N G   +K +G++GLG G  S++++
Sbjct: 182 ISKQNIVFGCGQDNSGF--TKYSGVLGLGPGTFSIVTR 217


>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
 gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 458

 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 114/445 (25%), Positives = 173/445 (38%), Gaps = 83/445 (18%)

Query: 9   FILFFLCFYVVSPIEAQTGGFSVELIHRDSPK--SPFYNSSETPYQRLRDALTRSLNRLN 66
            +LF    Y V     +    +++LIHR+S    +P      TP   ++     S  R  
Sbjct: 9   LLLFITVSYFVVTESIKPNRMAMKLIHRESVARLNPNARVPITPEDHIKHLTDISSARFK 68

Query: 67  HFNQNSSISSSKAS--QADI--IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +  QNS      +S  Q D+      + +L+  S+G PP  +L + DTGS L+W QC+PC
Sbjct: 69  YL-QNSIDKELGSSNFQVDVEQAIKTSLFLVNFSVGQPPVPQLTIMDTGSSLLWIQCQPC 127

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGN 181
                     P+F+P +SST+    C    C       C   N C Y   Y  G+ S G 
Sbjct: 128 KHCSSDHMIHPVFNPALSSTFVECSCDDRFCRYAPNGHCGSSNKCVYEQVYISGTGSKGV 187

Query: 182 LATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA- 240
           LA E +T  +  G  V    I FGCG  NG    S  TGI+GLG    SL  Q+ +  + 
Sbjct: 188 LAKERLTFTTPNGNTVVTQPIAFGCGYENGEQLESHFTGILGLGAKPTSLAVQLGSKFSY 247

Query: 241 ----------GNQRLGVSTPDIVIDSGTTLTFLPQG--YNSNL--LSVMSSMIEAQPVA- 285
                     G  +L +     ++   T + F  +   Y  NL  +SV  + +  +PV  
Sbjct: 248 CIGDLANKNYGYNQLVLGEDADILGDPTPIEFETENSIYYMNLEGISVGDTQLNIEPVVF 307

Query: 286 ------------------------------------DPTGSLE-------LCYS---FNS 299
                                               DP   LE       LCY       
Sbjct: 308 KRRGPRTGVILDSGTLYTWLADIAYRELYNEIKSILDP--KLERFWFRDFLCYHGRVSEE 365

Query: 300 LSQVPEVTIHFR-GADVKLSRSNFFVKVSE----DIVCSVFK------GITNSVPIYGNI 348
           L   P VT HF  GA++ +  ++ F  +SE    ++ C   K      G        G +
Sbjct: 366 LIGFPVVTFHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKEHGGEYKEFTAIGLM 425

Query: 349 MQTNFLVGYDIEQQTVSFKPTDCTK 373
            Q  + +GYD++++ +  +  DC +
Sbjct: 426 AQQYYNIGYDLKEKNIYLQRIDCVQ 450


>gi|125532796|gb|EAY79361.1| hypothetical protein OsI_34489 [Oryza sativa Indica Group]
          Length = 405

 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 94/349 (26%), Positives = 146/349 (41%), Gaps = 70/349 (20%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+   +IGTPP    AV D   +L+WTQC PC P  C+ QD PLFDP  SST++ LPC S
Sbjct: 57  YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CFEQDLPLFDPTKSSTFRGLPCGS 114

Query: 151 SQCASLNQKS--CSGVNCQYSVS---------YGDGSFSNGNLATETVTLG--------- 190
             C S+ + S  C+   C Y             G  +F+ G  A ET+  G         
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGDTGGMAGTDTFAIGA-AKETLGFGCVVMTDKRL 173

Query: 191 STTGQAVALPGI--------------TFG---CGTNNGGLF------------NSKTTGI 221
            T G    + G+               F     G ++G LF            NS T  +
Sbjct: 174 KTIGGPSGIVGLGRTPWSLVTQMNVTAFSYCLAGKSSGALFLGATAKQLAGGKNSSTPFV 233

Query: 222 VGLGGGD---------ISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLL 272
           +    G          +  ++ ++   A  Q    S   +++D+ +  ++L  G    L 
Sbjct: 234 IKTSAGSSDNGSNPYYMVKLAGIKAGGAPLQAASSSGSTVLLDTVSRASYLADGAYKALK 293

Query: 273 SVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIV 331
             +++ +  QPVA P    +LC+S       PE+   F  GA + +  +N+ +      V
Sbjct: 294 KALTAAVGVQPVASPPKPYDLCFSKAVAGDAPELVFTFDGGAALTVPPANYLLASGNGTV 353

Query: 332 CSV--------FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           C            G      I G++ Q N  V +D++++T+SFKP DC+
Sbjct: 354 CLTIGSSASLNLTGELEGASILGSLQQENVHVLFDLKEETLSFKPADCS 402


>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
 gi|238015146|gb|ACR38608.1| unknown [Zea mays]
 gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
          Length = 467

 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 69/152 (45%), Positives = 94/152 (61%), Gaps = 13/152 (8%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ R+ +GTP    + V DTGS L W QC PC  S C+ Q  P+F+PK SS+Y S+ CS
Sbjct: 128 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYTSVSCS 186

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           + QC     A+LN  SCS  N C Y  SYGD SFS G L+ +TV+ GST+     +P   
Sbjct: 187 AQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 241

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           +GCG +N GLF  ++ G++GL    +SL+ Q+
Sbjct: 242 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQL 272



 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 54/122 (44%), Gaps = 3/122 (2%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR 311
           +IDSGT +T LP G  S L   ++  ++  P A     L+ C+   +   +VPEVT+ F 
Sbjct: 347 IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFA 406

Query: 312 GADVKLSRS-NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
           G       + N  V V     C  F     S  I GN  Q  F V YD++   + F    
Sbjct: 407 GGAALKLAARNLLVDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGG 465

Query: 371 CT 372
           C+
Sbjct: 466 CS 467


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score =  110 bits (276), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 89/253 (35%), Positives = 119/253 (47%), Gaps = 44/253 (17%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI------ 84
           V L+HRDS      N+S        D L R L R     + + I +  A+ AD       
Sbjct: 68  VRLVHRDSFA---VNASAA------DLLARRLQR--DMRRAAWIITKAATPADPENGTVV 116

Query: 85  --IPNNANYLIRISIGTPPT-----ERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
              P +  Y+ +I++GTP       E L   D GSD+ W QC PC   +CY Q  P+++ 
Sbjct: 117 TGAPTSGEYIAKITVGTPYENDSSFEALLSPDMGSDVTWLQCMPC--FRCYHQPGPVYNR 174

Query: 138 KMSSTYKSLPCSSSQCASLNQKS-CSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
             SS+   + C +  C +L     C      CQY V YGDGS S G+   ET+T      
Sbjct: 175 LKSSSASDVGCYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTFP---- 230

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVI 254
             V +PG+  GCG++N GLF +   GI+GLG G +S  SQ    IAG  R G S    + 
Sbjct: 231 PGVRVPGVAIGCGSDNQGLFPAPAAGILGLGRGSLSFPSQ----IAG--RYGRSFSYCLA 284

Query: 255 DSGT-----TLTF 262
             GT     TLTF
Sbjct: 285 GQGTGGRSSTLTF 297



 Score = 49.7 bits (117), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 47/168 (27%), Positives = 78/168 (46%), Gaps = 17/168 (10%)

Query: 210 NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPD--IVIDSGTTLTFLPQ-G 266
           N  ++     G+VG+  G +    ++R     + RL  ST    +++DSGT +T L    
Sbjct: 319 NSRMYTFYYVGLVGISVGGV----RVRGVTESDLRLDPSTGHGGVIVDSGTAVTRLSGPA 374

Query: 267 YNSNLLSVMSSMIEAQPVADPTGSL---ELCYS---FNSLSQVPEVTIHFRGA-DVKLSR 319
           Y +   +   + ++      P G     + CYS      + +VP V++HF G  +VKL  
Sbjct: 375 YAAFRDAFRVAAVKELGWPSPGGPFAFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPP 434

Query: 320 SNFFVKV--SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTV 364
            N+ + V  ++  +C  F G  +  V I GNI    F V YD++ Q V
Sbjct: 435 QNYLIPVDSNKGTMCFAFAGSGDRGVSIIGNIQLQGFRVVYDVDGQRV 482


>gi|21717157|gb|AAM76350.1|AC074196_8 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433294|gb|AAP54832.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 396

 Score =  110 bits (276), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 91/354 (25%), Positives = 154/354 (43%), Gaps = 75/354 (21%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPSQCYMQDSPLFDPKMSSTYKS 145
           + A Y++ ++IGTPP    A+ D G +L+WTQC + C   +C+ QD PLFD   SST++ 
Sbjct: 47  SQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHC--RRCFKQDLPLFDTNASSTFRP 104

Query: 146 LPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSN--GNLATETVTLGSTTGQAVALPGIT 203
            PC ++ C S+  +SC+G            SF    G + T+ V +G+      A   + 
Sbjct: 105 EPCGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGT-----AATARLA 159

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT------------------------I 239
           FGC   +       ++G VGLG  ++SL +QM  T                        +
Sbjct: 160 FGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYCLAPPDTGKSSALFLGASAKL 219

Query: 240 AGNQRLGVSTPDI-------------------VIDSGTTLTFLPQGYNS----------- 269
           AG  +   +TP +                    I +G     +PQ  N+           
Sbjct: 220 AGAGKGAGTTPFVKTSTPPNSGLSRSYLLRLEAIRAGNATIAMPQSGNTITVSTATPVTA 279

Query: 270 -------NLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRS 320
                  +L   ++  + A PV  P  + +LC+   S S   P++ + F+ GA++ +  S
Sbjct: 280 LVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGGAEMTVPVS 339

Query: 321 NFFVKVSEDIVCSVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           ++      D  C    G      V I G++ Q N  + +D++++T+SF+P DC+
Sbjct: 340 SYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADCS 393


>gi|218184943|gb|EEC67370.1| hypothetical protein OsI_34481 [Oryza sativa Indica Group]
          Length = 367

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 95/353 (26%), Positives = 148/353 (41%), Gaps = 79/353 (22%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N AN+    +IGTPP    A  D   +L+WTQC  C    C+ QD P+F P  SST+K  
Sbjct: 24  NVANF----TIGTPPQAASAFIDLTGELVWTQCSQC--IHCFKQDLPVFVPNASSTFKPE 77

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           PC +  C S+    C+   C +    G G  + G +AT+T  +G+      A   + FGC
Sbjct: 78  PCGTDVCKSIPTPKCASDVCAFDGVTGLGGHTVGIVATDTFAIGTA-----APASLGFGC 132

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTT----------IAGNQRL----------- 245
              +        +G +GLG    SL++QM+ T             N RL           
Sbjct: 133 VVASDIDTMGGPSGFIGLGRTPWSLVAQMKLTRFSYCLAPHDTGKNSRLFLGASAKLAGG 192

Query: 246 GVSTPDIV-----------------IDSGTTLTFLPQGYNSNLL---------------- 272
           G  TP +                  I +G     +P+G N+ L+                
Sbjct: 193 GAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGRNTVLVQTAVVRVSLLVDSVYQ 252

Query: 273 ----SVMSSMIEAQPVADPTGS-LELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKV 326
               +VM+S + A P A P G   E+C+    +S  P++   F+ GA + +  +N+   V
Sbjct: 253 EFKKAVMAS-VGAAPTATPVGEPFEVCFPKAGVSGAPDLVFTFQAGAALTVPPANYLFDV 311

Query: 327 SEDIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
             D VC     I        + + I G+  Q N  + +D+++  +SF+P DC+
Sbjct: 312 GNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFDLDKDMLSFEPADCS 364


>gi|125532792|gb|EAY79357.1| hypothetical protein OsI_34486 [Oryza sativa Indica Group]
          Length = 396

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 91/354 (25%), Positives = 154/354 (43%), Gaps = 75/354 (21%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPSQCYMQDSPLFDPKMSSTYKS 145
           + A Y++ ++IGTPP    A+ D G +L+WTQC + C   +C+ QD PLFD   SST++ 
Sbjct: 47  SQAFYVVNLTIGTPPQPVSAIIDIGGELVWTQCAQHC--RRCFKQDLPLFDTNASSTFRP 104

Query: 146 LPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSN--GNLATETVTLGSTTGQAVALPGIT 203
            PC ++ C S+  +SC+G            SF    G + T+ V +G+      A   + 
Sbjct: 105 EPCGAAVCESIPTRSCAGDGGGACGYEASTSFGRTVGRIGTDAVAIGT-----AATARLA 159

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT------------------------I 239
           FGC   +       ++G VGLG  ++SL +QM  T                        +
Sbjct: 160 FGCAVASEMDTMWGSSGSVGLGRTNLSLAAQMNATAFSYCLAPPDTGKSSALFLGASAKL 219

Query: 240 AGNQRLGVSTPDI-------------------VIDSGTTLTFLPQGYNS----------- 269
           AG  +   +TP +                    I +G     +PQ  N+           
Sbjct: 220 AGAGKGAGTTPFVKTSTPPHSGLSRSYLLRLEAIRAGNATIAMPQSGNTIMVSTATPVTA 279

Query: 270 -------NLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR-GADVKLSRS 320
                  +L   ++  + A PV  P  + +LC+   S S   P++ + F+ GA++ +  S
Sbjct: 280 LVDSVYRDLRKAVADAVGAAPVPPPVQNYDLCFPKASASGGAPDLVLAFQGGAEMTVPVS 339

Query: 321 NFFVKVSEDIVCSVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           ++      D  C    G      V I G++ Q N  + +D++++T+SF+P DC+
Sbjct: 340 SYLFDAGNDTACVAILGSPALGGVSILGSLQQVNIHLLFDLDKETLSFEPADCS 393


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 76/198 (38%), Positives = 108/198 (54%), Gaps = 23/198 (11%)

Query: 56  DALTRSL-NRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTERLAVADTG 111
           D   RS+ NR+     + ++ +S+      + I     NY++ + +G+  T    + DTG
Sbjct: 26  DLRVRSMQNRIRRVVSSHNVEASQTQIPLSSGINLQTLNYIVTMGLGS--TNMTVIIDTG 83

Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCSGVN- 165
           SDL W QCEPC    CY Q  P+F P  SS+Y+S+ C+SS C SL     N  +C G N 
Sbjct: 84  SDLTWVQCEPC--MSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGAC-GSNP 140

Query: 166 --CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVG 223
             C Y V+YGDGS++NG L  E ++ G      V++    FGCG NN GLF    +G++G
Sbjct: 141 STCNYVVNYGDGSYTNGELGVEQLSFG-----GVSVSDFVFGCGRNNKGLFGG-VSGLMG 194

Query: 224 LGGGDISLISQMRTTIAG 241
           LG   +SL+SQ   T  G
Sbjct: 195 LGRSYLSLVSQTNATFGG 212



 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 60/128 (46%), Gaps = 7/128 (5%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           ++IDSGT +T LP      L ++        P A     L+ C++     +V  P +++H
Sbjct: 283 VLIDSGTVITRLPSSVYKALKALFLKQFTGFPSAPGFSILDTCFNLTGYDEVSIPTISMH 342

Query: 310 FRG-ADVKLSRSNFFVKVSEDI--VCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTV 364
           F G A++K+  +  F  V ED   VC     ++++    I GN  Q N  V YD +Q  V
Sbjct: 343 FEGNAELKVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKV 402

Query: 365 SFKPTDCT 372
            F    C+
Sbjct: 403 GFAEESCS 410


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 53/130 (40%), Positives = 74/130 (56%), Gaps = 8/130 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  +I +GTP T  L V DTGSD++W QC PC   +CY Q   +FDP+ S +Y ++
Sbjct: 143 GSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPC--RRCYDQSGQMFDPRASHSYGAV 200

Query: 147 PCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C++  C  L+   C      C Y V+YGDGS + G+ ATET+T  S       +P +  
Sbjct: 201 DCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAS----GARVPRVAL 256

Query: 205 GCGTNNGGLF 214
           GCG +N GLF
Sbjct: 257 GCGHDNEGLF 266



 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 44/156 (28%), Positives = 70/156 (44%), Gaps = 16/156 (10%)

Query: 232 ISQMRTTIAGNQRLGVSTPD-----------IVIDSGTTLTFLPQGYNSNLLSVMSSMIE 280
           +  M  ++ G +  GV+  D           +++DSGT++T L +   + L     +   
Sbjct: 345 VQLMGISVGGARVPGVAVSDLRLDPSTGRGGVIVDSGTSVTRLARPAYAALRDAFRAAAA 404

Query: 281 AQPVADPTGSL-ELCYSFNSLS--QVPEVTIHFRG-ADVKLSRSNFFVKV-SEDIVCSVF 335
              ++    SL + CY  + L   +VP V++HF G A+  L   N+ + V S    C  F
Sbjct: 405 GLRLSPGGFSLFDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAF 464

Query: 336 KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            G    V I GNI Q  F V +D + Q + F P  C
Sbjct: 465 AGTDGGVSIIGNIQQQGFRVVFDGDGQRLGFVPKGC 500


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 54/129 (41%), Positives = 74/129 (57%), Gaps = 4/129 (3%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  R+ IG P        DTGSD+ W QC PC  S CY Q  P++DP  SS+Y+ +
Sbjct: 8   GSGEYFARMGIGNPQRSYYLELDTGSDVTWIQCAPC--SSCYSQVDPIYDPSNSSSYRRV 65

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            C S+ C +L+  +C G+ C Y V YGD S S+G+L  E+  LG  +  + A+  I FGC
Sbjct: 66  YCGSALCQALDYSACQGMGCSYRVVYGDSSASSGDLGIESFYLGPNS--STAMRNIAFGC 123

Query: 207 GTNNGGLFN 215
           G +N GLF 
Sbjct: 124 GHSNSGLFR 132



 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 35/131 (26%), Positives = 62/131 (47%), Gaps = 6/131 (4%)

Query: 246 GVSTPDIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--Q 302
           G  T   ++DSGT++T  +P  Y + L     +     P A     L+ C++F  L   Q
Sbjct: 228 GNGTGGAILDSGTSVTRVVPPAY-AVLRDAYRAASRNLPPAPGVYLLDTCFNFQGLPTVQ 286

Query: 303 VPEVTIHF-RGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
           +P + +HF  G D+ L   N  + V      C  F   +  + + GN+ Q  F +G+D++
Sbjct: 287 IPSLVLHFDNGVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQ 346

Query: 361 QQTVSFKPTDC 371
           +  ++  P +C
Sbjct: 347 RSLIAIAPREC 357


>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
 gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 55/131 (41%), Positives = 73/131 (55%), Gaps = 11/131 (8%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +R+ +GTP T    V DTGSD++W QC PC    CY Q   +FDPK S T+ ++P
Sbjct: 132 SGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC--KACYNQTDAIFDPKKSKTFATVP 189

Query: 148 CSSSQCASLNQKS-C---SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           C S  C  L+  S C       C Y VSYGDGSF+ G+ +TET+T          +  + 
Sbjct: 190 CGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF-----HGARVDHVP 244

Query: 204 FGCGTNNGGLF 214
            GCG +N GLF
Sbjct: 245 LGCGHDNEGLF 255



 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/123 (31%), Positives = 55/123 (44%), Gaps = 3/123 (2%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           ++IDSGT++T L Q     L             A      + C+  + ++  +VP V  H
Sbjct: 361 VIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPTVVFH 420

Query: 310 FRGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           F G +V L  SN+ + V +E   C  F G   S+ I GNI Q  F V YD+    V F  
Sbjct: 421 FGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLS 480

Query: 369 TDC 371
             C
Sbjct: 481 RAC 483


>gi|14532550|gb|AAK64003.1| AT3g61820/F15G16_210 [Arabidopsis thaliana]
          Length = 362

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 55/131 (41%), Positives = 73/131 (55%), Gaps = 11/131 (8%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +R+ +GTP T    V DTGSD++W QC PC    CY Q   +FDPK S T+ ++P
Sbjct: 132 SGEYFMRLGVGTPATNVYMVLDTGSDVVWLQCSPC--KACYNQTDAIFDPKKSKTFATVP 189

Query: 148 CSSSQCASLNQKS-C---SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           C S  C  L+  S C       C Y VSYGDGSF+ G+ +TET+T          +  + 
Sbjct: 190 CGSRLCRRLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLTF-----HGARVDHVP 244

Query: 204 FGCGTNNGGLF 214
            GCG +N GLF
Sbjct: 245 LGCGHDNEGLF 255


>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 465

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 69/156 (44%), Positives = 96/156 (61%), Gaps = 13/156 (8%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ R+ +GTP    + V DTGS L W QC PC  S C+ Q  P+F+PK SS+Y S+ CS
Sbjct: 126 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYASVSCS 184

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           + QC     A+LN  SCS  N C Y  SYGD SFS G L+ +TV+ GST+     +P   
Sbjct: 185 AQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 239

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           +GCG +N GLF  ++ G++GL    +SL+ Q+  ++
Sbjct: 240 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQLAPSM 274



 Score = 55.1 bits (131), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 54/122 (44%), Gaps = 3/122 (2%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR 311
           +IDSGT +T LP G  S L   ++  ++  P A     L+ C+   +   +VPEVT+ F 
Sbjct: 345 IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFA 404

Query: 312 GADVKLSRS-NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
           G       + N  V V     C  F     S  I GN  Q  F V YD++   + F    
Sbjct: 405 GGAALKLAARNLLVDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAAG 463

Query: 371 CT 372
           C+
Sbjct: 464 CS 465


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 102/405 (25%), Positives = 158/405 (39%), Gaps = 79/405 (19%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           V L+HR  P +P  ++   P   + +   RS  RL++      +S        +   +  
Sbjct: 56  VPLLHRHGPCAPSLSTDTPP--SMSEMFRRSHARLSYIVSGKKVSVPAHLGTSV--KSLE 111

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+  +S GTP   ++ V DTGSDL W QC+PC   QC  Q  PLFDP  SSTY ++PC+S
Sbjct: 112 YVATVSFGTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCAS 171

Query: 151 SQCASLNQKS----CS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
            +C  L   +    CS G  C +++SY DG+ + G    + +TL         +    FG
Sbjct: 172 GECKKLAADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTL----APGAIVKDFYFG 227

Query: 206 CGTNN--------------------GGLF-------------NSKTTGIVGLGGGD---- 228
           CG +                     G  +             NSK  G +  G G     
Sbjct: 228 CGHSKSSLPGLFDGLLGLGRLSESLGAQYGGGGGFSYCLPAVNSK-PGFLAFGAGRNPSG 286

Query: 229 --------------ISLISQMRTTIAGNQ---RLGVSTPDIVIDSGTTLTFLPQGYNSNL 271
                          S ++    T+ G +   R    +  +++DSGT +T L       L
Sbjct: 287 FVFTPMGRVPGQPTFSTVTLAGITVGGKKLDLRPSAFSGGMIVDSGTVVTVLQSTVYRAL 346

Query: 272 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSE 328
            +     ++A  +    G L+ CY         VP++ + F  GA + L   N  +    
Sbjct: 347 RAAFREAMKAYRLVH--GDLDTCYDLTGYKNVVVPKIALTFSGGATINLDVPNGILVNG- 403

Query: 329 DIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
              C  F   G   +  + GN+ Q  F V +D       F+   C
Sbjct: 404 ---CLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445


>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
 gi|223975971|gb|ACN32173.1| unknown [Zea mays]
 gi|224034191|gb|ACN36171.1| unknown [Zea mays]
 gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
 gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
          Length = 465

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 69/156 (44%), Positives = 96/156 (61%), Gaps = 13/156 (8%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ R+ +GTP    + V DTGS L W QC PC  S C+ Q  P+F+PK SS+Y S+ CS
Sbjct: 126 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYASVSCS 184

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           + QC     A+LN  SCS  N C Y  SYGD SFS G L+ +TV+ GST+     +P   
Sbjct: 185 AQQCSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 239

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           +GCG +N GLF  ++ G++GL    +SL+ Q+  ++
Sbjct: 240 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQLAPSM 274



 Score = 54.3 bits (129), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 54/122 (44%), Gaps = 3/122 (2%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR 311
           +IDSGT +T LP G  S L   ++  ++  P A     L+ C+   +   +VPEVT+ F 
Sbjct: 345 IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFA 404

Query: 312 GADVKLSRS-NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
           G       + N  V V     C  F     S  I GN  Q  F V YD++   + F    
Sbjct: 405 GGAALKLAARNLLVDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGG 463

Query: 371 CT 372
           C+
Sbjct: 464 CS 465


>gi|115466078|ref|NP_001056638.1| Os06g0121500 [Oryza sativa Japonica Group]
 gi|113594678|dbj|BAF18552.1| Os06g0121500 [Oryza sativa Japonica Group]
          Length = 442

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 86/288 (29%), Positives = 124/288 (43%), Gaps = 39/288 (13%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
           DT  DL W QC PCP  +CY Q + LFDP+ S T  ++PC S+ C  L +    CS   C
Sbjct: 169 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 228

Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
           QY V YGDG  ++G    + +TL  +T     +    FGC     G F++ T+G +    
Sbjct: 229 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMFART 284

Query: 227 GDIS--------LISQMRTTIAGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLS 273
             +          + ++R    G +RL V         V+DS   +T L P  Y +  L+
Sbjct: 285 PLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLA 344

Query: 274 VMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIV 331
             S+M     VA     L+ CY F   +   VP V++ F G  V          V  D +
Sbjct: 345 FRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAM 394

Query: 332 CSVFKGITNSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             + +G    VP          GN+ Q    V YD+   +V F+   C
Sbjct: 395 GVMVEGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 442


>gi|55296886|dbj|BAD68338.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|55296941|dbj|BAD68392.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
          Length = 424

 Score =  110 bits (274), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 86/288 (29%), Positives = 124/288 (43%), Gaps = 39/288 (13%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
           DT  DL W QC PCP  +CY Q + LFDP+ S T  ++PC S+ C  L +    CS   C
Sbjct: 151 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 210

Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
           QY V YGDG  ++G    + +TL  +T     +    FGC     G F++ T+G +    
Sbjct: 211 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMFART 266

Query: 227 GDIS--------LISQMRTTIAGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLS 273
             +          + ++R    G +RL V         V+DS   +T L P  Y +  L+
Sbjct: 267 PLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLA 326

Query: 274 VMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIV 331
             S+M     VA     L+ CY F   +   VP V++ F G  V          V  D +
Sbjct: 327 FRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAM 376

Query: 332 CSVFKGITNSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             + +G    VP          GN+ Q    V YD+   +V F+   C
Sbjct: 377 GVMVEGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVGGGSVGFRRGAC 424


>gi|55296937|dbj|BAD68388.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|218197467|gb|EEC79894.1| hypothetical protein OsI_21421 [Oryza sativa Indica Group]
          Length = 424

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 86/288 (29%), Positives = 124/288 (43%), Gaps = 39/288 (13%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
           DT  DL W QC PCP  +CY Q + LFDP+ S T  ++PC S+ C  L +    CS   C
Sbjct: 151 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 210

Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
           QY V YGDG  ++G    + +TL  +T     +    FGC     G F++ T+G +    
Sbjct: 211 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMFART 266

Query: 227 GDIS--------LISQMRTTIAGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLS 273
             +          + ++R    G +RL V         V+DS   +T L P  Y +  L+
Sbjct: 267 PLVRNPSIIPTLYLVRLRGIEVGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLA 326

Query: 274 VMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIV 331
             S+M     VA     L+ CY F   +   VP V++ F G  V          V  D +
Sbjct: 327 FRSAMAAYPRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAM 376

Query: 332 CSVFKGITNSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             + +G    VP          GN+ Q    V YD+   +V F+   C
Sbjct: 377 GVMVEGCLAFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 424


>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
          Length = 501

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 53/130 (40%), Positives = 74/130 (56%), Gaps = 8/130 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  +I +GTP T  L V DTGSD++W QC PC   +CY Q   +FDP+ S +Y ++
Sbjct: 143 GSGEYFTKIGVGTPVTPALMVLDTGSDVVWLQCAPC--RRCYDQSGQMFDPRASHSYGAV 200

Query: 147 PCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C++  C  L+   C      C Y V+YGDGS + G+ ATET+T  S       +P +  
Sbjct: 201 DCAAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAS----GARVPRVAL 256

Query: 205 GCGTNNGGLF 214
           GCG +N GLF
Sbjct: 257 GCGHDNEGLF 266



 Score = 53.5 bits (127), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 41/85 (48%), Gaps = 4/85 (4%)

Query: 291 LELCYSFNSLS--QVPEVTIHFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYG 346
            + CY  + L   +VP V++HF G A+  L   N+ + V S    C  F G    V I G
Sbjct: 417 FDTCYDLSGLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIG 476

Query: 347 NIMQTNFLVGYDIEQQTVSFKPTDC 371
           NI Q  F V +D + Q + F P  C
Sbjct: 477 NIQQQGFRVVFDGDGQRLGFVPKGC 501


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 86/233 (36%), Positives = 122/233 (52%), Gaps = 33/233 (14%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL-------------RDALTRSLNRLNHFNQNSSI 74
           G  + L H  SP SP    ++ P+  +             R A T S +R     + SS 
Sbjct: 40  GLHLTLHHPRSPCSPAPLPADVPFSAVLTHDHARIASLAARLAKTPS-SRPTKLRRGSSS 98

Query: 75  SSSKASQADII--PNNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
           S    S A +   P  +    NY+ R+ +GTP    + V DTGS L W QC PC  S C+
Sbjct: 99  SPDAESLASVPLGPGTSVGVGNYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVS-CH 157

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNL 182
            Q  P+F+P+ SS+Y S+ CS+ QC     A+LN  +CS  N C Y  SYGD SFS G L
Sbjct: 158 RQSGPVFNPRSSSSYASVSCSAPQCDALTTATLNPSTCSTSNVCIYQASYGDSSFSVGYL 217

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           + +TV+ GST+     +P   +GCG +N GLF  ++ G++GL    +SL+ Q+
Sbjct: 218 SKDTVSFGSTS-----VPNFYYGCGQDNEGLFG-QSAGLIGLARNKLSLLYQL 264



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/122 (31%), Positives = 57/122 (46%), Gaps = 3/122 (2%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFR 311
           +IDSGT +T LP    S L   ++  ++  P A     L+ C+    S  +VP+V++ F 
Sbjct: 338 IIDSGTVITRLPTDVYSALSKAVAGAMKGTPRASAFSILDTCFQGQASRLRVPQVSMAFA 397

Query: 312 -GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
            GA +KL  +N  V V     C  F     S  I GN  Q  F V YD++   + F    
Sbjct: 398 GGAALKLKATNLLVDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGG 456

Query: 371 CT 372
           C+
Sbjct: 457 CS 458


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score =  109 bits (272), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 68/168 (40%), Positives = 97/168 (57%), Gaps = 18/168 (10%)

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ--DS 132
           SSS   QA +      Y + IS+GTPP +   + DTGS+LIW QC PC  ++C+ +   +
Sbjct: 75  SSSVNVQAQLENGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPC--TRCFPRPTPA 132

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL----NQKSCSG-VNCQYSVSYGDGSFSNGNLATETV 187
           P+  P  SST+  LPC+ S C  L      ++C+    C Y+ +YG G ++ G LATET+
Sbjct: 133 PVLQPARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETL 191

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           T+G  T      P + FGC T NG      ++GIVGLG G +SL+SQ+
Sbjct: 192 TVGDGT-----FPKVAFGCSTENG---VDNSSGIVGLGRGPLSLVSQL 231



 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 68/139 (48%), Gaps = 18/139 (12%)

Query: 253 VIDSGTTLTFLPQ-GY---NSNLLSVMSSMIEAQPVADPTGSLELCYSFNS-----LSQV 303
           ++DSGTTLT+L + GY        S M+++ +  P +     L+LCY  ++       +V
Sbjct: 316 IVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRV 375

Query: 304 PEVTIHFRG-ADVKLSRSNFFVKVSED------IVCSVFKGITNSVPI--YGNIMQTNFL 354
           P + + F G A   +   N+F  V  D      + C +    T+ +PI   GN+MQ +  
Sbjct: 376 PRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMH 435

Query: 355 VGYDIEQQTVSFKPTDCTK 373
           + YDI+    SF P DC K
Sbjct: 436 LLYDIDGGMFSFAPADCAK 454


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 54/121 (44%), Positives = 75/121 (61%), Gaps = 5/121 (4%)

Query: 95  ISIGTPPTERLAVADTGSDLIWTQCEPCP-PSQCYMQDSPLFDPKMSSTYKSLPCSSSQC 153
           + +G P      V DTGSD+ W QC PC   + CY Q +P+FDP++SS+Y  + C S QC
Sbjct: 1   MRVGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQC 60

Query: 154 ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGL 213
             L++  C+  +C Y V YGDGSF+ G LATET+T   +     ++P I+ GCG +N GL
Sbjct: 61  QLLDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSN----SIPNISIGCGHDNEGL 116

Query: 214 F 214
           F
Sbjct: 117 F 117



 Score = 47.0 bits (110), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 36/124 (29%), Positives = 51/124 (41%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           I++DSGTT+T LP      L      +    P A      + CY  +S S  +VP +   
Sbjct: 213 IIVDSGTTITQLPSDVYEVLREAFLGLTTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFI 272

Query: 310 FRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
             G + ++L   N  ++V S    C  F   T  + I GN  Q    V YD+    V F 
Sbjct: 273 LPGENSLQLPAKNCLIQVDSAGTFCLAFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFS 332

Query: 368 PTDC 371
              C
Sbjct: 333 TNKC 336


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 57/131 (43%), Positives = 74/131 (56%), Gaps = 11/131 (8%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +R+ +GTP T    V DTGSD++W QC PC    CY Q  P+F+P  S T+ ++P
Sbjct: 133 SGEYFMRLGVGTPATNMYMVLDTGSDVVWLQCSPC--KVCYNQSDPVFNPAKSKTFATVP 190

Query: 148 CSSSQCASLNQKS-CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           C S  C  L+  S C       C Y VSYGDGSF+ G+ +TET+T        VAL    
Sbjct: 191 CGSRLCRRLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTFHGARVDHVAL---- 246

Query: 204 FGCGTNNGGLF 214
            GCG +N GLF
Sbjct: 247 -GCGHDNEGLF 256



 Score = 59.3 bits (142), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 38/123 (30%), Positives = 55/123 (44%), Gaps = 3/123 (2%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           ++IDSGT++T L Q     L             A      + C+  + ++  +VP V  H
Sbjct: 362 VIIDSGTSVTRLTQSAYVALRDAFRLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFH 421

Query: 310 FRGADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           F G +V L  SN+ + V+ +   C  F G   S+ I GNI Q  F V YD+    V F  
Sbjct: 422 FTGGEVSLPASNYLIPVNNQGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLS 481

Query: 369 TDC 371
             C
Sbjct: 482 RAC 484


>gi|356513697|ref|XP_003525547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 252

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 76/202 (37%), Positives = 108/202 (53%), Gaps = 26/202 (12%)

Query: 54  LRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNA------NYLIRISIGTPPTERLAV 107
           L D   RS+   N   + +S  + +ASQ  I  ++       NY++ + +G+       +
Sbjct: 24  LDDLRVRSMQ--NRIRRVASTHNVEASQTQIPLSSGINLQTLNYIVTMGLGSK--NMTVI 79

Query: 108 ADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL-----NQKSCS 162
            DT SDL W QCEPC    CY Q  P+F P  SS+Y+S+ C+SS C SL     N  +C 
Sbjct: 80  IDTRSDLTWVQCEPCMS--CYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQFATGNTGACG 137

Query: 163 GVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTT 219
             N   C Y V+YGDGS++NG+L  E ++ G      V++    FGCG NN GLF    +
Sbjct: 138 SSNPSTCNYVVNYGDGSYTNGDLGVEALSFG-----GVSVSDFVFGCGRNNKGLFGG-VS 191

Query: 220 GIVGLGGGDISLISQMRTTIAG 241
           G++GLG   +SL+SQ   T  G
Sbjct: 192 GLMGLGRSYLSLVSQTNATFGG 213


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 68/168 (40%), Positives = 97/168 (57%), Gaps = 18/168 (10%)

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ--DS 132
           SSS   QA +      Y + IS+GTPP +   + DTGS+LIW QC PC  ++C+ +   +
Sbjct: 75  SSSVNVQAQLENGAGAYNMNISLGTPPLDFPVIVDTGSNLIWAQCAPC--TRCFPRPTPA 132

Query: 133 PLFDPKMSSTYKSLPCSSSQCASL----NQKSCSG-VNCQYSVSYGDGSFSNGNLATETV 187
           P+  P  SST+  LPC+ S C  L      ++C+    C Y+ +YG G ++ G LATET+
Sbjct: 133 PVLQPARSSTFSRLPCNGSFCQYLPTSSRPRTCNATAACAYNYTYGSG-YTAGYLATETL 191

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           T+G  T      P + FGC T NG      ++GIVGLG G +SL+SQ+
Sbjct: 192 TVGDGT-----FPKVAFGCSTENG---VDNSSGIVGLGRGPLSLVSQL 231



 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 43/139 (30%), Positives = 68/139 (48%), Gaps = 18/139 (12%)

Query: 253 VIDSGTTLTFLPQ-GY---NSNLLSVMSSMIEAQPVADPTGSLELCYSFNS-----LSQV 303
           ++DSGTTLT+L + GY        S M+++ +  P +     L+LCY  ++       +V
Sbjct: 316 IVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCYKPSAGGGGKAVRV 375

Query: 304 PEVTIHFRG-ADVKLSRSNFFVKVSED------IVCSVFKGITNSVPI--YGNIMQTNFL 354
           P + + F G A   +   N+F  V  D      + C +    T+ +PI   GN+MQ +  
Sbjct: 376 PRLALRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISIIGNLMQMDMH 435

Query: 355 VGYDIEQQTVSFKPTDCTK 373
           + YDI+    SF P DC K
Sbjct: 436 LLYDIDGGMFSFAPADCAK 454


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 69/197 (35%), Positives = 98/197 (49%), Gaps = 23/197 (11%)

Query: 30  SVELIHRDSPKSPFYN---SSETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADII 85
           S+E++HR  P     N    ++ P     +   R  NR++  +   SS       QA  +
Sbjct: 1   SLEVVHRHGPCIGIVNQEKGADAPSNM--EIFLRDQNRVDSIHARLSSRGMFPEKQATTL 58

Query: 86  P-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           P          +Y++ + +GTP  E   + DTGSD+ WTQCEPC  + CY Q  P  +P 
Sbjct: 59  PVQSGASIGAGDYVVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKT-CYKQKEPRLNPS 117

Query: 139 MSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
            S++YK++ CSS+ C  +       +SCS   C Y V YGDGS+S G  ATET+TL S+ 
Sbjct: 118 TSTSYKNISCSSALCKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTLSSSN 177

Query: 194 GQAVALPGITFGCGTNN 210
                     FGCG  N
Sbjct: 178 ----VFKNFLFGCGQQN 190



 Score = 48.1 bits (113), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 57/126 (45%), Gaps = 6/126 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
           VIDSGT +T L     S L S   +++   P        + CY F+     ++P+V + F
Sbjct: 287 VIDSGTVITRLSPTAYSELSSAFQNLMTDYPSTSGYSIFDTCYDFSKYDTVRIPKVGVTF 346

Query: 311 RGA-DVKLSRSNFFVKVSE-DIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSF 366
           +G  ++ +  S     V+    VC  F G  +     I+GN+ Q  + V YD  +  V F
Sbjct: 347 KGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKGRVGF 406

Query: 367 KPTDCT 372
            P  C+
Sbjct: 407 APGGCS 412


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 85/156 (54%), Gaps = 10/156 (6%)

Query: 60  RSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
           R +N  +  N  ++  +S ASQ         Y  RI +G P      V DTGSD+ W QC
Sbjct: 158 RRINGSDSTNSLTAPVTSGASQG-----AGEYFARIGVGQPVQSYFFVPDTGSDVSWLQC 212

Query: 120 EPCPPSQ-CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFS 178
           +PC     CY Q  P+FDPK SS+Y  L C S QC  L++ +C   +C Y V YGDGSF+
Sbjct: 213 QPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFT 272

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
            G LATET +   +     ++P +  GCG +N GLF
Sbjct: 273 VGELATETFSFRHSN----SIPNLPIGCGHDNEGLF 304



 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 53/124 (42%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           I++DSGTT+T +P      L      + +  P A      + CY  +S S  +VP +   
Sbjct: 400 IIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFI 459

Query: 310 FRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
             G + ++L   N  ++V S    C  F   T  + I GN+ Q    V YD+    V F 
Sbjct: 460 LPGENSLQLPAKNCLIQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFS 519

Query: 368 PTDC 371
              C
Sbjct: 520 TDKC 523


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 64/156 (41%), Positives = 85/156 (54%), Gaps = 10/156 (6%)

Query: 60  RSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
           R +N  +  N  ++  +S ASQ         Y  RI +G P      V DTGSD+ W QC
Sbjct: 158 RRINGSDSTNSLTAPVTSGASQG-----AGEYFARIGVGQPVQSYFFVPDTGSDVSWLQC 212

Query: 120 EPCPPSQ-CYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFS 178
           +PC     CY Q  P+FDPK SS+Y  L C S QC  L++ +C   +C Y V YGDGSF+
Sbjct: 213 QPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSEQCHLLDEAACDANSCIYEVEYGDGSFT 272

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF 214
            G LATET +   +     ++P +  GCG +N GLF
Sbjct: 273 VGELATETFSFRHSN----SIPNLPIGCGHDNEGLF 304



 Score = 45.4 bits (106), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 52/124 (41%), Gaps = 4/124 (3%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           I++DSGTT+T +P      L      + +  P A      + CY  +S S  +VP +   
Sbjct: 400 IIVDSGTTITEIPSDVYDVLRDAFVGLTKNLPPAPGVSPFDTCYDLSSQSNVEVPTIAFI 459

Query: 310 FRGAD-VKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
             G + ++L   N   +V S    C  F   T  + I GN+ Q    V YD+    V F 
Sbjct: 460 LPGENSLQLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFS 519

Query: 368 PTDC 371
              C
Sbjct: 520 TDKC 523


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/158 (41%), Positives = 88/158 (55%), Gaps = 13/158 (8%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ---DSPLFDPKMSSTYK 144
            YL+ ++ GTPP E L +ADTGSDLIW QC     PP+ C  +     P F    S+T  
Sbjct: 53  QYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLS 112

Query: 145 SLPCSSSQCASL-----NQKSCSG---VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
            +PCS++QC  +     +  SCS    V C Y+  Y DGS + G LA +T T+ + T   
Sbjct: 113 VVPCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGG 172

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
            A+ G+ FGCGT N G   S T G++GLG G +S  +Q
Sbjct: 173 AAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQ 210



 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 39/135 (28%), Positives = 67/135 (49%), Gaps = 13/135 (9%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT---GSLELCYSFNSLSQV------ 303
           VIDSG+TLT+L  G   +L+S  ++ +    +         LELCY+ +S S +      
Sbjct: 296 VIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSSLAPANGG 355

Query: 304 -PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDI 359
            P +TI F +G  ++L   N+ V V++D+ C   +   +  +  + GN+MQ  + V +D 
Sbjct: 356 FPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGNLMQQGYHVEFDR 415

Query: 360 EQQTVSFKPTDCTKQ 374
               + F  T+C   
Sbjct: 416 ASARIGFARTECVAH 430


>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
 gi|194706308|gb|ACF87238.1| unknown [Zea mays]
          Length = 467

 Score =  108 bits (271), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 68/152 (44%), Positives = 94/152 (61%), Gaps = 13/152 (8%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ R+ +GTP    + V DTGS L W QC PC  S C+ Q  P+F+PK SS+Y S+ CS
Sbjct: 128 NYVTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVS-CHRQSGPVFNPKASSSYTSVSCS 186

Query: 150 SSQC-----ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           + QC     A+L+  SCS  N C Y  SYGD SFS G L+ +TV+ GST+     +P   
Sbjct: 187 AQQCSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSFGSTS-----VPNFY 241

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           +GCG +N GLF  ++ G++GL    +SL+ Q+
Sbjct: 242 YGCGQDNEGLFG-QSAGLIGLARNKLSLLYQL 272



 Score = 54.3 bits (129), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 38/122 (31%), Positives = 54/122 (44%), Gaps = 3/122 (2%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFR 311
           +IDSGT +T LP G  S L   ++  ++  P A     L+ C+   +   +VPEVT+ F 
Sbjct: 347 IIDSGTVITRLPTGVYSALSKAVAGAMKGTPRASAFSILDTCFQGQAARLRVPEVTMAFA 406

Query: 312 GADVKLSRS-NFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
           G       + N  V V     C  F     S  I GN  Q  F V YD++   + F    
Sbjct: 407 GGAALKLAARNLLVDVDSATTCLAF-APARSAAIIGNTQQQTFSVVYDVKNSKIGFAAGG 465

Query: 371 CT 372
           C+
Sbjct: 466 CS 467


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 92/370 (24%), Positives = 152/370 (41%), Gaps = 87/370 (23%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y I + IG+PP     + DTGSDL W QC PC    C+ Q+ P +DPK S +++++ C
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPC--FDCFEQNGPYYDPKDSISFRNITC 251

Query: 149 SSSQCASLNQ----KSCS--GVNCQYSVSYGDGSFSNGNLATETVTLG---STTGQA--V 197
           +  +C  ++     + C     +C Y   YGD S + G+ A ET T+    STTG++   
Sbjct: 252 NDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFR 311

Query: 198 ALPGITFGCGTNNGGL----------------FNSKTTGIVG------------------ 223
            +  + FGCG  N GL                F+S+   + G                  
Sbjct: 312 RVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSS 371

Query: 224 --LGGGDISLIS-----------------------QMRTTIAGNQRLGVSTPDI------ 252
             + G D  L++                       Q+++   G ++L +   +       
Sbjct: 372 KLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADG 431

Query: 253 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEV 306
               +IDSGTTL++        +       ++   + +    L  CY+ +   ++  PE 
Sbjct: 432 AGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDELNFPEF 491

Query: 307 TIHFR-GADVKLSRSNFFVKVSE-DIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 363
            I F  GA       N+F+++ + DIVC    G   S + I GN  Q NF + YD +   
Sbjct: 492 LIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNSR 551

Query: 364 VSFKPTDCTK 373
           + + P  C +
Sbjct: 552 LGYAPMRCAE 561


>gi|222634868|gb|EEE65000.1| hypothetical protein OsJ_19937 [Oryza sativa Japonica Group]
          Length = 402

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 86/280 (30%), Positives = 119/280 (42%), Gaps = 45/280 (16%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
           DT  DL W QC PCP  +CY Q + LFDP+ S T  ++PC S+ C  L +    CS   C
Sbjct: 151 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 210

Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
           QY V YGDG  ++G       TL  +T     +    FGC     G F++ T+G +G+  
Sbjct: 211 QYFVDYGDGRATSGRTWWTPSTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMGIE- 265

Query: 227 GDISLISQMRTTIAGNQRLGVS----TPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEA 281
                         G +RL V         V+DS   +T L P  Y +  L+  S+M   
Sbjct: 266 -------------VGGRRLNVPPVVFAGGAVMDSSVIITQLPPTAYRALRLAFRSAMAAY 312

Query: 282 QPVADPTGSLELCYSFNSLSQ--VPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGIT 339
             VA     L+ CY F   +   VP V++ F G  V          V  D +  + +G  
Sbjct: 313 PRVAGGRAGLDTCYDFVRFTSVTVPAVSLVFDGGAV----------VRLDAMGVMVEGCL 362

Query: 340 NSVP--------IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             VP          GN+ Q    V YD+   +V F+   C
Sbjct: 363 AFVPTPGDFALGFIGNVQQQTHEVLYDVVGGSVGFRRGAC 402


>gi|357118871|ref|XP_003561171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 506

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 125/458 (27%), Positives = 180/458 (39%), Gaps = 112/458 (24%)

Query: 17  YVVSPIEAQTGGFSVELIHRDSPKSPFYNSSET--PYQRLRDALTRSLNRLNHFN----- 69
           + VSP  + +GG    L H  SP SP     ++  P + L   L    +R  H       
Sbjct: 58  HRVSP--SSSGGSWAPLSHLHSPCSPAAGGRDSAPPPKTLSATLQWDEHRAGHIQRKLSG 115

Query: 70  -------------QNSSISSSKASQADIIPNNANYLIRISI-----------GTPPTERL 105
                        Q++ ++SS A+  ++  ++ +      I             P   + 
Sbjct: 116 NAAPMDDAGEETPQSTQVTSSPAANVNVGKSSTDSAFEQGIVPAATGPGGQKKLPGVAQS 175

Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--CSG 163
            V DT SD+ W QC PCP  QCY Q   L+DP  S      PCSS QC SL + +  C+G
Sbjct: 176 MVVDTASDVPWVQCAPCPQPQCYAQSDVLYDPTKSILSAPFPCSSPQCRSLGRYANGCTG 235

Query: 164 VN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN--NGGLFNSK 217
                 CQY V Y DGS ++G   ++ +TL +    AV+     FGC       G FN+K
Sbjct: 236 AGNTGTCQYRVLYPDGSGTSGTYVSDLLTLNADPKGAVS--KFQFGCSHALLRPGSFNNK 293

Query: 218 TTGIVGLGGGDISLISQMRTTIA------------GNQR----LGVS---------TPD- 251
           T G + LG G  SL SQ + T +            G+ +    LGV          TP  
Sbjct: 294 TAGFMALGRGAQSLSSQTKGTFSKGNVFSYCLPPTGSHKGFLSLGVPQHAASRYAVTPML 353

Query: 252 -------------IVID-SGTTLTFLPQGYNSN-------------------LLSVMSSM 278
                        I ID +G  L   P  + +N                   L +   + 
Sbjct: 354 KSKMAPMIYMVRLIGIDVAGQRLPVPPAVFAANAAMDSRTIITRLPPTAYMALRAAFRAQ 413

Query: 279 IEAQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF 335
           + A     P G L+ CY F  +  V  P+VT+ F R A V+L  S   +       C  F
Sbjct: 414 MRAYRAVAPKGQLDTCYDFTGVPMVRLPKVTLVFDRNAAVELDPSGVMLD-----SCLAF 468

Query: 336 KGITNS-VP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
               N  +P I GN+ Q    V Y+++  +V F+   C
Sbjct: 469 APNANDFMPGIIGNVQQQTLEVLYNVDGASVGFRRAAC 506


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score =  108 bits (270), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 92/370 (24%), Positives = 152/370 (41%), Gaps = 87/370 (23%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y I + IG+PP     + DTGSDL W QC PC    C+ Q+ P +DPK S +++++ C
Sbjct: 194 GEYFIDVFIGSPPKHFSLILDTGSDLNWIQCVPC--FDCFEQNGPYYDPKDSISFRNITC 251

Query: 149 SSSQCASLNQ----KSCS--GVNCQYSVSYGDGSFSNGNLATETVTLG---STTGQA--V 197
           +  +C  ++     + C     +C Y   YGD S + G+ A ET T+    STTG++   
Sbjct: 252 NDPRCQLVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFR 311

Query: 198 ALPGITFGCGTNNGGL----------------FNSKTTGIVG------------------ 223
            +  + FGCG  N GL                F+S+   + G                  
Sbjct: 312 RVENVMFGCGHWNRGLFHGAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLVDRDSDTSVSS 371

Query: 224 --LGGGDISLIS-----------------------QMRTTIAGNQRLGVSTPDI------ 252
             + G D  L++                       Q+++   G ++L +   +       
Sbjct: 372 KLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSADG 431

Query: 253 ----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEV 306
               +IDSGTTL++        +       ++   + +    L  CY+ +   ++  PE 
Sbjct: 432 AGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVEDFPILHPCYNVSGTDELNFPEF 491

Query: 307 TIHFR-GADVKLSRSNFFVKVSE-DIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQT 363
            I F  GA       N+F+++ + DIVC    G   S + I GN  Q NF + YD +   
Sbjct: 492 LIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSALSIIGNYQQQNFHILYDTKNSR 551

Query: 364 VSFKPTDCTK 373
           + + P  C +
Sbjct: 552 LGYAPMRCAE 561


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 107/393 (27%), Positives = 172/393 (43%), Gaps = 88/393 (22%)

Query: 54  LRDALTRSLNRLNHFNQNSSISSSKASQAD--IIPN--NANYLIRISIGTPPTERLAVAD 109
           ++ A+ RS  RL      S++++ +    +  + P+  +  YLI+++IGTP     A+ D
Sbjct: 1   MKRAIQRSQERLEKLQITSAVNTHQMKDIETPVTPDIGSGEYLIQMAIGTPALSLSAIMD 60

Query: 110 TGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-NCQY 168
           TGSDL+WT+C PC  + C            SSTY  + C SS C   +  SC+   +C+Y
Sbjct: 61  TGSDLVWTKCNPC--TDCSTSSIYDP--SSSSTYSKVLCQSSLCQPPSIFSCNNDGDCEY 116

Query: 169 SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGD 228
              YGD S ++G L+ ET ++ S      +LP ITFGCG +N G    K  G+VG G G 
Sbjct: 117 VYPYGDRSSTSGILSDETFSISSQ-----SLPNITFGCGHDNQGF--DKVGGLVGFGRGS 169

Query: 229 ISLISQMRTTIAGNQ-----------------RLGVSTPDIVIDSGTTLTFLPQGYNSNL 271
           +SL+SQ+  ++ GN+                  +G +        G+T        N   
Sbjct: 170 LSLVSQLGPSM-GNKFSYCLVSRTDSSKTSPLFIGNTASLEATTVGSTPLVQSSSTNHYY 228

Query: 272 LSVMSSMIEAQPVADPTGSLEL--------------------CYSFNSLSQVPEVTIHFR 311
           LS+    +  Q +A PTG+ ++                      +++++ +    +I+  
Sbjct: 229 LSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFLQQTAYDAVKEAMVSSINLP 288

Query: 312 GADVKLS----------------------------RSNF-FVKVSEDIVCSVFKGITNS- 341
            AD +L                             + N+ F   + DIVC      TNS 
Sbjct: 289 QADGQLDLCFNQQGSSNPGFPSMTFHFKGADYDVPKENYLFPDSTSDIVCLAMMP-TNSN 347

Query: 342 ---VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
              + I+GN+ Q N+ + YD E   +SF PT C
Sbjct: 348 LGNMAIFGNVQQQNYQILYDNENNVLSFAPTAC 380


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 56/130 (43%), Positives = 74/130 (56%), Gaps = 8/130 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  +I +GTP T  L V DTGSD++W QC PC   +CY Q   +FDP+ S +Y ++
Sbjct: 138 GSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPC--RRCYDQSGQVFDPRRSRSYGAV 195

Query: 147 PCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            CS+  C  L+   C      C Y V+YGDGS + G+ ATET+T     G  VA   I  
Sbjct: 196 GCSAPLCRRLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAG--GARVAR--IAL 251

Query: 205 GCGTNNGGLF 214
           GCG +N GLF
Sbjct: 252 GCGHDNEGLF 261



 Score = 59.7 bits (143), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 41/125 (32%), Positives = 61/125 (48%), Gaps = 5/125 (4%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY--SFNSLSQVPEVTI 308
           +++DSGT++T L +   S L     +      ++    SL + CY  S   + +VP V++
Sbjct: 370 VIVDSGTSVTRLARPAYSALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSM 429

Query: 309 HFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           HF G A+  L   N+ + V S+   C  F G    V I GNI Q  F V +D + Q V F
Sbjct: 430 HFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGF 489

Query: 367 KPTDC 371
            P  C
Sbjct: 490 VPKGC 494


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 63/158 (39%), Positives = 85/158 (53%), Gaps = 13/158 (8%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ---DSPLFDPKMSSTYK 144
            YL+ ++ GTPP E L +ADTGSDLIW QC     PP+ C  +     P F    S+T  
Sbjct: 52  QYLVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLS 111

Query: 145 SLPCSSSQCASLNQKSCSG--------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
            +PCS++QC  +      G        V C Y+  Y DGS + G LA +T T+ + T   
Sbjct: 112 VVPCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGG 171

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
            A+ G+ FGCGT N G   S T G++GLG G +S  +Q
Sbjct: 172 AAVRGVAFGCGTRNQGGSFSGTGGVIGLGQGQLSFPAQ 209



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 67/135 (49%), Gaps = 13/135 (9%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV---ADPTGSLELCYSFNSLSQ------- 302
           VIDSG+TLT+L  G   +L+S  ++ +    +   A     LELCY+ +S S        
Sbjct: 295 VIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSSSSAPANGG 354

Query: 303 VPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDI 359
            P +TI F +G  ++L   N+ V V++D+ C   +   +  +  + GN+MQ  + V +D 
Sbjct: 355 FPRLTIDFAQGLSLELPTGNYLVDVADDVKCLAIRPTLSPFAFNVLGNLMQQGYHVEFDR 414

Query: 360 EQQTVSFKPTDCTKQ 374
               + F  T+C   
Sbjct: 415 ASARIGFARTECVAH 429


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 105/360 (29%), Positives = 157/360 (43%), Gaps = 82/360 (22%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I +GTP  +     DTGSD++W  C     CP     ++ +P +D   SST KS+ 
Sbjct: 85  YFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP-YDADASSTAKSVS 143

Query: 148 CSSSQCASLNQKS-C-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG-I 202
           CS + C+ +NQ+S C SG  CQY + YGDGS +NG L  + V L   TG  Q  +  G I
Sbjct: 144 CSDNFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNGTI 203

Query: 203 TFGCGTNNGGLF---NSKTTGIVGLG----------------------------GGDISL 231
            FGCG+   G      +   GI+G G                            GG I  
Sbjct: 204 IFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFA 263

Query: 232 ISQM-----RTT----------------IAGNQRLGVST--------PDIVIDSGTTLTF 262
           I ++     +TT                  GN  L +S+          ++IDSGTTL +
Sbjct: 264 IGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLQLSSDAFDSGDDKGVIIDSGTTLVY 323

Query: 263 LPQG-YNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQVPEVTIHF-RGADVKLS 318
           LP   YN  +  +++S  E     V D   S    +  + L + P VT  F +   + + 
Sbjct: 324 LPDAVYNPLMNQILASHQELNLHTVQD---SFTCFHYIDRLDRFPTVTFQFDKSVSLAVY 380

Query: 319 RSNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
              +  +V ED  C  ++  G+      S+ I G++  +N LV YDIE Q + +   +C+
Sbjct: 381 PQEYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 62/163 (38%), Positives = 87/163 (53%), Gaps = 20/163 (12%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+  + +G    E   V DT S+L W QC+PC    C+ Q  PLFDP  S +Y ++PC+
Sbjct: 119 NYVATVGLGA--AEATVVVDTASELTWVQCQPC--ESCHDQQDPLFDPSSSPSYAAVPCN 174

Query: 150 SSQCASLNQKSCSGVN-----------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           SS C +L     +G +           C Y++SY DGS+S G LA + + L    GQ + 
Sbjct: 175 SSSCDALRVAMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRL---AGQDIE 231

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
             G  FGCGT+N G     T+G++GLG   +SL+SQ      G
Sbjct: 232 --GFVFGCGTSNQGAPFGGTSGLMGLGRSHVSLVSQTMDQFGG 272


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score =  108 bits (269), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 94/330 (28%), Positives = 135/330 (40%), Gaps = 75/330 (22%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGV 164
           V D+ SD+ W QC PCP   C+ Q    +DP  S T  +  CSS  C +L      C+  
Sbjct: 32  VLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPTSAAFSCSSPTCTALGPYANGCANN 91

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            CQY V Y DGS ++G    + +TL +  G AV+  G  FGC     G F+++  GI+ L
Sbjct: 92  QCQYLVRYPDGSSTSGAYIADLLTLDA--GNAVS--GFKFGCSHAEQGSFDARAAGIMAL 147

Query: 225 GGGDISLISQ-------------------------------------------------- 234
           GGG  SL+SQ                                                  
Sbjct: 148 GGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATFY 207

Query: 235 ---MRTTIAGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD 286
              +RT   G QRLGV+ P +     V+DS T +T LP      L +   S +     A 
Sbjct: 208 GVLLRTITVGGQRLGVA-PAVFAAGSVLDSRTAITRLPPTAYQALRAAFRSSMTMYRSAP 266

Query: 287 PTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KGITNSV 342
           P G L+ CY F  +   ++P++++ F R A + L  S           C  F     + +
Sbjct: 267 PKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILFN-----DCLAFTSNADDRM 321

Query: 343 P-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           P + G++ Q    V YD+    V F+   C
Sbjct: 322 PGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 351


>gi|297818124|ref|XP_002876945.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322783|gb|EFH53204.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 206

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 60/134 (44%), Positives = 76/134 (56%), Gaps = 9/134 (6%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSL 62
           +F   +  L+   F   S I A     +VELIHRDSP SP YN   T    L     RS+
Sbjct: 70  SFFEVILHLYTAIFCFSSTI-ANRENLTVELIHRDSPHSPLYNPHHTVSDGLNATFLRSI 128

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R   FN  + +      Q+ +I N   YL+ ISIGTPP++ LA+ADTGSDL W QC+P 
Sbjct: 129 SRSRRFNTKTDL------QSGLISNGGEYLMSISIGTPPSKVLAIADTGSDLTWVQCKPY 182

Query: 123 PPSQCYMQDSPLFD 136
              QCY Q+SPLFD
Sbjct: 183 --QQCYKQNSPLFD 194


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 93/194 (47%), Gaps = 22/194 (11%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD-------- 83
            L+HRD      ++ + T  + L   L R   R    +  +  ++               
Sbjct: 77  RLVHRDD-----FSVNATAAELLAYRLERDAKRAARLSAAAGPANGTRRGGGGVVAPVVS 131

Query: 84  -IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
            +   +  Y  +I +GTP T  L V DTGSD++W QC PC   +CY Q   +FDP+ S +
Sbjct: 132 GLAQGSGEYFTKIGVGTPATPALMVLDTGSDVVWLQCAPC--RRCYEQSGQVFDPRRSRS 189

Query: 143 YKSLPCSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           Y ++ C++  C  L+   C      C Y V+YGDGS + G+ ATET+T     G  VA  
Sbjct: 190 YNAVGCAAPLCRRLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAG--GARVAR- 246

Query: 201 GITFGCGTNNGGLF 214
            +  GCG +N GLF
Sbjct: 247 -VALGCGHDNEGLF 259



 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 41/125 (32%), Positives = 61/125 (48%), Gaps = 5/125 (4%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCY--SFNSLSQVPEVTI 308
           +++DSGT++T L +   S L            ++    SL + CY  S   + +VP V++
Sbjct: 368 VIVDSGTSVTRLARPAYSALRDAFRGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVSM 427

Query: 309 HFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           HF G A+  L   N+ + V S+   C  F G    V I GNI Q  F V +D + Q V+F
Sbjct: 428 HFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVAF 487

Query: 367 KPTDC 371
            P  C
Sbjct: 488 TPKGC 492


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score =  107 bits (267), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 101/358 (28%), Positives = 151/358 (42%), Gaps = 89/358 (24%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           + + +GTPP     + D GSDL+WTQC    P+    Q  P+FD   SS++  LPC S  
Sbjct: 109 LTVGVGTPPQPSKVILDLGSDLLWTQCSLVGPTA--KQLEPVFDAARSSSFSVLPCDSKL 166

Query: 153 C--ASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT-N 209
           C   +   K+C+   C Y   YG  + + G LATET T G+  G +  L   TFGCG   
Sbjct: 167 CEAGTFTNKTCTDRKCAYENDYGIMT-ATGVLATETFTFGAHHGVSANL---TFGCGKLA 222

Query: 210 NGGL-------------------------------FNSKTTGIVGLGG----------GD 228
           NG +                               F  + T  V  G           G 
Sbjct: 223 NGTIAEASGILGLSPGPLSMLKQLAITKFSYCLTPFADRKTSPVMFGAMADLGKYKTTGK 282

Query: 229 ISLISQMRTTI-------------AGNQRLGVS------TPD----IVIDSGTTLTFLPQ 265
           +  I  ++  +              G++RL V        PD     V+DS TTL +L +
Sbjct: 283 VQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTLAYLVE 342

Query: 266 GYNSNLLSVMSSMIEAQPVADPT-GSLELCYSF-NSLS----QVPEVTIHFRG-ADVKLS 318
              + L   +   I+  PVA+ +     +C+     +S    QVP + +HF G A++ L 
Sbjct: 343 PAFTELKKAVMEGIKL-PVANRSVDDYPVCFELPRGMSMEGVQVPPLVLHFDGDAEMSLP 401

Query: 319 RSNFFVKVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           R N+F + S  ++C     + F+G  N   + GN+ Q N  V YD+  +  S+ PT C
Sbjct: 402 RDNYFQEPSPGMMCLAVMQAPFEGAPN---VIGNVQQQNMHVLYDVGNRKFSYAPTKC 456


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 78/235 (33%), Positives = 116/235 (49%), Gaps = 36/235 (15%)

Query: 27  GGFSV-ELIHRDSPKSPFYNSSETPYQRLRDALTR--SLN-RLNHFNQNSSISSSK---- 78
           GG +V EL H     +P  +  E     L     R  SL  R+ H+   ++ SS++    
Sbjct: 65  GGATVLELRHHSFSPAPANSREEEADALLSTDAARVSSLQGRIEHYRLTTTSSSAEVAVT 124

Query: 79  ASQADI-IPNNA-----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
           AS+A + + + A     NY+  + +G    E   + DT S+L W QC PC    C+ Q  
Sbjct: 125 ASKAQVPVSSGARLRTLNYVATVGLGG--GEATVIVDTASELTWVQCAPC--ESCHDQQG 180

Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN-------------CQYSVSYGDGSFSN 179
           PLFDP  S +Y ++PC S  C +L Q+  +G               C Y++SY DGS+S 
Sbjct: 181 PLFDPSSSPSYAAVPCDSPSCDALQQQLATGAGAGAPPCDAGRPAACSYALSYRDGSYSR 240

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
           G LA + ++L         + G  FGCGT+N G     T+G++GLG   +SL+SQ
Sbjct: 241 GVLAHDRLSLAGEV-----IDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQ 290


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 94/330 (28%), Positives = 134/330 (40%), Gaps = 75/330 (22%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGV 164
           V D+ SD+ W QC PCP   C+ Q    +DP  S +     CSS  C +L      C+  
Sbjct: 162 VLDSASDVPWVQCVPCPIPPCHPQVDSFYDPSRSPSSAPFSCSSPTCTALGPYANGCANN 221

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            CQY V Y DGS ++G    + +TL +  G AV+  G  FGC     G F+++  GI+ L
Sbjct: 222 QCQYLVRYPDGSSTSGAYIADLLTLDA--GNAVS--GFKFGCSHAEQGSFDARAAGIMAL 277

Query: 225 GGGDISLISQ-------------------------------------------------- 234
           GGG  SL+SQ                                                  
Sbjct: 278 GGGPESLLSQTASRYGNAFSYCIPATASDSGFFTLGVPRRASSRYVVTPMVRFRQAATFY 337

Query: 235 ---MRTTIAGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVAD 286
              +RT   G QRLGV+ P +     V+DS T +T LP      L S   S +     A 
Sbjct: 338 GVLLRTITVGGQRLGVA-PAVFAAGSVLDSRTAITRLPPTAYQALRSAFRSSMTMYRSAP 396

Query: 287 PTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KGITNSV 342
           P G L+ CY F  +   ++P++++ F R A + L  S           C  F     + +
Sbjct: 397 PKGYLDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILFN-----DCLAFTSNADDRM 451

Query: 343 P-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           P + G++ Q    V YD+    V F+   C
Sbjct: 452 PGVLGSVQQQTIEVLYDVGGGAVGFRQGAC 481


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 54/128 (42%), Positives = 70/128 (54%), Gaps = 8/128 (6%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y  RI +GTPP     V DTGSD++W QC PC    CY Q  P+F+P  S ++  + 
Sbjct: 126 SGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPC--KNCYSQTDPVFNPVKSGSFAKVL 183

Query: 148 CSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           C +  C  L    C+    C Y VSYGDGS++ G   TET+T   T  + VAL     GC
Sbjct: 184 CRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVAL-----GC 238

Query: 207 GTNNGGLF 214
           G +N GLF
Sbjct: 239 GHDNEGLF 246



 Score = 67.0 bits (162), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 58/123 (47%), Gaps = 3/123 (2%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           ++ID GT++T L +     L     +   +   A      + CY  +  +  +VP V +H
Sbjct: 348 VIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLH 407

Query: 310 FRGADVKLSRSNFFVKVS-EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           FRGADV L  SN+ + V      C  F G T+ + I GNI Q  F V YD+    V F P
Sbjct: 408 FRGADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSP 467

Query: 369 TDC 371
             C
Sbjct: 468 RGC 470


>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
 gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 72/239 (30%), Positives = 119/239 (49%), Gaps = 26/239 (10%)

Query: 25  QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
           Q  G ++ELIH+DSP+SP Y  +  P +++          L+H  Q S +S++KA    +
Sbjct: 10  QLDGLTMELIHKDSPQSPLYPGNLPPGEQILQPAACPFAGLHH--QTSMMSTNKAVMNRM 67

Query: 85  IPNNANY------LIRISIGT--PPTERLAVA------DTGSDLIWTQCEPC--PPSQCY 128
           +    +Y      L ++ +G+    + R          DTG++L W QCE C    + C+
Sbjct: 68  MSPLTSYGDPFLFLAQVGVGSFQEKSHRTHFKTYYFQIDTGNELSWIQCEGCQNKGNMCF 127

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
               P +    S +YK + C+       NQ  C    C Y+V+YG GS+++GNLA ET T
Sbjct: 128 PHKDPPYTSSQSKSYKPVSCNQHSFCEPNQ--CKEGLCAYNVTYGPGSYTSGNLANETFT 185

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLF------NSKTTGIVGLGGGDISLISQMRTTIAG 241
             S  G+  AL  I+FGC T++  +        +  +G++G+G G  S ++Q+ +   G
Sbjct: 186 FYSNHGKHTALKSISFGCSTDSRNMIYAFLLDKNPVSGVLGMGWGPRSFLAQLGSISHG 244


>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 81/245 (33%), Positives = 115/245 (46%), Gaps = 34/245 (13%)

Query: 8   VFILFFLCFYVVSPIEAQTG-GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN 66
           VF+L  LCF       + TG G  ++L H D        +  T  +R+R A+  S  RL 
Sbjct: 6   VFLLVLLCFRASLVTSSSTGAGLRMKLTHVDD------KAGYTTEERVRRAVAVSRERLA 59

Query: 67  HFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPS 125
           +  Q   + +S    A +      Y+    IG PP    A+ DTGS+LIWTQC   C   
Sbjct: 60  YTQQQQQLRASGDVSAPVHLATRQYIAEYLIGDPPQRAAALIDTGSNLIWTQCGTTCGLK 119

Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQ--CASLNQKSCSGVN--CQYSVSYGDGSFSNGN 181
            C  QD P ++   SST+ ++PC+ S   CA+     C G++  C ++ SYG GS   G+
Sbjct: 120 ACAKQDLPYYNLSRSSTFAAVPCADSAKLCAANGVHLC-GLDGSCTFAASYGAGSV-FGS 177

Query: 182 LATETVTLGSTTGQAVALPGITFGC--------GTNNGGLFNSKTTGIVGLGGGDISLIS 233
           L TE  T  S   +      + FGC        G  NG       +G++GLG G +SL+S
Sbjct: 178 LGTEAFTFQSGAAK------LGFGCVSLTRITKGALNG------ASGLIGLGRGRLSLVS 225

Query: 234 QMRTT 238
           Q   T
Sbjct: 226 QTGAT 230



 Score = 47.8 bits (112), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 71/144 (49%), Gaps = 13/144 (9%)

Query: 234 QMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE---AQPVADPTGS 290
           ++R   AG    GV     +ID+G+ +T L +   S L   ++  +     QP AD TG 
Sbjct: 304 ELRRVAAGYWSGGV-----IIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPPAD-TG- 356

Query: 291 LELCYSFNSLSQV-PEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNI 348
           L+LC +   + +V P +  HF G AD+ +S  +++  V +   C + +       I GN 
Sbjct: 357 LDLCVARQDVDKVVPVLVFHFGGGADMAVSAGSYWGPVDKSTACMLIEEGGYETVI-GNF 415

Query: 349 MQTNFLVGYDIEQQTVSFKPTDCT 372
            Q +  + YDI +  +SF+  DC+
Sbjct: 416 QQQDVHLLYDIGKGELSFQTADCS 439


>gi|21717175|gb|AAM76368.1|AC074196_26 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433292|gb|AAP54830.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 418

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 95/360 (26%), Positives = 150/360 (41%), Gaps = 86/360 (23%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N AN+    +IGTPP    A+ D   +L+WTQC  C  S+C+ QD PLF P  SST++  
Sbjct: 67  NVANF----TIGTPPQPASAIIDVAGELVWTQCSMC--SRCFKQDLPLFVPNASSTFRPE 120

Query: 147 PCSSSQCASLNQKSCSGVNCQY--SVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           PC +  C S+   +CS   C Y  +++   G  + G +AT+T  +G+ T        + F
Sbjct: 121 PCGTDACKSIPTSNCSSNMCTYEGTINSKLGGHTLGIVATDTFAIGTATAS------LGF 174

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI----------AGNQRL--------- 245
           GC   +G       +G++GLG    SL+SQM  T             N RL         
Sbjct: 175 GCVVASGIDTMGGPSGLIGLGRAPSSLVSQMNITKFSYCLTPHDSGKNSRLLLGSSAKLA 234

Query: 246 ----GVSTPDIV-----------------IDSGTTLTFLPQGYNSNLLSVMSSM------ 278
                 +TP +                  I +G     LP   N+ L+  ++ M      
Sbjct: 235 GGGNSTTTPFVKTSPGDDMSQYYPIQLDGIKAGDAAIALPPSGNTVLVQTLAPMSFLVDS 294

Query: 279 ------------IEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHFR--GADVKLSRSNF 322
                       + A P A P    +LC+    LS    P++   F+   A + +    +
Sbjct: 295 AYQALKKEVTKAVGAAPTATPLQPFDLCFPKAGLSNASAPDLVFTFQQGAAALTVPPPKY 354

Query: 323 FVKVSED--IVCSVF--------KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            + V E+   VC             +  ++ I G++ Q N     D+E++T+SF+P DC+
Sbjct: 355 LIDVGEEKGTVCMAILSTSWLNTTALDENLNILGSLQQENTHFLLDLEKKTLSFEPADCS 414


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 59/138 (42%), Positives = 82/138 (59%), Gaps = 13/138 (9%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-----ASLNQKSC 161
           + DTGS L W QC+PC    C+ Q  PL+DP +S TYK L C+S +C     A+LN   C
Sbjct: 2   ILDTGSSLSWLQCQPCA-VYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60

Query: 162 SGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTT 219
              +  C Y+ SYGD SFS G L+ + +TL S+      LP  T+GCG +N GLF  +  
Sbjct: 61  ETDSNACLYTASYGDTSFSIGYLSQDLLTLTSSQ----TLPQFTYGCGQDNQGLFG-RAA 115

Query: 220 GIVGLGGGDISLISQMRT 237
           GI+GL    +S+++Q+ T
Sbjct: 116 GIIGLARDKLSMLAQLST 133



 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 39/125 (31%), Positives = 61/125 (48%), Gaps = 6/125 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-LELCY--SFNSLSQVPEVTIH 309
           +IDSGT +T LP    + L      ++  +    P  S L+ C+  S  S+S VPE+ + 
Sbjct: 207 LIDSGTVITRLPMSMYAALRQAFVKIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMI 266

Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           F+G AD+ L   +  ++  + I C  F G   TN + I GN  Q  + + YD+    + F
Sbjct: 267 FQGGADLTLRAPSILIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGF 326

Query: 367 KPTDC 371
            P  C
Sbjct: 327 APGSC 331


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 54/129 (41%), Positives = 70/129 (54%), Gaps = 8/129 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  RI +GTPP     V DTGSD++W QC PC    CY Q  P+F+P  S ++  +
Sbjct: 38  GSGEYFTRIGVGTPPKYVYMVLDTGSDIVWLQCAPC--KNCYSQTDPVFNPVKSGSFAKV 95

Query: 147 PCSSSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
            C +  C  L    C+    C Y VSYGDGS++ G   TET+T   T  + VAL     G
Sbjct: 96  LCRTPLCRRLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFRRTKVEQVAL-----G 150

Query: 206 CGTNNGGLF 214
           CG +N GLF
Sbjct: 151 CGHDNEGLF 159



 Score = 67.8 bits (164), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 41/123 (33%), Positives = 58/123 (47%), Gaps = 3/123 (2%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           ++ID GT++T L +     L     +   +   A      + CY  +  +  +VP V +H
Sbjct: 261 VIIDCGTSVTRLNKPAYIALRDAFRAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLH 320

Query: 310 FRGADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           FRGADV L  SN+ + V      C  F G T+ + I GNI Q  F V YD+    V F P
Sbjct: 321 FRGADVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSP 380

Query: 369 TDC 371
             C
Sbjct: 381 RGC 383


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 67/212 (31%), Positives = 95/212 (44%), Gaps = 7/212 (3%)

Query: 30  SVELIHRDS--PKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           +++LI R+S    +P      TP   ++     S  R  +  QNS +    +S   +  +
Sbjct: 2   AMKLIRRESVVRHNPDARVPVTPEDHIQHMTDISSARFKYL-QNSIVKELGSSDFQVDVH 60

Query: 88  NAN----YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
            A     + +  S+G PP  +  + DTGS L+W QC PC          P+F+P +SST+
Sbjct: 61  QAIKTSLFFVNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTF 120

Query: 144 KSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
               C    C       CS   C Y   Y  G+ S G LA E +T  +  G  V    I 
Sbjct: 121 VECSCDDRFCRYAPNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIA 180

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           FGCG  NG    S+ TGI+GLG    SL  Q+
Sbjct: 181 FGCGHENGEQLESEFTGILGLGAKPTSLAVQL 212


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 60/138 (43%), Positives = 81/138 (58%), Gaps = 10/138 (7%)

Query: 76  SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
           SS + QA +      Y + IS+GTP      VADTGSDLIWTQC PC  ++C+ Q +P F
Sbjct: 71  SSVSFQALLENGVGGYNMNISVGTPLLTFSVVADTGSDLIWTQCAPC--TKCFQQPAPPF 128

Query: 136 DPKMSSTYKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
            P  SST+  LPC+SS C  L  + ++C+   C Y+  YG G ++ G LATET+ +G   
Sbjct: 129 QPASSSTFSKLPCTSSFCQFLPNSIRTCNATGCVYNYKYGSG-YTAGYLATETLKVGD-- 185

Query: 194 GQAVALPGITFGCGTNNG 211
               + P + FGC T NG
Sbjct: 186 ---ASFPSVAFGCSTENG 200


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 97/339 (28%), Positives = 139/339 (41%), Gaps = 80/339 (23%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCS 162
           V DT SD+ W QC PCP   C+ Q   L+DP  SS+  + PCSS  C +L    N  + +
Sbjct: 159 VIDTASDVPWVQCAPCPAPHCHAQTDVLYDPSKSSSSAAFPCSSPACRNLGPYANGCTPA 218

Query: 163 GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN--NGGLFNSKTTG 220
           G  CQY V Y DGS S G   ++ +TL      A A+    FGC       G F++KT+G
Sbjct: 219 GDQCQYRVQYPDGSASAGTYISDVLTLNPAK-PASAISEFRFGCSHALLQPGSFSNKTSG 277

Query: 221 IVGLGGGDISLISQMRTT-------------------IAGNQRLGVS----TPDI----- 252
           I+ LG G  SL +Q + T                   I G  R+  S    TP +     
Sbjct: 278 IMALGRGAQSLPTQTKATYGDVFSYCLPPTPVHSGFFILGVPRVAASRYAVTPMLRSKAA 337

Query: 253 -----------------------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP 283
                                        V+DS T +T LP      L +   + + A  
Sbjct: 338 PMLYLVRLIAIEVAGKRLPVPPAVFAAGAVMDSRTIVTRLPPTAYMALRAAFVAEMRAYR 397

Query: 284 VADPTGSLELCYSFNSLS-------QVPEVTIHFRGAD--VKLSRSNFFVKVSEDIVCSV 334
            A P   L+ CY F+  +       ++P++T+ F G +  V+L  S   +       C  
Sbjct: 398 AAAPKEHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVELDPSGVLLD-----GCLA 452

Query: 335 FKGITNS--VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           F   T+     I GN+ Q    V Y+++  TV F+   C
Sbjct: 453 FAPNTDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491


>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 475

 Score =  105 bits (261), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 93/332 (28%), Positives = 131/332 (39%), Gaps = 78/332 (23%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCS---- 162
           DT  D+ W QC PCP  QCY Q  PLFDP  SST  ++ C S  C SL      CS    
Sbjct: 153 DTTVDVPWIQCAPCPIPQCYPQRDPLFDPTTSSTAAAVRCRSPACRSLGPYGNGCSNRSA 212

Query: 163 GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
              C+Y + Y D   + G   T+T+T+  TT    A+    FGC     G F+  T G +
Sbjct: 213 NAECRYLIEYSDDRATAGTYMTDTLTISGTT----AVRNFRFGCSHAVRGRFSDLTAGTM 268

Query: 223 GLGGGDISLISQ-----------------------------------------MRTTI-- 239
            LGGG  SL++Q                                         +R+ I  
Sbjct: 269 SLGGGAQSLLAQTARSLGNAFSYCVPQASASGFLSIGGPATTNSTTVFATTPLVRSAINP 328

Query: 240 -----------AGNQRLGVS----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV 284
                         +RLG+     +   V+DS   +T LP      L     + + A P 
Sbjct: 329 SLYLVRLQGIVVAGRRLGIPPVAFSAGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPR 388

Query: 285 ADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNS 341
           +  TG+L+ CY F  L+  +VP V++ F  GA V L      +       C  F   ++ 
Sbjct: 389 SGATGTLDTCYDFLGLTNVRVPAVSLVFGGGAVVVLDPPAVMIG-----GCLAFTATSSD 443

Query: 342 VPI--YGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           + +   GN+ Q    V YD+    V F+   C
Sbjct: 444 LALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 59/154 (38%), Positives = 79/154 (51%), Gaps = 6/154 (3%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +   +++ +  G+P        DTGSD+ W QC PC    CY Q  P+FDP  S+TY ++
Sbjct: 157 DTLEFVVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCS-GHCYKQHDPVFDPTKSATYSAV 215

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           PC   QCA+   K  +   C Y V+YGDGS + G L+ ET++L ST      LPG  FGC
Sbjct: 216 PCGHPQCAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRD----LPGFAFGC 271

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
           G  N G F      +    G  +SL SQ   T  
Sbjct: 272 GQTNLGEFGGVDGLVGLGRGA-LSLPSQAAATFG 304


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 100/384 (26%), Positives = 161/384 (41%), Gaps = 84/384 (21%)

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
            RL  F  + ++S+++    D +  N  Y  R+ IGTPP +   + DTGS + +  C  C
Sbjct: 55  RRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 114

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCS-SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN 181
              QC     P FDP+ SSTYK + C+    C S       GV C Y   Y + S S+G 
Sbjct: 115 --EQCGRHQDPKFDPESSSTYKPIKCNIDCICDS------DGVQCVYERQYAEMSTSSGV 166

Query: 182 LATETVTLGSTTGQAVALP-GITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           L  + ++ G+   Q+  +P    FGC     G LF+ +  GI+GLG GD+SL+ Q+    
Sbjct: 167 LGEDVISFGN---QSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKG 223

Query: 240 AGNQRL-----------------GVSTP-DIV---------------------------- 253
           A N                    G+S P D++                            
Sbjct: 224 AINDSFSLCYGGMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPL 283

Query: 254 ------------IDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--- 296
                       +DSGTT  +LP + +++   ++M  +   + +  P  +  ++C+S   
Sbjct: 284 SSGIFDGRYGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAG 343

Query: 297 --FNSLS-QVPEVTIHFR-GADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIM 349
                LS + P V + F  G  + L+  N+F    KV       +F+   +   + G I+
Sbjct: 344 SDAAELSNKFPTVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIV 403

Query: 350 QTNFLVGYDIEQQTVSFKPTDCTK 373
             N LV YD     + F  T+C++
Sbjct: 404 VRNTLVMYDRANSKIGFWKTNCSE 427


>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 424

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 71/219 (32%), Positives = 110/219 (50%), Gaps = 12/219 (5%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQA----- 82
           GF+  LIH DSP SPFYN + T   R+   + RS +RLN+    + +S +          
Sbjct: 7   GFTARLIHHDSPLSPFYNHTMTDTARIEATVHRSRSRLNYLYYINKLSENALDNDVSLSP 66

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS- 141
            ++     YL+  +IG P ++ +   DT + LIW QC  C  SQC  +   L    +SS 
Sbjct: 67  TLVNEGGEYLMSFNIGNPSSQVMGFLDTSNGLIWVQCSNC-NSQCEPEKRGLTTKFLSSK 125

Query: 142 --TYKSLPCSSSQCASLNQ-KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
             TY+  PC S+ C SL   ++C+  +  C+Y + YGD   ++G L++++    ++ G  
Sbjct: 126 SFTYEMEPCGSNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFGFDTSDGML 185

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           V +  + FGC            TG VGL    +SLISQ+
Sbjct: 186 VDVGFLNFGCSEAPLTGDEQSYTGNVGLNQTPLSLISQL 224



 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 46/124 (37%), Positives = 66/124 (53%), Gaps = 5/124 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIE-AQPVADPTGSLELCYSF---NSLSQVPEVTI 308
           +ID+G T + L      +LL+   ++ +  Q   DP    ELC+     N L   P+VT+
Sbjct: 298 IIDTGITYSSLETDAFDSLLAKFLTLKDFPQRKDDPKERFELCFELQNANDLESFPDVTV 357

Query: 309 HFRGADVKLSRSNFFVKVSED-IVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           HF GAD+ L+  + FVK+ +D I C       + V I GN    N+ VGYD+E Q +SF 
Sbjct: 358 HFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVISFA 417

Query: 368 PTDC 371
           P DC
Sbjct: 418 PVDC 421


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 100/384 (26%), Positives = 161/384 (41%), Gaps = 84/384 (21%)

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
            RL  F  + ++S+++    D +  N  Y  R+ IGTPP +   + DTGS + +  C  C
Sbjct: 55  RRLRQFPTSDNLSNARMRLYDDLLLNGYYTTRLWIGTPPQQFALIVDTGSTVTYVPCSTC 114

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCS-SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN 181
              QC     P FDP+ SSTYK + C+    C S       GV C Y   Y + S S+G 
Sbjct: 115 --EQCGRHQDPKFDPESSSTYKPIKCNIDCICDS------DGVQCVYERQYAEMSTSSGV 166

Query: 182 LATETVTLGSTTGQAVALP-GITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           L  + ++ G+   Q+  +P    FGC     G LF+ +  GI+GLG GD+SL+ Q+    
Sbjct: 167 LGEDVISFGN---QSELIPQRAVFGCENMETGDLFSQRADGIMGLGTGDLSLVDQLVEKG 223

Query: 240 AGNQRL-----------------GVSTP-DIV---------------------------- 253
           A N                    G+S P D++                            
Sbjct: 224 AINDSFSLCYGGMDIGGGAMVLGGISPPSDMIFTYSDPVRSPYYNVDLKEIHVAGKKLPL 283

Query: 254 ------------IDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--- 296
                       +DSGTT  +LP + +++   ++M  +   + +  P  +  ++C+S   
Sbjct: 284 SSGIFDGRYGAVLDSGTTYAYLPAEAFSAFKDAIMDEIHSLKKIDGPDPNFKDICFSGAG 343

Query: 297 --FNSLS-QVPEVTIHFR-GADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIM 349
                LS + P V + F  G  + L+  N+F    KV       +F+   +   + G I+
Sbjct: 344 SDAAELSNKFPTVDMVFENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIV 403

Query: 350 QTNFLVGYDIEQQTVSFKPTDCTK 373
             N LV YD     + F  T+C++
Sbjct: 404 VRNTLVMYDRANSKIGFWKTNCSE 427


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 49/130 (37%), Positives = 71/130 (54%), Gaps = 8/130 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  ++ +GTP T  L V DTGSD++W QC PC    CY Q   +FDP+ S +Y ++
Sbjct: 118 GSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPC--RHCYAQSGRVFDPRRSRSYAAV 175

Query: 147 PCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C +  C  L+   C      C Y V+YGDGS + G+ A+ET+T      +   +  +  
Sbjct: 176 DCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF----ARGARVQRVAI 231

Query: 205 GCGTNNGGLF 214
           GCG +N GLF
Sbjct: 232 GCGHDNEGLF 241



 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 57/125 (45%), Gaps = 5/125 (4%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTI 308
           +++DSGT++T L +     +     +      V+    SL + CY+     + +VP V++
Sbjct: 351 VILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSM 410

Query: 309 HFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           H  G A V L   N+ + V +    C    G    V I GNI Q  F V +D + Q V F
Sbjct: 411 HLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGF 470

Query: 367 KPTDC 371
            P  C
Sbjct: 471 VPKSC 475


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 49/130 (37%), Positives = 71/130 (54%), Gaps = 8/130 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  ++ +GTP T  L V DTGSD++W QC PC    CY Q   +FDP+ S +Y ++
Sbjct: 118 GSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPC--RHCYAQSGRVFDPRRSRSYAAV 175

Query: 147 PCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C +  C  L+   C      C Y V+YGDGS + G+ A+ET+T      +   +  +  
Sbjct: 176 DCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF----ARGARVQRVAI 231

Query: 205 GCGTNNGGLF 214
           GCG +N GLF
Sbjct: 232 GCGHDNEGLF 241



 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 57/125 (45%), Gaps = 5/125 (4%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTI 308
           +++DSGT++T L +     +     +      V+    SL + CY+     + +VP V++
Sbjct: 351 VILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSM 410

Query: 309 HFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           H  G A V L   N+ + V +    C    G    V I GNI Q  F V +D + Q V F
Sbjct: 411 HLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGF 470

Query: 367 KPTDC 371
            P  C
Sbjct: 471 VPKSC 475


>gi|147794033|emb|CAN68918.1| hypothetical protein VITISV_035156 [Vitis vinifera]
          Length = 398

 Score =  104 bits (259), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 97/353 (27%), Positives = 145/353 (41%), Gaps = 55/353 (15%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
           E+  RD  +  F NS    Y         S N  NH + N           ++   + N+
Sbjct: 88  EIXGRDESRVSFINSKCNQYT--------SGNLKNHAHNN-----------NLFDEDGNF 128

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           L+ ++ GTPP     + DTGS + WTQC+ C    C       FB   SSTY    C   
Sbjct: 129 LVDVAFGTPPQXFXLILDTGSSITWTQCKAC--VNCLQDSXRYFBXSASSTYSXGSCIPX 186

Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
                       V   Y+++YGD S S GN    T+TL  +           FG G NN 
Sbjct: 187 T-----------VENNYNMTYGDDSTSVGNYGCXTMTLEPSD----VFQKFQFGXGRNNK 231

Query: 212 GLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNL 271
           G F S   G++GLG G +S +SQ  +     +      P+   DS  +L F  +  + + 
Sbjct: 232 GDFGSGADGMLGLGQGQLSTVSQTASKFX--KVFSYCLPE--EDSIGSLLFGEKATSQSS 287

Query: 272 LSVMSSMIEAQPVADPTGSLELCYSFNSLSQV------PEVTIHF-RGADVKLSRSNFFV 324
               +S++        +G  E  Y F  L  +      PE+ +HF  GADV+L+ +N   
Sbjct: 288 SLKFTSLVNGPGT---SGLXESGYYFVKLLDISVDVLLPEIVLHFGGGADVRLNGTNIVW 344

Query: 325 KVSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
                 +C  F G + S     + I GN  Q +  V YDI+   + F+   C+
Sbjct: 345 GSDASRLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGCS 397


>gi|296082634|emb|CBI21639.3| unnamed protein product [Vitis vinifera]
          Length = 278

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 69/191 (36%), Positives = 95/191 (49%), Gaps = 37/191 (19%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
           F V L H DS        + T ++RL+ A+ R   RL   +  ++ S   + +A +   N
Sbjct: 35  FRVSLRHVDS------GGNYTKFERLQRAMKRGKLRLQRLSAKTA-SFESSVEAPVHAGN 87

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             +L++++IGTP     A+ DTGSDLIWTQC+PC    C+ Q +P+FDPK SS++  LPC
Sbjct: 88  GEFLMKLAIGTPAETYSAIMDTGSDLIWTQCKPC--KDCFDQPTPIFDPKKSSSFSKLPC 145

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
           SS    S  Q                     G LATET   G  +     +  I FGCG 
Sbjct: 146 SSDLYYSSTQ---------------------GVLATETFAFGDAS-----VSKIGFGCGE 179

Query: 209 NNGGLFNSKTT 219
           +N G  NS TT
Sbjct: 180 DNDG--NSGTT 188


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 49/130 (37%), Positives = 71/130 (54%), Gaps = 8/130 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +  Y  ++ +GTP T  L V DTGSD++W QC PC    CY Q   +FDP+ S +Y ++
Sbjct: 124 GSGEYFAQVGVGTPATTALMVLDTGSDVVWLQCAPC--RHCYAQSGRVFDPRRSRSYAAV 181

Query: 147 PCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C +  C  L+   C      C Y V+YGDGS + G+ A+ET+T      +   +  +  
Sbjct: 182 DCVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTF----ARGARVQRVAI 237

Query: 205 GCGTNNGGLF 214
           GCG +N GLF
Sbjct: 238 GCGHDNEGLF 247



 Score = 51.2 bits (121), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 57/125 (45%), Gaps = 5/125 (4%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSF--NSLSQVPEVTI 308
           +++DSGT++T L +     +     +      V+    SL + CY+     + +VP V++
Sbjct: 357 VILDSGTSVTRLARPVYEAVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSM 416

Query: 309 HFRG-ADVKLSRSNFFVKV-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           H  G A V L   N+ + V +    C    G    V I GNI Q  F V +D + Q V F
Sbjct: 417 HLAGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGF 476

Query: 367 KPTDC 371
            P  C
Sbjct: 477 VPKSC 481


>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
           [Brachypodium distachyon]
          Length = 452

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 52/126 (41%), Positives = 72/126 (57%), Gaps = 6/126 (4%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            +++ +  G+P      + DTGSDL W QC+PC    CY Q  P+FDP  SS+Y  +PC 
Sbjct: 111 EFVVVVGFGSPAQTSATMFDTGSDLSWIQCQPCS-GHCYKQHDPVFDPAKSSSYAVVPCG 169

Query: 150 SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
           +++CA+   + C+G  C Y V YGDGS + G LA ET+T  S++       G  FGCG  
Sbjct: 170 TTECAAAGGE-CNGTTCVYGVEYGDGSSTTGVLARETLTFSSSS----EFTGFIFGCGET 224

Query: 210 NGGLFN 215
           N G F 
Sbjct: 225 NLGDFG 230



 Score = 44.7 bits (104), Expect = 0.067,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 54/127 (42%), Gaps = 8/127 (6%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
           ++DSGT LT+LP    + L       ++    A P   L+ CY F   S +    + F  
Sbjct: 326 LLDSGTILTYLPPPAYTALRDRFKFTMQGSKPAPPYDELDTCYDFTGQSGILIPGVSFNF 385

Query: 313 ADVKLSRSNFF--VKVSED----IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTV 364
           +D  +   NFF  +   +D    + C  F      +P  + G+  Q +  V YD+  Q +
Sbjct: 386 SDGAVFNLNFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDVPAQKI 445

Query: 365 SFKPTDC 371
            F P  C
Sbjct: 446 GFIPASC 452


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 77/231 (33%), Positives = 114/231 (49%), Gaps = 25/231 (10%)

Query: 74  ISSSKASQADIIPNNANYLIRISIGTPPTERLAVA-DTGSDLIWTQCEPCPPSQCYMQDS 132
           ++      ADI   ++ YLI +SIGTP  +R+A+  DTGSDL+WTQC  C    C+ Q  
Sbjct: 86  LARGTVGDADI---DSEYLIHLSIGTPRPQRVALTLDTGSDLVWTQCA-C--HVCFAQPF 139

Query: 133 PLFDPKMSSTYKSLPCSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
           P FD   S T  ++PCS   C S    L+  + +   C Y   Y D S ++G +  +T T
Sbjct: 140 PTFDALASQTTLAVPCSDPICTSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFT 199

Query: 189 L-------GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
                   GS     VA+P + FGCG  N G+F S  +GI G   G +SL SQ++     
Sbjct: 200 FRSPQGNNGSKAHAGVAVPNVRFGCGQYNKGIFKSNESGIAGFSRGPMSLPSQLKVARFS 259

Query: 242 NQRLGVSTPDIVIDSGTTLTFLPQGYN-SNLLSVMSSMIEAQPVADPTGSL 291
           +    ++      D+ T+  FL       NL +  +  +++ P A+  GSL
Sbjct: 260 HCFTAIA------DARTSPVFLGGAPGPDNLGAHATGPVQSTPFANSNGSL 304



 Score = 48.5 bits (114), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 39/140 (27%), Positives = 63/140 (45%), Gaps = 20/140 (14%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE---LCYS---------FNSL 300
           +IDSGT +  LP     +L +   + ++  PVA+ + +     LC+              
Sbjct: 339 IIDSGTGIRTLPGPMYRSLRAAFVARVKL-PVANESAADAESTLCFEAARSASLPPEAPA 397

Query: 301 SQVPEVTIHFRGADVKLSRSNFFVKVSEDI------VCSVFKGITNS-VPIYGNIMQTNF 353
             +P+V +H  GAD  L R ++ + + ED       +C V     +S + I GN  Q N 
Sbjct: 398 PALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGSGLCLVMNSAGDSDLTIIGNFQQQNM 457

Query: 354 LVGYDIEQQTVSFKPTDCTK 373
            V YD+E+  + F P  C K
Sbjct: 458 HVAYDLEKNKLVFVPARCDK 477


>gi|413938618|gb|AFW73169.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 324

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 101/333 (30%), Positives = 132/333 (39%), Gaps = 82/333 (24%)

Query: 109 DTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN---QKSCSGV 164
           DTGSDL W QC+PC  +  CY Q  PLFDP  SS+Y ++PC    CA L      +CS  
Sbjct: 4   DTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGGPVCAGLGIYAASACSAA 63

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C Y VSYGDGS + G  +++T+TL +++    A+ G  FGCG    GLFN    G++GL
Sbjct: 64  QCGYVVSYGDGSNTTGVYSSDTLTLSASS----AVQGFFFGCGHAQSGLFNG-VDGLLGL 118

Query: 225 GGGDISLISQ--------------MRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
           G    SL+ Q               + + AG   LGV  P       +T   LP      
Sbjct: 119 GREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAAPGFSTTQLLPSPNAPT 178

Query: 271 LLSVMSSMIE-----------------------------------------------AQP 283
              VM + I                                                  P
Sbjct: 179 YYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYAALRSAFRSGMASYGYP 238

Query: 284 VADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFK--GI 338
            A   G L+ CY+F     V  P V + F  GA V L              C  F   G 
Sbjct: 239 TAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL-----SFGCLAFAPSGS 293

Query: 339 TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
              + I GN+ Q +F V   I+  +V FKP+ C
Sbjct: 294 DGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 324


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 100/368 (27%), Positives = 144/368 (39%), Gaps = 96/368 (26%)

Query: 83  DIIPNN------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
           D  PNN       N+L+ ++ GTPP +   + DTGS + WTQC+PC   +C       FD
Sbjct: 148 DHTPNNKLFDEDGNFLVDVAFGTPPQKFTLILDTGSSITWTQCKPC--VRCLKASRRHFD 205

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           P  S TY    C  S            V   Y+++YGD S S GN   +T+TL      +
Sbjct: 206 PSASLTYSLGSCIPST-----------VGNTYNMTYGDKSTSVGNYGCDTMTLE----HS 250

Query: 197 VALPGITFGCGTNNGG-------------------------------------------- 212
              P   FGCG NN G                                            
Sbjct: 251 DVFPKFQFGCGRNNEGDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSL 310

Query: 213 LFNSKTT---------------GIVGLGGGDISLISQMRTTIAGNQRLGV-----STPDI 252
           LF  K T               G  GL       +  +  ++ GN+RL +     ++P  
Sbjct: 311 LFGEKATSQSSSLKFTSLVNGPGTSGLEESGYYFVKLLDISV-GNKRLNIPSSVFASPGT 369

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS----LELCYSFNSLSQV--PEV 306
           +IDSGT +T LPQ   S L +     +   P+++        L+ CY+ +    V  PE+
Sbjct: 370 IIDSGTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEI 429

Query: 307 TIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
            +HF  GADV+L+            +C  F G  + + I GN  Q +  V YDI+   + 
Sbjct: 430 VLHFGEGADVRLNGKRVIWGNDASRLCLAFAG-NSELTIIGNRQQVSLTVLYDIQGGRIG 488

Query: 366 FKPTDCTK 373
           F    C+K
Sbjct: 489 FGGNGCSK 496


>gi|125595855|gb|EAZ35635.1| hypothetical protein OsJ_19925 [Oryza sativa Japonica Group]
          Length = 335

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 76/281 (27%), Positives = 133/281 (47%), Gaps = 27/281 (9%)

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           GT    +  + D+GSD+ W QC+PCP   C+ Q  PLFDP  S+TY ++PCSS+ CA L 
Sbjct: 75  GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLG 134

Query: 158 --QKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
             ++ C +   CQ+ ++Y +G+ + G  +++ +TLG        + G  FGC   + G  
Sbjct: 135 PYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYD----VVRGFLFGCAHADQGST 190

Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLS 273
           F+    G + LGGG  S + Q  +    ++      P      G  +  +P    + + +
Sbjct: 191 FSYDVAGTLALGGGSQSFVQQTASQY--SRVFSYCVPPSTSSFGFIMFGVPPQRAALVPT 248

Query: 274 VMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVC 332
            +S+ + +     PT        F S++ +P + + F  GA V L  +   ++      C
Sbjct: 249 FVSTPLLSSSTMSPT--------FYSIT-LPSIALVFDGGATVNLDAAGILLQ-----GC 294

Query: 333 SVFK-GITNSVPIY-GNIMQTNFLVGYDIEQQTVSFKPTDC 371
             F    ++ +P + GN+ Q    V YD+  + + F+   C
Sbjct: 295 LAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 335


>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Brachypodium distachyon]
          Length = 464

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/231 (32%), Positives = 106/231 (45%), Gaps = 29/231 (12%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLN---------HFNQNSS 73
           ++ + G +V L HR  P SP   S +       + L R   R N         H+ +   
Sbjct: 52  DSSSSGATVPLNHRHGPCSPV-PSGKKKQPTFTELLRRDQLRANYIQRQFSDEHYPRTGG 110

Query: 74  ISSSKAS---QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
           +  S+A+       + N   Y+I +SIG+P        DTGSD+ W +C+          
Sbjct: 111 LQQSEATVPIALGSLLNTLEYVITVSIGSPAVAXTMFIDTGSDVSWLRCK---------- 160

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKSC---SGVNCQYSVSYGDGSFSNGNLATETV 187
            S L+DP  SSTY    CS+  CA L ++     SG  C YSV YGDGS + G   ++T+
Sbjct: 161 -SRLYDPGTSSTYAPFSCSAPACAQLGRRGTGCSSGSTCVYSVKYGDGSNTTGTYGSDTL 219

Query: 188 TLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           TL  T+   ++  G  FGC     G     T G++GLGG   S +SQ   T
Sbjct: 220 TLAGTSEPLIS--GFQFGCSAVEHGFEEDNTDGLMGLGGDAQSFVSQTAAT 268



 Score = 45.4 bits (106), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 38/129 (29%), Positives = 53/129 (41%), Gaps = 15/129 (11%)

Query: 253 VIDSGTTLTFLP-QGYNSNLLSVMSSMI--EAQPVADPTGSLELCYSFNSLSQ-----VP 304
           ++DSGT +T LP   Y +   +    M   + QP A P G L+ C+ F    +     VP
Sbjct: 341 IVDSGTVITRLPPTAYGALSAAFRDGMARYQYQPAA-PRGLLDTCFDFTGHGEGNNFTVP 399

Query: 305 EVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQ 362
            V +   G  V     N  V+      C  F    +     I GN+ Q  F V YD+ Q 
Sbjct: 400 SVALVLDGGAVVDLHPNGIVQDG----CLAFAATDDDGRTGIIGNVQQRTFEVLYDVGQS 455

Query: 363 TVSFKPTDC 371
              F+P  C
Sbjct: 456 VFGFRPGAC 464


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 62/161 (38%), Positives = 86/161 (53%), Gaps = 19/161 (11%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+  + +G    E   + DT S+L W QC PC  + C+ Q  PLFDP  S +Y  LPC+
Sbjct: 126 NYVATVGLGG--GEATVIVDTASELTWVQCAPC--ASCHDQQGPLFDPASSPSYAVLPCN 181

Query: 150 SSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           SS C +L   + S           +C Y++SY DGS+S G LA + ++L         + 
Sbjct: 182 SSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEV-----ID 236

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           G  FGCGT+N G F   T+G++GLG   +SLISQ      G
Sbjct: 237 GFVFGCGTSNQGPFGG-TSGLMGLGRSQLSLISQTMDQFGG 276


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 100/348 (28%), Positives = 134/348 (38%), Gaps = 80/348 (22%)

Query: 94  RISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC 153
           R S   P   +L + DT SD+ W QC PCP SQCY Q   L+DP  S + +S  CSS  C
Sbjct: 172 RRSRLRPGVRQLMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTC 231

Query: 154 ASL-------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
             L       +  S S   CQY V Y DGS ++G L  + ++L  T+     +P   FGC
Sbjct: 232 RQLGPYANGCSSSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTS----QVPKFEFGC 287

Query: 207 GTNNGGLFN-SKTTGIVGLGGGDISLISQMRT---------------------------- 237
                G F+ SKT GI+ LG G  SL+SQ  T                            
Sbjct: 288 SHAARGSFSRSKTAGIMALGRGVQSLVSQTSTKYGQVFSYCFPPTASHKGFFVLGVPRRS 347

Query: 238 -----------------------TIAGNQRLGVS----TPDIVIDSGTTLTFLPQGYNSN 270
                                   +AG QRL V          +DS T +T LP      
Sbjct: 348 SSRYAVTPMLKTPMLYQVRLEAIAVAG-QRLDVPPTVFAAGAALDSRTVITRLPPTAYQA 406

Query: 271 LLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFR--GADVKLSRSNFFVKV 326
           L S     +     A   G L+ CY F  +S +  P +++ F   GA V+L  S      
Sbjct: 407 LRSAFRDKMSMYRPAAANGQLDTCYDFTGVSSIMLPTISLVFDRTGAGVQLDPSGVLFG- 465

Query: 327 SEDIVCSVF---KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
                C  F    G   +  I G +      V Y++   +V F+   C
Sbjct: 466 ----SCLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509


>gi|125553836|gb|EAY99441.1| hypothetical protein OsI_21409 [Oryza sativa Indica Group]
          Length = 376

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 55/144 (38%), Positives = 81/144 (56%), Gaps = 8/144 (5%)

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           GT    +  + D+GSD+ W QC+PCP   C+ Q  PLFDP  S+TY ++PCSS+ CA L 
Sbjct: 155 GTSAVRQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYSAVPCSSAACARLG 214

Query: 158 --QKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
             ++ CS  V CQ+  +Y DG+ + G  +++ +TLG        + G  FGC   + G  
Sbjct: 215 PYRRGCSANVQCQFGFTYTDGATATGTYSSDDLTLGPYD----VVRGFLFGCAHADRGST 270

Query: 214 FNSKTTGIVGLGGGDISLISQMRT 237
           F+   +G + LGGG  S + Q  T
Sbjct: 271 FSFDVSGTLALGGGAQSFVQQTAT 294


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 62/161 (38%), Positives = 86/161 (53%), Gaps = 19/161 (11%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+  + +G    E   + DT S+L W QC PC  + C+ Q  PLFDP  S +Y  LPC+
Sbjct: 125 NYVATVGLGG--GEATVIVDTASELTWVQCAPC--ASCHDQQGPLFDPASSPSYAVLPCN 180

Query: 150 SSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           SS C +L   + S           +C Y++SY DGS+S G LA + ++L         + 
Sbjct: 181 SSSCDALQVATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGEV-----ID 235

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
           G  FGCGT+N G F   T+G++GLG   +SLISQ      G
Sbjct: 236 GFVFGCGTSNQGPFGG-TSGLMGLGRSQLSLISQTMDQFGG 275


>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
 gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
          Length = 487

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 54/132 (40%), Positives = 72/132 (54%), Gaps = 6/132 (4%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
           DT  DL W QC PCP  +CY Q + LFDP+ S T  ++PC S+ C  L +    CS   C
Sbjct: 167 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 226

Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
           QY V YGDG  ++G    + +TL  +T     +    FGC     G F++ T+G + LGG
Sbjct: 227 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMSLGG 282

Query: 227 GDISLISQMRTT 238
           G  SL+SQ   T
Sbjct: 283 GRQSLLSQTAAT 294


>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 497

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 91/362 (25%), Positives = 151/362 (41%), Gaps = 81/362 (22%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTPP       DTGSD++W     C+ CP       D  L+DPK SS+  ++ 
Sbjct: 87  YYTKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVS 146

Query: 148 CSSSQCASLNQKS------CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---A 198
           C +  CA+            +G  C+Y   YGDGS + G+  ++++     +G A    A
Sbjct: 147 CDNKFCAATYGSGEKLPGCTAGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTRHA 206

Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT-------------TIAGN 242
              + FGCG   GG     N    GI+G G  + S +SQ+ +             TI G 
Sbjct: 207 KANVIFGCGAQQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTIKGG 266

Query: 243 QRLGV---------STP---------------DI--------------------VIDSGT 258
               +         STP               D+                    +IDSGT
Sbjct: 267 GIFAIGEVVQPKVKSTPLLPNMSHYNVNLQSIDVAGNALQLPPHIFETSEKRGTIIDSGT 326

Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLS 318
           TLT+LP+    ++L+ +    +        G L   YS +     P++T HF   D+ L+
Sbjct: 327 TLTYLPELVYKDILAAVFQKHQDITFRTIQGFLCFEYSESVDDGFPKITFHFE-DDLGLN 385

Query: 319 R--SNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
               ++F +  +++ C  F+           + + G+++ +N +V YD+E+Q + +   +
Sbjct: 386 VYPHDYFFQNGDNLYCLGFQNGGFQPKDAKDMVLLGDLVLSNKVVVYDLEKQVIGWTDYN 445

Query: 371 CT 372
           C+
Sbjct: 446 CS 447


>gi|357143328|ref|XP_003572882.1| PREDICTED: uncharacterized protein LOC100846829 [Brachypodium
           distachyon]
          Length = 836

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 119/425 (28%), Positives = 173/425 (40%), Gaps = 97/425 (22%)

Query: 31  VELIHRDSPKSPFYNSSETPY--------QRLRDALTRSLNRLNHFNQNSSISSSKASQA 82
           + L HR  P +    S+  P         +R  + + R ++           +++ +S++
Sbjct: 425 LRLTHRHGPCAGPSRSASAPSFAEVLRADERRAEYIQRRMSGAKGPGGLQQFTAASSSKS 484

Query: 83  DIIPNNAN-------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
             IP N         Y++ +S+GTP   +    DTGSD+ W QC PC    CY Q   LF
Sbjct: 485 VTIPANIGHSIGTLQYVVTVSLGTPGVAQTVEVDTGSDVSWVQCAPCAAPACYAQKDQLF 544

Query: 136 DPKMSSTYKSLPCSSSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
           DP  SS+Y ++PC++  C+ L+       +G  C Y VSYGDGS + G   ++T+TL   
Sbjct: 545 DPAKSSSYSAVPCAADACSELSTYGHGCAAGSQCGYVVSYGDGSNTTGVYGSDTLTLT-- 602

Query: 193 TGQAVALPGITFGCGTNNGGLF----------------NSKTTGIVGLG----------- 225
              A A+ G  FGCG    GLF                 S+T+G  G G           
Sbjct: 603 --DADAVTGFLFGCGHAQAGLFAGIDGLLALGRKGMSLTSQTSGAYGGGVFSYCLPPSPS 660

Query: 226 -------GG----------------DISLISQMRTT---IAGNQRLGVSTPDI----VID 255
                  GG                D+     +  T   + G Q  GV         V+D
Sbjct: 661 STGFLTLGGPSSASGFATTGLLTAWDVPTFYMVMLTGIGVGGQQLSGVPASAFAGGTVVD 720

Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNSLSQV--PEVTIHFR 311
           +GT +T LP    + L +   + +     P A  TG L+ CY+F     V  P V++ F 
Sbjct: 721 TGTVITRLPPTAYAALRAAFRAAMAPYGYPAAPATGILDTCYNFTDYGTVTLPTVSLTFS 780

Query: 312 -GADVKLSRSNFFVKVSEDIVCSVFKGITNS----VPIYGNIMQTNFLVGYDIEQQTVSF 366
            GA +KL    F         C  F   TNS      I GN+ Q +F V +D    +V F
Sbjct: 781 GGATLKLDAPGFLSS-----GCLAFA--TNSGDGDPAILGNVQQRSFAVRFD--GSSVGF 831

Query: 367 KPTDC 371
            P  C
Sbjct: 832 MPHSC 836


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 60/166 (36%), Positives = 86/166 (51%), Gaps = 23/166 (13%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+  + +G    E   + DT S+L W QC PC    C+ Q  PLFDP  S +Y ++PC+
Sbjct: 152 NYVATVGLGG--GEATVIVDTASELTWVQCAPC--ESCHDQQDPLFDPSSSPSYAAVPCN 207

Query: 150 SSQCASLN---------QKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           SS C +L            +C G +     C Y++SY DGS+S G LA + ++L      
Sbjct: 208 SSSCDALQLATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGEV-- 265

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
              + G  FGCGT+N G     T+G++GLG   +SL+SQ      G
Sbjct: 266 ---IDGFVFGCGTSNQGPPFGGTSGLMGLGRSQLSLVSQTMDQFGG 308



 Score = 37.7 bits (86), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 36/127 (28%), Positives = 53/127 (41%), Gaps = 9/127 (7%)

Query: 253 VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           +IDSGT +T  +P  YN+     +S   E  P A     L+ C++   L   QVP + + 
Sbjct: 384 IIDSGTVITSLVPSIYNAVKAEFLSQFAE-YPQAPGFSILDTCFNMTGLREVQVPSLKLV 442

Query: 310 FRGA-DVKLSRSN--FFVKVSEDIVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTV 364
           F G  +V++      +FV      VC     + +     I GN  Q N  V +D     V
Sbjct: 443 FDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRVIFDTSGSQV 502

Query: 365 SFKPTDC 371
            F    C
Sbjct: 503 GFAQETC 509


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score =  102 bits (253), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 58/158 (36%), Positives = 81/158 (51%), Gaps = 15/158 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKS 145
           Y  +I +G+PP E     DTGSD++W  C PCP  +C ++        L+D K SST K+
Sbjct: 77  YFTKIKLGSPPKEYYVQVDTGSDILWVNCAPCP--KCPVKTDLGIPLSLYDSKASSTSKN 134

Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
           + C  + C+ + Q    G    C Y V YGDGS S+G+   + +TL   TG     P   
Sbjct: 135 VGCEDAFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLAQ 194

Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
            + FGCG N  G      S   GI+G G  + S+ISQ+
Sbjct: 195 EVVFGCGKNQSGQLGQTESAVDGIMGFGQSNTSVISQL 232


>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
          Length = 471

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 54/132 (40%), Positives = 72/132 (54%), Gaps = 6/132 (4%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ--KSCSGVNC 166
           DT  DL W QC PCP  +CY Q + LFDP+ S T  ++PC S+ C  L +    CS   C
Sbjct: 151 DTSIDLPWIQCAPCPMPECYPQQNALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQC 210

Query: 167 QYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGG 226
           QY V YGDG  ++G    + +TL  +T     +    FGC     G F++ T+G + LGG
Sbjct: 211 QYFVDYGDGRATSGTYMVDALTLNPST----VVMNFRFGCSHAVRGNFSASTSGTMSLGG 266

Query: 227 GDISLISQMRTT 238
           G  SL+SQ   T
Sbjct: 267 GRQSLLSQTAAT 278


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score =  101 bits (252), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 96/394 (24%), Positives = 162/394 (41%), Gaps = 88/394 (22%)

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPPTERLAVADTGSDL 114
            +L HF  + +   S+   +  +P   +        Y  +I +G+PP E     DTGSD+
Sbjct: 38  KKLEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDI 97

Query: 115 IWTQCEPCP--PSQCYMQ-DSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYS 169
           +W  C+PCP  PS+  +     LFD   SST K + C    C+ ++Q  SC   V C Y 
Sbjct: 98  LWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKVGCDDDFCSFISQSDSCQPAVGCSYH 157

Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNGGLF---NSKTTGIVG 223
           + Y D S S GN   + +TL   TG     P    + FGCG++  G     +S   G++G
Sbjct: 158 IVYADESTSEGNFIRDKLTLEQVTGDLQTGPLGQEVVFGCGSDQSGQLGKSDSAVDGVMG 217

Query: 224 LGGGDISLISQMRTT-------------IAGNQRLGVST--------------------- 249
            G  + S++SQ+  T             + G     V                       
Sbjct: 218 FGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVM 277

Query: 250 --------------PDIVIDSG------TTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPT 288
                         P I+ + G      TTL + P+    +L+    +++  QPV     
Sbjct: 278 LMGMDVDGTALDLPPSIMRNGGTIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIV 334

Query: 289 GSLELCYSFNSLSQV--PEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK------GI 338
                C+SF+    V  P V+  F  + VKL+    ++   + +++ C  ++      G 
Sbjct: 335 EDTFQCFSFSENVDVAFPPVSFEFEDS-VKLTVYPHDYLFTLEKELYCFGWQAGGLTTGE 393

Query: 339 TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
              V + G+++ +N LV YD+E + + +   +C+
Sbjct: 394 RTEVILLGDLVLSNKLVVYDLENEVIGWADHNCS 427


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 58/154 (37%), Positives = 81/154 (52%), Gaps = 14/154 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +   +GTPP +   + D+GSDL+W QC PC   QCY QDSPL+ P  SST+  +P
Sbjct: 61  SGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCSPC--RQCYAQDSPLYVPSNSSTFSPVP 118

Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           C SS C  +        +      C Y   Y D S S G  A E+ T+       V +  
Sbjct: 119 CLSSDCLLIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATV-----DGVRIDK 173

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           + FGCG++N G F +   G++GLG G +S  SQ+
Sbjct: 174 VAFGCGSDNQGSF-AAAGGVLGLGQGPLSFGSQV 206



 Score = 66.6 bits (161), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 42/129 (32%), Positives = 65/129 (50%), Gaps = 9/129 (6%)

Query: 253 VIDSGTTLTF-LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIH 309
           + DSGTTLT+  P  Y S++L+   S +   P A+    L+LC     + Q   P  TI 
Sbjct: 291 IFDSGTTLTYWFPSAY-SHILAAFDSGVH-YPRAESVQGLDLCVELTGVDQPSFPSFTIE 348

Query: 310 FR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIY---GNIMQTNFLVGYDIEQQTVS 365
           F  GA  +    N+FV V+ ++ C    G+ + +  +   GN++Q NF V YD E+  + 
Sbjct: 349 FDDGAVFQPEAENYFVDVAPNVRCLAMAGLASPLGGFNTIGNLLQQNFFVQYDREENLIG 408

Query: 366 FKPTDCTKQ 374
           F P  C+  
Sbjct: 409 FAPAKCSSH 417


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score =  100 bits (250), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 60/160 (37%), Positives = 87/160 (54%), Gaps = 12/160 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y + + +GTP  E + + DTGSD+ W QC PC    C     P F+P+ SS++  LPC+
Sbjct: 138 EYYVPLQVGTPAVEVVLIMDTGSDVSWIQCVPC--KDCVPALRPPFNPRHSSSFFKLPCA 195

Query: 150 SSQCASLNQK-----SCSGVNCQYSVSYGDGSFSNGNLATETVTLGST----TGQAVALP 200
           SS C ++ Q      S SG  C +S+ YGDGS S+G LA ET+  G+T     G+ V L 
Sbjct: 196 SSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVKLS 254

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
            IT GC   +     +  +G++G+    IS  SQ+ +  A
Sbjct: 255 NITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYA 294



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/136 (28%), Positives = 61/136 (44%), Gaps = 22/136 (16%)

Query: 253 VIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ------ 302
           +IDSGT  T+L     Q      L+  S + +     D       CY+  S +       
Sbjct: 378 IIDSGTAFTYLKKPAFQAMRREFLARTSHLAKV----DDNSGFTPCYNITSGTAALESTI 433

Query: 303 VPEVTIHFRGA-DVKLSRSNFFVKVS----EDIVCSVFKGITNSVP--IYGNIMQTNFLV 355
           +P +T+HFRG  DV L +++  + VS    +  +C  F  ++  +P  I GN  Q N  V
Sbjct: 434 LPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFL-MSGDIPFNIIGNYQQQNLWV 492

Query: 356 GYDIEQQTVSFKPTDC 371
            YD+E+  +   P  C
Sbjct: 493 EYDLEKLRLGIAPAQC 508


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 57/132 (43%), Positives = 74/132 (56%), Gaps = 8/132 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKS 145
           +   +++ + +GTP      + DTGSDL W QC+PC  S  C+ Q  PLFDP  SSTY +
Sbjct: 145 DTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAA 204

Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           + C   QCA+     CS  N  C Y V YGDGS + G L+ +T+ L S+     AL G  
Sbjct: 205 VHCGEPQCAAAGGL-CSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSSR----ALAGFP 259

Query: 204 FGCGTNNGGLFN 215
           FGCGT N G F 
Sbjct: 260 FGCGTRNLGDFG 271



 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 37/125 (29%), Positives = 56/125 (44%), Gaps = 6/125 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
           ++DSGT LT+LP      L       +E    A P   L+ CY F   S+V    + FR 
Sbjct: 367 LLDSGTVLTYLPAQAYELLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVIVPAVSFRF 426

Query: 313 ADVKLSRSNFF---VKVSEDIVCSVFKGI-TNSVP--IYGNIMQTNFLVGYDIEQQTVSF 366
            D  +   +FF   + + E++ C  F  +    +P  I GN  Q +  V YD+  + + F
Sbjct: 427 GDGAVFELDFFGVMIFLDENVGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGF 486

Query: 367 KPTDC 371
            P  C
Sbjct: 487 VPASC 491


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 60/160 (37%), Positives = 87/160 (54%), Gaps = 12/160 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y + + +GTP  E + + DTGSD+ W QC PC    C     P F+P+ SS++  LPC+
Sbjct: 137 EYYVPLQLGTPAVEVVLIMDTGSDVSWIQCVPC--KDCVPALRPPFNPRHSSSFFKLPCA 194

Query: 150 SSQCASLNQK-----SCSGVNCQYSVSYGDGSFSNGNLATETVTLGST----TGQAVALP 200
           SS C ++ Q      S SG  C +S+ YGDGS S+G LA ET+  G+T     G+ V L 
Sbjct: 195 SSTCTNVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIA-GNTPNFGDGEPVKLS 253

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIA 240
            IT GC   +     +  +G++G+    IS  SQ+ +  A
Sbjct: 254 NITLGCADIDREGLPTGASGLLGMDRRPISFPSQLSSRYA 293



 Score = 52.0 bits (123), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 39/137 (28%), Positives = 62/137 (45%), Gaps = 22/137 (16%)

Query: 253 VIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ------ 302
           +IDSGT  T+L     Q      L+  S + +     D       CY+  S +       
Sbjct: 377 IIDSGTAFTYLKKPAFQAMRREFLARTSHLAKV----DDNSGFTPCYNITSGTAALESTI 432

Query: 303 VPEVTIHFRGA-DVKLSRSNFFVKVS----EDIVCSVFKGITNSVP--IYGNIMQTNFLV 355
           +P +T+HFRG  DV L +++  + VS    +  +C  F+ ++  +P  I GN  Q N  V
Sbjct: 433 LPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQ-MSGDIPFNIIGNYQQQNLWV 491

Query: 356 GYDIEQQTVSFKPTDCT 372
            YD+E+  +   P  C 
Sbjct: 492 EYDLEKLRLGIAPAQCA 508


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 109/450 (24%), Positives = 172/450 (38%), Gaps = 111/450 (24%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDAL----TRSLNRLNHFNQNSSISSSKASQ----- 81
           +ELIHR SP+       +T  QRL++ +     R L  L H  +   I   KA +     
Sbjct: 3   LELIHRHSPQ--VMGRPKTQLQRLKELVHSDSVRQLMIL-HKLRGGQIPRRKAKEVLSSS 59

Query: 82  -------ADIIPNN-------ANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQ 126
                  A  +P +         Y +   +GTP  + + VADTGSDL W  C+  C    
Sbjct: 60  SGRGSDDAIEVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRN 119

Query: 127 C------YMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYG 173
           C       ++   +F   +SS++K++PC +  C        SL         C Y   Y 
Sbjct: 120 CSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYS 179

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG-------- 225
           DGS + G  A ETVT+    G+ + L  +  GC  +  G       G++GLG        
Sbjct: 180 DGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAI 239

Query: 226 ------GGDIS-----------------------------------LISQMRTTIAGNQR 244
                 GG  S                                   L+  M  +      
Sbjct: 240 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 299

Query: 245 LGVS-------TPDIVID----SGTTL------TFLPQ-GYNSNLLSVMSSMIEAQPVAD 286
           +G+S        P  V D     GT L      TFL +  Y   + ++  S+++ + V  
Sbjct: 300 MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 359

Query: 287 PTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN-SV 342
             G LE C++     +  VP +  HF  GA+ +    ++ +  ++ + C  F  +     
Sbjct: 360 DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGT 419

Query: 343 PIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            + GNIMQ N L  +D+  + + F P+ CT
Sbjct: 420 SVVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 52/129 (40%), Positives = 71/129 (55%), Gaps = 6/129 (4%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +   +++ +  GTP      + DTGSDL W QC+PC    CY Q  P FDP  SS+Y ++
Sbjct: 133 DTLEFVVVVGFGTPAQTAAIILDTGSDLSWIQCKPC-SGHCYRQHDPDFDPAKSSSYAAV 191

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           PC +  CA+     C+G  C Y V YGDGS + G L+ +T+T  S++       G TFGC
Sbjct: 192 PCGTPVCAAAGGM-CNGTTCLYGVQYGDGSSTTGVLSRDTLTFNSSS----KFTGFTFGC 246

Query: 207 GTNNGGLFN 215
           G  N G F 
Sbjct: 247 GEKNIGDFG 255



 Score = 45.4 bits (106), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 34/127 (26%), Positives = 54/127 (42%), Gaps = 8/127 (6%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
           ++DSGT LT+LP    ++L       ++    A P   L+ CY F     +    + F  
Sbjct: 351 LLDSGTILTYLPPPAYTSLRDRFKFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNF 410

Query: 313 ADVKLSRSNFF-VKVSED-----IVCSVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTV 364
           +D  +   +F+ + +  D     I C  F     ++P  I GN  Q    V YD+  Q +
Sbjct: 411 SDGAVFDLDFYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKI 470

Query: 365 SFKPTDC 371
            F P  C
Sbjct: 471 GFIPISC 477


>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
          Length = 409

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 51/141 (36%), Positives = 79/141 (56%), Gaps = 8/141 (5%)

Query: 98  GTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           GT    +  + D+GSD+ W QC+PCP   C+ Q  PLFDP  S+TY ++PCSS+ CA L 
Sbjct: 75  GTSAVSQTVIIDSGSDVPWVQCQPCPLLVCHPQRDPLFDPATSTTYAAVPCSSAACARLG 134

Query: 158 --QKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG-TNNGGL 213
             ++ C +   CQ+ ++Y +G+ + G  +++ +TLG        + G  FGC   + G  
Sbjct: 135 PYRRGCLANSQCQFGITYANGATATGTYSSDDLTLGPYD----VVRGFLFGCAHADQGST 190

Query: 214 FNSKTTGIVGLGGGDISLISQ 234
           F+    G + LGGG  S + Q
Sbjct: 191 FSYDVAGTLALGGGSQSFVQQ 211



 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 33/125 (26%), Positives = 58/125 (46%), Gaps = 12/125 (9%)

Query: 253 VIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIH 309
           VIDS T ++ +P   Y +   +  S+M   +P A P   L+ CY F+ +  +  P + + 
Sbjct: 291 VIDSATVISRIPPTAYQALRAAFRSAMTMYRP-APPVSILDTCYDFSGVRSITLPSIALV 349

Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFK-GITNSVPIY-GNIMQTNFLVGYDIEQQTVSF 366
           F G A V L  +   ++      C  F    ++ +P + GN+ Q    V YD+  + + F
Sbjct: 350 FDGGATVNLDAAGILLQG-----CLAFAPTASDRMPGFIGNVQQRTLEVVYDVPGKAIRF 404

Query: 367 KPTDC 371
           +   C
Sbjct: 405 RSAAC 409


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 57/132 (43%), Positives = 74/132 (56%), Gaps = 8/132 (6%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKS 145
           +   +++ + +GTP      + DTGSDL W QC+PC  S  C+ Q  PLFDP  SSTY +
Sbjct: 140 DTLEFVVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAA 199

Query: 146 LPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           + C   QCA+     CS  N  C Y V YGDGS + G L+ +T+ L S+     AL G  
Sbjct: 200 VHCGEPQCAAAGDL-CSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSSR----ALTGFP 254

Query: 204 FGCGTNNGGLFN 215
           FGCGT N G F 
Sbjct: 255 FGCGTRNLGDFG 266



 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 38/125 (30%), Positives = 58/125 (46%), Gaps = 6/125 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
           ++DSGT LT+LP    + L       +E    A P   L+ CY F   S+V    + FR 
Sbjct: 362 LLDSGTVLTYLPAQAYALLRDRFRLTMERYTPAPPNDVLDACYDFAGESEVVVPAVSFRF 421

Query: 313 ADVKLSRSNFF---VKVSEDIVCSVFKGI-TNSVP--IYGNIMQTNFLVGYDIEQQTVSF 366
            D  +   +FF   + + E++ C  F  + T  +P  I GN  Q +  V YD+  + + F
Sbjct: 422 GDGAVFELDFFGVMIFLDENVGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGF 481

Query: 367 KPTDC 371
            P  C
Sbjct: 482 VPASC 486


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score =  100 bits (248), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 111/406 (27%), Positives = 170/406 (41%), Gaps = 100/406 (24%)

Query: 51  YQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
           Y+ LR+   R L R+        + +   S  D       Y  RI +GTPP +     DT
Sbjct: 13  YRTLREHDQRRLRRIL-----PEVVAFPISGDDDTFTTGLYYTRIYLGTPPQQFYVHVDT 67

Query: 111 GSDLIWTQCEPCPPSQCYMQDS-----PLFDPKMSSTYKSLPCSSSQCASLNQKSCS--G 163
           GSD+ W  C PC  + C    +      +FDP+ S++  S+ C+  +C   +   CS   
Sbjct: 68  GSDVAWVNCVPC--TNCKRASNVALPISIFDPEKSTSKTSISCTDEECYLASNSKCSFNS 125

Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVALPG---ITFGCGTNNGGLFNSKTT 219
           ++C YS  YGDGS + G L  + ++     +G + A  G   +TFGCG+N  G +   T 
Sbjct: 126 MSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLTFGCGSNQTGTW--LTD 183

Query: 220 GIVGLGGGDISLISQMR---------------------TTIAGNQR-------------- 244
           G+VG G  ++SL SQ+                      T + G+ R              
Sbjct: 184 GLVGFGQAEVSLPSQLSKQNVSVNIFAHCLQGDNKGSGTLVIGHIREPGLVYTPIVPKQS 243

Query: 245 --------LGVS-----TP---------DIVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEA 281
                   +GVS     TP          +++DSGTTLT+L Q  Y+     V   M   
Sbjct: 244 HYNVELLNIGVSGTNVTTPTAFDLSNSGGVIMDSGTTLTYLVQPAYDQFQAKVRDCM--- 300

Query: 282 QPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVK--VSEDIVCSVFK 336
                 +G L + + F    +   P VT++F  GA + LS S++  K  ++  +    F 
Sbjct: 301 -----RSGVLPVAFQFFCTIEGYFPNVTLYFAGGAAMLLSPSSYLYKEMLTTGLSAYCFS 355

Query: 337 GITNSVPIYGNIMQTNF--------LVGYDIEQQTVSFKPTDCTKQ 374
            +  S  +YG +  T F        LV YD     + +K  DCTK+
Sbjct: 356 WL-ESTSVYGYLSYTIFGDNVLKDQLVVYDNVNNRIGWKNFDCTKE 400


>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 488

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 95/391 (24%), Positives = 158/391 (40%), Gaps = 86/391 (21%)

Query: 67  HFNQNSSISSSKASQADI------IPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQ- 118
           H   +S+      + AD+      +P +   Y   I IGTPP +     DTGSD++W   
Sbjct: 52  HLTHDSNRRGRLLAAADVPLGGLGLPTDTGLYYTEIEIGTPPKQYHVQVDTGSDILWVNC 111

Query: 119 --CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG----VNCQYSVSY 172
             C  CP       D  L+DPK SS+  ++ C    CA+       G    + C+YSV Y
Sbjct: 112 ISCNKCPRKSDLGIDLRLYDPKGSSSGSTVSCDQKFCAATYGGKLPGCAKNIPCEYSVMY 171

Query: 173 GDGSFSNGNLATETVTLGSTTGQAV---ALPGITFGCGTNNGGLF---NSKTTGIVGLGG 226
           GDGS + G   ++++     +G      A   + FGCG   GG     N    GI+G G 
Sbjct: 172 GDGSSTTGYFVSDSLQYNQVSGDGQTRHANASVIFGCGAQQGGDLGSTNQALDGIIGFGQ 231

Query: 227 GDISLISQMRT-------------TIAGNQRLGVS------------TPDI--------- 252
            + S++SQ+               TI G     +              PD+         
Sbjct: 232 SNTSMLSQLAAAGEVKKIFSHCLDTIKGGGIFAIGDVVQPKVKSTPLVPDMPHYNVNLES 291

Query: 253 -----------------------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTG 289
                                  +IDSGTTLT+LP+    ++L+ + +            
Sbjct: 292 INVGGTTLQLPSHMFETGEKKGTIIDSGTTLTYLPELVYKDVLAAVFAKHPDTTFHSVQD 351

Query: 290 SLELCYSFNSLSQVPEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK--GIT----NS 341
            L + Y  +     P++T HF   D+ L+    ++F +  +++ C  F+  G+       
Sbjct: 352 FLCIQYFQSVDDGFPKITFHFE-DDLGLNVYPHDYFFQNGDNLYCFGFQNGGLQSKDGKD 410

Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           + + G+++ +N +V YD+E Q V +   +C+
Sbjct: 411 MVLLGDLVLSNKVVVYDLENQVVGWTDYNCS 441


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 60/179 (33%), Positives = 86/179 (48%), Gaps = 9/179 (5%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKA-SQADIIPNN 88
           S  LIH  S  SPF   + T    + + +    NRL    + S  S   A +   +   +
Sbjct: 53  SFPLIHIYSECSPFRPPNRTWESLMSEKIRGDANRLRFLKRTSRSSKQDANANVPVRSGS 112

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y+I++  GTP      + DTGSD+ W  C+ C   Q     +P+FDP  SS+YK   C
Sbjct: 113 GEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQC---QGCHSTAPIFDPAKSSSYKPFAC 169

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
            S  C  ++        CQ+ VSYGDG+  +G LA++ +TLGS       LP  +FGC 
Sbjct: 170 DSQPCQEISGNCGGNSKCQFEVSYGDGTQVDGTLASDAITLGSQ-----YLPNFSFGCA 223



 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 67/146 (45%), Gaps = 12/146 (8%)

Query: 235 MRTTIAGNQRLGVSTPDI------VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADP 287
           ++    GN R+ V   +I      +IDSGTT+T L P  Y +   +    +   QP   P
Sbjct: 306 LKAISVGNTRISVPGTNIASGGGTIIDSGTTITHLVPSAYTALRDAFRQQLSSLQPT--P 363

Query: 288 TGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIY 345
              ++ CY  +S S  VP +T+H  R  D+ L + N  +     + C  F   T+S  I 
Sbjct: 364 VEDMDTCYDLSSSSVDVPTITLHLDRNVDLVLPKENILITQESGLACLAFSS-TDSRSII 422

Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDC 371
           GN+ Q N+ + +D+    V F    C
Sbjct: 423 GNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 56/154 (36%), Positives = 80/154 (51%), Gaps = 14/154 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +   +GTPP +   + D+GSDL+W QC PC   QCY QD+PL+ P  SST+  +P
Sbjct: 62  SGQYFVDFFLGTPPQKFSLIVDSGSDLLWVQCAPC--LQCYAQDTPLYAPSNSSTFNPVP 119

Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           C S +C  +        +      C Y   Y D S S G  A E+ T+       V +  
Sbjct: 120 CLSPECLLIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATVDD-----VRIDK 174

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           + FGCG +N G F +   G++GLG G +S  SQ+
Sbjct: 175 VAFGCGRDNQGSF-AAAGGVLGLGQGPLSFGSQV 207



 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 43/129 (33%), Positives = 65/129 (50%), Gaps = 9/129 (6%)

Query: 253 VIDSGTTLTF-LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIH 309
           + DSGTT+T+ LP  Y  N+L+     +   P A     L+LC     + Q   P  TI 
Sbjct: 292 IFDSGTTVTYWLPPAYR-NILAAFDKNVR-YPRAASVQGLDLCVDVTGVDQPSFPSFTIV 349

Query: 310 FRGADV-KLSRSNFFVKVSEDIVCSVFKGITNSVPIY---GNIMQTNFLVGYDIEQQTVS 365
             G  V +  + N+FV V+ ++ C    G+ +SV  +   GN++Q NFLV YD E+  + 
Sbjct: 350 LGGGAVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREENRIG 409

Query: 366 FKPTDCTKQ 374
           F P  C+  
Sbjct: 410 FAPAKCSSH 418


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 97/375 (25%), Positives = 158/375 (42%), Gaps = 80/375 (21%)

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
            NS + ++     D + +N  Y  R+ IGTPP E   + DTGS + +  C  C   QC  
Sbjct: 67  HNSDLPNAHMRLYDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSTC--EQCGK 124

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL 189
              P F P+ SSTYK + C+ S C   ++    G  C Y   Y + S S+G LA + ++ 
Sbjct: 125 HQDPRFQPESSSTYKPMQCNPS-CNCDDE----GKQCTYERRYAEMSSSSGLLAEDVLSF 179

Query: 190 GSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM------------- 235
           G+ +   +      FGC T   G LF+ +  GI+GLG G +S++ Q+             
Sbjct: 180 GNES--ELTPQRAIFGCETVETGELFSQRADGIMGLGRGPLSVVDQLVIKEVVGNSFSLC 237

Query: 236 ---RTTIAGNQRLG--VSTPDIV------------------------------------- 253
                 + G   LG     PD+V                                     
Sbjct: 238 YGGMDVVGGAMVLGNIPPPPDMVFAHSDPYRSAYYNIELKELHVAGKRLKLNPRVFDGKH 297

Query: 254 ---IDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV 303
              +DSGTT  +LP + + +   +++  +   + +  P  S  ++C+S      + LS++
Sbjct: 298 GTVLDSGTTYAYLPEEAFVAFKDAIIKEIKFLKQIHGPDPSYNDICFSGAGRDVSQLSKI 357

Query: 304 -PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 358
            PEV + F  G  + LS  N+     KVS      +F+   +   + G I+  N LV YD
Sbjct: 358 FPEVNMVFGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYD 417

Query: 359 IEQQTVSFKPTDCTK 373
            +   + F  T+C++
Sbjct: 418 RDNDKIGFWKTNCSE 432


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 95/360 (26%), Positives = 144/360 (40%), Gaps = 79/360 (21%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I +GTPP       DTGSD++W     CE CP       D  L+DPK SST   + 
Sbjct: 86  YYTEIKLGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGSMVM 145

Query: 148 CSSSQCASLNQ----KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C  + CA+       K  + V C+YSV+YGDGS + G+  T+ +     T      P   
Sbjct: 146 CDQAFCAATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQTQPANA 205

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT-------------TIAGNQR 244
            + FGCG   GG     N    GI+G G  + S++SQ+ T             TI G   
Sbjct: 206 SVIFGCGAQQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDTIKGGGI 265

Query: 245 LGV---------STPDIV-----------IDSG------------------------TTL 260
             +         +TP +            ID G                        TTL
Sbjct: 266 FSIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLQLPAHIFEPGEKKGTIIDSGTTL 325

Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR- 319
           T+LP+     ++  + +  +     D  G L   Y  +     P +T HF   D+ L   
Sbjct: 326 TYLPELVFKEVMLAVFNKHQDITFHDVQGFLCFQYPGSVDDGFPTITFHFE-DDLALHVY 384

Query: 320 -SNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
              +F     D+ C  F+ G + S     + + G+++ +N LV YD+E + + +   +C+
Sbjct: 385 PHEYFFANGNDVYCVGFQNGASQSKDGKDIVLMGDLVLSNKLVIYDLENRVIGWTDYNCS 444


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 62/172 (36%), Positives = 89/172 (51%), Gaps = 13/172 (7%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           ++ +   +A YL+ + +GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  S
Sbjct: 136 ESGVAVGSAEYLMDVYVGTPPRRFQMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 193

Query: 141 STYKSLPCSSSQCASLNQKSCSGVN---------CQYSVSYGDGSFSNGNLATETVTLGS 191
           S+Y++L C   +C  +                  C Y   YGD S S G+LA E+ T+  
Sbjct: 194 SSYRNLTCGDPRCGHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNL 253

Query: 192 TT-GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
           T  G +  + G+ FGCG  N GLF+     ++GLG G +S  SQ+R    G+
Sbjct: 254 TAPGASSRVDGVVFGCGHRNRGLFHGAAG-LLGLGRGPLSFASQLRAVYGGH 304



 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/127 (29%), Positives = 61/127 (48%), Gaps = 6/127 (4%)

Query: 253 VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           +IDSGTTL+ F+   Y     + +  M  + P       L  CY+ + +   +VPE+++ 
Sbjct: 385 IIDSGTTLSYFVEPAYQVIRRAFIDRMSGSYPPVPDFPVLSPCYNVSGVERPEVPELSLL 444

Query: 310 FR-GADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSF 366
           F  GA       N+F+++  D I+C    G   + + I GN  Q NF V YD+    + F
Sbjct: 445 FADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNFHVAYDLHNNRLGF 504

Query: 367 KPTDCTK 373
            P  C +
Sbjct: 505 APRRCAE 511


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 60/203 (29%), Positives = 100/203 (49%), Gaps = 30/203 (14%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYN--SSETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS 80
           +A+     +E +HR + +S      +S +P + L + +  ++                  
Sbjct: 97  KAEKDAVRIETMHRRAARSGVARMPASSSPRRALSERMVATV------------------ 138

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           ++ +   +  YLI + +GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  S
Sbjct: 139 ESGVAVGSGEYLIDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 196

Query: 141 STYKSLPCSSSQCASL----NQKSC---SGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           S+Y+++ C   +C  +      ++C   +  +C Y   YGD S + G+LA E+ T+  T 
Sbjct: 197 SSYRNVTCGDQRCGLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTA 256

Query: 194 -GQAVALPGITFGCGTNNGGLFN 215
            G +  + G+ FGCG  N GLF+
Sbjct: 257 PGASRRVDGVVFGCGHRNRGLFH 279



 Score = 58.2 bits (139), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 51/173 (29%), Positives = 82/173 (47%), Gaps = 25/173 (14%)

Query: 213 LFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLT-FLPQGYNS-- 269
            +  K  G+  L GGD+  IS     +  +   G      +IDSGTTL+ F+   Y    
Sbjct: 350 FYYVKLKGV--LVGGDLLNISSDTWDVGKDGSGGT-----IIDSGTTLSYFVEPAYQVIR 402

Query: 270 ----NLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNF 322
               +L+S +  +I   PV +P      CY+ + +   +VPE+++ F  GA       N+
Sbjct: 403 QAFVDLMSRLYPLIPDFPVLNP------CYNVSGVERPEVPELSLLFADGAVWDFPAENY 456

Query: 323 FVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           FV++  D I+C   +G   + + I GN  Q NF V YD++   + F P  C +
Sbjct: 457 FVRLDPDGIMCLAVRGTPRTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRCAE 509


>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 448

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 63/185 (34%), Positives = 97/185 (52%), Gaps = 9/185 (4%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFN-QNSSISSSKASQADIIP 86
           GFS+E++HR S +SPFY  + T Y+R+   +  S  R ++     SS  S +A +  I  
Sbjct: 27  GFSLEIVHRYSRESPFYPGNITDYERITRLVELSKIRAHNLAITTSSGFSPEAFRLRISQ 86

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           ++  YL+++ IG+P      V DTGS L WTQCEPC  ++ + Q  P+F+   S TY+ L
Sbjct: 87  DDTCYLVKVIIGSPGVPLYLVPDTGSGLFWTQCEPC--TRRFRQLPPIFNSTASRTYRDL 144

Query: 147 PCSSSQCA-SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           PC    C  + N   C    C Y ++Y  GS + G  A + +     + +   +P   FG
Sbjct: 145 PCQHQFCTNNQNVFQCRDDKCVYRIAYAGGSATAGVAAQDIL----QSAENDRIP-FYFG 199

Query: 206 CGTNN 210
           C  +N
Sbjct: 200 CSRDN 204


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score = 99.4 bits (246), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 94/362 (25%), Positives = 144/362 (39%), Gaps = 83/362 (22%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I +GTPP       DTGSD++W     CE CP       D   +DPK SS+  ++ 
Sbjct: 84  YFTEIKLGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGSTVS 143

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    CA+       G    V C+YSV YGDGS + G   T+ +     TG     PG  
Sbjct: 144 CDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQPGNA 203

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT-------------TIAGNQR 244
            +TFGCG   GG     N    GI+G G  + S++SQ+               TI G   
Sbjct: 204 TVTFGCGAQQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDTIKGGGI 263

Query: 245 LGV---------STPDIV-----------IDSG------------------------TTL 260
             +         +TP +            ID G                        TTL
Sbjct: 264 FAIGNVVQPKVKTTPLVADMPHYNVNLKSIDVGGTTLQLPAHVFETGERKGTIIDSGTTL 323

Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTIHFRGADVKLS 318
           T+LP+     +++ + +  + Q +        +C+ +  +     P +T HF   D+ L 
Sbjct: 324 TYLPELVFKEVMAAIFN--KHQDIVFHNVQDFMCFQYPGSVDDGFPTITFHFE-DDLALH 380

Query: 319 R--SNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
                +F     D+ C  F+ G   S     + + G+++ +N LV YD+E Q + +   +
Sbjct: 381 VYPHEYFFPNGNDMYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVIYDLENQVIGWTDYN 440

Query: 371 CT 372
           C+
Sbjct: 441 CS 442


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 60/167 (35%), Positives = 82/167 (49%), Gaps = 27/167 (16%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC------PPSQCYMQDSPLFDPKMSS 141
           +  Y + I +GTPP   L VADTGSDL+W +C  C      PPS  ++       P+ SS
Sbjct: 85  SGQYFVDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFL-------PRHSS 137

Query: 142 TYKSLPCSSSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           ++    C    C  L        N       C++  SY DGS S+G  + ET TL S +G
Sbjct: 138 SFSPFHCFDPHCRLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSG 197

Query: 195 QAVALPGITFGCGTN------NGGLFNSKTTGIVGLGGGDISLISQM 235
             + L G++FGCG        +G  FN    G++GLG G IS  SQ+
Sbjct: 198 SEIHLKGLSFGCGFRISGPSVSGAQFNG-ARGVMGLGRGSISFSSQL 243



 Score = 58.9 bits (141), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/125 (28%), Positives = 62/125 (49%), Gaps = 6/125 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEV-TIHFR 311
           V+DSGTTLT+L +     +L  +   ++    A+ T   +LC + +  S+ P +  + FR
Sbjct: 332 VVDSGTTLTYLTKTAYEEVLKSVRRRVKLPNAAELTPGFDLCVNASGESRRPSLPRLRFR 391

Query: 312 ---GADVKLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSF 366
              GA       N+F++  E ++C   + +   N   + GN+MQ  FL+ +D E+  + F
Sbjct: 392 LGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFLLEFDKEESRLGF 451

Query: 367 KPTDC 371
               C
Sbjct: 452 TRRGC 456


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 109/450 (24%), Positives = 172/450 (38%), Gaps = 111/450 (24%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDAL----TRSLNRLNHFNQNSSISSSKASQ----- 81
           +ELIHR SP+       +T  QRL++ +     R L  L H  +   I   KA +     
Sbjct: 3   LELIHRHSPQ--VMGRPKTQLQRLKELVHSDSVRQLMIL-HKLRGGQIPRRKAKEVLSSS 59

Query: 82  -------ADIIPNN-------ANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQ 126
                  A  +P +         Y +   +GTP  + + VADTGSDL W  C+  C    
Sbjct: 60  SGRGSDDAIEVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRN 119

Query: 127 C------YMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYG 173
           C       ++   +F   +SS++K++PC +  C        SL         C Y   Y 
Sbjct: 120 CSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYS 179

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG-------- 225
           DGS + G  A ETVT+    G+ + L  +  GC  +  G       G++GLG        
Sbjct: 180 DGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAI 239

Query: 226 ------GGDIS-----------------------------------LISQMRTTIAGNQR 244
                 GG  S                                   L+  M  +      
Sbjct: 240 KAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNM 299

Query: 245 LGVS-------TPDIVID----SGTTL------TFLPQ-GYNSNLLSVMSSMIEAQPVAD 286
           +G+S        P  V D     GT L      TFL +  Y   + ++  S+++ + V  
Sbjct: 300 MGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEM 359

Query: 287 PTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN-SV 342
             G LE C++     +  VP +  HF  GA+ +    ++ +  ++ + C  F  +     
Sbjct: 360 DIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGT 419

Query: 343 PIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            + GNIMQ N L  +D+  + + F P+ CT
Sbjct: 420 SVVGNIMQQNHLWEFDLGLKKLGFAPSSCT 449


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 110/451 (24%), Positives = 173/451 (38%), Gaps = 96/451 (21%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDS--PKSPFYNSSETPYQRLRDALTR 60
           T LSC+     L   +      +     ++L HRD+  PK         P  R+ D +  
Sbjct: 26  TLLSCLITTLLL---ITVADSMKDTSVRLKLAHRDTLLPK---------PLSRIEDVIGA 73

Query: 61  SLNR--LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
              R  L    +NS++       + I    A Y   I +GTP  +   V DTGS+L W  
Sbjct: 74  DQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVN 133

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVS 171
           C      +    +  +F    S ++K++ C +  C        SL         C Y   
Sbjct: 134 CRYRARGK---DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYR 190

Query: 172 YGDGSFSNGNLATETVTL-----------GSTTGQAVALPGITFGCGTNNGGL------F 214
           Y DGS + G  A ET+T+           G   G + +  G +F       GL      F
Sbjct: 191 YADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSF 250

Query: 215 NSKTTGIVG----------LGGGDIS---LISQMRTTIAGNQR----------------- 244
            S  T + G          L   ++S   +    R+T    +R                 
Sbjct: 251 TSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINV 310

Query: 245 ----LGVSTPDI-------------VIDSGTTLTFLPQG-YNSNLLSVMSSMIEAQPVAD 286
               LG    DI             ++DSGT+LT L    Y   +  +   ++E + V  
Sbjct: 311 IGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKP 370

Query: 287 PTGSLELCYSFNS---LSQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGI-TNS 341
               +E C+SF S   +S++P++T H +G A  +  R ++ V  +  + C  F    T +
Sbjct: 371 EGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPA 430

Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
             + GNIMQ N+L  +D+   T+SF P+ CT
Sbjct: 431 TNVIGNIMQQNYLWEFDLMASTLSFAPSACT 461


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 56/159 (35%), Positives = 84/159 (52%), Gaps = 13/159 (8%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + +GTPP     + DTGSDL W QC+PC    C+ Q+   + PK SSTY+++ C
Sbjct: 169 GEYFLDMFVGTPPKHVWLILDTGSDLSWIQCDPC--YDCFEQNGSHYYPKDSSTYRNISC 226

Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGST----TGQAVA 198
              +C  ++     + C   N  C Y   Y DGS + G+ A+ET T+  T      +   
Sbjct: 227 YDPRCQLVSSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQ 286

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
           +  + FGCG  N G F    +G++GLG G IS  SQ+++
Sbjct: 287 VVDVMFGCGHWNKGFFYG-ASGLLGLGRGPISFPSQIQS 324



 Score = 58.5 bits (140), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 41/155 (26%), Positives = 74/155 (47%), Gaps = 7/155 (4%)

Query: 226 GGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 285
           GG++  IS+     +       +    +IDSG+TLTF P      +       I+ Q +A
Sbjct: 388 GGEVLDISEQTWHWSSEGAAADAGGGTIIDSGSTLTFFPDSAYDIIKEAFEKKIKLQQIA 447

Query: 286 DPTGSLELCYSFN-SLSQV--PEVTIHFRGADV-KLSRSNFFVKVSED-IVC-SVFKGIT 339
                +  CY+ + ++ QV  P+  IHF    V      N+F +   D ++C ++ K   
Sbjct: 448 ADDFVMSPCYNVSGAMMQVELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPN 507

Query: 340 NS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           +S + I GN++Q NF + YD+++  + + P  C +
Sbjct: 508 HSHLTIIGNLLQQNFHILYDVKRSRLGYSPRRCAE 542


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 98/369 (26%), Positives = 151/369 (40%), Gaps = 82/369 (22%)

Query: 85  IPNNAN-YLIRISIGTPPTERLAVADTGSDLIWTQC---EPCPPSQCYMQDSPLFDPKMS 140
           IP +   Y  +I IGTP        DTGSD++W  C   + CP       D  L+DP  S
Sbjct: 82  IPTDTGLYFTQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTAS 141

Query: 141 STYKSLPCSSSQCASLNQK----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           ++ K++ C    CA+        SC+  + CQYS++YGDGS + G    + +     +G 
Sbjct: 142 ASSKTVTCGQEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGD 201

Query: 196 A---VALPGITFGCGTNNGGLFNSKTT---GIVGLG------------------------ 225
               +A   +TFGCG   GG   S      GI+G G                        
Sbjct: 202 GQTNLANASVTFGCGAKIGGALGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCL 261

Query: 226 ----GGDISLISQ---------------------MRTTIAGNQRL---------GVSTPD 251
               GG I  I                       ++T   G   L         G  +  
Sbjct: 262 DTVNGGGIFAIGNVVQPKVKTTPLVPGMPHYNVVLKTIDVGGSTLQLPTNIFDIGGGSRG 321

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR 311
            +IDSGTTL +LP+     +LS + S      + +    L   YS +  +  PEVT HF 
Sbjct: 322 TIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQDFLCFQYSGSVDNGFPEVTFHFD 381

Query: 312 GADVKLSR--SNFFVKVSEDIVCSVFK--GITNS----VPIYGNIMQTNFLVGYDIEQQT 363
           G D+ L     ++  + +ED+ C  F+  G+ +     + + G++  +N LV YD+E Q 
Sbjct: 382 G-DLPLVVYPHDYLFQNTEDVYCVGFQSGGVQSKDGKDMVLLGDLALSNKLVVYDLENQV 440

Query: 364 VSFKPTDCT 372
           + +   +C+
Sbjct: 441 IGWTNYNCS 449


>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 373

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 100/380 (26%), Positives = 155/380 (40%), Gaps = 87/380 (22%)

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           Q ++I  S     D I  N  + + IS+GTP    L   DTGS + W QC+ C    CY 
Sbjct: 3   QAANIPDSAVIGDDSIRKN-QFFMGISLGTPAVFNLVTIDTGSTISWVQCQYC-IVHCYT 60

Query: 130 QDS---PLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGV-----NCQYSVSYGDGSFSN 179
           QD    P F+   SSTY+ + CS+  C  ++  Q   SG      +C YS+ Y  G +S 
Sbjct: 61  QDQRAGPTFNTSSSSTYRRVGCSAQVCHDMHVSQNIPSGCVEEEDSCIYSLRYASGEYSA 120

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT- 238
           G L+ + +TL ++     ++    FGCG++N   +N  + GI+G G    S  +Q+    
Sbjct: 121 GYLSQDRLTLANS----YSIQKFIFGCGSDN--RYNGHSAGIIGFGNKSYSFFNQIAQLT 174

Query: 239 --------IAGNQR------------------------LGVSTP-------DIVIDSGTT 259
                      NQ                          G   P       D++++ G  
Sbjct: 175 NYSAFSYCFPSNQENEGFLSIGPYVRDSNKLILTQLFDYGAHLPVYALQQFDMMVN-GMR 233

Query: 260 LTFLPQGYNSNLLSVMSSMIE--------------------AQPVADPTGSLELCYSFN- 298
           L   P  Y + +  V S  +E                    A+     + S E+C+  N 
Sbjct: 234 LQVDPPVYTTRMTVVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSKEICFHSNG 293

Query: 299 ---SLSQVPEVTIHFRGADVKLSRSN-FFVKVSEDIVCSVFKGITNSVP---IYGNIMQT 351
                S++P V I F  + +KL   N F+ + S+  +CS F+     VP   I GN    
Sbjct: 294 DSVDWSKLPVVEIKFSRSILKLPAENVFYYETSDGSICSTFQPDDAGVPGVQILGNRATR 353

Query: 352 NFLVGYDIEQQTVSFKPTDC 371
           +F V +DI+Q+   F+   C
Sbjct: 354 SFRVVFDIQQRNFGFEAGAC 373


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 90/358 (25%), Positives = 142/358 (39%), Gaps = 88/358 (24%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ-DSPL--FDPKMSSTYKSLP 147
           Y  ++ +GTPP       DTGSDL+W  C PC     +     P+  +D K S++   +P
Sbjct: 36  YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVP 95

Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           CS   C  + Q S SG N    C YS  YGDGS + G L  + +          A   + 
Sbjct: 96  CSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHY-----MVNATATVI 150

Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT----------IAGNQRLG---- 246
           FGCG    G  ++      GI+G G  D+S  SQ+             + G +R G    
Sbjct: 151 FGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILV 210

Query: 247 ---VSTPDI-----------------------------------------VIDSGTTLTF 262
              V  PDI                                         + DSGTTL +
Sbjct: 211 LGNVIEPDIQYTPLVPYMSHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTTLAY 270

Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNF 322
           LP          +S ++    + D   S  +   F      P V ++F GA + L+ + +
Sbjct: 271 LPDEAYQAFTQAVSLVVAPFLLCDTRLSRFIYKLF------PNVVLYFEGASMTLTPAEY 324

Query: 323 FVK----VSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            ++     +  I C  ++ + ++       I+G+++  N LV YD+E+  + ++P DC
Sbjct: 325 LIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 90/358 (25%), Positives = 142/358 (39%), Gaps = 88/358 (24%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ-DSPL--FDPKMSSTYKSLP 147
           Y  ++ +GTPP       DTGSDL+W  C PC     +     P+  +D K S++   +P
Sbjct: 36  YFTQVQLGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVP 95

Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           CS   C  + Q S SG N    C YS  YGDGS + G L  + +          A   + 
Sbjct: 96  CSDPSCTLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHY-----MVNATATVI 150

Query: 204 FGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT----------IAGNQRLG---- 246
           FGCG    G  ++      GI+G G  D+S  SQ+             + G +R G    
Sbjct: 151 FGCGFKQSGDLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHCLDGGERGGGILV 210

Query: 247 ---VSTPDI-----------------------------------------VIDSGTTLTF 262
              V  PDI                                         + DSGTTL +
Sbjct: 211 LGNVIEPDIQYTPLVPYMYHYNVVLQSISVNNANLTIDPKLFSNDVMQGTIFDSGTTLAY 270

Query: 263 LPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSRSNF 322
           LP          +S ++    + D   S  +   F      P V ++F GA + L+ + +
Sbjct: 271 LPDEAYQAFTQAVSLVVAPFLLCDTRLSRFIYKLF------PNVVLYFEGASMTLTPAEY 324

Query: 323 FVK----VSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            ++     +  I C  ++ + ++       I+G+++  N LV YD+E+  + ++P DC
Sbjct: 325 LIRQASAANAPIWCMGWQSMGSAESELQYTIFGDLVLKNKLVVYDLERGRIGWRPFDC 382


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score = 97.8 bits (242), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 59/179 (32%), Positives = 85/179 (47%), Gaps = 9/179 (5%)

Query: 30  SVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKA-SQADIIPNN 88
           S  LIH  S  SPF   + T    + + +    NRL    + S  S   A +   +   +
Sbjct: 53  SFPLIHIYSECSPFRPPNRTWESLMSEKIRGDANRLRFLKRTSRSSKEDANANVPVRSGS 112

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y+I++  GTP      + DTGSD+ W  C+ C   Q     +P+FDP  SS+YK   C
Sbjct: 113 GEYIIQVDFGTPKQSMYTLIDTGSDVAWIPCKQC---QGCHSTAPIFDPAKSSSYKPFAC 169

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
            S  C  ++        CQ+ V YGDG+  +G LA++ +TLGS       LP  +FGC 
Sbjct: 170 DSQPCQEISGNCGGNSKCQFEVLYGDGTQVDGTLASDAITLGSQ-----YLPNFSFGCA 223



 Score = 63.5 bits (153), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/146 (29%), Positives = 67/146 (45%), Gaps = 12/146 (8%)

Query: 235 MRTTIAGNQRLGVSTPDI------VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADP 287
           ++    GN R+ V   +I      +IDSGTT+T+L P  Y     +    +   QP   P
Sbjct: 306 LKAISVGNTRISVPATNIASGGGTIIDSGTTITYLVPSAYKDLRDAFRQQLSSLQPT--P 363

Query: 288 TGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIY 345
              ++ CY  +S S  VP +T+H  R  D+ L + N  +     + C  F   T+S  I 
Sbjct: 364 VEDMDTCYDLSSSSVDVPTITLHLDRNVDLVLPKENILITQESGLSCLAFSS-TDSRSII 422

Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDC 371
           GN+ Q N+ + +D+    V F    C
Sbjct: 423 GNVQQQNWRIVFDVPNSQVGFAQEQC 448


>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
 gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
 gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 389

 Score = 97.4 bits (241), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 85/338 (25%), Positives = 139/338 (41%), Gaps = 65/338 (19%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD-SPLFDPKMSSTYKSLPCS 149
           ++  I  G+P  ++    DTGS L WTQC PC  S CY Q   P + P  S TY+   C 
Sbjct: 58  FMAEIHFGSPQKKQFLHMDTGSSLTWTQCFPC--SDCYAQKIYPKYRPAASITYRDAMCE 115

Query: 150 SSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
            S   S    +   +   C Y   Y D +   G LA E +T+ +  G    + G+ FGC 
Sbjct: 116 DSHPKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVYFGCN 175

Query: 208 T-NNGGLFNSKTTGIVGLGGGDISLIS--------------------------------- 233
           T ++G  F    TGI+GLG G  S+I                                  
Sbjct: 176 TLSDGSYFTG--TGILGLGVGKYSIIGEFGSKFSFCLGEISEPKASHNLILGDGANVQGH 233

Query: 234 -------------QMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE 280
                        Q+ + I G +        + +D+G+TL+ L        +     +I 
Sbjct: 234 PTVINITEGHTIFQLESIIVGEEITLDDPVQVFVDTGSTLSHLSTNLYYKFVDAFDDLIG 293

Query: 281 AQPVA-DPTGSLELCYSFNSLSQVPEVTIHFR---GADVKLSRSNFFVKVS-EDIVCSVF 335
           ++P++ +PT    LCY  +++ ++ ++ + F+   GA++ ++  N F++    +I C   
Sbjct: 294 SRPLSYEPT----LCYKADTIERLEKMDVGFKFDVGAELSVNIHNIFIQQGPPEIRCLAI 349

Query: 336 KGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           +    S    I G I    + VGYD+  +T      DC
Sbjct: 350 QNNKESFSHVIIGVIAMQGYNVGYDLSAKTAYINKQDC 387


>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 457

 Score = 97.4 bits (241), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 75/245 (30%), Positives = 116/245 (47%), Gaps = 15/245 (6%)

Query: 4   FLSCVFILFFLCFYVVSP----IEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALT 59
           F+ C+  L  LCF    P    ++    GF V L+H  S +SPFY  + T  +  + ++ 
Sbjct: 9   FMICIQTL--LCFSSSLPDHVLLKDNRLGFKVPLLHWLSTESPFYEPNLTLAELTQASIR 66

Query: 60  RSLNR---LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
            S  R   +      +  SS K   + +   +  Y+++ SIG+P  +  A+ D+GS L+W
Sbjct: 67  TSGARGDSIRSIMSGNITSSMKYPISRMSYTDKAYVMKFSIGSPAVDTYAIPDSGSSLVW 126

Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC-ASLNQK--SCSGVN--CQYSVS 171
            QC       CY Q  PLF+P  S TY    C++++C  +L  +   C   N  C+Y   
Sbjct: 127 LQCGTPYCRNCYRQKIPLFNPSKSVTYMKRLCNTAECRVALGDEYWRCKKPNQICKYHED 186

Query: 172 YGDGSFSNGNLATETVTL-GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDIS 230
           Y D S++ G ++T+  T     +G       I FGCG NN    +    G+VGL     S
Sbjct: 187 YLDDSYTEGVISTDIFTFPEHISGFGNYTLRIIFGCGYNNSDPQHFYPPGLVGLTNNKAS 246

Query: 231 LISQM 235
           L+ QM
Sbjct: 247 LVGQM 251


>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
          Length = 440

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 71/221 (32%), Positives = 109/221 (49%), Gaps = 36/221 (16%)

Query: 25  QTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADI 84
           +  G  +EL H D+ ++    S+E   +R+R A  R+  RL    + S+      SQ   
Sbjct: 20  RAAGLRLELTHVDAKQN---CSTE---ERMRRATERTHRRLASMGEASAPVHWAESQ--- 70

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
                 Y+    IG PP +  A+ DTGS+LIWTQC  C P+ C+ Q+   +DP  S T +
Sbjct: 71  ------YIAEYLIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTAR 124

Query: 145 SLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
            + C+ + CA  ++  C+  N  C    +YG G    G L TE  T    + + V+L   
Sbjct: 125 PVACNDTACALGSETRCARDNKACAVLTAYGAGVI-GGVLGTEAFTFQPQS-ENVSL--- 179

Query: 203 TFGC--------GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
            FGC        G+ +G       +GI+GLG G++SL+SQ+
Sbjct: 180 AFGCIAATRLTPGSLDG------ASGIIGLGRGNLSLVSQL 214



 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 40/135 (29%), Positives = 64/135 (47%), Gaps = 15/135 (11%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSF---NSLSQVPEVT 307
           +IDSG+  T L       L   +   + A  V  P G+  L+LC +    +    VP + 
Sbjct: 304 LIDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGAEGLDLCAAVAHGDVGKLVPPLV 363

Query: 308 IHFR--GADVKLSRSNFFVKVSEDIVCSVF---KGITNSVP-----IYGNIMQTNFLVGY 357
           +HF   G DV +   N++  V +   C V     G  +++P     I GN MQ +  + Y
Sbjct: 364 LHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNYMQQDMHLLY 423

Query: 358 DIEQQTVSFKPTDCT 372
           D+E+  +SF+P DC+
Sbjct: 424 DLEKGMLSFQPADCS 438


>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
 gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
          Length = 460

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 71/214 (33%), Positives = 101/214 (47%), Gaps = 21/214 (9%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           G  +EL H D+ ++       T  +R+R A  R+  RL      S       + A I  N
Sbjct: 32  GLRLELTHVDAKQN------CTTKERMRRATERTHRRLA-----SMAGGGGEASAPIHWN 80

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
              Y+    IG PP +  A+ DTGS+LIWTQC  C  + C+ QD   +DP  S T K + 
Sbjct: 81  ETQYIAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVA 140

Query: 148 CSSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           C+ + C   ++  C+  G  C    +YG G+   G L TE  T G        +  + FG
Sbjct: 141 CNDTACLLGSETRCARDGKACAVLTAYGAGAI-GGFLGTEVFTFGHGQSSENNV-SLAFG 198

Query: 206 CGTNN----GGLFNSKTTGIVGLGGGDISLISQM 235
           C T +    G L     +GI+GLG G +SL SQ+
Sbjct: 199 CITASRLTPGSL--DGASGIIGLGRGKLSLPSQL 230



 Score = 46.6 bits (109), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 39/139 (28%), Positives = 62/139 (44%), Gaps = 19/139 (13%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSF----NSLSQVPEV 306
           +IDSG+  T L       L   +   + A  V  P G+  L+LC       ++   VP +
Sbjct: 320 LIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGAEGLDLCVGGVAPGDAGKLVPPL 379

Query: 307 TIHFRGA-----DVKLSRSNFFVKVSEDIVCSVF---KGITNSVP-----IYGNIMQTNF 353
            +HF        DV +   N++  V +   C V     G  +++P     I GN MQ + 
Sbjct: 380 VLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTIIGNYMQQDM 439

Query: 354 LVGYDIEQQTVSFKPTDCT 372
            + YD+ Q  +SF+P DC+
Sbjct: 440 HLLYDLGQGVLSFQPADCS 458


>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
 gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
 gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
          Length = 475

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 95/392 (24%), Positives = 163/392 (41%), Gaps = 88/392 (22%)

Query: 65  LNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPPTERLAVADTGSDLIW 116
           L HF  + +   S+   +  +P   +        Y  +I +G+PP E     DTGSD++W
Sbjct: 40  LEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILW 99

Query: 117 TQCEPCP--PSQCYMQ-DSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVS 171
             C+PCP  P++  +     LFD   SST K + C    C+ ++Q  SC   + C Y + 
Sbjct: 100 INCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIV 159

Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNGGLF---NSKTTGIVGLG 225
           Y D S S+G    + +TL   TG     P    + FGCG++  G     +S   G++G G
Sbjct: 160 YADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFG 219

Query: 226 GGDISLISQMRTT-------------IAGNQRLGVSTPD--------------------- 251
             + S++SQ+  T             + G     V   D                     
Sbjct: 220 QSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLM 279

Query: 252 --------------IVIDSG------TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSL 291
                         IV + G      TTL + P+    +L+    +++  QPV       
Sbjct: 280 GMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIVEE 336

Query: 292 EL-CYSF--NSLSQVPEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK--GIT----N 340
              C+SF  N     P V+  F  + VKL+    ++   + E++ C  ++  G+T    +
Sbjct: 337 TFQCFSFSTNVDEAFPPVSFEFEDS-VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERS 395

Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            V + G+++ +N LV YD++ + + +   +C+
Sbjct: 396 EVILLGDLVLSNKLVVYDLDNEVIGWADHNCS 427


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 94/360 (26%), Positives = 143/360 (39%), Gaps = 79/360 (21%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   + +GTPP       DTGSD++W     C+ CP       D  L+DPK SST  ++ 
Sbjct: 88  YYTEVRLGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASSTGSTVM 147

Query: 148 CSSSQCASL---NQKSCSG-VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    CA         CS  V C+YSV+YGDGS + G+   + +     TG     P   
Sbjct: 148 CDQGFCADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDGQTQPANA 207

Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRT-------------TIAGNQR 244
            + FGCG   GG   S +    GI+G G  + S++SQ+ T             TI G   
Sbjct: 208 SVIFGCGAQQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLDTIKGGGI 267

Query: 245 LGV---------STPDIV-----------IDSG------------------------TTL 260
             +         +TP +            ID G                        TTL
Sbjct: 268 FAIGDVVQPKVKTTPLVADKPHYNVNLKTIDVGGTTLELPADIFKPGEKRGTIIDSGTTL 327

Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR- 319
           T+LP+     ++  + +  +     D    L   YS +     P +T HF   D+ L   
Sbjct: 328 TYLPELVFKKVMLAVFNKHQDITFHDVQDFLCFEYSGSVDDGFPTLTFHFE-DDLALHVY 386

Query: 320 -SNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
              +F     D+ C  F+ G   S     + + G+++ +N LV YD+E + + +   +C+
Sbjct: 387 PHEYFFPNGNDVYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDLENRVIGWTDYNCS 446


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 51/148 (34%), Positives = 78/148 (52%), Gaps = 15/148 (10%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           ++ +   +  YL+ + +GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  S
Sbjct: 141 ESGVAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFEQRGPVFDPAAS 198

Query: 141 STYKSLPCSSSQCASL---------NQKSCSGVN---CQYSVSYGDGSFSNGNLATETVT 188
           S+Y+++ C   +C  +         + ++C       C Y   YGD S + G+LA E+ T
Sbjct: 199 SSYRNVTCGDHRCGHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFT 258

Query: 189 LGSTT-GQAVALPGITFGCGTNNGGLFN 215
           +  T  G +  + G+ FGCG  N GLF+
Sbjct: 259 VNLTAPGASRRVDGVVFGCGHRNRGLFH 286



 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 44/158 (27%), Positives = 75/158 (47%), Gaps = 18/158 (11%)

Query: 234 QMRTTIAGNQRLGVS--TPDI--------VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQ 282
           +++  + G + L +S  T D+        +IDSGTTL+ F+   Y     + M  M  + 
Sbjct: 367 KLKGVLVGGELLNISSDTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSRSY 426

Query: 283 PVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNFFVKVSED---IVCSVFK 336
           P+      L  CY+ + +   +VPE+++ F  GA       N+F+++  D   I+C    
Sbjct: 427 PLVPEFPVLSPCYNVSGVERPEVPELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVL 486

Query: 337 GITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           G   + + I GN  Q NF V YD++   + F P  C +
Sbjct: 487 GTPRTGMSIIGNFQQQNFHVVYDLQNNRLGFAPRRCAE 524


>gi|125575538|gb|EAZ16822.1| hypothetical protein OsJ_32294 [Oryza sativa Japonica Group]
          Length = 392

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 56/147 (38%), Positives = 82/147 (55%), Gaps = 11/147 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+   +IGTPP    AV D   +L+WTQC+ C  S+C+ QD+PLFDP  S+TY++ PC 
Sbjct: 50  NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--SRCFEQDTPLFDPTASNTYRAEPCG 107

Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
           +  C S+  + ++CSG  C Y  S   G  + G + T+T  +G+      A   + FGC 
Sbjct: 108 TPLCESIPSDSRNCSGNVCAYQASTNAGD-TGGKVGTDTFAVGT------AKASLAFGCV 160

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQ 234
             +        +GIVGLG    SL++Q
Sbjct: 161 VASDIDTMGGPSGIVGLGRTPWSLVTQ 187


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 106/417 (25%), Positives = 154/417 (36%), Gaps = 113/417 (27%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
           E+  RD  +  F NS    Y         S N  NH + N           ++   + N+
Sbjct: 87  EIFGRDESRVSFINSKCNQYT--------SGNLKNHAHNN-----------NLFDEDGNF 127

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           L+ ++ GTPP +   + DTGS + WTQC+ C    C       FD   SSTY    C  S
Sbjct: 128 LVDVAFGTPPQKFKLILDTGSSITWTQCKAC--VHCLKDSHRHFDSLASSTYSFGSCIPS 185

Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
                       V   Y+++YGD S S GN   +T+TL  +           FGCG NN 
Sbjct: 186 T-----------VGNTYNMTYGDKSTSVGNYGCDTMTLEPSD----VFQKFQFGCGRNNE 230

Query: 212 G--------------------------------------------LFNSKTT-------- 219
           G                                            LF  K T        
Sbjct: 231 GDFGSGADGMLGLGQGQLSTVSQTASKFKKVFSYCLPEENSIGSLLFGEKATSQSSSLKF 290

Query: 220 -------GIVGLGGGDISLISQMRTTIAGNQRLGV-----STPDIVIDSGTTLTFLPQGY 267
                  G  GL       +  +  ++ GN+RL +     ++P  +IDSGT +T LPQ  
Sbjct: 291 TSLVNGPGTSGLEESGYYFVKLLDISV-GNKRLNIPSSVFASPGTIIDSGTVITRLPQRA 349

Query: 268 NSNLLSVMSSMIEAQPVAD----PTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRS 320
            S L +     +   P+++        L+ CY+ +    V  PE  +HF  GADV+L+  
Sbjct: 350 YSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFGDGADVRLNGK 409

Query: 321 NFFVKVSEDIVCSVFKGITNS-----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
                     +C  F G + S     + I GN  Q +  V YDI  + + F    C+
Sbjct: 410 RVVWGNDASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIGFGGNGCS 466


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 54/157 (34%), Positives = 83/157 (52%), Gaps = 13/157 (8%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
              Y++ +SIGTPP    A+ DTGSDL+W +C+ C           +F    SS+YK LP
Sbjct: 2   EGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLP 61

Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTL---GSTTGQAVA 198
           C+S+ C+ +   S +G+       C+Y   YGDGS ++G++ ++ ++    G+       
Sbjct: 62  CNSTHCSGM---SSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSF 118

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
             G  FGCG    G +N  T G++GLG    SLI Q+
Sbjct: 119 FDGFLFGCGRKLKGDWNF-TQGLIGLGQKSHSLIQQL 154



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 55/118 (46%), Gaps = 6/118 (5%)

Query: 253 VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIH 309
           VIDSGTT T L P  Y +   S+   +I   P    +  L+LC+  S ++    P VT +
Sbjct: 244 VIDSGTTYTLLTPPVYEAMRKSIEEQVI--LPTLGNSAGLDLCFNSSGDTSYGFPSVTFY 301

Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           F     + L   N F   S D+VC         + I GN+ Q NF + YD+    +SF
Sbjct: 302 FANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYDLVASQISF 359


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 103/392 (26%), Positives = 158/392 (40%), Gaps = 83/392 (21%)

Query: 55  RDALTRSLNRLNHFNQNSSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGS 112
           R  L R  +RL H        SS A     D +  N  Y  R+ IG+PP E   + DTGS
Sbjct: 52  RRVLDRD-HRLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGS 110

Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
            + +  C  C   QC     P F P++SSTY+ + C++      N     GV C Y   Y
Sbjct: 111 TVTYVPCSNC--VQCGNHQDPRFQPELSSTYQPVKCNADCNCDEN-----GVQCTYERRY 163

Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGL------- 224
            + S S+G LA + ++ G  +   +      FGC T  +G L+  +  GI+GL       
Sbjct: 164 AEMSTSSGVLAEDVMSFGKES--ELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSV 221

Query: 225 -----------------------GGGDISL-----------------------ISQMRTT 238
                                  GGG + L                       I      
Sbjct: 222 MDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIH 281

Query: 239 IAGNQ-RLGVSTPD----IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL- 291
           +AG   +L   T D     ++DSGTT  + P+  Y +   ++M  +   + ++ P  +  
Sbjct: 282 VAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFK 341

Query: 292 ELCYS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNS 341
           ++C+S        L +V PEV + F  G  + LS  N+     KVS      +FK   + 
Sbjct: 342 DICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQ 401

Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
             + G I+  N LV Y+ E  T+ F  T+C++
Sbjct: 402 TTLLGGIIVRNTLVTYNRENSTIGFWKTNCSE 433


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 61/158 (38%), Positives = 79/158 (50%), Gaps = 19/158 (12%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y +  S+GTP  +   + DTGSDL + QC PC    CY QD PL+ P  SST+  +P
Sbjct: 31  SGQYFVDFSLGTPEQKFHLIVDTGSDLAFVQCAPC--DLCYEQDGPLYQPSNSSTFTPVP 88

Query: 148 CSSSQ-----------CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           C S++           C+S   +S     C Y   YGD S + G  A ET T+G      
Sbjct: 89  CDSAECLLIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATVGGIRVNH 148

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
           VA     FGCG  N G F S   G++GLG G +S  SQ
Sbjct: 149 VA-----FGCGNRNQGSFVS-AGGVLGLGQGALSFTSQ 180



 Score = 62.8 bits (151), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 68/124 (54%), Gaps = 6/124 (4%)

Query: 253 VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIH 309
           + DSGTT+T+  PQ Y   + +   S+   +    P G L LC + + +     P  TI 
Sbjct: 266 IFDSGTTVTYWSPQAYARIIAAFEKSVPYPRAPPSPQG-LPLCVNVSGIDHPIYPSFTIE 324

Query: 310 F-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
           F +GA  + ++ N+F++VS +I C ++ +  ++   + GNI+Q N+LV YD E+  + F 
Sbjct: 325 FDQGATYRPNQGNYFIEVSPNIDCLAMLESSSDGFNVIGNIIQQNYLVQYDREEHRIGFA 384

Query: 368 PTDC 371
             +C
Sbjct: 385 HANC 388


>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
          Length = 359

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 71/231 (30%), Positives = 104/231 (45%), Gaps = 30/231 (12%)

Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
           C Y+++YGDGSF+ G L  E +  G+     + +    FGCG NN GLF    +G++GLG
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKFGT-----ILVKDFIFGCGRNNKGLFGG-VSGLMGLG 186

Query: 226 GGDISLISQMRT---------------TIAGN--QRLGVSTPDIVIDSGTTLTFLPQGYN 268
             D+SLISQ                  +I G   Q   V    I++DSGT +T LP    
Sbjct: 187 RSDLSLISQTSENPQLYNFYFINLTGISIGGVALQAPSVGPSRILVDSGTVITRLPPTIY 246

Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV--PEVTIHFRG---ADVKLSRSNFF 323
             L +         P A     L+ C++ ++  +V  P + +HF G     V ++   +F
Sbjct: 247 KALKAEFLKQFTGFPPAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYF 306

Query: 324 VKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           VK     VC     +   + V I GN  Q N  V YD ++  V F    C+
Sbjct: 307 VKSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 357


>gi|212721496|ref|NP_001131929.1| uncharacterized protein LOC100193320 precursor [Zea mays]
 gi|194692946|gb|ACF80557.1| unknown [Zea mays]
          Length = 424

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 96/384 (25%), Positives = 155/384 (40%), Gaps = 91/384 (23%)

Query: 71  NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYM 129
           +SSI++      D+ P+   Y + ++IG PP       D+GSDL W QC+ PC    C  
Sbjct: 38  SSSIAAVFPLYGDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNE 94

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNL 182
              PL+ P  S   K +PC    CASL+     +  C   +  C Y + Y D   S G L
Sbjct: 95  VPHPLYRPTKS---KLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVL 151

Query: 183 ATETVTLGSTTGQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMR--- 236
             ++  L  T G +VA P + FGCG +     G  +S T G++GLG G +SL+SQ++   
Sbjct: 152 INDSFALRLTNG-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRG 210

Query: 237 --------------------------------TTIA-----------------GNQRLGV 247
                                           T +A                 G++ LGV
Sbjct: 211 VTKNVVGHCLSLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGV 270

Query: 248 STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQV 303
               +V DSG++ T+        L++ +   +      +P  SL LC+     F S+  V
Sbjct: 271 RLAKVVFDSGSSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLDV 330

Query: 304 PE----VTIHFRGAD---VKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIM 349
            +    + ++F       +++   N+ +       C    GI N        + I G+I 
Sbjct: 331 RKEFKSLVLNFASGKKTLMEIPPENYLIVTENGNAC---LGILNGSEIGLKDLSIIGDIT 387

Query: 350 QTNFLVGYDIEQQTVSFKPTDCTK 373
             + +V YD E+  + +    C +
Sbjct: 388 MQDHMVIYDNEKGKIGWIRAPCDR 411


>gi|226509616|ref|NP_001152116.1| aspartic proteinase Asp1 precursor [Zea mays]
 gi|195652765|gb|ACG45850.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 432

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 93/374 (24%), Positives = 148/374 (39%), Gaps = 92/374 (24%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + ++IG PP       D+GSDL W QC+ PC    C     PL+ P  S
Sbjct: 56  GDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNEVPHPLYRPTKS 112

Query: 141 STYKSLPCSSSQCASLNQKSCSGVN--------CQYSVSYGDGSFSNGNLATETVTLGST 192
              K +PC    CASL+     G +        C Y + Y D   S G L  ++  L  T
Sbjct: 113 ---KLVPCVHRLCASLHNALTGGKHRCESPHEQCDYVIKYADQGSSTGVLVNDSFALRLT 169

Query: 193 TGQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMR------------- 236
            G +VA P + FGCG +     G  +S T G++GLG G +SL+SQ++             
Sbjct: 170 NG-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCL 228

Query: 237 ----------------------TTIA-----------------GNQRLGVSTPDIVIDSG 257
                                 T +A                 G++ LGV    +V DSG
Sbjct: 229 SLRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSG 288

Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIH 309
           ++ T+        L++ +   +      +P  SL LC+     F S+  V +    + ++
Sbjct: 289 SSFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLDVRKEFKSLVLN 348

Query: 310 FRGAD---VKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDI 359
           F       +++   N+ +       C    GI N        + I G+I   + +V YD 
Sbjct: 349 FASGKKTLMEIPPENYLIVTENGNAC---LGILNGSEIGLKDLSIIGDITMQDHMVIYDN 405

Query: 360 EQQTVSFKPTDCTK 373
           E+  + +    C +
Sbjct: 406 EKGKIGWIRAPCDR 419


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 103/392 (26%), Positives = 158/392 (40%), Gaps = 83/392 (21%)

Query: 55  RDALTRSLNRLNHFNQNSSISSSKASQ--ADIIPNNANYLIRISIGTPPTERLAVADTGS 112
           R  L R  +RL H        SS A     D +  N  Y  R+ IG+PP E   + DTGS
Sbjct: 52  RRVLDRD-HRLRHLQNLVKPHSSNARMRLHDDLLTNGYYTTRLWIGSPPQEFALIVDTGS 110

Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
            + +  C  C   QC     P F P++SSTY+ + C++      N     GV C Y   Y
Sbjct: 111 TVTYVPCSNC--VQCGNHQDPRFQPELSSTYQPVKCNADCNCDEN-----GVQCTYERRY 163

Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGL------- 224
            + S S+G LA + ++ G  +   +      FGC T  +G L+  +  GI+GL       
Sbjct: 164 AEMSTSSGVLAEDVMSFGKES--ELVPQRAVFGCETMESGDLYTQRADGIMGLGRGTLSV 221

Query: 225 -----------------------GGGDISL-----------------------ISQMRTT 238
                                  GGG + L                       I      
Sbjct: 222 MDQLVGKGVVSNSFSLCYGGMDVGGGAMVLGGISSPPGMVFSHSDPSRSPYYNIELKEIH 281

Query: 239 IAGNQ-RLGVSTPD----IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL- 291
           +AG   +L   T D     ++DSGTT  + P+  Y +   ++M  +   + ++ P  +  
Sbjct: 282 VAGKPLKLNPRTFDGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQISGPDPNFK 341

Query: 292 ELCYS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNS 341
           ++C+S        L +V PEV + F  G  + LS  N+     KVS      +FK   + 
Sbjct: 342 DICFSGAGRDVTELPKVFPEVDMVFANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQ 401

Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
             + G I+  N LV Y+ E  T+ F  T+C++
Sbjct: 402 TTLLGGIIVRNTLVTYNRENSTIGFWKTNCSE 433


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 63/206 (30%), Positives = 100/206 (48%), Gaps = 16/206 (7%)

Query: 42  PFYNSSETPYQRLRDAL-------TRSLNRLNHFNQNS-SISSSKASQADIIPNNANYLI 93
           PF+N  E P      +          SL   +H ++N  S+  + +    I    +N+L+
Sbjct: 130 PFHNQEEFPQTFSSSSSFKLKLYPAASLYNTHHQHKNYYSLDLNASLNPGITTGTSNFLV 189

Query: 94  RISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQC 153
           +I +G PP +   + D  +D  W QC+PC   +CY Q   +FDP  SS+Y  L C +  C
Sbjct: 190 QIGVGGPPQKFYMIFDLQTDFTWLQCQPCI--KCYDQPDSIFDPSQSSSYTLLSCETKHC 247

Query: 154 ASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
             L   SCS    C+Y+++Y DG+ + G L  ETV+  S+      +  ++ GC   N G
Sbjct: 248 NLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSFESSG----WVDRVSLGCSNKNQG 303

Query: 213 LFNSKTTGIVGLGGGDISLISQMRTT 238
            F   + G  GLG G +S  S++  +
Sbjct: 304 PF-VGSDGTFGLGRGSLSFPSRINAS 328


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 86/361 (23%), Positives = 146/361 (40%), Gaps = 86/361 (23%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
            Y +++ +GTP  E   VADTGSDL W +C    PP +       +F PK S ++  +PC
Sbjct: 115 QYFVKLRVGTPVQEFTLVADTGSDLTWVKCAGASPPGR-------VFRPKTSRSWAPIPC 167

Query: 149 SSSQCA-----SLNQKSCSGVNCQYSVSYGDGSF-SNGNLATETVTLGSTTGQAVALPGI 202
           SS  C      +L   S     C Y   Y +GS  + G + TE+ T+    G+   L  +
Sbjct: 168 SSDTCKLDVPFTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDV 227

Query: 203 TFGCGTNNGG-----------LFNSK---------------------------TTGIVGL 224
             GC +++ G           L N+K                            TG +  
Sbjct: 228 VLGCSSSHDGQSFRSADGVLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATGYLAF 287

Query: 225 GGGDISLISQMRTT------------------IAGN------QRLGVSTPDIVIDSGTTL 260
           G G +      +T                   +AG       +     +  +++DSG TL
Sbjct: 288 GPGQVPRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWDAKSGGVILDSGNTL 347

Query: 261 TFLPQGYNSNLLSVMSSMIEAQP-VADPTGSLELCYSFNSLSQ-----VPEVTIHFRG-A 313
           T L       +++ +S  ++  P V+ P    E CY++ +        +P++ + F G A
Sbjct: 348 TVLAAPAYKAVVAALSKHLDGVPKVSFPP--FEHCYNWTARRPGAPEIIPKLAVQFAGSA 405

Query: 314 DVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            ++    ++ + V   + C  V +G    + + GNIMQ   L  +D++   V FK ++CT
Sbjct: 406 RLEPPAKSYVIDVKPGVKCIGVQEGEWPGLSVIGNIMQQEHLWEFDLKNMQVRFKQSNCT 465

Query: 373 K 373
           +
Sbjct: 466 R 466


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 102/358 (28%), Positives = 156/358 (43%), Gaps = 78/358 (21%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I +GTP  +     DTGSD++W  C     CP     ++ +P +D   SST KS+ 
Sbjct: 85  YFAKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTP-YDVDASSTAKSVS 143

Query: 148 CSSSQCASLNQKS-C-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG-I 202
           CS + C+ +NQ+S C SG  CQY + YGDGS +NG L  + V L   TG  Q  +  G I
Sbjct: 144 CSDNFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTI 203

Query: 203 TFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT----------IAGNQRLGV-S 248
            FGCG+   G      +   GI+G G  + S ISQ+ +           +  N   G+ +
Sbjct: 204 IFGCGSKQSGQLGESQAAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCLDNNNGGGIFA 263

Query: 249 TPDIVIDSGTTLTFLPQ----GYNSNLLSVMSSMIEAQPVADPTGSLE---------LCY 295
             ++V     T   L +      N N + V +S++E    A  +G  +         L Y
Sbjct: 264 IGEVVSPKVKTTPMLSKSAHYSVNLNAIEVGNSVLELSSNAFDSGDDKGVIIDSGTTLVY 323

Query: 296 SFNS---------LSQVPEVTIH--------FRGADVKLSR------------------S 320
             ++         L+  PE+T+H        F   D KL R                   
Sbjct: 324 LPDAVYNPLLNEILASHPELTLHTVQESFTCFHYTD-KLDRFPTVTFQFDKSVSLAVYPR 382

Query: 321 NFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            +  +V ED  C  ++  G+      S+ I G++  +N LV YDIE Q + +   +C+
Sbjct: 383 EYLFQVREDTWCFGWQNGGLQTKGGASLTILGDMALSNKLVVYDIENQVIGWTNHNCS 440


>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 488

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 94/363 (25%), Positives = 143/363 (39%), Gaps = 84/363 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTPP       DTGSD++W    QC+ CP       D  L+D K SS+ K +P
Sbjct: 83  YYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLVP 142

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---ALP 200
           C    C  +N    +G    ++C Y   YGDGS + G    + V     +G      A  
Sbjct: 143 CDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANG 202

Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQ---------------------- 234
            I FGCG    G  +S       GI+G G  + S+ISQ                      
Sbjct: 203 SIVFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGG 262

Query: 235 ---------------------------MRTTIAGNQRLGVST--------PDIVIDSGTT 259
                                      M     G+  L +ST           +IDSGTT
Sbjct: 263 IFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHTFLSLSTDTSAQGDRKGTIIDSGTT 322

Query: 260 LTFLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADV 315
           L +LP+G    L+  M S    ++ Q + D     +  YS +     P VT  F  G  +
Sbjct: 323 LAYLPEGIYEPLVYKMISQHPDLKVQTLHDEYTCFQ--YSESVDDGFPAVTFFFENGLSL 380

Query: 316 KLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
           K+   ++    S +  C  ++        + ++ + G+++ +N LV YD+E Q + +   
Sbjct: 381 KVYPHDYLFP-SVNFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLENQAIGWAEY 439

Query: 370 DCT 372
           +C+
Sbjct: 440 NCS 442


>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
          Length = 506

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 57/158 (36%), Positives = 74/158 (46%), Gaps = 13/158 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I +GTPP       DTGSD++W     C  CP       D   +DPK SS+  ++ 
Sbjct: 87  YFTEIKLGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSGSTVS 146

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    CA+       G    V C+YSV YGDGS + G   T+ +     TG     PG  
Sbjct: 147 CDQGFCAATYGGKLPGCTANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQPGNA 206

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
            ITFGCG   GG     N    GI+G G  + S++SQ+
Sbjct: 207 TITFGCGAQQGGDLGNSNQALDGILGFGQANTSMLSQL 244



 Score = 46.6 bits (109), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 37/128 (28%), Positives = 60/128 (46%), Gaps = 9/128 (7%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
           +IDSGTTLT+LP+     ++ V+ S        +    L   YS +     P +T HF  
Sbjct: 335 IIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDFLCFQYSGSVDDGFPTITFHFE- 393

Query: 313 ADVKLSR--SNFFVKVSEDIVCSVFK-GITNS-----VPIYGNIMQTNFLVGYDIEQQTV 364
            D+ L      +F     DI C  F+ G   S     + + G+++ +N LV YD+E Q +
Sbjct: 394 DDLALHVYPHEYFFPNGNDIYCVGFQNGALQSKDGKDIVLMGDLVLSNKLVVYDLENQVI 453

Query: 365 SFKPTDCT 372
            +   +C+
Sbjct: 454 GWTDYNCS 461


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 68/203 (33%), Positives = 99/203 (48%), Gaps = 21/203 (10%)

Query: 50  PYQRLRDALTRSLNRLNHF----NQNSSISSSKASQADIIPNNANYLIRISIGTPPTERL 105
           P+     AL+   +RL+ F    +   S+ S   S A     +  Y + + +GTPP + L
Sbjct: 46  PFTTPSQALSFDSHRLSFFFSALHTPQSLKSPVVSGAST--GSGQYFVDLRLGTPPQKLL 103

Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL---NQKSCS 162
            VADTGSDL+W +C  C     +   S  F  + S+T+    C  S C  +       C+
Sbjct: 104 LVADTGSDLVWVKCSACRNCTRHTPGS-AFLARHSTTFSPNHCYDSACQLVPLPKHHRCN 162

Query: 163 GVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN------NGG 212
                  C+Y  SYGDGS ++G  + ET TL +++G+   L GI FGC         +G 
Sbjct: 163 HARLHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAFGCAFRISGPSVSGA 222

Query: 213 LFNSKTTGIVGLGGGDISLISQM 235
            FN    G++GLG G ISL SQ+
Sbjct: 223 SFNG-AHGVMGLGRGPISLSSQL 244



 Score = 64.7 bits (156), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 64/124 (51%), Gaps = 5/124 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
           ++DSGTTLTFLP+     +L+V+   +     A+PT   +LC + + +   ++P+++   
Sbjct: 332 IVDSGTTLTFLPEPAYLQILTVIKRRVRLPSPAEPTPGFDLCVNVSEIEHPRLPKLSFKL 391

Query: 311 RGADV-KLSRSNFFVKVSEDIVCSVFKGIT--NSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            G  V      N+FV   ED+ C   + +   +   + GN+MQ  FL+ +D ++  + F 
Sbjct: 392 GGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQQGFLLEFDKDRTRLGFS 451

Query: 368 PTDC 371
              C
Sbjct: 452 RHGC 455


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 64/229 (27%), Positives = 111/229 (48%), Gaps = 16/229 (6%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
           A+ GG   + IH  +P+S    +        +     S +      +N +  + ++S   
Sbjct: 35  ARGGGIGFKAIHVAAPQSRVKANPSPSSAAQKSLFPYSAHIFQQHTKNPA--ALRSSTTT 92

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           +      Y   I +G+P  E + + DTGS+L W QC PC    C      ++D   S++Y
Sbjct: 93  LGRKFGEYYTSIKLGSPGQEAILIVDTGSELTWLQCLPC--KVCAPSVDTIYDAARSASY 150

Query: 144 KSLPCSSSQ-CASLNQKS----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAV 197
           + + C++SQ C++ +Q +      G  CQ++  YGDGSFS G+L+T+T+ + +   G+ V
Sbjct: 151 RPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPV 210

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLG 246
            +    FGC   +  L  +  +GI+GL  G ++L  Q+       QR G
Sbjct: 211 TVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQL------GQRFG 253


>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
           [Arabidopsis thaliana]
          Length = 449

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 94/386 (24%), Positives = 160/386 (41%), Gaps = 88/386 (22%)

Query: 65  LNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPPTERLAVADTGSDLIW 116
           L HF  + +   S+   +  +P   +        Y  +I +G+PP E     DTGSD++W
Sbjct: 40  LEHFKSHDTRRHSRMLASIDLPLGGDSRVDSVGLYFTKIKLGSPPKEYHVQVDTGSDILW 99

Query: 117 TQCEPCP--PSQCYMQ-DSPLFDPKMSSTYKSLPCSSSQCASLNQ-KSCS-GVNCQYSVS 171
             C+PCP  P++  +     LFD   SST K + C    C+ ++Q  SC   + C Y + 
Sbjct: 100 INCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCDDDFCSFISQSDSCQPALGCSYHIV 159

Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNGGLF---NSKTTGIVGLG 225
           Y D S S+G    + +TL   TG     P    + FGCG++  G     +S   G++G G
Sbjct: 160 YADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVFGCGSDQSGQLGNGDSAVDGVMGFG 219

Query: 226 GGDISLISQMRTT-------------IAGNQRLGVSTPD--------------------- 251
             + S++SQ+  T             + G     V   D                     
Sbjct: 220 QSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLM 279

Query: 252 --------------IVIDSG------TTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGS 290
                         IV + G      TTL + P+    +L+    +++  QPV       
Sbjct: 280 GMDVDGTSLDLPRSIVRNGGTIVDSGTTLAYFPKVLYDSLI---ETILARQPVKLHIVEE 336

Query: 291 LELCYSF--NSLSQVPEVTIHFRGADVKLSR--SNFFVKVSEDIVCSVFK--GIT----N 340
              C+SF  N     P V+  F  + VKL+    ++   + E++ C  ++  G+T    +
Sbjct: 337 TFQCFSFSTNVDEAFPPVSFEFEDS-VKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERS 395

Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSF 366
            V + G+++ +N LV YD++ + + +
Sbjct: 396 EVILLGDLVLSNKLVVYDLDNEVIGW 421


>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 93/363 (25%), Positives = 145/363 (39%), Gaps = 84/363 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTPP       DTGSD++W    QC+ CP       D  L+D K SS+ K +P
Sbjct: 85  YYAKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFVP 144

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---ALP 200
           C    C  +N    +G    ++C Y   YGDGS + G    + V     +G      A  
Sbjct: 145 CDQEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANG 204

Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQ---------------------- 234
            I FGCG    G  +S       GI+G G  + S+ISQ                      
Sbjct: 205 SIVFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCLNGVNGGG 264

Query: 235 ---------------------------MRTTIAGNQRLGVST--------PDIVIDSGTT 259
                                      M     G+  L +ST           +IDSGTT
Sbjct: 265 IFAIGHVVQPKVNMTPLLPDQPHYSVNMTAVQVGHAFLSLSTDTSTQGDRKGTIIDSGTT 324

Query: 260 LTFLPQG-YNSNLLSVMSSM--IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GADV 315
           L +LP+G Y   +  ++S    ++ + + D     +  YS +     P VT +F  G  +
Sbjct: 325 LAYLPEGIYEPLVYKIISQHPDLKVRTLHDEYTCFQ--YSESVDDGFPAVTFYFENGLSL 382

Query: 316 KLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
           K+   ++    S D  C  ++        + ++ + G+++ +N LV YD+E Q + +   
Sbjct: 383 KVYPHDYLFP-SGDFWCIGWQNSGTQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWTEY 441

Query: 370 DCT 372
           +C+
Sbjct: 442 NCS 444


>gi|238006986|gb|ACR34528.1| unknown [Zea mays]
 gi|413916290|gb|AFW56222.1| aspartic proteinase Asp1 [Zea mays]
          Length = 433

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 93/373 (24%), Positives = 149/373 (39%), Gaps = 91/373 (24%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + ++IG PP       D+GSDL W QC+ PC    C     PL+ P  S
Sbjct: 58  GDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNEVPHPLYRPTKS 114

Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
              K +PC    CASL+     +  C   +  C Y + Y D   S G L  ++  L  T 
Sbjct: 115 ---KLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTN 171

Query: 194 GQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMR-------------- 236
           G +VA P + FGCG +     G  +S T G++GLG G +SL+SQ++              
Sbjct: 172 G-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLS 230

Query: 237 ---------------------TTIA-----------------GNQRLGVSTPDIVIDSGT 258
                                T +A                 G++ LGV    +V DSG+
Sbjct: 231 LRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGS 290

Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIHF 310
           + T+        L++ +   +      +P  SL LC+     F S+  V +    + ++F
Sbjct: 291 SFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCWKGQEPFKSVLDVRKEFKSLVLNF 350

Query: 311 RGAD---VKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIE 360
                  +++   N+ +       C    GI N        + I G+I   + +V YD E
Sbjct: 351 ASGKKTLMEIPPENYLIVTENGNAC---LGILNGSEIGLKDLSIIGDITMQDHMVIYDNE 407

Query: 361 QQTVSFKPTDCTK 373
           +  + +    C +
Sbjct: 408 KGKIGWIRAPCDR 420


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score = 94.4 bits (233), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 84/363 (23%), Positives = 143/363 (39%), Gaps = 84/363 (23%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQDSPLFDPKMSSTYK 144
               Y +++ +GTP  E   VADTGS+L W +C     PP         +F P+ S ++ 
Sbjct: 87  GTGQYFVKVLVGTPAQEFTLVADTGSELTWVKCAGGASPPGL-------VFRPEASKSWA 139

Query: 145 SLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSN-GNLATETVTLGSTTGQAVA 198
            +PCSS  C      SL   S S   C Y   Y +GS    G + T++ T+    G+   
Sbjct: 140 PVPCSSDTCKLDVPFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQ 199

Query: 199 LPGITFGCGTNNGG-----------LFNSK---------------------------TTG 220
           L  +  GC + + G           L N+K                            TG
Sbjct: 200 LQDVVLGCSSTHDGQSFKSVDGVLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNATG 259

Query: 221 IVGLGGGDISLISQMRTTI-----------------AGNQRLGV-------STPDIVIDS 256
            +  G G +      +T +                    Q L +        +  +++DS
Sbjct: 260 YLAFGPGQVPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDPKSGGVILDS 319

Query: 257 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS----LSQVPEVTIHFRG 312
           GTTLT L       +++ ++ ++   P  D     E CY++ +      ++P++ + F G
Sbjct: 320 GTTLTVLATPAYKAVVAALTKLLAGVPKVD-FPPFEHCYNWTAPRPGAPEIPKLAVQFTG 378

Query: 313 -ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
            A ++    ++ + V   + C  + +G    V + GNIMQ   L  +D++   V F P+ 
Sbjct: 379 CARLEPPAKSYVIDVKPGVKCIGLQEGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMPST 438

Query: 371 CTK 373
           CT+
Sbjct: 439 CTR 441


>gi|297838267|ref|XP_002887015.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297332856|gb|EFH63274.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 324

 Score = 94.0 bits (232), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 85/303 (28%), Positives = 123/303 (40%), Gaps = 76/303 (25%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQC--EPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           +I + IGTPP  +  V DTGS L W QC  +  PP     +    FDP +SS++ +LPCS
Sbjct: 75  IISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP-----KPKTSFDPSLSSSFSTLPCS 129

Query: 150 SSQCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
              C       +L     S   C YS  Y DG+F+ GNL  E +T  +T       P + 
Sbjct: 130 HPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE----ITPPLI 185

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFL 263
            GC T      +S   GI+G+  G +S +SQ + T     +     P             
Sbjct: 186 LGCATE-----SSDDRGILGMNRGRLSFVSQAKIT-----KFSYCIPP------------ 223

Query: 264 PQGYNSNLLSVMSSMIEAQPVADPTGSLEL-------CYSFNSLSQVPEVTIHFRGADVK 316
                SN           +P   PTGS  L        + + SL   PE        ++ 
Sbjct: 224 ----KSN-----------RPGFTPTGSFYLGDNPNSKGFKYVSLLTFPE------RVEIL 262

Query: 317 LSRSNFFVKVSEDIVC------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
           + +    V V + I C      S+    +N   I GN+ Q N  V +D+  + V F   D
Sbjct: 263 VPKERVLVNVGDGIHCVGIGRSSMLGAASN---IIGNVHQQNLWVEFDVTNRRVGFARAD 319

Query: 371 CTK 373
           C++
Sbjct: 320 CSR 322


>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
 gi|219886805|gb|ACL53777.1| unknown [Zea mays]
 gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
          Length = 440

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 75/222 (33%), Positives = 103/222 (46%), Gaps = 40/222 (18%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQ-NSSISSSKASQADIIP 86
           G  +EL H D+ +   Y   E    R+R A  R+  RL       + I     SQ     
Sbjct: 22  GIRLELTHVDAKE--HYTVEE----RVRRATERTHRRLASMGGVTAPIHWGGQSQ----- 70

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
               Y+    IG PP    A+ DTGS+LIWTQC  C P+ C+ Q+ P +DP  S   +++
Sbjct: 71  ----YIAEYLIGDPPQRAEAIIDTGSNLIWTQCSRCRPT-CFRQNLPYYDPSRSRAARAV 125

Query: 147 PCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C+ + CA  ++  C   N  C     YG G+ + G LATE +T  S T   V      F
Sbjct: 126 GCNDAACALGSETQCLSDNKTCAVVTGYGAGNIA-GTLATENLTFQSETVSLV------F 178

Query: 205 GC--------GTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           GC        G+ NG       +GI+GLG G +SL SQ+  T
Sbjct: 179 GCIVVTKLSPGSLNGA------SGIIGLGRGKLSLPSQLGDT 214



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/135 (30%), Positives = 62/135 (45%), Gaps = 17/135 (12%)

Query: 254 IDSGTTLTFLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCYSFNSLSQ-VPEVTIH 309
           IDSG  LT L       L + ++  + A   QP+A  TG  +LC +     + VP + +H
Sbjct: 305 IDSGAPLTSLVDVAYQALRAELARQLGAALVQPLAGTTG-FDLCVALKDAERLVPPLVLH 363

Query: 310 F-----RGADVKLSRSNFFVKVSEDIVCSV-FKGI------TNSVPIYGNIMQTNFLVGY 357
           F      G D+ +  +N++  V     C V F  +       N   + GN MQ N  V Y
Sbjct: 364 FGGGSGTGTDLVVPPANYWAPVDSATACMVVFSSVDRKSLPMNETTVIGNYMQQNMHVLY 423

Query: 358 DIEQQTVSFKPTDCT 372
           D+    +SF+P DC+
Sbjct: 424 DLAGGVLSFQPADCS 438


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 53/137 (38%), Positives = 74/137 (54%), Gaps = 12/137 (8%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + IGTPP     + DTGSDL W QC PC    C+ Q+ P +DPK SS+++++ C
Sbjct: 88  GEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPC--HDCFEQNGPYYDPKESSSFRNIGC 145

Query: 149 SSSQCASLNQKS----CSGVN--CQYSVSYGDGSFSNGNLATE--TVTLGSTTGQA--VA 198
              +C  ++       C   N  C Y   YGD S + G+ ATE  TV L S TG++    
Sbjct: 146 HDPRCHLVSSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKR 205

Query: 199 LPGITFGCGTNNGGLFN 215
           +  + FGCG  N GLF+
Sbjct: 206 VENVMFGCGHWNRGLFH 222


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 95/375 (25%), Positives = 156/375 (41%), Gaps = 80/375 (21%)

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           QNS + +++    D + +N  Y  R+ IGTPP E   + DTGS + +  C  C   QC  
Sbjct: 56  QNSELPNARMRLFDDLLSNGYYTTRLFIGTPPQEFALIVDTGSTVTYVPCSSC--EQCGK 113

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL 189
              P F P +SSTY+ + C+ S C   ++    G  C Y   Y + S S+G +A + V+ 
Sbjct: 114 HQDPRFQPDLSSTYRPVKCNPS-CNCDDE----GKQCTYERRYAEMSSSSGVIAEDVVSF 168

Query: 190 GSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVG------------------------- 223
           G+ +   +      FGC     G L++ +  GI+G                         
Sbjct: 169 GNES--ELKPQRAVFGCENVETGDLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLC 226

Query: 224 -----LGGGDISL-----------------------ISQMRTTIAGNQ-RLGVSTPD--- 251
                +GGG + L                       I      +AG   +L     D   
Sbjct: 227 YGGMDVGGGAMVLGQISPPPNMVFSHSNPYRSPYYNIELKELHVAGKPLKLKPKVFDEKH 286

Query: 252 -IVIDSGTTLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV 303
             V+DSGTT  + P+  +++   ++M  +   + +  P  +  ++C+S      + LS+V
Sbjct: 287 GTVLDSGTTYAYFPEAAFHALKDAIMKEIRHLKQIPGPDPNYHDICFSGAGREVSHLSKV 346

Query: 304 -PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYD 358
            PEV + F  G  + LS  N+     KVS      +F+   +   + G I+  N LV YD
Sbjct: 347 FPEVNMVFGSGQKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTLVTYD 406

Query: 359 IEQQTVSFKPTDCTK 373
            E   + F  T+C++
Sbjct: 407 RENDKIGFWKTNCSE 421


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 82/157 (52%), Gaps = 13/157 (8%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
              Y++ +SIGTPP    A+ DTGSDL+W +C+ C           +F    SS+YK LP
Sbjct: 2   EGEYMMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLP 61

Query: 148 CSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTL---GSTTGQAVA 198
           C+S+ C+ +   S +G+       C+Y   YGDGS ++G++ ++ ++    G+       
Sbjct: 62  CNSTHCSGM---SSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSF 118

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
             G  FGC     G +N  T G++GLG    SLI Q+
Sbjct: 119 FDGFLFGCARKLKGDWNF-TQGLIGLGQKSHSLIQQL 154



 Score = 48.9 bits (115), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 39/118 (33%), Positives = 55/118 (46%), Gaps = 6/118 (5%)

Query: 253 VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGSLELCY--SFNSLSQVPEVTIH 309
           VIDSGTT T L P  Y +   S+   +I   P    +  L+LC+  S ++    P VT +
Sbjct: 244 VIDSGTTYTLLTPPVYEAMRKSIEEQVI--LPTLGNSAGLDLCFNSSGDTSYGFPSVTFY 301

Query: 310 FRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
           F     + L   N F   S D+VC         + I GN+ Q NF + YD+    +SF
Sbjct: 302 FANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDLSIIGNMQQQNFHILYDLVASQISF 359


>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
          Length = 409

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 89/363 (24%), Positives = 149/363 (41%), Gaps = 85/363 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I IGTP        DTGSD++W     C+ CP       +  L+DPK SST   + 
Sbjct: 4   YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 63

Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    CA+    L     + + C+YSV+YGDGS + G   ++ +     +G     P   
Sbjct: 64  CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 123

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT-------------TIAGNQR 244
            +TFGCG+  GG     N    GI+G G  + S++SQ+               TI G   
Sbjct: 124 TVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGGGI 183

Query: 245 LGVST----------------------------------PDIVIDSG----------TTL 260
             +                                    P  + D+G          TTL
Sbjct: 184 FAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDSGTTL 243

Query: 261 TFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADVKL 317
           T+LP+  Y   +L+V +   + + +        LC+ +        P++T HF   D+ L
Sbjct: 244 TYLPEIVYKEIMLAVFA---KHKDITFHNVQEFLCFQYVGRVDDDFPKITFHFEN-DLPL 299

Query: 318 SR--SNFFVKVSEDIVCSVFK--GITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
           +    ++F +  +++ C  F+  G+ +     + + G+++ +N LV YD+E Q + +   
Sbjct: 300 NVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIGWTEY 359

Query: 370 DCT 372
           +C+
Sbjct: 360 NCS 362


>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Cucumis sativus]
          Length = 478

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 97/369 (26%), Positives = 143/369 (38%), Gaps = 96/369 (26%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI IG+PP +     DTGSD++W     C  CP       D  L++PK SST   + 
Sbjct: 73  YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132

Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
           C    C++       G      CQY V YGDGS + G    + + L    G         
Sbjct: 133 CDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNG 192

Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT-------------IAG--- 241
            I FGCG    G   S +    GI+G G  + S+ISQ+  T             I+G   
Sbjct: 193 SIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGI 252

Query: 242 ------------------NQRL------GVSTPDIVID-----------------SGTTL 260
                             NQ        GV   D  +D                 SGTTL
Sbjct: 253 FAIGEVVEPKLXNTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGTTL 312

Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL--------CYSF--NSLSQVPEVTIHF 310
            +LP+   S  L +M  ++ AQP       L+L        C+ F  N     P VT  F
Sbjct: 313 AYLPE---SIYLPLMEKILGAQP------DLKLRTVDDQFTCFVFDKNVDDGFPTVTFKF 363

Query: 311 RGADV-KLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQT 363
             + +  +    +  ++ +D+ C  ++         N V + G+++  N LV Y++E QT
Sbjct: 364 EESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNLENQT 423

Query: 364 VSFKPTDCT 372
           + +   +C+
Sbjct: 424 IGWTEYNCS 432


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 59/200 (29%), Positives = 94/200 (47%), Gaps = 24/200 (12%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
           A      +E +HR + +S    +  +P    R AL+  +                  ++ 
Sbjct: 98  ADKDAVRIETMHRRAARSGGDRTPASPSSSPRRALSERM--------------VATVESG 143

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           +   +  YL+ + +GTPP     + DTGSDL W QC PC    C+ Q  P+FDP  SS+Y
Sbjct: 144 VAVGSGEYLMDVYVGTPPRRFRMIMDTGSDLNWLQCAPC--LDCFDQVGPVFDPAASSSY 201

Query: 144 KSLPCSSSQCASLN----QKSCSGV---NCQYSVSYGDGSFSNGNLATETVTLGSTT-GQ 195
           +++ C   +C  +      ++C      +C Y   YGD S + G+LA E+ T+  T  G 
Sbjct: 202 RNVTCGDQRCGLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGA 261

Query: 196 AVALPGITFGCGTNNGGLFN 215
           +  +  + FGCG  N GLF+
Sbjct: 262 SRRVDDVVFGCGHWNRGLFH 281



 Score = 55.1 bits (131), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 38/127 (29%), Positives = 63/127 (49%), Gaps = 6/127 (4%)

Query: 253 VIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIH 309
           +IDSGTTL+ F+   Y     + +  M  + P+      L  CY+ + +   +VPE+++ 
Sbjct: 390 IIDSGTTLSYFVEPAYQVIRQAFIDRMGRSYPLIPDFPVLSPCYNVSGVDRPEVPELSLL 449

Query: 310 FR-GADVKLSRSNFFVKVSED-IVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSF 366
           F  GA       N+F+++  D I+C    G   + + I GN  Q NF V YD++   + F
Sbjct: 450 FADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNFHVVYDLKNNRLGF 509

Query: 367 KPTDCTK 373
            P  C +
Sbjct: 510 APRRCAE 516


>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
 gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
 gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
          Length = 494

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 89/363 (24%), Positives = 149/363 (41%), Gaps = 85/363 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I IGTP        DTGSD++W     C+ CP       +  L+DPK SST   + 
Sbjct: 89  YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 148

Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    CA+    L     + + C+YSV+YGDGS + G   ++ +     +G     P   
Sbjct: 149 CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 208

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT-------------TIAGNQR 244
            +TFGCG+  GG     N    GI+G G  + S++SQ+               TI G   
Sbjct: 209 TVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGGGI 268

Query: 245 LGVST----------------------------------PDIVIDSG----------TTL 260
             +                                    P  + D+G          TTL
Sbjct: 269 FAIGNVVQPKVKTTPLVPNMPHYNVNLKSIDVGGTALKLPSHMFDTGEKKGTIIDSGTTL 328

Query: 261 TFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADVKL 317
           T+LP+  Y   +L+V +   + + +        LC+ +        P++T HF   D+ L
Sbjct: 329 TYLPEIVYKEIMLAVFA---KHKDITFHNVQEFLCFQYVGRVDDDFPKITFHFEN-DLPL 384

Query: 318 SR--SNFFVKVSEDIVCSVFK--GITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
           +    ++F +  +++ C  F+  G+ +     + + G+++ +N LV YD+E Q + +   
Sbjct: 385 NVYPHDYFFENGDNLYCVGFQNGGLQSKDGKGMVLLGDLVLSNKLVVYDLENQVIGWTEY 444

Query: 370 DCT 372
           +C+
Sbjct: 445 NCS 447


>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 478

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 97/369 (26%), Positives = 143/369 (38%), Gaps = 96/369 (26%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI IG+PP +     DTGSD++W     C  CP       D  L++PK SST   + 
Sbjct: 73  YYARIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLIT 132

Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
           C    C++       G      CQY V YGDGS + G    + + L    G         
Sbjct: 133 CDQPFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNG 192

Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT-------------IAG--- 241
            I FGCG    G   S +    GI+G G  + S+ISQ+  T             I+G   
Sbjct: 193 SIVFGCGAKQSGELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGI 252

Query: 242 ------------------NQRL------GVSTPDIVID-----------------SGTTL 260
                             NQ        GV   D  +D                 SGTTL
Sbjct: 253 FAIGEVVEPKLKTTPVVPNQAHYNVVLNGVKVGDTALDLPLGLFETSYKRGAIIDSGTTL 312

Query: 261 TFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL--------CYSF--NSLSQVPEVTIHF 310
            +LP   +S  L +M  ++ AQP       L+L        C+ F  N     P VT  F
Sbjct: 313 AYLP---DSIYLPLMEKILGAQP------DLKLRTVDDQFTCFVFDKNVDDGFPTVTFKF 363

Query: 311 RGADV-KLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQT 363
             + +  +    +  ++ +D+ C  ++         N V + G+++  N LV Y++E QT
Sbjct: 364 EESLILTIYPHEYLFQIRDDVWCVGWQNSGAQSKDGNEVTLLGDLVLQNKLVYYNLENQT 423

Query: 364 VSFKPTDCT 372
           + +   +C+
Sbjct: 424 IGWTEYNCS 432


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 51/138 (36%), Positives = 73/138 (52%), Gaps = 12/138 (8%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y + + +GTPP     + DTGSDL W QC PC    C+ Q  P +DPK SS+++++ 
Sbjct: 194 SGEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKDSSSFRNIS 251

Query: 148 CSSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA--- 198
           C   +C  ++     K C   N  C Y   YGDGS + G+ A ET T+  TT    +   
Sbjct: 252 CHDPRCQLVSAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELK 311

Query: 199 -LPGITFGCGTNNGGLFN 215
            +  + FGCG  N GLF+
Sbjct: 312 HVENVMFGCGHWNRGLFH 329



 Score = 51.6 bits (122), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 30/124 (24%), Positives = 60/124 (48%), Gaps = 4/124 (3%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
           +IDSGTTLT+  +     +       I+   + +    L+ CY+ + +   ++P+  I F
Sbjct: 436 IIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVEGLPPLKPCYNVSGIEKMELPDFGILF 495

Query: 311 RGADV-KLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
               V      N+F+ +  ++VC ++     +++ I GN  Q NF + YD+++  + + P
Sbjct: 496 ADEAVWNFPVENYFIWIDPEVVCLAILGNPRSALSIIGNYQQQNFHILYDMKKSRLGYAP 555

Query: 369 TDCT 372
             C 
Sbjct: 556 MKCA 559


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 51/137 (37%), Positives = 71/137 (51%), Gaps = 12/137 (8%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + IGTPP     + DTGSDL W QC PC    C+ Q  P +DPK SS+++++ C
Sbjct: 190 GEYFMDVFIGTPPKHYSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKESSSFENITC 247

Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT----GQAVA 198
              +C  ++     K C   N  C Y   YGD S + G+ A ET T+  TT     +   
Sbjct: 248 HDPRCKLVSSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKH 307

Query: 199 LPGITFGCGTNNGGLFN 215
           +  + FGCG  N GLF+
Sbjct: 308 VENVMFGCGHWNRGLFH 324



 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 35/124 (28%), Positives = 61/124 (49%), Gaps = 4/124 (3%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
           +IDSGTTLT+  +     +       I+   + +    L+ CY+ + +   ++P+  I F
Sbjct: 431 IIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVEGFPPLKPCYNVSGIEKMELPDFGILF 490

Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFKP 368
             GA       N+F+++  D+VC    G   S + I GN  Q NF + YD+++  + + P
Sbjct: 491 SDGAMWDFPVENYFIQIEPDLVCLAILGTPKSALSIIGNYQQQNFHILYDMKKSRLGYAP 550

Query: 369 TDCT 372
             CT
Sbjct: 551 MKCT 554


>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
          Length = 453

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 68/195 (34%), Positives = 100/195 (51%), Gaps = 22/195 (11%)

Query: 57  ALTRSLNRLNHFN----QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGS 112
           A+ RS +RL+        N+  +  +++Q  +   + +Y +   IGTP T     ADTGS
Sbjct: 54  AVQRSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGEADTGS 113

Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-------- 164
           DLIWT+C  C  ++C  + SP + P  SS+   + C    C  L +  CS V        
Sbjct: 114 DLIWTKCGAC--ARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSG 171

Query: 165 NCQYSVSYGDG----SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTG 220
           NC Y  +YG+      ++ G L TET T G     A A PGI FGC   + G F +  +G
Sbjct: 172 NCSYHYAYGNARDTHHYTEGILMTETFTFGD---DAAAFPGIAFGCTLRSEGGFGTG-SG 227

Query: 221 IVGLGGGDISLISQM 235
           +VGLG G +SL++Q+
Sbjct: 228 LVGLGRGKLSLVTQL 242



 Score = 47.4 bits (111), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 57/115 (49%), Gaps = 7/115 (6%)

Query: 252 IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLSQVPEVTIH 309
           ++ DSGTTLT LP   Y      ++S M   +P         +C++  +S +  P + +H
Sbjct: 326 VIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLH 385

Query: 310 FR-GADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
           F  GAD+ LS  N+  ++     E   C      + ++ I GNIMQ +F V +D+
Sbjct: 386 FDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDL 440


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 97/384 (25%), Positives = 161/384 (41%), Gaps = 99/384 (25%)

Query: 79  ASQADIIP-NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-----PSQCYMQDS 132
           A+   + P ++  + + + IGTPP  R  + DTGSDLIWTQC          +    Q  
Sbjct: 71  AADVPVAPLSDQGHSLTVGIGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQRE 130

Query: 133 PLFDPKMSSTYKSLPCSSSQC--ASLNQKSCS---------------------------G 163
           PL++P+ SS++  LPCS   C     + K+C+                           G
Sbjct: 131 PLYEPRRSSSFAYLPCSDRLCQEGQFSYKNCARNNRCMYDELYGSAEAGGVLASETFTFG 190

Query: 164 VNCQYSV--SYGDGSFSNGNLATETVTLGSTTG-----QAVALPGITFGCGTNNGGLFNS 216
           VN + S+   +G G+ S G+L   +  +G + G       +++P  ++ C T     F  
Sbjct: 191 VNAKVSLPLGFGCGALSAGDLVGASGLMGLSPGIMSLVSQLSVPRFSY-CLTP----FAE 245

Query: 217 KTTGIVGLGG----------GDISLISQMRTTIA--------------GNQRL------- 245
           + T  +  G           G +   S +R                  G +RL       
Sbjct: 246 RKTSPLLFGAMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSL 305

Query: 246 GVSTPD----IVIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPTGS----LELCY 295
           G+  PD     ++DSG+T+++L +   +   +V  +++EA   PVA+ T       ELC+
Sbjct: 306 GMIKPDGSGGTIVDSGSTMSYLEE---TAFRAVKKAVVEAVRLPVANGTDEDYDDYELCF 362

Query: 296 SFNS-----LSQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGN 347
           +  +       + P + +HF G A + L R N+F +    ++C       +   V I GN
Sbjct: 363 ALPTGVAMEAVKTPPLVLHFDGGAAMTLPRDNYFQEPRAGLMCLAVGTSPDGFGVSIIGN 422

Query: 348 IMQTNFLVGYDIEQQTVSFKPTDC 371
           + Q N  V +D+  Q  SF PT C
Sbjct: 423 VQQQNMHVLFDVRNQKFSFAPTKC 446


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 57/162 (35%), Positives = 81/162 (50%), Gaps = 17/162 (10%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-YMQDSPLFDPKMSSTYKSL 146
           +  Y + + IGTPP   L VADTGSDLIW +C PC    C +      F  + S+TY ++
Sbjct: 83  SGQYFVSLRIGTPPQTLLLVADTGSDLIWVKCSPC--RNCSHRSPGSAFFARHSTTYSAI 140

Query: 147 PCSSSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
            C S QC  +     +  N       C+Y  +Y D S + G  + E +TL ++TG+   L
Sbjct: 141 HCYSPQCQLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKL 200

Query: 200 PGITFGCGTN------NGGLFNSKTTGIVGLGGGDISLISQM 235
            G++FGCG         G  F     G++GLG   IS  SQ+
Sbjct: 201 NGLSFGCGFRISGPSLTGASFEG-AQGVMGLGRAPISFSSQL 241



 Score = 57.4 bits (137), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 31/124 (25%), Positives = 63/124 (50%), Gaps = 5/124 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 310
           +IDSGTTLTF+ +   + +L      ++    A+PT   +LC + + +++  +P ++ + 
Sbjct: 329 IIDSGTTLTFITEPAYTEILKAFKKRVKLPSPAEPTPGFDLCMNVSGVTRPALPRMSFNL 388

Query: 311 RGADV-KLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            G  V      N+F++  + I C   + ++      + GN+MQ  FL+ +D ++  + F 
Sbjct: 389 AGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGNLMQQGFLLEFDRDKSRLGFT 448

Query: 368 PTDC 371
              C
Sbjct: 449 RRGC 452


>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
          Length = 453

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 68/195 (34%), Positives = 100/195 (51%), Gaps = 22/195 (11%)

Query: 57  ALTRSLNRLNHFNQ----NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGS 112
           A+ RS +RL+        N+  +  +++Q  +   + +Y +   IGTP T     ADTGS
Sbjct: 54  AVQRSRSRLSMLAARAVSNAGAAPGESAQTPLKKGSGDYAMSFGIGTPATGLSGEADTGS 113

Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-------- 164
           DLIWT+C  C  ++C  + SP + P  SS+   + C    C  L +  CS V        
Sbjct: 114 DLIWTKCGAC--ARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSG 171

Query: 165 NCQYSVSYGDG----SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTG 220
           NC Y  +YG+      ++ G L TET T G     A A PGI FGC   + G F +  +G
Sbjct: 172 NCSYHYAYGNARDTHHYTEGILMTETFTFGD---DAAAFPGIAFGCTLRSEGGFGTG-SG 227

Query: 221 IVGLGGGDISLISQM 235
           +VGLG G +SL++Q+
Sbjct: 228 LVGLGRGKLSLVTQL 242



 Score = 47.0 bits (110), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 35/115 (30%), Positives = 57/115 (49%), Gaps = 7/115 (6%)

Query: 252 IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYS-FNSLSQVPEVTIH 309
           ++ DSGTTLT LP   Y      ++S M   +P         +C++  +S +  P + +H
Sbjct: 326 VIFDSGTTLTMLPDPAYTLVRDELLSQMGFQKPPPAANDDDLICFTGGSSTTTFPSMVLH 385

Query: 310 FR-GADVKLSRSNFFVKV----SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
           F  GAD+ LS  N+  ++     E   C      + ++ I GNIMQ +F V +D+
Sbjct: 386 FDGGADMDLSTENYLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDL 440


>gi|255685712|gb|ACU28345.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
          Length = 91

 Score = 92.8 bits (229), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 48/102 (47%), Positives = 63/102 (61%), Gaps = 13/102 (12%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           +++ IGTPP E  AV DTGS+LIWTQC PC    CY Q +P+FDP  SST+K   C++  
Sbjct: 1   MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRCNTPD 58

Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
                       +C Y + Y D S++ G LATETVT+ ST+G
Sbjct: 59  -----------HSCXYKIVYDDKSYTQGTLATETVTIHSTSG 89


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 73/232 (31%), Positives = 109/232 (46%), Gaps = 27/232 (11%)

Query: 18  VVSPIEAQTGGFSVELIHRDSPKSPFYNSSETP-------YQRLRDA-LTRSLNRLNHFN 69
           V S   A + G +V L HR  P SP   S++ P       + +LR   + R L+  +   
Sbjct: 52  VCSVTPASSSGTTVPLNHRYGPCSP-APSAKVPTILELLEHDQLRAKYIQRKLSGTDGLQ 110

Query: 70  Q-NSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY 128
             + ++ ++  S  D +     Y+I + IG+P   +  + DTGSD+ W +C         
Sbjct: 111 PLDLTVPTTLGSALDTM----EYVITVGIGSPAVTQTMMIDTGSDVSWVRCNS------- 159

Query: 129 MQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
                LFDP  S+TY    CSS+ CA L  N   CS   CQY V YGDGS + G  +++T
Sbjct: 160 TDGLTLFDPSKSTTYAPFSCSSAACAQLGNNGDGCSNSGCQYRVQYGDGSNTTGTYSSDT 219

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           + L ++      +    FGC  +       K  G++GLGG   SL+SQ   T
Sbjct: 220 LALSASD----TVTDFHFGCSHHEEDFDGEKIDGLMGLGGDAQSLVSQTAAT 267



 Score = 48.1 bits (113), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 42/124 (33%), Positives = 57/124 (45%), Gaps = 11/124 (8%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVM-SSMIE-AQPVADPTGSLELCYSFNSLSQV--PEVTI 308
           V+DSGT +T+LP+   S L S   SSM       A P G L+ CY F  L  V  P V++
Sbjct: 339 VMDSGTVITWLPRRAYSALSSAFRSSMTRLRHQRAAPLGILDTCYDFTGLVNVSIPAVSL 398

Query: 309 HFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
               GA V L  +   ++      C  F   T+   I GN+ Q  F V +D+ Q    F+
Sbjct: 399 VLDGGAVVDLDGNGIMIQ-----DCLAFAA-TSGDSIIGNVQQRTFEVLHDVGQGVFGFR 452

Query: 368 PTDC 371
              C
Sbjct: 453 SGAC 456


>gi|255685714|gb|ACU28346.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
          Length = 91

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 48/102 (47%), Positives = 63/102 (61%), Gaps = 13/102 (12%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           +++ IGTPP E  AV DTGS+LIWTQC PC    CY Q +P+FDP  SST+K   C++  
Sbjct: 1   MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRCNTPD 58

Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
                       +C Y + Y D S++ G LATETVT+ ST+G
Sbjct: 59  -----------HSCSYKIVYDDKSYTQGTLATETVTIHSTSG 89


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 89/369 (24%), Positives = 142/369 (38%), Gaps = 85/369 (23%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQC------YMQDSPLFDPKMSS 141
             Y +   +GTP  + + VADTGSDL W  C+  C    C       ++   +F   +SS
Sbjct: 10  GQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSS 69

Query: 142 TYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           ++K++PC +  C        SL         C Y   Y DGS + G  A ETVT+    G
Sbjct: 70  SFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEG 129

Query: 195 QAVALPGITFGCGTNNGGLFNSKTTGIVGLG--------------GGDIS---------- 230
           + + L  +  GC  +  G       G++GLG              GG  S          
Sbjct: 130 RKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHK 189

Query: 231 -------------------------LISQMRTTIAGNQRLGVS-------TPDIVID--- 255
                                    L+  M  +      +G+S        P  V D   
Sbjct: 190 NVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWDVKG 249

Query: 256 -SGTTL------TFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPE 305
             GT L      TFL +  Y   + ++  S+++ + V    G LE C++     +  VP 
Sbjct: 250 AGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNSTGFEESLVPR 309

Query: 306 VTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQT 363
           +  HF  GA+ +    ++ +  ++ + C  F  +      + GNIMQ N L  +D+  + 
Sbjct: 310 LVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGLKK 369

Query: 364 VSFKPTDCT 372
           + F P+ CT
Sbjct: 370 LGFAPSSCT 378


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 111/451 (24%), Positives = 172/451 (38%), Gaps = 96/451 (21%)

Query: 3   TFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDS--PKSPFYNSSETPYQRLRDALTR 60
           T LSC+     L   +      +     ++L HRD+  PK         P  R+ D +  
Sbjct: 4   TLLSCLITTLLL---ITVADSMKDTSVRLKLAHRDTLLPK---------PLSRIEDVIGA 51

Query: 61  SLNR--LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQ 118
              R  L    +NS++       + I    A Y   I +GTP  +   V DTGS+L W  
Sbjct: 52  DQKRHSLISRKRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVN 111

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVS 171
           C      +    +  +F    S ++K++ C +  C        SL         C Y   
Sbjct: 112 CRYRARGK---DNRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYR 168

Query: 172 YGDGSFSNGNLA--TETVTL---------GSTTGQAVALPGITFGCGTNNGGL------F 214
           Y DGS + G  A  T TV L         G   G + +  G +F       GL      F
Sbjct: 169 YADGSAAQGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSF 228

Query: 215 NSKTTGIVG----------LGGGDIS---LISQMRTTIAGNQR----------------- 244
            S  T + G          L   ++S   +    R+T    +R                 
Sbjct: 229 TSTATSLYGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRTTPLDLTRIPPFYAINV 288

Query: 245 ----LGVSTPDI-------------VIDSGTTLTFLPQG-YNSNLLSVMSSMIEAQPVAD 286
               LG    DI             ++DSGT+LT L    Y   +  +   ++E + V  
Sbjct: 289 IGISLGYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKP 348

Query: 287 PTGSLELCYSFNS---LSQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGI-TNS 341
               +E C+SF S   +S++P++T H +G A  +  R ++ V  +  + C  F    T +
Sbjct: 349 EGVPIEYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPA 408

Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
             + GNIMQ N+L  +D+   T+SF P+ CT
Sbjct: 409 TNVIGNIMQQNYLWEFDLMASTLSFAPSACT 439


>gi|357157325|ref|XP_003577760.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 413

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 94/368 (25%), Positives = 156/368 (42%), Gaps = 84/368 (22%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
            D+ P   +Y + ++IG P        DTGSDL W QC+  P   C     PL+ P  + 
Sbjct: 44  GDVYPT-GHYYVTMNIGDPAKPYFLDIDTGSDLTWLQCD-APCQSCNKVPHPLYKPTKN- 100

Query: 142 TYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
             K +PC++S C +L      N+K      C Y + Y D + S G L T+  TL      
Sbjct: 101 --KLVPCAASICTTLHSAQSPNKKCAVPQQCDYQIKYTDSASSLGVLVTDNFTLPLRNSS 158

Query: 196 AVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMRT-------------- 237
           +V  P  TFGCG +      G+  + T G++GLG G +SL+SQ++               
Sbjct: 159 SVR-PSFTFGCGYDQQVGKNGVVQATTDGLLGLGKGSVSLVSQLKVLGITKNVLGHCLST 217

Query: 238 -------------------------TIAGN-------------QRLGVSTPDIVIDSGTT 259
                                    + +GN             + LGV   ++V DSG+T
Sbjct: 218 NGGGFLFFGDNVVPTSRATWVPMVRSTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGST 277

Query: 260 LT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS----FNSLSQVPE----VTIH 309
            T F  Q Y + + ++ + + ++ Q V+DP  SL LC+     F S+S V      + + 
Sbjct: 278 YTYFAAQPYQATVSALKAGLSKSLQQVSDP--SLPLCWKGQKVFKSVSDVKNDFKSLFLS 335

Query: 310 F-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVS 365
           F + + +++   N+ +       C  +  G    +   I G+I   + L+ YD E+  + 
Sbjct: 336 FVKNSVLEIPPENYLIVTKNGNACLGILDGSAAKLTFNIIGDITMQDQLIIYDNERGQLG 395

Query: 366 FKPTDCTK 373
           +    C++
Sbjct: 396 WIRGSCSR 403


>gi|125552105|gb|EAY97814.1| hypothetical protein OsI_19735 [Oryza sativa Indica Group]
          Length = 424

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 102/415 (24%), Positives = 168/415 (40%), Gaps = 86/415 (20%)

Query: 29  FSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNN 88
             +EL H D+      N   T  +R+R A  R+ +R    + +++ ++   +        
Sbjct: 23  LRLELAHVDA------NEHCTMEERVRRATERTHHR-RLLHASTAAAAGGVAAPLRWSGK 75

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC--------PPSQCYMQDSPLFDPKMS 140
             Y+    IG PP    AV DTGSDL+WTQC  C            C+ Q+ P ++  +S
Sbjct: 76  TQYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYYNFSLS 135

Query: 141 STYKSLPCS---------SSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLG 190
            T +++PC          + + A   +   SG + C  + SYG G  + G L T+  T  
Sbjct: 136 RTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTDAFTFP 194

Query: 191 STTGQAVALPGITFGCGTNN----GGLFNSKTTGIVGLGGGDISL--------------- 231
           S++   +A     FGC +      G L  +  +GI+GLG G +SL               
Sbjct: 195 SSSSVTLA-----FGCVSQTRISPGAL--TGASGIIGLGRGALSLNPKDSPFSTFYYLPL 247

Query: 232 --ISQMRTTI---AGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVMSSMIEA 281
             ++    T+   AG   L  + P +     +IDSG+  T L    +  L   ++  +  
Sbjct: 248 VGLAAGNATVALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKELARQLRG 307

Query: 282 Q-----PVADPTGSLELCYSFN------SLSQVPEVTIHFR-----GADVKLSRSNFFVK 325
                 P A   G+LELC          + + VP + + F      G ++ +    ++ +
Sbjct: 308 SGSLVPPPAKLGGALELCVEAGDDGDSLAAAAVPSLVLRFDDGVGGGRELVIPAEKYWAR 367

Query: 326 VSEDIVCSVFKGI--------TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           V     C              TN   I GN MQ +  V YD+    +SF+P +C+
Sbjct: 368 VEASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANCS 422


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 65/210 (30%), Positives = 96/210 (45%), Gaps = 22/210 (10%)

Query: 40  KSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGT 99
           KSPF + ++      R     SL R       S + S  AS       +  Y + + IG 
Sbjct: 39  KSPFPSPTQALALDTRRLHFLSLRRKPIPFVKSPVVSGAAS------GSGQYFVDLRIGQ 92

Query: 100 PPTERLAVADTGSDLIWTQCEPCPPSQC-YMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
           PP   L +ADTGSDL+W +C  C    C +   + +F P+ SST+    C    C  + +
Sbjct: 93  PPQSLLLIADTGSDLVWVKCSAC--RNCSHHSPATVFFPRHSSTFSPAHCYDPVCRLVPK 150

Query: 159 KSCSGV--------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
              + +         C Y   Y DGS ++G  A ET +L +++G+   L  + FGCG   
Sbjct: 151 PDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFGCGFRI 210

Query: 211 GGLFNSKTT-----GIVGLGGGDISLISQM 235
            G   S T+     G++GLG G IS  SQ+
Sbjct: 211 SGQSVSGTSFNGANGVMGLGRGPISFASQL 240



 Score = 53.5 bits (127), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 34/128 (26%), Positives = 64/128 (50%), Gaps = 9/128 (7%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP-TGSLELCYSFNSLSQ----VPEVT 307
           V+DSGTTL FL +    ++++ +   ++  P+AD  T   +LC + + +++    +P + 
Sbjct: 324 VVDSGTTLAFLAEPAYRSVIAAVRRRVKL-PIADALTPGFDLCVNVSGVTKPEKILPRLK 382

Query: 308 IHFRGADVKL-SRSNFFVKVSEDIVCSVFKGITNSV--PIYGNIMQTNFLVGYDIEQQTV 364
             F G  V +    N+F++  E I C   + +   V   + GN+MQ  FL  +D ++  +
Sbjct: 383 FEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRL 442

Query: 365 SFKPTDCT 372
            F    C 
Sbjct: 443 GFSRRGCA 450


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 50/137 (36%), Positives = 73/137 (53%), Gaps = 12/137 (8%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y I + +GTPP     + DTGSDL W QC PC   +C+ Q+ P +DP  SS+Y+++ C
Sbjct: 179 GEYFIDVFVGTPPKHFSLILDTGSDLNWIQCVPC--YECFEQNGPHYDPGQSSSYRNIGC 236

Query: 149 SSSQCASLNQ----KSCSGVN--CQYSVSYGDGSFSNGNLATET----VTLGSTTGQAVA 198
             S+C  ++     + C   N  C Y   YGD S + G+ A ET    +T+ S   +   
Sbjct: 237 HDSRCHLVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRR 296

Query: 199 LPGITFGCGTNNGGLFN 215
           +  + FGCG  N GLF+
Sbjct: 297 VENVMFGCGHWNRGLFH 313



 Score = 60.1 bits (144), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 42/151 (27%), Positives = 72/151 (47%), Gaps = 10/151 (6%)

Query: 226 GGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA 285
           GG++  I + +  IA +   G      +IDSGTTL++  +     +     + ++  PV 
Sbjct: 398 GGEVVNIPEEKWQIATDGSGGT-----IIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVV 452

Query: 286 DPTGSLELCYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVS-EDIVCSVFKGIT-N 340
                LE CY+   + Q  +P+  I F  GA       N+F+++   ++VC    G   +
Sbjct: 453 KDFPVLEPCYNVTGVEQPDLPDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPS 512

Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           ++ I GN  Q NF + YD ++  + F PT C
Sbjct: 513 ALSIIGNYQQQNFHILYDTKKSRLGFAPTKC 543


>gi|255685716|gb|ACU28347.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
 gi|255685726|gb|ACU28352.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
 gi|255685728|gb|ACU28353.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
          Length = 91

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 48/102 (47%), Positives = 63/102 (61%), Gaps = 13/102 (12%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           +++ IGTPP E  AV DTGS+LIWTQC PC    CY Q +P+FDP  SST+K   C++  
Sbjct: 1   MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRCNTPD 58

Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
                       +C Y + Y D S++ G LATETVT+ ST+G
Sbjct: 59  -----------HSCPYKIVYDDKSYTQGTLATETVTIHSTSG 89


>gi|255685722|gb|ACU28350.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
          Length = 91

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 48/102 (47%), Positives = 63/102 (61%), Gaps = 13/102 (12%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           +++ IGTPP E  AV DTGS+LIWTQC PC    CY Q +P+FDP  SST+K   C++  
Sbjct: 1   MKLQIGTPPFEXEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFKETRCNTPB 58

Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
                       +C Y + Y D S++ G LATETVT+ ST+G
Sbjct: 59  -----------HSCPYKJVYDDKSYTXGTLATETVTIHSTSG 89


>gi|255685718|gb|ACU28348.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
 gi|255685720|gb|ACU28349.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
 gi|255685724|gb|ACU28351.1| At2g28040-like protein [Arabidopsis lyrata subsp. petraea]
          Length = 91

 Score = 92.0 bits (227), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 49/102 (48%), Positives = 65/102 (63%), Gaps = 13/102 (12%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           +++ IGTPP E  AV DTGS+LIWTQC PC    CY Q +P+FDP  SST+K      ++
Sbjct: 1   MKLQIGTPPFEIEAVLDTGSELIWTQCLPC--LHCYDQKAPIFDPSKSSTFK-----ETR 53

Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           C + N       +C Y + Y D S++ G LATETVT+ ST+G
Sbjct: 54  CNTPNH------SCPYKIVYDDKSYTLGTLATETVTIHSTSG 89


>gi|168065778|ref|XP_001784824.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663621|gb|EDQ50376.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 102/391 (26%), Positives = 151/391 (38%), Gaps = 77/391 (19%)

Query: 21  PIEAQTGGFSVELIHRDSPKSPF-----------YNSSETPYQRLRDALTRSLNRLNHFN 69
           P E  +    V+LIHRD P SP              ++   YQ+ R  LT    RL  F 
Sbjct: 39  PSENGSQSIRVDLIHRDHPLSPIAPAEGTTLSERLQAAAKRYQKRRTLLTSRGRRLGAFT 98

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
                         I   +  Y+I +  GTP      + DTGSDL+W QC PC    CY 
Sbjct: 99  ------------TPITAESFEYVIPLFFGTPLQPFTGMVDTGSDLVWIQCLPC--INCYT 144

Query: 130 -QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVT 188
               P FDP  SS+   +PC+       +   C+  N            + G+L      
Sbjct: 145 THPHPEFDPTTSSSEAYVPCT-------DPALCTASNFS----------ARGDLDVFAFC 187

Query: 189 LGSTTGQAVALPGITFGC--GTNNGGLF-------NSKTTGIVGLGGGDISLISQMRTTI 239
           L   T        + FG    TN  GL         S +   VG+ G  ++ +     T 
Sbjct: 188 LVPYTAATTLTSALVFGSRDATNALGLVYTPLLQGTSPSFYWVGMVGVSVAGVDAGIPT- 246

Query: 240 AGNQRLGVSTPDIVIDSGTTLT-FLPQGYNSNLLSVMSSMIEAQPVA-DPTGSL------ 291
                L  ST  ++ DSGT LT F P+ Y+    S+  ++    PVA DP  ++      
Sbjct: 247 ----ALFASTDGVLFDSGTPLTYFAPEIYDPLHQSIAGAI--PYPVAPDPVDAVVAKPLN 300

Query: 292 ELCYSFNSLSQ--VPEVTIHFRGADV-------KLSRSNFFVKVSEDIVC-SVFKGITNS 341
            LC+    +    +P +  HF  AD         L   N ++     + C ++ +G + +
Sbjct: 301 RLCFDLAGVQSPVLPTMAYHFTDADAAGATVDFDLGLENIYMNDMNTVWCLAIVRGESGN 360

Query: 342 VPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
             I GNI Q N  + +D+    + +   DCT
Sbjct: 361 PSIVGNIQQANHYIEHDVALNRIGWTSKDCT 391


>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 415

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 91/371 (24%), Positives = 154/371 (41%), Gaps = 87/371 (23%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           Q D+ P   +Y + ++IG P        DTGSDL W QC+  P   C     PL+ P   
Sbjct: 44  QGDVYPT-GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCD-APCRSCNKVPHPLYRP--- 98

Query: 141 STYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           +  + +PC+++ C +L      N K  S   C Y + Y D + S G L  ++ +L   + 
Sbjct: 99  TANRLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSS 158

Query: 195 QAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMR-------------- 236
                PG+TFGCG +      G   +   G++GLG G +SL+SQ++              
Sbjct: 159 N--IRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLS 216

Query: 237 -------------------TTIAGNQR-------------------LGVSTPDIVIDSGT 258
                              T +   QR                   LGV   ++V DSG+
Sbjct: 217 TNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGS 276

Query: 259 TLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS--------FNSLSQVPEVTI 308
           T T F  Q Y + + ++   + ++ + V+DPT  L LC+         F+  ++   + +
Sbjct: 277 TYTYFTAQPYQAVVSALKGGLSKSLKQVSDPT--LPLCWKGQKAFKSVFDVKNEFKSMFL 334

Query: 309 HF---RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQQ 362
            F   + A +++   N+ +      VC  +  G     S  + G+I   + +V YD E+ 
Sbjct: 335 SFSSAKNAAMEIPPENYLIVTKNGNVCLGILDGTAAKLSFNVIGDITMQDQMVIYDNEKS 394

Query: 363 TVSFKPTDCTK 373
            + +    CT+
Sbjct: 395 QLGWARGACTR 405


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 50/137 (36%), Positives = 73/137 (53%), Gaps = 12/137 (8%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + +GTPP     + DTGSDL W QC PC    C+ Q+ P +DPK SS++K++ C
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--YACFEQNGPYYDPKDSSSFKNITC 250

Query: 149 SSSQCASLNQ----KSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA---- 198
              +C  ++     + C G   +C Y   YGD S + G+ A ET T+  TT +       
Sbjct: 251 HDPRCQLVSSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKI 310

Query: 199 LPGITFGCGTNNGGLFN 215
           +  + FGCG  N GLF+
Sbjct: 311 VENVMFGCGHWNRGLFH 327



 Score = 59.7 bits (143), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 62/124 (50%), Gaps = 5/124 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
           +IDSGTTLT+  +     +       I+  P+ +    L+ CY+ + +   ++PE  I F
Sbjct: 434 IIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETFPPLKPCYNVSGVEKMELPEFAILF 493

Query: 311 R-GADVKLSRSNFFVKVS-EDIVCSVFKGITNS-VPIYGNIMQTNFLVGYDIEQQTVSFK 367
             GA       N+F+++  ED+VC    G   S + I GN  Q NF + YD+++  + + 
Sbjct: 494 ADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSIIGNYQQQNFHILYDLKKSRLGYA 553

Query: 368 PTDC 371
           P  C
Sbjct: 554 PMKC 557


>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
 gi|219888509|gb|ACL54629.1| unknown [Zea mays]
 gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
          Length = 415

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 91/371 (24%), Positives = 154/371 (41%), Gaps = 87/371 (23%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           Q D+ P   +Y + ++IG P        DTGSDL W QC+  P   C     PL+ P  +
Sbjct: 44  QGDVYPT-GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCD-APCRSCNKVPHPLYRPTAN 101

Query: 141 STYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
              + +PC+++ C +L      N K  S   C Y + Y D + S G L  ++ +L   + 
Sbjct: 102 ---RLVPCANALCTALHSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSS 158

Query: 195 QAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMR-------------- 236
                PG+TFGCG +      G   +   G++GLG G +SL+SQ++              
Sbjct: 159 N--IRPGLTFGCGYDQQVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLS 216

Query: 237 -------------------TTIAGNQR-------------------LGVSTPDIVIDSGT 258
                              T +   QR                   LGV   ++V DSG+
Sbjct: 217 TNGGGFLFFGDDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGS 276

Query: 259 TLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS--------FNSLSQVPEVTI 308
           T T F  Q Y + + ++   + ++ + V+DPT  L LC+         F+  ++   + +
Sbjct: 277 TYTYFTAQPYQAVVSALKGGLSKSLKQVSDPT--LPLCWKGQKAFKSVFDVKNEFKSMFL 334

Query: 309 HF---RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQQ 362
            F   + A +++   N+ +      VC  +  G     S  + G+I   + +V YD E+ 
Sbjct: 335 SFASAKNAAMEIPPENYLIVTKNGNVCLGILDGTAAKLSFNVIGDITMQDQMVIYDNEKS 394

Query: 363 TVSFKPTDCTK 373
            + +    CT+
Sbjct: 395 QLGWARGACTR 405


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 50/137 (36%), Positives = 71/137 (51%), Gaps = 12/137 (8%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + +GTPP     + DTGSDL W QC PC    C+ Q  P +DPK SS+++++ C
Sbjct: 193 GEYFMDVFVGTPPKHFSLILDTGSDLNWIQCVPC--IACFEQSGPYYDPKDSSSFRNISC 250

Query: 149 SSSQCASLNQKS----CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT----GQAVA 198
              +C  ++       C   N  C Y   YGDGS + G+ A ET T+  TT     +   
Sbjct: 251 HDPRCQLVSSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKH 310

Query: 199 LPGITFGCGTNNGGLFN 215
           +  + FGCG  N GLF+
Sbjct: 311 VENVMFGCGHWNRGLFH 327



 Score = 56.2 bits (134), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 62/124 (50%), Gaps = 4/124 (3%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHF 310
           +IDSGTTLT+  +     +       I+   + +    L+ CY+ + +   ++P+  I F
Sbjct: 434 IIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVEGLPPLKPCYNVSGIEKMELPDFGILF 493

Query: 311 R-GADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
             GA       N+F+++  D+VC ++     +++ I GN  Q NF + YD+++  + + P
Sbjct: 494 ADGAVWNFPVENYFIQIDPDVVCLAILGNPRSALSIIGNYQQQNFHILYDMKKSRLGYAP 553

Query: 369 TDCT 372
             C 
Sbjct: 554 MKCA 557


>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Brachypodium distachyon]
          Length = 429

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 95/367 (25%), Positives = 143/367 (38%), Gaps = 90/367 (24%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM---QDSPLFDPKMSSTY 143
           +   + + IS+GTPP   L   DTGS L W  C+ C  S C+    +   +FDP  S+TY
Sbjct: 71  HEGKFFMDISLGTPPVANLVTVDTGSTLSWVVCQRCQIS-CHTTAPEAGSVFDPDKSTTY 129

Query: 144 KSLPCSSSQCASLNQKSCSGVN-------CQYSVSYG---DGSFSNGNLATETVTLGSTT 193
           + + CSS  CA + +   +          C YS+ YG    G +S G L T+ +TL S++
Sbjct: 130 ELVGCSSRDCADVQRSLVAPFGCIEETDTCLYSLRYGSGPSGQYSAGRLGTDKLTLASSS 189

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT---------------T 238
                + G  FGC  ++   F    +G++G GG + S  +Q+                 T
Sbjct: 190 S---IIDGFIFGCSGDDS--FKGYESGVIGFGGANFSFFNQVARQTNYRAFSYCFPGDHT 244

Query: 239 IAGNQRLG-----------------------VSTPDIVIDSGTTLTFLPQGYNSNLLSVM 275
             G   +G                       +   D+++D G  L      Y   ++ V 
Sbjct: 245 AEGFLSIGAYPKDELVYTNLIPHFGDRSVYSLQQIDMMVD-GNRLQVDQSEYTKRMMVVD 303

Query: 276 SSMIEA---QPVADP-----------------TGSLELCYSFNSLSQV-----PEVTIHF 310
           S  ++     PV D                  T   E C+  N    V     P V + F
Sbjct: 304 SGTVDTFLLGPVFDAFSKAMASAMQAKGFLSDTVGTETCFRPNGGDSVDSGDLPTVEMRF 363

Query: 311 RGADVKLSRSNFFVKV--SEDIVCSVFK----GITNSVPIYGNIMQTNFLVGYDIEQQTV 364
            G  +KL   N F  +  S D +C  FK    G+ N V I GN    +F V YD++    
Sbjct: 364 IGTTLKLPPENVFHDLLPSHDKICLAFKPDVAGVRN-VQILGNKATXSFRVVYDLQAMYF 422

Query: 365 SFKPTDC 371
            F+   C
Sbjct: 423 GFQAGAC 429


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 92/333 (27%), Positives = 124/333 (37%), Gaps = 78/333 (23%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--CSGV 164
           V DT SD+ W QC PCP   CY Q   L+DP  SS+     C+S  C  L   +  C+  
Sbjct: 147 VLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNN 206

Query: 165 N-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC--GTNNGGLFNSKTTGI 221
           N CQY V Y DG+ + G   ++ +T+   T    A+    FGC  G      F S   GI
Sbjct: 207 NQCQYRVRYPDGTSTAGTYISDLLTITPAT----AVRSFQFGCSHGVQGSFSFGSSAAGI 262

Query: 222 VGLGGGDISLISQMRTTIA------------------GNQRLG----------------- 246
           + LGGG  SL+SQ   T                    G  R+                  
Sbjct: 263 MALGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIPP 322

Query: 247 ----------------VSTPDIVIDSGTTL-------TFLPQGYNSNLLSVMSSMIEAQP 283
                           ++ P  V  +G  L          P  Y +   +    M   QP
Sbjct: 323 TFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQP 382

Query: 284 VADPTGSLELCYSFNSLSQ--VPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KGIT 339
            A P G L+ CY    +    +P +T+ F + A V+L  S    +      C  F  G  
Sbjct: 383 -APPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQ-----GCLAFTAGPN 436

Query: 340 NSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           + VP I GNI      V Y+I    V F+   C
Sbjct: 437 DQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 469


>gi|413916291|gb|AFW56223.1| hypothetical protein ZEAMMB73_420944 [Zea mays]
          Length = 383

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 76/277 (27%), Positives = 114/277 (41%), Gaps = 70/277 (25%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + ++IG PP       D+GSDL W QC+ PC    C     PL+ P  S
Sbjct: 58  GDVYPHGL-YYVAMNIGNPPKPYFLDVDSGSDLTWLQCDAPC--RSCNEVPHPLYRPTKS 114

Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
              K +PC    CASL+     +  C   +  C Y + Y D   S G L  ++  L  T 
Sbjct: 115 ---KLVPCVHRLCASLHNGLTGKHRCDSPHEQCDYVIKYADQGSSTGVLINDSFALRLTN 171

Query: 194 GQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMR-------------- 236
           G +VA P + FGCG +     G  +S T G++GLG G +SL+SQ++              
Sbjct: 172 G-SVARPSVAFGCGYDQQVRSGDLSSPTDGVLGLGTGSVSLLSQLKQRGVTKNVVGHCLS 230

Query: 237 ---------------------TTIA-----------------GNQRLGVSTPDIVIDSGT 258
                                T +A                 G++ LGV    +V DSG+
Sbjct: 231 LRGGGFLFFGDDLVPYQRATWTPMARSAFRNYYSPGSASLYFGDRSLGVRLAKVVFDSGS 290

Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCY 295
           + T+        L++ +   +      +P  SL LC+
Sbjct: 291 SFTYFAAKPYQALVTALKDGLSRTLEEEPDTSLPLCW 327


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 92/333 (27%), Positives = 124/333 (37%), Gaps = 78/333 (23%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--CSGV 164
           V DT SD+ W QC PCP   CY Q   L+DP  SS+     C+S  C  L   +  C+  
Sbjct: 172 VLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNN 231

Query: 165 N-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC--GTNNGGLFNSKTTGI 221
           N CQY V Y DG+ + G   ++ +T+   T    A+    FGC  G      F S   GI
Sbjct: 232 NQCQYRVRYPDGTSTAGTYISDLLTITPAT----AVRSFQFGCSHGVQGSFSFGSSAAGI 287

Query: 222 VGLGGGDISLISQMRTTIA------------------GNQRLG----------------- 246
           + LGGG  SL+SQ   T                    G  R+                  
Sbjct: 288 MALGGGPESLVSQTAATYGRVFSHCFPPPTRRGFFTLGVPRVAAWRYVLTPMLKNPAIPP 347

Query: 247 ----------------VSTPDIVIDSGTTL-------TFLPQGYNSNLLSVMSSMIEAQP 283
                           ++ P  V  +G  L          P  Y +   +    M   QP
Sbjct: 348 TFYMVRLEAIAVAGQRIAVPPTVFAAGAALDSRTAITRLPPTAYQALRQAFRDRMAMYQP 407

Query: 284 VADPTGSLELCYSFNSLS--QVPEVTIHF-RGADVKLSRSNFFVKVSEDIVCSVF-KGIT 339
            A P G L+ CY    +    +P +T+ F + A V+L  S    +      C  F  G  
Sbjct: 408 -APPKGPLDTCYDMAGVRSFALPRITLVFDKNAAVELDPSGVLFQ-----GCLAFTAGPN 461

Query: 340 NSVP-IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           + VP I GNI      V Y+I    V F+   C
Sbjct: 462 DQVPGIIGNIQLQTLEVLYNIPAALVGFRHAAC 494


>gi|72384474|gb|AAZ67590.1| 80A08_5 [Brassica rapa subsp. pekinensis]
          Length = 632

 Score = 91.3 bits (225), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 111/444 (25%), Positives = 166/444 (37%), Gaps = 81/444 (18%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHR------DSPKSP--FYNSSETPYQRLRDALT 59
            FIL F+   V     A    FS  LIHR       S KSP  F       Y RL  ++ 
Sbjct: 6   AFILLFILSLVSEKSLASL--FSSRLIHRFSDEGRASIKSPGSFPEKRSFEYYRLLTSID 63

Query: 60  RSLNRLNHFNQNSSISSSKASQADIIPNN---ANYLIRISIGTPPTERLAVADTGSDLIW 116
               ++N   +  S+  S+ S+  I P N     +   I IGTP    L   D+GSDL+W
Sbjct: 64  SRRQKMNLGAKFQSLVPSEGSKT-ISPGNYFGWLHYTWIDIGTPSVSFLVALDSGSDLLW 122

Query: 117 TQCE--PCPP------SQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
             C    C P      S    +D   FDP  S+T K  PCS   C S          C Y
Sbjct: 123 IPCNCVQCAPLSSAYYSSLATKDLNEFDPSASTTSKVFPCSHKLCESAPACESPKEQCPY 182

Query: 169 SVSYG-DGSFSNGNLATETVTLGSTTGQAVALPG-ITFGCGTNNGGLFNSKTT--GIVGL 224
           +V+Y  + + S+G L  + + L  +   + ++   +  GCG    G F       G++GL
Sbjct: 183 TVTYASENTSSSGLLVEDVLHLAYSANASSSVKARVVVGCGEKQSGEFLKGIAPDGVMGL 242

Query: 225 GGGDISLIS------QMRTTIA-------------------------------------- 240
           G G+IS+ S       MR + +                                      
Sbjct: 243 GPGEISVPSFLAKAGLMRNSFSMCFDEEDSGRIYFGDVGPSTQQSTRFLPYKNEFVAYFV 302

Query: 241 -------GNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
                  GN  L  S+   +IDSG + TFLP+     +   + S I A       G  E 
Sbjct: 303 GVEVCCVGNSCLKQSSFTTLIDSGQSFTFLPEEIYREVALEIDSHINATVKKIEGGPWEY 362

Query: 294 CYSFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNF 353
           CY  +   +VP + + F   +  +     FV    + +      I+ S    G ++  N+
Sbjct: 363 CYETSFEPKVPAIKLKFSSNNTFVIHKPLFVLQRSEGLVQFCLPISASEEGTGGVIGQNY 422

Query: 354 LVGY----DIEQQTVSFKPTDCTK 373
           + GY    D E   + +  + C +
Sbjct: 423 MAGYRIVFDRENMKLGWSASKCQE 446


>gi|224144963|ref|XP_002325476.1| predicted protein [Populus trichocarpa]
 gi|222862351|gb|EEE99857.1| predicted protein [Populus trichocarpa]
          Length = 372

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 92/346 (26%), Positives = 143/346 (41%), Gaps = 72/346 (20%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I +G P  +     DTGSD++W     C+ CP          L+DP  S +   + 
Sbjct: 27  YFAKIGLGNPSKDYYVQVDTGSDILWVNCIGCDKCPTKSDLGIKLTLYDPASSVSATRVS 86

Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C    C S    L       + CQY+V YGDGS + G   ++ V     TG     ++  
Sbjct: 87  CDDDFCTSTYNGLLPDCKKELPCQYNVVYGDGSSTAGYFVSDAVQFERVTGNLQTGLSNG 146

Query: 201 GITFGCGT-NNGGLFNSKTT--GIVG--------LGGGDISLISQ--------------- 234
            +TFGCG   +GGL  S     GI+G        + GG I  I +               
Sbjct: 147 TVTFGCGAQQSGGLGTSGEALDGILGAFAHCLDNVNGGGIFAIGELVSPKVNTTPMVPNQ 206

Query: 235 ------MRTTIAGNQRLGVSTP--------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIE 280
                 M+    G   L + T           +IDSGTTL +LP+        V  SM+ 
Sbjct: 207 AHYNVYMKEIEVGGTVLELPTDVFDSGDRRGTIIDSGTTLAYLPE-------VVYDSMMN 259

Query: 281 AQPVADPTGSLE------LC--YSFNSLSQVPEVTIHFRGA-DVKLSRSNFFVKVSEDIV 331
                 P  SL       +C  YS N     P++  HF+ +  + +   ++  ++SEDI 
Sbjct: 260 EIRSQQPGLSLHTVEEQFICFKYSGNVDDGFPDIKFHFKDSLTLTVYPHDYLFQISEDIW 319

Query: 332 CSVFK--GITNS----VPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           C  ++  G+ +     + + G+++ +N LV YDIE Q + +   +C
Sbjct: 320 CFGWQNGGMQSKDGRDMTLLGDLVLSNKLVLYDIENQAIGWTEYNC 365


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score = 90.9 bits (224), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 58/153 (37%), Positives = 82/153 (53%), Gaps = 11/153 (7%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +++NY+I++  GTPP     V DTGS++ W  C PC  S C  +  P F+P  SSTY  L
Sbjct: 120 SSSNYIIKLGFGTPPQSFYTVLDTGSNIAWIPCNPC--SGCSSKQQP-FEPSKSSTYNYL 176

Query: 147 PCSSSQCASLN--QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C+S QC  L    KS + VNC  +  YGD S  +  L++ET+++GS       +    F
Sbjct: 177 TCASQQCQLLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSVGSQ-----QVENFVF 231

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRT 237
           GC     GL   +T  +VG G   +S +SQ  T
Sbjct: 232 GCSNAARGLIQ-RTPSLVGFGRNPLSFVSQTAT 263



 Score = 42.7 bits (99), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 41/159 (25%), Positives = 71/159 (44%), Gaps = 11/159 (6%)

Query: 222 VGLGGGDISLISQMRTTIAGNQRLGVST-PDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE 280
           VGL G  IS+  ++ +  AG   L  ST    +IDSGT +T L +   + +     S + 
Sbjct: 314 VGLNG--ISVGEELVSIPAGTLSLDESTGRGTIIDSGTVITRLVEPAYNAMRDSFRSQLS 371

Query: 281 AQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFVKVSED--IVCSVF- 335
              +A PT   + CY+  S   + P +T+HF    D+ L   N     ++D  ++C  F 
Sbjct: 372 NLTMASPTDLFDTCYNRPSGDVEFPLITLHFDDNLDLTLPLDNILYPGNDDGSVLCLAFG 431

Query: 336 ---KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
               G  + +  +GN  Q    + +D+ +  +     +C
Sbjct: 432 LPPGGGDDVLSTFGNYQQQKLRIVHDVAESRLGIASENC 470


>gi|21717173|gb|AAM76366.1|AC074196_24 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433291|gb|AAP54829.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 413

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 87/170 (51%), Gaps = 18/170 (10%)

Query: 76  SSKASQADIIPNNAN---YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
           ++ A  A ++P   +   Y+   +IGTPP    A+ D   +L+WTQC  C   +C+ QD 
Sbjct: 44  ATPAGGAAVVPIRWSPPYYVANFTIGTPPQPASAIVDVAGELVWTQCSAC--RRCFKQDL 101

Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN----LATETVT 188
           P+F P  SST+K  PC ++ C S+  +SCSG  C Y    G  +   GN     AT+T  
Sbjct: 102 PVFVPNASSTFKPEPCGTAVCESIPTRSCSGDVCSYK---GPPTQLRGNTSGFAATDTFA 158

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           +G+ T +      + FGC   +        +G +GLG    SL++QM+ T
Sbjct: 159 IGTATVR------LAFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLT 202



 Score = 37.7 bits (86), Expect = 9.0,   Method: Compositional matrix adjust.
 Identities = 22/95 (23%), Positives = 46/95 (48%), Gaps = 13/95 (13%)

Query: 291 LELCYSFN---SLSQVPEVTIHFRGAD-VKLSRSNFFVKVSE--DIVCSVFKGIT----- 339
            +LC+      S +  P++   F+GA  + +  + + + V E  D  C+    +      
Sbjct: 316 FDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRT 375

Query: 340 --NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
               V + G++ Q +    YD++++T+SF+P DC+
Sbjct: 376 GLEGVSVLGSLQQEDVHFLYDLKKETLSFEPADCS 410


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score = 90.9 bits (224), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 63/229 (27%), Positives = 109/229 (47%), Gaps = 16/229 (6%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
           A+ GG   + IH  +P+     +        +     S +      +N +  + ++S   
Sbjct: 35  ARGGGIGFKAIHVAAPQFRVKANPSPSSAAQKSLFPYSAHIFQQHTKNPA--ALRSSTTT 92

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           +      Y   I +G+P  E + + DTGS+L W +C PC    C      ++D   S +Y
Sbjct: 93  LGRKFGEYYTSIKLGSPGQEAILIVDTGSELTWLKCLPC--KVCAPSVDTIYDAARSVSY 150

Query: 144 KSLPCSSSQ-CASLNQKS----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT-GQAV 197
           K + C++SQ C++ +Q +      G  CQ++  YGDGSFS G+L+T+T+ + +   G+ V
Sbjct: 151 KPVTCNNSQLCSNSSQGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVGGKPV 210

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLG 246
            +    FGC   +  L  +  +GI+GL  G ++L  Q+       QR G
Sbjct: 211 TVQDFAFGCAQGDLELVPTGASGILGLNAGKMALPMQL------GQRFG 253


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/129 (34%), Positives = 72/129 (55%), Gaps = 10/129 (7%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y + + +G PP   L + DTGSDL W QC+PC    C+ Q  P+FDP  S+++K +PC+
Sbjct: 86  EYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPC--KACFDQSGPVFDPSQSTSFKIIPCN 143

Query: 150 SSQCASLNQKSC-------SGVNCQYSVSYGDGSFSNGNLATETVTLG-STTGQAVALPG 201
           ++ C  +    C       S   C+Y   YGD S ++G+LA E++++  S    ++ +  
Sbjct: 144 AAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRD 203

Query: 202 ITFGCGTNN 210
           +  GCG +N
Sbjct: 204 MVIGCGHSN 212



 Score = 57.0 bits (136), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 64/124 (51%), Gaps = 5/124 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVP--EVTIHF 310
           +IDSGTTLT+L +     + S   + I + P ADP   L +CY+    + VP   ++I F
Sbjct: 325 IIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFDILGICYNATGRAAVPFPALSIVF 383

Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           + GA++ L + N+F++            + T+ + I GN  Q N    YD++   + F  
Sbjct: 384 QNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIHFLYDVQHARLGFAN 443

Query: 369 TDCT 372
           TDC+
Sbjct: 444 TDCS 447


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 90/365 (24%), Positives = 150/365 (41%), Gaps = 89/365 (24%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI IGTP        DTGSD++W     C+ CP       +  ++DP+ S + + + 
Sbjct: 90  YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149

Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    C +       SC+  + C+YS+SYGDGS + G   T+ +     +G     P   
Sbjct: 150 CDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANA 209

Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRT-------------TIAGNQR 244
            ++FGCG   GG   S      GI+G G  + S++SQ+               T+ G   
Sbjct: 210 SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGI 269

Query: 245 LGVS------------TPDI----------------------VIDSG----------TTL 260
             +              PD+                      + DSG          TTL
Sbjct: 270 FAIGNVVQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTL 329

Query: 261 TFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADV 315
            ++P+G    L +++      I  Q + D +     C+ ++       PEVT HF G DV
Sbjct: 330 AYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVDDGFPEVTFHFEG-DV 383

Query: 316 KL--SRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            L  S  ++  +  +++ C  F+  G+       + + G+++ +N LV YD+E Q + + 
Sbjct: 384 SLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYDLENQAIGWA 443

Query: 368 PTDCT 372
             +C+
Sbjct: 444 DYNCS 448


>gi|358345193|ref|XP_003636666.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355502601|gb|AES83804.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 161

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 46/123 (37%), Positives = 73/123 (59%), Gaps = 2/123 (1%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA-DPTGSLELCYSFNSLSQVPEVTIHF 310
           ++IDSGT ++ LP+ +   LL  +   +  +P+  DP+   +LCY   +  + P +  HF
Sbjct: 1   MLIDSGTPISILPEDFFHRLLEQVRKKVALEPMPFDPSLGYQLCYRTPTNLKGPTLVAHF 60

Query: 311 RGADVKLSRSNFFVKVSEDIVCSVF-KGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
            GADV L+ +  F+ V   I C  F    +N    YG+ +Q+N+L+G+D+E+Q VSFK T
Sbjct: 61  EGADVLLTPTQIFIPVQYGIFCFAFTSSFSNEYGTYGSYVQSNYLIGFDLEKQVVSFKAT 120

Query: 370 DCT 372
           DCT
Sbjct: 121 DCT 123


>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
          Length = 396

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 87/170 (51%), Gaps = 18/170 (10%)

Query: 76  SSKASQADIIPNNAN---YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
           ++ A  A ++P   +   Y+   +IGTPP    A+ D   +L+WTQC  C   +C+ QD 
Sbjct: 27  ATPAGGAAVVPIRWSPPYYVANFTIGTPPQPASAIVDVAGELVWTQCSAC--RRCFKQDL 84

Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN----LATETVT 188
           P+F P  SST+K  PC ++ C S+  +SCSG  C Y    G  +   GN     AT+T  
Sbjct: 85  PVFVPNASSTFKPEPCGTAVCESIPTRSCSGDVCSYK---GPPTQLRGNTSGFAATDTFA 141

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           +G+ T +      + FGC   +        +G +GLG    SL++QM+ T
Sbjct: 142 IGTATVR------LAFGCVVASDIDTMDGPSGFIGLGRTPWSLVAQMKLT 185



 Score = 37.7 bits (86), Expect = 9.2,   Method: Compositional matrix adjust.
 Identities = 22/95 (23%), Positives = 46/95 (48%), Gaps = 13/95 (13%)

Query: 291 LELCYSFN---SLSQVPEVTIHFRGAD-VKLSRSNFFVKVSE--DIVCSVFKGIT----- 339
            +LC+      S +  P++   F+GA  + +  + + + V E  D  C+    +      
Sbjct: 299 FDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAILSMAWLNRT 358

Query: 340 --NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
               V + G++ Q +    YD++++T+SF+P DC+
Sbjct: 359 GLEGVSVLGSLQQEDVHFLYDLKKETLSFEPADCS 393


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 44/130 (33%), Positives = 72/130 (55%), Gaps = 10/130 (7%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y + + +G PP   L + DTGSDL W QC+PC    C+ Q  P+FDP  S+++K +PC
Sbjct: 169 GEYFMDVFVGNPPRHFLLIIDTGSDLTWLQCKPC--KACFDQSGPVFDPSQSTSFKIIPC 226

Query: 149 SSSQCASLNQKSC-------SGVNCQYSVSYGDGSFSNGNLATETVTLG-STTGQAVALP 200
           +++ C  +    C       S   C+Y   YGD S ++G+LA E++++  S    ++ + 
Sbjct: 227 NAAACDLVVHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIR 286

Query: 201 GITFGCGTNN 210
            +  GCG +N
Sbjct: 287 DMVIGCGHSN 296



 Score = 56.6 bits (135), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 39/124 (31%), Positives = 64/124 (51%), Gaps = 5/124 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVP--EVTIHF 310
           +IDSGTTLT+L +     + S   + I + P ADP   L +CY+    + VP   ++I F
Sbjct: 409 IIDSGTTLTYLNRDAYRAVESAFLARI-SYPRADPFDILGICYNATGRTAVPFPTLSIVF 467

Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           + GA++ L + N+F++            + T+ + I GN  Q N    YD++   + F  
Sbjct: 468 QNGAELDLPQENYFIQPDPQEAKHCLAILPTDGMSIIGNFQQQNIHFLYDVQHARLGFAN 527

Query: 369 TDCT 372
           TDC+
Sbjct: 528 TDCS 531


>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
          Length = 494

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 92/367 (25%), Positives = 150/367 (40%), Gaps = 93/367 (25%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI IGTP        DTGSD++W     C+ CP       +  ++DP+ S + + + 
Sbjct: 90  YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149

Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    C +       SC+  + C+YS+SYGDGS + G   T+ +     +G     P   
Sbjct: 150 CDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANA 209

Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQM------RTTIA----------- 240
            ++FGCG   GG   S      GI+G G  + S++SQ+      R   A           
Sbjct: 210 SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGI 269

Query: 241 --------------------------------GNQRLGVSTPDIVIDSG----------T 258
                                           G   LG+  P  + DSG          T
Sbjct: 270 FAIGNVVQPKVKTTPLVSDMPHYNVILKGIDVGGTALGL--PTNIFDSGNSKGTIIDSGT 327

Query: 259 TLTFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGA 313
           TL ++P+G    L +++      I  Q + D +     C+ ++       PEVT HF G 
Sbjct: 328 TLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVDDGFPEVTFHFEG- 381

Query: 314 DVKL--SRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQTVS 365
           DV L  S  ++  +  +++ C  F+  G+       + + G+++ +N LV YD+E Q + 
Sbjct: 382 DVSLIVSPHDYLFQNGKNLYCMGFQNGGVQTKDGKDMVLLGDLVLSNKLVLYDLENQAIG 441

Query: 366 FKPTDCT 372
           +   +C+
Sbjct: 442 WADYNCS 448


>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 460

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 66/203 (32%), Positives = 90/203 (44%), Gaps = 36/203 (17%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
           E+  RD  +  F NS    Y         S N  NH + N           ++   + N+
Sbjct: 88  EIFGRDESRVSFINSKCNQYT--------SGNLKNHAHNN-----------NLFDEDGNF 128

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           L+ ++ GTP TE   + DTGS + WTQC+ C    C    +  FD   SSTY    C  S
Sbjct: 129 LVDVAFGTPXTEIXLILDTGSSITWTQCKAC--VNCLQDSNRYFDSSASSTYSFGSCIPS 186

Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
                       V   Y+++YGD S S GN   +T+TL  +           FGCG NN 
Sbjct: 187 T-----------VENNYNMTYGDDSTSVGNYGCDTMTLEPSD----VFQKFQFGCGRNNK 231

Query: 212 GLFNSKTTGIVGLGGGDISLISQ 234
           G F S   G++GLG G +S +SQ
Sbjct: 232 GDFGSGVDGMLGLGQGQLSTVSQ 254



 Score = 62.0 bits (149), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 45/145 (31%), Positives = 72/145 (49%), Gaps = 13/145 (8%)

Query: 241 GNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS----L 291
           GN+RL +     ++P  +IDS T +T LPQ   S L +     +   P+++        L
Sbjct: 317 GNERLNIPSSVFASPGTIIDSRTVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDIL 376

Query: 292 ELCYSFNSLSQV--PEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNI 348
           + CY+ +    V  PE+ +HF G ADV+L+ +N         +C  F G T+ + I GN 
Sbjct: 377 DTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSDASRLCLAFAG-TSELTIIGNR 435

Query: 349 MQTNFLVGYDIEQQTVSFKPTDCTK 373
            Q +  V YDI+ + + F    C+K
Sbjct: 436 QQLSLTVLYDIQGRRIGFGGNGCSK 460


>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
          Length = 396

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 56/155 (36%), Positives = 79/155 (50%), Gaps = 15/155 (9%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N AN+    +IGTPP    A+ D   +L+WTQC  C  S+C+ QD PLF P  SST++  
Sbjct: 43  NVANF----TIGTPPQPASAIIDVAGELVWTQCSRC--SRCFKQDLPLFIPNASSTFRPE 96

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYG---DGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           PC +  C S    +CSG  C Y  +     D   + G + TET  +G+ T        + 
Sbjct: 97  PCGTDACKSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGTATAS------LA 150

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           FGC   +       T+G +GLG    SL++QM+ T
Sbjct: 151 FGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLT 185



 Score = 44.7 bits (104), Expect = 0.082,   Method: Compositional matrix adjust.
 Identities = 26/104 (25%), Positives = 51/104 (49%), Gaps = 13/104 (12%)

Query: 282 QPVADPTGSLELCYSFN---SLSQVPEVTIHFRGAD-VKLSRSNFFVKVSE--DIVCSVF 335
           QP+A P    +LC+      S +  P++   F+GA  + +  + + + V E  D  C+  
Sbjct: 290 QPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALTVPPAKYLIDVGEEKDTACAAI 349

Query: 336 KGIT-------NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
             +          V + G++ Q +    YD++++T+SF+P DC+
Sbjct: 350 LSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFEPADCS 393


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 90/367 (24%), Positives = 144/367 (39%), Gaps = 92/367 (25%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQC---EPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I +G PP +     DTGSD++W  C   + CP          L+DP+ S++   + 
Sbjct: 82  YFAKIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATRIY 141

Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C    CA+    + Q     + CQYSV YGDGS + G    + +     TG    + A  
Sbjct: 142 CDDDFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSSANG 201

Query: 201 GITFGCGTNNGGLFNSKTT---GIVGLG----------------------------GGDI 229
            + FGCG    G   + +    GI+G G                            GG I
Sbjct: 202 SVIFGCGAKQSGELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCLDNVKGGGI 261

Query: 230 SLISQ---------------------MRTTIAGNQRLGVSTPDI---------VIDSGTT 259
             I +                     M+    G   L + T DI         +IDSGTT
Sbjct: 262 FAIGEVVSPKVNTTPMVPNQPHYNVVMKEIEVGGNVLELPT-DIFDTGDRRGTIIDSGTT 320

Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE----LCYSFNSLSQVPEVTIHFRGA-D 314
           L +LP+       S+M+ ++  QP        E      Y+ N     P V  HF G+  
Sbjct: 321 LAYLPEVVYE---SMMTKIVSEQPGLKLHTVEEQFTCFQYTGNVNEGFPVVKFHFNGSLS 377

Query: 315 VKLSRSNFFVKVSEDIVCSVFKGITNS---------VPIYGNIMQTNFLVGYDIEQQTVS 365
           + ++  ++  ++ E++ C    G  NS         + + G+++ +N LV YD+E Q + 
Sbjct: 378 LTVNPHDYLFQIHEEVWCF---GWQNSGMQSKDGRDMTLLGDLVLSNKLVLYDLENQAIG 434

Query: 366 FKPTDCT 372
           +   +C+
Sbjct: 435 WTDYNCS 441


>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 481

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 88/369 (23%), Positives = 143/369 (38%), Gaps = 89/369 (24%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQC---EPCPPSQCYMQDSPLFDPKMSSTYKS 145
           +N L    IG  P +     DTGSD +W  C     CP       D  L+DP +S T K+
Sbjct: 72  SNGLYYTKIGLGPKDYYVQVDTGSDTLWVNCVGCTACPKKSGLGMDLTLYDPNLSKTSKA 131

Query: 146 LPCSSSQCASLNQKSCS----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP- 200
           +PC    C S      S    G++C YS++YGDGS ++G+   + +T     G    +P 
Sbjct: 132 VPCDDEFCTSTYDGQISGCTKGMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPD 191

Query: 201 --GITFGCGTNNGGLFNSKT----TGIVGLGGGDISLISQM------------------- 235
              + FGCG+   G  +S T     GI+G G  + S++SQ+                   
Sbjct: 192 NTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQLAAAGKVKRIFSHCLDSISG 251

Query: 236 --------------RTT--IAGNQRLGVSTPDI----------------------VIDSG 257
                         +TT  + G     V   DI                      +IDSG
Sbjct: 252 GGIFAIGEVVQPKVKTTPLLQGMAHYNVVLKDIEVAGDPIQLPSDILDSSSGRGTIIDSG 311

Query: 258 TTLTFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSLSQVPEV--TIHF-- 310
           TTL +LP      LL  +    S ++   V D       C+ ++    V ++  T+ F  
Sbjct: 312 TTLAYLPVSIYDQLLEKILAQRSGMKLYLVEDQF----TCFHYSDEESVDDLFPTVKFTF 367

Query: 311 -RGADVKLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQQT 363
             G  +     ++     ED+ C  ++           + + G+++  N LV YD++   
Sbjct: 368 EEGLTLTTYPRDYLFLFKEDMWCVGWQKSMAQTKDGKELILLGDLVLANKLVVYDLDNMA 427

Query: 364 VSFKPTDCT 372
           + +   +C+
Sbjct: 428 IGWADYNCS 436


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 87/347 (25%), Positives = 135/347 (38%), Gaps = 76/347 (21%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD------SPLFDPKMSSTY 143
            Y  ++ +GTP T  L V DTGSD++W      PP    ++       +P   P+ +   
Sbjct: 121 EYFAQVGVGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAAPAPTPRWN--- 177

Query: 144 KSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
               C +  C  L+   C      C Y V+YGDGS + G+ A+ET+T      +   +  
Sbjct: 178 ----CVAPICRRLDSAGCDRRRNSCLYQVAYGDGSVTAGDFASETLTFA----RGARVQR 229

Query: 202 ITFGCGTNNGGL----------------FNSKTTGIVG---------------------L 224
           +  GCG +N GL                F S+     G                      
Sbjct: 230 VAIGCGHDNEGLFIAASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSRRARPSRRW 289

Query: 225 GG----GDISLISQMRTTIAGNQRLGVSTPD-----------IVIDSGTTLTFLPQGYNS 269
           GG         +  +  ++ G +  GVS  D           +++DSGT++T L +    
Sbjct: 290 GGTPRMATFYYVHLLGFSVGGARVKGVSQSDLRLNPTTGRGGVILDSGTSVTRLARPVYE 349

Query: 270 NLLSVMSSMIEAQPVADPTGSL-ELCYSFNS--LSQVPEVTIHFRG-ADVKLSRSNFFVK 325
            +     +      V+    SL + CY+ +   + +VP V++H  G A V L   N+ + 
Sbjct: 350 AVRDAFRAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGASVALPPENYLIP 409

Query: 326 V-SEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           V +    C    G    V I GNI Q  F V +D + Q V F P  C
Sbjct: 410 VDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456


>gi|21717169|gb|AAM76362.1|AC074196_20 putative nucleoid DNA binding protein, 3'-partial [Oryza sativa
           Japonica Group]
          Length = 377

 Score = 90.1 bits (222), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/152 (38%), Positives = 77/152 (50%), Gaps = 13/152 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+   +IGTPP    AV D   +L+WTQC PC P  C+ QD PLFDP  SST++ LPC S
Sbjct: 57  YVANFTIGTPPQPVSAVVDLTGELVWTQCTPCQP--CFEQDLPLFDPTKSSTFRGLPCGS 114

Query: 151 SQCASLNQKS--CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
             C S+ + S  C+   C Y      G  + G   T+T  +G+      A   + FGC  
Sbjct: 115 HLCESIPESSRNCTSDVCIYEAPTKAGD-TGGKAGTDTFAIGA------AKETLGFGCVV 167

Query: 209 NNGGLFNS--KTTGIVGLGGGDISLISQMRTT 238
                  +    +GIVGLG    SL++QM  T
Sbjct: 168 MTDKRLKTIGGPSGIVGLGRTPWSLVTQMNVT 199


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 56/160 (35%), Positives = 82/160 (51%), Gaps = 12/160 (7%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y + I +G+PP   L VADTGSDL W +C  C  +         F  + S+T+    
Sbjct: 80  SGQYFVSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTH 139

Query: 148 CSSSQCASLNQKSCSGVN-------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           C SS C  + Q + +  N       C+Y   Y DGS ++G  + ET TL +++G+ + L 
Sbjct: 140 CFSSLCQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLK 199

Query: 201 GITFGCGTNNGG--LFNSK---TTGIVGLGGGDISLISQM 235
            I FGCG +  G  L  S     +G++GLG G IS  SQ+
Sbjct: 200 SIAFGCGFHASGPSLIGSSFNGASGVMGLGRGPISFASQL 239



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/130 (27%), Positives = 61/130 (46%), Gaps = 12/130 (9%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS-----LELCYSFNSLS--QVPE 305
           VIDSGTTLTFL +     +LS     ++  P   P G+      +LC +   +S  + P 
Sbjct: 327 VIDSGTTLTFLTEPAYREILSAFKREVKL-PSPTPGGASTRSGFDLCVNVTGVSRPRFPR 385

Query: 306 VTIHFRGADV-KLSRSNFFVKVSEDIVCSVFKGI---TNSVPIYGNIMQTNFLVGYDIEQ 361
           +++   G  +      N+F+ +SE I C   + +   +    + GN+MQ  FL+ +D  +
Sbjct: 386 LSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVEAESGRFSVIGNLMQQGFLLEFDRGK 445

Query: 362 QTVSFKPTDC 371
             + F    C
Sbjct: 446 SRLGFSRRGC 455


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 54/162 (33%), Positives = 80/162 (49%), Gaps = 16/162 (9%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-YMQDSPLFDPKMSSTYKSL 146
           +  Y + + IG PP   L +ADTGSDL+W +C  C    C +   + +F P+ SST+   
Sbjct: 80  SGQYFVDLRIGQPPQSLLLIADTGSDLVWVKCSAC--RNCSHHSPATVFFPRHSSTFSPA 137

Query: 147 PCSSSQCASLNQ----KSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            C    C  + +      C+       C Y   Y DGS ++G  A ET +L +++G+   
Sbjct: 138 HCYDPVCRLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAK 197

Query: 199 LPGITFGCGTNNGGLFNSKTT-----GIVGLGGGDISLISQM 235
           L  + FGCG    G   S T+     G++GLG G IS  SQ+
Sbjct: 198 LKSVAFGCGFRISGQSVSGTSFNGANGVMGLGRGPISFASQL 239



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/127 (25%), Positives = 59/127 (46%), Gaps = 7/127 (5%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ----VPEVTI 308
           V+DSGTTL FL       +++ +   I+     + T   +LC + + +++    +P +  
Sbjct: 323 VMDSGTTLAFLADPAYRLVIAAVKQRIKLPNADELTPGFDLCVNVSGVTKPEKILPRLKF 382

Query: 309 HFRGADVKL-SRSNFFVKVSEDIVCSVFKGITNSV--PIYGNIMQTNFLVGYDIEQQTVS 365
            F G  V +    N+F++  E I C   + +   V   + GN+MQ  FL  +D ++  + 
Sbjct: 383 EFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFLFEFDRDRSRLG 442

Query: 366 FKPTDCT 372
           F    C 
Sbjct: 443 FSRRGCA 449


>gi|21717154|gb|AAM76347.1|AC074196_5 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433293|gb|AAP54831.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125532791|gb|EAY79356.1| hypothetical protein OsI_34485 [Oryza sativa Indica Group]
          Length = 397

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 56/155 (36%), Positives = 79/155 (50%), Gaps = 15/155 (9%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N AN+    +IGTPP    A+ D   +L+WTQC  C  S+C+ QD PLF P  SST++  
Sbjct: 43  NVANF----TIGTPPQPASAIIDVAGELVWTQCSRC--SRCFKQDLPLFIPNASSTFRPE 96

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYG---DGSFSNGNLATETVTLGSTTGQAVALPGIT 203
           PC +  C S    +CSG  C Y  +     D   + G + TET  +G+ T        + 
Sbjct: 97  PCGTDACKSTPTSNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAIGTATAS------LA 150

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           FGC   +       T+G +GLG    SL++QM+ T
Sbjct: 151 FGCVVASDIDTMDGTSGFIGLGRTPRSLVAQMKLT 185



 Score = 39.3 bits (90), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 23/96 (23%), Positives = 45/96 (46%), Gaps = 14/96 (14%)

Query: 291 LELCYSFN---SLSQVPEVTIHFRGADVKLSR--SNFFVKVSE--DIVCSVFKGITN--- 340
            +LC+      S +  P++   F+G    L+   + + + V E  D  C+    +     
Sbjct: 299 FDLCFKKAAGFSRATAPDLVFTFQGGGAALTVPPAKYLIDVGEEKDTACAAILSMARLNR 358

Query: 341 ----SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
                V + G++ Q N    YD++++T+SF+P DC+
Sbjct: 359 TGLEGVSVLGSLQQENVHFLYDLKKETLSFEPADCS 394


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 57/170 (33%), Positives = 85/170 (50%), Gaps = 20/170 (11%)

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
           SSSK +   +  +N      ++IGTPP     V DTGS+L W +C+  P        + +
Sbjct: 51  SSSKTTGKLLFHHNVTLTASLTIGTPPQNITMVLDTGSELSWLRCKKEP------NFTSI 104

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
           F+P  S TY  +PCSS  C +         +C     C + +SY D S   G+LA ET  
Sbjct: 105 FNPLASKTYTKIPCSSQTCKTRTSDLTLPVTCDPAKLCHFIISYADASSVEGHLAFETFR 164

Query: 189 LGSTTGQAVALPGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
            GS T      P   FGC   G+++    ++KTTG++G+  G +S ++QM
Sbjct: 165 FGSLTR-----PATVFGCMDSGSSSNTEEDAKTTGLMGMNRGSLSFVNQM 209


>gi|326520109|dbj|BAK03979.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 70/229 (30%), Positives = 112/229 (48%), Gaps = 28/229 (12%)

Query: 21  PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHF-NQNSSISSSKA 79
           P    T    + ++HR+ P +P   +S+ P +R   AL     R+    N+ SS  + +A
Sbjct: 52  PNSPSTSTIRLTILHREHPCAP---ASKRPVRRSPSALQEYHTRVRRLANRLSSCPADEA 108

Query: 80  SQADIIPNNA------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
           + + +I  N       +Y+ ++ +GTP      + DT S L W  CEPC  + C +   P
Sbjct: 109 TASGLIFANGVPWDYYSYVTQVQLGTPAKTHNVLVDTASSLSWVGCEPC-INACLI---P 164

Query: 134 LFDPKMSSTYKSLPCSSSQC-----ASLNQKSCSG--VNCQYSVSYGDGSFSNGNLATET 186
            F+P  SSTYK + C S+ C     A++ +KSC      C Y  SY D S S G ++++T
Sbjct: 165 TFNPNASSTYKVVGCGSALCNAVPSATMARKSCMAPTEGCSYRQSYHDYSLSVGVVSSDT 224

Query: 187 VTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           +T G  + + +      FGC     G+   + +GI+G+     SL SQM
Sbjct: 225 LTYGLGSQKFI------FGCCNLFRGV-GGRYSGILGMSVNKFSLFSQM 266


>gi|413938616|gb|AFW73167.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 452

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 93/349 (26%), Positives = 122/349 (34%), Gaps = 102/349 (29%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFDPKMSSTYKSLPC 148
           NY++  S+GTP   +    DTGSDL W QC+PC  +  CY Q  PLFDP  SS+Y ++PC
Sbjct: 139 NYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPC 198

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
               CA L                         +   +    +  G   A+ G  FGCG 
Sbjct: 199 GGPVCAGL------------------------GIYAASACSAAQCG---AVQGFFFGCGH 231

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGNQRLGVSTPDIVI 254
              GLFN    G++GLG    SL+ Q               + + AG   LGV  P    
Sbjct: 232 AQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGYLTLGVGGPSGAA 290

Query: 255 DSGTTLTFLPQGYNSNLLSVMSSMIE---------------------------------- 280
              +T   LP         VM + I                                   
Sbjct: 291 PGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTGTVVTRLPPTAYA 350

Query: 281 -------------AQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFV 324
                          P A   G L+ CY+F     V  P V + F  GA V L       
Sbjct: 351 ALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSGATVTLGADGIL- 409

Query: 325 KVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
                  C  F   G    + I GN+ Q +F V   I+  +V FKP+ C
Sbjct: 410 ----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSSC 452


>gi|194699094|gb|ACF83631.1| unknown [Zea mays]
 gi|413938606|gb|AFW73157.1| hypothetical protein ZEAMMB73_333672 [Zea mays]
          Length = 452

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 108/421 (25%), Positives = 144/421 (34%), Gaps = 116/421 (27%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSS------ISSSKASQADI 84
           + L HR  P +P   SS      + D L     R  +  +  S        S  A+ A  
Sbjct: 68  LRLTHRHGPCAPSRASSLA-APSVADTLRADQRRAEYILRRVSGRAPQLWDSKAAAAAAT 126

Query: 85  IP-------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS-QCYMQDSPLFD 136
           +P          NY++  S+GTP   +    DTGSDL W QC+PC  +  CY Q  PLFD
Sbjct: 127 VPASWGYDIGTLNYVVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFD 186

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           P  SS+Y ++PC    CA L                         +   +    +  G  
Sbjct: 187 PAQSSSYAAVPCGGPVCAGL------------------------GIYAASACSAAQCG-- 220

Query: 197 VALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ--------------MRTTIAGN 242
            A+ G  FGCG    GLFN    G++GLG    SL+ Q               + + AG 
Sbjct: 221 -AVQGFFFGCGHAQSGLFNG-VDGLLGLGREQPSLVEQTAGTYGGVFSYCLPTKPSTAGY 278

Query: 243 QRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIE---------------------- 280
             LGV  P       +T   LP         VM + I                       
Sbjct: 279 LTLGVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTVVDTG 338

Query: 281 -------------------------AQPVADPTGSLELCYSFNSLSQV--PEVTIHF-RG 312
                                      P A   G L+ CY+F     V  P V + F  G
Sbjct: 339 TVVTRLPPTAYAALRSAFRSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPNVALTFGSG 398

Query: 313 ADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
           A V L              C  F   G    + I GN+ Q +F V   I+  +V FKP+ 
Sbjct: 399 ATVTLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRSFEV--RIDGTSVGFKPSS 451

Query: 371 C 371
           C
Sbjct: 452 C 452


>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
          Length = 418

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 93/368 (25%), Positives = 161/368 (43%), Gaps = 84/368 (22%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
            D+ P   +Y + ++IG P        DTGSDL W QC+  P   C     PL+ P  + 
Sbjct: 49  GDVYPT-GHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCD-APCQSCNKVPHPLYRPTKN- 105

Query: 142 TYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
             K +PC++S C +L      N+K  +   C Y + Y D + S G L T++ +L     +
Sbjct: 106 --KLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSL-PLRNK 162

Query: 196 AVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMR--------------- 236
           +   P ++FGCG +      G   + T G++GLG G +SL+SQ++               
Sbjct: 163 SNVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLST 222

Query: 237 ------------------------TTIAGN-----------QRLGVSTP--DIVIDSGTT 259
                                    + +GN            R  +ST   ++V DSG+T
Sbjct: 223 SGGGFLFFGDDMVPTSRVTWVPMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVVFDSGST 282

Query: 260 LTFL-PQGYNSNLLSVMSSMIEA-QPVADPTGSLELCY----SFNSLSQVPE--VTIHF- 310
            T+   Q Y + + ++  S+ ++ + V+DP  SL LC+    +F S+S V +   ++ F 
Sbjct: 283 YTYFSAQPYQATISAIKGSLSKSLKQVSDP--SLPLCWKGQKAFKSVSDVKKDFKSLQFI 340

Query: 311 --RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVS 365
             + A +++   N+ +      VC  +  G     S  I G+I   + +V YD E+  + 
Sbjct: 341 FGKNAVMEIPPENYLIVTKNGNVCLGILDGSAAKLSFSIIGDITMQDQMVIYDNEKAQLG 400

Query: 366 FKPTDCTK 373
           +    C++
Sbjct: 401 WIRGSCSR 408


>gi|115465777|ref|NP_001056488.1| Os05g0591300 [Oryza sativa Japonica Group]
 gi|113580039|dbj|BAF18402.1| Os05g0591300 [Oryza sativa Japonica Group]
          Length = 453

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 62/174 (35%), Positives = 92/174 (52%), Gaps = 20/174 (11%)

Query: 72  SSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQD 131
           +SIS +K S+     N+  +LI + +GTP  + L   DTGS L W QC PC   +C++Q 
Sbjct: 38  TSISVTKDSKL----NDFAFLIPVKLGTPAVQYLVTMDTGSSLSWVQCRPC-TIKCHVQP 92

Query: 132 S---PLFDPKMSSTYKSLPCSSSQCASLNQ------KSCSGVN--CQYSVSYGDG-SFSN 179
           +   P+FDP  SST++ + CS+S C+ L +      K+C      C Y++SYG G ++S 
Sbjct: 93  AKVGPIFDPSNSSTFRHVGCSTSICSYLGRTLRIQSKACMEWEDICLYTMSYGGGWAYSV 152

Query: 180 GNLATETVTL--GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISL 231
           G   T+ + L  G TT   ++L    FGC  +       K  GI GLG  + S 
Sbjct: 153 GKAVTDRLVLGGGETTRTTLSLANFVFGCSMDT-QYSTHKEAGIFGLGTSNYSF 205


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 56/170 (32%), Positives = 85/170 (50%), Gaps = 20/170 (11%)

Query: 75  SSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL 134
           ++SK +   +  +N    + ++ GTP      V DTGS+L W  C+  P        + +
Sbjct: 51  TTSKTTDKLLFHHNVTLTVSLTAGTPLQNITMVLDTGSELSWLHCKKEP------NFNSI 104

Query: 135 FDPKMSSTYKSLPCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVT 188
           F+P  S TY  +PCSS  C +  +      SC     C + +SY D S   GNLA ET  
Sbjct: 105 FNPLASKTYTKIPCSSPTCETRTRDLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFR 164

Query: 189 LGSTTGQAVALPGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           +GS TG     P   FGC   G ++    ++KTTG++G+  G +S ++QM
Sbjct: 165 VGSVTG-----PATVFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVNQM 209


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 53/153 (34%), Positives = 75/153 (49%), Gaps = 19/153 (12%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N   Y+    IGTPP +     D  SDL+WT C    P          F+P  S+T   +
Sbjct: 96  NAGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP----------FNPVRSTTVADV 145

Query: 147 PCSSSQCASLNQKSC--SGVNCQYSVSYGDGSF-SNGNLATETVTLGSTTGQAVALPGIT 203
           PC+   C     ++C      C Y+  YG G+  + G L TE  T G T      + G+ 
Sbjct: 146 PCTDDACQQFAPQTCGAGASECAYTYMYGGGAANTTGLLGTEAFTFGDTR-----IDGVV 200

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           FGCG  N G F S  +G++GLG G++SL+SQ++
Sbjct: 201 FGCGLKNVGDF-SGVSGVIGLGRGNLSLVSQLQ 232


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 48/135 (35%), Positives = 71/135 (52%), Gaps = 12/135 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + + +G+PP     + DTGSDL W QC PC    C+ Q+   +DPK S++YK++ C+ 
Sbjct: 170 YFMDVLVGSPPKHFSLILDTGSDLNWIQCLPC--YDCFQQNGAFYDPKASASYKNITCND 227

Query: 151 SQCASLNQKS----CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTG----QAVALP 200
            +C  ++       C   N  C Y   YGD S + G+ A ET T+  TT     +   + 
Sbjct: 228 QRCNLVSSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVE 287

Query: 201 GITFGCGTNNGGLFN 215
            + FGCG  N GLF+
Sbjct: 288 NMMFGCGHWNRGLFH 302



 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 37/124 (29%), Positives = 63/124 (50%), Gaps = 5/124 (4%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQ-PVADPTGSLELCYSFNSLS--QVPEVTIH 309
           +IDSGTTL++  +     + + ++   + + PV      L+ C++ + +   Q+PE+ I 
Sbjct: 409 IIDSGTTLSYFAEPAYEFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGIHNVQLPELGIA 468

Query: 310 F-RGADVKLSRSNFFVKVSEDIVCSVFKGITNSV-PIYGNIMQTNFLVGYDIEQQTVSFK 367
           F  GA       N F+ ++ED+VC    G   S   I GN  Q NF + YD ++  + + 
Sbjct: 469 FADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYA 528

Query: 368 PTDC 371
           PT C
Sbjct: 529 PTKC 532


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score = 89.0 bits (219), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 61/210 (29%), Positives = 109/210 (51%), Gaps = 26/210 (12%)

Query: 52  QRLRDALTR-----SLNRLNHFN-QNSSISSSKASQADIIPNNANYLIRISIGTPPTERL 105
           +++R++L+R       N+ NH + + +  +S   S    + + A + +++ IG+      
Sbjct: 55  EQVRESLSRIQSQVQDNQNNHLDLRGNRPTSGVRSVVTPLEDYALFSMQLGIGSLQKNLS 114

Query: 106 AVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-- 163
           A+ DTGS+ +  QC          +  P+FDP  S +Y+ +PC S  C ++ Q++ +G  
Sbjct: 115 AIIDTGSEAVLVQCGS--------RSRPVFDPAASQSYRQVPCISQLCLAVQQQTSNGSS 166

Query: 164 -------VNCQYSVSYGDGSFSNGNLATETVTLGST--TGQAVALPGITFGCGTN-NGGL 213
                    C YS+SYGD   S G+ + + + L ST  +GQAV    + FGC  +  G L
Sbjct: 167 QPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRDVAFGCAHSPQGFL 226

Query: 214 FNSKTTGIVGLGGGDISLISQMRTTIAGNQ 243
            +  + GIVG   G++SL SQ++  + G++
Sbjct: 227 VDLGSLGIVGFNRGNLSLPSQLKDRLGGSK 256



 Score = 43.5 bits (101), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 46/146 (31%), Positives = 68/146 (46%), Gaps = 18/146 (12%)

Query: 244 RLGVSTPD--IVIDSGTTLT-FLPQGYNS--NLLSVMSSMIEAQPVADPTGSLELCYSFN 298
           +L  ST D   V+DSGTT T  +   Y +  N  +  +     + V    G  + CY+ +
Sbjct: 326 KLDPSTGDGGTVLDSGTTFTRVVDDAYTAFRNAFAASNRSGLRKKVGAAAG-FDDCYNIS 384

Query: 299 ---SLSQVPEVTIHFRG-ADVKLSRSNFFVKVS----EDIVC----SVFKGITNSVPIYG 346
              SL  VPEV +  +    ++L   + FV VS    E  VC    S  K     + + G
Sbjct: 385 AGSSLPGVPEVRLSLQNNVRLELRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLG 444

Query: 347 NIMQTNFLVGYDIEQQTVSFKPTDCT 372
           N  Q+N+LV YD E+  V F+  DC+
Sbjct: 445 NYQQSNYLVEYDNERSRVGFERADCS 470


>gi|356558489|ref|XP_003547539.1| PREDICTED: uncharacterized protein LOC100817234 [Glycine max]
          Length = 739

 Score = 88.6 bits (218), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 51/122 (41%), Positives = 73/122 (59%), Gaps = 3/122 (2%)

Query: 251 DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSF--NSLSQVPEVTI 308
           +I+IDSGTTLT L + + + L + + + I  + V      L LCY    N+  +VP +T 
Sbjct: 173 NIIIDSGTTLTILLENFYTKLEAEVEAHINLERVNSTDQILSLCYKSPPNNAIEVPIITT 232

Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           HF G D+ L+  N FV V +D +   F  +  S  I+GN+ Q N LVGYD+ ++TVSFKP
Sbjct: 233 HFAGVDIVLNSLNTFVSVFDDAMWFAFAPVA-SGSIFGNLAQMNHLVGYDLLRKTVSFKP 291

Query: 369 TD 370
           TD
Sbjct: 292 TD 293


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 60/164 (36%), Positives = 87/164 (53%), Gaps = 19/164 (11%)

Query: 83  DIIP--NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           D +P  +N +  + +++GTPP     V DTGS+L W  C     SQ     S  F+P  S
Sbjct: 63  DKLPFRHNISLTVSLTVGTPPQNVTMVIDTGSELSWLHCN---TSQNSSSSSSTFNPVWS 119

Query: 141 STYKSLPCSSSQCASLNQK-----SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
           S+Y  +PCSSS C    +      SC S   C  ++SY D S S GNLAT+T  +GS+  
Sbjct: 120 SSYSPIPCSSSTCTDQTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYIGSS-- 177

Query: 195 QAVALPGITFGCGT---NNGGLFNSKTTGIVGLGGGDISLISQM 235
               +P + FGC     ++    +SK TG++G+  G +S +SQM
Sbjct: 178 ---GIPNVVFGCMDSIFSSNSEEDSKNTGLMGMNRGSLSFVSQM 218


>gi|225455900|ref|XP_002275943.1| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 686

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 62/175 (35%), Positives = 84/175 (48%), Gaps = 16/175 (9%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
           + D+ PN   Y   I +G+PP       DTGSDL W QC+ PC  + C    +PL+ PK 
Sbjct: 305 RGDVYPNGL-YFTHIFVGSPPRRYFLDMDTGSDLTWIQCDAPC--TSCAKGPNPLYKPKK 361

Query: 140 SSTYKSLPCSSSQCASLNQKSCSGV-----NCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
            +    +P   S C  + +   +G       C Y + Y D S S G LA++ + L    G
Sbjct: 362 GNL---VPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANG 418

Query: 195 QAVALPGITFGCGTNNGG-LFNS--KTTGIVGLGGGDISLISQMRTTIAGNQRLG 246
               L GI FGC  +  G L NS  KT GI+GL    +SL SQ+ +    N  LG
Sbjct: 419 SLTKL-GIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLG 472


>gi|413936471|gb|AFW71022.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
          Length = 315

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 57/177 (32%), Positives = 83/177 (46%), Gaps = 27/177 (15%)

Query: 4   FLSCVFILFFLC---------FYVV-----SPIEAQTGGFS----------VELIHRDSP 39
            LSC+F+ F+L          F  V      P    +G F           V L+HR  P
Sbjct: 5   LLSCIFLCFYLSTVHGAGEDSFVTVPSSSFEPESVCSGEFVKPEQNGSTVYVPLVHRHGP 64

Query: 40  KSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGT 99
            +P   S  T  +   D   RS  R ++  +   +S        ++  +  Y++R+S GT
Sbjct: 65  CAP-APSLSTDTRSFADIFRRSRARPSYIVRGKKVSVPAHLGTSVM--SLEYVVRVSFGT 121

Query: 100 PPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
           P   ++ V DTGSD+ W QC+PC   QC+ Q  PL+DP  SSTY ++PC+S  C  L
Sbjct: 122 PAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCASDVCKKL 178


>gi|218186446|gb|EEC68873.1| hypothetical protein OsI_37494 [Oryza sativa Indica Group]
          Length = 353

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 55/155 (35%), Positives = 79/155 (50%), Gaps = 17/155 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---LFDPKMSSTYKSLP 147
           Y + IS+GTPP   L   DTGS L W QC+ C   +CY Q +    +F+P  SSTY  + 
Sbjct: 6   YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQI-KCYDQAAKAGQIFNPYNSSTYSKVG 64

Query: 148 CSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           CS+  C  ++     +  C   +  C YS+ YG G +S G L  + +TL S      ++ 
Sbjct: 65  CSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNR----SID 120

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
              FGCG +N  L+N    GI+G G    S  +Q+
Sbjct: 121 NFIFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQV 153


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 59/161 (36%), Positives = 85/161 (52%), Gaps = 21/161 (13%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  I I++GTPP     V DTGS+L W  C     +       P F+P +SS+Y  +
Sbjct: 62  HNVSLTISITVGTPPQNMSMVIDTGSELSWLHCN---TNTTATIPYPFFNPNISSSYTPI 118

Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
            CSS  C +  +      SC   N C  ++SY D S S GNLA++T   GS+       P
Sbjct: 119 SCSSPTCTTRTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTFGFGSSFN-----P 173

Query: 201 GITFGC-----GTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
           GI FGC      TN+    +S TTG++G+  G +SL+SQ++
Sbjct: 174 GIVFGCMNSSYSTNSES--DSNTTGLMGMNLGSLSLVSQLK 212


>gi|449529533|ref|XP_004171754.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 437

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 93/363 (25%), Positives = 149/363 (41%), Gaps = 90/363 (24%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + I+IG          D+GSDL W QC+  P + C      L+ P  ++    L C  
Sbjct: 55  YSVSINIGKGDEAFEFDIDSGSDLTWVQCD-APCTHCTKPREQLYKPNNNA----LNCFE 109

Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
             C SL+      C   +  CQY + Y D   S G L  + V L  T G ++A P I FG
Sbjct: 110 PLCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNG-SLAAPRIAFG 168

Query: 206 CGTNNGGLF---NSKTTGIVGLGGGDISLISQM------RTTIA---------------- 240
           CG ++       +  T G++GLG G++S ISQ+      R  +                 
Sbjct: 169 CGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSDEGGFLFFGDEF 228

Query: 241 -----------------------------GNQRLGVSTPDIVIDSGTTLTFL-PQGYNSN 270
                                        G +  G+    +V DSG++ T+   Q YNS 
Sbjct: 229 VPSSGVTWTSMSHESIGSYYSSGPAEVYFGGKATGIKDLTLVFDSGSSYTYFNSQAYNS- 287

Query: 271 LLSVMSSMIEAQPVADP--TGSLELCYS----FNSLSQVPE----VTIHF---RGADVKL 317
           +L+++ + +  +P+ D     SL +C+     F SL  V +    + + F   + A ++L
Sbjct: 288 ILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNLLALRFTKTKNAQIQL 347

Query: 318 SRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
              N+ +      VC    GI N        + I G+I   + +V YD E++ + + PT+
Sbjct: 348 PPENYLIITKYGNVCF---GILNGTEVGLGDLNIIGDISLKDKMVIYDNERRRIGWFPTN 404

Query: 371 CTK 373
           C K
Sbjct: 405 CNK 407


>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
 gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
          Length = 332

 Score = 88.2 bits (217), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 55/146 (37%), Positives = 77/146 (52%), Gaps = 23/146 (15%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y   I++G+PP +   V DTGSDL W +C+PC P  C    S  FD   S+TYK+L C+ 
Sbjct: 3   YYSTITLGSPPKDFSLVMDTGSDLTWVRCDPCSP-DC----SSTFDRLASNTYKALTCAD 57

Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL-GSTTGQAVALPGITFGCGTN 209
                            YS  YGDGSF+ G+L+ +T+ + G+ + +    PG  FGCG+ 
Sbjct: 58  ----------------DYSYGYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGSL 101

Query: 210 NGGLFNSKTTGIVGLGGGDISLISQM 235
             GL  S   GI+ L  G +S  SQ+
Sbjct: 102 LKGLI-SGEVGILALSPGSLSFPSQI 126



 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 48/142 (33%), Positives = 74/142 (52%), Gaps = 14/142 (9%)

Query: 241 GNQRLGVS--------TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 292
           GNQRL +S            + DSGTTLT LP G   ++   ++SM+         G L+
Sbjct: 194 GNQRLDLSPSAFLNGQDKPTIFDSGTTLTMLPPGVCDSIKQSLASMVSGAEFVAIKG-LD 252

Query: 293 LCYSF--NSLSQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIM 349
            C+    +S   +P++T HF  GAD     SN+ + +   + C +F   TN V I+GN+ 
Sbjct: 253 ACFRVPPSSGQGLPDITFHFNGGADFVTRPSNYVIDLGS-LQCLIFV-PTNEVSIFGNLQ 310

Query: 350 QTNFLVGYDIEQQTVSFKPTDC 371
           Q +F V +D++ + + FK TDC
Sbjct: 311 QQDFFVLHDMDNRRIGFKETDC 332


>gi|77553049|gb|ABA95845.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 372

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 55/155 (35%), Positives = 79/155 (50%), Gaps = 17/155 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---LFDPKMSSTYKSLP 147
           Y + IS+GTPP   L   DTGS L W QC+ C   +CY Q +    +F+P  SSTY  + 
Sbjct: 25  YFMGISLGTPPVFNLVTIDTGSTLSWVQCKNCQI-KCYDQAAKAGQIFNPYNSSTYSKVG 83

Query: 148 CSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           CS+  C  ++     +  C   +  C YS+ YG G +S G L  + +TL S      ++ 
Sbjct: 84  CSTEACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNR----SID 139

Query: 201 GITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
              FGCG +N  L+N    GI+G G    S  +Q+
Sbjct: 140 NFIFGCGEDN--LYNGVNAGIIGFGTKSYSFFNQV 172


>gi|168051774|ref|XP_001778328.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670305|gb|EDQ56876.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 165

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 53/140 (37%), Positives = 67/140 (47%), Gaps = 16/140 (11%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y I I I TPP   L + DTGSDL W QC PC    CY+Q   +F+P  S +Y  + C
Sbjct: 10  GEYFIDIFIDTPPRHILVIIDTGSDLTWVQCTPCL--HCYLQKGLVFNPHSSESYDPVAC 67

Query: 149 SSSQCA----SLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTL--------GSTTG 194
              + A    S N+ +C      C Y   YGD S +  + ATET T+        G    
Sbjct: 68  GEPKRAFVESSNNRSTCVTDSQGCSYFYWYGDSSNTTSDFATETFTVNKTIKNDEGGGED 127

Query: 195 QAVALPGITFGCGTNNGGLF 214
             + +  I FGCG NN GLF
Sbjct: 128 DTLQISKIMFGCGHNNQGLF 147


>gi|224066811|ref|XP_002302227.1| predicted protein [Populus trichocarpa]
 gi|222843953|gb|EEE81500.1| predicted protein [Populus trichocarpa]
          Length = 422

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 85/362 (23%), Positives = 145/362 (40%), Gaps = 86/362 (23%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
            +Y + ++IG PP       DTGSDL W QC+  P   C      L+ PK +     +PC
Sbjct: 66  GHYSVILNIGNPPKAFDLDIDTGSDLTWVQCD-APCKGCTKPLDKLYKPKNN----RVPC 120

Query: 149 SSSQCASLNQKSCS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           +SS C ++   +C      C Y V Y D   S G L ++   L    G  +  P I FGC
Sbjct: 121 ASSLCQAIQNNNCDIPTEQCDYEVEYADLGSSLGVLLSDYFPLRLNNGSLLQ-PRIAFGC 179

Query: 207 GTNN---GGLFNSKTTGIVGLGGGDISLISQMRT-------------------------- 237
           G +    G      T GI+GLG G  S++SQ+RT                          
Sbjct: 180 GYDQKYLGPHSPPDTAGILGLGRGKASILSQLRTLGITQNVVGHCFSRVTGGFLFFGDHL 239

Query: 238 --------------------------TIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNL 271
                                      + G +  G+    ++ DSG++ T+       ++
Sbjct: 240 LPPSGITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQSI 299

Query: 272 LSVMSSMIEAQPVADP--TGSLELCYS--------FNSLSQVPEVTIHF---RGADVKLS 318
           L+++   +   P+ D     +L +C+          +  S    +TI+F   +   ++L+
Sbjct: 300 LNLVRKDLSGMPLKDAPEEKALAVCWKTAKPIKSILDIKSFFKPLTINFIKAKNVQLQLA 359

Query: 319 RSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             ++ +   +  VC    GI N       ++ + G+I   + +V YD E+Q + + PT+C
Sbjct: 360 PEDYLIITKDGNVCL---GILNGGEQGLGNLNVIGDIFMQDRVVVYDNERQQIGWFPTNC 416

Query: 372 TK 373
            +
Sbjct: 417 NR 418


>gi|297734190|emb|CBI15437.3| unnamed protein product [Vitis vinifera]
          Length = 473

 Score = 87.8 bits (216), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 61/174 (35%), Positives = 83/174 (47%), Gaps = 14/174 (8%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
           + D+ PN   Y   I +G+PP       DTGSDL W QC+  P + C    +PL+ PK  
Sbjct: 92  RGDVYPNGL-YFTHIFVGSPPRRYFLDMDTGSDLTWIQCD-APCTSCAKGPNPLYKPKKG 149

Query: 141 STYKSLPCSSSQCASLNQKSCSGV-----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           +    +P   S C  + +   +G       C Y + Y D S S G LA++ + L    G 
Sbjct: 150 NL---VPLKDSLCVEVQRNLKTGYCETCEQCDYEIEYADHSSSMGVLASDDLHLMLANGS 206

Query: 196 AVALPGITFGCGTNNGG-LFNS--KTTGIVGLGGGDISLISQMRTTIAGNQRLG 246
              L GI FGC  +  G L NS  KT GI+GL    +SL SQ+ +    N  LG
Sbjct: 207 LTKL-GIMFGCAYDQQGLLLNSLAKTDGILGLSKAKVSLPSQLASQRIINNVLG 259


>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
          Length = 419

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 57/190 (30%), Positives = 93/190 (48%), Gaps = 18/190 (9%)

Query: 58  LTRSLNRLNHFNQNSSISSSKASQADIIP---NNANYLIRISIGTPPTERLAVADTGSDL 114
           L R L+R     +  + +++      ++P   + A Y+   +IGTPP     + D   +L
Sbjct: 26  LRRGLDRQGMRGRILADATAAPPGGAVVPLHWSGACYVANFTIGTPPQAVSGIVDLSGEL 85

Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVS-- 171
           +WTQC  C  S C+ Q+ P+FDP  S+TY++  C S  C S+  ++CSG   C Y     
Sbjct: 86  VWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSIPTRNCSGDGECGYEAPSM 145

Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKT---TGIVGLGGGD 228
           +GD   + G  +T+ + +G+  G+      + FGC   + G  +      +G VGLG   
Sbjct: 146 FGD---TFGIASTDAIAIGNAEGR------LAFGCVVASDGSIDGAMDGPSGFVGLGRTP 196

Query: 229 ISLISQMRTT 238
            SL+ Q   T
Sbjct: 197 WSLVGQSNVT 206



 Score = 52.0 bits (123), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 32/122 (26%), Positives = 63/122 (51%), Gaps = 9/122 (7%)

Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG-ADVKLS 318
           L++LP      L  V+++ + +  +A+P    +LC+   ++S VP++   F+G A +   
Sbjct: 295 LSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSGVPDLVFTFQGGATLTAP 354

Query: 319 RSNFFV-------KVSEDIVCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
            S + +        V   I+ S       + V I G+++Q N    +D+E++T+SF+P D
Sbjct: 355 PSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLEKETLSFEPAD 414

Query: 371 CT 372
           C+
Sbjct: 415 CS 416


>gi|297805186|ref|XP_002870477.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316313|gb|EFH46736.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 287

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 57/158 (36%), Positives = 84/158 (53%), Gaps = 12/158 (7%)

Query: 86  PNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKS 145
           P +  Y   + IGTPP E   V DTGSD++W  C  C    C +Q+   FDP  SS+   
Sbjct: 77  PISRIYYTTLQIGTPPREFNVVIDTGSDVLWVSCISC--VGCPLQNVTFFDPGASSSAVK 134

Query: 146 LPCSSSQCAS-LNQKS-CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           L CS  +C S L++KS CS +  +Y V Y DGSF++G   ++ ++  +     + +    
Sbjct: 135 LACSDKRCFSDLHKKSGCSPL--EYKVEYSDGSFTSGYYISDLISFETVMSSNLTVKSSA 192

Query: 202 -ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQM 235
              FGC   + GL +   T   GIVGLG G + ++SQ+
Sbjct: 193 PFVFGCSNLHAGLISLPETSIHGIVGLGKGRLLVVSQL 230


>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
 gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 75/249 (30%), Positives = 109/249 (43%), Gaps = 25/249 (10%)

Query: 2   ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
           ATF   +F L F     V P   Q+    + +I   S  SPF    +  +  +   +T +
Sbjct: 8   ATFF--LFALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESW--VNTVITMA 63

Query: 62  LNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSDLIW 116
                     S+++  K +   I P       ANY++R+ +GTP  +   V DT +D  W
Sbjct: 64  SKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAW 123

Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYG 173
             C     S C    S  F P  S+T  SL CS +QC+ +   SC       C ++ SYG
Sbjct: 124 VPC-----SGCTGCSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYG 178

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLI 232
             S     L  + +TL +       +PG TFGC    +GG    +  G++GLG G ISLI
Sbjct: 179 GDSSLTATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPISLI 231

Query: 233 SQMRTTIAG 241
           SQ     +G
Sbjct: 232 SQAGAMYSG 240


>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 428

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 61/177 (34%), Positives = 87/177 (49%), Gaps = 27/177 (15%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N    + +++G+PP     V DTGS+L W  C+  P        +  F+P +SS+Y   
Sbjct: 56  HNVTLTVSLTVGSPPQNVTMVLDTGSELSWLHCKKLP------NLNSTFNPLLSSSYTPT 109

Query: 147 PCSSSQCASLNQ-----KSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           PC+SS C +  +      SC   N  C   VSY D S + G LA ET +L        A 
Sbjct: 110 PCNSSICTTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLA-----GAAQ 164

Query: 200 PGITFGCGTNNGGLF----NSKTTGIVGLGGGDISLISQMRT-----TIAGNQRLGV 247
           PG  FGC  + G       +SKTTG++G+  G +SL++QM        I+G   LGV
Sbjct: 165 PGTLFGCMDSAGYTSDINEDSKTTGLMGMNRGSLSLVTQMSLPKFSYCISGEDALGV 221



 Score = 44.7 bits (104), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 36/132 (27%), Positives = 60/132 (45%), Gaps = 13/132 (9%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEA--QPVADPT----GSLELCYSF-NSLSQVPE 305
           ++DSGT  TFL     S+L        +     + DP     G+++LCY    S + VP 
Sbjct: 286 MVDSGTQFTFLLGSVYSSLKDEFLEQTKGVLTRIEDPNFVFEGAMDLCYHAPASFAAVPA 345

Query: 306 VTIHFRGADVKLSRSNFFVKVSED---IVCSVFKG---ITNSVPIYGNIMQTNFLVGYDI 359
           VT+ F GA++++S      +VS+    + C  F     +     + G+  Q N  + +D+
Sbjct: 346 VTLVFSGAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWMEFDL 405

Query: 360 EQQTVSFKPTDC 371
            +  V F  T C
Sbjct: 406 LKSRVGFTQTTC 417


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 52/158 (32%), Positives = 83/158 (52%), Gaps = 20/158 (12%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  + +++G+PP     V DTGS+L W  C+  P          +FDP  SS+Y  +
Sbjct: 59  HNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS------VFDPLRSSSYSPI 112

Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           PC+S  C +  +      SC     C   +SY D S   GNLA++T  +G++     A+P
Sbjct: 113 PCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNS-----AIP 167

Query: 201 GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
              FGC   G ++    +SKTTG++G+  G +S ++QM
Sbjct: 168 ATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQM 205


>gi|108862257|gb|ABA96612.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 451

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 93/373 (24%), Positives = 143/373 (38%), Gaps = 91/373 (24%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + +SIG PP       DTGSDL W QC+ PC    C     PL+ P  +
Sbjct: 50  GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106

Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
              K +PC    CA+L+     +  C      C Y + Y D   S G L T++  L    
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162

Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQM--------------- 235
             ++  PG+ FGCG +         S T G++GLG G +SL+SQ+               
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222

Query: 236 -----------------RTTIA--------------------GNQRLGVSTPDIVIDSGT 258
                            R T A                    G + LGV   ++V DSG+
Sbjct: 223 TRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGS 282

Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIHF 310
           + T+        L+  +   +       P  SL LC+     F S+  V +    V + F
Sbjct: 283 SFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFRTVVLSF 342

Query: 311 ---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIE 360
              + A +++   N+ +       C    GI N        + I G+I   + +V YD E
Sbjct: 343 SNGKKALMEIPPENYLIVTKYGNAC---LGILNGSEVGLKDLNIVGDITMQDQMVIYDNE 399

Query: 361 QQTVSFKPTDCTK 373
           +  + +    C +
Sbjct: 400 RGQIGWIRAPCDR 412


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 52/158 (32%), Positives = 83/158 (52%), Gaps = 20/158 (12%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  + +++G+PP     V DTGS+L W  C+  P          +FDP  SS+Y  +
Sbjct: 52  HNVSLTVSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHS------VFDPLRSSSYSPI 105

Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           PC+S  C +  +      SC     C   +SY D S   GNLA++T  +G++     A+P
Sbjct: 106 PCTSPTCRTRTRDFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHIGNS-----AIP 160

Query: 201 GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
              FGC   G ++    +SKTTG++G+  G +S ++QM
Sbjct: 161 ATIFGCMDSGFSSNSDEDSKTTGLIGMNRGSLSFVTQM 198


>gi|115487628|ref|NP_001066301.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|108862256|gb|ABA96613.2| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113648808|dbj|BAF29320.1| Os12g0177500 [Oryza sativa Japonica Group]
 gi|215693997|dbj|BAG89196.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 421

 Score = 87.4 bits (215), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 93/373 (24%), Positives = 143/373 (38%), Gaps = 91/373 (24%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + +SIG PP       DTGSDL W QC+ PC    C     PL+ P  +
Sbjct: 50  GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106

Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
              K +PC    CA+L+     +  C      C Y + Y D   S G L T++  L    
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162

Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQM--------------- 235
             ++  PG+ FGCG +         S T G++GLG G +SL+SQ+               
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222

Query: 236 -----------------RTTIA--------------------GNQRLGVSTPDIVIDSGT 258
                            R T A                    G + LGV   ++V DSG+
Sbjct: 223 TRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGS 282

Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIHF 310
           + T+        L+  +   +       P  SL LC+     F S+  V +    V + F
Sbjct: 283 SFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFRTVVLSF 342

Query: 311 ---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIE 360
              + A +++   N+ +       C    GI N        + I G+I   + +V YD E
Sbjct: 343 SNGKKALMEIPPENYLIVTKYGNAC---LGILNGSEVGLKDLNIVGDITMQDQMVIYDNE 399

Query: 361 QQTVSFKPTDCTK 373
           +  + +    C +
Sbjct: 400 RGQIGWIRAPCDR 412


>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
          Length = 419

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 56/190 (29%), Positives = 94/190 (49%), Gaps = 18/190 (9%)

Query: 58  LTRSLNRLNHFNQNSSISSSKASQADIIP---NNANYLIRISIGTPPTERLAVADTGSDL 114
           L R L++     +  + +++      ++P   + A+Y+   +IGTPP     + D   +L
Sbjct: 26  LRRGLDQQGMRGRILADATAAPPGGAVVPLHWSGAHYVANFTIGTPPQAVSGIVDLSGEL 85

Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG-VNCQYSVS-- 171
           +WTQC  C  S C+ Q+ P+FDP  S+TY++  C S  C S+  ++CSG   C Y     
Sbjct: 86  VWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSIPTRNCSGDGECGYEAPSM 145

Query: 172 YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKT---TGIVGLGGGD 228
           +GD   + G  +T+ + +G+  G+      + FGC   + G  +      +G VGLG   
Sbjct: 146 FGD---TFGIASTDAIAIGNAEGR------LAFGCVVASDGSIDGAMDGPSGFVGLGRTP 196

Query: 229 ISLISQMRTT 238
            SL+ Q   T
Sbjct: 197 WSLVGQSNVT 206



 Score = 52.4 bits (124), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 30/122 (24%), Positives = 63/122 (51%), Gaps = 9/122 (7%)

Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLSR 319
           L++LP      L  V+++ + +  +A+P    +LC+   ++S VP++   F+G     ++
Sbjct: 295 LSYLPDAAYQALEKVVTAALGSPSMANPPEPFDLCFQNAAVSGVPDLVFTFQGGATLTAQ 354

Query: 320 SNFFV--------KVSEDIVCSV-FKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
            + ++         V   I+ S       + V I G+++Q N    +D+E++T+SF+P D
Sbjct: 355 PSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLEKETLSFEPAD 414

Query: 371 CT 372
           C+
Sbjct: 415 CS 416


>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
 gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
          Length = 358

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 78/279 (27%), Positives = 121/279 (43%), Gaps = 74/279 (26%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
           Q ++ P   +Y + ++IG P        DTGSDL W QC+ PC    C     PL+ P  
Sbjct: 45  QGNVYPT-GHYYVTMNIGNPAKPYFLDVDTGSDLTWLQCDAPC--RSCNKVPHPLYRPTA 101

Query: 140 SSTYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           +S    +PC+++ C +L      N K  S   C Y + Y D + S G L  +  +L   +
Sbjct: 102 NSL---VPCANALCTALHSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSLPMRS 158

Query: 194 GQAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMR------------- 236
                 PG+TFGCG +      G   + T G++GLG G +SL+SQ++             
Sbjct: 159 SN--IRPGLTFGCGYDQQVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCL 216

Query: 237 -------------------------TTIAGN-------------QRLGVSTPDIVIDSGT 258
                                      I+GN             + LGV   ++V DSG+
Sbjct: 217 STNGGGFLFFGDDIVPTSRVTWVPMAKISGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGS 276

Query: 259 TLT-FLPQGYNSNLLSVMSSMIEA-QPVADPTGSLELCY 295
           T T F  Q Y + + ++ S + ++ + V+DP  SL LC+
Sbjct: 277 TYTYFTAQPYQAVVSALKSGLSKSLKQVSDP--SLPLCW 313


>gi|218186522|gb|EEC68949.1| hypothetical protein OsI_37668 [Oryza sativa Indica Group]
          Length = 421

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 93/373 (24%), Positives = 143/373 (38%), Gaps = 91/373 (24%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + +SIG PP       DTGSDL W QC+ PC    C     PL+ P  +
Sbjct: 50  GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106

Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
              K +PC    CA+L+     +  C      C Y + Y D   S G L T++  L    
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162

Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQM--------------- 235
             ++  PG+ FGCG +         S T G++GLG G +SL+SQ+               
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222

Query: 236 -----------------RTTIA--------------------GNQRLGVSTPDIVIDSGT 258
                            R T A                    G + LGV   ++V DSG+
Sbjct: 223 TRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGS 282

Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPE----VTIHF 310
           + T+        L+  +   +       P  SL LC+     F S+  V +    V + F
Sbjct: 283 SFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFKTVVLSF 342

Query: 311 ---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIE 360
              + A +++   N+ +       C    GI N        + I G+I   + +V YD E
Sbjct: 343 SNGKKALMEIPPENYLIVTKYGNAC---LGILNGSEVGLKDLNIVGDITMQDQMVIYDNE 399

Query: 361 QQTVSFKPTDCTK 373
           +  + +    C +
Sbjct: 400 RGQIGWIRAPCDR 412


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 53/157 (33%), Positives = 75/157 (47%), Gaps = 23/157 (14%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           N   Y+    IGTPP +     D  SDL+WT C    P          F+P  S+T   +
Sbjct: 96  NAGMYVFSYGIGTPPQQVSGALDISSDLVWTACGATAP----------FNPVRSTTVADV 145

Query: 147 PCSSSQCASLNQKSCSG------VNCQYSVSYGDGSF-SNGNLATETVTLGSTTGQAVAL 199
           PC+   C     ++C          C Y+  YG G+  + G L TE  T G T      +
Sbjct: 146 PCTDDACQQFAPQTCGAGAGAGSSECAYTYMYGGGAANTTGLLGTEAFTFGDTR-----I 200

Query: 200 PGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
            G+ FGCG  N G F S  +G++GLG G++SL+SQ++
Sbjct: 201 DGVVFGCGLQNVGDF-SGVSGVIGLGRGNLSLVSQLQ 236


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 53/140 (37%), Positives = 72/140 (51%), Gaps = 11/140 (7%)

Query: 97  IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
           +GTPP       + G++LIW    P P  +C+ Q  P F+P   S  + LP +S  C S 
Sbjct: 1   MGTPPNPVKLKLENGNELIWNHSNPSP--ECFEQAFPYFEPLTFS--RGLPFAS--CGS- 53

Query: 157 NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS 216
             K      C Y+ SYGD S + G L  +  T     G   ++PG+ FGCG  N G+F S
Sbjct: 54  -PKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTF---VGAGASVPGVAFGCGLFNNGVFKS 109

Query: 217 KTTGIVGLGGGDISLISQMR 236
             TGI G G G +SL SQ++
Sbjct: 110 NETGIAGFGRGPLSLPSQLK 129



 Score = 57.8 bits (138), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 40/128 (31%), Positives = 64/128 (50%), Gaps = 9/128 (7%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 310
           +IDSGT++T LP      +    ++ I+   V         C+S  S ++  VP++ +HF
Sbjct: 213 IIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHF 272

Query: 311 RGADVKLSRSNFFVKVSED----IVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
            GA + L R N+  +V +D    I+C ++ KG  +   I GN  Q N  V YD++   +S
Sbjct: 273 EGATMDLPRENYVFEVPDDAGNSIICLAINKG--DETTIIGNFQQQNMHVLYDLQNNMLS 330

Query: 366 FKPTDCTK 373
           F    C K
Sbjct: 331 FVAAQCDK 338


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 95/369 (25%), Positives = 150/369 (40%), Gaps = 80/369 (21%)

Query: 76  SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
           S++    D +  N  Y  R+ IGTPP E   + D+GS + +  C  C   QC     P F
Sbjct: 70  SARMRLHDDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRF 127

Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
            P +SSTY  + CS+  C   + KS     C Y   Y + S S+G L  + V+ G  T  
Sbjct: 128 QPDLSSTYSPVKCSAD-CTCDSDKS----QCTYERQYAEMSSSSGVLGEDIVSFG--TES 180

Query: 196 AVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTI------------- 239
            +      FGC  +  G LF+    GI+GLG G +S++ Q+  +  I             
Sbjct: 181 ELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDI 240

Query: 240 -AGNQRLGV--STPDIVID-------------------SGTTLTFLPQGYNSNLLSVMSS 277
             G   LG   + PD+V                     +G  L   P+ ++S   +V+ S
Sbjct: 241 GGGAMVLGAMPAPPDMVFSRSDPVRSPYYNIELKEIHVAGKALRLDPRIFDSKHGTVLDS 300

Query: 278 MI-------------------EAQPVADPTGS----LELCYS-----FNSLSQV-PEVTI 308
                                + +P+    G      ++C++      + LSQ  P+V +
Sbjct: 301 GTTYAYLPEQAFVAFKDAVTSKVRPLKKIRGPDPNYKDICFAGAGRNVSQLSQAFPDVDM 360

Query: 309 HF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
            F  G  + LS  N+  + S  E   C  VF+   +   + G I+  N LV YD   + +
Sbjct: 361 VFGDGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKI 420

Query: 365 SFKPTDCTK 373
            F  T+C++
Sbjct: 421 GFWKTNCSE 429


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 54/174 (31%), Positives = 85/174 (48%), Gaps = 10/174 (5%)

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R  H + + S+  S+    D +  N  Y  R+ IGTPP     + D+GS + +  C  C
Sbjct: 65  HRKLHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDC 124

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
              QC     P F P+MSSTY+ + C +  C   + +      C Y   Y + S S G L
Sbjct: 125 --EQCGKHQDPKFQPEMSSTYQPVKC-NMDCNCDDDRE----QCVYEREYAEHSSSKGVL 177

Query: 183 ATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM 235
             + ++ G+ +   +      FGC T   G L++ +  GI+GLG GD+SL+ Q+
Sbjct: 178 GEDLISFGNES--QLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQL 229


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 96/195 (49%), Gaps = 17/195 (8%)

Query: 42  PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
           P   SS  P  R+ D   R L++       S + ++     D + +N  Y  R+ IGTPP
Sbjct: 34  PLSYSSLPPRPRVEDFRRRRLHQ-------SQLPNAHMKLYDDLLSNGYYTTRLWIGTPP 86

Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
            E   + DTGS + +  C  C   QC     P F P++S++Y++L C+   C   ++   
Sbjct: 87  QEFALIVDTGSTVTYVPCSTC--KQCGKHQDPKFQPELSTSYQALKCNPD-CNCDDE--- 140

Query: 162 SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTG 220
            G  C Y   Y + S S+G L+ + ++ G+ +   ++     FGC     G LF+ +  G
Sbjct: 141 -GKLCVYERRYAEMSSSSGVLSEDLISFGNES--QLSPQRAVFGCENEETGDLFSQRADG 197

Query: 221 IVGLGGGDISLISQM 235
           I+GLG G +S++ Q+
Sbjct: 198 IMGLGRGKLSVVDQL 212


>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 414

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 92/350 (26%), Positives = 133/350 (38%), Gaps = 94/350 (26%)

Query: 100 PPTERLAVADTGSDLI-WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
           PP+ +  +A+   D I WTQC+PC   +C       FDP  S TY    C  S       
Sbjct: 83  PPSPQEILAEMNPDSITWTQCKPC--VRCLKDSHRHFDPSASLTYSLGSCIPST------ 134

Query: 159 KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG------ 212
                V   Y+++YGD S S GN   +T+TL  +       P   FGCG NN G      
Sbjct: 135 -----VGNTYNMTYGDKSTSVGNYGCDTMTLEPSD----VFPKFQFGCGRNNEGDFGSGA 185

Query: 213 --------------------------------------LFNSKTT--------------G 220
                                                 LF  K T              G
Sbjct: 186 DGMLGLGQGQLSTVSQTASKFKKVFSYCLPEEDSIGSLLFGEKATSQSSLKFTSLVNGPG 245

Query: 221 IVGLGGGDISLISQMRTTIAGNQRLGV-----STPDIVIDSGTTLTFLPQGYNSNLLSVM 275
             GL       +  +  ++ GN+RL V     ++P  +IDSGT +T LPQ   S L +  
Sbjct: 246 TSGLEESGYYFVKLLDISV-GNKRLNVPSSVFASPGTIIDSGTVITCLPQRAYSALTAAF 304

Query: 276 SSMIEAQPVADPTGS----LELCYSFNSLSQV--PEVTIHF-RGADVKLSRSNFFVKVSE 328
              +   P+++        L+ CY+ +    V  PE+ +HF  GADV+L+          
Sbjct: 305 KKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRVIWGNDA 364

Query: 329 DIVCSVFKG-----ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
             +C  F G     + + + I GN  Q +  V YDI+   + F    C+K
Sbjct: 365 SRLCLAFAGNSKSTMNSELTIIGNRQQVSLTVLYDIQGGRIGFGGNGCSK 414


>gi|222613193|gb|EEE51325.1| hypothetical protein OsJ_32293 [Oryza sativa Japonica Group]
          Length = 371

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 77/314 (24%), Positives = 128/314 (40%), Gaps = 72/314 (22%)

Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
            C+ QD P+F P  SST+K  PC +  C S+    C+   C Y    G G  + G +AT+
Sbjct: 60  HCFKQDLPVFVPNASSTFKPEPCGTDVCKSIPTPKCASDVCAYDGVTGLGGHTVGIVATD 119

Query: 186 TVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT------- 238
           T  +G+         G ++   +       +  +G +GLG    SL++QM+ T       
Sbjct: 120 TFAIGTAAPARPPASGASWRATSTPW----AGPSGFIGLGRTPWSLVAQMKLTRFSYCLA 175

Query: 239 ---IAGNQRL-----------GVSTPDIV-----------------IDSGTTLTFLPQGY 267
                 N RL           G  TP +                  I +G     +P+G 
Sbjct: 176 PHDTGKNSRLFLGASAKLAGGGAWTPFVKTSPNDGMSQYYPIELEEIKAGDATITMPRGR 235

Query: 268 NSNLL--------------------SVMSSMIEAQPVADPTGS-LELCYSFNSLSQVPEV 306
           N+ L+                    +VM+S + A P A P G+  E+C+    +S  P++
Sbjct: 236 NTVLVQTAVVRVSLLVDSVYQEFKKAVMAS-VGAAPTATPVGAPFEVCFPKAGVSGAPDL 294

Query: 307 TIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGIT-------NSVPIYGNIMQTNFLVGYD 358
              F+ GA + +  +N+   V  D VC     I        + + I G+  Q N  + +D
Sbjct: 295 VFTFQAGAALTVPPANYLFDVGNDTVCLSVMSIALLNITALDGLNILGSFQQENVHLLFD 354

Query: 359 IEQQTVSFKPTDCT 372
           +++  +SF+P DC+
Sbjct: 355 LDKDMLSFEPADCS 368


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 97/388 (25%), Positives = 154/388 (39%), Gaps = 88/388 (22%)

Query: 65  LNHFNQNSSISSSKASQA--------DIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
            +HFN    +  S +           D +  N  Y  R+ IGTPP     + DTGS + +
Sbjct: 59  FSHFNPRRQLKESDSEHHPNARMRLYDDLLRNGYYTARLWIGTPPQRFALIVDTGSTVTY 118

Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGS 176
             C  C    C     P F P+ S TY+ + C + QC   N +      C Y   Y + S
Sbjct: 119 VPCSTC--RHCGSHQDPKFRPEDSETYQPVKC-TWQCNCDNDRK----QCTYERRYAEMS 171

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQM 235
            S+G L  + V+ G+ T   ++     FGC  +  G ++N +  GI+GLG GD+S++ Q+
Sbjct: 172 TSSGALGEDVVSFGNQT--ELSPQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQL 229

Query: 236 --RTTIAGNQRL---------------GVSTP-DIV-------------ID------SGT 258
             +  I+ +  L               G+S P D+V             ID      +G 
Sbjct: 230 VEKKVISDSFSLCYGGMGVGGGAMVLGGISPPADMVFTRSDPVRSPYYNIDLKEIHVAGK 289

Query: 259 TLTFLPQGYNSNLLSVMSS------------MIEAQPVADPTGSL-----------ELCY 295
            L   P+ ++    +V+ S            +     +   T SL           ++C+
Sbjct: 290 RLHLNPKVFDGKHGTVLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPRYNDICF 349

Query: 296 SFNSL--SQV----PEVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIY 345
           S   +  SQ+    P V + F  G  + LS  N+     KV       VF    +   + 
Sbjct: 350 SGAEIDVSQISKSFPVVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLL 409

Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           G I+  N LV YD E   + F  T+C++
Sbjct: 410 GGIVVRNTLVMYDREHTKIGFWKTNCSE 437


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 96/195 (49%), Gaps = 17/195 (8%)

Query: 42  PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
           P   SS  P  R+ D   R L++       S + ++     D + +N  Y  R+ IGTPP
Sbjct: 34  PLSYSSLPPRPRVEDFRRRRLHQ-------SQLPNAHMKLYDDLLSNGYYTTRLWIGTPP 86

Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC 161
            E   + DTGS + +  C  C   QC     P F P++S++Y++L C+   C   ++   
Sbjct: 87  QEFALIVDTGSTVTYVPCSTC--KQCGKHQDPKFQPELSTSYQALKCNPD-CNCDDE--- 140

Query: 162 SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTG 220
            G  C Y   Y + S S+G L+ + ++ G+ +   ++     FGC     G LF+ +  G
Sbjct: 141 -GKLCVYERRYAEMSSSSGVLSEDLISFGNES--QLSPQRAVFGCENEETGDLFSQRADG 197

Query: 221 IVGLGGGDISLISQM 235
           I+GLG G +S++ Q+
Sbjct: 198 IMGLGRGKLSVVDQL 212


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/191 (31%), Positives = 93/191 (48%), Gaps = 18/191 (9%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  + +++GTPP     V DTGS+L W  C P        + +  F P+ S T+ S+
Sbjct: 62  HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASV 121

Query: 147 PCSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           PC S+QC S +  S   C G +  C+ S+SY DGS S+G LATE  T+G       A   
Sbjct: 122 PCDSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGC 181

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPD-----IVIDS 256
           +     T+  G+    T G++G+  G +S +SQ     A  +R      D     +++  
Sbjct: 182 MATAFDTSPDGV---ATAGLLGMNRGALSFVSQ-----ASTRRFSYCISDRDDAGVLLLG 233

Query: 257 GTTLTFLPQGY 267
            + L FLP  Y
Sbjct: 234 HSDLPFLPLNY 244


>gi|110738505|dbj|BAF01178.1| hypothetical protein [Arabidopsis thaliana]
          Length = 284

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 54/174 (31%), Positives = 85/174 (48%), Gaps = 10/174 (5%)

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R  H + + S+  S+    D +  N  Y  R+ IGTPP     + D+GS + +  C  C
Sbjct: 65  HRKLHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDC 124

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
              QC     P F P+MSSTY+ + C +  C   + +      C Y   Y + S S G L
Sbjct: 125 --EQCGKHQDPKFQPEMSSTYQPVKC-NMDCNCDDDRE----QCVYEREYAEHSSSKGVL 177

Query: 183 ATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM 235
             + ++ G+ +   +      FGC T   G L++ +  GI+GLG GD+SL+ Q+
Sbjct: 178 GEDLISFGNES--QLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQL 229


>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
 gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
          Length = 321

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 53/158 (33%), Positives = 76/158 (48%), Gaps = 13/158 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y   I IGTP        DTGSD++W     C+ CP       +  L+DPK SST   + 
Sbjct: 33  YYTEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVS 92

Query: 148 CSSSQCAS----LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    CA+    L     + + C+YSV+YGDGS + G   ++ +     +G     P   
Sbjct: 93  CDQGFCAATYGGLLPGCTTSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANS 152

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
            +TFGCG+  GG     N    GI+G G  + S++SQ+
Sbjct: 153 TVTFGCGSQQGGDLGSSNQALDGIIGFGQSNTSMLSQL 190


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 71/239 (29%), Positives = 113/239 (47%), Gaps = 28/239 (11%)

Query: 8   VFILFFLCFYVVSPIE------AQTGGFSVELIHRDSPKSPFYNSSETPYQR-LRDALTR 60
           +F L FL F +   +        Q  G ++++ H  SP SPF+ S    ++  +     +
Sbjct: 5   LFSLAFLFFTLAQGMHLNPKCGIQDQGSNLQVFHVYSPCSPFWPSKPLKWEESVLQMQAK 64

Query: 61  SLNRLNHFNQNSSISSSK-----ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLI 115
              RL      SS+ + K     AS   I+  +  Y++R  IGTP    L   DT +D  
Sbjct: 65  DQARLQFL---SSLVARKSVVPIASGRQIV-QSPTYIVRAKIGTPAQTMLLAMDTSNDAA 120

Query: 116 WTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDG 175
           W  C     S C    S +F+   S+T+K++ C + QC  +    C G  C ++++YG  
Sbjct: 121 WIPC-----SGCVGCSSTVFNNVKSTTFKTVGCEAPQCKQVPNSKCGGSACAFNMTYGSS 175

Query: 176 SFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
           S +  NL+ + VTL + +     +P  TFGC T   G  +    G++GLG G +SL+SQ
Sbjct: 176 SIA-ANLSQDVVTLATDS-----IPSYTFGCLTEATG-SSIPPQGLLGLGRGPMSLLSQ 227


>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 481

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 90/365 (24%), Positives = 144/365 (39%), Gaps = 86/365 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTG+D++W    QC+ CP       D  L++ K SS+ K +P
Sbjct: 73  YYAKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKLVP 132

Query: 148 CSSSQCASLNQKSCSGV------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
           C    C  +N    +G       +C Y   YGDGS + G    + V     +G    A A
Sbjct: 133 CDQELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKTASA 192

Query: 199 LPGITFGCGTNNGGLFN----SKTTGIVGLG----------------------------G 226
              + FGCG    G  +        GI+G G                            G
Sbjct: 193 NGSVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCLNGVNG 252

Query: 227 GDISLISQ---------------------MRTTIAGNQRLGVST--------PDIVIDSG 257
           G I  I                       M     G+  L +ST           +IDSG
Sbjct: 253 GGIFAIGHVVQPTVNTTPLLPDQPHYSVNMTAIQVGHTFLNLSTDASEQRDSKGTIIDSG 312

Query: 258 TTLTFLPQG-YNSNLLSVMSSM--IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR-GA 313
           TTL +LP G Y   +  ++S    ++ Q + D     +  YS +     P VT +F  G 
Sbjct: 313 TTLAYLPDGIYQPLVYKILSQQPNLKVQTLHDEYTCFQ--YSGSVDDGFPNVTFYFENGL 370

Query: 314 DVKLSRSNFFVKVSEDIVCSVFKG------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
            +K+   ++   +SE++ C  ++        + ++ + G+++ +N LV YD+E Q + + 
Sbjct: 371 SLKVYPHDYLF-LSENLWCIGWQNSGAQSRDSKNMTLLGDLVLSNKLVFYDLENQVIGWT 429

Query: 368 PTDCT 372
             +C+
Sbjct: 430 EYNCS 434


>gi|449464178|ref|XP_004149806.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 437

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 92/363 (25%), Positives = 150/363 (41%), Gaps = 90/363 (24%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + I+IG          D+GSDL W QC+  P + C      L+ P  ++    L C  
Sbjct: 55  YSVSINIGKGDEAFEFDIDSGSDLTWVQCD-APCTHCTKPREQLYKPNNNA----LNCFE 109

Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
             C SL+      C   +  CQY + Y D   S G L  + V L  T G ++A P I FG
Sbjct: 110 PLCTSLHPITNHHCKSADDQCQYEIEYADHGSSLGVLVNDHVPLKLTNG-SLAAPRIAFG 168

Query: 206 CGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT-----TIAGN--------------- 242
           CG ++       +  T G++GLG G++S ISQ+ +      + G+               
Sbjct: 169 CGYDHKYSVPDSSPPTAGVLGLGNGEVSFISQLSSMGVVRNVVGHCLSDEGGFLFFGDEF 228

Query: 243 -------------------------------QRLGVSTPDIVIDSGTTLTFL-PQGYNSN 270
                                          +  G+    +V DSG++ T+   Q YNS 
Sbjct: 229 VPSSGVTWTSMSHESIGSYYSSGPAEVYFSGKATGIKDLTLVFDSGSSYTYFNSQAYNS- 287

Query: 271 LLSVMSSMIEAQPVADP--TGSLELCYS----FNSLSQVPE----VTIHF---RGADVKL 317
           +L+++ + +  +P+ D     SL +C+     F SL  V +    + + F   + A ++L
Sbjct: 288 ILALVKNNLRGKPLEDAPEDKSLPVCWKGTRPFKSLRDVKKYFNPLALRFTKTKNAQIQL 347

Query: 318 SRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTD 370
              N+ +      VC    GI N        + I G+I   + +V YD E++ + + PT+
Sbjct: 348 PPENYLIITKYGNVCF---GILNGTEVGLGDLNIIGDISLKDKMVIYDNERRRIGWFPTN 404

Query: 371 CTK 373
           C K
Sbjct: 405 CNK 407


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/160 (36%), Positives = 75/160 (46%), Gaps = 13/160 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI +GTPP       DTGSD++W  C+P   CP +         FDP+ SST   L 
Sbjct: 41  YYTRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASPLS 100

Query: 148 CSSSQCASLNQKSCSGVN----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---ALP 200
           C  S+C S NQ S S       C YS  YGDGS + G   ++         Q V   A  
Sbjct: 101 CIDSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNASA 160

Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT 237
            ITFGC  N  G     +    GI G G  D+S++SQ+ +
Sbjct: 161 KITFGCSYNQSGDLTKPDRAVDGIFGFGQNDLSVVSQLNS 200


>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
 gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
          Length = 418

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 92/368 (25%), Positives = 159/368 (43%), Gaps = 84/368 (22%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSS 141
            D+ P   +Y + ++IG P        DTGSDL W QC+  P   C     PL+ P  + 
Sbjct: 49  GDVYPT-GHYYVTMNIGDPAKPYFLDVDTGSDLTWLQCD-APCQSCNKVPHPLYRPTKN- 105

Query: 142 TYKSLPCSSSQCASL------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
             K +PC++S C +L      N+K  +   C Y + Y D + S G L  ++ +L     +
Sbjct: 106 --KLVPCANSICTALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSL-PLRNK 162

Query: 196 AVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMR--------------- 236
           +   P ++FGCG +      G   + T G++GLG G +SL+SQ++               
Sbjct: 163 SNVRPSLSFGCGYDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCLST 222

Query: 237 ------------------------TTIAGN-----------QRLGVSTP--DIVIDSGTT 259
                                    + +GN            R  +ST   ++V DSG+T
Sbjct: 223 SGGGFLFFGDDMVPTSRVTWVSMVRSTSGNYYSPGSATLYFDRRSLSTKPMEVVFDSGST 282

Query: 260 LTFL-PQGYNSNLLSVMSSMIEA-QPVADPTGSLELCY----SFNSLSQVPE--VTIHF- 310
            T+   Q Y + + ++  S+ ++ + V+DP  SL LC+    +F S+S V +   ++ F 
Sbjct: 283 YTYFSAQPYQATISAIKGSLSKSLKQVSDP--SLPLCWKGQKAFKSVSDVKKDFKSLQFI 340

Query: 311 --RGADVKLSRSNFFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVS 365
             + A + +   N+ +      VC  +  G     S  I G+I   + +V YD E+  + 
Sbjct: 341 FGKNAVMDIPPENYLIITKNGNVCLGILDGSAAKLSFSIIGDITMQDQMVIYDNEKAQLG 400

Query: 366 FKPTDCTK 373
           +    C++
Sbjct: 401 WIRGSCSR 408


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 61/191 (31%), Positives = 93/191 (48%), Gaps = 18/191 (9%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  + +++GTPP     V DTGS+L W  C P        + +  F P+ S T+ S+
Sbjct: 61  HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASV 120

Query: 147 PCSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           PC S+QC S +  S   C G +  C+ S+SY DGS S+G LATE  T+G       A   
Sbjct: 121 PCGSAQCRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTVGQGPPLRAAFGC 180

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPD-----IVIDS 256
           +     T+  G+    T G++G+  G +S +SQ     A  +R      D     +++  
Sbjct: 181 MATAFDTSPDGV---ATAGLLGMNRGALSFVSQ-----ASTRRFSYCISDRDDAGVLLLG 232

Query: 257 GTTLTFLPQGY 267
            + L FLP  Y
Sbjct: 233 HSDLPFLPLNY 243


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 69/191 (36%), Positives = 93/191 (48%), Gaps = 25/191 (13%)

Query: 57  ALTRSLNRLNHFNQNSSISSSKASQADIIP--NNANYLIRISIGTPPTERLAVADTGSDL 114
           A  RS  RL+ +      +S   ++A +        Y+++ SIG PP    A  DTGSDL
Sbjct: 57  AAERSRRRLSVY------TSGTGTKAPVTKSQKGGKYIMQFSIGEPPLLIWAEVDTGSDL 110

Query: 115 IWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ-----KSCSGVN--CQ 167
           +W +C PC  + C    SPL+DP  S +   LPCSS  C +L +       CS     C 
Sbjct: 111 MWVKCSPC--NGCNPPPSPLYDPARSRSSGKLPCSSQLCQALGRGRIISDQCSDDPPLCG 168

Query: 168 YSVSYGD-GSFS-NGNLATETVTLGSTTGQAVALPGITFG-CGTNNGGLFNSKTTGIVGL 224
           Y  +YG  G  S  G L TET T     G       ++FG   T +G  F   T G+VGL
Sbjct: 169 YHYAYGHSGDHSTQGVLGTETFTF----GDGYVANNVSFGRSDTIDGSQFGG-TAGLVGL 223

Query: 225 GGGDISLISQM 235
           G G +SL+SQ+
Sbjct: 224 GRGHLSLVSQL 234


>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 476

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/208 (27%), Positives = 92/208 (44%), Gaps = 25/208 (12%)

Query: 50  PYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN--------YLIRISIGTPP 101
           P QR  +   RSL+ +   +        +   A  +P   N        Y  ++ +G+P 
Sbjct: 26  PVQRKFNGPHRSLDAIKAHDDRRR---GRFLAAIDVPLGGNGLPSSTGLYYTKVGLGSPA 82

Query: 102 TERLAVADTGSDLIWTQCE---PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQ 158
            E     DTGSD++W  C     CP       D  L+DP  S T  ++PC    C     
Sbjct: 83  KEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYS 142

Query: 159 KSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---GITFGCGTNNG 211
              SG    ++C YS++YGDGS ++G+   +++T    +G     P    + FGCG    
Sbjct: 143 GPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQS 202

Query: 212 GLFNSKT----TGIVGLGGGDISLISQM 235
           G  +S +     GI+G G  + S++SQ+
Sbjct: 203 GSLSSNSDEALDGIIGFGQANSSVLSQL 230



 Score = 43.1 bits (100), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 34/129 (26%), Positives = 57/129 (44%), Gaps = 11/129 (8%)

Query: 253 VIDSGTTLTFLPQG-YNSNLLSVMSSM--IEAQPVADPTGSLELCYSFNSLSQVPEVTIH 309
           +IDSGTTL +LP   YN  L  V+     ++   V D        YS       P V  H
Sbjct: 305 IIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVEDQFTCFH--YSDKLDEGFPVVKFH 362

Query: 310 FRGADVKLSRSNFFVKVSEDIVCSVFKGIT------NSVPIYGNIMQTNFLVGYDIEQQT 363
           F G  + +   ++     EDI C  ++  +        + + G+++ +N LV YD+E   
Sbjct: 363 FEGLSLTVHPHDYLFLYKEDIYCIGWQKSSTQTKEGRDLILIGDLVLSNKLVVYDLENMV 422

Query: 364 VSFKPTDCT 372
           + +   +C+
Sbjct: 423 IGWTNFNCS 431


>gi|383165471|gb|AFG65613.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
          Length = 136

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 48/111 (43%), Positives = 65/111 (58%), Gaps = 1/111 (0%)

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
           Q +P++DP  SSTY  + C S  C +L    C S   C+Y  +YGD S + G L+ ET+T
Sbjct: 2   QPTPIYDPARSSTYSKVSCKSLLCNALPDFECKSAAGCEYQYTYGDFSITVGILSYETLT 61

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           L S +G    +P   FGCG NN G    +  GIVGLG G +SLISQ+  ++
Sbjct: 62  LTSKSGAEQLIPNFAFGCGQNNEGNGFDQGAGIVGLGRGPLSLISQLSASM 112


>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
 gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
          Length = 431

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 103/418 (24%), Positives = 160/418 (38%), Gaps = 86/418 (20%)

Query: 21  PIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN--SSISSSK 78
           P+     GF  EL H      P+  SS   +   R +   S  R+          +S   
Sbjct: 30  PVAGSDAGFRAELHH------PYAGSSLPVHDMWRRSARASKARVARLEARLTGDMSVPL 83

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           A  +D       Y + I IGTPP     +ADT SDL WTQC     +    Q  PLFDP 
Sbjct: 84  ARISD-----EGYTVTIGIGTPPQLHTLIADTASDLTWTQCNLF--NDTAKQVEPLFDPA 136

Query: 139 MSSTYKSLPCSSSQCASLN--QKSCSGVNCQY---------------------------- 168
            SS++  + CSS  C   N   K CS   C+Y                            
Sbjct: 137 KSSSFAFVTCSSKLCTEDNPGTKRCSNKTCRYVYPYVSVEAAGVLAYESFTLSDNNQHIC 196

Query: 169 -SVSYGDGSFSNGNLATETVTLGSTTG-----QAVALPGIT---------------FGCG 207
            S  +G G+ ++GNL   +  LG +         +A+P  +               FG  
Sbjct: 197 MSFGFGCGALTDGNLLGASGILGMSPAILSMVSQLAIPKFSYCLTPYTDRKSSPLFFGAW 256

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGY 267
            + G     KTTG +         +  +  ++ G +RL V      +  G T+  L    
Sbjct: 257 ADLG---RYKTTGPIQKSLTFYYYVPLVGLSL-GTRRLDVPAATFALKQGGTVVDLGCTV 312

Query: 268 NSNLLSVMSSMIEA------QPVADPT-GSLELCYSFNS-----LSQVPEVTIHFRG-AD 314
                   +++ EA       P+ + T    ++C++  S       Q P + ++F G AD
Sbjct: 313 GQLAEPAFTALKEAVLHTLNLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYFDGGAD 372

Query: 315 VKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           + L R N+F + +  ++C ++  G    + I GN+ Q NF + +D+      F PT C
Sbjct: 373 MVLPRDNYFQEPTAGLMCLALVPG--GGMSIIGNVQQQNFHLLFDVHDSKFLFAPTIC 428


>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score = 86.3 bits (212), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 54/174 (31%), Positives = 85/174 (48%), Gaps = 10/174 (5%)

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R  H + + S+  S+    D +  N  Y  R+ IGTPP     + D+GS + +  C  C
Sbjct: 66  HRKLHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDC 125

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNL 182
              QC     P F P++SSTY+ + C +  C   + K      C Y   Y + S S G L
Sbjct: 126 --EQCGKHQDPKFQPELSSTYQPVKC-NMDCNCDDDKE----QCVYEREYAEHSSSKGVL 178

Query: 183 ATETVTLGSTTGQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM 235
             + ++ G+ +   +      FGC T   G L++ +  GI+GLG GD+SL+ Q+
Sbjct: 179 GEDLISFGNES--QLTPQRAVFGCETVETGDLYSQRADGIIGLGQGDLSLVDQL 230


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 57/160 (35%), Positives = 81/160 (50%), Gaps = 22/160 (13%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N    I ++IG+PP     V DTGS+L W  C+  P        +  F+P +SS+Y   
Sbjct: 55  HNVTLTISLTIGSPPQNVTMVLDTGSELSWLHCKKLP------NLNSTFNPLLSSSYTPT 108

Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           PC+SS C +  +      SC   N  C   VSY D S + G LA ET +L        A 
Sbjct: 109 PCNSSVCMTRTRDLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLA-----GAAQ 163

Query: 200 PGITFGCGTNNGGL----FNSKTTGIVGLGGGDISLISQM 235
           PG  FGC  + G       ++KTTG++G+  G +SL++QM
Sbjct: 164 PGTLFGCMDSAGYTSDINEDAKTTGLMGMNRGSLSLVTQM 203



 Score = 45.1 bits (105), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 42/136 (30%), Positives = 64/136 (47%), Gaps = 21/136 (15%)

Query: 253 VIDSGTTLTFL-PQGYNS---NLLSVMSSMIEAQPVADPT----GSLELCYSF-NSLSQV 303
           ++DSGT  TFL    YNS     L     ++    + DP     G+++LCY    SL+ V
Sbjct: 285 MVDSGTQFTFLLGPVYNSLKDEFLEQTKGVLTR--IEDPNFVFEGAMDLCYHAPASLAAV 342

Query: 304 PEVTIHFRGADVKLSRSNFFVKVSED---IVCSVFK-----GITNSVPIYGNIMQTNFLV 355
           P VT+ F GA++++S      +VS+    + C  F      GI   V   G+  Q N  +
Sbjct: 343 PAVTLVFSGAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYV--IGHHHQQNVWM 400

Query: 356 GYDIEQQTVSFKPTDC 371
            +D+ +  V F  T C
Sbjct: 401 EFDLVKSRVGFTETTC 416


>gi|222637182|gb|EEE67314.1| hypothetical protein OsJ_24556 [Oryza sativa Japonica Group]
          Length = 304

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 92/327 (28%), Positives = 145/327 (44%), Gaps = 70/327 (21%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP-----LFDPKMSSTYKSLP 147
           + +++GTPP    A+    SDL W +C PC  S C    +P     L+D   SS++   P
Sbjct: 1   MELAVGTPPVTVQALFGI-SDLCWVECTPC--SGCNNNAAPPAGARLYDRANSSSFS--P 55

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
            + ++C         G    Y  +  D ++  G L TET+  GS    A  +   TFGC 
Sbjct: 56  LADTEC---------GYRYVYGATDTDRNYVKGILGTETIKFGSN--DAATVQSFTFGC- 103

Query: 208 TN---NGGLFNSKTTGIVGLGGGDISLISQMRTT-----IAGNQRL-------------- 245
           TN      LF+  T G+VGLG   +SL+ Q+        +A N  +              
Sbjct: 104 TNTVYRNDLFDGNT-GVVGLGRSKLSLVGQLGLDRFSYCLASNPNVASPVLFGSTASMDG 162

Query: 246 -GVS-TPDIVIDSGTTLTFLPQGYNSNLLSV------MSSMIEAQPVADPTGSLELCYSF 297
            GVS TP +  D+   +  L    +   L++      MS   EA       GS  LC+  
Sbjct: 163 NGVSSTPLLPDDANYYVNLLGISVDGTRLAIPNDTARMSRTYEAV-----NGSGLLCFLV 217

Query: 298 NSLSQ----VPEVTIHFRGADVKLSRSNFFVKVSE-------DIVCSVFKGITNSVPIYG 346
           +  S+    VP +T+HF G D++L   N+F    +       D++C +  G +++    G
Sbjct: 218 DDASKNVVTVPTMTMHFDGMDMELLFGNYFAYTGKQSGGGGGDVLC-LMIGKSSTGSRIG 276

Query: 347 NIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           N +Q +F V Y+++   +S +P DC K
Sbjct: 277 NYLQMDFHVLYELKNSVLSVQPADCGK 303


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 60/180 (33%), Positives = 88/180 (48%), Gaps = 13/180 (7%)

Query: 58  LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
           L+ S   L     +S+ ++      D+IP    Y  RI IGTPP     + DTGS L + 
Sbjct: 60  LSHSRRHLQRSESHSTATARMPLYDDLIPY-GYYTTRIWIGTPPQTFALIVDTGSTLTYV 118

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSF 177
            C  C   QC     P F P  SSTY+ L C S +C   ++     ++C Y   Y + S 
Sbjct: 119 PCSTC--EQCGKHQDPNFQPDWSSTYQPLKC-SMECTCDSEM----MHCVYDRQYAEMSS 171

Query: 178 SNGNLATETVTLGSTTGQAVALPGIT-FGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           S+G L  + V+ G    Q+   P  T FGC     G +++ +  GI+GLG GD+S++ Q+
Sbjct: 172 SSGVLGEDIVSFGK---QSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQL 228


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 52/158 (32%), Positives = 86/158 (54%), Gaps = 20/158 (12%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N + ++ +++GTPP     V DTGS+L W  C         +     FDP  S++Y+++
Sbjct: 27  HNVSLIVSLTVGTPPQNVSMVIDTGSELSWLHCNKT------LSYPTTFDPTRSTSYQTI 80

Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           PCSS  C +  Q      SC   N C  ++SY D S S+GNLA++   +GS+      + 
Sbjct: 81  PCSSPTCTNRTQDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHIGSSD-----IS 135

Query: 201 GITFGCGT---NNGGLFNSKTTGIVGLGGGDISLISQM 235
           G+ FGC     ++    +SK+TG++G+  G +S +SQ+
Sbjct: 136 GLVFGCMDSVFSSNSDEDSKSTGLMGMNRGSLSFVSQL 173



 Score = 39.7 bits (91), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 36/140 (25%), Positives = 62/140 (44%), Gaps = 23/140 (16%)

Query: 253 VIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADP----TGSLELCY----SFNSL 300
           ++DSGT  TFL         S  L+  SS++    + DP     G+++LCY    S   L
Sbjct: 256 MVDSGTQFTFLLGPVYNALRSAFLNQTSSVLRV--LEDPDFVFQGAMDLCYLVPLSQRVL 313

Query: 301 SQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKG---ITNSVPIYGNIMQT 351
             +P VT+ FRGA++ +S      +V      ++ + C  F     +     + G+  Q 
Sbjct: 314 PLLPTVTLVFRGAEMTVSGDRVLYRVPGELRGNDSVHCLSFGNSDLLGVEAYVIGHHHQQ 373

Query: 352 NFLVGYDIEQQTVSFKPTDC 371
           N  + +D+E+  +      C
Sbjct: 374 NVWMEFDLEKSRIGLAQVRC 393


>gi|296084856|emb|CBI28265.3| unnamed protein product [Vitis vinifera]
          Length = 446

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 94/363 (25%), Positives = 145/363 (39%), Gaps = 63/363 (17%)

Query: 50  PYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVAD 109
           PY+ LR  ++ SL R  H     +  +S         +   Y I +S GTPP     + D
Sbjct: 51  PYRNLRHLVSASLIRARHLKNPKTTPTSTTPLFTH--SYGAYSIPLSFGTPPQTLPLIMD 108

Query: 110 TGSDLIWTQCEPCPPSQCYM----------QDSPLFDPKMSSTYKSLPCSSSQCASLN-- 157
           TGSDL+W  C     +  Y+            S +F PK SS+ K L C + +C  ++  
Sbjct: 109 TGSDLVWFPC-----THRYVCRNCSFSTSNPSSNIFIPKSSSSSKVLGCVNPKCGWIHGS 163

Query: 158 ------------QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
                         +C+ +   Y V YG G  + G + +ET+ L    G+ V  P    G
Sbjct: 164 KVQSRCRDCEPTSPNCTQICPPYLVFYGSG-ITGGIMLSETLDL---PGKGV--PNFIVG 217

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQ 265
           C      L  S+  GI G G G  SL SQ+         L     D    S      +  
Sbjct: 218 CSV----LSTSQPAGISGFGRGPPSLPSQLGLKKFSYCLLSRRYDDTTESSSLIFELVAA 273

Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFR-GADVKLSRSNF 322
            +   + S  ++ +E          L  C++ + L+    PE+T+ FR GA+++L  +N+
Sbjct: 274 EFEKQVQSKRATEVEG------ITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLANY 327

Query: 323 FVKV-SEDIVC----------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
              +  +D+VC            F G      I GN  Q NF V YD+  + + F+   C
Sbjct: 328 VAFLGGDDVVCLTIVTDGAAGKEFSG--GPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 385

Query: 372 TKQ 374
              
Sbjct: 386 NPH 388


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 60/180 (33%), Positives = 88/180 (48%), Gaps = 13/180 (7%)

Query: 58  LTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
           L+ S   L     +S+ ++      D+IP    Y  RI IGTPP     + DTGS L + 
Sbjct: 60  LSHSRRHLQRSESHSTATARMPLYDDLIPY-GYYTTRIWIGTPPQTFALIVDTGSTLTYV 118

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSF 177
            C  C   QC     P F P  SSTY+ L C S +C   ++     ++C Y   Y + S 
Sbjct: 119 PCSTC--EQCGKHQDPNFQPDWSSTYQPLKC-SMECTCDSEM----MHCVYDRQYAEMSS 171

Query: 178 SNGNLATETVTLGSTTGQAVALPGIT-FGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           S+G L  + V+ G    Q+   P  T FGC     G +++ +  GI+GLG GD+S++ Q+
Sbjct: 172 SSGVLGEDIVSFGK---QSELKPQRTVFGCENVETGDIYSQRADGIMGLGRGDLSIVDQL 228


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 52/158 (32%), Positives = 84/158 (53%), Gaps = 20/158 (12%)

Query: 87   NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
            +N    + +++G+PP +   V DTGS+L W  C+  P        + +F+P  SS+Y  +
Sbjct: 996  HNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSP------NLTSVFNPLSSSSYSPI 1049

Query: 147  PCSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
            PCSS  C +  +   + V       C   VSY D S   GNLA++   +GS+     ALP
Sbjct: 1050 PCSSPICRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS-----ALP 1104

Query: 201  GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
            G  FGC   G ++    ++KTTG++G+  G +S ++Q+
Sbjct: 1105 GTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQL 1142


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 110/446 (24%), Positives = 176/446 (39%), Gaps = 98/446 (21%)

Query: 4   FLSCVFILFFLCFYVVSPIEAQTGGFSVELI---HRDSPKSPFYNSSETPYQRLRDALTR 60
            +S + IL F+  Y  S  +    G    +I   +  SPKS  +          R A+  
Sbjct: 8   LISAIVILSFVTIYSSSASQIPNRGVRRPMIFPLYFASPKSSGH----------RQAIEG 57

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
           S  R +  +      +++    D + +N  Y  R+ IGTPP E   + DTGS + +  C 
Sbjct: 58  SYWRRHLKSDPYHHPNARMRLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCS 117

Query: 121 PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS-QCASLNQKSCSGVNCQYSVSYGDGSFSN 179
            C    C     P F P  SSTY  + C+    C         GVNC Y   Y + S S+
Sbjct: 118 DC--EHCGKHQDPRFQPDESSTYHPVKCNMDCNCDH------DGVNCVYERRYAEMSSSS 169

Query: 180 GNLATETVTLGSTTGQAVALP-GITFGC-GTNNGGLFNSKTTGIVGLG------------ 225
           G L  + ++ G+   Q+  +P    FGC     G L++ +  GI+GLG            
Sbjct: 170 GVLGEDIISFGN---QSEVVPQRAVFGCENVETGDLYSQRADGIMGLGRGQLSIVDQLVD 226

Query: 226 ------------------GGDISL-----------------------ISQMRTTIAGNQ- 243
                             GG + L                       I      +AG   
Sbjct: 227 KNVINDSFSLCYGGMHVGGGAMVLGGIPPPPDMVFSRSDPYRSPYYNIELKEIHVAGKPL 286

Query: 244 RLGVSTPD----IVIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS- 296
           +L  ST D     V+DSGTT  +LP+  + +   +++      + +  P  +  ++C+S 
Sbjct: 287 KLSPSTFDRKHGTVLDSGTTYAYLPEEAFVAFRDAIIKKSHNLKQIHGPDPNYNDICFSG 346

Query: 297 ----FNSLSQV-PEVTIHF-RGADVKLSRSNFF---VKVSEDIVCSVFKGITNSVPIYGN 347
                + LS+  PEV + F  G  + L+  N+     KV       +F+   +S  + G 
Sbjct: 347 AGRDVSQLSKAFPEVDMVFSNGQKLSLTPENYLFQHTKVHGAYCLGIFRN-GDSTTLLGG 405

Query: 348 IMQTNFLVGYDIEQQTVSFKPTDCTK 373
           I+  N LV YD E + + F  T+C++
Sbjct: 406 IIVRNTLVTYDRENEKIGFWKTNCSE 431


>gi|222616654|gb|EEE52786.1| hypothetical protein OsJ_35257 [Oryza sativa Japonica Group]
          Length = 346

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 54/151 (35%), Positives = 77/151 (50%), Gaps = 17/151 (11%)

Query: 95  ISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---LFDPKMSSTYKSLPCSSS 151
           IS+GTPP   L   DTGS L W QC+ C   +CY Q +    +F+P  SSTY  + CS+ 
Sbjct: 3   ISLGTPPVFNLVTIDTGSTLSWVQCKNCQI-KCYDQAAKAGQIFNPYNSSTYSKVGCSTE 61

Query: 152 QCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
            C  ++     +  C   +  C YS+ YG G +S G L  + +TL S      ++    F
Sbjct: 62  ACNGMHMDLAVEYGCVEEDDTCIYSLRYGSGEYSVGYLGKDRLTLASNR----SIDNFIF 117

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           GCG +N  L+N    GI+G G    S  +Q+
Sbjct: 118 GCGEDN--LYNGVNAGIIGFGTKSYSFFNQV 146


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 59/188 (31%), Positives = 93/188 (49%), Gaps = 25/188 (13%)

Query: 53  RLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGS 112
           R+ D   R L++       S + ++     D + +N  Y  R+ IGTPP E   + DTGS
Sbjct: 49  RVEDFRRRRLHQ-------SQLPNAHMKLYDDLLSNGYYTTRLWIGTPPQEFALIVDTGS 101

Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
            + +  C  C   QC     P F P++SS+YK+L C+   C   ++    G  C Y   Y
Sbjct: 102 TVTYVPCSTC--KQCGKHQDPKFQPELSSSYKALKCNPD-CNCDDE----GKLCVYERRY 154

Query: 173 GDGSFSNGNLATETVTLGS----TTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGG 227
            + S S+G L+ + ++ G+    T  +AV      FGC     G LF+ +  GI+GLG G
Sbjct: 155 AEMSSSSGVLSEDLISFGNESQLTPQRAV------FGCENVETGDLFSQRADGIMGLGRG 208

Query: 228 DISLISQM 235
            +S++ Q+
Sbjct: 209 KLSVVDQL 216


>gi|361068027|gb|AEW08325.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165459|gb|AFG65601.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165460|gb|AFG65602.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165461|gb|AFG65603.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165462|gb|AFG65604.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165463|gb|AFG65605.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165465|gb|AFG65607.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165466|gb|AFG65608.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165467|gb|AFG65609.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165468|gb|AFG65610.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165469|gb|AFG65611.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165472|gb|AFG65614.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165473|gb|AFG65615.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165474|gb|AFG65616.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165475|gb|AFG65617.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165476|gb|AFG65618.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
          Length = 136

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 48/111 (43%), Positives = 65/111 (58%), Gaps = 1/111 (0%)

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
           Q +P++DP  SSTY  + C S  C +L    C S   C+Y  +YGD S + G L+ ET+T
Sbjct: 2   QPTPIYDPARSSTYSKVSCKSLLCNALPDFECKSTAGCEYQYTYGDFSITVGILSYETLT 61

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           L S +G    +P   FGCG NN G    +  GIVGLG G +SLISQ+  ++
Sbjct: 62  LTSKSGAEQLIPNFAFGCGQNNEGNGFDQGAGIVGLGRGPLSLISQLSASM 112


>gi|51091919|dbj|BAD35188.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|125596474|gb|EAZ36254.1| hypothetical protein OsJ_20576 [Oryza sativa Japonica Group]
 gi|196212950|gb|ACG76111.1| S5 [Oryza sativa Japonica Group]
 gi|340810891|gb|AEK75372.1| S5 [Oryza sativa]
 gi|340810893|gb|AEK75373.1| S5 [Oryza sativa]
 gi|340810899|gb|AEK75376.1| S5 [Oryza sativa]
 gi|340810901|gb|AEK75377.1| S5 [Oryza sativa]
 gi|340810933|gb|AEK75393.1| S5 [Oryza sativa]
 gi|340810947|gb|AEK75400.1| S5 [Oryza sativa]
 gi|340810949|gb|AEK75401.1| S5 [Oryza sativa]
 gi|340810967|gb|AEK75410.1| S5 [Oryza sativa]
 gi|340810969|gb|AEK75411.1| S5 [Oryza sativa]
 gi|340810999|gb|AEK75426.1| S5 [Oryza rufipogon]
 gi|340811017|gb|AEK75435.1| S5 [Oryza rufipogon]
 gi|340811029|gb|AEK75441.1| S5 [Oryza nivara]
 gi|340811051|gb|AEK75452.1| S5 [Oryza nivara]
 gi|340811075|gb|AEK75464.1| S5 [Oryza nivara]
 gi|340811077|gb|AEK75465.1| S5 [Oryza rufipogon]
 gi|340811085|gb|AEK75469.1| S5 [Oryza nivara]
 gi|340811096|gb|AEK75474.1| S5 [Oryza rufipogon]
 gi|340811100|gb|AEK75476.1| S5 [Oryza rufipogon]
 gi|340811114|gb|AEK75483.1| S5 [Oryza nivara]
          Length = 472

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)

Query: 70  QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           Q   I+SS +++ D+I     N+  +L+ +S+G PP   L   DTGS L W QC+PC   
Sbjct: 89  QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147

Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
            C+ Q +   P+FDP  S T + + CSS +C  L       Q +C     +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207

Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
           G ++S G + T+T+ +G +         + FGC  +
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 237


>gi|340810993|gb|AEK75423.1| S5 [Oryza rufipogon]
 gi|340811015|gb|AEK75434.1| S5 [Oryza nivara]
 gi|340811021|gb|AEK75437.1| S5 [Oryza nivara]
          Length = 474

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)

Query: 70  QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           Q   I+SS +++ D+I     N+  +L+ +S+G PP   L   DTGS L W QC+PC   
Sbjct: 91  QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 149

Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
            C+ Q +   P+FDP  S T + + CSS +C  L       Q +C     +C YSV+YG+
Sbjct: 150 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 209

Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
           G ++S G + T+T+ +G +         + FGC  +
Sbjct: 210 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 239


>gi|340810907|gb|AEK75380.1| S5 [Oryza sativa]
          Length = 472

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)

Query: 70  QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           Q   I+SS +++ D+I     N+  +L+ +S+G PP   L   DTGS L W QC+PC   
Sbjct: 89  QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147

Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
            C+ Q +   P+FDP  S T + + CSS +C  L       Q +C     +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207

Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
           G ++S G + T+T+ +G +         + FGC  +
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 237


>gi|125554529|gb|EAZ00135.1| hypothetical protein OsI_22138 [Oryza sativa Indica Group]
          Length = 472

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)

Query: 70  QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           Q   I+SS +++ D+I     N+  +L+ +S+G PP   L   DTGS L W QC+PC   
Sbjct: 89  QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147

Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
            C+ Q +   P+FDP  S T + + CSS +C  L       Q +C     +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207

Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
           G ++S G + T+T+ +G +         + FGC  +
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 237


>gi|383165464|gb|AFG65606.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165470|gb|AFG65612.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
          Length = 136

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 48/111 (43%), Positives = 65/111 (58%), Gaps = 1/111 (0%)

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVT 188
           Q +P++DP  SSTY  + C S  C +L    C S   C+Y  +YGD S + G L+ ET+T
Sbjct: 2   QPTPIYDPARSSTYSKVSCKSLLCNALPDFECKSTAGCEYQYTYGDFSITVGILSYETLT 61

Query: 189 LGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTI 239
           L S +G    +P   FGCG NN G    +  GIVGLG G +SLISQ+  ++
Sbjct: 62  LTSKSGAEQLIPKFAFGCGQNNEGNGFDQGAGIVGLGRGPLSLISQLSASM 112


>gi|196212952|gb|ACG76112.1| S5 [Oryza sativa Indica Group]
 gi|338809989|gb|AEJ08560.1| S5 [Oryza barthii]
 gi|340810883|gb|AEK75368.1| S5 [Oryza sativa]
 gi|340810885|gb|AEK75369.1| S5 [Oryza sativa]
 gi|340810889|gb|AEK75371.1| S5 [Oryza sativa]
 gi|340810895|gb|AEK75374.1| S5 [Oryza sativa]
 gi|340810897|gb|AEK75375.1| S5 [Oryza sativa]
 gi|340810905|gb|AEK75379.1| S5 [Oryza sativa]
 gi|340810909|gb|AEK75381.1| S5 [Oryza sativa]
 gi|340810911|gb|AEK75382.1| S5 [Oryza sativa]
 gi|340810913|gb|AEK75383.1| S5 [Oryza sativa]
 gi|340810923|gb|AEK75388.1| S5 [Oryza sativa]
 gi|340810925|gb|AEK75389.1| S5 [Oryza sativa]
 gi|340810929|gb|AEK75391.1| S5 [Oryza sativa]
 gi|340810935|gb|AEK75394.1| S5 [Oryza sativa]
 gi|340810937|gb|AEK75395.1| S5 [Oryza sativa]
 gi|340810939|gb|AEK75396.1| S5 [Oryza sativa]
 gi|340810941|gb|AEK75397.1| S5 [Oryza sativa]
 gi|340810943|gb|AEK75398.1| S5 [Oryza sativa]
 gi|340810951|gb|AEK75402.1| S5 [Oryza sativa]
 gi|340810953|gb|AEK75403.1| S5 [Oryza sativa]
 gi|340810963|gb|AEK75408.1| S5 [Oryza sativa]
 gi|340810965|gb|AEK75409.1| S5 [Oryza sativa]
 gi|340810973|gb|AEK75413.1| S5 [Oryza nivara]
 gi|340811003|gb|AEK75428.1| S5 [Oryza rufipogon]
 gi|340811005|gb|AEK75429.1| S5 [Oryza rufipogon]
 gi|340811009|gb|AEK75431.1| S5 [Oryza rufipogon]
 gi|340811023|gb|AEK75438.1| S5 [Oryza rufipogon]
 gi|340811025|gb|AEK75439.1| S5 [Oryza nivara]
 gi|340811031|gb|AEK75442.1| S5 [Oryza rufipogon]
 gi|340811033|gb|AEK75443.1| S5 [Oryza rufipogon]
 gi|340811035|gb|AEK75444.1| S5 [Oryza nivara]
 gi|340811039|gb|AEK75446.1| S5 [Oryza rufipogon]
 gi|340811049|gb|AEK75451.1| S5 [Oryza nivara]
 gi|340811053|gb|AEK75453.1| S5 [Oryza rufipogon]
 gi|340811055|gb|AEK75454.1| S5 [Oryza nivara]
 gi|340811057|gb|AEK75455.1| S5 [Oryza rufipogon]
 gi|340811059|gb|AEK75456.1| S5 [Oryza rufipogon]
 gi|340811061|gb|AEK75457.1| S5 [Oryza rufipogon]
 gi|340811065|gb|AEK75459.1| S5 [Oryza nivara]
 gi|340811067|gb|AEK75460.1| S5 [Oryza nivara]
 gi|340811069|gb|AEK75461.1| S5 [Oryza nivara]
 gi|340811071|gb|AEK75462.1| S5 [Oryza rufipogon]
 gi|340811081|gb|AEK75467.1| S5 [Oryza nivara]
 gi|340811083|gb|AEK75468.1| S5 [Oryza nivara]
 gi|340811087|gb|AEK75470.1| S5 [Oryza nivara]
 gi|340811092|gb|AEK75472.1| S5 [Oryza nivara]
 gi|340811102|gb|AEK75477.1| S5 [Oryza rufipogon]
 gi|340811106|gb|AEK75479.1| S5 [Oryza rufipogon]
 gi|340811108|gb|AEK75480.1| S5 [Oryza rufipogon]
 gi|340811110|gb|AEK75481.1| S5 [Oryza rufipogon]
 gi|340811112|gb|AEK75482.1| S5 [Oryza rufipogon]
 gi|340811118|gb|AEK75485.1| S5 [Oryza nivara]
 gi|340811120|gb|AEK75486.1| S5 [Oryza rufipogon]
          Length = 472

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)

Query: 70  QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           Q   I+SS +++ D+I     N+  +L+ +S+G PP   L   DTGS L W QC+PC   
Sbjct: 89  QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147

Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
            C+ Q +   P+FDP  S T + + CSS +C  L       Q +C     +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207

Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
           G ++S G + T+T+ +G +         + FGC  +
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 237


>gi|340810931|gb|AEK75392.1| S5 [Oryza sativa]
 gi|340810983|gb|AEK75418.1| S5 [Oryza nivara]
 gi|340810985|gb|AEK75419.1| S5 [Oryza nivara]
 gi|340810997|gb|AEK75425.1| S5 [Oryza nivara]
 gi|340811011|gb|AEK75432.1| S5 [Oryza nivara]
 gi|340811013|gb|AEK75433.1| S5 [Oryza nivara]
 gi|340811041|gb|AEK75447.1| S5 [Oryza nivara]
 gi|340811043|gb|AEK75448.1| S5 [Oryza nivara]
          Length = 474

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)

Query: 70  QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           Q   I+SS +++ D+I     N+  +L+ +S+G PP   L   DTGS L W QC+PC   
Sbjct: 91  QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 149

Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
            C+ Q +   P+FDP  S T + + CSS +C  L       Q +C     +C YSV+YG+
Sbjct: 150 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 209

Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
           G ++S G + T+T+ +G +         + FGC  +
Sbjct: 210 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 239


>gi|340810987|gb|AEK75420.1| S5 [Oryza rufipogon]
 gi|340810989|gb|AEK75421.1| S5 [Oryza rufipogon]
 gi|340810991|gb|AEK75422.1| S5 [Oryza rufipogon]
 gi|340811001|gb|AEK75427.1| S5 [Oryza rufipogon]
 gi|340811019|gb|AEK75436.1| S5 [Oryza rufipogon]
 gi|340811104|gb|AEK75478.1| S5 [Oryza rufipogon]
 gi|340811124|gb|AEK75488.1| S5 [Oryza rufipogon]
          Length = 472

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)

Query: 70  QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           Q   I+SS +++ D+I     N+  +L+ +S+G PP   L   DTGS L W QC+PC   
Sbjct: 89  QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147

Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
            C+ Q +   P+FDP  S T + + CSS +C  L       Q +C     +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKENSCTYSVTYGN 207

Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
           G ++S G + T+T+ +G +         + FGC  +
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 237


>gi|224082314|ref|XP_002306645.1| predicted protein [Populus trichocarpa]
 gi|222856094|gb|EEE93641.1| predicted protein [Populus trichocarpa]
          Length = 410

 Score = 85.1 bits (209), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 84/362 (23%), Positives = 145/362 (40%), Gaps = 88/362 (24%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + ++IG PP       DTGSDL W QC+  P   C      L+ PK +     +PCS+
Sbjct: 54  YSVILNIGNPPKAFDFDIDTGSDLTWVQCD-APCKGCTKPRDKLYKPKNN----LVPCSN 108

Query: 151 SQCASL---NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           S C ++       C   +  C Y + Y D   S G L +++  L  + G  +  P + FG
Sbjct: 109 SLCQAVSTGENYHCDAPDDQCDYEIEYADLGSSIGVLLSDSFPLRLSNGTLLQ-PKMAFG 167

Query: 206 CGTNNGGLFNS---KTTGIVGLGGGDISLISQMRT------------------------- 237
           CG +   L       T GI+GLG G +S++SQ+RT                         
Sbjct: 168 CGYDQKHLGPHPPPDTAGILGLGRGKVSILSQLRTLGITQNVVGHCFSRARGGFLFFGDH 227

Query: 238 ---------------------------TIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
                                       + G +  G+    ++ DSG++ T+       +
Sbjct: 228 LFPSSRITWTPMLRSSSDTLYSSGPAELLFGGKPTGIKGLQLIFDSGSSYTYFNAQVYQS 287

Query: 271 LLSVMSSMIEAQPVAD-PTGSLELCYS--------FNSLSQVPEVTIHFRGAD---VKLS 318
           +L+++   +  +P+ D P   L +C+          +  S    +TI F  A    ++L+
Sbjct: 288 ILNLVRKDLAGKPLKDAPEKELAVCWKTAKPIKSILDIKSYFKPLTISFMNAKNVQLQLA 347

Query: 319 RSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             ++ +   +  VC    GI N       +  + G+I   + +V YD E+Q + + P +C
Sbjct: 348 PEDYLIITKDGNVC---LGILNGSEQQLGNFNVIGDIFMQDRVVIYDNEKQQIGWFPANC 404

Query: 372 TK 373
            +
Sbjct: 405 DR 406


>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
           vinifera]
          Length = 437

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 60/190 (31%), Positives = 93/190 (48%), Gaps = 21/190 (11%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQR-LRDALTRSLNRLNHFNQNSSISSSK--- 78
           E    G +++++H  SP SPF       ++  +     +   RL      SS+ + K   
Sbjct: 31  ETPDQGSTLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFL---SSLVARKSVV 87

Query: 79  --ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
             AS   I+  N  Y++R  IGTP    L   DT SD+ W  C       C    S LF+
Sbjct: 88  PIASGRQIV-QNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCN-----GCLGCSSTLFN 141

Query: 137 PKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
              S+TYKSL C ++QC  + + +C G  C ++++YG  S +  NL+ +T+TL +     
Sbjct: 142 SPASTTYKSLGCQAAQCKQVPKPTCGGGVCSFNLTYGGSSLA-ANLSQDTITLATD---- 196

Query: 197 VALPGITFGC 206
            A+PG +FGC
Sbjct: 197 -AVPGYSFGC 205


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 95/385 (24%), Positives = 151/385 (39%), Gaps = 110/385 (28%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCE------PCPPSQCYMQDSPLFDPKMS 140
           +N +  + +++GTPP     V DTGS+L W  C           +   M +S  F P+ S
Sbjct: 59  HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGES--FRPRAS 116

Query: 141 STYKSLPCSSSQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
           +T+ ++PC S+QC+S +     SC G +  C  S+SY DGS S+G LAT+   +G     
Sbjct: 117 ATFAAVPCGSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAVGEAPPL 176

Query: 196 AVALPGITFGC------------------GTNNGGLF---------------NSKTTGIV 222
             A     FGC                  G N G L                +    G++
Sbjct: 177 RSA-----FGCMSTAYDSSPDGVATAGLLGMNRGTLSFVTQASTRRFSYCISDRDDAGVL 231

Query: 223 GLGGGDISLIS-----------------------QMRTTIAGNQRL----GVSTPD---- 251
            LG  D+  +                        Q+     G + L     V  PD    
Sbjct: 232 LLGHSDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHTGA 291

Query: 252 --IVIDSGTTLTFLP----QGYNSNLLSVMSSMIEAQPVADPT----GSLELCYSFNS-- 299
              ++DSGT  TFL         +  L     ++ A  + DP+     +L+ C+   +  
Sbjct: 292 GQTMVDSGTQFTFLLGDAYSALKAEFLKQTKPLLRA--LDDPSFAFQEALDTCFRVPAGR 349

Query: 300 ---LSQVPEVTIHFRGADVKLSRSNFFVKV------SEDIVCSVFKGITNSVP----IYG 346
               +++P VT+ F GA++ ++      KV      ++ + C  F G  + VP    + G
Sbjct: 350 PPPSARLPPVTLLFNGAEMSVAGDRLLYKVPGEHRGADGVWCLTF-GNADMVPLTAYVIG 408

Query: 347 NIMQTNFLVGYDIEQQTVSFKPTDC 371
           +  Q N  V YD+E+  V   P  C
Sbjct: 409 HHHQMNLWVEYDLERGRVGLAPVKC 433


>gi|222616728|gb|EEE52860.1| hypothetical protein OsJ_35411 [Oryza sativa Japonica Group]
          Length = 395

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 83/304 (27%), Positives = 117/304 (38%), Gaps = 73/304 (24%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + +SIG PP       DTGSDL W QC+ PC    C     PL+ P  +
Sbjct: 50  GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCSKVPHPLYRPTKN 106

Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
              K +PC    CA+L+     +  C      C Y + Y D   S G L T++  L    
Sbjct: 107 ---KLVPCVDQMCAALHGGLTGRHKCDSPKQQCDYEIKYADQGSSLGVLVTDSFAL-RLA 162

Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQM--------------- 235
             ++  PG+ FGCG +         S T G++GLG G +SL+SQ+               
Sbjct: 163 NSSIVRPGLAFGCGYDQQVGSSTEVSATDGVLGLGSGSVSLLSQLKQHGITKNVVGHCLS 222

Query: 236 -----------------RTTIA--------------------GNQRLGVSTPDIVIDSGT 258
                            R T A                    G + LGV   ++V DSG+
Sbjct: 223 TRGGGFLFFGDDIVPYSRATWAPMARSTSRNYYSPGSANLYFGGRPLGVRPMEVVFDSGS 282

Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS-LSQVPEVTIHFRGADVKL 317
           + T+        L+  +   +       P  SL LC+        V +V   FR   V L
Sbjct: 283 SFTYFSAQPYQALVDAIKGDLSKNLKEVPDHSLPLCWKGKKPFKSVLDVKKEFR--TVVL 340

Query: 318 SRSN 321
           S SN
Sbjct: 341 SFSN 344


>gi|168022164|ref|XP_001763610.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685103|gb|EDQ71500.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 308

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 75/252 (29%), Positives = 116/252 (46%), Gaps = 39/252 (15%)

Query: 51  YQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADT 110
           Y  LR    R L R+        +S   +   DI      Y  RIS+GTPP +     DT
Sbjct: 6   YHTLRKHDQRRLRRM----LPEVVSFPISGDNDIFAMGL-YYTRISLGTPPQQFYVDVDT 60

Query: 111 GSDLIWTQCEPCPPSQCYMQDSPL----FDPKMSSTYKSLPCSSSQCASLNQK-SCS--G 163
           GS++ W +C PC   + +  D P+    FDP+ S+T  S+ C+ ++C  LN+K  CS   
Sbjct: 61  GSNVAWVKCAPCTGCE-HSGDVPVPMSTFDPRKSTTKISISCTDAECGVLNKKLQCSPER 119

Query: 164 VNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVALPG---ITFGCGTNNGGLFNSKTT 219
           ++C YS+ YGDGS + G    +  T     +  + A  G   + FGCG    G ++    
Sbjct: 120 LSCPYSLLYGDGSSTAGYYLNDVFTFNQVPSDNSTAKSGTARLVFGCGGTQTGSWS--VD 177

Query: 220 GIVGLGGGDISLISQ--------------MRTTIAGNQRLGVST---PDIVIDSGTTLTF 262
           G++G G   +SL +Q              ++  ++G   L + T   PD+V    T + F
Sbjct: 178 GLLGFGPTTVSLPNQLAQQNISVNIFAHCLQGDVSGRGSLVIGTIREPDLVY---TPMVF 234

Query: 263 LPQGYNSNLLSV 274
               YN  LL++
Sbjct: 235 GEDHYNVQLLNI 246


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 53/180 (29%), Positives = 85/180 (47%), Gaps = 18/180 (10%)

Query: 65  LNHFNQNSSISSSKASQA--------DIIPNNANYLIRISIGTPPTERLAVADTGSDLIW 116
           L+HFN    +  S++           D +  N  Y  R+ IGTPP     + DTGS + +
Sbjct: 59  LSHFNPRRHLQGSQSEHHPNARMRLFDDLLRNGYYTTRLWIGTPPQRFALIVDTGSTVTY 118

Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGS 176
             C  C    C     P F P+ S TY+ + C + QC   + +      C Y   Y + S
Sbjct: 119 VPCSTC--KHCGSHQDPKFRPEASETYQPVKC-TWQCNCDDDRK----QCTYERRYAEMS 171

Query: 177 FSNGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQM 235
            S+G L  + V+ G+ +   ++     FGC  +  G ++N +  GI+GLG GD+S++ Q+
Sbjct: 172 TSSGVLGEDVVSFGNQS--ELSPQRAIFGCENDETGDIYNQRADGIMGLGRGDLSIMDQL 229



 Score = 38.9 bits (89), Expect = 4.2,   Method: Compositional matrix adjust.
 Identities = 36/133 (27%), Positives = 61/133 (45%), Gaps = 12/133 (9%)

Query: 253 VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQ-VP 304
           V+DSGTT  +LP+  + +   ++M      + ++ P     ++C+S      + LS+  P
Sbjct: 305 VLDSGTTYAYLPESAFLAFKHAIMKETHSLKRISGPDPHYNDICFSGAEINVSQLSKSFP 364

Query: 305 EVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
            V + F  G  + LS  N+     KV       VF    +   + G I+  N LV YD E
Sbjct: 365 VVEMVFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDRE 424

Query: 361 QQTVSFKPTDCTK 373
              + F  T+C++
Sbjct: 425 HSKIGFWKTNCSE 437


>gi|340811122|gb|AEK75487.1| S5 [Oryza rufipogon]
          Length = 277

 Score = 84.7 bits (208), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 53/156 (33%), Positives = 83/156 (53%), Gaps = 23/156 (14%)

Query: 70  QNSSISSSKASQADIIP----NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPS 125
           Q   I+SS +++ D+I     N+  +L+ +S+G PP   L   DTGS L W QC+PC   
Sbjct: 89  QEEEITSSSSTKIDVIEDSSINDFLFLMAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AV 147

Query: 126 QCYMQDS---PLFDPKMSSTYKSLPCSSSQCASLN------QKSC--SGVNCQYSVSYGD 174
            C+ Q +   P+FDP  S T + + CSS +C  L       Q +C     +C YSV+YG+
Sbjct: 148 HCHTQSAKAGPIFDPGRSYTSRRVRCSSVKCGELRYDLRLQQANCMEKEDSCTYSVTYGN 207

Query: 175 G-SFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
           G ++S G + T+T+ +G +         + FGC  +
Sbjct: 208 GWAYSVGKMVTDTLRIGDS------FMDLMFGCSMD 237


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 64/196 (32%), Positives = 96/196 (48%), Gaps = 29/196 (14%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  + +++GTPP     V DTGS+L W  C P      +   S  F P+ SST+ ++
Sbjct: 81  HNVSLTVSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMS--FRPRASSTFAAV 138

Query: 147 PCSSSQCASLNQKS---CSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           PC+S+QC S +  S   C G    C  S+SY DGS S+G LAT+   +GS      A   
Sbjct: 139 PCASAQCRSRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAVGSGPPLRAA--- 195

Query: 202 ITFGCGTNNGGLFNS-----KTTGIVGLGGGDISLISQMRTT-----IAGNQRLGVSTPD 251
             FGC ++    F+S      + G++G+  G +S +SQ  T      I+     GV    
Sbjct: 196 --FGCMSSA---FDSSPDGVASAGLLGMNRGALSFVSQASTRRFSYCISDRDDAGV---- 246

Query: 252 IVIDSGTTLTFLPQGY 267
           +++      TFLP  Y
Sbjct: 247 LLLGHSDLPTFLPLNY 262


>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
          Length = 438

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 74/249 (29%), Positives = 108/249 (43%), Gaps = 25/249 (10%)

Query: 2   ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRS 61
           ATF   +  L F     V P   Q+    + +I   S  SPF    +  +  +   +T +
Sbjct: 8   ATFF--LVALLFSTTKAVDPCATQSDTSDLSVIPIYSKCSPFVPPKQESW--VNTVITMA 63

Query: 62  LNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSDLIW 116
                     S+++  K +   I P       ANY++R+ +GTP  +   V DT +D  W
Sbjct: 64  SKDPERLKYLSTLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSNDAAW 123

Query: 117 TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYG 173
             C     S C    S  F P  S+T  SL CS +QC+ +   SC       C ++ SYG
Sbjct: 124 VPC-----SGCTGFSSTTFLPNASTTLGSLDCSGAQCSQVRGFSCPATGSSACLFNQSYG 178

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLI 232
             S     L  + +TL +       +PG TFGC    +GG    +  G++GLG G ISLI
Sbjct: 179 GDSSLTATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPISLI 231

Query: 233 SQMRTTIAG 241
           SQ     +G
Sbjct: 232 SQAGAMYSG 240


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 53/162 (32%), Positives = 78/162 (48%), Gaps = 15/162 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G+PP E     DTGSD++W  C P   CP S         F+P  SST   +P
Sbjct: 91  YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150

Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
           CS  +C +  Q S   C   +   C Y+ +YGDGS ++G   ++T+   S  G    A +
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTANS 210

Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT 237
              I FGC  +  G     +    GI G G   +S++SQ+ +
Sbjct: 211 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNS 252


>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
          Length = 440

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 68/219 (31%), Positives = 98/219 (44%), Gaps = 31/219 (14%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           + + H   P SP    S     R  DA    L  L+     + +SS+  +     P+   
Sbjct: 27  LSVYHNVHPSSPSPLESIIALARDDDA---RLLFLSSKAATAGVSSAPVASGQAPPS--- 80

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y++R  +G+P  + L   DT +D  W  C PC    C    S LF P  SS+Y SLPCSS
Sbjct: 81  YVVRAGLGSPSQQLLLALDTSADATWAHCSPC--GTC--PSSSLFAPANSSSYASLPCSS 136

Query: 151 SQCASLNQKSCSGVN--------------CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           S C     ++C                  C +S  + D SF    LA++T+ LG      
Sbjct: 137 SWCPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKD---- 191

Query: 197 VALPGITFGCGTN-NGGLFNSKTTGIVGLGGGDISLISQ 234
            A+P  TFGC ++  G   N    G++GLG G ++L+SQ
Sbjct: 192 -AIPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQ 229


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 54/168 (32%), Positives = 86/168 (51%), Gaps = 12/168 (7%)

Query: 70  QNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM 129
           Q S+  +++    D +  N  Y  RI IGTPP     + DTGS + +  C  C   QC  
Sbjct: 69  QGSARPNARMRLYDDLLLNGYYTTRIWIGTPPQTFALIVDTGSTVTYVPCSTC--EQCGR 126

Query: 130 QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTL 189
              P F+P++SSTY+ + C+   C   N++      C Y   Y + S S+G L  + ++ 
Sbjct: 127 HQDPKFEPELSSTYQPVSCNID-CTCDNERK----QCVYERQYAEMSSSSGVLGEDIISF 181

Query: 190 GSTTGQAVALP-GITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           G+   Q+  +P    FGC     G L++ +  GI+GLG GD+S++ Q+
Sbjct: 182 GN---QSELVPQRAIFGCENQETGDLYSQRADGIMGLGRGDLSIVDQL 226


>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
 gi|224030351|gb|ACN34251.1| unknown [Zea mays]
          Length = 342

 Score = 84.3 bits (207), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 45/121 (37%), Positives = 68/121 (56%), Gaps = 10/121 (8%)

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSVSYGD 174
           QC+PC    CY Q  P+F+PK+SS+Y  +PC+S  CA L+   C   +   CQY+  Y  
Sbjct: 2   QCQPC--VSCYRQLDPVFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSG 59

Query: 175 GSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQ 234
              + G LA + + +G     AV      FGC  ++ G   ++ +G+VGLG G +SL+SQ
Sbjct: 60  HGVTKGTLAIDKLAIGGDVFHAV-----VFGCSDSSVGGPAAQASGLVGLGRGPLSLVSQ 114

Query: 235 M 235
           +
Sbjct: 115 L 115


>gi|21618176|gb|AAM67226.1| unknown [Arabidopsis thaliana]
          Length = 430

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 54/153 (35%), Positives = 75/153 (49%), Gaps = 22/153 (14%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQC--EPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           +I + IGTPP  +  V DTGS L W QC  +  PP     +    FDP +SS++ +LPCS
Sbjct: 73  IISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP-----KPKTSFDPSLSSSFSTLPCS 127

Query: 150 SSQCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
              C       +L     S   C YS  Y DG+F+ GNL  E +T  +T       P + 
Sbjct: 128 HPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE----ITPPLI 183

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
            GC T      +S   GI+G+  G +S +SQ +
Sbjct: 184 LGCATE-----SSDDRGILGMNRGRLSFVSQAK 211


>gi|18408451|ref|NP_564867.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322615|gb|AAG51309.1|AC026480_16 unknown protein [Arabidopsis thaliana]
 gi|14334808|gb|AAK59582.1| unknown protein [Arabidopsis thaliana]
 gi|15293195|gb|AAK93708.1| unknown protein [Arabidopsis thaliana]
 gi|332196351|gb|AEE34472.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 430

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 54/153 (35%), Positives = 75/153 (49%), Gaps = 22/153 (14%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQC--EPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           +I + IGTPP  +  V DTGS L W QC  +  PP     +    FDP +SS++ +LPCS
Sbjct: 73  IISLPIGTPPQAQQMVLDTGSQLSWIQCHRKKLPP-----KPKTSFDPSLSSSFSTLPCS 127

Query: 150 SSQCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
              C       +L     S   C YS  Y DG+F+ GNL  E +T  +T       P + 
Sbjct: 128 HPLCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKITFSNTE----ITPPLI 183

Query: 204 FGCGTNNGGLFNSKTTGIVGLGGGDISLISQMR 236
            GC T      +S   GI+G+  G +S +SQ +
Sbjct: 184 LGCATE-----SSDDRGILGMNRGRLSFVSQAK 211


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 52/158 (32%), Positives = 84/158 (53%), Gaps = 20/158 (12%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N    + +++G+PP +   V DTGS+L W  C+  P        + +F+P  SS+Y  +
Sbjct: 36  HNVTLTVSLTVGSPPQQVTMVLDTGSELSWLHCKKSP------NLTSVFNPLSSSSYSPI 89

Query: 147 PCSSSQCASLNQKSCSGVN------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           PCSS  C +  +   + V       C   VSY D S   GNLA++   +GS+     ALP
Sbjct: 90  PCSSPVCRTRTRDLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRIGSS-----ALP 144

Query: 201 GITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           G  FGC   G ++    ++KTTG++G+  G +S ++Q+
Sbjct: 145 GTLFGCMDSGFSSNSEEDAKTTGLMGMNRGSLSFVTQL 182


>gi|413923781|gb|AFW63713.1| hypothetical protein ZEAMMB73_300584, partial [Zea mays]
          Length = 190

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 67/186 (36%), Positives = 95/186 (51%), Gaps = 11/186 (5%)

Query: 16  FYVVSPIEAQTGGFSVELIHRDSPKSPFYNSS-ETPYQRLRDALTRSLNRLNHFNQNSSI 74
           F +V+ +   + G +V L HR  P SP  +    T  +RLR    R+      F+    I
Sbjct: 2   FGLVASVTPPSTGVTVPLHHRYDPCSPVPSKKVPTLEERLRRDQLRAAYIKRKFSGAGDI 61

Query: 75  SSSKASQADIIPNNA----NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
             S A+        +     Y+I + IG+P   +    DTGSD+ W QC+PC  SQC+ +
Sbjct: 62  EQSDAATVPTTLGTSLSTLEYVITVGIGSPAVTQTMSMDTGSDVSWVQCKPC--SQCHSE 119

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASLNQKS----CSGVNCQYSVSYGDGSFSNGNLATET 186
              LFDP  SSTY    CSS+ CA L+Q      C    CQY V+YGD S + G  +++T
Sbjct: 120 VDSLFDPSSSSTYSPFSCSSAPCAQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDT 179

Query: 187 VTLGST 192
           +TLGS+
Sbjct: 180 LTLGSS 185


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 64/187 (34%), Positives = 90/187 (48%), Gaps = 42/187 (22%)

Query: 60  RSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
           RS N+L HF+ N S++                 + +++GTPP     V DTGS+L W +C
Sbjct: 72  RSPNKL-HFHHNVSLT-----------------VSLTVGTPPQNVSMVLDTGSELSWLRC 113

Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK-----SC-SGVNCQYSVSYG 173
                 Q        FDP  SS+Y  +PCSS  C    +      SC S   C   +SY 
Sbjct: 114 NKTQTFQT------TFDPNRSSSYSPVPCSSLTCTDRTRDFPIPASCDSNQLCHAILSYA 167

Query: 174 DGSFSNGNLATETVTLGSTTGQAVALPGITFGC-----GTNNGGLFNSKTTGIVGLGGGD 228
           D S S GNLA++T  +G++      +PG  FGC      TN     +SK TG++G+  G 
Sbjct: 168 DASSSEGNLASDTFYIGNSD-----MPGTIFGCMDSSFSTNTEE--DSKNTGLMGMNRGS 220

Query: 229 ISLISQM 235
           +S +SQM
Sbjct: 221 LSFVSQM 227


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 93/362 (25%), Positives = 148/362 (40%), Gaps = 80/362 (22%)

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
           D +  N  Y  R+ IGTP  E   + D+GS + +  C  C   QC     P F P +SST
Sbjct: 83  DDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATC--EQCGNHQDPRFQPDLSST 140

Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           Y  + C+   C   N++S     C Y   Y + S S+G L  + ++ G  +   +     
Sbjct: 141 YSPVKCNVD-CTCDNERS----QCTYERQYAEMSSSSGVLGEDIMSFGKES--ELKPQRA 193

Query: 203 TFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIA-------GNQRLGVST--- 249
            FGC  T  G LF+    GI+GLG G +S++ Q+  +  I+       G   +G  T   
Sbjct: 194 VFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDVGGGTMVL 253

Query: 250 ------PDIVID-------------------SGTTLTFLPQGYNSNLLSVMSS------- 277
                 PD+V                     +G  L   P+ +NS   +V+ S       
Sbjct: 254 GGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTVLDSGTTYAYL 313

Query: 278 -----MIEAQPVADPTGSL-----------ELCYS-----FNSLSQV-PEVTIHF-RGAD 314
                +     V +   SL           ++C++      + LS+V P+V + F  G  
Sbjct: 314 PEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPDVDMVFGNGQK 373

Query: 315 VKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           + LS  N+  + S  E   C  VF+   +   + G I+  N LV YD   + + F  T+C
Sbjct: 374 LSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKTNC 433

Query: 372 TK 373
           ++
Sbjct: 434 SE 435


>gi|61213864|sp|P69477.1|NEP2_NEPDI RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II
          Length = 178

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 66/216 (30%), Positives = 93/216 (43%), Gaps = 69/216 (31%)

Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSY 172
           DLIWTQCEPC  +QC+ QD        SS++ +LPC S  C  L  ++C   +CQY+  Y
Sbjct: 20  DLIWTQCEPC--TQCFSQD--------SSSFSTLPCESQYCQDLPSETC---DCQYTYGY 66

Query: 173 GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLI 232
           GDGS + G +A E            ++P I FGCG N                       
Sbjct: 67  GDGSSTQGYMAXE---------DGSSVPNIAFGCGDN----------------------- 94

Query: 233 SQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE 292
                              + IDSGTTLT+LPQ   + +    +  I    V + +  L 
Sbjct: 95  -------------------LQIDSGTTLTYLPQDAYNAVAQAFTDQINLPTVDESSSGLS 135

Query: 293 LCY---SFNSLSQVPEVTIHFRGA--DVKLSRSNFF 323
            C+   S  S  QVPE+++   G   D++    +FF
Sbjct: 136 TCFQEPSDGSTVQVPEISMQDGGVLNDLQNLAVSFF 171


>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
 gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 68/217 (31%), Positives = 97/217 (44%), Gaps = 31/217 (14%)

Query: 33  LIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYL 92
           + H   P SP    S     R  DA    L  L+     + +SS+  +     P+   Y+
Sbjct: 27  VYHNVHPSSPSPLESIIALARDDDA---RLLFLSSKAATAGVSSAPVASGQAPPS---YV 80

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           +R  +G+P  + L   DT +D  W  C PC    C    S LF P  SS+Y SLPCSSS 
Sbjct: 81  VRAGLGSPSQQLLLALDTSADATWAHCSPC--GTC--PSSSLFAPANSSSYASLPCSSSW 136

Query: 153 CASLNQKSCSGVN--------------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           C     ++C                  C +S  + D SF    LA++T+ LG       A
Sbjct: 137 CPLFQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASF-QAALASDTLRLGKD-----A 190

Query: 199 LPGITFGCGTN-NGGLFNSKTTGIVGLGGGDISLISQ 234
           +P  TFGC ++  G   N    G++GLG G ++L+SQ
Sbjct: 191 IPNYTFGCVSSVTGPTTNMPRQGLLGLGRGPMALLSQ 227


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 93/369 (25%), Positives = 149/369 (40%), Gaps = 80/369 (21%)

Query: 76  SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
           S++    D +  N  Y  R+ IGTPP E   + D+GS + +  C  C   QC     P F
Sbjct: 73  SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRF 130

Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
            P +SSTY  + C+   C   + K+     C Y   Y + S S+G L  + V+ G+ +  
Sbjct: 131 QPDLSSTYSPVKCNVD-CTCDSDKN----QCTYERQYAEMSSSSGVLGEDIVSFGTES-- 183

Query: 196 AVALPGITFGC-GTNNGGLFNSKTTGIVGL------------------------------ 224
            +      FGC  +  G LF+    GI+GL                              
Sbjct: 184 ELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDI 243

Query: 225 GGGDISL-----------------------ISQMRTTIAG-----NQRLGVSTPDIVIDS 256
           GGG + L                       I      +AG     + R+       V+DS
Sbjct: 244 GGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDS 303

Query: 257 GTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PEVTI 308
           GTT  +LP Q + +   +V S +   + +  P  +  ++C++      + LS+V P+V +
Sbjct: 304 GTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDSNYKDICFAGAGRNVSQLSEVFPKVDM 363

Query: 309 HF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
            F  G  + LS  N+  + S  E   C  VF+   +   + G I+  N LV YD   + +
Sbjct: 364 VFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKI 423

Query: 365 SFKPTDCTK 373
            F  T+C++
Sbjct: 424 GFWKTNCSE 432


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 92/363 (25%), Positives = 146/363 (40%), Gaps = 85/363 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPL----FDPKMSSTYKSL 146
           Y  +I +GTPP       DTGSD+ W  C PC       Q   +    +DP  SST  +L
Sbjct: 37  YYTKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTDGAL 96

Query: 147 PCSSSQCASL---NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALP 200
            C  S C +    N+ SC+    C YS +YGDGS + G    + +T        Q     
Sbjct: 97  SCRDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVNGTA 156

Query: 201 GITFGCGTNNGG--LFNSKT-TGIVGLGGGDISLISQMRT-------------------- 237
            + FGCGT   G  L +S+   G++G G   +S+ SQ+ +                    
Sbjct: 157 SVYFGCGTTQSGNLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQGDNQGGG 216

Query: 238 ----------------------------TIAGNQRLGVSTP-----------DIVIDSGT 258
                                        IA N R  V+TP            +++DSGT
Sbjct: 217 TIVIGSVSEPNISYTPIVSRNHYAVGMQNIAVNGR-NVTTPASFDTTSTSAGGVIMDSGT 275

Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHF-RGADVKL 317
           TL +L     +  ++ +S+  E+   +  +  L+L +  +  +  P V + F  GA + L
Sbjct: 276 TLAYLVDPAYTQFVNAVST-FESSMFSSHSQCLQLAWC-SLQADFPTVKLFFDAGAVMNL 333

Query: 318 SRSNFF----VKVSEDIVCSVFKGITN-----SVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           +  N+     ++  +   C  ++  T      S  I G+I+  + LV YD + + V +K 
Sbjct: 334 TPRNYLYSQPLQNGQAAYCMGWQKSTTKAGYLSYSILGDIVLKDHLVVYDNDNRVVGWKS 393

Query: 369 TDC 371
            DC
Sbjct: 394 FDC 396


>gi|297798582|ref|XP_002867175.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297313011|gb|EFH43434.1| hypothetical protein ARALYDRAFT_328390 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 85/380 (22%), Positives = 147/380 (38%), Gaps = 89/380 (23%)

Query: 77  SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
           ++A  + + P + N      Y + I+IG PP       DTGSDL W QC+  P  +C   
Sbjct: 40  TRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEA 98

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
             PL+ P    +   +PC+   C +L    NQ+  +   C Y V Y DG  S G L  + 
Sbjct: 99  PHPLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDV 154

Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT------ 238
            ++  T G  +  P +  GCG +   G   +    G++GLG G +S++SQ+ +       
Sbjct: 155 FSMNYTKGLRLT-PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 213

Query: 239 ----------------------------------------------IAGNQRLGVSTPDI 252
                                                         + G +  G+     
Sbjct: 214 IGHCLSSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLT 273

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTIH 309
           V DSG++ T+        +  ++   +  +P+  A    +L LC+        + EV  +
Sbjct: 274 VFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKY 333

Query: 310 FRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTNF 353
           F+   +       S++ F +     ++ S    V  GI N   I        G+I   + 
Sbjct: 334 FKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQ 393

Query: 354 LVGYDIEQQTVSFKPTDCTK 373
           ++ YD E+Q++ + P DC +
Sbjct: 394 MIIYDNEKQSIGWMPADCDE 413


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 52/162 (32%), Positives = 78/162 (48%), Gaps = 15/162 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G+PP E     DTGSD++W  C P   CP S         F+P  SST   +P
Sbjct: 91  YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150

Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
           CS  +C +  Q S   C   +   C Y+ +YGDGS ++G   ++T+   +  G    A +
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 210

Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT 237
              I FGC  +  G     +    GI G G   +S++SQ+ +
Sbjct: 211 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNS 252


>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
          Length = 357

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 87/355 (24%), Positives = 145/355 (40%), Gaps = 86/355 (24%)

Query: 97  IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
           IG P        DTGSDL W QC+  P   C     PL+ P   +  + +PC+++ C +L
Sbjct: 1   IGNPAKPYFLDVDTGSDLTWLQCD-APCRSCNKVPHPLYRP---TANRLVPCANALCTAL 56

Query: 157 ------NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN- 209
                 N K  S   C Y + Y D + S G L  ++ +L   +      PG+TFGCG + 
Sbjct: 57  HSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN--IRPGLTFGCGYDQ 114

Query: 210 ---NGGLFNSKTTGIVGLGGGDISLISQMR------------------------------ 236
                G   +   G++GLG G +SL+SQ++                              
Sbjct: 115 QVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLSTNGGGFLFFGDDVVPS 174

Query: 237 ---TTIAGNQR-------------------LGVSTPDIVIDSGTTLT-FLPQGYNSNLLS 273
              T +   QR                   LGV   ++V DSG+T T F  Q Y + + +
Sbjct: 175 SRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEVVFDSGSTYTYFTAQPYQAVVSA 234

Query: 274 VMSSMIEA-QPVADPTGSLELCYS--------FNSLSQVPEVTIHF---RGADVKLSRSN 321
           +   + ++ + V+DPT  L LC+         F+  ++   + + F   + A +++   N
Sbjct: 235 LKGGLSKSLKQVSDPT--LPLCWKGQKAFKSVFDVKNEFKSMFLSFASAKNAAMEIPPEN 292

Query: 322 FFVKVSEDIVC-SVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           + +      VC  +  G     S  + G+I   + +V YD E+  + +    CT+
Sbjct: 293 YLIVTKNGNVCLGILDGTAAKLSFNVIGDITMQDQMVIYDNEKSQLGWARGACTR 347


>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 105/440 (23%), Positives = 163/440 (37%), Gaps = 119/440 (27%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ--------- 81
           +EL+HR   +           + ++  + R   R    NQ   + S+  S+         
Sbjct: 35  LELVHRHHERFAGGGGDVDRVEAVKGFVKRDKLRRQRMNQRWGVVSNYDSRRKGFEMTTT 94

Query: 82  -ADI-IPNNA-------NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS 132
            A++ +P ++        Y   + +G+P      V DTGS+  W  C             
Sbjct: 95  PAEVEMPMHSGRDDALGEYFAEVKVGSPGQRFWLVVDTGSEFTWLNC------------- 141

Query: 133 PLFDPKMSSTYKSLPCSSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE 185
                  S +++++ C+S +C        SL+        C Y +SY DGS + G   T+
Sbjct: 142 -------SKSFEAVTCASRKCKVDLSELFSLSVCPKPSDPCLYDISYADGSSAKGFFGTD 194

Query: 186 TVTLGSTTGQAVALPGITFGCGTN--NGGLFNSKTTGIVGLGGGDISLISQMRT------ 237
           ++T+G T G+   L  +T GC  +  NG  FN +T GI+GLG    S I +         
Sbjct: 195 SITVGLTNGKQGKLNNLTIGCTKSMLNGVNFNEETGGILGLGFAKDSFIDKAANKYGAKF 254

Query: 238 -----------------TIAG--NQRL----------------GVSTPDIVIDSGTTLTF 262
                            TI G  N +L                GV+   I I  G  L  
Sbjct: 255 SYCLVDHLSHRSVSSNLTIGGHHNAKLLGEIRRTELILFPPFYGVNVVGISI-GGQMLKI 313

Query: 263 LPQGYNSN----------------LLSVMSSMIEA-----QPVADPTG----SLELCYSF 297
            PQ ++ N                LL    ++ EA       V   TG    +LE C+  
Sbjct: 314 PPQVWDFNAEGGTLIDSGTTLTSLLLPAYEAVFEALTKSLTKVKRVTGEDFDALEFCFDA 373

Query: 298 NSL--SQVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTN 352
                S VP +  HF  GA  +    ++ + V+  + C     I       + GNIMQ N
Sbjct: 374 EGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGASVIGNIMQQN 433

Query: 353 FLVGYDIEQQTVSFKPTDCT 372
            L  +D+   TV F P+ CT
Sbjct: 434 HLWEFDLSTNTVGFAPSTCT 453


>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
          Length = 494

 Score = 83.6 bits (205), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 91/365 (24%), Positives = 150/365 (41%), Gaps = 89/365 (24%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI IGTP        DTGSD++W     C+ CP       +  ++DP+ S + + + 
Sbjct: 90  YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149

Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    C +       SC+  + C+YS+SYGDGS + G   T+ +     +G     P   
Sbjct: 150 CDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANA 209

Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRT-------------TIAGNQR 244
            ++FGCG   GG   S      GI+G G  + S++SQ+               T+ G   
Sbjct: 210 SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNGGGI 269

Query: 245 LGVS------------TPDI----------------------VIDSG----------TTL 260
             +              PD+                      + DSG          TTL
Sbjct: 270 FAIGNVVQPKVKTTPLVPDMPHYNVILKGIDVGGTALGLPTNIFDSGNSKGTIIDSGTTL 329

Query: 261 TFLPQGYNSNLLSVM---SSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGADV 315
            ++P+G    L +++      I  Q + D +     C+ ++       PEVT HF G DV
Sbjct: 330 AYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVDDGFPEVTFHFEG-DV 383

Query: 316 KL--SRSNFFVKVSEDIVCSVFK---GITNS---VPIYGNIMQTNFLVGYDIEQQTVSFK 367
            L  S  ++  +  +++ C  F+   G T     + + G+++ +N LV YD+E Q + + 
Sbjct: 384 SLIVSPHDYLFQNGKNLYCMGFQNGGGKTKDGKDLGLLGDLVLSNKLVLYDLENQAIGWA 443

Query: 368 PTDCT 372
             +C+
Sbjct: 444 DYNCS 448


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 88/359 (24%), Positives = 144/359 (40%), Gaps = 84/359 (23%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           N  Y  R+ IGTPP     + DTGS + +  C  C    C     P F P +S TY+ + 
Sbjct: 86  NGYYTTRLWIGTPPQRFALIVDTGSTVTYVPCSTC--EHCGRHQDPKFQPDLSETYQPVK 143

Query: 148 CSSSQCASLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           C +  C      +C G    C Y   Y + S S+G L  + V+ G+ +   +A     FG
Sbjct: 144 C-TPDC------NCDGDTNQCMYDRQYAEMSSSSGVLGEDVVSFGNLS--ELAPQRAVFG 194

Query: 206 CGTNN-GGLFNSKTTGIVGLGGGDISLISQM--RTTIAGNQRL---------------GV 247
           C  +  G L++ +  GI+GLG GD+S++ Q+  +  I+ +  L               G+
Sbjct: 195 CENDETGDLYSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILGGI 254

Query: 248 STPDIVI-----------------------------------------DSGTTLTFLPQ- 265
           S P+ ++                                         DSGTT  +LP+ 
Sbjct: 255 SPPEDMVFTHSDPDRSPYYNINLKEMHVAGKKLQLNPKVFDGKHGTVLDSGTTYAYLPET 314

Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYSFNSL--SQV----PEVTIHFR-GADVKL 317
            + +   ++M      + +  P  +  ++C++   +  SQ+    P V + F  G  + L
Sbjct: 315 AFLAFKRAIMKERNSLKQINGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENGHKLSL 374

Query: 318 SRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           S  N+     KV       VF    +   + G I   N LV YD E   + F  T+C++
Sbjct: 375 SPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMYDRENSKIGFWKTNCSE 433


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 54/178 (30%), Positives = 83/178 (46%), Gaps = 31/178 (17%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEP--------CPPSQCYMQDSPLFDPKM 139
           +  Y + + +GTP  +   + DTGSDL W QC P         PP       +P +D   
Sbjct: 56  SGQYFVELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPP-------APWYDKSS 108

Query: 140 SSTYKSLPCSSSQCASLNQ---KSCSGVN---CQYSVSYGDGSFSNGNLATETVTL---- 189
           SS+Y+ +PC+  +C  L      SCS  +   C Y+  Y D S + G LA ET+++    
Sbjct: 109 SSSYREIPCTDDECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRK 168

Query: 190 ------GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
                 G+   + + +  +  GC   + G      +G++GLG G ISL +Q R T  G
Sbjct: 169 RSGKRAGNHKTRRIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALG 226



 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 68/123 (55%), Gaps = 4/123 (3%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-VPEVTIHFR 311
           + DSGTTL++L +   S +L  +++ I      +     ELCY+   + + +P++ + F+
Sbjct: 304 IFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNVTRMEKGMPKLGVEFQ 363

Query: 312 GADV-KLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           G  V +L  +N+ V V+E++ C   + +  TN   I GN++Q +  + YD+ +  + FK 
Sbjct: 364 GGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLLQQDHHIEYDLAKARIGFKW 423

Query: 369 TDC 371
           + C
Sbjct: 424 SPC 426


>gi|334187133|ref|NP_001190905.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|21592493|gb|AAM64443.1| nucellin-like protein [Arabidopsis thaliana]
 gi|332660834|gb|AEE86234.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 85/380 (22%), Positives = 147/380 (38%), Gaps = 89/380 (23%)

Query: 77  SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
           ++A  + + P + N      Y + I+IG PP       DTGSDL W QC+  P  +C   
Sbjct: 40  TRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEA 98

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
             PL+ P    +   +PC+   C +L    NQ+  +   C Y V Y DG  S G L  + 
Sbjct: 99  PHPLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDV 154

Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT------ 238
            ++  T G  +  P +  GCG +   G   +    G++GLG G +S++SQ+ +       
Sbjct: 155 FSMNYTQGLRLT-PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 213

Query: 239 ----------------------------------------------IAGNQRLGVSTPDI 252
                                                         + G +  G+     
Sbjct: 214 IGHCLSSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLT 273

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTIH 309
           V DSG++ T+        +  ++   +  +P+  A    +L LC+        + EV  +
Sbjct: 274 VFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKY 333

Query: 310 FRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTNF 353
           F+   +       S++ F +     ++ S    V  GI N   I        G+I   + 
Sbjct: 334 FKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQ 393

Query: 354 LVGYDIEQQTVSFKPTDCTK 373
           ++ YD E+Q++ + P DC +
Sbjct: 394 MIIYDNEKQSIGWMPVDCDE 413


>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
 gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
          Length = 381

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 52/162 (32%), Positives = 78/162 (48%), Gaps = 15/162 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G+PP E     DTGSD++W  C P   CP S         F+P  SST   +P
Sbjct: 91  YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 150

Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
           CS  +C +  Q S   C   +   C Y+ +YGDGS ++G   ++T+   +  G    A +
Sbjct: 151 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 210

Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT 237
              I FGC  +  G     +    GI G G   +S++SQ+ +
Sbjct: 211 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNS 252


>gi|26452545|dbj|BAC43357.1| putative nucellin [Arabidopsis thaliana]
          Length = 413

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 85/380 (22%), Positives = 147/380 (38%), Gaps = 89/380 (23%)

Query: 77  SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
           ++A  + + P + N      Y + I+IG PP       DTGSDL W QC+  P  +C   
Sbjct: 28  TRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEA 86

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
             PL+ P    +   +PC+   C +L    NQ+  +   C Y V Y DG  S G L  + 
Sbjct: 87  PHPLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDV 142

Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT------ 238
            ++  T G  +  P +  GCG +   G   +    G++GLG G +S++SQ+ +       
Sbjct: 143 FSMNYTQGLRLT-PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 201

Query: 239 ----------------------------------------------IAGNQRLGVSTPDI 252
                                                         + G +  G+     
Sbjct: 202 IGHCLSSLGGGILFFGDDLYDSSRVSWTPMSREYSKHYSPAMGGELLFGGRTTGLKNLLT 261

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTIH 309
           V DSG++ T+        +  ++   +  +P+  A    +L LC+        + EV  +
Sbjct: 262 VFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKY 321

Query: 310 FRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTNF 353
           F+   +       S++ F +     ++ S    V  GI N   I        G+I   + 
Sbjct: 322 FKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQ 381

Query: 354 LVGYDIEQQTVSFKPTDCTK 373
           ++ YD E+Q++ + P DC +
Sbjct: 382 MIIYDNEKQSIGWMPVDCDE 401


>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 54/161 (33%), Positives = 81/161 (50%), Gaps = 20/161 (12%)

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
           +  +N +  + +++G+PP     V DTGS+L W  C+     +    +S +F+P  S TY
Sbjct: 62  LFHHNVSLTVSLTVGSPPQNVTMVLDTGSELSWLHCK-----KTQFLNS-VFNPLSSKTY 115

Query: 144 KSLPCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
             +PC S  C +  +      SC     C   VSY D +   GNLA ET  LGS T    
Sbjct: 116 SKVPCLSPTCKTRTRDLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRLGSLTK--- 172

Query: 198 ALPGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
             P   FGC   G ++    +SKTTG++G+  G +S ++QM
Sbjct: 173 --PATIFGCMDSGFSSNSEEDSKTTGLIGMNRGSLSFVNQM 211


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 97/369 (26%), Positives = 150/369 (40%), Gaps = 82/369 (22%)

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS--PLFDPKMS 140
           +I+ ++  + + + I  P   R  + DTGSDLIWTQC+    +    +    P++DP  S
Sbjct: 8   NILLSDQGHSLTVGIVQP---RKLIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGES 64

Query: 141 STYKSLPCSSSQC--ASLNQKSCSGVN-CQYSVSYGDGS---------FSNGNLATETVT 188
           ST+  LPCS   C     + K+C+  N C Y   YG  +         F+ G     ++ 
Sbjct: 65  STFAFLPCSDRLCQEGQFSFKNCTSKNRCVYEDVYGSAAAVGVLASETFTFGARRAVSLR 124

Query: 189 LGSTTGQ--AVALPGITFGCGTNNGGL-----------------FNSKTTGIVGLGG-GD 228
           LG   G   A +L G T   G +   L                 F  K T  +  G   D
Sbjct: 125 LGFGCGALSAGSLIGATGILGLSPESLSLITQLKIQRFSYCLTPFADKKTSPLLFGAMAD 184

Query: 229 ISLISQMR----TTIA------------------GNQRLGVST------PD----IVIDS 256
           +S     R    T I                   G++RL V        PD     ++DS
Sbjct: 185 LSRHKTTRPIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDS 244

Query: 257 GTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT-GSLELCYSFNSLS--------QVPEVT 307
           G+T+ +L +     +   +  ++   PVA+ T    ELC+     +        QVP + 
Sbjct: 245 GSTVAYLVEAAFEAVKEAVMDVVRL-PVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLV 303

Query: 308 IHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNS--VPIYGNIMQTNFLVGYDIEQQTV 364
           +HF G A + L R N+F +    ++C      T+   V I GN+ Q N  V +D++    
Sbjct: 304 LHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKF 363

Query: 365 SFKPTDCTK 373
           SF PT C +
Sbjct: 364 SFAPTQCDQ 372


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 92/369 (24%), Positives = 147/369 (39%), Gaps = 80/369 (21%)

Query: 76  SSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
           S++    D +  N  Y  R+ IGTPP E   + D+GS + +  C  C   QC     P F
Sbjct: 73  SARMRLHDDLLTNGYYTTRLHIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRF 130

Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
            P +SSTY  + C+   C   + K+     C Y   Y + S S+G L  + V+ G+ +  
Sbjct: 131 QPDLSSTYSPVKCNVD-CTCDSDKN----QCTYERQYAEMSSSSGVLGEDIVSFGTES-- 183

Query: 196 AVALPGITFGC-GTNNGGLFNSKTTGIVGL------------------------------ 224
            +      FGC  +  G LF+    GI+GL                              
Sbjct: 184 ELKPQRAVFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDI 243

Query: 225 GGGDISL-----------------------ISQMRTTIAG-----NQRLGVSTPDIVIDS 256
           GGG + L                       I      +AG     + R+       V+DS
Sbjct: 244 GGGAMVLGAMPAPPGMIYTHSNAVRSPYYNIELKEMHVAGKALRVDPRIFDGKHGTVLDS 303

Query: 257 GTTLTFLPQGYNSNLLSVMSSMIEA-QPVADPTGSL-ELCYS-----FNSLSQV-PEVTI 308
           GTT  +LP+         +SS +   + +  P  +  ++C++      + LS+V P+V +
Sbjct: 304 GTTYAYLPEQAFVAFKDAVSSQVHPLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPKVDM 363

Query: 309 HF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
            F  G  + LS  N+  + S  E   C  VF+   +   + G I+  N LV YD   + +
Sbjct: 364 VFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKI 423

Query: 365 SFKPTDCTK 373
            F  T+C++
Sbjct: 424 GFWKTNCSE 432


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 53/163 (32%), Positives = 87/163 (53%), Gaps = 20/163 (12%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           +++ IG+      A+ DTGS+ +  QC          +  P+FDP  S +Y+ +PC S  
Sbjct: 1   MQLGIGSLQKNLSAIIDTGSEAVLVQCGS--------RSRPVFDPAASQSYRQVPCISQL 52

Query: 153 CASLNQKSCSG-----VN----CQYSVSYGDGSFSNGNLATETVTLGST--TGQAVALPG 201
           C ++ Q++ +G     VN    C YS+SYGD   S G+ + + + L ST  + QAV    
Sbjct: 53  CLAVQQQTSNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRD 112

Query: 202 ITFGCGTN-NGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQ 243
           + FGC  +  G L +  + GIVG   G++SL SQ++  + G++
Sbjct: 113 VAFGCAHSPQGFLVDLGSLGIVGFNRGNLSLPSQLKDRLGGSK 155


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 82/367 (22%), Positives = 138/367 (37%), Gaps = 86/367 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G P    +   DTGSD++W  C P   CP          ++DP+ SST   + 
Sbjct: 2   YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 61

Query: 148 CSSSQCA---SLNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLG--STTGQAVALP 200
           CS   C       +  CS    NC+Y  SYGDGS S G    + +     S+ G A    
Sbjct: 62  CSDPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTS 121

Query: 201 GITFGCGTNNGGLFNS---KTTGIVGLGGGDISLISQ----------------------- 234
            + FGC     G  ++      GI+G G  ++S+ +Q                       
Sbjct: 122 QVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGGG 181

Query: 235 ---------------------------MRTTIAGNQRLGVSTPD--------IVIDSGTT 259
                                      +R     + RL +   D        +++DSGTT
Sbjct: 182 ILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSGTT 241

Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV-PEVTIHFRGADVKLS 318
           L + P G  +  +  +     A PV       +       LS + P VT++F G  ++L 
Sbjct: 242 LAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLNFEGGAMELQ 301

Query: 319 RSNFFV------KVSEDIVCSVFKGITNS--------VPIYGNIMQTNFLVGYDIEQQTV 364
             N+ +        + D+ C  ++  ++S        + I G+I+  + LV YD++   +
Sbjct: 302 PDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRI 361

Query: 365 SFKPTDC 371
            +   +C
Sbjct: 362 GWMSYNC 368


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 52/162 (32%), Positives = 78/162 (48%), Gaps = 15/162 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G+PP E     DTGSD++W  C P   CP S         F+P  SST   +P
Sbjct: 117 YFTRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIP 176

Query: 148 CSSSQCASLNQKS---CSGVN---CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVA 198
           CS  +C +  Q S   C   +   C Y+ +YGDGS ++G   ++T+   +  G    A +
Sbjct: 177 CSDDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANS 236

Query: 199 LPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT 237
              I FGC  +  G     +    GI G G   +S++SQ+ +
Sbjct: 237 SASIVFGCSNSQSGDLTKTDRAVDGIFGFGQHQLSVVSQLNS 278


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 93/371 (25%), Positives = 144/371 (38%), Gaps = 92/371 (24%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---LFDPKMSSTYKSL 146
            Y +R+ +GTP    + VADTGSDL W +C     S      SP   +F P  S ++  L
Sbjct: 103 QYFVRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSWSPL 162

Query: 147 PCSSSQCAS---LNQKSCSGVN--CQYSVSYGDGSFSNG--NLATETVTLGSTTG-QAVA 198
           PC S  C S    +  +CS     C Y   Y D S + G   L + TV+L    G +   
Sbjct: 163 PCDSDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVSLSGNDGTRKAK 222

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLG--------------GGDIS-------------- 230
           L  +  GC T+  G     + G++ LG              GG  S              
Sbjct: 223 LQEVVLGCTTSYDGQSFKSSDGVLSLGNSNISFASRAASRFGGRFSYCLVDHLAPRNATS 282

Query: 231 -----------------------LISQMRT-----------TIAGNQRLGVSTPDI---- 252
                                  L+   RT           T+AG +RL +  PD+    
Sbjct: 283 FLTFGNGDSSPGDDSSSRRTPLVLLEDARTRPFYFVSVDAVTVAG-ERLEI-LPDVWDFR 340

Query: 253 -----VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVA--DPTGSLELCYSFNSLS-QVP 304
                ++DSGT+LT L       ++  +S      P    DP    E CY++  +S ++P
Sbjct: 341 KNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNMDP---FEYCYNWTGVSAEIP 397

Query: 305 EVTIHFRGA-DVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
            + + F GA  +     ++ +  +  + C  V +G    V + GNI+Q   L  +D+  +
Sbjct: 398 RMELRFAGAATLAPPGKSYVIDTAPGVKCIGVVEGAWPGVSVIGNILQQEHLWEFDLANR 457

Query: 363 TVSFKPTDCTK 373
            + FK + C  
Sbjct: 458 WLRFKQSRCAH 468


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score = 82.8 bits (203), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 61/205 (29%), Positives = 98/205 (47%), Gaps = 15/205 (7%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           + + H +SP SPF   +   ++     L +   RL + +  +   S   +    I  +  
Sbjct: 34  LRVFHVNSPCSPFKQPNTVSWE---STLLKDKARLQYLSSLAKKPSVPIASGRAIVQSPT 90

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y++R +IGTP    L   DT +D  W  C  C         S LFDP  SS+ ++L C +
Sbjct: 91  YIVRANIGTPAQPMLVALDTSNDAAWVPCSGC----VGCASSVLFDPSKSSSSRNLQCDA 146

Query: 151 SQCASLNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
            QC      +C +G +C ++++YG GS    +L  +T+TL +       +   TFGC + 
Sbjct: 147 PQCKQAPNPTCTAGKSCGFNMTYG-GSTIEASLTQDTLTLAND-----VIKSYTFGCISK 200

Query: 210 NGGLFNSKTTGIVGLGGGDISLISQ 234
             G  +    G++GLG G +SLISQ
Sbjct: 201 ATGT-SLPAQGLMGLGRGPLSLISQ 224


>gi|312282457|dbj|BAJ34094.1| unnamed protein product [Thellungiella halophila]
          Length = 424

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 85/380 (22%), Positives = 146/380 (38%), Gaps = 89/380 (23%)

Query: 77  SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
           ++A+ + + P + N      Y + I+IG PP       DTGSDL W QC+  P   C   
Sbjct: 37  TRAASSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVHCLEA 95

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
             PL+ P    +   +PC+   C +L    N +  +   C Y V Y DG  S G L  + 
Sbjct: 96  PHPLYQP----SNDLIPCNDPLCKALHFNGNHRCETPEQCDYEVEYADGGSSLGVLVRDV 151

Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRTT------ 238
            +L  T G  +  P +  GCG +   G   +    G++GLG G +S++SQ+ +       
Sbjct: 152 FSLNYTKGLRLT-PRLALGCGYDQIPGASGHHPLDGVLGLGRGKVSILSQLHSQGYVKNV 210

Query: 239 ----------------------------------------------IAGNQRLGVSTPDI 252
                                                         + G +  G+     
Sbjct: 211 VGHCLSSLGGGILFFGNDLYDSSRVSWTPMARENSKHYSPAMGGELLFGGRTTGLKNLLT 270

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCYSFNS-LSQVPEVTIH 309
           V DSG++ T+        +  ++   +  +P+  A    +L LC+        + EV  +
Sbjct: 271 VFDSGSSYTYFNSKAYQAVTYLLKRELSGKPLKEARDDHTLPLCWQGRRPFMSIEEVKKY 330

Query: 310 FRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITNSVPI-------YGNIMQTNF 353
           F+   +       S++ F +     ++ S    V  GI N   I        G+I   + 
Sbjct: 331 FKPLALSFKTGWRSKTLFEIPPEAYLIISMKGNVCLGILNGTEIGLQNLNLIGDISMQDQ 390

Query: 354 LVGYDIEQQTVSFKPTDCTK 373
           ++ YD E+Q++ + P DC +
Sbjct: 391 MIIYDNEKQSIGWIPADCDE 410


>gi|297827153|ref|XP_002881459.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327298|gb|EFH57718.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 507

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 56/158 (35%), Positives = 79/158 (50%), Gaps = 13/158 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+PPTE     DTGSD++W   + C  CP S     D   FD   S T  S+ 
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSFTAGSVT 159

Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL---P 200
           CS   C+S+ Q +   CS  N C YS  YGDGS ++G   T+T    +  G+++      
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219

Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
            I FGC T   G     +    GI G G G +S++SQ+
Sbjct: 220 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQL 257


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 49/149 (32%), Positives = 75/149 (50%), Gaps = 10/149 (6%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           N  Y  R+ IGTPP     + DTGS + +  C  C   QC     P F P +SSTY+S+ 
Sbjct: 10  NGYYTTRLWIGTPPQRFALIVDTGSSVTYVPCSSC--EQCGRHQDPKFQPDLSSTYQSVK 67

Query: 148 CSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC- 206
           C+   C   ++K      C Y   Y + S S+G L  + ++ G+ +  A+A     FGC 
Sbjct: 68  CNID-CNCDDEKQ----QCVYERQYAEMSTSSGVLGEDIISFGNLS--ALAPQRAVFGCE 120

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
               G L++    GI+G+G GD+S++  +
Sbjct: 121 NMETGDLYSQHADGIMGMGRGDLSIVDHL 149



 Score = 43.1 bits (100), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 36/133 (27%), Positives = 65/133 (48%), Gaps = 12/133 (9%)

Query: 253 VIDSGTTLTFLPQ-GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--FNSLSQV----P 304
           ++DSGTT  +LP+  + S   ++M  +   +P+  P  +  ++C+S   + +SQ+    P
Sbjct: 225 ILDSGTTYAYLPEAAFVSFKDAIMKELHSLKPIRGPDPNYNDICFSGAGSDISQLSSSFP 284

Query: 305 EVTIHF-RGADVKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIE 360
            V + F  G  + LS  N+     KV       +F+   +   + G I+  N LV YD E
Sbjct: 285 AVEMVFGNGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRE 344

Query: 361 QQTVSFKPTDCTK 373
              + F  T+C++
Sbjct: 345 NSKIGFWKTNCSE 357


>gi|42571079|ref|NP_973613.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|110737616|dbj|BAF00749.1| putative protease [Arabidopsis thaliana]
 gi|330254187|gb|AEC09281.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 507

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 56/158 (35%), Positives = 79/158 (50%), Gaps = 13/158 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+PPTE     DTGSD++W   + C  CP S     D   FD   S T  S+ 
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159

Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL---P 200
           CS   C+S+ Q +   CS  N C YS  YGDGS ++G   T+T    +  G+++      
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219

Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
            I FGC T   G     +    GI G G G +S++SQ+
Sbjct: 220 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQL 257


>gi|42569679|ref|NP_181205.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|330254186|gb|AEC09280.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 512

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 56/158 (35%), Positives = 79/158 (50%), Gaps = 13/158 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+PPTE     DTGSD++W   + C  CP S     D   FD   S T  S+ 
Sbjct: 105 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 164

Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           CS   C+S+ Q +   CS  N C YS  YGDGS ++G   T+T    +  G+++      
Sbjct: 165 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 224

Query: 202 -ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
            I FGC T   G     +    GI G G G +S++SQ+
Sbjct: 225 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQL 262


>gi|30699261|ref|NP_850981.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|17065172|gb|AAL32740.1| nucellin-like protein [Arabidopsis thaliana]
 gi|24899795|gb|AAN65112.1| nucellin-like protein [Arabidopsis thaliana]
 gi|332197863|gb|AEE35984.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 466

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 90/364 (24%), Positives = 145/364 (39%), Gaps = 90/364 (24%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + ++IG PP       DTGSDL W QC+  P + C    +  + P     + +LPCS 
Sbjct: 67  YYVLLNIGNPPKLFDLDIDTGSDLTWVQCD-APCNGCTKPRAKQYKP----NHNTLPCSH 121

Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
             C+ L+    + C+     C Y + Y D + S G L T+ V L    G  + L  +TFG
Sbjct: 122 ILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLR-LTFG 180

Query: 206 CG---TNNGGLFNSKTTGIVGLGGGDISLISQMR-------------------------- 236
           CG    N G      T GI+GLG G + L +Q++                          
Sbjct: 181 CGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDE 240

Query: 237 ---------TTIAGN-----------------QRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
                    T++A N                 +  GV   ++V DSG++ T+        
Sbjct: 241 LVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQA 300

Query: 271 LLSVMSSMIEAQPVADPTG--SLELCYS----FNSLSQVPE----VTIHF----RGADVK 316
           +L ++   +  +P+ D     SL +C+       SL +V +    +T+ F     G   +
Sbjct: 301 ILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQ 360

Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVP-------IYGNIMQTNFLVGYDIEQQTVSFKPT 369
           +   ++ +   +  VC    GI N          I G+I     +V YD E+Q + +  +
Sbjct: 361 VPPESYLIITEKGRVCL---GILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISS 417

Query: 370 DCTK 373
           DC K
Sbjct: 418 DCDK 421


>gi|30699263|ref|NP_177872.3| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332197862|gb|AEE35983.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 432

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 90/364 (24%), Positives = 145/364 (39%), Gaps = 90/364 (24%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + ++IG PP       DTGSDL W QC+  P + C    +  + P     + +LPCS 
Sbjct: 67  YYVLLNIGNPPKLFDLDIDTGSDLTWVQCD-APCNGCTKPRAKQYKP----NHNTLPCSH 121

Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
             C+ L+    + C+     C Y + Y D + S G L T+ V L    G  + L  +TFG
Sbjct: 122 ILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLR-LTFG 180

Query: 206 CG---TNNGGLFNSKTTGIVGLGGGDISLISQMR-------------------------- 236
           CG    N G      T GI+GLG G + L +Q++                          
Sbjct: 181 CGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDE 240

Query: 237 ---------TTIAGN-----------------QRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
                    T++A N                 +  GV   ++V DSG++ T+        
Sbjct: 241 LVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQA 300

Query: 271 LLSVMSSMIEAQPVADPTG--SLELCYS----FNSLSQVPE----VTIHF----RGADVK 316
           +L ++   +  +P+ D     SL +C+       SL +V +    +T+ F     G   +
Sbjct: 301 ILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQ 360

Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVP-------IYGNIMQTNFLVGYDIEQQTVSFKPT 369
           +   ++ +   +  VC    GI N          I G+I     +V YD E+Q + +  +
Sbjct: 361 VPPESYLIITEKGRVCL---GILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISS 417

Query: 370 DCTK 373
           DC K
Sbjct: 418 DCDK 421


>gi|240255485|ref|NP_189841.4| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332644216|gb|AEE77737.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 430

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 82/327 (25%), Positives = 134/327 (40%), Gaps = 53/327 (16%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +A Y   + IGTPP E   V DTGSDL+W  C  C    C + +   FDP  SS+   L 
Sbjct: 75  SALYYTTVQIGTPPRELDVVIDTGSDLVWVSCNSC--VGCPLHNVTFFDPGASSSAVKLA 132

Query: 148 CSSSQCASLNQKS--CSGV-NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           CS  +C+S  QK   CS + +C Y V YGDGS ++G   ++ ++  + +         T+
Sbjct: 133 CSDKRCSSDLQKKSRCSLLESCTYKVEYGDGSVTSGYYISDLISFDTMSDW-------TY 185

Query: 205 GCGTNNGGLFNSKTTG-IVGLGGGDISL---------------ISQMRTTIAGNQRLGVS 248
               +N         G I+G      S                 S M T    + RL + 
Sbjct: 186 IAFRDNSTWHPWVRQGAIIGTFPALCSTPCSTVSSQPLYYNPQFSHMMTVAVNDLRLPID 245

Query: 249 TPDI---------VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNS 299
            P +         +IDSGTTL   P      L+  + +++       P  S + C++  S
Sbjct: 246 -PSVFSVAKGYGTIIDSGTTLVHFPGEAYDPLIQAILNVVSQYGRPIPYESFQ-CFNITS 303

Query: 300 --------LSQVPEVTIHFR-GADVKLSRSNF----FVKVSEDIVC-SVFKGITNSVPIY 345
                       PEV + F  GA + +    +    F+ ++  I C   +   +  + I 
Sbjct: 304 GISSHLVIADMFPEVHLGFAGGASMVIKPEAYLFQKFLDLTNAIWCLGFYSSTSRRITII 363

Query: 346 GNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           G +   + +  YD++ Q + +   +C+
Sbjct: 364 GEVAIRDKMFVYDLDHQRIGWAEYNCS 390


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 54/178 (30%), Positives = 83/178 (46%), Gaps = 31/178 (17%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEP--------CPPSQCYMQDSPLFDPKM 139
           +  Y + + +GTP  +   + DTGSDL W QC P         PP       +P +D   
Sbjct: 24  SGQYFVELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPP-------APWYDKSS 76

Query: 140 SSTYKSLPCSSSQCASLNQ---KSCSGVN---CQYSVSYGDGSFSNGNLATETVTL---- 189
           SS+Y+ +PC+  +C  L      SCS  +   C Y+  Y D S + G LA ET+++    
Sbjct: 77  SSSYREIPCTDDECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRK 136

Query: 190 ------GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
                 G+   + + +  +  GC   + G      +G++GLG G ISL +Q R T  G
Sbjct: 137 RSGKRAGNHKTRTIRIKNVALGCSRESVGASFLGASGVLGLGQGPISLATQTRHTALG 194



 Score = 65.5 bits (158), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 35/123 (28%), Positives = 68/123 (55%), Gaps = 4/123 (3%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ-VPEVTIHFR 311
           + DSGTTL++L +   S +L  +++ I      +     ELCY+   + + +P++ + F+
Sbjct: 272 IFDSGTTLSYLREPAYSKVLGALNASIYLPRAQEIPEGFELCYNVTRMEKGMPKLGVEFQ 331

Query: 312 GADV-KLSRSNFFVKVSEDIVCSVFKGI--TNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           G  V +L  +N+ V V+E++ C   + +  TN   I GN++Q +  + YD+ +  + FK 
Sbjct: 332 GGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNLLQQDHHIEYDLAKARIGFKW 391

Query: 369 TDC 371
           + C
Sbjct: 392 SPC 394


>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 482

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 51/158 (32%), Positives = 74/158 (46%), Gaps = 14/158 (8%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQC---EPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
           L    IG  P +     DTGSD +W  C     CP       +  L+DP  S T K +PC
Sbjct: 76  LYYTKIGLGPNDYYVQVDTGSDTLWVNCVGCTTCPKKSGLGMELTLYDPNSSKTSKVVPC 135

Query: 149 SSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
               C S      SG    ++C YS++YGDGS ++G+   + +T     G    +P    
Sbjct: 136 DDEFCTSTYDGPISGCKKDMSCPYSITYGDGSTTSGSYIKDDLTFDRVVGDLRTVPDNTS 195

Query: 202 ITFGCGTNNGGLFNSKT----TGIVGLGGGDISLISQM 235
           + FGCG+   G  +S T     GI+G G  + S++SQ+
Sbjct: 196 VIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQL 233


>gi|255588450|ref|XP_002534607.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223524923|gb|EEF27776.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 260

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 57/178 (32%), Positives = 90/178 (50%), Gaps = 18/178 (10%)

Query: 61  SLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE 120
           SL   NH + N+ +        DI+ +   Y  ++ IGTPP E   V DTGS++ +  C 
Sbjct: 25  SLANYNHLHPNARM----PLYGDIL-SYGYYATKLYIGTPPQEFTLVVDTGSNMTFVPC- 78

Query: 121 PCPPSQ-CYMQDSPLFDPKMSSTYKSLPCSSS-QCASLNQKSCSGVNCQYSVSYGDGSFS 178
            C   + C   + P F  + SSTY+ + C  S  C  L  +      C Y + YGDGS+S
Sbjct: 79  -CGSEEYCGKHEDPAFQTESSSTYQPVNCHPSCDCDYLRSQ------CSYKMHYGDGSYS 131

Query: 179 NGNLATETVTLGSTTGQAVALPGITFGCGTNN-GGLFNSKTTGIVGLGGGDISLISQM 235
            G LA + ++ G+ +    A   + FGC  +  G L++ +  GI+GLG G  +++ Q+
Sbjct: 132 RGVLAEDIISFGNES--EFAPQRLVFGCELDAIGSLYSLRADGIIGLGRGRSTIVDQL 187


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 91/359 (25%), Positives = 143/359 (39%), Gaps = 84/359 (23%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           N  Y  R+ IGTPP     + DTGS + +  C  C   QC     P F P+ SSTY+ + 
Sbjct: 109 NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTC--EQCGRHQDPKFQPESSSTYQPVK 166

Query: 148 CSSSQCASLNQKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           C+   C      +C G  + C Y   Y + S S+G L  + ++ G+ +   +A     FG
Sbjct: 167 CTID-C------NCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQS--ELAPQRAVFG 217

Query: 206 C-GTNNGGLFNSKTTGIVGLGGG---------DISLISQMRTTIAGNQRLG--------- 246
           C     G L++    GI+GLG G         D  +IS   +   G   +G         
Sbjct: 218 CENVETGDLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMVLGGI 277

Query: 247 ---------VSTPD-------------------------------IVIDSGTTLTFLPQ- 265
                     S PD                                V+DSGTT  +LP+ 
Sbjct: 278 SPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVFDGKHGTVLDSGTTYAYLPEA 337

Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS--FNSLSQV----PEVTIHF-RGADVKL 317
            + +   +++  +   + ++ P  +  ++C+S   N +SQ+    P V + F  G    L
Sbjct: 338 AFLAFKDAIVKELQSLKQISGPDPNYNDICFSGAGNDVSQLSKSFPVVDMVFGNGHKYSL 397

Query: 318 SRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           S  N+     KV       +F+   +   + G I+  N LV YD EQ  + F  T+C +
Sbjct: 398 SPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDREQTKIGFWKTNCAE 456


>gi|297842525|ref|XP_002889144.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297334985|gb|EFH65403.1| hypothetical protein ARALYDRAFT_476912 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 467

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 87/361 (24%), Positives = 145/361 (40%), Gaps = 84/361 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + ++IG PP       DTGSDL W QC+  P + C    +  + P     + +LPCS 
Sbjct: 68  YYVLLNIGNPPKLFDLDIDTGSDLTWVQCD-APCNGCTKPRAKQYKP----NHNTLPCSH 122

Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
             C+ L+    + C      C Y + Y D + S G L T+   L    G ++  P +TFG
Sbjct: 123 LLCSGLDLTQNRPCDDPEDQCDYEIGYSDHASSIGALVTDEFPLKLANG-SIMNPHLTFG 181

Query: 206 CG---TNNGGLFNSKTTGIVGLGGGDISLISQMR-------------------------- 236
           CG    N G      T GI+GLG G + + +Q++                          
Sbjct: 182 CGYDQQNPGPHPPPPTAGILGLGRGKVGISTQLKSLGITKNVIVHCLSHTGKGFLSIGDE 241

Query: 237 ---------TTIAGN-----------------QRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
                    T++A N                 +  GV   ++V DSG++ T+        
Sbjct: 242 LVPSSGVTWTSLATNSASKNYMTGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQA 301

Query: 271 LLSVMSSMIEAQPVADPTG--SLELCYS----FNSLSQVPE----VTIHF----RGADVK 316
           +L ++   +  +P+ D     SL +C+       SL +V +    +T+ F     G   +
Sbjct: 302 ILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGYQKNGQLFQ 361

Query: 317 LSRSNFFVKVSEDIVC-SVFKGIT---NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           +   ++ +   +  VC  +  G     +S  I G+I     +V YD E+Q + +  +DC 
Sbjct: 362 VPPESYLIITEKGNVCLGILNGTEVGLDSYNIVGDISFQGIMVIYDNEKQRIGWISSDCD 421

Query: 373 K 373
           K
Sbjct: 422 K 422


>gi|4415912|gb|AAD20143.1| putative protease [Arabidopsis thaliana]
          Length = 469

 Score = 82.0 bits (201), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 56/158 (35%), Positives = 79/158 (50%), Gaps = 13/158 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+PPTE     DTGSD++W   + C  CP S     D   FD   S T  S+ 
Sbjct: 100 YFTKVKLGSPPTEFNVQIDTGSDILWVTCSSCSNCPHSSGLGIDLHFFDAPGSLTAGSVT 159

Query: 148 CSSSQCASLNQKS---CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL---P 200
           CS   C+S+ Q +   CS  N C YS  YGDGS ++G   T+T    +  G+++      
Sbjct: 160 CSDPICSSVFQTTAAQCSENNQCGYSFRYGDGSGTSGYYMTDTFYFDAILGESLVANSSA 219

Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
            I FGC T   G     +    GI G G G +S++SQ+
Sbjct: 220 PIVFGCSTYQSGDLTKSDKAVDGIFGFGKGKLSVVSQL 257


>gi|326514838|dbj|BAJ99780.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 430

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 88/355 (24%), Positives = 150/355 (42%), Gaps = 81/355 (22%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
           +Y + ++IG P        DTGSDL W QC+ PC    C     P + P  +   K +PC
Sbjct: 72  HYYVTMNIGDPAKPYFLDVDTGSDLTWLQCDAPC--QSCNKVPHPWYKPTKN---KIVPC 126

Query: 149 SSSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           ++S C SL  N+K      C Y + Y D + S G L  +  TL S    +     +TFGC
Sbjct: 127 AASLCTSLTPNKKCAVPQQCDYQIKYTDKASSLGVLIADNFTL-SLRNSSTVRANLTFGC 185

Query: 207 GTN----NGGLFNSKTTGIVGLGGGDISLISQMR-------------------------- 236
           G +      G   + T G++GLG G +SL+SQ++                          
Sbjct: 186 GYDQQVGKNGAVQAATDGLLGLGKGAVSLLSQLKQQGVTKNVLGHCFSTNGGGFLFFGDD 245

Query: 237 -------------TTIAGN-------------QRLGVSTPDIVIDSGTTLT-FLPQGYNS 269
                         T +GN             + LG+   ++V DSG+T   F  + Y +
Sbjct: 246 IVPTSRVTWVPMARTTSGNYYSPGSGTLYFDRRSLGMKPMEVVFDSGSTYAYFAAEPYQA 305

Query: 270 NLLSVMSSMIEA-QPVADPTGSLELCYS----FNSLSQVPE----VTIHF-RGADVKLSR 319
            + ++ + + ++ + V+D   SL LC+     F S+S+V      + + F + + +++  
Sbjct: 306 TVSALKAGLSKSLKEVSDV--SLPLCWKGQKVFKSVSEVKNDFKSLFLSFGKNSVMEIPP 363

Query: 320 SNFFVKVSEDIVC-SVFKGITNSVP--IYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            N+ +      VC  +  G T  +   I G+I   + ++ YD E+  + +    C
Sbjct: 364 ENYLIVTKYGNVCLGILDGTTAKLKFNIIGDITMQDQMIIYDNEKGQLGWIRGSC 418


>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
          Length = 442

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 80/159 (50%), Gaps = 21/159 (13%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N    + +++G PP     V DTGS+L W  C+  P          +F+P  SSTY  +
Sbjct: 61  HNVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSP------NLGSVFNPVSSSTYSPV 114

Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           PCSS  C +  +      SC      C  ++SY D +   GNLA ET  +GS T      
Sbjct: 115 PCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVTR----- 169

Query: 200 PGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           PG  FGC   G ++    ++K+TG++G+  G +S ++Q+
Sbjct: 170 PGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQL 208


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 53/173 (30%), Positives = 84/173 (48%), Gaps = 10/173 (5%)

Query: 64  RLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP 123
           R  H +++    +++    D +  N  Y  R+ IGTPP     + DTGS + +  C  C 
Sbjct: 54  RQLHGSESKRHPNARMRLHDDLLLNGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTC- 112

Query: 124 PSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLA 183
             QC     P F P +SSTY+ + C+   C   N +    + C Y   Y + S S+G L 
Sbjct: 113 -EQCGRHQDPKFQPDLSSTYQPVKCTLD-CNCDNDR----MQCVYERQYAEMSTSSGVLG 166

Query: 184 TETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
            + V+ G+ +   +A     FGC     G L++    GI+GLG GD+S++ Q+
Sbjct: 167 EDVVSFGNQS--ELAPQRAVFGCENVETGDLYSQHADGIMGLGRGDLSIMDQL 217


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 80/159 (50%), Gaps = 21/159 (13%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N    + +++G PP     V DTGS+L W  C+  P          +F+P  SSTY  +
Sbjct: 61  HNVTLTVTLAVGDPPQNISMVLDTGSELSWLHCKKSP------NLGSVFNPVSSSTYSPV 114

Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           PCSS  C +  +      SC      C  ++SY D +   GNLA ET  +GS T      
Sbjct: 115 PCSSPICRTRTRDLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVIGSVT-----R 169

Query: 200 PGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           PG  FGC   G ++    ++K+TG++G+  G +S ++Q+
Sbjct: 170 PGTLFGCMDSGLSSNSEEDAKSTGLMGMNRGSLSFVNQL 208


>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 81.6 bits (200), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 90/192 (46%), Gaps = 24/192 (12%)

Query: 59  TRSLNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSD 113
           ++   RL +    S+++  K +   I P       ANY++R+ +GTP  +   V DT +D
Sbjct: 11  SKDPERLKYL---STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSND 67

Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSV 170
             W  C     S C    S  F P  S+T  SL CS +QC+ +   SC       C ++ 
Sbjct: 68  AAWVPC-----SGCTGCSSTTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQ 122

Query: 171 SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDI 229
           SYG  S     L  + +TL +       +PG TFGC    +GG    +  G++GLG G I
Sbjct: 123 SYGGDSSLAATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPI 175

Query: 230 SLISQMRTTIAG 241
           SLISQ     +G
Sbjct: 176 SLISQAGAMYSG 187


>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 407

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 59/204 (28%), Positives = 90/204 (44%), Gaps = 20/204 (9%)

Query: 36  RDSPKSPFYNSSETPY---QRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYL 92
           R  P+ P +      Y    RL  +L R L    H N       ++    D +  N  Y 
Sbjct: 38  RPVPRPPLFLPLTRSYPNASRLAASLRRGLGDGAHPN-------ARMRLHDDLLTNGYYT 90

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
            R+ IGTPP E   + D+GS + +  C  C   QC     P F P +SS+Y  + C+   
Sbjct: 91  TRLYIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRFQPDLSSSYSPVKCNVDC 148

Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNG 211
               ++K C+     Y   Y + S S+G L  + V+ G  +   +      FGC  +  G
Sbjct: 149 TCDSDKKQCT-----YERQYAEMSSSSGVLGEDIVSFGRES--ELKAQRAVFGCENSETG 201

Query: 212 GLFNSKTTGIVGLGGGDISLISQM 235
            LF+    GI+GLG G +S++ Q+
Sbjct: 202 DLFSQHADGIMGLGRGQLSIMDQL 225


>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
 gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
 gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
 gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
 gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
 gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
 gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
 gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
 gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
 gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
 gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
 gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
 gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
 gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
 gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
 gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
 gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
 gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
 gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
 gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
 gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
 gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
 gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
 gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
 gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
 gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
 gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
 gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
 gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
 gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
 gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
 gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
 gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
 gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
 gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
 gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
 gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 62/192 (32%), Positives = 90/192 (46%), Gaps = 24/192 (12%)

Query: 59  TRSLNRLNHFNQNSSISSSKASQADIIPNN-----ANYLIRISIGTPPTERLAVADTGSD 113
           ++   RL +    S+++  K +   I P       ANY++R+ +GTP  +   V DT +D
Sbjct: 11  SKDPERLKYL---STLADQKTTAVPIAPGQQVLKIANYVVRVKLGTPGQQMFMVLDTSND 67

Query: 114 LIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN---CQYSV 170
             W  C     S C    S  F P  S+T  SL CS +QC+ +   SC       C ++ 
Sbjct: 68  AAWVPC-----SGCTGCSSTTFLPNASTTLGSLDCSEAQCSQVRGFSCPATGSSACLFNQ 122

Query: 171 SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDI 229
           SYG  S     L  + +TL +       +PG TFGC    +GG    +  G++GLG G I
Sbjct: 123 SYGGDSSLAATLVQDAITLANDV-----IPGFTFGCINAVSGGSIPPQ--GLLGLGRGPI 175

Query: 230 SLISQMRTTIAG 241
           SLISQ     +G
Sbjct: 176 SLISQAGAMYSG 187


>gi|56692305|dbj|BAD80835.1| nucellin-like protein [Daucus carota]
          Length = 426

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 86/358 (24%), Positives = 139/358 (38%), Gaps = 82/358 (22%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y ++ +IG PP       DTGSDL W QC+  P  QC     PL+ P    T   + C  
Sbjct: 67  YHVQFNIGQPPKPYFLDPDTGSDLTWLQCD-APCIQCTPAPHPLYQP----TNDLVVCKD 121

Query: 151 SQCASL---NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
             CASL   N +      C Y V Y DG  S G L  +   +  T+G   A P +T GCG
Sbjct: 122 PICASLHPDNYRCDDPDQCDYEVEYADGGSSIGVLVNDLFPVNLTSGMR-ARPRLTIGCG 180

Query: 208 TNN-GGLFNSKTTGIVGLGGGDISLISQMRTT---------------------------- 238
            +   G+      G++GLG G  S+++Q+ +                             
Sbjct: 181 YDQLPGIAYHPLDGVLGLGRGSSSIVAQLSSQGLVRNVVGHCFSRRGGGYLFFGDDIYDS 240

Query: 239 -----------------------IAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVM 275
                                  I   +  G+    +V DSG++ T+        LLS +
Sbjct: 241 SKVIWTPMSRDYLKHYTPGFAELILNGRSSGLKNLLVVFDSGSSYTYFNTQTYQTLLSFI 300

Query: 276 SSMIEAQPVADPT--GSLELCYSFNS-LSQVPEVTIHFR------GADVKLSRSNFFVKV 326
              +  +P+ +     +L +C+        + +   +F+      G+  K ++S F ++ 
Sbjct: 301 KKDLHGKPLKEAVEDDTLPVCWRGKKPFKSIRDAKKYFKPLALSFGSGWK-TKSQFEIQQ 359

Query: 327 SEDIVC----SVFKGITNSVP-------IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
              ++     SV  GI N          I G+I     LV YD E+Q + ++P++C +
Sbjct: 360 ESYLIISSKGSVCLGILNGTEVGLQNYNIIGDISMQEKLVIYDNEKQVIGWQPSNCDR 417


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 82/367 (22%), Positives = 138/367 (37%), Gaps = 86/367 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPC---PPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G P    +   DTGSD++W  C PC   P          ++DP+ SST   + 
Sbjct: 29  YFTQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVS 88

Query: 148 CSSSQCAS---LNQKSCSGV--NCQYSVSYGDGSFSNGNLATETVTLG--STTGQAVALP 200
           CS   C       +  CS    NC+Y  SYGDGS S G    + +     S+ G A    
Sbjct: 89  CSDPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTS 148

Query: 201 GITFGCGTNNGGLFNS---KTTGIVGLGGGDISLISQ----------------------- 234
            + FGC     G  ++      GI+G G  ++S+ +Q                       
Sbjct: 149 QVLFGCSIRQTGDLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCLEGEKRGGG 208

Query: 235 ---------------------------MRTTIAGNQRLGVSTPD--------IVIDSGTT 259
                                      +R     + RL +   D        +++DSGTT
Sbjct: 209 ILVIGGIAEPGMTYTPLVPDSVHYNVVLRGISVNSNRLPIDAEDFSSTNDTGVIMDSGTT 268

Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQV-PEVTIHFRGADVKLS 318
           L + P G  +  +  +     A PV       +       LS + P VT++F G  ++L 
Sbjct: 269 LAYFPSGAYNVFVQAIREATSATPVRVQGMDTQCFLVSGRLSDLFPNVTLNFEGGAMELQ 328

Query: 319 RSNFFV------KVSEDIVCSVFKGITNS--------VPIYGNIMQTNFLVGYDIEQQTV 364
             N+ +        + D+ C  ++  ++S        + I G+I+  + LV YD++   +
Sbjct: 329 PDNYLMWGGTAPTGTTDVWCIGWQSSSSSAGPKDGSQLTILGDIVLKDKLVVYDLDNSRI 388

Query: 365 SFKPTDC 371
            +   +C
Sbjct: 389 GWMSYNC 395


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 98/412 (23%), Positives = 162/412 (39%), Gaps = 90/412 (21%)

Query: 36  RDSPKSPFYNSSETPY---QRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYL 92
           R +P  P +      Y    RL  +L R L    H N       ++    D +  N  Y 
Sbjct: 37  RPAPGPPLFLPLTRSYPNASRLAASLRRGLGDGVHPN-------ARMRLHDDLLTNGYYT 89

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
            R+ IGTPP E   + D+GS + +  C  C   QC     P F P +SS+Y  + C+   
Sbjct: 90  TRLYIGTPPQEFALIVDSGSTVTYVPCSSC--EQCGNHQDPRFQPDLSSSYSPVKCNVDC 147

Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNG 211
               ++K C+     Y   Y + S S+G L  + V+ G  +   +      FGC  +  G
Sbjct: 148 TCDSDKKQCT-----YERQYAEMSSSSGVLGEDIVSFGRES--ELKPQHAIFGCENSETG 200

Query: 212 GLFNSKTTGIVGLGGGDISLISQM--RTTIA--------------GNQRLG--VSTPDIV 253
            LF+    GI+GLG G +S++ Q+  +  I+              G   LG  ++ PD++
Sbjct: 201 DLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGMLAPPDMI 260

Query: 254 I----------------------------------DSGTTL------TFLP-QGYNSNLL 272
                                                GT L       +LP Q + +   
Sbjct: 261 FSNSDPLRSPYYNIELKEIHVAGKALRVESRIFNSKHGTVLDSGTTYAYLPEQAFVAFKE 320

Query: 273 SVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PEVTIHF-RGADVKLSRSNFFV 324
           +V S +   + +  P  S  ++C++      + L +V P+V + F  G  + L+  N+  
Sbjct: 321 AVTSKVHSLKKIRGPDPSYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQKLSLTPENYLF 380

Query: 325 ---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
              KV       VF+   +   + G I+  N LV YD   + + F  T+C++
Sbjct: 381 RHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNCSE 432


>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
 gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
          Length = 428

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 97/382 (25%), Positives = 155/382 (40%), Gaps = 87/382 (22%)

Query: 65  LNHFNQNSSISSSKASQA--DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
             +    +S  S+KA Q   D     + Y+I + +GTP   ++   DTGS   W  CE C
Sbjct: 54  FRYITNKTSRLSTKAVQVGWDRGLQTSLYVISVGLGTPAKTQIVEIDTGSSTSWVFCE-C 112

Query: 123 PPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN-----QKSCSGVNCQYSVSYGDGSF 177
               C+         + S+T   + C +S C         Q S +  +C + VSY DGS 
Sbjct: 113 --DGCHTNPRTFLQSR-STTCAKVSCGTSMCLLGGSDPHCQDSENYPDCPFRVSYQDGSA 169

Query: 178 SNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGL------------------------ 213
           S G L  +T+T          +PG +FGC  ++ G                         
Sbjct: 170 SYGILYQDTLTFSDVQ----KIPGFSFGCNMDSFGANEFGNVDGLLGMGAGPMSVLKQSS 225

Query: 214 ------------------FNSKTTGIVGLGG----GDI---SLISQMRTTI--------- 239
                             F SKTTG   LG      D+    ++++ + T          
Sbjct: 226 PTFDCFSYCLPLQKSERGFFSKTTGYFSLGKVATRTDVRYTKMVARKKNTELFFVDLTAI 285

Query: 240 -AGNQRLGVS-----TPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
               +RLG+S        +V DSG+ L+++P    S L   +  ++  +  A+   S   
Sbjct: 286 SVDGERLGLSPSVFSRKGVVFDSGSELSYIPDRALSVLSQRIRELLLKRGAAEEE-SERN 344

Query: 294 CYSFNSLSQ--VPEVTIHFR-GADVKLSRSNFFVKVS---EDIVCSVFKGITNSVPIYGN 347
           CY   S+ +  +P +++HF  GA   L     FV+ S   +D+ C  F   T SV I G+
Sbjct: 345 CYDMRSVDEGDMPAISLHFDDGARFDLGSHGVFVERSVQEQDVWCLAF-APTESVSIIGS 403

Query: 348 IMQTNFLVGYDIEQQTVSFKPT 369
           +MQT+  V YD+++Q +   P+
Sbjct: 404 LMQTSKEVVYDLKRQLIGIGPS 425


>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
          Length = 446

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 54/156 (34%), Positives = 74/156 (47%), Gaps = 16/156 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y+    IG PP    A+ DTGSDL+WTQC  C    C  Q  P ++   SST+  +PC+
Sbjct: 89  QYVAEYLIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCA 148

Query: 150 SSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           +  CA+ +     C     C     YG G  + G L TE     S T +      + FGC
Sbjct: 149 ARICAANDDIIHFCDLAAGCSVIAGYGAGVVA-GTLGTEAFAFQSGTAE------LAFGC 201

Query: 207 GT----NNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
            T      G L  +  +G++GLG G +SL+SQ   T
Sbjct: 202 VTFTRIVQGALHGA--SGLIGLGRGRLSLVSQTGAT 235



 Score = 50.4 bits (119), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 38/133 (28%), Positives = 62/133 (46%), Gaps = 6/133 (4%)

Query: 246 GVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE--LCYSFNSLSQV 303
           G+ +  ++IDSG+  T L       L S +++ +    VA P  + +  LC +   + +V
Sbjct: 312 GLFSGGVIIDSGSPFTSLVHDAYDALASELAARLNGSLVAPPPDADDGALCVARRDVGRV 371

Query: 304 -PEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFK--GITNSVPIYGNIMQTNFLVGYDI 359
            P V  HFRG AD+ +   +++  V +   C      G      + GN  Q N  V YD+
Sbjct: 372 VPAVVFHFRGGADMAVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDL 431

Query: 360 EQQTVSFKPTDCT 372
                SF+P DC+
Sbjct: 432 ANGDFSFQPADCS 444


>gi|242082978|ref|XP_002441914.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
 gi|241942607|gb|EES15752.1| hypothetical protein SORBIDRAFT_08g004800 [Sorghum bicolor]
          Length = 429

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 57/167 (34%), Positives = 81/167 (48%), Gaps = 21/167 (12%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + ++IG PP       DTGSDL W QC+ PC    C     PL+ P  +
Sbjct: 58  GDVYPHGL-YYVAMNIGNPPKPYFLDVDTGSDLTWLQCDAPC--RSCNKVPHPLYRPTKN 114

Query: 141 STYKSLPCSSSQCASLN-----QKSCSG--VNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
              K +PC    CASL+     +  C      C Y + Y D   S G L  ++  L    
Sbjct: 115 ---KLVPCVDQLCASLHNGLNRKHKCDSPYEQCDYVIKYADQGSSTGVLVNDSFALRLAN 171

Query: 194 GQAVALPGITFGCGTN----NGGLFNSKTTGIVGLGGGDISLISQMR 236
           G +V  P + FGCG +    +G +  S T G++GLG G +SL+SQ +
Sbjct: 172 G-SVVRPSLAFGCGYDQQVSSGEM--SPTDGVLGLGTGSVSLLSQFK 215


>gi|125595845|gb|EAZ35625.1| hypothetical protein OsJ_19916 [Oryza sativa Japonica Group]
          Length = 152

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 53/137 (38%), Positives = 67/137 (48%), Gaps = 9/137 (6%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGV 164
           V DT SD+ W QC PCP   CY Q   L+DP  SS+     C+S  C  L      C+  
Sbjct: 20  VLDTASDVTWVQCSPCPTPPCYPQKDVLYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNN 79

Query: 165 N-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC--GTNNGGLFNSKTTGI 221
           N CQY V Y DG+ + G   ++ +T+      A A+    FGC  G      F S   GI
Sbjct: 80  NQCQYRVRYPDGTSTAGTYISDLLTI----TPATAVRSFQFGCSKGVKGSFSFGSSAAGI 135

Query: 222 VGLGGGDISLISQMRTT 238
           + LGGG  SL+SQ   T
Sbjct: 136 MALGGGPESLVSQTGAT 152


>gi|326503488|dbj|BAJ86250.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 551

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 100/371 (26%), Positives = 149/371 (40%), Gaps = 90/371 (24%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
           + ++ P+   Y   I +G PP       DTGSDL W QC+ PC  + C     PL+ P  
Sbjct: 182 KGNVFPD-GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPC--TNCAKGPHPLYKP-- 236

Query: 140 SSTYKSLPCSSSQCASL--NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
            +  K +P   S C  L  +Q  C     C Y + Y D S S G LA + + L +T G  
Sbjct: 237 -AKEKIVPPRDSLCQELQGDQNYCETCKQCDYEIEYADRSSSMGVLAKDDMHLIATNGGR 295

Query: 197 VALPGITFGCGTNNGGLFNS---KTTGIVGLGGGDISLISQM---------------RTT 238
             L    FGC  +  G   S   KT GI+GL    ISL SQ+               R T
Sbjct: 296 EKL-DFVFGCAYDQQGQLLSSPAKTDGILGLSSAAISLPSQLASKGIISNVFGHCITRET 354

Query: 239 IAGNQ---------RLGVST------PD------------------------IVIDSGTT 259
             G           R G++       PD                        ++ DSG++
Sbjct: 355 NGGGYMFLGDDYVPRWGMTWAPIRGGPDNLYHTEAQKVNYGDQELHAGNSVQVIFDSGSS 414

Query: 260 LTFLPQGYNSNLLSVMSSMIEAQP--VADPTG-SLELCYS--FNSLSQVPEVTIHFRGAD 314
            T+LP+    NL+  +    E  P  V D +  +L LC+   F+  S    + +HF    
Sbjct: 415 YTYLPEEMYKNLIDAIK---EDSPSFVQDSSDTTLPLCWKADFSVRSFFKPLNLHFGRRW 471

Query: 315 VKLSRSNFFVKVSEDIVC-----SVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQ 362
             + ++  F  V +D +      +V  G+ N       S  I G++     LV YD E++
Sbjct: 472 FVVPKT--FTIVPDDYLIISDKGNVCLGLLNGTEINHGSTIIVGDVSLRGKLVVYDNERR 529

Query: 363 TVSFKPTDCTK 373
            + +  ++CTK
Sbjct: 530 QIGWANSECTK 540


>gi|449434470|ref|XP_004135019.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
 gi|449517144|ref|XP_004165606.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 508

 Score = 81.3 bits (199), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 93/345 (26%), Positives = 136/345 (39%), Gaps = 67/345 (19%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQDSPL----FDPKMSSTYK 144
           Y   +SIGTP    L   DTGSDL W  CE   CP       +       +    SST  
Sbjct: 104 YYANVSIGTPGLYFLVALDTGSDLFWLPCECTKCPTYLTKRDNGKFWLNHYSSNASSTSI 163

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALP-GI 202
            +PCSSS C   NQ S +  +C Y   Y  + S S G L  + + + +   Q   +   +
Sbjct: 164 RVPCSSSLCELANQCSSNKSSCPYQTHYLSENSSSAGYLVQDILHMATDDSQLKPVDVKV 223

Query: 203 TFGCGTNNGGLFNSKTT--GIVGLGGGDIS----LISQMRTTIAGNQRLG------VSTP 250
           T GCG    G F++ T   G++GLG G +S    L SQ  TT + +   G      +   
Sbjct: 224 TLGCGKVQTGKFSNVTAPNGLIGLGMGKVSVPSFLASQGLTTDSFSMCFGYYGYGRIDFG 283

Query: 251 DI-------------------------------------VIDSGTTLTFLPQGYNSNLLS 273
           DI                                     +IDSG + T+L   + S +  
Sbjct: 284 DIGPVGQRETPFNPASLSYNVTILQIIVTNRPTNVHLTAIIDSGASFTYLTDPFYSIITE 343

Query: 274 VMSSMIEAQPV-ADPTGSLELCY--SFNSLSQVPEVTIHFRGADVKLSRSNFFVKVSEDI 330
            M + +E + + +D     E CY  S  ++ Q P +     G   K      +V V  D 
Sbjct: 344 NMDAAMELERIKSDSDFPFEYCYRLSLATIFQQPNLNFTMEGGR-KFDVITSYVSVDTDD 402

Query: 331 VCSVFKGITNSVPIYGNIMQTNFLVGYDI----EQQTVSFKPTDC 371
             ++   I  S  I  N++  NF  GY +    E+ T+ +K  DC
Sbjct: 403 GPALCLAIVKSTDI--NVIGHNFFGGYRVVFNREKMTLGWKEVDC 445


>gi|79495937|ref|NP_567922.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660833|gb|AEE86233.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 401

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 53/173 (30%), Positives = 83/173 (47%), Gaps = 18/173 (10%)

Query: 77  SKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQ 130
           ++A  + + P + N      Y + I+IG PP       DTGSDL W QC+  P  +C   
Sbjct: 37  TRAVSSVVFPVHGNVYPLGYYNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEA 95

Query: 131 DSPLFDPKMSSTYKSLPCSSSQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATET 186
             PL+ P    +   +PC+   C +L    NQ+  +   C Y V Y DG  S G L  + 
Sbjct: 96  PHPLYQP----SSDLIPCNDPLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDV 151

Query: 187 VTLGSTTGQAVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMRT 237
            ++  T G  +  P +  GCG +   G   +    G++GLG G +S++SQ+ +
Sbjct: 152 FSMNYTQGLRLT-PRLALGCGYDQIPGASSHHPLDGVLGLGRGKVSILSQLHS 203


>gi|4490316|emb|CAB38807.1| nucellin-like protein [Arabidopsis thaliana]
 gi|7270297|emb|CAB80066.1| nucellin-like protein [Arabidopsis thaliana]
          Length = 420

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 50/153 (32%), Positives = 75/153 (49%), Gaps = 12/153 (7%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + I+IG PP       DTGSDL W QC+  P  +C     PL+ P    +   +PC+ 
Sbjct: 38  YNVTINIGQPPRPYYLDLDTGSDLTWLQCD-APCVRCLEAPHPLYQP----SSDLIPCND 92

Query: 151 SQCASL----NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
             C +L    NQ+  +   C Y V Y DG  S G L  +  ++  T G  +  P +  GC
Sbjct: 93  PLCKALHLNSNQRCETPEQCDYEVEYADGGSSLGVLVRDVFSMNYTQGLRLT-PRLALGC 151

Query: 207 GTNN--GGLFNSKTTGIVGLGGGDISLISQMRT 237
           G +   G   +    G++GLG G +S++SQ+ +
Sbjct: 152 GYDQIPGASSHHPLDGVLGLGRGKVSILSQLHS 184


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 77/159 (48%), Gaps = 14/159 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +GTPP E     DTGSD++W   + C  CP +         FD   SST + +P
Sbjct: 81  YFTRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVP 140

Query: 148 CSSSQCASLNQKSCSGV-----NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV---AL 199
           CS   C S  Q + +        C Y+  YGDGS ++G   ++T    +  G+++   + 
Sbjct: 141 CSHPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSS 200

Query: 200 PGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
             I FGC T   G     +    GI G G G++S+ISQ+
Sbjct: 201 AAIVFGCSTYQSGDLTKTDKAVDGIFGFGQGELSVISQL 239



 Score = 40.0 bits (92), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 32/127 (25%), Positives = 60/127 (47%), Gaps = 10/127 (7%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPT-GSLELCYSF-NSLSQV-PEVTIH 309
           +ID+GTTL +L +      +S +++ +    +A PT      CY   NS+S+V P V+ +
Sbjct: 315 IIDTGTTLAYLVEEAYDPFVSAITAAVSQ--LATPTINKGNQCYLVSNSVSEVFPPVSFN 372

Query: 310 FRGADVKLSRSNFFVK-----VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
           F G    L +   ++          + C  F+ I   + I G+++  + +  YD+  Q +
Sbjct: 373 FAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGITILGDLVLKDKIFVYDLAHQRI 432

Query: 365 SFKPTDC 371
            +   DC
Sbjct: 433 GWANYDC 439


>gi|217073884|gb|ACJ85302.1| unknown [Medicago truncatula]
          Length = 259

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 59/194 (30%), Positives = 94/194 (48%), Gaps = 17/194 (8%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQ----RLRDALTRSLNRLNHFNQNSSISSSK 78
           + Q  G ++++IH  SP SPF  S    ++    +++   T  L  L+      SI    
Sbjct: 23  DVQDNGSTLQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSLVARKSIVP-I 81

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           AS   II  +  Y++R  IGTPP   L   DT +D  W  C  C    C    S LF P+
Sbjct: 82  ASGRQII-QSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTAC--DGCA---STLFAPE 135

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            S+T+K++ C++ +C  +    C   +C ++++YG  S +  NL  +T+TL +       
Sbjct: 136 KSTTFKNVSCAAPECKQVPNPGCGVSSCNFNLTYGSSSIA-ANLVQDTITLATD-----P 189

Query: 199 LPGITFGCGTNNGG 212
           +P  TFGC +   G
Sbjct: 190 VPSYTFGCVSKTTG 203


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 88/362 (24%), Positives = 143/362 (39%), Gaps = 80/362 (22%)

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
           D +  N  Y  R+ IGTPP E   + D+GS + +  C  C   QC     P F P +SS+
Sbjct: 81  DDLLTNGYYTTRLYIGTPPQEFALIVDSGSTVTYVPCASC--EQCGNHQDPRFQPDLSSS 138

Query: 143 YKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           Y  + C+       ++K C+     Y   Y + S S+G L  + V+ G  +   +     
Sbjct: 139 YSPVKCNVDCTCDSDKKQCT-----YERQYAEMSSSSGVLGEDIVSFGRES--ELKPQRA 191

Query: 203 TFGC-GTNNGGLFNSKTTGIVGLG------------------------------GGDISL 231
            FGC  +  G LF+    GI+GLG                              GG + L
Sbjct: 192 VFGCENSETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVL 251

Query: 232 -----------------------ISQMRTTIAG-----NQRLGVSTPDIVIDSGTTLTFL 263
                                  I      +AG     + R+  S    V+DSGTT  +L
Sbjct: 252 GGVPAPSDMVFSHSDPLRSPYYNIELKEIHVAGKALRVDSRVFNSKHGTVLDSGTTYAYL 311

Query: 264 P-QGYNSNLLSVMSSMIEAQPVADPTGSL-ELCYS-----FNSLSQV-PEVTIHF-RGAD 314
           P Q + +   +V S +   + +  P  +  ++C++      + L +V P+V + F  G  
Sbjct: 312 PEQAFVAFKDAVTSKVHSLKKIRGPDPNYKDICFAGAGRNVSKLHEVFPDVDMVFGNGQK 371

Query: 315 VKLSRSNFFV---KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           + L+  N+     KV       VF+   +   + G I+  N LV YD   + + F  T+C
Sbjct: 372 LSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKTNC 431

Query: 372 TK 373
           ++
Sbjct: 432 SE 433


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 63/209 (30%), Positives = 92/209 (44%), Gaps = 29/209 (13%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           ++ + IGTPP  +  V DTGS L W QC    P++     S  FDP +SST+ +LPC+  
Sbjct: 98  IVDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTAS--FDPSLSSTFSTLPCTHP 155

Query: 152 QCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
            C       +L         C YS  Y DG+++ GNL  E  T      +++  P +  G
Sbjct: 156 VCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF----SRSLFTPPLILG 211

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLT---F 262
           C T      ++   GI+G+  G +S  SQ + T     +     P  V   G T T   +
Sbjct: 212 CATE-----STDPRGILGMNRGRLSFASQSKIT-----KFSYCVPTRVTRPGYTPTGSFY 261

Query: 263 LPQGYNSNLLSVMSSMIEAQ----PVADP 287
           L    NSN    +  +  A+    P  DP
Sbjct: 262 LGHNPNSNTFRYIEMLTFARSQRMPNLDP 290


>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
          Length = 372

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 45/122 (36%), Positives = 66/122 (54%), Gaps = 11/122 (9%)

Query: 85  IPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYK 144
           I  N  Y++R  IGTP    L   DT SD+ W  C       C    S LF+   S+TYK
Sbjct: 30  IVQNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCN-----GCLGCSSTLFNSPASTTYK 84

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
           SL C ++QC  + + +C G  C ++++YG  S +  NL+ +T+TL +      A+PG +F
Sbjct: 85  SLGCQAAQCKQVPKPTCGGGVCSFNLTYGGSSLA-ANLSQDTITLATD-----AVPGYSF 138

Query: 205 GC 206
           GC
Sbjct: 139 GC 140


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 51/159 (32%), Positives = 81/159 (50%), Gaps = 21/159 (13%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N    + +++G+PP     V DTGS+L W  C+  P          +F+P  SSTY  +
Sbjct: 57  HNVTLTVTLAVGSPPQNISMVLDTGSELSWLHCKKSP------NLGSVFNPVSSSTYSPV 110

Query: 147 PCSSSQCASLNQK-----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           PCSS  C +  +      SC      C  ++SY D +   GNLA +T  +GS T      
Sbjct: 111 PCSSPICRTRTRDLPIPASCDPKTHFCHVAISYADATSIEGNLAHDTFVIGSVT-----R 165

Query: 200 PGITFGC---GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           PG  FGC   G ++    ++K+TG++G+  G +S ++Q+
Sbjct: 166 PGTLFGCMDSGLSSDSEEDAKSTGLMGMNRGSLSFVNQL 204


>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 445

 Score = 80.9 bits (198), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 54/158 (34%), Positives = 84/158 (53%), Gaps = 20/158 (12%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N    + +++GTPP     V DTGS+L W  C+           + +F+P +SS+Y  +
Sbjct: 66  HNVTLTVSLTVGTPPQSVTMVLDTGSELSWLHCKK------QQNINSVFNPHLSSSYTPI 119

Query: 147 PCSSSQCASLNQK-----SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           PC S  C +  +      SC   N C  +VSY D +   GNLA++T  + S +GQ    P
Sbjct: 120 PCMSPICKTRTRDFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAI-SGSGQ----P 174

Query: 201 GITFG---CGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           GI FG    G ++    +SKTTG++G+  G +S ++QM
Sbjct: 175 GIIFGSMDSGFSSNANEDSKTTGLMGMNRGSLSFVTQM 212


>gi|357520119|ref|XP_003630348.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355524370|gb|AET04824.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 435

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 88/363 (24%), Positives = 150/363 (41%), Gaps = 94/363 (25%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           Y + ++IG PP       DTGS+L W QC+ PC  SQC     PL+ P    +   +PC 
Sbjct: 74  YNVTLNIGQPPRPYFLDVDTGSELTWLQCDAPC--SQCSETPHPLYKP----SNDFIPCK 127

Query: 150 SSQCASL---NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
              CASL   +  +C   N C Y + Y D   + G L  +   L  T G  + +  +  G
Sbjct: 128 DPLCASLQPTDDYTCEDPNQCDYEIKYADQYSTLGVLLNDVYLLNFTNGVQLKV-RMALG 186

Query: 206 CGTNNGGLFNSKT----TGIVGLGGGDISLISQMRTT----------------------- 238
           CG +   +F+  T     GI+GLG G  SLISQ+ +                        
Sbjct: 187 CGYDQ--IFSPSTYHPLDGILGLGRGKASLISQLNSQGLVRNVMGHCLSSRGGGYIFFGN 244

Query: 239 ----------------------------IAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
                                       + G ++ GV + +I+ D+G++ T+        
Sbjct: 245 VYDSSRMSWTPISSIDSGKHYSAGPAELVFGGRKTGVGSLNIIFDTGSSYTYFNSQAYQA 304

Query: 271 LLSVMSSMIEAQPV--ADPTGSLELCYS----FNSLSQVPE----VTIHF-RGADVKLSR 319
           ++S+++  +  +P+  A    +L +C+     F S+++V +    +T+ F  G  VK   
Sbjct: 305 MISLLNKELHRKPIKAAPDDQTLPMCWHGKRPFRSINEVKKYFKPLTLSFTNGGRVK--- 361

Query: 320 SNFFVKVSEDIVCS----VFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
             F +     ++ S    V  GI N        + + G+I   + ++ +D E+Q + + P
Sbjct: 362 PQFEIPPEAYLIISNMGNVCLGILNGPEVGLGELNLIGDISMLDKVMVFDNEKQLIGWGP 421

Query: 369 TDC 371
            DC
Sbjct: 422 ADC 424


>gi|125532795|gb|EAY79360.1| hypothetical protein OsI_34488 [Oryza sativa Indica Group]
          Length = 342

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 80/353 (22%), Positives = 133/353 (37%), Gaps = 112/353 (31%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+  ++IGTPP    A+     + +WTQC PC   +C+ QD PLF+              
Sbjct: 28  YMANLTIGTPPQPASAIIHLAGEFVWTQCSPC--RRCFKQDLPLFN-------------- 71

Query: 151 SQCASLNQKSCSGVNCQYSVS--YGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT 208
                           +Y V   +GD S   G   T+T  +G+ T        + FGC  
Sbjct: 72  ----------------RYEVETMFGDTSGIGG---TDTFAIGTATAS------LAFGCAM 106

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTT-------------------------IAGNQ 243
           ++        +G+VGLG    SL+ QM  T                         +AG +
Sbjct: 107 DSNIKQLLGASGVVGLGRTPWSLVGQMNATAFSYCLAPHGAAGKKSALLLGASAKLAGGK 166

Query: 244 RL--------------------GVSTPDIVI-----------DSGTTLTFLPQGYNSNLL 272
                                 G+   D++I           D+   ++FL       + 
Sbjct: 167 SAATTPLVNTSDDSSDYMIHLEGIKFGDVIIEPPPNGSVVLVDTIFGVSFLVDAAFHAIK 226

Query: 273 SVMSSMIEAQPVADPTGSLELCY-------SFNSLSQVPEVTIHFRGAD-VKLSRSNFFV 324
             ++  + A P+A PT   +LC+         NS   +P+V + F+GA  + +  S +  
Sbjct: 227 KAVTVAVGAAPMATPTKPFDLCFPKAAAAAGANSSLPLPDVVLTFQGAAALTVPPSKYMY 286

Query: 325 KVSEDIVC-----SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
                 VC     S    +T  + I G + Q N    +D++++T+SF+P DC+
Sbjct: 287 DAGNGTVCLAMMSSAMLNLTTELSILGRLHQENIHFLFDLDKETLSFEPADCS 339


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 75/159 (47%), Gaps = 14/159 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I +G+P  +     DTGSD++W    +C  CP          L+DPK S T + + 
Sbjct: 69  YFTKIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVS 128

Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C  + C+S  +     C   N C YS+SYGDGS + G    + +T     G    A    
Sbjct: 129 CEHNFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQNS 188

Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQM 235
            I FGCG    G F S +     GI+G G  + S++SQ+
Sbjct: 189 SIIFGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQL 227


>gi|224063191|ref|XP_002301033.1| predicted protein [Populus trichocarpa]
 gi|222842759|gb|EEE80306.1| predicted protein [Populus trichocarpa]
          Length = 536

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 111/444 (25%), Positives = 168/444 (37%), Gaps = 99/444 (22%)

Query: 22  IEAQTG-GFSVELIHR--DSPKS--------------PFYNSSETPYQRLRDALTRSLNR 64
            EA  G  FS +LIHR  D  KS              P   S E     L + L R   +
Sbjct: 20  FEASIGLTFSSKLIHRFSDEAKSISISRKGNASGDLWPKRYSFEYFQLLLGNDLKRQRMK 79

Query: 65  LNHFNQNSSISSSKASQADIIPNNANYL--IRISIGTPPTERLAVADTGSDLIWTQCE-- 120
           L    +N  +  S+ SQA    N  ++L    I IGTP    L   D GSDL+W  C+  
Sbjct: 80  LGS-QKNQLLFPSQGSQALFFGNELDWLHYTWIDIGTPNVSFLVALDAGSDLLWVPCDCI 138

Query: 121 PCPP-SQCYM-----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGD 174
            C P S  Y      +D   + P +SST + L C    C   +        C Y  +Y D
Sbjct: 139 QCAPLSASYYNISLDRDLSEYSPSLSSTSRHLSCDHQLCEWGSNCKNPKDPCPYIFNYDD 198

Query: 175 --GSFSNGNLATETVTL---GSTTGQAVALPGITFGCGTNNGGLF--NSKTTGIVGLGGG 227
              + S G L  + + L   G  T + +    +  GCG   GG F   +   G++GLG G
Sbjct: 199 FENTTSAGFLVEDKLHLASVGDHTARKMLQASVVLGCGRKQGGSFFDGAAPDGVMGLGPG 258

Query: 228 DISLISQM---------------------------------------------------R 236
           DIS+ S +                                                    
Sbjct: 259 DISVPSLLAKAGLIQNCFSLCFDENDSGRILFGDRGHASQQSTPFLPIQGTYVAYFVGVE 318

Query: 237 TTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS 296
           +   GN  L  S    ++DSG++ T+LP    + L+S     + A+ ++   G  + CY+
Sbjct: 319 SYCVGNSCLKRSGFKALVDSGSSFTYLPSEVYNELVSEFDKQVNAKRISFQDGLWDYCYN 378

Query: 297 FNS--LSQVPEVTIHF-RGADVKLSRSNFFVKVSED--IVCSVFKGITNSVPIYGNIMQT 351
            +S  L  +P + + F R  +  +    + +   +   + C   +    S   YG I Q 
Sbjct: 379 ASSQELHDIPAIQLKFPRNQNFVVHNPTYSIPHHQGFTMFCLSLQPTDGS---YGIIGQ- 434

Query: 352 NFLVGY----DIEQQTVSFKPTDC 371
           NF++GY    DIE   + +  + C
Sbjct: 435 NFMIGYRMVFDIENLKLGWSNSSC 458


>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
 gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
          Length = 557

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 75/259 (28%), Positives = 109/259 (42%), Gaps = 34/259 (13%)

Query: 2   ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKS---PFYNSS----------- 47
           A  L  + +  + CFY  S +  Q  G   E   R+  +S   P Y  +           
Sbjct: 83  ALVLGALAVAAYYCFY--SDVAVQFLGMEQEEEQRNETRSFLLPLYPKARQGRALREFGD 140

Query: 48  -ETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTE 103
            +   +R+ D   ++ NR+      ++ ++S A    + ++ P+   Y   I IG PP  
Sbjct: 141 VKLAARRVDDGGRKARNRMEVAKAATARTNSTALLPIKGNVFPD-GQYYTSIFIGNPPRP 199

Query: 104 RLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKS 160
                DTGSDL W QC+ PC  + C     PL+ P   +  K +P     C  L  NQ  
Sbjct: 200 YFLDVDTGSDLTWIQCDAPC--TNCAKGPHPLYKP---AKEKIVPPRDLLCQELQGNQNY 254

Query: 161 CSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS--- 216
           C     C Y + Y D S S G LA + + + +T G    L    FGC  +  G   S   
Sbjct: 255 CETCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKL-DFVFGCAYDQQGQLLSSPA 313

Query: 217 KTTGIVGLGGGDISLISQM 235
           KT GI+GL    IS  SQ+
Sbjct: 314 KTDGILGLSSAAISFPSQL 332


>gi|12323376|gb|AAG51657.1|AC010704_1 nucellin-like protein; 27671-25467 [Arabidopsis thaliana]
          Length = 427

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 90/364 (24%), Positives = 143/364 (39%), Gaps = 95/364 (26%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y + ++IG PP       DTGSDL W QC+  P + C          K    + +LPCS 
Sbjct: 67  YYVLLNIGNPPKLFDLDIDTGSDLTWVQCD-APCNGC---------TKYKPNHNTLPCSH 116

Query: 151 SQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
             C+ L+    + C+     C Y + Y D + S G L T+ V L    G  + L  +TFG
Sbjct: 117 ILCSGLDLPQDRPCADPEDQCDYEIGYSDHASSIGALVTDEVPLKLANGSIMNLR-LTFG 175

Query: 206 CG---TNNGGLFNSKTTGIVGLGGGDISLISQMR-------------------------- 236
           CG    N G      T GI+GLG G + L +Q++                          
Sbjct: 176 CGYDQQNPGPHPPPPTAGILGLGRGKVGLSTQLKSLGITKNVIVHCLSHTGKGFLSIGDE 235

Query: 237 ---------TTIAGN-----------------QRLGVSTPDIVIDSGTTLTFLPQGYNSN 270
                    T++A N                 +  GV   ++V DSG++ T+        
Sbjct: 236 LVPSSGVTWTSLATNSPSKNYMAGPAELLFNDKTTGVKGINVVFDSGSSYTYFNAEAYQA 295

Query: 271 LLSVMSSMIEAQPVADPTG--SLELCYS----FNSLSQVPE----VTIHF----RGADVK 316
           +L ++   +  +P+ D     SL +C+       SL +V +    +T+ F     G   +
Sbjct: 296 ILDLIRKDLNGKPLTDTKDDKSLPVCWKGKKPLKSLDEVKKYFKTITLRFGNQKNGQLFQ 355

Query: 317 LSRSNFFVKVSEDIVCSVFKGITNSVP-------IYGNIMQTNFLVGYDIEQQTVSFKPT 369
           +   ++ +   +  VC    GI N          I G+I     +V YD E+Q + +  +
Sbjct: 356 VPPESYLIITEKGRVCL---GILNGTEIGLEGYNIIGDISFQGIMVIYDNEKQRIGWISS 412

Query: 370 DCTK 373
           DC K
Sbjct: 413 DCDK 416


>gi|449439393|ref|XP_004137470.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449486840|ref|XP_004157418.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 570

 Score = 80.1 bits (196), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 58/161 (36%), Positives = 75/161 (46%), Gaps = 10/161 (6%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
           + DI P+   Y   I +G PP       DTGSDL W QC+ PC  S C    SPL+ P+ 
Sbjct: 190 RGDIYPDGL-YYTYIMVGEPPRPYFLDIDTGSDLTWVQCDAPC--SSCGKGRSPLYKPRR 246

Query: 140 SS--TYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
            +  ++K   C   Q      +  +   C Y V Y D S S G L  +  TL  + G   
Sbjct: 247 ENVVSFKDSLCMEVQRNYDGDQCAACQQCNYEVQYADQSSSLGVLVKDEFTLRFSNGSLT 306

Query: 198 ALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
            L  I FGC  +  GL     SKT GI+GL    +SL SQ+
Sbjct: 307 KLNAI-FGCAYDQQGLLLNTLSKTDGILGLSRAKVSLPSQL 346


>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
          Length = 204

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 41/125 (32%), Positives = 72/125 (57%), Gaps = 3/125 (2%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS---QVPEVTI 308
           ++IDSGTT+T+L +     L     S    Q     +  L++C+S  S +   +VP++  
Sbjct: 79  VIIDSGTTITYLEKSVFDTLKKEFISQSNLQLDKSSSTGLDVCFSLPSETTQVEVPKLVF 138

Query: 309 HFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
           HF+G D++L   ++ +  S+  V  +  G +N + I+GN+ Q N LV +D+E++T+SF P
Sbjct: 139 HFKGGDLELPAESYMIADSKLGVACLAMGASNGMSIFGNVQQQNILVNHDLEKETISFVP 198

Query: 369 TDCTK 373
           T C +
Sbjct: 199 TQCDQ 203


>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 57/167 (34%), Positives = 80/167 (47%), Gaps = 15/167 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP        DTGSD++W     C+ CP       +  L+DP  SS+   + 
Sbjct: 81  YFTQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGVT 140

Query: 148 CSSSQCASLNQK---SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQA---VALP 200
           C    C + +     SC     CQYS+SYGDGS + G   T+ +     +G +   +A  
Sbjct: 141 CGQDFCVATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQTTLANT 200

Query: 201 GITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRTTIAGNQR 244
            ITFGCG   GG   S +    GI+G G  + S++SQ+    AG  R
Sbjct: 201 SITFGCGAKIGGDLGSSSQALDGILGFGQSNSSMLSQLAA--AGKVR 245



 Score = 40.0 bits (92), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 31/127 (24%), Positives = 60/127 (47%), Gaps = 8/127 (6%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
           +IDSGTTL +LP    + ++S + +     P+ +        YS +     P +T HF G
Sbjct: 313 IIDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQDFQCFRYSGSVDDGFPIITFHFEG 372

Query: 313 A-DVKLSRSNFFVKVSEDIVCSVFK--GIT----NSVPIYGNIMQTNFLVGYDIEQQTVS 365
              + +   ++  +  E + C  F+  G+       + + G++  +N LV YD+E Q + 
Sbjct: 373 GLPLNIHPHDYLFQNGE-LYCMGFQTGGLQTKDGKDMVLLGDLAFSNRLVLYDLENQVIG 431

Query: 366 FKPTDCT 372
           +   +C+
Sbjct: 432 WTDYNCS 438


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 55/158 (34%), Positives = 76/158 (48%), Gaps = 13/158 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G+PP E     DTGSD++W     C  CP S         FDP  SST   + 
Sbjct: 68  YFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLIS 127

Query: 148 CSSSQCASLNQKS---CS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV--ALP 200
           CS  +C+   Q S   CS  G  C Y+  YGDGS ++G   ++ +   +  G +V  +  
Sbjct: 128 CSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSSA 187

Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
            I FGC  +  G     +    GI G G  D+S+ISQM
Sbjct: 188 SIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQM 225


>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 438

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 62/194 (31%), Positives = 89/194 (45%), Gaps = 17/194 (8%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQ----RLRDALTRSLNRLNHFNQNSSISSSK 78
           + Q  G ++E+ H  SP SPF  S    +     +L+      L  L       SI    
Sbjct: 27  DTQDHGSTLEVFHVFSPCSPFRPSKPLSWAESVLQLQAKDQARLQFLASMVAGRSIVP-I 85

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           AS   II  +  Y++R  IGTPP   L   DT +D  W  C  C    C    S LF P+
Sbjct: 86  ASGRQII-QSPTYIVRAKIGTPPQTLLLAIDTSNDAAWIPCTAC--DGC---TSTLFAPE 139

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            S+T+K++ C S +C  +   SC    C ++++YG  S +  N+  +TVTL +       
Sbjct: 140 KSTTFKNVSCGSPECNKVPSPSCGTSACTFNLTYGSSSIA-ANVVQDTVTLATD-----P 193

Query: 199 LPGITFGCGTNNGG 212
           +PG TFGC     G
Sbjct: 194 IPGYTFGCVAKTTG 207


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 55/158 (34%), Positives = 76/158 (48%), Gaps = 13/158 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G+PP E     DTGSD++W     C  CP S         FDP  SST   + 
Sbjct: 83  YFTRVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLIS 142

Query: 148 CSSSQCASLNQKS---CS--GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV--ALP 200
           CS  +C+   Q S   CS  G  C Y+  YGDGS ++G   ++ +   +  G +V  +  
Sbjct: 143 CSDQRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNSSA 202

Query: 201 GITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
            I FGC  +  G     +    GI G G  D+S+ISQM
Sbjct: 203 SIVFGCSISQTGDLTKSDRAVDGIFGFGQQDMSVISQM 240


>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 431

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 66/220 (30%), Positives = 96/220 (43%), Gaps = 24/220 (10%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           + + H   P SP    S     R  DA  R L   +    +  ++S+  +     P+   
Sbjct: 24  LSVYHNVHPPSPSPLESIIALARADDA--RLLFLSSKAASSGGVTSAPVASGQTPPS--- 78

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y++R  +GTP  + L   DT +D  W+ C PC    C       F P  SS+Y SLPC+S
Sbjct: 79  YVVRAGLGTPVQQLLLALDTSADATWSHCAPC--DTCPAGSR--FIPASSSSYASLPCAS 134

Query: 151 SQCASLNQKSCSG--------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
             C     + C            C +S  + D SF   +L ++T+ LG       A+ G 
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLGKD-----AIAGY 188

Query: 203 TFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAG 241
            FGC G   G   N    G++GLG G +SL+SQ  +T  G
Sbjct: 189 AFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQTGSTYNG 228


>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 434

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 64/189 (33%), Positives = 87/189 (46%), Gaps = 34/189 (17%)

Query: 65  LNHFNQNSSISSSKASQADIIPNNA----NY----------LIRISIGTPPTERLAVADT 110
           L+  ++NS  SSS ASQ    PN      NY          ++ + IGTPP  +  V DT
Sbjct: 38  LSSHSKNSLFSSSLASQFKQNPNTKTTSYNYRSSFKYSMALIVSLPIGTPPQTQQMVLDT 97

Query: 111 GSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA------SLNQKSCSGV 164
           GS L W QC+  PP          FDP +SS++  LPC+ S C       +L        
Sbjct: 98  GSQLSWIQCK-VPPK----TPPTAFDPLLSSSFSVLPCNHSLCKPRVPDYTLPTSCDQNR 152

Query: 165 NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGL 224
            C YS  Y DG+++ GNL  E  T  S+       P +  GC T+     +S T GI+G+
Sbjct: 153 LCHYSYFYADGTYAEGNLVREKFTFSSSQ----TTPPLILGCATD-----SSDTQGILGM 203

Query: 225 GGGDISLIS 233
             G +S  S
Sbjct: 204 NLGRLSFSS 212


>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
 gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
          Length = 443

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 61/208 (29%), Positives = 91/208 (43%), Gaps = 15/208 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPN 87
           G  ++L H D+        + T  +R+R A+  S       N  S+ +      A +   
Sbjct: 33  GIRMKLTHVDA------KGNYTAPERVRRAIALS----RQINLASTRAEGGGVSAPVHWA 82

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
              Y+    +G PP    A+ DTGS LIWTQC  C    C  QD P F+   S ++  +P
Sbjct: 83  TRQYIAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVP 142

Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           C    CA      C+    C + V+YG G    G L T+  T   + G  +A   ++F  
Sbjct: 143 CQDKACAGNYLHFCALDGTCTFRVTYGAGGII-GFLGTDAFTF-QSGGATLAFGCVSFTR 200

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQ 234
                 L  +  +G++GLG G +SL SQ
Sbjct: 201 FAAPDVLHGA--SGLIGLGRGRLSLASQ 226



 Score = 53.9 bits (128), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 35/127 (27%), Positives = 62/127 (48%), Gaps = 8/127 (6%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP---TGSLELCYSFNSLSQV-PEVT 307
           ++IDSG+  T L +     L+  ++  +    V  P    G + LC +   L +V P + 
Sbjct: 317 VIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLDRVVPTLV 376

Query: 308 IHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
           +HF G AD+ L   N++  + +   C ++ +G   S  I GN  Q N  + +D+    +S
Sbjct: 377 LHFSGGADMALPPENYWAPLEKSTACMAIVRGYLQS--IIGNFQQQNMHILFDVGGGRLS 434

Query: 366 FKPTDCT 372
           F+  DC+
Sbjct: 435 FQNADCS 441


>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
          Length = 428

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 144/357 (40%), Gaps = 91/357 (25%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y+I + +GTP   ++   DTGS   W  CE C    C+         + S+T   + C +
Sbjct: 82  YVISVGLGTPAKTQIVEIDTGSSTSWVFCE-C--DGCHTNPRTFLQSR-STTCAKVSCGT 137

Query: 151 SQCASLN-----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
           S C         Q S +  +C + VSY DGS S G L  +T+T          +P  TFG
Sbjct: 138 SMCLLGGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQ----KIPSFTFG 193

Query: 206 CGTNNGGL------------------------------------------FNSKTTGIVG 223
           C  ++ G                                           F SKTTG   
Sbjct: 194 CNLDSFGANEFGNVDGLLGMGAGPMSVLKQSSPRFDGFSYCLPLQKSERGFFSKTTGYFS 253

Query: 224 LGG----GDI---SLISQMRTTI----------AGNQRLGVS-----TPDIVIDSGTTLT 261
           LG      D+    ++++ + T              +RLG+S        +V DSG+ L+
Sbjct: 254 LGKVATRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFSRKGVVFDSGSELS 313

Query: 262 FLPQGYNSNLLSVMSSMIEA---QPVADPTGSLELCYSFNSLSQ--VPEVTIHFR-GADV 315
           ++P       LSV+S  I     +  A    S   CY   S+ +  +P +++HF  GA  
Sbjct: 314 YIPD----RALSVLSQRIRELLLRRGAAEEESERNCYDMRSVDEGDMPAISLHFDDGARF 369

Query: 316 KLSRSNFFVKVS---EDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
            L     FV+ S   +D+ C  F   T SV I G++MQT+  V YD+++Q +   P+
Sbjct: 370 DLGSHGVFVERSVQEQDVWCLAF-APTESVSIIGSLMQTSKEVVYDLKRQLIGIGPS 425


>gi|125575536|gb|EAZ16820.1| hypothetical protein OsJ_32292 [Oryza sativa Japonica Group]
          Length = 253

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 40/103 (38%), Positives = 59/103 (57%), Gaps = 5/103 (4%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+   +IGTPP    AV D   +L+WTQC+ C   +C+ Q +PLFDP  S+TY++ PC 
Sbjct: 50  NYVANFTIGTPPQPASAVIDLAGELVWTQCKQC--GRCFEQGTPLFDPTASNTYRAEPCG 107

Query: 150 SSQCASL--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG 190
           +  C S+  + ++CSG  C Y  S   G  + G    + VT+ 
Sbjct: 108 TPLCESIPSDVRNCSGNVCAYEASTNAGD-TGGKAVKKAVTVA 149



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 29/102 (28%), Positives = 52/102 (50%), Gaps = 7/102 (6%)

Query: 279 IEAQPVADPTGSLELCYSFNSLS-QVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVC---- 332
           + A P+A P    +LC+  +  S   P++   FRG A + +  +N+ +      VC    
Sbjct: 150 VGAPPMATPVEPFDLCFPKSGASGAAPDLVFTFRGGAAMTVPATNYLLDYKNGTVCLAML 209

Query: 333 -SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
            S     T  + + G++ Q N    +D++++T+SF+P DCTK
Sbjct: 210 SSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADCTK 251


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 52/149 (34%), Positives = 78/149 (52%), Gaps = 8/149 (5%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           ++  IGTPP E L + DT S+L W Q   C  + C     P F+P +SS++ S PC+SS 
Sbjct: 1   MQTKIGTPPREVLLLVDTASELTWVQGTSC--TNCSPTKVPPFNPGLSSSFISEPCTSSV 58

Query: 153 CASLN----QKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           C   +    Q +C  S  +C + V+Y DGS + G +A E  +L S  G A  L  + FGC
Sbjct: 59  CLGRSKLGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGC 118

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
            + +       ++G +GL  G  S  +Q+
Sbjct: 119 ASKDLQRPVDFSSGTLGLNRGSFSFPAQI 147



 Score = 49.3 bits (116), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 33/132 (25%), Positives = 63/132 (47%), Gaps = 14/132 (10%)

Query: 254 IDSGTTLTFLPQGYNSNLLSVMSS-MIEAQPVADPTGSLELCYSFNS----LSQVPEVTI 308
            DSGTT++FL +  ++ L+      ++     +    + ELCY   +    L   P VT+
Sbjct: 239 FDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTKELCYDVAAGDARLPTAPLVTL 298

Query: 309 HFRG-ADVKLSRSNFFVKVSED----IVCSVFKGI----TNSVPIYGNIMQTNFLVGYDI 359
           HF+   D++L  ++ +V ++       +C  F          V + GN  Q ++L+ +D+
Sbjct: 299 HFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYLIEHDL 358

Query: 360 EQQTVSFKPTDC 371
           E+  + F P +C
Sbjct: 359 ERSRIGFAPANC 370


>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 421

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 64/203 (31%), Positives = 89/203 (43%), Gaps = 36/203 (17%)

Query: 32  ELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANY 91
           E+  RD  +  F NS    Y         S N  NH + N           ++   + N+
Sbjct: 88  EIFGRDESRVSFINSKCNQYT--------SGNLKNHAHNN-----------NLFDEDGNF 128

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           L+ ++ GTPP   + + DTGS + WTQC+ C    C       F+   SSTY S  C   
Sbjct: 129 LVDVAFGTPPQNFMLILDTGSSITWTQCKAC--VNCLQDSHRYFNWSASSTYSSGSCIPG 186

Query: 152 QCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNG 211
                       V   Y+++YGD S S GN   +T+TL  +           FGCG NN 
Sbjct: 187 T-----------VENNYNMTYGDDSTSVGNYGCDTMTLEPSD----VFQKFQFGCGRNNK 231

Query: 212 GLFNSKTTGIVGLGGGDISLISQ 234
           G F S   G++GLG G +S +SQ
Sbjct: 232 GDFGSGVDGMLGLGQGQLSTVSQ 254


>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
 gi|255641727|gb|ACU21134.1| unknown [Glycine max]
          Length = 475

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 49/159 (30%), Positives = 73/159 (45%), Gaps = 14/159 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+PP +     DTGSD++W    +C  CP       D  L+DPK S T   + 
Sbjct: 70  YFTKLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLTLYDPKGSETSDVVS 129

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
           C    C++       G    + C YS++YGDGS + G    + +T     G     P   
Sbjct: 130 CDQDFCSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNRINGNLRTSPQNS 189

Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQM 235
            I FGCG    G   S +     GI+G G  + S++SQ+
Sbjct: 190 SIIFGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQL 228


>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
          Length = 193

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 44/127 (34%), Positives = 77/127 (60%), Gaps = 7/127 (5%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS--LELCYSFNSLS---QVPEV 306
           ++IDSGTT+T++ +    +L    +S  +  PV D +GS  L++C+S  S     ++P++
Sbjct: 68  VIIDSGTTITYIEENAFDSLKKEFTSQTKL-PV-DKSGSTGLDVCFSLPSGKTEVEIPKL 125

Query: 307 TIHFRGADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
             HF+G D++L   N+ +  S   V  +  G +N + I+GNI Q N LV +D++++T++F
Sbjct: 126 VFHFKGGDLELPGENYMIADSSLGVACLAMGASNGMSIFGNIQQQNILVNHDLQKETITF 185

Query: 367 KPTDCTK 373
            PT C K
Sbjct: 186 IPTQCNK 192


>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 492

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 89/370 (24%), Positives = 137/370 (37%), Gaps = 98/370 (26%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ IGTP  +     DTGSD++W    QC  CP +     +  L++ K S + K +P
Sbjct: 86  YYAKVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLVP 145

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG 201
           C    C  +N    SG    ++C Y   YGDGS + G    + V     +G  Q  +  G
Sbjct: 146 CDEEFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSSNG 205

Query: 202 -ITFGCGTNNGGLF----NSKTTGIVGLG----------------------------GGD 228
            + FGCG    G           GI+G G                            GG 
Sbjct: 206 SVIFGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCLDGINGGG 265

Query: 229 ISLISQ---------------------MRTTIAGNQRLGVSTPDI--------VIDSGTT 259
           I  I                       M     G   L + T +         +IDSGTT
Sbjct: 266 IFAIGHVVQPKVNMTPLIPNQPHYNVNMTAVQVGEDFLHLPTEEFEAGDRKGAIIDSGTT 325

Query: 260 LTFLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVK 316
           L +LP+     L+S + S    ++   V D     +  YS +     P VT HF      
Sbjct: 326 LAYLPEIVYEPLVSKIISQQPDLKVHIVRDEYTCFQ--YSGSVDDGFPNVTFHF------ 377

Query: 317 LSRSNFFVKVSEDIVCSVFKGI--------------TNSVPIYGNIMQTNFLVGYDIEQQ 362
              ++ F+KV        F+G+                ++ + G+++ +N LV YD+E Q
Sbjct: 378 --ENSVFLKVHPHEYLFPFEGLWCIGWQNSGMQSRDRRNMTLLGDLVLSNKLVLYDLENQ 435

Query: 363 TVSFKPTDCT 372
            + +   +C+
Sbjct: 436 AIGWTEYNCS 445


>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 407

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 53/158 (33%), Positives = 82/158 (51%), Gaps = 17/158 (10%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  + +++GTPP     V DTGS+L W  C     +         F+   S +Y+ +
Sbjct: 27  HNISLTVSLTVGTPPQNVSMVIDTGSELSWLYCN---KTTTTTSYPTTFNQTRSISYRPI 83

Query: 147 PCSSSQCASLNQK-----SC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
           PCSSS C +  +      SC S   C  ++SY D S S GNLA++T  +G     A  +P
Sbjct: 84  PCSSSTCTNQTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHMG-----ASDIP 138

Query: 201 GITFGCGT---NNGGLFNSKTTGIVGLGGGDISLISQM 235
           G+ FGC     ++    +SK TG++G+  G +S +SQM
Sbjct: 139 GMVFGCMDSVFSSNSDEDSKNTGLMGMNRGSLSFVSQM 176


>gi|449508697|ref|XP_004163385.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase Asp1-like
           [Cucumis sativus]
          Length = 418

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 55/163 (33%), Positives = 75/163 (46%), Gaps = 15/163 (9%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
           Q ++ PN   Y + + +G PP       DTGSDL W QC+ PC   QC     PL+ P  
Sbjct: 48  QGNVYPNGF-YNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPC--QQCTETLHPLYQP-- 102

Query: 140 SSTYKSLPCSSSQCASLN---QKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
             +   +PC    C SL+      C   + C Y V Y DG  S G L  +   L  T G 
Sbjct: 103 --SNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGD 160

Query: 196 AVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMR 236
            +  P +  GCG +   G        GI+GLG G +S++SQ+ 
Sbjct: 161 PIR-PRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLH 202


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 76/160 (47%), Gaps = 12/160 (7%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W     C+ CP       D  L+D K S+T  ++ 
Sbjct: 155 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 214

Query: 148 CSSSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
           C  + C+  +     C  G+ C YSV YGDGS + G    + V     +G     P    
Sbjct: 215 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT 274

Query: 202 ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT 238
           + FGCG    G   S +    GI+G G  + S++SQ+ ++
Sbjct: 275 VVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 314


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 76/160 (47%), Gaps = 12/160 (7%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W     C+ CP       D  L+D K S+T  ++ 
Sbjct: 155 YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 214

Query: 148 CSSSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
           C  + C+  +     C  G+ C YSV YGDGS + G    + V     +G     P    
Sbjct: 215 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT 274

Query: 202 ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT 238
           + FGCG    G   S +    GI+G G  + S++SQ+ ++
Sbjct: 275 VVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 314


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 105/414 (25%), Positives = 169/414 (40%), Gaps = 91/414 (21%)

Query: 42  PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN-YLIRISIGTP 100
           P  ++ E    R RDAL     R     Q+S+     + Q    P     Y  ++ +GTP
Sbjct: 30  PTNHTVELSQLRARDAL-----RHRRMLQSSNGVVDFSVQGTFDPFQVGLYYTKVQLGTP 84

Query: 101 PTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           P E     DTGSD++W  C     CP +         FDP  SST   + CS  +C +  
Sbjct: 85  PVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGI 144

Query: 158 QKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTL-----GSTTGQAVALPGITFGCG 207
           Q S   CS  N  C Y+  YGDGS ++G   ++ + L     GS T  + A P + FGC 
Sbjct: 145 QSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNSTA-P-VVFGCS 202

Query: 208 TNNGGLF---NSKTTGIVGLGGGDISLISQMRTT----------IAGNQRLG-------V 247
               G     +    GI G G  ++S+ISQ+ +           + G+   G       +
Sbjct: 203 NQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGGGILVLGEI 262

Query: 248 STPDIV--------------------------IDS---------------GTTLTFLPQG 266
             P+IV                          IDS               GTTL +L + 
Sbjct: 263 VEPNIVYTSLVPAQPHYNLNLQSIAVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEE 322

Query: 267 YNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIHFR-GADVKLSRSNFF 323
                +S +++ I  Q V         CY   +S+++V P+V+++F  GA + L   ++ 
Sbjct: 323 AYDPFVSAITASIP-QSVHTVVSRGNQCYLITSSVTEVFPQVSLNFAGGASMILRPQDYL 381

Query: 324 VKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           ++ +      + C  F+ I    + I G+++  + +V YD+  Q + +   DC+
Sbjct: 382 IQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDCS 435


>gi|356509399|ref|XP_003523437.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 421

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 89/367 (24%), Positives = 148/367 (40%), Gaps = 97/367 (26%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           Y + ++IG PP       DTGSDL W QC+ PC    C +  + L+ P        + C 
Sbjct: 64  YTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPC--KGCTLPRNRLYKPHGD----LVKCV 117

Query: 150 SSQCASLN---QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
              CA++       C+G N  C Y V Y D   S G L  + + L  T G ++A P + F
Sbjct: 118 DPLCAAIQSAPNHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNG-SLARPMLAF 176

Query: 205 GCG---TNNGGLFNSKTTGIVGLGGGDISLISQMRT-----TIAGN-------------- 242
           GCG   T++G      T G++GLG G  S++SQ+ +      + G+              
Sbjct: 177 GCGYDQTHHGQNPPPSTAGVLGLGNGRTSILSQLHSLGLIRNVVGHCLSGRGGGFLFFGD 236

Query: 243 -------------------QRLGVSTPDIVID--------------SGTTLTFLPQGYNS 269
                              Q       D+  D              SG++ T+     + 
Sbjct: 237 QLIPPSGVVWTPLLQSSSAQHYKTGPADLFFDRKTTSVKGLELIFDSGSSYTYFNSQAHK 296

Query: 270 NLLSVMSSMIEAQPVADPTG--SLELCYS----FNSLSQVPEVTIHFRGADVKLSRSN-- 321
            L++++++ +  +P++  TG  SL +C+     F SL    +VT +F+   +  ++S   
Sbjct: 297 ALVNLIANDLRGKPLSRATGDPSLPICWKGPKPFKSLH---DVTSNFKPLLLSFTKSKNS 353

Query: 322 --------FFVKVSEDIVC-------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSF 366
                   + +      VC        +  G TN   I G+I   + LV YD E+Q + +
Sbjct: 354 PLQLPPEAYLIVTKHGNVCLGILDGTEIGLGNTN---IIGDISLQDKLVIYDNEKQQIGW 410

Query: 367 KPTDCTK 373
              +C +
Sbjct: 411 ASANCDR 417


>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 491

 Score = 79.0 bits (193), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 48/132 (36%), Positives = 65/132 (49%), Gaps = 10/132 (7%)

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN--QKSCSGVN- 165
           DT  D+ W QC PC   QCY Q +  FDP+ SST   + C S  C +L      CS  N 
Sbjct: 164 DTTEDVPWIQCLPCLIPQCYPQRNAFFDPRRSSTGAPVRCGSRACRTLGGYANGCSKPNS 223

Query: 166 ---CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
              C Y + Y D   + G   T+T+T+  +T          FGC     G F+++ +G +
Sbjct: 224 TGDCLYRIEYSDHRLTLGTYMTDTLTISPST----TFLNFRFGCSHAVRGKFSAQASGTM 279

Query: 223 GLGGGDISLISQ 234
            LGGG  SL+SQ
Sbjct: 280 SLGGGPQSLLSQ 291



 Score = 39.3 bits (90), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 33/124 (26%), Positives = 54/124 (43%), Gaps = 10/124 (8%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQ--VPEVTIHF 310
           V+DS   +T LP      L     + + A     PTG+L+ C+ F  +S+  VP V++ F
Sbjct: 373 VMDSSAVITQLPPTAYRALRLAFRNAMRAYKTRAPTGNLDTCFDFVGVSKVTVPTVSLVF 432

Query: 311 R-GADVKLSRSNFFVKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFK 367
             GA ++L   +  +       C  F  +    ++   GN+ Q    V YD+    V F+
Sbjct: 433 DGGAVIELGLLSVLLD-----SCLAFAPMAADFALGFIGNVQQQTHEVLYDVAGGAVGFR 487

Query: 368 PTDC 371
              C
Sbjct: 488 HGAC 491


>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
 gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
          Length = 485

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 52/162 (32%), Positives = 71/162 (43%), Gaps = 14/162 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W    QC  CP +     D  L++   S T K +P
Sbjct: 78  YYAKIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLVP 137

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C    C  +N     G    ++C Y   YGDGS + G    + V     +G      A  
Sbjct: 138 CDQEFCYEINGGQLPGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAANG 197

Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQMRTT 238
            + FGCG    G   S       GI+G G  + S+ISQ+  T
Sbjct: 198 SVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVT 239


>gi|312282765|dbj|BAJ34248.1| unnamed protein product [Thellungiella halophila]
          Length = 515

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 108/443 (24%), Positives = 176/443 (39%), Gaps = 85/443 (19%)

Query: 9   FILFFLCFYVVSPIEAQTGGFSVELIHRDS-------PKSPFYNSSETPYQRL---RDAL 58
            IL  +  +V+   E   G F  E  HR S       P     N   + Y R+   RD L
Sbjct: 14  LILMLVSSWVLDRCEG-LGEFGFEFHHRFSDQVVGVLPGDGLPNRDSSKYYRVMAHRDRL 72

Query: 59  TRSLNRLNHFNQNSSISSSKASQADIIPNNANYL--IRISIGTPPTERLAVADTGSDLIW 116
            R   RL   +++ S+ +       I  N   +L    +++GTP    L   DTGSDL W
Sbjct: 73  IRG-RRLA--SEDQSLVTFADGNETIRVNALGFLHYANVTVGTPSDWFLVALDTGSDLFW 129

Query: 117 TQCEPCPPSQCYMQ---------DSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
             C+ C  + C  +         D  ++ P  SST   +PC+S+ C  +++ +    +C 
Sbjct: 130 LPCD-CS-TNCVRELKAPGGSSLDLNIYSPNASSTSSKVPCNSTLCTRVDRCASPLSDCP 187

Query: 168 YSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPG-ITFGCGTNNGGLFN--SKTTGIVG 223
           Y + Y  +G+ S G L  + + L S    +  +   IT GCG    G+F+  +   G+ G
Sbjct: 188 YQIRYLSNGTSSTGVLVEDVLHLVSMEKNSKPIRARITLGCGLVQTGVFHDGAAPNGLFG 247

Query: 224 LGGGDISLISQM-RTTIAGN-----------------------QR---LGVSTP------ 250
           LG  DIS+ S + +  IA N                       QR   L +  P      
Sbjct: 248 LGLEDISVPSVLAKEGIAANSFSMCFGDDGAGRISFGDKGSVDQRETPLNIRQPHPTYNV 307

Query: 251 ----------------DIVIDSGTTLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLEL 293
                           D V D+GT+ T+L    Y     S  S  ++ +   D     E 
Sbjct: 308 TVTQISVGGNTGDLEFDAVFDTGTSFTYLTDAPYTLISESFNSLALDKRYQTDSELPFEY 367

Query: 294 CYSF--NSLS-QVPEVTIHFR-GADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGNIM 349
           CY+   N  S + P+V +  + G+   +      V + + +V  +    +  + I G   
Sbjct: 368 CYAVSPNKKSFEYPDVNLTMKGGSSYPVYHPLIVVPIEDTVVYCLAIMKSEDISIIGQNF 427

Query: 350 QTNFLVGYDIEQQTVSFKPTDCT 372
            T + V +D E+  + +K +DC+
Sbjct: 428 MTGYRVVFDREKLILGWKESDCS 450


>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
 gi|194690728|gb|ACF79448.1| unknown [Zea mays]
          Length = 431

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 65/213 (30%), Positives = 93/213 (43%), Gaps = 24/213 (11%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           + + H   P SP    S     R  DA  R L   +    +  I+S+  +     P+   
Sbjct: 24  LSVYHNVHPPSPSPLESIIALARADDA--RLLFLSSKAASSGGITSAPVASGQTPPS--- 78

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y++R  +GTP  + L   DT +D  W+ C PC    C       F P  SS+Y SLPC+S
Sbjct: 79  YVVRAGLGTPVQQLLLALDTSADATWSHCAPC--DTCPAGSR--FIPASSSSYASLPCAS 134

Query: 151 SQCASLNQKSCSG--------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
             C     + C            C +S  + D SF   +L ++T+ LG       A+ G 
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLGKD-----AIAGY 188

Query: 203 TFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQ 234
            FGC G   G   N    G++GLG G +SL+SQ
Sbjct: 189 AFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQ 221


>gi|357160697|ref|XP_003578847.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 421

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 91/375 (24%), Positives = 151/375 (40%), Gaps = 95/375 (25%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
            D+ P+   Y + +SIG PP       DTGSDL W QC+ PC    C     PL+ P  +
Sbjct: 50  GDVYPHGL-YYVAMSIGNPPRPYFLDVDTGSDLTWLQCDAPC--VSCNKVPHPLYRPTKN 106

Query: 141 STYKSLPCSSSQCASLN-----QKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTT 193
              K +PC    C+SL+     +  C      C Y + Y D   S G L T++  +    
Sbjct: 107 ---KIVPCVDQLCSSLHGGLSGKHKCDSPKQQCDYEIKYADQGSSLGVLLTDSFAV-RLA 162

Query: 194 GQAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMR-----TTIAGN--- 242
             ++  P + FGCG +         + T G++GLG G ISL+SQ++       + G+   
Sbjct: 163 NSSIVRPSLAFGCGYDQQVGSSTEVAPTDGVLGLGSGSISLLSQLKQHGITKNVVGHCLS 222

Query: 243 --------------------------------------------QRLGVSTPDIVIDSGT 258
                                                       + LGV   ++V+DSG+
Sbjct: 223 IRGGGFLFFGDNLVPYSRATWVPMVRSAFKNYYSPGTASLYFGGRSLGVRPMEVVLDSGS 282

Query: 259 TLTFL-PQGYNSNLLSVMSSMIEA-QPVADPTGSLELCYS----FNSLSQVPE----VTI 308
           + T+   Q Y + + ++ S + +  + V DP  SL LC+     F S+  V +    + +
Sbjct: 283 SFTYFGAQPYQALVTALKSDLSKTLKEVFDP--SLPLCWKGKKPFKSVLDVKKEFKSLVL 340

Query: 309 HF---RGADVKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYD 358
            F   + A +++   N+ +       C    GI N        + I G+I   + +V YD
Sbjct: 341 SFSNGKKALMEIPPENYLIVTKFGNAC---LGILNGSEIGLKDLNIVGDITMQDQMVIYD 397

Query: 359 IEQQTVSFKPTDCTK 373
            E+  + +    C +
Sbjct: 398 NERGQIGWIRAPCDR 412


>gi|383156234|gb|AFG60356.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
 gi|383156236|gb|AFG60358.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
 gi|383156239|gb|AFG60361.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
          Length = 154

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 52/153 (33%), Positives = 73/153 (47%), Gaps = 12/153 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
           G ++ L H     SP   ++ + +  L   +L R  +RL      +S S +  S   +  
Sbjct: 3   GVNIRLDHIHGACSPLRPANSSKWIDLVSQSLERDNDRLKTIRSRNSGSYTTMSNLPLQS 62

Query: 87  NN----ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
            N     NY++    GTP  + L + DTGSDL W QC+PC    CY Q  P+F+P  SS+
Sbjct: 63  GNKVGTGNYIVTAGFGTPTKKFLLIIDTGSDLTWIQCKPC--LGCYSQVDPIFEPSQSSS 120

Query: 143 YKSLPCSSSQCASL-----NQKSCSGVNCQYSV 170
           YKSLPC S+ C  L     N   C    C Y +
Sbjct: 121 YKSLPCLSATCTELLTSESNLTPCFLGGCSYEI 153


>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
          Length = 287

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 83/281 (29%), Positives = 113/281 (40%), Gaps = 75/281 (26%)

Query: 159 KSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKT 218
           + CSG +C Y V YGDGS++ G  A +T+TL S      A+ G  FGCG  N GLF  + 
Sbjct: 14  RGCSGGHCLYGVQYGDGSYTIGFFAMDTLTLSSHD----AIKGFRFGCGERNEGLFG-EA 68

Query: 219 TGIVGLGGGDISLISQ--------------MRTTIAGNQRLG----------VSTPDIVI 254
            G++GLG G  SL  Q               R++  G    G          +ST  ++I
Sbjct: 69  AGLLGLGRGKTSLPVQTYDKYGGVFAHCFPARSSGTGYLEFGPGSSPAVSAKLSTTPMLI 128

Query: 255 D-----------------------------------SGTTLTFLPQGYNSNLLSVMSSMI 279
           D                                   SGT +T LP    S+L S  ++ +
Sbjct: 129 DTGPTFYYVGMTGIRVGGKLLPIPQSVFAAAGTIVDSGTVITRLPPAAYSSLRSAFAASM 188

Query: 280 EAQPV--ADPTGSLELCYSFNSLSQV--PEVTIHFRGA---DVKLSRSNFFVKVSEDIVC 332
            A+    A     L+ CY     S+V  P V++ F+G    DV  S   +   VS+   C
Sbjct: 189 AARGYKRAPALSLLDTCYDLTGASEVAIPTVSLLFQGGVSLDVDASGIIYAASVSQ--AC 246

Query: 333 SVFKG--ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             F G    + V I GN     F V YDI  + V F P  C
Sbjct: 247 LGFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGFCPGAC 287


>gi|449459186|ref|XP_004147327.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 418

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 55/163 (33%), Positives = 75/163 (46%), Gaps = 15/163 (9%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
           Q ++ PN   Y + + +G PP       DTGSDL W QC+ PC   QC     PL+ P  
Sbjct: 48  QGNVYPNGF-YNVTLYVGQPPKPYFLDPDTGSDLTWLQCDAPC--QQCTETLHPLYQP-- 102

Query: 140 SSTYKSLPCSSSQCASLN---QKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
             +   +PC    C SL+      C   + C Y V Y DG  S G L  +   L  T G 
Sbjct: 103 --SNDLVPCKDPLCMSLHSSMDHRCENPDQCDYEVEYADGGSSLGVLVRDVFPLNLTNGD 160

Query: 196 AVALPGITFGCGTNN--GGLFNSKTTGIVGLGGGDISLISQMR 236
            +  P +  GCG +   G        GI+GLG G +S++SQ+ 
Sbjct: 161 PIR-PRLALGCGYDQDPGSSSYHPMDGILGLGRGAVSIVSQLH 202


>gi|357492303|ref|XP_003616440.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517775|gb|AES99398.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 521

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 88/358 (24%), Positives = 143/358 (39%), Gaps = 95/358 (26%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N    + +++G+PP     V DTGS+L W  C+  P        + +F+P +SS+Y   
Sbjct: 32  HNVTLTVSLTVGSPPQRVTMVLDTGSELSWLHCKKLP------NLNFIFNPLVSSSYTPT 85

Query: 147 PCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           PC+S  C +  +   + V+C  +      +F  G  A                 G+ FGC
Sbjct: 86  PCTSPICTTQTRDLINPVSCDANKLCHIITFFVGGPAQR---------------GMVFGC 130

Query: 207 ---GTNNGGLFNSKTTGIVGLGGGDISLISQMR-------------------TTIAGNQR 244
              GT++G   +SKTTG++G+  G +S  +QMR                     IA   R
Sbjct: 131 MDTGTSSGDE-DSKTTGLMGMDLGSLSFSNQMRLPKFSYCISNKDSTGVLVLENIANPPR 189

Query: 245 LG---------VSTP------------------------DIVIDSGTTLTFLPQGYNSNL 271
           LG          +TP                          ++DS T  TFL Q   + L
Sbjct: 190 LGPLHYTPLVKKTTPLPYFNRNCCLFQKSAFLPDHTGAGQTMVDSATQFTFLRQPVYTAL 249

Query: 272 LSVMSSMIE--AQPVADPT----GSLELCYSF---NSLSQVPEVTIHFRGADVKLSRSNF 322
            +  +   +    P+ DP     G ++LC+     ++L  +P VT+ F GA+++++    
Sbjct: 250 KNEFAIQTKNILTPLGDPKFVFQGVMDLCFRVPIGSTLPVLPVVTLMFDGAELRVTGERL 309

Query: 323 FVKVSE------DIVCSVFKG---ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             KVS        I C  F     +     I G+  Q N  + YD+    + F  T+C
Sbjct: 310 LYKVSNVAKSNSWIYCFTFGNSDLLGIEAFIIGHHHQRNVWMEYDLANSRIGFSDTNC 367


>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 442

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 58/166 (34%), Positives = 83/166 (50%), Gaps = 19/166 (11%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMS 140
           A +      Y+    IG+PP    A+ DTGSDLIWTQC   C P  C  Q  P ++   S
Sbjct: 77  AQVHRATRQYIASYLIGSPPQRTEALIDTGSDLIWTQCATTCLPKSCAKQGLPYYNLSQS 136

Query: 141 STYKSLPCSSSQ--CASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           ST+  +PC+     CA+     C G++  C +  SYG G    G+L TE+    S T   
Sbjct: 137 STFVPVPCADKAGFCAANGVHLC-GLDGSCTFIASYGAGRVI-GSLGTESFAFESGT--- 191

Query: 197 VALPGITFGCGT----NNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
                + FGC +     +G L  +  +G++GLG G +SL+SQ+  T
Sbjct: 192 ---TSLAFGCVSLTRITSGAL--NDASGLIGLGRGRLSLVSQIGAT 232



 Score = 50.1 bits (118), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 42/144 (29%), Positives = 67/144 (46%), Gaps = 12/144 (8%)

Query: 234 QMRTTIAGNQRLGVSTPDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGS-L 291
           Q+R    G    GV     +ID+G+ LT L    Y +    V + +     V  P  S L
Sbjct: 304 QLRQLFKGYWAGGV-----IIDTGSPLTQLASHAYEALKEEVAAQLGNGSLVPAPEDSGL 358

Query: 292 ELCYSFNSLSQV-PEVTIHFRG-ADVKLSRSNFFVKVSEDIVCS-VFKGITNSVPIYGNI 348
           ELC +     +V P +  HF G AD+ +  ++++  V +   C  + +G  +S  I GN 
Sbjct: 359 ELCVAREGFQKVVPALVFHFGGGADMAVPAASYWAPVDKAAACMMILEGGYDS--IIGNF 416

Query: 349 MQTNFLVGYDIEQQTVSFKPTDCT 372
            Q +  + YD+ +   SF+  DCT
Sbjct: 417 QQQDMHLLYDLRRGRFSFQTADCT 440


>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
 gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
          Length = 492

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 52/163 (31%), Positives = 80/163 (49%), Gaps = 10/163 (6%)

Query: 74  ISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
           + S++    D +     Y  R+ IGTPP E   + DTGS + +  C  C  + C     P
Sbjct: 18  LGSARMDLHDDLLTKGYYTSRVKIGTPPHEFSLIVDTGSTVTYVPCSSC--THCGNHQDP 75

Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
            F P +SS+YK L C  S+C++     C G   +Y   Y + S S+G L  + +  G + 
Sbjct: 76  RFSPALSSSYKPLEC-GSECST---GFCDGSR-KYQRQYAEKSTSSGVLGKDVI--GFSN 128

Query: 194 GQAVALPGITFGCGT-NNGGLFNSKTTGIVGLGGGDISLISQM 235
              +    + FGC T   G L++    GI+GLG G +S+I Q+
Sbjct: 129 SSDLGGQRLVFGCETAETGDLYDQTADGIIGLGRGPLSIIDQL 171


>gi|30680102|ref|NP_849967.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|17978947|gb|AAL47439.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|22655368|gb|AAM98276.1| At2g17760/At2g17760 [Arabidopsis thaliana]
 gi|330251585|gb|AEC06679.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 513

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 96/399 (24%), Positives = 154/399 (38%), Gaps = 77/399 (19%)

Query: 45  NSSETPYQRL---RDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPP 101
           N   + Y R+   RD L R   RL + +Q+    S       +      +   +++GTP 
Sbjct: 56  NRDSSKYYRVMAHRDRLIRG-RRLANEDQSLVTFSDGNETVRVDALGFLHYANVTVGTPS 114

Query: 102 TERLAVADTGSDLIWTQCEPCPPSQCYMQ---------DSPLFDPKMSSTYKSLPCSSSQ 152
              +   DTGSDL W    PC  + C  +         D  ++ P  SST   +PC+S+ 
Sbjct: 115 DWFMVALDTGSDLFWL---PCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNSTL 171

Query: 153 CASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPG-ITFGCGTNN 210
           C   ++ +    +C Y + Y  +G+ S G L  + + L S    + A+P  +TFGCG   
Sbjct: 172 CTRGDRCASPESDCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTFGCGQVQ 231

Query: 211 GGLFN--SKTTGIVGLGGGDISLISQM-RTTIAGN-----------------------QR 244
            G+F+  +   G+ GLG  DIS+ S + +  IA N                       QR
Sbjct: 232 TGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVDQR 291

Query: 245 ---LGVSTP----------------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMI 279
              L +  P                      D V DSGT+ T+L     + +    +S+ 
Sbjct: 292 ETPLNIRQPHPTYNITVTKISVGGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNSLA 351

Query: 280 --EAQPVADPTGSLELCYSFNSLS---QVPEVTIHFRGADVKLSRSNFFV--KVSEDIVC 332
             +     D     E CY+ +      Q P V +  +G           V      D+ C
Sbjct: 352 LDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDVYC 411

Query: 333 SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
                I + + I G    T + V +D E+  + +K +DC
Sbjct: 412 LAIMKIED-ISIIGQNFMTGYRVVFDREKLILGWKESDC 449


>gi|242062640|ref|XP_002452609.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
 gi|241932440|gb|EES05585.1| hypothetical protein SORBIDRAFT_04g028990 [Sorghum bicolor]
          Length = 557

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 61/193 (31%), Positives = 88/193 (45%), Gaps = 15/193 (7%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTERLAVA 108
           +R+ D   ++ N++      ++ ++S A    + ++ P+   Y   I +G PP       
Sbjct: 146 RRIDDGWRKARNKMEVAKAAAAGTNSTALLPIKGNVFPD-GQYYTSIFVGNPPRPYFLDV 204

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSCSGVN- 165
           DTGSDL W QC+  P + C     PL+ P   +  K +P     C  L  NQ  C     
Sbjct: 205 DTGSDLTWIQCD-APCTNCAKGPHPLYKP---TKEKIVPPRDLLCQELQGNQNYCETCKQ 260

Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS---KTTGIV 222
           C Y + Y D S S G LA + + L +T G    L    FGC  +  G   S   KT GI+
Sbjct: 261 CDYEIEYADQSSSMGVLARDDMHLIATNGGREKL-DFVFGCAYDQQGQLLSSPAKTDGIL 319

Query: 223 GLGGGDISLISQM 235
           GL    ISL SQ+
Sbjct: 320 GLSNAAISLPSQL 332


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 105/414 (25%), Positives = 166/414 (40%), Gaps = 91/414 (21%)

Query: 42  PFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN-YLIRISIGTP 100
           P  +  E    R RD L     R     Q+SS     + Q    P     Y  ++ +GTP
Sbjct: 33  PTNHGVELSQLRARDEL-----RHRRMLQSSSGVVDFSVQGTFDPFQVGLYYTKVQLGTP 87

Query: 101 PTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLN 157
           P E     DTGSD++W  C     CP +         FDP  SST   + CS  +C +  
Sbjct: 88  PVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSMIACSDQRCNNGK 147

Query: 158 QKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTL-----GSTTGQAVALPGITFGCG 207
           Q S   CS  N  C Y+  YGDGS ++G   ++ + L     GS T  + A P + FGC 
Sbjct: 148 QSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSMTTNSTA-P-VVFGCS 205

Query: 208 TNNGGLF---NSKTTGIVGLGGGDISLISQMRTT----------IAGNQRLG-------V 247
               G     +    GI G G  ++S+ISQ+ +           + G+   G       +
Sbjct: 206 NQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGDSSGGGILVLGEI 265

Query: 248 STPDIV--------------------------IDS---------------GTTLTFLPQG 266
             P+IV                          IDS               GTTL +L + 
Sbjct: 266 VEPNIVYTSLVPAQPHYNLNLQSISVNGQTLQIDSSVFATSNSRGTIVDSGTTLAYLAEE 325

Query: 267 YNSNLLSVMSSMIEAQPVADPTGSLELCYSF-NSLSQV-PEVTIHFR-GADVKLSRSNFF 323
                +S +++ I  Q V         CY   +S++ V P+V+++F  GA + L   ++ 
Sbjct: 326 AYDPFVSAITAAIP-QSVRTVVSRGNQCYLITSSVTDVFPQVSLNFAGGASMILRPQDYL 384

Query: 324 VKVSE----DIVCSVFKGIT-NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           ++ +      + C  F+ I    + I G+++  + +V YD+  Q + +   DC+
Sbjct: 385 IQQNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVYDLAGQRIGWANYDCS 438


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 48/159 (30%), Positives = 74/159 (46%), Gaps = 14/159 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+PP +     DTGSD++W    +C  CP       D  L+DPK S T + + 
Sbjct: 70  YFTKLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSELIS 129

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP--- 200
           C    C++       G    + C YS++YGDGS + G    + +T           P   
Sbjct: 130 CDQEFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAPQNS 189

Query: 201 GITFGCGTNNGGLFNSKTT----GIVGLGGGDISLISQM 235
            I FGCG    G  +S +     GI+G G  + S++SQ+
Sbjct: 190 SIIFGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQL 228


>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
          Length = 477

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 88/362 (24%), Positives = 134/362 (37%), Gaps = 86/362 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W    QC  CP       +  L+D K S T K + 
Sbjct: 98  YYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVS 157

Query: 148 CSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C    C ++N        + ++C Y+  Y DGS S G    + V     +G      A  
Sbjct: 158 CDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANG 217

Query: 201 GITFGCGTNNGGLFNSKTT--GIVGLG----------------------------GGDIS 230
            + FGC     G  +S+    GI+G G                            GG I 
Sbjct: 218 SVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIF 277

Query: 231 LISQ---------------------MRTTIAGNQRLGVSTP--DI------VIDSGTTLT 261
            I                       M+    G   L + T   D+      +IDSGTTL 
Sbjct: 278 AIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLA 337

Query: 262 FLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLS 318
           +LP+     LLS + S    ++   + D     +  YS +     P VT HF  +     
Sbjct: 338 YLPEVVYDQLLSKIFSWQSDLKVHTIHDQFTCFQ--YSESLDDGFPAVTFHFENSLYLKV 395

Query: 319 RSNFFVKVSEDIVCSVFKGITNS---------VPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
             + ++   + + C    G  NS         + + G++  +N LV YD+E Q + +   
Sbjct: 396 HPHEYLFSYDGLWCI---GWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEY 452

Query: 370 DC 371
           +C
Sbjct: 453 NC 454


>gi|297832400|ref|XP_002884082.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297329922|gb|EFH60341.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 513

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 99/401 (24%), Positives = 156/401 (38%), Gaps = 81/401 (20%)

Query: 45  NSSETPYQRL---RDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYL--IRISIGT 99
           N   + Y R+   RD L R   RL   N++ S+ +       I  +   +L    +++GT
Sbjct: 56  NRDSSKYYRVMAHRDRLIRG-RRLA--NEDQSLVTFSDGNETIRVDALGFLHYANVTVGT 112

Query: 100 PPTERLAVADTGSDLIWTQCEPCPPSQCYMQ---------DSPLFDPKMSSTYKSLPCSS 150
           P    L   DTGSDL W    PC  + C  +         D  ++ P  SST   +PC+S
Sbjct: 113 PSDWFLVALDTGSDLFWL---PCDCTNCVRELKAPGGSSLDLNIYSPNASSTSTKVPCNS 169

Query: 151 SQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPG-ITFGCGT 208
           + C   ++ +    NC Y + Y  +G+ S G L  + + L S    + A+P  +T GCG 
Sbjct: 170 TLCTRGDRCASPESNCPYQIRYLSNGTSSTGVLVEDVLHLVSNDKSSKAIPARVTLGCGQ 229

Query: 209 NNGGLFN--SKTTGIVGLGGGDISLISQM-RTTIAGN----------------------- 242
              G+F+  +   G+ GLG  DIS+ S + +  IA N                       
Sbjct: 230 VQTGVFHDGAAPNGLFGLGLEDISVPSVLAKEGIAANSFSMCFGNDGAGRISFGDKGSVD 289

Query: 243 QR---LGVSTP----------------------DIVIDSGTTLTFLPQGYNSNLLSVMSS 277
           QR   L +  P                      D V DSGT+ T+L     + +    +S
Sbjct: 290 QRETPLNIRQPHPTYNITVTKISVEGNTGDLEFDAVFDSGTSFTYLTDAAYTLISESFNS 349

Query: 278 MI--EAQPVADPTGSLELCYSFNSLS---QVPEVTIHFRGADVKLSRSNFFV--KVSEDI 330
           +   +     D     E CY+ +      Q P V +  +G           V      D+
Sbjct: 350 LALDKRYQTTDSELPFEYCYALSPNKDSFQYPAVNLTMKGGSSYPVYHPLVVIPMKDTDV 409

Query: 331 VCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
            C     I + + I G    T + V +D E+  + +K +DC
Sbjct: 410 YCLAILKIED-ISIIGQNFMTGYRVVFDREKLILGWKESDC 449


>gi|91806508|gb|ABE65981.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 203

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 44/112 (39%), Positives = 62/112 (55%), Gaps = 5/112 (4%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +A Y   + IGTPP E   V DTGSDL+W  C  C    C + +   FDP  SS+   L 
Sbjct: 75  SALYYTTVQIGTPPRELDVVIDTGSDLVWVSCNSC--VGCPLHNVTFFDPGASSSAVKLA 132

Query: 148 CSSSQCASLNQKS--CSGV-NCQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
           CS  +C+S  QK   CS + +C Y V YGDGS ++G   ++ ++  + +G A
Sbjct: 133 CSDKRCSSDLQKKSRCSLLESCTYKVEYGDGSVTSGYYISDLISFDTMSGVA 184


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 76/160 (47%), Gaps = 12/160 (7%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W     C+ CP       D  L+D K S+T  ++ 
Sbjct: 74  YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 133

Query: 148 CSSSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
           C  + C+  +     C  G+ C YSV YGDGS + G    + V     +G     P    
Sbjct: 134 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT 193

Query: 202 ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT 238
           + FGCG    G   S +    GI+G G  + S++SQ+ ++
Sbjct: 194 VVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 233


>gi|297849132|ref|XP_002892447.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338289|gb|EFH68706.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 86/363 (23%), Positives = 149/363 (41%), Gaps = 82/363 (22%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +GTPP E     DTGSD++W   T C  CP +         FDP +SS+   + 
Sbjct: 84  YYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVS 143

Query: 148 CSSSQCAS--LNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL---PG 201
           CS  +C S    +  CS  N C YS  YGDGS ++G   ++ ++  +     +A+     
Sbjct: 144 CSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGFYISDFMSFDTVITSTLAINSSAP 203

Query: 202 ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT----------IAGNQRLG-- 246
             FGC     G          GI GLG G +S+ISQ+             + G++  G  
Sbjct: 204 FVFGCSNLQTGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGI 263

Query: 247 -----VSTPDIV-----------------------------------------IDSGTTL 260
                +  PD V                                         ID+GTTL
Sbjct: 264 MVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTL 323

Query: 261 TFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNS--LSQVPEVTIHFRGADVK 316
            +LP    S  +  +++ +    +P+   T     C+   +  +   PEV++ F G    
Sbjct: 324 AYLPDEAYSPFIQAIANAVSQYGRPI---TYESYQCFEITAGDVDVFPEVSLSFAGGASM 380

Query: 317 LSRSNFFVKV----SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           + R + ++++       I C  F+ +++  + I G+++  + +V YD+ +Q + +   DC
Sbjct: 381 VLRPHAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDC 440

Query: 372 TKQ 374
           + +
Sbjct: 441 SLE 443


>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 51/154 (33%), Positives = 73/154 (47%), Gaps = 23/154 (14%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQC-EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           +I + IGTPP  +  V DTGS L W QC +  PP+         FDP +SST+  LPC+ 
Sbjct: 76  IINLPIGTPPQTQPMVLDTGSQLSWIQCHKKQPPTAS-------FDPSLSSTFSILPCTH 128

Query: 151 SQCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITF 204
             C       +L         C YS  Y DG+++ GNL  E  T      ++V+ P +  
Sbjct: 129 PLCKPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF----SRSVSTPPLIL 184

Query: 205 GCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           GC T      ++   GI+G+  G +S   Q + T
Sbjct: 185 GCATE-----STDPRGILGMNLGRLSFAKQSKIT 213


>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
 gi|194703964|gb|ACF86066.1| unknown [Zea mays]
 gi|219886221|gb|ACL53485.1| unknown [Zea mays]
 gi|219886359|gb|ACL53554.1| unknown [Zea mays]
 gi|223950085|gb|ACN29126.1| unknown [Zea mays]
 gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 431

 Score = 78.2 bits (191), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 64/213 (30%), Positives = 93/213 (43%), Gaps = 24/213 (11%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN 90
           + + H   P SP    S     R  DA  R L   +    +  ++S+  +     P+   
Sbjct: 24  LSVYHNVHPPSPSPLESIIALARADDA--RLLFLSSKAASSGGVTSAPVASGQTPPS--- 78

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y++R  +GTP  + L   DT +D  W+ C PC    C       F P  SS+Y SLPC+S
Sbjct: 79  YVVRAGLGTPVQQLLLALDTSADATWSHCAPC--DTCPAGSR--FIPASSSSYASLPCAS 134

Query: 151 SQCASLNQKSCSG--------VNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
             C     + C            C +S  + D SF   +L ++T+ LG       A+ G 
Sbjct: 135 DWCPLFEGQPCPANQDASAPLPACAFSKPFADTSF-QASLGSDTLRLGKD-----AIAGY 188

Query: 203 TFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQ 234
            FGC G   G   N    G++GLG G +SL+SQ
Sbjct: 189 AFGCVGAVAGPTTNLPKQGLLGLGRGPMSLLSQ 221


>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
          Length = 425

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 60/205 (29%), Positives = 95/205 (46%), Gaps = 21/205 (10%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNRLN 66
           +F L FL   +V  +  +  G +V++ H  SP+SPF  S    ++  +   L     RL 
Sbjct: 5   LFSLAFLFLSLVQGLNTRGQGTTVKVFHVYSPQSPFRPSKPVSWEDSVLQMLAEDQARLQ 64

Query: 67  HFNQNSSISSSK-----ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
                SS+   K     AS   I+  +  Y+++ ++GTP    L   DT +D  W  C  
Sbjct: 65  FL---SSLVGRKSWVPIASGRQIV-QSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCN- 119

Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN 181
                C    S +F+   S+T+K+L C + QC  +   +C G  C ++ +YG GS    N
Sbjct: 120 ----GCVGCSSTVFNSVTSTTFKTLGCDAPQCKQVPNPTCGGSTCTWNTTYG-GSTILSN 174

Query: 182 LATETVTLGSTTGQAVALPGITFGC 206
           L  +T+ L +       +PG TFGC
Sbjct: 175 LTRDTIALSTDI-----VPGYTFGC 194


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 49/150 (32%), Positives = 73/150 (48%), Gaps = 12/150 (8%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           N  Y  R+ IGTPP     + DTGS + +  C  C   QC     P F P+ SSTY+ + 
Sbjct: 81  NGYYTTRLWIGTPPQMFALIVDTGSTVTYVPCSTC--EQCGRHQDPKFQPESSSTYQPVK 138

Query: 148 CS-SSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           C+    C S        + C Y   Y + S S+G L  + ++ G+ +   +A     FGC
Sbjct: 139 CTIDCNCDS------DRMQCVYERQYAEMSTSSGVLGEDLISFGNQS--ELAPQRAVFGC 190

Query: 207 -GTNNGGLFNSKTTGIVGLGGGDISLISQM 235
                G L++    GI+GLG GD+S++ Q+
Sbjct: 191 ENVETGDLYSQHADGIMGLGRGDLSIMDQL 220


>gi|212722898|ref|NP_001132197.1| pepsin A precursor [Zea mays]
 gi|194693730|gb|ACF80949.1| unknown [Zea mays]
 gi|195605492|gb|ACG24576.1| pepsin A [Zea mays]
 gi|413938914|gb|AFW73465.1| pepsin A [Zea mays]
          Length = 519

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 94/387 (24%), Positives = 149/387 (38%), Gaps = 72/387 (18%)

Query: 54  LRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSD 113
           LR  L R   RL   NQ  S+S   ++ +        Y   + +GTP T  L   DTGSD
Sbjct: 63  LRSDLQRQKRRLAGKNQLLSLSKGGSTFSPGNDLGWLYYAWVDVGTPTTSFLVALDTGSD 122

Query: 114 LIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQ 167
           L W  C+   C P   Y     +D  ++ P  S+T + LPCS   C   +  +     C 
Sbjct: 123 LFWVPCDCIQCAPLSSYRGNLDRDLGIYKPAESTTSRHLPCSHELCQPGSGCTNPKQPCT 182

Query: 168 YSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG--LFNSKTTGIVGL 224
           Y++ Y  + + S+G L  +++ L S  G A     +  GCG    G  L      G++GL
Sbjct: 183 YNIDYFSENTTSSGLLIEDSLHLNSREGHAPVNASVIIGCGRKQSGDYLDGIAPDGLLGL 242

Query: 225 GGGDIS--------------------------------------------LISQMRT--- 237
           G  DIS                                            L  +++T   
Sbjct: 243 GMADISVPSFLARAGLVRNSFSMCFKEDSSGRIFFGDQGVSSQQSTPFVPLYGKLQTYAV 302

Query: 238 ----TIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLEL 293
               +  G++ L  S+   ++DSGT+ T LP        +     I A  V     + + 
Sbjct: 303 NVDKSCIGHKCLEGSSFQALVDSGTSFTSLPPDVYKAFTTEFDKQINASRVPYEDSTWKY 362

Query: 294 CYSFNSLS--QVPEVTIHFRGADVKLSRSNFFVKVSED---IVCSVFKGITNSVPIYGNI 348
           CYS + L    VP + + F  A+      N  +  +++   +       + ++ PI   I
Sbjct: 363 CYSASPLEMPDVPTIILAF-AANKSFQAVNPILPFNDEQGALARFCLAVLPSTEPI--GI 419

Query: 349 MQTNFLVGY----DIEQQTVSFKPTDC 371
           +  NFLVGY    D E   + +  ++C
Sbjct: 420 IGQNFLVGYHVVFDRESMKLGWYRSEC 446


>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
 gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
 gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
          Length = 425

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 60/205 (29%), Positives = 95/205 (46%), Gaps = 21/205 (10%)

Query: 8   VFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNRLN 66
           +F L FL   +V  +  +  G +V++ H  SP+SPF  S    ++  +   L     RL 
Sbjct: 5   LFSLAFLFLSLVQGLNTRGQGTTVKVFHVYSPQSPFRPSKPVSWEDSVLQMLAEDQARLQ 64

Query: 67  HFNQNSSISSSK-----ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEP 121
                SS+   K     AS   I+  +  Y+++ ++GTP    L   DT +D  W  C  
Sbjct: 65  FL---SSLVGRKSWVPIASGRQIV-QSPTYIVKANVGTPAQTFLMALDTSNDAAWIPCN- 119

Query: 122 CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGN 181
                C    S +F+   S+T+K+L C + QC  +   +C G  C ++ +YG GS    N
Sbjct: 120 ----GCVGCSSTVFNSVTSTTFKTLGCDAPQCKQVPNPTCGGSTCTWNTTYG-GSTILSN 174

Query: 182 LATETVTLGSTTGQAVALPGITFGC 206
           L  +T+ L +       +PG TFGC
Sbjct: 175 LTRDTIALSTDI-----VPGYTFGC 194


>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
          Length = 507

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 51/160 (31%), Positives = 76/160 (47%), Gaps = 12/160 (7%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W     C+ CP       D  L+D K S+T  ++ 
Sbjct: 78  YFAKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVG 137

Query: 148 CSSSQCASLNQ--KSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP---G 201
           C  + C+  +     C  G+ C YSV YGDGS + G    + V     +G     P    
Sbjct: 138 CDDNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGT 197

Query: 202 ITFGCGTNNGGLFNSKTT---GIVGLGGGDISLISQMRTT 238
           + FGCG    G   S +    GI+G G  + S++SQ+ ++
Sbjct: 198 VVFGCGNKQSGELGSSSEALDGILGFGQANSSMLSQLASS 237


>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
          Length = 414

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 46/112 (41%), Positives = 64/112 (57%), Gaps = 10/112 (8%)

Query: 126 QCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK--SCSGVNCQYSVSYGDGSFSNGNLA 183
           +C  + +P F P  SST+  LPC+SS C  L     +C+   C Y   YG G F+ G LA
Sbjct: 87  ECAARPAPPFQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYGMG-FTAGYLA 145

Query: 184 TETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
           TET+ +G  +      PG+ FGC T NG    + ++GIVGLG   +SL+SQ+
Sbjct: 146 TETLHVGGAS-----FPGVAFGCSTENG--VGNSSSGIVGLGRSPLSLVSQV 190



 Score = 51.6 bits (122), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 40/137 (29%), Positives = 61/137 (44%), Gaps = 18/137 (13%)

Query: 253 VIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPVADPTGS---LELCYSFNSLSQ-----V 303
           ++DSGTTLT+L  +GY     + +S M  A       G+    +LC+  N+        V
Sbjct: 275 IVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDANAAGGGSGVPV 334

Query: 304 PEVTIHFRGADVKLSRSNFFVKVSE-------DIVCSVFKGITN--SVPIYGNIMQTNFL 354
           P + + F G      R   +V V E        + C +    +   S+ I GN+MQ +  
Sbjct: 335 PTLVLRFAGGAEYAVRRRSYVGVVEVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDLH 394

Query: 355 VGYDIEQQTVSFKPTDC 371
           V YD++    SF P DC
Sbjct: 395 VLYDLDGGMFSFAPADC 411


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 68/207 (32%), Positives = 99/207 (47%), Gaps = 19/207 (9%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRD-ALTRSLNRLNHFNQNS-SISSSKASQADIIPNN 88
           + + H +S  SPF  S       L+D A    L+ L    ++S  I+S +A     I  +
Sbjct: 31  LRVFHINSQCSPFKTSVSWADTLLQDKARFLYLSSLAGVRKSSVPIASGRA-----IVQS 85

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y++R +IGTP    L   DT +D  W  C  C         S LFDP  SS+ ++L C
Sbjct: 86  PTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGC----VGCSSSVLFDPSKSSSSRTLQC 141

Query: 149 SSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
            + QC      SC+   +C ++++YG GS     L  +T+TL S       +P  TFGC 
Sbjct: 142 EAPQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDV-----IPNYTFGC- 194

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQ 234
            N     +    G++GLG G +SLISQ
Sbjct: 195 INKASGTSLPAQGLMGLGRGPLSLISQ 221


>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 432

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 54/169 (31%), Positives = 80/169 (47%), Gaps = 25/169 (14%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           +Y++R  +G+P    L   DT +D  W  C PC    C    S LF P  S++Y  LPCS
Sbjct: 76  SYVVRAGLGSPAQPILLALDTSADATWAHCSPC--GTCPSSGS-LFAPANSTSYAPLPCS 132

Query: 150 SSQCASLNQKSCSGVN----------CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL 199
           S+ C  L  + C   +          C ++  + D SF   +LA++ + LG       A+
Sbjct: 133 STMCTVLQGQPCPAQDPYDSSAPLPMCAFTKPFADASF-QASLASDWLHLGKD-----AI 186

Query: 200 PGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGV 247
           P   FGC    +G   N    G++GLG G ++L+SQ+     GN   GV
Sbjct: 187 PNYAFGCVSAVSGPTANLPKQGLLGLGRGPMALLSQV-----GNMYNGV 230


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score = 77.8 bits (190), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 51/153 (33%), Positives = 74/153 (48%), Gaps = 17/153 (11%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           L+ + IGTPP  +  + DTGS L W QC    P +     S +FDP +SS++  LPC+  
Sbjct: 78  LVSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKP--PPSTVFDPSLSSSFSVLPCNHP 135

Query: 152 QCA----SLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
            C          +   +N  C YS  Y DG+ + GNL  E +T   +T Q+   P +  G
Sbjct: 136 LCKPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITF--STSQST--PPLILG 191

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           C  +      S   GI+G+  G +S  SQ + T
Sbjct: 192 CAED-----ASDDKGILGMNLGRLSFASQAKIT 219


>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 502

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 86/360 (23%), Positives = 138/360 (38%), Gaps = 80/360 (22%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W    QC  CP       +  L+D K S T K + 
Sbjct: 98  YYAKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVS 157

Query: 148 CSSSQCASLN----QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ---AVALP 200
           C    C ++N        + ++C Y+  Y DGS S G    + V     +G      A  
Sbjct: 158 CDQDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANG 217

Query: 201 GITFGCGTNNGGLFNSKTT--GIVGLG----------------------------GGDIS 230
            + FGC     G  +S+    GI+G G                            GG I 
Sbjct: 218 SVIFGCSATQSGDLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIF 277

Query: 231 LISQ---------------------MRTTIAGNQRLGVSTP--DI------VIDSGTTLT 261
            I                       M+    G   L + T   D+      +IDSGTTL 
Sbjct: 278 AIGHIVQPKVNTTPLVPNQTHYNVNMKAVEVGGYFLNLPTDVFDVGDKKGTIIDSGTTLA 337

Query: 262 FLPQGYNSNLLSVMSSM---IEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKLS 318
           +LP+     LLS + S    ++   + D     +  YS +     P VT HF  +     
Sbjct: 338 YLPEVVYDQLLSKIFSWQSDLKVHTIHDQFTCFQ--YSESLDDGFPAVTFHFENSLYLKV 395

Query: 319 RSNFFVKVSEDIVCSVFK--GITN----SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
             + ++   + + C  ++  G+ +    ++ + G++  +N LV YD+E Q + +   +C+
Sbjct: 396 HPHEYLFSYDGLWCIGWQNSGMQSRDRRNITLLGDLALSNKLVLYDLENQVIGWTEYNCS 455


>gi|326532354|dbj|BAK05106.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 564

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 87/331 (26%), Positives = 124/331 (37%), Gaps = 63/331 (19%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQ----DSPLFDPKMSSTYK 144
           Y   + +GTP T  +   DTGSDL W  C+   C P   Y +    D  ++ P  S+T +
Sbjct: 143 YYTWVDVGTPNTSFMVALDTGSDLFWVPCDCIECAPLAGYRETLDRDLGIYKPAESTTSR 202

Query: 145 SLPCSSSQCASLNQKSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
            LPCS   C   +  S     C YS  Y  + + S+G L  + + L S    A     + 
Sbjct: 203 HLPCSHELCPPGSGCSSPKQPCPYSTDYLQENTTSSGLLIEDILHLDSRESHAPVKASVV 262

Query: 204 FGCGTNNGG--LFNSKTTGIVGLGGGDISLISQM------RTTIA------------GNQ 243
            GCG    G  L      G++GLG  DIS+ S +      R + +            G+Q
Sbjct: 263 IGCGRKQSGSYLDGIAPDGLLGLGMADISVPSFLARAGLVRNSFSMCFKEDSGRIFFGDQ 322

Query: 244 RLGV--STP------------------------------DIVIDSGTTLTFLPQGYNSNL 271
            + +  STP                              + ++DSGT+ T LP      +
Sbjct: 323 GVSIQQSTPFVPLYGKYQTYAVNVDKSCVGHKCFEATSFEALVDSGTSFTALPLNVYKAV 382

Query: 272 LSVMSSMIEAQPVADPTGSLELCYSFNSLS--QVPEVTIHFRG-ADVKLSRSNFFVKVSE 328
                  + A  +     S E CYS + L    VP VT+ F      +       +K  E
Sbjct: 383 AVEFDKQVHAPRITQEDASFEYCYSASPLKMPDVPTVTLTFAANKSFQAVNPTIVLKDGE 442

Query: 329 DIVCSVFKGITNSVPIYGNIMQTNFLVGYDI 359
             V      +  S    G I Q NFL GY I
Sbjct: 443 GSVAGFCLALQKSPEPIGIIGQ-NFLTGYHI 472


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 53/188 (28%), Positives = 88/188 (46%), Gaps = 16/188 (8%)

Query: 27  GGFSVELIHRDSPKSPFYNSSETPYQR--LRDALTRSLNRLNHFNQNSSISSSKA----S 80
            G ++++ H   P SP    +  P     L D  +R  +RL + +  ++   ++A    +
Sbjct: 40  AGNTLQVSHAFGPCSPLGPGTTAPSWAGFLADQASRDASRLLYLDSLAARGKARAYAPIA 99

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMS 140
               +     Y++R  +GTPP + L   DT +D  W  C  C  + C    +P FDP  S
Sbjct: 100 SGRQLLQTPTYVVRARLGTPPQQLLLAVDTSNDAAWIPCAGC--AGCPTSSAPPFDPAAS 157

Query: 141 STYKSLPCSSSQCASLNQKSC--SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           ++Y+S+PC S  CA     +C   G  C +S++Y D S     L+ +++ +    G AV 
Sbjct: 158 TSYRSVPCGSPLCAQAPNAACPPGGKACGFSLTYADSSL-QAALSQDSLAVA---GDAVK 213

Query: 199 LPGITFGC 206
               TFGC
Sbjct: 214 T--YTFGC 219


>gi|225438361|ref|XP_002273988.1| PREDICTED: aspartic proteinase Asp1 [Vitis vinifera]
 gi|296082608|emb|CBI21613.3| unnamed protein product [Vitis vinifera]
          Length = 426

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 86/359 (23%), Positives = 136/359 (37%), Gaps = 84/359 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           Y + +SIG PP       DTGSDL W QC+ PC   +C     PL+ P  +       C 
Sbjct: 67  YYVSLSIGQPPKPYFLDPDTGSDLSWLQCDAPC--VRCTKAPHPLYRPNNNLVI----CK 120

Query: 150 SSQCASLN---QKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
              CASL+    K      C Y V Y DG  S G L  +   L  T G  +A P +  GC
Sbjct: 121 DPMCASLHPPGYKCEHPEQCDYEVEYADGGSSLGVLVKDVFPLNFTNGLRLA-PRLALGC 179

Query: 207 GTNN-GGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLG------------------- 246
           G +   G       G++GLG G  S++SQ+ +       +G                   
Sbjct: 180 GYDQIPGQSYHPLDGVLGLGKGKSSIVSQLHSQGVIRNVVGHCVSSRGGGFLFFGDDLYD 239

Query: 247 ----VSTPDI----------------------------VIDSGTTLTFLPQGYNSNLLSV 274
               V TP +                              DSG++ T+L       L+ +
Sbjct: 240 SSRVVWTPMLRDQHTHYSSGYAELILGGKTTVFKNLLVTFDSGSSYTYLNSLAYQALVHL 299

Query: 275 MSSMIEAQPVADPTG--SLELCYS----FNSLSQVPE----VTIHFRGA-------DVKL 317
           +   +  +PV +     +L LC+     F S+  V +    + + F G        D+ L
Sbjct: 300 VRKELSEKPVREALDDQTLPLCWRGKRPFKSVRDVKKFFKPLALSFPGGGRTKTQYDIPL 359

Query: 318 SRSNFFVKVSEDIVCSVFKGITNSVP---IYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
             S   + +  ++   +  G    +    + G+I   + +V YD E+  + + PT+C +
Sbjct: 360 -ESYLIISLKGNVCLGILNGTEAGLQDFNLIGDISMQDKMVVYDNEKNQIGWAPTNCDR 417


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 58/194 (29%), Positives = 93/194 (47%), Gaps = 17/194 (8%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQ----RLRDALTRSLNRLNHFNQNSSISSSK 78
           + Q  G ++++IH  SP SPF  S    ++    +++   T  L  L+      SI    
Sbjct: 23  DVQDNGSTLQVIHVFSPCSPFRPSKPLSWEESVLQMQAKDTTRLQFLDSLVARKSIVP-I 81

Query: 79  ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPK 138
           AS   II  +  Y++R  IGTPP   L   DT +D  W  C  C    C    S LF P+
Sbjct: 82  ASGRQII-QSPTYIVRAKIGTPPQTLLLAMDTSNDAAWIPCTAC--DGC---ASTLFAPE 135

Query: 139 MSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            S+T+K++ C++ +C  +    C   +  ++++YG  S +  NL  +T+TL +       
Sbjct: 136 KSTTFKNVSCAAPECKQVPNPGCGVSSRNFNLTYGSSSIA-ANLVQDTITLATD-----P 189

Query: 199 LPGITFGCGTNNGG 212
           +P  TFGC +   G
Sbjct: 190 VPSYTFGCVSKTTG 203


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 72/153 (47%), Gaps = 17/153 (11%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           L+ + IGTPP  +  + DTGS L W QC    P +     S +FDP +SS++  LPC+  
Sbjct: 83  LVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRK--PPPSSVFDPSLSSSFSVLPCNHP 140

Query: 152 QCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
            C       +L         C YS  Y DG+ + GNL  E +T      ++ + P +  G
Sbjct: 141 LCKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITF----SRSQSTPPLILG 196

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           C        +S   GI+G+  G +S  SQ + T
Sbjct: 197 CAEE-----SSDAKGILGMNLGRLSFASQAKLT 224


>gi|255586860|ref|XP_002534040.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223525947|gb|EEF28344.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 518

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 95/359 (26%), Positives = 145/359 (40%), Gaps = 94/359 (26%)

Query: 95  ISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQ--CYMQDSPL--FDPKMSSTYKSLPC 148
           + +GTP  + +   DTGSDL W  C+   C P+Q   Y  D  L  +DPK SST K + C
Sbjct: 105 VELGTPGMKFMVALDTGSDLFWVPCDCSKCAPTQGVAYASDFELSIYDPKQSSTSKKVTC 164

Query: 149 SSSQCASLNQKSCSGVNCQYSVSYGDGSFS-NGNLATETVTLGSTTGQAVALPG-ITFGC 206
           +++ CA  N+   +  +C Y VSY     S +G L  + + L S      ++   +TFGC
Sbjct: 165 NNNLCAHRNRCLGTFSSCPYMVSYVSAQTSTSGILVEDVLHLTSEDSNQESIKAYVTFGC 224

Query: 207 GTNNGGLF--NSKTTGIVGLGGGDISLIS------------------------------- 233
           G    G F   +   G+ GLG   IS+ S                               
Sbjct: 225 GQVQSGSFLNTAAPNGLFGLGMDQISVPSILSREGLTADSFSMCFGHDGVGRISFGDKGS 284

Query: 234 --QMRTTIAGNQ------------RLGVSTPDI----VIDSGTTLTFLPQGYNSNLLSVM 275
             Q  T    N             R+G +  D+    + DSGT+ T+L     + + +++
Sbjct: 285 PDQEETPFNSNPSHPSYNISVTQVRVGTTLVDVDFTALFDSGTSFTYLI----NPIYAMV 340

Query: 276 SSMIEAQPV-----ADPTGSLELCYSFN---SLSQVPEVTIHFRGADVKLSRSNFFV--- 324
           S    AQ        DP    E CY  +   + S +P +++  +G      R +F V   
Sbjct: 341 SENFHAQAQDKRRPPDPRIPFEYCYDMSPGANSSLIPSMSLTMKG------RGHFTVFDP 394

Query: 325 -----KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDCTKQ 374
                  +E + C     I  S  +  NI+  NF+ GY    D E+  + +K TDC  Q
Sbjct: 395 IIVITTQNELVYC---LAIVKSTEL--NIIGQNFMTGYRVVFDREKLVLGWKETDCYDQ 448


>gi|356515904|ref|XP_003526637.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 421

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 91/365 (24%), Positives = 152/365 (41%), Gaps = 93/365 (25%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLP-C 148
           Y + ++IG PP       DTGSDL W QC+ PC    C +  + L+ P  +      P C
Sbjct: 64  YTVSLAIGNPPKVYDLDIDTGSDLTWVQCDAPC--QGCTIPRNRLYKPNGNLVKCGDPLC 121

Query: 149 SSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
            + Q A      C+G N  C Y V Y D   S G L  + + L  T G ++A P + FGC
Sbjct: 122 KAIQSAP--NHHCAGPNEQCDYEVEYADQGSSLGVLLRDNIPLKFTNG-SLARPILAFGC 178

Query: 207 GTNN---GGLFNSKTTGIVGLGGGDISLISQMRT-----TIAGN---QRLG--------- 246
           G +    G   ++ T G++GLG G  S++SQ+ +      + G+   +R G         
Sbjct: 179 GYDQKHVGHNPSASTAGVLGLGNGKTSILSQLHSLGLIRNVVGHCLSERGGGFLFFGDQL 238

Query: 247 ------VSTP-----------------------------DIVIDSGTTLTFLPQGYNSNL 271
                 V TP                              ++ DSG++ T+     +  L
Sbjct: 239 VPQSGVVWTPLLQSSSTQHYKTGPADLFFDRKPTSVKGLQLIFDSGSSYTYFNSKAHKAL 298

Query: 272 LSVMSSMIEAQPVADPT--GSLELCYS----FNSLSQVPEVTIHFRGADVKLSRSN---- 321
           ++++++ +  +P++  T   SL +C+     F SL    +VT +F+   +  ++S     
Sbjct: 299 VNLVTNDLRGKPLSRATEDSSLPICWRGPKPFKSLH---DVTSNFKPLLLSFTKSKNSLL 355

Query: 322 ------FFVKVSEDIVC-------SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKP 368
                 + +      VC        +  G TN   I G+I   + LV YD E+Q + +  
Sbjct: 356 QLPPEAYLIVTKHGNVCLGILDGTEIGLGNTN---IIGDISLQDKLVIYDNEKQQIGWAS 412

Query: 369 TDCTK 373
            +C +
Sbjct: 413 ANCDR 417


>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 54/212 (25%), Positives = 88/212 (41%), Gaps = 16/212 (7%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQ--ADIIPNN 88
           ++L HRD+           P  R+ D +     R +  ++             + I    
Sbjct: 33  LKLAHRDT-------LWPNPLSRIEDIIGADQKRHSLISRKRKFKGGVKMDLGSGIDYGT 85

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
           A Y   + +GTP  +   V DTGS+L W  C      +  +++  +F  + S ++K++ C
Sbjct: 86  AQYFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEESKSFKTVGC 145

Query: 149 SSSQCA-------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
            +  C        SL+        C Y   Y DGS + G  A ET+T+G T G+   L G
Sbjct: 146 FTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLRG 205

Query: 202 ITFGCGTNNGGLFNSKTTGIVGLGGGDISLIS 233
           +  GC ++  G       G++GL   D S  S
Sbjct: 206 LLVGCSSSFSGQSFQGADGVLGLAFSDFSFTS 237



 Score = 63.9 bits (154), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 40/127 (31%), Positives = 68/127 (53%), Gaps = 8/127 (6%)

Query: 253 VIDSGTTLTFLPQG-YNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPEVT 307
           ++DSGT+LT L +  Y   +  +   ++E + V      +E C+S    FN  S++P++T
Sbjct: 324 ILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIPIEYCFSSTSGFNE-SKLPQLT 382

Query: 308 IHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGI-TNSVPIYGNIMQTNFLVGYDIEQQTVS 365
            H +G A  +  R ++ V  +  + C  F    T +  + GNIMQ N+L  +D+   T+S
Sbjct: 383 FHLKGGARFEPHRKSYLVDAAPGVKCLGFMSAGTPATNVVGNIMQQNYLWEFDLMASTLS 442

Query: 366 FKPTDCT 372
           F P+ CT
Sbjct: 443 FAPSTCT 449


>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 498

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 89/360 (24%), Positives = 142/360 (39%), Gaps = 78/360 (21%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W    QC  CP +     +   +D + S+T K + 
Sbjct: 87  YYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVS 146

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG 201
           C    C  +N    SG    ++C Y   YGDGS + G    + V     +G  +  A  G
Sbjct: 147 CDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANG 206

Query: 202 -ITFGCGTNNGGLFNS----KTTGIVGLGGGDISLISQMRTT----------IAGNQRLG 246
            I FGCG    G   S       GI+G G  + S+ISQ+ +T          + G    G
Sbjct: 207 SIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCLDGTNGGG 266

Query: 247 VSTPDIVIDSGTTLT-FLPQ--GYNSNLLSVMSSMI------EAQPVADPTGSL------ 291
           +     V+     +T  +P    YN N+  V    I      +     D  G++      
Sbjct: 267 IFAMGHVVQPKVNMTPLVPNQPHYNVNMTGVQVGHIILNISADVFEAGDRKGTIIDSGTT 326

Query: 292 -----ELCYS---FNSLSQ-------------------------VPEVTIHFRGADVKLS 318
                EL Y       LSQ                          P V  HF  + +   
Sbjct: 327 LAYLPELIYEPLVAKILSQQHNLEVQTIHGEYKCFQYSERVDDGFPPVIFHFENSLLLKV 386

Query: 319 RSNFFVKVSEDIVCSVFK--GI----TNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
             + ++   E++ C  ++  G+      +V ++G+++ +N LV YD+E QT+ +   +C+
Sbjct: 387 YPHEYLFQYENLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDLENQTIGWTEYNCS 446


>gi|361067983|gb|AEW08303.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
          Length = 154

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 51/153 (33%), Positives = 72/153 (47%), Gaps = 12/153 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
           G ++ L H     SP    + + +  L   +L R  +RL      +S   +  S   +  
Sbjct: 3   GVNIRLDHIHGACSPLRPINSSSWIDLVSQSLERDDDRLKTIRSRNSSPYTTMSNLPLQS 62

Query: 87  NN----ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
            N     NY++   +GTP    L + DTGSDL W QC+PC    CY Q  P+F+P+ SS+
Sbjct: 63  GNKVGTGNYIVTAGLGTPTKNFLLIIDTGSDLTWIQCKPC--LDCYSQVDPIFEPRQSSS 120

Query: 143 YKSLPCSSSQCASL-----NQKSCSGVNCQYSV 170
           YKSLPC S+ C  L     N   C    C Y +
Sbjct: 121 YKSLPCLSATCTELLISESNLTPCLLGGCSYEI 153


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 94/372 (25%), Positives = 149/372 (40%), Gaps = 90/372 (24%)

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---------- 132
           D +  N  Y  R+ IGTP  E   + D+GS + +  C  C   QC    S          
Sbjct: 84  DDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATC--EQCGNHQSESPNIIEAHD 141

Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
           P F P +SSTY  + C+   C   N++S     C Y   Y + S S+G L  + ++ G  
Sbjct: 142 PRFQPDLSSTYSPVKCNVD-CTCDNERS----QCTYERQYAEMSSSSGVLGEDIMSFGKE 196

Query: 193 TGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIA-------GN 242
           +   +      FGC  T  G LF+    GI+GLG G +S++ Q+  +  I+       G 
Sbjct: 197 S--ELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGG 254

Query: 243 QRLGVST---------PDIVID-------------------SGTTLTFLPQGYNSNLLSV 274
             +G  T         PD+V                     +G  L   P+ +NS   +V
Sbjct: 255 MDVGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTV 314

Query: 275 MSS------------MIEAQPVADPTGSL-----------ELCYS-----FNSLSQV-PE 305
           + S            +     V +   SL           ++C++      + LS+V P+
Sbjct: 315 LDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPD 374

Query: 306 VTIHF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
           V + F  G  + LS  N+  + S  E   C  VF+   +   + G I+  N LV YD   
Sbjct: 375 VDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHN 434

Query: 362 QTVSFKPTDCTK 373
           + + F  T+C++
Sbjct: 435 EKIGFWKTNCSE 446


>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
          Length = 439

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 56/201 (27%), Positives = 91/201 (45%), Gaps = 30/201 (14%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNRLNHFNQNSSISSSKASQ 81
           + Q  G ++ + H DSP SPF +SS   ++ R+   L +   RL +        SS  + 
Sbjct: 29  KTQDQGSTLRIFHIDSPCSPFKSSSPLSWEARVLQTLAQDQARLQYL-------SSLVAG 81

Query: 82  ADIIP--------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP--PSQCYMQD 131
             ++P         +  Y+++  IGTP    L   DT SD+ W  C  C   PS      
Sbjct: 82  RSVVPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTA--- 138

Query: 132 SPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGS 191
              F P  S+++K++ CS+ QC  +   +C    C ++++YG  S +  NL+ +T+ L  
Sbjct: 139 ---FSPAKSTSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSSIA-ANLSQDTIRLA- 193

Query: 192 TTGQAVALPGITFGCGTNNGG 212
               A  +   TFGC     G
Sbjct: 194 ----ADPIKAFTFGCVNKVAG 210


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 64/207 (30%), Positives = 95/207 (45%), Gaps = 19/207 (9%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSI--SSSKASQADIIPNN 88
           + + H +S  SPF  S         D L +   R  + +  + +  SS   +    I  +
Sbjct: 31  LRVFHINSLCSPFKTSVS-----WADTLLQDKARFLYLSSLAGVRKSSVPIASGRAIVQS 85

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPC 148
             Y++R +IGTP    L   DT +D  W  C  C         S LFDP  SS+ ++L C
Sbjct: 86  PTYIVRANIGTPAQPMLVALDTSNDAAWIPCSGC----VGCSSSVLFDPSKSSSSRTLQC 141

Query: 149 SSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCG 207
            + QC      SC+   +C ++++YG GS     L  +T+TL S       +P  TFGC 
Sbjct: 142 EAPQCKQAPNPSCTVSKSCGFNMTYG-GSTIEAYLTQDTLTLASDV-----IPNYTFGC- 194

Query: 208 TNNGGLFNSKTTGIVGLGGGDISLISQ 234
            N     +    G++GLG G +SLISQ
Sbjct: 195 INKASGTSLPAQGLMGLGRGPLSLISQ 221


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 72/252 (28%), Positives = 111/252 (44%), Gaps = 31/252 (12%)

Query: 8   VFILFFLCFYVVSPIEA---------QTGGFSVELIHRDSPKSPFYNSSETPY-QRLRDA 57
           +F  F     VVS  +A         ++ G  + +IH     SPF       +   + + 
Sbjct: 3   IFTAFVFLTLVVSTTKAFDPCASPSSESKGSDLSVIHVYGQCSPFNQHKAGSWVNTVINM 62

Query: 58  LTRSLNRLNHFNQNSSISSSKASQADI-----IPNNANYLIRISIGTPPTERLAVADTGS 112
            ++   R+ + +  S ++S KA+   I     + N  NY++R+ +GTP      V DT  
Sbjct: 63  ASKDPARVTYLS--SLVASPKATSVPIASGQQVLNIGNYVVRVKLGTPGQLMFMVLDTSR 120

Query: 113 DLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSC---SGVNCQYS 169
           D  W  C  C  + C    SP F P  SSTY SL CS  QC  +   SC       C ++
Sbjct: 121 DAAWVPCADC--AGC---SSPTFSPNTSSTYASLQCSVPQCTQVRGLSCPTTGTAACFFN 175

Query: 170 VSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDI 229
            +YG  S  +  L+ +++ L   T     LP  +FGC     G       G++GLG G +
Sbjct: 176 QTYGGDSSFSAMLSQDSLGLAVDT-----LPSYSFGCVNAVSG-STLPPQGLLGLGRGPM 229

Query: 230 SLISQMRTTIAG 241
           SL+SQ  +  +G
Sbjct: 230 SLLSQSGSLYSG 241



 Score = 40.4 bits (93), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 28/112 (25%), Positives = 49/112 (43%), Gaps = 7/112 (6%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRG 312
           +IDSGT +T   +   + +       ++  P A   G+ + C++  +    P VT HF G
Sbjct: 317 IIDSGTVITRFVEPVYAAIRDEFRKQVKG-PFAT-IGAFDTCFAATNEDIAPPVTFHFTG 374

Query: 313 ADVKLSRSNFFVKVSE-DIVCSVFKGITNSV----PIYGNIMQTNFLVGYDI 359
            D+KL   N  +  S   + C       N+V     +  N+ Q N  + +D+
Sbjct: 375 MDLKLPLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDV 426


>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 455

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 54/199 (27%), Positives = 91/199 (45%), Gaps = 26/199 (13%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNRLNHFNQNSSISSSKASQ 81
           + Q  G ++ + H DSP SPF +SS   ++ R+   L +   RL +        SS  + 
Sbjct: 45  KTQDQGSTLRIFHIDSPCSPFKSSSPLSWEARVLQTLAQDQARLQYL-------SSLVAG 97

Query: 82  ADIIP--------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP 133
             ++P         +  Y+++  IGTP    L   DT SD+ W  C  C         + 
Sbjct: 98  RSVVPIASGRQMLQSTTYIVKALIGTPAQPLLLAMDTSSDVAWIPCSGC----VGCPSNT 153

Query: 134 LFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTT 193
            F P  S+++K++ CS+ QC  +   +C    C ++++YG  S +  NL+ +T+ L +  
Sbjct: 154 AFSPAKSTSFKNVSCSAPQCKQVPNPTCGARACSFNLTYGSSSIA-ANLSQDTIRLAADP 212

Query: 194 GQAVALPGITFGCGTNNGG 212
            +A      TFGC     G
Sbjct: 213 IKA-----FTFGCVNKVAG 226


>gi|376337722|gb|AFB33417.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
 gi|376337724|gb|AFB33418.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
 gi|376337726|gb|AFB33419.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
 gi|376337728|gb|AFB33420.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
 gi|376337730|gb|AFB33421.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
 gi|376337732|gb|AFB33422.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
          Length = 154

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 53/153 (34%), Positives = 72/153 (47%), Gaps = 12/153 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKAS----QA 82
           G ++ L H     SP   ++ + +  L   +L R  +RL      +S   +  S    Q+
Sbjct: 3   GVNIRLDHIHGACSPLRPTNSSKWIDLVSQSLERDNDRLKTIRSRNSGPYTTMSNLPLQS 62

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
                  NY++    GTP  + L V DTGSDL W QC+PC    CY Q  P+FDP  SS+
Sbjct: 63  GSEVGTGNYILTAGFGTPTKKFLLVIDTGSDLTWIQCKPC--LGCYSQVDPIFDPSQSSS 120

Query: 143 YKSLPCSSSQCASL-----NQKSCSGVNCQYSV 170
           YKSLPC S+ C  L     N   C    C Y +
Sbjct: 121 YKSLPCLSATCTELLTSESNLTPCLLGGCSYEI 153


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 62/202 (30%), Positives = 90/202 (44%), Gaps = 18/202 (8%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNAN-YLIRISIGTPPTERLAV 107
            R+  A  ++ +R  H      ++        Q    PN+   Y  ++ +GTPP E    
Sbjct: 35  HRVEVAALKARDRARHARMLRGVAGGVVDFSVQGTSDPNSVGLYYTKVKMGTPPKEFNVQ 94

Query: 108 ADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS---C 161
            DTGSD++W  C     CP S     +   FD   SST   +PCS   C S  Q +   C
Sbjct: 95  IDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSDPICTSRVQGAAAEC 154

Query: 162 S-GVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL---PGITFGCGTNNGGLF-- 214
           S  VN C Y+  YGDGS ++G   ++ +      GQ  A+     I FGC  +  G    
Sbjct: 155 SPRVNQCSYTFQYGDGSGTSGYYVSDAMYFSLIMGQPPAVNSSATIVFGCSISQSGDLTK 214

Query: 215 -NSKTTGIVGLGGGDISLISQM 235
            +    GI G G G +S++SQ+
Sbjct: 215 TDKAVDGIFGFGPGPLSVVSQL 236



 Score = 38.1 bits (87), Expect = 7.1,   Method: Compositional matrix adjust.
 Identities = 30/127 (23%), Positives = 60/127 (47%), Gaps = 8/127 (6%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQV-PEVTIHF 310
           ++D GTTL +L Q     L++ +++ + +Q           CY  + S+  + P V+++F
Sbjct: 313 IVDCGTTLAYLIQEAYDPLVTAINTAV-SQSARQTNSKGNQCYLVSTSIGDIFPSVSLNF 371

Query: 311 RG-ADVKLSRSNFFVK----VSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
            G A + L    + +        ++ C  F+       I G+++  + +V YDI QQ + 
Sbjct: 372 EGGASMVLKPEQYLMHNGYLDGAEMWCIGFQKFQEGASILGDLVLKDKIVVYDIAQQRIG 431

Query: 366 FKPTDCT 372
           +   DC+
Sbjct: 432 WANYDCS 438


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 94/372 (25%), Positives = 149/372 (40%), Gaps = 90/372 (24%)

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---------- 132
           D +  N  Y  R+ IGTP  E   + D+GS + +  C  C   QC    S          
Sbjct: 83  DDLLTNGYYTTRLYIGTPSQEFALIVDSGSTVTYVPCATC--EQCGNHQSESPNIIEAHD 140

Query: 133 PLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST 192
           P F P +SSTY  + C+   C   N++S     C Y   Y + S S+G L  + ++ G  
Sbjct: 141 PRFQPDLSSTYSPVKCNVD-CTCDNERS----QCTYERQYAEMSSSSGVLGEDIMSFGKE 195

Query: 193 TGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQM--RTTIA-------GN 242
           +   +      FGC  T  G LF+    GI+GLG G +S++ Q+  +  I+       G 
Sbjct: 196 S--ELKPQRAVFGCENTETGDLFSQHADGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGG 253

Query: 243 QRLGVST---------PDIVID-------------------SGTTLTFLPQGYNSNLLSV 274
             +G  T         PD+V                     +G  L   P+ +NS   +V
Sbjct: 254 MDVGGGTMVLGGMPAPPDMVFSHSNPVRSPYYNIELKEIHVAGKALRLDPKIFNSKHGTV 313

Query: 275 MSS------------MIEAQPVADPTGSL-----------ELCYS-----FNSLSQV-PE 305
           + S            +     V +   SL           ++C++      + LS+V P+
Sbjct: 314 LDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPDPNYKDICFAGAGRNVSQLSEVFPD 373

Query: 306 VTIHF-RGADVKLSRSNFFVKVS--EDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
           V + F  G  + LS  N+  + S  E   C  VF+   +   + G I+  N LV YD   
Sbjct: 374 VDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHN 433

Query: 362 QTVSFKPTDCTK 373
           + + F  T+C++
Sbjct: 434 EKIGFWKTNCSE 445


>gi|222623568|gb|EEE57700.1| hypothetical protein OsJ_08178 [Oryza sativa Japonica Group]
          Length = 441

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 51/149 (34%), Positives = 75/149 (50%), Gaps = 24/149 (16%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQN-SSISSSKASQADI-- 84
           G  +EL H  SP SP    ++ P+  +   LT    R++      +   S++A+  D   
Sbjct: 42  GLHLELHHPRSPCSPAPVPADLPFTAV---LTHDDARISSLAARLAKTPSARATSLDADA 98

Query: 85  ----------IPNN-------ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC 127
                     +P +        NY+ R+ +GTP T+ + V DTGS L W QC PC  S C
Sbjct: 99  DAGLAGSLASVPLSPGASVGVGNYVTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVS-C 157

Query: 128 YMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
           + Q  P+F+PK SSTY S+ CS+ QC+ L
Sbjct: 158 HRQSGPVFNPKSSSTYASVGCSAQQCSDL 186



 Score = 47.4 bits (111), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 34/115 (29%), Positives = 49/115 (42%), Gaps = 3/115 (2%)

Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN-SLSQVPEVTIHFRG-ADVKL 317
           +T LP    S L   +++ ++    A     L+ C+    S    P VT+ F G A +KL
Sbjct: 328 ITRLPTSVYSALSKAVAAAMKGTSRASAYSILDTCFKGQASRVSAPAVTMSFAGGAALKL 387

Query: 318 SRSNFFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           S  N  V V +   C  F     S  I GN  Q  F V YD++   + F    C+
Sbjct: 388 SAQNLLVDVDDSTTCLAF-APARSAAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 441


>gi|361067981|gb|AEW08302.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
 gi|383156226|gb|AFG60348.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
 gi|383156228|gb|AFG60350.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
 gi|383156229|gb|AFG60351.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
 gi|383156230|gb|AFG60352.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
 gi|383156231|gb|AFG60353.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
 gi|383156232|gb|AFG60354.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
 gi|383156233|gb|AFG60355.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
 gi|383156235|gb|AFG60357.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
 gi|383156237|gb|AFG60359.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
 gi|383156238|gb|AFG60360.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
 gi|383156240|gb|AFG60362.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
 gi|383156241|gb|AFG60363.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
          Length = 154

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 51/154 (33%), Positives = 73/154 (47%), Gaps = 12/154 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
           G ++ L H     SP   ++ + +  L   +L R  +RL      +S   +  S   +  
Sbjct: 3   GVNIRLDHIHGACSPLRPANSSKWIDLVSQSLERDNDRLKTIRSRNSGPYTTMSNLPLQS 62

Query: 87  NN----ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
            N     NY++    GTP  + L + DTGSDL W QC+PC    CY Q  P+F+P  SS+
Sbjct: 63  GNKVGTGNYIVTAGFGTPTKKFLLIIDTGSDLTWIQCKPC--LGCYSQVDPIFEPSQSSS 120

Query: 143 YKSLPCSSSQCASL-----NQKSCSGVNCQYSVS 171
           YKSLPC S+ C  L     N   C    C Y ++
Sbjct: 121 YKSLPCLSATCTELLTSESNLTPCFLGGCSYEIN 154


>gi|357143901|ref|XP_003573095.1| PREDICTED: aspartic proteinase-like protein 1-like [Brachypodium
           distachyon]
          Length = 627

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 99/388 (25%), Positives = 145/388 (37%), Gaps = 74/388 (19%)

Query: 54  LRDALTRSLNRLNHFNQNSSISSSKASQADIIPNNAN----YLIRISIGTPPTERLAVAD 109
           +R  L R   RL    ++  +S SK     IIP   +    Y   + +GTP T  +   D
Sbjct: 170 VRSDLQRQKRRLGG-GKHQLLSFSK--DGGIIPTGNDFGWLYYTWVDVGTPNTSFMVALD 226

Query: 110 TGSDLIWTQCE--PCPPSQCYM----QDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSG 163
           TGSDL W  C+   C P   Y     +D  ++ P  S+T + LPCS   C   +  +   
Sbjct: 227 TGSDLFWIPCDCIECAPLSGYHGSLDRDLGIYKPAESTTSRHLPCSHELCLLGSDCTNQK 286

Query: 164 VNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG--LFNSKTTG 220
             C Y+  Y  + + S+G L  + + L S    A     +  GCG    G  L      G
Sbjct: 287 QPCPYNTKYLQENTTSSGLLVEDILHLDSRESHAPVKASVIIGCGRKQSGSYLDGIAPDG 346

Query: 221 IVGLGGGDISLISQM-RTTIAGNQ----------RL-----GVSTP-------------- 250
           ++GLG  DIS+ S + R  +  N           R+     GVST               
Sbjct: 347 LLGLGMADISVPSFLARAGLVRNSFSMCFTKDSGRIFFGDQGVSTQQSTPFVPLYGKLQT 406

Query: 251 --------------------DIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGS 290
                                 ++DSGT+ T LP      +       + A  +     S
Sbjct: 407 YTVNVDKSCVGHKCFESTSFQAIVDSGTSFTALPLDIYKAVAIEFDKQVNASRLPQEATS 466

Query: 291 LELCYSFNSL--SQVPEVTIHFRG-ADVKLSRSNFFVKVSEDIVCSVFKGITNSVPIYGN 347
            + CYS + L    VP VT+ F G    +     F +   E  V      +  S    G 
Sbjct: 467 FDYCYSASPLVMPDVPTVTLTFAGNKSFQPVNPTFLLHDEEGAVAGFCLAVVQSPEPIGI 526

Query: 348 IMQTNFLVGY----DIEQQTVSFKPTDC 371
           I Q NFL+GY    D E   + +  ++C
Sbjct: 527 IAQ-NFLLGYHVVFDRENMKLGWYRSEC 553


>gi|6580159|emb|CAB62657.2| putative protein [Arabidopsis thaliana]
          Length = 475

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 78/304 (25%), Positives = 126/304 (41%), Gaps = 29/304 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM--------QDSP--LFDPKMS 140
           Y   +S+GTPP+  L   DTGSDL W  C  C  + C          Q  P  L+ P  S
Sbjct: 102 YYANVSVGTPPSSFLVALDTGSDLFWLPCN-C-GTTCIRDLEDIGVPQSVPLNLYTPNAS 159

Query: 141 STYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           +T  S+ CS  +C     K CS  +  C Y +SY + + + G L  + + L +       
Sbjct: 160 TTSSSIRCSDKRC--FGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLTP 217

Query: 199 LPG-ITFGCGTNNGGLF--NSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVID 255
           +   +T GCG    GLF  N+   G++GLG    S+ S +           +    ++ +
Sbjct: 218 VKANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGN 277

Query: 256 SGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFN---SLSQVPEVTIHFRG 312
            G  ++F  +GY     +   S+   +   DP    E CY  +   +  Q P V + F G
Sbjct: 278 VG-RISFGDRGYTDQEETPFISVAPRRRPVDPELPFEFCYDLSPNATTIQFPLVEMTFIG 336

Query: 313 ADVKLSRSNFF-----VKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFK 367
               +  + FF      +  E  V     G+  SV +  N     + + +D E+  + +K
Sbjct: 337 GSKIILNNPFFTARTQARHGEGNVMYCL-GVLKSVGLKINNFVAGYRIVFDRERMILGWK 395

Query: 368 PTDC 371
            + C
Sbjct: 396 QSLC 399


>gi|383156225|gb|AFG60347.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
 gi|383156227|gb|AFG60349.1| Pinus taeda anonymous locus 2_5996_01 genomic sequence
          Length = 154

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 51/154 (33%), Positives = 73/154 (47%), Gaps = 12/154 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKASQADIIP 86
           G ++ L H     SP   ++ + +  L   +L R  +RL      +S   +  S   +  
Sbjct: 3   GVNIRLDHIHGACSPLRPANSSKWIDLISQSLERDNDRLKTIRSRNSGPYTTMSNLPLQS 62

Query: 87  NN----ANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
            N     NY++    GTP  + L + DTGSDL W QC+PC    CY Q  P+F+P  SS+
Sbjct: 63  GNKVGTGNYIVTAGFGTPTKKFLLIIDTGSDLTWIQCKPC--LGCYSQVDPIFEPSQSSS 120

Query: 143 YKSLPCSSSQCASL-----NQKSCSGVNCQYSVS 171
           YKSLPC S+ C  L     N   C    C Y ++
Sbjct: 121 YKSLPCLSATCTELLTSESNLTPCFLGGCSYEIN 154


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score = 77.0 bits (188), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/161 (36%), Positives = 80/161 (49%), Gaps = 16/161 (9%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +GTPP E     DTGSD++W  C     CP S     +   FD   SST   +P
Sbjct: 84  YTTKVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALVP 143

Query: 148 CSSSQCASLNQKS---CS-GVN-CQYSVSYGDGSFSNGNLATET----VTLGSTTGQAVA 198
           CS   CAS  Q +   CS  VN C Y+  Y DGS ++G   ++     + LG +T   VA
Sbjct: 144 CSDPMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANVA 203

Query: 199 LPG-ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
               I FGC T   G     +    GI+G G G++S++SQ+
Sbjct: 204 SSATIVFGCSTYQSGDLTKTDKAVDGILGFGPGELSVVSQL 244


>gi|222613194|gb|EEE51326.1| hypothetical protein OsJ_32296 [Oryza sativa Japonica Group]
          Length = 309

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 41/117 (35%), Positives = 61/117 (52%), Gaps = 7/117 (5%)

Query: 55  RDALTRSLNRLNHFNQNSSISSSKASQADIIP---NNANYLIRISIGTPPTERLAVADTG 111
            D L R L +       +   ++ A  A ++P   +   Y+   +IGTPP    A+ D  
Sbjct: 25  HDDLRRGLEQATRGRLLAD--ATPAGGAAVVPIRWSPPYYVANFTIGTPPQPASAIVDVA 82

Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQY 168
            +L+WTQC  C   +C+ QD P+F P  SST+K  PC ++ C S+  +SCSG  C Y
Sbjct: 83  GELVWTQCSAC--RRCFKQDLPVFVPNASSTFKPEPCGTAVCESIPTRSCSGDVCSY 137


>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 53/197 (26%), Positives = 90/197 (45%), Gaps = 26/197 (13%)

Query: 25  QTGGFSVELIHRDSPKSPFYNSSETPYQ-RLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
           Q  G ++ + H DSP SPF + S   ++ R+   L +   RL +        SS  +   
Sbjct: 31  QDQGSTLRIFHIDSPCSPFKSPSPLSWEARVLQTLAQDQARLQYL-------SSLVAGRS 83

Query: 84  IIP--------NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLF 135
           ++P         +  Y++++ IGTP    L   DT SD+ W  C  C         +  F
Sbjct: 84  VVPIASGRQMLQSTTYIVKVLIGTPAQPLLLAMDTSSDVAWIPCSGC----VGCPSNTAF 139

Query: 136 DPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQ 195
            P  S+++K++ CS+ QC  +   +C    C ++++YG  S +  NL+ +T+ L +   +
Sbjct: 140 SPAKSTSFKNVSCSAPQCKQVPNPACGARACSFNLTYGSSSIA-ANLSQDTIRLAADPIK 198

Query: 196 AVALPGITFGCGTNNGG 212
           A      TFGC     G
Sbjct: 199 A-----FTFGCVNKVAG 210


>gi|42565826|ref|NP_190703.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332645261|gb|AEE78782.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 528

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 85/354 (24%), Positives = 137/354 (38%), Gaps = 76/354 (21%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP--------LFDPKMSST 142
           Y   +S+GTPP+  L   DTGSDL W  C         ++D          L+ P  S+T
Sbjct: 102 YYANVSVGTPPSSFLVALDTGSDLFWLPCNCGTTCIRDLEDIGVPQSVPLNLYTPNASTT 161

Query: 143 YKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP 200
             S+ CS  +C     K CS  +  C Y +SY + + + G L  + + L +       + 
Sbjct: 162 SSSIRCSDKRC--FGSKKCSSPSSICPYQISYSNSTGTKGTLLQDVLHLATEDENLTPVK 219

Query: 201 G-ITFGCGTNNGGLF--NSKTTGIVGLG-------------------------------- 225
             +T GCG    GLF  N+   G++GLG                                
Sbjct: 220 ANVTLGCGQKQTGLFQRNNSVNGVLGLGIKGYSVPSLLAKANITANSFSMCFGRVIGNVG 279

Query: 226 ---GGDISLISQMRT---TIAGNQRLGVSTPDIVI-------------DSGTTLTFLPQG 266
               GD     Q  T   ++A +   GV+   + +             D+G++ T L + 
Sbjct: 280 RISFGDRGYTDQEETPFISVAPSTAYGVNISGVSVAGDPVDIRLFAKFDTGSSFTHLREP 339

Query: 267 YNSNLLSVMSSMIE--AQPVADPTGSLELCYSFN---SLSQVPEVTIHFRGADVKLSRSN 321
               L      ++E   +PV DP    E CY  +   +  Q P V + F G    +  + 
Sbjct: 340 AYGVLTKSFDELVEDRRRPV-DPELPFEFCYDLSPNATTIQFPLVEMTFIGGSKIILNNP 398

Query: 322 FFVKVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGY----DIEQQTVSFKPTDC 371
           FF   +++       G+  SV +  N++  NF+ GY    D E+  + +K + C
Sbjct: 399 FFTARTQEGNVMYCLGVLKSVGLKINVIGQNFVAGYRIVFDRERMILGWKQSLC 452


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 64/208 (30%), Positives = 97/208 (46%), Gaps = 21/208 (10%)

Query: 31  VELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK---ASQADIIPN 87
           + + H +S  SPF  S         D L +   R  + +  + ++ S    AS   I+  
Sbjct: 31  LRVFHINSQCSPFKTSVS-----WADTLLQDKARFLYLSSLAGVTKSSVPIASGRGIV-Q 84

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +  Y++R +IGTP    L   DT +D  W  C  C         S LFDP  SS+ ++L 
Sbjct: 85  SPTYIVRANIGTPAQAMLVALDTSNDAAWIPCSGC----VGCSSSVLFDPSKSSSSRTLQ 140

Query: 148 CSSSQCASLNQKSCS-GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC 206
           C + QC      SC+   +C ++++YG GS     L  +T+TL +       +P  TFGC
Sbjct: 141 CEAPQCKQAPNPSCTVSKSCGFNMTYG-GSAIEAYLTQDTLTLATDV-----IPNYTFGC 194

Query: 207 GTNNGGLFNSKTTGIVGLGGGDISLISQ 234
             N     +    G++GLG G +SLISQ
Sbjct: 195 -INKASGTSLPAQGLMGLGRGPLSLISQ 221


>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
 gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
          Length = 492

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 54/163 (33%), Positives = 75/163 (46%), Gaps = 22/163 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI IG+PP       DTGSD++W     C+ CP       +   +DP  S T  ++ 
Sbjct: 85  YYTRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGT--TVG 142

Query: 148 CSSSQCASLNQKSCSGV---------NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
           C    C +    + SGV          CQ+ ++YGDGS + G   T+ V     +G    
Sbjct: 143 CEQEFCVA--NSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQT 200

Query: 199 LP---GITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQM 235
            P    ITFGCG   GG   S +    GI+G G  D S++SQ+
Sbjct: 201 TPSNVSITFGCGAQLGGDLGSSSQALDGILGFGQSDASMLSQL 243



 Score = 42.0 bits (97), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 36/131 (27%), Positives = 68/131 (51%), Gaps = 15/131 (11%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP-VADPTGSLELCYSFN-SL-SQVPEVTIH 309
           +IDSGTTL +LP+     LL   +++ +  P +A       +C+ F+ SL  + P +T  
Sbjct: 319 IIDSGTTLAYLPREVYRTLL---TAVFDKHPDLAVRNYEDFICFQFSGSLDEEFPVITFS 375

Query: 310 FRGADVKLSR--SNFFVKVSEDIVCSVF--KGIT----NSVPIYGNIMQTNFLVGYDIEQ 361
           F G D+ L+    ++  +   D+ C  F   G+       + + G+++ +N LV YD+E+
Sbjct: 376 FEG-DLTLNVYPHDYLFQNGNDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVYDLEK 434

Query: 362 QTVSFKPTDCT 372
           Q + +   +C+
Sbjct: 435 QVIGWTDYNCS 445


>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
          Length = 508

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 66/249 (26%), Positives = 104/249 (41%), Gaps = 37/249 (14%)

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-----YMQDSPLFDP 137
           D   N   Y++  S+GTPP     V D  SD +W QC  C  + C         +P F  
Sbjct: 89  DPATNTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSAC--ATCGADAPAATSAPPFYA 146

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSFSN--GNLATETVTLGSTT 193
            +SST + + C++  C  L  ++CS  +  C YS  YG G+ +   G LA +     +  
Sbjct: 147 FLSSTIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFAT-- 204

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIV 253
              V   G+ FGC     G       G++GLG G++SL+SQ++      +      PD  
Sbjct: 205 ---VRADGVIFGCAVATEG----DIGGVIGLGRGELSLVSQLQI----GRFSYYLAPDDA 253

Query: 254 IDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGA 313
           +D G+ + FL         +V + ++  +             +  SL  V    I   G 
Sbjct: 254 VDVGSFILFLDDAKPRTSRAVSTPLVANR-------------ASRSLYYVELAGIRVDGE 300

Query: 314 DVKLSRSNF 322
           D+ + R  F
Sbjct: 301 DLAIPRGTF 309


>gi|21717171|gb|AAM76364.1|AC074196_22 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433290|gb|AAP54828.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|125532789|gb|EAY79354.1| hypothetical protein OsI_34483 [Oryza sativa Indica Group]
          Length = 382

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 54/146 (36%), Positives = 78/146 (53%), Gaps = 8/146 (5%)

Query: 96  SIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS 155
           +IGTPP    A  D G  L+WTQC  C  S C+ Q+ P FDP  SSTY+  PC ++ C  
Sbjct: 29  TIGTPPQPASAFIDVGGLLVWTQCSQCSSSSCFNQELPPFDPTKSSTYRPEPCGTALCEF 88

Query: 156 L--NQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGG 212
              + ++CSG  C Y  S      ++G + T+ V +G+ T  +VA     FGC   ++  
Sbjct: 89  FPASIRNCSGDVCAYEASTQLFEHTSGKIGTDAVAIGTATAASVA-----FGCVMASDIK 143

Query: 213 LFNSKTTGIVGLGGGDISLISQMRTT 238
           L +   +G VGL    +SL++QM  T
Sbjct: 144 LMDGGPSGFVGLARTPLSLVAQMNVT 169



 Score = 50.8 bits (120), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 32/133 (24%), Positives = 64/133 (48%), Gaps = 13/133 (9%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLE----LCYSFNSLSQVPEVT 307
           +++ + + ++FL  G   +L   +++ +   P A P    +    LC+    +S  P+V 
Sbjct: 246 VLLQTFSPVSFLVDGVYQDLKKAVTAAV-GGPTATPPEQFQSIFDLCFKRGGVSGAPDVV 304

Query: 308 IHFRGAD-VKLSRSNFFVKVSEDIVCSVFKGITN-------SVPIYGNIMQTNFLVGYDI 359
           + F+GA  + +  +N+ + V +D VC                + I G + Q N    YD+
Sbjct: 305 LTFQGAAALTVPPTNYLLDVGDDTVCVAIASSARLNSTEVAGMSILGGLQQQNVHFLYDL 364

Query: 360 EQQTVSFKPTDCT 372
           E++T+SF+  DC+
Sbjct: 365 EKETLSFEAADCS 377


>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
 gi|219888491|gb|ACL54620.1| unknown [Zea mays]
          Length = 557

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 73/258 (28%), Positives = 107/258 (41%), Gaps = 32/258 (12%)

Query: 2   ATFLSCVFILFFLCFYVVSPIEAQTGGFSVELIHRDSPKS---PFYNSS----------- 47
           A  L  + +  + CFY  S +  Q  G   E   R+  +S   P Y  +           
Sbjct: 83  ALVLGALAVAAYYCFY--SDVAVQFLGMEQEEEQRNETRSFLLPLYPKARQGRALREFGD 140

Query: 48  -ETPYQRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTE 103
            +   +R+ D   ++ NR+      ++ ++S A    + ++ P+   Y   I IG PP  
Sbjct: 141 VKLAARRVDDGGRKARNRMEVAKAATARTNSTALLPIKGNVFPD-GQYYTSIFIGNPPRP 199

Query: 104 RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL--NQKSC 161
                DTGSDL W QC+  P +       PL+ P   +  K +P     C  L  NQ  C
Sbjct: 200 YFLDVDTGSDLTWIQCD-APCTNFAKGPHPLYKP---AKEKIVPPRDLLCQELQGNQNYC 255

Query: 162 SGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS---K 217
                C Y + Y D S S G LA + + + +T G    L    FGC  +  G   S   K
Sbjct: 256 ETCKQCDYEIEYADQSSSMGVLARDDMHMIATNGGREKL-DFVFGCAYDQQGQLLSSPAK 314

Query: 218 TTGIVGLGGGDISLISQM 235
           T GI+GL    IS  SQ+
Sbjct: 315 TDGILGLSSAAISFPSQL 332


>gi|359492489|ref|XP_002285867.2| PREDICTED: aspartic proteinase Asp1-like [Vitis vinifera]
          Length = 453

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 88/372 (23%), Positives = 140/372 (37%), Gaps = 90/372 (24%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
           Q ++ P    Y + + IG PP       D+GSDL W QC+ PC    C     P + P  
Sbjct: 59  QGNVYPQGF-YSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPC--VSCTKAPHPPYKPNK 115

Query: 140 SSTYKSLPCSSSQCASLNQKS-----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
                 + C+   C++L+  S      S   C Y VSY D   S G L  +  +L  T G
Sbjct: 116 G----PITCNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLGVLVHDIFSLQLTNG 171

Query: 195 QAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMRT-----TIAGN---- 242
             +A P + FGCG +    G        G++GLG G  S+++Q+R+     +I G+    
Sbjct: 172 -TLAAPRLAFGCGYDQSYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSG 230

Query: 243 -------------------------------------------QRLGVSTPDIVIDSGTT 259
                                                      Q  GV    +V DSG++
Sbjct: 231 RGGGFLFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSS 290

Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPEVTIHF----- 310
            T+         LS++   +  +       SL +C+     F S+ +V      F     
Sbjct: 291 YTYFNAQAYKTTLSLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEVKNYFKPFALSFT 350

Query: 311 --RGADVKLSRSNFFVKVSEDIVC-------SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
             + A ++L   ++ +       C        V  G +N   + G+I   + +V YD E+
Sbjct: 351 KAKSAQLQLPPESYLIISKHGNACLGILNGSEVGLGDSN---VIGDIAFQDKMVIYDNER 407

Query: 362 QTVSFKPTDCTK 373
           Q + + P DC K
Sbjct: 408 QQIGWVPKDCNK 419


>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 452

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 65/200 (32%), Positives = 95/200 (47%), Gaps = 43/200 (21%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSL 146
           +N +  + +++GTPP     V DTGS+L W  C           D+P FD   SS+Y  +
Sbjct: 59  HNVSLTVPVAVGTPPQNVTMVLDTGSELSWLLC------NGSRHDAP-FDASASSSYAPV 111

Query: 147 PCSSSQCASLNQKS-----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG 201
           PCSS  C  L +       C    C+ S+SY D S ++G LA +T  LGS+   A+    
Sbjct: 112 PCSSPACTWLGRDLPVRPFCDSSACRVSLSYADASSADGLLAADTFLLGSSPMPAL---- 167

Query: 202 ITFGCGTNNGGLFNSKT-------TGIVGLGGGDISLISQMRTT-----IAGNQRLGVST 249
             FGC T+    ++S T       TG++G+  G +S ++Q  T      IA  Q      
Sbjct: 168 --FGCITS----YSSSTDPSETPPTGLLGMNRGGLSFVTQTATRRFAYCIAAGQ-----G 216

Query: 250 PDIVI----DSGTTLTFLPQ 265
           P I++    D+ T LT  PQ
Sbjct: 217 PGILLLGGNDTETPLTSPPQ 236


>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 49/153 (32%), Positives = 71/153 (46%), Gaps = 15/153 (9%)

Query: 92  LIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS 151
           ++ + IGTP   +  V DTGS L W QC P    +     +  FDP +SS++  LPCS  
Sbjct: 82  ILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 141

Query: 152 QCA------SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFG 205
            C       +L     S   C YS  Y DG+F+ GNL  E  T  ++       P +  G
Sbjct: 142 LCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQ----TTPPLILG 197

Query: 206 CGTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           C        ++   GI+G+  G +S ISQ + +
Sbjct: 198 CAKE-----STDVKGILGMNLGRLSFISQAKIS 225


>gi|125606590|gb|EAZ45626.1| hypothetical protein OsJ_30294 [Oryza sativa Japonica Group]
          Length = 431

 Score = 76.3 bits (186), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 46/123 (37%), Positives = 62/123 (50%), Gaps = 24/123 (19%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQ 152
           + + IGTP      V DT SDL+WTQC+PC    C  Q   ++DP  + TY +L  SS  
Sbjct: 90  VFLGIGTPAMNVTLVFDTTSDLLWTQCQPC--LSCVAQAGDMYDPNKTETYANLTSSS-- 145

Query: 153 CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
                          Y+ +Y   SF++G  ATET  LG+ T     +  ITFGCGT N G
Sbjct: 146 ---------------YNYTYSKQSFTSGYFATETFALGNVT-----VANITFGCGTRNQG 185

Query: 213 LFN 215
            ++
Sbjct: 186 YYD 188



 Score = 51.2 bits (121), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 38/133 (28%), Positives = 67/133 (50%), Gaps = 13/133 (9%)

Query: 252 IVIDSGTTLTFLPQG----YNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPE-- 305
           +VIDS + +T L +         L++ ++ + EA   A     L+LC+   +    P   
Sbjct: 291 LVIDSTSPVTVLDEATYGPVRRALVAQLAPLKEANANASAGVGLDLCFELAAGGATPTPP 350

Query: 306 ---VTIHFRG--ADVKLSRSNFFVKVSED-IVC-SVFKGITNSVPIYGNIMQTNFLVGYD 358
              +T+HF G  AD+ L  +++  K S   ++C ++    +N VP+ G+    + LV YD
Sbjct: 351 NVTMTLHFDGGAADLVLPPASYLAKDSAGGLICLTMTPSSSNGVPVLGSWALLDTLVLYD 410

Query: 359 IEQQTVSFKPTDC 371
           + +  VSF+P DC
Sbjct: 411 LAKNVVSFQPLDC 423


>gi|302757589|ref|XP_002962218.1| hypothetical protein SELMODRAFT_403844 [Selaginella moellendorffii]
 gi|300170877|gb|EFJ37478.1| hypothetical protein SELMODRAFT_403844 [Selaginella moellendorffii]
          Length = 353

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 84/292 (28%), Positives = 128/292 (43%), Gaps = 42/292 (14%)

Query: 97  IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASL 156
           +GTP  E LA+ DT  DL+W Q E                 + SS++K++ CS S+C  L
Sbjct: 88  LGTPEQEILAIIDTALDLVWAQVE-----------------ERSSSFKNVSCSDSRC-RL 129

Query: 157 NQKSCS-GVN-CQYSVSYGDGSFSN-GNLATETVTLGSTTG---QAVALPGITFGCGTNN 210
               CS G N C Y  S   G     G LATETVTL    G   + + +P   FGC    
Sbjct: 130 TPSHCSDGSNTCIYYPSSAIGHAGRGGRLATETVTLVYARGRWTERIPVPDTLFGCERKT 189

Query: 211 GGLFNSKTTGIVGLGGGDISL-ISQM----RTTIAGNQRLGVSTPDIVIDSGTTLTFLPQ 265
               NS+ +    +     S  +S M    R  I G    GV T  ++   G    +  +
Sbjct: 190 EA-HNSRHSYYSEITENKFSYCLSSMLFLGRARIPGE---GVQTIPMLSSPGHGHYYFAE 245

Query: 266 GYNSNLLSVMSSMIEAQPVADPTGSLELCYS--FNSLSQVPEVTIHFRGADVKLSRSNFF 323
                 ++V  S+I A    D   +LELCYS   +   + P + +H   A + LS+ N+ 
Sbjct: 246 ---LRAITVGFSVI-AIARNDSDANLELCYSTALDPSYKFPSMELHPESARMVLSQKNYI 301

Query: 324 VKVSEDIVCSVFKGITN--SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
           +       C V   + +   V + G++MQ ++ + +D    T+SF P  C++
Sbjct: 302 LSNGSGWAC-VATAMRDPGDVSVIGSLMQRDYHILFDNPGSTISFAPATCSE 352


>gi|302141796|emb|CBI18999.3| unnamed protein product [Vitis vinifera]
          Length = 390

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 88/372 (23%), Positives = 140/372 (37%), Gaps = 90/372 (24%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
           Q ++ P    Y + + IG PP       D+GSDL W QC+ PC    C     P + P  
Sbjct: 26  QGNVYPQGF-YSVSLRIGNPPKPYTLDIDSGSDLTWLQCDAPC--VSCTKAPHPPYKPNK 82

Query: 140 SSTYKSLPCSSSQCASLNQKS-----CSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTG 194
                 + C+   C++L+  S      S   C Y VSY D   S G L  +  +L  T G
Sbjct: 83  G----PITCNDPMCSALHWPSKPPCKASHEQCDYEVSYADHGSSLGVLVHDIFSLQLTNG 138

Query: 195 QAVALPGITFGCGTNN---GGLFNSKTTGIVGLGGGDISLISQMRT-----TIAGN---- 242
             +A P + FGCG +    G        G++GLG G  S+++Q+R+     +I G+    
Sbjct: 139 -TLAAPRLAFGCGYDQSYPGPNAPPFVDGVLGLGYGKSSIVTQLRSLGLIRSIVGHCLSG 197

Query: 243 -------------------------------------------QRLGVSTPDIVIDSGTT 259
                                                      Q  GV    +V DSG++
Sbjct: 198 RGGGFLFLGDGLSTTPGIIWTPMSRKSGESAYALGPADLLFNGQNSGVKGLRLVFDSGSS 257

Query: 260 LTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYS----FNSLSQVPEVTIHF----- 310
            T+         LS++   +  +       SL +C+     F S+ +V      F     
Sbjct: 258 YTYFNAQAYKTTLSLVRKYLNGKLKETADESLPVCWRGAKPFKSIFEVKNYFKPFALSFT 317

Query: 311 --RGADVKLSRSNFFVKVSEDIVC-------SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
             + A ++L   ++ +       C        V  G +N   + G+I   + +V YD E+
Sbjct: 318 KAKSAQLQLPPESYLIISKHGNACLGILNGSEVGLGDSN---VIGDIAFQDKMVIYDNER 374

Query: 362 QTVSFKPTDCTK 373
           Q + + P DC K
Sbjct: 375 QQIGWVPKDCNK 386


>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
 gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
 gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
 gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
 gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 58/193 (30%), Positives = 84/193 (43%), Gaps = 27/193 (13%)

Query: 64  RLNHFNQNSSISSSKASQADIIPNNANYLIR------------ISIGTPPTERLAVADTG 111
           RL     +SS  +S  S+ +  P ++ Y  R            + IGTP   +  V DTG
Sbjct: 41  RLTPTTNSSSFKTSLLSRRNPSPPSSPYTFRSNIKYSMALILSLPIGTPSQSQELVLDTG 100

Query: 112 SDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCA------SLNQKSCSGVN 165
           S L W QC P    +     +  FDP +SS++  LPCS   C       +L     S   
Sbjct: 101 SQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHPLCKPRIPDFTLPTSCDSNRL 160

Query: 166 CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLG 225
           C YS  Y DG+F+ GNL  E  T  ++       P +  GC        ++   GI+G+ 
Sbjct: 161 CHYSYFYADGTFAEGNLVKEKFTFSNSQ----TTPPLILGCAKE-----STDEKGILGMN 211

Query: 226 GGDISLISQMRTT 238
            G +S ISQ + +
Sbjct: 212 LGRLSFISQAKIS 224


>gi|326517745|dbj|BAK03791.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 556

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 48/160 (30%), Positives = 78/160 (48%), Gaps = 15/160 (9%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP--LFDPKMSSTYK 144
           NN  +L+ I +GTPP   L   DTG+ L + QCEPC   +C+ Q     +FDP  S ++ 
Sbjct: 202 NNFLFLMPIKLGTPPVWNLVAVDTGATLSFVQCEPC-TLRCHKQTDAGEIFDPSKSESFS 260

Query: 145 SLPCSSSQCAS------LNQKSC--SGVNCQYSVSYGD-GSFSNGNLATETVTLGSTTGQ 195
            + CS ++C +      L  K+C     +C YS+++G   S+S G L  + + +G    +
Sbjct: 261 RVGCSENKCRTVQRALHLQSKACMEKEDSCLYSMTFGGTSSYSVGKLVRDRLAIGK-YAK 319

Query: 196 AVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQM 235
             + P   FGC  +    ++    G+VG      S   Q+
Sbjct: 320 GYSFPDFLFGCSLDTE--YHQYEAGLVGFADEPFSFFEQV 357


>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 486

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 64/245 (26%), Positives = 103/245 (42%), Gaps = 37/245 (15%)

Query: 87  NNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQC-----YMQDSPLFDPKMSS 141
           N   Y++  S+GTPP     V D  SD +W QC  C  + C         +P F   +SS
Sbjct: 93  NTGMYVLSFSVGTPPQVVTGVLDITSDFVWMQCSAC--ATCGADAPAATSAPPFYAFLSS 150

Query: 142 TYKSLPCSSSQCASLNQKSCSGVN--CQYSVSYGDGSF--SNGNLATETVTLGSTTGQAV 197
           T + + C++  C  L  ++CS  +  C YS  YG G+   + G LA +     +     V
Sbjct: 151 TIREVRCANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFAT-----V 205

Query: 198 ALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSG 257
              G+ FGC     G       G++GLG G++S +SQ++      +      PD  +D G
Sbjct: 206 RADGVIFGCAVATEG----DIGGVIGLGRGELSPVSQLQI----GRFSYYLAPDDAVDVG 257

Query: 258 TTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFRGADVKL 317
           + + FL         +V + ++ ++             +  SL  V    I   G D+ +
Sbjct: 258 SFILFLDDAKPRTSRAVSTPLVASR-------------ASRSLYYVELAGIRVDGEDLAI 304

Query: 318 SRSNF 322
            R  F
Sbjct: 305 PRGTF 309


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 50/173 (28%), Positives = 71/173 (41%), Gaps = 20/173 (11%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCP-------------PSQCYMQDSPLFD 136
            Y +R  +GTP    L VADTGSDL W +C                 P+         F 
Sbjct: 86  QYFVRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRTFR 145

Query: 137 PKMSSTYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE--TVTL 189
           P  S T+  +PCSS+ C      SL   +     C Y   Y DGS + G +  +  T+ L
Sbjct: 146 PDKSRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIAL 205

Query: 190 GSTTGQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
                +   L G+  GC T+  G     + G++ LG  +IS  S+  +   G 
Sbjct: 206 SGRAARKAKLRGVVLGCTTSYNGQSFLASDGVLSLGYSNISFASRAASRFGGR 258



 Score = 51.2 bits (121), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 32/129 (24%), Positives = 64/129 (49%), Gaps = 13/129 (10%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNSLS------QVP 304
           ++DSGT+LT L +     +++ +S  +   P    DP    + CY++ S S       +P
Sbjct: 357 ILDSGTSLTMLAKPAYRAVVAALSKRLAGLPRVTMDP---FDYCYNWTSPSGSDVAAPLP 413

Query: 305 EVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQ 362
            + +HF G A ++    ++ +  +  + C  + +G    + + GNI+Q   L  YD++ +
Sbjct: 414 MLAVHFAGSARLEPPAKSYVIDAAPGVKCIGLQEGPWPGLSVIGNILQQEHLWEYDLKNR 473

Query: 363 TVSFKPTDC 371
            + FK + C
Sbjct: 474 RLRFKRSRC 482


>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
 gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
          Length = 460

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 95/384 (24%), Positives = 144/384 (37%), Gaps = 93/384 (24%)

Query: 60  RSLNR--LNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWT 117
           RS+N      ++   S         D +  +  +L+ +  GTP  +   + DTGSD  W 
Sbjct: 96  RSINAKIFGQYSTQESKDGWSPESMDTLNEDGLFLVNVGFGTPQQKFNLIIDTGSDTTWI 155

Query: 118 QCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSF 177
           QC  C    C+ + +  F+P +SS+Y +  C  S             +  Y++ Y D S+
Sbjct: 156 QCNSCSLGNCHNKKT--FNPSLSSSYSNRSCIPS------------TDTNYTMKYEDNSY 201

Query: 178 SNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS--------------------- 216
           S G    + VTL     +    P   FGCG + GG F +                     
Sbjct: 202 SKGVFVCDEVTL-----KPDVFPKFQFGCGDSGGGEFGTASGVLGLAKGEQYSLISQTAS 256

Query: 217 --------------KTTGIVGLGGGDISLISQMRTTIAGN------------------QR 244
                          T G +  G   IS    ++ T   N                  +R
Sbjct: 257 KFKKKFSYCFPPKEHTLGSLLFGEKAISASPSLKFTQLLNPPSGLGYFVELIGISVAKKR 316

Query: 245 LGVST-----PDIVIDSGTTLTFLP-QGYNSNLLSVMSSMIEAQPVADPTGS--LELCYS 296
           L VS+     P  +IDSGT +T LP   Y +   +    M+    ++ P     L+ CY+
Sbjct: 317 LNVSSSLFASPGTIIDSGTVITRLPTAAYEALRTAFQQEMLHCPSISPPPQEKLLDTCYN 376

Query: 297 FNSLS----QVPEVTIHFRG-ADVKLSRSNFFVKVSEDI--VCSVFKGITN--SVPIYGN 347
                    ++PE+ +HF G  DV L  S   +  + D+   C  F   +N   V I GN
Sbjct: 377 LKGCGGRNIKLPEIVLHFVGEVDVSLHPSG-ILWANGDLTQACLAFARKSNPSHVTIIGN 435

Query: 348 IMQTNFLVGYDIEQQTVSFKPTDC 371
             Q +  V YDIE   + F   DC
Sbjct: 436 RQQVSLKVVYDIEGGRLGFG-NDC 458


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 49/162 (30%), Positives = 74/162 (45%), Gaps = 17/162 (10%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  R+ +G P  E     DTGSD++W  C P   CP S         F+P  SST   +P
Sbjct: 89  YFTRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRIP 148

Query: 148 CSSSQCASLNQKS---CSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQ---A 196
           CS  +C +  Q     C   +     C Y+ +YGDGS ++G   ++T+   +  G    A
Sbjct: 149 CSDDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQTA 208

Query: 197 VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
            +   + FGC  +  G     +    GI G G   +S++SQ+
Sbjct: 209 NSSASVVFGCSNSQSGDLMKTDRAVDGIFGFGQHQLSVVSQL 250


>gi|51038078|gb|AAT93881.1| hypothetical protein [Oryza sativa Japonica Group]
          Length = 481

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 67/236 (28%), Positives = 107/236 (45%), Gaps = 35/236 (14%)

Query: 24  AQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSKASQAD 83
           A      +EL H D+      N   T  +R+R A  R+ +R    + +++ ++   +   
Sbjct: 18  AGGAALRLELAHVDA------NEHCTMEERVRRATERTHHR-RLLHASTAAAAGGVAAPL 70

Query: 84  IIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC--------PPSQCYMQDSPLF 135
                  Y+    IG PP    AV DTGSDL+WTQC  C            C+ Q+ P +
Sbjct: 71  RWSGKTQYIASYGIGDPPQPAEAVVDTGSDLVWTQCSTCRLPAAAAAGGGGCFPQNLPYY 130

Query: 136 DPKMSSTYKSLPCS---------SSQCASLNQKSCSGVN-CQYSVSYGDGSFSNGNLATE 185
           +  +S T +++PC          + + A   +   SG + C  + SYG G  + G L T+
Sbjct: 131 NFSLSRTARAVPCDDDDGALCGVAPETAGCARGGGSGDDACVVAASYGAG-VALGVLGTD 189

Query: 186 TVTLGSTTGQAVALPGITFGCGTN---NGGLFNSKTTGIVGLGGGDISLISQMRTT 238
             T  S++   +A     FGC +    + G  N   +GI+GLG G +SL+SQ+  T
Sbjct: 190 AFTFPSSSSVTLA-----FGCVSQTRISPGALNG-ASGIIGLGRGALSLVSQLNAT 239



 Score = 43.1 bits (100), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 42/181 (23%), Positives = 71/181 (39%), Gaps = 34/181 (18%)

Query: 221 IVGLGGGDISLISQMRTTIAGNQRLGVSTPDI-----VIDSGTTLTFLPQGYNSNLLSVM 275
           +VGL  G+ ++        AG   L  + P +     +IDSG+  T L    +  L   +
Sbjct: 304 LVGLAAGNATV-----ALPAGAFDLREAAPKVWAGGALIDSGSPFTRLVDPAHRALTKEL 358

Query: 276 SSMIEAQ-----PVADPTGSLELCYSFN------SLSQVPEVTIHFR-----GADVKLSR 319
           +  +        P A   G+LELC          + + VP + + F      G ++ +  
Sbjct: 359 ARQLRGSGSLVPPPAKLGGALELCVEAGDDGDSLAAAAVPPLVLRFDDGVGGGRELVIPA 418

Query: 320 SNFFVKVSEDIVCSVFKG--------ITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
             ++ +V     C              TN   I GN MQ +  V YD+    +SF+P +C
Sbjct: 419 EKYWARVEASTWCMAVVSSASGNATLPTNETTIIGNFMQQDMRVLYDLANGLLSFQPANC 478

Query: 372 T 372
           +
Sbjct: 479 S 479


>gi|376337718|gb|AFB33415.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
 gi|376337720|gb|AFB33416.1| hypothetical protein 2_5996_01, partial [Pinus mugo]
          Length = 154

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 52/153 (33%), Positives = 72/153 (47%), Gaps = 12/153 (7%)

Query: 28  GFSVELIHRDSPKSPFYNSSETPYQRL-RDALTRSLNRLNHFNQNSSISSSKAS----QA 82
           G ++ L H     SP   ++ + +  L   +L R  +RL      +S   +  S    Q+
Sbjct: 3   GVNIRLDHIHGACSPLRPTNSSKWIDLVSQSLERDNDRLKTIRSRNSGPYTTMSNLPLQS 62

Query: 83  DIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSST 142
                  NY++    GTP  + L V DTGSDL W QC+PC    CY Q  P+F+P  SS+
Sbjct: 63  GSEVGTGNYILTAGFGTPTKKFLLVIDTGSDLTWIQCKPC--LGCYSQVDPIFEPSQSSS 120

Query: 143 YKSLPCSSSQCASL-----NQKSCSGVNCQYSV 170
           YKSLPC S+ C  L     N   C    C Y +
Sbjct: 121 YKSLPCLSATCTELLTSESNLTPCLLGGCSYEI 153


>gi|242067691|ref|XP_002449122.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
 gi|241934965|gb|EES08110.1| hypothetical protein SORBIDRAFT_05g005410 [Sorghum bicolor]
          Length = 407

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 85/375 (22%), Positives = 146/375 (38%), Gaps = 94/375 (25%)

Query: 82  ADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQDSPLFDPKM 139
            D+ P   ++ + ++IG P        DTGS+L W +C   P P   C     PL+ PK 
Sbjct: 32  GDVHPT-GHFYVTMNIGEPAKPYFLDIDTGSNLTWIKCHATPGPCKTCNKVPHPLYRPK- 89

Query: 140 SSTYKSLPCSSSQCASLNQ-----KSCSGV--NCQYSVSYGDGSFSNGNLATETVTLGST 192
               K +PC+   C +L++     K C      C Y ++Y DG+ S G L  +  +L + 
Sbjct: 90  ----KLVPCADPLCDALHKDLGTTKDCREEPDQCHYQINYADGTTSLGVLLLDKFSLPTG 145

Query: 193 TGQAVALPGITFGCGTNNGGLFNSKT------TGIVGLG--------------------- 225
           + + +A     FGCG +       K        GI+GLG                     
Sbjct: 146 SARNIA-----FGCGYDQMQGPKKKAPEKVPVDGILGLGRGSVDLVSQLKHSGAVSKNVI 200

Query: 226 -------GGDISLISQ--------------------------MRTTIAGNQRLGVSTPDI 252
                  GG    I +                            T   G   +G      
Sbjct: 201 GHCLSSKGGGYLFIGEENVPSSHLHIIYIYCISREPNHYSPGQATLHLGRNPIGTKPFKA 260

Query: 253 VIDSGTTLTFLPQGYNSNLLSVM-SSMIEA--QPVADPTGSLELCYS----FNSLSQVPE 305
           + DSG+T T+LP+  ++ L+S + +S+I++  + V+D    L LC+     F ++  +P+
Sbjct: 261 IFDSGSTYTYLPENLHAQLVSALKASLIKSSLKLVSDTDTRLHLCWKGPKPFKTVHDLPK 320

Query: 306 -----VTIHF-RGADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYD 358
                VT+ F  G  + +   N+ +       C  + +     + + G I     LV +D
Sbjct: 321 EFKSLVTLKFDHGVTMTIPPENYLIITGHGNACFGILELPGYDLFVIGGISMQEQLVIHD 380

Query: 359 IEQQTVSFKPTDCTK 373
            E+  +++ P+ C K
Sbjct: 381 NEKGRLAWMPSPCDK 395


>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
 gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
          Length = 486

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 50/136 (36%), Positives = 73/136 (53%), Gaps = 13/136 (9%)

Query: 107 VADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKS--C-SG 163
           V DT  D+ W +C PC  +QC       +DP  SSTY + PC+SS C  L + +  C + 
Sbjct: 166 VLDTAGDVPWMRCVPCTFAQCAD-----YDPTRSSTYSAFPCNSSACKQLGRYANGCDAN 220

Query: 164 VNCQYSV-SYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNSKTTGIV 222
             CQY V + GD   ++G  +++ +T+ S  G  V   G  FGC  N  G F ++  GI+
Sbjct: 221 GQCQYMVVTAGDSFTTSGTYSSDVLTINS--GDRVE--GFRFGCSQNEQGSFENQADGIM 276

Query: 223 GLGGGDISLISQMRTT 238
            LG G  SL++Q  +T
Sbjct: 277 ALGRGVQSLMAQTSST 292


>gi|357483911|ref|XP_003612242.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355513577|gb|AES95200.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 527

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 92/360 (25%), Positives = 144/360 (40%), Gaps = 90/360 (25%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCY--MQDSP-------LFDPKMSS 141
           +   +S+GTP +  L   DTGSDL W    PC  ++C   +Q S        ++D K SS
Sbjct: 113 HFANVSVGTPASSYLVALDTGSDLFWL---PCNCTKCVHGIQLSTGQKIAFNIYDNKESS 169

Query: 142 TYKSLPCSSSQCASLNQ-KSCSGVNCQYSVSY-GDGSFSNGNLATETVTLGSTTGQAV-- 197
           T K++ C+SS C    Q  S SG  C Y V Y  + + + G L  + + L +        
Sbjct: 170 TSKNVACNSSLCEQKTQCSSSSGGTCPYQVEYLSENTSTTGFLVEDVLHLITDNDDQTQH 229

Query: 198 ALPGITFGCGTNNGGLF--NSKTTGIVGLGGGDISLISQM-------------------- 235
           A P ITFGCG    G F   +   G+ GLG  D+S+ S +                    
Sbjct: 230 ANPLITFGCGQVQTGAFLDGAAPNGLFGLGMSDVSVPSILAKQGLTSNSFSMCFAADGLG 289

Query: 236 RTTIAGNQR--------------------------LGVSTPDI----VIDSGTTLTFLPQ 265
           R T   N                            +G ++ D+    + D+GT+ T+L  
Sbjct: 290 RITFGDNNSSLDQGKTPFNIRPSHSTYNITVTQIIVGGNSADLEFNAIFDTGTSFTYLNN 349

Query: 266 GYNSNLLSVMSSMIEAQPVADPTGS---LELCYSF--NSLSQVPEVTIHFRGADVKLSRS 320
                +     S I+ Q  +         E CY    N   +VP + +  +G D      
Sbjct: 350 PAYKQITQSFDSKIKLQRHSFSNSDDLPFEYCYDLRTNQTIEVPNINLTMKGGD------ 403

Query: 321 NFFVKV--------SEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           N+FV          +  ++C +V K  +N+V I G    T + + +D E  T+ +K ++C
Sbjct: 404 NYFVMDPIITSGGGNNGVLCLAVLK--SNNVNIIGQNFMTGYRIVFDRENMTLGWKESNC 461


>gi|47497551|dbj|BAD19623.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
 gi|47847593|dbj|BAD21980.1| nucellin-like aspartic protease-like [Oryza sativa Japonica Group]
          Length = 297

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 50/158 (31%), Positives = 76/158 (48%), Gaps = 13/158 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQ---CEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  RI IGTP        DTGSD++W     C+ CP       +  ++DP+ S + + + 
Sbjct: 90  YFTRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVT 149

Query: 148 CSSSQCASLNQK---SCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG-- 201
           C    C +       SC+  + C+YS+SYGDGS + G   T+ +     +G     P   
Sbjct: 150 CDQQFCVANYGGVLPSCTSTSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPANA 209

Query: 202 -ITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQM 235
            ++FGCG   GG   S      GI+G G  + S++SQ+
Sbjct: 210 SVSFGCGAKLGGDLGSSNLALDGILGFGQSNSSMLSQL 247


>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 58/195 (29%), Positives = 86/195 (44%), Gaps = 19/195 (9%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQRLRDALTRSLNRLNHFNQNSSISSSK---- 78
           + Q  G ++E+ H  SP SPF      P       L             +S+ + +    
Sbjct: 28  DTQDHGSTLEVFHVFSPCSPF--RPPKPLSWAESVLQLQAKDQARLQFLASMVAGRSVVP 85

Query: 79  -ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDP 137
            AS   II  +  Y++R  IG+PP   L   DT +D  W  C  C    C    S LF P
Sbjct: 86  IASGRQII-QSPTYIVRAKIGSPPQTLLLAMDTSNDAAWIPCTAC--DGC---TSTLFAP 139

Query: 138 KMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV 197
           + S+T+K++ C S QC  +   SC    C ++++YG  S +  N+  +TVTL +      
Sbjct: 140 EKSTTFKNVSCGSPQCNQVPNPSCGTSACTFNLTYGSSSIA-ANVVQDTVTLATD----- 193

Query: 198 ALPGITFGCGTNNGG 212
            +P  TFGC     G
Sbjct: 194 PIPDYTFGCVAKTTG 208


>gi|356518800|ref|XP_003528065.1| PREDICTED: aspartic proteinase Asp1-like [Glycine max]
          Length = 438

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 95/394 (24%), Positives = 153/394 (38%), Gaps = 103/394 (26%)

Query: 65  LNHFNQNSSISSSKASQADIIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQ 118
           LN F   SS+         + P + N      Y + ++IG PP       DTGSDL W Q
Sbjct: 54  LNRFRAGSSV---------VFPVHGNVYPVGFYNVTLNIGQPPRPYFLDIDTGSDLTWLQ 104

Query: 119 CEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQK---SCSGVN-CQYSVSYGD 174
           C+  P S+C     PL+ P    +   +PC  S CASL+      C   + C Y V Y D
Sbjct: 105 CD-APCSRCSQTPHPLYRP----SNDFVPCRHSLCASLHHSDNYDCEVPHQCDYEVQYAD 159

Query: 175 GSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLF----NSKTTGIVGLGGGDIS 230
              S G L  +  TL  T G  + +  +  GCG +   +F    +    G++GLG G  S
Sbjct: 160 HYSSLGVLLHDVYTLNFTNGVQLKVR-MALGCGYDQ--IFPDPSHHPLDGMLGLGRGKTS 216

Query: 231 LISQM------RTTIA-------------------------------------------- 240
           L SQ+      R  I                                             
Sbjct: 217 LTSQLNSQGLVRNVIGHCLSAQGGGYIFFGDVYDSSRLTWTPMSSRDYKHYSAAGAAELL 276

Query: 241 -GNQRLGVSTPDIVIDSGTTLTFL-PQGYNSNLLSVMSSMIEAQPV--ADPTGSLELCY- 295
            G ++ G+ +   V D+G++ T+  P  Y + L+S +      +P+  A    +L LC+ 
Sbjct: 277 FGGKKSGIGSLHAVFDTGSSYTYFNPYAYQA-LISWLGKESGGKPLKEAHDDQTLPLCWR 335

Query: 296 SFNSLSQVPEVTIHFRGADVKL-----SRSNFFVKVSEDIVCS----VFKGITN------ 340
                  + EV  +F+   +       S++ F +     ++ S    V  GI N      
Sbjct: 336 GRRPFRSIYEVRKYFKPIVLSFTSNGRSKAQFEMPPEAYLIISNMGNVCLGILNGSEVGM 395

Query: 341 -SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
             + + G+I   N ++ +D ++Q + + P DC +
Sbjct: 396 GDLNLIGDISMLNKVMVFDNDKQLIGWTPADCDQ 429


>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
           vinifera]
          Length = 451

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 60/204 (29%), Positives = 93/204 (45%), Gaps = 35/204 (17%)

Query: 23  EAQTGGFSVELIHRDSPKSPFYNSSETPYQR-LRDALTRSLNRLNHFNQNSSISSSK--- 78
           E    G +++++H  SP SPF       ++  +     +   RL      SS+ + K   
Sbjct: 31  ETPDQGSTLQVLHVYSPCSPFRPKEPLSWEESVLQMQAKDKARLQFL---SSLVARKSVV 87

Query: 79  --ASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFD 136
             AS   I+  N  Y++R  IGTP    L   DT SD+ W  C       C    S LF+
Sbjct: 88  PIASGRQIV-QNPTYIVRAKIGTPAQTMLMAMDTSSDVAWIPCN-----GCLGCSSTLFN 141

Query: 137 PKMSSTYKSLPCSSSQCAS--------------LNQKSCSGVNCQYSVSYGDGSFSNGNL 182
              S+TYKSL C ++QC                + + +C G  C ++++YG  S +  NL
Sbjct: 142 SPASTTYKSLGCQAAQCKQVLHLLSPLLTSPSVVPKPTCGGGVCSFNLTYGGSSLA-ANL 200

Query: 183 ATETVTLGSTTGQAVALPGITFGC 206
           + +T+TL +      A+PG +FGC
Sbjct: 201 SQDTITLATD-----AVPGYSFGC 219


>gi|449449906|ref|XP_004142705.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
 gi|449500739|ref|XP_004161182.1| PREDICTED: aspartic proteinase Asp1-like [Cucumis sativus]
          Length = 410

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 81/361 (22%), Positives = 141/361 (39%), Gaps = 83/361 (22%)

Query: 89  ANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
            ++ + ++IG PP       DTGSDL W QC+ PC  + C +    L+ P  +       
Sbjct: 53  GHFTVSVTIGNPPKVFELDIDTGSDLTWVQCDAPC--TGCTLPHDRLYKPHNNVVR---- 106

Query: 148 CSSSQCASL---NQKSCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGI 202
           C    C++L   ++  C   N  C Y V Y D   S G L  + V L  T G  +A P +
Sbjct: 107 CGEPLCSALFSASKSPCKNPNDQCDYEVEYADHGSSIGVLVKDPVPLRLTNGTILA-PNL 165

Query: 203 TFGCG---TNNGGLFNSKTTGIVGLGGGD------ISLISQMRTTIA------------- 240
            FGCG    N G      T G++GLG         +S +S +R  +              
Sbjct: 166 GFGCGYDQHNGGSQLPPLTAGVLGLGNSKATMATQLSALSHVRNVLGHCFSGQGGGFLFF 225

Query: 241 --------------------------------GNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
                                           G   +G+    +  DSG++ T+      
Sbjct: 226 GGDLVPSSGMSWMPILRTPGGKYSAGPAEVYFGGNPVGIRGLILTFDSGSSYTYFNSQVY 285

Query: 269 SNLLSVMSSMIEAQPVADP--TGSLELCYSF-NSLSQVPEVTIHFRGADVKLSRSNFFVK 325
             +L+++ + ++ QP+ D     +L +C+    +   V +V   F+   +    S    +
Sbjct: 286 GAVLNLLRNGLKGQPLRDAPEDKTLPICWKGSKAFKSVADVRNFFKPLALSFGNSKVQFQ 345

Query: 326 VSEDI------VCSVFKGITN-------SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           +  +       + +V  GI N       +V + G+I   + ++ YD E+Q + + P +C+
Sbjct: 346 IPPEAYLIISNLGNVCLGILNGSQVGLGNVNLIGDISMLDKMMVYDNERQQIGWAPANCS 405

Query: 373 K 373
           K
Sbjct: 406 K 406


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 52/158 (32%), Positives = 77/158 (48%), Gaps = 14/158 (8%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYM-QDSPLFDPKMSSTYKSLPC 148
           +Y+ R  +GTPP   L   D  +D  W  C  C    C     SP FDP  SSTY+ + C
Sbjct: 99  SYVARARLGTPPQTLLVAIDPSNDAAWVPCSAC--LGCAPGASSPSFDPTQSSTYRPVRC 156

Query: 149 SSSQCASLNQKSCS-----GVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGIT 203
            + QCA +   + S     G +C +++SY   +  +  L  + ++L  + G AV     T
Sbjct: 157 GAPQCAQVPPATPSCPAGPGASCAFNLSYASSTL-HAVLGQDALSLSDSNGAAVPDDHYT 215

Query: 204 FGC---GTNNGGLFNSKTTGIVGLGGGDISLISQMRTT 238
           FGC    T +GG  +    G+VG G G +S +SQ + T
Sbjct: 216 FGCLRVVTGSGG--SVPPQGLVGFGRGPLSFLSQTKAT 251



 Score = 41.6 bits (96), Expect = 0.71,   Method: Compositional matrix adjust.
 Identities = 33/127 (25%), Positives = 49/127 (38%), Gaps = 8/127 (6%)

Query: 253 VIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSLSQVPEVTIHFR- 311
           ++D+GT  T L     + L +     + A P A   G  + CY  N    VP V   F  
Sbjct: 331 IVDAGTMFTRLSPPAYAALRNAFRRGVSA-PAAPALGGFDTCYYVNGTKSVPAVAFVFAG 389

Query: 312 GADVKLSRSNFFV-KVSEDIVCSVFK-----GITNSVPIYGNIMQTNFLVGYDIEQQTVS 365
           GA V L   N  +   S  + C         G+   + +  ++ Q N  V +D+    V 
Sbjct: 390 GARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNGRVG 449

Query: 366 FKPTDCT 372
           F    CT
Sbjct: 450 FSRELCT 456


>gi|340810981|gb|AEK75417.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 45/129 (34%), Positives = 68/129 (52%), Gaps = 19/129 (14%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
           + +S+G PP   L   DTGS L W QC+PC    C+ Q +   P+FDP  S T + + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 150 SSQCASLN------QKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
           S +C  L       Q +C     +C YSV+YG+G ++S G + T+T+ +G +        
Sbjct: 60  SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113

Query: 201 GITFGCGTN 209
            + FGC  +
Sbjct: 114 DLMFGCSMD 122


>gi|7413629|emb|CAB85978.1| putative protein [Arabidopsis thaliana]
          Length = 356

 Score = 75.1 bits (183), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 42/109 (38%), Positives = 60/109 (55%), Gaps = 5/109 (4%)

Query: 88  NANYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           +A Y   + IGTPP E   V DTGSDL+W  C  C    C + +   FDP  SS+   L 
Sbjct: 75  SALYYTTVQIGTPPRELDVVIDTGSDLVWVSCNSC--VGCPLHNVTFFDPGASSSAVKLA 132

Query: 148 CSSSQCASLNQKS--CSGV-NCQYSVSYGDGSFSNGNLATETVTLGSTT 193
           CS  +C+S  QK   CS + +C Y V YGDGS ++G   ++ ++  + +
Sbjct: 133 CSDKRCSSDLQKKSRCSLLESCTYKVEYGDGSVTSGYYISDLISFDTMS 181


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 54/163 (33%), Positives = 77/163 (47%), Gaps = 18/163 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+PP E     DTGSD++W     C  CP +         FD   SST   + 
Sbjct: 66  YFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVR 125

Query: 148 CSSSQCASLNQKS---CSGV--NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAV----- 197
           CS   C S  Q +   CS     C Y+  YGDGS ++G   ++T+   +  GQ++     
Sbjct: 126 CSDPICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSS 185

Query: 198 ALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT 237
           AL  I FGC     G     +    GI G G G++S+ISQ+ T
Sbjct: 186 AL--IVFGCSAYQSGDLTKTDKAVDGIFGFGQGELSVISQLST 226


>gi|340810961|gb|AEK75407.1| S5 [Oryza sativa]
 gi|340811037|gb|AEK75445.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 45/129 (34%), Positives = 68/129 (52%), Gaps = 19/129 (14%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
           + +S+G PP   L   DTGS L W QC+PC    C+ Q +   P+FDP  S T + + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 150 SSQCASLN------QKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
           S +C  L       Q +C     +C YSV+YG+G ++S G + T+T+ +G +        
Sbjct: 60  SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113

Query: 201 GITFGCGTN 209
            + FGC  +
Sbjct: 114 DLMFGCSMD 122


>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 423

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 59/195 (30%), Positives = 88/195 (45%), Gaps = 21/195 (10%)

Query: 52  QRLRDALTRSLNRLNHFNQNSSISSSKAS---QADIIPNNANYLIRISIGTPPTERLAVA 108
           Q  R  L R L+RL      SS+ +         +I P+   Y+  + +G+PP       
Sbjct: 3   QIRRTLLERDLSRLG----KSSVGNHSVRFHVGGNIYPDGLYYMALL-LGSPPKLYFLDM 57

Query: 109 DTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVN--- 165
           DTGSDL W QC+  P   C +    L++PK +   K + C    CA + Q      N   
Sbjct: 58  DTGSDLTWAQCD-APCRNCAIGPHGLYNPKKA---KVVDCHLPVCAQIQQGGSYECNSDV 113

Query: 166 --CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFN---SKTTG 220
             C Y V Y DGS + G L  +T+T+  T G  +    I  GCG +  G      + T G
Sbjct: 114 KQCDYEVEYADGSSTMGVLVEDTLTVRLTNGTLIQTKAI-IGCGYDQQGTLAKSPASTDG 172

Query: 221 IVGLGGGDISLISQM 235
           ++GL    ++L +Q+
Sbjct: 173 VIGLSSSKVALPAQL 187


>gi|340811098|gb|AEK75475.1| S5 [Oryza nivara]
          Length = 357

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 45/126 (35%), Positives = 67/126 (53%), Gaps = 19/126 (15%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
           + +S+G PP   L   DTGS L W QC+PC    C+ Q +   P+FDP  S T + + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 150 SSQCASLN------QKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
           S +C  L       Q +C     +C YSV+YG+G ++S G + T+T+ +G +        
Sbjct: 60  SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113

Query: 201 GITFGC 206
            + FGC
Sbjct: 114 DLMFGC 119


>gi|196212948|gb|ACG76110.1| S5 [Oryza sativa Japonica Group]
 gi|340810887|gb|AEK75370.1| S5 [Oryza sativa]
 gi|340810903|gb|AEK75378.1| S5 [Oryza sativa]
 gi|340810921|gb|AEK75387.1| S5 [Oryza sativa]
 gi|340810955|gb|AEK75404.1| S5 [Oryza sativa]
 gi|340811079|gb|AEK75466.1| S5 [Oryza nivara]
 gi|340811090|gb|AEK75471.1| S5 [Oryza rufipogon]
 gi|340811116|gb|AEK75484.1| S5 [Oryza nivara]
          Length = 357

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 45/129 (34%), Positives = 68/129 (52%), Gaps = 19/129 (14%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
           + +S+G PP   L   DTGS L W QC+PC    C+ Q +   P+FDP  S T + + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 150 SSQCASLN------QKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
           S +C  L       Q +C     +C YSV+YG+G ++S G + T+T+ +G +        
Sbjct: 60  SVKCGELRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113

Query: 201 GITFGCGTN 209
            + FGC  +
Sbjct: 114 DLMFGCSMD 122


>gi|340810915|gb|AEK75384.1| S5 [Oryza sativa]
 gi|340810917|gb|AEK75385.1| S5 [Oryza sativa]
 gi|340810919|gb|AEK75386.1| S5 [Oryza sativa]
 gi|340810927|gb|AEK75390.1| S5 [Oryza sativa]
 gi|340810975|gb|AEK75414.1| S5 [Oryza nivara]
 gi|340810979|gb|AEK75416.1| S5 [Oryza nivara]
 gi|340810995|gb|AEK75424.1| S5 [Oryza nivara]
 gi|340811027|gb|AEK75440.1| S5 [Oryza nivara]
 gi|340811063|gb|AEK75458.1| S5 [Oryza nivara]
          Length = 357

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 45/129 (34%), Positives = 68/129 (52%), Gaps = 19/129 (14%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
           + +S+G PP   L   DTGS L W QC+PC    C+ Q +   P+FDP  S T + + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 150 SSQCAS------LNQKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
           S +C        L Q +C     +C YSV+YG+G ++S G + T+T+ +G +        
Sbjct: 60  SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113

Query: 201 GITFGCGTN 209
            + FGC  +
Sbjct: 114 DLMFGCSMD 122


>gi|340810945|gb|AEK75399.1| S5 [Oryza sativa]
 gi|340810957|gb|AEK75405.1| S5 [Oryza sativa]
 gi|340811007|gb|AEK75430.1| S5 [Oryza nivara]
 gi|340811073|gb|AEK75463.1| S5 [Oryza rufipogon]
 gi|340811094|gb|AEK75473.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 45/126 (35%), Positives = 67/126 (53%), Gaps = 19/126 (15%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
           + +S+G PP   L   DTGS L W QC+PC    C+ Q +   P+FDP  S T + + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 150 SSQCAS------LNQKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
           S +C        L Q +C     +C YSV+YG+G ++S G + T+T+ +G +        
Sbjct: 60  SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113

Query: 201 GITFGC 206
            + FGC
Sbjct: 114 DLMFGC 119


>gi|340810977|gb|AEK75415.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 74.7 bits (182), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 45/129 (34%), Positives = 68/129 (52%), Gaps = 19/129 (14%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
           + +S+G PP   L   DTGS L W QC+PC    C+ Q +   P+FDP  S T + + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 150 SSQCAS------LNQKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
           S +C        L Q +C     +C YSV+YG+G ++S G + T+T+ +G +        
Sbjct: 60  SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113

Query: 201 GITFGCGTN 209
            + FGC  +
Sbjct: 114 DLMFGCSMD 122


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score = 74.7 bits (182), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 52/169 (30%), Positives = 71/169 (42%), Gaps = 18/169 (10%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---------LFDPKMS 140
            Y +R  +GTP    L VADTGSDL W +C    P+      SP          F P+ S
Sbjct: 96  QYFVRFRVGTPAQPFLLVADTGSDLTWVKCR--RPASANSSLSPADSGPGPGRAFRPEDS 153

Query: 141 STYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATE--TVTLGSTT 193
            T+  + C+S  C      SL      G  C Y   Y DGS + G + TE  T+ L    
Sbjct: 154 RTWAPISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRE 213

Query: 194 GQAVALPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
            +   L G+  GC ++  G     + G++ LG   IS  S   +   G 
Sbjct: 214 ERKAKLKGLVLGCSSSYTGPSFEASDGVLSLGYSGISFASHAASRFGGR 262



 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 34/131 (25%), Positives = 67/131 (51%), Gaps = 13/131 (9%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNSLS------QV 303
           +++DSGT+LT L +     +++ +S  +   P    DP    E CY++ S S       V
Sbjct: 349 VILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTMDP---FEYCYNWTSPSGKDADVAV 405

Query: 304 PEVTIHFRGA-DVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
           P++ +HF GA  ++    ++ +  +  + C  + +G    + + GNI+Q   L  +DI+ 
Sbjct: 406 PKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWPGISVIGNILQQEHLWEFDIKN 465

Query: 362 QTVSFKPTDCT 372
           + + F+ + CT
Sbjct: 466 RRLKFQRSRCT 476


>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
           [Cucumis sativus]
          Length = 420

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 53/162 (32%), Positives = 75/162 (46%), Gaps = 14/162 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IGTP  +     DTGSD++W    QC  CP +     +   +D + S+T K + 
Sbjct: 87  YYAKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVS 146

Query: 148 CSSSQCASLNQKSCSG----VNCQYSVSYGDGSFSNGNLATETVTLGSTTG--QAVALPG 201
           C    C  +N    SG    ++C Y   YGDGS + G    + V     +G  +  A  G
Sbjct: 147 CDEQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANG 206

Query: 202 -ITFGCGTNNGGLFNS----KTTGIVGLGGGDISLISQMRTT 238
            I FGCG    G   S       GI+G G  + S+ISQ+ +T
Sbjct: 207 SIKFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLAST 248


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 55/149 (36%), Positives = 74/149 (49%), Gaps = 25/149 (16%)

Query: 100 PPTERLAVADTGSDLIWTQC----EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS 155
           PP     V DTGS+L W +C     P P +         FDP  SS+Y  +PCSS  C +
Sbjct: 82  PPQNISMVIDTGSELSWLRCNRSSNPNPVNN--------FDPTRSSSYSPIPCSSPTCRT 133

Query: 156 -----LNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
                L   SC S   C  ++SY D S S GNLA E    G++T  +     + FGC  +
Sbjct: 134 RTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDS----NLIFGCMGS 189

Query: 210 NGG---LFNSKTTGIVGLGGGDISLISQM 235
             G     ++KTTG++G+  G +S ISQM
Sbjct: 190 VSGSDPEEDTKTTGLLGMNRGSLSFISQM 218


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 52/176 (29%), Positives = 76/176 (43%), Gaps = 24/176 (13%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSP---------LFDPKMS 140
            Y +R  +GTP    L VADTGSDL W +C P   +      S           F P+ S
Sbjct: 94  QYFVRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKS 153

Query: 141 STYKSLPCSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG----- 190
            T+  +PC+S  C+     SL+     G  C Y   Y DGS + G + TE+ T+      
Sbjct: 154 KTWAPIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSSSS 213

Query: 191 ---STTGQAVALPGITFGC-GTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
                  +   L G+  GC G+  G  F + + G++ LG  ++S  S   +   G 
Sbjct: 214 SSSKNKVKKAKLQGLVLGCTGSYTGPSFEA-SDGVLSLGYSNVSFASHAASRFGGR 268



 Score = 55.5 bits (132), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 33/131 (25%), Positives = 67/131 (51%), Gaps = 13/131 (9%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNSLSQ------V 303
           +++DSGT+LT L +     +++ +   +   P    DP    E CY++ S S+      +
Sbjct: 350 VIVDSGTSLTVLAKPAYRAVVAALGKKLARFPRVAMDP---FEYCYNWTSPSRKDEGDDL 406

Query: 304 PEVTIHFRG-ADVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQ 361
           P++ +HF G A ++    ++ +  +  + C  V +G    + + GNI+Q   L  +D++ 
Sbjct: 407 PKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEGPWPGISVIGNILQQEHLWEFDLKN 466

Query: 362 QTVSFKPTDCT 372
           + + FK + CT
Sbjct: 467 RRLRFKRSRCT 477


>gi|340810959|gb|AEK75406.1| S5 [Oryza sativa]
 gi|340810971|gb|AEK75412.1| S5 [Oryza rufipogon]
          Length = 357

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 45/129 (34%), Positives = 68/129 (52%), Gaps = 19/129 (14%)

Query: 93  IRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDS---PLFDPKMSSTYKSLPCS 149
           + +S+G PP   L   DTGS L W QC+PC    C+ Q +   P+FDP  S T + + CS
Sbjct: 1   MAVSLGKPPVVNLVAIDTGSTLSWVQCQPC-AVHCHTQSAKAGPIFDPGRSYTSRRVRCS 59

Query: 150 SSQCAS------LNQKSC--SGVNCQYSVSYGDG-SFSNGNLATETVTLGSTTGQAVALP 200
           S +C        L Q +C     +C YSV+YG+G ++S G + T+T+ +G +        
Sbjct: 60  SVKCGEPRYDLRLQQANCMEKEDSCTYSVTYGNGWAYSVGKMVTDTLRIGDS------FM 113

Query: 201 GITFGCGTN 209
            + FGC  +
Sbjct: 114 DLMFGCSMD 122


>gi|413953655|gb|AFW86304.1| hypothetical protein ZEAMMB73_151223 [Zea mays]
          Length = 535

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 84/360 (23%), Positives = 133/360 (36%), Gaps = 83/360 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSS 150
           Y   IS+G+PP       DTGS   W QC+  P + C     PL+ P  + T  +LP S 
Sbjct: 160 YYTAISLGSPPRPYFLDVDTGSHTTWVQCDAPPCASCAKGAHPLYRP--ARTADALPASD 217

Query: 151 SQCASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNN 210
             C     ++ +   C Y +SY DGS S G    +++      G+      I FGCG + 
Sbjct: 218 PLCEGAQHENPN--QCDYEISYADGSSSMGVYVRDSMQFVGEDGERENA-DIVFGCGYDQ 274

Query: 211 GG-LFNSKTT---------------------GIV-------------GLGG--------- 226
            G L N+  T                     GI+             G GG         
Sbjct: 275 QGVLLNALETTDGVLGLTNKALSLPTQLASRGIISNAFGHCMSTDPSGAGGYLFLGDDYI 334

Query: 227 --------------GDISLISQMRTTIAGNQRLGVSTP--DIVIDSGTTLTFLPQGYNSN 270
                          D    +Q++    G+Q+L        +V D+G+T T+ P    + 
Sbjct: 335 PRWGMTWVPIRDGPADDVRRAQVKQINHGDQQLNAQGKLTQVVFDTGSTYTYFPDEALTR 394

Query: 271 LLSVMSSMIEAQPVADPTG-SLELCYSFN-SLSQVPEVTIHFRGADVKLSRSNFFVKV-- 326
           L+S +      + V D +  +L  C   +  +  V +V   F+   ++  +  FF +   
Sbjct: 395 LISSLKEAASPRFVQDDSDKTLPFCMKSDFPVRSVEDVKHFFKPLSLQFEKRFFFSRTFN 454

Query: 327 -----------SEDIVCSVFKGIT---NSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
                        ++   V  G T   +SV I G++     LV YD ++  V +   DCT
Sbjct: 455 IRPEHYLVISDKGNVCLGVLNGTTIGYDSVVIVGDVSLRGKLVAYDNDKNEVGWVDFDCT 514


>gi|255541790|ref|XP_002511959.1| protein with unknown function [Ricinus communis]
 gi|223549139|gb|EEF50628.1| protein with unknown function [Ricinus communis]
          Length = 583

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 60/206 (29%), Positives = 88/206 (42%), Gaps = 22/206 (10%)

Query: 69  NQNSSISSSKASQAD---IIPNNAN------YLIRISIGTPPTERLAVADTGSDLIWTQC 119
           N+N  ++SS A+  D   + P   N      Y   I +G PP       DT SDL W QC
Sbjct: 177 NRNYKLASSNAAAVDSSSVFPVRGNVYPDGLYFTYILVGNPPRPYYLDIDTASDLTWIQC 236

Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGV-----NCQYSVSYGD 174
           +  P + C    + L+ P+  +    +    S C  L++   +G       C Y + Y D
Sbjct: 237 D-APCTSCAKGANALYKPRRDNI---VTPKDSLCVELHRNQKAGYCETCQQCDYEIEYAD 292

Query: 175 GSFSNGNLATETVTLGSTTGQAVALPGITFGCGTNNGGLFNS---KTTGIVGLGGGDISL 231
            S S G LA + + L    G +  L    FGC  +  GL  +   KT GI+GL    +SL
Sbjct: 293 HSSSMGVLARDELHLTMANGSSTNL-KFNFGCAYDQQGLLLNTLVKTDGILGLSKAKVSL 351

Query: 232 ISQMRTTIAGNQRLGVSTPDIVIDSG 257
            SQ+      N  +G    + V+  G
Sbjct: 352 PSQLANRGIINNVVGHCLANDVVGGG 377


>gi|297820902|ref|XP_002878334.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297324172|gb|EFH54593.1| hypothetical protein ARALYDRAFT_907565 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 362

 Score = 74.3 bits (181), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 56/202 (27%), Positives = 89/202 (44%), Gaps = 36/202 (17%)

Query: 63  NRLNHFNQNSSISSSKASQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCEPC 122
           +R  H + + S+  S+    D +  N  Y  R+ IGTPP     + D+GS + +  C  C
Sbjct: 64  HRKLHKSDSKSLPHSRMRLYDDLLINGYYTTRLWIGTPPQMFALIVDSGSTVTYVPCSDC 123

Query: 123 ------------PPSQ------CYMQ----------DSPLFDPKMSSTYKSLPCSSSQCA 154
                       P  Q      C +Q          + P F P++SSTY+ + C +  C 
Sbjct: 124 EQCGKHQVMLSSPKDQILCLVSCKVQIFKISYGLFDEDPKFQPELSSTYQPVKC-NMDCN 182

Query: 155 SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGT-NNGGL 213
             + K      C Y   Y + S S G L  + ++ G+ +   +      FGC T   G L
Sbjct: 183 CDDDKE----QCVYEREYAEHSSSKGVLGEDLISFGNES--HLTPQRAVFGCKTVETGDL 236

Query: 214 FNSKTTGIVGLGGGDISLISQM 235
           ++ +  GI+GLG GD+SL+ Q+
Sbjct: 237 YSQRADGIIGLGQGDLSLVGQL 258


>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
 gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
          Length = 490

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 84/363 (23%), Positives = 142/363 (39%), Gaps = 85/363 (23%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  +I IG+P        DTGSD++W    +C+ CP +     +   +DP  S T  ++ 
Sbjct: 85  YYTQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT--TVG 142

Query: 148 CSSSQCASLNQK----SCSGVN--CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALP- 200
           C    C + +      +C   +  CQ+ ++YGDGS + G   +++V     +G     P 
Sbjct: 143 CDQEFCVANSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTTPS 202

Query: 201 --GITFGCGTNNGGLFNSKT---TGIVGLGGGDISLISQMRT-------------TIAGN 242
              ITFGCG   GG   S +    GI+G G  D S++SQ+               T+ G 
Sbjct: 203 NASITFGCGAQLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCLDTVHGG 262

Query: 243 QRLGVST----------------------------------PDIVIDSG----------T 258
               +                                    P    DSG          T
Sbjct: 263 GIFAIGNVVQPKVKTTPLVQNVTHYNVNLQGISVGGATLQLPSSTFDSGDSKGTIIDSGT 322

Query: 259 TLTFLPQGYNSNLLSVMSSMIEAQPVADPTGSLELCYSFNSL--SQVPEVTIHFRGA-DV 315
           TL +LP+     LL+ +    + Q +A       +C+ F+       P VT  F G   +
Sbjct: 323 TLAYLPREVYRTLLTAV--FDKYQDLALHNYQDFVCFQFSGSIDDGFPVVTFSFEGEITL 380

Query: 316 KLSRSNFFVKVSEDIVCSVF--KGIT----NSVPIYGNIMQTNFLVGYDIEQQTVSFKPT 369
            +   ++  +   D+ C  F   G+       + + G+++ +N LV YD+E+Q + +   
Sbjct: 381 NVYPHDYLFQNENDLYCMGFLDGGVQTKDGKDMVLLGDLVLSNKLVVYDLEKQVIGWADY 440

Query: 370 DCT 372
           +C+
Sbjct: 441 NCS 443


>gi|222619890|gb|EEE56022.1| hypothetical protein OsJ_04800 [Oryza sativa Japonica Group]
          Length = 423

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 85/332 (25%), Positives = 126/332 (37%), Gaps = 59/332 (17%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
           NY+ R  +GTP    L   D  +D  W  C  C  + C    SP F P  SSTY+++PC 
Sbjct: 101 NYIARAGLGTPAQTLLVAIDPSNDAAWVPCSAC--AGC-AASSPSFSPTQSSTYRTVPCG 157

Query: 150 SSQCASLNQKSCS---GVNCQYSVSY---------GDGSFSNGNLATETVTLG------- 190
           S QCA +   SC    G +C ++++Y         G  S +  N    + T G       
Sbjct: 158 SPQCAQVPSPSCPAGVGSSCGFNLTYAASTFQAVLGQDSLALENNVVVSYTFGCLRVVNG 217

Query: 191 ---STTGQAVALPGITFGCGTNNGGL------FNSKTT--------------GIVGLGGG 227
              +  G     P        + G L         KTT               ++G+  G
Sbjct: 218 NSRAAAGAHRLRPRAALLLVADQGHLGPIGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVG 277

Query: 228 DISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQPVADP 287
              ++   ++ +A N   G  T   +ID+GT  T L     + +       +   PVA P
Sbjct: 278 S-KVVQVPQSALAFNPVTGSGT---IIDAGTMFTRLAAPVYAAVRDAFRGRVR-TPVAPP 332

Query: 288 TGSLELCYSFNSLSQVPEVTIHFRGA-DVKLSRSNFFVKVSE-DIVCSVFK-----GITN 340
            G  + CY  N    VP VT  F GA  V L   N  +  S   + C         G+  
Sbjct: 333 LGGFDTCY--NVTVSVPTVTFMFAGAVAVTLPEENVMIHSSSGGVACLAMAAGPSDGVNA 390

Query: 341 SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
           ++ +  ++ Q N  V +D+    V F    CT
Sbjct: 391 ALNVLASMQQQNQRVLFDVANGRVGFSRELCT 422


>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 86/350 (24%), Positives = 138/350 (39%), Gaps = 86/350 (24%)

Query: 97  IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS-QCAS 155
           IGTPP E   + DTGS + +  C  C   QC     P F P +S TY  + C+    C +
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCNSC--DQCGNHQDPKFQPDLSDTYHPVKCNPDCTCDT 59

Query: 156 LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLF 214
            N +      C Y   Y + S S+G L  + V+ G+ +   +      FGC     G LF
Sbjct: 60  ENDQ------CTYERQYAEMSSSSGILGEDLVSFGNMS--ELKPQRAVFGCENAETGDLF 111

Query: 215 NSKTTGIVGL------------------------------GGGDISL--IS--------- 233
           +    GI+GL                              GGG + L  IS         
Sbjct: 112 SQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSDMVFSH 171

Query: 234 -----------QMRTTIAGNQRLGVSTPDI-------VIDSGTTLTFLPQGYNSNLLSVM 275
                      ++R      ++L ++ P +       ++DSGTT  +LP+      +  +
Sbjct: 172 SDPDRSPYYNIELRGLHVAGKKLDIN-PQVFDGKHGTILDSGTTYAYLPEAAFLPFIQAI 230

Query: 276 SSMIEA-QPVADPTGSL-ELCYSFNSLSQVPEVTIHFRGADV--------KLSRSNFFVK 325
           +S +   + +  P  +  ++C+S  + S++PE+   F   D+         LS  N+  K
Sbjct: 231 TSELHGLKQIRGPDPNYNDVCFS-GAGSEIPELYKTFPSVDMVFDNGEKYSLSPENYLFK 289

Query: 326 VSE---DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            S+        VF+   +   + G I+  N LV YD E   V F  T+C+
Sbjct: 290 HSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339


>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
          Length = 434

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 58/213 (27%), Positives = 98/213 (46%), Gaps = 26/213 (12%)

Query: 12  FFLCFYVVSPIEAQT-------GGFSVELIHRDSPKSPFYNSSETPYQR-LRDALTRSLN 63
           F LC ++ S ++AQT        G ++++ H  S  SPF  S    ++  + +   +   
Sbjct: 10  FLLCLFI-SLVQAQTPKCDIQDDGSTLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQA 68

Query: 64  RLNHFNQNSSISSSKA----SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
           R+ +F   SS+ + K+    + A  I  +  Y+++   GTPP   L   DT SD  W  C
Sbjct: 69  RMQYF---SSLVARKSVVPIASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPC 125

Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSN 179
             C         S  F P  S++++++ C S  C  +   +C G  C ++ +YG  S + 
Sbjct: 126 SGC----VGCSTSKPFAPIKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSSIA- 180

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
            ++  +T+TL      A  +PG TFGC     G
Sbjct: 181 ASVVQDTLTLA-----ADPIPGYTFGCVNKTTG 208


>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/149 (36%), Positives = 74/149 (49%), Gaps = 25/149 (16%)

Query: 100 PPTERLAVADTGSDLIWTQC----EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCAS 155
           PP     V DTGS+L W +C     P P +         FDP  SS+Y  +PCSS  C +
Sbjct: 82  PPQNISMVIDTGSELSWLRCNRSSNPNPVNN--------FDPTRSSSYSPIPCSSPTCRT 133

Query: 156 -----LNQKSC-SGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGCGTN 209
                L   SC S   C  ++SY D S S GNLA E    G++T  +     + FGC  +
Sbjct: 134 RTRDFLIPASCDSDKLCHATLSYADASSSEGNLAAEIFHFGNSTNDS----NLIFGCMGS 189

Query: 210 NGG---LFNSKTTGIVGLGGGDISLISQM 235
             G     ++KTTG++G+  G +S ISQM
Sbjct: 190 VSGSDPEEDTKTTGLLGMNRGSLSFISQM 218


>gi|357469587|ref|XP_003605078.1| Aspartic proteinase Asp1 [Medicago truncatula]
 gi|355506133|gb|AES87275.1| Aspartic proteinase Asp1 [Medicago truncatula]
          Length = 418

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 76/162 (46%), Gaps = 22/162 (13%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCE----PCPPSQCYMQDSPLFDPKMSSTYKSL 146
           Y + I+IG PP       DTGSDL W QC+    PC    C +    L+ P  +   K  
Sbjct: 62  YTVSINIGNPPNPYELDIDTGSDLTWVQCDGPDAPC--KGCTLPKDKLYKPNGNQLVK-- 117

Query: 147 PCSSSQCASLN-------QKSCSGV-NCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVA 198
            CS   CA++        QK    +  C Y V Y D + S G LA + + +GS +G  V 
Sbjct: 118 -CSDPICAAVQPPFSTFGQKCAKPIPPCVYKVEYADNAESTGALARDYMHIGSPSGSNVP 176

Query: 199 LPGITFGCGTNNG---GLFNSKTTGIVGLGGGDISLISQMRT 237
           L  + FGCG             T G++GLG G IS++SQ+ +
Sbjct: 177 L--VVFGCGYEQKFSGPTPPPSTPGVLGLGNGKISILSQLHS 216


>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 86/350 (24%), Positives = 138/350 (39%), Gaps = 86/350 (24%)

Query: 97  IGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSS-QCAS 155
           IGTPP E   + DTGS + +  C  C   QC     P F P +S TY  + C+    C +
Sbjct: 2   IGTPPQEFALIVDTGSTVTYVPCNSC--DQCGNHQDPKFQPDLSDTYHPVKCNPDCTCDT 59

Query: 156 LNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPGITFGC-GTNNGGLF 214
            N +      C Y   Y + S S+G L  + V+ G+ +   +      FGC     G LF
Sbjct: 60  ENDQ------CTYERQYAEMSSSSGILGEDLVSFGNMS--ELKPQRAVFGCENAETGDLF 111

Query: 215 NSKTTGIVGL------------------------------GGGDISL--IS--------- 233
           +    GI+GL                              GGG + L  IS         
Sbjct: 112 SQHADGIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSDMVFSH 171

Query: 234 -----------QMRTTIAGNQRLGVSTPDI-------VIDSGTTLTFLPQGYNSNLLSVM 275
                      ++R      ++L ++ P +       ++DSGTT  +LP+      +  +
Sbjct: 172 SDPDRSPYYNIELRGLHVAGKKLDIN-PQVFDGKHGTILDSGTTYAYLPEAAFLPFIQAI 230

Query: 276 SSMIEA-QPVADPTGSL-ELCYSFNSLSQVPEVTIHFRGADV--------KLSRSNFFVK 325
           +S +   + +  P  +  ++C+S  + S++PE+   F   D+         LS  N+  K
Sbjct: 231 TSELHGLKQIRGPDPNYNDVCFS-GAGSEIPELYKTFPSVDMVFDNGEKYSLSPENYLFK 289

Query: 326 VSE---DIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCT 372
            S+        VF+   +   + G I+  N LV YD E   V F  T+C+
Sbjct: 290 HSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339


>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
          Length = 468

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/158 (31%), Positives = 71/158 (44%), Gaps = 10/158 (6%)

Query: 87  NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
             + YL+++ IGTP      R  + DTGSDL WTQCEPC     +    P  DP  S T+
Sbjct: 97  GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 155

Query: 144 KSLPCSSSQ---CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
           + L C       C ++         C +   YGDG   +G L ++    G+   G    L
Sbjct: 156 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 215

Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQM 235
              + FGC    +       +TGI+ LG G  S ++Q+
Sbjct: 216 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQL 253



 Score = 41.6 bits (96), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 43/170 (25%), Positives = 71/170 (41%), Gaps = 9/170 (5%)

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
            +G  +  +   +V   GG ++    +   +AG +    +   +++DSGTTL +LP    
Sbjct: 302 QDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEE--AAAAMPMLVDSGTTLLWLPGSVF 359

Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFV-- 324
             L   +   I      D T     CY  N    +   VT+ F  GAD++L  ++ F   
Sbjct: 360 YPLQRRIEEDISLTRRYDLTHPSLYCYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTD 419

Query: 325 -KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
             ++ED VC        +  I G   Q N  VGYD+    ++F    C +
Sbjct: 420 ENLTEDWVCLAVAAGNRA--ILGVYPQRNINVGYDLSTMEIAFDRDQCDR 467


>gi|125524351|gb|EAY72465.1| hypothetical protein OsI_00321 [Oryza sativa Indica Group]
          Length = 343

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 29/72 (40%), Positives = 45/72 (62%), Gaps = 2/72 (2%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTYKSLPCS 149
            Y  R+ +G+P  +   V DTGSD+ W QC+PC  + CY Q  P+FDP +S++Y S+ C 
Sbjct: 166 EYFSRVGVGSPARQLYMVLDTGSDVTWVQCQPC--ADCYQQSDPVFDPSLSTSYASVACD 223

Query: 150 SSQCASLNQKSC 161
           + +C  L+  +C
Sbjct: 224 NPRCHDLDAAAC 235


>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
          Length = 488

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/158 (31%), Positives = 71/158 (44%), Gaps = 10/158 (6%)

Query: 87  NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
             + YL+++ IGTP      R  + DTGSDL WTQCEPC     +    P  DP  S T+
Sbjct: 119 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 177

Query: 144 KSLPCSSSQ---CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
           + L C       C ++         C +   YGDG   +G L ++    G+   G    L
Sbjct: 178 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 237

Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQM 235
              + FGC    +       +TGI+ LG G  S ++Q+
Sbjct: 238 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQL 275



 Score = 42.0 bits (97), Expect = 0.48,   Method: Compositional matrix adjust.
 Identities = 43/170 (25%), Positives = 71/170 (41%), Gaps = 9/170 (5%)

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
            +G  +  +   +V   GG ++    +   +AG +    +   +++DSGTTL +LP    
Sbjct: 322 QDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEE--AAAAMPMLVDSGTTLLWLPGSVF 379

Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFV-- 324
             L   +   I      D T     CY  N    +   VT+ F  GAD++L  ++ F   
Sbjct: 380 YPLQRRIEEDISLTRRYDLTHPSLYCYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTD 439

Query: 325 -KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
             ++ED VC        +  I G   Q N  VGYD+    ++F    C +
Sbjct: 440 ENLTEDWVCLAVAAGNRA--ILGVYPQRNINVGYDLSTMEIAFDRDQCDR 487


>gi|255634819|gb|ACU17770.1| unknown [Glycine max]
          Length = 354

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 54/163 (33%), Positives = 77/163 (47%), Gaps = 18/163 (11%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWTQCEP---CPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +GTPP E     DTGSD++W  C     CP +         FDP  SST   + 
Sbjct: 25  YYTKVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIA 84

Query: 148 CSSSQCASLNQKS---CSGVN--CQYSVSYGDGSFSNGNLATETVTL-----GSTTGQAV 197
           CS  +C +  Q S   CS  N  C Y+  YGDGS ++G   ++ + L     GS T  + 
Sbjct: 85  CSDQRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNST 144

Query: 198 ALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRT 237
           A   + FGC     G     +    GI G G  ++S+ISQ+ +
Sbjct: 145 AP--VVFGCSNQQTGDLTKSDRAVDGIFGFGQQEMSVISQLSS 185


>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
 gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
          Length = 489

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/158 (31%), Positives = 71/158 (44%), Gaps = 10/158 (6%)

Query: 87  NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
             + YL+++ IGTP      R  + DTGSDL WTQCEPC     +    P  DP  S T+
Sbjct: 118 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 176

Query: 144 KSLPCSSSQ---CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
           + L C       C ++         C +   YGDG   +G L ++    G+   G    L
Sbjct: 177 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 236

Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQM 235
              + FGC    +       +TGI+ LG G  S ++Q+
Sbjct: 237 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQL 274



 Score = 42.0 bits (97), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 43/170 (25%), Positives = 71/170 (41%), Gaps = 9/170 (5%)

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
            +G  +  +   +V   GG ++    +   +AG +    +   +++DSGTTL +LP    
Sbjct: 323 QDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEE--AAAAMPMLVDSGTTLLWLPGSVF 380

Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFV-- 324
             L   +   I      D T     CY  N    +   VT+ F  GAD++L  ++ F   
Sbjct: 381 YPLQRRIEEDISLTRRYDLTHPSLYCYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTD 440

Query: 325 -KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
             ++ED VC        +  I G   Q N  VGYD+    ++F    C +
Sbjct: 441 ENLTEDWVCLAVAAGNRA--ILGVYPQRNINVGYDLSTMEIAFDRDQCDR 488


>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
          Length = 467

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/158 (31%), Positives = 71/158 (44%), Gaps = 10/158 (6%)

Query: 87  NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
             + YL+++ IGTP      R  + DTGSDL WTQCEPC     +    P  DP  S T+
Sbjct: 98  GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 156

Query: 144 KSLPCSSSQ---CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
           + L C       C ++         C +   YGDG   +G L ++    G+   G    L
Sbjct: 157 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 216

Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQM 235
              + FGC    +       +TGI+ LG G  S ++Q+
Sbjct: 217 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQL 254



 Score = 41.6 bits (96), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 43/170 (25%), Positives = 71/170 (41%), Gaps = 9/170 (5%)

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
            +G  +  +   +V   GG ++    +   +AG +    +   +++DSGTTL +LP    
Sbjct: 301 QDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEE--AAAAMPMLVDSGTTLLWLPGSVF 358

Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFV-- 324
             L   +   I      D T     CY  N    +   VT+ F  GAD++L  ++ F   
Sbjct: 359 YPLQRRIEEDISLTRRYDLTHPSLYCYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTD 418

Query: 325 -KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
             ++ED VC        +  I G   Q N  VGYD+    ++F    C +
Sbjct: 419 ENLTEDWVCLAVAAGNRA--ILGVYPQRNINVGYDLSTMEIAFDRDQCDR 466


>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
 gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
          Length = 471

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/158 (31%), Positives = 71/158 (44%), Gaps = 10/158 (6%)

Query: 87  NNANYLIRISIGTPPTE---RLAVADTGSDLIWTQCEPCPPSQCYMQDSPLFDPKMSSTY 143
             + YL+++ IGTP      R  + DTGSDL WTQCEPC     +    P  DP  S T+
Sbjct: 100 GGSTYLVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPP-HDPSKSRTF 158

Query: 144 KSLPCSSSQ---CASLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLGST-TGQAVAL 199
           + L C       C ++         C +   YGDG   +G L ++    G+   G    L
Sbjct: 159 RRLSCFDPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQL 218

Query: 200 P-GITFGCG-TNNGGLFNSKTTGIVGLGGGDISLISQM 235
              + FGC    +       +TGI+ LG G  S ++Q+
Sbjct: 219 ERDVAFGCAHVEDSKAVRGYSTGILALGIGKPSFVTQL 256



 Score = 42.0 bits (97), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 43/170 (25%), Positives = 71/170 (41%), Gaps = 9/170 (5%)

Query: 209 NNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGNQRLGVSTPDIVIDSGTTLTFLPQGYN 268
            +G  +  +   +V   GG ++    +   +AG +    +   +++DSGTTL +LP    
Sbjct: 305 QDGSGYAVRLKSVVYQHGGRLNQQQPVPVYVAGEE--AAAAMPMLVDSGTTLLWLPGSVF 362

Query: 269 SNLLSVMSSMIEAQPVADPTGSLELCYSFNSLS-QVPEVTIHF-RGADVKLSRSNFFV-- 324
             L   +   I      D T     CY  N    +   VT+ F  GAD++L  ++ F   
Sbjct: 363 YPLQRRIEEDISLTRRYDLTHPSLYCYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTD 422

Query: 325 -KVSEDIVCSVFKGITNSVPIYGNIMQTNFLVGYDIEQQTVSFKPTDCTK 373
             ++ED VC        +  I G   Q N  VGYD+    ++F    C +
Sbjct: 423 ENLTEDWVCLAVAAGNRA--ILGVYPQRNINVGYDLSTMEIAFDRDQCDR 470


>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
 gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
          Length = 478

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 49/161 (30%), Positives = 77/161 (47%), Gaps = 14/161 (8%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIWT---QCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +G+PP E     DTGSD++W     C  CP +         FD   SST   + 
Sbjct: 66  YFTKVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLVH 125

Query: 148 CSSSQCASLNQKSCSGVN-----CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVALPG- 201
           CS   C S  Q + +  +     C Y+  Y DGS ++G   ++T+   +  G+++ +   
Sbjct: 126 CSDPICTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAILGESLVVNSS 185

Query: 202 --ITFGCGTNNGG---LFNSKTTGIVGLGGGDISLISQMRT 237
             I FGC T   G   + +    GI G G G++S+ISQ+ T
Sbjct: 186 ALIVFGCSTFQSGDLTMTDKAVDGIFGFGQGELSVISQLST 226


>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/164 (31%), Positives = 73/164 (44%), Gaps = 11/164 (6%)

Query: 90  NYLIRISIGTPPTERLAVADTGSDLIWTQCE--PCPPSQCYMQDSPLFDPKMSSTYKSLP 147
            Y +R  +GTP    L VADTGSDL W +C       S+        F P+ S T+  + 
Sbjct: 93  QYFVRFRVGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPIS 152

Query: 148 CSSSQCA-----SLNQKSCSGVNCQYSVSYGDGSFSNGNLATETVTLG-STTG---QAVA 198
           C+S  C      SL      G  C Y   Y DGS + G + TE+ T+  S  G   +   
Sbjct: 153 CASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAK 212

Query: 199 LPGITFGCGTNNGGLFNSKTTGIVGLGGGDISLISQMRTTIAGN 242
           L G+  GC ++  G     + G++ LG  D+S  S   +  AG 
Sbjct: 213 LKGLVLGCTSSYTGPSFEVSDGVLSLGYSDVSFASHAASRFAGR 256



 Score = 54.7 bits (130), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 33/128 (25%), Positives = 67/128 (52%), Gaps = 10/128 (7%)

Query: 252 IVIDSGTTLTFLPQGYNSNLLSVMSSMIEAQP--VADPTGSLELCYSFNSLS---QVPEV 306
           +++DSGT+LT L +     +++ +S  +   P    DP    E CY++ S S    +P++
Sbjct: 351 VILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRVTMDP---FEYCYNWTSPSGDVTLPKM 407

Query: 307 TIHFRGA-DVKLSRSNFFVKVSEDIVC-SVFKGITNSVPIYGNIMQTNFLVGYDIEQQTV 364
            +HF GA  ++    ++ +  +  + C  + +G    + + GNI+Q   L  +DI+ + +
Sbjct: 408 AVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWPGISVIGNILQQEHLWEFDIKNRRL 467

Query: 365 SFKPTDCT 372
            F+ + CT
Sbjct: 468 KFQRSRCT 475


>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 57/213 (26%), Positives = 98/213 (46%), Gaps = 26/213 (12%)

Query: 12  FFLCFYVVSPIEAQT-------GGFSVELIHRDSPKSPFYNSSETPYQR-LRDALTRSLN 63
           F LC ++ S ++AQT        G ++++ H  S  SPF  S    ++  + +   +   
Sbjct: 10  FLLCLFI-SLVQAQTPKCDIQDDGSTLKVFHIFSQCSPFKPSKPMSWEESVLNLQAKDQA 68

Query: 64  RLNHFNQNSSISSSKA----SQADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQC 119
           R+ +F   SS+ + K+    + A  I  +  Y+++   GTPP   L   DT SD  W  C
Sbjct: 69  RMQYF---SSLVARKSVVPIASARQIIQSPTYIVKAKFGTPPQTLLLALDTSSDAAWIPC 125

Query: 120 EPCPPSQCYMQDSPLFDPKMSSTYKSLPCSSSQCASLNQKSCSGVNCQYSVSYGDGSFSN 179
             C         S  F P  S++++++ C S  C  +   +C G  C ++ +YG  S + 
Sbjct: 126 SGC----VGCSTSKPFAPIKSTSFRNVSCGSPHCKQVPNPTCGGSACAFNFTYGSSSIA- 180

Query: 180 GNLATETVTLGSTTGQAVALPGITFGCGTNNGG 212
            ++  +T+TL +       +PG TFGC     G
Sbjct: 181 ASVVQDTLTLATD-----PIPGYTFGCVNKTTG 208


>gi|18390865|ref|NP_563808.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|11993877|gb|AAG42922.1|AF329505_1 unknown protein [Arabidopsis thaliana]
 gi|20260142|gb|AAM12969.1| unknown protein [Arabidopsis thaliana]
 gi|22136092|gb|AAM91124.1| unknown protein [Arabidopsis thaliana]
 gi|332190140|gb|AEE28261.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 492

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 84/363 (23%), Positives = 147/363 (40%), Gaps = 82/363 (22%)

Query: 91  YLIRISIGTPPTERLAVADTGSDLIW---TQCEPCPPSQCYMQDSPLFDPKMSSTYKSLP 147
           Y  ++ +GTPP E     DTGSD++W   T C  CP +         FDP +SS+   + 
Sbjct: 84  YYTKVKLGTPPREFNVQIDTGSDVLWVSCTSCNGCPKTSELQIQLSFFDPGVSSSASLVS 143

Query: 148 CSSSQCAS--LNQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQAVAL---PG 201
           CS  +C S    +  CS  N C YS  YGDGS ++G   ++ ++  +     +A+     
Sbjct: 144 CSDRRCYSNFQTESGCSPNNLCSYSFKYGDGSGTSGYYISDFMSFDTVITSTLAINSSAP 203

Query: 202 ITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQMRTT----------IAGNQRLG-- 246
             FGC     G          GI GLG G +S+ISQ+             + G++  G  
Sbjct: 204 FVFGCSNLQSGDLQRPRRAVDGIFGLGQGSLSVISQLAVQGLAPRVFSHCLKGDKSGGGI 263

Query: 247 -----VSTPDIV-----------------------------------------IDSGTTL 260
                +  PD V                                         ID+GTTL
Sbjct: 264 MVLGQIKRPDTVYTPLVPSQPHYNVNLQSIAVNGQILPIDPSVFTIATGDGTIIDTGTTL 323

Query: 261 TFLPQGYNSNLLSVMSSMIE--AQPVADPTGSLELCYSFNS--LSQVPEVTIHFRGADVK 316
            +LP    S  +  +++ +    +P+   T     C+   +  +   P+V++ F G    
Sbjct: 324 AYLPDEAYSPFIQAVANAVSQYGRPI---TYESYQCFEITAGDVDVFPQVSLSFAGGASM 380

Query: 317 LSRSNFFVKV----SEDIVCSVFKGITN-SVPIYGNIMQTNFLVGYDIEQQTVSFKPTDC 371
           +     ++++       I C  F+ +++  + I G+++  + +V YD+ +Q + +   DC
Sbjct: 381 VLGPRAYLQIFSSSGSSIWCIGFQRMSHRRITILGDLVLKDKVVVYDLVRQRIGWAEYDC 440

Query: 372 TKQ 374
           + +
Sbjct: 441 SLE 443


>gi|115448471|ref|NP_001048015.1| Os02g0730700 [Oryza sativa Japonica Group]
 gi|46390468|dbj|BAD15929.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|46390864|dbj|BAD16368.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|113537546|dbj|BAF09929.1| Os02g0730700 [Oryza sativa Japonica Group]
 gi|215697021|dbj|BAG91015.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222623612|gb|EEE57744.1| hypothetical protein OsJ_08261 [Oryza sativa Japonica Group]
          Length = 573

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 74/162 (45%), Gaps = 14/162 (8%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
           + ++ P+   Y   I +G PP       DTGSDL W QC+ PC  + C     PL+ P  
Sbjct: 194 KGNVFPD-GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPC--TNCAKGPHPLYKP-- 248

Query: 140 SSTYKSLPCSSSQCASL--NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
            +  K +P     C  L  NQ  C     C Y + Y D S S G LA + + + +T G  
Sbjct: 249 -AKEKIVPPKDLLCQELQGNQNYCETCKQCDYEIEYADRSSSMGVLARDDMHIITTNGGR 307

Query: 197 VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
             L    FGC  +  G      +KT GI+GL    ISL SQ+
Sbjct: 308 EKL-DFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQL 348


>gi|218191512|gb|EEC73939.1| hypothetical protein OsI_08807 [Oryza sativa Indica Group]
          Length = 574

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 74/162 (45%), Gaps = 14/162 (8%)

Query: 81  QADIIPNNANYLIRISIGTPPTERLAVADTGSDLIWTQCE-PCPPSQCYMQDSPLFDPKM 139
           + ++ P+   Y   I +G PP       DTGSDL W QC+ PC  + C     PL+ P  
Sbjct: 195 KGNVFPD-GQYYTSIFVGNPPRPYFLDVDTGSDLTWIQCDAPC--TNCAKGPHPLYKP-- 249

Query: 140 SSTYKSLPCSSSQCASL--NQKSCSGVN-CQYSVSYGDGSFSNGNLATETVTLGSTTGQA 196
            +  K +P     C  L  NQ  C     C Y + Y D S S G LA + + + +T G  
Sbjct: 250 -AKEKIVPPKDLLCQELQGNQNYCETCKQCDYEIEYADRSSSMGVLARDDMHIITTNGGR 308

Query: 197 VALPGITFGCGTNNGGLF---NSKTTGIVGLGGGDISLISQM 235
             L    FGC  +  G      +KT GI+GL    ISL SQ+
Sbjct: 309 EKL-DFVFGCAYDQQGQLLASPAKTDGILGLSSAGISLPSQL 349


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.132    0.390 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,846,866,689
Number of Sequences: 23463169
Number of extensions: 248656532
Number of successful extensions: 685059
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1231
Number of HSP's successfully gapped in prelim test: 3131
Number of HSP's that attempted gapping in prelim test: 675673
Number of HSP's gapped (non-prelim): 7136
length of query: 374
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 230
effective length of database: 8,980,499,031
effective search space: 2065514777130
effective search space used: 2065514777130
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)