BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 036636
         (341 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255563739|ref|XP_002522871.1| DNA binding protein, putative [Ricinus communis]
 gi|223537955|gb|EEF39569.1| DNA binding protein, putative [Ricinus communis]
          Length = 414

 Score =  180 bits (456), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 120/348 (34%), Positives = 184/348 (52%), Gaps = 34/348 (9%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTM 60
           +V++ IG P   + L+ DTGSALI+ + +       Q I         F+C N +C YT 
Sbjct: 92  LVKVRIGNPGIPLYLVPDTGSALIWTVNN-------QNI---------FQCRNNKCSYTR 135

Query: 61  KYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSR 120
           +Y D S+T G AA + +    +G  +  F+   FGCS DN  F      G   GV+GL+ 
Sbjct: 136 RYDDGSITTGVAAQDILQ--SEGSERIPFY---FGCSRDNQNFSVFEHTGKSGGVMGLNT 190

Query: 121 VTISFISQLGSIIKKRFSYCLVIPLPNGEYT--SSYLKFGTDMGYRRPSTQATKFINHPN 178
             +S + QL  I ++RFSYCL  P  +G     SS L+FG D+   R   Q+T  ++ P+
Sbjct: 191 SPVSLLQQLSHITQRRFSYCLN-PYQHGSEPPPSSLLRFGNDIRKGRRRFQSTPLMSSPD 249

Query: 179 N-FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVS 237
              Y+L+L D+++  +R++ PP TF +   G GG IIDSG+ LT+     Y +L   F +
Sbjct: 250 RPNYFLNLLDMTVAGQRLHLPPGTFALRQDGTGGTIIDSGTGLTFITQTAYPRLISAFQN 309

Query: 238 YFER--FQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFEDANLRIDGENVFIIDYEN 293
           YF+   FQ   +   PE   LCY      TF+   SM F+FE A+  +  + V++   ++
Sbjct: 310 YFDHRGFQRVHI---PE-FDLCYSFRGNHTFHDHASMTFHFERADFTVQADYVYLPMEDD 365

Query: 294 HFFLLAVAP-HDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDDS 340
           + F +A+ P       +IG+  Q +TRF+YD     L F+ ENC +D+
Sbjct: 366 NAFCVALQPTPPQQRTVIGAINQGNTRFIYDAAAHQLLFIAENCRNDA 413


>gi|255563741|ref|XP_002522872.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223537956|gb|EEF39570.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 448

 Score =  174 bits (440), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 120/368 (32%), Positives = 188/368 (51%), Gaps = 40/368 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V++ IG+P   + L+ DTGS L +               IF+   S +++ + C H  C
Sbjct: 92  LVKVIIGSPGVPLYLVPDTGSGLFWTQCEPCTRRFRQLPPIFNSTASRTYRDLPCQHQFC 151

Query: 47  T----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           T     F+C +++CVY + YA  S T G AA +   ++   E   I     FGCS DN  
Sbjct: 152 TNNQNVFQCRDDKCVYRIAYAGGSATAGVAAQD---ILQSAENDRI--PFYFGCSRDNQN 206

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDM 161
           F      G   G++GL+   +S + Q+  I K RFSYCL +  L +  + +S L+FG D+
Sbjct: 207 FSTFESSGKGGGIIGLNMSPVSLLQQMNHITKNRFSYCLNLFDLSSPSHATSLLRFGNDI 266

Query: 162 GYRRPSTQATKFIN---HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
              R    +T F++    PN  Y+L+L D+S+   RM  PP TF +   G GG IIDSG+
Sbjct: 267 RKSRRKYLSTPFVSPRGMPN--YFLNLIDVSVAGNRMQIPPGTFALKPDGTGGTIIDSGT 324

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL----CYFLP-ETFNRFPSMAFY 273
            +TY     Y+ +   F +YF++    +++     IQL    CY     TF+ +PSMAF+
Sbjct: 325 AVTYISQTAYFPVITAFKNYFDQHGFQRVN-----IQLSGYICYKQQGHTFHNYPSMAFH 379

Query: 274 FEDANLRIDGENVFIIDYENHFFLLAVAP-HDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           F+ A+  ++ E V++   +   F +A+ P       +IG+  Q +T+F+YD     L F 
Sbjct: 380 FQGADFFVEPEYVYLTVQDRGAFCVALQPISPQQRTIIGALNQANTQFIYDAANRQLLFT 439

Query: 333 KENCSDDS 340
            ENC D +
Sbjct: 440 PENCQDHA 447


>gi|449523529|ref|XP_004168776.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 461

 Score =  165 bits (418), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 117/364 (32%), Positives = 182/364 (50%), Gaps = 45/364 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L IG+P +    I+DTGS LI+               IFDP++SSSF KI+C    C
Sbjct: 112 LMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELC 171

Query: 47  TYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                  C ++ C Y   Y D S T+G  A ET +     E +    G  FGC NDN+G 
Sbjct: 172 GALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNG- 230

Query: 104 DEDARDG--ALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
                DG    AG++GL R  +S +SQL    +++F+YCL       +   S L  G+ +
Sbjct: 231 -----DGFSQGAGLVGLGRGPLSLVSQLK---EQKFAYCLTAI---DDSKPSSLLLGS-L 278

Query: 162 GYRRPST-----QATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
               P T     + T  I +P+  +FYYLSL+ IS+   +++ P  TF++   G GG II
Sbjct: 279 ANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVII 338

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAF 272
           DSG+ +TY  +  +  L  +F++   +  L         + LC+ LP   N+   P + F
Sbjct: 339 DSGTTITYVENSAFTSLKNEFIA---QMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTF 395

Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +F+ A+L + GEN  I D +     LA+     + ++ G+ QQ++   V+DL  + LSF+
Sbjct: 396 HFKGADLELPGENYMIGDSKAGLLCLAIGSSRGM-SIFGNLQQQNFMVVHDLQEETLSFL 454

Query: 333 KENC 336
              C
Sbjct: 455 PTQC 458


>gi|449453902|ref|XP_004144695.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like, partial [Cucumis sativus]
          Length = 716

 Score =  164 bits (416), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 117/364 (32%), Positives = 182/364 (50%), Gaps = 45/364 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L IG+P +    I+DTGS LI+               IFDP++SSSF KI+C    C
Sbjct: 367 LMKLAIGSPPRSFSAIMDTGSDLIWTQCKPCQQCFDQSTPIFDPKQSSSFYKISCSSELC 426

Query: 47  TYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                  C ++ C Y   Y D S T+G  A ET +     E +    G  FGC NDN+G 
Sbjct: 427 GALPTSTCSSDGCEYLYTYGDSSSTQGVLAFETFTFGDSTEDQISIPGLGFGCGNDNNG- 485

Query: 104 DEDARDG--ALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
                DG    AG++GL R  +S +SQL    +++F+YCL       +   S L  G+ +
Sbjct: 486 -----DGFSQGAGLVGLGRGPLSLVSQLK---EQKFAYCLTAI---DDSKPSSLLLGS-L 533

Query: 162 GYRRPST-----QATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
               P T     + T  I +P+  +FYYLSL+ IS+   +++ P  TF++   G GG II
Sbjct: 534 ANITPKTSKDEMKTTPLIKNPSQPSFYYLSLQGISVGGTQLSIPKSTFELHDDGSGGVII 593

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAF 272
           DSG+ +TY  +  +  L  +F++   +  L         + LC+ LP   N+   P + F
Sbjct: 594 DSGTTITYVENSAFTSLKNEFIA---QMNLPVDDSGTGGLDLCFNLPAGTNQVEVPKLTF 650

Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +F+ A+L + GEN  I D +     LA+     + ++ G+ QQ++   V+DL  + LSF+
Sbjct: 651 HFKGADLELPGENYMIGDSKAGLLCLAIGSSRGM-SIFGNLQQQNFMVVHDLQEETLSFL 709

Query: 333 KENC 336
              C
Sbjct: 710 PTQC 713


>gi|357158688|ref|XP_003578209.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 443

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 116/363 (31%), Positives = 169/363 (46%), Gaps = 37/363 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP +    ILDTGS LI+                FDP +S S+ K+ C+ P C
Sbjct: 90  LMSMGIGTPPRYYSAILDTGSDLIWTQCAPCMLCVDQPTPFFDPAQSPSYAKLPCNSPMC 149

Query: 47  T---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
               Y  C    CVY   Y D + T G  ++ET +  G  + +       FGC N N G 
Sbjct: 150 NALYYPLCYRNVCVYQYFYGDSANTAGVLSNETFT-FGTNDTRVTVPRIAFGCGNLNAGS 208

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKFGTD 160
             +      +G++G  R  +S +SQLGS    RFSYCL   + P+P+  Y  +Y    + 
Sbjct: 209 LFNG-----SGMVGFGRGPLSLVSQLGS---PRFSYCLTSFMSPVPSRLYFGAYATLNST 260

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSG 217
                   Q+T FI +P     YYL++  IS+  E +   P  F I    G GG IIDSG
Sbjct: 261 SASTGEPVQSTPFIVNPGLPTMYYLNMTGISVGGELLPIDPSVFAINDADGTGGVIIDSG 320

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR---FPSMAFYF 274
           S +TY     Y  +H+ F        L   +   + +  C+  P    +    P +AF+F
Sbjct: 321 STITYLARAAYDMVHQAFADQVG-LPLTNATSLADVLDTCFVWPPPPRKIVTMPELAFHF 379

Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
           E AN+ +  EN  +ID +     LA+A  DD  ++IGS Q ++   +YD    LLSF   
Sbjct: 380 EGANMELPLENYMLIDGDTGNLCLAIAASDD-GSIIGSFQHQNFHVLYDNENSLLSFTPA 438

Query: 335 NCS 337
            C+
Sbjct: 439 TCN 441


>gi|224286173|gb|ACN40797.1| unknown [Picea sitchensis]
          Length = 383

 Score =  155 bits (391), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 109/357 (30%), Positives = 171/357 (47%), Gaps = 40/357 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFDP------------RKSSSFQKINCDHPDC-- 46
           ++++ IGTP+  +  I+DTGS L++   +P              SS++ K+ C    C  
Sbjct: 43  LIQMAIGTPALSLSAIMDTGSDLVWTKCNPCTDCSTSSIYDPSSSSTYSKVLCQSSLCQP 102

Query: 47  -TYFKCVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
            + F C N+  C Y   Y D+S T G  + ET S+  +           FGC +DN GFD
Sbjct: 103 PSIFSCNNDGDCEYVYPYGDRSSTSGILSDETFSISSQS-----LPNITFGCGHDNQGFD 157

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
           +      + G++G  R ++S +SQLG  +  +FSYCLV    + +  +S L  G      
Sbjct: 158 K------VGGLVGFGRGSLSLVSQLGPSMGNKFSYCLVSRTDSSK--TSPLFIGNTASLE 209

Query: 165 RPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
             +  +T  +   + N YYLSL+ IS+  + +  P  TFDI   G GG IIDSG+ LT+ 
Sbjct: 210 ATTVGSTPLVQSSSTNHYYLSLEGISVGGQSLAIPTGTFDIQSDGSGGLIIDSGTTLTFL 269

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSMAFYFEDANLRID 282
               Y  + E  VS         L      + LC+    + N  FPSM F+F+ A+  + 
Sbjct: 270 QQTAYDAVKEAMVSSIN------LPQADGQLDLCFNQQGSSNPGFPSMTFHFKGADYDVP 323

Query: 283 GENVFIIDYENHFFLLAVAPHDD---LVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            EN    D  +    LA+ P +     +A+ G+ QQ++ + +YD   ++LSF    C
Sbjct: 324 KENYLFPDSTSDIVCLAMMPTNSNLGNMAIFGNVQQQNYQILYDNENNVLSFAPTAC 380


>gi|225438315|ref|XP_002272802.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 436

 Score =  153 bits (387), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 115/358 (32%), Positives = 174/358 (48%), Gaps = 44/358 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L IGTP++    I+DTGS LI+               IFDP+KSSSF K+ C    C
Sbjct: 98  LMKLAIGTPAETYSAIMDTGSDLIWTQCKPCKDCFDQPTPIFDPKKSSSFSKLPCSSDLC 157

Query: 47  TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS--NDNHG 102
                   ++ C Y   Y D S T+G  A ET +      G A      FGC   ND  G
Sbjct: 158 AALPISSCSDGCEYLYSYGDYSSTQGVLATETFAF-----GDASVSKIGFGCGEDNDGSG 212

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
           F + A      G++GL R  +S ISQLG   + +FSYCL   + + +  SS L  G++  
Sbjct: 213 FSQGA------GLVGLGRGPLSLISQLG---EPKFSYCLT-SMDDSKGISSLL-VGSEAT 261

Query: 163 YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
            +   T  T  I +P+  +FYYLSL+ IS+ +  +     TF I   G GG IIDSG+ +
Sbjct: 262 MKNAIT--TPLIQNPSQPSFYYLSLEGISVGDTLLPIEKSTFSIQNDGSGGLIIDSGTTI 319

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAFYFEDAN 278
           TY     +  L ++F+S   + +L         + LC+ LP   +    P + F+FE A+
Sbjct: 320 TYLEDSAFAALKKEFIS---QLKLDVDESGSTGLDLCFTLPPDASTVDVPQLVFHFEGAD 376

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L++  EN  I D       L +     + ++ G+ QQ++   ++DL  + +SF    C
Sbjct: 377 LKLPAENYIIADSGLGVICLTMGSSSGM-SIFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|242079447|ref|XP_002444492.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
 gi|241940842|gb|EES13987.1| hypothetical protein SORBIDRAFT_07g022780 [Sorghum bicolor]
          Length = 441

 Score =  150 bits (380), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 107/361 (29%), Positives = 163/361 (45%), Gaps = 37/361 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V L IGTP      I+DTGS LI+                FD ++S++++ + C    C
Sbjct: 90  LVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCAAQPTPYFDVKRSATYRALPCRSSRC 149

Query: 47  TYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                  C  + CVY   Y D + T G  A+ET +       K       FGC + N G 
Sbjct: 150 AALSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAASSTKVRAANISFGCGSLNAGE 209

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKFGTD 160
             ++     +G++G  R  +S +SQLG     RFSYCL   + P P+  Y   +    + 
Sbjct: 210 LANS-----SGMVGFGRGPLSLVSQLG---PSRFSYCLTSYLSPTPSRLYFGVFANLNST 261

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                   Q+T F+ +P   N Y+LS+K IS+  +R+   P  F I   G GG IIDSG+
Sbjct: 262 NTSSGSPVQSTPFVINPALPNMYFLSVKGISLGTKRLPIDPLVFAINDDGTGGVIIDSGT 321

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFYFE 275
            +T+   D Y  +     S      L  ++D    +  C+  P   N     P   F+F+
Sbjct: 322 SITWLQQDAYEAVRRGLAS---TIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDFVFHFD 378

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            AN+ +  EN  +I     +  LA+AP   +  +IG+ QQ++   +YD+    LSFV   
Sbjct: 379 GANMTLPPENYMLIASTTGYLCLAMAP-TSVGTIIGNYQQQNLHLLYDIANSFLSFVPAP 437

Query: 336 C 336
           C
Sbjct: 438 C 438


>gi|30678047|ref|NP_565298.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|30102688|gb|AAP21262.1| At2g03200 [Arabidopsis thaliana]
 gi|110736021|dbj|BAE99983.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330250580|gb|AEC05674.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 461

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 113/369 (30%), Positives = 172/369 (46%), Gaps = 51/369 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IG P+     I+DTGS LI+               IFDP KSSS+ K+ C    C
Sbjct: 108 LMELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLC 167

Query: 47  TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS--ND 99
                 N     + C Y   Y D S T+G  A ET +     E +    G  FGC   N+
Sbjct: 168 NALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTF----EDENSISGIGFGCGVENE 223

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY----L 155
             GF + +      G++GL R  +S ISQL    + +FSYCL   + + E +SS     L
Sbjct: 224 GDGFSQGS------GLVGLGRGPLSLISQLK---ETKFSYCLT-SIEDSEASSSLFIGSL 273

Query: 156 KFG----TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
             G    T        T+    + +P+  +FYYL L+ I++  +R++    TF++   G 
Sbjct: 274 ASGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGT 333

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-- 267
           GG IIDSG+ +TY     +  L E+F S   R  L         + LC+ LP+       
Sbjct: 334 GGMIIDSGTTITYLEETAFKVLKEEFTS---RMSLPVDDSGSTGLDLCFKLPDAAKNIAV 390

Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
           P M F+F+ A+L + GEN  + D       LA+   + + ++ G+ QQ++   ++DL  +
Sbjct: 391 PKMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGM-SIFGNVQQQNFNVLHDLEKE 449

Query: 328 LLSFVKENC 336
            +SFV   C
Sbjct: 450 TVSFVPTEC 458


>gi|356516413|ref|XP_003526889.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  150 bits (378), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 118/358 (32%), Positives = 181/358 (50%), Gaps = 38/358 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP      +LDTGS LI+               IFDP+KSSSF K++C    C
Sbjct: 109 LIELAIGTPPVSYPAVLDTGSDLIWTQCKPCTRCYKQPTPIFDPKKSSSFSKVSCGSSLC 168

Query: 47  TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
           +       ++ C Y   Y D S+T+G  A ET +  GK + K   H   FGC  DN G  
Sbjct: 169 SALPSSTCSDGCEYVYSYGDYSMTQGVLATETFT-FGKSKNKVSVHNIGFGCGEDNEG-- 225

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
            D  + A +G++GL R  +S +SQL    ++RFSYCL    P  +   S L  G+ +G  
Sbjct: 226 -DGFEQA-SGLVGLGRGPLSLVSQLK---EQRFSYCLT---PIDDTKESVLLLGS-LGKV 276

Query: 165 RPSTQ--ATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
           + + +   T  + +P   +FYYLSL+ IS+ + R++    TF++   G GG IIDSG+ +
Sbjct: 277 KDAKEVVTTPLLKNPLQPSFYYLSLEAISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTI 336

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFEDAN 278
           TY     Y  L ++F+S   + +LA        + LC+ LP   T    P + F+F+  +
Sbjct: 337 TYVQQKAYEALKKEFIS---QTKLALDKTSSTGLDLCFSLPSGSTQVEIPKLVFHFKGGD 393

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L +  EN  I D       LA+     + ++ G+ QQ++    +DL  + +SFV  +C
Sbjct: 394 LELPAENYMIGDSNLGVACLAMGASSGM-SIFGNVQQQNILVNHDLEKETISFVPTSC 450


>gi|242044886|ref|XP_002460314.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
 gi|241923691|gb|EER96835.1| hypothetical protein SORBIDRAFT_02g026340 [Sorghum bicolor]
          Length = 444

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 111/365 (30%), Positives = 169/365 (46%), Gaps = 43/365 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP++    ILDTGS LI+                FDP  SS+++ + C  P C
Sbjct: 93  LMEMGIGTPARFYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPANSSTYRSLGCSAPAC 152

Query: 47  T---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
               Y  C  + CVY   Y D + T G  A+ET +  G  + +       FGC N N G 
Sbjct: 153 NALYYPLCYQKTCVYQYFYGDSASTAGVLANETFT-FGTNDTRVTLPRISFGCGNLNAGS 211

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKFGTD 160
             +      +G++G  R ++S +SQLGS    RFSYCL   + P+ +  Y  +Y    + 
Sbjct: 212 LANG-----SGMVGFGRGSLSLVSQLGS---PRFSYCLTSFLSPVRSRLYFGAYATLNST 263

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSG 217
                 + Q+T FI +P     Y+L++  IS+   R+   P    I    G GG IIDSG
Sbjct: 264 ---NASTVQSTPFIINPALPTMYFLNMTGISVGGNRLPIDPAVLAINDTDGTGGTIIDSG 320

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYFLPETFNR---FPSMAF 272
           + +TY     Y+ + E FV Y        L D  E   +  C+  P    +    P +  
Sbjct: 321 TTITYLAEPAYYAVREAFVLYLN--STLPLLDVTETSVLDTCFQWPPPPRQSVTLPQLVL 378

Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +F+ A+  +  +N  ++D       LA+A   D  ++IGS Q ++   +YDL   LLSFV
Sbjct: 379 HFDGADWELPLQNYMLVDPSTGGLCLAMATSSD-GSIIGSYQHQNFNVLYDLENSLLSFV 437

Query: 333 KENCS 337
              C+
Sbjct: 438 PAPCN 442


>gi|242073260|ref|XP_002446566.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
 gi|241937749|gb|EES10894.1| hypothetical protein SORBIDRAFT_06g018170 [Sorghum bicolor]
          Length = 452

 Score =  149 bits (377), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 177/366 (48%), Gaps = 47/366 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP+     I+DTGS L++               +FDP  SS++  + C    C
Sbjct: 101 LMDVAIGTPALSYAAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSALC 160

Query: 47  TYF---KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDN 100
           +      C +  +C YT  Y D S T+G  A ET ++   G+ K    G  FGC  +N+ 
Sbjct: 161 SDLPTSTCTSASKCGYTYTYGDASSTQGVLASETFTL---GKEKKKLPGVAFGCGDTNEG 217

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            GF + A      G++GL R  +S +SQLG     +FSYCL   L +G+  S  L  G+ 
Sbjct: 218 DGFTQGA------GLVGLGRGPLSLVSQLG---LDKFSYCLT-SLDDGDGKSPLLLGGSA 267

Query: 161 MGYRRPS----TQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
                 +     Q T  + +P+  +FYY+SL  +++ + R+  P   F I   G GG I+
Sbjct: 268 AAISESAATAPVQTTPLVKNPSQPSFYYVSLTGLTVGSTRITLPASAFAIQDDGTGGVIV 327

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF---NRFPSMA 271
           DSG+ +TY     Y  L + FV+   +  L  +      + LC+  P       + P + 
Sbjct: 328 DSGTSITYLELQGYRALKKAFVA---QMALPTVDGSEIGLDLCFQGPAKGVDEVQVPKLV 384

Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
            +F+  A+L +  EN  ++D  +    L VAP   L ++IG+ QQ++ +FVYD+  D LS
Sbjct: 385 LHFDGGADLDLPAENYMVLDSASGALCLTVAPSRGL-SIIGNFQQQNFQFVYDVAGDTLS 443

Query: 331 FVKENC 336
           F    C
Sbjct: 444 FAPVQC 449


>gi|409179878|gb|AFV26024.1| aspartic proteinase nepenthesin 1 [Nepenthes mirabilis]
          Length = 437

 Score =  149 bits (376), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 107/358 (29%), Positives = 168/358 (46%), Gaps = 41/358 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP++    I+DTGS LI+               IF+P+ SSSF  + C    C
Sbjct: 96  LMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLC 155

Query: 47  TYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
              +   C N  C YT  Y D S T+G    ET++      G        FGC  +N GF
Sbjct: 156 QALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTF-----GSVSIPNITFGCGENNQGF 210

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
            +    G  AG++G+ R  +S  SQL      +FSYC+    P G  TSS L  G+    
Sbjct: 211 GQ----GNGAGLVGMGRGPLSLPSQLD---VTKFSYCMT---PIGSSTSSTLLLGSLANS 260

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVL 220
               +  T  I       FYY++L  +S+ +  +   P  F + + +G GG IIDSG+ L
Sbjct: 261 VTAGSPNTTLIESSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTL 320

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFEDAN 278
           TYF  + Y  + + F+S   +  L+ ++       LC+ +P  ++  + P+   +F+  +
Sbjct: 321 TYFADNAYQAVRQAFIS---QMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGD 377

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L +  EN F I   N    LA+      +++ G+ QQ++   VYD    ++SF+   C
Sbjct: 378 LVLPSENYF-ISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLFAQC 434


>gi|20197342|gb|AAC34482.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 353

 Score =  149 bits (375), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 113/368 (30%), Positives = 171/368 (46%), Gaps = 51/368 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           + L IG P+     I+DTGS LI+               IFDP KSSS+ K+ C    C 
Sbjct: 1   MELSIGNPAVKYSAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLCN 60

Query: 48  YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS--NDN 100
                N     + C Y   Y D S T+G  A ET +     E +    G  FGC   N+ 
Sbjct: 61  ALPRSNCNEDKDACEYLYTYGDYSSTRGLLATETFTF----EDENSISGIGFGCGVENEG 116

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY----LK 156
            GF + +      G++GL R  +S ISQL    + +FSYCL   + + E +SS     L 
Sbjct: 117 DGFSQGS------GLVGLGRGPLSLISQLK---ETKFSYCLT-SIEDSEASSSLFIGSLA 166

Query: 157 FG----TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
            G    T        T+    + +P+  +FYYL L+ I++  +R++    TF++   G G
Sbjct: 167 SGIVNKTGASLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELAEDGTG 226

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF--P 268
           G IIDSG+ +TY     +  L E+F S   R  L         + LC+ LP+       P
Sbjct: 227 GMIIDSGTTITYLEETAFKVLKEEFTS---RMSLPVDDSGSTGLDLCFKLPDAAKNIAVP 283

Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
            M F+F+ A+L + GEN  + D       LA+   + + ++ G+ QQ++   ++DL  + 
Sbjct: 284 KMIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGM-SIFGNVQQQNFNVLHDLEKET 342

Query: 329 LSFVKENC 336
           +SFV   C
Sbjct: 343 VSFVPTEC 350


>gi|226503109|ref|NP_001147206.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195608496|gb|ACG26078.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|413921850|gb|AFW61782.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 441

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 109/361 (30%), Positives = 162/361 (44%), Gaps = 37/361 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V L IGTP      I+DTGS LI+                FD +KS++++ + C    C
Sbjct: 90  LVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRC 149

Query: 47  TYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                  C  + CVY   Y D + T G  A+ET +       K       FGC + N G 
Sbjct: 150 ASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGD 209

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL---PNGEYTSSYLKFGTD 160
             ++     +G++G  R  +S +SQLG     RFSYCL   L   P+  Y   Y    + 
Sbjct: 210 LANS-----SGMVGFGRGPLSLVSQLG---PSRFSYCLTSYLSATPSRLYFGVYANLSST 261

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                   Q+T F+ +P   N Y+LSLK IS+  + +   P  F I   G GG IIDSG+
Sbjct: 262 NTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGT 321

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFYFE 275
            +T+   D Y  +    VS      L  ++D    +  C+  P   N     P + F+F+
Sbjct: 322 SITWLQQDAYEAVRRGLVS---AIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFD 378

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            AN+ +  EN  +I     +  L +AP   +  +IG+ QQ++   +YD+    LSFV   
Sbjct: 379 SANMTLLPENYMLIASTTGYLCLVMAP-TGVGTIIGNYQQQNLHLLYDIGNSFLSFVPAP 437

Query: 336 C 336
           C
Sbjct: 438 C 438


>gi|409179880|gb|AFV26025.1| aspartic proteinase nepenthesin 2 [Nepenthes mirabilis]
          Length = 437

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 105/355 (29%), Positives = 170/355 (47%), Gaps = 36/355 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP+  +  I+DTGS LI+               IF+P+ SSSF  + C+   C
Sbjct: 97  LMNVAIGTPASSLSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYC 156

Query: 47  TYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                  C N+ C YT  Y D S T+G+ A ET +     E  ++ + A FGC  DN GF
Sbjct: 157 QDLPSESCYND-CQYTYGYGDGSSTQGYMATETFTF----ETSSVPNIA-FGCGEDNQGF 210

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
            +    G  AG++G+    +S  SQLG     +FSYC+     +   T +     + +  
Sbjct: 211 GQ----GNGAGLIGMGWGPLSLPSQLG---VGQFSYCMTSSGSSSPSTLALGSAASGVPE 263

Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
             PST       +P  +YY++L+ I++  + +  P  TF +   G GG IIDSG+ LTY 
Sbjct: 264 GSPSTTLIHSSLNPT-YYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLTYL 322

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE--TFNRFPSMAFYFEDANLRI 281
             D Y  + + F    ++  L+ + +    +  C+ LP   +  + P ++  F+   L +
Sbjct: 323 PQDAYNAVAQAFT---DQINLSPVDESSSGLSTCFQLPSDGSTVQVPEISMQFDGGVLNL 379

Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             ENV I   E    L   +     +++ G+ QQ++T+ +YDL    +SFV   C
Sbjct: 380 GEENVLISPAEGVICLAMGSSSQQGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 434


>gi|165292434|dbj|BAF98915.1| aspartic proteinase nepenthesin I [Nepenthes alata]
          Length = 437

 Score =  149 bits (375), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 106/358 (29%), Positives = 167/358 (46%), Gaps = 41/358 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP++    I+DTGS LI+               IF+P+ SSSF  + C    C
Sbjct: 96  LMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLC 155

Query: 47  TYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
              +   C N  C YT  Y D S T+G    ET++      G        FGC  +N GF
Sbjct: 156 QALQSPTCSNNSCQYTYGYGDGSETQGSMGTETLTF-----GSVSIPNITFGCGENNQGF 210

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
            +    G  AG++G+ R  +S  SQL      +FSYC+    P G   SS L  G+    
Sbjct: 211 GQ----GNGAGLVGMGRGPLSLPSQLD---VTKFSYCMT---PIGSSNSSTLLLGSLANS 260

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVL 220
               +  T  I       FYY++L  +S+ +  +   P  F + + +G GG IIDSG+ L
Sbjct: 261 VTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTPLPIDPSVFKLNSNNGTGGIIIDSGTTL 320

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFEDAN 278
           TYF  + Y  + + F+S   +  L+ ++       LC+ +P  ++  + P+   +F+  +
Sbjct: 321 TYFVDNAYQAVRQAFIS---QMNLSVVNGSSSGFDLCFQMPSDQSNLQIPTFVMHFDGGD 377

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L +  EN F I   N    LA+      +++ G+ QQ++   VYD    ++SF+   C
Sbjct: 378 LVLPSENYF-ISPSNGLICLAMGSSSQGMSIFGNIQQQNLLVVYDTGNSVVSFLSAQC 434


>gi|61214233|sp|Q766C3.1|NEP1_NEPGR RecName: Full=Aspartic proteinase nepenthesin-1; AltName:
           Full=Nepenthesin-I; Flags: Precursor
 gi|41016421|dbj|BAD07474.1| aspartic proteinase nepenthesin I [Nepenthes gracilis]
          Length = 437

 Score =  148 bits (374), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 107/358 (29%), Positives = 165/358 (46%), Gaps = 41/358 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP++    I+DTGS LI+               IF+P+ SSSF  + C    C
Sbjct: 96  LMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQLC 155

Query: 47  TYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                  C N  C YT  Y D S T+G    ET++      G        FGC  +N GF
Sbjct: 156 QALSSPTCSNNFCQYTYGYGDGSETQGSMGTETLTF-----GSVSIPNITFGCGENNQGF 210

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
            +    G  AG++G+ R  +S  SQL      +FSYC+    P G  T S L  G+    
Sbjct: 211 GQ----GNGAGLVGMGRGPLSLPSQLD---VTKFSYCMT---PIGSSTPSNLLLGSLANS 260

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVL 220
               +  T  I       FYY++L  +S+ + R+   P  F + + +G GG IIDSG+ L
Sbjct: 261 VTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDSGTTL 320

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN--RFPSMAFYFEDAN 278
           TYF ++ Y  + ++F+S   +  L  ++       LC+  P   +  + P+   +F+  +
Sbjct: 321 TYFVNNAYQSVRQEFIS---QINLPVVNGSSSGFDLCFQTPSDPSNLQIPTFVMHFDGGD 377

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L +  EN F I   N    LA+      +++ G+ QQ++   VYD    ++SF    C
Sbjct: 378 LELPSENYF-ISPSNGLICLAMGSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASAQC 434


>gi|147862576|emb|CAN79341.1| hypothetical protein VITISV_006338 [Vitis vinifera]
          Length = 436

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 112/356 (31%), Positives = 171/356 (48%), Gaps = 40/356 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP++    I+DTGS LI+               IFDP KSSSF K+ C    C
Sbjct: 98  LMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLC 157

Query: 47  TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                   ++ C Y   Y D S T+G  A ET +      G A      FGC  DN G  
Sbjct: 158 VALPISSCSDGCEYRYSYGDHSSTQGVLATETFTF-----GDASVSKIGFGCGEDNRGRA 212

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
                   AG++GL R  +S ISQLG     +FSYCL   + + +  S+ L  G++   +
Sbjct: 213 YSQG----AGLVGLGRGPLSLISQLG---VPKFSYCLT-SIDDSKGISTLL-VGSEATVK 263

Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
             S   T  I +P+  +FYYLSL+ IS+ +  +     TF I   G GG IIDSG+ +TY
Sbjct: 264 --SAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITY 321

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN--RFPSMAFYFEDANLR 280
              + +  L ++F+S   + +L   +     ++LC+ LP   +    P + F+FE  +L+
Sbjct: 322 LKDNAFAALKKEFIS---QMKLDVDASGSTELELCFTLPPDGSPVEVPQLVFHFEGVDLK 378

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  EN  I D       L +     + ++ G+ QQ++   ++DL  + +SF    C
Sbjct: 379 LPKENYIIEDSALRVICLTMGSSSGM-SIFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|61214232|sp|Q766C2.1|NEP2_NEPGR RecName: Full=Aspartic proteinase nepenthesin-2; AltName:
           Full=Nepenthesin-II; Flags: Precursor
 gi|41016423|dbj|BAD07475.1| aspartic proteinase nepenthesin II [Nepenthes gracilis]
          Length = 438

 Score =  148 bits (373), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 104/357 (29%), Positives = 168/357 (47%), Gaps = 39/357 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP      I+DTGS LI+               IF+P+ SSSF  + C+   C
Sbjct: 97  LMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQYC 156

Query: 47  TYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                  C N +C YT  Y D S T+G+ A ET +     E  ++ + A FGC  DN GF
Sbjct: 157 QDLPSETCNNNECQYTYGYGDGSTTQGYMATETFTF----ETSSVPNIA-FGCGEDNQGF 211

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
            +    G  AG++G+    +S  SQLG     +FSYC+      G  + S L  G+    
Sbjct: 212 GQ----GNGAGLIGMGWGPLSLPSQLG---VGQFSYCMT---SYGSSSPSTLALGSAASG 261

Query: 164 RRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
               + +T  I+   N  +YY++L+ I++  + +  P  TF +   G GG IIDSG+ LT
Sbjct: 262 VPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGMIIDSGTTLT 321

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE--TFNRFPSMAFYFEDANL 279
           Y   D Y  + + F    ++  L  + +    +  C+  P   +  + P ++  F+   L
Sbjct: 322 YLPQDAYNAVAQAFT---DQINLPTVDESSSGLSTCFQQPSDGSTVQVPEISMQFDGGVL 378

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            +  +N+ I   E    L   +     +++ G+ QQ++T+ +YDL    +SFV   C
Sbjct: 379 NLGEQNILISPAEGVICLAMGSSSQLGISIFGNIQQQETQVLYDLQNLAVSFVPTQC 435


>gi|297817972|ref|XP_002876869.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322707|gb|EFH53128.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 462

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 111/369 (30%), Positives = 172/369 (46%), Gaps = 51/369 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IG P+     I+DTGS LI+               IFDP KSSS+ K+ C    C
Sbjct: 109 LMELSIGNPAVKYAAIVDTGSDLIWTQCKPCTECFDQPTPIFDPEKSSSYSKVGCSSGLC 168

Query: 47  TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS--ND 99
                 N     + C Y   Y D S T+G  A ET +     E +    G  FGC   N+
Sbjct: 169 NALPRSNCNEDKDSCEYLYTYGDYSSTRGLLATETFTF----EDENSISGIGFGCGVENE 224

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY----L 155
             GF + +      G++GL R  +S ISQL    + +FSYCL   + + E +SS     L
Sbjct: 225 GDGFSQGS------GLVGLGRGPLSLISQLK---ETKFSYCLT-SIEDSEASSSLFIGSL 274

Query: 156 KFG----TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
             G    T        T+    + +P+  +FYYL L+ I++  +R++    TF+++  G 
Sbjct: 275 ASGIVNKTGANLDGEVTKTMSLLRNPDQPSFYYLELQGITVGAKRLSVEKSTFELSEDGT 334

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-- 267
           GG IIDSG+ +TY     +  L E+F S   R  L         + LC+ LP        
Sbjct: 335 GGMIIDSGTTITYLEETAFKVLKEEFTS---RMSLPVDDSGSTGLDLCFKLPNAAKNIAV 391

Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
           P + F+F+ A+L + GEN  + D       LA+   + + ++ G+ QQ++   ++DL  +
Sbjct: 392 PKLIFHFKGADLELPGENYMVADSSTGVLCLAMGSSNGM-SIFGNVQQQNFNVLHDLEKE 450

Query: 328 LLSFVKENC 336
            ++FV   C
Sbjct: 451 TVTFVPTEC 459


>gi|356557014|ref|XP_003546813.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 435

 Score =  147 bits (372), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 109/363 (30%), Positives = 178/363 (49%), Gaps = 45/363 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++R +IG+P    L ++DTGS+LI+               +F+P KSS+++   CD   C
Sbjct: 90  LMRFYIGSPPVERLAMVDTGSSLIWLQCSPCHNCFPQETPLFEPLKSSTYKYATCDSQPC 149

Query: 47  TYFKCVNE------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSND 99
           T  +          QC+Y + Y D+S + G    ET+S    G  + + F   +FGC  D
Sbjct: 150 TLLQPSQRDCGKLGQCIYGIMYGDKSFSVGILGTETLSFGSTGGAQTVSFPNTIFGCGVD 209

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           N+ F     +  + G+ GL    +S +SQLG+ I  +FSYCL   LP    ++S LKFG+
Sbjct: 210 NN-FTIYTSNKVM-GIAGLGAGPLSLVSQLGAQIGHKFSYCL---LPYDSTSTSKLKFGS 264

Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           +         +T  I  P+   +Y+L+L+ ++I  + ++        T   +G  +IDSG
Sbjct: 265 EAIITTNGVVSTPLIIKPSLPTYYFLNLEAVTIGQKVVS--------TGQTDGNIVIDSG 316

Query: 218 SVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE 275
           + LTY  +  Y      FV+   E   +  L D P P++ C+  P   N   P +AF F 
Sbjct: 317 TPLTYLENTFY----NNFVASLQETLGVKLLQDLPSPLKTCF--PNRANLAIPDIAFQFT 370

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKE 334
            A++ +  +NV I   +++   LAV P   + ++L GS  Q D +  YDL    +SF   
Sbjct: 371 GASVALRPKNVLIPLTDSNILCLAVVPSSGIGISLFGSIAQYDFQVEYDLEGKKVSFAPT 430

Query: 335 NCS 337
           +C+
Sbjct: 431 DCA 433


>gi|168025534|ref|XP_001765289.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683608|gb|EDQ70017.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 372

 Score =  147 bits (371), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 109/366 (29%), Positives = 173/366 (47%), Gaps = 50/366 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V +++GTP +  ++I+DTGS L +               IFDP KSS++ KI C    C
Sbjct: 26  LVPIYLGTPPQKAVVIIDTGSDLTWIQSEPCRACFEQADPIFDPSKSSTYNKIACSSSAC 85

Query: 47  -----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVI-GKGE----GKAIFHGALFGC 96
                T        C+Y   Y D SVT+G+ + ETI+     GE    G ++++   FG 
Sbjct: 86  ADLLGTQTCSAAANCIYAYGYGDGSVTRGYFSKETITATDTAGEEVKFGASVYNTGTFG- 144

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
                       D    G+LGL +  +S  SQLGS++  +FSYCLV  L  G  TS+ + 
Sbjct: 145 ------------DTGGEGILGLGQGPVSMPSQLGSVLGNKFSYCLVDWLSAGSETST-MY 191

Query: 157 FGTDMGYRRPSTQATKFI---NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
           FG D        Q T  +   +HP  +YY++++ IS+    ++     ++I   G GG I
Sbjct: 192 FG-DAAVPSGEVQYTPIVPNADHPT-YYYIAVQGISVGGSLLDIDQSVYEIDSGGSGGTI 249

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSMAF 272
           IDSG+ +TY   +V+  L   + S          +     + LC+    T +  FP+M  
Sbjct: 250 IDSGTTITYLQQEVFNALVAAYTSQVRYPTTTSATG----LDLCFNTRGTGSPVFPAMTI 305

Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSF 331
           + +  +L +   N F I  E +   LA A   D  +A+ G+ QQ++   VYDL+   + F
Sbjct: 306 HLDGVHLELPTANTF-ISLETNIICLAFASALDFPIAIFGNIQQQNFDIVYDLDNMRIGF 364

Query: 332 VKENCS 337
              +C+
Sbjct: 365 APADCA 370


>gi|115458644|ref|NP_001052922.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|113564493|dbj|BAF14836.1| Os04g0448300 [Oryza sativa Japonica Group]
 gi|215766465|dbj|BAG98773.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767943|dbj|BAH00172.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 454

 Score =  147 bits (370), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 112/367 (30%), Positives = 175/367 (47%), Gaps = 52/367 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP+     I+DTGS L++               +FDP  SS++  + C    C
Sbjct: 106 LMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASC 165

Query: 47  TYF---KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDN 100
           +     KC +  +C YT  Y D S T+G  A ET ++      K+   G +FGC  +N+ 
Sbjct: 166 SDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTL-----AKSKLPGVVFGCGDTNEG 220

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            GF + A      G++GL R  +S +SQLG     +FSYCL       +  +S L  G+ 
Sbjct: 221 DGFSQGA------GLVGLGRGPLSLVSQLG---LDKFSYCLT---SLDDTNNSPLLLGSL 268

Query: 161 MG-----YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
            G         S Q T  I +P+  +FYY+SLK I++ + R++ P   F +   G GG I
Sbjct: 269 AGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVI 328

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF---NRFPSM 270
           +DSG+ +TY     Y  L + F +   +  L         + LC+  P         P +
Sbjct: 329 VDSGTSITYLEVQGYRALKKAFAA---QMALPAADGSGVGLDLCFRAPAKGVDQVEVPRL 385

Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
            F+F+  A+L +  EN  ++D  +    L V     L ++IG+ QQ++ +FVYD+  D L
Sbjct: 386 VFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGL-SIIGNFQQQNFQFVYDVGHDTL 444

Query: 330 SFVKENC 336
           SF    C
Sbjct: 445 SFAPVQC 451


>gi|38344829|emb|CAD40873.2| OSJNBa0064H22.10 [Oryza sativa Japonica Group]
 gi|116310063|emb|CAH67084.1| H0818E04.1 [Oryza sativa Indica Group]
 gi|116310186|emb|CAH67198.1| OSIGBa0152K17.10 [Oryza sativa Indica Group]
          Length = 444

 Score =  147 bits (370), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 112/367 (30%), Positives = 175/367 (47%), Gaps = 52/367 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP+     I+DTGS L++               +FDP  SS++  + C    C
Sbjct: 96  LMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASC 155

Query: 47  TYF---KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDN 100
           +     KC +  +C YT  Y D S T+G  A ET ++      K+   G +FGC  +N+ 
Sbjct: 156 SDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTL-----AKSKLPGVVFGCGDTNEG 210

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            GF + A      G++GL R  +S +SQLG     +FSYCL       +  +S L  G+ 
Sbjct: 211 DGFSQGA------GLVGLGRGPLSLVSQLG---LDKFSYCLT---SLDDTNNSPLLLGSL 258

Query: 161 MG-----YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
            G         S Q T  I +P+  +FYY+SLK I++ + R++ P   F +   G GG I
Sbjct: 259 AGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVI 318

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF---NRFPSM 270
           +DSG+ +TY     Y  L + F +   +  L         + LC+  P         P +
Sbjct: 319 VDSGTSITYLEVQGYRALKKAFAA---QMALPAADGSGVGLDLCFRAPAKGVDQVEVPRL 375

Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
            F+F+  A+L +  EN  ++D  +    L V     L ++IG+ QQ++ +FVYD+  D L
Sbjct: 376 VFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGL-SIIGNFQQQNFQFVYDVGHDTL 434

Query: 330 SFVKENC 336
           SF    C
Sbjct: 435 SFAPVQC 441


>gi|225437854|ref|XP_002264056.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
          Length = 436

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 112/356 (31%), Positives = 170/356 (47%), Gaps = 40/356 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP++    I+DTGS LI+               IFDP KSSSF K+ C    C
Sbjct: 98  LMNLAIGTPAETYSAIMDTGSDLIWTQCKPCKVCFDQPTPIFDPEKSSSFSKLPCSSDLC 157

Query: 47  TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                   ++ C Y   Y D S T+G  A ET +      G A      FGC  DN G  
Sbjct: 158 VALPISSCSDGCEYRYSYGDHSSTQGVLATETFTF-----GDASVSKIGFGCGEDNRGRA 212

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
                   AG++GL R  +S ISQLG     +FSYCL   + + +  S+ L  G++   +
Sbjct: 213 YSQG----AGLVGLGRGPLSLISQLG---VPKFSYCLT-SIDDSKGISTLL-VGSEATVK 263

Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
             S   T  I +P+  +FYYLSL+ IS+ +  +     TF I   G GG IIDSG+ +TY
Sbjct: 264 --SAIPTPLIQNPSRPSFYYLSLEGISVGDTLLPIEKSTFSIQDDGSGGLIIDSGTTITY 321

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAFYFEDANLR 280
                +  L ++F+S   + +L   +     ++LC+ LP   +    P + F+FE  +L+
Sbjct: 322 LKDSAFAALKKEFIS---QMKLDVDASGSTELELCFTLPPDGSPVDVPQLVFHFEGVDLK 378

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  EN  I D       L +     + ++ G+ QQ++   ++DL  + +SF    C
Sbjct: 379 LPKENYIIEDSALRVICLTMGSSSGM-SIFGNFQQQNIVVLHDLEKETISFAPAQC 433


>gi|255566010|ref|XP_002523993.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536720|gb|EEF38361.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 439

 Score =  147 bits (370), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 110/362 (30%), Positives = 175/362 (48%), Gaps = 38/362 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++  +GTP+  +L I DTGS LI+               +FDP+ SS+++ I+C    C
Sbjct: 93  LMKFSLGTPAFDILAIADTGSDLIWTQCKPCDQCYEQDAPLFDPKSSSTYRDISCSTKQC 152

Query: 47  TYFK----CV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
              K    C    N+ C Y+  Y D+S T G  A +TI++        +   A+ GC ++
Sbjct: 153 DLLKEGASCSGEGNKTCHYSYSYGDRSFTSGNVAADTITLGSTSGRPVLLPKAIIGCGHN 212

Query: 100 NHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
           N G F E                 IS ISQLGS I  +FSYCLV PL +    SS L FG
Sbjct: 213 NGGSFTEKGSGIVGL-----GGGPISLISQLGSTIDGKFSYCLV-PLSSNATNSSKLNFG 266

Query: 159 TDMGYRRPSTQATKFINH-PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           ++        Q+T  I+  P+ FY+L+L+ +S+ +ER+ FP  +F  +   EG  IIDSG
Sbjct: 267 SNGIVSGGGVQSTPLISKDPDTFYFLTLEAVSVGSERIKFPGSSFGTS---EGNIIIDSG 323

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
           + LT F  D + +L     +  +      + D    + LCY +     +FPS+  +F+ A
Sbjct: 324 TTLTLFPEDFFSELSS---AVQDAVAGTPVEDPSGILSLCYSIDADL-KFPSITAHFDGA 379

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +++++  N F +   +     A  P +   A+ G+  Q +    YDL    +SF   +C+
Sbjct: 380 DVKLNPLNTF-VQVSDTVLCFAFNPINS-GAIFGNLAQMNFLVGYDLEGKTVSFKPTDCT 437

Query: 338 DD 339
            D
Sbjct: 438 QD 439


>gi|125548488|gb|EAY94310.1| hypothetical protein OsI_16079 [Oryza sativa Indica Group]
          Length = 423

 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 112/367 (30%), Positives = 175/367 (47%), Gaps = 52/367 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP+     I+DTGS L++               +FDP  SS++  + C    C
Sbjct: 75  LMDVSIGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASC 134

Query: 47  TYF---KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDN 100
           +     KC +  +C YT  Y D S T+G  A ET ++      K+   G +FGC  +N+ 
Sbjct: 135 SDLPTSKCTSASKCGYTYTYGDSSSTQGVLATETFTL-----AKSKLPGVVFGCGDTNEG 189

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            GF + A      G++GL R  +S +SQLG     +FSYCL       +  +S L  G+ 
Sbjct: 190 DGFSQGA------GLVGLGRGPLSLVSQLG---LDKFSYCLT---SLDDTNNSPLLLGSL 237

Query: 161 MG-----YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
            G         S Q T  I +P+  +FYY+SLK I++ + R++ P   F +   G GG I
Sbjct: 238 AGISEASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVI 297

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF---NRFPSM 270
           +DSG+ +TY     Y  L + F +   +  L         + LC+  P         P +
Sbjct: 298 VDSGTSITYLEVQGYRALKKAFAA---QMALPAADGSGVGLDLCFRAPAKGVDQVEVPRL 354

Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
            F+F+  A+L +  EN  ++D  +    L V     L ++IG+ QQ++ +FVYD+  D L
Sbjct: 355 VFHFDGGADLDLPAENYMVLDGGSGALCLTVMGSRGL-SIIGNFQQQNFQFVYDVGHDTL 413

Query: 330 SFVKENC 336
           SF    C
Sbjct: 414 SFAPVQC 420


>gi|125590542|gb|EAZ30892.1| hypothetical protein OsJ_14967 [Oryza sativa Japonica Group]
          Length = 516

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 112/362 (30%), Positives = 172/362 (47%), Gaps = 52/362 (14%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
           IGTP+     I+DTGS L++               +FDP  SS++  + C    C+    
Sbjct: 173 IGTPALAYSAIVDTGSDLVWTQCKPCVDCFKQSTPVFDPSSSSTYATVPCSSASCSDLPT 232

Query: 50  -KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDNHGFDE 105
            KC +  +C YT  Y D S T+G  A ET ++      K+   G +FGC  +N+  GF +
Sbjct: 233 SKCTSASKCGYTYTYGDSSSTQGVLATETFTLA-----KSKLPGVVFGCGDTNEGDGFSQ 287

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG--- 162
            A      G++GL R  +S +SQLG     +FSYCL       +  +S L  G+  G   
Sbjct: 288 GA------GLVGLGRGPLSLVSQLG---LDKFSYCLT---SLDDTNNSPLLLGSLAGISE 335

Query: 163 --YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                 S Q T  I +P+  +FYY+SLK I++ + R++ P   F +   G GG I+DSG+
Sbjct: 336 ASAAASSVQTTPLIKNPSQPSFYYVSLKAITVGSTRISLPSSAFAVQDDGTGGVIVDSGT 395

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF---NRFPSMAFYFE 275
            +TY     Y  L + F +   +  L         + LC+  P         P + F+F+
Sbjct: 396 SITYLEVQGYRALKKAFAA---QMALPAADGSGVGLDLCFRAPAKGVDQVEVPRLVFHFD 452

Query: 276 -DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
             A+L +  EN  ++D  +    L V     L ++IG+ QQ++ +FVYD+  D LSF   
Sbjct: 453 GGADLDLPAENYMVLDGGSGALCLTVMGSRGL-SIIGNFQQQNFQFVYDVGHDTLSFAPV 511

Query: 335 NC 336
            C
Sbjct: 512 QC 513


>gi|356508918|ref|XP_003523200.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 117/358 (32%), Positives = 180/358 (50%), Gaps = 38/358 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP      +LDTGS LI+               IFDP+KSSSF K++C    C
Sbjct: 109 LMELAIGTPPVSYPAVLDTGSDLIWTQCKPCTQCYKQPTPIFDPKKSSSFSKVSCGSSLC 168

Query: 47  TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
           +       ++ C Y   Y D S+T+G  A ET +  GK + K   H   FGC  DN G  
Sbjct: 169 SAVPSSTCSDGCEYVYSYGDYSMTQGVLATETFT-FGKSKNKVSVHNIGFGCGEDNEG-- 225

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
            D  + A +G++GL R  +S +SQL    + RFSYCL    P  +   S L  G+ +G  
Sbjct: 226 -DGFEQA-SGLVGLGRGPLSLVSQLK---EPRFSYCLT---PMDDTKESILLLGS-LGKV 276

Query: 165 RPSTQ--ATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
           + + +   T  + +P   +FYYLSL+ IS+ + R++    TF++   G GG IIDSG+ +
Sbjct: 277 KDAKEVVTTPLLKNPLQPSFYYLSLEGISVGDTRLSIEKSTFEVGDDGNGGVIIDSGTTI 336

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFEDAN 278
           TY     +  L ++F+S   +  L + S     + LC+ LP   T    P + F+F+  +
Sbjct: 337 TYIEQKAFEALKKEFISQ-TKLPLDKTSST--GLDLCFSLPSGSTQVEIPKIVFHFKGGD 393

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L +  EN  I D       LA+     + ++ G+ QQ++    +DL  + +SFV  +C
Sbjct: 394 LELPAENYMIGDSNLGVACLAMGASSGM-SIFGNVQQQNILVNHDLEKETISFVPTSC 450


>gi|302753526|ref|XP_002960187.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
 gi|300171126|gb|EFJ37726.1| hypothetical protein SELMODRAFT_75184 [Selaginella moellendorffii]
          Length = 398

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 106/363 (29%), Positives = 167/363 (46%), Gaps = 36/363 (9%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +  + +GTP+K   +I DTGS LI+               IFDP  SSS+  ++C    C
Sbjct: 41  VTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTLC 100

Query: 47  TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                K  +  C Y+  Y D S T+G  + ET+++      K       FGC + N G  
Sbjct: 101 DSLPRKSCSPNCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLNRGSF 160

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
            DA     +G++GL R  +SF+SQLG +   +FSYCLV P  +    +S + FG +    
Sbjct: 161 NDA-----SGLVGLGRGNLSFVSQLGDLFGHKFSYCLV-PWRDAPSKTSPMFFGDESSSH 214

Query: 165 RPSTQA----TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
               +     T  I++P   +FYY+ LKDISI    +  P  +FDI   G GG I DSG+
Sbjct: 215 SSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGT 274

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET----FNRFPSMAFYF 274
            LT      Y  +     S   +    ++      + LCY +  +      + P+M F+F
Sbjct: 275 TLTLLPDAPYQIVLRALRS---KVSFPEIDGSSAGLDLCYDVSGSKASYKKKIPAMVFHF 331

Query: 275 EDANLRIDGENVFIIDYE-NHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
           E A+ ++  EN FI   +      LA+   +  + + G+  Q++ R +YD+    + +  
Sbjct: 332 EGADHQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAP 391

Query: 334 ENC 336
             C
Sbjct: 392 SQC 394


>gi|302768196|ref|XP_002967518.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
 gi|300165509|gb|EFJ32117.1| hypothetical protein SELMODRAFT_87804 [Selaginella moellendorffii]
          Length = 398

 Score =  145 bits (366), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 106/363 (29%), Positives = 167/363 (46%), Gaps = 36/363 (9%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +  + +GTP+K   +I DTGS LI+               IFDP  SSS+  ++C    C
Sbjct: 41  VTTISLGTPAKVFSVIADTGSDLIWIQCKPCQACFNQKDPIFDPEGSSSYTTMSCGDTLC 100

Query: 47  TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                K  +  C Y+  Y D S T+G  + ET+++      K       FGC + N G  
Sbjct: 101 DSLPRKSCSPDCDYSYGYGDGSGTRGTLSSETVTLTSTQGEKLAAKNIAFGCGHLNRGSF 160

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
            DA     +G++GL R  +SF+SQLG +   +FSYCLV P  +    +S + FG +    
Sbjct: 161 NDA-----SGLVGLGRGNLSFVSQLGDLFGHKFSYCLV-PWRDAPSKTSPMFFGDESSSH 214

Query: 165 RPSTQA----TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
               +     T  I++P   +FYY+ LKDISI    +  P  +FDI   G GG I DSG+
Sbjct: 215 SSGKKLHYAFTPMIHNPAMESFYYVKLKDISIAGRALRIPAGSFDIKPDGSGGMIFDSGT 274

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN----RFPSMAFYF 274
            LT      Y  +     S   +    ++      + LCY +  +      + P+M F+F
Sbjct: 275 TLTLLPDAPYQIVLRALRS---KISFPKIDGSSAGLDLCYDVSGSKASYKMKIPAMVFHF 331

Query: 275 EDANLRIDGENVFIIDYE-NHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
           E A+ ++  EN FI   +      LA+   +  + + G+  Q++ R +YD+    + +  
Sbjct: 332 EGADYQLPVENYFIAANDAGTIVCLAMVSSNMDIGIYGNMMQQNFRVMYDIGSSKIGWAP 391

Query: 334 ENC 336
             C
Sbjct: 392 SQC 394


>gi|115476828|ref|NP_001062010.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|42407407|dbj|BAD09565.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623979|dbj|BAF23924.1| Os08g0469000 [Oryza sativa Japonica Group]
 gi|125603713|gb|EAZ43038.1| hypothetical protein OsJ_27627 [Oryza sativa Japonica Group]
          Length = 448

 Score =  144 bits (362), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 109/364 (29%), Positives = 165/364 (45%), Gaps = 39/364 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP      ++DTGS LI+                F P +S++++ + C  P C
Sbjct: 93  LMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSPLC 152

Query: 47  T---YFKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
               Y  C     CVY   Y D++ T G  A ET +       K +     FGC N N G
Sbjct: 153 AALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNINSG 212

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKF-G 158
              ++     +G++GL R  +S +SQLG     RFSYCL   + P P+      +    G
Sbjct: 213 QLANS-----SGMVGLGRGPLSLVSQLG---PSRFSYCLTSFLSPEPSRLNFGVFATLNG 264

Query: 159 TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
           T+        Q+T  + +    + Y++SLK IS+  +R+   P  F I   G GG  IDS
Sbjct: 265 TNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDS 324

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFY 273
           G+ LT+   D Y  +  + VS      L   +D    ++ C+  P   +     P M  +
Sbjct: 325 GTSLTWLQQDAYDAVRRELVSVLR--PLPPTNDTEIGLETCFPWPPPPSVAVTVPDMELH 382

Query: 274 FE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           F+  AN+ +  EN  +ID    F  LA+    D   +IG+ QQ++   +YD+   LLSFV
Sbjct: 383 FDGGANMTVPPENYMLIDGATGFLCLAMIRSGD-ATIIGNYQQQNMHILYDIANSLLSFV 441

Query: 333 KENC 336
              C
Sbjct: 442 PAPC 445


>gi|125561848|gb|EAZ07296.1| hypothetical protein OsI_29544 [Oryza sativa Indica Group]
          Length = 448

 Score =  143 bits (361), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 109/364 (29%), Positives = 165/364 (45%), Gaps = 39/364 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP      ++DTGS LI+                F P +S++++ + C  P C
Sbjct: 93  LMDLAIGTPPLRYTAMVDTGSDLIWTQCAPCVLCADQPTPYFRPARSATYRLVPCRSPLC 152

Query: 47  T---YFKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
               Y  C     CVY   Y D++ T G  A ET +       K +     FGC N N G
Sbjct: 153 AALPYPACFQRSVCVYQYYYGDEASTAGVLASETFTFGAANSSKVMVSDVAFGCGNINSG 212

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKF-G 158
              ++     +G++GL R  +S +SQLG     RFSYCL   + P P+      +    G
Sbjct: 213 QLANS-----SGMVGLGRGPLSLVSQLG---PSRFSYCLTSFLSPEPSRLNFGVFATLNG 264

Query: 159 TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
           T+        Q+T  + +    + Y++SLK IS+  +R+   P  F I   G GG  IDS
Sbjct: 265 TNASSSGSPVQSTPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDS 324

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFY 273
           G+ LT+   D Y  +  + VS      L   +D    ++ C+  P   +     P M  +
Sbjct: 325 GTSLTWLQQDAYDAVRHELVSVLR--PLPPTNDTEIGLETCFPWPPPPSVAVTVPDMELH 382

Query: 274 FE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           F+  AN+ +  EN  +ID    F  LA+    D   +IG+ QQ++   +YD+   LLSFV
Sbjct: 383 FDGGANMTVPPENYMLIDGATGFLCLAMIRSGD-ATIIGNYQQQNMHILYDIANSLLSFV 441

Query: 333 KENC 336
              C
Sbjct: 442 PAPC 445


>gi|357167693|ref|XP_003581287.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 468

 Score =  142 bits (359), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 109/371 (29%), Positives = 173/371 (46%), Gaps = 57/371 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP+     I+DTGS L++               +FDP  SS++  + C    C
Sbjct: 119 LMDMSIGTPALAYAAIVDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYSTLPCSSSLC 178

Query: 47  TYF---KCVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SND 99
           +      C +  + C YT  Y D S T+G  A ET ++      K    G  FGC  +N+
Sbjct: 179 SDLPTSTCTSAAKDCGYTYTYGDASSTQGVLAAETFTL-----AKTKLPGVAFGCGDTNE 233

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVI-------PLPNGEYTS 152
             GF + A      G++GL R  +S +SQLG     +FSYCL         PL  G    
Sbjct: 234 GDGFTQGA------GLVGLGRGPLSLVSQLG---LGKFSYCLTSLDDTSKSPLLLG---- 280

Query: 153 SYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
           S     TD      + Q T  I +P+  +FYY++LK +++ + R+  P   F +   G G
Sbjct: 281 SLAAISTDTA-SAAAIQTTPLIKNPSQPSFYYVTLKALTVGSTRIPLPGSAFAVQDDGTG 339

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF---NRF 267
           G I+DSG+ +TY     Y  L + F +   + +L         + LC+  P +       
Sbjct: 340 GVIVDSGTSITYLELQGYRPLKKAFAA---QMKLPVADGSAVGLDLCFKAPASGVDDVEV 396

Query: 268 PSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
           P +  +F+  A+L +  EN  ++D  +    L V     L ++IG+ QQ++ +FVYD++ 
Sbjct: 397 PKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMGSRGL-SIIGNFQQQNIQFVYDVDK 455

Query: 327 DLLSFVKENCS 337
           D LSF    C+
Sbjct: 456 DTLSFAPVQCA 466


>gi|326494754|dbj|BAJ94496.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514480|dbj|BAJ96227.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 449

 Score =  142 bits (357), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 104/366 (28%), Positives = 171/366 (46%), Gaps = 50/366 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP+     I+DTGS L++               +FDP  SS++  + C    C
Sbjct: 103 LMDMSIGTPAVAYAAIIDTGSDLVWTQCKPCVECFNQSTPVFDPSSSSTYAALPCSSTLC 162

Query: 47  TYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDNH 101
           +     KC + +C YT  Y D S T+G  A ET ++      K       FGC  +N+  
Sbjct: 163 SDLPSSKCTSAKCGYTYTYGDSSSTQGVLAAETFTL-----AKTKLPDVAFGCGDTNEGD 217

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT-- 159
           GF + A      G++GL R  +S +SQLG     +FSYCL       + + S L  G+  
Sbjct: 218 GFTQGA------GLVGLGRGPLSLVSQLG---LNKFSYCLT---SLDDTSKSPLLLGSLA 265

Query: 160 ---DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
              +      S Q T  I +P+  +FYY++LK +++ +  +  P   F +   G GG I+
Sbjct: 266 TISESAAAASSVQTTPLIRNPSQPSFYYVNLKGLTVGSTHITLPSSAFAVQDDGTGGVIV 325

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF---NRFPSMA 271
           DSG+ +TY     Y  L + F +   + +L         +  C+  P +       P + 
Sbjct: 326 DSGTSITYLELQGYRALKKAFAA---QMKLPAADGSGIGLDTCFEAPASGVDQVEVPKLV 382

Query: 272 FYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
           F+ + A+L +  EN  ++D  +    L V     L ++IG+ QQ++ +FVYD+  + LSF
Sbjct: 383 FHLDGADLDLPAENYMVLDSGSGALCLTVMGSRGL-SIIGNFQQQNIQFVYDVGENTLSF 441

Query: 332 VKENCS 337
               C+
Sbjct: 442 APVQCA 447


>gi|223946005|gb|ACN27086.1| unknown [Zea mays]
          Length = 336

 Score =  140 bits (353), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 102/345 (29%), Positives = 154/345 (44%), Gaps = 37/345 (10%)

Query: 17  LDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFK---CVNEQCVYT 59
           +DTGS LI+                FD +KS++++ + C    C       C  + CVY 
Sbjct: 1   MDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRCASLSSPSCFKKMCVYQ 60

Query: 60  MKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLS 119
             Y D + T G  A+ET +       K       FGC + N G   ++     +G++G  
Sbjct: 61  YYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGDLANS-----SGMVGFG 115

Query: 120 RVTISFISQLGSIIKKRFSYCLVIPL---PNGEYTSSYLKFGTDMGYRRPSTQATKFINH 176
           R  +S +SQLG     RFSYCL   L   P+  Y   Y    +         Q+T F+ +
Sbjct: 116 RGPLSLVSQLG---PSRFSYCLTSYLSATPSRLYFGVYANLSSTNTSSGSPVQSTPFVIN 172

Query: 177 PN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEK 234
           P   N Y+LSLK IS+  + +   P  F I   G GG IIDSG+ +T+   D Y  +   
Sbjct: 173 PALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGTSITWLQQDAYEAVRRG 232

Query: 235 FVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFYFEDANLRIDGENVFIIDY 291
            VS      L  ++D    +  C+  P   N     P + F+F+ AN+ +  EN  +I  
Sbjct: 233 LVS---AIPLPAMNDTDIGLDTCFQWPPPPNVTVTVPDLVFHFDSANMTLLPENYMLIAS 289

Query: 292 ENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
              +  L +AP   +  +IG+ QQ++   +YD+    LSFV   C
Sbjct: 290 TTGYLCLVMAP-TGVGTIIGNYQQQNLHLLYDIGNSFLSFVPAPC 333


>gi|296090291|emb|CBI40110.3| unnamed protein product [Vitis vinifera]
          Length = 408

 Score =  139 bits (349), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 110/359 (30%), Positives = 171/359 (47%), Gaps = 38/359 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V   +G P    L+ +DTGS L++               IFDP KSS++  ++ D P C
Sbjct: 60  LVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC 119

Query: 47  -----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                  +  +N QC+Y   YAD S + G  A E I      +G       +FGC + N 
Sbjct: 120 PNSPQKKYNHLN-QCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNR 178

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G      DG  +G+LGLS    S +S+LGS    RFSYC +  L +  YT + L  G  +
Sbjct: 179 G----RFDGQQSGILGLSAGDQSIVSRLGS----RFSYC-IGDLFDPHYTHNQLVLGDGV 229

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
                ST    F    N FYY++L+ IS+   R++  P+ F  T SG+GG ++DSG+  T
Sbjct: 230 KMEGSSTPFHTF----NGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTAT 285

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYF-EDAN 278
           +   D +  L  + +    R    Q+     P  LCY   + E    FP +AF+F E A+
Sbjct: 286 FLAKDGFDPLSNE-IQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGAD 344

Query: 279 LRIDGENVFIIDYENHFFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L +D  ++F+   ++ F L  +  +  ++ ++IG   Q+     YDL    + F + +C
Sbjct: 345 LVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403


>gi|147786881|emb|CAN62311.1| hypothetical protein VITISV_008929 [Vitis vinifera]
          Length = 408

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 110/359 (30%), Positives = 171/359 (47%), Gaps = 38/359 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V   +G P    L+ +DTGS L++               IFDP KSS++  ++ D P C
Sbjct: 60  LVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC 119

Query: 47  -----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                  +  +N QC+Y   YAD S + G  A E I      +G       +FGC + N 
Sbjct: 120 PNSPQKKYNHLN-QCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNR 178

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G      DG  +G+LGLS    S +S+LGS    RFSYC +  L +  YT + L  G  +
Sbjct: 179 G----RFDGQQSGILGLSAGDQSIVSRLGS----RFSYC-IGDLFDPHYTHNQLVLGDGV 229

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
                ST    F    N FYY++L+ IS+   R++  P+ F  T SG+GG ++DSG+  T
Sbjct: 230 KMEGSSTPFHTF----NGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTAT 285

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYF-EDAN 278
           +   D +  L  + +    R    Q+     P  LCY   + E    FP +AF+F E A+
Sbjct: 286 FLAKDGFDPLSNE-IQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGAD 344

Query: 279 LRIDGENVFIIDYENHFFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L +D  ++F+   ++ F L  +  +  ++ ++IG   Q+     YDL    + F + +C
Sbjct: 345 LVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 403


>gi|357492389|ref|XP_003616483.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517818|gb|AES99441.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 107/351 (30%), Positives = 163/351 (46%), Gaps = 30/351 (8%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFK- 50
           +GTP   +  I DTGS +++               IF+P KSSS++ I C    C   + 
Sbjct: 93  VGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCSSKLCHSVRD 152

Query: 51  --CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
             C ++  C Y + Y D S ++G  + +T+S+         F   + GC  DN G     
Sbjct: 153 TSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKIVIGCGTDNAG----T 208

Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFGTDMGYRRP 166
             GA +G++GL    +S I+QLGS I  +FSYCLV PL N E   SS L FG        
Sbjct: 209 FGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLV-PLLNKESNASSILSFGDAAVVSGD 267

Query: 167 STQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSD 226
              +T  I     FY+L+L+  S+ N+R+ F   +       EG  IIDSG+ LT   SD
Sbjct: 268 GVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEG--GDDEGNIIIDSGTTLTLIPSD 325

Query: 227 VYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGENV 286
           VY  L    V   +  +L ++ D  +   LCY L      FP +  +F+ A++ +   + 
Sbjct: 326 VYTNLESAVV---DLVKLDRVDDPNQQFSLCYSLKSNEYDFPIITVHFKGADVELHSIST 382

Query: 287 FIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           F +   +     A  P   L ++ G+  Q++    YDL    +SF   +C+
Sbjct: 383 F-VPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSFKPTDCT 432


>gi|225446261|ref|XP_002265547.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 440

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 110/359 (30%), Positives = 171/359 (47%), Gaps = 38/359 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V   +G P    L+ +DTGS L++               IFDP KSS++  ++ D P C
Sbjct: 92  LVNFSVGRPPVPQLVGIDTGSDLLWVQCRPCADCFRQSTPIFDPSKSSTYVDLSYDSPIC 151

Query: 47  -----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                  +  +N QC+Y   YAD S + G  A E I      +G       +FGC + N 
Sbjct: 152 PNSPQKKYNHLN-QCIYNASYADGSTSSGNLATEDIVFETSDQGTVTVSSVVFGCGHSNR 210

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G      DG  +G+LGLS    S +S+LGS    RFSYC +  L +  YT + L  G  +
Sbjct: 211 G----RFDGQQSGILGLSAGDQSIVSRLGS----RFSYC-IGDLFDPHYTHNQLVLGDGV 261

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
                ST    F    N FYY++L+ IS+   R++  P+ F  T SG+GG ++DSG+  T
Sbjct: 262 KMEGSSTPFHTF----NGFYYVTLEGISVGETRLDINPEVFQRTESGQGGVVMDSGTTAT 317

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYF-EDAN 278
           +   D +  L  + +    R    Q+     P  LCY   + E    FP +AF+F E A+
Sbjct: 318 FLAKDGFDPLSNE-IQRLVRGHFQQVIYRTIPGWLCYKGRVNEDLRGFPELAFHFAEGAD 376

Query: 279 LRIDGENVFIIDYENHFFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L +D  ++F+   ++ F L  +  +  ++ ++IG   Q+     YDL    + F + +C
Sbjct: 377 LVLDANSLFVQKNQDVFCLAVLESNLKNIGSVIGIMAQQHYNVAYDLIGKRVYFQRTDC 435


>gi|293335828|ref|NP_001170221.1| uncharacterized protein LOC100384173 precursor [Zea mays]
 gi|224034427|gb|ACN36289.1| unknown [Zea mays]
          Length = 443

 Score =  138 bits (348), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 106/362 (29%), Positives = 164/362 (45%), Gaps = 38/362 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP++    ILDTGS LI+                FDP +S++++ + C  P C
Sbjct: 91  LMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPAC 150

Query: 47  T---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
               Y  C  + CVY   Y D + T G  A+ET +  G  E +    G  FGC N N G 
Sbjct: 151 NALYYPLCYQKVCVYQYFYGDSASTAGVLANETFT-FGTNETRVSLPGISFGCGNLNAGL 209

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKFGTD 160
             +      +G++G  R ++S +SQLGS    RFSYCL   + P+P+  Y   Y    + 
Sbjct: 210 LANG-----SGMVGFGRGSLSLVSQLGS---PRFSYCLTSFLSPVPSRLYFGVYATLNST 261

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSG 217
                P  Q+T F+ +P     Y+L++  IS+    +   P  F I    G GG IIDSG
Sbjct: 262 NASSEP-VQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSG 320

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR---FPSMAFYF 274
           + +TY     Y  +   F S      L  ++D    +  C+  P    +    P +  +F
Sbjct: 321 TTITYLAEPAYDAVRAAFASQIT-LPLLNVTDA-SVLDTCFQWPPPPRQSVTLPQLVLHF 378

Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
           + A+  +  +N  ++D      L          ++IGS Q ++   +YDL   L+SFV  
Sbjct: 379 DGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPA 438

Query: 335 NC 336
            C
Sbjct: 439 PC 440


>gi|356528671|ref|XP_003532923.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 440

 Score =  138 bits (347), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 107/369 (28%), Positives = 168/369 (45%), Gaps = 51/369 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++R +IGTP      I DTGS LI+               +FDPRKSS+F+ + CD   C
Sbjct: 93  LMRFYIGTPPVERFAIADTGSDLIWVQCAPCEKCVPQNAPLFDPRKSSTFKTVPCDSQPC 152

Query: 47  TYF-----KCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS-N 98
           T        CV +  QC Y   Y D ++  G    E+I+  G       F    FGC+ +
Sbjct: 153 TLLPPSQRACVGKSGQCYYQYIYGDHTLVSGILGFESIN-FGSKNNAIKFPKLTFGCTFS 211

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
           +N   DE  R+    G++GL    +S ISQLG  I ++FSYC     P    ++S ++FG
Sbjct: 212 NNDTVDESKRN---MGLVGLGVGPLSLISQLGYQIGRKFSYCFP---PLSSNSTSKMRFG 265

Query: 159 TDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
            D   +    Q    ++ P        ++YYL+L+ +SI N+++       D      G 
Sbjct: 266 NDAIVK----QIKGVVSTPLIIKSIGPSYYYLNLEGVSIGNKKVKTSESQTD------GN 315

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSM 270
            +IDSG+  T      Y     KFV+   E + +  +   P     C+       RFP +
Sbjct: 316 ILIDSGTSFTILKQSFY----NKFVALVKEVYGVEAVKIPPLVYNFCFENKGKRKRFPDV 371

Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
            F F  A +R+D  N+F  +  N   ++A+   D+  ++ G+  Q   +  YDL   ++S
Sbjct: 372 VFLFTGAKVRVDASNLFEAEDNNLLCMVALPTSDEDDSIFGNHAQIGYQVEYDLQGGMVS 431

Query: 331 FVKENCSDD 339
           F   +C+ D
Sbjct: 432 FAPADCAKD 440


>gi|116790042|gb|ABK25480.1| unknown [Picea sitchensis]
          Length = 460

 Score =  138 bits (347), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 106/358 (29%), Positives = 174/358 (48%), Gaps = 38/358 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++++ IGTPS     ILDTGS L +               I+DP +SS++ K+ C    C
Sbjct: 116 LMKMAIGTPSLSFSAILDTGSDLTWTQCKPCTDCYPQPTPIYDPSQSSTYSKVPCSSSMC 175

Query: 47  T---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                + C    C Y   Y DQS T+G  ++E+ ++  +    ++ H A FGC  +N   
Sbjct: 176 QALPMYSCSGANCEYLYSYGDQSSTQGILSYESFTLTSQ----SLPHIA-FGCGQEN--- 227

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
            E        G++G  R  +S ISQLG  +  +FSYCLV  + +    +S L  G     
Sbjct: 228 -EGGGFSQGGGLVGFGRGPLSLISQLGQSLGNKFSYCLV-SITDSPSKTSPLFIGKTASL 285

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
              +  +T  +   +   FYYLSL+ IS+  + ++    TFD+ + G GG IIDSG+ +T
Sbjct: 286 NAKTVSSTPLVQSRSRPTFYYLSLEGISVGGQLLDIADGTFDLQLDGTGGVIIDSGTTVT 345

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE---TFNRFPSMAFYFEDAN 278
           Y     Y  + +  +S      L Q+      + LC F P+   + + FP++ F+FE A+
Sbjct: 346 YLEQSGYDVVKKAVIS---SINLPQVDGSNIGLDLC-FEPQSGSSTSHFPTITFHFEGAD 401

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             +  EN    D  +    LA+ P + + ++ G+ QQ++ + +YD   ++LSF    C
Sbjct: 402 FNLPKENYIYTD-SSGIACLAMLPSNGM-SIFGNIQQQNYQILYDNERNVLSFAPTVC 457


>gi|224130878|ref|XP_002320947.1| predicted protein [Populus trichocarpa]
 gi|222861720|gb|EEE99262.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 100/356 (28%), Positives = 168/356 (47%), Gaps = 32/356 (8%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L +GTP   ++ I DTGS LI+               +FDP+ S +++  +CD   C
Sbjct: 96  LMSLSLGTPPFKIMGIADTGSDLIWTQCKPCERCYKQVDPLFDPKSSKTYRDFSCDARQC 155

Query: 47  TYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
           +      C    C Y   Y D+S T G  A +TI++         F   + GC ++N G 
Sbjct: 156 SLLDQSTCSGNICQYQYSYGDRSYTMGNVASDTITLDSTTGSPVSFPKTVIGCGHENDGT 215

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
             D      +G++GL    +S ISQ+GS +  +FSYCLV PL +    SS L FG++   
Sbjct: 216 FSDKG----SGIVGLGAGPLSLISQMGSSVGGKFSYCLV-PLSSRAGNSSKLNFGSNAVV 270

Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
             P  Q+T  ++    ++FY+L+L+ +S+ NER+ F   +     +GEG  IIDSG+ LT
Sbjct: 271 SGPGVQSTPLLSSETMSSFYFLTLEAMSVGNERIKFGDSSLG---TGEGNIIIDSGTTLT 327

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
               D +  L     +  E     +  D    + +CY       + P++  +F  A++++
Sbjct: 328 IVPDDFFSNLSTAVGNQVEG---RRAEDPSGFLSVCYSATSDL-KVPAITAHFTGADVKL 383

Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
              N F +   +    LA A     +++ G+  Q +    Y++    LSF   +C+
Sbjct: 384 KPINTF-VQVSDDVVCLAFASTTSGISIYGNVAQMNFLVEYNIQGKSLSFKPTDCT 438


>gi|115479485|ref|NP_001063336.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|51535935|dbj|BAD38017.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631569|dbj|BAF25250.1| Os09g0452400 [Oryza sativa Japonica Group]
 gi|215693279|dbj|BAG88661.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  137 bits (345), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 160/358 (44%), Gaps = 40/358 (11%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
           IG+P +    ++DTGS LI+                F+P KS+S+  + C    C     
Sbjct: 94  IGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAMCNALYS 153

Query: 50  -KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDAR 108
             C    CVY   Y D + + G  A+ET +  G    +       FGC N N G   +  
Sbjct: 154 PLCFQNACVYQAFYGDSASSAGVLANETFT-FGTNSTRVAVPRVSFGCGNMNAGTLFNG- 211

Query: 109 DGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKFGTDMGYRR 165
               +G++G  R  +S +SQLGS    RFSYCL   + P  +  Y  +Y    +      
Sbjct: 212 ----SGMVGFGRGALSLVSQLGS---PRFSYCLTSFMSPATSRLYFGAYATLNSTNTSSS 264

Query: 166 PSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSGSVLTY 222
              Q+T FI +P     Y+L++  IS+  + +   P  F I    G GG IIDSG+ +T+
Sbjct: 265 GPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGTTVTF 324

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPETFNR---FPSMAFYFEDAN 278
                Y  +   FV++     L + +  P +    C+  P    R    P M  +F+ A+
Sbjct: 325 LAQPAYAMVQGAFVAWV---GLPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLHFDGAD 381

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           + +  EN  ++D       LA+ P DD  ++IGS Q ++   +YDL   LLSFV   C
Sbjct: 382 MELPLENYMVMDGGTGNLCLAMLPSDD-GSIIGSFQHQNFHMLYDLENSLLSFVPAPC 438


>gi|125563957|gb|EAZ09337.1| hypothetical protein OsI_31609 [Oryza sativa Indica Group]
 gi|125605916|gb|EAZ44952.1| hypothetical protein OsJ_29595 [Oryza sativa Japonica Group]
          Length = 438

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 105/358 (29%), Positives = 160/358 (44%), Gaps = 40/358 (11%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
           IG+P +    ++DTGS LI+                F+P KS+S+  + C    C     
Sbjct: 91  IGSPPRYFSAMIDTGSDLIWTQCAPCLLCVEQPTPYFEPAKSTSYASLPCSSAMCNALYS 150

Query: 50  -KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDAR 108
             C    CVY   Y D + + G  A+ET +  G    +       FGC N N G   +  
Sbjct: 151 PLCFQNACVYQAFYGDSASSAGVLANETFT-FGTNSTRVAVPRVSFGCGNMNAGTLFNG- 208

Query: 109 DGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKFGTDMGYRR 165
               +G++G  R  +S +SQLGS    RFSYCL   + P  +  Y  +Y    +      
Sbjct: 209 ----SGMVGFGRGALSLVSQLGS---PRFSYCLTSFMSPATSRLYFGAYATLNSTNTSSS 261

Query: 166 PSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSGSVLTY 222
              Q+T FI +P     Y+L++  IS+  + +   P  F I    G GG IIDSG+ +T+
Sbjct: 262 GPVQSTPFIVNPALPTMYFLNMTGISVAGDLLPIDPSVFAINETDGTGGVIIDSGTTVTF 321

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPETFNR---FPSMAFYFEDAN 278
                Y  +   FV++     L + +  P +    C+  P    R    P M  +F+ A+
Sbjct: 322 LAQPAYAMVQGAFVAWV---GLPRANATPSDTFDTCFKWPPPPRRMVTLPEMVLHFDGAD 378

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           + +  EN  ++D       LA+ P DD  ++IGS Q ++   +YDL   LLSFV   C
Sbjct: 379 MELPLENYMVMDGGTGNLCLAMLPSDD-GSIIGSFQHQNFHMLYDLENSLLSFVPAPC 435


>gi|414589630|tpg|DAA40201.1| TPA: hypothetical protein ZEAMMB73_629620 [Zea mays]
          Length = 443

 Score =  137 bits (345), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 106/362 (29%), Positives = 164/362 (45%), Gaps = 38/362 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP++    ILDTGS LI+                FDP +S++++ + C  P C
Sbjct: 91  LMEMGIGTPTRYYSAILDTGSDLIWTQCAPCLLCVDQPTPYFDPARSATYRSLGCASPAC 150

Query: 47  T---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
               Y  C  + CVY   Y D + T G  A+ET +  G  E +    G  FGC N N G 
Sbjct: 151 NALYYPLCYQKVCVYQYFYGDSASTAGVLANETFT-FGTNETRVSLPGISFGCGNLNAGS 209

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL---VIPLPNGEYTSSYLKFGTD 160
             +      +G++G  R ++S +SQLGS    RFSYCL   + P+P+  Y   Y    + 
Sbjct: 210 LANG-----SGMVGFGRGSLSLVSQLGS---PRFSYCLTSFLSPVPSRLYFGVYATLNST 261

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSG 217
                P  Q+T F+ +P     Y+L++  IS+    +   P  F I    G GG IIDSG
Sbjct: 262 NASSEP-VQSTPFVVNPALPTMYFLNMTGISVGGYLLPIDPAVFAINDTDGTGGTIIDSG 320

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR---FPSMAFYF 274
           + +TY     Y  +   F S      L  ++D    +  C+  P    +    P +  +F
Sbjct: 321 TTITYLAEPAYDAVRAAFASQIT-LPLLNVTDA-SVLDTCFQWPPPPRQSVTLPQLVLHF 378

Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
           + A+  +  +N  ++D      L          ++IGS Q ++   +YDL   L+SFV  
Sbjct: 379 DGADWELPLQNYMLVDPSTGGGLCLAMASSSDGSIIGSYQHQNFNVLYDLENSLMSFVPA 438

Query: 335 NC 336
            C
Sbjct: 439 PC 440


>gi|357492401|ref|XP_003616489.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517824|gb|AES99447.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  137 bits (344), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 107/351 (30%), Positives = 163/351 (46%), Gaps = 30/351 (8%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFK- 50
           +GTP   +  I DTGS +++               IF+P KSSS++ I C    C   + 
Sbjct: 93  VGTPPTKIYGIADTGSDIVWLQCEPCEQCYNQTTPIFNPSKSSSYKNIPCLSKLCHSVRD 152

Query: 51  --CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
             C ++  C Y + Y D S ++G  + +T+S+         F   + GC  DN G     
Sbjct: 153 TSCSDQNSCQYKISYGDSSHSQGDLSVDTLSLESTSGSPVSFPKTVIGCGTDNAG----T 208

Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFGTDMGYRRP 166
             GA +G++GL    +S I+QLGS I  +FSYCLV PL N E   SS L FG        
Sbjct: 209 FGGASSGIVGLGGGPVSLITQLGSSIGGKFSYCLV-PLLNKESNASSILSFGDAAVVSGD 267

Query: 167 STQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSD 226
              +T  I     FY+L+L+  S+ N+R+ F   +       EG  IIDSG+ LT   SD
Sbjct: 268 GVVSTPLIKKDPVFYFLTLQAFSVGNKRVEFGGSSEG--GDDEGNIIIDSGTTLTLIPSD 325

Query: 227 VYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGENV 286
           VY  L    V   +  +L ++ D  +   LCY L      FP +  +F+ A++ +   + 
Sbjct: 326 VYTNLESAVV---DLVKLDRVDDPNQQFSLCYSLKSNEYDFPIITAHFKGADIELHSIST 382

Query: 287 FIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           F +   +     A  P   L ++ G+  Q++    YDL    +SF   +C+
Sbjct: 383 F-VPITDGIVCFAFQPSPQLGSIFGNLAQQNLLVGYDLQQKTVSFKPTDCT 432


>gi|225455876|ref|XP_002275164.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 496

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 111/362 (30%), Positives = 159/362 (43%), Gaps = 54/362 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           +R+ IG PSK   +++DTGS + +               IFDP  SSSF ++ C  P C 
Sbjct: 162 LRVGIGRPSKTFYMVIDTGSDVNWLQCKPCDDCYQQVDPIFDPASSSSFSRLGCQTPQCR 221

Query: 48  ---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
               F C N+ C+Y + Y D S T G  A ET+S    G    +      GC +DN G  
Sbjct: 222 NLDVFACRNDSCLYQVSYGDGSYTVGDFATETVSFGNSGSVDKV----AIGCGHDNEGL- 276

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
                    G  GL  +    +S    I    FSYCLV         SS L+F +     
Sbjct: 277 -------FVGAAGLIGLGGGPLSLTSQIKASSFSYCLV---NRDSVDSSTLEFNS----A 322

Query: 165 RPSTQATKFI---NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
           +PS   T  I   +  + FYY+ +  +S+  E++  PP  F++  SG+GG I+D G+ +T
Sbjct: 323 KPSDSVTAPIFKNSKVDTFYYVGITGMSVGGEKLAIPPSIFEVDGSGKGGIIVDCGTAVT 382

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-----IQLCYFL-PETFNRFPSMAFYFE 275
              +  Y  L + FV   +        D P          CY L   T  R P++AF F+
Sbjct: 383 RLQTQAYNALRDTFVKLTK--------DLPSTSGFALFDTCYNLSSRTSVRVPTVAFLFD 434

Query: 276 DA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
              +L +   N  I       F LA AP    +++IG+ QQ+ TR  YDL    +SF   
Sbjct: 435 GGKSLPLPPSNYLIPVDSAGTFCLAFAPTTASLSIIGNVQQQGTRVTYDLANSQVSFSSR 494

Query: 335 NC 336
            C
Sbjct: 495 KC 496


>gi|449454652|ref|XP_004145068.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470630|ref|XP_004153019.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  136 bits (342), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 103/358 (28%), Positives = 175/358 (48%), Gaps = 33/358 (9%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L +GTP   ++ + DTGS +I+               +F+P KS++++K++C  P C
Sbjct: 86  LMKLSVGTPPFPIIAVADTGSDIIWTQCEPCTNCYQQDLPMFNPSKSTTYRKVSCSSPVC 145

Query: 47  TYFKCVNE-----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDN 100
           ++    N       C Y++ Y D S ++G  A +T++ +G   G+ + F     GC +DN
Sbjct: 146 SFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLT-MGSTSGRVVAFPRTAIGCGHDN 204

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G    + D  ++G++GL     S I Q+GS +  +FSYCL  P+ N +  S+ L FG++
Sbjct: 205 AG----SFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLT-PIGNDDGGSNKLNFGSN 259

Query: 161 MGYRRPSTQATK-FINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                    +T  +I+    +FY L LK +S+   R N    T +  + G+   IIDSG+
Sbjct: 260 ANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVG--RNNTFYSTANSILGGKANIIIDSGT 317

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDAN 278
            LT    D+Y   H    +      L +  D  + ++ C+       + P +A +FE AN
Sbjct: 318 TLTLLPVDLY---HNFAKAISNSINLQRTDDPNQFLEYCFETTTDDYKVPFIAMHFEGAN 374

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           LR+  ENV I   +N   L      D+ +++ G+  Q +    YD+    LSF   NC
Sbjct: 375 LRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432


>gi|449499012|ref|XP_004160696.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 435

 Score =  135 bits (341), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 103/358 (28%), Positives = 175/358 (48%), Gaps = 33/358 (9%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L +GTP   ++ + DTGS +I+               +F+P KS++++K++C  P C
Sbjct: 86  LMKLSVGTPPFPIIAVADTGSDIIWTQCVPCTNCYQQDLPMFNPSKSTTYRKVSCSSPVC 145

Query: 47  TYFKCVNE-----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDN 100
           ++    N       C Y++ Y D S ++G  A +T++ +G   G+ + F     GC +DN
Sbjct: 146 SFTGEDNSCSFKPDCTYSISYGDNSHSQGDFAVDTLT-MGSTSGRVVAFPRTAIGCGHDN 204

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G    + D  ++G++GL     S I Q+GS +  +FSYCL  P+ N +  S+ L FG++
Sbjct: 205 AG----SFDANVSGIVGLGLGPASLIKQMGSAVGGKFSYCLT-PIGNDDGGSNKLNFGSN 259

Query: 161 MGYRRPSTQATK-FINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                    +T  +I+    +FY L LK +S+   R N    T +  + G+   IIDSG+
Sbjct: 260 ANVSGSGAVSTPIYISDKFKSFYSLKLKAVSVG--RNNTFYSTANSILGGKANIIIDSGT 317

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDAN 278
            LT    D+Y   H    +      L +  D  + ++ C+       + P +A +FE AN
Sbjct: 318 TLTLLPVDLY---HNFAKAISNSINLQRTDDPNQFLEYCFETTTDDYKVPFIAMHFEGAN 374

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           LR+  ENV I   +N   L      D+ +++ G+  Q +    YD+    LSF   NC
Sbjct: 375 LRLQRENVLIRVSDNVICLAFAGAQDNDISIYGNIAQINFLVGYDVTNMSLSFKPMNC 432


>gi|226493786|ref|NP_001142400.1| uncharacterized protein LOC100274575 [Zea mays]
 gi|194708650|gb|ACF88409.1| unknown [Zea mays]
          Length = 392

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 103/377 (27%), Positives = 160/377 (42%), Gaps = 59/377 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDH-- 43
           ++ L IGTP      I DTGS LI+                +++P  S++F  + C+   
Sbjct: 33  LMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSSL 92

Query: 44  ---------------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI 88
                          P C         C Y + Y     T  F   ET +      G A 
Sbjct: 93  SVCAAALAGTGTAPPPGC--------ACTYNVTYGS-GWTSVFQGSETFTFGSTPAGHAR 143

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG 148
             G  FGCS  + GF+  +     +G++GL R  +S +SQLG     +FSYCL  P  + 
Sbjct: 144 VPGIAFGCSTASSGFNASSA----SGLVGLGRGRLSLVSQLG---VPKFSYCLT-PYQDT 195

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFD 203
             TS+ L   +          +T F+  P     N FYYL+L  IS+    ++ PPD F 
Sbjct: 196 NSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFS 255

Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET 263
           +   G GG IIDSG+ +T   +  Y ++    VS          +D    + LC+ LP +
Sbjct: 256 LNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSAD--TGLDLCFMLPSS 313

Query: 264 FN---RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
            +     PSM  +F  A++ +  ++  + D    + L      D  V ++G+ QQ++   
Sbjct: 314 TSAPPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHI 373

Query: 321 VYDLNIDLLSFVKENCS 337
           +YD+  + LSF    CS
Sbjct: 374 LYDIGQETLSFAPAKCS 390


>gi|125563959|gb|EAZ09339.1| hypothetical protein OsI_31611 [Oryza sativa Indica Group]
          Length = 453

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 113/368 (30%), Positives = 167/368 (45%), Gaps = 51/368 (13%)

Query: 4   LFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT-- 47
           L +GTP + +  +LDTGS LI+               +F PR SSS++ + C    C   
Sbjct: 102 LAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGDI 161

Query: 48  -YFKCVN-EQCVYTMKYADQSVTKGFAAHETIS-VIGKGEGKAIFHGALFGCSNDNHGFD 104
            +  CV  + C Y   Y D + T G+ A E  +     GE +++  G  FGC   N G  
Sbjct: 162 LHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLG--FGCGTMNVGSL 219

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT--DMG 162
            +A     +G++G  R  +S +SQL SI  +RFSYCL    P      S L+FG+  D+G
Sbjct: 220 NNA-----SGIVGFGRDPLSLVSQL-SI--RRFSYCLT---PYASSRKSTLQFGSLADVG 268

Query: 163 YRRPST---QATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
               +T   Q T  +    N  FYY++   +++   R+  P   F +   G GG IIDSG
Sbjct: 269 LYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSG 328

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR---------FP 268
           + LT F + V  ++   F S   R   A  S   + +  C+  P               P
Sbjct: 329 TALTLFPAAVLAEVVRAFRSQL-RLPFANGSSPDDGV--CFAAPAVAAGGGRMARQVAVP 385

Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
            M F+F+ A+L +  EN  + D+      + +    D  A IG+  Q+D R VYDL  + 
Sbjct: 386 RMVFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERET 445

Query: 329 LSFVKENC 336
           LSF    C
Sbjct: 446 LSFAPVEC 453


>gi|414886964|tpg|DAA62978.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 452

 Score =  135 bits (340), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 103/377 (27%), Positives = 160/377 (42%), Gaps = 59/377 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDH-- 43
           ++ L IGTP      I DTGS LI+                +++P  S++F  + C+   
Sbjct: 93  LMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSSL 152

Query: 44  ---------------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI 88
                          P C         C Y + Y     T  F   ET +      G A 
Sbjct: 153 SVCAAALAGTGTAPPPGC--------ACTYNVTYGS-GWTSVFQGSETFTFGSTPAGHAR 203

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG 148
             G  FGCS  + GF+  +     +G++GL R  +S +SQLG     +FSYCL  P  + 
Sbjct: 204 VPGIAFGCSTASSGFNASSA----SGLVGLGRGRLSLVSQLG---VPKFSYCLT-PYQDT 255

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFD 203
             TS+ L   +          +T F+  P     N FYYL+L  IS+    ++ PPD F 
Sbjct: 256 NSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFS 315

Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET 263
           +   G GG IIDSG+ +T   +  Y ++    VS          +D    + LC+ LP +
Sbjct: 316 LNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTDGSAD--TGLDLCFMLPSS 373

Query: 264 FN---RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
            +     PSM  +F  A++ +  ++  + D    + L      D  V ++G+ QQ++   
Sbjct: 374 TSAPPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHI 433

Query: 321 VYDLNIDLLSFVKENCS 337
           +YD+  + LSF    CS
Sbjct: 434 LYDIGQETLSFAPAKCS 450


>gi|242044888|ref|XP_002460315.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
 gi|241923692|gb|EER96836.1| hypothetical protein SORBIDRAFT_02g026350 [Sorghum bicolor]
          Length = 456

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 106/368 (28%), Positives = 165/368 (44%), Gaps = 46/368 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V L IGTP + V  +LDTGS LI+               +F P +S+S++ + C    C
Sbjct: 103 VVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPGESASYEPMRCAGQLC 162

Query: 47  T---YFKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           +   +  C + + C Y   Y D ++T G  A E  +    G  + +     FGC + N G
Sbjct: 163 SDILHHGCEMPDTCTYRYNYGDGTMTMGVYATERFTFTSSGGDRLMTVPLGFGCGSMNVG 222

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
              +      +G++G  R  +S +SQL SI  +RFSYCL      G    S L FG+  G
Sbjct: 223 SLNNG-----SGIVGFGRNPLSLVSQL-SI--RRFSYCLT---SYGSGRKSTLLFGSLSG 271

Query: 163 Y----RRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
                     Q T  +    N  FYY+ L  +++   R+  P   F +   G GG I+DS
Sbjct: 272 GVYGDATGPVQTTPLLQSLQNPTFYYVHLAGLTVGARRLRIPESAFALRPDGSGGVIVDS 331

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--------FP 268
           G+ LT     V   L E   ++ ++ +L   +       +C+ +P  + R         P
Sbjct: 332 GTALTLLPGAV---LAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQVPVP 388

Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
            M F+F+DA+L +   N  + D+      L +A   D  + IG+  Q+D R +YDL  + 
Sbjct: 389 RMVFHFQDADLDLPRRNYVLDDHRKGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAET 448

Query: 329 LSFVKENC 336
           LSF    C
Sbjct: 449 LSFAPAQC 456


>gi|242041115|ref|XP_002467952.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
 gi|241921806|gb|EER94950.1| hypothetical protein SORBIDRAFT_01g037070 [Sorghum bicolor]
          Length = 774

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 111/367 (30%), Positives = 166/367 (45%), Gaps = 40/367 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V L IGTP + V LILDTGS L++                 DP  SS+F  + C  P C
Sbjct: 416 LVHLAIGTPPQPVQLILDTGSDLVWTQCRPCPVCFSRALGPLDPSNSSTFDVLPCSSPVC 475

Query: 47  ---TYFKCV-----NEQCVYTMKYADQSVTKGFAAHETIS-VIGKGEGKAIFHGALFGCS 97
              T+  C      N+ CVY   YAD S+T G    ET +     G G+A      FGC 
Sbjct: 476 DNLTWSSCGKHNWGNQTCVYVYAYADGSITTGHLDAETFTFAAADGTGQATVPDLAFGCG 535

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
             N+G           G+ G  R  +S  SQL       FS+C    +   E +S  L  
Sbjct: 536 LFNNGIFTSNE----TGIAGFGRGALSLPSQLKV---DNFSHCFTA-ITGSEPSSVLLGL 587

Query: 158 GTDM-GYRRPSTQATKFINHPNNF--YYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
             ++      + Q+T  + + ++   YYLSLK I++ + R+  P  TF +   G GG II
Sbjct: 588 PANLYSDADGAVQSTPLVQNFSSLRAYYLSLKGITVGSTRLPIPESTFALKQDGTGGTII 647

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSMAFY 273
           DSG+ +T    D Y  +H+ F +          S     +   + +P       P +  +
Sbjct: 648 DSGTGMTTLPQDAYKLVHDAFTAQVRLPVDNATSSSLSRLCFSFSVPRRAKPDVPKLVLH 707

Query: 274 FEDANLRIDGENVFIIDYEN---HFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
           FE A L +  EN ++ ++E+       LA+   DDL  +IG+ QQ++   +YDL  ++LS
Sbjct: 708 FEGATLDLPREN-YMFEFEDAGGSVTCLAINAGDDLT-IIGNYQQQNLHVLYDLVRNMLS 765

Query: 331 FVKENCS 337
           FV   C+
Sbjct: 766 FVPAQCN 772


>gi|343198386|gb|AEM05966.1| nodulin 41 [Phaseolus vulgaris]
          Length = 437

 Score =  135 bits (339), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 106/362 (29%), Positives = 168/362 (46%), Gaps = 42/362 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++R +IGTP    L   DTGS LI+               +F P KSS+F    C    C
Sbjct: 91  LMRFYIGTPPVERLATADTGSDLIWVQCSPCASCFPQSTPLFQPLKSSTFMPTTCRSQPC 150

Query: 47  TYFKCVNE------QCVYTMKYADQ-SVTKGFAAHETISVIGKGEGKAI-FHGALFGCSN 98
           T      +      +C+YT KY DQ S ++G  + ET+    +G  + + F  + FGC  
Sbjct: 151 TLLLPEQKGCGKSGECIYTYKYGDQYSFSEGLLSTETLRFDSQGGVQTVAFPNSFFGCGL 210

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
            N+     +    L G++GL    +S +SQ+G  I  +FSYCL   LP G  ++S LKFG
Sbjct: 211 YNNITVFPSYK--LTGIMGLGAGPLSLVSQIGDQIGHKFSYCL---LPLGSTSTSKLKFG 265

Query: 159 TDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
            +         +T  I  P    +Y+L+L+ +++  + +         T S +G  IIDS
Sbjct: 266 NESIITGEGVVSTPMIIKPWLPTYYFLNLEAVTVAQKTVP--------TGSTDGNVIIDS 317

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
           G++LTY     Y+       S  E   +  + D   P+  C+   + F  FP +AF F  
Sbjct: 318 GTLLTYLGESFYYNFA---ASLQESLAVELVQDVLSPLPFCFPYRDNF-VFPEIAFQFTG 373

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
           A + +   N+F++  + +   L +AP     +++ GS  Q D +  YDL    +SF   +
Sbjct: 374 ARVSLKPANLFVMTEDRNTVCLMIAPSSVSGISIFGSFSQIDFQVEYDLEGKKVSFQPTD 433

Query: 336 CS 337
           CS
Sbjct: 434 CS 435


>gi|356569916|ref|XP_003553140.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 560

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 121/370 (32%), Positives = 163/370 (44%), Gaps = 44/370 (11%)

Query: 4   LFIGTPSKGVLLILDTGSAL-------IYAIF-------DPRKSSSFQKINCDHPDCTYF 49
           +F+GTP K   LILDTGS L        YA F       DP+ SSSF+ I C  P C   
Sbjct: 199 VFVGTPPKHFSLILDTGSDLNWIQCVPCYACFEQNGPYYDPKDSSSFKNITCHDPRCQLV 258

Query: 50  ---------KCVNEQCVYTMKYADQSVTKGFAAHETISV-IGKGEGKA---IFHGALFGC 96
                    K   + C Y   Y D S T G  A ET +V +   EGK    I    +FGC
Sbjct: 259 SSPDPPQPCKGETQSCPYFYWYGDSSNTTGDFALETFTVNLTTPEGKPELKIVENVMFGC 318

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
            + N G    A            R  +SF +QL S+    FSYCLV    N    SS L 
Sbjct: 319 GHWNRGLFHGAAGLLGL-----GRGPLSFATQLQSLYGHSFSYCLVDRNSNSS-VSSKLI 372

Query: 157 FGTDMGY-RRPSTQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
           FG D      P+   T F+    N    FYY+ +K I +  E +  P +T+ ++  G GG
Sbjct: 373 FGEDKELLSHPNLNFTSFVGGKENPVDTFYYVLIKSIMVGGEVLKIPEETWHLSAQGGGG 432

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSM 270
            IIDSG+ LTYF    Y  + E F+   + F L +      P++ CY +        P  
Sbjct: 433 TIIDSGTTLTYFAEPAYEIIKEAFMRKIKGFPLVETF---PPLKPCYNVSGVEKMELPEF 489

Query: 271 AFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
           A  F D A      EN FI I+ E+   L  +      +++IG+ QQ++   +YDL    
Sbjct: 490 AILFADGAMWDFPVENYFIQIEPEDVVCLAILGTPRSALSIIGNYQQQNFHILYDLKKSR 549

Query: 329 LSFVKENCSD 338
           L +    C+D
Sbjct: 550 LGYAPMKCAD 559


>gi|225427550|ref|XP_002266461.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 439

 Score =  134 bits (338), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 107/359 (29%), Positives = 168/359 (46%), Gaps = 34/359 (9%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L+IGTP   V+ I+DTGS L +               +FDP+ SS+++  +C    C
Sbjct: 93  LMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSFC 152

Query: 47  TYF----KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                   C  E+ C +   YAD S T G  A ET++V         F G  FGC + + 
Sbjct: 153 LALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHSSG 212

Query: 102 G-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
           G FD+ +     +G++GL    +S ISQL S I   FSYCL +P+      SS + FG  
Sbjct: 213 GIFDKSS-----SGIVGLGGGELSLISQLKSTINGLFSYCL-LPVSTDSSISSRINFGAS 266

Query: 161 MGYRRPSTQATKFINH-PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                  T +T  +   P+ FYYL+L+ IS+  +R+ +   +    V  EG  I+DSG+ 
Sbjct: 267 GRVSGYGTVSTPLVQKSPDTFYYLTLEGISVGKKRLPYKGYSKKTEVE-EGNIIVDSGTT 325

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
            T+   + Y KL +   S     +  ++ D      LCY      N  P +  +F+DAN+
Sbjct: 326 YTFLPQEFYSKLEK---SVANSIKGKRVRDPNGIFSLCYNTTAEINA-PIITAHFKDANV 381

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
            +   N F+   E+      VAP  D + ++G+  Q +    +DL    +SF   +C+ 
Sbjct: 382 ELQPLNTFMRMQED-LVCFTVAPTSD-IGVLGNLAQVNFLVGFDLRKKRVSFKAADCTQ 438


>gi|357119741|ref|XP_003561592.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 410

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 101/355 (28%), Positives = 161/355 (45%), Gaps = 37/355 (10%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           V +  G  ++  +L LDTG++  +               +F P  S +FQ +  D P CT
Sbjct: 72  VSIGTGEGTRRKVLALDTGASTSWLMCEPCQPPLPQVGHLFSPAASPTFQGVRGDGPVCT 131

Query: 48  Y-FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI--FHGALFGCSNDNHGFD 104
             ++  ++ C +   +A      G+ + +T  +     G  +    G +FGC++   GF 
Sbjct: 132 VPYRHTDKGCSFRFPFA-----AGYLSRDTFHLRSGRSGTVMESVPGIMFGCAHSVTGFH 186

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
            D   G L+GVL LS   +SF++ LG     RFSYCL  P P      S+L+FG D+   
Sbjct: 187 ND---GTLSGVLSLSHSPLSFLTLLGGRSSGRFSYCL--PKPTTHNPDSFLRFGADVPSL 241

Query: 165 RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
            P    T  ++     Y+L++  IS+ N+R++     F    +  GGC I+    +T   
Sbjct: 242 PPHAHTTTLVHAGVPGYHLNIVGISLGNKRLHIDRHVF----AAGGGCSINPAVTITRIM 297

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAFYFED-ANLRI 281
              Y  +    V++ +     ++   P    LC+   +   R   P M+F+FED A LR 
Sbjct: 298 ELAYLAVEHALVAHMKELGSGRVKGMPG-RSLCFDHMDRSVRVQLPGMSFHFEDGAELRF 356

Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             E +F +      FL+    H   V  IG+ QQ DTRF +D+    L+FV E C
Sbjct: 357 AAEQLFDVRVMAACFLVVGRGHHQTV--IGAAQQVDTRFTFDIAAGRLAFVPETC 409


>gi|24430421|dbj|BAC22609.1| 41 kD chloroplast nucleoid DNA binding protein (CND41) [Nicotiana
           sylvestris]
          Length = 502

 Score =  134 bits (337), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 104/369 (28%), Positives = 159/369 (43%), Gaps = 53/369 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP K + LI DTGS L +                IFDP  S ++  I+C    
Sbjct: 155 IVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSASKTYSNISCTSTA 214

Query: 46  CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C+  K        C +  CVY ++Y D S T GF A +T+++        +F G +FGC 
Sbjct: 215 CSGLKSATGNSPGCSSSNCVYGIQYGDSSFTVGFFAKDTLTL----TQNDVFDGFMFGCG 270

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            +N G       G  AG++GL R  +S + Q      K FSYC    LP    ++ +L F
Sbjct: 271 QNNRGL-----FGKTAGLIGLGRDPLSIVQQTAQKFGKYFSYC----LPTSRGSNGHLTF 321

Query: 158 GTDMGYRRPSTQATKFINHP------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
           G   G +            P        FY++ +  IS+  + ++  P  F        G
Sbjct: 322 GNGNGVKTSKAVKNGITFTPFASSQGATFYFIDVLGISVGGKALSISPMLFQ-----NAG 376

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSM 270
            IIDSG+V+T   S VY  L   F  +  ++  A        +  CY L   T    P +
Sbjct: 377 TIIDSGTVITRLPSTVYGSLKSTFKQFMSKYPTAPALSL---LDTCYDLSNYTSISIPKI 433

Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFL-LAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
           +F F  +AN+ ++   + I +  +   L  A    DD + + G+ QQ+    VYD+    
Sbjct: 434 SFNFNGNANVDLEPNGILITNGASQVCLAFAGNGDDDTIGIFGNIQQQTLEVVYDVAGGQ 493

Query: 329 LSFVKENCS 337
           L F  + CS
Sbjct: 494 LGFGYKGCS 502


>gi|224060469|ref|XP_002300215.1| predicted protein [Populus trichocarpa]
 gi|222847473|gb|EEE85020.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  134 bits (337), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 115/360 (31%), Positives = 172/360 (47%), Gaps = 45/360 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L IGTP +    ILDTGS LI+               IFDP+KSSSF K++C    C
Sbjct: 98  LMKLAIGTPPETYSAILDTGSDLIWTQCKPCTQCFHQSTPIFDPKKSSSFSKLSCSSQLC 157

Query: 47  TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG-- 102
                   N  C Y   Y D S T+G  A ET++      GKA      FGC  DN G  
Sbjct: 158 EALPQSSCNNGCEYLYSYGDYSSTQGILASETLTF-----GKASVPNVAFGCGADNEGSG 212

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
           F + A      G++GL R  +S +SQL    + +FSYCL       +  +S L  G+   
Sbjct: 213 FSQGA------GLVGLGRGPLSLVSQLK---EPKFSYCLTT---VDDTKTSTLLMGSLAS 260

Query: 163 YRRPST--QATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
               S+  + T  I+ P   +FYYLSL+ IS+ + R+     TF +   G GG IIDSG+
Sbjct: 261 VNASSSAIKTTPLIHSPAHPSFYYLSLEGISVGDTRLPIKKSTFSLQDDGSGGLIIDSGT 320

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFED 276
            +TY     +  + ++F +   +  L   S     + +C+ LP   T    P + F+F+ 
Sbjct: 321 TITYLEESAFNLVAKEFTA---KINLPVDSSGSTGLDVCFTLPSGSTNIEVPKLVFHFDG 377

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A+L +  EN  I D       LA+     + ++ G+ QQ++   ++DL  + LSF+   C
Sbjct: 378 ADLELPAENYMIGDSSMGVACLAMGSSSGM-SIFGNVQQQNMLVLHDLEKETLSFLPTQC 436


>gi|125558631|gb|EAZ04167.1| hypothetical protein OsI_26309 [Oryza sativa Indica Group]
          Length = 441

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 100/369 (27%), Positives = 168/369 (45%), Gaps = 52/369 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + IGTP   +  +LDTGS LI+                ++ P +S+++  ++C  P 
Sbjct: 93  LVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSPM 152

Query: 46  CT-----YFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
           C      + +C   +  C Y   Y D + T G  A ET ++   G   A+  G  FGC  
Sbjct: 153 CQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL---GSDTAV-RGVAFGCGT 208

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
           +N G  +++     +G++G+ R  +S +SQLG     RFSYC     P     +S L  G
Sbjct: 209 ENLGSTDNS-----SGLVGMGRGPLSLVSQLG---VTRFSYCFT---PFNATAASPLFLG 257

Query: 159 TDMGYRRPSTQATKFINHPN-------NFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
           +       + + T F+  P+       ++YYLSL+ I++ +  +   P  F +T  G+GG
Sbjct: 258 SSARLSS-AAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGG 316

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL--PETFNRFPS 269
            IIDSG+  T      +  L     S   R +L   S     + LC+    PE     P 
Sbjct: 317 VIIDSGTTFTALEESAFVALARALAS---RVRLPLASGAHLGLSLCFAAASPEAVE-VPR 372

Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
           +  +F+ A++ +  E+  + D       L +     + +++GS QQ++T  +YDL   +L
Sbjct: 373 LVLHFDGADMELRRESYVVEDRSAGVACLGMVSARGM-SVLGSMQQQNTHILYDLERGIL 431

Query: 330 SFVKENCSD 338
           SF    C +
Sbjct: 432 SFEPAKCGE 440


>gi|115479489|ref|NP_001063338.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|51535938|dbj|BAD38020.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113631571|dbj|BAF25252.1| Os09g0452800 [Oryza sativa Japonica Group]
 gi|125605918|gb|EAZ44954.1| hypothetical protein OsJ_29597 [Oryza sativa Japonica Group]
 gi|215740967|dbj|BAG97462.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 453

 Score =  134 bits (336), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 113/368 (30%), Positives = 166/368 (45%), Gaps = 51/368 (13%)

Query: 4   LFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT-- 47
           L +GTP + +  +LDTGS LI+               +F PR SSS++ + C    C   
Sbjct: 102 LAVGTPPQPITALLDTGSDLIWTQCDTCTACLRQPDPLFSPRMSSSYEPMRCAGQLCGDI 161

Query: 48  -YFKCVN-EQCVYTMKYADQSVTKGFAAHETIS-VIGKGEGKAIFHGALFGCSNDNHGFD 104
            +  CV  + C Y   Y D + T G+ A E  +     GE +++  G  FGC   N G  
Sbjct: 162 LHHSCVRPDTCTYRYSYGDGTTTLGYYATERFTFASSSGETQSVPLG--FGCGTMNVGSL 219

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT--DMG 162
            +A     +G++G  R  +S +SQL SI  +RFSYCL    P      S L+FG+  D+G
Sbjct: 220 NNA-----SGIVGFGRDPLSLVSQL-SI--RRFSYCLT---PYASSRKSTLQFGSLADVG 268

Query: 163 YRRPST---QATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
               +T   Q T  +    N  FYY++   +++   R+  P   F +   G GG IIDSG
Sbjct: 269 LYDDATGPVQTTPILQSAQNPTFYYVAFTGVTVGARRLRIPASAFALRPDGSGGVIIDSG 328

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR---------FP 268
           + LT F   V  ++   F S   R   A  S   + +  C+  P               P
Sbjct: 329 TALTLFPVAVLAEVVRAFRSQL-RLPFANGSSPDDGV--CFAAPAVAAGGGRMARQVAVP 385

Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
            M F+F+ A+L +  EN  + D+      + +    D  A IG+  Q+D R VYDL  + 
Sbjct: 386 RMVFHFQGADLDLPRENYVLEDHRRGHLCVLLGDSGDDGATIGNFVQQDMRVVYDLERET 445

Query: 329 LSFVKENC 336
           LSF    C
Sbjct: 446 LSFAPVEC 453


>gi|115472517|ref|NP_001059857.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|22831047|dbj|BAC15910.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508280|dbj|BAD32129.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113611393|dbj|BAF21771.1| Os07g0533600 [Oryza sativa Japonica Group]
 gi|215766673|dbj|BAG98901.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 441

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 100/369 (27%), Positives = 168/369 (45%), Gaps = 52/369 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + IGTP   +  +LDTGS LI+                ++ P +S+++  ++C  P 
Sbjct: 93  LVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSPM 152

Query: 46  CT-----YFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
           C      + +C   +  C Y   Y D + T G  A ET ++   G   A+  G  FGC  
Sbjct: 153 CQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL---GSDTAV-RGVAFGCGT 208

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
           +N G  +++     +G++G+ R  +S +SQLG     RFSYC     P     +S L  G
Sbjct: 209 ENLGSTDNS-----SGLVGMGRGPLSLVSQLG---VTRFSYCFT---PFNATAASPLFLG 257

Query: 159 TDMGYRRPSTQATKFINHPN-------NFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
           +       + + T F+  P+       ++YYLSL+ I++ +  +   P  F +T  G+GG
Sbjct: 258 SSARLSS-AAKTTPFVPSPSGGARRRSSYYYLSLEGITVGDTLLPIDPAVFRLTPMGDGG 316

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL--PETFNRFPS 269
            IIDSG+  T      +  L     S   R +L   S     + LC+    PE     P 
Sbjct: 317 VIIDSGTTFTALEERAFVALARALAS---RVRLPLASGAHLGLSLCFAAASPEAVE-VPR 372

Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
           +  +F+ A++ +  E+  + D       L +     + +++GS QQ++T  +YDL   +L
Sbjct: 373 LVLHFDGADMELRRESYVVEDRSAGVACLGMVSARGM-SVLGSMQQQNTHILYDLERGIL 431

Query: 330 SFVKENCSD 338
           SF    C +
Sbjct: 432 SFEPAKCGE 440


>gi|413944392|gb|AFW77041.1| hypothetical protein ZEAMMB73_800604 [Zea mays]
          Length = 476

 Score =  133 bits (335), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 102/364 (28%), Positives = 158/364 (43%), Gaps = 51/364 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           +V +  GTP++   +I DTGS + +                IFDP KS+++  + C HP 
Sbjct: 136 VVTVGFGTPAQTYTVIFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSVVPCGHPQ 195

Query: 46  CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C      KC N  C+Y ++Y D S + G  +HET+S+           G  FGC   N G
Sbjct: 196 CAAADGSKCSNGTCLYKVEYGDGSSSAGVLSHETLSLT----STRALPGFAFGCGQTNLG 251

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                  G + G++GL R  +S  SQ  +     FSYC    LP+   T  YL  G    
Sbjct: 252 -----DFGDVDGLIGLGRGQLSLSSQAAASFGGTFSYC----LPSDNTTHGYLTIGPTTP 302

Query: 163 YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
                 Q T  +   +  +FY++ L  I I    +  PP  F      + G  +DSG++L
Sbjct: 303 ASNDDVQYTAMVQKQDYPSFYFVELVSIDIGGYILPVPPTLFT-----DDGTFLDSGTIL 357

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSMAFYFEDA 277
           TY   + Y  L ++F     +F + Q    P  +P   CY F  ++    P+++F F D 
Sbjct: 358 TYLPPEAYTALRDRF-----KFTMTQYKPAPAYDPFDTCYDFTGQSAIFIPAVSFKFSDG 412

Query: 278 ---NLRIDGENVFIIDYENHFFLLA--VAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
              +L   G  +F  D       L     P      ++G+ QQR+T  +YD+  + + F 
Sbjct: 413 SVFDLSFFGILIFPDDTAPAIGCLGFVARPSAMPFTIVGNMQQRNTEVIYDVAAEKIGFA 472

Query: 333 KENC 336
             +C
Sbjct: 473 SASC 476


>gi|168059885|ref|XP_001781930.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666576|gb|EDQ53226.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 355

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 103/357 (28%), Positives = 159/357 (44%), Gaps = 30/357 (8%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +  + +GTP +   +I+DTGS L +              ++F P  S+SF K+ C    C
Sbjct: 4   LATVRLGTPERVFSVIVDTGSDLTWVQCSPCGTCYSQNDSLFIPNTSTSFTKLACGTELC 63

Query: 47  T---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
               Y  C    CVY   Y D S++ G   ++TI++ G    K       FGC +DN G 
Sbjct: 64  NGLPYPMCNQTTCVYWYSYGDGSLSTGDFVYDTITMDGINGQKQQVPNFAFGCGHDNEGS 123

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A      G+LGL +  +SF SQL ++   +FSYCLV  L     TS  L FG     
Sbjct: 124 FAGAD-----GILGLGQGPLSFPSQLKTVFNGKFSYCLVDWLAPPTQTSPLL-FGDAAVP 177

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
             P  +    + +P    +YY+ L  IS+  + +N     FDI   G  G I DSG+ +T
Sbjct: 178 TFPGVKYISLLTNPKVPTYYYVKLNGISVGGKLLNISSTAFDIDSVGRAGTIFDSGTTVT 237

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYFEDANL 279
               +V+ ++     +        + SD    + LC   F        PSM F+FE  ++
Sbjct: 238 QLAGEVHQEVLAAMNA--STMDYPRKSDDSSGLDLCLGGFAEGQLPTVPSMTFHFEGGDM 295

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            +   N FI    +  +  ++    D V +IGS QQ++ +  YD     + FV ++C
Sbjct: 296 ELPPSNYFIFLESSQSYCFSMVSSPD-VTIIGSIQQQNFQVYYDTVGRKIGFVPKSC 351


>gi|168000296|ref|XP_001752852.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696015|gb|EDQ82356.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 384

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 112/364 (30%), Positives = 161/364 (44%), Gaps = 47/364 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L +G+P +   +I+DTGS L +                FDP KS SF+K  C    C
Sbjct: 40  LMTLTLGSPPQSFDVIVDTGSDLNWVQCLPCRVCYQQPGPKFDPSKSRSFRKAACTDNLC 99

Query: 47  TYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                    C    C Y   Y DQS T G  A ETIS +  G G        FGC   N 
Sbjct: 100 NVSALPLKACAANVCQYQYTYGDQSNTNGDLAFETIS-LNNGAGTQSVPNFAFGCGTQNL 158

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G    A     AG++GL +  +S  SQL      +FSYCLV        ++S L FG+  
Sbjct: 159 GTFAGA-----AGLVGLGQGPLSLNSQLSHTFANKFSYCLV---SLNSLSASPLTFGSIA 210

Query: 162 GYRRPSTQATKFI---NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS-GEGGCIIDSG 217
                + Q T  +    HP  +YY+ L  I +  + +N  P  F I  S G GG IIDSG
Sbjct: 211 A--AANIQYTSIVVNARHP-TYYYVQLNSIEVGGQPLNLAPSVFAIDQSTGRGGTIIDSG 267

Query: 218 SVLTYFHSDVY---WKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFY 273
           + +T      Y    + +E FV+Y       +L      + LC+ +    N   P M F 
Sbjct: 268 TTITMLTLPAYSAVLRAYESFVNY------PRLDGSAYGLDLCFNIAGVSNPSVPDMVFK 321

Query: 274 FEDANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           F+ A+ ++ GEN+F+ +D       LA+       ++IG+ QQ++   VYDL    + F 
Sbjct: 322 FQGADFQMRGENLFVLVDTSATTLCLAMGGSQGF-SIIGNIQQQNHLVVYDLEAKKIGFA 380

Query: 333 KENC 336
             +C
Sbjct: 381 TADC 384


>gi|449454654|ref|XP_004145069.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449470632|ref|XP_004153020.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449499016|ref|XP_004160697.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 96/358 (26%), Positives = 169/358 (47%), Gaps = 32/358 (8%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V + +GTP   ++ + DTGS +I+               +FDP KS++++ + C  P C
Sbjct: 84  LVEISVGTPPFSIVAVADTGSDVIWTQCKPCSNCYQQNAPMFDPSKSTTYKNVACSSPVC 143

Query: 47  TYF----KCVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
           +Y      C ++ +C+Y++ Y D S ++G  A +T+++         F   + GC +DN 
Sbjct: 144 SYSGDGSSCSDDSECLYSIAYGDDSHSQGNLAVDTVTMQSTSGRPVAFPRTVIGCGHDNA 203

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT-SSYLKFGTD 160
           G      +  ++G++GL R   S ++QLG     +FSYCL IP+  G    S+ L FG++
Sbjct: 204 G----TFNANVSGIVGLGRGPASLVTQLGPATGGKFSYCL-IPIGTGSTNDSTKLNFGSN 258

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                  T +T   +      FY L L+ +S+ + + NFP     +   GE   IIDSG+
Sbjct: 259 ANVSGSGTVSTPIYSSAQYKTFYSLKLEAVSVGDTKFNFPEGASKL--GGESNIIIDSGT 316

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDAN 278
            LTY  S +   L+    +  +   L    D  E +  C+         P +  +FE A+
Sbjct: 317 TLTYLPSAL---LNSFGSAISQSMSLPHAQDPSEFLDYCFATTTDDYEMPPVTMHFEGAD 373

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           + +  EN+F+   ++   L   +  DD + + G+  Q +    YD+    +SF   +C
Sbjct: 374 VPLQRENLFVRLSDDTICLAFGSFPDDNIFIYGNIAQSNFLVGYDIKNLAVSFQPAHC 431


>gi|15217764|ref|NP_176663.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|5042418|gb|AAD38257.1|AC006193_13 Hypothetical Protein [Arabidopsis thaliana]
 gi|332196174|gb|AEE34295.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 431

 Score =  132 bits (333), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 102/357 (28%), Positives = 175/357 (49%), Gaps = 34/357 (9%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP   +L I DTGS LI+               +FDP++SS+++K++C    C
Sbjct: 87  LMNISIGTPPVPILAIADTGSDLIWTQCNPCEDCYQQTSPLFDPKESSTYRKVSCSSSQC 146

Query: 47  TYFK---CVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
              +   C  ++  C YT+ Y D S TKG  A +T+++   G         + GC ++N 
Sbjct: 147 RALEDASCSTDENTCSYTITYGDNSYTKGDVAVDTVTMGSSGRRPVSLRNMIIGCGHENT 206

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G      D A +G++GL   + S +SQL   I  +FSYCLV P  +    +S + FGT+ 
Sbjct: 207 G----TFDPAGSGIIGLGGGSTSLVSQLRKSINGKFSYCLV-PFTSETGLTSKINFGTNG 261

Query: 162 GYRRPSTQATKFINH-PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
                   +T  +   P  +Y+L+L+ IS+ ++++ F    F    +GEG  +IDSG+ L
Sbjct: 262 IVSGDGVVSTSMVKKDPATYYFLNLEAISVGSKKIQFTSTIFG---TGEGNIVIDSGTTL 318

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
           T   S+ Y++L     S     +  ++ D    + LCY    +F + P +  +F+  +++
Sbjct: 319 TLLPSNFYYELESVVAS---TIKAERVQDPDGILSLCYRDSSSF-KVPDITVHFKGGDVK 374

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +   N F+   E+     A A ++ L  + G+  Q +    YD     +SF K +CS
Sbjct: 375 LGNLNTFVAVSED-VSCFAFAANEQLT-IFGNLAQMNFLVGYDTVSGTVSFKKTDCS 429


>gi|356499344|ref|XP_003518501.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 561

 Score =  132 bits (332), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 118/381 (30%), Positives = 166/381 (43%), Gaps = 67/381 (17%)

Query: 4   LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
           +F+GTP K   LILDTGS L +                +DP+ SSSF+ I+C  P C   
Sbjct: 201 VFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPRCQLV 260

Query: 50  ---------KCVNEQCVYTMKYADQSVTKGFAAHETISVI-----GKGEGKAIFHGALFG 95
                    K  N+ C Y   Y D S T G  A ET +V      G  E K +    +FG
Sbjct: 261 SAPDPPKPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGTSELKHV-ENVMFG 319

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           C + N G    A            +  +SF SQ+ S+  + FSYCLV    N    SS L
Sbjct: 320 CGHWNRGLFHGAAGLLGL-----GKGPLSFASQMQSLYGQSFSYCLVDRNSNAS-VSSKL 373

Query: 156 KFGTDMGYRRPSTQATKFINHPN---------------NFYYLSLKDISIDNERMNFPPD 200
            FG D           + ++HPN                FYY+ +K + +D+E +  P +
Sbjct: 374 IFGED----------KELLSHPNLNFTSFGGGKDGSVDTFYYVQIKSVMVDDEVLKIPEE 423

Query: 201 TFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL 260
           T+ ++  G GG IIDSG+ LTYF    Y  + E FV   + +QL +    P P++ CY +
Sbjct: 424 TWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYQLVE--GLP-PLKPCYNV 480

Query: 261 PETFN-RFPSMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRD 317
                   P     F D A      EN FI ID E     +   P   L ++IG+ QQ++
Sbjct: 481 SGIEKMELPDFGILFADEAVWNFPVENYFIWIDPEVVCLAILGNPRSAL-SIIGNYQQQN 539

Query: 318 TRFVYDLNIDLLSFVKENCSD 338
              +YD+    L +    C+D
Sbjct: 540 FHILYDMKKSRLGYAPMKCAD 560


>gi|255563737|ref|XP_002522870.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537954|gb|EEF39568.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 341

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 79/250 (31%), Positives = 131/250 (52%), Gaps = 5/250 (2%)

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
           FGCS DN  F   +R G   G++GL+   +S + QL ++  +RFSYCL  P  +    +S
Sbjct: 91  FGCSKDNRNFSAFSRTGKTDGIMGLNMSPVSILQQLRNVTNQRFSYCLT-PYGSRPPATS 149

Query: 154 YLKFGTDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
            L+FG D+        +T F++ P+   Y+L+L D+S+  +R+  PP+TF +   G GG 
Sbjct: 150 LLRFGNDISTWGRGFYSTPFVDPPDMPNYFLNLLDLSVAGQRLRLPPETFALKRDGTGGT 209

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFER--FQLAQLSDCPEPIQLCYFLPETFNRFPSM 270
           IIDSG+ LT      Y  L     ++F+   F    + D    ++  +    TF    S+
Sbjct: 210 IIDSGTGLTLVVQPAYRHLLGALQNHFDHHGFHRVHIPDTNLELRYNFAQNRTFQNHASL 269

Query: 271 AFYFEDANLRIDGENVFII-DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
            ++F+ A+  ++    +++ + EN F +  +A H +  A+IG+  Q +TRFVY+     L
Sbjct: 270 TYHFQGADFTVEPRYAYVVYNDENAFCVALLASHIEGRAIIGALHQANTRFVYNAAKRRL 329

Query: 330 SFVKENCSDD 339
            F  EN  +D
Sbjct: 330 KFKAENFQND 339


>gi|108707835|gb|ABF95630.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 384

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 108/364 (29%), Positives = 165/364 (45%), Gaps = 46/364 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP + V L LDTGS L++                +D  +SS+F   +CD   C
Sbjct: 36  LLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQC 95

Query: 47  ----TYFKCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
               +   CVN+    C Y+  Y D+S T GF   ET+S +      A   G +FGC  +
Sbjct: 96  KLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAG----ASVPGVVFGCGLN 151

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL-KFG 158
           N G           G+ G  R  +S  SQL       FS+C      +G   S+ L    
Sbjct: 152 NTGIFRSNE----TGIAGFGRGPLSLPSQLKV---GNFSHCFTAV--SGRKPSTVLFDLP 202

Query: 159 TDMGYR--RPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
            D+ Y+  R + Q T  I +P +  FYYLSLK I++ + R+  P   F +  +G GG II
Sbjct: 203 ADL-YKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALK-NGTGGTII 260

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE--TFNRFPSMAF 272
           DSG+  T     VY  +H++F ++ +   +      P    LC+  P        P +  
Sbjct: 261 DSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGP---LLCFSAPPLGKAPHVPKLVL 317

Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +FE A + +  EN      +     + +A  +  + +IG+ QQ++   +YDL    LSFV
Sbjct: 318 HFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFV 377

Query: 333 KENC 336
           +  C
Sbjct: 378 RAKC 381


>gi|168041176|ref|XP_001773068.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675615|gb|EDQ62108.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 365

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 106/360 (29%), Positives = 163/360 (45%), Gaps = 36/360 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +  + +GTP +   +I+DTGS L +              A+F P  S+SF K+ C    C
Sbjct: 14  LATVRLGTPERVFSVIVDTGSDLTWVQCSPCGKCYSQNDALFLPNTSTSFTKLACGSALC 73

Query: 47  T---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
               +  C    CVY   Y D S+T G   ++TI++ G    K       FGC +DN G 
Sbjct: 74  NGLPFPMCNQTTCVYWYSYGDGSLTTGDFVYDTITMDGINGQKQQVPNFAFGCGHDNEGS 133

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A      G+LGL +  +SF SQL S+   +FSYCLV  L     TS  L FG     
Sbjct: 134 FAGAD-----GILGLGQGPLSFHSQLKSVYNGKFSYCLVDWLAPPTQTSPLL-FGDAAVP 187

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
             P  +    + +P    +YY+ L  IS+ +  +N     FDI   G  G I DSG+ +T
Sbjct: 188 ILPDVKYLPILANPKVPTYYYVKLNGISVGDNLLNISSTVFDIDSVGGAGTIFDSGTTVT 247

Query: 222 YF----HSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
                 + +V   ++   ++Y  +   +++L  C     L  F  +     P+M F+FE 
Sbjct: 248 QLAEAAYKEVLAAMNASTMAYSRKIDDISRLDLC-----LSGFPKDQLPTVPAMTFHFEG 302

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            ++ +   N FI    +  +  A+    D V +IGS QQ++ +  YD     L FV ++C
Sbjct: 303 GDMVLPPSNYFIYLESSQSYCFAMTSSPD-VNIIGSVQQQNFQVYYDTAGRKLGFVPKDC 361


>gi|222624819|gb|EEE58951.1| hypothetical protein OsJ_10630 [Oryza sativa Japonica Group]
          Length = 440

 Score =  132 bits (331), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 108/364 (29%), Positives = 165/364 (45%), Gaps = 46/364 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP + V L LDTGS L++                +D  +SS+F   +CD   C
Sbjct: 92  LLHLAIGTPPQPVQLTLDTGSVLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQC 151

Query: 47  ----TYFKCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
               +   CVN+    C Y+  Y D+S T GF   ET+S +      A   G +FGC  +
Sbjct: 152 KLDPSVTMCVNQTVQTCAYSYSYGDKSATIGFLDVETVSFVAG----ASVPGVVFGCGLN 207

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL-KFG 158
           N G           G+ G  R  +S  SQL       FS+C      +G   S+ L    
Sbjct: 208 NTGIFRSNE----TGIAGFGRGPLSLPSQLKV---GNFSHCFTAV--SGRKPSTVLFDLP 258

Query: 159 TDMGYR--RPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
            D+ Y+  R + Q T  I +P +  FYYLSLK I++ + R+  P   F +  +G GG II
Sbjct: 259 ADL-YKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALK-NGTGGTII 316

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE--TFNRFPSMAF 272
           DSG+  T     VY  +H++F ++ +   +      P    LC+  P        P +  
Sbjct: 317 DSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGP---LLCFSAPPLGKAPHVPKLVL 373

Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +FE A + +  EN      +     + +A  +  + +IG+ QQ++   +YDL    LSFV
Sbjct: 374 HFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFV 433

Query: 333 KENC 336
           +  C
Sbjct: 434 RAKC 437


>gi|356528627|ref|XP_003532901.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 430

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 108/365 (29%), Positives = 175/365 (47%), Gaps = 53/365 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++R  +GTPS   L I DTGS L +               +FDP +SS++  + C+   C
Sbjct: 89  LMRFSLGTPSVERLAIFDTGSDLSWLQCTPCKTCYPQEAPLFDPTQSSTYVDVPCESQPC 148

Query: 47  TYF-----KC-VNEQCVYTMKYADQSVTKGFAAHETISV--IGKGEGKAIFHGALFGCS- 97
           T F     +C  ++QC+Y  +Y   S T G   ++TIS    G G+G A F  ++FGC+ 
Sbjct: 149 TLFPQNQRECGSSKQCIYLHQYGTDSFTIGRLGYDTISFSSTGMGQGGATFPKSVFGCAF 208

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
             N  F    +     G +GL    +S  SQLG  I  +FSYC+V   P    ++  LKF
Sbjct: 209 YSNFTFKISTKAN---GFVGLGPGPLSLASQLGDQIGHKFSYCMV---PFSSTSTGKLKF 262

Query: 158 GTDMGYRRPSTQ--ATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
           G+      P+ +  +T F+ +P+  ++Y L+L+ I++  +++        +T    G  I
Sbjct: 263 GS----MAPTNEVVSTPFMINPSYPSYYVLNLEGITVGQKKV--------LTGQIGGNII 310

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
           IDS  +LT+    +Y      F+S   E   +    D P P + C   P   N FP   F
Sbjct: 311 IDSVPILTHLEQGIY----TDFISSVKEAINVEVAEDAPTPFEYCVRNPTNLN-FPEFVF 365

Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +F  A++ +  +N+F I  +N+   + V P    +++ G+  Q + +  YDL    +SF 
Sbjct: 366 HFTGADVVLGPKNMF-IALDNNLVCMTVVPSKG-ISIFGNWAQVNFQVEYDLGEKKVSFA 423

Query: 333 KENCS 337
             NCS
Sbjct: 424 PTNCS 428


>gi|357450863|ref|XP_003595708.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484756|gb|AES65959.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 407

 Score =  131 bits (330), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 169/361 (46%), Gaps = 40/361 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP   +    DTGS L++               +FDPR SSS+  I C    C
Sbjct: 61  LMELSIGTPPIKIYAEADTGSDLVWFQCIPCTKCYKQQNPMFDPRSSSSYTNITCGTESC 120

Query: 47  TYFK---CVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                  C  +Q  C YT  YAD S+T+G  A ET+++         F G +FGC ++N 
Sbjct: 121 NKLDSSLCSTDQKTCNYTYSYADNSITQGVLAQETLTLTSTTGEPVAFQGIIFGCGHNNS 180

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSII---KKRFSYCLVIPLPNGEYTSSYLKFG 158
           GF++        G++GL R  +S ISQ+GS +      FS CLV P       +S + FG
Sbjct: 181 GFNDRE-----MGLIGLGRGPLSLISQIGSSLGAGGNMFSQCLV-PFNTDPSITSQMNFG 234

Query: 159 TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                    T +T  I+     Y+ +L  IS+++  + F   +   T++ +G  +IDSG+
Sbjct: 235 KGSEVLGNGTVSTPLISKDGTGYFATLLGISVEDINLPFSNGSSLGTIT-KGNILIDSGT 293

Query: 219 VLTYFHSDVYWKLHEKFVS--YFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
            +TY   + Y +L E+  +    E F++       +  +LCY  P   N  P++  +FE 
Sbjct: 294 TITYLPEEFYHRLIEQVRNKVALEPFRI-------DGYELCYQTPTNLNG-PTLTIHFEG 345

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            ++ +    +FI   +++F       +++ V   G+  Q +    +DL   ++SF   +C
Sbjct: 346 GDVLLTPAQMFIPVQDDNFCFAVFDTNEEYVTY-GNYAQSNYLIGFDLERQVVSFKATDC 404

Query: 337 S 337
           +
Sbjct: 405 T 405


>gi|226508080|ref|NP_001150678.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
 gi|195641018|gb|ACG39977.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 101/377 (26%), Positives = 159/377 (42%), Gaps = 59/377 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDH-- 43
           ++ L IGTP      I DTGS LI+                +++P  S++F  + C+   
Sbjct: 91  LMALAIGTPPLPYQAIADTGSDLIWTQCAPCTSQCFRQPTPLYNPSSSTTFAVLPCNSSL 150

Query: 44  ---------------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI 88
                          P C         C Y + Y     T  F   ET +      G++ 
Sbjct: 151 SVCAAALAGTGTAPPPGC--------ACTYNVTYGS-GWTSVFQGSETFTFGSTPAGQSR 201

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG 148
             G  FGCS  + GF+  +     +G++GL R  +S +SQLG     +FSYCL  P  + 
Sbjct: 202 VPGIAFGCSTASSGFNASSA----SGLVGLGRGRLSLVSQLG---VPKFSYCLT-PYQDT 253

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFD 203
             TS+ L   +          +T F+  P     N FYYL+L  IS+    ++ PPD F 
Sbjct: 254 NSTSTLLLGPSASLNGTAGVSSTPFVASPSTAPMNTFYYLNLTGISLGTTALSIPPDAFL 313

Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET 263
           +   G GG IIDSG+ +T   +  Y ++    VS                + LC+ LP +
Sbjct: 314 LNADGTGGLIIDSGTTITLLGNTAYQQVRAAVVSLVTLPTTD--GSAATGLDLCFMLPSS 371

Query: 264 FN---RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
            +     PSM  +F  A++ +  ++  + D    + L      D  V ++G+ QQ++   
Sbjct: 372 TSAPPAMPSMTLHFNGADMVLPADSYMMSDDSGLWCLAMQNQTDGEVNILGNYQQQNMHI 431

Query: 321 VYDLNIDLLSFVKENCS 337
           +YD+  + LSF    CS
Sbjct: 432 LYDIGQETLSFAPAKCS 448


>gi|2541876|dbj|BAA22813.1| CND41, chloroplast nucleoid DNA binding protein [Nicotiana tabacum]
          Length = 502

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 102/369 (27%), Positives = 157/369 (42%), Gaps = 53/369 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP K + LI DTGS L +                IFDP  S ++  I+C    
Sbjct: 155 IVNVGLGTPKKDLSLIFDTGSDLTWTQCQPCVKSCYAQQQPIFDPSTSKTYSNISCTSAA 214

Query: 46  CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C+  K        C +  CVY ++Y D S T GF A + +++        +F G +FGC 
Sbjct: 215 CSSLKSATGNSPGCSSSNCVYGIQYGDSSFTIGFFAKDKLTLTQN----DVFDGFMFGCG 270

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            +N G       G  AG++GL R  +S + Q      K FSYC    LP    ++ +L F
Sbjct: 271 QNNKGLF-----GKTAGLIGLGRDPLSIVQQTAQKFGKYFSYC----LPTSRGSNGHLTF 321

Query: 158 GTDMGYRRPSTQATKFINHP------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
           G   G +            P        +Y++ +  IS+  + ++  P  F        G
Sbjct: 322 GNGNGVKASKAVKNGITFTPFASSQGTAYYFIDVLGISVGGKALSISPMLFQ-----NAG 376

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSM 270
            IIDSG+V+T   S  Y  L   F  +  ++  A        +  CY L   T    P +
Sbjct: 377 TIIDSGTVITRLPSTAYGSLKSAFKQFMSKYPTAPALSL---LDTCYDLSNYTSISIPKI 433

Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFL-LAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
           +F F  +AN+ +D   + I +  +   L  A    DD + + G+ QQ+    VYD+    
Sbjct: 434 SFNFNGNANVELDPNGILITNGASQVCLAFAGNGDDDSIGIFGNIQQQTLEVVYDVAGGQ 493

Query: 329 LSFVKENCS 337
           L F  + CS
Sbjct: 494 LGFGYKGCS 502


>gi|224085379|ref|XP_002307559.1| predicted protein [Populus trichocarpa]
 gi|222857008|gb|EEE94555.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 121/371 (32%), Positives = 166/371 (44%), Gaps = 46/371 (12%)

Query: 4   LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
           +FIGTP K   LILDTGS L +                +DP++SSSF+ I C  P C   
Sbjct: 94  VFIGTPPKHYSLILDTGSDLNWIQCVPCHDCFEQNGPYYDPKESSSFRNIGCHDPRCHLV 153

Query: 50  ---------KCVNEQCVYTMKYADQSVTKGFAAHETISV-----IGKGEGKAIFHGALFG 95
                    K  N+ C Y   Y D S T G  A ET +V      GK E K +    +FG
Sbjct: 154 SSPDPPLPCKAENQTCPYFYWYGDSSNTTGDFATETFTVNLTSPTGKSEFKRV-ENVMFG 212

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           C + N G    A     +G+LGL R  +SF SQL S+    FSYCLV    +    SS L
Sbjct: 213 CGHWNRGLFHGA-----SGLLGLGRGPLSFSSQLQSLYGHSFSYCLV-DRNSDTNVSSKL 266

Query: 156 KFGTDMGY-RRPSTQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
            FG D      P    T  +    N    FYY+ +K I +  E +N P  T+++T  G G
Sbjct: 267 IFGEDKDLLNHPELNFTTLVGGKENPVDTFYYVQIKSIMVGGEVLNIPESTWNMTSDGVG 326

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPS 269
           G I+DSG+ L+YF    Y  + + FV   + + + Q     +P   CY +        P 
Sbjct: 327 GTIVDSGTTLSYFTEPAYQIIKDAFVKKVKGYPIVQDFPILDP---CYNVSGVEKIDLPD 383

Query: 270 MAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
               F D A      EN FI +D E    L  +      +++IG+ QQ++   +YD    
Sbjct: 384 FGILFADGAVWNFPVENYFIRLDPEEVVCLAILGTPRSALSIIGNYQQQNFHVLYDTKKS 443

Query: 328 LLSFVKENCSD 338
            L +   NC+D
Sbjct: 444 RLGYAPMNCAD 454


>gi|357494221|ref|XP_003617399.1| 60S ribosomal protein L18a [Medicago truncatula]
 gi|355518734|gb|AET00358.1| 60S ribosomal protein L18a [Medicago truncatula]
          Length = 749

 Score =  131 bits (329), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 117/373 (31%), Positives = 165/373 (44%), Gaps = 45/373 (12%)

Query: 4   LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
           +FIGTP K   LILDTGS L +                +DP++SSSF+ I C  P C   
Sbjct: 196 VFIGTPPKHYSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKESSSFENITCHDPRCKLV 255

Query: 50  ---------KCVNEQCVYTMKYADQSVTKGFAAHETISVI-----GKGEGKAIFHGALFG 95
                    K  N+ C Y   Y D S T G  A ET +V      GK E K +    +FG
Sbjct: 256 SSPDPPKPCKDENQTCPYFYWYGDSSNTTGDFALETFTVNLTTPNGKSEQKHV-ENVMFG 314

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           C + N G    A            R  +SF SQL SI    FSYCLV    +    SS L
Sbjct: 315 CGHWNRGLFHGAAGLLGL-----GRGPLSFASQLQSIYGHSFSYCLV-DRNSDTSVSSKL 368

Query: 156 KFGTDMGY-RRPSTQATKFI----NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
            FG D      P+   T F+    N  + FYY+ +K I +D E +  P +T+ ++  G G
Sbjct: 369 IFGEDKELLSHPNLNFTSFVGGEENSVDTFYYVGIKSIMVDGEVLKIPEETWHLSKEGGG 428

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPS 269
           G IIDSG+ LTYF    Y  + E F+   + ++L +    P P++ CY +        P 
Sbjct: 429 GTIIDSGTTLTYFAEPAYEIIKEAFMKKIKGYELVE--GFP-PLKPCYNVSGIEKMELPD 485

Query: 270 MAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
               F D A      EN FI    +   L  +      +++IG+ QQ++   +YD+    
Sbjct: 486 FGILFSDGAMWDFPVENYFIQIEPDLVCLAILGTPKSALSIIGNYQQQNFHILYDMKKSR 545

Query: 329 LSFVKENCSDDSA 341
           L +    C+  ++
Sbjct: 546 LGYAPMKCTATTS 558


>gi|145693992|gb|ABP93696.1| unknown protein isoform 1 [Lemna minor]
          Length = 350

 Score =  130 bits (327), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 98/356 (27%), Positives = 148/356 (41%), Gaps = 43/356 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           ++ +  GTP K   +I DTGS + +                +FDP  SS+++ I+C    
Sbjct: 17  VITVGFGTPKKNQTVIFDTGSNVNWIQCKPCVVSCYPQQEPLFDPTLSSTYRNISCTSAA 76

Query: 46  CTYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           CT      C    CVY + Y D S T GF A ET ++        +F+  +FGC  +N G
Sbjct: 77  CTGLSSRGCSGSTCVYGVTYGDGSSTVGFLATETFTL----AAGNVFNNFIFGCGQNNQG 132

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               A     AG++GL R   S  SQL + +   FSYC    LP+    + YL  G  + 
Sbjct: 133 LFTGA-----AGLIGLGRSPYSLNSQLATSLGNIFSYC----LPSTSSATGYLNIGNPL- 182

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
            R P   A    +     Y++ L  IS+   R+      F        G IIDSG+V+T 
Sbjct: 183 -RTPGYTAMLTNSRAPTLYFIDLIGISVGGTRLALSSTVFQSV-----GTIIDSGTVITR 236

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDANLRI 281
                Y  L   F +   ++  A  +     +  CY F   T   FP++  ++   ++ I
Sbjct: 237 LPPTAYGALRTAFRAAMTQYTRAAAASI---LDTCYDFSRTTTVTFPTIKLHYTGLDVTI 293

Query: 282 DGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            G  VF +   +   L      D   + +IG+ QQR     YD  +  + F    C
Sbjct: 294 PGAGVFYVISSSQVCLAFAGNSDSTQIGIIGNVQQRTMEVTYDNALKRIGFAAGAC 349


>gi|224074591|ref|XP_002304395.1| predicted protein [Populus trichocarpa]
 gi|222841827|gb|EEE79374.1| predicted protein [Populus trichocarpa]
          Length = 443

 Score =  130 bits (327), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 105/360 (29%), Positives = 167/360 (46%), Gaps = 38/360 (10%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           +++ IGTP   V++I DTGS L +               +FDP +SSS++ + C    C 
Sbjct: 96  MKMSIGTPLVEVIVIADTGSDLTWVQCLPCDPCYRQKSPLFDPSRSSSYRHMLCGSRFCN 155

Query: 48  YFKCVNEQ--------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
               V+EQ        C Y   Y D+S T G  A E  ++             +FGC   
Sbjct: 156 ALD-VSEQACTMDTNICEYHYSYGDKSYTNGNLATEKFTIGSTSSRPVHLSPIVFGCGTG 214

Query: 100 NHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
           N G FDE        G   LS      +SQL SIIK +FSYCLV PL      +S +KFG
Sbjct: 215 NGGTFDELGSGIVGLGGGALS-----LVSQLSSIIKGKFSYCLV-PLSEQSNVTSKIKFG 268

Query: 159 TDMGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           TD     P   +T  ++  P+ +YY++L+ IS+ N+R+ +     +  V  +G  IIDSG
Sbjct: 269 TDSVISGPQVVSTPLVSKQPDTYYYVTLEAISVGNKRLPYTNGLLNGNVE-KGNVIIDSG 327

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
           + LT+  S+ + +L        E  +  ++SD      +C+      +  P +A +F DA
Sbjct: 328 TTLTFLDSEFFTELERVLE---ETVKAERVSDPRGLFSVCFRSAGDID-LPVIAVHFNDA 383

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ++++   N F+   E+      ++ +   + + G+  Q D    YDL    +SF   +C+
Sbjct: 384 DVKLQPLNTFVKADEDLLCFTMISSNQ--IGIFGNLAQMDFLVGYDLEKRTVSFKPTDCT 441


>gi|225430555|ref|XP_002285593.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 481

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 102/364 (28%), Positives = 160/364 (43%), Gaps = 50/364 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP + +  I DTGS L +                IF+P KS+S+  I+C  P 
Sbjct: 139 VVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSPT 198

Query: 46  CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C   K        C    CVY ++Y DQS + GF A + +++        +F+  LFGC 
Sbjct: 199 CDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALTSTD----VFNNFLFGCG 254

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            +N G         +AG++GL R  +S +SQ      K FSYC    LP+   ++ YL F
Sbjct: 255 QNNRGLFV-----GVAGLIGLGRNALSLVSQTAQKYGKLFSYC----LPSTSSSTGYLTF 305

Query: 158 GTDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
           G+  G  +        +N    +FY+L+L  IS+   +++     F        G IIDS
Sbjct: 306 GSGGGTSKAVKFTPSLVNSQGPSFYFLNLIAISVGGRKLSTSASVFST-----AGTIIDS 360

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE 275
           G+V++      Y  L   F     ++  A  +     +  CY F        P +  YF 
Sbjct: 361 GTVISRLPPTAYSDLRASFQQQMSKYPKAAPASI---LDTCYDFSQYDTVDVPKINLYFS 417

Query: 276 D-ANLRIDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFV 332
           D A + +D   +F I   +    LA A + D   +A++G+ QQ+    VYD+    + F 
Sbjct: 418 DGAEMDLDPSGIFYILNISQ-VCLAFAGNSDATDIAILGNVQQKTFDVVYDVAGGRIGFA 476

Query: 333 KENC 336
              C
Sbjct: 477 PGGC 480


>gi|255548662|ref|XP_002515387.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545331|gb|EEF46836.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 463

 Score =  130 bits (326), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 104/358 (29%), Positives = 164/358 (45%), Gaps = 50/358 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA------IFDPRKSSSFQKINCDHPDCTYF----- 49
           +V + +G+P K ++LI DTGS L +A       FDP KS+S+  ++C  P C+       
Sbjct: 135 IVSIGLGSPKKDLMLIFDTGSDLTWARCSAAETFDPTKSTSYANVSCSTPLCSSVISATG 194

Query: 50  ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
              +C    CVY ++Y D S + GF   E ++ IG  +   IF+   FGC     G D D
Sbjct: 195 NPSRCAASTCVYGIQYGDGSYSIGFLGKERLT-IGSTD---IFNNFYFGC-----GQDVD 245

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
              G  AG+LGL R  +S +SQ      + FSYC    LP+   T  +L FG+    +  
Sbjct: 246 GLFGKAAGLLGLGRDKLSVVSQTAPKYNQLFSYC----LPSSSST-GFLSFGSS---QSK 297

Query: 167 STQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSD 226
           S + T   + P++FY L L  I++  +++  P   F        G IIDSG+V+T     
Sbjct: 298 SAKFTPLSSGPSSFYNLDLTGITVGGQKLAIPLSVFSTA-----GTIIDSGTVVTRLPPA 352

Query: 227 VYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CY-FLPETFNRFPSMAFYFEDA-NLRI 281
            Y  L   F      + + +      P+ +   CY F      + P +   F    ++ +
Sbjct: 353 AYSALRSAFRKAMASYPMGK------PLSILDTCYDFSKYKTIKVPKIVISFSGGVDVDV 406

Query: 282 DGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           D   +F+ +       LA A +      A+ G+ QQR+   VYD++   + F   +CS
Sbjct: 407 DQAGIFVANGLKQ-VCLAFAGNTGARDTAIFGNTQQRNFEVVYDVSGGKVGFAPASCS 463


>gi|356557010|ref|XP_003546811.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 437

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 109/362 (30%), Positives = 164/362 (45%), Gaps = 44/362 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L+IGTP    L I DTGS LI+               +F+P KSS+F+   CD   C
Sbjct: 93  LMTLYIGTPPVERLAIADTGSDLIWVQCSPCQNCFPQDTPLFEPLKSSTFKAATCDSQPC 152

Query: 47  TYFKCVNEQC------VYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSND 99
           T       QC      +Y+  Y D+S T G    ET+S    G+ + + F  ++FGC   
Sbjct: 153 TSVPPSQRQCGKVGQCIYSYSYGDKSFTVGVVGTETLSFGSTGDAQTVSFPSSIFGCGVY 212

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           N+ F     D     V          +SQLG  I  +FSYCL   LP    ++S LKFG+
Sbjct: 213 NN-FTFHTSDKVTGLVGLGGGPLSL-VSQLGPQIGYKFSYCL---LPFSSNSTSKLKFGS 267

Query: 160 DMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           +         +T  I  P   +FY+L+L+ ++I  + +         T   +G  IIDSG
Sbjct: 268 EAIVTTNGVVSTPLIIKPLFPSFYFLNLEAVTIGQKVVP--------TGRTDGNIIIDSG 319

Query: 218 SVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
           +VLTY     Y      FV+   E   +    D P P + C+  P      P +AF F  
Sbjct: 320 TVLTYLEQTFY----NNFVASLQEVLSVESAQDLPFPFKFCF--PYRDMTIPVIAFQFTG 373

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
           A++ +  +N+ I   + +   LAV P     +++ G+  Q D + VYDL    +SF   +
Sbjct: 374 ASVALQPKNLLIKLQDRNMLCLAVVPSSLSGISIFGNVAQFDFQVVYDLEGKKVSFAPTD 433

Query: 336 CS 337
           C+
Sbjct: 434 CT 435


>gi|414880708|tpg|DAA57839.1| TPA: hypothetical protein ZEAMMB73_997765 [Zea mays]
          Length = 452

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 102/366 (27%), Positives = 160/366 (43%), Gaps = 43/366 (11%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT---- 47
           +G P    L+++DTGS LI+               ++DPR S + ++I C  P C     
Sbjct: 98  VGDPPTHALVVIDTGSDLIWLQCLPCRRCYRQVTPLYDPRNSKTHRRIPCASPQCRGVLR 157

Query: 48  YFKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
           Y  C      CVY + Y D S + G  A +T+ +          H    GC +DN G   
Sbjct: 158 YPGCDARTGGCVYMVVYGDGSASSGDLATDTLVL----PDDTRVHNVTLGCGHDNEGLLA 213

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
            A     AG+LG  R  +SF +QL       FSYCL   +     +SSYL FG       
Sbjct: 214 SA-----AGLLGAGRGQLSFPTQLAPAYGHVFSYCLGDRMSRARNSSSYLVFGRTP--EL 266

Query: 166 PSTQATKFINHPN--NFYYLSLKDISIDNERM-NFPPDTFDIT-VSGEGGCIIDSGSVLT 221
           PST  T    +P   + YY+ +   S+  ER+  F   +  +   +G GG ++DSG+ ++
Sbjct: 267 PSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVAGFSNASLALNPATGRGGVVVDSGTAIS 326

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL----PETFNRFPSMAFYF-ED 276
            F  D Y  + + FVS+     + +L +       CY +    P T  R PS+  +F   
Sbjct: 327 RFTRDAYAAVRDAFVSHAAAAGMRRLRNKFSVFDTCYDVHGNGPGTGVRVPSIVLHFAAA 386

Query: 277 ANLRIDGENVFIIDY---ENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
           A++ +   N  I         +F L +   DD + ++G+ QQ+    V+D+    + F  
Sbjct: 387 ADMALPQANYLIPVVGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGVVFDVERGRIGFTP 446

Query: 334 ENCSDD 339
             CS +
Sbjct: 447 NGCSGE 452


>gi|356525748|ref|XP_003531485.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 436

 Score =  129 bits (325), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 112/365 (30%), Positives = 166/365 (45%), Gaps = 45/365 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++R +IGTP    L I DT S LI+               +F+P KSS+F  ++CD   C
Sbjct: 91  LMRFYIGTPPVERLAIADTASDLIWVQCSPCETCFPQDTPLFEPHKSSTFANLSCDSQPC 150

Query: 47  T-----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
           T     Y   V   C+YT  Y D S TKG    E+I     G     F   +FGC ++N 
Sbjct: 151 TSSNIYYCPLVGNLCLYTNTYGDGSSTKGVLCTESIHF---GSQTVTFPKTIFGCGSNND 207

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
              + +    + G++GL    +S +SQLG  I  +FSYCL   LP    ++  LKFG D 
Sbjct: 208 FMHQISNK--VTGIVGLGAGPLSLVSQLGDQIGHKFSYCL---LPFTSTSTIKLKFGNDT 262

Query: 162 GYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                   +T  I  P+  ++Y+L L  I+I  + +     T D T    G  IID G+V
Sbjct: 263 TITGNGVVSTPLIIDPHYPSYYFLHLVGITIGQKMLQV--RTTDHT---NGNIIIDLGTV 317

Query: 220 LTYFHSDVYWKLHEKFVSYF-ERFQLAQLS-DCPEPIQLCYFLPETFN-RFPSMAFYFED 276
           LTY   + Y      FV+   E   +++   D P P   C+  P   N  FP + F F  
Sbjct: 318 LTYLEVNFY----HNFVTLLREALGISETKDDIPYPFDFCF--PNQANITFPKIVFQFTG 371

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
           A + +  +N+F    + +   LAV P  +    ++ G+  Q D +  YD     +SF   
Sbjct: 372 AKVFLSPKNLFFRFDDLNMICLAVLPDFYAKGFSVFGNLAQVDFQVEYDRKGKKVSFAPA 431

Query: 335 NCSDD 339
           +CS +
Sbjct: 432 DCSKN 436


>gi|302781726|ref|XP_002972637.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
 gi|300160104|gb|EFJ26723.1| hypothetical protein SELMODRAFT_97698 [Selaginella moellendorffii]
          Length = 393

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 107/356 (30%), Positives = 159/356 (44%), Gaps = 47/356 (13%)

Query: 4   LFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
           + +GTP K    I DTGS L++             IFDPR+SS+F++++C    CT    
Sbjct: 59  ISVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGGTIFDPRQSSTFREMDCSSQLCTELPG 118

Query: 52  VNE----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
             E     C Y+ +Y     T+G  A +TIS+     G   F     GC   N GFD   
Sbjct: 119 SCEPGSSACSYSYEYGSGE-TEGEFARDTISLGTTSGGSQKFPSFAVGCGMVNSGFD--- 174

Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS 167
               + G++GL +  +S  SQL + I  +FSYCLV    N +  SS L FG         
Sbjct: 175 ---GVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVD--INSQSESSPLLFGPSAALHGTG 229

Query: 168 TQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
            Q+TK I  P++    +Y L++  I++  + M  P           G  IIDSG+ LTY 
Sbjct: 230 IQSTK-ITPPSDTYPTYYLLTVNGIAVAGQTMGSP-----------GTTIIDSGTTLTYV 277

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDANLRID 282
            S VY ++  +  S      L ++      + LCY      N +FP++      A +   
Sbjct: 278 PSGVYGRVLSRMESM---VTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPP 334

Query: 283 GENVF-IIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             N F ++D       LA+     L V++IG+  Q+    +YD     LSFV+  C
Sbjct: 335 SSNYFLVVDDSGDTVCLAMGSAGGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390


>gi|312281631|dbj|BAJ33681.1| unnamed protein product [Thellungiella halophila]
          Length = 502

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 103/356 (28%), Positives = 159/356 (44%), Gaps = 42/356 (11%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
           R+ +GTP+K + ++LDTGS + +               IFDP  SS+F+ + C  P C  
Sbjct: 167 RIGVGTPAKEMYVVLDTGSDVNWIQCLPCSECYQQSDPIFDPTSSSTFKSLTCSDPKCAS 226

Query: 49  F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
                C + +C+Y + Y D S T G  A +T++    GE   +   AL GC +DN G   
Sbjct: 227 LDVSACRSNKCLYQVSYGDGSFTVGNYATDTVTF---GESGKVNDVAL-GCGHDNEGL-- 280

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
                   G  GL  +    +S    I  K FSYCLV         SS L F +      
Sbjct: 281 ------FTGAAGLLGLGGGALSMTNQIKAKSFSYCLV---DRDSAKSSSLDFNSVQIGAG 331

Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
            +T      +  + FYY+ L   S+  ++++ P   F++  SG GG I+D G+ +T   +
Sbjct: 332 DATAPLLRNSKMDTFYYVGLSGFSVGGQQVSIPSSLFEVDASGAGGVILDCGTAVTRLQT 391

Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CY-FLPETFNRFPSMAFYFEDA-NLR 280
             Y  L + FV     F+         PI L   CY F   +  + P++ F+F    +L 
Sbjct: 392 QAYNSLRDAFVKLTTDFKKGT-----SPISLFDTCYDFSSLSTVKVPTVTFHFTGGKSLN 446

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  +N  I   +   F  A AP    +++IG+ QQ+ TR  YDL  +L+      C
Sbjct: 447 LPAKNYLIPIDDAGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANNLIGLSANKC 502


>gi|116787398|gb|ABK24493.1| unknown [Picea sitchensis]
          Length = 479

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 106/363 (29%), Positives = 169/363 (46%), Gaps = 49/363 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V    GTP+K  LLI+DTGS + +               IF+P++SSS++ ++C    C
Sbjct: 139 IVTAGFGTPAKNSLLIIDTGSDVTWIQCKPCSDCYSQVDPIFEPQQSSSYKHLSCLSSAC 198

Query: 47  TYFKCVNE----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           T    +N      CVY + Y D S ++G  + ET+++     G   F    FGC + N G
Sbjct: 199 TELTTMNHCRLGGCVYEINYGDGSRSQGDFSQETLTL-----GSDSFPSFAFGCGHTNTG 253

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
             + +     AG+LGL R  +SF SQ  S    +FSYC    LP+   ++S   F    G
Sbjct: 254 LFKGS-----AGLLGLGRTALSFPSQTKSKYGGQFSYC----LPDFVSSTSTGSFSVGQG 304

Query: 163 YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
               +      +++ N  +FY++ L  IS+  ER++ PP      V G GG I+DSG+V+
Sbjct: 305 SIPATATFVPLVSNSNYPSFYFVGLNGISVGGERLSIPP-----AVLGRGGTIVDSGTVI 359

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI-QLCYFLPE-TFNRFPSMAFYFE-DA 277
           T      Y  L   F S       A+    P  I   CY L   +  R P++ F+F+ +A
Sbjct: 360 TRLVPQAYDALKTSFRSKTRNLPSAK----PFSILDTCYDLSSYSQVRIPTITFHFQNNA 415

Query: 278 NLRIDGENV-FIIDYENHFFLLAVAPHDDLVA--LIGSQQQRDTRFVYDLNIDLLSFVKE 334
           ++ +    + F I  +     LA A     ++  +IG+ QQ+  R  +D     + F   
Sbjct: 416 DVAVSAVGILFTIQSDGSQVCLAFASASQSISTNIIGNFQQQRMRVAFDTGAGRIGFAPG 475

Query: 335 NCS 337
           +C+
Sbjct: 476 SCA 478


>gi|356553775|ref|XP_003545228.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 559

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 113/380 (29%), Positives = 164/380 (43%), Gaps = 65/380 (17%)

Query: 4   LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
           +F+GTP K   LILDTGS L +                +DP+ SSSF+ I+C  P C   
Sbjct: 199 VFVGTPPKHFSLILDTGSDLNWIQCVPCIACFEQSGPYYDPKDSSSFRNISCHDPRCQLV 258

Query: 50  ---------KCVNEQCVYTMKYADQSVTKGFAAHETISVI-----GKGEGKAIFHGALFG 95
                    K  N+ C Y   Y D S T G  A ET +V      GK E K +    +FG
Sbjct: 259 SSPDPPNPCKAENQSCPYFYWYGDGSNTTGDFALETFTVNLTTPNGKSELKHV-ENVMFG 317

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           C + N G    A            +  +SF SQ+ S+  + FSYCLV    N    SS L
Sbjct: 318 CGHWNRGLFHGAAGLLGL-----GKGPLSFASQMQSLYGQSFSYCLVDRNSNAS-VSSKL 371

Query: 156 KFGTDMGYRRPSTQATKFINHPN---------------NFYYLSLKDISIDNERMNFPPD 200
            FG D           + ++HPN                FYY+ +  + +D+E +  P +
Sbjct: 372 IFGED----------KELLSHPNLNFTSFGGGKDGSVDTFYYVQINSVMVDDEVLKIPEE 421

Query: 201 TFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL 260
           T+ ++  G GG IIDSG+ LTYF    Y  + E FV   + ++L +    P P++ CY +
Sbjct: 422 TWHLSSEGAGGTIIDSGTTLTYFAEPAYEIIKEAFVRKIKGYELVE--GLP-PLKPCYNV 478

Query: 261 PETFN-RFPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
                   P     F D A      EN FI    +   L  +      +++IG+ QQ++ 
Sbjct: 479 SGIEKMELPDFGILFADGAVWNFPVENYFIQIDPDVVCLAILGNPRSALSIIGNYQQQNF 538

Query: 319 RFVYDLNIDLLSFVKENCSD 338
             +YD+    L +    C+D
Sbjct: 539 HILYDMKKSRLGYAPMKCAD 558


>gi|225423917|ref|XP_002281973.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
          Length = 491

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 107/356 (30%), Positives = 165/356 (46%), Gaps = 44/356 (12%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTY 48
           R+ IG+P K V +++DTGS + +               IF+P  SSS+  + C+   C  
Sbjct: 158 RVGIGSPPKHVYMVVDTGSDVNWVQCAPCADCYQQADPIFEPSFSSSYAPLTCETHQCKS 217

Query: 49  F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
               +C N+ C+Y + Y D S T G  A ETI++    +G A  +    GC +DN G   
Sbjct: 218 LDVSECRNDSCLYEVSYGDGSYTVGDFATETITL----DGSASLNNVAIGCGHDNEGLFV 273

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
            A              ++SF SQ+ +     FSYCLV        ++S L+F + +    
Sbjct: 274 GAAGLLGL-----GGGSLSFPSQINA---SSFSYCLV---NRDTDSASTLEFNSPI---- 318

Query: 166 PSTQATKFI---NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
           PS   T  +   N  + FYYL +  I +  + ++ P  +F++  SG GG I+DSG+ +T 
Sbjct: 319 PSHSVTAPLLRNNQLDTFYYLGMTGIGVGGQMLSIPRSSFEVDESGNGGIIVDSGTAVTR 378

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDAN-LR 280
             SDVY  L + FV   +   L   S        CY L    +   P+++F+F D   L 
Sbjct: 379 LQSDVYNSLRDSFVRGTQ--HLPSTSGVAL-FDTCYDLSSRSSVEVPTVSFHFPDGKYLA 435

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  +N  I       F  A AP    +++IG+ QQ+ TR  YDL+  L+ F    C
Sbjct: 436 LPAKNYLIPVDSAGTFCFAFAPTTSALSIIGNVQQQGTRVSYDLSNSLVGFSPNGC 491


>gi|218192703|gb|EEC75130.1| hypothetical protein OsI_11317 [Oryza sativa Indica Group]
          Length = 440

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 107/364 (29%), Positives = 165/364 (45%), Gaps = 46/364 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP + V L LDTGS L++                +D  +SS+F   +CD   C
Sbjct: 92  LLHLAIGTPPQPVQLTLDTGSDLVWTQCQPCAVCFNQSLPYYDASRSSTFALPSCDSTQC 151

Query: 47  ----TYFKCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
               +   CVN+    C ++  Y D+S T GF   ET+S +      A   G +FGC  +
Sbjct: 152 KLDPSVTMCVNQTVQTCAFSYSYGDKSATIGFLDVETVSFVAG----ASVPGVVFGCGLN 207

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL-KFG 158
           N G           G+ G  R  +S  SQL       FS+C      +G   S+ L    
Sbjct: 208 NTGIFRSNE----TGIAGFGRGPLSLPSQLKV---GNFSHCFTAV--SGRKPSTVLFDLP 258

Query: 159 TDMGYR--RPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
            D+ Y+  R + Q T  I +P +  FYYLSLK I++ + R+  P   F +  +G GG II
Sbjct: 259 ADL-YKNGRGTVQTTPLIKNPAHPTFYYLSLKGITVGSTRLPVPESAFALK-NGTGGTII 316

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE--TFNRFPSMAF 272
           DSG+  T     VY  +H++F ++ +   +      P    LC+  P        P +  
Sbjct: 317 DSGTAFTSLPPRVYRLVHDEFAAHVKLPVVPSNETGP---LLCFSAPPLGKAPHVPKLVL 373

Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +FE A + +  EN      +     + +A  +  + +IG+ QQ++   +YDL    LSFV
Sbjct: 374 HFEGATMHLPRENYVFEAKDGGNCSICLAIIEGEMTIIGNFQQQNMHVLYDLKNSKLSFV 433

Query: 333 KENC 336
           +  C
Sbjct: 434 RAKC 437


>gi|255563827|ref|XP_002522914.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537841|gb|EEF39457.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 113/360 (31%), Positives = 169/360 (46%), Gaps = 45/360 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP +    I+DTGS LI+               IFDP+KSSSF K++C    C
Sbjct: 101 LMNLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPSPIFDPKKSSSFSKLSCSSQLC 160

Query: 47  TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                   ++ C Y   Y D S T+G  A ET +      GK       FGC  DN G  
Sbjct: 161 KALPQSSCSDSCEYLYTYGDYSSTQGTMATETFTF-----GKVSIPNVGFGCGEDNEG-- 213

Query: 105 EDARDG--ALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT--D 160
               DG    +G++GL R  +S +SQL    + +FSYCL       +  +S L  G+   
Sbjct: 214 ----DGFTQGSGLVGLGRGPLSLVSQLK---EAKFSYCLT---SIDDTKTSTLLMGSLAS 263

Query: 161 MGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
           +     + + T  I +P   +FYYLSL+ IS+   R+     TF +   G GG IIDSG+
Sbjct: 264 VNGTSAAIRTTPLIQNPLQPSFYYLSLEGISVGGTRLPIKESTFQLQDDGTGGLIIDSGT 323

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF--PSMAFYFED 276
            +TY     +  + ++F S   +  L   +     ++LCY LP   +    P +  +F  
Sbjct: 324 TITYLEESAFDLVKKEFTS---QMGLPVDNSGATGLELCYNLPSDTSELEVPKLVLHFTG 380

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A+L + GEN  I D       LA+     + ++ G+ QQ++    +DL  + LSF+  NC
Sbjct: 381 ADLELPGENYMIADSSMGVICLAMGSSGGM-SIFGNVQQQNMFVSHDLEKETLSFLPTNC 439


>gi|15222611|ref|NP_173922.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical protein [Arabidopsis thaliana]
 gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis thaliana]
 gi|23198172|gb|AAN15613.1| unknown protein [Arabidopsis thaliana]
 gi|110736960|dbj|BAF00436.1| hypothetical protein [Arabidopsis thaliana]
 gi|332192515|gb|AEE30636.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 108/358 (30%), Positives = 162/358 (45%), Gaps = 47/358 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ IG P++ V ++LDTGS + +               IF+P  SSS++ ++CD P C 
Sbjct: 150 TRVGIGKPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCN 209

Query: 48  YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
             +   C N  C+Y + Y D S T G  A ET+++     G  +      GC + N G  
Sbjct: 210 ALEVSECRNATCLYEVSYGDGSYTVGDFATETLTI-----GSTLVQNVAVGCGHSNEGLF 264

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A               ++  SQL +     FSYCLV        ++S + FGT +   
Sbjct: 265 VGAAGLLGL-----GGGLLALPSQLNT---TSFSYCLV---DRDSDSASTVDFGTSLS-- 311

Query: 165 RPSTQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
             +  A    NH  + FYYL L  IS+  E +  P  +F++  SG GG IIDSG+ +T  
Sbjct: 312 PDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRL 371

Query: 224 HSDVYWKLHEKFVSY---FERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFYFEDAN- 278
            +++Y  L + FV      E+     + D       CY L  +T    P++AF+F     
Sbjct: 372 QTEIYNSLRDSFVKGTLDLEKAAGVAMFDT------CYNLSAKTTVEVPTVAFHFPGGKM 425

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L +  +N  I       F LA AP    +A+IG+ QQ+ TR  +DL   L+ F    C
Sbjct: 426 LALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 483


>gi|168043550|ref|XP_001774247.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674374|gb|EDQ60883.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 106/362 (29%), Positives = 158/362 (43%), Gaps = 52/362 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ +  G+P +   +I+DTGS LI+               IFDP KSS++  ++C    C
Sbjct: 81  LIDISFGSPPQKASVIVDTGSDLIWTQCLPCETCNAAASVIFDPVKSSTYDTVSCASNFC 140

Query: 47  TY--FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
           +   F+     C Y   Y D S T G      +S      G        FGC + N G  
Sbjct: 141 SSLPFQSCTTSCKYDYMYGDGSSTSG-----ALSTETVTVGTGTIPNVAFGCGHTNLGSF 195

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A     AG++GL +  +S ISQ  SI  K+FSYCLV   P G   +S +  G D    
Sbjct: 196 AGA-----AGIVGLGQGPLSLISQASSITSKKFSYCLV---PLGSTKTSPMLIG-DSAAA 246

Query: 165 RPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
                     N  N  FYY  L  IS+  + + +P  TF I  SG+GG I+DSG+ LTY 
Sbjct: 247 GGVAYTALLTNTANPTFYYADLTGISVSGKAVTYPVGTFSIDASGQGGFILDSGTTLTYL 306

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEP--------IQLCYFLPETFN-RFPSMAFYF 274
            +             F     A  ++ P P        +  C+      N  +P+M F+F
Sbjct: 307 ETGA-----------FNALVAALKAEVPFPEADGSLYGLDYCFSTAGVANPTYPTMTFHF 355

Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
           + A+  +  ENVF+         LA+A      +++G+ QQ++   V+DL    + F + 
Sbjct: 356 KGADYELPPENVFVALDTGGSICLAMAASTGF-SIMGNIQQQNHLIVHDLVNQRVGFKEA 414

Query: 335 NC 336
           NC
Sbjct: 415 NC 416


>gi|302780643|ref|XP_002972096.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
 gi|300160395|gb|EFJ27013.1| hypothetical protein SELMODRAFT_96575 [Selaginella moellendorffii]
          Length = 393

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 106/356 (29%), Positives = 159/356 (44%), Gaps = 47/356 (13%)

Query: 4   LFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
           + +GTP K    I DTGS L++             IFDPR+SS+F++++C    C     
Sbjct: 59  ISVGTPGKRFRAIADTGSDLVWVQSEPCTGCSGGTIFDPRQSSTFREMDCSSQLCAELPG 118

Query: 52  VNE----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
             E     C Y+ +Y     T+G  A +TIS+    +G   F     GC   N GFD   
Sbjct: 119 SCEPGSSTCSYSYEYGSGE-TEGEFARDTISLGTTSDGSQKFPSFAVGCGMVNSGFD--- 174

Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS 167
               + G++GL +  +S  SQL + I  +FSYCLV    N +  SS L FG         
Sbjct: 175 ---GVDGLVGLGQGPVSLTSQLSAAIDSKFSYCLVD--INSQSESSPLLFGPSAALHGTG 229

Query: 168 TQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
            Q+TK I  P++    +Y L++  I++  + M  P           G  IIDSG+ LTY 
Sbjct: 230 IQSTK-ITPPSDTYPTYYLLTVNGIAVAGQTMGSP-----------GTTIIDSGTTLTYV 277

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDANLRID 282
            S VY ++  +  S      L ++      + LCY      N +FP++      A +   
Sbjct: 278 PSGVYGRVLSRMESM---VTLPRVDGSSMGLDLCYDRSSNRNYKFPALTIRLAGATMTPP 334

Query: 283 GENVF-IIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             N F ++D       LA+     L V++IG+  Q+    +YD     LSFV+  C
Sbjct: 335 SSNYFLVVDDSGDTVCLAMGSASGLPVSIIGNVMQQGYHILYDRGSSELSFVQAKC 390


>gi|224111722|ref|XP_002315953.1| predicted protein [Populus trichocarpa]
 gi|222864993|gb|EEF02124.1| predicted protein [Populus trichocarpa]
          Length = 484

 Score =  129 bits (323), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 106/355 (29%), Positives = 163/355 (45%), Gaps = 43/355 (12%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTY 48
           R+ IG P     LILDTGS + +               IF+P  S+SF  ++C+   C  
Sbjct: 152 RVGIGKPPSQAYLILDTGSDVNWVQCAPCADCYQQADPIFEPASSASFSTLSCNTRQCRS 211

Query: 49  F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
               +C N+ C+Y + Y D S T G    ETI++     G A       GC ++N G   
Sbjct: 212 LDVSECRNDTCLYEVSYGDGSYTVGDFVTETITL-----GSAPVDNVAIGCGHNNEGLFV 266

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
            A              ++SF SQ+ +     FSYCLV        ++S L+F + +    
Sbjct: 267 GAAGLLGL-----GGGSLSFPSQINA---TSFSYCLV---DRDSESASTLEFNSTL---P 312

Query: 166 PSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
           P+  +   +  +H + FYY+ L  +S+  E ++ P   F I  SG GG I+DSG+ +T  
Sbjct: 313 PNAVSAPLLRNHHLDTFYYVGLTGLSVGGELVSIPESAFQIDESGNGGVIVDSGTAITRL 372

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDAN-LRI 281
            +DVY  L + FV   +R +    ++       CY L    N   P+++F+F D   L +
Sbjct: 373 QTDVYNSLRDAFV---KRTRDLPSTNGIALFDTCYDLSSKGNVEVPTVSFHFPDGKELPL 429

Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             +N  +       F  A AP    +++IG+ QQ+ TR VYDL   L+ FV   C
Sbjct: 430 PAKNYLVPLDSEGTFCFAFAPTASSLSIIGNVQQQGTRVVYDLVNHLVGFVPNKC 484


>gi|357127293|ref|XP_003565317.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial
           [Brachypodium distachyon]
          Length = 540

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 104/362 (28%), Positives = 155/362 (42%), Gaps = 48/362 (13%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
           R+ IG+P++ + ++LDTGS + +               +FDP  SSS+  + CD P C  
Sbjct: 199 RIGIGSPARQLYMVLDTGSDVTWLQCAPCADCYAQSDPLFDPALSSSYATVPCDSPHCRA 258

Query: 49  FKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                        N  CVY + Y D S T G  A ET+++   G+G A  H    GC +D
Sbjct: 259 LDASACHNNAANGNSSCVYEVAYGDGSYTVGDFATETLTL--GGDGSAAVHDVAIGCGHD 316

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           N G    A               +SF SQ   I    FSYCLV        ++S L+FG 
Sbjct: 317 NEGLFVGAAGLLAL-----GGGPLSFPSQ---ISATEFSYCLV---DRDSPSASTLQFGA 365

Query: 160 DMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDS 216
                  ST     +  P  N FYY++L  IS+  E + + PP  F +   G GG I+DS
Sbjct: 366 S----DSSTVTAPLMRSPRSNTFYYVALNGISVGGETLSDIPPAAFAMDEQGSGGVIVDS 421

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFE 275
           G+ +T   S  Y  L + FV   +    A           CY L   +  + P+++  FE
Sbjct: 422 GTAVTRLQSSAYSALRDAFVRGTQALPRASGVSL---FDTCYDLAGRSSVQVPAVSLRFE 478

Query: 276 DAN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
               L++  +N  I       + LA A     V+++G+ QQ+  R  +D   + + F   
Sbjct: 479 GGGELKLPAKNYLIPVDGAGTYCLAFAATGGAVSIVGNVQQQGIRVSFDTAKNTVGFSPN 538

Query: 335 NC 336
            C
Sbjct: 539 KC 540


>gi|225430551|ref|XP_002283470.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 490

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 104/370 (28%), Positives = 162/370 (43%), Gaps = 62/370 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +G+P + +  I DTGS L +                IFDP  S S+  ++CD P 
Sbjct: 148 VVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSPS 207

Query: 46  CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C   +        C +  C+Y ++Y D S + GF A E +S+        +F+   FGC 
Sbjct: 208 CEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTSTD----VFNNFQFGCG 263

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            +N G       G  AG+LGL+R  +S +SQ      K FSYC    LP+   ++ YL F
Sbjct: 264 QNNRGLF-----GGTAGLLGLARNPLSLVSQTAQKYGKVFSYC----LPSSSSSTGYLSF 314

Query: 158 GTDMGYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
           G+  G     ++A KF     N     FY+L +  IS+   ++  P   F        G 
Sbjct: 315 GSGDG----DSKAVKFTPSEVNSDYPSFYFLDMVGISVGERKLPIPKSVFST-----AGT 365

Query: 213 IIDSGSVLTYFHSDVY---WKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFP 268
           IIDSG+V++     VY    K+  + +S + R +   + D       CY L +    + P
Sbjct: 366 IIDSGTVISRLPPTVYSSVQKVFRELMSDYPRVKGVSILD------TCYDLSKYKTVKVP 419

Query: 269 SMAFYFE-DANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
            +  YF   A + +  E  ++++         A    DD VA+IG+ QQ+    VYD   
Sbjct: 420 KIILYFSGGAEMDLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAE 479

Query: 327 DLLSFVKENC 336
             + F    C
Sbjct: 480 GRVGFAPSGC 489


>gi|297828001|ref|XP_002881883.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297327722|gb|EFH58142.1| hypothetical protein ARALYDRAFT_903676 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 529

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 114/370 (30%), Positives = 162/370 (43%), Gaps = 42/370 (11%)

Query: 4   LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
           + +GTP K   LILDTGS L +              A +DP+ S+SF+ I C+ P C+  
Sbjct: 166 VLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNEAFYDPKTSASFKNITCNDPRCSLI 225

Query: 50  ---------KCVNEQCVYTMKYADQSVTKGFAAHETISV-IGKGEGKAI---FHGALFGC 96
                    K  N+ C Y   Y D+S T G  A ET +V +   EG++        +FGC
Sbjct: 226 SSPEPPVQCKSDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGRSSEYKVENMMFGC 285

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
            + N G    A            R  +SF SQL S+    FSYCLV    +    SS L 
Sbjct: 286 GHWNRGLFSGASGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLV-DRNSDTNVSSKLI 339

Query: 157 FGTDMGY-RRPSTQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
           FG D       +   T F+N   N    FYY+ +K I +  E ++ P +T++I+  G GG
Sbjct: 340 FGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGEALDIPEETWNISPDGAGG 399

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFNRFPS 269
            IIDSG+ L+YF    Y  +  KF    +   L    D P  +P      + E     P 
Sbjct: 400 TIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYLV-FRDFPVLDPCFNVSGIEENNIHLPE 458

Query: 270 MAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
           +   F D A      EN FI   E+   L  +       ++IG+ QQ++   +YD  +  
Sbjct: 459 LGIAFADGAVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKMSR 518

Query: 329 LSFVKENCSD 338
           L F    C+D
Sbjct: 519 LGFTPTKCAD 528


>gi|297845610|ref|XP_002890686.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336528|gb|EFH66945.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  128 bits (322), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 108/358 (30%), Positives = 163/358 (45%), Gaps = 47/358 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ IG P++ V ++LDTGS + +               IF+P  SSS++ ++CD P C 
Sbjct: 153 TRVGIGNPAREVYMVLDTGSDVNWLQCTPCADCYHQTEPIFEPSSSSSYEPLSCDTPQCN 212

Query: 48  YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
             +   C N  C+Y + Y D S T G  A ET+++     G  +      GC + N G  
Sbjct: 213 ALEVSECRNATCLYEVSYGDGSYTVGDFATETLTI-----GSTLVQNVAVGCGHSNEGLF 267

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A               ++  SQL +     FSYCLV        ++S ++FGT +   
Sbjct: 268 VGAAGLLGL-----GGGLLALPSQLNT---TSFSYCLV---DRDSDSASTVEFGTSL--P 314

Query: 165 RPSTQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
             +  A    NH  + FYYL L  IS+  E +  P  +F++  SG GG IIDSG+ +T  
Sbjct: 315 PDAVVAPLLRNHQLDTFYYLGLTGISVGGELLQIPQSSFEMDESGSGGIIIDSGTAVTRL 374

Query: 224 HSDVYWKLHEKFV---SYFERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFYFEDAN- 278
            + +Y  L + F+   S  E+     + D       CY L  +T    P++AF+F     
Sbjct: 375 QTGIYNSLRDSFLKGTSDLEKAAGVAMFD------TCYNLSAKTTIEVPTVAFHFPGGKM 428

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L +  +N  I       F LA AP    +A+IG+ QQ+ TR  +DL   L+ F    C
Sbjct: 429 LALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSLIGFSSNKC 486


>gi|125543640|gb|EAY89779.1| hypothetical protein OsI_11321 [Oryza sativa Indica Group]
          Length = 434

 Score =  128 bits (321), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 115/366 (31%), Positives = 162/366 (44%), Gaps = 47/366 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V L IGTP + V L LDTGS LI+                FDP  SS+    +CD   C
Sbjct: 83  LVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLC 142

Query: 47  TYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
                          N+ CVYT  Y D+SVT GF   +  + +G G   A   G  FGC 
Sbjct: 143 QGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAG---ASVPGVAFGCG 199

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS-YLK 156
             N+G  +        G+ G  R  +S  SQL       FS+C      NG   S+  L 
Sbjct: 200 LFNNGVFKSNE----TGIAGFGRGPLSLPSQLKV---GNFSHCFTA--VNGLKPSTVLLD 250

Query: 157 FGTDMGYR--RPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
              D+ Y+  R + Q+T  I +P N  FYYLSLK I++ + R+  P   F +  +G GG 
Sbjct: 251 LPADL-YKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFTLK-NGTGGT 308

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-PSMA 271
           IIDSG+ +T   + VY  + + F +   + +L  +S        C   P     + P + 
Sbjct: 309 IIDSGTAMTSLPTRVYRLVRDAFAA---QVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLV 365

Query: 272 FYFEDANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
            +FE A + +  EN VF ++      L         V  IG+ QQ++   +YDL    LS
Sbjct: 366 LHFEGATMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLS 425

Query: 331 FVKENC 336
           FV   C
Sbjct: 426 FVPAQC 431


>gi|115452685|ref|NP_001049943.1| Os03g0318400 [Oryza sativa Japonica Group]
 gi|108707841|gb|ABF95636.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548414|dbj|BAF11857.1| Os03g0318400 [Oryza sativa Japonica Group]
          Length = 434

 Score =  128 bits (321), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 115/366 (31%), Positives = 162/366 (44%), Gaps = 47/366 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V L IGTP + V L LDTGS LI+                FDP  SS+    +CD   C
Sbjct: 83  LVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLC 142

Query: 47  TYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
                          N+ CVYT  Y D+SVT GF   +  + +G G   A   G  FGC 
Sbjct: 143 QGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAG---ASVPGVAFGCG 199

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS-YLK 156
             N+G  +        G+ G  R  +S  SQL       FS+C      NG   S+  L 
Sbjct: 200 LFNNGVFKSNE----TGIAGFGRGPLSLPSQLKV---GNFSHCFTA--VNGLKPSTVLLD 250

Query: 157 FGTDMGYR--RPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
              D+ Y+  R + Q+T  I +P N  FYYLSLK I++ + R+  P   F +  +G GG 
Sbjct: 251 LPADL-YKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALK-NGTGGT 308

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-PSMA 271
           IIDSG+ +T   + VY  + + F +   + +L  +S        C   P     + P + 
Sbjct: 309 IIDSGTAMTSLPTRVYRLVRDAFAA---QVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLV 365

Query: 272 FYFEDANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
            +FE A + +  EN VF ++      L         V  IG+ QQ++   +YDL    LS
Sbjct: 366 LHFEGATMDLPRENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLS 425

Query: 331 FVKENC 336
           FV   C
Sbjct: 426 FVPAQC 431


>gi|357158691|ref|XP_003578210.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 459

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 105/367 (28%), Positives = 154/367 (41%), Gaps = 43/367 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V L +GTP + V  +LDTGS LI+               IF P  SSS++ + C    C
Sbjct: 105 LVDLAVGTPPQPVSALLDTGSDLIWTQCAPCASCLPQPDPIFSPGASSSYEPMRCAGELC 164

Query: 47  T---YFKCVN-EQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFGCSND 99
               +  C   + C Y   Y D + T+G  A E  +       GE   +     FGC   
Sbjct: 165 NDILHHSCQRPDTCTYRYSYGDGTTTRGVYATERFTFSSSSSGGETTKLSAPLGFGCGTM 224

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           N G   +      +G++G  R  +S +SQL     +RFSYCL    P      S L FG+
Sbjct: 225 NKGSLNNG-----SGIVGFGRAPLSLVSQLA---IRRFSYCLT---PYASGRKSTLLFGS 273

Query: 160 DMG----YRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
             G        + Q T+ +    N  FYY+    +++   R+  P   F +   G GG I
Sbjct: 274 LRGGVYDAATATVQTTRLLRSRQNPTFYYVPFTGVTVGARRLRIPISAFALRPDGSGGAI 333

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF----NRFPS 269
           +DSG+ LT F + V  ++   F S       A  S  P+   +C+    +        P 
Sbjct: 334 VDSGTALTLFPAPVLAEVVRAFRSQLRLPFAANGSSGPD-DGVCFAAAASRVPRPAVVPR 392

Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
           M F+ + A+L +   N  + D       L +A   D    IG+  Q+D R +YDL  D L
Sbjct: 393 MVFHLQGADLDLPRRNYVLDDQRKGNLCLLLADSGDSGTTIGNFVQQDMRVLYDLEADTL 452

Query: 330 SFVKENC 336
           SF    C
Sbjct: 453 SFAPAQC 459


>gi|413918484|gb|AFW58416.1| hypothetical protein ZEAMMB73_998053 [Zea mays]
          Length = 475

 Score =  127 bits (320), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 106/376 (28%), Positives = 166/376 (44%), Gaps = 58/376 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L +GTP+     I+DTGS L++               +FDP  SS++  + C    C
Sbjct: 117 LMDLSVGTPALPYAAIVDTGSDLVWTQCKPCVECFNQTTPVFDPAASSTYAALPCSSALC 176

Query: 47  TYFKCVNEQCV-----------YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
                                 YT  Y D S T+G  A ET ++      +    G  FG
Sbjct: 177 ADLPTSTCASSSSSSSASSPCGYTYTYGDASSTQGVLATETFTL-----ARQKVPGVAFG 231

Query: 96  C--SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
           C  +N+  GF + A      G++GL R  +S +SQLG     RFSYCL   L +    S 
Sbjct: 232 CGDTNEGDGFTQGA------GLVGLGRGPLSLVSQLG---IDRFSYCLT-SLDDAAGRSP 281

Query: 154 YL---KFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSG 208
            L     G          Q T  + +P+  +FYY+SL  +++ + R+  P   F I   G
Sbjct: 282 LLLGSAAGISASAATAPAQTTPLVKNPSQPSFYYVSLTGLTVGSTRLALPSSAFAIQDDG 341

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP------E 262
            GG I+DSG+ +TY     Y  L + FV++     L  +      + LC+  P      +
Sbjct: 342 TGGVIVDSGTSITYLELRAYRALRKAFVAHMS---LPTVDASEIGLDLCFQGPAGAVDQD 398

Query: 263 TFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
              + P +  +F+  A+L +  EN  ++D  +    L V     L ++IG+ QQ++ +FV
Sbjct: 399 VQVQVPKLVLHFDGGADLDLPAENYMVLDSASGALCLTVMASRGL-SIIGNFQQQNFQFV 457

Query: 322 YDLNIDLLSFVKENCS 337
           YD+  D LSF    C+
Sbjct: 458 YDVAGDTLSFAPAECN 473


>gi|226508202|ref|NP_001141111.1| hypothetical protein precursor [Zea mays]
 gi|194702684|gb|ACF85426.1| unknown [Zea mays]
 gi|414590469|tpg|DAA41040.1| TPA: hypothetical protein ZEAMMB73_571218 [Zea mays]
          Length = 439

 Score =  127 bits (320), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 101/371 (27%), Positives = 169/371 (45%), Gaps = 53/371 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDH-- 43
           ++ L IGTP    L I DTGS LI+                +++P  S++F  + C+   
Sbjct: 86  LMTLAIGTPPLPFLAIADTGSDLIWTQCAPCSRQCFQQPTPLYNPSSSTTFSALPCNSSL 145

Query: 44  ----PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSN 98
               P C         C+Y M Y     T  F   ET +         +   G  FGCSN
Sbjct: 146 GLCAPAC--------ACMYNMTYGS-GWTYVFQGTETFTFGSSTPADQVRVPGIAFGCSN 196

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
            + GF+  +     +G++GL R ++S +SQLG+    +FSYCL  P  +   TS+ L   
Sbjct: 197 ASSGFNASSA----SGLVGLGRGSLSLVSQLGA---PKFSYCLT-PYQDTNSTSTLLLGP 248

Query: 159 TDMGYRRPSTQATKFINHPNN-FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           +          +T F+  P++ +YYL+L  IS+    +  PP+ F +   G GG IIDSG
Sbjct: 249 SASLNDTGVVSSTPFVASPSSIYYYLNLTGISLGTTALPIPPNAFSLKADGTGGLIIDSG 308

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFYF 274
           + +T   +  Y ++    +S                + LC+ LP + +     PSM  +F
Sbjct: 309 TTITMLGNTAYQQVRAAVLSLVTLPTTD--GSAATGLDLCFELPSSTSAPPSMPSMTLHF 366

Query: 275 EDANLRIDGENVFI----IDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNI 326
           + A++ +  +N  +     D ++  + LA+    D    +V+++G+ QQ++   +YD+  
Sbjct: 367 DGADMVLPADNYMMSLSDPDSDSSLWCLAMQNQTDTDGVVVSILGNYQQQNMHILYDVGK 426

Query: 327 DLLSFVKENCS 337
           + LSF    CS
Sbjct: 427 ETLSFAPAKCS 437


>gi|302776610|ref|XP_002971459.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
 gi|300160591|gb|EFJ27208.1| hypothetical protein SELMODRAFT_64134 [Selaginella moellendorffii]
          Length = 357

 Score =  127 bits (319), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 110/359 (30%), Positives = 169/359 (47%), Gaps = 41/359 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           VR+ IG+P+K   L++DTGS + +              A+FDPR SSSF++++C  P C 
Sbjct: 16  VRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQCK 75

Query: 48  YF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                     + +C+Y + Y D S T G  A ++ SV  +G    +    +FGC +DN G
Sbjct: 76  LLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSFSV-SRGRTSPV----VFGCGHDNEG 130

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               A      G   L     SF SQL S   ++FSYCLV    NG   SS L FG    
Sbjct: 131 LFVGAAGLLGLGAGKL-----SFPSQLSS---RKFSYCLV-SRDNGVRASSALLFGDSAL 181

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVS-GEGGCIIDSGSV 219
               S   T+ + +P  + FYY  L  ISI    ++ P   F ++ S G GG IIDSG+ 
Sbjct: 182 PTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTS 241

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DA 277
           +T   +  Y  + + F S  ++  L + +D       CY F   T    P+++F+FE  A
Sbjct: 242 VTRLPTYAYTVMRDAFRSATQK--LPRAADF-SLFDTCYDFSALTSVTIPTVSFHFEGGA 298

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           ++++   N  +    +  F  A +     +++IG+ QQ+  R   DL+   + F    C
Sbjct: 299 SVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|242056497|ref|XP_002457394.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
 gi|241929369|gb|EES02514.1| hypothetical protein SORBIDRAFT_03g006630 [Sorghum bicolor]
          Length = 509

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 108/361 (29%), Positives = 161/361 (44%), Gaps = 50/361 (13%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTY 48
           R+ IG+P++ + ++LDTGS + +               +FDP  S+S+  ++CD P C  
Sbjct: 172 RVGIGSPARELYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSPRCRD 231

Query: 49  F---KCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                C N    C+Y + Y D S T G  A ET+++   G+   + + A+ GC +DN G 
Sbjct: 232 LDTAACRNATGACLYEVAYGDGSYTVGDFATETLTL---GDSTPVTNVAI-GCGHDNEGL 287

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A      G   LS     F SQ   I    FSYCLV         +S L+FG D   
Sbjct: 288 FVGAAGLLALGGGPLS-----FPSQ---ISASTFSYCLV---DRDSPAASTLQFGADGA- 335

Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVL 220
               T     +  P    FYY++L  IS+  + ++ P   F +   SG GG I+DSG+ +
Sbjct: 336 -EADTVTAPLVRSPRTGTFYYVALSGISVGGQALSIPSSAFAMDATSGSGGVIVDSGTAV 394

Query: 221 TYFHSDVYWKLHEKFV---SYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFED 276
           T   S  Y  L + FV       R     L D       CY L + T    P+++  FE 
Sbjct: 395 TRLQSSAYAALRDAFVRGTPSLPRTSGVSLFD------TCYDLSDRTSVEVPAVSLRFEG 448

Query: 277 AN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
              LR+  +N  I       + LA AP +  V++IG+ QQ+ TR  +D    ++ F    
Sbjct: 449 GGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKGVVGFTPNK 508

Query: 336 C 336
           C
Sbjct: 509 C 509


>gi|356527091|ref|XP_003532147.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 482

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 109/363 (30%), Positives = 164/363 (45%), Gaps = 49/363 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDC 46
           V + +GTP + + LI DTGS L +                IFDP KSSS+  I C    C
Sbjct: 142 VVVGLGTPKRDLSLIFDTGSYLTWTQCEPCAGSCYKQQDPIFDPSKSSSYTNIKCTSSLC 201

Query: 47  TYFKCV------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
           T F+        +  C+Y +KY D S+++GF + E +++        I H  LFGC  DN
Sbjct: 202 TQFRSAGCSSSTDASCIYDVKYGDNSISRGFLSQERLTITATD----IVHDFLFGCGQDN 257

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G          AG++GLSR  ISF+ Q  SI  K FSYC    LP+   +  +L FG  
Sbjct: 258 EGLFR-----GTAGLMGLSRHPISFVQQTSSIYNKIFSYC----LPSTPSSLGHLTFGAS 308

Query: 161 MGYRRPSTQATKF--INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                 + + T F  I+  N+FY L +  IS+   ++   P     T S  GG IIDSG+
Sbjct: 309 AA-TNANLKYTPFSTISGENSFYGLDIVGISVGGTKL---PAVSSSTFSA-GGSIIDSGT 363

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDA 277
           V+T      Y  L   F  +  ++ +A  +     +  CY F        P + F F   
Sbjct: 364 VITRLPPTAYAALRSAFRQFMMKYPVAYGT---RLLDTCYDFSGYKEISVPRIDFEFA-G 419

Query: 278 NLRIDGENVFIIDYEN-HFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFVKE 334
            ++++   V I+  E+     LA A + +   + + G+ QQ+    VYD+    + F   
Sbjct: 420 GVKVELPLVGILYGESAQQLCLAFAANGNGNDITIFGNVQQKTLEVVYDVEGGRIGFGAA 479

Query: 335 NCS 337
            C+
Sbjct: 480 GCN 482


>gi|108707838|gb|ABF95633.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 391

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 111/369 (30%), Positives = 164/369 (44%), Gaps = 49/369 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V L IGTP + V L LDTGS LI+                FDP  SS+    +CD   C
Sbjct: 36  LVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLC 95

Query: 47  TYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
                          N+ CVYT  Y D+SVT GF   +  + +G G   A   G  FGC 
Sbjct: 96  QGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAG---ASVPGVAFGCG 152

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
             N+G  +        G+ G  R  +S  SQL       FS+C    +     ++  L  
Sbjct: 153 LFNNGVFKSNE----TGIAGFGRGPLSLPSQLK---VGNFSHCFTT-ITGAIPSTVLLDL 204

Query: 158 GTDM-GYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
             D+    + + Q T  I +  N      YYLSLK I++ + R+  P   F +T +G GG
Sbjct: 205 PADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALT-NGTGG 263

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSM 270
            IIDSG+ +T     VY  + ++F +   + +L  +         C+  P +     P +
Sbjct: 264 TIIDSGTSITSLPPQVYQVVRDEFAA---QIKLPVVPGNATGHYTCFSAPSQAKPDVPKL 320

Query: 271 AFYFEDANLRIDGEN-VFII--DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
             +FE A + +  EN VF +  D  N    LA+   D+   +IG+ QQ++   +YDL  +
Sbjct: 321 VLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDE-TTIIGNFQQQNMHVLYDLQNN 379

Query: 328 LLSFVKENC 336
           +LSFV   C
Sbjct: 380 MLSFVAAQC 388


>gi|302765224|ref|XP_002966033.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
 gi|300166847|gb|EFJ33453.1| hypothetical protein SELMODRAFT_64135 [Selaginella moellendorffii]
          Length = 357

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 108/359 (30%), Positives = 169/359 (47%), Gaps = 41/359 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           VR+ IG+P+K   L++DTGS + +              A+FDPR SSSF++++C  P C 
Sbjct: 16  VRVGIGSPTKLQYLVMDTGSDVPWIQCSPCKSCYKQNDAVFDPRASSSFRRLSCSTPQCK 75

Query: 48  YF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                     + +C+Y + Y D S T G  A ++  ++ +G    +    +FGC +DN G
Sbjct: 76  LLDVKACASTDNRCLYQVSYGDGSFTVGDLASDSF-LVSRGRTSPV----VFGCGHDNEG 130

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               A      G   L     SF SQL S   ++FSYCLV    NG   SS L FG    
Sbjct: 131 LFVGAAGLLGLGAGKL-----SFPSQLSS---RKFSYCLV-SRDNGVRASSALLFGDSAL 181

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVS-GEGGCIIDSGSV 219
               S   T+ + +P  + FYY  L  ISI    ++ P   F ++ S G GG IIDSG+ 
Sbjct: 182 PTSASFAYTQLLKNPKLDTFYYAGLSGISIGGTLLSIPSTAFKLSSSTGRGGVIIDSGTS 241

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DA 277
           +T   +  Y  + + F S  ++  L + +D       CY F   T    P+++F+FE  A
Sbjct: 242 VTRLPTYAYTVMRDAFRSATQK--LPRAADF-SLFDTCYDFSALTSVTIPTVSFHFEGGA 298

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           ++++   N  +    +  F  A +     +++IG+ QQ+  R   DL+   + F    C
Sbjct: 299 SVQLPPSNYLVPVDTSGTFCFAFSKTSLDLSIIGNIQQQTMRVAIDLDSSRVGFAPRQC 357


>gi|297603570|ref|NP_001054261.2| Os04g0677100 [Oryza sativa Japonica Group]
 gi|255675885|dbj|BAF16175.2| Os04g0677100 [Oryza sativa Japonica Group]
          Length = 464

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 150/361 (41%), Gaps = 54/361 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ +G+P     L++D+GS +I+               +FDP  SSSF  ++C    C 
Sbjct: 132 VRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICR 191

Query: 48  YFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                         +C Y++ Y D S TKG  A ET+++     G     G   GC + N
Sbjct: 192 TLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL-----GGTAVQGVAIGCGHRN 246

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G    A     AG+LGL    +S + QLG      FSYCL      G   +  L  G  
Sbjct: 247 SGLFVGA-----AGLLGLGWGAMSLVGQLGGAAGGVFSYCLA---SRGAGGAGSLVLG-- 296

Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
                  T+A       ++FYY+ L  I +  ER+      F +T  G GG ++D+G+ +
Sbjct: 297 ------RTEAVPRGRRASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAV 350

Query: 221 TYFHSDVYWKLHEKF---VSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF-E 275
           T    + Y  L   F   +    R     L D       CY L    + R P+++FYF +
Sbjct: 351 TRLPREAYAALRGAFDGAMGALPRSPAVSLLDT------CYDLSGYASVRVPTVSFYFDQ 404

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            A L +   N  +++     F LA AP    ++++G+ QQ   +   D     + F    
Sbjct: 405 GAVLTLPARN-LLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNT 463

Query: 336 C 336
           C
Sbjct: 464 C 464


>gi|302806531|ref|XP_002985015.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
 gi|300147225|gb|EFJ13890.1| hypothetical protein SELMODRAFT_121417 [Selaginella moellendorffii]
          Length = 533

 Score =  126 bits (316), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 116/368 (31%), Positives = 172/368 (46%), Gaps = 45/368 (12%)

Query: 4   LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
           +F+G P +  LLI+DTGS L +               +FDP +S+SF+ I C+   C   
Sbjct: 175 VFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAACDLV 234

Query: 50  ----------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSN 98
                     K   + C Y   Y D S T G  A E++SV       ++     + GC +
Sbjct: 235 VHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVIGCGH 294

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQL-GSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            N             G+LGL +  +SF SQL  S I + FSYCLV    N    SS + F
Sbjct: 295 SNK-----GLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLV-DRTNNLSVSSAISF 348

Query: 158 GTDMGYRRPSTQA--TKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
           G      R   Q   T F+   N+   FYYL ++ I ID E +  P + F I  +G GG 
Sbjct: 349 GAGFALSRHFDQMRFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIAPNGSGGT 408

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMA 271
           IIDSG+ LTY + D Y  +   F++     +    +D  + + +CY     T   FP+++
Sbjct: 409 IIDSGTTLTYLNRDAYRAVESAFLARISYPR----ADPFDILGICYNATGRTAVPFPTLS 464

Query: 272 FYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
             F++ A L +  EN FI  D +     LA+ P D + ++IG+ QQ++  F+YD+    L
Sbjct: 465 IVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGM-SIIGNFQQQNIHFLYDVQHARL 523

Query: 330 SFVKENCS 337
            F   +CS
Sbjct: 524 GFANTDCS 531


>gi|414885970|tpg|DAA61984.1| TPA: hypothetical protein ZEAMMB73_915310 [Zea mays]
          Length = 523

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 98/357 (27%), Positives = 154/357 (43%), Gaps = 42/357 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V + +GTP + +L++ DTGS L +               +FDP +S+++  + C   +C
Sbjct: 189 IVSVGLGTPRRDLLVVFDTGSDLSWVQCKPCNNCYKQHDPLFDPSQSTTYSAVPCGAQEC 248

Query: 47  T-YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
                C + +C Y + Y D S T G  A +T+++   G       G +FGC +D+ G   
Sbjct: 249 LDSGTCSSGKCRYEVVYGDMSQTDGNLARDTLTL---GPSSDQLQGFVFGCGDDDTGLF- 304

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
               G   G+ GL R  +S  SQ  +     FSYCL    P+      YL  G+      
Sbjct: 305 ----GRADGLFGLGRDRVSLASQAAARYGAGFSYCL----PSSWRAEGYLSLGSAAA--P 354

Query: 166 PSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
           P  Q T  +   +  +FYYL L  I +    +   P  F        G +IDSG+V+T  
Sbjct: 355 PHAQFTAMVTRSDTPSFYYLDLVGIKVAGRTVRVAPAVFKAP-----GTVIDSGTVITRL 409

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DANLRI 281
            S  Y  L   F  +  R++ A        +  CY F   T  + PS+A  F+  A L +
Sbjct: 410 PSRAYSALRSSFAGFMRRYKRAPALSI---LDTCYDFTGRTKVQIPSVALLFDGGATLNL 466

Query: 282 D-GENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
             G  +++ +        A    D  V ++G+ QQ+    VYDL    + F  + CS
Sbjct: 467 GFGGVLYVANRSQACLAFASNGDDTSVGILGNMQQKTFAVVYDLANQKIGFGAKGCS 523


>gi|242050426|ref|XP_002462957.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
 gi|241926334|gb|EER99478.1| hypothetical protein SORBIDRAFT_02g035290 [Sorghum bicolor]
          Length = 452

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 104/366 (28%), Positives = 160/366 (43%), Gaps = 45/366 (12%)

Query: 4   LFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDC-- 46
           L +GTP      I+DTGS L +                ++DP +SS+F K+ C  P C  
Sbjct: 100 LSVGTPPLAFPAIIDTGSDLTWTQCAPCTTACFAQPTPLYDPARSSTFSKLPCASPLCQA 159

Query: 47  ---TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE---GKAIFHGALFGCSNDN 100
               +  C    CVY  +YA    T G+ A +T+++          + F G  FGCS  N
Sbjct: 160 LPSAFRACNATGCVYDYRYA-VGFTAGYLAADTLAIGDGDGDGDASSSFAGVAFGCSTAN 218

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G      DGA +G++GL R  +S +SQ+G     RFSYCL     + +  +S + FG  
Sbjct: 219 GG----DMDGA-SGIVGLGRSALSLLSQIG---VGRFSYCL---RSDADAGASPILFGAL 267

Query: 161 MGYRRPSTQATKFINHP------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
                   Q+T  + +P        +YY++L  I++ +  +     TF  T +G GG I+
Sbjct: 268 ANVTGDKVQSTALLRNPVAARRRAPYYYVNLTGIAVGSTDLPVTSSTFGFTAAGAGGVIV 327

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYF 274
           DSG+  TY     Y  L + F+S      L ++S       LC+         P + F F
Sbjct: 328 DSGTTFTYLAEAGYTMLRQAFLSQTAGL-LTRVSGAQFDFDLCFEAGAADTPVPRLVFRF 386

Query: 275 E-DANLRIDGENVF-IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
              A   +  ++ F  +D       L V P    V++IG+  Q D   +YDL+    SF 
Sbjct: 387 AGGAEYAVPRQSYFDAVDEGGRVACLLVLPTRG-VSVIGNVMQMDLHVLYDLDGATFSFA 445

Query: 333 KENCSD 338
             +C+ 
Sbjct: 446 PADCAS 451


>gi|326517992|dbj|BAK07248.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 500

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 105/357 (29%), Positives = 165/357 (46%), Gaps = 45/357 (12%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
           R+ +G P++ + ++LDTGS + +               ++DP  S+S+  + CD P C  
Sbjct: 166 RVGVGRPARQLYMVLDTGSDVTWLQCQPCADCYAQSDPVYDPSVSTSYATVGCDSPRCRD 225

Query: 49  F---KCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                C N    C+Y + Y D S T G  A ET+++   G+   + + A+ GC +DN G 
Sbjct: 226 LDAAACRNSTGSCLYEVAYGDGSYTVGDFATETLTL---GDSAPVSNVAI-GCGHDNEGL 281

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
                    AG+L L    +SF SQ+ +     FSYCLV        +SS L+FG     
Sbjct: 282 FV-----GAAGLLALGGGPLSFPSQISATT---FSYCLV---DRDSPSSSTLQFGDS--- 327

Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            +P+  A   I  P  N FYY++L  IS+  E ++ P   F +  +G GG I+DSG+ +T
Sbjct: 328 EQPAVTA-PLIRSPRTNTFYYVALSGISVGGEALSIPSSAFAMDDAGSGGVIVDSGTAVT 386

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFEDAN-L 279
              S  Y  L E FV   +    A           CY L   +  + P++A +FE    L
Sbjct: 387 RLQSGAYGALREAFVQGTQSLPRASGVSL---FDTCYDLAGRSSVQVPAVALWFEGGGEL 443

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           ++  +N  I       + LA A     V++IG+ QQ+  R  +D   + + F  + C
Sbjct: 444 KLPAKNYLIPVDAAGTYCLAFAGTSGPVSIIGNVQQQGVRVSFDTAKNTVGFTADKC 500


>gi|326524580|dbj|BAK00673.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 109/376 (28%), Positives = 160/376 (42%), Gaps = 58/376 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP + V  +LDTGS LI+               +F P  SSS+  + C    C
Sbjct: 104 LIDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLAQPDPLFAPAASSSYVPMRCSGQLC 163

Query: 47  T---YFKCVN-EQCVYTMKYADQSVTKGFAAHETIS-VIGKGEGKAIFHGALFGCSNDNH 101
               +  C   + C Y   Y D + T G  A E  +     GE  ++  G  FGC   N 
Sbjct: 164 NDILHHSCQRPDTCTYRYNYGDGTTTLGVYATERFTFASSSGEKLSVPLG--FGCGTMNV 221

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT-- 159
           G   +      +G++G  R  +S +SQL SI  +RFSYCL    P      S L FG+  
Sbjct: 222 GSLNNG-----SGIVGFGRDPLSLVSQL-SI--RRFSYCLT---PYTSTRKSTLMFGSLS 270

Query: 160 -------DMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
                  D    +   Q T+ +    N  FYY+    +++   R+  P   F +   G G
Sbjct: 271 DGVFEGDDAATGQ--VQTTRLLQSRQNPTFYYVPFTGVTVGTRRLRIPLSAFALRPDGSG 328

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--------- 261
           G I+DSG+ LT F + V   L E   ++  + +L   S       +C+  P         
Sbjct: 329 GVIVDSGTALTLFPAAV---LTEVLRAFRAQLRLPFTSSSSPDDGVCFATPMAAGGRRAS 385

Query: 262 -ETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
             T    P MAF+F+ A+L +   N  + D       + +A   D  A IG+  Q+D R 
Sbjct: 386 AATVVSVPRMAFHFQGADLELPRRNYVLDDPRRGSLCILLADSGDSGATIGNFVQQDMRV 445

Query: 321 VYDLNIDLLSFVKENC 336
           +YDL  + LSF    C
Sbjct: 446 LYDLEAETLSFAPAQC 461


>gi|225427552|ref|XP_002266498.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 441

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 107/359 (29%), Positives = 166/359 (46%), Gaps = 34/359 (9%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP   V+ I+DTGS L +                FDP+ SS+++  +C    C
Sbjct: 93  IMNLSIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPFFDPKNSSTYRDSSCGTSFC 152

Query: 47  TYF----KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                   C N ++C +   YAD S T G  A ET++V         F G  FGC + + 
Sbjct: 153 LALGNDRSCRNGKKCTFMYSYADGSFTGGNLAVETLTVASTAGKPVSFPGFAFGCVHRSG 212

Query: 102 G-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
           G FDE +     +G++GL    +S ISQL S I  RFSYCL +P+      SS + FG  
Sbjct: 213 GIFDEHS-----SGIVGLGVAELSMISQLKSTINGRFSYCL-LPVFTDSSMSSRINFGRS 266

Query: 161 MGYRRPSTQATKFI-NHPNNFYYL-SLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                  T +T  +   P+ +YYL +L+  S+  +R+++   +    V  EG  I+DSG+
Sbjct: 267 GIVSGAGTVSTPLVMKGPDTYYYLITLEGFSVGKKRLSYKGFSKKAEVE-EGNIIVDSGT 325

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDAN 278
             TY   + Y KL E   S     +  ++ D      LCY         P +  +F+DAN
Sbjct: 326 TYTYLPLEFYVKLEE---SVAHSIKGKRVRDPNGISSLCYNTTVDQIDAPIITAHFKDAN 382

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           + +   N F+   E+      V P  D + ++G+  Q +    +DL    +SF   +C+
Sbjct: 383 VELQPWNTFLRMQED-LVCFTVLPTSD-IGILGNLAQVNFLVGFDLRKKRVSFKAADCT 439


>gi|302809015|ref|XP_002986201.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
 gi|300146060|gb|EFJ12732.1| hypothetical protein SELMODRAFT_234982 [Selaginella moellendorffii]
          Length = 449

 Score =  125 bits (313), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 115/368 (31%), Positives = 171/368 (46%), Gaps = 45/368 (12%)

Query: 4   LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
           +F+G P +  LLI+DTGS L +               +FDP +S+SF+ I C+   C   
Sbjct: 91  VFVGNPPRHFLLIIDTGSDLTWLQCKPCKACFDQSGPVFDPSQSTSFKIIPCNAAACDLV 150

Query: 50  ----------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSN 98
                     K   + C Y   Y D S T G  A E++SV       ++     + GC +
Sbjct: 151 VHDECRDNSSKTSPKTCKYFYWYGDSSRTSGDLALESLSVSLSDHPSSLEIRDMVIGCGH 210

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQL-GSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            N             G+LGL +  +SF SQL  S I + FSYCLV    N    SS + F
Sbjct: 211 SNK-----GLFQGAGGLLGLGQGALSFPSQLRSSPIGQSFSYCLV-DRTNNLSVSSAISF 264

Query: 158 GTDMGYRRPSTQA--TKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
           G      R   Q   T F+   N+   FYYL ++ I ID E +  P + F I  +G GG 
Sbjct: 265 GAGFALSRHFDQMKFTPFVRTNNSVETFYYLGIQGIKIDQELLPIPAERFAIATNGSGGT 324

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMA 271
           IIDSG+ LTY + D Y  +   F++     +    +D  + + +CY         FP+++
Sbjct: 325 IIDSGTTLTYLNRDAYRAVESAFLARISYPR----ADPFDILGICYNATGRAAVPFPALS 380

Query: 272 FYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
             F++ A L +  EN FI  D +     LA+ P D + ++IG+ QQ++  F+YD+    L
Sbjct: 381 IVFQNGAELDLPQENYFIQPDPQEAKHCLAILPTDGM-SIIGNFQQQNIHFLYDVQHARL 439

Query: 330 SFVKENCS 337
            F   +CS
Sbjct: 440 GFANTDCS 447


>gi|357487631|ref|XP_003614103.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515438|gb|AES97061.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 101/351 (28%), Positives = 167/351 (47%), Gaps = 35/351 (9%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT---Y 48
           +GTP   V  ++DTGS +++               IF+P KSSS++ I C    C    Y
Sbjct: 93  VGTPPFNVYGVVDTGSDIVWLQCKPCEQCYKQTTPIFNPSKSSSYKNIPCSSNLCQSVRY 152

Query: 49  FKCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
             C N+Q  C YT+ ++DQS ++G  + ET+++         F   + GC ++N G  + 
Sbjct: 153 TSC-NKQNSCEYTINFSDQSYSQGELSVETLTLDSTTGHSVSFPKTVIGCGHNNRGMFQ- 210

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
              G  +G++GL    +S  +QL S I  +FSYCL +PL      +S L FG        
Sbjct: 211 ---GETSGIVGLGIGPVSLTTQLKSSIGGKFSYCL-LPLLVDSNKTSKLNFGDAAVVSGD 266

Query: 167 STQATKFINH-PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
              +T F+   P  FYYL+L+  S+ N+R+ F  +  D   S EG  I+DSG+ LT   S
Sbjct: 267 GVVSTPFVKKDPQAFYYLTLEAFSVGNKRIEF--EVLDD--SEEGNIILDSGTTLTLLPS 322

Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGEN 285
            VY  L        +  +L ++ D  + + LCY +      FP +  +F+ A+++++  +
Sbjct: 323 HVYTNLESAVA---QLVKLDRVDDPNQLLNLCYSITSDQYDFPIITAHFKGADIKLNPIS 379

Query: 286 VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            F    +    L   +       + G+  Q +    YDL  +++SF   +C
Sbjct: 380 TFAHVADGVVCLAFTSSQTG--PIFGNLAQLNLLVGYDLQQNIVSFKPSDC 428


>gi|414584780|tpg|DAA35351.1| TPA: hypothetical protein ZEAMMB73_696016 [Zea mays]
          Length = 524

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 95/367 (25%), Positives = 159/367 (43%), Gaps = 46/367 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +VR+ +G+P     L++D+GS +++               +FDP  S++F  ++C    C
Sbjct: 172 LVRVSVGSPPTEQYLVVDSGSDVMWVQCKPCLECYVQADPLFDPATSATFSGVSCGSAIC 231

Query: 47  TYF---KCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                  C + +   C Y + YAD S TKG  A ET+++     G     G + GC + N
Sbjct: 232 RILPTSACGDGELGGCEYEVSYADGSYTKGALALETLTL-----GGTAVEGVVIGCGHRN 286

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS------- 153
            G    A     AG++GL    +S + QLG  +   FSYCL      G Y S        
Sbjct: 287 RGLFVGA-----AGLMGLGWGPMSLVGQLGGEVGGAFSYCLA---SRGGYGSGAADDDAG 338

Query: 154 YLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
           +L  G        +      + +P   +FYY+ L  I + +ER+      F +T  G G 
Sbjct: 339 WLVLGRSEAVPEGAVW-VPLVRNPRAPSFYYVGLSGIEVGDERLPLQAGLFQLTEDGAGD 397

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSM 270
            ++D+G+ +T    + Y  L + FV                 +  CY L    + R P++
Sbjct: 398 VVMDTGTTVTRLPQEAYAALRDAFVGALAGAVPRAQGVSSSVLDTCYDLSGYASVRVPTV 457

Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
           +F F+ DA L +   NV +++ +   + LA AP    ++++G+ QQ   +   D     +
Sbjct: 458 SFCFDGDARLILAARNV-LLEVDMGIYCLAFAPSSSGLSIMGNTQQAGIQITVDSANGYI 516

Query: 330 SFVKENC 336
            F   NC
Sbjct: 517 GFGPANC 523


>gi|19424106|gb|AAL87345.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
          Length = 500

 Score =  124 bits (312), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 95/353 (26%), Positives = 157/353 (44%), Gaps = 36/353 (10%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
           R+ +GTP+K + L+LDTGS + +               +F+P  SS+++ + C  P C+ 
Sbjct: 165 RIGVGTPAKDMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSL 224

Query: 49  FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
            +   C + +C+Y + Y D S T G  A +T++    G+     +    GC +DN G   
Sbjct: 225 LETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGK----INNVALGCGHDNEGL-- 278

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
                   G  GL  +    +S    +    FSYCLV         SS L F +      
Sbjct: 279 ------FTGAAGLLGLGGGVLSITNQMKATSFSYCLV---DRDSGKSSSLDFNSVQLGGG 329

Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
            +T         + FYY+ L   S+  E++  P   FD+  SG GG I+D G+ +T   +
Sbjct: 330 DATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQT 389

Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDA-NLRIDG 283
             Y  L + F+       L + S        CY F   +  + P++AF+F    +L +  
Sbjct: 390 QAYNSLRDAFLKL--TVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPA 447

Query: 284 ENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +N  I   ++  F  A AP    +++IG+ QQ+ TR  YDL+ +++      C
Sbjct: 448 KNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|293329689|dbj|BAJ04354.1| pollen allergen CPA63 [Cryptomeria japonica]
          Length = 472

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 100/365 (27%), Positives = 155/365 (42%), Gaps = 48/365 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
           +++L  GTP +    +LDTGS + +               F+P KSS++  + C    C 
Sbjct: 125 IIKLGFGTPPQSFYTVLDTGSNIAWIPCNPCSGCSSKQQPFEPSKSSTYNYLTCASQQCQ 184

Query: 48  YFKCVNE-----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
             +   +      C  T +Y DQS      + ET+SV     G       +FGCSN   G
Sbjct: 185 LLRVCTKSDNSVNCSLTQRYGDQSEVDEILSSETLSV-----GSQQVENFVFGCSNAARG 239

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
             +         ++G  R  +SF+SQ  ++    FSYCL   L +  +T S L     +G
Sbjct: 240 LIQRT-----PSLVGFGRNPLSFVSQTATLYDSTFSYCLP-SLFSSAFTGSLL-----LG 288

Query: 163 YRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
               S Q  KF    +N     FYY+ L  IS+  E ++ P  T  +  S   G IIDSG
Sbjct: 289 KEALSAQGLKFTPLLSNSRYPSFYYVGLNGISVGEELVSIPAGTLSLDESTGRGTIIDSG 348

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED- 276
           +V+T      Y  + + F S      +A  +D       CY  P     FP +  +F+D 
Sbjct: 349 TVITRLVEPAYNAMRDSFRSQLSNLTMASPTDL---FDTCYNRPSGDVEFPLITLHFDDN 405

Query: 277 ANLRIDGENVFIIDYENHFFL-----LAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
            +L +  +N+     ++   L     L     DD+++  G+ QQ+  R V+D+    L  
Sbjct: 406 LDLTLPLDNILYPGNDDGSVLCLAFGLPPGGGDDVLSTFGNYQQQKLRIVHDVAESRLGI 465

Query: 332 VKENC 336
             ENC
Sbjct: 466 ASENC 470


>gi|15229656|ref|NP_188478.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273882|sp|Q9LS40.1|ASPG1_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 1;
           Short=AtASPG1; Flags: Precursor
 gi|11994113|dbj|BAB01116.1| CND41, chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|23297732|gb|AAN13013.1| putative chloroplast nucleoid DNA-binding protein [Arabidopsis
           thaliana]
 gi|332642583|gb|AEE76104.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 500

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 95/353 (26%), Positives = 157/353 (44%), Gaps = 36/353 (10%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
           R+ +GTP+K + L+LDTGS + +               +F+P  SS+++ + C  P C+ 
Sbjct: 165 RIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNPTSSSTYKSLTCSAPQCSL 224

Query: 49  FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
            +   C + +C+Y + Y D S T G  A +T++    G+     +    GC +DN G   
Sbjct: 225 LETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGK----INNVALGCGHDNEGL-- 278

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
                   G  GL  +    +S    +    FSYCLV         SS L F +      
Sbjct: 279 ------FTGAAGLLGLGGGVLSITNQMKATSFSYCLV---DRDSGKSSSLDFNSVQLGGG 329

Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
            +T         + FYY+ L   S+  E++  P   FD+  SG GG I+D G+ +T   +
Sbjct: 330 DATAPLLRNKKIDTFYYVGLSGFSVGGEKVVLPDAIFDVDASGSGGVILDCGTAVTRLQT 389

Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDA-NLRIDG 283
             Y  L + F+       L + S        CY F   +  + P++AF+F    +L +  
Sbjct: 390 QAYNSLRDAFLKL--TVNLKKGSSSISLFDTCYDFSSLSTVKVPTVAFHFTGGKSLDLPA 447

Query: 284 ENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +N  I   ++  F  A AP    +++IG+ QQ+ TR  YDL+ +++      C
Sbjct: 448 KNYLIPVDDSGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLSKNVIGLSGNKC 500


>gi|449458736|ref|XP_004147103.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449518669|ref|XP_004166359.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 482

 Score =  124 bits (311), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 101/362 (27%), Positives = 155/362 (42%), Gaps = 51/362 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ +G+P +   +++D+GS +++               +FDP  SSSF  ++C    C 
Sbjct: 145 VRIGVGSPPRNQYMVIDSGSDIVWVQCKPCSRCYQQSDPVFDPADSSSFAGVSCGSDVCD 204

Query: 48  YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
             +   C   +C Y + Y D S TKG  A ET++V     G+ +      GC + N G  
Sbjct: 205 RLENTGCNAGRCRYEVSYGDGSYTKGTLALETLTV-----GQVMIRDVAIGCGHTNQGMF 259

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A              ++SFI QLG      FSYCLV     G  ++  L+FG   G  
Sbjct: 260 IGAAGLLGL-----GGGSMSFIGQLGGQTGGAFSYCLV---SRGTGSTGALEFG--RGAL 309

Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
                    I +P   +FYY+ L  I +   R++ P +TF +T  G  G ++D+G+ +T 
Sbjct: 310 PVGATWISLIRNPRAPSFYYIGLAGIGVGGVRVSVPEETFQLTEYGTNGVVMDTGTAVTR 369

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-----IQLCYFLPETFN--RFPSMAFYFE 275
           F +  Y    + F         AQ S+ P          CY L   F   R P+++FYF 
Sbjct: 370 FPTAAYVAFRDSFT--------AQTSNLPRAPGVSIFDTCYDL-NGFESVRVPTVSFYFS 420

Query: 276 DAN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
           D   L +   N  I       F LA AP    +++IG+ QQ   +  +D     + F   
Sbjct: 421 DGPVLTLPARNFLIPVDGGGTFCLAFAPSPSGLSIIGNIQQEGIQISFDGANGFVGFGPN 480

Query: 335 NC 336
            C
Sbjct: 481 IC 482


>gi|388502342|gb|AFK39237.1| unknown [Lotus japonicus]
          Length = 440

 Score =  124 bits (311), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 105/362 (29%), Positives = 164/362 (45%), Gaps = 45/362 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           ++R++IGTPS   L I DTGS L +                 ++DP  SS+F  + CD  
Sbjct: 97  LMRIYIGTPSVERLAIADTGSDLTWVQCSPCDNTKCFAQNTPLYDPLNSSTFTLLPCDSQ 156

Query: 45  DCTY-----FKCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-FGCS 97
            CT      + C +   C+Y   Y D S + G  + ++I ++     +  ++  + FGC 
Sbjct: 157 PCTQLPYSQYVCSDYGDCIYAYTYGDNSYSYGGLSSDSIRLMLL---QLHYNSKICFGCG 213

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
             N  F  D + G   G++GL    +S +SQLG  I  +FSYCL   LP    ++S LKF
Sbjct: 214 FQNK-FTAD-KSGKTTGIVGLGAGPLSLVSQLGDEIGHKFSYCL---LPFSSNSNSKLKF 268

Query: 158 GTDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
           G     +     +T  I  P+  FYYL+L+ I++  + +         T   +G  IIDS
Sbjct: 269 GEAAIVQGNGVVSTPLIIKPDLPFYYLNLEGITVGAKTVK--------TGQTDGNIIIDS 320

Query: 217 GSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
           GS LTY     Y     +FVS   E   + +    P P   C+   E  +  P + F+F 
Sbjct: 321 GSTLTYLEESFY----NEFVSLVKETVAVEEDQYIPYPFDFCFTYKEGMSTPPDVVFHFT 376

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
             ++ +   N  ++  +N      V  H D +A+ G+  Q D    YD+    +SF   +
Sbjct: 377 GGDVVLKPMNTLVLIEDNLICSTVVPSHFDGIAIFGNLGQIDFHVGYDIQGGKVSFAPTD 436

Query: 336 CS 337
           CS
Sbjct: 437 CS 438


>gi|125524353|gb|EAY72467.1| hypothetical protein OsI_00323 [Oryza sativa Indica Group]
          Length = 500

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 107/362 (29%), Positives = 161/362 (44%), Gaps = 55/362 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC-- 46
           R+ +G+P++ + ++LDTGS + +               +FDP  S+S+  + CD+P C  
Sbjct: 166 RVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNPRCHD 225

Query: 47  -TYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                C N    C+Y + Y D S T G  A ET+++   G+   +   A+ GC +DN G 
Sbjct: 226 LDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTL---GDSAPVSSVAI-GCGHDNEGL 281

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG--TDM 161
              A      G   LS     F SQ+ +     FSYCLV        +SS L+FG   D 
Sbjct: 282 FVGAAGLLALGGGPLS-----FPSQISATT---FSYCLV---DRDSPSSSTLQFGDAADA 330

Query: 162 GYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
               P       I  P  + FYY+ L  IS+  + ++ PP  F +  +G GG I+DSG+ 
Sbjct: 331 EVTAP------LIRSPRTSTFYYVGLSGISVGGQILSIPPSAFAMDGTGAGGVIVDSGTA 384

Query: 220 LTYFHSDVYWKLHEKFV---SYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFE 275
           +T   S  Y  L + FV       R     L D       CY L + T    P+++  F 
Sbjct: 385 VTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFD------TCYDLSDRTSVEVPAVSLRFA 438

Query: 276 DAN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
               LR+  +N  I       + LA AP +  V++IG+ QQ+ TR  +D     + F   
Sbjct: 439 GGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTSN 498

Query: 335 NC 336
            C
Sbjct: 499 KC 500


>gi|326490862|dbj|BAJ90098.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 447

 Score =  124 bits (310), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 106/363 (29%), Positives = 155/363 (42%), Gaps = 38/363 (10%)

Query: 1   MVRLFIGTP-SKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPD 45
           ++   IGTP  + V L +DTGS +++                FD   S +   + C  P 
Sbjct: 93  LIHFGIGTPRPQQVALEVDTGSDVVWTQCRPCFDCFTQPLPRFDTSASDTVHGVLCTDPI 152

Query: 46  CTYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C   +   C    C Y + Y D SVT G  A ++ +  GKG GK      +FGC   N G
Sbjct: 153 CRALRPHACFLGGCTYQVNYGDNSVTIGQLAKDSFTFDGKGGGKVTVPDLVFGCGQYNTG 212

Query: 103 -FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
            F  +       G+ G  R  +S   QLG      FSYC    +   + T  +L      
Sbjct: 213 NFHSNE-----TGIAGFGRGPLSLPRQLGV---SSFSYCFTT-IFESKSTPVFLGGAPAD 263

Query: 162 GYRRPSTQ---ATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           G R  +T    +T F+ NHP  +YYLSLK I++   R+  P   F +   G GG IIDSG
Sbjct: 264 GLRAHATGPILSTPFLPNHPE-YYYLSLKGITVGKTRLAVPESAFVVKADGSGGTIIDSG 322

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY---FLPETFN-RFPSMAFY 273
           + +T F   V+  L E FV+          +D  EP   C+    +P+      P M  +
Sbjct: 323 TAITAFPRAVFRSLWEAFVAQVP-LPHTSYNDTGEPTLQCFSTESVPDASKVPVPKMTLH 381

Query: 274 FEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
            E A+  +  EN      ++    + V   DD   +IG+ QQ++   V+DL  + L    
Sbjct: 382 LEGADWELPRENYMAEYPDSDQLCVVVLAGDDDRTMIGNFQQQNMHIVHDLAGNKLVIEP 441

Query: 334 ENC 336
             C
Sbjct: 442 AQC 444


>gi|115434442|ref|NP_001041979.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|12328547|dbj|BAB21205.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113531510|dbj|BAF03893.1| Os01g0140100 [Oryza sativa Japonica Group]
 gi|125568961|gb|EAZ10476.1| hypothetical protein OsJ_00309 [Oryza sativa Japonica Group]
 gi|215697206|dbj|BAG91200.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 504

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 106/362 (29%), Positives = 161/362 (44%), Gaps = 55/362 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC-- 46
           R+ +G+P++ + ++LDTGS + +               +FDP  S+S+  + CD+P C  
Sbjct: 170 RVGVGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSTSYASVACDNPRCHD 229

Query: 47  -TYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                C N    C+Y + Y D S T G  A ET+++   G+   +   A+ GC +DN G 
Sbjct: 230 LDAAACRNSTGACLYEVAYGDGSYTVGDFATETLTL---GDSAPVSSVAI-GCGHDNEGL 285

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG--TDM 161
              A      G   LS     F SQ+ +     FSYCLV        +SS L+FG   D 
Sbjct: 286 FVGAAGLLALGGGPLS-----FPSQISATT---FSYCLV---DRDSPSSSTLQFGDAADA 334

Query: 162 GYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
               P       I  P  + FYY+ L  +S+  + ++ PP  F +  +G GG I+DSG+ 
Sbjct: 335 EVTAP------LIRSPRTSTFYYVGLSGLSVGGQILSIPPSAFAMDSTGAGGVIVDSGTA 388

Query: 220 LTYFHSDVYWKLHEKFV---SYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFE 275
           +T   S  Y  L + FV       R     L D       CY L + T    P+++  F 
Sbjct: 389 VTRLQSSAYAALRDAFVRGTQSLPRTSGVSLFD------TCYDLSDRTSVEVPAVSLRFA 442

Query: 276 DAN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
               LR+  +N  I       + LA AP +  V++IG+ QQ+ TR  +D     + F   
Sbjct: 443 GGGELRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTAKSTVGFTTN 502

Query: 335 NC 336
            C
Sbjct: 503 KC 504


>gi|125558629|gb|EAZ04165.1| hypothetical protein OsI_26307 [Oryza sativa Indica Group]
          Length = 455

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 109/387 (28%), Positives = 167/387 (43%), Gaps = 77/387 (19%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPD 45
           + + +GTP     +I+DTGS LI+A                +  P +SS+F ++ C+   
Sbjct: 93  MNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNGSF 152

Query: 46  CTYFKC--------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C Y               C Y   Y     T G+ A ET++V     G   F    FGCS
Sbjct: 153 CQYLPTSSRPRTCNATAACAYNYTYGS-GYTAGYLATETLTV-----GDGTFPKVAFGCS 206

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            +N G D  +      G++GL R  +S +SQL      RFSYCL   + +G   +S + F
Sbjct: 207 TEN-GVDNSS------GIVGLGRGPLSLVSQLA---VGRFSYCLRSDMADGG--ASPILF 254

Query: 158 GTDMGY-RRPSTQATKFINHP----NNFYYLSLKDISIDNERMNFPPDTFDITVSG-EGG 211
           G+      R   Q+T  + +P    +  YY++L  I++D+  +     TF  T +G  GG
Sbjct: 255 GSLAKLTERSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGG 314

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFLPETFN----- 265
            I+DSG+ LTY   D Y  + + F S      Q    S  P  + LCY  P         
Sbjct: 315 TIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCY-KPSAGGGGKAV 373

Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHFF--------------LLAVAPHDDL-VALI 310
           R P +A       LR  G   + +  +N+F               LL +   DDL +++I
Sbjct: 374 RVPRLA-------LRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISII 426

Query: 311 GSQQQRDTRFVYDLNIDLLSFVKENCS 337
           G+  Q D   +YD++  + SF   +C+
Sbjct: 427 GNLMQMDMHLLYDIDGGMFSFAPADCA 453


>gi|224126751|ref|XP_002329464.1| predicted protein [Populus trichocarpa]
 gi|222870144|gb|EEF07275.1| predicted protein [Populus trichocarpa]
          Length = 439

 Score =  123 bits (309), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 108/362 (29%), Positives = 172/362 (47%), Gaps = 45/362 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L IGTP +    I+DTGS LI+               IFDP+KSSSF K++C    C
Sbjct: 98  LMKLAIGTPPETYSAIMDTGSDLIWTQCKPCTQCFDQPTPIFDPKKSSSFSKLSCSSKLC 157

Query: 47  TYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG-- 102
                   ++ C Y   Y D S T+G  A ET++      GK       FGC  DN G  
Sbjct: 158 EALPQSTCSDGCEYLYGYGDYSSTQGMLASETLTF-----GKVSVPEVAFGCGEDNEGSG 212

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
           F + +      G++GL R  +S +SQL    + +FSYCL       +  +S L  G+   
Sbjct: 213 FSQGS------GLVGLGRGPLSLVSQLK---EPKFSYCLT---SVDDTKASTLLMGSLAS 260

Query: 163 YRRPSTQ--ATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
            +   ++   T  I +    +FYYLSL+ IS+ +  +     TF +   G GG IIDSG+
Sbjct: 261 VKASDSEIKTTPLIQNSAQPSFYYLSLEGISVGDTSLPIKKSTFSLQEDGSGGLIIDSGT 320

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFED 276
            +TY     +  + ++F S   +  L   +     +++C+ LP   T    P + F+F+ 
Sbjct: 321 TITYLEQSAFDLVAKEFTS---QINLPVDNSGSTGLEVCFTLPSGSTDIEVPKLVFHFDG 377

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A+L +  EN  I D       LA+     + ++ G+ QQ++   ++DL  + LSF+   C
Sbjct: 378 ADLELPAENYMIADASMGVACLAMGSSSGM-SIFGNIQQQNMLVLHDLEKETLSFLPTQC 436

Query: 337 SD 338
            +
Sbjct: 437 DE 438


>gi|414584783|tpg|DAA35354.1| TPA: hypothetical protein ZEAMMB73_186928 [Zea mays]
          Length = 464

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 96/359 (26%), Positives = 153/359 (42%), Gaps = 47/359 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ IG+P     L++D+GS +I+               +FDP  S++F  + C    C 
Sbjct: 129 VRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPATSATFSAVPCGSAVCR 188

Query: 48  YFKCV----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
             +      +  C Y + Y D S TKG  A ET+++     G     G   GC + N G 
Sbjct: 189 TLRTSGCGDSGGCDYEVSYGDGSYTKGALALETLTL-----GGTAVEGVAIGCGHRNRGL 243

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A     AG+LGL    +S + QLG      FSYCL          +  L  G     
Sbjct: 244 FVGA-----AGLLGLGWGPMSLVGQLGGAAGGAFSYCLA------SRGAGSLVLGRSEAV 292

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
              +      + +P   +FYY+ L  I + +ER+    D F +T  G GG ++D+G+ +T
Sbjct: 293 PEGAVW-VPLVRNPQAPSFYYVGLSGIGVGDERLPLQEDLFQLTEDGAGGVVMDTGTAVT 351

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPE-TFNRFPSMAFYFED-A 277
               + Y  L + FV+      +  L   P    +  CY L   T  R P+++FYF+  A
Sbjct: 352 RLPQEAYAALRDAFVA-----AVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFDGAA 406

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            L +   N  +++ +   + LA AP     +++G+ QQ   +   D     + F    C
Sbjct: 407 TLTLPARN-LLLEVDGGIYCLAFAPSSSGPSILGNIQQEGIQITVDSANGYIGFGPTTC 464


>gi|223949775|gb|ACN28971.1| unknown [Zea mays]
 gi|414590177|tpg|DAA40748.1| TPA: hypothetical protein ZEAMMB73_257146 [Zea mays]
          Length = 510

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 112/376 (29%), Positives = 171/376 (45%), Gaps = 54/376 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++ +++GTP +   +I+DTGS L +               +FDP  SSS++ + C    C
Sbjct: 150 LIDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDQRC 209

Query: 47  TYF----------KCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIFHGALF 94
                        +   + C Y   Y DQS T G  A E  T+++   G  + +  G +F
Sbjct: 210 GLVAPPEAPRACRRPAEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRV-DGVVF 268

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
           GC + N G          AG+LGL R  +SF SQL ++    FSYCLV    +G    S 
Sbjct: 269 GCGHRNRGLFH-----GAAGLLGLGRGPLSFASQLRAVYGHTFSYCLV---EHGSDAGSK 320

Query: 155 LKFGTD-MGYRRPSTQATKFI---NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
           + FG D +    P  + T F    +  + FYY+ LK + +  + +N   DT+D+   G G
Sbjct: 321 VVFGEDYLVLAHPQLKYTAFAPTSSPADTFYYVKLKGVLVGGDLLNISSDTWDVGKDGSG 380

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL-----PETFN 265
           G IIDSG+ L+YF    Y  + + FV    R     + D P  +  CY +     PE   
Sbjct: 381 GTIIDSGTTLSYFVEPAYQVIRQAFVDLMSRL-YPLIPDFPV-LNPCYNVSGVERPEV-- 436

Query: 266 RFPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFVY 322
             P ++  F D A      EN F+    +    LAV   P   + ++IG+ QQ++   VY
Sbjct: 437 --PELSLLFADGAVWDFPAENYFVRLDPDGIMCLAVRGTPRTGM-SIIGNFQQQNFHVVY 493

Query: 323 DLNIDLLSFVKENCSD 338
           DL  + L F    C++
Sbjct: 494 DLQNNRLGFAPRRCAE 509


>gi|224067990|ref|XP_002302634.1| predicted protein [Populus trichocarpa]
 gi|222844360|gb|EEE81907.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 102/361 (28%), Positives = 161/361 (44%), Gaps = 44/361 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            RL +GTP++ V ++LDTGS +++               +F+P KS SF  I C  P C 
Sbjct: 149 TRLGVGTPARYVFMVLDTGSDVVWIQCAPCKKCYSQTDPVFNPTKSRSFANIPCGSPLCR 208

Query: 48  YF---KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                 C  ++  C+Y + Y D S T G  + ET++  G   G+        GC +DN G
Sbjct: 209 RLDSPGCSTKKHICLYQVSYGDGSFTYGEFSTETLTFRGTRVGRVAL-----GCGHDNEG 263

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               A            R  +SF SQ+G    ++FSYCLV    +     SY+ FG    
Sbjct: 264 LFIGAAGLLGL-----GRGRLSFPSQIGRRFSRKFSYCLVD--RSASSKPSYMVFGDSAI 316

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSV 219
            R  + + T  +++P  + FYY+ L  +S+   R+       F +  +G GG IIDSG+ 
Sbjct: 317 SR--TARFTPLVSNPKLDTFYYVELLGVSVGGTRVPGITASLFKLDSTGNGGVIIDSGTS 374

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLP-ETFNRFPSMAFYFED 276
           +T      Y  L + F     R   + L   PE      C+ L  +T  + P++  +F  
Sbjct: 375 VTRLTRPAYVALRDAF-----RVGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVLHFRG 429

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A++ +   N  I    +  F  A A     ++++G+ QQ+  R VYDL    + F    C
Sbjct: 430 ADVSLPASNYLIPVDNSGSFCFAFAGTMSGLSIVGNIQQQGFRVVYDLAASRVGFAPRGC 489

Query: 337 S 337
           +
Sbjct: 490 A 490


>gi|90399033|emb|CAJ86229.1| H0402C08.5 [Oryza sativa Indica Group]
 gi|125550227|gb|EAY96049.1| hypothetical protein OsI_17922 [Oryza sativa Indica Group]
          Length = 473

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 98/362 (27%), Positives = 149/362 (41%), Gaps = 47/362 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ +G+P     L++D+GS +I+               +FDP  SSSF  ++C    C 
Sbjct: 132 VRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICR 191

Query: 48  YFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                         +C Y++ Y D S TKG  A ET+++     G     G   GC + N
Sbjct: 192 TLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL-----GGTAVQGVAIGCGHRN 246

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G    A     AG+LGL    +S I QLG      FSYCL      G   +  L  G  
Sbjct: 247 SGLFVGA-----AGLLGLGWGAMSLIGQLGGAAGGVFSYCLA---SRGAGGAGSLVLGRT 298

Query: 161 MGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                 +        N  ++FYY+ L  I +  ER+      F +T  G GG ++D+G+ 
Sbjct: 299 EAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDGLFQLTEDGAGGVVMDTGTA 358

Query: 220 LTYFHSDVYWKLHEKF---VSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF- 274
           +T    + Y  L   F   +    R     L D       CY L    + R P+++FYF 
Sbjct: 359 VTRLPREAYAALRGAFDGAMGALPRSPAVSLLDT------CYDLSGYASVRVPTVSFYFD 412

Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
           + A L +   N  +++     F LA AP    ++++G+ QQ   +   D     + F   
Sbjct: 413 QGAVLTLPARN-LLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPN 471

Query: 335 NC 336
            C
Sbjct: 472 TC 473


>gi|242050430|ref|XP_002462959.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
 gi|241926336|gb|EER99480.1| hypothetical protein SORBIDRAFT_02g035310 [Sorghum bicolor]
          Length = 448

 Score =  123 bits (308), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 102/376 (27%), Positives = 162/376 (43%), Gaps = 61/376 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDH- 43
           ++ L IGTP      I DTGS LI+                 +++P  S++F  + C+  
Sbjct: 93  LMTLSIGTPPLSYPAIADTGSDLIWTQCAPCSGDQCFAQPAPLYNPASSTTFGVLPCNSS 152

Query: 44  --------------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIF 89
                         P C         C+Y   Y     T G    ET +       +A  
Sbjct: 153 LSMCAGVLAGKAPPPGCA--------CMYNQTYG-TGWTAGVQGSETFTFGSAAADQARV 203

Query: 90  HGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGE 149
            G  FGCSN +   D +      AG++GL R ++S +SQLG+    RFSYCL  P  +  
Sbjct: 204 PGIAFGCSNASSS-DWNGS----AGLVGLGRGSLSLVSQLGA---GRFSYCLT-PFQDTN 254

Query: 150 YTSSYLKFGTDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDI 204
            TS+ L  G          ++T F+  P     + +YYL+L  IS+  + ++  PD F +
Sbjct: 255 STSTLL-LGPSAALNGTGVRSTPFVASPAKAPMSTYYYLNLTGISLGAKALSISPDAFSL 313

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF 264
              G GG IIDSG+ +T   +  Y ++    V           SD    + LCY LP   
Sbjct: 314 KADGTGGLIIDSGTTITSLVNAAYQQVRAA-VQSLVTLPAIDGSDS-TGLDLCYALPTPT 371

Query: 265 N---RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
           +     PSM  +F+ A++ +  ++ ++I     + L      D  ++  G+ QQ++   +
Sbjct: 372 SAPPAMPSMTLHFDGADMVLPADS-YMISGSGVWCLAMRNQTDGAMSTFGNYQQQNMHIL 430

Query: 322 YDLNIDLLSFVKENCS 337
           YD+  ++LSF    CS
Sbjct: 431 YDVRNEMLSFAPAKCS 446


>gi|50508279|dbj|BAD32128.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|125600536|gb|EAZ40112.1| hypothetical protein OsJ_24555 [Oryza sativa Japonica Group]
          Length = 455

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 109/387 (28%), Positives = 167/387 (43%), Gaps = 77/387 (19%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPD 45
           + + +GTP     +I+DTGS LI+A                +  P +SS+F ++ C+   
Sbjct: 93  MNISLGTPPLDFPVIVDTGSNLIWAQCAPCTRCFPRPTPAPVLQPARSSTFSRLPCNGSF 152

Query: 46  CTYFKC--------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C Y               C Y   Y     T G+ A ET++V     G   F    FGCS
Sbjct: 153 CQYLPTSSRPRTCNATAACAYNYTYGS-GYTAGYLATETLTV-----GDGTFPKVAFGCS 206

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            +N G D  +      G++GL R  +S +SQL      RFSYCL   + +G   +S + F
Sbjct: 207 TEN-GVDNSS------GIVGLGRGPLSLVSQLA---VGRFSYCLRSDMADGG--ASPILF 254

Query: 158 GTDMGYRRPST-QATKFINHP----NNFYYLSLKDISIDNERMNFPPDTFDITVSG-EGG 211
           G+       S  Q+T  + +P    +  YY++L  I++D+  +     TF  T +G  GG
Sbjct: 255 GSLAKLTEGSVVQSTPLLKNPYLQRSTHYYVNLTGIAVDSTELPVTGSTFGFTQTGLGGG 314

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFLPETFN----- 265
            I+DSG+ LTY   D Y  + + F S      Q    S  P  + LCY  P         
Sbjct: 315 TIVDSGTTLTYLAKDGYAMVKQAFQSQMANLNQTTPASGAPYDLDLCY-KPSAGGGGKAV 373

Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHFF--------------LLAVAPHDDL-VALI 310
           R P +A       LR  G   + +  +N+F               LL +   DDL +++I
Sbjct: 374 RVPRLA-------LRFAGGAKYNVPVQNYFAGVEADSQGRVTVACLLVLPATDDLPISII 426

Query: 311 GSQQQRDTRFVYDLNIDLLSFVKENCS 337
           G+  Q D   +YD++  + SF   +C+
Sbjct: 427 GNLMQMDMHLLYDIDGGMFSFAPADCA 453


>gi|362799904|dbj|BAL41445.1| aspartyl protease 1 [Linum grandiflorum]
          Length = 449

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 109/381 (28%), Positives = 166/381 (43%), Gaps = 58/381 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           M+ L IGTP   +L I DTGS L +               IFDP  S++F K+ C    C
Sbjct: 81  MMNLSIGTPPFPILAIADTGSDLTWLQSKPCDQCYPQKGPIFDPSNSTTFHKLPCTTAPC 140

Query: 47  TYF-----KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                    C +   C YT  Y D S T G+ A +T++V   G          FGC   N
Sbjct: 141 NALDESARSCTDPTTCGYTYSYGDHSYTTGYLASDTVTV---GNASVQIRNVAFGCGTRN 197

Query: 101 HG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL-------PNGEYTS 152
            G FDE        G   LS     F+SQLG  I K+FSYCL +PL       P+    +
Sbjct: 198 GGNFDEQGSGIVGLGGGNLS-----FVSQLGDTIGKKFSYCL-LPLENEISSQPSDSPAT 251

Query: 153 SYLKFGTDMGYRRPSTQA-----TKFIN-HPNNFYYLSLKDISIDNERMNFPP------- 199
           S + FG +  +   ST       T  +N  P+ +YYL+++ I++  +++ +         
Sbjct: 252 SRIVFGDNPVFSSSSTNGVVFATTPLVNKEPSTYYYLTIEAITVGRKKLLYSSSSSKTAS 311

Query: 200 -DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI-QLC 257
            D+   +   EG  IIDSG+ LT+   + Y  L    V   E  ++ +++D    +  LC
Sbjct: 312 YDSGSKSSVEEGNIIIDSGTTLTFLEEEFYGALEAALV---EEIKMERVNDVKNSMFSLC 368

Query: 258 YFLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQR 316
           +   +     P M  +F   A++ +   N F +  E       + P +D V + G+  Q 
Sbjct: 369 FKSGKEEVELPLMKVHFRGGADVELKPVNTF-VRAEEGLVCFTMLPTND-VGIYGNLAQM 426

Query: 317 DTRFVYDLNIDLLSFVKENCS 337
           +    YDL    +SF+  +CS
Sbjct: 427 NFVVGYDLGKRTVSFLPADCS 447


>gi|357124468|ref|XP_003563922.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 450

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 96/365 (26%), Positives = 159/365 (43%), Gaps = 51/365 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           + RL +GTP+   ++++D+GS+L +                ++DPR SS++  + C  P 
Sbjct: 109 ITRLGLGTPTTTYVMVVDSGSSLTWLQCAPCAVSCHPQAGPLYDPRASSTYAAVPCSAPQ 168

Query: 46  CTYFKCVNEQ---------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           C   +              C Y   Y D S + G+ + +T+S+   G     F G  +GC
Sbjct: 169 CAELQAATLNPSSCSGSGVCQYQASYGDGSFSFGYLSKDTVSLSSSGS----FPGFYYGC 224

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             DN G       G  AG++GL+R  +S +SQL   +   F+YCL     +   ++ YL 
Sbjct: 225 GQDNVGLF-----GRAAGLIGLARNKLSLLSQLAPSVGNSFAYCLPT---SAAASAGYLS 276

Query: 157 FGTDMGYRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
           FG++   + P   + T  ++     + Y++SL  +S+    +  P   +     G    I
Sbjct: 277 FGSNSDNKNPGKYSYTSMVSSSLDASLYFVSLAGMSVAGSPLAVPSSEY-----GSLPTI 331

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFY 273
           IDSG+V+T   + VY  L +   +          S     +Q C+         P++   
Sbjct: 332 IDSGTVITRLPTPVYTALSKAVGAALAAPSAPAYSI----LQTCFKGQVAKLPVPAVNMA 387

Query: 274 FE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           F   A LR+   NV ++D       LA AP D   A+IG+ QQ+    VYD+    + F 
Sbjct: 388 FAGGATLRLTPGNV-LVDVNETTTCLAFAPTDS-TAIIGNTQQQTFSVVYDVKGSRIGFA 445

Query: 333 KENCS 337
              CS
Sbjct: 446 AGGCS 450


>gi|125592062|gb|EAZ32412.1| hypothetical protein OsJ_16623 [Oryza sativa Japonica Group]
          Length = 473

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 97/362 (26%), Positives = 149/362 (41%), Gaps = 47/362 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ +G+P     L++D+GS +I+               +FDP  SSSF  ++C    C 
Sbjct: 132 VRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICR 191

Query: 48  YFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                         +C Y++ Y D S TKG  A ET+++     G     G   GC + N
Sbjct: 192 TLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL-----GGTAVQGVAIGCGHRN 246

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G    A     AG+LGL    +S + QLG      FSYCL      G   +  L  G  
Sbjct: 247 SGLFVGA-----AGLLGLGWGAMSLVGQLGGAAGGVFSYCLA---SRGAGGAGSLVLGRT 298

Query: 161 MGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                 +        N  ++FYY+ L  I +  ER+      F +T  G GG ++D+G+ 
Sbjct: 299 EAVPVGAVWVPLVRNNQASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTA 358

Query: 220 LTYFHSDVYWKLHEKF---VSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF- 274
           +T    + Y  L   F   +    R     L D       CY L    + R P+++FYF 
Sbjct: 359 VTRLPREAYAALRGAFDGAMGALPRSPAVSLLDT------CYDLSGYASVRVPTVSFYFD 412

Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
           + A L +   N  +++     F LA AP    ++++G+ QQ   +   D     + F   
Sbjct: 413 QGAVLTLPARN-LLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPN 471

Query: 335 NC 336
            C
Sbjct: 472 TC 473


>gi|357117301|ref|XP_003560410.1| PREDICTED: uncharacterized protein LOC100833752 [Brachypodium
           distachyon]
          Length = 473

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 96/339 (28%), Positives = 154/339 (45%), Gaps = 35/339 (10%)

Query: 27  IFDPRKSSSFQKIN------CDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVI 80
           +FDP KS +F+ ++      C  P   Y    + +C + + Y + +   G+ A +T S  
Sbjct: 144 VFDPAKSPTFRPVSGHNAVLCRPP---YHPLQDGRCGFGIAYRNGASAAGYLARDTFSFP 200

Query: 81  GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLS-----RVTISFISQLGSIIKK 135
                     G +FGC+N    FD     GALAGVLG+      +    F+ QL      
Sbjct: 201 TGDNNFQHLPGIVFGCANRIARFDTH---GALAGVLGMGMGAEGKPLTGFMRQLYHNGGG 257

Query: 136 RFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS-----TQATKFINHPNNFYYLSLKDISI 190
           RFSYC ++P   G    S+L+FG D+  + P+     + A       +  YY+ L  IS+
Sbjct: 258 RFSYCPIVP---GTTAYSFLRFGNDIPSQPPAGVHRQSMAVLAPTTTSEAYYVKLAGISV 314

Query: 191 DNERM-NFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD 249
              R+    P+ F+    G GGC ID G+ +T      Y  +      + +R + A+   
Sbjct: 315 GALRVPGVTPEMFERDQHGRGGCAIDIGTKMTAIVQTAYAHVEAAVRGHLQRNR-ARFVQ 373

Query: 250 CPEPIQLCYFLPETFNRFPSMAFYFEDAN-LRIDGENVFII----DYENHFFLLAVAPHD 304
            P      +  P    R PSM  +F     LR+  +++F++         +  L + P D
Sbjct: 374 SPGHHLCVHRTPAIEERLPSMTLHFVGGPWLRVKPQHLFLVVGSPTGGGEYLCLGLVP-D 432

Query: 305 DLVALIGSQQQRDTRFVYDL--NIDLLSFVKENCSDDSA 341
             + +IG+ QQ DTRF++DL  NI ++SF  E+C  D+ 
Sbjct: 433 AEMTVIGAMQQIDTRFIFDLHNNIPIVSFNPEDCHLDAG 471


>gi|317106730|dbj|BAJ53226.1| JHL06P13.6 [Jatropha curcas]
          Length = 445

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 109/368 (29%), Positives = 166/368 (45%), Gaps = 47/368 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           +R+ IGTP   VL+I DTGS LI+               IF+P++SS+++++ C+   C 
Sbjct: 96  MRISIGTPPIEVLVIADTGSDLIWVQCQPCQECYKQKSPIFNPKQSSTYRRVLCETRYCN 155

Query: 48  YFK-----CVN----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
                   C      + C Y+  Y D S T G+ A E   +   G          FGC N
Sbjct: 156 ALNSDMRACSAHGFFKACGYSYSYGDHSFTMGYLATERFII---GSTNNSIQELAFGCGN 212

Query: 99  DNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            N G FDE                ++S ISQLG+ I  +FSYCLV  L    ++   + F
Sbjct: 213 SNGGNFDEVGSGIVGL-----GGGSLSLISQLGTKIDNKFSYCLVPILEKSNFSLGKIVF 267

Query: 158 GTDMGYRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           G +       T  +  +    P  FYYL+L+ IS+ NER+ +     D  V  +G  IID
Sbjct: 268 GDNSFISGSDTYVSTPLVSKEPETFYYLTLEAISVGNERLAYENSRNDGNVE-KGNIIID 326

Query: 216 SGSVLTYFHSDVYWKLH---EKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
           SG+ LT+  S +Y KL    EK V      +  ++SD      +C F  +     P +  
Sbjct: 327 SGTTLTFLDSKLYNKLELVLEKAV------EGERVSDPNGIFSIC-FRDKIGIELPIITV 379

Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +F DA++ +   N F    E       + P +  +A+ G+  Q +    YDL+ + +SF+
Sbjct: 380 HFTDADVELKPINTF-AKAEEDLLCFTMIPSNG-IAIFGNLAQMNFLVGYDLDKNCVSFM 437

Query: 333 KENCSDDS 340
             +CS  S
Sbjct: 438 PTDCSGHS 445


>gi|297814776|ref|XP_002875271.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297321109|gb|EFH51530.1| hypothetical protein ARALYDRAFT_484331 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 451

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 108/378 (28%), Positives = 162/378 (42%), Gaps = 55/378 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
           V L IG P + +LLI DTGS L++                +F PR SS+F   +C  P C
Sbjct: 85  VDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVC 144

Query: 47  TYF-------KC----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
                     +C    ++  C Y   YAD S+T G  A ET S+      +A      FG
Sbjct: 145 RLVPKPGRAPRCNHTRIHSTCPYEYGYADGSLTSGLFARETTSLKTSSGKEAKLKSVAFG 204

Query: 96  CSNDNHG--FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT-- 151
           C     G      + +GA  GV+GL R  ISF SQLG     +FSYCL+      +YT  
Sbjct: 205 CGFRISGQSVSGTSFNGA-NGVMGLGRGPISFASQLGRRFGNKFSYCLM------DYTLS 257

Query: 152 ---SSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITV 206
              +SYL  G D G        T  + +P    FYY+ LK + ++  ++   P  ++I  
Sbjct: 258 PPPTSYLIIG-DGGDAVSKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDD 316

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL-----P 261
           SG GG ++DSG+ L +     Y  +     +  +R +L    +      LC  +     P
Sbjct: 317 SGNGGTVMDSGTTLAFLADPAYRLV---IAAVKQRIKLPNADELTPGFDLCVNVSGVTKP 373

Query: 262 ETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLV--ALIGSQQQRDTR 319
           E     P + F F    + +     + I+ E     LA+   D  V  ++IG+  Q+   
Sbjct: 374 EKI--LPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGFL 431

Query: 320 FVYDLNIDLLSFVKENCS 337
           F +D +   L F +  C+
Sbjct: 432 FEFDRDRSRLGFSRRGCA 449


>gi|115452683|ref|NP_001049942.1| Os03g0317300 [Oryza sativa Japonica Group]
 gi|108707833|gb|ABF95628.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548413|dbj|BAF11856.1| Os03g0317300 [Oryza sativa Japonica Group]
          Length = 448

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 110/373 (29%), Positives = 168/373 (45%), Gaps = 48/373 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDC 46
           +V + IGTP + V LILDTGS L +                F+P +S +F  + CD   C
Sbjct: 86  LVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRIC 145

Query: 47  ---TYFKCVNEQ-----CVYTMKYADQSVTKGFAAHETISVIGKGE--GKAIFHGALFGC 96
              T+  C  +      CVY   YAD S+T G    +T S        G A      FGC
Sbjct: 146 RDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGC 205

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
              N+G           G+ G SR  +S  +QL       FSYC    +   E +  +L 
Sbjct: 206 GLFNNGIFVSNE----TGIAGFSRGALSMPAQLK---VDNFSYCFTA-ITGSEPSPVFLG 257

Query: 157 FGTDM-----GYRRPSTQATKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITVSG 208
              ++     G      Q+T  I + ++    YY+SLK +++   R+  P   F +   G
Sbjct: 258 VPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDG 317

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-F 267
            GG I+DSG+ +T     VY  + + FV+   + +L   +      QLC+ +P       
Sbjct: 318 TGGTIVDSGTGMTMLPEAVYNLVCDAFVA---QTKLTVHNSTSSLSQLCFSVPPGAKPDV 374

Query: 268 PSMAFYFEDANLRIDGEN-VFIIDYENHFFL--LAVAPHDDLVALIGSQQQRDTRFVYDL 324
           P++  +FE A L +  EN +F I+      L  LA+   +DL ++IG+ QQ++   +YDL
Sbjct: 375 PALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDL-SVIGNFQQQNMHVLYDL 433

Query: 325 NIDLLSFVKENCS 337
             D+LSFV   C+
Sbjct: 434 ANDMLSFVPARCN 446


>gi|54290728|dbj|BAD62398.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 486

 Score =  122 bits (306), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 109/364 (29%), Positives = 164/364 (45%), Gaps = 50/364 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA-----------------IFDPRKSSSFQKINCDH 43
           +V + +GTP++   LI DTGS L +                  +FDP KSS++  ++C  
Sbjct: 145 VVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCGE 204

Query: 44  PDCTYFKCV----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
           P C     +    N  C+Y ++Y D S T G  + +T+++           G  FGC   
Sbjct: 205 PQCAAAGDLCSEDNTTCLYLVRYGDGSSTTGVLSRDTLALTSS----RALTGFPFGCGTR 260

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           N G       G + G+LGL R  +S  SQ  +     FSYCL    P+   T+ YL  G 
Sbjct: 261 NLG-----DFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCL----PSSNSTTGYLTIGA 311

Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
                  + Q T  +  P   +FY++ L  I I    +  PP  F       GG ++DSG
Sbjct: 312 TPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYVLPVPPAVFT-----RGGTLLDSG 366

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFED 276
           +VLTY  +  Y  L ++F    ER+  A  +D    +  CY F  E+    P+++F F D
Sbjct: 367 TVLTYLPAQAYALLRDRFRLTMERYTPAPPNDV---LDACYDFAGESEVVVPAVSFRFGD 423

Query: 277 -ANLRIDGENVFIIDYENHFFLLAVAPHDD---LVALIGSQQQRDTRFVYDLNIDLLSFV 332
            A   +D   V I   EN    LA A  D     +++IG+ QQR    +YD+  + + FV
Sbjct: 424 GAVFELDFFGVMIFLDEN-VGCLAFAAMDTGGLPLSIIGNTQQRSAEVIYDVAAEKIGFV 482

Query: 333 KENC 336
             +C
Sbjct: 483 PASC 486


>gi|125543634|gb|EAY89773.1| hypothetical protein OsI_11315 [Oryza sativa Indica Group]
          Length = 474

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 110/373 (29%), Positives = 168/373 (45%), Gaps = 48/373 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDC 46
           +V + IGTP + V LILDTGS L +                F+P +S +F  + CD   C
Sbjct: 112 LVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRIC 171

Query: 47  ---TYFKCVNEQ-----CVYTMKYADQSVTKGFAAHETISVIGKGE--GKAIFHGALFGC 96
              T+  C  +      CVY   YAD S+T G    +T S        G A      FGC
Sbjct: 172 RDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGC 231

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
              N+G           G+ G SR  +S  +QL       FSYC    +   E +  +L 
Sbjct: 232 GLFNNGIFVSNE----TGIAGFSRGALSMPAQLK---VDNFSYCFTA-ITGSEPSPVFLG 283

Query: 157 FGTDM-----GYRRPSTQATKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITVSG 208
              ++     G      Q+T  I + ++    YY+SLK +++   R+  P   F +   G
Sbjct: 284 VPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDG 343

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-F 267
            GG I+DSG+ +T     VY  + + FV+   + +L   +      QLC+ +P       
Sbjct: 344 TGGTIVDSGTGMTMLPEAVYNLVCDAFVA---QTKLTVHNSTSSLSQLCFSVPPGAKPDV 400

Query: 268 PSMAFYFEDANLRIDGEN-VFIIDYENHFFL--LAVAPHDDLVALIGSQQQRDTRFVYDL 324
           P++  +FE A L +  EN +F I+      L  LA+   +DL ++IG+ QQ++   +YDL
Sbjct: 401 PALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDL-SVIGNFQQQNMHVLYDL 459

Query: 325 NIDLLSFVKENCS 337
             D+LSFV   C+
Sbjct: 460 ANDMLSFVPARCN 472


>gi|242058537|ref|XP_002458414.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
 gi|241930389|gb|EES03534.1| hypothetical protein SORBIDRAFT_03g033075 [Sorghum bicolor]
          Length = 448

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 103/366 (28%), Positives = 158/366 (43%), Gaps = 45/366 (12%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT---- 47
           +G P    L+++DTGS LI+               ++DPR SS+ ++I C  P C     
Sbjct: 94  VGDPPTRALVVIDTGSDLIWLQCVPCRHCYRQVTPLYDPRSSSTHRRIPCASPRCRDVLR 153

Query: 48  YFKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
           Y  C      CVY + Y D S + G  A + +            H    GC +DN G  E
Sbjct: 154 YPGCDARTGGCVYMVVYGDGSASSGDLATDRLVF----PDDTHVHNVTLGCGHDNVGLLE 209

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
            A     AG+LG+ R  +SF +QL       FSYCL   L   +  SSYL FG       
Sbjct: 210 SA-----AGLLGVGRGQLSFPTQLAPAYGHVFSYCLGDRLSRAQNGSSYLVFGRTP--EP 262

Query: 166 PSTQATKFINHPN--NFYYLSLKDISIDNERM-NFPPDTFDIT-VSGEGGCIIDSGSVLT 221
           PST  T    +P   + YY+ +   S+  ER+  F   +  +   +G GG ++DSG+ ++
Sbjct: 263 PSTAFTPLRTNPRRPSLYYVDMVGFSVGGERVTGFSNASLALNPATGRGGIVVDSGTAIS 322

Query: 222 YFHSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFL-----PETFNRFPSMAFYFE 275
            F  D Y  + + F S+      + +L+        CY L     P    R PS+  +F 
Sbjct: 323 RFARDAYAAVRDAFDSHAAAAGTMRKLATKFSVFDACYDLRGNGAPAAAVRVPSIVLHFA 382

Query: 276 -DANLRIDGENVFII---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
             A++ +   N  I         +F L +   DD + ++G+ QQ+    V+D+    + F
Sbjct: 383 GGADMALPQANYLIPVQGGDRRTYFCLGLQAADDGLNVLGNVQQQGFGLVFDVERGRIGF 442

Query: 332 VKENCS 337
               CS
Sbjct: 443 TPNGCS 448


>gi|115445765|ref|NP_001046662.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|46391044|dbj|BAD15987.1| putative chloroplast nucleoid DNA binding protein [Oryza sativa
           Japonica Group]
 gi|113536193|dbj|BAF08576.1| Os02g0314600 [Oryza sativa Japonica Group]
 gi|125581836|gb|EAZ22767.1| hypothetical protein OsJ_06441 [Oryza sativa Japonica Group]
 gi|215697168|dbj|BAG91162.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 514

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 104/372 (27%), Positives = 165/372 (44%), Gaps = 45/372 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V L++GTP +   +I+DTGS L +               +FDP  S S++ + C  P C
Sbjct: 153 LVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASLSYRNVTCGDPRC 212

Query: 47  TYF----------KCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIFHGALF 94
                        +  ++ C Y   Y DQS T G  A E  T+++   G  + +    +F
Sbjct: 213 GLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRV-DDVVF 271

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
           GC + N G    A            R  +SF SQL ++    FSYCLV    +G    S 
Sbjct: 272 GCGHSNRGLFHGAAGLLGL-----GRGALSFASQLRAVYGHAFSYCLV---DHGSSVGSK 323

Query: 155 LKFGTD---MGYRRP--STQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
           + FG D   +G+ R   +  A       + FYY+ LK + +  E++N  P T+D+   G 
Sbjct: 324 IVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGS 383

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFP 268
           GG IIDSG+ L+YF    Y  +   FV   ++     ++D P  +  CY +        P
Sbjct: 384 GGTIIDSGTTLSYFAEPAYEVIRRAFVERMDK-AYPLVADFPV-LSPCYNVSGVERVEVP 441

Query: 269 SMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
             +  F D A      EN F+ +D +    L  +      +++IG+ QQ++   +YDL  
Sbjct: 442 EFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDLQN 501

Query: 327 DLLSFVKENCSD 338
           + L F    C++
Sbjct: 502 NRLGFAPRRCAE 513


>gi|125586057|gb|EAZ26721.1| hypothetical protein OsJ_10629 [Oryza sativa Japonica Group]
          Length = 474

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 110/373 (29%), Positives = 168/373 (45%), Gaps = 48/373 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDC 46
           +V + IGTP + V LILDTGS L +                F+P +S +F  + CD   C
Sbjct: 112 LVHMAIGTPPQPVQLILDTGSDLTWTQCAPCVSCFRQSLPRFNPSRSMTFSVLPCDLRIC 171

Query: 47  ---TYFKCVNEQ-----CVYTMKYADQSVTKGFAAHETISVIGKGE--GKAIFHGALFGC 96
              T+  C  +      CVY   YAD S+T G    +T S        G A      FGC
Sbjct: 172 RDLTWSSCGEQSWGNGICVYAYAYADHSITTGHLDSDTFSFASADHAIGGASVPDLTFGC 231

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
              N+G           G+ G SR  +S  +QL       FSYC    +   E +  +L 
Sbjct: 232 GLFNNGIFVSNE----TGIAGFSRGALSMPAQLK---VDNFSYCFTA-ITGSEPSPVFLG 283

Query: 157 FGTDM-----GYRRPSTQATKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITVSG 208
              ++     G      Q+T  I + ++    YY+SLK +++   R+  P   F +   G
Sbjct: 284 VPPNLYSDAAGGGHGVVQSTALIRYHSSQLKAYYISLKGVTVGTTRLPIPESVFALKEDG 343

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-F 267
            GG I+DSG+ +T     VY  + + FV+   + +L   +      QLC+ +P       
Sbjct: 344 TGGTIVDSGTGMTMLPEAVYNLVCDAFVA---QTKLTVHNSTSSLSQLCFSVPPGAKPDV 400

Query: 268 PSMAFYFEDANLRIDGEN-VFIIDYENHFFL--LAVAPHDDLVALIGSQQQRDTRFVYDL 324
           P++  +FE A L +  EN +F I+      L  LA+   +DL ++IG+ QQ++   +YDL
Sbjct: 401 PALVLHFEGATLDLPRENYMFEIEEAGGIRLTCLAINAGEDL-SVIGNFQQQNMHVLYDL 459

Query: 325 NIDLLSFVKENCS 337
             D+LSFV   C+
Sbjct: 460 ANDMLSFVPARCN 472


>gi|125539168|gb|EAY85563.1| hypothetical protein OsI_06935 [Oryza sativa Indica Group]
          Length = 514

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 104/372 (27%), Positives = 165/372 (44%), Gaps = 45/372 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V L++GTP +   +I+DTGS L +               +FDP  S S++ + C  P C
Sbjct: 153 LVDLYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPATSLSYRNVTCGDPRC 212

Query: 47  TYF----------KCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIFHGALF 94
                        +  ++ C Y   Y DQS T G  A E  T+++   G  + +    +F
Sbjct: 213 GLVAPPTAPRACRRPHSDPCPYYYWYGDQSNTTGDLALEAFTVNLTAPGASRRV-DDVVF 271

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
           GC + N G    A            R  +SF SQL ++    FSYCLV    +G    S 
Sbjct: 272 GCGHSNRGLFHGAAGLLGL-----GRGALSFASQLRAVYGHAFSYCLV---DHGSSVGSK 323

Query: 155 LKFGTD---MGYRRP--STQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
           + FG D   +G+ R   +  A       + FYY+ LK + +  E++N  P T+D+   G 
Sbjct: 324 IVFGDDDALLGHPRLNYTAFAPSAAAAADTFYYVQLKGVLVGGEKLNISPSTWDVGKDGS 383

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFP 268
           GG IIDSG+ L+YF    Y  +   FV   ++     ++D P  +  CY +        P
Sbjct: 384 GGTIIDSGTTLSYFAEPAYEVIRRAFVERMDK-AYPLVADFPV-LSPCYNVSGVERVEVP 441

Query: 269 SMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
             +  F D A      EN F+ +D +    L  +      +++IG+ QQ++   +YDL  
Sbjct: 442 EFSLLFADGAVWDFPAENYFVRLDPDGIMCLAVLGTPRSAMSIIGNFQQQNFHVLYDLQN 501

Query: 327 DLLSFVKENCSD 338
           + L F    C++
Sbjct: 502 NRLGFAPRRCAE 513


>gi|414876414|tpg|DAA53545.1| TPA: hypothetical protein ZEAMMB73_483039 [Zea mays]
          Length = 506

 Score =  122 bits (305), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 107/361 (29%), Positives = 160/361 (44%), Gaps = 50/361 (13%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTY 48
           R+ IG+P++ + ++LDTGS + +               +FDP  S+S+  ++CD   C  
Sbjct: 169 RVGIGSPARQLYMVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRD 228

Query: 49  F---KCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                C N    C+Y + Y D S T G  A ET+++   G+   + + A+ GC +DN G 
Sbjct: 229 LDTAACRNATGACLYEVAYGDGSYTVGDFATETLTL---GDSTPVGNVAI-GCGHDNEGL 284

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A      G   LS     F SQ   I    FSYCLV         +S L+FG   G 
Sbjct: 285 FVGAAGLLALGGGPLS-----FPSQ---ISASTFSYCLV---DRDSPAASTLQFGD--GA 331

Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVL 220
               T     +  P  + FYY++L  IS+  + ++ P   F +   SG GG I+DSG+ +
Sbjct: 332 AEAGTVTAPLVRSPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAV 391

Query: 221 TYFHSDVYWKLHEKFVS---YFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFED 276
           T   S  Y  L + FV       R     L D       CY L + T    P+++  FE 
Sbjct: 392 TRLQSAAYAALRDAFVQGAPSLPRTSGVSLFD------TCYDLSDRTSVEVPAVSLRFEG 445

Query: 277 AN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
              LR+  +N  I       + LA AP +  V++IG+ QQ+ TR  +D     + F    
Sbjct: 446 GGALRLPAKNYLIPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNK 505

Query: 336 C 336
           C
Sbjct: 506 C 506


>gi|224130548|ref|XP_002320868.1| predicted protein [Populus trichocarpa]
 gi|222861641|gb|EEE99183.1| predicted protein [Populus trichocarpa]
          Length = 488

 Score =  122 bits (305), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 106/365 (29%), Positives = 166/365 (45%), Gaps = 52/365 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            RL +GTP++ V ++LDTGS +++               +FDP KS SF  I C  P C 
Sbjct: 147 TRLGVGTPARYVYMVLDTGSDIVWIQCAPCIKCYSQTDPVFDPTKSRSFANIPCGSPLCR 206

Query: 48  ---YFKCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
              Y  C  ++  C+Y + Y D S T G  + ET++  G   G+ +      GC +DN G
Sbjct: 207 RLDYPGCSTKKQICLYQVSYGDGSFTVGEFSTETLTFRGTRVGRVVL-----GCGHDNEG 261

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS----SYLKFG 158
               A            R  +SF SQ+G     +FSYCL      G+ ++    S + FG
Sbjct: 262 LFVGAAGLLGL-----GRGRLSFPSQIGRRFNSKFSYCL------GDRSASSRPSSIVFG 310

Query: 159 TDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMN-FPPDTFDITVSGEGGCIID 215
            D    R +T+ T  +++P  + FYY+ L  IS+   R++      F +  +G GG IID
Sbjct: 311 -DSAISR-TTRFTPLLSNPKLDTFYYVELLGISVGGTRVSGISASLFKLDSTGNGGVIID 368

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLP-ETFNRFPSMAF 272
           SG+ +T      Y  L + F+        + L   PE      C+ L  +T  + P++  
Sbjct: 369 SGTSVTRLTRAAYVALRDAFL-----VGASNLKRAPEFSLFDTCFDLSGKTEVKVPTVVL 423

Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +F  A++ +   N  I    +  F  A A     +++IG+ QQ+  R VYDL    + F 
Sbjct: 424 HFRGADVPLPASNYLIPVDNSGSFCFAFAGTASGLSIIGNIQQQGFRVVYDLATSRVGFA 483

Query: 333 KENCS 337
              C+
Sbjct: 484 PRGCA 488


>gi|357135412|ref|XP_003569303.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 494

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 106/366 (28%), Positives = 161/366 (43%), Gaps = 46/366 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            ++ +GTP+   L++LDTGS +++               +FDPR+S S+  + C  P C 
Sbjct: 144 TKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYDQSGQVFDPRRSRSYGAVGCSAPLCR 203

Query: 48  YFKC-----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                      + C+Y + Y D SVT G  A ET++  G   G  +   AL GC +DN G
Sbjct: 204 RLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFAG---GARVARIAL-GCGHDNEG 259

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV--IPLPNGEYTSSYLKFGTD 160
               A            R ++SF +Q+     + FSYCLV      N    SS + FG+ 
Sbjct: 260 LFVAAAGLLGL-----GRGSLSFPAQISRRYGRSFSYCLVDRTSSANPASHSSTVTFGSG 314

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCIID 215
                 +   T  + +P    FYY+ L  IS+   R++   D+ D+ +   SG GG I+D
Sbjct: 315 AVGSTVAASFTPMVKNPRMETFYYVQLVGISVGGARVSGVADS-DLRLDPSSGRGGVIVD 373

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLP-ETFNRFPSMA 271
           SG+ +T      Y  L + F     R   A L   P    L   CY L      + P+++
Sbjct: 374 SGTSVTRLARPAYSALRDAF-----RAAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVS 428

Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
            +F   A   +  EN  I       F  A A  D  V++IG+ QQ+  R V+D +   + 
Sbjct: 429 MHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVG 488

Query: 331 FVKENC 336
           FV + C
Sbjct: 489 FVPKGC 494


>gi|242050432|ref|XP_002462960.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
 gi|241926337|gb|EER99481.1| hypothetical protein SORBIDRAFT_02g035320 [Sorghum bicolor]
          Length = 445

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 97/376 (25%), Positives = 162/376 (43%), Gaps = 58/376 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDH-- 43
           ++ L IGTP      I DTGS LI+                +++P  S++F  + C+   
Sbjct: 87  LMTLAIGTPPVSYQAIADTGSDLIWTQCAPCSSQCFQQPTPLYNPSSSTTFAVLPCNSSL 146

Query: 44  -------------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-F 89
                        P CT        C+Y M Y     T  +   ET +            
Sbjct: 147 SMCAAALAGTTPPPGCT--------CMYNMTYGS-GWTSVYQGSETFTFGSSTPANQTGV 197

Query: 90  HGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGE 149
            G  FGCSN + GF+  +     +G++GL R ++S +SQLG     +FSYCL  P  +  
Sbjct: 198 PGIAFGCSNASGGFNTSSA----SGLVGLGRGSLSLVSQLG---VPKFSYCLT-PYQDTN 249

Query: 150 YTSSYLKFGTDMGYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDI 204
            TS+ L   +          +T F+  P++     +YYL+L  IS+    ++ P     +
Sbjct: 250 STSTLLLGPSASLNDTGGVSSTPFVASPSDAPMSTYYYLNLTGISLGTTALSIPTTALSL 309

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF 264
              G GG IIDSG+ +T   +  Y ++    VS                + LC+ LP + 
Sbjct: 310 KADGTGGFIIDSGTTITLLGNTAYQQVRAAVVSLVT-LPTTDGGSAATGLDLCFELPSST 368

Query: 265 N---RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
           +     PSM  +F+ A++ +  ++  ++D  N + L      D  V+++G+ QQ++   +
Sbjct: 369 SAPPTMPSMTLHFDGADMVLPADSYMMLD-SNLWCLAMQNQTDGGVSILGNYQQQNMHIL 427

Query: 322 YDLNIDLLSFVKENCS 337
           YD+  + L+F    CS
Sbjct: 428 YDVGQETLTFAPAKCS 443


>gi|356531224|ref|XP_003534178.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 492

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 107/359 (29%), Positives = 155/359 (43%), Gaps = 51/359 (14%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
           R+ +G PSK   ++LDTGS + +               IFDP  SSS+  + CD   C  
Sbjct: 160 RVGVGQPSKPFYMVLDTGSDVNWLQCKPCSDCYQQSDPIFDPTASSSYNPLTCDAQQCQD 219

Query: 49  FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
            +   C N +C+Y + Y D S T G    ET+S      G    +    GC +DN G   
Sbjct: 220 LEMSACRNGKCLYQVSYGDGSFTVGEYVTETVSF-----GAGSVNRVAIGCGHDNEGL-- 272

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
                   G  GL  +    +S    I    FSYCLV         SS L+F +     R
Sbjct: 273 ------FVGSAGLLGLGGGPLSLTSQIKATSFSYCLV---DRDSGKSSTLEFNSP----R 319

Query: 166 P--STQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
           P  S  A    N   N FYY+ L  +S+  E +  PP+TF +  SG GG I+DSG+ +T 
Sbjct: 320 PGDSVVAPLLKNQKVNTFYYVELTGVSVGGEIVTVPPETFAVDQSGAGGVIVDSGTAITR 379

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLPETFN-RFPSMAFYFE-DA 277
             +  Y  + + F       + A      E + L   CY L    + R P+++F+F  D 
Sbjct: 380 LRTQAYNSVRDAFKRKTSNLRPA------EGVALFDTCYDLSSLQSVRVPTVSFHFSGDR 433

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
              +  +N  I       +  A AP    +++IG+ QQ+ TR  +DL   L+ F    C
Sbjct: 434 AWALPAKNYLIPVDGAGTYCFAFAPTTSSMSIIGNVQQQGTRVSFDLANSLVGFSPNKC 492


>gi|15228044|ref|NP_181826.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|20197868|gb|AAM15292.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197965|gb|AAD21712.2| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330255100|gb|AEC10194.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 527

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 109/369 (29%), Positives = 159/369 (43%), Gaps = 40/369 (10%)

Query: 4   LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
           + +GTP K   LILDTGS L +                +DP+ S+SF+ I C+ P C+  
Sbjct: 164 VLVGTPPKHFSLILDTGSDLNWLQCLPCYDCFHQNGMFYDPKTSASFKNITCNDPRCSLI 223

Query: 50  -------KCV--NEQCVYTMKYADQSVTKGFAAHETISV----IGKGEGKAIFHGALFGC 96
                  +C   N+ C Y   Y D+S T G  A ET +V       G  +      +FGC
Sbjct: 224 SSPDPPVQCESDNQSCPYFYWYGDRSNTTGDFAVETFTVNLTTTEGGSSEYKVGNMMFGC 283

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
            + N G    A            R  +SF SQL S+    FSYCLV    N    SS L 
Sbjct: 284 GHWNRGLFSGASGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLVDRNSNTN-VSSKLI 337

Query: 157 FGTDMGY-RRPSTQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
           FG D       +   T F+N   N    FYY+ +K I +  + ++ P +T++I+  G+GG
Sbjct: 338 FGEDKDLLNHTNLNFTSFVNGKENSVETFYYIQIKSILVGGKALDIPEETWNISSDGDGG 397

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSM 270
            IIDSG+ L+YF    Y  +  KF     E + + +     +P      + E     P +
Sbjct: 398 TIIDSGTTLSYFAEPAYEIIKNKFAEKMKENYPIFRDFPVLDPCFNVSGIEENNIHLPEL 457

Query: 271 AFYFEDANL-RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
              F D  +     EN FI   E+   L  +       ++IG+ QQ++   +YD     L
Sbjct: 458 GIAFVDGTVWNFPAENSFIWLSEDLVCLAILGTPKSTFSIIGNYQQQNFHILYDTKRSRL 517

Query: 330 SFVKENCSD 338
            F    C+D
Sbjct: 518 GFTPTKCAD 526


>gi|255558694|ref|XP_002520371.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223540418|gb|EEF41987.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 557

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 115/372 (30%), Positives = 160/372 (43%), Gaps = 48/372 (12%)

Query: 4   LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
           +FIGTP +   LILDTGS L +                +DP++SSSF+ I C  P C   
Sbjct: 196 VFIGTPPRHFSLILDTGSDLNWIQCVPCYDCFVQNGPYYDPKESSSFKNIGCHDPRCHLV 255

Query: 50  ---------KCVNEQCVYTMKYADQSVTKGFAAHETISV-----IGKGEGKAIFHGALFG 95
                    K  N+ C Y   Y D S T G  A ET +V      GK E K +    +FG
Sbjct: 256 SSPDPPQPCKAENQTCPYFYWYGDSSNTTGDFALETFTVNLTSPAGKSEFKRV-ENVMFG 314

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           C + N G    A            R  +SF SQL S+    FSYCLV    +    SS L
Sbjct: 315 CGHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLV-DRNSDTNVSSKL 368

Query: 156 KFGTDMGY-RRPSTQATKFI----NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
            FG D      P    T  +    N  + FYY+ +K I +  E +  P +T+ ++  G G
Sbjct: 369 IFGEDKDLLNHPEVNFTSLVAGKENPVDTFYYVQIKSIMVGGEVLKIPEETWHLSPEGAG 428

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPS 269
           G I+DSG+ L+YF    Y  + + FV   + + +  + D P  +  CY +        P 
Sbjct: 429 GTIVDSGTTLSYFAEPSYEIIKDAFVKKVKGYPV--IKDFPI-LDPCYNVSGVEKMELPE 485

Query: 270 MAFYFED-ANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFVYDLNI 326
               FED A      EN FI         LA+   P   L ++IG+ QQ++   +YD   
Sbjct: 486 FRILFEDGAVWNFPVENYFIKLEPEEIVCLAILGTPRSAL-SIIGNYQQQNFHILYDTKK 544

Query: 327 DLLSFVKENCSD 338
             L +    C+D
Sbjct: 545 SRLGYAPMKCAD 556


>gi|30683732|ref|NP_180371.2| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|28392898|gb|AAO41885.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|56382011|gb|AAV85724.1| At2g28040 [Arabidopsis thaliana]
 gi|330252978|gb|AEC08072.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 395

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 107/355 (30%), Positives = 160/355 (45%), Gaps = 47/355 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L IGTP   +  +LDTGS  I+               IFDP KSS+F++I CD  D 
Sbjct: 66  LMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDTHD- 124

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                    C Y + Y  +S TKG    ET+++        +    + GC  +N GF   
Sbjct: 125 -------HSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNNSGFKP- 176

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
                 AGV+GL R   S I+Q+G       SYC       G+ TS  + FG +      
Sbjct: 177 ----GFAGVVGLDRGPKSLITQMGGEYPGLMSYCFA-----GKGTSK-INFGANAIVAGD 226

Query: 167 STQATK-FINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
              +T  F+      FYYL+L  +S+ N R+      F    + +G  +IDSGS LTYF 
Sbjct: 227 GVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFH---ALKGNIVIDSGSTLTYF- 282

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DANLRIDG 283
            + Y  L  K V      Q+      P    LCY+  +T + FP +  +F   A+L +D 
Sbjct: 283 PESYCNLVRKAVE-----QVVTAVRFPRSDILCYY-SKTIDIFPVITMHFSGGADLVLDK 336

Query: 284 ENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            N+++       F LA+  +  +  A+ G++ Q +    YD +  L+SF   NCS
Sbjct: 337 YNMYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 391


>gi|413952720|gb|AFW85369.1| hypothetical protein ZEAMMB73_571116 [Zea mays]
          Length = 451

 Score =  121 bits (304), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 103/370 (27%), Positives = 166/370 (44%), Gaps = 43/370 (11%)

Query: 1   MVRLFIGTP-SKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPD 45
           ++   IGTP  + V L +DTGS L++               +FDP  SS+F+ + C  P 
Sbjct: 88  LIHFNIGTPRPQRVALTMDTGSDLVWTQCTPCPVCFDQPFPLFDPSVSSTFRAVACPDPI 147

Query: 46  C------TYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVI---GKGEGKAIFHGALF 94
           C      +   C  +  +C Y   Y D+S+T G+   +T + +   G+G       G  F
Sbjct: 148 CRPSSGLSVSACALKTFRCFYLCSYGDKSITAGYIFKDTFTFMSPNGEGAPPVAVSGLAF 207

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
           GC + N G          +G+ G  R  +S  SQL      RFSYCL          +S 
Sbjct: 208 GCGDYNTGVFASNE----SGIAGFGRGPLSLPSQL---RVGRFSYCLTSHDETESNKTSA 260

Query: 155 LKFGTDM-GYRRPST---QATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSG 208
           +  GT   G R  S+   ++T  I+ P+   FYYLSL+ I++   R+      F +   G
Sbjct: 261 VFLGTPPNGLRAHSSGPFRSTPIIHSPSFPTFYYLSLEGITVGKTRLPVDSSVFALKKDG 320

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-- 266
            GG +IDSG+ +T F + V+ +L  +FV+     +    S+      LC+  P+   +  
Sbjct: 321 SGGTVIDSGTGVTTFPAAVFEQLKNEFVAQLPLPRYDNTSEVGN--LLCFQRPKGGKQVP 378

Query: 267 FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
            P + F+   A++ +  EN    D ++    L +   +  + LIG+ QQ++   VYD+  
Sbjct: 379 VPKLIFHLASADMDLPRENYIPEDTDSGVMCLMINGAEVDMVLIGNFQQQNMHIVYDVEN 438

Query: 327 DLLSFVKENC 336
             L F    C
Sbjct: 439 SKLLFASAQC 448


>gi|4063754|gb|AAC98462.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197956|gb|AAM15330.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
          Length = 389

 Score =  121 bits (304), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 107/355 (30%), Positives = 160/355 (45%), Gaps = 47/355 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L IGTP   +  +LDTGS  I+               IFDP KSS+F++I CD  D 
Sbjct: 60  LMKLQIGTPPFEIEAVLDTGSEHIWTQCLPCVHCYNQTAPIFDPSKSSTFKEIRCDTHD- 118

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                    C Y + Y  +S TKG    ET+++        +    + GC  +N GF   
Sbjct: 119 -------HSCPYELVYGGKSYTKGTLVTETVTIHSTSGQPFVMPETIIGCGRNNSGFKP- 170

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
                 AGV+GL R   S I+Q+G       SYC       G+ TS  + FG +      
Sbjct: 171 ----GFAGVVGLDRGPKSLITQMGGEYPGLMSYCFA-----GKGTSK-INFGANAIVAGD 220

Query: 167 STQATK-FINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
              +T  F+      FYYL+L  +S+ N R+      F    + +G  +IDSGS LTYF 
Sbjct: 221 GVVSTTVFVKTAKPGFYYLNLDAVSVGNTRIETVGTPFH---ALKGNIVIDSGSTLTYF- 276

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DANLRIDG 283
            + Y  L  K V      Q+      P    LCY+  +T + FP +  +F   A+L +D 
Sbjct: 277 PESYCNLVRKAVE-----QVVTAVRFPRSDILCYY-SKTIDIFPVITMHFSGGADLVLDK 330

Query: 284 ENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            N+++       F LA+  +  +  A+ G++ Q +    YD +  L+SF   NCS
Sbjct: 331 YNMYVASNTGGVFCLAIICNSPIEEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 385


>gi|225470916|ref|XP_002263964.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147788999|emb|CAN64659.1| hypothetical protein VITISV_009613 [Vitis vinifera]
          Length = 489

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 106/360 (29%), Positives = 159/360 (44%), Gaps = 43/360 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            RL +GTP K V ++LDTGS +++               +FDP+KS SF  I+C  P C 
Sbjct: 149 TRLGVGTPPKYVYMVLDTGSDVVWIQCAPCRKCYSQTDPVFDPKKSGSFSSISCRSPLCL 208

Query: 48  YF---KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                 C + Q C+Y + Y D S T G  + ET++  G    K        GC +DN G 
Sbjct: 209 RLDSPGCNSRQSCLYQVAYGDGSFTFGEFSTETLTFRGTRVPKVAL-----GCGHDNEGL 263

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A            R  +SF +Q G    ++FSYCLV    + + +S  + FG     
Sbjct: 264 FVGAAGLLGL-----GRGRLSFPTQTGLRFGRKFSYCLVDRSASSKPSS--VVFGQSAVS 316

Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSVL 220
           R  +   T  I +P  + FYYL L  IS+   R+       F +  +G GG IIDSG+ +
Sbjct: 317 R--TAVFTPLITNPKLDTFYYLELTGISVGGARVAGITASLFKLDTAGNGGVIIDSGTSV 374

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLP-ETFNRFPSMAFYFEDA 277
           T      Y  L + F     R   A L   P+      C+ L  +T  + P++  +F  A
Sbjct: 375 TRLTRRAYVSLRDAF-----RAGAADLKRAPDYSLFDTCFDLSGKTEVKVPTVVMHFRGA 429

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ++ +   N  I    N  F  A A     +++IG+ QQ+  R V+D+    + F    C+
Sbjct: 430 DVSLPATNYLIPVDTNGVFCFAFAGTMSGLSIIGNIQQQGFRVVFDVAASRIGFAARGCA 489


>gi|414589628|tpg|DAA40199.1| TPA: hypothetical protein ZEAMMB73_627989 [Zea mays]
          Length = 452

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 105/373 (28%), Positives = 162/373 (43%), Gaps = 54/373 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V L IGTP + V  +LDTGS LI+               +F P +S+S++ + C    C
Sbjct: 97  VVDLAIGTPPQPVSALLDTGSDLIWTQCAPCASCLSQPDPLFAPGQSASYEPMRCAGTLC 156

Query: 47  T---YFKCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL--FGCSNDN 100
           +   +  C   + C Y   Y D ++T G  A E  +    G G          FGC + N
Sbjct: 157 SDILHHSCERPDTCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCGSVN 216

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS---SYLKF 157
            G   +      +G++G  R  +S +SQL SI  +RFSYCL        Y S   S L F
Sbjct: 217 VGSLNNG-----SGIVGFGRNPLSLVSQL-SI--RRFSYCLT------SYASRRQSTLLF 262

Query: 158 GT----DMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
           G+      G      Q T  +  P N  FYY+    +++   R+  P   F +   G GG
Sbjct: 263 GSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGG 322

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR----- 266
            I+DSG+ LT   + V   L E   ++ ++ +L   +       +C+ +P  + R     
Sbjct: 323 VIVDSGTALTLLPAAV---LAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTS 379

Query: 267 ---FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
               P M  +F+ A+L +   N  + D+      L +A   D  + IG+  Q+D R +YD
Sbjct: 380 QMPVPRMVLHFQGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYD 439

Query: 324 LNIDLLSFVKENC 336
           L  + LS     C
Sbjct: 440 LEAETLSIAPARC 452


>gi|242079451|ref|XP_002444494.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
 gi|241940844|gb|EES13989.1| hypothetical protein SORBIDRAFT_07g022800 [Sorghum bicolor]
          Length = 445

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 106/366 (28%), Positives = 161/366 (43%), Gaps = 49/366 (13%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC----- 46
           IGTP +   LILDTGS LI+               ++DP KSSSF    CD   C     
Sbjct: 95  IGTPPQPRTLILDTGSDLIWTQCKLFDTRQHREKPLYDPAKSSSFAAAPCDGRLCETGSF 154

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
               C   +C+YT  Y   + TKG  A ET +    GE + +     FGC     G    
Sbjct: 155 NTKNCSRNKCIYTYNYGS-ATTKGELASETFTF---GEHRRVSVSLDFGCGKLTSGSLPG 210

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT--DMGYR 164
           A     +G+LG+S   +S +SQL      RFSYCL   L     T+S++ FG   D+   
Sbjct: 211 A-----SGILGISPDRLSLVSQLQ---IPRFSYCLTPFL--DRNTTSHIFFGAMADLSKY 260

Query: 165 RPS--TQATKFINHP---NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
           R +   Q T  + +P   N +YY+ L  IS+  +R+N P  +F I   G GG  +DSG  
Sbjct: 261 RTTGPIQTTSLVTNPDGSNYYYYVPLIGISVGTKRLNVPVSSFAIGRDGSGGTFVDSGDT 320

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-------ETFNRFPSMAF 272
                S V   L E  V    +  +   +D     +LC+ LP       ET  + P + +
Sbjct: 321 TGMLPSVVMEALKEAMVEAV-KLPVVNATDHGYEYELCFQLPRNGGGAVETAVQVPPLVY 379

Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +F+     +   + ++++       L ++      A+IG+ QQ++   ++D+     SF 
Sbjct: 380 HFDGGAAMLLRRDSYMVEVSAGRMCLVIS-SGARGAIIGNYQQQNMHVLFDVENHEFSFA 438

Query: 333 KENCSD 338
              C+ 
Sbjct: 439 PTQCNQ 444


>gi|297834758|ref|XP_002885261.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297331101|gb|EFH61520.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 500

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 156/356 (43%), Gaps = 42/356 (11%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
           R+ +GTP+K + L+LDTGS + +               +F+P  SS+++ + C  P C+ 
Sbjct: 165 RIGVGTPAKEMYLVLDTGSDVNWIQCEPCSDCYQQSDPVFNPTSSSTYKSLTCSAPQCSL 224

Query: 49  FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
            +   C + +C+Y + Y D S T G  A +T++    G+     +    GC +DN G   
Sbjct: 225 LETSACRSNKCLYQVSYGDGSFTVGELATDTVTFGNSGK----INDVALGCGHDNEGL-- 278

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
                   G  GL  +    +S    +    FSYCLV         SS L F +      
Sbjct: 279 ------FTGAAGLLGLGGGALSITNQMKATSFSYCLV---DRDSGKSSSLDFNSVQLGSG 329

Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
            +T         + FYY+ L   S+  +++  P   FD+  SG GG I+D G+ +T   +
Sbjct: 330 DATAPLLRNQKIDTFYYVGLSGFSVGGQKVMMPDAIFDVDASGSGGVILDCGTAVTRLQT 389

Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CY-FLPETFNRFPSMAFYFEDA-NLR 280
             Y  L + F+      +    S     I L   CY F   +  + P++AF+F    +L 
Sbjct: 390 QAYNSLRDAFLKLTTNLKKGTSS-----ISLFDTCYDFSSLSSVKVPTVAFHFTGGKSLD 444

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  +N  I   +N  F  A AP    +++IG+ QQ+ TR  YDL   ++      C
Sbjct: 445 LPAKNYLIPVDDNGTFCFAFAPTSSSLSIIGNVQQQGTRITYDLANKIIGLSGNKC 500


>gi|414590468|tpg|DAA41039.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 469

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 159/371 (42%), Gaps = 50/371 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           ++ L IGTP      + DTGS LI+                +++P  S++F  + C+   
Sbjct: 113 LMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNS-- 170

Query: 46  CTYFKCVNE----------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
            +   C              C+Y   Y     T G    ET +       +A   G  FG
Sbjct: 171 -SLSMCAGALAGAAPPPGCACMYNQTYG-TGWTAGVQGSETFTFGSSAADQARVPGVAFG 228

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           CSN +   D +      AG++GL R ++S +SQLG+    RFSYCL  P  +   TS+ L
Sbjct: 229 CSNASSS-DWNGS----AGLVGLGRGSLSLVSQLGA---GRFSYCLT-PFQDTNSTSTLL 279

Query: 156 KFGTDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
             G          ++T F+  P     + +YYL+L  IS+  + +   P  F +   G G
Sbjct: 280 -LGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTG 338

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR---- 266
           G IIDSG+ +T   +  Y ++     S          SD    + LC+ LP   +     
Sbjct: 339 GLIIDSGTTITSLANAAYQQVRAAVKSLVTTLPTVDGSDS-TGLDLCFALPAPTSAPPAV 397

Query: 267 FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
            PSM  +F+ A++ +  ++ ++I     + L      D  ++  G+ QQ++   +YD+  
Sbjct: 398 LPSMTLHFDGADMVLPADS-YMISGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVRE 456

Query: 327 DLLSFVKENCS 337
           + LSF    CS
Sbjct: 457 ETLSFAPAKCS 467


>gi|242077672|ref|XP_002448772.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
 gi|241939955|gb|EES13100.1| hypothetical protein SORBIDRAFT_06g032900 [Sorghum bicolor]
          Length = 471

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 95/362 (26%), Positives = 155/362 (42%), Gaps = 44/362 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ IG+P     L++D+GS +I+               +FDP  S++F  ++C    C 
Sbjct: 127 VRVGIGSPPTEQYLVVDSGSDVIWVQCKPCLECYAQADPLFDPASSATFSAVSCGSAICR 186

Query: 48  YFKCV----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
             +      +  C Y + Y D S TKG  A ET+++     G     G   GC + N G 
Sbjct: 187 TLRTSGCGDSGGCEYEVSYGDGSYTKGTLALETLTL-----GGTAVEGVAIGCGHRNRGL 241

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS---SYLKFGTD 160
              A     AG+LGL    +S + QLG      FSYCL     +G   +     L  G  
Sbjct: 242 FVGA-----AGLLGLGWGPMSLVGQLGGAAGGAFSYCLASRGGSGSGAADAAGSLVLGRS 296

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                 +      + +P   +FYY+ +  I + +ER+      F +T  G GG ++D+G+
Sbjct: 297 EAVPEGAVW-VPLVRNPQAPSFYYVGVSGIGVGDERLPLQDGLFQLTEDGGGGVVMDTGT 355

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPE-TFNRFPSMAFYFE 275
            +T    + Y  L + FV       +  L   P    +  CY L   T  R P+++FYF+
Sbjct: 356 AVTRLPQEAYAALRDAFVG-----AVGALPRAPGVSLLDTCYDLSGYTSVRVPTVSFYFD 410

Query: 276 D-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
             A L +   N  +++ +   + LA AP    ++++G+ QQ   +   D     + F   
Sbjct: 411 GAATLTLPARN-LLLEVDGGIYCLAFAPSSSGLSILGNIQQEGIQITVDSANGYIGFGPA 469

Query: 335 NC 336
            C
Sbjct: 470 TC 471


>gi|356558300|ref|XP_003547445.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 447

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 104/360 (28%), Positives = 165/360 (45%), Gaps = 43/360 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           M  + IG P    L+++DTGS +++               +FDP KSS+F  + C  P C
Sbjct: 102 MANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNDLGLLFDPSKSSTFSPL-CKTP-C 159

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
            +  C  +   +T+ YAD S   G    +T+      EG +     LFGC + N G D D
Sbjct: 160 DFEGCRCDPIPFTVTYADNSTASGTFGRDTVVFETTDEGTSRISDVLFGCGH-NIGHDTD 218

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV-IPLPNGEYTSSYLKFGTDM-GYR 164
                  G+LGL+    S +++LG    ++FSYC+  +  P   Y    L  G D+ GY 
Sbjct: 219 P---GHNGILGLNNGPDSLVTKLG----QKFSYCIGNLADPYYNYHQLILGEGADLEGYS 271

Query: 165 RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
            P           N FYY++++ IS+  +R++  P+TF++  +  GG IID+GS +T+  
Sbjct: 272 TP-------FEVYNGFYYVTMEGISVGEKRLDIAPETFEMKENRAGGVIIDTGSTITFLV 324

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF--LPETFNRFPSMAFYFED-ANLRI 281
             V+ KL  K V     +   Q +    P   C++  +      FP + F+F D A+L +
Sbjct: 325 DSVH-KLLSKEVRNLLGWSFRQATIEKSPWMQCFYGSISRDLVGFPVVTFHFSDGADLAL 383

Query: 282 DGENVFIIDYENHFFLLAVAPHDDL-----VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           D    F     ++ F + V P   L      +LIG   Q+     YDL    + F + +C
Sbjct: 384 D-SGSFFNQLNDNVFCMTVGPVSSLNIKSKPSLIGLLAQQSYNVGYDLVNQFVYFQRIDC 442


>gi|357143680|ref|XP_003573011.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 510

 Score =  120 bits (302), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 104/369 (28%), Positives = 160/369 (43%), Gaps = 41/369 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V +++GTP +   +I+DTGS L +               +FDP  S+S++ + C    C
Sbjct: 151 LVEVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFDQRGPVFDPMASTSYRNVTCGDTRC 210

Query: 47  ----------TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
                     T     ++ C Y   Y DQS T G  A E  +V           G + GC
Sbjct: 211 GLVSPPAAPRTCRSSRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTASSSRRVDGVVLGC 270

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
            + N G    A            R  +SF SQL ++    FSYCLV    +G    S + 
Sbjct: 271 GHRNRGLFHGAAGLLGL-----GRGPLSFASQLRAVYGHAFSYCLV---DHGSAVGSKIV 322

Query: 157 FGTD-MGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVS-GEGGC 212
           FG D +    P    T F      N FYY+ LK I +  E ++ P +T+ ++   G GG 
Sbjct: 323 FGDDNVLLSHPQLNYTAFAPSAAENTFYYVQLKGILVGGEMLDIPSNTWGVSKEDGSGGT 382

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMA 271
           IIDSG+ L+YF    Y  + + FV   ++     ++D P  +  CY +        P  +
Sbjct: 383 IIDSGTTLSYFPEPAYKAIRQAFVDRMDK-AYPLIADFPV-LSPCYNVSGVERVEVPEFS 440

Query: 272 FYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
             F D A      EN FI +D E    L  +      +++IG+ QQ++   +YDL+ + L
Sbjct: 441 LLFADGAVWDFPAENYFIRLDTEGIMCLAVLGTPRSAMSIIGNYQQQNFHVLYDLHHNRL 500

Query: 330 SFVKENCSD 338
            F    C++
Sbjct: 501 GFAPRRCAE 509


>gi|224114179|ref|XP_002332420.1| predicted protein [Populus trichocarpa]
 gi|222832373|gb|EEE70850.1| predicted protein [Populus trichocarpa]
          Length = 449

 Score =  120 bits (302), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 89/327 (27%), Positives = 151/327 (46%), Gaps = 27/327 (8%)

Query: 28  FDPRKSSSFQKINCD-HPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGK 86
           +   +S S++ ++C+ H  C   +C    C Y + Y   S T G  A+ET +        
Sbjct: 134 YTSSQSKSYKPVSCNQHSFCEPNQCKEGLCAYNVTYGPGSYTSGNLANETFTFYSNHGKH 193

Query: 87  AIFHGALFGCSNDN----HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV 142
                  FGCS D+    + F  D     ++GVLG+     SF++QLGSI   +FSYC+ 
Sbjct: 194 TALKSISFGCSTDSRNMIYAFLLDKN--PVSGVLGMGWGPRSFLAQLGSISHGKFSYCIT 251

Query: 143 IPLPNGEYTSSYLKFGTDMGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDT 201
                    ++YL+FG  +  +  + Q TK +   P+  Y+++L  IS++  ++N     
Sbjct: 252 A----NNTHNTYLRFGKHV-VKSKNLQTTKIMQVKPSAAYHVNLLGISVNGVKLNITKTD 306

Query: 202 FDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSY------FERFQLAQLSDCPEPIQ 255
             +   G  GCIID+G++ T     ++  LH    ++       +R+ + +L        
Sbjct: 307 LAVRKDGSRGCIIDAGTLATLLVKPIFDTLHTALSNHLSSNQNLKRWVIHKLHK-----D 361

Query: 256 LCYFLPETFNR--FPSMAFYFEDANLRIDGENVFII-DYENHFFLLAVAPHDDLVALIGS 312
           LCY       R   P + F+ E+A+L +  E +F+  ++E           DD   +IG+
Sbjct: 362 LCYEQLSDAGRKNLPVVTFHLENADLEVKPEAIFLFREFEGKNVFCLSMLSDDSKTIIGA 421

Query: 313 QQQRDTRFVYDLNIDLLSFVKENCSDD 339
            QQ   +FVYD    +LSF  E+C  +
Sbjct: 422 YQQMKQKFVYDTKARVLSFGPEDCEKN 448


>gi|414877221|tpg|DAA54352.1| TPA: hypothetical protein ZEAMMB73_214154 [Zea mays]
          Length = 447

 Score =  120 bits (301), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 103/365 (28%), Positives = 164/365 (44%), Gaps = 43/365 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP    + + DTGS L +               I+D   SSSF  + C    C
Sbjct: 94  LMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPIYDTAVSSSFSPVPCASATC 153

Query: 47  ----TYFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
               +   C   +  C Y   Y D + + G    ET++  G   G ++  G  FGC  DN
Sbjct: 154 LPIWSSRNCTASSSPCRYRYAYGDGAYSAGVLGTETLTFPGA-PGVSV-GGIAFGCGVDN 211

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G   ++      G +GL R ++S ++QLG     +FSYCL     N    S  L FG  
Sbjct: 212 GGLSYNS-----TGTVGLGRGSLSLVAQLG---VGKFSYCLT-DFFNTSLGSPVL-FGAL 261

Query: 161 MGYRRPST----QATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
                PST    Q+T  +  P    +YY+SL+ IS+ + R+  P  TFD+   G GG I+
Sbjct: 262 AELAAPSTGAAVQSTPLVQSPYVPTWYYVSLEGISLGDARLPIPNGTFDLRDDGSGGMIV 321

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYF 274
           DSG+  T+     + ++    V+   R  +   S    P        +     P M  +F
Sbjct: 322 DSGTTFTFLVESAF-RVVVDHVAGVLRQPVVNASSLDSPCFPAATGEQQLPAMPDMVLHF 380

Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVA--PHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
              A++R+  +N    + E   F L +A  P  D V+++G+ QQ++ + ++D+ +  LSF
Sbjct: 381 AGGADMRLHRDNYMSFNQEESSFCLNIAGSPSAD-VSILGNFQQQNIQMLFDITVGQLSF 439

Query: 332 VKENC 336
           +  +C
Sbjct: 440 MPTDC 444


>gi|125555056|gb|EAZ00662.1| hypothetical protein OsI_22683 [Oryza sativa Indica Group]
          Length = 491

 Score =  120 bits (301), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 109/364 (29%), Positives = 163/364 (44%), Gaps = 50/364 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA-----------------IFDPRKSSSFQKINCDH 43
           +V + +GTP++   LI DTGS L +                  +FDP KSS++  ++C  
Sbjct: 150 VVAVGLGTPAQPSALIFDTGSDLSWVQCQPCGSSGHCHPQQDPLFDPSKSSTYAAVHCGE 209

Query: 44  PDCTYFKCV----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
           P C     +    N  C+Y + Y D S T G  + +T+++           G  FGC   
Sbjct: 210 PQCAAAGGLCSEDNTTCLYLVHYGDGSSTTGVLSRDTLALTSS----RALAGFPFGCGTR 265

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           N G       G + G+LGL R  +S  SQ  +     FSYCL    P+   T+ YL  G 
Sbjct: 266 NLG-----DFGRVDGLLGLGRGELSLPSQAAASFGAVFSYCL----PSSNSTTGYLTIGA 316

Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
                  + Q T  +  P   +FY++ L  I I    +  PP  F       GG ++DSG
Sbjct: 317 TPATDTGAAQYTAMLRKPQFPSFYFVELVSIDIGGYILPVPPAVFT-----RGGTLLDSG 371

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFED 276
           +VLTY  +  Y  L ++F    ER+  A  +D    +  CY F  E+    P+++F F D
Sbjct: 372 TVLTYLPAQAYELLRDRFRLTMERYTPAPPNDV---LDACYDFAGESEVIVPAVSFRFGD 428

Query: 277 -ANLRIDGENVFIIDYENHFFLLAVAPHDD---LVALIGSQQQRDTRFVYDLNIDLLSFV 332
            A   +D   V I   EN    LA A  D     +++IG+ QQR    +YD+  + + FV
Sbjct: 429 GAVFELDFFGVMIFLDEN-VGCLAFAAMDAGGLPLSIIGNTQQRSAEVIYDVAAEKIGFV 487

Query: 333 KENC 336
             +C
Sbjct: 488 PASC 491


>gi|357143854|ref|XP_003573079.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 417

 Score =  120 bits (301), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 91/362 (25%), Positives = 168/362 (46%), Gaps = 38/362 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP    + + DTGS L +               ++DP  SS+F  + C    C
Sbjct: 67  LMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSATC 126

Query: 47  --TYFK--CVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-FGCSND 99
             T+    C N    C Y   Y+D + + G    ET+++     G+ +  G++ FGC  D
Sbjct: 127 LPTWRSRNCSNPSSPCRYIYSYSDGAYSVGILGTETLTIGSSVPGQTVSVGSVAFGCGTD 186

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           N G   ++      G +GL R T+S ++QLG     +FSYCL     +   +  +L    
Sbjct: 187 NGGDSLNS-----TGTVGLGRGTLSLLAQLG---VGKFSYCLTDFFNSTMDSPFFLGTLA 238

Query: 160 DMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           ++     + Q+T  +  P N   Y+++L+ IS+ + R+  P  TFD+   G GG ++DSG
Sbjct: 239 ELAPGPGTVQSTPLLQSPLNPSRYFVNLQGISLGDVRLPIPNGTFDLRADGNGGMMVDSG 298

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLS-DCPEPIQLCYFLPETFNRFPSMAFYFE- 275
           +  T      + ++ ++      +  +   S D P     C+  P+     P +  +F  
Sbjct: 299 TTFTILAKSGFREVVDRVAQLLGQPPVNASSLDSP-----CFPSPDGEPFMPDLVLHFAG 353

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            A++R+  +N    + ++  F L +       + +G+ QQ++ + ++D+ +  LSF+  +
Sbjct: 354 GADMRLHRDNYMSYNEDDSSFCLNIVGSPSTWSRLGNFQQQNIQMLFDMTVGQLSFLPTD 413

Query: 336 CS 337
           CS
Sbjct: 414 CS 415


>gi|297805038|ref|XP_002870403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316239|gb|EFH46662.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 440

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 100/361 (27%), Positives = 169/361 (46%), Gaps = 41/361 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + +GTP   ++ I DTGS L++               +FDP+ SS+++ ++C    C
Sbjct: 95  LMNISLGTPPFPIMAIADTGSDLLWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSSQC 154

Query: 47  TYFK----CVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
           T  +    C  E   C Y+  Y D+S TKG  A +T+++             + GC ++N
Sbjct: 155 TALENQASCSTEDNTCSYSTSYGDRSYTKGNIAVDTLTLGSTDTRPVQLKNIIIGCGHNN 214

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G      +   +G++GL    +S I+QLG  I  +FSYCLV PL +    +S + FGT+
Sbjct: 215 AG----TFNKKGSGIVGLGGGAVSLITQLGDSIDGKFSYCLV-PLTSENDRTSKINFGTN 269

Query: 161 MGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                    +T  I      FYYL+LK IS+ ++ + +P      + SGEG  IIDSG+ 
Sbjct: 270 AVVSGTGVVSTPLIAKSQETFYYLTLKSISVGSKEVQYPGSD---SGSGEGNIIIDSGTT 326

Query: 220 LTYFHSDVYWKLHEKFVSYF--ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
           LT   ++ Y +L +   S    E+ Q  Q       + LCY       + P++  +F+ A
Sbjct: 327 LTLLPTEFYSELEDAVASSIDAEKKQDPQTG-----LSLCYSATGDL-KVPAITMHFDGA 380

Query: 278 NLRIDGENVFIIDYENHF-FLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           ++ +   N F+   E+   F    +P     ++ G+  Q +    YD     +SF   +C
Sbjct: 381 DVNLKPSNCFVQISEDLVCFAFRGSPS---FSIYGNVAQMNFLVGYDTVSKTVSFKPTDC 437

Query: 337 S 337
           +
Sbjct: 438 A 438


>gi|226504334|ref|NP_001141706.1| uncharacterized protein LOC100273835 precursor [Zea mays]
 gi|194705620|gb|ACF86894.1| unknown [Zea mays]
 gi|414885968|tpg|DAA61982.1| TPA: hypothetical protein ZEAMMB73_231717 [Zea mays]
          Length = 477

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 106/362 (29%), Positives = 156/362 (43%), Gaps = 50/362 (13%)

Query: 4   LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
           L +GTP+  +L+ LDTGS   +              A+FDP KSS++  I C   +C   
Sbjct: 138 LRLGTPATDLLVELDTGSDQSWIQCKPCPDCYEQHEALFDPSKSSTYSDITCSSRECQEL 197

Query: 50  ------KC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                  C  +++C Y + YAD S T G  A +T+++           G +FGC ++N G
Sbjct: 198 GSSHKHNCSSDKKCPYEITYADDSYTVGNLARDTLTL----SPTDAVPGFVFGCGHNNAG 253

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                  G + G+LGL R   S  SQ+ +     FSYC    LP+    + YL F     
Sbjct: 254 -----SFGEIDGLLGLGRGKASLSSQVAARYGAGFSYC----LPSSPSATGYLSFSGAAA 304

Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
               + Q T+ +   HP +FYYL+L  I++    +  PP  F        G IIDSG+  
Sbjct: 305 AAPTNAQFTEMVAGQHP-SFYYLNLTGITVAGRAIKVPPSVFATAA----GTIIDSGTAF 359

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFED-A 277
           +      Y  L     S   R++ A  S        CY L   ET  R PS+A  F D A
Sbjct: 360 SCLPPSAYAALRSSVRSAMGRYKRAPSSTI---FDTCYDLTGHETV-RIPSVALVFADGA 415

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVAL--IGSQQQRDTRFVYDLNIDLLSFVKEN 335
            + +    V           LA  P+ D  +L  +G+ QQR    +YD++   + F    
Sbjct: 416 TVHLHPSGVLYTWSNVSQTCLAFLPNPDDTSLGVLGNTQQRTLAVIYDVDNQKVGFGANG 475

Query: 336 CS 337
           C+
Sbjct: 476 CA 477


>gi|225427554|ref|XP_002266533.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 447

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 107/361 (29%), Positives = 166/361 (45%), Gaps = 36/361 (9%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++ + +GTP   +  I DTGS L++               IFDP KS ++Q ++C+   C
Sbjct: 96  LMNISLGTPPVSMHGIADTGSDLLWRQCKPCDSCYEQIEPIFDPAKSKTYQILSCEGKSC 155

Query: 47  TYFK----CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDN 100
           +       C ++  C+Y+  Y D S T G  A +T++ IG   G+ +     +FGC ++N
Sbjct: 156 SNLGGQGGCSDDNTCIYSYSYGDGSHTSGDLAVDTLT-IGSTTGRPVSVPKVVFGCGHNN 214

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G  E    G +     L    +S ISQL  +I  RFSYCLV PL N    SS + FG+ 
Sbjct: 215 GGTFELHGSGLVG----LGGGPLSMISQLRPLIGGRFSYCLV-PLGNDPSVSSKMHFGSR 269

Query: 161 MGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERM---NFPPDTFDITVSGEGGCIIDS 216
                    +T   +  P+ FYYL+L+ +S+ ++++    F      +  + EG  IIDS
Sbjct: 270 GIVSGAGAVSTPLASRQPDTFYYLTLESMSVGSKKLAYKGFSKVGSPLADADEGNIIIDS 329

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
           G+ LT    D Y  L    VS         + D      LCY       R P++  +F  
Sbjct: 330 GTTLTLLPQDFYGTLESNVVSAIGG---KPVRDPNNVFSLCYSNLSGL-RIPTITAHFVG 385

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A+L +   N F +  +   F  A+ P  DL A+ G+  Q +    YDL    +SF   +C
Sbjct: 386 ADLELKPLNTF-VQVQEDLFCFAMIPVSDL-AIFGNLAQMNFLVGYDLKSRTVSFKPTDC 443

Query: 337 S 337
           +
Sbjct: 444 T 444


>gi|356527089|ref|XP_003532146.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 488

 Score =  119 bits (299), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 104/354 (29%), Positives = 156/354 (44%), Gaps = 56/354 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
           V + +GTP + + LI DTGS L +               AIFDP KS+S+  I C    C
Sbjct: 147 VVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDAIFDPSKSTSYSNITCTSTLC 206

Query: 47  TYFKCVN----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           T                + C+Y ++Y D S + G+ + E +SV        I    LFGC
Sbjct: 207 TQLSTATGNEPGCSASTKACIYGIQYGDSSFSVGYFSRERLSVTATD----IVDNFLFGC 262

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             +N G       G  AG++GL R  ISF+ Q  ++ +K FSYC    LP    ++  L 
Sbjct: 263 GQNNQGLF-----GGSAGLIGLGRHPISFVQQTAAVYRKIFSYC----LPATSSSTGRLS 313

Query: 157 FG-TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           FG T   Y + +  +T  I+  ++FY L +  IS+   ++     TF       GG IID
Sbjct: 314 FGTTTTSYVKYTPFST--ISRGSSFYGLDITGISVGGAKLPVSSSTFST-----GGAIID 366

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLA-QLSDCPEPIQLCYFLP--ETFNRFPSMAF 272
           SG+V+T      Y  L   F     ++  A +LS     +  CY L   E F+  P + F
Sbjct: 367 SGTVITRLPPTAYTALRSAFRQGMSKYPSAGELS----ILDTCYDLSGYEVFS-IPKIDF 421

Query: 273 YFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDL 324
            F     +++  + +  +       L   A  DD  V + G+ QQ+    VYD+
Sbjct: 422 SFAGGVTVQLPPQGILYVASAKQVCLAFAANGDDSDVTIYGNVQQKTIEVVYDV 475


>gi|302781668|ref|XP_002972608.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
 gi|300160075|gb|EFJ26694.1| hypothetical protein SELMODRAFT_97538 [Selaginella moellendorffii]
          Length = 430

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 104/381 (27%), Positives = 156/381 (40%), Gaps = 53/381 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------AIFDPRK------------SSSFQKI 39
           +V +  GTP + VLLI DTGS LI+           F P+K            S++   +
Sbjct: 55  LVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSVV 114

Query: 40  NCDHPDCTYFKCVNEQ-----------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAI 88
            C    C                    C Y   YAD S T GF A +T ++     G A 
Sbjct: 115 PCSAAQCLLVPAPRGHGPSCSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGGAA 174

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG 148
             G  FGC   N G           GV+GL +  +SF +Q GS+  + FSYCL + L  G
Sbjct: 175 VRGVAFGCGTRNQGGSFSG----TGGVIGLGQGQLSFPAQSGSLFAQTFSYCL-LDLEGG 229

Query: 149 EY--TSSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDI 204
               +SS+L  G     RR +   T  +++P    FYY+ +  I + N  +  P   + I
Sbjct: 230 RRGRSSSFLFLGRPE--RRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAI 287

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY------ 258
            V G GG +IDSGS LTY     Y  L   F +     ++   +   + ++LCY      
Sbjct: 288 DVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSS 347

Query: 259 FLPETFNRFPSMAFYFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVA--LIGSQQQ 315
            L      FP +   F    +L +   N +++D  +    LA+ P     A  ++G+  Q
Sbjct: 348 SLAPANGGFPRLTIDFAQGLSLELPTGN-YLVDVADDVKCLAIRPTLSPFAFNVLGNLMQ 406

Query: 316 RDTRFVYDLNIDLLSFVKENC 336
           +     +D     + F +  C
Sbjct: 407 QGYHVEFDRASARIGFARTEC 427


>gi|413953782|gb|AFW86431.1| hypothetical protein ZEAMMB73_038825 [Zea mays]
          Length = 462

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 108/368 (29%), Positives = 166/368 (45%), Gaps = 58/368 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           +V +  GTP++   L+ DTGS + +                IFDP KS+++  + C HP 
Sbjct: 121 VVTVGFGTPAQTYTLMFDTGSDVSWIQCLPCSGHCYKQHDPIFDPTKSATYSAVPCGHPQ 180

Query: 46  CTYF--KC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C     KC  N  C+Y ++Y D S T G  +HET+S+      +A+  G  FGC   N G
Sbjct: 181 CAAAGGKCSSNGTCLYKVQYGDGSSTAGVLSHETLSLT---SARAL-PGFAFGCGETNLG 236

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM- 161
                  G + G++GL R  +S  SQ  +     FSYC    LP+   +  YL  GT   
Sbjct: 237 -----DFGDVDGLIGLGRGQLSLSSQAAASFGAAFSYC----LPSYNTSHGYLTIGTTTP 287

Query: 162 -----GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
                G R   T   +  ++P +FY++ L  I +    +  PP  F        G ++DS
Sbjct: 288 ASGSDGVRY--TAMIQKQDYP-SFYFVDLVSIVVGGFVLPVPPILFT-----RDGTLLDS 339

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSMAFY 273
           G+VLTY   + Y  L ++F     +F + Q    P  +P   CY F  +     P ++F 
Sbjct: 340 GTVLTYLPPEAYTALRDRF-----KFTMTQYKPAPAYDPFDTCYDFAGQNAIFMPLVSFK 394

Query: 274 FEDA---NLRIDGENVFIIDYENHFFLLAVAPHDDLV--ALIGSQQQRDTRFVYDLNIDL 328
           F D    +L   G  +F  D       LA  P    +   ++G+ QQR+T  +YD+  + 
Sbjct: 395 FSDGSSFDLSPFGVLIFPDDTAPATGCLAFVPRPSTMPFTIVGNTQQRNTEMIYDVAAEK 454

Query: 329 LSFVKENC 336
           + FV  +C
Sbjct: 455 IGFVSGSC 462


>gi|168064509|ref|XP_001784204.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664276|gb|EDQ51002.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 367

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 100/360 (27%), Positives = 160/360 (44%), Gaps = 43/360 (11%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT---Y 48
           +GTP + + L++DTGS + +              A+F+P  SSSF+ ++C    C     
Sbjct: 22  VGTPRRDMYLVVDTGSDITWLQCAPCTNCYKQKDALFNPSSSSSFKVLDCSSSLCLNLDV 81

Query: 49  FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGK-GEGKAIFHGALFGCSNDNHGFDEDA 107
             C++ +C+Y   Y D S T G    + + +    G G+ +      GC +DN G     
Sbjct: 82  MGCLSNKCLYQADYGDGSFTMGELVTDNVVLDDAFGPGQVVLTNIPLGCGHDNEG----- 136

Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT---SSYLKFGTDMGYR 164
             G  AG+LGL R  +SF + L +  +  FSYC    LP+ E      S L FG D    
Sbjct: 137 TFGTAAGILGLGRGPLSFPNNLDASTRNIFSYC----LPDRESDPNHKSTLVFG-DAAIP 191

Query: 165 RPSTQATKFINHPNN-----FYYLSLKDISI-DNERMNFPPDTFDITVSGEGGCIIDSGS 218
             +T + KFI    N     +YY+ +  IS+  N   N P   F +   G GG I DSG+
Sbjct: 192 HTATGSVKFIPQLRNPRVATYYYVQITGISVGGNLLTNIPASVFQLDSHGNGGTIFDSGT 251

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-D 276
            +T   +  Y  + + F +      L   +D  +    CY F        P++ F+F+ D
Sbjct: 252 TITRLEARAYTAVRDAFRA--ATMHLTSAADF-KIFDTCYDFTGMNSISVPTVTFHFQGD 308

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            ++R+   N  +    N+ F  A A      ++IG+ QQ+  R +YD     +  + + C
Sbjct: 309 VDMRLPPSNYIVPVSNNNIFCFAFAASMG-PSVIGNVQQQSFRVIYDNVHKQIGLLPDQC 367


>gi|449453872|ref|XP_004144680.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 757

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 115/378 (30%), Positives = 163/378 (43%), Gaps = 49/378 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           + +FIG+P K   LILDTGS L +                +DP+ S SF+ I C+ P C 
Sbjct: 198 IDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPRCQ 257

Query: 48  YF---------KCVNEQCVYTMKYADQSVTKG------FAAHETISVIGKGEGKAIFHGA 92
                      K   + C Y   Y D S T G      F  + T S  GK E + +    
Sbjct: 258 LVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRV-ENV 316

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
           +FGC + N G    A            R  +SF SQL S+    FSYCLV    +    S
Sbjct: 317 MFGCGHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLV-DRDSDTSVS 370

Query: 153 SYLKFGTDMGY-RRPSTQATKFI----NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
           S L FG D      P    T  I    N  + FYYL +K I +  E++  P + ++++  
Sbjct: 371 SKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSAD 430

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-R 266
           G GG IIDSG+ L+YF    Y  + E F+   + ++L +  D P  +  CY +  T    
Sbjct: 431 GAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVE--DFP-ILHPCYNVSGTDELN 487

Query: 267 FPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFVYD 323
           FP     F D A      EN FI   +     LA+   P   L ++IG+ QQ++   +YD
Sbjct: 488 FPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSAL-SIIGNYQQQNFHILYD 546

Query: 324 LNIDLLSFVKENCSDDSA 341
                L +    C++  A
Sbjct: 547 TKNSRLGYAPMRCAEIEA 564


>gi|356524289|ref|XP_003530762.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Glycine max]
          Length = 392

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 155/370 (41%), Gaps = 59/370 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           +V + +GTP + + L+ DTGS L +               AIFDP KSSS+  I C    
Sbjct: 47  VVVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYTNITCTSSL 106

Query: 46  CTYF-------KC---VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
           CT         +C    +  C+Y  KY D S + GF + E +++        I    LFG
Sbjct: 107 CTQLTSDGIKSECSSSTDASCIYDAKYGDNSTSVGFLSQERLTITAT----DIVDDFLFG 162

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           C  DN G    +     AG++GL R  IS + Q  S   K FSYC    LP    +  +L
Sbjct: 163 CGQDNEGLFNGS-----AGLMGLGRHPISIVQQTSSNYNKIFSYC----LPATSSSLGHL 213

Query: 156 KFGTDMGYRRPSTQAT------KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
            FG        +T A+        I+  N+FY L +  IS+   ++   P     T S  
Sbjct: 214 TFGAS-----AATNASLIYTPLSTISGDNSFYGLDIVSISVGGTKL---PAVSSSTFSA- 264

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFP 268
           GG IIDSG+V+T     VY  L   F    E++ +A  +     +  CY L        P
Sbjct: 265 GGSIIDSGTVITRLAPTVYAALRSAFRRXMEKYPVANEAGL---LDTCYDLSGYKEISVP 321

Query: 269 SMAFYFEDA-NLRIDGENVFIIDYENHFFL-LAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
            + F F     + +    +  ++ E    L  A    D+ + + G+ QQ+    VYD+  
Sbjct: 322 RIDFEFSGGVTVELXHRGILXVESEQQVCLAFAANGSDNDITVFGNVQQKTLEVVYDVKG 381

Query: 327 DLLSFVKENC 336
             + F    C
Sbjct: 382 GRIGFGAAGC 391


>gi|302756591|ref|XP_002961719.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
 gi|300170378|gb|EFJ36979.1| hypothetical protein SELMODRAFT_64161 [Selaginella moellendorffii]
          Length = 357

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 101/363 (27%), Positives = 155/363 (42%), Gaps = 47/363 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ IG P +   L LDTGS + +               I+DP  SSS++++ C    C 
Sbjct: 14  ARMGIGNPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQ 73

Query: 48  ---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
              Y  C    C Y + Y D S + G    E+   +G     A+ + A FGC + N G  
Sbjct: 74  ALDYSACQGMGCSYRVVYGDSSASSGDLGIESF-YLGPNSSTAMRNIA-FGCGHSNSGLF 131

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG-TDMGY 163
                            T+SF SQ+ + I   FSYCLV      +  SS L FG T + +
Sbjct: 132 RGEAGLLGM-----GGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPF 186

Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
              + + T  + +P  N FYY  L  IS+    +  PP  F +T +G GG I+DSG+ +T
Sbjct: 187 ---AARFTPLLKNPRINTFYYAVLTGISVGGTPLPIPPAQFALTGNGTGGAILDSGTSVT 243

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-------RFPSMAFYF 274
                 Y  L + + +               P    Y L   FN       + PS+  +F
Sbjct: 244 RVVPPAYAVLRDAYRAASRNL---------PPAPGVYLLDTCFNFQGLPTVQIPSLVLHF 294

Query: 275 EDA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
           ++  ++ + G N+ I    +  F LA AP    +++IG+ QQ+  R  +DL   L++   
Sbjct: 295 DNGVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAP 354

Query: 334 ENC 336
             C
Sbjct: 355 REC 357


>gi|449451908|ref|XP_004143702.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
 gi|449529900|ref|XP_004171936.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 459

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 102/377 (27%), Positives = 157/377 (41%), Gaps = 52/377 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
           V + +GTP + +LL+ DTGS L++               + F PR SSSF   +C  P C
Sbjct: 90  VDIRLGTPPQSLLLVADTGSDLVWVKCSACRNCSHHPPSSAFLPRHSSSFSPFHCFDPHC 149

Query: 47  TYFK------C----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
                     C    ++  C +   YAD S++ GF + ET ++      +    G  FGC
Sbjct: 150 RLLPHAPHHLCNHTRLHSPCRFLYSYADGSLSSGFFSKETTTLKSLSGSEIHLKGLSFGC 209

Query: 97  SNDNHGFD-EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT---- 151
                G     A+     GV+GL R +ISF SQLG     +FSYCL+      +YT    
Sbjct: 210 GFRISGPSVSGAQFNGARGVMGLGRGSISFSSQLGRRFGNKFSYCLM------DYTLSPP 263

Query: 152 -SSYLKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFD 203
            +S+L  G  + +  P T ATK    P         FYY+++  I+ID  ++   P  ++
Sbjct: 264 PTSFLMIGGGL-HSLPLTNATKISYTPLQINPLSPTFYYITIHSITIDGVKLPINPAVWE 322

Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET 263
           I   G GG ++DSG+ LTY     Y    E   S   R +L   ++      LC      
Sbjct: 323 IDEQGNGGTVVDSGTTLTYLTKTAY---EEVLKSVRRRVKLPNAAELTPGFDLCVNASGE 379

Query: 264 FNR--FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTR 319
             R   P + F      +       + ++ E     LA+      +  ++IG+  Q+   
Sbjct: 380 SRRPSLPRLRFRLGGGAVFAPPPRNYFLETEEGVMCLAIRAVESGNGFSVIGNLMQQGFL 439

Query: 320 FVYDLNIDLLSFVKENC 336
             +D     L F +  C
Sbjct: 440 LEFDKEESRLGFTRRGC 456


>gi|302774174|ref|XP_002970504.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
 gi|300162020|gb|EFJ28634.1| hypothetical protein SELMODRAFT_93504 [Selaginella moellendorffii]
          Length = 483

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 112/369 (30%), Positives = 168/369 (45%), Gaps = 51/369 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHP--- 44
           VRL +GTP++ + +++DTGS L +               IFDPR SSSFQ+I C  P   
Sbjct: 131 VRLGVGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPLCK 190

Query: 45  -----DCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                 C+  +    +C Y + Y D S + G  + +  ++   G G      A FGC   
Sbjct: 191 ALEIHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTL---GTGSKAMSVA-FGC--- 243

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQL-----GSIIKKRFSYCLVIPLPNGEYTSSY 154
             GFD +      AG+LGL    +SF SQ+      S     FSYCLV        +SS 
Sbjct: 244 --GFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSS 301

Query: 155 LKFGTDMGYRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
           L FG       PST A +  + +P  + FYY ++  +S+   ++     +  ++ SG GG
Sbjct: 302 LIFGAAA---IPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGG 358

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCY-FLPETFNRFP 268
            IIDSG+ +T F + VY  + + F     R     L   P       CY F  +     P
Sbjct: 359 VIIDSGTSVTRFPTSVYATIRDAF-----RNATTNLPSAPRYSLFDTCYNFSGKASVDVP 413

Query: 269 SMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
           ++  +FE+ A+L++   N  I       F LA AP    + +IG+ QQ+  R  +DL   
Sbjct: 414 ALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKS 473

Query: 328 LLSFVKENC 336
            L+F  + C
Sbjct: 474 HLAFAPQQC 482


>gi|449519146|ref|XP_004166596.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 752

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 119/378 (31%), Positives = 168/378 (44%), Gaps = 49/378 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           + +FIG+P K   LILDTGS L +                +DP+ S SF+ I C+ P C 
Sbjct: 198 IDVFIGSPPKHFSLILDTGSDLNWIQCVPCFDCFEQNGPYYDPKDSISFRNITCNDPRCQ 257

Query: 48  YF---------KCVNEQCVYTMKYADQSVTKG------FAAHETISVIGKGEGKAIFHGA 92
                      K   + C Y   Y D S T G      F  + T S  GK E + +    
Sbjct: 258 LVSSPDPPRPCKFETQSCPYFYWYGDSSNTTGDFALETFTVNLTSSTTGKSEFRRV-ENV 316

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
           +FGC + N G          AG+LGL R  +SF SQL S+    FSYCLV    +    S
Sbjct: 317 MFGCGHWNRGLFH-----GAAGLLGLGRGPLSFSSQLQSLYGHSFSYCLV-DRDSDTSVS 370

Query: 153 SYLKFGTDMGY-RRPSTQATKFI----NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
           S L FG D      P    T  I    N  + FYYL +K I +  E++  P + ++++  
Sbjct: 371 SKLIFGEDKDLLTHPELNFTSLIAGKENPVDTFYYLQIKSIFVGGEKLQIPEENWNLSAD 430

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-R 266
           G GG IIDSG+ L+YF    Y  + E F+   + ++L +  D P  +  CY +  T    
Sbjct: 431 GAGGTIIDSGTTLSYFSDPAYRIIKEAFLRKVKGYKLVE--DFP-ILHPCYNVSGTDELN 487

Query: 267 FPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFVYD 323
           FP     F D A      EN FI   +     LA+   P   L ++IG+ QQ++   +YD
Sbjct: 488 FPEFLIQFADGAVWNFPVENYFIRIQQLDIVCLAMLGTPKSAL-SIIGNYQQQNFHILYD 546

Query: 324 LNIDLLSFVKENCSDDSA 341
                L +    C++  A
Sbjct: 547 TKNSRLGYAPMRCAEIEA 564


>gi|125543638|gb|EAY89777.1| hypothetical protein OsI_11319 [Oryza sativa Indica Group]
          Length = 390

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 107/370 (28%), Positives = 162/370 (43%), Gaps = 52/370 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V L IGTP + V L LDTGS LI+                FD  +SS+   + C+   C
Sbjct: 36  LVHLAIGTPPQPVQLTLDTGSDLIWTQCKPCVSCFDQPLPYFDTSRSSTNALLPCESTQC 95

Query: 47  ----TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
               T   CV      + C Y   Y D SVT G  A +  + +          G  FGC 
Sbjct: 96  KLDPTVTVCVKLNQTVQTCAYYTSYGDNSVTIGLLAADKFTFVAGTS----LPGVTFGCG 151

Query: 98  NDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
            +N G F+ +       G+ G  R  +S  SQL       FS+C    +     ++  L 
Sbjct: 152 LNNTGVFNSNE-----TGIAGFGRGPLSLPSQLK---VGNFSHCFTT-ITGAIPSTVLLD 202

Query: 157 FGTDM-GYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
              D+    + + Q T  I +  N      YYLSLK I++ + R+  P   F +T +G G
Sbjct: 203 LPADLFSNGQGAVQTTPLIQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALT-NGTG 261

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPS 269
           G IIDSG+ +T     VY  + ++F +   + +L  +         C+  P +     P 
Sbjct: 262 GTIIDSGTSITSLPPQVYQVVRDEFAA---QIKLPVVPGNATGHYTCFSAPSQAKPDVPK 318

Query: 270 MAFYFEDANLRIDGEN-VFII--DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
           +  +FE A + +  EN VF +  D  N    LA+   D+   +IG+ QQ++   +YDL  
Sbjct: 319 LVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKGDE-TTIIGNFQQQNMHVLYDLQN 377

Query: 327 DLLSFVKENC 336
           ++LSFV   C
Sbjct: 378 NMLSFVAAQC 387


>gi|38344196|emb|CAE05761.2| OSJNBa0064G10.12 [Oryza sativa Japonica Group]
          Length = 451

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 95/361 (26%), Positives = 146/361 (40%), Gaps = 67/361 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ +G+P     L++D+GS +I+               +FDP  SSSF  ++C    C 
Sbjct: 132 VRVGVGSPPTDQYLVVDSGSDVIWVQCRPCEQCYAQTDPLFDPAASSSFSGVSCGSAICR 191

Query: 48  YFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                         +C Y++ Y D S TKG  A ET+++     G     G   GC + N
Sbjct: 192 TLSGTGCGGGGDAGKCDYSVTYGDGSYTKGELALETLTL-----GGTAVQGVAIGCGHRN 246

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G    A     AG+LGL    +S + QLG      FSYCL                   
Sbjct: 247 SGLFVGA-----AGLLGLGWGAMSLVGQLGGAAGGVFSYCLA------------------ 283

Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
                 S  A    +  ++FYY+ L  I +  ER+      F +T  G GG ++D+G+ +
Sbjct: 284 ------SRGAGGAGSLASSFYYVGLTGIGVGGERLPLQDSLFQLTEDGAGGVVMDTGTAV 337

Query: 221 TYFHSDVYWKLHEKF---VSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF-E 275
           T    + Y  L   F   +    R     L D       CY L    + R P+++FYF +
Sbjct: 338 TRLPREAYAALRGAFDGAMGALPRSPAVSLLDT------CYDLSGYASVRVPTVSFYFDQ 391

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            A L +   N  +++     F LA AP    ++++G+ QQ   +   D     + F    
Sbjct: 392 GAVLTLPARN-LLVEVGGAVFCLAFAPSSSGISILGNIQQEGIQITVDSANGYVGFGPNT 450

Query: 336 C 336
           C
Sbjct: 451 C 451


>gi|356569424|ref|XP_003552901.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 536

 Score =  119 bits (297), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 114/383 (29%), Positives = 162/383 (42%), Gaps = 65/383 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           + +F+GTP K V LILDTGS L +                ++P +SSS++ I+C  P C 
Sbjct: 172 IDMFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGPHYNPNESSSYRNISCYDPRCQ 231

Query: 48  ---------YFKCVNEQCVYTMKYADQSVTKGFAAHETISV-IGKGEGKAIFH---GALF 94
                    + K  N+ C Y   YAD S T G  A ET +V +    GK  F      +F
Sbjct: 232 LVSSPDPLQHCKTENQTCPYFYDYADGSNTTGDFALETFTVNLTWPNGKEKFKHVVDVMF 291

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
           GC + N GF   A            R  +SF SQL SI    FSYCL     N    SS 
Sbjct: 292 GCGHWNKGFFHGAGGLLGL-----GRGPLSFPSQLQSIYGHSFSYCLTDLFSNTS-VSSK 345

Query: 155 LKFGTDMGYRRPSTQATKFINHPN---------------NFYYLSLKDISIDNERMNFPP 199
           L FG D           + +NH N                FYYL +K I +  E ++ P 
Sbjct: 346 LIFGED----------KELLNHHNLNFTKLLAGEETPDDTFYYLQIKSIVVGGEVLDIPE 395

Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF 259
            T+  +  G GG IIDSGS LT+F    Y  + E F    ++ +L Q++     +  CY 
Sbjct: 396 KTWHWSSEGVGGTIIDSGSTLTFFPDSAYDVIKEAFE---KKIKLQQIAADDFIMSPCYN 452

Query: 260 LPETFN-RFPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQ 315
           +        P    +F D A      EN F     +    LA+   P+   + +IG+  Q
Sbjct: 453 VSGAMQVELPDYGIHFADGAVWNFPAENYFYQYEPDEVICLAILKTPNHSHLTIIGNLLQ 512

Query: 316 RDTRFVYDLNIDLLSFVKENCSD 338
           ++   +YD+    L +    C++
Sbjct: 513 QNFHILYDVKRSRLGYSPRRCAE 535


>gi|302762735|ref|XP_002964789.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
 gi|300167022|gb|EFJ33627.1| hypothetical protein SELMODRAFT_82470 [Selaginella moellendorffii]
          Length = 390

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 101/363 (27%), Positives = 156/363 (42%), Gaps = 47/363 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ IG+P +   L LDTGS + +               I+DP  SSS++++ C    C 
Sbjct: 47  ARMGIGSPQRSYYLELDTGSDVTWIQCAPCSSCYSQVDPIYDPSNSSSYRRVYCGSALCQ 106

Query: 48  ---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
              Y  C    C Y + Y D S + G    E+   +G     A+ + A FGC + N G  
Sbjct: 107 ALDYSACQGMGCSYRVVYGDSSASSGDLGIESF-YLGPNSSTAMRNIA-FGCGHSNSGLF 164

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG-TDMGY 163
                            T+SF SQ+ + I   FSYCLV      +  SS L FG T + +
Sbjct: 165 RGEAGLLGM-----GGGTLSFFSQIAASIGPAFSYCLVDRYSQLQSRSSPLIFGRTAIPF 219

Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
              + + T  + +P  + FYY  L  IS+    +  PP  F +T +G GG I+DSG+ +T
Sbjct: 220 ---AARFTPLLKNPRIDTFYYAILTGISVGGTALPIPPAQFALTGNGTGGAILDSGTSVT 276

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-------RFPSMAFYF 274
                 Y  L + + +               P    Y L   FN       + PS+  +F
Sbjct: 277 RVVPAAYAVLRDAYRAASRNL---------PPAPGVYLLDTCFNFQGLPTVQIPSLVLHF 327

Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
           + D ++ + G N+ I    +  F LA AP    +++IG+ QQ+  R  +DL   L++   
Sbjct: 328 DNDVDMVLPGGNILIPVDRSGTFCLAFAPSSMPISVIGNVQQQTFRIGFDLQRSLIAIAP 387

Query: 334 ENC 336
             C
Sbjct: 388 REC 390


>gi|297817208|ref|XP_002876487.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322325|gb|EFH52746.1| hypothetical protein ARALYDRAFT_486375 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 520

 Score =  118 bits (296), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 110/376 (29%), Positives = 161/376 (42%), Gaps = 56/376 (14%)

Query: 4   LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
           + +G+P K   LILDTGS L +              A +DP+ S+S++ I C+ P C   
Sbjct: 159 VLVGSPPKHFSLILDTGSDLNWIQCLPCHDCFQQNGAFYDPKASASYKNITCNDPRCNLV 218

Query: 50  ---------KCVNEQCVYTMKYADQSVTKGFAAHETISV--IGKGEGKAIFH--GALFGC 96
                    K  N+ C Y   Y D S T G  A ET +V     G    +++    +FGC
Sbjct: 219 SPPDPPKPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTSGGSSELYNVENMMFGC 278

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
            + N G    A            R  +SF SQL S+    FSYCLV    +    SS L 
Sbjct: 279 GHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLV-DRNSDTNVSSKLI 332

Query: 157 FGTDMGY-RRPSTQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
           FG D      P+   T F+    N    FYY+ +K I +  E +N P +T++I+  G GG
Sbjct: 333 FGEDKDLLSHPNLNFTSFVARKENLVDTFYYVQIKSIIVAGEVLNIPEETWNISSDGAGG 392

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFN----- 265
            IIDSG+ L+YF    Y          F + ++A+ +    P+   +  L   FN     
Sbjct: 393 TIIDSGTTLSYFAEPAY---------EFIKNKIAEKAKGKYPVYRDFPILDPCFNVSGID 443

Query: 266 --RFPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
             + P +   F D A      EN FI   E+   L  +       ++IG+ QQ++   +Y
Sbjct: 444 SIQLPELGIAFADGAVWNFPTENSFIWLNEDLVCLAILGTPKSAFSIIGNYQQQNFHILY 503

Query: 323 DLNIDLLSFVKENCSD 338
           D     L +    C+D
Sbjct: 504 DTKRSRLGYAPTKCAD 519


>gi|326506192|dbj|BAJ86414.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516358|dbj|BAJ92334.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 492

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 109/366 (29%), Positives = 166/366 (45%), Gaps = 46/366 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            ++ +GTP+   L++LDTGS +++               +FDPR+S S+  + C  P C 
Sbjct: 142 TKIGVGTPATPALMVLDTGSDVVWLQCAPCRRCYEQSGQVFDPRRSRSYNAVGCAAPLCR 201

Query: 48  YFKC-----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                        C+Y + Y D SVT G  A ET++  G   G  +   AL GC +DN G
Sbjct: 202 RLDSGGCDLRRSACLYQVAYGDGSVTAGDFATETLTFAG---GARVARVAL-GCGHDNEG 257

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV--IPLPNGEYTSSYLKFGTD 160
                   A AG+LGL R ++SF +Q+     + FSYCLV      N    SS + FG+ 
Sbjct: 258 LFV-----AAAGLLGLGRGSLSFPTQISRRYGRSFSYCLVDRTSSANTASRSSTVTFGSG 312

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCIID 215
                 ++  T  + +P    FYY+ L  IS+   R+    ++ D+ +   SG GG I+D
Sbjct: 313 AVGSTVASSFTPMVKNPRMETFYYVQLIGISVGGARVPGVANS-DLRLDPSSGRGGVIVD 371

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLP-ETFNRFPSMA 271
           SG+ +T      Y  L + F     R   A L   P    L   CY L      + P+++
Sbjct: 372 SGTSVTRLARPAYSALRDAF-----RGAAAGLRLSPGGFSLFDTCYDLSGRKVVKVPTVS 426

Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
            +F   A   +  EN  I       F  A A  D  V++IG+ QQ+  R V+D +   ++
Sbjct: 427 MHFAGGAEAALPPENYLIPVDSKGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVA 486

Query: 331 FVKENC 336
           F  + C
Sbjct: 487 FTPKGC 492


>gi|242050428|ref|XP_002462958.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
 gi|241926335|gb|EER99479.1| hypothetical protein SORBIDRAFT_02g035300 [Sorghum bicolor]
          Length = 460

 Score =  118 bits (296), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 100/378 (26%), Positives = 163/378 (43%), Gaps = 59/378 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V   IGTP   +  +LDTGS LI+                ++ P +S ++  ++C    
Sbjct: 101 LVDFAIGTPPLALSAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSVTYANVSCGSRL 160

Query: 46  CTYFKCVNEQ----------------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIF 89
           C     +                   C Y   Y D S T G  A ET +    G G  + 
Sbjct: 161 CDALPSLRPSSRCSASASAPAPERGGCTYYYSYGDGSSTDGVLATETFTF---GAGTTV- 216

Query: 90  HGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGE 149
           H   FGC  DN G  +++     +G++G+ R  +S +SQLG     +FSYC   P  N  
Sbjct: 217 HDLAFGCGTDNLGGTDNS-----SGLVGMGRGPLSLVSQLG---VTKFSYCFT-PF-NDT 266

Query: 150 YTSSYLKFGTDMGYRRPSTQATKFINHPN-----NFYYLSLKDISIDNERMNFPPDTFDI 204
            TSS L  G+      P+ ++T F+  P+     ++YYLSL+ I++ +  +   P  F +
Sbjct: 267 TTSSPLFLGSSASLS-PAAKSTPFVPSPSGPRRSSYYYLSLEGITVGDTLLPIDPAVFRL 325

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF 264
           T SG GG IIDSG+  T      +  L     +      LA  S     + +C+  P+  
Sbjct: 326 TASGRGGLIIDSGTTFTALEERAFVVLARAVAARVA-LPLA--SGAHLGLSVCFAAPQGR 382

Query: 265 NR----FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
                  P +  +F+ A++ +   +  + D       L +     + +++GS QQ++   
Sbjct: 383 GPEAVDVPRLVLHFDGADMELPRSSAVVEDRVAGVACLGIVSARGM-SVLGSMQQQNMHV 441

Query: 321 VYDLNIDLLSFVKENCSD 338
            YD+  D+LSF   NC +
Sbjct: 442 RYDVGRDVLSFEPANCGE 459


>gi|302793638|ref|XP_002978584.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
 gi|300153933|gb|EFJ20570.1| hypothetical protein SELMODRAFT_54048 [Selaginella moellendorffii]
          Length = 407

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 112/369 (30%), Positives = 168/369 (45%), Gaps = 51/369 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHP--- 44
           VRL +GTP++ + +++DTGS L +               IFDPR SSSFQ+I C  P   
Sbjct: 56  VRLGLGTPARSLFMVVDTGSDLPWLQCQPCKSCYKQADPIFDPRNSSSFQRIPCLSPLCK 115

Query: 45  -----DCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                 C+  +    +C Y + Y D S + G  + +  ++   G G      A FGC   
Sbjct: 116 ALEVHSCSGSRGATSRCSYQVAYGDGSFSVGDFSSDLFTL---GTGSKAMSVA-FGC--- 168

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQL-----GSIIKKRFSYCLVIPLPNGEYTSSY 154
             GFD +      AG+LGL    +SF SQ+      S     FSYCLV        +SS 
Sbjct: 169 --GFDNEGLFAGAAGLLGLGAGKLSFPSQIFASSTNSSTANSFSYCLVDRSNPMTRSSSS 226

Query: 155 LKFGTDMGYRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
           L FG       PST A +  + +P  + FYY ++  +S+   ++     +  ++ SG GG
Sbjct: 227 LIFGVAA---IPSTAALSPLLKNPKLDTFYYAAMIGVSVGGAQLPISLKSLQLSQSGSGG 283

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCY-FLPETFNRFP 268
            IIDSG+ +T F + VY  + + F     R     L   P       CY F  +     P
Sbjct: 284 VIIDSGTSVTRFPTSVYATIRDAF-----RNATINLPSAPRYSLFDTCYNFSGKASVDVP 338

Query: 269 SMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
           ++  +FE+ A+L++   N  I       F LA AP    + +IG+ QQ+  R  +DL   
Sbjct: 339 ALVLHFENGADLQLPPTNYLIPINTAGSFCLAFAPTSMELGIIGNIQQQSFRIGFDLQKS 398

Query: 328 LLSFVKENC 336
            L+F  + C
Sbjct: 399 HLAFAPQQC 407


>gi|224126551|ref|XP_002329582.1| predicted protein [Populus trichocarpa]
 gi|222870291|gb|EEF07422.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 105/362 (29%), Positives = 169/362 (46%), Gaps = 42/362 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L +GTP   +L I DTGS LI+               +FDP+ S +++ ++CD   C
Sbjct: 94  LMSLSLGTPPFEILAIADTGSDLIWTQCTPCDKCYKQIAPLFDPKSSKTYRDLSCDTRQC 153

Query: 47  TYF----KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                   C +EQ C Y+  Y D+S T G  A +T+++     G   F   + GC   N+
Sbjct: 154 QNLGESSSCSSEQLCQYSYYYGDRSFTNGNLAVDTVTLPSTNGGPVYFPKTVIGCGRRNN 213

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G   D +D   +G++GL    +S ISQ+GS +  +FSYCLV         SS L FG + 
Sbjct: 214 G-TFDKKD---SGIIGLGGGPMSLISQMGSSVGGKFSYCLVPFSSESAGNSSKLHFGRNA 269

Query: 162 GYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
                  Q+T  I+ +P+ FYYL+L+ +S+ ++++             EG  IIDSG+ L
Sbjct: 270 VVSGSGVQSTPLISKNPDTFYYLTLEAMSVGDKKIE---FGGSSFGGSEGNIIIDSGTSL 326

Query: 221 TYFHSDVYWKLH---EKFVSYFERFQLAQ--LSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
           T F  + + +     E  V   ER Q A   LS C  P       P+   + P +  +F 
Sbjct: 327 TLFPVNFFTEFATAVENAVINGERTQDASGLLSHCYRPT------PDL--KVPVITAHFN 378

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            A++ +   N FI+  ++   L   +      A+ G+  Q +    YD+    +SF   +
Sbjct: 379 GADVVLQTLNTFILISDDVLCLAFNSTQSG--AIFGNVAQMNFLIGYDIQGKSVSFKPTD 436

Query: 336 CS 337
           C+
Sbjct: 437 CT 438


>gi|8979711|emb|CAB96832.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
          Length = 446

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 98/363 (26%), Positives = 151/363 (41%), Gaps = 47/363 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP   + LI DTGS L +                IF+P KS+S+  ++C    
Sbjct: 105 IVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAA 164

Query: 46  CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C            C    C+Y ++Y DQS + GF A E  ++        +F G  FGC 
Sbjct: 165 CGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTL----TNSDVFDGVYFGCG 220

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            +N G         +AG+LGL R  +SF SQ  +   K FSYC    LP+    + +L F
Sbjct: 221 ENNQGLFT-----GVAGLLGLGRDKLSFPSQTATAYNKIFSYC----LPSSASYTGHLTF 271

Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           G+    R         I    +FY L++  I++  +++  P   F        G +IDSG
Sbjct: 272 GSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSG 326

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE- 275
           +V+T      Y  L   F +   ++   + +S       L  F   T    P +AF F  
Sbjct: 327 TVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVT---IPKVAFSFSG 383

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKE 334
            A + +  + +F +   +   L      DD   A+ G+ QQ+    VYD     + F   
Sbjct: 384 GAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPN 443

Query: 335 NCS 337
            CS
Sbjct: 444 GCS 446


>gi|22326716|ref|NP_196638.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|18700103|gb|AAL77663.1| AT5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|24111269|gb|AAN46758.1| At5g10770/T30N20_40 [Arabidopsis thaliana]
 gi|332004211|gb|AED91594.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 474

 Score =  118 bits (295), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 98/363 (26%), Positives = 151/363 (41%), Gaps = 47/363 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP   + LI DTGS L +                IF+P KS+S+  ++C    
Sbjct: 133 IVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAA 192

Query: 46  CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C            C    C+Y ++Y DQS + GF A E  ++        +F G  FGC 
Sbjct: 193 CGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKEKFTLTNS----DVFDGVYFGCG 248

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            +N G         +AG+LGL R  +SF SQ  +   K FSYC    LP+    + +L F
Sbjct: 249 ENNQGLFT-----GVAGLLGLGRDKLSFPSQTATAYNKIFSYC----LPSSASYTGHLTF 299

Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           G+    R         I    +FY L++  I++  +++  P   F        G +IDSG
Sbjct: 300 GSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSG 354

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE- 275
           +V+T      Y  L   F +   ++   + +S       L  F   T    P +AF F  
Sbjct: 355 TVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVT---IPKVAFSFSG 411

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKE 334
            A + +  + +F +   +   L      DD   A+ G+ QQ+    VYD     + F   
Sbjct: 412 GAVVELGSKGIFYVFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPN 471

Query: 335 NCS 337
            CS
Sbjct: 472 GCS 474


>gi|326512608|dbj|BAJ99659.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 484

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 98/359 (27%), Positives = 153/359 (42%), Gaps = 43/359 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V + +GTP++ + ++ DTGS L +               +FDP +SS++  + C  P+C
Sbjct: 147 VVSMGLGTPARDMTVVFDTGSDLSWVQCTPCSDCYEQKDPLFDPARSSTYSAVPCASPEC 206

Query: 47  TYF---KCV-NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                  C  +++C Y + Y DQS T G  A +T+++        +  G +FGC   + G
Sbjct: 207 QGLDSRSCSRDKKCRYEVVYGDQSQTDGALARDTLTLTQSD----VLPGFVFGCGEQDTG 262

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                  G   G++GL R  +S  SQ  S     FSYC    LP+    + YL  G    
Sbjct: 263 LF-----GRADGLVGLGREKVSLSSQAASKYGAGFSYC----LPSSPSAAGYLSLGGPAP 313

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
                T      + P +FYY+ L  + +    +   P  F        G +IDSG+V+T 
Sbjct: 314 ANARFTAMETRHDSP-SFYYVRLVGVKVAGRTVRVSPIVFS-----AAGTVIDSGTVITR 367

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DANLR 280
               VY  L   F     R+   + +     +  CY F   T  R PS+A  F   A + 
Sbjct: 368 LPPRVYAALRSAFARSMGRYGYKR-APALSILDTCYDFTGHTTVRIPSVALVFAGGAAVG 426

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +D   V  +   +    LA AP+ D     +IG+ QQ+    VYD+    + F    CS
Sbjct: 427 LDFSGVLYVAKVSQ-ACLAFAPNGDGADAGIIGNTQQKTLAVVYDVARQKIGFGANGCS 484


>gi|302780575|ref|XP_002972062.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
 gi|300160361|gb|EFJ26979.1| hypothetical protein SELMODRAFT_96804 [Selaginella moellendorffii]
          Length = 429

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 103/381 (27%), Positives = 156/381 (40%), Gaps = 53/381 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------AIFDPRK------------SSSFQKI 39
           +V +  GTP + VLLI DTGS LI+           F P+K            S++   +
Sbjct: 54  LVSMAFGTPPQEVLLIADTGSDLIWLQCSTTAAPPAFCPKKACSRRPAFVASKSATLSVV 113

Query: 40  NCDHPDCTYFKCVNEQ-----------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAI 88
            C    C                    C Y   YAD S T GF A +T ++     G A 
Sbjct: 114 PCSAAQCLLVPAPRGHGPACSPAAPVPCGYAYDYADGSSTTGFLARDTATISNGTSGGAA 173

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG 148
             G  FGC   N G           GV+GL +  +SF +Q GS+  + FSYCL + L  G
Sbjct: 174 VRGVAFGCGTRNQGGSFSG----TGGVIGLGQGQLSFPAQSGSLFAQTFSYCL-LDLEGG 228

Query: 149 EY--TSSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDI 204
               +SS+L  G     RR +   T  +++P    FYY+ +  I + N  +  P   + I
Sbjct: 229 RRGRSSSFLFLGRPE--RRAAFAYTPLVSNPLAPTFYYVGVVAIRVGNRVLPVPGSEWAI 286

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL---- 260
            V G GG +IDSGS LTY     Y  L   F +     ++   +   + ++LCY +    
Sbjct: 287 DVLGNGGTVIDSGSTLTYLRLGAYLHLVSAFAASVHLPRIPSSATFFQGLELCYNVSSSS 346

Query: 261 --PETFNRFPSMAFYFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVA--LIGSQQQ 315
                   FP +   F    +L +   N +++D  +    LA+ P     A  ++G+  Q
Sbjct: 347 SSAPANGGFPRLTIDFAQGLSLELPTGN-YLVDVADDVKCLAIRPTLSPFAFNVLGNLMQ 405

Query: 316 RDTRFVYDLNIDLLSFVKENC 336
           +     +D     + F +  C
Sbjct: 406 QGYHVEFDRASARIGFARTEC 426


>gi|449434646|ref|XP_004135107.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 486

 Score =  117 bits (294), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 99/355 (27%), Positives = 155/355 (43%), Gaps = 43/355 (12%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTY 48
           R+ IG P   V ++LDTGS + +               IF+P  S+SF  ++C+   C  
Sbjct: 154 RVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPIFEPTSSASFTSLSCETEQCKS 213

Query: 49  F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
               +C N  C+Y + Y D S T G    ET+++     G         GC ++N G   
Sbjct: 214 LDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTL-----GSTSLGNIAIGCGHNNEGLFI 268

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
            A              ++SF SQL +     FSYCLV        ++S L F + +    
Sbjct: 269 GAAGLLGL-----GGGSLSFPSQLNA---SSFSYCLV---DRDSDSTSTLDFNSPI---T 314

Query: 166 PSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
           P         +PN   F+YL L  +S+    +  P  +F ++  G GG I+DSG+ +T  
Sbjct: 315 PDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRL 374

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFYFEDAN-LRI 281
            + VY  L + FV      Q A+          CY L  ++    P+++F+F + N L +
Sbjct: 375 QTTVYNVLRDAFVKSTHDLQTARGVAL---FDTCYDLSSKSRVEVPTVSFHFANGNELPL 431

Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             +N  I       F  A AP D  ++++G+ QQ+ TR  +DL   L+ F    C
Sbjct: 432 PAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|15230868|ref|NP_189198.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994761|dbj|BAB03090.1| chloroplast nucleoid DNA binding protein-like; nucellin-like
           protein [Arabidopsis thaliana]
 gi|189339286|gb|ACD89063.1| At3g25700 [Arabidopsis thaliana]
 gi|332643533|gb|AEE77054.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 452

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 108/379 (28%), Positives = 161/379 (42%), Gaps = 57/379 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
           V L IG P + +LLI DTGS L++                +F PR SS+F   +C  P C
Sbjct: 86  VDLRIGQPPQSLLLIADTGSDLVWVKCSACRNCSHHSPATVFFPRHSSTFSPAHCYDPVC 145

Query: 47  TYFK-------C----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
                      C    ++  C Y   YAD S+T G  A ET S+      +A      FG
Sbjct: 146 RLVPKPDRAPICNHTRIHSTCHYEYGYADGSLTSGLFARETTSLKTSSGKEARLKSVAFG 205

Query: 96  CSNDNHG--FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT-- 151
           C     G      + +GA  GV+GL R  ISF SQLG     +FSYCL+      +YT  
Sbjct: 206 CGFRISGQSVSGTSFNGA-NGVMGLGRGPISFASQLGRRFGNKFSYCLM------DYTLS 258

Query: 152 ---SSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITV 206
              +SYL  G + G        T  + +P    FYY+ LK + ++  ++   P  ++I  
Sbjct: 259 PPPTSYLIIG-NGGDGISKLFFTPLLTNPLSPTFYYVKLKSVFVNGAKLRIDPSIWEIDD 317

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-IQLCYFL----- 260
           SG GG ++DSG+ L +     Y       ++   R     ++D   P   LC  +     
Sbjct: 318 SGNGGTVVDSGTTLAFLAEPAY----RSVIAAVRRRVKLPIADALTPGFDLCVNVSGVTK 373

Query: 261 PETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLV--ALIGSQQQRDT 318
           PE     P + F F    + +     + I+ E     LA+   D  V  ++IG+  Q+  
Sbjct: 374 PEKI--LPRLKFEFSGGAVFVPPPRNYFIETEEQIQCLAIQSVDPKVGFSVIGNLMQQGF 431

Query: 319 RFVYDLNIDLLSFVKENCS 337
            F +D +   L F +  C+
Sbjct: 432 LFEFDRDRSRLGFSRRGCA 450


>gi|168005153|ref|XP_001755275.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693403|gb|EDQ79755.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 429

 Score =  117 bits (294), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 105/362 (29%), Positives = 162/362 (44%), Gaps = 52/362 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSAL--------------IYAIFDPRKSSSFQKINCDHPDC 46
           ++ +  G P +    I+DTGS L              + A FDP KS+S++ + C    C
Sbjct: 91  LIDISYGNPPQKSTAIVDTGSDLNWVQCLPCKSCYETLSAKFDPSKSASYKTLGCGSNFC 150

Query: 47  T--YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
               F+     C Y   Y D S T G  + + +++   G GK       FGC N N G  
Sbjct: 151 QDLPFQSCAASCQYDYMYGDGSSTSGALSTDDVTI---GTGK--IPNVAFGCGNSNLGTF 205

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A            +  +S +SQLG    K+FSYCLV PL + + +  Y+   T  G  
Sbjct: 206 AGAGGLVGL-----GKGPLSLVSQLGGTATKKFSYCLV-PLGSTKTSPLYIGDSTLAGGV 259

Query: 165 RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
             +   T   N+P  FYY  L+ IS++ + +N+P +TFDI  +G GG I+DSG+ LTY  
Sbjct: 260 AYTPMLTNN-NYPT-FYYAELQGISVEGKAVNYPANTFDIAATGRGGLILDSGTTLTYLD 317

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEP--------IQLCYFLPETFN-RFPSMAFYFE 275
            D            F     A  +  P P        ++ C+      N  +P++ F+F 
Sbjct: 318 VDA-----------FNPMVAALKAALPYPEADGSFYGLEYCFSTAGVANPTYPTVVFHFN 366

Query: 276 DANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
            A++ +  +N FI +D+E     LA+A      ++ G+ QQ +   V+DL    + F   
Sbjct: 367 GADVALAPDNTFIALDFEGT-TCLAMASSTGF-SIFGNIQQLNHVIVHDLVNKRIGFKSA 424

Query: 335 NC 336
           NC
Sbjct: 425 NC 426


>gi|297823357|ref|XP_002879561.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297325400|gb|EFH55820.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 447

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 113/376 (30%), Positives = 166/376 (44%), Gaps = 58/376 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           + + IGTP   V  I DTGS L +               IFD +KSS+++   CD  +C 
Sbjct: 87  MSITIGTPPMKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCH 146

Query: 48  YFKCVNEQC-------VYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                   C        Y   Y DQS +KG  A ETIS+         F G +FGC  +N
Sbjct: 147 ALSSSERGCDESKNVCKYRYSYGDQSFSKGDVATETISIDSASGSPVSFPGTVFGCGYNN 206

Query: 101 HG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV--IPLPNGEYTSSYLKF 157
            G FDE        G   L     S ISQLGS I K+FSYCL       NG   +S +  
Sbjct: 207 GGTFDETGSGIIGLGGGHL-----SLISQLGSSISKKFSYCLSHKSATTNG---TSVINL 258

Query: 158 GTDMGYRRPSTQ-------ATKFIN-HPNNFYYLSLKDISIDNERM-----NFPPDTFDI 204
           GT+     PS+        +T  ++  P  +YYL+L+ IS+  +++     ++ P+   I
Sbjct: 259 GTN---SIPSSLSKDSGVISTPLVDKEPRTYYYLTLEAISVGKKKIPYTGSSYNPNDGGI 315

Query: 205 TVSGEGGCIIDSGSVLTYFHS---DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP 261
                G  IIDSG+ LT   S   D +    E+ V+  +R     +SD    +  C+   
Sbjct: 316 FSETSGNIIIDSGTTLTLLDSGFFDKFGAAVEELVTGAKR-----VSDPQGLLSHCFKSG 370

Query: 262 ETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
                 P +  +F  A++R+   N F+   E+    L++ P  + VA+ G+  Q D    
Sbjct: 371 SAEIGLPEITVHFTGADVRLSPINAFVKVSED-MVCLSMVPTTE-VAIYGNFAQMDFLVG 428

Query: 322 YDLNIDLLSFVKENCS 337
           YDL    +SF + +CS
Sbjct: 429 YDLETRTVSFQRMDCS 444


>gi|242095586|ref|XP_002438283.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
 gi|241916506|gb|EER89650.1| hypothetical protein SORBIDRAFT_10g011110 [Sorghum bicolor]
          Length = 470

 Score =  117 bits (293), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 94/363 (25%), Positives = 157/363 (43%), Gaps = 53/363 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           +  L +GTP+    +++DTGS+L +                ++DPR SS++  + C    
Sbjct: 135 VTELGLGTPATSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLYDPRASSTYATVPCSASQ 194

Query: 46  CTYFKC---------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           C   +          V   C+Y   Y D S + G+ + +T+S      G   +    +GC
Sbjct: 195 CDELQAATLNPSACSVRNVCIYQASYGDSSFSVGYLSRDTVSF-----GSGSYPNFYYGC 249

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             DN G       G  AG++GL+R  +S + QL   +   FSYCL  P   G     YL 
Sbjct: 250 GQDNEGLF-----GRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTPASTG-----YLS 299

Query: 157 FGT-DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
            G    G+   +  A+  ++   + Y+++L  +S+    +   P  +    +     IID
Sbjct: 300 IGPYTSGHYSYTPMASSSLDA--SLYFVTLSGMSVGGSPLAVSPAEYSSLPT-----IID 352

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
           SG+V+T   + VY  L +   +     Q A        +  C+    +  R P++A  F 
Sbjct: 353 SGTVITRLPTAVYTALSKAVAAAMVGVQSAPAFSI---LDTCFQGQASQLRVPAVAMAFA 409

Query: 276 -DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
             A L++  +NV +ID ++    LA AP D    +IG+ QQ+    VYD+    + F   
Sbjct: 410 GGATLKLATQNV-LIDVDDSTTCLAFAPTDS-TTIIGNTQQQTFSVVYDVAQSRIGFAAG 467

Query: 335 NCS 337
            CS
Sbjct: 468 GCS 470


>gi|359485189|ref|XP_002279141.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 546

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 112/373 (30%), Positives = 157/373 (42%), Gaps = 46/373 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           + +F+GTP K   LILDTGS L +                +DP +SSS++ I C    C 
Sbjct: 183 IDVFVGTPPKHFSLILDTGSDLNWIQCVPCYECFEQNGPHYDPGQSSSYRNIGCHDSRCH 242

Query: 48  YF---------KCVNEQCVYTMKYADQSVTKGFAAHETISV-----IGKGEGKAIFHGAL 93
                      K  N+ C Y   Y D S T G  A ET +V      GK E + +    +
Sbjct: 243 LVSSPDPPQPCKAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRV-ENVM 301

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
           FGC + N G    A            R  +SF SQL S+    FSYCLV    +    SS
Sbjct: 302 FGCGHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLVDRNSDAN-VSS 355

Query: 154 YLKFGTDMGY-RRPSTQATKFI----NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
            L FG D      P    T  +    N  + FYY+ +K I +  E +N P + + I   G
Sbjct: 356 KLIFGEDKDLLSHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDG 415

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-F 267
            GG IIDSG+ L+YF    Y  + E F++  + + + +     EP   CY +        
Sbjct: 416 SGGTIIDSGTTLSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEP---CYNVTGVEQPDL 472

Query: 268 PSMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
           P     F D A      EN FI I+      L  +      +++IG+ QQ++   +YD  
Sbjct: 473 PDFGIVFSDGAVWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILYDTK 532

Query: 326 IDLLSFVKENCSD 338
              L F    C+D
Sbjct: 533 KSRLGFAPTKCAD 545


>gi|224126221|ref|XP_002329620.1| predicted protein [Populus trichocarpa]
 gi|222870359|gb|EEF07490.1| predicted protein [Populus trichocarpa]
          Length = 357

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 96/357 (26%), Positives = 153/357 (42%), Gaps = 43/357 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ +G P++   ++LDTGS + +               IFDP  SS++  + C    C+
Sbjct: 22  TRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQQCS 81

Query: 48  YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
             +   C + QC+Y + Y D S T G  A E++S    G  K +      GC +DN G  
Sbjct: 82  SLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVKNV----ALGCGHDNEGL- 136

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY--LKFGTDMG 162
                    G  GL  +    +S    +    FSYCLV     G  T  +   + G D  
Sbjct: 137 -------FVGAAGLLGLGGGPLSLTNQLKATSFSYCLVNRDSAGSSTLDFNSAQLGVD-- 187

Query: 163 YRRPSTQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
               S  A    N   + FYY+ L  +S+  + ++ P  TF +  SG GG I+D G+ +T
Sbjct: 188 ----SVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAIT 243

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFEDA-NL 279
              +  Y  L + FV   +     +L+        CY L  +   R P+++F+F D  + 
Sbjct: 244 RLQTQAYNPLRDAFVRMTQNL---KLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSW 300

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            +   N  I       +  A AP    +++IG+ QQ+ TR  +DL  + + F    C
Sbjct: 301 NLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 357


>gi|224143371|ref|XP_002324933.1| predicted protein [Populus trichocarpa]
 gi|222866367|gb|EEF03498.1| predicted protein [Populus trichocarpa]
          Length = 463

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 101/353 (28%), Positives = 154/353 (43%), Gaps = 40/353 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA-------------IFDPRKSSSFQKINCDHPDCT 47
           +V + IGTP K + LI DTGS LI+              +FDP KS+SF+ + C    C 
Sbjct: 133 IVNVGIGTPKKEMPLIFDTGSGLIWTQCKPCKACYPKVPVFDPTKSASFKGLPCSSKLCQ 192

Query: 48  YFK--CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
             +  C + +C Y   Y D S + G  A ETIS       K  F   L GCS+   G   
Sbjct: 193 SIRQGCSSPKCTYLTAYVDNSSSTGTLATETISF---SHLKYDFKNILIGCSDQVSG--- 246

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
                  +G++GL+R  IS  SQ  +I  K FSYC    +P+   ++ +L FG  +    
Sbjct: 247 --ESLGESGIMGLNRSPISLASQTANIYDKLFSYC----IPSTPGSTGHLTFGGKVPNDV 300

Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
             +  +K    P++ Y + +  IS+   ++      F I  +      IDSG+VLT    
Sbjct: 301 RFSPVSK--TAPSSDYDIKMTGISVGGRKLLIDASAFKIAST------IDSGAVLTRLPP 352

Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDA-NLRIDG 283
             Y  L   F    + + L    D    +  CY F   +    PS++ +FE    + ID 
Sbjct: 353 KAYSALRSVFREMMKGYPLLDQDDF---LDTCYDFSNYSTVAIPSISVFFEGGVEMDIDV 409

Query: 284 ENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             +      +  + LA A  DD V++ G+ QQ+    V+D   + + F    C
Sbjct: 410 SGIMWQVPGSKVYCLAFAELDDEVSIFGNFQQKTYTVVFDGAKERIGFAPGGC 462


>gi|118484458|gb|ABK94105.1| unknown [Populus trichocarpa]
          Length = 499

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 98/357 (27%), Positives = 154/357 (43%), Gaps = 43/357 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ +G P++   ++LDTGS + +               IFDP  SS++  + C    C+
Sbjct: 163 TRVGVGNPARQFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPTASSTYAPVTCQSQQCS 222

Query: 48  YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
             +   C + QC+Y + Y D S T G  A E++S    G  K +      GC +DN G  
Sbjct: 223 SLEMSSCRSGQCLYQVNYGDGSYTFGDFATESVSFGNSGSVKNV----ALGCGHDNEGLF 278

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY--LKFGTDMG 162
             A      G   LS       +QL +     FSYCLV     G  T  +   + G D  
Sbjct: 279 VGAAGLLGLGGGPLS-----LTNQLKAT---SFSYCLVNRDSAGSSTLDFNSAQLGVD-- 328

Query: 163 YRRPSTQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
               S  A    N   + FYY+ L  +S+  + ++ P  TF +  SG GG I+D G+ +T
Sbjct: 329 ----SVTAPLMKNRKIDTFYYVGLSGMSVGGQMVSIPESTFRLDESGNGGIIVDCGTAIT 384

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFEDA-NL 279
              +  Y  L + FV   +     +L+        CY L  +   R P+++F+F D  + 
Sbjct: 385 RLQTQAYNPLRDAFVRMTQNL---KLTSAVALFDTCYDLSGQASVRVPTVSFHFADGKSW 441

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            +   N  I       +  A AP    +++IG+ QQ+ TR  +DL  + + F    C
Sbjct: 442 NLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVTFDLANNRMGFSPNKC 498


>gi|255553149|ref|XP_002517617.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223543249|gb|EEF44781.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 449

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 103/368 (27%), Positives = 160/368 (43%), Gaps = 45/368 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++R+ IG P   +L I DTGS LI+               IFDPR+SSS++ + C +  C
Sbjct: 94  LMRISIGNPQVEILAIADTGSDLIWVQCQPCEMCYKQNSPIFDPRRSSSYRNVLCGNEFC 153

Query: 47  TYF---------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGK----AIFHGAL 93
                       +   + C YT  Y DQS + G  A E   +           A F    
Sbjct: 154 NKLDGEARSCDARGFVKTCGYTYSYGDQSFSDGHLAIERFGIGSTNSNTSAAIAYFQEVA 213

Query: 94  FGCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
           FGC   N G FDE                ++S +SQLG  +  +FSYCLV       YTS
Sbjct: 214 FGCGTKNGGTFDELGSGIIGL-----GGGSMSLVSQLGPKLSGKFSYCLVPTSEQSNYTS 268

Query: 153 SYLKFGTDM---GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
             + FG D+   G              P  +YYL+L+ IS++N+R+ +  + ++  V  +
Sbjct: 269 K-INFGNDINISGSNYNVVSTPLLPKKPETYYYLTLEAISVENKRLPY-TNLWNGEVE-K 325

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
           G  IIDSG+ LT+  S+ +  L     +  E  +  ++SD      +C F  E     P 
Sbjct: 326 GNIIIDSGTTLTFLDSEFFNNLDS---AVEEAVKGERVSDPHGLFNIC-FKDEKAIELPI 381

Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
           +  +F  A++ +   N F    E       + P +D +A+ G+  Q +    YDL    +
Sbjct: 382 ITAHFTGADVELQPVNTF-AKVEEDLLCFTMIPSND-IAIFGNLAQMNFLVGYDLEKKAV 439

Query: 330 SFVKENCS 337
           SF+  +C+
Sbjct: 440 SFLPTDCT 447


>gi|449443786|ref|XP_004139658.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449475416|ref|XP_004154449.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 453

 Score =  117 bits (293), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 108/361 (29%), Positives = 157/361 (43%), Gaps = 44/361 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            RL +GTP + + ++LDTGS +++               IF+P KS SF  I C  P C 
Sbjct: 112 TRLGVGTPPRYLYMVLDTGSDVVWLQCSPCRKCYSQSDPIFNPYKSKSFAGIPCSSPLCR 171

Query: 48  YF---KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                 C   +  C+Y + Y D S T G  A ET++  G    K        GC + N G
Sbjct: 172 RLDSSGCSTRRHTCLYQVSYGDGSFTTGDFATETLTFRGNKIAKVAL-----GCGHHNEG 226

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               A            R  +SF SQ G     +FSYCLV    + + +S  + FG D  
Sbjct: 227 LFVGAAGLLGL-----GRGRLSFPSQTGIRFNHKFSYCLVDRSASSKPSS--MVFG-DAA 278

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSV 219
             R + + T  I +P  + FYY+ L  IS+   R+    P  F +  +G GG IIDSG+ 
Sbjct: 279 ISRLA-RFTPLIRNPKLDTFYYVGLIGISVGGVRVRGVSPSLFKLDSAGNGGVIIDSGTS 337

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLP-ETFNRFPSMAFYFED 276
           +T      Y  L + F     R     L   PE      CY L  ++  + P++  +F  
Sbjct: 338 VTRLTRPAYTALRDAF-----RVGARHLKRGPEFSLFDTCYDLSGQSSVKVPTVVLHFRG 392

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A++ +   N  I   EN  F  A A     +++IG+ QQ+  R VYDL    + F    C
Sbjct: 393 ADMALPATNYLIPVDENGSFCFAFAGTISGLSIIGNIQQQGFRVVYDLAGSRIGFAPRGC 452

Query: 337 S 337
           +
Sbjct: 453 T 453


>gi|326507654|dbj|BAK03220.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 104/375 (27%), Positives = 163/375 (43%), Gaps = 61/375 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDH-- 43
           ++ L IGTP      I DTGS LI+                 ++P  S++F  + C+   
Sbjct: 89  IMTLAIGTPPLSYPAIADTGSDLIWTQCAPCGSQCFKQAGQPYNPSSSTTFGVLPCNSSV 148

Query: 44  ------------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
                       P C+        C+Y   Y     T G  + ET +       +    G
Sbjct: 149 SMCAALAGPSPPPGCS--------CMYNQTYG-TGWTAGIQSVETFTFGSTPADQTRVPG 199

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
             FGCSN +     D  +G+ AG++GL R ++S +SQLG+     FSYCL  P  +   T
Sbjct: 200 IAFGCSNAS----SDDWNGS-AGLVGLGRGSMSLVSQLGA---GMFSYCLT-PFQDANST 250

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPN-----NFYYLSLKDISIDNERMNFPPDTFDITV 206
           S+ L  G            T F+  P+      +YYL+L  ISI    ++ PP+ F +  
Sbjct: 251 STLL-LGPSAALNGTGVLTTPFVASPSKAPMSTYYYLNLTGISIGTTALSIPPNAFALRT 309

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL---PET 263
            G GG IIDSG+ +T      Y ++     S      +A  SD    + LC+ L     T
Sbjct: 310 DGTGGLIIDSGTTITSLVDAAYQQVRAAIESLVT-LPVADGSDS-TGLDLCFALTSETST 367

Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVY 322
               PSM F+F+ A++ +  +N  I+   +  + LA+       ++  G+ QQ++   +Y
Sbjct: 368 PPSMPSMTFHFDGADMVLPVDNYMILG--SGVWCLAMRNQTVGAMSTFGNYQQQNVHLLY 425

Query: 323 DLNIDLLSFVKENCS 337
           D++ + LSF    CS
Sbjct: 426 DIHEETLSFAPAKCS 440


>gi|356546370|ref|XP_003541599.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 434

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 108/365 (29%), Positives = 171/365 (46%), Gaps = 47/365 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSS-----FQKINC 41
           ++   +G P   +  I+DTGS +I+               IFDP KS++     F    C
Sbjct: 87  LISYSVGIPPFQLYGIIDTGSDMIWLQCKPCEKCYNQTTRIFDPSKSNTYKILPFSSTTC 146

Query: 42  DHPDCTYFKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSND 99
              + T     N + C YT+ Y D S ++G  + ET++ +G   G ++ F   + GC  +
Sbjct: 147 QSVEDTSCSSDNRKMCEYTIYYGDGSYSQGDLSVETLT-LGSTNGSSVKFRRTVIGCGRN 205

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQL---GSIIKKRFSYCLVIPLPNGEYTSSYLK 156
           N      + +G  +G++GL    +S I+QL    S I ++FSYCL   + N    SS L 
Sbjct: 206 N----TVSFEGKSSGIVGLGNGPVSLINQLRRRSSSIGRKFSYCLA-SMSN---ISSKLN 257

Query: 157 FGTDMGYRRPSTQATKFINH-PNNFYYLSLKDISIDNERMNFPPDTFDITVSGE-GGCII 214
           FG         T +T  + H P  FYYL+L+  S+ N R+ F   +F     GE G  II
Sbjct: 258 FGDAAVVSGDGTVSTPIVTHDPKVFYYLTLEAFSVGNNRIEFTSSSFRF---GEKGNIII 314

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF--PSMAF 272
           DSG+ LT   +D+Y KL        E   L ++ D  + + LCY    TF+    P +  
Sbjct: 315 DSGTTLTLLPNDIYSKLESAVADLVE---LDRVKDPLKQLSLCY--RSTFDELNAPVIMA 369

Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +F  A+++++  N F I+ E     LA      +  + G+  Q++    YDL   ++SF 
Sbjct: 370 HFSGADVKLNAVNTF-IEVEQGVTCLAFI-SSKIGPIFGNMAQQNFLVGYDLQKKIVSFK 427

Query: 333 KENCS 337
             +CS
Sbjct: 428 PTDCS 432


>gi|125527523|gb|EAY75637.1| hypothetical protein OsI_03542 [Oryza sativa Indica Group]
          Length = 446

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 100/374 (26%), Positives = 161/374 (43%), Gaps = 62/374 (16%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFK- 50
           +GTPS   +L++DTGS L++               +FDPR+SS+++++ C  P C   + 
Sbjct: 92  VGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRF 151

Query: 51  -------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                       C Y + Y D S + G  A + ++           +    GC  DN G 
Sbjct: 152 PGCDSGGAAGGGCRYMVAYGDGSSSTGELATDKLAF----ANDTYVNNVTLGCGRDNEGL 207

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS-----SYLKFG 158
            + A     AG+LG++R  IS  +Q+       F YCL      G+ TS     SYL FG
Sbjct: 208 FDSA-----AGLLGVARGKISISTQVAPAYGSVFEYCL------GDRTSRSTRSSYLVFG 256

Query: 159 TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERM-NFPPDTFDI-TVSGEGGCII 214
                  PST  T  +++P   + YY+ +   S+  ER+  F   +  + T +G GG ++
Sbjct: 257 RTP--EPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVV 314

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL---PET-------- 263
           DSG+ ++ F  D Y  L + F +      + +L+        CY L   P          
Sbjct: 315 DSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLH 374

Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
           F     MA   E+  L +DG       Y      L     DD +++IG+ QQ+  R V+D
Sbjct: 375 FAGGADMALPPENYFLPVDGGRRRAASYRR---CLGFEAADDGLSVIGNVQQQGFRVVFD 431

Query: 324 LNIDLLSFVKENCS 337
           +  + + F  + C+
Sbjct: 432 VEKERIGFAPKGCT 445


>gi|168013126|ref|XP_001759252.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689565|gb|EDQ75936.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 385

 Score =  117 bits (292), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 96/360 (26%), Positives = 156/360 (43%), Gaps = 39/360 (10%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           +R+ +GTP +G+ L++DTGS +++               +FDP KSS++  + C+   C 
Sbjct: 39  IRVSVGTPPRGMYLVMDTGSDILWLQCAPCVSCYHQCDEVFDPYKSSTYSTLGCNSRQCL 98

Query: 48  YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIG-KGEGKAIFHGALFGCSNDNHGF 103
                 CV  +C+Y + Y D S + G  A + +S+    G G+ + +    GC +DN G+
Sbjct: 99  NLDVGGCVGNKCLYQVDYGDGSFSTGEFATDAVSLNSTSGGGQVVLNKIPLGCGHDNEGY 158

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A            +  +SF +Q+ S    RFSYCL      G  T S  +     G 
Sbjct: 159 FVGAAGLLGL-----GKGPLSFPNQINSENGGRFSYCLT-----GRDTDSTERSSLIFGD 208

Query: 164 RRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                   +F    +N     FYYL +  IS+    +  P   F +   G GG IIDSG+
Sbjct: 209 AAVPPAGVRFTPQASNLRVSTFYYLKMTGISVGGSILTIPTSAFQLDSLGNGGVIIDSGT 268

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-D 276
            +T   +  Y  L E F +      L            CY L +  +   P++  +F+  
Sbjct: 269 SVTRLQNAAYASLREAFRAGTSDLVLTTEFSL---FDTCYNLSDLSSVDVPTVTLHFQGG 325

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A+L++   N  +    +  F LA A      ++IG+ QQ+  R +YD   + + FV   C
Sbjct: 326 ADLKLPASNYLVPVDNSSTFCLAFAGTTG-PSIIGNIQQQGFRVIYDNLHNQVGFVPSQC 384


>gi|242045120|ref|XP_002460431.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
 gi|241923808|gb|EER96952.1| hypothetical protein SORBIDRAFT_02g028000 [Sorghum bicolor]
          Length = 481

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 93/361 (25%), Positives = 155/361 (42%), Gaps = 42/361 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V + +GTP + +L++ DTGS L +               +FDP +S+++  + C   +C
Sbjct: 139 IVSVGLGTPKRDLLVVFDTGSDLSWVQCKPCDGCYQQHDPLFDPSQSTTYSAVPCGAQEC 198

Query: 47  TYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI--FHGALFGCSNDNH 101
                  C + +C Y + Y D S T G  A +T+++       +       +FGC +D+ 
Sbjct: 199 RRLDSGSCSSGKCRYEVVYGDMSQTDGNLARDTLTLGPSSSSSSSDQLQEFVFGCGDDDT 258

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G       G   G+ GL R  +S  SQ  +     FSYCL    P+      YL  G+  
Sbjct: 259 GLF-----GKADGLFGLGRDRVSLASQAAAKYGAGFSYCL----PSSSTAEGYLSLGSAA 309

Query: 162 GYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
               P+ + T  +   +  +FYYL+L  I +    +   P  F        G +IDSG+V
Sbjct: 310 ---PPNARFTAMVTRSDTPSFYYLNLVGIKVAGRTVRVSPAVFRTP-----GTVIDSGTV 361

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFED-A 277
           +T   S  Y  L   F     R+   + +     +  CY F      + PS+A  F+  A
Sbjct: 362 ITRLPSRAYAALRSSFAGLMRRYSYKR-APALSILDTCYDFTGRNKVQIPSVALLFDGGA 420

Query: 278 NLRID-GENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            L +  GE +++ +        A    D  +A++G+ QQ+    VYD+    + F  + C
Sbjct: 421 TLNLGFGEVLYVANKSQACLAFASNGDDTSIAILGNMQQKTFAVVYDVANQKIGFGAKGC 480

Query: 337 S 337
           S
Sbjct: 481 S 481


>gi|224072755|ref|XP_002303865.1| predicted protein [Populus trichocarpa]
 gi|222841297|gb|EEE78844.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 99/360 (27%), Positives = 159/360 (44%), Gaps = 57/360 (15%)

Query: 11  KGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKCVN--- 53
           + + +I+DTGS L +               +F+P KS S++ + C+   C   +      
Sbjct: 75  RKMTVIVDTGSDLSWVQCQPCNRCYNQQDPVFNPSKSPSYRTVLCNSLTCRSLQLATGNS 134

Query: 54  -------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                    C Y + Y D S T G    E +++     G    +  +FGC   N G    
Sbjct: 135 GVCGSNPPTCNYVVNYGDGSYTSGEVGMEHLNL-----GNTTVNNFIFGCGRKNQGLF-- 187

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
              G  +G++GL R  +S ISQ+  +    FSYCL  P    E + S +  G    Y+  
Sbjct: 188 ---GGASGLVGLGRTDLSLISQISPMFGGVFSYCL--PTTEAEASGSLVMGGNSSVYKNT 242

Query: 167 STQA-TKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
           +  + T+ I++P   FY+L+L  I++    +  P  +F     G+   IIDSG+V++   
Sbjct: 243 TPISYTRMIHNPLLPFYFLNLTGITVGGVEVQAP--SF-----GKDRMIIDSGTVISRLP 295

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL--CYFLPETFN-RFPSMAFYFE-DANLR 280
             +Y  L  +FV  F  +  A     P  + L  C+ L      + P +  YFE  A L 
Sbjct: 296 PSIYQALKAEFVKQFSGYPSA-----PSFMILDSCFNLSGYQEVKIPDIKMYFEGSAELN 350

Query: 281 IDGENVFI---IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +D   VF     D       +A  P++D V +IG+ QQ++ R +YD    +L F +E CS
Sbjct: 351 VDVTGVFYSVKTDASQVCLAIASLPYEDEVGIIGNYQQKNQRIIYDTKGSMLGFAEEACS 410


>gi|115472515|ref|NP_001059856.1| Os07g0532800 [Oryza sativa Japonica Group]
 gi|50508274|dbj|BAD32123.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113611392|dbj|BAF21770.1| Os07g0532800 [Oryza sativa Japonica Group]
          Length = 436

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 105/374 (28%), Positives = 161/374 (43%), Gaps = 65/374 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           + + +GTP     ++ DTGS LI+                F P  SS+F K+ C    C 
Sbjct: 88  MNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQ 147

Query: 48  YF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           +       C    CVY  KY     T G+ A ET+ V     G A F    FGCS +N  
Sbjct: 148 FLPNSIRTCNATGCVYNYKYGS-GYTAGYLATETLKV-----GDASFPSVAFGCSTEN-- 199

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                   + +G+ GL R  +S I QLG     RFSYCL      G   +S + FG+   
Sbjct: 200 ----GVGNSTSGIAGLGRGALSLIPQLG---VGRFSYCLRSGSAAG---ASPILFGSLAN 249

Query: 163 YRRPSTQATKFINHPN---NFYYLSLKDISIDNERMNFPPDTFDITVSG-EGGCIIDSGS 218
               + Q+T F+N+P    ++YY++L  I++    +     TF  T +G  GG I+DSG+
Sbjct: 250 LTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGT 309

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLS--DCPEPIQLCY----------FLPETFNR 266
            LTY   D Y  + + F+S     Q A ++  +    + LC+           +P    R
Sbjct: 310 TLTYLAKDGYEMVKQAFLS-----QTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLR 364

Query: 267 FPSMAFYFED---ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
           F   A Y      A +  D +    +       ++  A  D  +++IG+  Q D   +YD
Sbjct: 365 FDGGAEYAVPTYFAGVETDSQGSVTV----ACLMMLPAKGDQPMSVIGNVMQMDMHLLYD 420

Query: 324 LNIDLLSFVKENCS 337
           L+  + SF   +C+
Sbjct: 421 LDGGIFSFAPADCA 434


>gi|224053042|ref|XP_002297678.1| predicted protein [Populus trichocarpa]
 gi|222844936|gb|EEE82483.1| predicted protein [Populus trichocarpa]
          Length = 482

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 99/367 (26%), Positives = 168/367 (45%), Gaps = 66/367 (17%)

Query: 11  KGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKCVN--- 53
           + + +I+DTGS L +               +F+P  S S++ + C  P C   +      
Sbjct: 144 RKMTVIVDTGSDLSWVQCQPCKRCYNQQDPVFNPSTSPSYRTVLCSSPTCQSLQSATGNL 203

Query: 54  -------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                    C Y + Y D S T+G    E + +   G   A+ +  +FGC  +N G    
Sbjct: 204 GVCGSNPPSCNYVVNYGDGSYTRGELGTEHLDL---GNSTAV-NNFIFGCGRNNQGLF-- 257

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
              G  +G++GL R ++S ISQ  ++    FSYCL  P+   E + S +  G    Y+  
Sbjct: 258 ---GGASGLVGLGRSSLSLISQTSAMFGGVFSYCL--PITETEASGSLVMGGNSSVYKNT 312

Query: 167 STQA-TKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
           +  + T+ I +P   FY+L+L  I++ +  +  P  +F     G+ G +IDSG+V+T   
Sbjct: 313 TPISYTRMIPNPQLPFYFLNLTGITVGSVAVQAP--SF-----GKDGMMIDSGTVITRLP 365

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FN-------RFPSMAFYFE- 275
             +Y  L ++FV  F  F          P    + + +T FN         P++  +FE 
Sbjct: 366 PSIYQALKDEFVKQFSGF----------PSAPAFMILDTCFNLSGYQEVEIPNIKMHFEG 415

Query: 276 DANLRIDGENVF-IIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +A L +D   VF  +  +     LA+A   +++ V +IG+ QQ++ R +YD    +L F 
Sbjct: 416 NAELNVDVTGVFYFVKTDASQVCLAIASLSYENEVGIIGNYQQKNQRVIYDTKGSMLGFA 475

Query: 333 KENCSDD 339
            E C+ D
Sbjct: 476 AEACTFD 482


>gi|125558622|gb|EAZ04158.1| hypothetical protein OsI_26300 [Oryza sativa Indica Group]
          Length = 435

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 105/373 (28%), Positives = 161/373 (43%), Gaps = 64/373 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           + + +GTP     ++ DTGS LI+                F P  SS+F K+ C    C 
Sbjct: 88  MNISVGTPLLTFPVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQ 147

Query: 48  YF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           +       C    CVY  KY     T G+ A ET+ V     G A F    FGCS +N  
Sbjct: 148 FLPNSIRTCNATGCVYNYKYGS-GYTAGYLATETLKV-----GDASFPSVAFGCSTEN-- 199

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                   + +G+ GL R  +S I QLG     RFSYCL      G   +S + FG+   
Sbjct: 200 ----GVGNSTSGIAGLGRGALSLIPQLG---VGRFSYCLRSGSAAG---ASPILFGSLAN 249

Query: 163 YRRPSTQATKFINHPN---NFYYLSLKDISIDNERMNFPPDTFDITVSG-EGGCIIDSGS 218
               + Q+T F+N+P    ++YY++L  I++    +     TF  T +G  GG I+DSG+
Sbjct: 250 LTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGT 309

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLS--DCPEPIQLCY---------FLPETFNRF 267
            LTY   D Y  + + F+S     Q A ++  +    + LC+          +P    RF
Sbjct: 310 TLTYLAKDGYEMVKQAFLS-----QTANVTTVNGTRGLDLCFKSTGGGGGIAVPSLVLRF 364

Query: 268 PSMAFYFED---ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
              A Y      A +  D +    +       ++  A  D  +++IG+  Q D   +YDL
Sbjct: 365 DGGAEYAVPTYFAGVETDSQGSVTV----ACLMMLPAKGDQPMSVIGNVMQMDMHLLYDL 420

Query: 325 NIDLLSFVKENCS 337
           +  + SF   +C+
Sbjct: 421 DGGIFSFSPADCA 433


>gi|356575389|ref|XP_003555824.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 98/362 (27%), Positives = 160/362 (44%), Gaps = 51/362 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ +G+P +   +++D+GS +I+               +F+P  SSS+  ++C    C+
Sbjct: 136 VRIGVGSPPRNQYVVIDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSYAGVSCASTVCS 195

Query: 48  YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
           +     C   +C Y + Y D S TKG  A ET++      G+ +      GC + N G  
Sbjct: 196 HVDNAGCHEGRCRYEVSYGDGSYTKGTLALETLTF-----GRTLIRNVAIGCGHHNQGMF 250

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A     AG+LGL    +SF+ QLG      FSYCLV     G  +S  L+FG +    
Sbjct: 251 VGA-----AGLLGLGSGPMSFVGQLGGQAGGTFSYCLV---SRGIQSSGLLQFGREA--- 299

Query: 165 RPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            P   A    I++P   +FYY+ L  + +   R+    D F ++  G+GG ++D+G+ +T
Sbjct: 300 VPVGAAWVPLIHNPRAQSFYYVGLSGLGVGGLRVPISEDVFKLSELGDGGVVMDTGTAVT 359

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-----CYFLPETFN-RFPSMAFYFE 275
              +  Y    + F+        AQ ++ P    +     CY L    + R P+++FYF 
Sbjct: 360 RLPTAAYEAFRDAFI--------AQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSFYFS 411

Query: 276 DAN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
               L +   N  I   +   F  A AP    +++IG+ QQ       D     + F   
Sbjct: 412 GGPILTLPARNFLIPVDDVGSFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGFVGFGPN 471

Query: 335 NC 336
            C
Sbjct: 472 VC 473


>gi|356557203|ref|XP_003546907.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 470

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 104/366 (28%), Positives = 157/366 (42%), Gaps = 57/366 (15%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFK- 50
           +G  S+ + +I+DTGS L +               +F P  S S+Q I C+   C   + 
Sbjct: 126 MGLGSQNMSVIVDTGSDLTWVQCEPCRSCYNQNGPLFKPSTSPSYQPILCNSTTCQSLEL 185

Query: 51  -------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                    +  C Y + Y D S T G    E +     G G       +FGC  +N G 
Sbjct: 186 GACGSDPSTSATCDYVVNYGDGSYTSGELGIEKL-----GFGGISVSNFVFGCGRNNKGL 240

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
                 G  +G++GL R  +S ISQ  +     FSYCL  P  +    S  L  G   G 
Sbjct: 241 F-----GGASGLMGLGRSELSMISQTNATFGGVFSYCL--PSTDQAGASGSLVMGNQSGV 293

Query: 164 RRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
            +  T        PN    NFY L+L  I +    ++    +F     G GG I+DSG+V
Sbjct: 294 FKNVTPIAYTRMLPNLQLSNFYILNLTGIDVGGVSLHVQASSF-----GNGGVILDSGTV 348

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP--ETFNRFPSMAFYFE 275
           ++     VY  L  KF+  F  F  A     P    +  C+ L   +  N  P+++ YFE
Sbjct: 349 ISRLAPSVYKALKAKFLEQFSGFPSA-----PGFSILDTCFNLTGYDQVN-IPTISMYFE 402

Query: 276 -DANLRIDGENVF-IIDYENHFFLLAVAPHDD--LVALIGSQQQRDTRFVYDLNIDLLSF 331
            +A L +D   +F ++  +     LA+A   D   + +IG+ QQR+ R +YD  +  + F
Sbjct: 403 GNAELNVDATGIFYLVKEDASRVCLALASLSDEYEMGIIGNYQQRNQRVLYDAKLSQVGF 462

Query: 332 VKENCS 337
            KE C+
Sbjct: 463 AKEPCT 468


>gi|30686482|ref|NP_850251.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|122215044|sp|Q3EBM5.1|ASPR1_ARATH RecName: Full=Probable aspartic protease At2g35615; Flags:
           Precursor
 gi|330254036|gb|AEC09130.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 447

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 110/372 (29%), Positives = 163/372 (43%), Gaps = 50/372 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           + + IGTP   V  I DTGS L +               IFD +KSS+++   CD  +C 
Sbjct: 87  MSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSRNCQ 146

Query: 48  YFKCV-------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                       N  C Y   Y DQS +KG  A ET+S+         F G +FGC  +N
Sbjct: 147 ALSSTERGCDESNNICKYRYSYGDQSFSKGDVATETVSIDSASGSPVSFPGTVFGCGYNN 206

Query: 101 HG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV--IPLPNGEYTSSYLKF 157
            G FDE        G   L     S ISQLGS I K+FSYCL       NG   +S +  
Sbjct: 207 GGTFDETGSGIIGLGGGHL-----SLISQLGSSISKKFSYCLSHKSATTNG---TSVINL 258

Query: 158 GTD----MGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERM-----NFPPDTFDITVS 207
           GT+       +     +T  ++  P  +YYL+L+ IS+  +++     ++ P+   I   
Sbjct: 259 GTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSSYNPNDDGILSE 318

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLA--QLSDCPEPIQLCYFLPETFN 265
             G  IIDSG+ LT   +  +    +KF S  E       ++SD    +  C+       
Sbjct: 319 TSGNIIIDSGTTLTLLEAGFF----DKFSSAVEESVTGAKRVSDPQGLLSHCFKSGSAEI 374

Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
             P +  +F  A++R+   N F+   E+    L++ P  + VA+ G+  Q D    YDL 
Sbjct: 375 GLPEITVHFTGADVRLSPINAFVKLSED-MVCLSMVPTTE-VAIYGNFAQMDFLVGYDLE 432

Query: 326 IDLLSFVKENCS 337
              +SF   +CS
Sbjct: 433 TRTVSFQHMDCS 444


>gi|357481205|ref|XP_003610888.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512223|gb|AES93846.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 413

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 102/362 (28%), Positives = 167/362 (46%), Gaps = 40/362 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L+IGTP   +   +DTGS LI+               +FDP KSS++  I+CD P C
Sbjct: 65  LMELYIGTPPIKISGTVDTGSDLIWVQCVPCLGCYNQINPMFDPLKSSTYTNISCDSPLC 124

Query: 47  --TYF-KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
              Y  +C  E+ C YT  YAD S+TKG  A ET+++           G LFGC ++N G
Sbjct: 125 YKPYIGECSPEKRCDYTYGYADSSLTKGVLAQETVTLTSNTGKPISLQGILFGCGHNNTG 184

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
              D       G++GL     S +SQ+G +   K+FS CLV P       SS + FG   
Sbjct: 185 NFNDHE----MGLIGLGGGPTSLVSQIGPLFGGKKFSQCLV-PFLTDITISSQMSFGKGS 239

Query: 162 GYRRPSTQATKFINHPNNF--YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                    T  +    +   YY++L  IS+++  +       + T+  +G  ++DSG+ 
Sbjct: 240 EVLGEGVVTTPLVQREQDMTSYYVTLLGISVEDTYL-----PMNSTIE-KGNMLVDSGTP 293

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMAFYFEDAN 278
                  +Y ++   +V    +  L  ++D P    QLCY   +T  + P++ ++FE AN
Sbjct: 294 PNILPQQLYDRV---YVEVKNKVPLEPITDDPSLGPQLCYRT-QTNLKGPTLTYHFEGAN 349

Query: 279 LRIDGENVFI---IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
           L +     FI    + +  F L      +    + G+  Q +    +DL+  ++SF   +
Sbjct: 350 LLLTPIQTFIPPTPETKGVFCLAITNCANSDPGIYGNFAQTNYLIGFDLDRQIVSFKPTD 409

Query: 336 CS 337
           C+
Sbjct: 410 CT 411


>gi|356567196|ref|XP_003551807.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 490

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 98/353 (27%), Positives = 152/353 (43%), Gaps = 53/353 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDC 46
           V + +GTP + + LI DTGS L +                IFDP KS+S+  I C    C
Sbjct: 148 VVVGLGTPKRDLSLIFDTGSDLTWTQCEPCARSCYKQQDVIFDPSKSTSYSNITCTSALC 207

Query: 47  TYFKCVN----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           T                + C+Y ++Y D S + G+ + E ++V        +    LFGC
Sbjct: 208 TQLSTATGNDPGCSASTKACIYGIQYGDSSFSVGYFSRERLTVTATD----VVDNFLFGC 263

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             +N G       G  AG++GL R  ISF+ Q  +  +K FSYC    LP+   ++ +L 
Sbjct: 264 GQNNQGLF-----GGSAGLIGLGRHPISFVQQTAAKYRKIFSYC----LPSTSSSTGHLS 314

Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
           FG     R         I+  ++FY L +  I++   ++     TF       GG IIDS
Sbjct: 315 FGPAATGRYLKYTPFSTISRGSSFYGLDITAIAVGGVKLPVSSSTFST-----GGAIIDS 369

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLA-QLSDCPEPIQLCYFLP--ETFNRFPSMAFY 273
           G+V+T      Y  L   F     ++  A +LS     +  CY L   + F+  P++ F 
Sbjct: 370 GTVITRLPPTAYGALRSAFRQGMSKYPSAGELS----ILDTCYDLSGYKVFS-IPTIEFS 424

Query: 274 FEDA-NLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDL 324
           F     +++  + +  +       L   A  DD  V + G+ QQR    VYD+
Sbjct: 425 FAGGVTVKLPPQGILFVASTKQVCLAFAANGDDSDVTIYGNVQQRTIEVVYDV 477


>gi|225427558|ref|XP_002266614.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 444

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 106/364 (29%), Positives = 170/364 (46%), Gaps = 40/364 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++ + +GTP   +L I DTGS LI+               +FDP++S +++ ++CD+  C
Sbjct: 95  LMNISLGTPPVPMLGIADTGSDLIWRQCLPCPNCYEQVEPLFDPKESETYKTLDCDNEFC 154

Query: 47  TYF----KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEG-KAIFHGALFGCSNDN 100
                   C ++  C Y+  Y D+S T+G  + +T++ IG  EG  A F G  FGC +DN
Sbjct: 155 QDLGQQGSCDDDNTCTYSYSYGDRSYTRGDLSSDTLT-IGSTEGDPASFPGIAFGCGHDN 213

Query: 101 HG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
            G F+E        G   LS V      QL S +  +FSYCLV PL +    SS + FG 
Sbjct: 214 GGTFNEKDGGLIGLGGGPLSLVM-----QLSSEVGGQFSYCLV-PLSSDSTVSSKINFGK 267

Query: 160 DMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERM---NFPPDTFDITVSGEGGCIID 215
                   T +T  I   P+ FYYL+L+ +S+ +E +    F  +        EG  IID
Sbjct: 268 SGVVSGSGTVSTPLIKGTPDTFYYLTLEGLSVGSETVAFKGFSENKSSPAAVEEGNIIID 327

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI-QLCYFLPETFNRFPSMAFYF 274
           SG+ LT    D Y  +     +        Q +  P  I  LCY         P++  +F
Sbjct: 328 SGTTLTLLPQDFYTDVESALTNAIG----GQTTTDPNGIFSLCYSSVNNL-EIPTITAHF 382

Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
             A++++   N F +  +      ++ P  +L A+ G+  Q +    YDL  + +SF + 
Sbjct: 383 TGADVQLPPLNTF-VQVQEDLVCFSMIPSSNL-AIFGNLAQINFLVGYDLKNNKVSFKQT 440

Query: 335 NCSD 338
           +C++
Sbjct: 441 DCTE 444


>gi|18461217|dbj|BAB84414.1| chloroplast nucleoid DNA-binding protein cnd41-like [Oryza sativa
           Japonica Group]
          Length = 446

 Score =  116 bits (290), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 100/374 (26%), Positives = 160/374 (42%), Gaps = 62/374 (16%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFK- 50
           +GTPS   +L++DTGS L++               +FDPR+SS+++++ C  P C   + 
Sbjct: 92  VGTPSTKAMLVIDTGSDLVWLQCSPCRRCYAQRGQVFDPRRSSTYRRVPCSSPQCRALRF 151

Query: 51  -------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                       C Y + Y D S + G  A + ++           +    GC  DN G 
Sbjct: 152 PGCDSGGAAGGGCRYMVAYGDGSSSTGDLATDKLAF----ANDTYVNNVTLGCGRDNEGL 207

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS-----SYLKFG 158
            + A     AG+LG+ R  IS  +Q+       F YCL      G+ TS     SYL FG
Sbjct: 208 FDSA-----AGLLGVGRGKISISTQVAPAYGSVFEYCL------GDRTSRSTRSSYLVFG 256

Query: 159 TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERM-NFPPDTFDI-TVSGEGGCII 214
                  PST  T  +++P   + YY+ +   S+  ER+  F   +  + T +G GG ++
Sbjct: 257 RTP--EPPSTAFTALLSNPRRPSLYYVDMAGFSVGGERVTGFSNASLALDTATGRGGVVV 314

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL---PET-------- 263
           DSG+ ++ F  D Y  L + F +      + +L+        CY L   P          
Sbjct: 315 DSGTAISRFARDAYAALRDAFDARARAAGMRRLAGEHSVFDACYDLRGRPAASAPLIVLH 374

Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
           F     MA   E+  L +DG       Y      L     DD +++IG+ QQ+  R V+D
Sbjct: 375 FAGGADMALPPENYFLPVDGGRRRAASYRR---CLGFEAADDGLSVIGNVQQQGFRVVFD 431

Query: 324 LNIDLLSFVKENCS 337
           +  + + F  + C+
Sbjct: 432 VEKERIGFAPKGCT 445


>gi|167997964|ref|XP_001751688.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162696786|gb|EDQ83123.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 418

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 109/371 (29%), Positives = 165/371 (44%), Gaps = 50/371 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           V  F+GTP +   LI+D+GS L++               ++ P  SS+F  + C   DC 
Sbjct: 66  VDFFLGTPPQKFSLIVDSGSDLLWVQCSPCRQCYAQDSPLYVPSNSSTFSPVPCLSSDCL 125

Query: 48  Y------FKC---VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
                  F C       C Y   YAD S +KG  A+E+ +V G    K  F     GC +
Sbjct: 126 LIPATEGFPCDFRYPGACAYEYLYADTSSSKGVFAYESATVDGVRIDKVAF-----GCGS 180

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
           DN G        A  GVLGL +  +SF SQ+G     +F+YCLV  L +    SS L FG
Sbjct: 181 DNQG-----SFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYL-DPTSVSSSLIFG 234

Query: 159 TDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
            ++       Q T  +++P +   YY+ ++ +++  + +      ++I + G GG I DS
Sbjct: 235 DELISTIHDMQYTPIVSNPKSPTLYYVQIEKVTVGGKSLPISDSAWEIDLLGNGGSIFDS 294

Query: 217 GSVLTYFHSDVYWKLHEKFVS--YFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFY 273
           G+ LTY+    Y  +   F S  ++ R +  Q  D      LC  L       FPS    
Sbjct: 295 GTTLTYWFPSAYSHILAAFDSGVHYPRAESVQGLD------LCVELTGVDQPSFPSFTIE 348

Query: 274 FED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVA---LIGSQQQRDTRFVYDLNIDLL 329
           F+D A  + + EN F +D   +   LA+A     +     IG+  Q++    YD   +L+
Sbjct: 349 FDDGAVFQPEAENYF-VDVAPNVRCLAMAGLASPLGGFNTIGNLLQQNFFVQYDREENLI 407

Query: 330 SFVKENCSDDS 340
            F    CS  S
Sbjct: 408 GFAPAKCSSHS 418


>gi|15231625|ref|NP_191467.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|15983376|gb|AAL11556.1|AF424562_1 AT3g59080/F17J16_130 [Arabidopsis thaliana]
 gi|7529751|emb|CAB86936.1| putative protein [Arabidopsis thaliana]
 gi|20466704|gb|AAM20669.1| putative protein [Arabidopsis thaliana]
 gi|23198236|gb|AAN15645.1| putative protein [Arabidopsis thaliana]
 gi|332646352|gb|AEE79873.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 535

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 108/369 (29%), Positives = 160/369 (43%), Gaps = 42/369 (11%)

Query: 4   LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF 49
           + +G+P K   LILDTGS L +              A +DP+ S+S++ I C+   C   
Sbjct: 174 VLVGSPPKHFSLILDTGSDLNWIQCLPCYDCFQQNGAFYDPKASASYKNITCNDQRCNLV 233

Query: 50  ---------KCVNEQCVYTMKYADQSVTKGFAAHETISV--IGKGEGKAIFH--GALFGC 96
                    K  N+ C Y   Y D S T G  A ET +V     G    +++    +FGC
Sbjct: 234 SSPDPPMPCKSDNQSCPYYYWYGDSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGC 293

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
            + N G    A            R  +SF SQL S+    FSYCLV    +    SS L 
Sbjct: 294 GHWNRGLFHGAAGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLV-DRNSDTNVSSKLI 347

Query: 157 FGTDMGY-RRPSTQATKFINHPNN----FYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
           FG D      P+   T F+    N    FYY+ +K I +  E +N P +T++I+  G GG
Sbjct: 348 FGEDKDLLSHPNLNFTSFVAGKENLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGG 407

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSM 270
            IIDSG+ L+YF    Y  +  K ++   + +     D P  +  C+ +    N + P +
Sbjct: 408 TIIDSGTTLSYFAEPAYEFIKNK-IAEKAKGKYPVYRDFP-ILDPCFNVSGIHNVQLPEL 465

Query: 271 AFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
              F D A      EN FI   E+   L  +       ++IG+ QQ++   +YD     L
Sbjct: 466 GIAFADGAVWNFPTENSFIWLNEDLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRL 525

Query: 330 SFVKENCSD 338
            +    C+D
Sbjct: 526 GYAPTKCAD 534


>gi|255541796|ref|XP_002511962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549142|gb|EEF50631.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 495

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 101/357 (28%), Positives = 154/357 (43%), Gaps = 44/357 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ +G P+K   ++LDTGS + +               IF P  SSS+  + CD   C 
Sbjct: 161 TRVGVGNPAKSYYMVLDTGSDINWIQCQPCSDCYQQSDPIFTPAASSSYSPLTCDSQQCN 220

Query: 48  YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
             +   C N QC Y + Y D S T G    ET+S  G G   +I      GC +DN G  
Sbjct: 221 SLQMSSCRNGQCRYQVNYGDGSFTFGDFVTETMSFGGSGTVNSI----ALGCGHDNEGLF 276

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM--- 161
             A      G   LS       SQL +     FSYCLV         SS L F +     
Sbjct: 277 VGAAGLLGLGGGPLS-----LTSQLKA---TSFSYCLV---NRDSAASSTLDFNSAPVGD 325

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
               P  +++K     + FYY+ L  +S+  E +  P + F +  SG+GG I+D G+ +T
Sbjct: 326 SVIAPLLKSSKI----DTFYYVGLSGMSVGGELLRIPQEVFKLDDSGDGGVIVDCGTAIT 381

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFEDA-NL 279
              S+ Y  L + FVS        + +        CY L  ++  + P+++F+F+   + 
Sbjct: 382 RLQSEAYNSLRDSFVSMSRHL---RSTSGVALFDTCYDLSGQSSVKVPTVSFHFDGGKSW 438

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            +   N  I       +  A AP    +++IG+ QQ+ TR  +DL  + + F    C
Sbjct: 439 DLPAANYLIPVDSAGTYCFAFAPTTSSLSIIGNVQQQGTRVSFDLANNRVGFSTNKC 495


>gi|297811185|ref|XP_002873476.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319313|gb|EFH49735.1| hypothetical protein ARALYDRAFT_325615 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 475

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 97/363 (26%), Positives = 149/363 (41%), Gaps = 47/363 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP   + LI DTGS L +                IF+P KS+S+  ++C    
Sbjct: 134 IVTVGLGTPKNDLSLIFDTGSDLTWTQCQPCVRTCYDQKEPIFNPSKSTSYYNVSCSSAA 193

Query: 46  CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C            C    C+Y ++Y DQS + GF A +  ++        +F G  FGC 
Sbjct: 194 CGSLSSATGNAGSCSASNCIYGIQYGDQSFSVGFLAKDKFTL----TSSDVFDGVYFGCG 249

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            +N G         +AG+LGL R  +SF SQ  +   K FSYC    LP+    + +L F
Sbjct: 250 ENNQGLFT-----GVAGLLGLGRDKLSFPSQTATAYNKIFSYC----LPSSASYTGHLTF 300

Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           G+    R         I    +FY L++  I++  +++  P   F        G +IDSG
Sbjct: 301 GSAGISRSVKFTPISTITDGTSFYGLNIVAITVGGQKLPIPSTVFSTP-----GALIDSG 355

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
           +V+T      Y  L   F +   ++   + +S       L  F   T    P +AF F  
Sbjct: 356 TVITRLPPKAYAALRSSFKAKMSKYPTTSGVSILDTCFDLSGFKTVT---IPKVAFSFSG 412

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
             +   G       ++     LA A +  D   A+ G+ QQ+    VYD     + F   
Sbjct: 413 GAVVELGSKGIFYAFKISQVCLAFAGNSDDSNAAIFGNVQQQTLEVVYDGAGGRVGFAPN 472

Query: 335 NCS 337
            CS
Sbjct: 473 GCS 475


>gi|356546372|ref|XP_003541600.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 433

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 103/363 (28%), Positives = 166/363 (45%), Gaps = 43/363 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++   +GTPS  V  ILDTGS +I+               IFD  KS +++ + C    C
Sbjct: 90  LISYSVGTPSLQVFGILDTGSDIIWLQCQPCKKCYEQTTPIFDSSKSQTYKTLPCPSNTC 149

Query: 47  T----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDNH 101
                 F    + C+Y++ Y D S + G  + ET++ +G   G  + F G + GC   N 
Sbjct: 150 QSVQGTFCSSRKHCLYSIHYVDGSQSLGDLSVETLT-LGSTNGSPVQFPGTVIGCGRYNA 208

Query: 102 -GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G +E       +G++GL R  +S I+QL      +FSYCLV   P     SS L FG  
Sbjct: 209 IGIEEKN-----SGIVGLGRGPMSLITQLSPSTGGKFSYCLV---PGLSTASSKLNFGNA 260

Query: 161 MGYRRPSTQATK-FINHPNNFYYLSLKDISIDNERMNF-PPDTFDITVSGEGGCIIDSGS 218
                  T +T  F  +   FY+L+L+  S+   R+ F  P +      G+G  IIDSG+
Sbjct: 261 AVVSGRGTVSTPLFSKNGLVFYFLTLEAFSVGRNRIEFGSPGS-----GGKGNIIIDSGT 315

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFN-RFPSMAFYFED 276
            LT   + VY KL        +   L ++ D  + + LCY   P+  +   P +  +F  
Sbjct: 316 TLTALPNGVYSKLEAAVA---KTVILQRVRDPNQVLGLCYKVTPDKLDASVPVITAHFSG 372

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A++ ++  N F +   +     A  P  +  A+ G+  Q++    YDL ++ +SF   +C
Sbjct: 373 ADVTLNAINTF-VQVADDVVCFAFQP-TETGAVFGNLAQQNLLVGYDLQMNTVSFKHTDC 430

Query: 337 SDD 339
           +  
Sbjct: 431 TKQ 433


>gi|226492465|ref|NP_001150925.1| aspartic proteinase nepenthesin-1 [Zea mays]
 gi|195642996|gb|ACG40966.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 472

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 98/372 (26%), Positives = 159/372 (42%), Gaps = 51/372 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           ++ L IGTP      + DTGS LI+                +++P  S++F  + C+   
Sbjct: 115 LMTLAIGTPPLPYAAVADTGSDLIWTQCAPCGTQCFEQPAPLYNPASSTTFSVLPCNS-- 172

Query: 46  CTYFKCVNE----------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
            +   C              C+Y   Y     T G    ET +       +A   G  FG
Sbjct: 173 -SLSMCAGALAGAAPPPGCACMYYQTYG-TGWTAGVQGSETFTFGSSAADQARVPGVAFG 230

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           CSN +   D +      AG++GL R ++S +SQLG+    RFSYCL  P  +   TS+ L
Sbjct: 231 CSNASSS-DWNGS----AGLVGLGRGSLSLVSQLGA---GRFSYCLT-PFQDTNSTSTLL 281

Query: 156 KFGTDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
             G          ++T F+  P     + +YYL+L  IS+  + +   P  F +   G G
Sbjct: 282 -LGPSAALNGTGVRSTPFVASPARAPMSTYYYLNLTGISLGAKALPISPGAFSLKPDGTG 340

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNR--- 266
           G IIDSG+ +T   +  Y ++     S           SD    + LC+ LP   +    
Sbjct: 341 GLIIDSGTTITSLANAAYQQVRAAVKSQLVTTLPTVDGSDS-TGLDLCFALPAPTSAPPA 399

Query: 267 -FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
             PSM  +F+ A++ +  ++ ++I     + L      D  ++  G+ QQ++   +YD+ 
Sbjct: 400 VLPSMTLHFDGADMVLPADS-YMISGSGVWCLAMRNQTDGAMSTFGNYQQQNMHILYDVR 458

Query: 326 IDLLSFVKENCS 337
            + LSF    CS
Sbjct: 459 EETLSFAPAKCS 470


>gi|242053503|ref|XP_002455897.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
 gi|241927872|gb|EES01017.1| hypothetical protein SORBIDRAFT_03g026970 [Sorghum bicolor]
          Length = 493

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 104/367 (28%), Positives = 161/367 (43%), Gaps = 47/367 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            ++ +GTPS   L++LDTGS +++               +FDPR+SSS+  ++C  P C 
Sbjct: 142 TKIGVGTPSTPALMVLDTGSDVVWLQCAPCRRCYDQSGPVFDPRRSSSYGAVDCAAPLCR 201

Query: 48  YFKC-----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                        C+Y + Y D SVT G  A ET++  G   G  +   AL GC +DN G
Sbjct: 202 RLDSGGCDLRRRACLYQVAYGDGSVTAGDFATETLTFAG---GARVARVAL-GCGHDNEG 257

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               A            R ++SF +Q+     K FSYCLV    +    ++     + + 
Sbjct: 258 LFVAAAGLLGL-----GRGSLSFPTQISRRYGKSFSYCLVDRTSSSSSGAASRSRSSTVT 312

Query: 163 YRRPSTQATKF---INHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCII 214
           +  PS  A  F   + +P    FYY+ L  IS+   R+    ++ D+ +   +G GG I+
Sbjct: 313 FGPPSASAASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAES-DLRLDPSTGRGGVIV 371

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFL-PETFNRFPSM 270
           DSG+ +T      Y  L + F     R   A L   P    L   CY L      + P++
Sbjct: 372 DSGTSVTRLARPSYSALRDAF-----RAAAAGLRLSPGGFSLFDTCYDLGGRKVVKVPTV 426

Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
           + +F   A   +  EN  I       F  A A  D  V++IG+ QQ+  R V+D +   +
Sbjct: 427 SMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRV 486

Query: 330 SFVKENC 336
            F  + C
Sbjct: 487 GFAPKGC 493


>gi|255566835|ref|XP_002524401.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
 gi|223536362|gb|EEF38012.1| basic 7S globulin 2 precursor small subunit, putative [Ricinus
           communis]
          Length = 455

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 101/376 (26%), Positives = 162/376 (43%), Gaps = 50/376 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAIFDP---------------RKSSSFQKINCDHPDC 46
           V L IGTP + +LL+ DTGS LI+    P               R S+++  I+C  P C
Sbjct: 88  VSLRIGTPPQTLLLVADTGSDLIWVKCSPCRNCSHRSPGSAFFARHSTTYSAIHCYSPQC 147

Query: 47  TYFK------C----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
                     C    ++  C Y   YAD S T GF + E +++          +G  FGC
Sbjct: 148 QLVPHPHPNPCNRTRLHSPCRYQYTYADSSTTTGFFSKEALTLNTSTGKVKKLNGLSFGC 207

Query: 97  SNDNHG--FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--- 151
                G      + +GA  GV+GL R  ISF SQLG     +FSYCL+      +YT   
Sbjct: 208 GFRISGPSLTGASFEGA-QGVMGLGRAPISFSSQLGRRFGSKFSYCLM------DYTLSP 260

Query: 152 --SSYLKFGTDMGY---RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDI 204
             +S+L  G        ++     T  + +P    FYY+++K + ++  ++   P  + I
Sbjct: 261 PPTSFLTIGGAQNVAVSKKGIMSFTPLLINPLSPTFYYIAIKGVYVNGVKLPINPSVWSI 320

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-T 263
              G GG IIDSG+ LT+     Y ++ + F    +R +L   ++      LC  +   T
Sbjct: 321 DDLGNGGTIIDSGTTLTFITEPAYTEILKAFK---KRVKLPSPAEPTPGFDLCMNVSGVT 377

Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFV 321
               P M+F     ++       + I+  +    LAV P   D   +++G+  Q+     
Sbjct: 378 RPALPRMSFNLAGGSVFSPPPRNYFIETGDQIKCLAVQPVSQDGGFSVLGNLMQQGFLLE 437

Query: 322 YDLNIDLLSFVKENCS 337
           +D +   L F +  C+
Sbjct: 438 FDRDKSRLGFTRRGCA 453


>gi|297846526|ref|XP_002891144.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297336986|gb|EFH67403.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 445

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 106/372 (28%), Positives = 167/372 (44%), Gaps = 48/372 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           + + IGTP    L I DTGS L +               +FD +KSS+++  +CD   C 
Sbjct: 87  MSISIGTPPSKFLAIADTGSDLTWVQCKPCQQCYKQNTPLFDKKKSSTYKTESCDSITCN 146

Query: 48  YFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 E        C Y   Y D+S TKG  A ETIS+         F G  FGC  +N
Sbjct: 147 ALSEHEEGCDESRNACKYRYSYGDESFTKGEVATETISIDSSSGSPVSFPGTAFGCGYNN 206

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL--VIPLPNGEYTSSYLKFG 158
            G  E+   G +     L    +S +SQLGS I K+FSYCL       NG   +S +  G
Sbjct: 207 GGTFEETGSGIIG----LGGGPLSLVSQLGSSIGKKFSYCLSHTSATTNG---TSVINLG 259

Query: 159 TDMGYRRPSTQA----TKFINH-PNNFYYLSLKDISIDNERMNFPPD---TFDITVSGEG 210
           T+    +PS  +    T  I   P  +Y+L+L+ I++   ++ +      + +      G
Sbjct: 260 TNSMTSKPSKDSAILTTPLIQKDPETYYFLTLEAITVGKTKLPYTGGGGYSLNRKSKKTG 319

Query: 211 GCIIDSGSVLTYFHS---DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF 267
             IIDSG+ LT   S   D +  + E+ V+  +R     +SD    +  C+   +     
Sbjct: 320 NIIIDSGTTLTLLDSGFYDDFGAVVEESVTGAKR-----VSDPQGILTHCFKSGDKEIGL 374

Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
           P++  +F  A++++   N F+   E+    L++ P  + VA+ G+  Q D    YDL   
Sbjct: 375 PTITMHFTGADVKLSPINSFVKLSED-IVCLSMIPTTE-VAIYGNMVQMDFLVGYDLETK 432

Query: 328 LLSFVKENCSDD 339
            +SF + +CS +
Sbjct: 433 TVSFQRMDCSGN 444


>gi|54290717|dbj|BAD62387.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|215734915|dbj|BAG95637.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 469

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 94/363 (25%), Positives = 156/363 (42%), Gaps = 51/363 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           + RL +GTP+   ++++DTGS+L +                +FDPR S ++  + C   +
Sbjct: 132 VTRLGLGTPATSYVMVVDTGSSLTWLQCSPCSVSCHRQAGPVFDPRASGTYAAVQCSSSE 191

Query: 46  CTYFKC---------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           C   +          V+  C+Y   Y D S + G+ + +T+S      G   F G  +GC
Sbjct: 192 CGELQAATLNPSACSVSNVCIYQASYGDSSYSVGYLSKDTVSF-----GSGSFPGFYYGC 246

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             DN G       G  AG++GL++  +S + QL   +   FSYC    LP     + YL 
Sbjct: 247 GQDNEGLF-----GRSAGLIGLAKNKLSLLYQLAPSLGYAFSYC----LPTSSAAAGYLS 297

Query: 157 FGT-DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
            G+ + G    +  A+  ++   + Y+++L  IS+    +  PP  +    +     IID
Sbjct: 298 IGSYNPGQYSYTPMASSSLD--ASLYFVTLSGISVAGAPLAVPPSEYRSLPT-----IID 350

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
           SG+V+T    +VY  L     +          +     +  C+       R P +   F 
Sbjct: 351 SGTVITRLPPNVYTALSRAVAAAMASAAPRAPTY--SILDTCFRGSAAGLRVPRVDMAFA 408

Query: 276 -DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
             A L +   NV +ID ++    LA AP     A+IG+ QQ+    VYD+    + F   
Sbjct: 409 GGATLALSPGNV-LIDVDDSTTCLAFAPTGG-TAIIGNTQQQTFSVVYDVAQSRIGFAAG 466

Query: 335 NCS 337
            CS
Sbjct: 467 GCS 469


>gi|302811785|ref|XP_002987581.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
 gi|300144735|gb|EFJ11417.1| hypothetical protein SELMODRAFT_426333 [Selaginella moellendorffii]
          Length = 511

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 106/381 (27%), Positives = 164/381 (43%), Gaps = 53/381 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           V L +GTP+  V+LI+DTGS + +                F+PR SSSF K+ C    CT
Sbjct: 141 VPLQVGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASSTCT 200

Query: 48  --------YFKCVNEQCVYTMKYADQSVTKGFAAHETIS--VIGKGEGKAI-FHGALFGC 96
                   +       C+++++Y D S++ G  A ETI+      G+G+ +       GC
Sbjct: 201 NVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNITLGC 260

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
           ++     D +      +G+LG+ R  ISF SQL S   ++FS+C   P       SS L 
Sbjct: 261 AD----IDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCF--PDKIAHLNSSGLV 314

Query: 157 FGTDMGYRRPSTQATKFINHPN------NFYYLSLKDISIDNERMNFPPDTFDI-TVSGE 209
           F  +     P  + T  + +P       ++YY+ L  IS+D  R+      FDI  V+G 
Sbjct: 315 FFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVTGS 374

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD------CPEPIQLCYFLPET 263
           GG IIDSG+  TY     +  +  +F++      LA++ D      C         L  T
Sbjct: 375 GGTIIDSGTAFTYLKKPAFQAMRREFLARTS--HLAKVDDNSGFTPCYNITSGTAALEST 432

Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDY----ENHFFLLAVAPHDDL-VALIGSQQQRDT 318
               PS+  +F      +  +N  +I      E     LA     D+   +IG+ QQ++ 
Sbjct: 433 I--LPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFLMSGDIPFNIIGNYQQQNL 490

Query: 319 RFVYDLNIDLLSFVKENCSDD 339
              YDL    L      C+ D
Sbjct: 491 WVEYDLEKLRLGIAPAQCATD 511


>gi|449530542|ref|XP_004172253.1| PREDICTED: LOW QUALITY PROTEIN: protein ASPARTIC PROTEASE IN GUARD
           CELL 1-like [Cucumis sativus]
          Length = 486

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 98/355 (27%), Positives = 154/355 (43%), Gaps = 43/355 (12%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTY 48
           R+ IG P   V ++LDTGS + +                F+P  S+SF  ++C+   C  
Sbjct: 154 RVGIGRPPSPVYMVLDTGSDVSWVQCAPCAECYEQTDPXFEPTSSASFTSLSCETEQCKS 213

Query: 49  F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
               +C N  C+Y + Y D S T G    ET+++     G         GC ++N G   
Sbjct: 214 LDVSECRNGTCLYEVSYGDGSYTVGDFVTETVTL-----GSTSLGNIAIGCGHNNEGLFI 268

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
            A              ++SF SQL +     FSYCLV        ++S L F + +    
Sbjct: 269 GAAGLLGL-----GGGSLSFPSQLNA---SSFSYCLV---DRDSDSTSTLDFNSPI---T 314

Query: 166 PSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
           P         +PN   F+YL L  +S+    +  P  +F ++  G GG I+DSG+ +T  
Sbjct: 315 PDAVTAPLHRNPNLDTFFYLGLTGMSVGGAVLPIPETSFQMSEDGNGGIIVDSGTAVTRL 374

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFYFEDAN-LRI 281
            + VY  L + FV      Q A+          CY L  ++    P+++F+F + N L +
Sbjct: 375 QTTVYNVLRDAFVKSTHDLQTARGVAL---FDTCYDLSSKSRVEVPTVSFHFANGNELPL 431

Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             +N  I       F  A AP D  ++++G+ QQ+ TR  +DL   L+ F    C
Sbjct: 432 PAKNYLIPVDSEGTFCFAFAPTDSTLSILGNAQQQGTRVGFDLANSLVGFSPNKC 486


>gi|302822373|ref|XP_002992845.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
 gi|300139393|gb|EFJ06135.1| hypothetical protein SELMODRAFT_136051 [Selaginella moellendorffii]
          Length = 510

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 106/381 (27%), Positives = 164/381 (43%), Gaps = 53/381 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           V L +GTP+  V+LI+DTGS + +                F+PR SSSF K+ C    CT
Sbjct: 140 VPLQLGTPAVEVVLIMDTGSDVSWIQCVPCKDCVPALRPPFNPRHSSSFFKLPCASSTCT 199

Query: 48  --------YFKCVNEQCVYTMKYADQSVTKGFAAHETIS--VIGKGEGKAI-FHGALFGC 96
                   +       C+++++Y D S++ G  A ETI+      G+G+ +       GC
Sbjct: 200 NVYQGVKPFCSPSGRTCLFSIQYGDGSLSSGLLAMETIAGNTPNFGDGEPVKLSNITLGC 259

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
           ++     D +      +G+LG+ R  ISF SQL S   ++FS+C   P       SS L 
Sbjct: 260 AD----IDREGLPTGASGLLGMDRRPISFPSQLSSRYARKFSHCF--PDKIAHLNSSGLV 313

Query: 157 FGTDMGYRRPSTQATKFINHPN------NFYYLSLKDISIDNERMNFPPDTFDI-TVSGE 209
           F  +     P  + T  + +P       ++YY+ L  IS+D  R+      FDI  V+G 
Sbjct: 314 FFGESDIISPYLRYTPLVQNPAVPSASLDYYYVGLVGISVDESRLPLSHKNFDIDKVTGS 373

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD------CPEPIQLCYFLPET 263
           GG IIDSG+  TY     +  +  +F++      LA++ D      C         L  T
Sbjct: 374 GGTIIDSGTAFTYLKKPAFQAMRREFLARTS--HLAKVDDNSGFTPCYNITSGTAALEST 431

Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDY----ENHFFLLAVAPHDDL-VALIGSQQQRDT 318
               PS+  +F      +  +N  +I      E     LA     D+   +IG+ QQ++ 
Sbjct: 432 I--LPSITLHFRGGLDVVLPKNSILIPVSSSEEQTTLCLAFQMSGDIPFNIIGNYQQQNL 489

Query: 319 RFVYDLNIDLLSFVKENCSDD 339
              YDL    L      C+ D
Sbjct: 490 WVEYDLEKLRLGIAPAQCATD 510


>gi|357163818|ref|XP_003579856.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 467

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 100/394 (25%), Positives = 172/394 (43%), Gaps = 77/394 (19%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V+L +GTP       +DT S LI+               +F+P  S+S+  + C+   C
Sbjct: 89  LVKLGLGTPQHCFTAAIDTASDLIWTQCQPCVKCYKQLDPVFNPVASTSYAVVPCNSDTC 148

Query: 47  TYF---KCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
                 +C  +        C YT  Y   + T+G  A + +++     G  +F G +FGC
Sbjct: 149 DELDTHRCARDGDSDDEDACQYTYSYGGNATTRGILAVDRLAI-----GDDVFRGVVFGC 203

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
           S+ + G         ++GV+GL R  +S +SQL     +RF YCL  P+     ++  L 
Sbjct: 204 SSSSVGGPPPQ----VSGVVGLGRGALSLVSQLSV---RRFMYCLPPPV---SRSAGRLV 253

Query: 157 FGTDMG--YRRPSTQATKFINHPN---NFYYLSLKDISIDNERMNFPP-DTFDITVSGEG 210
            G D     R  S +    ++  +   ++YYL+L  ISI +  M+F   +  + T  G  
Sbjct: 254 LGADAAATVRNASERVVVPMSTGSRYPSYYYLNLDGISIGDRAMSFRSRNRMNATTPGTA 313

Query: 211 ------------------------GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQ 246
                                   G IID  S +T+    +Y ++ +      E  +L +
Sbjct: 314 AGAPASPVSGSGDGDGSGTGPDAYGMIIDIASTITFLEESLYEEMVDDL---EEEIRLPR 370

Query: 247 LSDCPEPIQLCYFLPE--TFNRF--PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAP 302
            S     + LC+ LPE    +R   P ++  FE   LR+D E +F+ D  +    L V  
Sbjct: 371 GSGSDLGLDLCFILPEGVPMSRVYAPPVSLAFEGVWLRLDKEQMFVEDRASGMMCLMVGK 430

Query: 303 HDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            D  V+++G+ QQ++ + +Y+L    ++F+K  C
Sbjct: 431 TDG-VSILGNYQQQNMQVMYNLRRGRITFIKTAC 463


>gi|413921976|gb|AFW61908.1| hypothetical protein ZEAMMB73_608282 [Zea mays]
          Length = 459

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 99/373 (26%), Positives = 156/373 (41%), Gaps = 47/373 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP    + + DTGS L +               I+D   S+SF  + C    C
Sbjct: 96  LMELAIGTPPVPFVALADTGSDLTWTQCKPCKLCFPQDTPIYDTAASASFSPVPCASATC 155

Query: 47  TYF--------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGK----AIFHGALF 94
                            C Y   Y D + + G    ET++  G   G         G  F
Sbjct: 156 LPIWRSSRNCTATTTSPCRYRYAYDDGAYSAGVLGTETLTFAGSSPGAPGPGVSVGGVAF 215

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
           GC  DN G   ++      G +GL R ++S ++QLG     +FSYCL     N    S  
Sbjct: 216 GCGVDNGGLSYNS-----TGTVGLGRGSLSLVAQLG---VGKFSYCLT-DFFNTSLGSPV 266

Query: 155 LKFGTDMGYRRPST------QATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITV 206
           L FG+      PST      Q+T  +  P N   YY+SL+ IS+ + R+  P  TFD+  
Sbjct: 267 L-FGSLAELAAPSTIGGAAVQSTPLVQGPYNPSRYYVSLEGISLGDARLPIPNGTFDLRD 325

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR 266
            G GG I+DSG++ T      +  +         +  +   S    P        +    
Sbjct: 326 DGSGGMIVDSGTIFTVLVESAFRVVVNHVAGVLNQ-PVVNASSLDSPCFPATAGEQQLPD 384

Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLV-ALIGSQQQRDTRFVYDL 324
            P M  +F   A++R+  +N    + E+  F L +A       +++G+ QQ++ + ++D+
Sbjct: 385 MPDMLLHFAGGADMRLHRDNYMSFNQESSSFCLNIAGAPSAYGSILGNFQQQNIQMLFDI 444

Query: 325 NIDLLSFVKENCS 337
            +  LSFV  +CS
Sbjct: 445 TVGQLSFVPTDCS 457


>gi|357143847|ref|XP_003573077.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 420

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 96/363 (26%), Positives = 162/363 (44%), Gaps = 42/363 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IG P    + + DTGS L +               ++DP  SS+F  + C    C
Sbjct: 72  LMELAIGKPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPLPCSSATC 131

Query: 47  TYFKCVN----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                 N      C Y   Y D + + G    ET++ +G         G  FGC  DN G
Sbjct: 132 LPIWSRNCTPSSLCRYRYAYGDGAYSAGILGTETLT-LGPSSAPVSVGGVAFGCGTDNGG 190

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT--D 160
              ++      G +GL R T+S ++QLG     +FSYCL     N    S +L  GT  +
Sbjct: 191 DSLNS-----TGTVGLGRGTLSLLAQLG---VGKFSYCLTDFF-NSALDSPFL-LGTLAE 240

Query: 161 MGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
           +     + Q+T  +  P N   Y++SL+ IS+ + R+  P  TFD+   G GG I+DSG+
Sbjct: 241 LAPGPSTVQSTPLLQSPQNPSRYFVSLQGISLGDVRLPIPNGTFDLRGDGTGGMIVDSGT 300

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQL-AQLSDCPEPIQLCYFLPETFNRF-PSMAFYFE- 275
             T      + ++  +      +  + A   D P     C+  P     + P +  +F  
Sbjct: 301 TFTILAESGFREVVGRVARVLGQPPVNASSLDAP-----CFPAPAGEPPYMPDLVLHFAG 355

Query: 276 DANLRIDGENVFIIDYENHFFLLAVA-PHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
            A++R+  +N    + E+  F L +A    +  +++G+ QQ++ + ++D  +  LSF+  
Sbjct: 356 GADMRLYRDNYMSYNEEDSSFCLNIAGTTPESTSVLGNFQQQNIQMLFDTTVGQLSFLPT 415

Query: 335 NCS 337
           +CS
Sbjct: 416 DCS 418


>gi|148908573|gb|ABR17396.1| unknown [Picea sitchensis]
          Length = 350

 Score =  115 bits (287), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 101/359 (28%), Positives = 157/359 (43%), Gaps = 43/359 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ IGTP++   ++LDTGS +++               IF+P  S SF  + CD   C+
Sbjct: 10  TRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSAVCS 69

Query: 48  YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                 C    C+Y + Y D S T G  A ET++      G         GC +DN G  
Sbjct: 70  QLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTF-----GTTSIQNVAIGCGHDNVGLF 124

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A              ++SF +QLG+   + FSYCLV        +S  L+FG +    
Sbjct: 125 VGAAGLLGL-----GAGSLSFPAQLGTQTGRAFSYCLV---DRDSESSGTLEFGPE---S 173

Query: 165 RP-STQATKFINHP--NNFYYLSLKDISIDNERMN-FPPDTFDI-TVSGEGGCIIDSGSV 219
            P  +  T  + +P    FYYLS+  IS+    ++  P + F I   +G GG IIDSG+ 
Sbjct: 174 VPIGSIFTPLVANPFLPTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTA 233

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFED-A 277
           +T   +  Y  L + F++  +    A   D       CY L    +   P++ F+F + A
Sbjct: 234 VTRLQTSAYDALRDAFIAGTQHLPRA---DGISIFDTCYDLSALQSVSIPAVGFHFSNGA 290

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
              +  +N  I       F  A AP D  ++++G+ QQ+  R  +D    L+ F  + C
Sbjct: 291 GFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 349


>gi|116787367|gb|ABK24480.1| unknown [Picea sitchensis]
          Length = 496

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 101/359 (28%), Positives = 156/359 (43%), Gaps = 43/359 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ IGTP++   ++LDTGS +++               IF+P  S SF  + CD   C+
Sbjct: 156 TRIGIGTPTREQYMVLDTGSDVVWIQCEPCRECYSQADPIFNPSSSVSFSTVGCDSAVCS 215

Query: 48  YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                 C    C+Y + Y D S T G  A ET++      G         GC +DN G  
Sbjct: 216 QLDANDCHGGGCLYEVSYGDGSYTVGSYATETLTF-----GTTSIQNVAIGCGHDNVGLF 270

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD---M 161
             A              ++SF +QLG+   + FSYCLV        +S  L+FG +   +
Sbjct: 271 VGAAGLLGL-----GAGSLSFPAQLGTQTGRAFSYCLV---DRDSESSGTLEFGPESVPI 322

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMN-FPPDTFDI-TVSGEGGCIIDSGSV 219
           G       A  F+     FYYLS+  IS+    ++  P + F I   +G GG IIDSG+ 
Sbjct: 323 GSIFTPLVANPFL---PTFYYLSMVAISVGGVILDSVPSEAFRIDETTGRGGIIIDSGTA 379

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFED-A 277
           +T   +  Y  L + F++  +    A   D       CY L    +   P++ F+F + A
Sbjct: 380 VTRLQTSAYDALRDAFIAGTQHLPRA---DGISIFDTCYDLSALQSVSIPAVGFHFSNGA 436

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
              +  +N  I       F  A AP D  ++++G+ QQ+  R  +D    L+ F  + C
Sbjct: 437 GFILPAKNCLIPMDSMGTFCFAFAPADSNLSIMGNIQQQGIRVSFDSANSLVGFAIDQC 495


>gi|356524287|ref|XP_003530761.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 481

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 106/368 (28%), Positives = 155/368 (42%), Gaps = 58/368 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
           V + +GTP + + L+ DTGS L +               AIFDP KSSS+  I C    C
Sbjct: 138 VVVGLGTPKRDLSLVFDTGSDLTWTQCEPCAGSCYKQQDAIFDPSKSSSYINITCTSSLC 197

Query: 47  TYF-------KCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           T         +C +    C+Y ++Y D+S + GF + E +++        I    LFGC 
Sbjct: 198 TQLTSAGIKSRCSSSTTACIYGIQYGDKSTSVGFLSQERLTITATD----IVDDFLFGCG 253

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            DN G    +     AG++GL R  ISF+ Q  SI  K FSYC    LP+   +  +L F
Sbjct: 254 QDNEGLFSGS-----AGLIGLGRHPISFVQQTSSIYNKIFSYC----LPSTSSSLGHLTF 304

Query: 158 G------TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
           G       ++ Y   ST     I+  N FY L +  IS+   ++   P     T S  GG
Sbjct: 305 GASAATNANLKYTPLST-----ISGDNTFYGLDIVGISVGGTKL---PAVSSSTFSA-GG 355

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSM 270
            IIDSG+V+T      Y  L   F    E++ +A           CY F        P +
Sbjct: 356 SIIDSGTVITRLAPTAYAALRSAFRQGMEKYPVANEDGL---FDTCYDFSGYKEISVPKI 412

Query: 271 AFYFEDA-NLRIDGENVFIIDYENHFFL-LAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
            F F     + +    + I        L  A   +D+ + + G+ QQ+    VYD+    
Sbjct: 413 DFEFAGGVTVELPLVGILIGRSAQQVCLAFAANGNDNDITIFGNVQQKTLEVVYDVEGGR 472

Query: 329 LSFVKENC 336
           + F    C
Sbjct: 473 IGFGAAGC 480


>gi|449466304|ref|XP_004150866.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449485213|ref|XP_004157102.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 473

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 98/365 (26%), Positives = 157/365 (43%), Gaps = 52/365 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP K + LI DTGS L +                +F P +S+++  I+C  PD
Sbjct: 132 IVSVGLGTPKKYLSLIFDTGSDLTWTQCQPCARYCYNQKDPVFVPSQSTTYSNISCSSPD 191

Query: 46  CTYFKC---------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           C+  +              C+Y ++Y DQS + G+ A ET+++        +    LFGC
Sbjct: 192 CSQLESGTGNQPGCSAARACIYGIQYGDQSFSVGYFAKETLTLT----STDVIENFLFGC 247

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             +N G       G+ AG++GL +  IS + Q      + FSYC    LP    ++ YL 
Sbjct: 248 GQNNRGL-----FGSAAGLIGLGQDKISIVKQTAQKYGQVFSYC----LPKTSSSTGYLT 298

Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
           FG   G            +   NFY + +  + +   ++      F  +     G IIDS
Sbjct: 299 FGGGGGGGALKYTPITKAHGVANFYGVDIVGMKVGGTQIPISSSVFSTS-----GAIIDS 353

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPE-TFNRFPSMAFY 273
           G+V+T    D Y  L     S FE+  +A+    PE   +  CY L + +  + P + F 
Sbjct: 354 GTVITRLPPDAYSALK----SAFEK-GMAKYPKAPELSILDTCYDLSKYSTIQIPKVGFV 408

Query: 274 FEDA-NLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSF 331
           F+    L +DG  +      +   L      D   VA+IG+ QQ+  + VYD+    + F
Sbjct: 409 FKGGEELDLDGIGIMYGASTSQVCLAFAGNQDPSTVAIIGNVQQKTLQVVYDVGGGKIGF 468

Query: 332 VKENC 336
               C
Sbjct: 469 GYNGC 473


>gi|357450869|ref|XP_003595711.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355484759|gb|AES65962.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 443

 Score =  114 bits (286), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 100/367 (27%), Positives = 162/367 (44%), Gaps = 45/367 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP       +DTGS LI+               +FDP+ SS++  I      C
Sbjct: 60  LMELSIGTPPVKTYAQVDTGSDLIWLQCIPCTNCYKQLNPMFDPQSSSTYSNIAYGSESC 119

Query: 47  TYF---KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
           +      C  +Q  C YT  Y D S+T+G  A ET+++           G +FGC ++N+
Sbjct: 120 SKLYSTSCSPDQNNCNYTYSYEDDSITEGVLAQETLTLTSTTGKPVALKGVIFGCGHNNN 179

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTD 160
           G   D       G++GL R  +S +SQ+GS    K FS CLV P       +S + FG  
Sbjct: 180 GVFNDKE----MGIIGLGRGPLSLVSQIGSSFGGKMFSQCLV-PFHTNPSITSPMSFGKG 234

Query: 161 MGYRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFP-PDTFDITVSGEGGCIIDSG 217
                    +T  +  N    FY+++L  IS+  E +N P  D   +    +G  +IDSG
Sbjct: 235 SEVLGNGVVSTPLVSKNTHQAFYFVTLLGISV--EDINLPFNDGSSLEPITKGNMVIDSG 292

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI------QLCYFLPETFNRFPSMA 271
           +  T    D Y +L E+  +        +++  P PI      QLCY  P    +  ++ 
Sbjct: 293 TPTTLLPEDFYHRLVEEVRN--------KVALDPIPIDPTLGYQLCYRTPTNL-KGTTLT 343

Query: 272 FYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
            +FE A++ +    +FI   +  F     +   +   + G+  Q +    +DL   L+SF
Sbjct: 344 AHFEGADVLLTPTQIFIPVQDGIFCFAFTSTFSNEYGIYGNHAQSNYLIGFDLEKQLVSF 403

Query: 332 VKENCSD 338
              +C++
Sbjct: 404 KATDCTN 410


>gi|357122568|ref|XP_003562987.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 455

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 105/380 (27%), Positives = 166/380 (43%), Gaps = 55/380 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------------IFDPRKSSSFQK 38
           ++ L IGTP      I DTGS LI+                       +++P  S++F  
Sbjct: 88  IMTLSIGTPPLSYRAIADTGSDLIWTQCAPCGDTVTDTDNQCFKQSGCLYNPSSSTTFGV 147

Query: 39  INCDHP--DCTYFKCVNE----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHG 91
           + C+ P   C      +      C+Y   Y     T G  + ET +        A+    
Sbjct: 148 LPCNSPLSMCAAMAGPSPPPGCACMYNQTYG-TGWTAGVQSVETFTFGSSSTPPAVRVPN 206

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
             FGCSN +     +  +G+ AG++GL R ++S +SQLG+     FSYCL  P  +   T
Sbjct: 207 IAFGCSNAS----SNDWNGS-AGLVGLGRGSMSLVSQLGA---GAFSYCLT-PFQDANST 257

Query: 152 SSYLKFGTDMGYRRPST---QATKFINHPN-----NFYYLSLKDISIDNERMNFPPDTFD 203
           S+ L  G         T   ++T F+  P+      +YYL+L  IS+    +  PPD F 
Sbjct: 258 STLL-LGPSAAAALKGTGPVRSTPFVAGPSKAPMSTYYYLNLTGISVGETALAIPPDAFS 316

Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPE 262
           +   G GG IIDSG+ +T      Y ++     S    R  LA   D    + LC+ L  
Sbjct: 317 LRADGTGGLIIDSGTTITTLVDSAYQQVRAAVRSLLVTRLPLAHGPDHSTGLDLCFALKA 376

Query: 263 TF--NRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDT 318
           +      PSM  +FE  A++ +  EN  I+   +  + LA+       ++++G+ QQ++ 
Sbjct: 377 STPPPAMPSMTLHFEGGADMVLPVENYMILG--SGVWCLAMRNQTVGAMSMVGNYQQQNI 434

Query: 319 RFVYDLNIDLLSFVKENCSD 338
             +YD+  + LSF    CS 
Sbjct: 435 HVLYDVRKETLSFAPAVCSS 454


>gi|224142011|ref|XP_002324354.1| predicted protein [Populus trichocarpa]
 gi|222865788|gb|EEF02919.1| predicted protein [Populus trichocarpa]
          Length = 471

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 102/366 (27%), Positives = 151/366 (41%), Gaps = 59/366 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDC 46
           V + +GTP K   L+ DTGS L +                 FDP KS+S++ ++C    C
Sbjct: 134 VTVGLGTPKKDFSLLFDTGSDLTWTQCEPCSGGCFPQNDEKFDPTKSTSYKNLSCSSEPC 193

Query: 47  TYFKCVNEQ-------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                 + Q       C+Y +KY     T GF A ET+++        +F   + GC   
Sbjct: 194 KSIGKESAQGCSSSNSCLYGVKYG-TGYTVGFLATETLTITPS----DVFENFVIGCGER 248

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           N G     R    AG+LGL R  ++  SQ  S  K  FSYC    LP    ++ +L FG 
Sbjct: 249 NGG-----RFSGTAGLLGLGRSPVALPSQTSSTYKNLFSYC----LPASSSSTGHLSFGG 299

Query: 160 DMGYRRPSTQATKF---INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
            +      +QA KF    +     Y L +  IS+   ++   P  F        G IIDS
Sbjct: 300 GV------SQAAKFTPITSKIPELYGLDVSGISVGGRKLPIDPSVFRTA-----GTIIDS 348

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFY 273
           G+ LTY  S  +  L   F      + L + +     +Q CY   +  N     P ++ +
Sbjct: 349 GTTLTYLPSTAHSALSSAFQEMMTNYTLTKGT---SGLQPCYDFSKHANDNITIPQISIF 405

Query: 274 FEDA-NLRIDGENVFIIDYENHFFLLAVA--PHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
           FE    + ID   +FI         LA     +D  VA+ G+ QQ+    VYD+   ++ 
Sbjct: 406 FEGGVEVDIDDSGIFIAANGLEEVCLAFKDNGNDTDVAIFGNVQQKTYEVVYDVAKGMVG 465

Query: 331 FVKENC 336
           F    C
Sbjct: 466 FAPGGC 471


>gi|357124861|ref|XP_003564115.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 477

 Score =  114 bits (286), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 110/394 (27%), Positives = 157/394 (39%), Gaps = 72/394 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V L +GTP + V L LDTGS L++                + DP  SS+   + CD P 
Sbjct: 95  LVHLSVGTPPRPVALTLDTGSDLVWTQCAPCLNCFDQGAIPVLDPAASSTHAAVRCDAPV 154

Query: 46  CTYFKCVN----------EQCVYTMKYADQSVTKGFAAHETISVIGKGE---GKAIFHGA 92
           C      +            CVY   Y D+S+T G  A +  +  G G+   G  +    
Sbjct: 155 CRALPFTSCGRGGSSWGERSCVYVYHYGDKSITVGKLASDRFT-FGPGDNADGGGVSERR 213

Query: 93  L-FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
           L FGC + N G  +        G+ G  R   S  SQLG      FSYC        E T
Sbjct: 214 LTFGCGHFNKGIFQANE----TGIAGFGRGRWSLPSQLGVT---SFSYCFTSMF---EST 263

Query: 152 SSYLKFGTDMG--YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVS 207
           SS +  G      +     Q+T  +  P+  + Y+LSLK I++   R+  P     +   
Sbjct: 264 SSLVTLGVAPAELHLTGQVQSTPLLRDPSQPSLYFLSLKAITVGATRIPIPERRQRLR-- 321

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-- 265
            E   IIDSG+ +T    DVY  +  +FV+   +  L   +     + LC+ LP      
Sbjct: 322 -EASAIIDSGASITTLPEDVYEAVKAEFVA---QVGLPVSAVEGSALDLCFALPSAAAPK 377

Query: 266 ----------------RFPSMAFYFED-ANLRIDGENVFIIDYENHFFLL---AVAPHDD 305
                           R P + F+    A+  +  EN    DY      L   A     D
Sbjct: 378 SAFGWRWRGRGRAMPVRVPRLVFHLGGGADWELPRENYVFEDYGARVMCLVLDAATGGGD 437

Query: 306 LVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDD 339
              +IG+ QQ++T  VYDL  D+LSF    C  D
Sbjct: 438 QTVVIGNYQQQNTHVVYDLENDVLSFAPARCECD 471


>gi|356532672|ref|XP_003534895.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 449

 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 103/363 (28%), Positives = 165/363 (45%), Gaps = 48/363 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQ---KINCDH 43
           M  + IG P    L+++DTGS +++               +FDP  SS+F    K  CD 
Sbjct: 102 MANISIGQPPIPQLVVMDTGSDILWVMCTPCTNCDNHLGLLFDPSMSSTFSPLCKTPCDF 161

Query: 44  PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
             C+  +C  +   +T+ YAD S   G    +T+      EG +     LFGC    H  
Sbjct: 162 KGCS--RC--DPIPFTVTYADNSTASGMFGRDTVVFETTDEGTSRIPDVLFGC---GHNI 214

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV-IPLPNGEYTSSYLKFGTDM- 161
            +D  D    G+LGL+    S  +++G    ++FSYC+  +  P   Y    L  G D+ 
Sbjct: 215 GQDT-DPGHNGILGLNNGPDSLATKIG----QKFSYCIGDLADPYYNYHQLILGEGADLE 269

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
           GY  P      F  H N FYY++++ IS+  +R++  P+TF++  +  GG IID+GS +T
Sbjct: 270 GYSTP------FEVH-NGFYYVTMEGISVGEKRLDIAPETFEMKKNRTGGVIIDTGSTIT 322

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF--LPETFNRFPSMAFYFED-AN 278
           +    V+ +L  K V     +   Q +    P   C++  +      FP + F+F D A+
Sbjct: 323 FLVDSVH-RLLSKEVRNLLGWSFRQTTIEKSPWMQCFYGSISRDLVGFPVVTFHFADGAD 381

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDL-----VALIGSQQQRDTRFVYDLNIDLLSFVK 333
           L +D    F     ++ F + V P   L      +LIG   Q+     YDL    + F +
Sbjct: 382 LALD-SGSFFNQLNDNVFCMTVGPVSSLNLKSKPSLIGLLAQQSYSVGYDLVNQFVYFQR 440

Query: 334 ENC 336
            +C
Sbjct: 441 IDC 443


>gi|15226358|ref|NP_180389.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4803959|gb|AAD29831.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252998|gb|AEC08092.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 756

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 98/354 (27%), Positives = 164/354 (46%), Gaps = 40/354 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L +GTP   ++  +DTGS +I+               IFDP KSS+F++  C+    
Sbjct: 422 LMKLQVGTPPFEIVAEIDTGSDIIWTQCMPCPNCYSQFAPIFDPSKSSTFREQRCN---- 477

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                    C Y + YAD++ +KG  A ET+++        +      GC  DN      
Sbjct: 478 ------GNSCHYEIIYADKTYSKGILATETVTIPSTSGEPFVMAETKIGCGLDNTNLQYS 531

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
               + +G++GL+   +S ISQ+        SYC      +G+ TS  + FGT+      
Sbjct: 532 GFASSSSGIVGLNMGPLSLISQMDLPYPGLISYCF-----SGQGTSK-INFGTNAIVAGD 585

Query: 167 STQATK-FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
            T A   FI   N FYYL+L  +S+++  +      F    + +G   IDSG+ LTYF  
Sbjct: 586 GTVAADMFIKKDNPFYYLNLDAVSVEDNLIATLGTPFH---AEDGNIFIDSGTTLTYFPM 642

Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DANLRIDGE 284
             Y  L  + V   +     ++ D      LCY+  +T + FP +  +F   A+L +D  
Sbjct: 643 S-YCNLVREAVE--QVVTAVKVPDMGSDNLLCYY-SDTIDIFPVITMHFSGGADLVLDKY 698

Query: 285 NVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           N+++       F LA+  +D  + A+ G++ Q +    YD + +++SF   NCS
Sbjct: 699 NMYLETITGGIFCLAIGCNDPSMPAVFGNRAQNNFLVGYDPSSNVISFSPTNCS 752



 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 101/351 (28%), Positives = 157/351 (44%), Gaps = 50/351 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L +GTP   +   +DTGS LI+               IFDP KSS+F +        
Sbjct: 83  LMKLQVGTPPFEIAAEIDTGSDLIWTQCMPCPDCYSQFDPIFDPSKSSTFNEQ------- 135

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
              +C  + C Y + Y D + +KG  A ET+++        +      GC   N   D  
Sbjct: 136 ---RCHGKSCHYEIIYEDNTYSKGILATETVTIHSTSGEPFVMAETTIGCGLHNTDLDNS 192

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
               + +G++GL+    S ISQ+        SYC      +G+ TS  + FGT+      
Sbjct: 193 GFASSSSGIVGLNMGPRSLISQMDLPYPGLISYCF-----SGQGTSK-INFGTNAIVAGD 246

Query: 167 STQATK-FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
            T A   FI   N FYYL+L  +S+++ R+      F    + +G  +IDSGS +TYF  
Sbjct: 247 GTVAADMFIKKDNPFYYLNLDAVSVEDNRIETLGTPFH---AEDGNIVIDSGSTVTYFPV 303

Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPI---QLCYFLPETFNRFPSMAFYFE-DANLRI 281
             Y  L  K V      Q+      P+P     LCYF  ET + FP +  +F   A+L +
Sbjct: 304 S-YCNLVRKAVE-----QVVTAVRVPDPSGNDMLCYF-SETIDIFPVITMHFSGGADLVL 356

Query: 282 DGENVFIIDYENHFFLLAV---APHDDLVALIGSQQQRDTRFVYDLNIDLL 329
           D  N+++       F LA+   +P  +  A+ G++ Q +    YD +  LL
Sbjct: 357 DKYNMYMESNSGGLFCLAIICNSPTQE--AIFGNRAQNNFLVGYDSSSLLL 405


>gi|449529638|ref|XP_004171805.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like, partial
           [Cucumis sativus]
          Length = 384

 Score =  114 bits (285), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 101/360 (28%), Positives = 157/360 (43%), Gaps = 43/360 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ +GTP K V ++LDTGS +++               +F+P KS SF K+ C  P C 
Sbjct: 44  TRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCR 103

Query: 48  YFK---CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
             +   C   Q C+Y + Y D S T G    ET++       +        GC +DN G 
Sbjct: 104 RLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFR-----RTKVEQVALGCGHDNEGL 158

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A            R  +SF SQ G    ++FSYCLV    + + +S  + FG     
Sbjct: 159 FVGAAGLLGL-----GRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSS--VVFGNSAVS 211

Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMN-FPPDTFDITVSGEGGCIIDSGSVL 220
           R  + + T  + +P  + FYY+ L  IS+    ++      F +  +G GG IID G+ +
Sbjct: 212 R--TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSV 269

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLP-ETFNRFPSMAFYFEDA 277
           T  +   Y  L + F     R   + L   PE      CY L  +T  + P++  +F  A
Sbjct: 270 TRLNKPAYIALRDAF-----RAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGA 324

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ++ +   N  I    +  F  A A     +++IG+ QQ+  R VYDL    + F    C+
Sbjct: 325 DVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 384


>gi|148905906|gb|ABR16115.1| unknown [Picea sitchensis]
          Length = 482

 Score =  114 bits (284), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 102/368 (27%), Positives = 166/368 (45%), Gaps = 55/368 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V    GTP+K  LLI+DTGS L +              AIF+P++SSS++ + C    C
Sbjct: 138 IVTAGFGTPAKNSLLIIDTGSDLTWIQCKPCADCYSQVDAIFEPKQSSSYKTLPCLSATC 197

Query: 47  TYF--------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
           T           C+   CVY + Y D S ++G  + ET+++     G   F    FGC +
Sbjct: 198 TELITSESNPTPCLLGGCVYEINYGDGSSSQGDFSQETLTL-----GSDSFQNFAFGCGH 252

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
            N G  + +     +G+LGL + ++SF SQ  S    +F+YC    LP+   ++S   F 
Sbjct: 253 TNTGLFKGS-----SGLLGLGQNSLSFPSQSKSKYGGQFAYC----LPDFGSSTSTGSFS 303

Query: 159 TDMGYRRPSTQATKFIN---HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
              G    S   T  ++   +P  FY++ L  IS+  +R++ PP      V G G  I+D
Sbjct: 304 VGKGSIPASAVFTPLVSNFMYP-TFYFVGLNGISVGGDRLSIPP-----AVLGRGSTIVD 357

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF 274
           SG+V+T      Y  L   F S       A+       +  CY L      R P++ F+F
Sbjct: 358 SGTVITRLLPQAYNALKTSFRSKTRDLPSAKPFSI---LDTCYDLSRHSQVRIPTITFHF 414

Query: 275 E-DANLRIDGENVFIIDYENH----FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
           + +A++ +    + ++  +N         A A   D   +IG+ QQ+  R  +D     +
Sbjct: 415 QNNADVAVSDVGI-LVPVQNGGSQVCLAFASASQMDGFNIIGNFQQQRMRVAFDTGAGRI 473

Query: 330 SFVKENCS 337
            F   +C+
Sbjct: 474 GFASGSCA 481


>gi|148906646|gb|ABR16474.1| unknown [Picea sitchensis]
          Length = 538

 Score =  114 bits (284), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 102/362 (28%), Positives = 162/362 (44%), Gaps = 49/362 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ +GTP +   ++LDTGS +++               IF+P  S+SF  + C+   C+
Sbjct: 199 TRIGVGTPMREQYMVLDTGSDVVWIQCEPCSKCYSQVDPIFNPSLSASFSTLGCNSAVCS 258

Query: 48  Y---FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
           Y   + C    C+Y + Y D S T G  A E ++      G         GC +DN G  
Sbjct: 259 YLDAYNCHGGGCLYKVSYGDGSYTIGSFATEMLTF-----GTTSVRNVAIGCGHDNAGLF 313

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD---M 161
             A               +SF SQLG+   + FSYCLV        +S  L+FG +   +
Sbjct: 314 VGAAGLLGL-----GAGLLSFPSQLGTQTGRAFSYCLVDRF---SESSGTLEFGPESVPL 365

Query: 162 GYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMN-FPPDTFDI-TVSGEGGCIIDSG 217
           G     +  T  + +P+   FYY+ L  IS+    ++  PPD F I   SG GG I+DSG
Sbjct: 366 G-----SILTPLLTNPSLPTFYYVPLISISVGGALLDSVPPDVFRIDETSGRGGFIVDSG 420

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFED 276
           + +T   + VY  + + FV+   +   A+          CY L        P++ F+F +
Sbjct: 421 TAVTRLQTPVYDAVRDAFVAGTRQLPKAEGVSI---FDTCYDLSGLPLVNVPTVVFHFSN 477

Query: 277 -ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
            A+L +  +N  I +D+    F  A AP    ++++G+ QQ+  R  +D    L+ F   
Sbjct: 478 GASLILPAKNYMIPMDFMGT-FCFAFAPATSDLSIMGNIQQQGIRVSFDTANSLVGFALR 536

Query: 335 NC 336
            C
Sbjct: 537 QC 538


>gi|356498306|ref|XP_003517994.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  114 bits (284), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 109/358 (30%), Positives = 161/358 (44%), Gaps = 47/358 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           +R+ IG P     ++LDTGS + +               IFDP  S+S+  I CD P C 
Sbjct: 151 LRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPISSNSYSPIRCDEPQCK 210

Query: 48  YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                +C N  C+Y + Y D S T G  A ET+++     G A       GC ++N G  
Sbjct: 211 SLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTL-----GSAAVENVAIGCGHNNEGLF 265

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS-SYLKFGTDMGY 163
             A      G   LS     F +Q+ +     FSYCLV    N +  + S L+F + +  
Sbjct: 266 VGAAGLLGLGGGKLS-----FPAQVNAT---SFSYCLV----NRDSDAVSTLEFNSPL-- 311

Query: 164 RRPSTQATK-FINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
             P   AT   + +P  + FYYL LK IS+  E +  P  +F++   G GG IIDSG+ +
Sbjct: 312 --PRNAATAPLMRNPELDTFYYLGLKGISVGGEALPIPESSFEVDAIGGGGIIIDSGTAV 369

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF-EDAN 278
           T   S+VY  L + FV   +    A           CY L    +   P+++F F E   
Sbjct: 370 TRLRSEVYDALRDAFVKGAKGIPKANGVSL---FDTCYDLSSRESVEIPTVSFRFPEGRE 426

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L +   N  I       F  A AP    +++IG+ QQ+ TR  +D+   L+ F  ++C
Sbjct: 427 LPLPARNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVGFDIANSLVGFSVDSC 484


>gi|168040957|ref|XP_001772959.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675692|gb|EDQ62184.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 351

 Score =  114 bits (284), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 100/365 (27%), Positives = 163/365 (44%), Gaps = 50/365 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++++ +GTP +    I+DTGS L +               +F P  SSS+   +C    C
Sbjct: 9   VLQISLGTPPQQFSAIVDTGSDLCWVQCAPCARCFEQPDPLFIPLASSSYSNASCTDSLC 68

Query: 47  TYFK----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                    +   C Y+  Y D S T+G  A ET+++ G    +       FGC ++  G
Sbjct: 69  DALPRPTCSMRNTCTYSYSYGDGSNTRGDFAFETVTLNGSTLARI-----GFGCGHNQEG 123

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               A      G++GL +  +S  SQL S     FSYCLV     G +  S + FG    
Sbjct: 124 TFAGAD-----GLIGLGQGPLSLPSQLNSSFTHIFSYCLVDQSTTGTF--SPITFGNAAE 176

Query: 163 YRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
             R S   T  + + +N  +YY+ ++ IS+ N R+  PP  F I  +G GG I+DSG+ +
Sbjct: 177 NSRAS--FTPLLQNEDNPSYYYVGVESISVGNRRVPTPPSAFRIDANGVGGVILDSGTTI 234

Query: 221 TYFHSDVYWKLHEKFVSYFE--RFQLAQLSDCPEP--IQLCYFLPETFNR---FPSMAFY 273
           T      YW+L   F+      R Q++     P P  + LCY +          PSM  +
Sbjct: 235 T------YWRL-AAFIPILAELRRQISYPEADPTPYGLNLCYDISSVSASSLTLPSMTVH 287

Query: 274 FEDANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
             + +  I   N+++ +D        A++  D   ++IG+ QQ++   V D+    + F+
Sbjct: 288 LTNVDFEIPVSNLWVLVDNFGETVCTAMSTSDQF-SIIGNVQQQNNLIVTDVANSRVGFL 346

Query: 333 KENCS 337
             +CS
Sbjct: 347 ATDCS 351


>gi|255576511|ref|XP_002529147.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223531426|gb|EEF33260.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 479

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 99/355 (27%), Positives = 153/355 (43%), Gaps = 43/355 (12%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
           R+ IG PS  V ++LDTGS + +               IF+P  S+S+  ++CD   C  
Sbjct: 147 RVGIGKPSSPVYMVLDTGSDVNWIQCAPCADCYHQADPIFEPASSTSYSPLSCDTKQCQS 206

Query: 49  F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
               +C N  C+Y + Y D S T G    ETI++     G A       GC ++N G   
Sbjct: 207 LDVSECRNNTCLYEVSYGDGSYTVGDFVTETITL-----GSASVDNVAIGCGHNNEGLFI 261

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
            A      G   LS     F SQ+ +     FSYCLV        ++S L+F + +    
Sbjct: 262 GAAGLLGLGGGKLS-----FPSQINA---SSFSYCLV---DRDSDSASTLEFNSAL---L 307

Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
           P       + +   + FYY+ +  +S+  E ++ P   F++  SG GG IIDSG+ +T  
Sbjct: 308 PHAITAPLLRNRELDTFYYVGMTGLSVGGELLSIPESMFEMDESGNGGIIIDSGTAVTRL 367

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFEDAN-LRI 281
            +  Y  L + FV   +   +            CY L  +T    P++ F+      L +
Sbjct: 368 QTAAYNALRDAFVKGTKDLPVTSEVAL---FDTCYDLSRKTSVEVPTVTFHLAGGKVLPL 424

Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
              N  I    +  F  A AP    +++IG+ QQ+ TR  +DL   L+ F    C
Sbjct: 425 PATNYLIPVDSDGTFCFAFAPTSSALSIIGNVQQQGTRVGFDLANSLVGFEPRQC 479


>gi|413921849|gb|AFW61781.1| hypothetical protein ZEAMMB73_702843 [Zea mays]
          Length = 379

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 87/287 (30%), Positives = 125/287 (43%), Gaps = 33/287 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V L IGTP      I+DTGS LI+                FD +KS++++ + C    C
Sbjct: 90  LVDLAIGTPPLYYTAIMDTGSDLIWTQCAPCLLCADQPTPYFDVKKSATYRALPCRSSRC 149

Query: 47  TYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                  C  + CVY   Y D + T G  A+ET +       K       FGC + N G 
Sbjct: 150 ASLSSPSCFKKMCVYQYYYGDTASTAGVLANETFTFGAANSTKVRATNIAFGCGSLNAGD 209

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL---PNGEYTSSYLKFGTD 160
             ++     +G++G  R  +S +SQLG     RFSYCL   L   P+  Y   Y    + 
Sbjct: 210 LANS-----SGMVGFGRGPLSLVSQLG---PSRFSYCLTSYLSATPSRLYFGVYANLSST 261

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                   Q+T F+ +P   N Y+LSLK IS+  + +   P  F I   G GG IIDSG+
Sbjct: 262 NTSSGSPVQSTPFVINPALPNMYFLSLKAISLGTKLLPIDPLVFAINDDGTGGVIIDSGT 321

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
            +T+   D Y  +    VS      L  ++D    +  C+  P   N
Sbjct: 322 SITWLQQDAYEAVRRGLVSAIP---LTAMNDTDIGLDTCFQWPPPPN 365


>gi|356537728|ref|XP_003537377.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 543

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 114/377 (30%), Positives = 164/377 (43%), Gaps = 51/377 (13%)

Query: 4   LFIGTPSKGVLLILDTGSALIYAIFDP--------------RKSSSFQKINCDHPDCT-- 47
           +F+GTP K V LILDTGS L +   DP              + SS+++ I+C  P C   
Sbjct: 175 MFVGTPPKHVWLILDTGSDLSWIQCDPCYDCFEQNGSHYYPKDSSTYRNISCYDPRCQLV 234

Query: 48  -------YFKCVNEQCVYTMKYADQSVTKGFAAHETISV-IGKGEGKAIFH---GALFGC 96
                  + K  N+ C Y   YAD S T G  A ET +V +    GK  F      +FGC
Sbjct: 235 SSSDPLQHCKAENQTCPYFYDYADGSNTTGDFASETFTVNLTWPNGKEKFKQVVDVMFGC 294

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
            + N GF   A     +G+LGL R  ISF SQ+ SI    FSYCL     N    SS L 
Sbjct: 295 GHWNKGFFYGA-----SGLLGLGRGPISFPSQIQSIYGHSFSYCLTDLFSNTS-VSSKLI 348

Query: 157 FGTDM----GYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDIT-----V 206
           FG D      +    T        P+  FYYL +K I +  E ++    T+  +      
Sbjct: 349 FGEDKELLNNHNLNFTTLLAGEETPDETFYYLQIKSIMVGGEVLDISEQTWHWSSEGAAA 408

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR 266
              GG IIDSGS LT+F    Y  + E F    ++ +L Q++     +  CY +     +
Sbjct: 409 DAGGGTIIDSGSTLTFFPDSAYDIIKEAFE---KKIKLQQIAADDFVMSPCYNVSGAMMQ 465

Query: 267 --FPSMAFYFEDANL-RIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFV 321
              P    +F D  +     EN F     +    LA+   P+   + +IG+  Q++   +
Sbjct: 466 VELPDFGIHFADGGVWNFPAENYFYQYEPDEVICLAIMKTPNHSHLTIIGNLLQQNFHIL 525

Query: 322 YDLNIDLLSFVKENCSD 338
           YD+    L +    C++
Sbjct: 526 YDVKRSRLGYSPRRCAE 542


>gi|356495754|ref|XP_003516738.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 420

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 99/359 (27%), Positives = 170/359 (47%), Gaps = 36/359 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP   +  I DTGS L +               +FDP+KS++++ I+CD   C
Sbjct: 73  LMELSIGTPPFKIYGIADTGSDLTWTSCVPCNNCYKQRNPMFDPQKSTTYRNISCDSKLC 132

Query: 47  ----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDNH 101
               T      ++C YT  YA  ++T+G  A ETI+ +   +GK++   G +FGC ++N 
Sbjct: 133 HKLDTGVCSPQKRCNYTYAYASAAITRGVLAQETIT-LSSTKGKSVPLKGIVFGCGHNNT 191

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTD 160
           G   D       G++GL    +S ISQ+GS    KRFS CLV P       SS + FG  
Sbjct: 192 GGFNDHE----MGIIGLGGGPVSLISQMGSSFGGKRFSQCLV-PFHTDVSVSSKMSFGKG 246

Query: 161 MGYRRPSTQATKFINHPNNF-YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                    +T  +   +   Y+++L  IS++N  ++F   + ++    +G   +DSG+ 
Sbjct: 247 SKVSGKGVVSTPLVAKQDKTPYFVTLLGISVENTYLHFNGSSQNVE---KGNMFLDSGTP 303

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMAFYFEDAN 278
            T   + +Y ++  +  S      +  ++D P+   QLCY       R P +  +FE A+
Sbjct: 304 PTILPTQLYDQVVAQVRS---EVAMKPVTDDPDLGPQLCYRTKNNL-RGPVLTAHFEGAD 359

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +++     FI   +  F L       D   + G+  Q +    +DL+  ++SF  ++C+
Sbjct: 360 VKLSPTQTFISPKDGVFCLGFTNTSSD-GGVYGNFAQSNYLIGFDLDRQVVSFKPKDCT 417


>gi|357500973|ref|XP_003620775.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|357500991|ref|XP_003620784.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495790|gb|AES76993.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355495799|gb|AES77002.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 104/357 (29%), Positives = 173/357 (48%), Gaps = 37/357 (10%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFK- 50
           +GTP      I+DTGS +++                F+P KSSS++ I+C    C   + 
Sbjct: 93  VGTPPIKSYGIVDTGSDIVWLQCEPCEQCYNQTTPKFNPSKSSSYKNISCSSKLCQSVRD 152

Query: 51  --CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
             C +++ C Y++ Y +QS ++G  + ET+++         F   + GC  +N G    +
Sbjct: 153 TSCNDKKNCEYSINYGNQSHSQGDLSLETLTLESTTGRPVSFPKTVIGCGTNNIG----S 208

Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV---IPLPNGEYTSSYLKFGTDMGYR 164
                +GV+GL     S I+QLG  I  +FSYCLV   I L N    SS L FG      
Sbjct: 209 FKRVSSGVVGLGGGPASLITQLGPSIGGKFSYCLVRMSITLKNMSMGSSKLNFGDVAIVS 268

Query: 165 RPSTQATKFINHPNN-FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
             +  +T  +   ++ FYYL+++  S+ ++R+ F   +  +    EG  IIDS +++T+ 
Sbjct: 269 GHNVLSTPIVKKDHSFFYYLTIEAFSVGDKRVEFAGSSKGVE---EGNIIIDSSTIVTFV 325

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFEDANLRI 281
            SDVY KL+   V   +   L ++ D  +   LCY +   E ++ FP M  +F+ A++ +
Sbjct: 326 PSDVYTKLNSAIV---DLVTLERVDDPNQQFSLCYNVSSDEEYD-FPYMTAHFKGADILL 381

Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
              N F ++        A AP +   A+ GS  Q+D    YDL    +SF   +C++
Sbjct: 382 YATNTF-VEVARDVLCFAFAPSNG-GAIFGSFSQQDFMVGYDLQQKTVSFKSVDCTE 436


>gi|449432044|ref|XP_004133810.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 471

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 101/360 (28%), Positives = 157/360 (43%), Gaps = 43/360 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ +GTP K V ++LDTGS +++               +F+P KS SF K+ C  P C 
Sbjct: 131 TRIGVGTPPKYVYMVLDTGSDIVWLQCAPCKNCYSQTDPVFNPVKSGSFAKVLCRTPLCR 190

Query: 48  YFK---CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
             +   C   Q C+Y + Y D S T G    ET++       +        GC +DN G 
Sbjct: 191 RLESPGCNQRQTCLYQVSYGDGSYTTGEFVTETLTFR-----RTKVEQVALGCGHDNEGL 245

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A            R  +SF SQ G    ++FSYCLV    + + +S  + FG     
Sbjct: 246 FVGAAGLLGL-----GRGGLSFPSQAGRTFNQKFSYCLVDRSASSKPSS--VVFGNSAVS 298

Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMN-FPPDTFDITVSGEGGCIIDSGSVL 220
           R  + + T  + +P  + FYY+ L  IS+    ++      F +  +G GG IID G+ +
Sbjct: 299 R--TARFTPLLTNPRLDTFYYVELLGISVGGTPVSGITASHFKLDRTGNGGVIIDCGTSV 356

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLP-ETFNRFPSMAFYFEDA 277
           T  +   Y  L + F     R   + L   PE      CY L  +T  + P++  +F  A
Sbjct: 357 TRLNKPAYIALRDAF-----RAGASSLKSAPEFSLFDTCYDLSGKTTVKVPTVVLHFRGA 411

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ++ +   N  I    +  F  A A     +++IG+ QQ+  R VYDL    + F    C+
Sbjct: 412 DVSLPASNYLIPVDGSGRFCFAFAGTTSGLSIIGNIQQQGFRVVYDLASSRVGFSPRGCA 471


>gi|357118074|ref|XP_003560784.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial
           [Brachypodium distachyon]
          Length = 452

 Score =  113 bits (283), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 103/363 (28%), Positives = 154/363 (42%), Gaps = 50/363 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           +V +  G+P++    + DTGS L +                +FDP KSSS+  + C   +
Sbjct: 113 VVVVGFGSPAQTSATMFDTGSDLSWIQCQPCSGHCYKQHDPVFDPAKSSSYAVVPCGTTE 172

Query: 46  CTYF--KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
           C     +C    CVY ++Y D S T G  A ET++     E    F G +FGC   N G 
Sbjct: 173 CAAAGGECNGTTCVYGVEYGDGSSTTGVLARETLTFSSSSE----FTGFIFGCGETNLG- 227

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
           D    DG L    G   ++       G I    FSYC    LP+   T  YL  G     
Sbjct: 228 DFGEVDGLLGLGRGSLSLSSQAAPAFGGI----FSYC----LPSYNTTPGYLSIGATPVT 279

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            +   Q T  +N P+  +FY++ L  I+I    +  PP  F  T     G ++DSG++LT
Sbjct: 280 GQIPVQYTAMVNKPDYPSFYFIELVSINIGGYVLPVPPSEFTKT-----GTLLDSGTILT 334

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSMAFYFEDA- 277
           Y     Y  L ++F     +F +      P  + +  CY F  ++    P ++F F D  
Sbjct: 335 YLPPPAYTALRDRF-----KFTMQGSKPAPPYDELDTCYDFTGQSGILIPGVSFNFSDGA 389

Query: 278 --NLRIDGENVFIIDYENHFFLLAVA--PHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
             NL   G   F  D +     LA    P D   +++GS  QR    +YD+    + F+ 
Sbjct: 390 VFNLNFFGIMTFPDDTKPAVGCLAFVSRPADMPFSVVGSTTQRSAEVIYDVPAQKIGFIP 449

Query: 334 ENC 336
            +C
Sbjct: 450 ASC 452


>gi|357143657|ref|XP_003573000.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Brachypodium distachyon]
          Length = 464

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 96/359 (26%), Positives = 145/359 (40%), Gaps = 49/359 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----AIFDPRKSSSFQKINCDHPDCTYFK----- 50
           ++ + IG+P+    + +DTGS + +      ++DP  SS++   +C  P C         
Sbjct: 132 VITVSIGSPAVAXTMFIDTGSDVSWLRCKSRLYDPGTSSTYAPFSCSAPACAQLGRRGTG 191

Query: 51  -CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARD 109
                 CVY++KY D S T G    +T+++ G  E   +  G  FGCS   HGF+ED  D
Sbjct: 192 CSSGSTCVYSVKYGDGSNTTGTYGSDTLTLAGTSE--PLISGFQFGCSAVEHGFEEDNTD 249

Query: 110 GALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQ 169
               G++GL     SF+SQ  +     FSYC    LP    +S +L  G        +  
Sbjct: 250 ----GLMGLGGDAQSFVSQTAATYGSAFSYC----LPPTWNSSGFLTLGAPSSSTSAAFS 301

Query: 170 ATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDV 227
            T  +       FY L L+ IS+  + +  P   F        G I+DSG+V+T      
Sbjct: 302 TTPMLRSKQAATFYGLLLRGISVGGKTLEIPSSVF------SAGSIVDSGTVITRLPPTA 355

Query: 228 YWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF--------NRF--PSMAFYFEDA 277
           Y  L   F     R+Q        +P      L   F        N F  PS+A    D 
Sbjct: 356 YGALSAAFRDGMARYQY-------QPAAPRGLLDTCFDFTGHGEGNNFTVPSVALVL-DG 407

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
              +D     I+  ++     A    D    +IG+ QQR    +YD+   +  F    C
Sbjct: 408 GAVVDLHPNGIV--QDGCLAFAATDDDGRTGIIGNVQQRTFEVLYDVGQSVFGFRPGAC 464


>gi|224093400|ref|XP_002309912.1| predicted protein [Populus trichocarpa]
 gi|222852815|gb|EEE90362.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 90/357 (25%), Positives = 156/357 (43%), Gaps = 41/357 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ +G+P +   +++D+GS +++               +FDP  S+SF  ++C    C 
Sbjct: 45  VRIGLGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCD 104

Query: 48  YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
             +   C + +C Y + Y D S TKG  A ET++      G+ +      GC + N G  
Sbjct: 105 RVENAGCNSGRCRYEVSYGDGSYTKGTLALETLTF-----GRTVVRNVAIGCGHSNRGMF 159

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A              ++SF+ QL       FSYCLV     G  T+ +L+FG++    
Sbjct: 160 VGAAGLLGL-----GGGSMSFMGQLSGQTGNAFSYCLV---SRGTNTNGFLEFGSEA--- 208

Query: 165 RPSTQA-TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            P   A    + +P   +FYY+ L  + + + R+    D F +   G GG ++D+G+ +T
Sbjct: 209 MPVGAAWIPLVRNPRAPSFYYIRLLGLGVGDTRVPVSEDVFQLNELGSGGVVMDTGTAVT 268

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDANLR 280
            F +  Y      F+   E+ Q    +        CY L    + R P+++FYF    + 
Sbjct: 269 RFPTVAYEAFRNAFI---EQTQNLPRASGVSIFDTCYNLFGFLSVRVPTVSFYFSGGPIL 325

Query: 281 IDGENVFIIDYENH-FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
               N F+I  ++   F  A AP    ++++G+ QQ   +   D   + + F    C
Sbjct: 326 TIPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDEANEFVGFGPNIC 382


>gi|357514989|ref|XP_003627783.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521805|gb|AET02259.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  113 bits (282), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 98/352 (27%), Positives = 158/352 (44%), Gaps = 50/352 (14%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
           +GTP   +  I DTGS +++                F P KSS+++ I C    C     
Sbjct: 93  VGTPPFKLYGIADTGSDIVWLQCEPCKECYNQTTPKFKPSKSSTYKNIPCSSDLC----- 147

Query: 52  VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGA 111
                        +S  +G  + +T+++         F   + GC  DN      + +GA
Sbjct: 148 -------------KSGQQGNLSVDTLTLESSTGHPISFPKTVIGCGTDN----TVSFEGA 190

Query: 112 LAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQAT 171
            +G++GL     S I+QLGS I  +FSYCL +P P    T+S L FG           +T
Sbjct: 191 SSGIVGLGGGPASLITQLGSSIDAKFSYCL-LPNPVESNTTSKLNFGDTAVVSGDGVVST 249

Query: 172 KFINH-PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWK 230
             +   P  FYYL+L+  S+ N+R+ F   +       EG  IIDSG+ LT   +DVY  
Sbjct: 250 PIVKKDPIVFYYLTLEAFSVGNKRIEFEGSS---NGGHEGNIIIDSGTTLTVIPTDVYNN 306

Query: 231 LHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGENVFIID 290
           L     +  E  +L +++D      LCY +      FP +  +F+ A++++   + F +D
Sbjct: 307 LES---AVLELVKLKRVNDPTRLFNLCYSVTSDGYDFPIITTHFKGADVKLHPISTF-VD 362

Query: 291 YENHFFLLAVAPH-----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
             +    LA A        D+V++ G+  Q++    YDL   ++SF   +CS
Sbjct: 363 VADGIVCLAFATTSAFIPSDVVSIFGNLAQQNLLVGYDLQQKIVSFKPTDCS 414


>gi|225463766|ref|XP_002267930.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 92/362 (25%), Positives = 154/362 (42%), Gaps = 51/362 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ +G+P +   +++D+GS +++               +FDP  S+SF  ++C    C 
Sbjct: 142 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSSVCD 201

Query: 48  YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
             +   C   +C Y + Y D S TKG  A ET++      G+ +      GC + N G  
Sbjct: 202 RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTF-----GRTMVRSVAIGCGHRNRGMF 256

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A              ++SF+ QLG      FSYCLV     G  +S  L FG +    
Sbjct: 257 VGAAGLLGL-----GGGSMSFVGQLGGQTGGAFSYCLV---SRGTDSSGSLVFGREA--- 305

Query: 165 RPSTQA-TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            P+  A    + +P   +FYY+ L  + +   R+    + F +T  G+GG ++D+G+ +T
Sbjct: 306 LPAGAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVT 365

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-----CY-FLPETFNRFPSMAFYFE 275
              +  Y    + F        LAQ ++ P    +     CY  L     R P+++FYF 
Sbjct: 366 RLPTLAYQAFRDAF--------LAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFS 417

Query: 276 DAN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
               L +   N  I   +   F  A AP    ++++G+ QQ   +  +D     + F   
Sbjct: 418 GGPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFDGANGYVGFGPN 477

Query: 335 NC 336
            C
Sbjct: 478 IC 479


>gi|293336306|ref|NP_001168599.1| uncharacterized protein LOC100382383 [Zea mays]
 gi|223949441|gb|ACN28804.1| unknown [Zea mays]
          Length = 326

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 104/349 (29%), Positives = 154/349 (44%), Gaps = 50/349 (14%)

Query: 15  LILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF---KCVNE--Q 55
           ++LDTGS + +               +FDP  S+S+  ++CD   C       C N    
Sbjct: 1   MVLDTGSDVTWVQCQPCADCYQQSDPVFDPSLSASYAAVSCDSQRCRDLDTAACRNATGA 60

Query: 56  CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGV 115
           C+Y + Y D S T G  A ET+++   G+   + + A+ GC +DN G          AG+
Sbjct: 61  CLYEVAYGDGSYTVGDFATETLTL---GDSTPVGNVAI-GCGHDNEGLFV-----GAAGL 111

Query: 116 LGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFIN 175
           L L    +SF SQ   I    FSYCLV         +S L+FG   G     T     + 
Sbjct: 112 LALGGGPLSFPSQ---ISASTFSYCLV---DRDSPAASTLQFGD--GAAEAGTVTAPLVR 163

Query: 176 HP--NNFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVLTYFHSDVYWKLH 232
            P  + FYY++L  IS+  + ++ P   F +   SG GG I+DSG+ +T   S  Y  L 
Sbjct: 164 SPRTSTFYYVALSGISVGGQPLSIPASAFAMDATSGSGGVIVDSGTAVTRLQSAAYAALR 223

Query: 233 EKFVS---YFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFEDAN-LRIDGENVF 287
           + FV       R     L D       CY L + T    P+++  FE    LR+  +N  
Sbjct: 224 DAFVQGAPSLPRTSGVSLFD------TCYDLSDRTSVEVPAVSLRFEGGGALRLPAKNYL 277

Query: 288 IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           I       + LA AP +  V++IG+ QQ+ TR  +D     + F    C
Sbjct: 278 IPVDGAGTYCLAFAPTNAAVSIIGNVQQQGTRVSFDTARGAVGFTPNKC 326


>gi|297821064|ref|XP_002878415.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324253|gb|EFH54674.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 486

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 106/370 (28%), Positives = 155/370 (41%), Gaps = 58/370 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           +RL +GTP+  V ++LDTGS +++               IFDP+KS +F  + C    C 
Sbjct: 140 MRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQSDVIFDPKKSKTFATVPCGSRLCR 199

Query: 48  YF----KCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-----FG 95
                 +CV  +   C+Y + Y D S T+G  + ET++          FHGA       G
Sbjct: 200 RLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLT----------FHGARVDHVPLG 249

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV--IPLPNGEYTSS 153
           C +DN G    A            R  +SF SQ  S    +FSYCLV      +     S
Sbjct: 250 CGHDNEGLFVGAAGLLGL-----GRGGLSFPSQTKSRYNGKFSYCLVDRTSSGSSSKPPS 304

Query: 154 YLKFGTDMGYRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGE 209
            + FG D     P T   T  + +P  + FYYL L  IS+   R+       F +  +G 
Sbjct: 305 TIVFGNDA---VPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGN 361

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPE-TFNR 266
           GG IIDSG+ +T      Y  L + F     R    +L   P       C+ L   T  +
Sbjct: 362 GGVIIDSGTSVTRLTQSAYVALRDAF-----RLGATKLKRAPSYSLFDTCFDLSGMTTVK 416

Query: 267 FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
            P++ F+F    + +   N  I       F  A A     +++IG+ QQ+  R  YDL  
Sbjct: 417 VPTVVFHFGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVG 476

Query: 327 DLLSFVKENC 336
             + F+   C
Sbjct: 477 SRVGFLSRAC 486


>gi|449486856|ref|XP_004157423.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 102/363 (28%), Positives = 159/363 (43%), Gaps = 58/363 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
           R+ +G P+K   ++LDTGS + +               IFDPR SSSF  + C+   C  
Sbjct: 158 RVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQCQA 217

Query: 49  FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
            +   C   +C+Y + Y D S T G    ET++    G    + +    GC +DN G   
Sbjct: 218 LETSGCRASKCLYQVSYGDGSFTVGEFVTETLTFGNSG----MINDVAVGCGHDNEGL-- 271

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
                   G  GL  +    +S    +    FSYCLV        +SS L+F        
Sbjct: 272 ------FVGSAGLLGLGGGPLSLTSQMKASSFSYCLV---DRDSSSSSDLEFN------- 315

Query: 166 PSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
            S   +  +N P       + FYY+ L  +S+  + ++ PP+ F +  SG GG I+DSG+
Sbjct: 316 -SAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGT 374

Query: 219 VLTYFHSDVYWKLHEKFVS---YFERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFYF 274
            +T   +  Y  L + FVS   Y ++     L D       CY L  ++    P+++F F
Sbjct: 375 AITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDT------CYDLSSQSRVTIPTVSFEF 428

Query: 275 EDA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
               +L++  +N  I       F  A AP    +++IG+ QQ+ TR  YDL   ++ F  
Sbjct: 429 AGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSP 488

Query: 334 ENC 336
             C
Sbjct: 489 HKC 491


>gi|449439383|ref|XP_004137465.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 491

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 102/363 (28%), Positives = 159/363 (43%), Gaps = 58/363 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
           R+ +G P+K   ++LDTGS + +               IFDPR SSSF  + C+   C  
Sbjct: 158 RVGVGQPAKPFYMVLDTGSDINWLQCQPCTDCYQQTDPIFDPRSSSSFASLPCESQQCQA 217

Query: 49  FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
            +   C   +C+Y + Y D S T G    ET++    G    + +    GC +DN G   
Sbjct: 218 LETSGCRASKCLYQVSYGDGSFTVGEFVIETLTFGNSG----MINNVAVGCGHDNEGL-- 271

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
                   G  GL  +    +S    +    FSYCLV        +SS L+F        
Sbjct: 272 ------FVGSAGLLGLGGGSLSLTSQMKASSFSYCLV---DRDSSSSSDLEFN------- 315

Query: 166 PSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
            S   +  +N P       + FYY+ L  +S+  + ++ PP+ F +  SG GG I+DSG+
Sbjct: 316 -SAAPSDSVNAPLLKSGKVDTFYYVGLTGMSVGGQLLSIPPNLFQMDDSGYGGIIVDSGT 374

Query: 219 VLTYFHSDVYWKLHEKFVS---YFERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFYF 274
            +T   +  Y  L + FVS   Y ++     L D       CY L  ++    P+++F F
Sbjct: 375 AITRLQTQAYNTLRDAFVSRTPYLKKTNGFALFDT------CYDLSSQSRVTIPTVSFEF 428

Query: 275 EDA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
               +L++  +N  I       F  A AP    +++IG+ QQ+ TR  YDL   ++ F  
Sbjct: 429 AGGKSLQLPPKNYLIPVDSVGTFCFAFAPTTSSLSIIGNVQQQGTRVHYDLANSVVGFSP 488

Query: 334 ENC 336
             C
Sbjct: 489 HKC 491


>gi|125586059|gb|EAZ26723.1| hypothetical protein OsJ_10631 [Oryza sativa Japonica Group]
          Length = 339

 Score =  112 bits (279), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 101/352 (28%), Positives = 155/352 (44%), Gaps = 37/352 (10%)

Query: 6   IGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHP-----DCTYFKC------VNE 54
           +GTP   V L L+ G+ LI+   +P      Q      P        +  C       N+
Sbjct: 1   MGTPPNPVKLKLENGNELIWNHSNPSPECFEQAFPYFEPLTFSRGLPFASCGSPKFWPNQ 60

Query: 55  QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAG 114
            CVYT  Y D+SVT GF   +  + +G G   A   G  FGC   N+G  +        G
Sbjct: 61  TCVYTYSYGDKSVTTGFLEVDKFTFVGAG---ASVPGVAFGCGLFNNGVFKSNE----TG 113

Query: 115 VLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-GYRRPSTQATKF 173
           + G  R  +S  SQL       FS+C    +     ++  L    D+    + + Q T  
Sbjct: 114 IAGFGRGPLSLPSQLKV---GNFSHCFTT-ITGAIPSTVLLDLPADLFSNGQGAVQTTPL 169

Query: 174 INHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
           I +  N      YYLSLK I++ + R+  P   F +T +G GG IIDSG+ +T     VY
Sbjct: 170 IQYAKNEANPTLYYLSLKGITVGSTRLPVPESAFALT-NGTGGTIIDSGTSITSLPPQVY 228

Query: 229 WKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFEDANLRIDGEN-V 286
             + ++F +   + +L  +         C+  P +     P +  +FE A + +  EN V
Sbjct: 229 QVVRDEFAA---QIKLPVVPGNATGHYTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYV 285

Query: 287 FII--DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           F +  D  N    LA+   D+   +IG+ QQ++   +YDL  ++LSFV   C
Sbjct: 286 FEVPDDAGNSIICLAINKGDE-TTIIGNFQQQNMHVLYDLQNNMLSFVAAQC 336


>gi|116787333|gb|ABK24467.1| unknown [Picea sitchensis]
          Length = 497

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 102/359 (28%), Positives = 163/359 (45%), Gaps = 44/359 (12%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
           RL +GTP +   ++LDTGS +++               +F+P  SS+++K+ C  P C  
Sbjct: 156 RLGVGTPPRYTYMVLDTGSDIMWIQCLPCAKCYGQTDPLFNPAASSTYRKVPCATPLCKK 215

Query: 49  F---KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                C N++ C Y + Y D S T G  + ET++  G+     +      GC +DN G  
Sbjct: 216 LDISGCRNKRYCEYQVSYGDGSFTVGDFSTETLTFRGQ-----VIRRVALGCGHDNEGLF 270

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A            R ++SF SQ G+   KRFSYCLV    +G  T+S L FG     +
Sbjct: 271 IGAAGLLGL-----GRGSLSFPSQTGAQFSKRFSYCLVDRSASG--TASSLIFGKAAIPK 323

Query: 165 RPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSVLT 221
             S   T  +++P  + FYY+ L  IS+   R+ + P   F +  +G GG IIDSG+ +T
Sbjct: 324 --SAIFTPLLSNPKLDTFYYVELVGISVGGRRLTSIPASVFRMDATGNGGVIIDSGTSVT 381

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAFYFE-DA 277
                 Y  + + F     R     L           CY L      + P++ F+F+  A
Sbjct: 382 RLVDSAYSTMRDAF-----RVGTGNLKSAGGFSLFDTCYDLSGLKTVKVPTLVFHFQGGA 436

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           ++ +   N  I    +  F  A A +   +++IG+ QQ+  R V+D   + + F   +C
Sbjct: 437 HISLPATNYLIPVDSSATFCFAFAGNTGGLSIIGNIQQQGYRVVFDSLANRVGFKAGSC 495


>gi|326525377|dbj|BAK07958.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 95/344 (27%), Positives = 153/344 (44%), Gaps = 48/344 (13%)

Query: 27  IFDPRKSSSFQKIN------CDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVI 80
           +FDP KS +F  I       C  P   Y    N  C + + Y D +   G+ A +T S  
Sbjct: 139 VFDPTKSPTFSNIPAHNTVWCRPP---YQPLANGACGFDIAYRDNTHASGYLARDTFSFP 195

Query: 81  GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGL-----SRVTISFISQLGSIIKK 135
              +        +FGC++    F       A+AG+LGL      +   +F  Q+      
Sbjct: 196 AGNDDFVPLSAIVFGCAHQTEHFKNQR---AVAGILGLGMGPAGKPPTAFTKQVLPAHGG 252

Query: 136 RFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPST---QATKFI--NHPNNFYYLSLKDISI 190
           RFSYC  +P   G    SYL+FG+D+    P     Q+T  +   H +  Y++ L  +S+
Sbjct: 253 RFSYCPFVP---GMSMYSYLRFGSDIPSHPPPNVHRQSTPVLAPAHNSEAYFVKLAGVSV 309

Query: 191 DNERMN-FPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFER----FQLA 245
              R++   P  F     G GGC++D G+ +T F    Y  +      + +R      + 
Sbjct: 310 GANRLSGVTPAMFRRNAHGAGGCVVDIGTRMTAFIHSAYVHIDHAVRQHLQRRGAHIVVV 369

Query: 246 QLSDC---PEPIQLCYFLPETFNRFPSMAFYFED-ANLRIDGENVFI--IDYENHFFLLA 299
           + + C   P P           +  PSM  +FE+ A LR+  E+VF+  +   +H+    
Sbjct: 370 RGNTCVQQPAPHH---------DVLPSMTLHFENGAWLRVMPEHVFMPFVVGGHHYQCFG 420

Query: 300 VAPHDDLVALIGSQQQRDTRFVYDLN--IDLLSFVKENCSDDSA 341
                DL  +IG++QQ + RF++DL+  I ++SF  E+C  D A
Sbjct: 421 FVSSTDLT-VIGARQQVNHRFIFDLHDTIPIMSFNPEDCHLDGA 463


>gi|15222357|ref|NP_174430.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|12322538|gb|AAG51267.1|AC027135_8 chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|67633408|gb|AAY78629.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332193236|gb|AEE31357.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 445

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 105/372 (28%), Positives = 165/372 (44%), Gaps = 48/372 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           + + IGTP   V  I DTGS L +               +FD +KSS+++  +CD   C 
Sbjct: 87  MSISIGTPPSKVFAIADTGSDLTWVQCKPCQQCYKQNSPLFDKKKSSTYKTESCDSKTCQ 146

Query: 48  YFKCVNEQC-------VYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 E C        Y   Y D S TKG  A ETIS+         F G +FGC  +N
Sbjct: 147 ALSEHEEGCDESKDICKYRYSYGDNSFTKGDVATETISIDSSSGSSVSFPGTVFGCGYNN 206

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL--VIPLPNGEYTSSYLKFG 158
            G  E+   G +     L    +S +SQLGS I K+FSYCL       NG   +S +  G
Sbjct: 207 GGTFEETGSGIIG----LGGGPLSLVSQLGSSIGKKFSYCLSHTAATTNG---TSVINLG 259

Query: 159 TDMGYRRPS----TQATKFINH-PNNFYYLSLKDISIDNERMNFPPDTFDI---TVSGEG 210
           T+     PS    T  T  I   P  +Y+L+L+ +++   ++ +    + +   +    G
Sbjct: 260 TNSIPSNPSKDSATLTTPLIQKDPETYYFLTLEAVTVGKTKLPYTGGGYGLNGKSSKRTG 319

Query: 211 GCIIDSGSVLTYFHS---DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF 267
             IIDSG+ LT   S   D +    E+ V+  +R     +SD    +  C+   +     
Sbjct: 320 NIIIDSGTTLTLLDSGFYDDFGTAVEESVTGAKR-----VSDPQGLLTHCFKSGDKEIGL 374

Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
           P++  +F +A++++   N F+   E+   L  +   +  VA+ G+  Q D    YDL   
Sbjct: 375 PAITMHFTNADVKLSPINAFVKLNEDTVCLSMIPTTE--VAIYGNMVQMDFLVGYDLETK 432

Query: 328 LLSFVKENCSDD 339
            +SF + +CS +
Sbjct: 433 TVSFQRMDCSGN 444


>gi|255564685|ref|XP_002523337.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223537425|gb|EEF39053.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 469

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 100/361 (27%), Positives = 156/361 (43%), Gaps = 44/361 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ +GTP + V ++LDTGS +++               +FDPRKS SF  I C  P C 
Sbjct: 128 TRIGVGTPPRYVYMVLDTGSDIVWIQCAPCKRCYAQSDPVFDPRKSRSFASIACRSPLCH 187

Query: 48  YF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                      + C+Y + Y D S T G  + ET++       +        GC +DN G
Sbjct: 188 RLDSPGCNTQKQTCMYQVSYGDGSFTFGDFSTETLTFR-----RTRVARVALGCGHDNEG 242

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               A            R  +SF SQ G     +FSYCLV    + + +S  + FG    
Sbjct: 243 LFVGAAGLLGL-----GRGRLSFPSQTGRRFNHKFSYCLVDRSASSKPSS--MVFGDSAV 295

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSV 219
            R  + + T  +++P  + FYY+ L  IS+   R+       F +  +G GG IIDSG+ 
Sbjct: 296 SR--TARFTPLVSNPKLDTFYYVELLGISVGGTRVPGITASLFKLDQTGNGGVIIDSGTS 353

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLP-ETFNRFPSMAFYFED 276
           +T      Y    + F     R   + L   P+      C+ L  +T  + P++  +F  
Sbjct: 354 VTRLTRPAYIAFRDAF-----RAGASNLKRAPQFSLFDTCFDLSGKTEVKVPTVVLHFRG 408

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A++ +   N  I    +  F LA A     +++IG+ QQ+  R VYDL    + F    C
Sbjct: 409 ADVSLPASNYLIPVDTSGNFCLAFAGTMGGLSIIGNIQQQGFRVVYDLAGSRVGFAPHGC 468

Query: 337 S 337
           +
Sbjct: 469 A 469


>gi|242095588|ref|XP_002438284.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
 gi|241916507|gb|EER89651.1| hypothetical protein SORBIDRAFT_10g011120 [Sorghum bicolor]
          Length = 487

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 100/383 (26%), Positives = 161/383 (42%), Gaps = 64/383 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +V + IGTP +   ++ DTGS L +                 +FDP KSS++  + C  P
Sbjct: 123 VVTIGIGTPPRNFTVLFDTGSDLTWVQCLPCPDSSCYPQQEPLFDPSKSSTYVDVPCSAP 182

Query: 45  DC-----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
           +C        +C    C Y++KY D+S T G  A ET ++           G +FGCS++
Sbjct: 183 ECHIGGVQQTRCGATSCEYSVKYGDESETHGSLAEETFTLSPPSPLAPAATGVVFGCSHE 242

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR---FSYCLVIPLPNGEYTSSYLK 156
                 D   G +AG+LGL R   S +SQ    I      FSYCL    P G  T  YL 
Sbjct: 243 YISVFNDTGMG-VAGLLGLGRGDSSILSQTRRSINSGGGVFSYCLP---PRGSST-GYLT 297

Query: 157 FGTDMGYRRPSTQATKF--------INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
            G   G   P  Q +          I+   + Y ++L  +S++   ++ P   F +    
Sbjct: 298 IGG--GAAAPQQQYSNLSFTPLITTISQLRSAYVVNLAGVSVNGAAVDIPASAFSL---- 351

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP----IQLCY-FLPET 263
             G +IDSG+V+T+  +  Y+ L ++F     R  +      PE     +  CY    + 
Sbjct: 352 --GAVIDSGTVVTHMPAAAYYPLRDEF-----RLHMGSYKMLPEGSMKLLDTCYDVTGQD 404

Query: 264 FNRFPSMAFYF-EDANLRIDGENVFII----DYENHFFLLA----VAPHDDLVALIGSQQ 314
               P +A  F   A + +D   + ++    D       LA    +  +   + ++G+ Q
Sbjct: 405 VVTAPRVALEFGGGARIDVDASGILLVLPAEDGSGQSLTLACLAFLPTNSAGLVIVGNMQ 464

Query: 315 QRDTRFVYDLNIDLLSFVKENCS 337
           QR    V+D++   + F    CS
Sbjct: 465 QRAYNVVFDVDGGRIGFGPNGCS 487


>gi|356502456|ref|XP_003520035.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 484

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 106/355 (29%), Positives = 159/355 (44%), Gaps = 41/355 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           +R+ IG P     ++LDTGS + +               IFDP  S+S+  I CD P C 
Sbjct: 151 LRVGIGKPPSQAYVVLDTGSDVSWIQCAPCSECYQQSDPIFDPVSSNSYSPIRCDAPQCK 210

Query: 48  YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                +C N  C+Y + Y D S T G  A ET+++     G A       GC ++N G  
Sbjct: 211 SLDLSECRNGTCLYEVSYGDGSYTVGEFATETVTL-----GTAAVENVAIGCGHNNEGLF 265

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS-SYLKFGTDMGY 163
             A      G   LS     F +Q+ +     FSYCLV    N +  + S L+F + +  
Sbjct: 266 VGAAGLLGLGGGKLS-----FPAQVNAT---SFSYCLV----NRDSDAVSTLEFNSPLP- 312

Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
           R   T   +     + FYYL LK IS+  E +  P   F++   G GG IIDSG+ +T  
Sbjct: 313 RNVVTAPLRRNPELDTFYYLGLKGISVGGEALPIPESIFEVDAIGGGGIIIDSGTAVTRL 372

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF-EDANLRI 281
            S+VY  L + FV   +    A           CY L    + + P+++F+F E   L +
Sbjct: 373 RSEVYDALRDAFVKGAKGIPKANGVSL---FDTCYDLSSRESVQVPTVSFHFPEGRELPL 429

Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
              N  I       F  A AP    ++++G+ QQ+ TR  +D+   L+ F  ++C
Sbjct: 430 PARNYLIPVDSVGTFCFAFAPTTSSLSIMGNVQQQGTRVGFDIANSLVGFSADSC 484


>gi|242066176|ref|XP_002454377.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
 gi|241934208|gb|EES07353.1| hypothetical protein SORBIDRAFT_04g029680 [Sorghum bicolor]
          Length = 474

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 96/365 (26%), Positives = 149/365 (40%), Gaps = 60/365 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +V + +GTP     L +DTGS L +                 +FDP +SSS+  + C  P
Sbjct: 141 VVTVSLGTPGVAQTLEVDTGSDLSWVQCTPCAAPACYSQKDPLFDPAQSSSYAAVPCGGP 200

Query: 45  DCTYF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
            C         C   QC Y + Y D S T G  + +T+++      +  F    FGC + 
Sbjct: 201 VCGGLGIYASSCSAAQCGYVVSYGDGSKTTGVYSSDTLTLSPNDAVRGFF----FGCGHA 256

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
             GF  +       G+LGL R   S + Q        FSYC    LP    T+ YL  G 
Sbjct: 257 QSGFTGN------DGLLGLGREEASLVEQTAGTYGGVFSYC----LPTRPSTTGYLTLGG 306

Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
             G   P    T+ ++ PN   +Y + L  IS+  ++++ P   F       GG ++D+G
Sbjct: 307 PSGAAPPGFSTTQLLSSPNAATYYVVMLTGISVGGQQLSVPSSVF------AGGTVVDTG 360

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE- 275
           +V+T      Y  L   F S    +     +     +  CY F        P++A  F  
Sbjct: 361 TVITRLPPTAYAALRSAFRSGMASYGYPS-APATGILDTCYNFSGYGTVTLPNVALTFSG 419

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLS--F 331
            A + +  + +        F  LA AP   D  +A++G+ QQR     +++ ID  S  F
Sbjct: 420 GATVTLGADGIL------SFGCLAFAPSGSDGGMAILGNVQQRS----FEVRIDGTSVGF 469

Query: 332 VKENC 336
              +C
Sbjct: 470 KPSSC 474


>gi|218192707|gb|EEC75134.1| hypothetical protein OsI_11325 [Oryza sativa Indica Group]
          Length = 401

 Score =  111 bits (278), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 101/327 (30%), Positives = 145/327 (44%), Gaps = 47/327 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V L IGTP + V L LDTGS LI+                FDP  SS+    +CD   C
Sbjct: 83  LVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLC 142

Query: 47  TYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
                          N+ CVYT  Y D+SVT GF   +  + +G G   A   G  FGC 
Sbjct: 143 QGLPVASCGSPKFWPNQTCVYTYSYGDKSVTTGFLEVDKFTFVGAG---ASVPGVAFGCG 199

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS-YLK 156
             N+G  +        G+ G  R  +S  SQL       FS+C      NG   S+  L 
Sbjct: 200 LFNNGVFKSNE----TGIAGFGRGPLSLPSQLKV---GNFSHCFTA--VNGLKPSTVLLD 250

Query: 157 FGTDMGYR--RPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
              D+ Y+  R + Q+T  I +P N  FYYLSLK I++ + R+  P   F +  +G GG 
Sbjct: 251 LPADL-YKSGRGAVQSTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALK-NGTGGT 308

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-PSMA 271
           IIDSG+ +T   + VY  + + F +   + +L  +S        C   P     + P + 
Sbjct: 309 IIDSGTAMTSLPTRVYRLVRDAFAA---QVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLV 365

Query: 272 FYFEDANLRIDGEN-VFIIDYENHFFL 297
            +FE A + +  EN V++  Y     +
Sbjct: 366 LHFEGATMDLPRENYVWLKHYPKRLLI 392


>gi|148910443|gb|ABR18297.1| unknown [Picea sitchensis]
          Length = 452

 Score =  111 bits (277), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 95/357 (26%), Positives = 160/357 (44%), Gaps = 43/357 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
           ++++  GTP + +  ++DTGS + +              IFDP KSSS++   CD   C 
Sbjct: 116 IIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTAPIFDPAKSSSYKPFACDSQPCQ 175

Query: 48  YFK--C-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                C  N +C + + Y D +   G  A + I++     G        FGC+       
Sbjct: 176 EISGNCGGNSKCQFEVSYGDGTQVDGTLASDAITL-----GSQYLPNFSFGCA---ESLS 227

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
           ED         LG   +++   +    +    FSYC    LP+   +S  L  G +    
Sbjct: 228 EDTSPSPGLMGLGGGSLSLLTQAPTAELFGGTFSYC----LPSSSTSSGSLVLGKEAAVS 283

Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
             S + T  I  P+   FY+++LK IS+ N R++ P       ++  GG IIDSG+ +T+
Sbjct: 284 SSSLKFTTLIKDPSIPTFYFVTLKAISVGNTRISVP----GTNIASGGGTIIDSGTTITH 339

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPETFNRFPSMAFYFE-DANLR 280
                Y  L + F     R QL+ L   P E +  CY L  +    P++  + + + +L 
Sbjct: 340 LVPSAYTALRDAF-----RQQLSSLQPTPVEDMDTCYDLSSSSVDVPTITLHLDRNVDLV 394

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +  EN+ I   E+    LA +  D   ++IG+ QQ++ R V+D+    + F +E C+
Sbjct: 395 LPKENILITQ-ESGLACLAFSSTDSR-SIIGNVQQQNWRIVFDVPNSQVGFAQEQCA 449


>gi|413944032|gb|AFW76681.1| hypothetical protein ZEAMMB73_606599 [Zea mays]
          Length = 315

 Score =  110 bits (276), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 94/324 (29%), Positives = 145/324 (44%), Gaps = 33/324 (10%)

Query: 33  SSSFQKINCDHPDC------TYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGE 84
           SS+F+ + C  P C      +   C  E  QC Y   Y D+S+T G    +T + +    
Sbjct: 2   SSTFKAVACPDPICRPSSGVSVSACAMENFQCFYLCSYGDRSITAGHIFKDTFTFMSPNG 61

Query: 85  GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
                    FGC + N G          +G+ G  R   S  SQL      RFSYCL + 
Sbjct: 62  VPVAVSELAFGCGDYNTGLFVSNE----SGIAGFGRGPQSLPSQLKV---GRFSYCLTL- 113

Query: 145 LPNGEYTSSYLKFGTDM---GYRRPST---QATKFINHP--NNFYYLSLKDISIDNERMN 196
               E  SS +  GT     G R  +T   Q+T  I +P    FYYLSL+ I++   R+ 
Sbjct: 114 --VTESKSSVVILGTPPDPDGLRAHTTGPFQSTPIIYNPLIPTFYYLSLEGITVGKTRLP 171

Query: 197 FPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQ 255
           F    F +   G GG +IDSG+ LT     V+  L E+ V+   +F L +  + PE   +
Sbjct: 172 FDKSVFALKKDGSGGTVIDSGTSLTTLPEAVFELLQEELVA---QFPLPRYDNTPEVGDR 228

Query: 256 LCYFLPETFNR--FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAV-APHDDLVALIGS 312
           LC+  P+   +   P +  +   A++ +  +N F+ + ++    L +    D  + LIG+
Sbjct: 229 LCFRRPKGGKQVPVPKLILHLAGADMDLPRDNYFVEEPDSGVMCLQINGAEDTTMVLIGN 288

Query: 313 QQQRDTRFVYDLNIDLLSFVKENC 336
            QQ++   VYD+  + L F    C
Sbjct: 289 FQQQNMHVVYDVENNKLLFAPAQC 312


>gi|357130848|ref|XP_003567056.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 448

 Score =  110 bits (276), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 94/356 (26%), Positives = 157/356 (44%), Gaps = 37/356 (10%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
           +GTP    LL++DTGS +++               ++DPR SS++ +  C  P C   + 
Sbjct: 105 VGTPPTPALLVIDTGSDVVWLQCKPCVHCYRQLSPLYDPRGSSTYAQTPCSPPQCRNPQT 164

Query: 52  VNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDAR 108
            +     C Y + Y D S T G  A + +                 GC +DN G      
Sbjct: 165 CDGTTGGCGYRIVYGDASSTSGNLATDRLVF----SNDTSVGNVTLGCGHDNEGLF---- 216

Query: 109 DGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPST 168
            G+ AG+LG++R   SF +Q+     + F+YCL     +G  +SSYL FG       PS+
Sbjct: 217 -GSAAGLLGVARGNNSFATQVADSYGRYFAYCLGDRTRSGS-SSSYLVFGR-TAPEPPSS 273

Query: 169 QATKFINHPN--NFYYLSLKDISIDNERM-NFPPDTFDI-TVSGEGGCIIDSGSVLTYFH 224
             T   ++P   + YY+ +   S+  E +  F   +  +   +G GG ++DSG+ +T F 
Sbjct: 274 VFTPLRSNPRRPSLYYVDMVGFSVGGEPVTGFSNASLSLDPATGRGGVVVDSGTSITRFA 333

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFE-DANLRID 282
            D Y  L + F +   +  + ++         CY L        P +  +F   A++ + 
Sbjct: 334 RDAYGALRDAFDARAAKVGMRKVGRGISVFDACYDLRGVAVADAPGVVLHFAGGADVALP 393

Query: 283 GENVFIIDYEN--HFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            EN  + +     H F L  A HD L ++IG+  Q+  R V+D+  + + F    C
Sbjct: 394 PENYLVPEESGRYHCFALEAAGHDGL-SVIGNVLQQRFRVVFDVENERVGFEPNGC 448


>gi|297842769|ref|XP_002889266.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297335107|gb|EFH65525.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 489

 Score =  110 bits (276), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 101/363 (27%), Positives = 157/363 (43%), Gaps = 61/363 (16%)

Query: 11  KGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT--------- 47
           K + LI+DTGS L +               ++DP  SSS++ + C+   C          
Sbjct: 147 KNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATGNS 206

Query: 48  -----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                +   V   C Y + Y D S T+G  A E+I +     G       +FGC  +N G
Sbjct: 207 GPCGGFNGVVKTTCEYVVSYGDGSYTRGDLASESIVL-----GDTKLENLVFGCGRNNKG 261

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                  G  +G++GL R ++S +SQ        FSYCL   L +G   S  L FG D  
Sbjct: 262 LF-----GGASGLMGLGRSSVSLVSQTLKTFNGVFSYCLP-SLEDG--ASGTLSFGNDFS 313

Query: 163 YRRPSTQA--TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
             + ST    T  + +P   +FY L+L   SI    +         T+S   G +IDSG+
Sbjct: 314 VYKNSTSVFYTPLVQNPQLRSFYILNLTGASIGGVELK--------TLSFGRGILIDSGT 365

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-D 276
           V+T     +Y  +  +F+  F  F  A        +  C+ L    +   P++   FE +
Sbjct: 366 VITRLPPSIYKAVKTEFLKQFSGFPSAPGYSI---LDTCFNLTSYEDISIPTIKMIFEGN 422

Query: 277 ANLRIDGENVF-IIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
           A L +D   VF  +  +     LA+A   +++ V +IG+ QQ++ R +YD   + L    
Sbjct: 423 AELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIAG 482

Query: 334 ENC 336
           ENC
Sbjct: 483 ENC 485


>gi|242081123|ref|XP_002445330.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
 gi|241941680|gb|EES14825.1| hypothetical protein SORBIDRAFT_07g009580 [Sorghum bicolor]
          Length = 543

 Score =  110 bits (276), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 98/366 (26%), Positives = 154/366 (42%), Gaps = 58/366 (15%)

Query: 7   GTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC------ 46
           G+P+  + +I+DTGS L +               +FDP  S+++  + C+   C      
Sbjct: 197 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACAASLKA 256

Query: 47  ---TYFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
              T   C   NE+C Y + Y D S ++G  A +T+++     G A   G +FGC   N 
Sbjct: 257 ATGTPGSCGGGNERCYYALAYGDGSFSRGVLATDTVAL-----GGASLDGFVFGCGLSNR 311

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G       G  AG++GL R  +S +SQ        FSYCL  P       S  L  G D 
Sbjct: 312 GLF-----GGTAGLMGLGRTELSLVSQTALRYGGVFSYCL--PATTSGDASGSLSLGGDA 364

Query: 162 GYRRPSTQA--TKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
              R +T    T+ I  P    FY+L++   ++    +            G    +IDSG
Sbjct: 365 SSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALA-------AQGLGASNVLIDSG 417

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAFYF 274
           +V+T     VY  +  +F     +F  A     P    +  CY L      + P +    
Sbjct: 418 TVITRLAPSVYRGVRAEFT---RQFAAAGYPTAPGFSILDTCYDLTGHDEVKVPLLTLRL 474

Query: 275 ED-ANLRIDGENV-FIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLS 330
           E  A + +D   + F++  +     LA+A   ++D   +IG+ QQ++ R VYD     L 
Sbjct: 475 EGGAEVTVDAAGMLFVVRKDGSQVCLAMASLSYEDQTPIIGNYQQKNKRVVYDTVGSRLG 534

Query: 331 FVKENC 336
           F  E+C
Sbjct: 535 FADEDC 540


>gi|357143850|ref|XP_003573078.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 431

 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 92/367 (25%), Positives = 162/367 (44%), Gaps = 45/367 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP    + + DTGS L +               ++DP  SS+F  + C    C
Sbjct: 78  LMELAIGTPPVPFVALADTGSDLTWTQCQPCKLCFPQDTPVYDPSASSTFSPVPCSSATC 137

Query: 47  TYF----KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSND 99
                   C      C Y   Y+D + + G    ET+++     G+A+      FGC  D
Sbjct: 138 LPVLRSRNCSTPSSLCRYGYSYSDGAYSAGILGTETLTLGSSVPGQAVSVSDVAFGCGTD 197

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           N G   ++      G +GL R T+S ++QLG     +FSYCL     N    S +L  GT
Sbjct: 198 NGGDSLNS-----TGTVGLGRGTLSLLAQLG---VGKFSYCLT-DFFNSTLDSPFL-LGT 247

Query: 160 --DMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
             ++     + Q+T  +  P N   Y +SL+ I++ + R+  P  TFD+  +  GG ++D
Sbjct: 248 LAELAPGPGAVQSTPLLQSPLNPSRYVVSLQGITLGDVRLPIPNKTFDLHANSTGGMVVD 307

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQL-AQLSDCPEPIQLCYFLPETFNRFPSMA--- 271
           SG+  +      +  + +       +  + A   D P     C+  P    + P M    
Sbjct: 308 SGTTFSILPESGFRVVVDHVAQVLGQPPVNASSLDSP-----CFPAPAGERQLPFMPDLV 362

Query: 272 -FYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
             +   A++R+  +N    + E+  F L +       +++G+ QQ++ + ++D+ +  LS
Sbjct: 363 LHFAGGADMRLHRDNYMSYNQEDSSFCLNIVGTTSTWSMLGNFQQQNIQMLFDMTVGQLS 422

Query: 331 FVKENCS 337
           F+  +CS
Sbjct: 423 FLPTDCS 429


>gi|356555042|ref|XP_003545848.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 431

 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 99/356 (27%), Positives = 163/356 (45%), Gaps = 44/356 (12%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFK- 50
           +GTP   V  I+DT S +I+               +FDP  S +++ + C    C   + 
Sbjct: 94  LGTPPFPVYGIVDTASDIIWVQCQLCETCYNDTSPMFDPSYSKTYKNLPCSSTTCKSVQG 153

Query: 51  --CVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC-SNDNHGFD 104
             C +++   C +T+ Y D S ++G    ET+++    +    F   + GC  N N  FD
Sbjct: 154 TSCSSDERKICEHTVNYKDGSHSQGDLIVETVTLGSYNDPFVHFPRTVIGCIRNTNVSFD 213

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
                    G++GL    +S + QL S I K+FSYCL  P+ +    SS LKFG      
Sbjct: 214 S-------IGIVGLGGGPVSLVPQLSSSISKKFSYCLA-PISD---RSSKLKFGDAAMVS 262

Query: 165 RPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
              T +T+ +      FYYL+L+  S+ N R+ F   +     SG+G  IIDSG+  T  
Sbjct: 263 GDGTVSTRIVFKDWKKFYYLTLEAFSVGNNRIEF--RSSSSRSSGKGNIIIDSGTTFTVL 320

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAFYFEDANLRI 281
             DVY KL        +  +L +  D  +   LCY    T+++   P +  +F  A++++
Sbjct: 321 PDDVYSKLESAVA---DVVKLERAEDPLKQFSLCY--KSTYDKVDVPVITAHFSGADVKL 375

Query: 282 DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +  N FI+       L  ++      A+ G+  Q++    YDL   ++SF   +C+
Sbjct: 376 NALNTFIVASHRVVCLAFLSSQSG--AIFGNLAQQNFLVGYDLQRKIVSFKPTDCT 429


>gi|356499109|ref|XP_003518386.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 428

 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 97/374 (25%), Positives = 165/374 (44%), Gaps = 57/374 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
           V L +G+P + V ++LDTGS L +          + F+P  SSS+    C+   CT    
Sbjct: 62  VSLTVGSPPQNVTMVLDTGSELSWLHCKKLPNLNSTFNPLLSSSYTPTPCNSSICTTRTR 121

Query: 49  -----FKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                  C   N+ C   + YAD S  +G  A ET S+ G  +      G LFGC  D+ 
Sbjct: 122 DLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQ-----PGTLFGCM-DSA 175

Query: 102 GFDEDA-RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-------VIPLPNGEYTSS 153
           G+  D   D    G++G++R ++S ++Q+      +FSYC+       V+ L +G    S
Sbjct: 176 GYTSDINEDSKTTGLMGMNRGSLSLVTQMS---LPKFSYCISGEDALGVLLLGDGTDAPS 232

Query: 154 YLKFGTDMGYRRPSTQATKFINHPNNF-YYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
            L++        P   AT    + N   Y + L+ I +  + +  P   F    +G G  
Sbjct: 233 PLQY-------TPLVTATTSSPYFNRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQT 285

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPETFNRF 267
           ++DSG+  T+    VY  L ++F+   +   L ++ D P       + LCY  P +F   
Sbjct: 286 MVDSGTQFTFLLGSVYSSLKDEFLEQTKGV-LTRIED-PNFVFEGAMDLCYHAPASFAAV 343

Query: 268 PSMAFYFEDANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVAL----IGSQQQRDTRFVY 322
           P++   F  A +R+ GE  ++ +   + +       + DL+ +    IG   Q++    +
Sbjct: 344 PAVTLVFSGAEMRVSGERLLYRVSKGSDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWMEF 403

Query: 323 DLNIDLLSFVKENC 336
           DL    + F +  C
Sbjct: 404 DLLKSRVGFTQTTC 417


>gi|224080963|ref|XP_002306246.1| predicted protein [Populus trichocarpa]
 gi|222855695|gb|EEE93242.1| predicted protein [Populus trichocarpa]
          Length = 382

 Score =  110 bits (276), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 89/357 (24%), Positives = 161/357 (45%), Gaps = 41/357 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ +G+P +   +++D+GS +++               +FDP  S+SF  ++C    C 
Sbjct: 45  VRIGVGSPPRSQYMVIDSGSDIVWVQCKPCTQCYHQTDPLFDPADSASFMGVSCSSAVCD 104

Query: 48  YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                 C + +C Y + Y D S TKG  A ET+++     G+ +      GC + N G  
Sbjct: 105 QVDNAGCNSGRCRYEVSYGDGSSTKGTLALETLTL-----GRTVVQNVAIGCGHMNQGMF 159

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A              ++SF+ QL       FSYCLV  + N   ++ +L+FG++    
Sbjct: 160 VGAAGLLGL-----GGGSMSFVGQLSRERGNAFSYCLVSRVTN---SNGFLEFGSEA--- 208

Query: 165 RPSTQA-TKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            P   A    I +P++  +YY+ L  + + + ++    D F++T  G GG ++D+G+ +T
Sbjct: 209 MPVGAAWIPLIRNPHSPSYYYIGLSGLGVGDMKVPISEDIFELTELGNGGVVMDTGTAVT 268

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDANLR 280
            F +  Y    + F+   +   L + S        CY L    + R P+++FYF    + 
Sbjct: 269 RFPTVAYEAFRDAFID--QTGNLPRASGV-SIFDTCYNLFGFLSVRVPTVSFYFSGGPIL 325

Query: 281 IDGENVFIIDYENH-FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
               N F+I  ++   F  A AP    ++++G+ QQ   +   D   + + F    C
Sbjct: 326 TLPANNFLIPVDDAGTFCFAFAPSPSGLSILGNIQQEGIQISVDGANEFVGFGPNVC 382


>gi|168014386|ref|XP_001759733.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689272|gb|EDQ75645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 392

 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 107/373 (28%), Positives = 157/373 (42%), Gaps = 50/373 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           V   +GTP +   LI+DTGS L +               ++ P  SS+F  + CD  +C 
Sbjct: 36  VDFSLGTPEQKFHLIVDTGSDLAFVQCAPCDLCYEQDGPLYQPSNSSTFTPVPCDSAECL 95

Query: 48  YFK------CVNE--------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
                    C +          C Y  +Y D S T G  A+ET +V     G  + H A 
Sbjct: 96  LIPAPVGAPCSSSYPESPPQGACSYEYRYGDNSSTVGVFAYETATV----GGIRVNHVA- 150

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL-PNGEYTS 152
           FGC N N G    A      GVLGL +  +SF SQ G   + +F+YCL   L P   ++S
Sbjct: 151 FGCGNRNQGSFVSA-----GGVLGLGQGALSFTSQAGYAFENKFAYCLTSYLSPTSVFSS 205

Query: 153 SYLKFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
             L FG DM       Q T  +++P N   YY+ +  I    E +  P   + I   G G
Sbjct: 206 --LIFGDDMMSTIHDLQFTPLVSNPLNPSVYYVQIVRICFGGETLLIPDSAWKIDSVGNG 263

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCYFLPETFNR-FP 268
           G I DSG+ +TY+    Y ++    ++ FE+     +    P+ + LC  +    +  +P
Sbjct: 264 GTIFDSGTTVTYWSPQAYARI----IAAFEKSVPYPRAPPSPQGLPLCVNVSGIDHPIYP 319

Query: 269 SMAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
           S    F + A  R +  N FI    N   L  +    D   +IG+  Q++    YD    
Sbjct: 320 SFTIEFDQGATYRPNQGNYFIEVSPNIDCLAMLESSSDGFNVIGNIIQQNYLVQYDREEH 379

Query: 328 LLSFVKENCSDDS 340
            + F   NC   S
Sbjct: 380 RIGFAHANCDAPS 392


>gi|224286159|gb|ACN40790.1| unknown [Picea sitchensis]
          Length = 452

 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 96/357 (26%), Positives = 160/357 (44%), Gaps = 43/357 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
           ++++  GTP + +  ++DTGS + +              IFDP KSSS++   CD   C 
Sbjct: 116 IIQVDFGTPKQSMYTLIDTGSDVAWIPCKQCQGCHSTAPIFDPAKSSSYKPFACDSQPCQ 175

Query: 48  YFK--CV-NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                C  N +C + + Y D +   G  A + I++     G        FGC+       
Sbjct: 176 EISGNCGGNSKCQFEVLYGDGTQVDGTLASDAITL-----GSQYLPNFSFGCA---ESLS 227

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
           ED         LG   +++   +    +    FSYCL    P+   +S  L  G +    
Sbjct: 228 EDTYSSPGLMGLGGGSLSLLTQAPTAELFGGTFSYCL----PSSSTSSGSLVLGKEAAVS 283

Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
             S + T  I  P+   FY+++LK IS+ N R++ P       ++  GG IIDSG+ +TY
Sbjct: 284 SSSLKFTTLIKDPSFPTFYFVTLKAISVGNTRISVPA----TNIASGGGTIIDSGTTITY 339

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPETFNRFPSMAFYFE-DANLR 280
                Y  L + F     R QL+ L   P E +  CY L  +    P++  + + + +L 
Sbjct: 340 LVPSAYKDLRDAF-----RQQLSSLQPTPVEDMDTCYDLSSSSVDVPTITLHLDRNVDLV 394

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +  EN+ I   E+    LA +  D   ++IG+ QQ++ R V+D+    + F +E C+
Sbjct: 395 LPKENILITQ-ESGLSCLAFSSTDSR-SIIGNVQQQNWRIVFDVPNSQVGFAQEQCA 449


>gi|449436215|ref|XP_004135889.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 496

 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 166/370 (44%), Gaps = 61/370 (16%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFK- 50
           +G   +   LI+DTGS L +               +F+P  SSSF  + C+ P C   + 
Sbjct: 149 VGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQP 208

Query: 51  -------CVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                  C N+    C Y + Y D S ++G    E +++     GK      +FGC  +N
Sbjct: 209 TAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTL-----GKTEIDNFIFGCGRNN 263

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G       G  +G++GL+R  +S +SQ  S+    FSYCL  P      + S    G D
Sbjct: 264 KGLF-----GGASGLMGLARSELSLVSQTSSLFGSVFSYCL--PTTGVGSSGSLTLGGAD 316

Query: 161 MG-YRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG-CIID 215
              ++  S  + T+ I +P  +NFY+L+L  ISI    +N P  +     S EG   ++D
Sbjct: 317 FSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLS-----SNEGVLSLLD 371

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP--ETFNRFPSMA 271
           SG+V+T     +Y    + F + FE+ Q +     P    +  C+ L   E  N  P++ 
Sbjct: 372 SGTVITRLSPSIY----KAFKAEFEK-QFSGYRTTPGFSILNTCFNLTGYEEVN-IPTVK 425

Query: 272 FYFE-DANLRIDGENVFII---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
           F FE +A + +D E VF     D        A   ++D   +IG+ QQ++ R +Y+    
Sbjct: 426 FIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKES 485

Query: 328 LLSFVKENCS 337
            + F  E CS
Sbjct: 486 KVGFAGEPCS 495


>gi|224092220|ref|XP_002309515.1| predicted protein [Populus trichocarpa]
 gi|222855491|gb|EEE93038.1| predicted protein [Populus trichocarpa]
          Length = 473

 Score =  110 bits (275), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 149/361 (41%), Gaps = 49/361 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI---------------FDPRKSSSFQKINCDHPDC 46
           V + +GTP K   LI DTGS L +                  DP KS+S++ I+C    C
Sbjct: 135 VTVGLGTPKKEFTLIFDTGSDLTWTQCEPCAKTCYKQKEPRLDPTKSTSYKNISCSSAFC 194

Query: 47  TYF------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                     C +  C+Y ++Y D S + GF A ET+++        +F   LFGC   N
Sbjct: 195 KLLDTEGGESCSSPTCLYQVQYGDGSYSIGFFATETLTL----SSSNVFKNFLFGCGQQN 250

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G    A     AG+LGL R  +S  SQ     KK FSYC    LP    +  YL FG  
Sbjct: 251 SGLFRGA-----AGLLGLGRTKLSLPSQTAQKYKKLFSYC----LPASSSSKGYLSFGGQ 301

Query: 161 MGYRRPSTQATK-FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
           +      T  ++ F + P  FY L + ++S+   ++     + D ++    G +IDSG+V
Sbjct: 302 VSKTVKFTPLSEDFKSTP--FYGLDITELSVGGNKL-----SIDASIFSTSGTVIDSGTV 354

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDA- 277
           +T   S  Y  L   F      +     +D       CY F      + P +   F+   
Sbjct: 355 ITRLPSTAYSALSSAFQKLMTDY---PSTDGYSIFDTCYDFSKNETIKIPKVGVSFKGGV 411

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLV--ALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            + ID   +           LA A + D V  A+ G+ QQ+  + VYD     + F    
Sbjct: 412 EMDIDVSGILYPVNGLKKVCLAFAGNGDDVKAAIFGNTQQKTYQVVYDDAKGRVGFAPSG 471

Query: 336 C 336
           C
Sbjct: 472 C 472


>gi|449509162|ref|XP_004163513.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 417

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 166/370 (44%), Gaps = 61/370 (16%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFK- 50
           +G   +   LI+DTGS L +               +F+P  SSSF  + C+ P C   + 
Sbjct: 70  VGIGGQNSTLIVDTGSDLTWVQCLPCRLCYNQQEPLFNPSNSSSFLSLPCNSPTCVALQP 129

Query: 51  -------CVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                  C N+    C Y + Y D S ++G    E +++     GK      +FGC  +N
Sbjct: 130 TAGSSGLCSNKNSTSCDYQIDYGDGSYSRGELGFEKLTL-----GKTEIDNFIFGCGRNN 184

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G       G  +G++GL+R  +S +SQ  S+    FSYCL  P      + S    G D
Sbjct: 185 KGLF-----GGASGLMGLARSELSLVSQTSSLFGSVFSYCL--PTTGVGSSGSLTLGGAD 237

Query: 161 MG-YRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG-CIID 215
              ++  S  + T+ I +P  +NFY+L+L  ISI    +N P  +     S EG   ++D
Sbjct: 238 FSNFKNISPISYTRMIQNPQMSNFYFLNLTGISIGGVNLNVPRLS-----SNEGVLSLLD 292

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP--ETFNRFPSMA 271
           SG+V+T     +Y    + F + FE+ Q +     P    +  C+ L   E  N  P++ 
Sbjct: 293 SGTVITRLSPSIY----KAFKAEFEK-QFSGYRTTPGFSILNTCFNLTGYEEVN-IPTVK 346

Query: 272 FYFE-DANLRIDGENVFII---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
           F FE +A + +D E VF     D        A   ++D   +IG+ QQ++ R +Y+    
Sbjct: 347 FIFEGNAEMIVDVEGVFYFVKSDASQICLAFASLGYEDQTMIIGNYQQKNQRVIYNSKES 406

Query: 328 LLSFVKENCS 337
            + F  E CS
Sbjct: 407 KVGFAGEPCS 416


>gi|356532674|ref|XP_003534896.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 446

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 99/360 (27%), Positives = 167/360 (46%), Gaps = 44/360 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V L IG PS   L+++DTGS +++               +FDP  SS+F  + C  P C
Sbjct: 102 LVNLSIGQPSIPQLVVMDTGSDILWIMCNPCTNCDNHLGLLFDPSMSSTFSPL-CKTP-C 159

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
            +  C  +   +T+ Y D S   G    + +      EG +     + GC + N GF+ D
Sbjct: 160 GFKGCKCDPIPFTISYVDNSSASGTFGRDILVFETTDEGTSQISDVIIGCGH-NIGFNSD 218

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV-IPLPNGEYTSSYLKFGTDM-GYR 164
                  G+LGL+    S  +Q+G    ++FSYC+  +  P   Y    L  G D+ GY 
Sbjct: 219 P---GYNGILGLNNGPNSLATQIG----RKFSYCIGNLADPYYNYNQLRLGEGADLEGYS 271

Query: 165 RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
            P       + H   FYY++++ IS+  +R++   +TF++  +G GG I+DSG+ +TY  
Sbjct: 272 TPFE-----VYH--GFYYVTMEGISVGEKRLDIALETFEMKRNGTGGVILDSGTTITYLV 324

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF--LPETFNRFPSMAFYFED-ANLRI 281
              +  L+ + V    ++   Q+     P +LCY+  +      FP + F+F D A+L +
Sbjct: 325 DSAHKLLYNE-VRNLLKWSFRQVIFENAPWKLCYYGIISRDLVGFPVVTFHFVDGADLAL 383

Query: 282 DGENVFIIDYENHFFLLAVAPHDDL-----VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           D  + F     +  F + V+P   L      ++IG   Q+     YDL    + F + +C
Sbjct: 384 DTGSFF--SQRDDIFCMTVSPASILNTTISPSVIGLLAQQSYNVGYDLVNQFVYFQRIDC 441


>gi|357114697|ref|XP_003559132.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 416

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 91/321 (28%), Positives = 143/321 (44%), Gaps = 26/321 (8%)

Query: 27  IFDPRKSSSFQKINCDHPDCTY-FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKG-E 84
           +F P  S +F  ++ + P CT  ++     C +   +A      G+ + +T  +   G  
Sbjct: 110 LFSPAASPTFHGVHSNDPVCTAPYRPTANGCSFRFPFAS-----GYLSRDTFHLRNGGLS 164

Query: 85  GKAIFH---GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
           G A      G +FGC++   GF  D   G L GVL LS + +S ++QL +    RFSYCL
Sbjct: 165 GGAPIESVPGIMFGCAHSVAGFHND---GTLGGVLSLSHLRLSLLTQLSARAGGRFSYCL 221

Query: 142 VIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNF--YYLSLKDISIDNERMNFPP 199
             P P       +L+ G D+    P +  T       +   YYLSL  I++  +R+   P
Sbjct: 222 --PKPTQGNPHGFLRLGADVLPPLPHSHMTALTVRSGSAPDYYLSLVGITLAEKRLRIDP 279

Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF 259
             F    +G GGC I+  + +T      Y  +    V+Y +     ++   P      +F
Sbjct: 280 RVF---AAGRGGCSINPAATITAIMEPAYLVVERALVAYMKELGSDRVKKGPPGGGALFF 336

Query: 260 ---LPETFNRFPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQ 315
                    R PSMAF+F+D A L    E +F +     +F++    +   V  IG+ QQ
Sbjct: 337 DRMYKSVQARLPSMAFHFKDGAELWFTPEQLFEVHGMVAWFMMVGKGYRRTV--IGAPQQ 394

Query: 316 RDTRFVYDLNIDLLSFVKENC 336
            +TRF +D+    LSF  E C
Sbjct: 395 VNTRFTFDVAAGRLSFASELC 415


>gi|15242803|ref|NP_198319.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75130158|sp|Q6XBF8.1|CDR1_ARATH RecName: Full=Aspartic proteinase CDR1; AltName: Full=Protein
           CONSTITUTIVE DISEASE RESISTANCE 1; Flags: Precursor
 gi|37935737|gb|AAP72988.1| CDR1 [Arabidopsis thaliana]
 gi|91806924|gb|ABE66189.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|109946613|gb|ABG48485.1| At5g33340 [Arabidopsis thaliana]
 gi|332006513|gb|AED93896.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 437

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 97/360 (26%), Positives = 170/360 (47%), Gaps = 38/360 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP   ++ I DTGS L++               +FDP+ SS+++ ++C    C
Sbjct: 91  LMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQC 150

Query: 47  TYFK----CV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
           T  +    C   +  C Y++ Y D S TKG  A +T+++             + GC ++N
Sbjct: 151 TALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHNN 210

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G      +   +G++GL    +S I QLG  I  +FSYCLV PL + +  +S + FGT+
Sbjct: 211 AG----TFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLV-PLTSKKDQTSKINFGTN 265

Query: 161 MGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                    +T  I   +   FYYL+LK IS+ ++++ +   +   + S EG  IIDSG+
Sbjct: 266 AIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY---SGSDSESSEGNIIIDSGT 322

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDAN 278
            LT   ++ Y +L +   S  +     +  D    + LCY       + P +  +F+ A+
Sbjct: 323 TLTLLPTEFYSELEDAVASSIDA---EKKQDPQSGLSLCYSATGDL-KVPVITMHFDGAD 378

Query: 279 LRIDGENVFIIDYENHF-FLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +++D  N F+   E+   F    +P     ++ G+  Q +    YD     +SF   +C+
Sbjct: 379 VKLDSSNAFVQVSEDLVCFAFRGSPS---FSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435


>gi|242092878|ref|XP_002436929.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
 gi|241915152|gb|EER88296.1| hypothetical protein SORBIDRAFT_10g011180 [Sorghum bicolor]
          Length = 505

 Score =  110 bits (275), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 105/372 (28%), Positives = 160/372 (43%), Gaps = 64/372 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           +V +  G+P++   L +DTGS + +                +FDP KS+++  + C HP 
Sbjct: 162 VVTVGFGSPAQNYTLSIDTGSDVSWIQCLPCSGHCYKQHDPVFDPTKSATYSAVPCGHPQ 221

Query: 46  CTYF--KCVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C     KC N   C+Y + Y D S T G  +HET+S+    +      G  FGC   N G
Sbjct: 222 CAAAGGKCSNSGTCLYKVTYGDGSSTAGVLSHETLSLSSTRD----LPGFAFGCGQTNLG 277

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                  G + G++GL R  +S  SQ  +     FSYC    LP+ + T  YL     MG
Sbjct: 278 -----EFGGVDGLVGLGRGALSLPSQAAATFGATFSYC----LPSYDTTHGYLT----MG 324

Query: 163 YRRPST-------QATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
              P+        Q T  I   +  + Y++ +  I I    +  PP  F        G +
Sbjct: 325 STTPAASNDDDDVQYTAMIQKEDYPSLYFVEVVSIDIGGYILPVPPTVFT-----RDGTL 379

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSM 270
            DSG++LTY   + Y  L ++F     +F + Q    P  +P   CY F        P++
Sbjct: 380 FDSGTILTYLPPEAYASLRDRF-----KFTMTQYKPAPAYDPFDTCYDFTGHNAIFMPAV 434

Query: 271 AFYFEDANLRIDGENVFIIDYENHFF----LLAVAPHDDLVA--LIGSQQQRDTRFVYDL 324
           AF F D  +  D   V I+ Y +        LA  P    +   +IG+ QQR T  +YD+
Sbjct: 435 AFKFSDGAV-FDLSPVAILIYPDDTAPATGCLAFVPRPSTMPFNIIGNTQQRGTEVIYDV 493

Query: 325 NIDLLSFVKENC 336
             + + F +  C
Sbjct: 494 AAEKIGFGQFTC 505


>gi|226492391|ref|NP_001140482.1| uncharacterized protein LOC100272542 precursor [Zea mays]
 gi|224030447|gb|ACN34299.1| unknown [Zea mays]
 gi|414887506|tpg|DAA63520.1| TPA: hypothetical protein ZEAMMB73_432695 [Zea mays]
          Length = 512

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 108/380 (28%), Positives = 169/380 (44%), Gaps = 57/380 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++ +++GTP +   +I+DTGS L +               +FDP  SSS++ + C  P C
Sbjct: 147 LMDVYVGTPPRRFQMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNLTCGDPRC 206

Query: 47  TYF------------KCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIFHGA 92
            +             +   + C Y   Y DQS + G  A E  T+++   G    +  G 
Sbjct: 207 GHVAPPEAPAPRACRRPGEDPCPYYYWYGDQSNSTGDLALESFTVNLTAPGASSRV-DGV 265

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR-FSYCLVIPLPNGEYT 151
           +FGC + N G          AG+LGL R  +SF SQL ++     FSYCLV    +G   
Sbjct: 266 VFGCGHRNRGLFH-----GAAGLLGLGRGPLSFASQLRAVYGGHTFSYCLV---DHGSDV 317

Query: 152 SSYLKFGTDMGYR---RPSTQATKFI---NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
           +S + FG D        P  + T F    +  + FYY+ L  + +  E +N   DT+D +
Sbjct: 318 ASKVVFGEDDALALAAHPRLKYTAFAPASSPADTFYYVRLTGVLVGGELLNISSDTWDAS 377

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL----- 260
             G GG IIDSG+ L+YF    Y  +   F+          + D P  +  CY +     
Sbjct: 378 EGGSGGTIIDSGTTLSYFVEPAYQVIRRAFIDRMSG-SYPPVPDFPV-LSPCYNVSGVER 435

Query: 261 PETFNRFPSMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
           PE     P ++  F D A      EN FI +D +    L  +      +++IG+ QQ++ 
Sbjct: 436 PEV----PELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNF 491

Query: 319 RFVYDLNIDLLSFVKENCSD 338
              YDL+ + L F    C++
Sbjct: 492 HVAYDLHNNRLGFAPRRCAE 511


>gi|116831531|gb|ABK28718.1| unknown [Arabidopsis thaliana]
          Length = 438

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 97/360 (26%), Positives = 170/360 (47%), Gaps = 38/360 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP   ++ I DTGS L++               +FDP+ SS+++ ++C    C
Sbjct: 91  LMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSSQC 150

Query: 47  TYFK----CV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
           T  +    C   +  C Y++ Y D S TKG  A +T+++             + GC ++N
Sbjct: 151 TALENQASCSTNDNTCSYSLSYGDNSYTKGNIAVDTLTLGSSDTRPMQLKNIIIGCGHNN 210

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G      +   +G++GL    +S I QLG  I  +FSYCLV PL + +  +S + FGT+
Sbjct: 211 AG----TFNKKGSGIVGLGGGPVSLIKQLGDSIDGKFSYCLV-PLTSKKDQTSKINFGTN 265

Query: 161 MGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                    +T  I   +   FYYL+LK IS+ ++++ +   +   + S EG  IIDSG+
Sbjct: 266 AIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQY---SGSDSESSEGNIIIDSGT 322

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDAN 278
            LT   ++ Y +L +   S  +     +  D    + LCY       + P +  +F+ A+
Sbjct: 323 TLTLLPTEFYSELEDAVASSIDA---EKKQDPQSGLSLCYSATGDL-KVPVITMHFDGAD 378

Query: 279 LRIDGENVFIIDYENHF-FLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +++D  N F+   E+   F    +P     ++ G+  Q +    YD     +SF   +C+
Sbjct: 379 VKLDSSNAFVQVSEDLVCFAFRGSPS---FSIYGNVAQMNFLVGYDTVSKTVSFKPTDCA 435


>gi|226494448|ref|NP_001141341.1| uncharacterized protein LOC100273432 precursor [Zea mays]
 gi|194704078|gb|ACF86123.1| unknown [Zea mays]
 gi|413953775|gb|AFW86424.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 471

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 92/364 (25%), Positives = 159/364 (43%), Gaps = 54/364 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           + +L +GTPS    +++DTGS+L +                +FDPR SS++  + C    
Sbjct: 135 VTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYTSVRCSASQ 194

Query: 46  CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           C   +           +  C+Y   Y D S + G+ + +T+S      G   +    +GC
Sbjct: 195 CDELQAATLNPSACSASNVCIYQASYGDSSFSVGYLSTDTVSF-----GSTSYPSFYYGC 249

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             DN G       G  AG++GL+R  +S + QL   +   FSYCL      G     YL 
Sbjct: 250 GQDNEGLF-----GRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTG-----YLS 299

Query: 157 FGT-DMG-YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
            G  + G Y   +  A+  ++   + Y+++L  +S+    +   P  +    +     II
Sbjct: 300 IGPYNTGHYYSYTPMASSSLD--ASLYFITLSGMSVGGSPLAVSPSEYSSLPT-----II 352

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYF 274
           DSG+V+T   + V+  L +      +    AQ +     +  C+    +  R P++   F
Sbjct: 353 DSGTVITRLPTAVHTALSKAVA---QAMAGAQRAPAFSILDTCFEGQASQLRVPTVVMAF 409

Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
              A++++   NV +ID ++    LA AP D   A+IG+ QQ+    +YD+    + F  
Sbjct: 410 AGGASMKLTTRNV-LIDVDDSTTCLAFAPTDS-TAIIGNTQQQTFSVIYDVAQSRIGFSA 467

Query: 334 ENCS 337
             CS
Sbjct: 468 GGCS 471


>gi|17979392|gb|AAL49921.1| unknown protein [Arabidopsis thaliana]
          Length = 439

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 101/362 (27%), Positives = 157/362 (43%), Gaps = 42/362 (11%)

Query: 6   IGTPSKGVLLILDTGSALIYA-------------IFDPRKSSSFQKINCDHPDC------ 46
           +GTP+K   +++DTGS L +              +F   +S SF+ + C    C      
Sbjct: 90  VGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMN 149

Query: 47  ----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGK-AIFHGALFGCSNDNH 101
               T     +  C Y  +YAD S  +G  A ETI+V G   G+ A   G L GCS+   
Sbjct: 150 LFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITV-GLTNGRMARLPGHLIGCSSSFT 208

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G      DG    VLGL+    SF S   S+   +FSYCLV  L N +  S+YL FG+  
Sbjct: 209 GQSFQGADG----VLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSN-KNVSNYLIFGSSR 263

Query: 162 GYRRPSTQATKF-INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
             +    + T   +     FY +++  IS+  + ++ P   +D T SG GG I+DSG+ L
Sbjct: 264 STKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDAT-SG-GGTILDSGTSL 321

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFN--RFPSMAFYFED 276
           T      Y ++      Y    +  +    PE  PI+ C+     FN  + P + F+ + 
Sbjct: 322 TLLADAAYKQVVTGLARYLVELKRVK----PEGVPIEYCFSFTSGFNVSKLPQLTFHLKG 377

Query: 277 ANLRIDGENVFIIDYENHFFLLA-VAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
                     +++D       L  V+       +IG+  Q++  + +DL    LSF    
Sbjct: 378 GARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSA 437

Query: 336 CS 337
           C+
Sbjct: 438 CT 439


>gi|15223368|ref|NP_171637.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein [Arabidopsis thaliana]
 gi|22135930|gb|AAM91547.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
 gi|30387595|gb|AAP31963.1| At1g01300 [Arabidopsis thaliana]
 gi|332189147|gb|AEE27268.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 153/361 (42%), Gaps = 44/361 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            RL +GTP++ V ++LDTGS +++               IFDPRKS ++  I C  P C 
Sbjct: 144 TRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCR 203

Query: 48  YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                      + C+Y + Y D S T G  + ET++       +    G   GC +DN G
Sbjct: 204 RLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR-----RNRVKGVALGCGHDNEG 258

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               A            +  +SF  Q G    ++FSYCLV    + + +S  + FG    
Sbjct: 259 LFVGAAGLLGL-----GKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS--VVFGNAAV 311

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSV 219
            R    + T  +++P  + FYY+ L  IS+   R+       F +   G GG IIDSG+ 
Sbjct: 312 SR--IARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTS 369

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFN-RFPSMAFYFED 276
           +T      Y  + + F     R     L   P+      C+ L      + P++  +F  
Sbjct: 370 VTRLIRPAYIAMRDAF-----RVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRG 424

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A++ +   N  I    N  F  A A     +++IG+ QQ+  R VYDL    + F    C
Sbjct: 425 ADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

Query: 337 S 337
           +
Sbjct: 485 A 485


>gi|115438214|ref|NP_001043485.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|15290061|dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|113533016|dbj|BAF05399.1| Os01g0598600 [Oryza sativa Japonica Group]
 gi|125526702|gb|EAY74816.1| hypothetical protein OsI_02707 [Oryza sativa Indica Group]
          Length = 500

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 109/373 (29%), Positives = 157/373 (42%), Gaps = 61/373 (16%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
           ++ +GTP    L++LDTGS +++               +FDPR S S+  ++C  P C  
Sbjct: 150 KIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAPLCRR 209

Query: 49  FKC-----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                     + C+Y + Y D SVT G  A ET++      G  +   AL GC +DN G 
Sbjct: 210 LDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA---SGARVPRVAL-GCGHDNEGL 265

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV---IPLPNGEYTSSYLKFGTD 160
              A            R ++SF SQ+     + FSYCLV       +    SS + FG+ 
Sbjct: 266 FVAAAGLLGL-----GRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTFGS- 319

Query: 161 MGYRRPSTQA--TKFINHPN--NFYYLSLKDISIDNER--------MNFPPDTFDITVSG 208
            G   PS  A  T  + +P    FYY+ L  IS+   R        +   P T      G
Sbjct: 320 -GAVGPSAAASFTPMVKNPRMETFYYVQLMGISVGGARVPGVAVSDLRLDPST------G 372

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLPE-TF 264
            GG I+DSG+ +T      Y  L + F     R   A L   P    L   CY L     
Sbjct: 373 RGGVIVDSGTSVTRLARPAYAALRDAF-----RAAAAGLRLSPGGFSLFDTCYDLSGLKV 427

Query: 265 NRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
            + P+++ +F   A   +  EN  I       F  A A  D  V++IG+ QQ+  R V+D
Sbjct: 428 VKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFD 487

Query: 324 LNIDLLSFVKENC 336
            +   L FV + C
Sbjct: 488 GDGQRLGFVPKGC 500


>gi|195625122|gb|ACG34391.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 471

 Score =  110 bits (274), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 93/364 (25%), Positives = 159/364 (43%), Gaps = 54/364 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           + +L +GTPS    +++DTGS+L +                +FDPR SS++  + C    
Sbjct: 135 VTQLGLGTPSTSYAMVVDTGSSLTWLQCSPCVVSCHRQVGPLFDPRASSTYASVRCSASQ 194

Query: 46  CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           C   +           +  C+Y   Y D S + G  + +T+S      G   +    +GC
Sbjct: 195 CDELQAATLNPSACSASNVCIYQASYGDSSFSVGSLSTDTVSF-----GSTRYPSFYYGC 249

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             DN G       G  AG++GL+R  +S + QL   +   FSYCL      G     YL 
Sbjct: 250 GQDNEGLF-----GRSAGLIGLARNKLSLLYQLAPSLGYSFSYCLPTAASTG-----YLS 299

Query: 157 FGT-DMG-YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
            G  + G Y   +  A+  ++   + Y+++L  +S+    +   P  +    +     II
Sbjct: 300 IGPYNTGHYYSYTPMASSSLD--ASLYFITLSGMSVGGSPLAVSPSEYSSLPT-----II 352

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYF 274
           DSG+V+T   + V+  L +      +    AQ +     +  C+    +  R P++A  F
Sbjct: 353 DSGTVITRLPTAVHTALSKAVA---QAMAGAQRAPAFSILDTCFEGQASQLRVPTVAMAF 409

Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
              A++++   NV +ID ++    LA AP D   A+IG+ QQ+    +YD+    + F  
Sbjct: 410 AGGASMKLTTRNV-LIDVDDSTTCLAFAPTDS-TAIIGNTQQQTFSVIYDVAQSRIGFSA 467

Query: 334 ENCS 337
             CS
Sbjct: 468 GGCS 471


>gi|357118076|ref|XP_003560785.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 477

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 105/363 (28%), Positives = 155/363 (42%), Gaps = 50/363 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           +V +  GTP++   +ILDTGS L +                 FDP KSSS+  + C  P 
Sbjct: 138 VVVVGFGTPAQTAAIILDTGSDLSWIQCKPCSGHCYRQHDPDFDPAKSSSYAAVPCGTPV 197

Query: 46  CTYFK--CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
           C      C    C+Y ++Y D S T G  + +T++        + F G  FGC   N G 
Sbjct: 198 CAAAGGMCNGTTCLYGVQYGDGSSTTGVLSRDTLTF----NSSSKFTGFTFGCGEKNIG- 252

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
                 G + G+LGL R  +S  SQ        FSYCL    P+   T  YL  G     
Sbjct: 253 ----DFGEVDGLLGLGRGKLSLPSQAAPSFGGVFSYCL----PSYNTTPGYLNIGATKPT 304

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
                Q T  I  P   +FY++ L  I+I    +  PP  F  T     G ++DSG++LT
Sbjct: 305 STVPVQYTAMIKKPQYPSFYFIELVSINIGGYILPVPPSVFTKT-----GTLLDSGTILT 359

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSMAFYFEDA- 277
           Y     Y  L ++F     +F +      P  EP+  CY F  +     P+++F F D  
Sbjct: 360 YLPPPAYTSLRDRF-----KFTMQGNKPAPPYEPLDTCYDFTGQGAIVIPAVSFNFSDGA 414

Query: 278 --NLRIDGENVFIIDYENHFFLLAVA--PHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
             +L   G  +F  D +     LA    P     +++G+ QQR    +YD+    + F+ 
Sbjct: 415 VFDLDFYGIMIFPDDAKPLIGCLAFVSRPAAMPFSIVGNTQQRAAEVIYDVPSQKIGFIP 474

Query: 334 ENC 336
            +C
Sbjct: 475 ISC 477


>gi|30682289|ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis
           thaliana]
 gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 461

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 100/360 (27%), Positives = 157/360 (43%), Gaps = 38/360 (10%)

Query: 6   IGTPSKGVLLILDTGSALIYA-------------IFDPRKSSSFQKINCDHPDC------ 46
           +GTP+K   +++DTGS L +              +F   +S SF+ + C    C      
Sbjct: 112 VGTPAKKFRVVVDTGSELTWVNCRYRARGKDNRRVFRADESKSFKTVGCLTQTCKVDLMN 171

Query: 47  ----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGK-AIFHGALFGCSNDNH 101
               T     +  C Y  +YAD S  +G  A ETI+V G   G+ A   G L GCS+   
Sbjct: 172 LFSLTTCPTPSTPCSYDYRYADGSAAQGVFAKETITV-GLTNGRMARLPGHLIGCSSSFT 230

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G      DG    VLGL+    SF S   S+   +FSYCLV  L N +  S+YL FG+  
Sbjct: 231 GQSFQGADG----VLGLAFSDFSFTSTATSLYGAKFSYCLVDHLSN-KNVSNYLIFGSSR 285

Query: 162 GYRRPSTQATKF-INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
             +    + T   +     FY +++  IS+  + ++ P   +D T SG GG I+DSG+ L
Sbjct: 286 STKTAFRRTTPLDLTRIPPFYAINVIGISLGYDMLDIPSQVWDAT-SG-GGTILDSGTSL 343

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN--RFPSMAFYFEDAN 278
           T      Y ++      Y    +L ++     PI+ C+     FN  + P + F+ +   
Sbjct: 344 TLLADAAYKQVVTGLARYL--VELKRVKPEGVPIEYCFSFTSGFNVSKLPQLTFHLKGGA 401

Query: 279 LRIDGENVFIIDYENHFFLLA-VAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
                   +++D       L  V+       +IG+  Q++  + +DL    LSF    C+
Sbjct: 402 RFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIGNIMQQNYLWEFDLMASTLSFAPSACT 461


>gi|297848386|ref|XP_002892074.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297337916|gb|EFH68333.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 485

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 153/361 (42%), Gaps = 44/361 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            RL +GTP++ V ++LDTGS +++               IFDPRKS ++  I C  P C 
Sbjct: 144 TRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCR 203

Query: 48  YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                      + C+Y + Y D S T G  + ET++       +    G   GC +DN G
Sbjct: 204 RLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR-----RNRVKGVALGCGHDNEG 258

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               A            +  +SF  Q G    ++FSYCLV    + + +S  + FG    
Sbjct: 259 LFVGAAGLLGL-----GKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS--VVFGNAAV 311

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSV 219
            R    + T  +++P  + FYY+ L  IS+   R+       F +   G GG IIDSG+ 
Sbjct: 312 SR--IARFTPLLSNPKLDTFYYVELLGISVGGTRVPGVAASLFKLDQIGNGGVIIDSGTS 369

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFN-RFPSMAFYFED 276
           +T      Y  + + F     R     L   P+      C+ L      + P++  +F  
Sbjct: 370 VTRLIRPAYIAMRDAF-----RVGAKALKRAPDFSLFDTCFDLSNMNEVKVPTVVLHFRG 424

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A++ +   N  I    N  F  A A     +++IG+ QQ+  R VYDL    + F    C
Sbjct: 425 ADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

Query: 337 S 337
           +
Sbjct: 485 A 485


>gi|413952718|gb|AFW85367.1| hypothetical protein ZEAMMB73_231535 [Zea mays]
          Length = 443

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 106/379 (27%), Positives = 153/379 (40%), Gaps = 70/379 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +VRL +GTP + V L LDTGS L++               + DP  SS++  + C    C
Sbjct: 85  LVRLAVGTPRRPVALTLDTGSDLVWTQCAPCRDCFDQDLPVLDPAASSTYAALPCGAARC 144

Query: 47  ---TYFKCV------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA--LFG 95
               +  C       +  C+Y   Y D+S+T G  A +  +    G      H     FG
Sbjct: 145 RALPFTSCGVRTLGNHRSCIYAYHYGDKSLTVGEIATDRFTFGDSGGSGESLHTRRLTFG 204

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           C + N G  +        G+ G  R   S  SQL       FSYC        E  SS +
Sbjct: 205 CGHLNKGVFQSNE----TGIAGFGRGRWSLPSQLNVT---SFSYCFTSMF---ESKSSLV 254

Query: 156 KFGTDMG-----YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSG 208
             G               + T  + +P+  + Y+LSLK IS+   R+  P   F  T   
Sbjct: 255 TLGGSPAALYSHAHSGEVRTTPILKNPSQPSLYFLSLKGISVGKTRLPVPETKFRST--- 311

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ-----LCYFLPET 263
               IIDSG+ +T    +VY  +  +F         AQ+   P  ++     LC+ LP T
Sbjct: 312 ----IIDSGASITTLPEEVYEAVKAEFA--------AQVGLPPSGVEGSALDLCFALPVT 359

Query: 264 --FNR--FPSMAFYFEDANLRIDGENVFIIDYENHFF--LLAVAPHDDLVALIGSQQQRD 317
             + R   PS+  + E A+  +   N    D        +L  AP +  V  IG+ QQ++
Sbjct: 360 ALWRRPAVPSLTLHLEGADWELPRSNYVFEDLGARVMCIVLDAAPGEQTV--IGNFQQQN 417

Query: 318 TRFVYDLNIDLLSFVKENC 336
           T  VYDL  D LSF    C
Sbjct: 418 THVVYDLENDRLSFAPARC 436


>gi|50508275|dbj|BAD32124.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 451

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 105/380 (27%), Positives = 159/380 (41%), Gaps = 64/380 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           + L IGTP     ++ DTGS+LI+                F P  SS+F K+ C    C 
Sbjct: 92  MNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSLCQ 151

Query: 48  -----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                Y  C    CVY   Y     T G+ A ET+ V     G A F G  FGCS +N  
Sbjct: 152 FLTSPYLTCNATGCVYYYPYG-MGFTAGYLATETLHV-----GGASFPGVAFGCSTENGV 205

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
            +  +      G++GL R  +S +SQ+G     RFSYCL      G+   S + FG+   
Sbjct: 206 GNSSS------GIVGLGRSPLSLVSQVG---VGRFSYCLRSDADAGD---SPILFGSLAK 253

Query: 163 YRRPSTQATKFINHP----NNFYYLSLKDISIDNERMNFPPDTFDITVSGE----GGCII 214
               + Q+T  + +P    +++YY++L  I++    +     TF  T        GG I+
Sbjct: 254 VTGGNVQSTPLLENPEMPSSSYYYVNLTGITVGATDLPVTSTTFGFTRGAGAGLVGGTIV 313

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCY-----------FLPE 262
           DSG+ LTY   + Y  +   F+S      L    +       LC+            +P 
Sbjct: 314 DSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFGFDLCFDATAAGGGSGVPVPT 373

Query: 263 TFNRFPSMAFYFEDANLRIDGENVFIIDYENHFF---LLAVAPHDDL-VALIGSQQQRDT 318
              RF   A Y   A  R     V  +D +       LL +   + L +++IG+  Q D 
Sbjct: 374 LVLRFAGGAEY---AVRRRSYVGVVAVDSQGRAAVECLLVLPASEKLSISIIGNVMQMDL 430

Query: 319 RFVYDLNIDLLSFVKENCSD 338
             +YDL+  + SF   +C++
Sbjct: 431 HVLYDLDGGMFSFAPADCAN 450


>gi|116310064|emb|CAH67085.1| H0818E04.2 [Oryza sativa Indica Group]
 gi|116310187|emb|CAH67199.1| OSIGBa0152K17.11 [Oryza sativa Indica Group]
          Length = 464

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 102/389 (26%), Positives = 170/389 (43%), Gaps = 71/389 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V+L IGTP       +DT S LI+               +F+PR SS++  + C    C
Sbjct: 90  LVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTC 149

Query: 47  TYF---KCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 +C    +E C YT  Y+  + T+G  A + + +     G+  F G  FGCS  +
Sbjct: 150 DELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVI-----GEDAFRGVAFGCSTSS 204

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G    A     +GV+GL R  +S +SQL     +RF+YCL    P        L  G D
Sbjct: 205 TG---GAPPPQASGVVGLGRGPLSLVSQLSV---RRFAYCLP---PPASRIPGKLVLGAD 255

Query: 161 MGYRRPSTQ--ATKFINHPN--NFYYLSLKDISIDNERMNF------------------- 197
               R +T   A      P   ++YYL+L  + I +  M+                    
Sbjct: 256 ADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATATATAPAPAP 315

Query: 198 --PPDTFDITV--SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFE-RFQLAQLSDCPE 252
              P+   + V  +   G IID  S +T+  + +Y    ++ V+  E   +L + +    
Sbjct: 316 TPSPNATAVAVGDANRYGMIIDIASTITFLEASLY----DELVNDLEVEIRLPRGTGSSL 371

Query: 253 PIQLCYFLPE--TFNRF--PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLV 307
            + LC+ LP+   F+R   P++A  F+   LR+D   +F  D E+    L V   +   V
Sbjct: 372 GLDLCFILPDGVAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSV 431

Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +++G+ QQ++ + +Y+L    ++FV+  C
Sbjct: 432 SILGNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|115458646|ref|NP_001052923.1| Os04g0448500 [Oryza sativa Japonica Group]
 gi|38344830|emb|CAD40872.2| OSJNBa0064H22.11 [Oryza sativa Japonica Group]
 gi|113564494|dbj|BAF14837.1| Os04g0448500 [Oryza sativa Japonica Group]
          Length = 464

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 102/389 (26%), Positives = 170/389 (43%), Gaps = 71/389 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V+L IGTP       +DT S LI+               +F+PR SS++  + C    C
Sbjct: 90  LVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTC 149

Query: 47  TYF---KCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 +C    +E C YT  Y+  + T+G  A + + +     G+  F G  FGCS  +
Sbjct: 150 DELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVI-----GEDAFRGVAFGCSTSS 204

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G    A     +GV+GL R  +S +SQL     +RF+YCL    P        L  G D
Sbjct: 205 TG---GAPPPQASGVVGLGRGPLSLVSQLSV---RRFAYCLP---PPASRIPGKLVLGAD 255

Query: 161 MGYRRPSTQ--ATKFINHPN--NFYYLSLKDISIDNERMNF------------------- 197
               R +T   A      P   ++YYL+L  + I +  M+                    
Sbjct: 256 ADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRAMSLPPTTTTTATATATAPAPAP 315

Query: 198 --PPDTFDITV--SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFE-RFQLAQLSDCPE 252
              P+   + V  +   G IID  S +T+  + +Y    ++ V+  E   +L + +    
Sbjct: 316 TPSPNATAVAVGDANRYGMIIDIASTITFLEASLY----DELVNDLEVEIRLPRGTGSSL 371

Query: 253 PIQLCYFLPE--TFNRF--PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLV 307
            + LC+ LP+   F+R   P++A  F+   LR+D   +F  D E+    L V   +   V
Sbjct: 372 GLDLCFILPDGVAFDRVYVPAVALAFDGRWLRLDKARLFAEDRESGMMCLMVGRAEAGSV 431

Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +++G+ QQ++ + +Y+L    ++FV+  C
Sbjct: 432 SILGNFQQQNMQVLYNLRRGRVTFVQSPC 460


>gi|413953792|gb|AFW86441.1| hypothetical protein ZEAMMB73_342504 [Zea mays]
          Length = 459

 Score =  109 bits (273), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 152/371 (40%), Gaps = 67/371 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCD-- 42
           +V + +GTP+   +L++DTGS L +                 +FDP +SS++  I C+  
Sbjct: 121 VVTVGLGTPAVSQVLLIDTGSDLSWVQCAPCNSTTCYPQKDPLFDPSRSSTYAPIPCNTD 180

Query: 43  ----------HPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA 92
                       DCT       QC Y + Y D S T G  ++ET++ +  G     FH  
Sbjct: 181 ACRDLTRDGYGSDCTSGSGGGAQCGYAITYGDGSQTTGVYSNETLT-MAPGVTVKDFH-- 237

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
            FGC     G D+D  +    G+LGL     S + Q  S+    FSYC    LP     +
Sbjct: 238 -FGC-----GHDQDGPNDKYDGLLGLGGAPESLVVQTSSVYGGAFSYC----LPAANDQA 287

Query: 153 SYLKFGTDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVS 207
            +L  G       P   A+ F+  P       FY +++  I++  E ++ PP  F     
Sbjct: 288 GFLALGA------PVNDASGFVFTPMVREQQTFYVVNMTGITVGGEPIDVPPSAF----- 336

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNR 266
             GG IIDSG+V+T      Y  L   F      + L    +    +  CY F   +   
Sbjct: 337 -SGGMIIDSGTVVTELQHTAYAALQAAFRKAMAAYPLLPNGE----LDTCYNFTGHSNVT 391

Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
            P +A  F   A + +D  +  ++D  N        P D+   ++G+  QR    +YD+ 
Sbjct: 392 VPRVALTFSGGATVDLDVPDGILLD--NCLAFQEAGP-DNQPGILGNVNQRTLEVLYDVG 448

Query: 326 IDLLSFVKENC 336
              + F  + C
Sbjct: 449 HGRVGFGADAC 459


>gi|15228618|ref|NP_191741.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6850873|emb|CAB71112.1| putative protein [Arabidopsis thaliana]
 gi|332646739|gb|AEE80260.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 483

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 105/370 (28%), Positives = 155/370 (41%), Gaps = 58/370 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           +RL +GTP+  V ++LDTGS +++              AIFDP+KS +F  + C    C 
Sbjct: 137 MRLGVGTPATNVYMVLDTGSDVVWLQCSPCKACYNQTDAIFDPKKSKTFATVPCGSRLCR 196

Query: 48  YF----KCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-----FG 95
                 +CV  +   C+Y + Y D S T+G  + ET++          FHGA       G
Sbjct: 197 RLDDSSECVTRRSKTCLYQVSYGDGSFTEGDFSTETLT----------FHGARVDHVPLG 246

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV--IPLPNGEYTSS 153
           C +DN G    A            R  +SF SQ  +    +FSYCLV      +     S
Sbjct: 247 CGHDNEGLFVGAAGLLGL-----GRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPS 301

Query: 154 YLKFGTDMGYRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGE 209
            + FG       P T   T  + +P  + FYYL L  IS+   R+       F +  +G 
Sbjct: 302 TIVFGNAA---VPKTSVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGN 358

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPE-TFNR 266
           GG IIDSG+ +T      Y  L + F     R    +L   P       C+ L   T  +
Sbjct: 359 GGVIIDSGTSVTRLTQPAYVALRDAF-----RLGATKLKRAPSYSLFDTCFDLSGMTTVK 413

Query: 267 FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
            P++ F+F    + +   N  I       F  A A     +++IG+ QQ+  R  YDL  
Sbjct: 414 VPTVVFHFGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVG 473

Query: 327 DLLSFVKENC 336
             + F+   C
Sbjct: 474 SRVGFLSRAC 483


>gi|242045564|ref|XP_002460653.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
 gi|241924030|gb|EER97174.1| hypothetical protein SORBIDRAFT_02g032590 [Sorghum bicolor]
          Length = 525

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 113/390 (28%), Positives = 168/390 (43%), Gaps = 69/390 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++ +++GTP +   +I+DTGS L +               +FDP  SSS++ + C    C
Sbjct: 152 LMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQRGPVFDPAASSSYRNVTCGDHRC 211

Query: 47  ---------------TYFKCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIF 89
                          T  +   + C Y   Y DQS T G  A E  T+++   G  + + 
Sbjct: 212 GHVAPPPEPEASSPRTCRRPGEDPCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRV- 270

Query: 90  HGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGE 149
            G +FGC + N G          AG+LGL R  +SF SQL ++    FSYCLV    +G 
Sbjct: 271 DGVVFGCGHRNRGLFH-----GAAGLLGLGRGPLSFASQLRAVYGHTFSYCLV---DHGS 322

Query: 150 YTSSYLKFGTD----MGYRRPSTQATKFINHP------NNFYYLSLKDISIDNERMNFPP 199
              S + FG D         P  + T F          + FYY+ LK + +  E +N   
Sbjct: 323 DVGSKVVFGEDDDALALAAHPQLKYTAFAPASSSSSPADTFYYVKLKGVLVGGELLNISS 382

Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PI-QLC 257
           DT+D+   G GG IIDSG+ L+YF    Y  +   F+    R         PE P+   C
Sbjct: 383 DTWDVGKDGSGGTIIDSGTTLSYFVEPAYQVIRHAFMDRMSR----SYPLVPEFPVLSPC 438

Query: 258 YFL-----PETFNRFPSMAFYFED-ANLRIDGENVFI---IDYENHFFLLAVAPHDDLVA 308
           Y +     PE     P ++  F D A      EN FI    D  +   L  +      ++
Sbjct: 439 YNVSGVERPEV----PELSLLFADGAVWDFPAENYFIRLDPDGGSIMCLAVLGTPRTGMS 494

Query: 309 LIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
           +IG+ QQ++   VYDL  + L F    C++
Sbjct: 495 IIGNFQQQNFHVVYDLQNNRLGFAPRRCAE 524


>gi|297820186|ref|XP_002877976.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297323814|gb|EFH54235.1| hypothetical protein ARALYDRAFT_906847 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 425

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 90/361 (24%), Positives = 156/361 (43%), Gaps = 48/361 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
           +VR  IGTP++ +L+ LDT +   +             +FDP KSSS + + C+ P C  
Sbjct: 89  IVRANIGTPAQAMLVALDTSNDAAWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQCKQ 148

Query: 49  FK----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                  V++ C + M Y   ++ + +   +T+++        +     FGC N   G  
Sbjct: 149 APNPSCTVSKSCGFNMTYGGSAI-EAYLTQDTLTL-----ATDVIPNYTFGCINKASGTS 202

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--SSYLKFGTDMG 162
             A+     G++GL R  +S ISQ  ++ +  FSYCL    PN + +  S  L+ G    
Sbjct: 203 LPAQ-----GLMGLGRGPLSLISQSQNLYQSTFSYCL----PNSKSSNFSGSLRLGPKNQ 253

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
             R   + T  + +P  ++ YY++L  I + N+ ++ P        +   G I DSG+V 
Sbjct: 254 PIR--IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVY 311

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
           T      Y  +  +F    +      L         CY        FPS+ F F   N+ 
Sbjct: 312 TRLVEPAYVAMRNEFRRRVKNANATSLGG----FDTCY---SGSVVFPSVTFMFAGMNVT 364

Query: 281 IDGENVFIIDYENHFFLLAVAPH----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  +N+ I     +   LA+A      + ++ +I S QQ++ R + D+    L   +E C
Sbjct: 365 LPPDNLLIHSSAGNLSCLAMAAAPTNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424

Query: 337 S 337
           +
Sbjct: 425 T 425


>gi|21594980|gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
           thaliana]
          Length = 485

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 152/361 (42%), Gaps = 44/361 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            RL +GTP++ V ++LDTGS +++               IFDPRKS ++  I C  P C 
Sbjct: 144 TRLGVGTPARYVYMVLDTGSDIVWLQCAPCRRCYSQSDPIFDPRKSKTYATIPCSSPHCR 203

Query: 48  YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                      + C+Y + Y D S T G  + ET++       +    G   GC +DN G
Sbjct: 204 RLDSAGCNTRRKTCLYQVSYGDGSFTVGDFSTETLTFR-----RNRVKGVALGCGHDNEG 258

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               A            +  +SF  Q G    ++FSYCLV    + + +S  + FG    
Sbjct: 259 LFVGAAGLLGL-----GKGKLSFPGQTGHRFNQKFSYCLVDRSASSKPSS--VVFGNAAV 311

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSV 219
            R    + T  +++P  + FYY+ L  IS+   R+       F +   G GG IIDSG+ 
Sbjct: 312 SR--IARFTPLLSNPKLDTFYYVGLLGISVGGTRVPGVTASLFKLDQIGNGGVIIDSGTS 369

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAFYFED 276
           +T      Y  + + F     R     L   P       C+ L      + P++  +F  
Sbjct: 370 VTRLIRPAYIAMRDAF-----RVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPTVVLHFRR 424

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A++ +   N  I    N  F  A A     +++IG+ QQ+  R VYDL    + F    C
Sbjct: 425 ADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRVGFAPGGC 484

Query: 337 S 337
           +
Sbjct: 485 A 485


>gi|326530426|dbj|BAJ97639.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 479

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 97/361 (26%), Positives = 152/361 (42%), Gaps = 42/361 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           VR+ +G+P     L++D+GS +I+               +FDP  S+SF  + CD   C 
Sbjct: 135 VRVGVGSPPTEQYLVVDSGSDVIWIQCRPCAECYQQADPLFDPAASASFTAVPCDSGVCR 194

Query: 48  YFK-----CVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                   C +   C Y + Y D S T+G  A ET++    G+   +  G   GC + N 
Sbjct: 195 TLPGGSSGCADSGACRYQVSYGDGSYTQGVLAMETLTF---GDSTPV-QGVAIGCGHRNR 250

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G    A     AG+LGL    +S + QLG      FSYCL       +  +  L FG D 
Sbjct: 251 GLFVGA-----AGLLGLGWGPMSLVGQLGGAAGGAFSYCLASR--GADAGAGSLVFGRDD 303

Query: 162 GYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
                +       N    +FYY+ L  + +  ER+      FD+T  G GG ++D+G+ +
Sbjct: 304 AMPVGAVWVPLLRNAQQPSFYYVGLTGLGVGGERLPLQDGLFDLTEDGGGGVVMDTGTAV 363

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAFYF--E 275
           T    D Y  L + F S         L   P    +  CY L    + R P++A YF  +
Sbjct: 364 TRLPPDAYAALRDAFASTIG----GDLPRAPGVSLLDTCYDLSGYASVRVPTVALYFGRD 419

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            A L +   N  +++     + LA A     ++++G+ QQ+  +   D     + F    
Sbjct: 420 GAALTLPARN-LLVEMGGGVYCLAFAASASGLSILGNIQQQGIQITVDSANGYVGFGPST 478

Query: 336 C 336
           C
Sbjct: 479 C 479


>gi|326516344|dbj|BAJ92327.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 96/367 (26%), Positives = 150/367 (40%), Gaps = 58/367 (15%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKC 51
           +G       +I+DT S L +               +FDP  S S+  + C+   C   + 
Sbjct: 117 VGIGGGEATVIVDTASELTWVQCEPCDACHDQQEPLFDPSSSPSYAAVPCNSSSCDALRV 176

Query: 52  VN-----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                          C YT+ Y D S ++G  AH+ +S+ G+        G +FGC   N
Sbjct: 177 ATGMSGQACDDQPAACSYTLSYRDGSYSRGVLAHDRLSLAGED-----IQGFVFGCGTSN 231

Query: 101 HG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
            G F      G  +G++GL R  +S ISQ        FSYCL    P    +S  L  G 
Sbjct: 232 QGPF------GGTSGLMGLGRSQLSLISQTMDQFGGVFSYCLP---PKESGSSGSLVLGD 282

Query: 160 DMGYRRPSTQA--TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           D    R ST    T  ++ P    FY  +L  I++  E +  P      +  G G  I+D
Sbjct: 283 DASVYRNSTPIVYTAMVSDPLQGPFYLANLTGITVGGEDVQSP----GFSAGGGGKAIVD 338

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERF-QLAQLSDCPEPIQLCYFLPETFN-RFPSMAFY 273
           SG+++T     VY  +  +FVS    + Q A  S     +  C+ L      + PS+   
Sbjct: 339 SGTIITSLVPSVYAAVRAEFVSQLAEYPQAAPFS----ILDTCFDLTGLREVQVPSLKLV 394

Query: 274 FE-DANLRIDGENVFII---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
           F+  A + +D + V  +   D       LA    +    +IG+ QQ++ R ++D     +
Sbjct: 395 FDGGAEVEVDSKGVLYVVTGDASQVCLALASLKSEYDTPIIGNYQQKNLRVIFDTVGSQI 454

Query: 330 SFVKENC 336
            F +E C
Sbjct: 455 GFAQETC 461


>gi|297803034|ref|XP_002869401.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
 gi|297315237|gb|EFH45660.1| aspartyl protease family [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 106/369 (28%), Positives = 162/369 (43%), Gaps = 55/369 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V + IG+P    LL +DT S L++               IFDP +S + +  +C     
Sbjct: 86  LVNISIGSPPVTQLLHMDTASDLLWLQCRPCINCYAQSLPIFDPSRSYTHRNESCRTSQY 145

Query: 47  TY----FKCVNEQCVYTMKYADQSVTKGFAAHETI--SVIGKGEGKAIFHGALFGCSNDN 100
           +     F      C Y+M+Y D + +KG  A E +  + I      A  H  +FGC +DN
Sbjct: 146 SMPSLRFNAKTRSCEYSMRYMDGTGSKGILAKEMLMFNTIYDESSSAALHDVVFGCGHDN 205

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
           +G           G+LGL     S + + G+    +FSYC    L +  Y  + L  G D
Sbjct: 206 YG-----EPLVGTGILGLGYGEFSLVHRFGT----KFSYCFG-SLDDPSYPHNVLVLGDD 255

Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSGSV 219
                  T   +  N    FYY++++ IS+D   +   P  F+    +G GG IID+G+ 
Sbjct: 256 GANILGDTTPLEIYN---GFYYVTIEAISVDGIILPIDPWVFNRNHQTGLGGTIIDTGNS 312

Query: 220 LTYFHSDVYWKLHEKFVSYFE-RFQLAQLSDCPEPIQLCYFLPETFN----------RFP 268
           LT    + Y  L  K   YFE RF  A ++      Q   F  E +N           FP
Sbjct: 313 LTSLVEEAYKPLKNKIEDYFEGRFTAADVN------QDDMFKVECYNGNLERDLVESGFP 366

Query: 269 SMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
            + F+F D A L +D ++VF +    + F LAV P +  +  IG+  Q+     YDL   
Sbjct: 367 IVTFHFSDGAELSLDVKSVF-MKLSPNVFCLAVTPGN--MNSIGATAQQSYNIGYDLEAK 423

Query: 328 LLSFVKENC 336
            +SF + +C
Sbjct: 424 KISFERIDC 432


>gi|242062704|ref|XP_002452641.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
 gi|241932472|gb|EES05617.1| hypothetical protein SORBIDRAFT_04g029670 [Sorghum bicolor]
          Length = 466

 Score =  108 bits (271), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 90/361 (24%), Positives = 152/361 (42%), Gaps = 51/361 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++ + IG+P+    + +DTGS + +              ++FDP  SS++   +C    C
Sbjct: 132 VITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSASSTYSPFSCSSAAC 191

Query: 47  TYFK-------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                      C + QC Y + Y D S T G  + +T+++     G     G  FGCS  
Sbjct: 192 VQLSQSQQGNGCSSSQCQYIVSYVDGSSTTGTYSSDTLTL-----GSNAIKGFQFGCSQS 246

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG- 158
             G   D  D    G++GL     S +SQ      K FSYCL  P P    +S +L  G 
Sbjct: 247 ESGGFSDQTD----GLMGLGGDAQSLVSQTAGTFGKAFSYCLP-PTPG---SSGFLTLGA 298

Query: 159 -TDMGY-RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
            +  G+ + P  ++T+       +Y + L+ I +  +++N P   F        G ++DS
Sbjct: 299 ASRSGFVKTPMLRSTQI----PTYYGVLLEAIRVGGQQLNIPTSVF------SAGSVMDS 348

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE 275
           G+V+T      Y  L   F +  +++  AQ S     +  C+ F  ++    PS+A  F 
Sbjct: 349 GTVITRLPPTAYSALSSAFKAGMKKYPPAQPSGI---LDTCFDFSGQSSVSIPSVALVFS 405

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
              +     N  +++ +N     A    D  +  IG+ QQR    +YD+    + F    
Sbjct: 406 GGAVVNLDFNGIMLELDNWCLAFAANSDDSSLGFIGNVQQRTFEVLYDVGGGAVGFRAGA 465

Query: 336 C 336
           C
Sbjct: 466 C 466


>gi|125561849|gb|EAZ07297.1| hypothetical protein OsI_29545 [Oryza sativa Indica Group]
          Length = 451

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 102/370 (27%), Positives = 160/370 (43%), Gaps = 57/370 (15%)

Query: 6   IGTPSKGVLLILDTGSALIYA------------------IFDPRKSSSFQKINCDHPDC- 46
           IGTP +   LI+DTGS LI+                   ++DP +SS+F  + C    C 
Sbjct: 97  IGTPPQPRKLIVDTGSDLIWTQCKLSSSTAVAARHGSPPVYDPGESSTFAFLPCSDRLCQ 156

Query: 47  ----TYFKCVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
               ++  C ++ +CVY   Y   +   G  A ET +    G  +A+     FGC   + 
Sbjct: 157 EGQFSFKNCTSKNRCVYEDVYGSAAAV-GVLASETFTF---GARRAVSLRLGFGCGALSA 212

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G    A      G+LGLS  ++S I+QL     +RFSYCL    P  +  +S L FG   
Sbjct: 213 GSLIGA-----TGILGLSPESLSLITQLK---IQRFSYCLT---PFADKKTSPLLFGAMA 261

Query: 162 GYRRPST----QATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
              R  T    Q T  +++P    +YY+ L  IS+ ++R+  P  +  +   G GG I+D
Sbjct: 262 DLSRHKTTRPIQTTAIVSNPVKTVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVD 321

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
           SGS + Y     +  + E   +  +  +L   +   E  +LC+ LP         A    
Sbjct: 322 SGSTVAYLVEAAFEAVKE---AVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVP 378

Query: 276 DANLRIDGENVFIIDYENHF-------FLLAVAPHDD--LVALIGSQQQRDTRFVYDLNI 326
              L  DG    ++  +N+F         LAV    D   V++IG+ QQ++   ++D+  
Sbjct: 379 PLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQH 438

Query: 327 DLLSFVKENC 336
              SF    C
Sbjct: 439 HKFSFAPTQC 448


>gi|147821993|emb|CAN70318.1| hypothetical protein VITISV_016757 [Vitis vinifera]
          Length = 429

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 93/379 (24%), Positives = 162/379 (42%), Gaps = 62/379 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSAL----------IYAIFDPRKSSSFQKINCDHPDCTY--- 48
           V L +G+P + V ++LDTGS L          ++++FDP +SSS+  I C  P C     
Sbjct: 58  VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDPLRSSSYSPIPCTSPTCRTRTR 117

Query: 49  -----FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                  C  ++ C   + YAD S  +G  A +T  +     G +     +FGC +    
Sbjct: 118 DFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHI-----GNSAIPATIFGCMDSGFS 172

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
            + D  D    G++G++R ++SF++Q+G    ++FSYC+     +G+ +S  L FG    
Sbjct: 173 SNSD-EDSKTTGLIGMNRGSLSFVTQMG---LQKFSYCI-----SGQDSSGILLFGESSF 223

Query: 163 YRRPSTQATKF--INHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
               + + T    I+ P  +     Y + L+ I + N  +  P   +    +G G  ++D
Sbjct: 224 SWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVD 283

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLCYFLP---ETFN 265
           SG+  T+    VY  L  +FV    R   A L    +P       + LCY +P    T  
Sbjct: 284 SGTQFTFLLGPVYTALKNEFV----RQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLP 339

Query: 266 RFPSMAFYFEDANLRIDGENVF-----IIDYENHFFLLAVAPHDDLVA---LIGSQQQRD 317
             P++   F  A + +  E +      +I   +  +       + L     +IG   Q++
Sbjct: 340 PLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQN 399

Query: 318 TRFVYDLNIDLLSFVKENC 336
               +DL    + F +  C
Sbjct: 400 VWMEFDLAKSRVGFAEVRC 418


>gi|357137788|ref|XP_003570481.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 455

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 88/365 (24%), Positives = 157/365 (43%), Gaps = 55/365 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           + R+ +GTP+K  ++++DTGS+L +                +FDP+ SSS+  ++C  P 
Sbjct: 118 VTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSSPQ 177

Query: 46  CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           C               +  C+Y   Y D S + G+ + +T+S      G        +GC
Sbjct: 178 CDGLSTATLNPAVCSPSNVCIYQASYGDSSFSVGYLSKDTVSF-----GANSVPNFYYGC 232

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             DN G       G  AG++GL+R  +S + QL   +   FSYCL         +S YL 
Sbjct: 233 GQDNEGLF-----GRSAGLMGLARNKLSLLYQLAPTLGYSFSYCL-----PSTSSSGYLS 282

Query: 157 FGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
            G+   Y       T  +++   ++ Y++SL  +++  + +      +    +     II
Sbjct: 283 IGS---YNPGGYSYTPMVSNTLDDSLYFISLSGMTVAGKPLAVSSSEYTSLPT-----II 334

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFY 273
           DSG+V+T   + VY  L +   +  +     + +     +  C+          P+++  
Sbjct: 335 DSGTVITRLPTSVYTALSKAVAAAMKGST--KRAAAYSILDTCFEGQASKLRAVPAVSMA 392

Query: 274 FE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           F   A L++   N  ++D +     LA AP     A+IG+ QQ+    VYD+  + + F 
Sbjct: 393 FSGGATLKLSAGN-LLVDVDGATTCLAFAPARS-AAIIGNTQQQTFSVVYDVKSNRIGFA 450

Query: 333 KENCS 337
              CS
Sbjct: 451 AAGCS 455


>gi|326506682|dbj|BAJ91382.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525815|dbj|BAJ88954.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 85/362 (23%), Positives = 159/362 (43%), Gaps = 51/362 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           + R+ +GTP+K  ++++DTGS+L +                +FDP+ SSS+  ++C  P 
Sbjct: 138 VTRMGLGTPAKPYIMVVDTGSSLTWLQCSPCRVSCHRQSGPVFDPKTSSSYAAVSCSTPQ 197

Query: 46  CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           C               ++ C+Y   Y D S + G+ + +T+S      G        +GC
Sbjct: 198 CNDLSTATLNPAACSSSDVCIYQASYGDSSFSVGYLSKDTVSF-----GSNSVPNFYYGC 252

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             DN G       G  AG++GL+R  +S + QL   +   FSYC    LP+   +     
Sbjct: 253 GQDNEGLF-----GRSAGLMGLARNKLSLLYQLAPTLGYSFSYC----LPSSSSSGYLSI 303

Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
              + G    +   +  ++  ++ Y++ L  +++  + +      +    +     IIDS
Sbjct: 304 GSYNPGQYSYTPMVSSTLD--DSLYFIKLSGMTVAGKPLAVSSSEYSSLPT-----IIDS 356

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE- 275
           G+V+T   + VY  L +      +  + A   D    +  C+    +  R P+++  F  
Sbjct: 357 GTVITRLPTTVYDALSKAVAGAMKGTKRA---DAYSILDTCFVGQASSLRVPAVSMAFSG 413

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            A L++  +N  ++D ++    LA AP     A+IG+ QQ+    VYD+  + + F    
Sbjct: 414 GAALKLSAQN-LLVDVDSSTTCLAFAPARS-AAIIGNTQQQTFSVVYDVKSNRIGFAAGG 471

Query: 336 CS 337
           C+
Sbjct: 472 CT 473


>gi|225449446|ref|XP_002283126.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 436

 Score =  108 bits (270), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 93/379 (24%), Positives = 162/379 (42%), Gaps = 62/379 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSAL----------IYAIFDPRKSSSFQKINCDHPDCTY--- 48
           V L +G+P + V ++LDTGS L          ++++FDP +SSS+  I C  P C     
Sbjct: 65  VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDPLRSSSYSPIPCTSPTCRTRTR 124

Query: 49  -----FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                  C  ++ C   + YAD S  +G  A +T  +     G +     +FGC +    
Sbjct: 125 DFSIPVSCDKKKLCHAIISYADASSIEGNLASDTFHI-----GNSAIPATIFGCMDSGFS 179

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
            + D  D    G++G++R ++SF++Q+G    ++FSYC+     +G+ +S  L FG    
Sbjct: 180 SNSD-EDSKTTGLIGMNRGSLSFVTQMG---LQKFSYCI-----SGQDSSGILLFGESSF 230

Query: 163 YRRPSTQATKF--INHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
               + + T    I+ P  +     Y + L+ I + N  +  P   +    +G G  ++D
Sbjct: 231 SWLKALKYTPLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVD 290

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLCYFLP---ETFN 265
           SG+  T+    VY  L  +FV    R   A L    +P       + LCY +P    T  
Sbjct: 291 SGTQFTFLLGPVYTALKNEFV----RQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLP 346

Query: 266 RFPSMAFYFEDANLRIDGENVF-----IIDYENHFFLLAVAPHDDLVA---LIGSQQQRD 317
             P++   F  A + +  E +      +I   +  +       + L     +IG   Q++
Sbjct: 347 PLPTVTLMFRGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQN 406

Query: 318 TRFVYDLNIDLLSFVKENC 336
               +DL    + F +  C
Sbjct: 407 VWMEFDLAKSRVGFAEVRC 425


>gi|302763741|ref|XP_002965292.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
 gi|300167525|gb|EFJ34130.1| hypothetical protein SELMODRAFT_83230 [Selaginella moellendorffii]
          Length = 423

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 93/358 (25%), Positives = 156/358 (43%), Gaps = 41/358 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           V L +GTP + V ++ DTGS +++               +F+P  SS+FQ I C    C 
Sbjct: 83  VSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSSLCQ 142

Query: 48  YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                 C   QC+Y + Y D S T G  + ET+S      G    +    GC ++N G  
Sbjct: 143 QLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSF-----GSNAVNSVAIGCGHNNQGLF 197

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY-LKFGTDMGY 163
             A            +  +SF SQ+G +    FSYC    LP  E T S  L FG     
Sbjct: 198 TGAAGLLGL-----GKGLLSFPSQVGQLYGSVFSYC----LPTRESTGSVPLIFGNQA-- 246

Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVL 220
              + Q T  + +P  + FYY+ +  I +    +N P  +  + + +G GG I+DSG+ +
Sbjct: 247 VASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVNIPAGSLSLDSSTGNGGVILDSGTAV 306

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFE-DAN 278
           T   +  Y  + + F +       A+++        CY L   +    P+++F F   A 
Sbjct: 307 TRLVTSAYNPMRDAFRAGMP--SDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGAT 364

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           + +  +N+ +    +  + LA AP+ +  ++IG+ QQ+  R  +D   + +      C
Sbjct: 365 MALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422


>gi|357514995|ref|XP_003627786.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521808|gb|AET02262.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 436

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 97/357 (27%), Positives = 160/357 (44%), Gaps = 40/357 (11%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFK- 50
           +GTP   +  I+DTGS +++               +F+P KSSS++ I C    C   + 
Sbjct: 93  VGTPPFKLYGIVDTGSDIVWLQCEPCQECYNQTTPMFNPSKSSSYKNIPCPSKLCQSMED 152

Query: 51  --CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
             C ++  C Y+  Y D S + G  + +T+++         F   + GC  +N      +
Sbjct: 153 TSCNDKNYCEYSTYYGDNSHSGGDLSVDTLTLESTNGLTVSFPNIVIGCGTNNI----LS 208

Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPN----GEYTSSYLKFGTDMGY 163
            +GA +G++G      SFI+QLGS    +FSYCL  PL +        +S L FG     
Sbjct: 209 YEGASSGIVGFGSGPASFITQLGSSTGGKFSYCLT-PLFSVTNIQSNATSKLNFGDAATV 267

Query: 164 RRPSTQATKFINH-PNNFYYLSLKDISIDNERMNFP--PDTFDITVSGEGGCIIDSGSVL 220
                  T  +   P  FYYL+L+  S+ N R+     P+        EG  IIDSG+ L
Sbjct: 268 SGDGVVTTPILKKDPETFYYLTLEAFSVGNRRVEIGGVPNG-----DNEGNIIIDSGTTL 322

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
           T    D Y  L    V   +  +L ++ D  + + LCY +      FP +  +F+ A++ 
Sbjct: 323 TSLTKDDYSFLESAVV---DLVKLERVDDPTQTLNLCYSVKAEGYDFPIITMHFKGADVD 379

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +   + F+   +  F L   +  D   A+ G+  Q++    YDL   ++SF   +C+
Sbjct: 380 LHPISTFVSVADGVFCLAFESSQDH--AIFGNLAQQNLMVGYDLQQKIVSFKPSDCT 434


>gi|15232503|ref|NP_191008.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|7288018|emb|CAB81805.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|17979257|gb|AAL49945.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|21700851|gb|AAM70549.1| AT3g54400/T12E18_90 [Arabidopsis thaliana]
 gi|332645705|gb|AEE79226.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 425

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 90/361 (24%), Positives = 156/361 (43%), Gaps = 48/361 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
           +VR  IGTP++ +L+ LDT +   +             +FDP KSSS + + C+ P C  
Sbjct: 89  IVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQCKQ 148

Query: 49  FK----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                  V++ C + M Y   ++ + +   +T+++        +     FGC N   G  
Sbjct: 149 APNPSCTVSKSCGFNMTYGGSTI-EAYLTQDTLTLASD-----VIPNYTFGCINKASGTS 202

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--SSYLKFGTDMG 162
             A+     G++GL R  +S ISQ  ++ +  FSYC    LPN + +  S  L+ G    
Sbjct: 203 LPAQ-----GLMGLGRGPLSLISQSQNLYQSTFSYC----LPNSKSSNFSGSLRLGPKNQ 253

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
             R   + T  + +P  ++ YY++L  I + N+ ++ P        +   G I DSG+V 
Sbjct: 254 PIR--IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVY 311

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
           T      Y  +  +F    +      L         CY        FPS+ F F   N+ 
Sbjct: 312 TRLVEPAYVAVRNEFRRRVKNANATSLGG----FDTCY---SGSVVFPSVTFMFAGMNVT 364

Query: 281 IDGENVFIIDYENHFFLLAVAPH----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  +N+ I     +   LA+A      + ++ +I S QQ++ R + D+    L   +E C
Sbjct: 365 LPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424

Query: 337 S 337
           +
Sbjct: 425 T 425


>gi|21617933|gb|AAM66983.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
          Length = 425

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 90/361 (24%), Positives = 156/361 (43%), Gaps = 48/361 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
           +VR  IGTP++ +L+ LDT +   +             +FDP KSSS + + C+ P C  
Sbjct: 89  IVRANIGTPAQPMLVALDTSNDAAWIPCSGCVGCSSSVLFDPSKSSSSRTLQCEAPQCKQ 148

Query: 49  FK----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                  V++ C + M Y   ++ + +   +T+++        +     FGC N   G  
Sbjct: 149 APNPSCTVSKSCGFNMTYGGSTI-EAYLTQDTLTLASD-----VIPNYTFGCINKASGTS 202

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--SSYLKFGTDMG 162
             A+     G++GL R  +S ISQ  ++ +  FSYC    LPN + +  S  L+ G    
Sbjct: 203 LPAQ-----GLMGLGRGPLSLISQSQNLYQSTFSYC----LPNSKSSNFSGSLRLGPKNQ 253

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
             R   + T  + +P  ++ YY++L  I + N+ ++ P        +   G I DSG+V 
Sbjct: 254 PIR--IKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDPATGAGTIFDSGTVY 311

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
           T      Y  +  +F    +      L         CY        FPS+ F F   N+ 
Sbjct: 312 TRLVEPAYVAVRNEFRRRVKNANATSLGG----FDTCY---SGSVVFPSVTFMFAGMNVT 364

Query: 281 IDGENVFIIDYENHFFLLAVAPH----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  +N+ I     +   LA+A      + ++ +I S QQ++ R + D+    L   +E C
Sbjct: 365 LPPDNLLIHSSAGNLSCLAMAAAPVNVNSVLNVIASMQQQNHRVLIDVPNSRLGISRETC 424

Query: 337 S 337
           +
Sbjct: 425 T 425


>gi|225427556|ref|XP_002266575.1| PREDICTED: probable aspartic protease At2g35615 [Vitis vinifera]
          Length = 445

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 105/363 (28%), Positives = 166/363 (45%), Gaps = 40/363 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++ + +GTP   +L I DTGS LI+               +FDP+KS +++ + C++  C
Sbjct: 95  LMNISLGTPPVSMLGIADTGSDLIWRQCLPCDDCYKQVEPLFDPKKSKTYKTLGCNNDFC 154

Query: 47  TYF----KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEG-KAIFHGALFGCSNDN 100
                   C ++  C  +  Y DQS T+   + ET + IG  EG  A F G  FGC + N
Sbjct: 155 QDLGQQGSCGDDNTCTSSYSYGDQSYTRRDLSSETFT-IGSTEGDPASFPGLAFGCGHSN 213

Query: 101 HG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
            G F+E        G   LS V      QL S +  +FSYCLV PL +    SS + FG 
Sbjct: 214 GGTFNEKDSGLIGLGGGPLSLVM-----QLSSKVGGQFSYCLV-PLSSDSTASSKINFGK 267

Query: 160 DMGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERM---NFPPDTFDITVSGEGGCIID 215
                   T +T  I   P+ FYYL+L+ +S+ +E++    F  +      + E   IID
Sbjct: 268 SAVVSGSGTVSTPLIKGTPDTFYYLTLEGMSLGSEKVAFKGFSKNKSSPAAAEESNIIID 327

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMAFYF 274
           SG+ LT    D Y  +     S   +    Q +  P     LCY   +     P++  +F
Sbjct: 328 SGTTLTLLPRDFYTDME----SALTKVIGGQTTTDPRGTFSLCYSGVKKL-EIPTITAHF 382

Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
             A++++   N F+   E+     ++ P  +L A+ G+  Q +    YDL  + +SF   
Sbjct: 383 IGADVQLPPLNTFVQAQED-LVCFSMIPSSNL-AIFGNLSQMNFLVGYDLKNNKVSFKPT 440

Query: 335 NCS 337
           +C+
Sbjct: 441 DCT 443


>gi|357481191|ref|XP_003610881.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512216|gb|AES93839.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 427

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 100/359 (27%), Positives = 168/359 (46%), Gaps = 36/359 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L +G+P   +  ++DTGS L++A              +F+P +S ++  I C+   C
Sbjct: 83  LMKLTLGSPPVDIYGLVDTGSDLVWAQCTPCGGCYRQKSPMFEPLRSKTYSPIPCESEQC 142

Query: 47  TYF--KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG- 102
           ++F   C  ++ C Y+  YAD SVTKG  A E I+         +    +FGC + N G 
Sbjct: 143 SFFGYSCSPQKMCAYSYSYADSSVTKGVLAREAITFSSTDGDPVVVGDIIFGCGHSNSGT 202

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           F+E+       G   L     S +SQ+G++   KRFS CLV P     +TS  + FG + 
Sbjct: 203 FNENDMGIIGMGGGPL-----SLVSQIGTLYGSKRFSQCLV-PFHTDAHTSGTINFGEES 256

Query: 162 GYRRPSTQATKFINHPNNFYYL-SLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
                    T   +      YL +L+ IS+ +  + F       T+S +G  +IDSG+  
Sbjct: 257 DVSGEGVVTTPLASEEGQTSYLVTLEGISVGDTFVRF---NSSETLS-KGNIMIDSGTPA 312

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMAFYFEDANL 279
           TY   + Y +L E+      +  L  + D P+   QLCY   ET    P +  +FE A++
Sbjct: 313 TYIPQEFYERLVEELKV---QSSLLPIEDDPDLGTQLCY-RSETNLEGPILTAHFEGADV 368

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
           ++     FI   ++  F  A+A   D   + G+  Q +    +DL+   +SF   +C++
Sbjct: 369 QLLPIQTFIPP-KDGVFCFAMAGSTDGDYIFGNFAQSNILMGFDLDRKTISFKPTDCTN 426


>gi|293335955|ref|NP_001168399.1| uncharacterized protein LOC100382168 precursor [Zea mays]
 gi|223948009|gb|ACN28088.1| unknown [Zea mays]
 gi|413922066|gb|AFW61998.1| hypothetical protein ZEAMMB73_694403 [Zea mays]
          Length = 507

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 98/370 (26%), Positives = 157/370 (42%), Gaps = 60/370 (16%)

Query: 7   GTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC------ 46
           G+P+  + +I+DTGS L +               +FDP  S+++  + C+   C      
Sbjct: 155 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRA 214

Query: 47  ---TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
              T   C +     E+C Y + Y D S ++G  A +T+++     G A   G +FGC  
Sbjct: 215 ATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL-----GGASLGGFVFGCGL 269

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
            N G       G  AG++GL R  +S +SQ  S     FSYCL          S  L  G
Sbjct: 270 SNRGLF-----GGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGG 324

Query: 159 TDMG--YRRPSTQA-TKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
            D    YR  +  A T+ I  P    FY+L++   ++    +            G    +
Sbjct: 325 DDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTALA-------AQGLGASNVL 377

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSM 270
           IDSG+V+T     VY  +  +F+    +F  A     P    +  CY L      + P +
Sbjct: 378 IDSGTVITRLAPSVYRAVRAEFM---RQFGAAGYPAAPGFSILDTCYDLTGHDEVKVPLL 434

Query: 271 AFYFE-DANLRIDGENV-FIIDYENHFFLLAVA--PHDDLVALIGSQQQRDTRFVYDLNI 326
               E  A++ +D   + F++  +     LA+A   ++D   +IG+ QQ++ R VYD   
Sbjct: 435 TLRLEGGADVTVDAAGMLFVVRKDGSQVCLAMASLSYEDETPIIGNYQQKNKRVVYDTLG 494

Query: 327 DLLSFVKENC 336
             L F  E+C
Sbjct: 495 SRLGFADEDC 504


>gi|145693994|gb|ABP93697.1| unknown protein isoform 2 [Lemna minor]
          Length = 351

 Score =  108 bits (269), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 92/358 (25%), Positives = 148/358 (41%), Gaps = 46/358 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           ++ +  GTP++   ++ DTGS + +                +FDP  SS+++ ++C  P 
Sbjct: 17  VITVGFGTPTRTQTVVFDTGSDVNWLQCKPCAVRCYAQQEPLFDPSLSSTYRNVSCTEPA 76

Query: 46  CTYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C       C +  C+Y + Y D S T GF A +T  +    +    F   +FGC  +N G
Sbjct: 77  CVGLSTRGCSSSTCLYGVFYGDGSSTIGFLAMDTFMLTPAQK----FKNFIFGCGQNNTG 132

Query: 103 FDEDARDGALAGVLGLSR-VTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
             +       AG++GL R  T S  SQ+   +   FSYC    LP+    + YL  G   
Sbjct: 133 LFQGT-----AGLVGLGRSSTYSLNSQVAPSLGNVFSYC----LPSTSSATGYLNIGNPQ 183

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
               P   A          Y++ L  IS+   R++     F        G IIDSG+V+T
Sbjct: 184 --NTPGYTAMLTDTRVPTLYFIDLIGISVGGTRLSLSSTVFQ-----SVGTIIDSGTVIT 236

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDANLR 280
                 Y  L     +   ++ LA        +  CY F   T   +P +  +F   ++R
Sbjct: 237 RLPPTAYSALKTAVRAAMTQYTLAPAVTI---LDTCYDFSRTTSVVYPVIVLHFAGLDVR 293

Query: 281 IDGENVFIIDYENHFFLLAVAPHDD--LVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           I    VF + + +    LA A + D  ++ +IG+ QQ      YD  +  + F    C
Sbjct: 294 IPATGVFFV-FNSSQVCLAFAGNTDSTMIGIIGNVQQLTMEVTYDNELKRIGFSAGAC 350


>gi|312283333|dbj|BAJ34532.1| unnamed protein product [Thellungiella halophila]
          Length = 428

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 90/361 (24%), Positives = 152/361 (42%), Gaps = 48/361 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
           +VR  IGTP++ +L+ LDT +   +             +FDP KSSS + + CD P C  
Sbjct: 92  IVRANIGTPAQPMLVALDTSNDAAWVPCSGCVGCASSVLFDPSKSSSSRNLQCDAPQCKQ 151

Query: 49  FK----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                    + C + M Y   ++       +T+++        +     FGC +   G  
Sbjct: 152 APNPTCTAGKSCGFNMTYGGSTIEASLT-QDTLTL-----ANDVIKSYTFGCISKATGTS 205

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--SSYLKFGTDMG 162
             A+     G++GL R  +S ISQ  ++    FSYCL    PN + +  S  L+ G    
Sbjct: 206 LPAQ-----GLMGLGRGPLSLISQTQNLYMSTFSYCL----PNSKSSNFSGSLRLGPK-- 254

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
           Y+    + T  + +P  ++ YY++L  I + N+ ++ P        S   G I DSG+V 
Sbjct: 255 YQPVRIKTTPLLKNPRRSSLYYVNLVGIRVGNKIVDIPTSALAFDASTGAGTIFDSGTVF 314

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
           T      Y  +  +F    +      L         CY        +PS+ F F   N+ 
Sbjct: 315 TRLVEPAYVAVRNEFRRRIKNANATSLGG----FDTCY---SGSVVYPSVTFMFAGMNVT 367

Query: 281 IDGENVFI--IDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  +N+ I           +A AP+  + ++ +I S QQ++ R + DL    L   +E C
Sbjct: 368 LPPDNLLIHSSSGSTSCLAMAAAPNNVNSVLNVIASMQQQNHRVLIDLPNSRLGISRETC 427

Query: 337 S 337
           +
Sbjct: 428 T 428


>gi|242081367|ref|XP_002445452.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
 gi|241941802|gb|EES14947.1| hypothetical protein SORBIDRAFT_07g019450 [Sorghum bicolor]
          Length = 459

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 94/363 (25%), Positives = 162/363 (44%), Gaps = 51/363 (14%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC----- 46
           +GTP +   +ILD GS L++               +FD  +SSSF  + CD   C     
Sbjct: 113 VGTPPQPSKVILDLGSDLLWTQCSLVGPTAKQLEPVFDAARSSSFSVLPCDSKLCEAGTF 172

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
           T   C + +C Y   Y   + T G  A ET +    G    +     FGC    +G   +
Sbjct: 173 TNKTCTDRKCAYENDYGIMTAT-GVLATETFTF---GAHHGVSANLTFGCGKLANGTIAE 228

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT--DMGYR 164
           A     +G+LGLS   +S + QL      +FSYCL    P  +  +S + FG   D+G  
Sbjct: 229 A-----SGILGLSPGPLSMLKQLAIT---KFSYCLT---PFADRKTSPVMFGAMADLGKY 277

Query: 165 RPS--TQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
           + +   Q    + +P  + +YY+ +  +S+ ++R++ P +T  I   G GG ++DS + L
Sbjct: 278 KTTGKVQTIPLLKNPVEDIYYYVPMVGMSVGSKRLDVPQETLAIKPDGTGGTVLDSATTL 337

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN----RFPSMAFYFE- 275
            Y     + +L +   +  E  +L   +   +   +C+ LP   +    + P +  +F+ 
Sbjct: 338 AYLVEPAFTELKK---AVMEGIKLPVANRSVDDYPVCFELPRGMSMEGVQVPPLVLHFDG 394

Query: 276 DANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
           DA + +  +N F  +       LAV  AP +    +IG+ QQ++   +YD+     S+  
Sbjct: 395 DAEMSLPRDNYF-QEPSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHVLYDVGNRKFSYAP 453

Query: 334 ENC 336
             C
Sbjct: 454 TKC 456


>gi|125542690|gb|EAY88829.1| hypothetical protein OsI_10302 [Oryza sativa Indica Group]
          Length = 440

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 99/375 (26%), Positives = 164/375 (43%), Gaps = 56/375 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
           +VR  +G+PS+ +LL LDT +   +A            +F P  SSS+  + C    C  
Sbjct: 82  VVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSSLFAPANSSSYASLPCSSSWCPL 141

Query: 49  FK---CVNEQ--------------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
           F+   C   Q              C ++  +AD S     A+ +T+ +     GK     
Sbjct: 142 FQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALAS-DTLRL-----GKDAIPN 195

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
             FGC +   G   +       G+LGL R  ++ +SQ GS+    FSYCL  P     Y 
Sbjct: 196 YTFGCVSSVTGPTTNMPR---QGLLGLGRGPMALLSQAGSLYNGVFSYCL--PSYRSYYF 250

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
           S  L+ G   G  R S + T  + +P  ++ YY+++  +S+    +  P  +F    +  
Sbjct: 251 SGSLRLGAGGGQPR-SVRYTPMLRNPHRSSLYYVNVTGLSVGRAWVKVPAGSFAFDAATG 309

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPE-TFNR 266
            G ++DSG+V+T + + VY  L E+F     R Q+A  S          C+   E     
Sbjct: 310 AGTVVDSGTVITRWTAPVYAALREEF-----RRQVAAPSGYTSLGAFDTCFNTDEVAAGG 364

Query: 267 FPSMAFYFEDA-NLRIDGENVFIIDYENHFFLLAV--APH--DDLVALIGSQQQRDTRFV 321
            P++  + +   +L +  EN  I         LA+  AP   + +V +I + QQ++ R V
Sbjct: 365 APAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVV 424

Query: 322 YDLNIDLLSFVKENC 336
           +D+    + F KE+C
Sbjct: 425 FDVANSRIGFAKESC 439


>gi|242056193|ref|XP_002457242.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
 gi|241929217|gb|EES02362.1| hypothetical protein SORBIDRAFT_03g003930 [Sorghum bicolor]
          Length = 457

 Score =  107 bits (268), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 101/373 (27%), Positives = 165/373 (44%), Gaps = 63/373 (16%)

Query: 6   IGTPSKGVLLILDTGSALIY------------------AIFDPRKSSSFQKINCDHPDCT 47
           +GTP   +L I DTGS L++                   +F P +SS++ +++C    C 
Sbjct: 109 VGTPPTQLLAIADTGSDLVWVNCSSSGGGLADADAGGNVVFQPTRSSTYSQLSCQSNACQ 168

Query: 48  YF---KC-VNEQCVYTMKYADQSVTKGFAAHETISVI-GKGEGKAIFHGALFGCSNDNHG 102
                 C  + +C Y   Y D S T G  + ET S + G G+G+       FGCS  + G
Sbjct: 169 ALSQASCDADSECQYQYSYGDGSRTIGVLSTETFSFVDGGGKGQVRVPRVNFGCSTASAG 228

Query: 103 -FDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
            F  D       G++GL     S +SQLG+   I ++ SYCL IP  +   +SS L FG+
Sbjct: 229 TFRSD-------GLVGLGAGAFSLVSQLGATTHIDRKLSYCL-IPSYDAN-SSSTLNFGS 279

Query: 160 DMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                 P   +T  + +  +++Y ++L+ +++  + +     T D  +      I+DSG+
Sbjct: 280 RAVVSEPGAASTPLVPSDVDSYYTVALESVAVGGQEVA----THDSRI------IVDSGT 329

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-IQLCYFLPETFNRFPSMAFYFEDA 277
            LT+    +   L    V+  ER    Q    PE  +QLCY   +   +  +  F   D 
Sbjct: 330 TLTFLDPALLGPL----VTELERRIKLQRVQPPEQLLQLCY---DVQGKSETDNFGIPDV 382

Query: 278 NLRIDGENVFIIDYENHFFL-------LAVAP--HDDLVALIGSQQQRDTRFVYDLNIDL 328
            LR  G     +  EN F L       L + P      V+++G+  Q++    YDL+   
Sbjct: 383 TLRFGGGAAVTLRPENTFSLLQEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDART 442

Query: 329 LSFVKENCSDDSA 341
           ++F   +C+  SA
Sbjct: 443 VTFAAADCARSSA 455


>gi|115451209|ref|NP_001049205.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|49532749|dbj|BAD26705.1| Radc1 [Oryza sativa Japonica Group]
 gi|108706569|gb|ABF94364.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113547676|dbj|BAF11119.1| Os03g0186900 [Oryza sativa Japonica Group]
 gi|215692805|dbj|BAG88249.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767626|dbj|BAG99854.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 438

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 99/375 (26%), Positives = 165/375 (44%), Gaps = 56/375 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
           +VR  +G+PS+ +LL LDT +   +A            +F P  SSS+  + C    C  
Sbjct: 80  VVRAGLGSPSQQLLLALDTSADATWAHCSPCGTCPSSSLFAPANSSSYASLPCSSSWCPL 139

Query: 49  FK---CVNEQ--------------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
           F+   C   Q              C ++  +AD S     A+ +T+ +     GK     
Sbjct: 140 FQGQACPAPQGGGDAAPPPATLPTCAFSKPFADASFQAALAS-DTLRL-----GKDAIPN 193

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
             FGC +   G   +       G+LGL R  ++ +SQ GS+    FSYCL  P     Y 
Sbjct: 194 YTFGCVSSVTGPTTNMPR---QGLLGLGRGPMALLSQAGSLYNGVFSYCL--PSYRSYYF 248

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
           S  L+ G   G  R S + T  + +P  ++ YY+++  +S+ +  +  P  +F    +  
Sbjct: 249 SGSLRLGAGGGQPR-SVRYTPMLRNPHRSSLYYVNVTGLSVGHAWVKVPAGSFAFDAATG 307

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPE-TFNR 266
            G ++DSG+V+T + + VY  L E+F     R Q+A  S          C+   E     
Sbjct: 308 AGTVVDSGTVITRWTAPVYAALREEF-----RRQVAAPSGYTSLGAFDTCFNTDEVAAGG 362

Query: 267 FPSMAFYFEDA-NLRIDGENVFIIDYENHFFLLAV--APH--DDLVALIGSQQQRDTRFV 321
            P++  + +   +L +  EN  I         LA+  AP   + +V +I + QQ++ R V
Sbjct: 363 APAVTVHMDGGVDLALPMENTLIHSSATPLACLAMAEAPQNVNSVVNVIANLQQQNIRVV 422

Query: 322 YDLNIDLLSFVKENC 336
           +D+    + F KE+C
Sbjct: 423 FDVANSRVGFAKESC 437


>gi|255543383|ref|XP_002512754.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547765|gb|EEF49257.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 414

 Score =  107 bits (268), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 97/360 (26%), Positives = 152/360 (42%), Gaps = 56/360 (15%)

Query: 11  KGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKCVN--- 53
           + + +I+DTGS L +               +F+P  S S+Q I C+   C   +      
Sbjct: 76  RNMTVIVDTGSDLTWVQCQPCRLCYNQQDPLFNPSGSPSYQTILCNSSTCQSLQYATGNL 135

Query: 54  -------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                    C Y + Y D S T+G    E +++     G       +FGC  +N G    
Sbjct: 136 GVCGSNTPTCNYVVNYGDGSYTRGDLGMEQLNL-----GTTHVSNFIFGCGRNNKGLF-- 188

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
              G  +G++GL +  +S +SQ  +I +  FSYCL  P    + + S +  G    Y+  
Sbjct: 189 ---GGASGLMGLGKSDLSLVSQTSAIFEGVFSYCL--PTTAADASGSLILGGNSSVYKNT 243

Query: 167 STQA-TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
           +  + T+ I +P    FY+L+L  ISI    +  P          + G +IDSG+V+T  
Sbjct: 244 TPISYTRMIANPQLPTFYFLNLTGISIGGVALQAPNYR-------QSGILIDSGTVITRL 296

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAFYFE-DANLR 280
              VY  L  +F+  F  F  A     P  I    F    ++    P++   FE +A L 
Sbjct: 297 PPPVYRDLKAEFLKQFSGFPSAP----PFSILDTCFNLNGYDEVDIPTIRMQFEGNAELT 352

Query: 281 IDGENVFI---IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +D   +F     D       LA    DD + +IG+ QQR+ R +Y+     L F  E CS
Sbjct: 353 VDVTGIFYFVKTDASQVCLALASLSFDDEIPIIGNYQQRNQRVIYNTKESKLGFAAEACS 412


>gi|326518194|dbj|BAK07349.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 435

 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 97/336 (28%), Positives = 147/336 (43%), Gaps = 34/336 (10%)

Query: 26  AIFDPRKSSSFQKINCDHPDCT--YFKCVNEQC-VYTMKYADQSVTKGFAAHETISVIGK 82
           A+F    S  ++      P CT  Y   V  +C  YT  +       G+   +     G 
Sbjct: 109 AVFKSAVSPRYKDTKATDPKCTPPYTPSVGNRCSFYTTSW--NVAAHGYLGSDMFGFAGS 166

Query: 83  -GEGKAIFHGA-----LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIK 134
            G G    HG       FGC++   GF E    G LAG L LSR   SF+SQL +  +  
Sbjct: 167 PGTGG---HGTDVDKLTFGCAHTTDGF-ERLNHGVLAGALSLSRHPTSFLSQLTARRLAD 222

Query: 135 KRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFI---NHPNNFYYLSLKDISID 191
            RFSYCL     +      +L+FG D+  R     +T  +       + YY+ +  IS++
Sbjct: 223 SRFSYCLFPGQSHPNARHGFLRFGRDI-PRHDHAHSTSLLFTGRGSGSMYYIGVTSISLN 281

Query: 192 NERM-NFPPDTFDITV-SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD 249
            +R+    P  F     +  GG ++D G+ LT    + Y  +  + V+Y    Q      
Sbjct: 282 GKRIIGLQPAFFRRNPQTRRGGSVVDPGTPLTRLVREAYNIVEAELVAYM---QTQGSRR 338

Query: 250 CPEPIQ---LCYFLPETFNRFPSMAFYFED--ANLRIDGENVFIIDYENHFFLLAVAPHD 304
            P P+Q   LC F+       PSM     +  A L I  E +F+     H   L V   D
Sbjct: 339 APAPVQGHRLC-FVSWGHAHLPSMTINMNEDRAKLFIKPELLFLKVTHEHLCFLVVP--D 395

Query: 305 DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDDS 340
           + + ++G+ QQ DTRF +DL+ + L F +E+C+ D+
Sbjct: 396 EEMTVLGAAQQVDTRFTFDLHANRLYFAQEHCTADT 431


>gi|224092218|ref|XP_002309514.1| predicted protein [Populus trichocarpa]
 gi|222855490|gb|EEE93037.1| predicted protein [Populus trichocarpa]
          Length = 474

 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 101/363 (27%), Positives = 156/363 (42%), Gaps = 50/363 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP +   L+ DTGS + +                 FDP KS+S+  ++C    
Sbjct: 136 VVTVGLGTPKEDFTLVFDTGSGITWTQCQPCLGSCYPQKEQKFDPTKSTSYNNVSCSSAS 195

Query: 46  CTYF-------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
           C             N  C+Y + Y DQS ++GF A ET+++        +F   LFGC  
Sbjct: 196 CNLLPTSERGCSASNSTCLYQIIYGDQSYSQGFFATETLTI----SSSDVFTNFLFGCGQ 251

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
            N+G       G  AG+LGLS  ++S  SQ     +K+FSYC    LP+   ++ YL FG
Sbjct: 252 SNNGLF-----GQAAGLLGLSSSSVSLPSQTAEKYQKQFSYC----LPSTPSSTGYLNFG 302

Query: 159 TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
              G    +   T      ++FY + +  IS+   ++   P  F  +     G IIDSG+
Sbjct: 303 ---GKVSQTAGFTPISPAFSSFYGIDIVGISVAGSQLPIDPSIFTTS-----GAIIDSGT 354

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDA 277
           V+T      Y  L E F    E+      ++  E +  CY F   T   FP ++  F+  
Sbjct: 355 VITRLPPTAYKALKEAFD---EKMSNYPKTNGDELLDTCYDFSNYTTVSFPKVSVSFKGG 411

Query: 278 -NLRIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
             + ID   +  +        LA A +  D    + G+ QQ+    VYD    ++ F   
Sbjct: 412 VEVDIDASGILYLVNGVKMVCLAFAANKDDSEFGIFGNHQQKTYEVVYDGAKGMIGFAAG 471

Query: 335 NCS 337
            CS
Sbjct: 472 ACS 474


>gi|359473000|ref|XP_002278677.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 99/382 (25%), Positives = 159/382 (41%), Gaps = 62/382 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINC----- 41
           V L +GTP + +LL+ DTGS L++               + F  R S++F   +C     
Sbjct: 91  VDLRLGTPPQKLLLVADTGSDLVWVKCSACRNCTRHTPGSAFLARHSTTFSPNHCYDSAC 150

Query: 42  ------DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
                  H  C + + ++  C Y   Y D S T GF + ET ++      +A   G  FG
Sbjct: 151 QLVPLPKHHRCNHAR-LHSPCRYEYSYGDGSKTSGFFSKETTTLNTSSGREAKLKGIAFG 209

Query: 96  CSNDNHGFD-EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL----VIPLPNGEY 150
           C+    G     A      GV+GL R  IS  SQLG     +FSYCL    + P P    
Sbjct: 210 CAFRISGPSVSGASFNGAHGVMGLGRGPISLSSQLGHRFGNKFSYCLMDHDISPSP---- 265

Query: 151 TSSYLKFGTDMGYRRPSTQATKFIN-HPN----NFYYLSLKDISIDNERMNFPPDTFDIT 205
            +SYL  G+      P  +  +F   H N     FYY+ ++ +S+D  ++   P  + + 
Sbjct: 266 -TSYLLIGSTQNDVAPGKRRMRFTPLHINPLSPTFYYIGIESVSVDGIKLPINPSVWALD 324

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
             G GG I+DSG+ LT+     Y ++         R +L   ++      LC  + E  +
Sbjct: 325 ELGNGGTIVDSGTTLTFLPEPAYLQI---LTVIKRRVRLPSPAEPTPGFDLCVNVSEIEH 381

Query: 266 -RFPSMAFYFEDANLRIDGENVFIIDYENHF---------FLLAVAPHDDLVALIGSQQQ 315
            R P ++F       ++ G++VF     N+F           L         ++IG+  Q
Sbjct: 382 PRLPKLSF-------KLGGDSVFSPPPRNYFVDTDEDVKCLALQAVMTPSGFSVIGNLMQ 434

Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
           +     +D +   L F +  C+
Sbjct: 435 QGFLLEFDKDRTRLGFSRHGCA 456


>gi|148907930|gb|ABR17085.1| unknown [Picea sitchensis]
          Length = 498

 Score =  107 bits (267), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 101/356 (28%), Positives = 155/356 (43%), Gaps = 37/356 (10%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ +GTP++   ++LDTGS + +               IF+P  S+SF  + CD   C+
Sbjct: 159 TRIGVGTPTREQYMVLDTGSDVAWIQCEPCRECYSQADPIFNPSYSASFSTVGCDSAVCS 218

Query: 48  Y---FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
               + C +  C+Y   Y D S + G  A ET++      G         GC + N G  
Sbjct: 219 QLDAYDCHSGGCLYEASYGDGSYSTGSFATETLTF-----GTTSVANVAIGCGHKNVGLF 273

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A               +SF +Q+G+     FSYCLV        +S  L+FG      
Sbjct: 274 IGAAGLLGL-----GAGALSFPNQIGTQTGHTFSYCLV---DRESDSSGPLQFGPKSVPV 325

Query: 165 RPSTQATKFINHPNNFYYLSLKDISIDNERMN-FPPDTFDI-TVSGEGGCIIDSGSVLTY 222
                  +   H   FYYLS+  IS+    ++  PP+ F I   SG GG IIDSG+V+T 
Sbjct: 326 GSIFTPLEKNPHLPTFYYLSVTAISVGGALLDSIPPEVFRIDETSGHGGFIIDSGTVVTR 385

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFED-ANLR 280
             +  Y  + + FV+     QL + +D       CY L    F   P++ F+F + A+L 
Sbjct: 386 LVTSAYDAVRDAFVA--GTGQLPR-TDAVSIFDTCYDLSGLQFVSVPTVGFHFSNGASLI 442

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  +N  I       F  A AP    V+++G+ QQ+  R  +D    L+ F  + C
Sbjct: 443 LPAKNYLIPMDTVGTFCFAFAPAASSVSIMGNTQQQHIRVSFDSANSLVGFAFDQC 498


>gi|242092368|ref|XP_002436674.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
 gi|241914897|gb|EER88041.1| hypothetical protein SORBIDRAFT_10g006870 [Sorghum bicolor]
          Length = 461

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 103/383 (26%), Positives = 151/383 (39%), Gaps = 68/383 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V L +GTP + V L LDTGS L++               + DP  SS++  + C  P C
Sbjct: 93  LVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFHQGLPLLDPAASSTYAALPCGAPRC 152

Query: 47  -------------TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGK---GEGKAIFH 90
                        + +   N  C Y   Y D+SVT G  A +  +  G    G+ +    
Sbjct: 153 RALPFTSCGGGGRSSWGNGNRSCAYIYHYGDKSVTVGEIATDRFTFGGDNGDGDSRLPTR 212

Query: 91  GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY 150
              FGC + N G  +        G+ G  R   S  SQL       FSYC        E 
Sbjct: 213 RLTFGCGHFNKGVFQSNE----TGIAGFGRGRWSLPSQLNVTT---FSYCFTSMF---ES 262

Query: 151 TSSYLKFG---------TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPP 199
            SS +  G         +   +     + T  + +P+  + Y+LSLK IS+   R+  P 
Sbjct: 263 KSSLVTLGGAPAAALLYSHAAHISGEVRTTPLLKNPSQPSLYFLSLKGISVGKTRLAVPE 322

Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF 259
                T       IIDSG+ +T     VY  +  +F +         +      + LC+ 
Sbjct: 323 AKLRST-------IIDSGASITTLPEAVYEAVKAEFAAQVGLPPTGVVEG--SALDLCFA 373

Query: 260 LPET--FNR--FPSMAFYFEDANLRIDGENVFIIDYENHFF--LLAVAPHDDLVALIGSQ 313
           LP T  + R   PS+  + + A+  +   N    D        +L  AP D  V  IG+ 
Sbjct: 374 LPVTALWRRPPVPSLTLHLDGADWELPRGNYVFEDLAARVMCVVLDAAPGDQTV--IGNF 431

Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
           QQ++T  VYDL  D LSF    C
Sbjct: 432 QQQNTHVVYDLENDWLSFAPARC 454


>gi|449440933|ref|XP_004138238.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 107/364 (29%), Positives = 162/364 (44%), Gaps = 53/364 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDH 43
           + ++ +G P K   L+ DTGS + +                  IFDP+ SSS+  ++C+ 
Sbjct: 149 LAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNS 208

Query: 44  PDCTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
             C       C ++ C+Y + Y D S T G  A ET+S    G   +I +  + GC +DN
Sbjct: 209 QQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSF---GNSNSIPNLPI-GCGHDN 264

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFGT 159
            G                    IS  SQL +     FSYCLV    N +  +SS L+F +
Sbjct: 265 EGLFAGGAGLIGL-----GGGAISLSSQLKA---SSFSYCLV----NLDSDSSSTLEFNS 312

Query: 160 DMGYRRPSTQATKFINHPNNFY---YLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
           +M    PS   T  +   + F+   Y+ +  IS+  + +   P  F+I  SG GG I+DS
Sbjct: 313 NM----PSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDS 368

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSMAFY 273
           G++++   SDVY  L E FV        + LS  P       CY F  ++    P++AF 
Sbjct: 369 GTIISRLPSDVYESLREAFVKL-----TSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFV 423

Query: 274 F-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
             E  +LR+   N  I+      + LA       +++IGS QQ+  R  YDL   L+ F 
Sbjct: 424 LSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSLVGFS 483

Query: 333 KENC 336
              C
Sbjct: 484 TNKC 487


>gi|357457681|ref|XP_003599121.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355488169|gb|AES69372.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 439

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 100/360 (27%), Positives = 162/360 (45%), Gaps = 46/360 (12%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY--- 48
           IGTP   +  ++DTGS  I+               IF+P KSS+++ I C  P C     
Sbjct: 96  IGTPPFQLYGVVDTGSDGIWFQCKPCKPCLNQTSPIFNPSKSSTYKNIRCSSPICKRGEK 155

Query: 49  FKCVN---EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
            +C +    +C Y + Y D+S ++G  + +T+++         F   + GC + N    E
Sbjct: 156 TRCSSNRKRKCEYEITYLDRSGSQGDISKDTLTLNSNDGSPISFPKIVIGCGHKNSLTTE 215

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG--- 162
               G  +G++G  R   S +SQLGS I  +FSYCL   L +    SS L FG DM    
Sbjct: 216 ----GLASGIIGFGRGNFSIVSQLGSSIGGKFSYCLA-SLFSKANISSKLYFG-DMAVVS 269

Query: 163 ----YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                  P  Q+    N+  N    S+ D  I  +  +  PD        EG  +IDSGS
Sbjct: 270 GHGVVSTPLIQSFYVGNYFTNLEAFSVGDHIIKLKDSSLIPDN-------EGNAVIDSGS 322

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDAN 278
            +T   +DVY +L    +S     +L ++ D  + + LCY         P +  +F  A+
Sbjct: 323 TITQLPNDVYSQLETAVISM---VKLKRVKDPTQQLSLCYKTTLKKYEVPIITAHFRGAD 379

Query: 279 LRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ++++  N FI +++E   F    +    +V   G+  Q++    YD   +++SF   NC+
Sbjct: 380 VKLNAFNTFIQMNHEVMCFAFNSSAFPWVV--YGNIAQQNFLVGYDTLKNIISFKPTNCT 437


>gi|326532056|dbj|BAK01404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 506

 Score =  107 bits (266), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 112/380 (29%), Positives = 155/380 (40%), Gaps = 66/380 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V +++GTP +   +I+DTGS L +               IFDP  S S++ + C    C
Sbjct: 150 LVDVYLGTPPRRFRMIMDTGSDLNWLQCAPCLDCFEQSGPIFDPAASISYRNVTCGDDRC 209

Query: 47  TYFKCVNEQ------------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF 94
                  E             C Y   Y DQS T G  A E  +V     G     G  F
Sbjct: 210 RLVSPPAESAPRECRRPRSDPCPYYYWYGDQSNTTGDLALEAFTVNLTQSGTRRVDGVAF 269

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR-FSYCLVIPLPNGEYTSS 153
           GC + N G          AG+LGL R  +SF SQL  +     FSYCLV    +G    S
Sbjct: 270 GCGHRNRGLFH-----GAAGLLGLGRGPLSFASQLRGVYGGHAFSYCLV---EHGSAAGS 321

Query: 154 YLKFGTDMG-YRRPSTQATKF--INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
            + FG D      P    T F      + FYYL LK I +  E +N   DT        G
Sbjct: 322 KIIFGHDDALLAHPQLNYTAFAPTTDADTFYYLQLKSILVGGEAVNISSDTLS-----AG 376

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFV-----SYFERFQLAQLSDC-----PEPIQLCYFL 260
           G IIDSG+ L+YF    Y  + + F+     SY        LS C      E +++    
Sbjct: 377 GTIIDSGTTLSYFPEPAYQAIRQAFIDRMSPSYPLILGFPVLSPCYNVSGAEKVEV---- 432

Query: 261 PETFNRFPSMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
                  P ++  F D A      EN FI ++ E    L  +      +++IG+ QQ++ 
Sbjct: 433 -------PELSLVFADGAAWEFPAENYFIRLEPEGIMCLAVLGTPRSGMSIIGNYQQQNF 485

Query: 319 RFVYDLNIDLLSFVKENCSD 338
             +YDL  + L F    C+D
Sbjct: 486 HVLYDLEHNRLGFAPRRCAD 505


>gi|242079449|ref|XP_002444493.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
 gi|241940843|gb|EES13988.1| hypothetical protein SORBIDRAFT_07g022790 [Sorghum bicolor]
          Length = 449

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 106/382 (27%), Positives = 161/382 (42%), Gaps = 72/382 (18%)

Query: 6   IGTPSKGVLLILDTGSALIYA---------------------IFDPRKSSSFQKINCDHP 44
           IGTP +   LI+DTGS LI+                      +++PR+SSSF  + C   
Sbjct: 90  IGTPPQPRTLIVDTGSDLIWTQCSMLSRRTRTAASASRQREPLYEPRRSSSFAYLPCSDR 149

Query: 45  DC-----TYFKCV-NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
            C     +Y  C  N +C+Y   Y       G  A ET +    G    +     FGC  
Sbjct: 150 LCQEGQFSYKNCARNNRCMYDELYGSAEA-GGVLASETFTF---GVNAKVSLPLGFGCGA 205

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
            + G    A     +G++GLS   +S +SQL      RFSYCL    P  E  +S L FG
Sbjct: 206 LSAGDLVGA-----SGLMGLSPGIMSLVSQLS---VPRFSYCLT---PFAERKTSPLLFG 254

Query: 159 TDMGYRRPST----QATKFINHP---NNFYYLSLKDISIDNERMNFPPDTFD-ITVSGEG 210
                RR  T    Q T  + +P     +YY+ L  +S+  +R++ P  +   I   G G
Sbjct: 255 AMADLRRYRTTGTVQTTSILRNPAMETAYYYVPLVGLSLGTKRLDVPATSLGMIKPDGSG 314

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPETFN---- 265
           G I+DSGS ++Y     +  + +  V    R  +A  +D   +  +LC+ LP        
Sbjct: 315 GTIVDSGSTMSYLEETAFRAVKKAVVEAV-RLPVANGTDEDYDDYELCFALPTGVAMEAV 373

Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHF-------FLLAVAPHDD--LVALIGSQQQR 316
           + P +  +F       DG     +  +N+F         LAV    D   V++IG+ QQ+
Sbjct: 374 KTPPLVLHF-------DGGAAMTLPRDNYFQEPRAGLMCLAVGTSPDGFGVSIIGNVQQQ 426

Query: 317 DTRFVYDLNIDLLSFVKENCSD 338
           +   ++D+     SF    C D
Sbjct: 427 NMHVLFDVRNQKFSFAPTKCDD 448


>gi|357140068|ref|XP_003571594.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 533

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 151/371 (40%), Gaps = 65/371 (17%)

Query: 7   GTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDCTYF- 49
           G  +K + +I+DTGS L +                 +FDP  S +F  + C  P C    
Sbjct: 188 GGGAKNLTVIVDTGSDLTWVQCEPCPGSSCYAQRDPLFDPAASPTFAAVPCGSPACAASL 247

Query: 50  --------KCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALF 94
                    C        ++C Y + Y D S ++G  A +T+     G G      G +F
Sbjct: 248 KDATGAPGSCARSAGNSEQRCYYALSYGDGSFSRGVLAQDTL-----GLGTTTKLDGFVF 302

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
           GC   N G       G  AG++GL R  +S +SQ  +     FSYCL    P    ++  
Sbjct: 303 GCGLSNRGLF-----GGTAGLMGLGRTDLSLVSQTAARFGGVFSYCL----PATTTSTGS 353

Query: 155 LKFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
           L  G       P+   T+ I  P    FY++++   ++        P        G G  
Sbjct: 354 LSLGPGPSSSFPNMAYTRMIADPTQPPFYFINITGAAVGGGAALTAPGF------GAGNV 407

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSM 270
           ++DSG+V+T     VY  +  +F   FE       S     +  CY L   +  N  P +
Sbjct: 408 LVDSGTVITRLAPSVYKAVRAEFARRFEYPAAPGFSI----LDACYDLTGRDEVN-VPLL 462

Query: 271 AFYFED-ANLRIDGENV-FIIDYENHFFLLAVA--PHDDLVALIGSQQQRDTRFVYDLNI 326
               E  A + +D   + F++  +     LA+A  P++D   +IG+ QQR+ R VYD   
Sbjct: 463 TLTLEGGAQVTVDAAGMLFVVRKDGSQVCLAMASLPYEDQTPIIGNYQQRNKRVVYDTVG 522

Query: 327 DLLSFVKENCS 337
             L F  E+C+
Sbjct: 523 SRLGFADEDCT 533


>gi|326515330|dbj|BAK03578.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 445

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 89/361 (24%), Positives = 157/361 (43%), Gaps = 48/361 (13%)

Query: 15  LILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKCV---NEQCV 57
           L+LDT S+L +               +FDP  SSS++ ++   P C     V    ++C 
Sbjct: 91  LVLDTASSLPWMRCAHCLPVQRQRSPVFDPSDSSSYRPLHPTSPLCRAPNPVLPAGDKCS 150

Query: 58  YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLG 117
           + +         G+   +TI +   G      H   FGC+    GFD     G  AG LG
Sbjct: 151 FHLP----GEAHGYVGTDTIIL---GNPTLPIHSVAFGCAQSTEGFDTK---GTFAGTLG 200

Query: 118 LSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG-------YRRPSTQA 170
           + ++  S I Q+   +  RFSYCL I L +    + +++FG D+        +R      
Sbjct: 201 MGKLPTSLIMQIKDRVGSRFSYCL-IGLGHSPGRNGFIRFGADIPDPTLLVHHRIKILPT 259

Query: 171 TKFINH--PNNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSVLTYFHSDV 227
              + H   ++ YY+ L  IS++   +       F+    G GGC +D+G+ +T+     
Sbjct: 260 PPHLPHGVADSAYYVKLLGISLNGTPIPGIRQAMFERRSDGSGGCFVDAGTQVTHLVPAA 319

Query: 228 YWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-LPETFNRFPSMAFYFED------ANLR 280
           Y  + E      +++   ++ D      LC+   P  ++  P +   FE       A+L 
Sbjct: 320 YAVVEEAVAHMVQQWGYKRVRD--PNFSLCFREHPGIWSHIPKLTLDFEGPASRTVAHLE 377

Query: 281 IDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDD 339
           I   N+F+ +D +                ++G+ QQ DTRF++DL+ + ++F +E+C  D
Sbjct: 378 IVSRNLFLKVDNQPLVCFGVYRTSRGSPTVVGAMQQVDTRFIFDLHANTITFHRESCEAD 437

Query: 340 S 340
           +
Sbjct: 438 T 438


>gi|125600538|gb|EAZ40114.1| hypothetical protein OsJ_24557 [Oryza sativa Japonica Group]
          Length = 412

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 89/362 (24%), Positives = 152/362 (41%), Gaps = 67/362 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + IGTP   +  +LDTGS LI+                ++ P +S+++  ++C  P 
Sbjct: 93  LVDIAIGTPPLPLTAVLDTGSDLIWTQCDAPCRRCFPQPAPLYAPARSATYANVSCRSPM 152

Query: 46  C-----TYFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
           C      + +C   +  C Y   Y D + T G  A ET ++   G   A+  G  FGC  
Sbjct: 153 CQALQSPWSRCSPPDTGCAYYFSYGDGTSTDGVLATETFTL---GSDTAV-RGVAFGCGT 208

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
           +N G  +++     +G++G+ R  +S +SQLG    +R                S     
Sbjct: 209 ENLGSTDNS-----SGLVGMGRGPLSLVSQLGVTRPRR----------------SCRARA 247

Query: 159 TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
              G   P+T +              L+ I++ +  +   P  F +T  G+GG IIDSG+
Sbjct: 248 AARGGGAPTTTS-------------PLEGITVGDTLLPIDPAVFRLTPMGDGGVIIDSGT 294

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL--PETFNRFPSMAFYFED 276
             T      +  L     S   R +L   S     + LC+    PE     P +  +F+ 
Sbjct: 295 TFTALEERAFVALARALAS---RVRLPLASGAHLGLSLCFAAASPEAVE-VPRLVLHFDG 350

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A++ +  E+  + D       L +     + +++GS QQ++T  +YDL   +LSF    C
Sbjct: 351 ADMELRRESYVVEDRSAGVACLGMVSARGM-SVLGSMQQQNTHILYDLERGILSFEPAKC 409

Query: 337 SD 338
            +
Sbjct: 410 GE 411


>gi|356536463|ref|XP_003536757.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 475

 Score =  106 bits (265), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 95/362 (26%), Positives = 156/362 (43%), Gaps = 51/362 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ +G+P +   +++D+GS +I+               +F+P  SSSF  ++C    C+
Sbjct: 138 VRIGVGSPPRNQYVVMDSGSDIIWVQCEPCTQCYHQSDPVFNPADSSSFSGVSCASTVCS 197

Query: 48  YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
           +     C   +C Y + Y D S TKG  A ETI+      G+ +      GC + N G  
Sbjct: 198 HVDNAACHEGRCRYEVSYGDGSYTKGTLALETITF-----GRTLIRNVAIGCGHHNQGMF 252

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A               +SF+ QLG      FSYCLV     G  +S  L+FG +    
Sbjct: 253 VGAAGLLGL-----GGGPMSFVGQLGGQTGGAFSYCLV---SRGIESSGLLEFGREA--- 301

Query: 165 RPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            P   A    I++P   +FYY+ L  + +   R++   D F ++  G+GG ++D+G+ +T
Sbjct: 302 MPVGAAWVPLIHNPRAQSFYYIGLSGLGVGGLRVSISEDVFKLSELGDGGVVMDTGTAVT 361

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-----CYFLPETFN-RFPSMAFYFE 275
              +  Y    + F+        AQ ++ P    +     CY L    + R P+++FYF 
Sbjct: 362 RLPTVAYEAFRDGFI--------AQTTNLPRASGVSIFDTCYDLFGFVSVRVPTVSFYFS 413

Query: 276 DAN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
               L +   N  I   +   F  A AP    +++IG+ QQ   +   D     + F   
Sbjct: 414 GGPILTLPARNFLIPVDDVGTFCFAFAPSSSGLSIIGNIQQEGIQISVDGANGFVGFGPN 473

Query: 335 NC 336
            C
Sbjct: 474 VC 475


>gi|357130715|ref|XP_003566992.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 99/393 (25%), Positives = 150/393 (38%), Gaps = 69/393 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA-------------------------IFDPRKSSSF 36
           VR  +GTP++  LL+ DTGS L +                           F P KS ++
Sbjct: 97  VRFRVGTPAQPFLLVADTGSDLTWVKCRPAKAAAASTNSSSSASASSPRRAFRPEKSKTW 156

Query: 37  QKINCDHPDCTYF--------KCVNEQCVYTMKYADQSVTKGFAAHETISVI-------- 80
             I C    C+                C Y  +Y D S  +G    E+ ++         
Sbjct: 157 APIPCASDTCSKSLPFSLSTCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSSSSSSS 216

Query: 81  GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
                KA   G + GC+    G   +A D    GVL L    +SF S   S    RFSYC
Sbjct: 217 KNKVKKAKLQGLVLGCTGSYTGPSFEASD----GVLSLGYSNVSFASHAASRFGGRFSYC 272

Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQA-------TKFI--NHPNNFYYLSLKDISID 191
           LV  L +    +SYL FG +     P   A       T  +  +    FY +S+K IS+D
Sbjct: 273 LVDHL-SPRNATSYLTFGPNSALSGPCPAAAGPGARQTPLVLDSRMRPFYDVSIKAISVD 331

Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
            E +  P D ++  V G GG I+DSG+ LT      Y  +         RF    +    
Sbjct: 332 GELLKIPRDVWE--VDGGGGVIVDSGTSLTVLAKPAYRAVVAALGKKLARFPRVAM---- 385

Query: 252 EPIQLCYFLPETFNR-----FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAV--APHD 304
           +P + CY       +      P +A +F  +         ++ID       + V   P  
Sbjct: 386 DPFEYCYNWTSPSRKDEGDDLPKLAVHFAGSARLEPPSKSYVIDAAPGVKCIGVQEGPWP 445

Query: 305 DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
             +++IG+  Q++  + +DL    L F +  C+
Sbjct: 446 G-ISVIGNILQQEHLWEFDLKNRRLRFKRSRCT 477


>gi|302809855|ref|XP_002986620.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
 gi|300145803|gb|EFJ12477.1| hypothetical protein SELMODRAFT_124369 [Selaginella moellendorffii]
          Length = 423

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 92/358 (25%), Positives = 156/358 (43%), Gaps = 41/358 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           V L +GTP + V ++ DTGS +++               +F+P  SS+FQ I C    C 
Sbjct: 83  VSLGVGTPPRTVNMVADTGSDVLWLQCLPCQSCYGQTDPLFNPSFSSTFQSITCGSSLCQ 142

Query: 48  YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                 C   QC+Y + Y D S T G  + ET+S      G    +    GC ++N G  
Sbjct: 143 QLLIRGCRRNQCLYQVSYGDGSFTVGEFSTETLSF-----GSNAVNSVAIGCGHNNQGLF 197

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY-LKFGTDMGY 163
             A            +  +SF SQ+G +    FSYC    LP  E T S  L FG     
Sbjct: 198 TGAAGLLGL-----GKGLLSFPSQVGQLYGSVFSYC----LPTRESTGSVPLIFGNQA-- 246

Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVL 220
              + Q T  + +P  + FYY+ +  I +    ++ P  +  + + +G GG I+DSG+ +
Sbjct: 247 VASNAQFTTLLTNPKLDTFYYVEMVGIKVGGTSVSIPAGSLSLDSSTGNGGVILDSGTAV 306

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFE-DAN 278
           T   +  Y  + + F +       A+++        CY L   +    P+++F F   A 
Sbjct: 307 TRLVTSAYNPMRDAFRAGMP--SDAKMTSGFSLFDTCYDLSGRSSIMLPAVSFVFNGGAT 364

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           + +  +N+ +    +  + LA AP+ +  ++IG+ QQ+  R  +D   + +      C
Sbjct: 365 MALPAQNIMVPVDNSGTYCLAFAPNSENFSIIGNIQQQSFRMSFDSTGNRVGIGANQC 422


>gi|356553832|ref|XP_003545255.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 427

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 95/374 (25%), Positives = 161/374 (43%), Gaps = 57/374 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
           + L IG+P + V ++LDTGS L +          + F+P  SSS+    C+   C     
Sbjct: 61  ISLTIGSPPQNVTMVLDTGSELSWLHCKKLPNLNSTFNPLLSSSYTPTPCNSSVCMTRTR 120

Query: 49  -----FKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                  C   N+ C   + YAD S  +G  A ET S+ G  +      G LFGC  D+ 
Sbjct: 121 DLTIPASCDPNNKLCHVIVSYADASSAEGTLAAETFSLAGAAQ-----PGTLFGCM-DSA 174

Query: 102 GFDEDA-RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
           G+  D   D    G++G++R ++S ++Q+   +  +FSYC+     +GE     L  G  
Sbjct: 175 GYTSDINEDAKTTGLMGMNRGSLSLVTQM---VLPKFSYCI-----SGEDAFGVLLLGD- 225

Query: 161 MGYRRPST-QATKFINHPNN-------FYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
            G   PS  Q T  +    +        Y + L+ I +  + +  P   F    +G G  
Sbjct: 226 -GPSAPSPLQYTPLVTATTSSPYFDRVAYTVQLEGIKVSEKLLQLPKSVFVPDHTGAGQT 284

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPETFNRF 267
           ++DSG+  T+    VY  L ++F+   +   L ++ D P       + LCY  P +    
Sbjct: 285 MVDSGTQFTFLLGPVYNSLKDEFLEQTKGV-LTRIED-PNFVFEGAMDLCYHAPASLAAV 342

Query: 268 PSMAFYFEDANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVAL----IGSQQQRDTRFVY 322
           P++   F  A +R+ GE  ++ +     +       + DL+ +    IG   Q++    +
Sbjct: 343 PAVTLVFSGAEMRVSGERLLYRVSKGRDWVYCFTFGNSDLLGIEAYVIGHHHQQNVWMEF 402

Query: 323 DLNIDLLSFVKENC 336
           DL    + F +  C
Sbjct: 403 DLVKSRVGFTETTC 416


>gi|356523155|ref|XP_003530207.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 412

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 107/368 (29%), Positives = 158/368 (42%), Gaps = 62/368 (16%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKC 51
           +G  S  + +I+DTGS L +               IF P  SSS+Q ++C+   C   + 
Sbjct: 69  MGLGSTNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQF 128

Query: 52  VN----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                         C Y + Y D S T G    E +S      G       +FGC  +N 
Sbjct: 129 ATGNTGACGSNPSTCNYVVNYGDGSYTNGELGVEQLSF-----GGVSVSDFVFGCGRNNK 183

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFGTD 160
           G       G ++G++GL R  +S +SQ  +     FSYCL    P  E   S  L  G +
Sbjct: 184 GLF-----GGVSGLMGLGRSYLSLVSQTNATFGGVFSYCL----PTTESGASGSLVMGNE 234

Query: 161 MGYRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
               +  T  T     PN    NFY L+L  I +D   +  P  +F     G GG +IDS
Sbjct: 235 SSVFKNVTPITYTRMLPNPQLSNFYILNLTGIDVDGVALQVP--SF-----GNGGVLIDS 287

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAFY 273
           G+V+T   S VY  L   F+  F  F  A     P    +  C+ L        P+++ +
Sbjct: 288 GTVITRLPSSVYKALKALFLKQFTGFPSA-----PGFSILDTCFNLTGYDEVSIPTISMH 342

Query: 274 FE-DANLRIDGENVF-IIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLL 329
           FE +A L++D    F ++  +     LA+A   D    A+IG+ QQR+ R +YD     +
Sbjct: 343 FEGNAELKVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSKV 402

Query: 330 SFVKENCS 337
            F +E+CS
Sbjct: 403 GFAEESCS 410


>gi|413953772|gb|AFW86421.1| hypothetical protein ZEAMMB73_098827 [Zea mays]
          Length = 482

 Score =  105 bits (263), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 100/379 (26%), Positives = 157/379 (41%), Gaps = 65/379 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + IGTP++   ++ DTGS L +                +FDP KSS++  + C  P 
Sbjct: 127 VVTIGIGTPARNFTVLFDTGSDLTWVQCKPCTDSCYQQQEPLFDPSKSSTYVDVPCGTPQ 186

Query: 46  CTY-----FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND- 99
           C         C    C Y++KY DQSVT+G  A E  ++       A   G +FGCS++ 
Sbjct: 187 CKIGGGQDLTCGGTTCEYSVKYGDQSVTRGNLAQEAFTLSPSAPPAA---GVVFGCSHEY 243

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR------FSYCLVIPLPNGEYTSS 153
           + G      + ++AG+LGL R   S +SQ      +R      FSYC    LP    ++ 
Sbjct: 244 SSGVKGAEEEMSVAGLLGLGRGDSSILSQ-----TRRGNSGDVFSYC----LPPRGSSAG 294

Query: 154 YLKFGTDMGYRRPSTQATKFI------NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
           YL  G       P      F       +  ++ Y ++L  IS+    +      F I   
Sbjct: 295 YLTIGAAA----PPQSNLSFTPLVTDNSQLSSVYVVNLVGISVSGAALPIDASAFYI--- 347

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNR 266
              G +IDSG+V+T+  +  Y+ L ++F  +   + +       E +  CY         
Sbjct: 348 ---GTVIDSGTVITHMPAAAYYVLRDEFRRHMGGYTMLPEGHV-ESLDTCYDVTGHDVVT 403

Query: 267 FPSMAFYF-EDANLRIDGEN---VFIIDYENHFFLLA----VAPHDDLVALIGSQQQRDT 318
            P +A  F   A + +D      VF +D       LA    V  +     +IG+ QQR  
Sbjct: 404 APPVALEFGGGARIDVDASGILLVFAVDASGQSLTLACLAFVPTNLPGFVIIGNMQQRAY 463

Query: 319 RFVYDLNIDLLSFVKENCS 337
             V+D+    + F    CS
Sbjct: 464 NVVFDVEGRRIGFGANGCS 482


>gi|42568291|ref|NP_199124.3| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332007527|gb|AED94910.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 631

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 100/373 (26%), Positives = 154/373 (41%), Gaps = 67/373 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+DTGS + Y                F P  S+S+Q + C+ PDC 
Sbjct: 78  TRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN-PDCN 136

Query: 48  YFKCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C +E   CVY  +YA+ S + G  + + IS     E +     A+FGC N+  G   
Sbjct: 137 ---CDDEGKLCVYERRYAEMSSSSGVLSEDLISF--GNESQLSPQRAVFGCENEETG--- 188

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
           D       G++GL R  +S + QL    +I+  FS C         Y    +  G  M  
Sbjct: 189 DLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLC---------YGGMEVGGGA-MVL 238

Query: 164 RRPSTQATKFINHPNNF----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
            + S       +H + F    Y + LK + +  + +   P  F+    G+ G ++DSG+ 
Sbjct: 239 GKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVLDSGTT 294

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYF 274
             YF  + +  + +  +      +     D P    +C+      + E  N FP +A  F
Sbjct: 295 YAYFPKEAFIAIKDAVIKEIPSLKRIHGPD-PNYDDVCFSGAGRDVAEIHNFFPEIAMEF 353

Query: 275 EDANLRIDGENVFIIDYENHFF---------LLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
            +           I+  EN+ F          L + P  D   L+G    R+T   YD  
Sbjct: 354 GNGQ-------KLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRE 406

Query: 326 IDLLSFVKENCSD 338
            D L F+K NCSD
Sbjct: 407 NDKLGFLKTNCSD 419


>gi|9757837|dbj|BAB08274.1| unnamed protein product [Arabidopsis thaliana]
          Length = 586

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 100/373 (26%), Positives = 154/373 (41%), Gaps = 67/373 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+DTGS + Y                F P  S+S+Q + C+ PDC 
Sbjct: 78  TRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSTSYQALKCN-PDCN 136

Query: 48  YFKCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C +E   CVY  +YA+ S + G  + + IS     E +     A+FGC N+  G   
Sbjct: 137 ---CDDEGKLCVYERRYAEMSSSSGVLSEDLISF--GNESQLSPQRAVFGCENEETG--- 188

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
           D       G++GL R  +S + QL    +I+  FS C         Y    +  G  M  
Sbjct: 189 DLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLC---------YGGMEVGGGA-MVL 238

Query: 164 RRPSTQATKFINHPNNF----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
            + S       +H + F    Y + LK + +  + +   P  F+    G+ G ++DSG+ 
Sbjct: 239 GKISPPPGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVLDSGTT 294

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYF 274
             YF  + +  + +  +      +     D P    +C+      + E  N FP +A  F
Sbjct: 295 YAYFPKEAFIAIKDAVIKEIPSLKRIHGPD-PNYDDVCFSGAGRDVAEIHNFFPEIAMEF 353

Query: 275 EDANLRIDGENVFIIDYENHFF---------LLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
            +           I+  EN+ F          L + P  D   L+G    R+T   YD  
Sbjct: 354 GNGQ-------KLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRE 406

Query: 326 IDLLSFVKENCSD 338
            D L F+K NCSD
Sbjct: 407 NDKLGFLKTNCSD 419


>gi|242041575|ref|XP_002468182.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
 gi|241922036|gb|EER95180.1| hypothetical protein SORBIDRAFT_01g041210 [Sorghum bicolor]
          Length = 490

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 100/365 (27%), Positives = 155/365 (42%), Gaps = 42/365 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           + ++ +GTP    LL LDT S L +               +FDPR S+S+++++ +  DC
Sbjct: 139 IAKIAVGTPGVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYREMSFNAADC 198

Query: 47  TYFK------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                          CVYT+ Y D S T G    ET++  G      I      GC +DN
Sbjct: 199 QALGRSGGGDAKRGTCVYTVGYGDGSTTVGDFIEETLTFAGGVRLPRIS----IGCGHDN 254

Query: 101 HG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
            G F   A     AG+LGL R  +SF +Q+       FSYCLV  L      SS L FG 
Sbjct: 255 KGLFGAPA-----AGILGLGRGLMSFPNQIDH--NGTFSYCLVDFLSGPGSLSSTLTFGA 307

Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCII 214
                 P    T  + + N   FYY+ L  IS+   R+    +  D+ +   +G GG I+
Sbjct: 308 GAVDTSPPVSFTPTVLNLNMPTFYYVRLTGISVGGVRVPGVTER-DLQLDPYTGRGGVIV 366

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFY 273
           DSG+ +T      Y    + F +         +         CY +      + P+++ +
Sbjct: 367 DSGTAVTRLARPAYTAFRDAFRAVAVDLGQVSIGGPSGFFDTCYTVGGRGMKKVPTVSMH 426

Query: 274 FEDA-NLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
           F  +  +++  +N  I +D          A  D  V++IG+ QQ+  R VYD+    + F
Sbjct: 427 FAGSVEVKLQPKNYLIPVDSMGTVCFAFAATGDHSVSIIGNIQQQGFRIVYDIG-GRVGF 485

Query: 332 VKENC 336
              +C
Sbjct: 486 APNSC 490


>gi|255543963|ref|XP_002513044.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548055|gb|EEF49547.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 431

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 87/357 (24%), Positives = 151/357 (42%), Gaps = 42/357 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
           +VR  IGTP++ +LL +DT +   +            +F+  KS++F+ + C+ P C   
Sbjct: 97  IVRAKIGTPAQTMLLAMDTSNDAAWIPCSGCVGCSSTVFNNVKSTTFKTVGCEAPQCKQV 156

Query: 50  ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
              KC    C + M Y   S+    AA+ +  V+             FGC  +  G    
Sbjct: 157 PNSKCGGSACAFNMTYGSSSI----AANLSQDVVTLATDS--IPSYTFGCLTEATGSSIP 210

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
            +     G+LGL R  +S +SQ  ++ +  FSYCL  P       S  L+ G     +R 
Sbjct: 211 PQ-----GLLGLGRGPMSLLSQTQNLYQSTFSYCL--PSFRSLNFSGSLRLGPVGQPKR- 262

Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
             + T  + +P  ++ YY++L  I +    ++ PP       +   G I DSG+V T   
Sbjct: 263 -IKTTPLLKNPRRSSLYYVNLMAIRVGRRVVDIPPSALAFNPTTGAGTIFDSGTVFTRLV 321

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGE 284
           +  Y  + + F        +  L         CY  P      P++ F F   N+ +  +
Sbjct: 322 APAYTAVRDAFRKRVGNATVTSLGG----FDTCYTSPIV---APTITFMFSGMNVTLPPD 374

Query: 285 NVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           N+ I    +    LA+A   D    ++ +I + QQ++ R ++D+    L   +E C+
Sbjct: 375 NLLIHSTASSITCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRLGVAREPCT 431


>gi|168054484|ref|XP_001779661.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162668975|gb|EDQ55572.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 419

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 96/368 (26%), Positives = 153/368 (41%), Gaps = 44/368 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           V  F+GTP +   LI+D+GS L++               ++ P  SS+F  + C  P+C 
Sbjct: 67  VDFFLGTPPQKFSLIVDSGSDLLWVQCAPCLQCYAQDTPLYAPSNSSTFNPVPCLSPECL 126

Query: 48  Y------FKC---VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
                  F C       C Y  +YAD S++KG  A+E+ +V              FGC  
Sbjct: 127 LIPATEGFPCDFHYPGACAYEYRYADTSLSKGVFAYESATV-----DDVRIDKVAFGCGR 181

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
           DN G        A  GVLGL +  +SF SQ+G     +F+YCLV  L +    SS+L FG
Sbjct: 182 DNQG-----SFAAAGGVLGLGQGPLSFGSQVGYAYGNKFAYCLVNYL-DPTSVSSWLIFG 235

Query: 159 TDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
            ++       Q T  +++  N   YY+ ++ + +  E +      + +   G GG I DS
Sbjct: 236 DELISTIHDLQFTPIVSNSRNPTLYYVQIEKVMVGGESLPISHSAWSLDFLGNGGSIFDS 295

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE 275
           G+ +TY+    Y  +   F       + A +    + + LC  +       FPS      
Sbjct: 296 GTTVTYWLPPAYRNILAAFDKNVRYPRAASV----QGLDLCVDVTGVDQPSFPSFTIVLG 351

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA---LIGSQQQRDTRFVYDLNIDLLSFV 332
              +    +  + +D   +   LA+A     V     IG+  Q++    YD   + + F 
Sbjct: 352 GGAVFQPQQGNYFVDVAPNVQCLAMAGLPSSVGGFNTIGNLLQQNFLVQYDREENRIGFA 411

Query: 333 KENCSDDS 340
              CS  S
Sbjct: 412 PAKCSSHS 419


>gi|356498789|ref|XP_003518231.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 446

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 96/360 (26%), Positives = 158/360 (43%), Gaps = 36/360 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++   IG P    L ++DTGS+L +               IFDP KSS++  ++C   +C
Sbjct: 94  LMNFSIGEPPIPQLAVMDTGSSLTWVMCHPCSSCSQQSVPIFDPSKSSTYSNLSCS--EC 151

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                VN +C Y+++Y     ++G  A E +++    E        +FGC          
Sbjct: 152 NKCDVVNGECPYSVEYVGSGSSQGIYAREQLTLETIDESIIKVPSLIFGCGRKFSISSNG 211

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
                + GV GL     S +   G    K+FSYC +  L N  Y  + L  G     +  
Sbjct: 212 YPYQGINGVFGLGSGRFSLLPSFG----KKFSYC-IGNLRNTNYKFNRLVLGDKANMQGD 266

Query: 167 STQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVS-GEGGCIIDSGSVLTYFHS 225
           ST     +N  N  YY++L+ ISI   +++  P  F+ +++    G IIDSG+  T+   
Sbjct: 267 STT----LNVINGLYYVNLEAISIGGRKLDIDPTLFERSITDNNSGVIIDSGADHTWLTK 322

Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYF-EDANLRID 282
             +  L  +  +  E   +    D   P  LCY   + +  + FP + F+F E A L +D
Sbjct: 323 YGFEVLSFEVENLLEGVLVLAQQDKHNPYTLCYSGVVSQDLSGFPLVTFHFAEGAVLDLD 382

Query: 283 GENVFIIDYENHFFLLAVAPHD------DLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             ++FI   EN  F +A+ P +      +  + IG   Q++    YDLN   + F + +C
Sbjct: 383 VTSMFIQTTENE-FCMAMLPGNYFGDDYESFSSIGMLAQQNYNVGYDLNRMRVYFQRIDC 441


>gi|357486591|ref|XP_003613583.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355514918|gb|AES96541.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 437

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 97/358 (27%), Positives = 166/358 (46%), Gaps = 33/358 (9%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++   IGTP   +  ++DT +  I+               +FDP KSS+++ I C  P C
Sbjct: 90  IISFLIGTPPFQLYGVMDTANDNIWFQCNPCKPCFNTTSPMFDPSKSSTYKTIPCSSPKC 149

Query: 47  TYFK---CVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
              +   C ++    C Y+  Y  ++ ++G  + +T+++    +    F   + GC + N
Sbjct: 150 KNVENTHCSSDDKKVCEYSFTYGGEAYSQGDLSIDTLTLNSNNDTPISFKNIVIGCGHRN 209

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G      +G ++G +GL R  +SFISQL S I  +FSYCLV PL + E  S  L FG  
Sbjct: 210 KG----PLEGYVSGNIGLGRGPLSFISQLNSSIGGKFSYCLV-PLFSNEGISGKLHFGDK 264

Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
                  T +T  I      Y  +L  +S+ +  + F   T        G  IIDSG+ L
Sbjct: 265 SVVSGVGTVSTP-ITAGEIGYSTTLNALSVGDHIIKFENSTSK--NDNLGNTIIDSGTTL 321

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
           T    +VY +L E  V+   + + A+  +  +  +LCY         P +  +F  A++ 
Sbjct: 322 TILPENVYSRL-ESIVTSMVKLERAKSPN--QQFKLCYKATLKNLDVPIITAHFNGADVH 378

Query: 281 IDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ++  N F  ID+E   F   V+  +    +IG+  Q++    +DL  +++SF   +C+
Sbjct: 379 LNSLNTFYPIDHEVVCFAF-VSVGNFPGTIIGNIAQQNFLVGFDLQKNIISFKPTDCT 435


>gi|307136234|gb|ADN34070.1| aspartic proteinase nepenthesin-1 precursor [Cucumis melo subsp.
           melo]
          Length = 412

 Score =  105 bits (261), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 97/378 (25%), Positives = 164/378 (43%), Gaps = 61/378 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
           V L +G+P + V ++LDTGS L +          ++F+P  SSS+  I C  P C     
Sbjct: 42  VSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSPVCRTRTR 101

Query: 49  -----FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                  C  ++ C   + YAD S  +G  A +   +     G +   G LFGC +   G
Sbjct: 102 DLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRI-----GSSALPGTLFGCMDS--G 154

Query: 103 FDEDAR-DGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG--- 158
           F  ++  D    G++G++R ++SF++QLG     +FSYC+     +G  +S  L FG   
Sbjct: 155 FSSNSEEDAKTTGLMGMNRGSLSFVTQLG---LPKFSYCI-----SGRDSSGVLLFGDSH 206

Query: 159 ----TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
                ++ Y      +T         Y + L  I + N+ +  P   F    +G G  ++
Sbjct: 207 LSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMV 266

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPE--TFNRF 267
           DSG+  T+    VY  L  +F+    +  LA L D P       + LCY +P        
Sbjct: 267 DSGTQFTFLLGPVYTALRNEFLEQ-TKGVLAPLGD-PNFVFQGAMDLCYRVPAGGKLPEL 324

Query: 268 PSMAFYFEDANLRIDGENVF-----IIDYENHFFLLAVAPHDDLVAL----IGSQQQRDT 318
           P+++  F  A + + GE +      ++  +   + L    + DL+ +    IG   Q++ 
Sbjct: 325 PAVSLMFRGAEMVVGGEVLLYKVPGMMKGKEWVYCLTFG-NSDLLGIEAFVIGHHHQQNV 383

Query: 319 RFVYDLNIDLLSFVKENC 336
              +DL    + FV+  C
Sbjct: 384 WMEFDLVKSRVGFVETRC 401


>gi|297822477|ref|XP_002879121.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297324960|gb|EFH55380.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 711

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 101/357 (28%), Positives = 159/357 (44%), Gaps = 50/357 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINC-DHPD 45
           +++L +GTP   +  ++DTGS + +               IFDP KSS+F++  C DH  
Sbjct: 381 LMKLQVGTPPFEIEAVIDTGSEITWTQCLPCVHCYKQNAPIFDPSKSSTFKEKRCHDH-- 438

Query: 46  CTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
                     C Y + Y D++ TKG  A +T+++        +    + GC  +N  F  
Sbjct: 439 ---------SCPYEVDYFDKTYTKGTLATDTVTIHSTSGEPFVMAETIIGCGRNNSWFRP 489

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM---G 162
                +  G +GL+   +S I+Q+G       SYC      NG   +S + FGT+    G
Sbjct: 490 -----SFEGFVGLNWGPLSLITQMGGEYPGLMSYCFA---GNG---TSKINFGTNAIVGG 538

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
               ST        P  FYYL+L  +S+ + R+      F      EG  +IDSG+ LTY
Sbjct: 539 GGVVSTTMFVTTARP-GFYYLNLDAVSVGDTRIETLGTPFHAL---EGNIVIDSGTTLTY 594

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DANLRI 281
           F  + Y  L  + V +         +D      LCY+   T   FP +  +F   A+L +
Sbjct: 595 F-PESYCNLVRQAVEHV--VPAVPAADPTGNDLLCYY-SNTTEIFPVITMHFSGGADLVL 650

Query: 282 DGENVFIIDYENHFFLLAVAPHDDLV-ALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           D  N+F+  Y    F LA+  ++    A+ G++ Q +    YD +  L+SF   NCS
Sbjct: 651 DKYNMFMESYSGGLFCLAIICNNPTQEAIFGNRAQNNFLVGYDSSSLLVSFKPTNCS 707



 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 97/340 (28%), Positives = 147/340 (43%), Gaps = 64/340 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L IGTP   V  +LDTGS LI+               IFDP KSS+F++  C+ PD 
Sbjct: 66  LMKLQIGTPPFEVEAVLDTGSELIWTQCLPCLHCYDQKAPIFDPSKSSTFKETRCNTPD- 124

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                    C Y + Y D+S T+G  A ET+++        +    + GCS +N G    
Sbjct: 125 -------HSCPYKLVYDDKSYTQGTLATETVTIHSTSGVPFVMPETIIGCSRNNSG---S 174

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
               + +G++GLSR ++S ISQ+G                 G Y    +   T       
Sbjct: 175 GFRPSSSGIVGLSRGSLSLISQMG-----------------GAYPGDGVVSTTMFAKTAK 217

Query: 167 STQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSD 226
             Q           YYL+L  +S+ + R+      F       G  +IDSG+ LTYF   
Sbjct: 218 RGQ-----------YYLNLDAVSVGDTRIETVGTPFHAL---NGNIVIDSGTPLTYFPVS 263

Query: 227 VYWKLHEKFVSYFERFQLA-QLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DANLRIDGE 284
            Y  L  K V   ER   A ++ D      LCY+   T   FP +  +F   A+L +D  
Sbjct: 264 -YCNLVRKAV---ERVVTADRVVDPSRNDMLCYY-SNTIEIFPVITVHFSGGADLVLDKY 318

Query: 285 NVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYD 323
           N+++       F LA+  ++   VA+ G++ Q +    YD
Sbjct: 319 NMYMELNRGGVFCLAIICNNPTQVAIFGNRAQNNFLVGYD 358


>gi|125543639|gb|EAY89778.1| hypothetical protein OsI_11320 [Oryza sativa Indica Group]
          Length = 488

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 91/293 (31%), Positives = 138/293 (47%), Gaps = 27/293 (9%)

Query: 53  NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGAL 112
           N+ CVYT  Y D+SVT G    +  +    G G ++  G  FGC   N+G  +       
Sbjct: 211 NQTCVYTYYYNDKSVTTGLLEVDKFTF---GAGASV-PGVAFGCGLFNNGVFKSNE---- 262

Query: 113 AGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG-EYTSSYLKFGTDMGYR--RPSTQ 169
            G+ G  R  +S  SQL       FS+C      NG + ++  L    D+ Y+  R + Q
Sbjct: 263 TGIAGFGRGPLSLPSQLKV---GNFSHCFTAV--NGLKQSTVLLDLLADL-YKNGRGAVQ 316

Query: 170 ATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDV 227
           +T  I +  N   YYLSLK I++ + R+  P   F +T +G GG IIDSG+ +T     V
Sbjct: 317 STPLIQNSANPTLYYLSLKGITVGSTRLPVPESAFALT-NGTGGTIIDSGTSITSLPPQV 375

Query: 228 YWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSMAFYFEDANLRIDGEN- 285
           Y  + ++F +  +   +   +  P     C+  P       P +  +FE A + +  EN 
Sbjct: 376 YQVVRDEFAAQIKLPVVPGNATGP---YTCFSAPSQAKPDVPKLVLHFEGATMDLPRENY 432

Query: 286 VFII--DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           VF +  D  N    LA+    D  A IG+ QQ++   +YDL  ++LSFV   C
Sbjct: 433 VFEVPDDAGNSMICLAINELGDERATIGNFQQQNMHVLYDLQNNMLSFVAAQC 485



 Score = 47.8 bits (112), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 37/134 (27%), Positives = 65/134 (48%), Gaps = 9/134 (6%)

Query: 188 ISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQL 247
           I++ + R+  P   F +T +G GG IIDSG+ +T     VY  + ++F +  +   +   
Sbjct: 42  ITVGSTRLPVPESAFALT-NGTGGTIIDSGTSITSLPPQVYQVVRDEFAAQIKLPVVPGN 100

Query: 248 SDCPEPIQLCYFLP-ETFNRFPSMAFYFEDANLRIDGEN-VFII--DYENHFFLLAVAPH 303
           +  P     C+  P +     P +  +FE A + +  EN VF +  D  N    LA+   
Sbjct: 101 ATGP---YTCFSAPSQAKPDVPKLVLHFEGATMDLPRENYVFEVPDDAGNSIICLAINKG 157

Query: 304 DDLVALIGSQQQRD 317
           D+   +IG+ QQ++
Sbjct: 158 DE-TTIIGNFQQQN 170


>gi|15234607|ref|NP_194733.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938480|emb|CAB43839.1| putative protein [Arabidopsis thaliana]
 gi|7269904|emb|CAB80997.1| putative protein [Arabidopsis thaliana]
 gi|67633776|gb|AAY78812.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660311|gb|AEE85711.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 427

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 104/360 (28%), Positives = 156/360 (43%), Gaps = 47/360 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V + IG+P    LL +DT S L++               IFDP +S + +   C     
Sbjct: 86  LVNISIGSPPITQLLHMDTASDLLWIQCLPCINCYAQSLPIFDPSRSYTHRNETCRTSQY 145

Query: 47  TY----FKCVNEQCVYTMKYADQSVTKGFAAHETI--SVIGKGEGKAIFHGALFGCSNDN 100
           +     F      C Y+M+Y D + +KG  A E +  + I      A  H  +FGC +DN
Sbjct: 146 SMPSLKFNANTRSCEYSMRYVDDTGSKGILAREMLLFNTIYDESSSAALHDVVFGCGHDN 205

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
           +G           G+LGL     S + + G    K+FSYC    L +  Y  + L  G D
Sbjct: 206 YG-----EPLVGTGILGLGYGEFSLVHRFG----KKFSYCFG-SLDDPSYPHNVLVLGDD 255

Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDIT-VSGEGGCIIDSGSV 219
                  T   +     N FYY++++ IS+D   +   P  F+    +G GG IID+G+ 
Sbjct: 256 GANILGDTTPLEI---HNGFYYVTIEAISVDGIILPIDPRVFNRNHQTGLGGTIIDTGNS 312

Query: 220 LTYFHSDVYWKLHEKFVSYFE-RFQLAQLSDCPEPIQLCYFLPETFNR------FPSMAF 272
           LT    + Y  L  +    FE RF  A +S        CY     F R      FP + F
Sbjct: 313 LTSLVEEAYKPLKNRIEDIFEGRFTAADVSQDDMIKMECY--NGNFERDLVESGFPIVTF 370

Query: 273 YF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
           +F E A L +D +++F +    + F LAV P +  +  IG+  Q+     YDL    +SF
Sbjct: 371 HFSEGAELSLDVKSLF-MKLSPNVFCLAVTPGN--LNSIGATAQQSYNIGYDLEAMEVSF 427


>gi|449441139|ref|XP_004138341.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
 gi|449477464|ref|XP_004155031.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 336

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 106/356 (29%), Positives = 146/356 (41%), Gaps = 47/356 (13%)

Query: 6   IGTPSKGVLLILDTGSAL-----------------IYAIFDPRKSSSFQKINCDHPDCTY 48
           +G P +    +LDTGS +                 I  IFDP  SSS+  ++CD   C  
Sbjct: 3   VGQPQQPSFFVLDTGSDVTWLQCLPCAGKNGCYEQITPIFDPELSSSYNPVSCDSEQCQL 62

Query: 49  FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
                C    C+Y ++Y D S T G  A ET++ +       I      GC +DN G   
Sbjct: 63  LDEAGCNVNSCIYKVEYGDGSFTIGELATETLTFVHSNSIPNIS----IGCGHDNEGLFV 118

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV-IPLPNGEYTSSYLKFGTDMGYR 164
            A               IS  SQL +     FSYCLV I  P    + S L F TD    
Sbjct: 119 GADGLIGL-----GGGAISISSQLKA---SSFSYCLVDIDSP----SFSTLDFNTDPPSD 166

Query: 165 RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
              +   K    P+ F Y+ +  +S+  + +      F+I  SG GG I+DSG+ +T   
Sbjct: 167 SLISPLVKNDRFPS-FRYVKVIGMSVGGKPLPISSSRFEIDESGLGGIIVDSGTTITQLP 225

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFN-RFPSMAFYFEDAN-LR 280
           SDVY  L E F+          L   PE  P   CY L    N   P++AF     N L+
Sbjct: 226 SDVYEVLREAFLGL-----TTNLPPAPEISPFDTCYDLSSQSNVEVPTIAFILPGENSLQ 280

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  +N  I       F LA       +++IG+ QQ+  R  YDL   L+ F    C
Sbjct: 281 LPAKNCLIQVDSAGTFCLAFVSATFPLSIIGNFQQQGIRVSYDLTNSLVGFSTNKC 336


>gi|356540371|ref|XP_003538663.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 374

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 96/357 (26%), Positives = 156/357 (43%), Gaps = 31/357 (8%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP   +  I DTGS L +               IFDP+KS+S++ I+CD   C
Sbjct: 26  LMEVSIGTPPFKIYGIADTGSDLTWTSCVPCNKCYKQRNPIFDPQKSTSYRNISCDSKLC 85

Query: 47  ----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
               T      + C YT  YA  ++T+G  A ETI++           G +FGC ++N G
Sbjct: 86  HKLDTGVCSPQKHCNYTYAYASAAITQGVLAQETITLSSTKGESVPLKGIVFGCGHNNTG 145

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
              D       G++GL    +SFISQ+GS    KRFS CLV P       SS +  G   
Sbjct: 146 GFNDRE----MGIIGLGGGPVSFISQIGSSFGGKRFSQCLV-PFHTDVSVSSKMSLGKGS 200

Query: 162 GYRRPSTQATKFINHPNNF-YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
                   +T  +   +   Y+++L  IS+ N  ++F   +       +G   +DSG+  
Sbjct: 201 EVSGKGVVSTPLVAKQDKTPYFVTLLGISVGNTYLHFNGSSSQSVE--KGNVFLDSGTPP 258

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
           T   + +Y +L  +  S  E       +D     QLCY       R P +  +FE  +++
Sbjct: 259 TILPTQLYDRLVAQVRS--EVAMKPVTNDLDLGPQLCYRTKNNL-RGPVLTAHFEGGDVK 315

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +     F +  ++  F L          + G+  Q +    +DL+  ++SF   +C+
Sbjct: 316 LLPTQTF-VSPKDGVFCLGFTNTSSDGGVYGNFAQSNYLIGFDLDRQVVSFKPMDCT 371


>gi|242092898|ref|XP_002436939.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
 gi|241915162|gb|EER88306.1| hypothetical protein SORBIDRAFT_10g011740 [Sorghum bicolor]
          Length = 469

 Score =  104 bits (260), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 102/369 (27%), Positives = 153/369 (41%), Gaps = 64/369 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +V L  GTPS   +L++DTGS + +                 +FDP KSS++  I C+  
Sbjct: 132 VVTLGFGTPSVPQVLLMDTGSDVSWVQCTPCNSTKCYPQKDPLFDPSKSSTYAPIACNTD 191

Query: 45  DCTYFK------CVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
            C          C +   QC Y+++YAD S ++G  ++ET++ +  G     FH   FGC
Sbjct: 192 ACRKLGDHYHNGCTSGGTQCGYSVEYADGSHSRGVYSNETLT-LAPGITVEDFH---FGC 247

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             D  G   D  DG    +LGL    +S + Q  S+    FSYCL  P  N E  + +L 
Sbjct: 248 GRDQRG-PSDKYDG----LLGLGGAPVSLVVQTSSVYGGAFSYCL--PALNSE--AGFLV 298

Query: 157 FGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
            G+      PS   + F+  P         FY +++  IS+  + ++ P   F       
Sbjct: 299 LGSP-----PSGNKSAFVFTPMRHLPGYATFYMVTMTGISVGGKPLHIPQSAF------R 347

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFP 268
           GG IIDSG+V T      Y  L        + + L    D       CY      N   P
Sbjct: 348 GGMIIDSGTVDTELPETAYNALEAALRKALKAYPLVPSDD----FDTCYNFTGYSNITVP 403

Query: 269 SMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
            +AF F   A + +D  N  ++   N       +  DD + +IG+  QR    +YD    
Sbjct: 404 RVAFTFSGGATIDLDVPNGILV---NDCLAFQESGPDDGLGIIGNVNQRTLEVLYDAGRG 460

Query: 328 LLSFVKENC 336
            + F    C
Sbjct: 461 NVGFRAGAC 469


>gi|413947545|gb|AFW80194.1| hypothetical protein ZEAMMB73_386053 [Zea mays]
          Length = 456

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 100/373 (26%), Positives = 162/373 (43%), Gaps = 61/373 (16%)

Query: 6   IGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHPDCTYF 49
           +GTP   +L I DTGS L++                 +F P +S+++  ++C    C   
Sbjct: 106 VGTPPAQMLAIADTGSDLVWVNCSSNGGGGGASDGAVVFHPSRSTTYSLLSCQSAACQAL 165

Query: 50  ---KC-VNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFGCSNDNHG 102
               C  + +C Y   Y D S T G  + ET S     G GEG+       FGCS  + G
Sbjct: 166 SQASCDADSECQYQYAYGDGSRTIGVLSTETFSFAAAGGGGEGQVRVPRVSFGCSTGSAG 225

Query: 103 -FDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
            F  D       G++GL    +S +SQLG+   I +RFSYCLV P      +SS L FG 
Sbjct: 226 SFRSD-------GLVGLGAGALSLVSQLGAAARIARRFSYCLVPPYAAAN-SSSTLSFGA 277

Query: 160 DMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                 P   +T  + +  +++Y ++L+ +++  +         D+  +     I+DSG+
Sbjct: 278 RAVVSDPGAASTPLVPSEVDSYYTVALESVAVAGQ---------DVASANSSRIIVDSGT 328

Query: 219 VLTYFHSDVYWKLHEKFVSYFE-RFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
            LT+        L    V+  E R +L +     + +QLCY   +   +  +  F   D 
Sbjct: 329 TLTFLDP----ALLRPLVAELERRIRLPRAQPPEQLLQLCY---DVQGKSQAEDFGIPDV 381

Query: 278 NLRIDGENVFIIDYENHFFL-------LAVAP--HDDLVALIGSQQQRDTRFVYDLNIDL 328
            LR  G     +  EN F L       L + P      V+++G+  Q++    YDL+   
Sbjct: 382 TLRFGGGASVTLRPENTFSLLEEGTLCLVLVPVSESQPVSILGNIAQQNFHVGYDLDART 441

Query: 329 LSFVKENCSDDSA 341
           ++F   +C+  SA
Sbjct: 442 VTFAAVDCTRSSA 454


>gi|242091325|ref|XP_002441495.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
 gi|241946780|gb|EES19925.1| hypothetical protein SORBIDRAFT_09g028050 [Sorghum bicolor]
          Length = 466

 Score =  104 bits (260), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 90/360 (25%), Positives = 148/360 (41%), Gaps = 36/360 (10%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA----------IFDPRKSSSFQKINCDHPDC----- 46
           V+L +GTP +   L+ DTGS L +           +F P+ S S+  I C    C     
Sbjct: 118 VKLRVGTPVQEFTLVADTGSDLTWVKCAGASPPGRVFRPKTSRSWAPIPCSSDTCKLDVP 177

Query: 47  -TYFKCVN--EQCVYTMKYADQSV-TKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
            T   C +    C Y  +Y + S   +G    E+ ++   G   A     + GCS+ + G
Sbjct: 178 FTLANCSSPASPCTYDYRYKEGSAGARGIVGTESATIALPGGKVAQLKDVVLGCSSSHDG 237

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               + DG    VL L    ISF +Q  +     FSYCLV  L     T  YL FG    
Sbjct: 238 QSFRSADG----VLSLGNAKISFATQAAARFGGSFSYCLVDHLAPRNATG-YLAFGPGQV 292

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
            R P+TQ   F++    FY + +  I +  + ++ P + +D   +  GG I+DSG+ LT 
Sbjct: 293 PRTPATQTKLFLDPEMPFYGVKVDAIHVAGKALDIPAEVWD---AKSGGVILDSGNTLTV 349

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL----PETFNRFPSMAFYFEDAN 278
             +  Y  +      + +            P + CY      P      P +A  F  + 
Sbjct: 350 LAAPAYKAVVAALSKHLDGVPKVSF----PPFEHCYNWTARRPGAPEIIPKLAVQFAGSA 405

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
                   ++ID +     + V   +   +++IG+  Q++  + +DL    + F + NC+
Sbjct: 406 RLEPPAKSYVIDVKPGVKCIGVQEGEWPGLSVIGNIMQQEHLWEFDLKNMQVRFKQSNCT 465


>gi|297742733|emb|CBI35367.3| unnamed protein product [Vitis vinifera]
          Length = 521

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 84/348 (24%), Positives = 142/348 (40%), Gaps = 68/348 (19%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ +G+P +   +++D+GS +++               +FDP  S+SF  ++C    C 
Sbjct: 203 VRIGVGSPPRSQYMVIDSGSDIVWVQCQPCTQCYHQSDPVFDPADSASFTGVSCSSSVCD 262

Query: 48  YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
             +   C   +C Y + Y D S TKG  A ET++      G+ +      GC + N G  
Sbjct: 263 RLENAGCHAGRCRYEVSYGDGSYTKGTLALETLTF-----GRTMVRSVAIGCGHRNRGMF 317

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A              ++SF+ QLG      FSYCLV                      
Sbjct: 318 VGAAGLLGL-----GGGSMSFVGQLGGQTGGAFSYCLV---------------------- 350

Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
             S      + +P   +FYY+ L  + +   R+    + F +T  G+GG ++D+G+ +T 
Sbjct: 351 --SAAWVPLVRNPRAPSFYYIGLAGLGVGGIRVPISEEVFRLTELGDGGVVMDTGTAVTR 408

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-----CY-FLPETFNRFPSMAFYFED 276
             +  Y    + F        LAQ ++ P    +     CY  L     R P+++FYF  
Sbjct: 409 LPTLAYQAFRDAF--------LAQTANLPRATGVAIFDTCYDLLGFVSVRVPTVSFYFSG 460

Query: 277 AN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
              L +   N  I   +   F  A AP    ++++G+ QQ   +  +D
Sbjct: 461 GPILTLPARNFLIPMDDAGTFCFAFAPSTSGLSILGNIQQEGIQISFD 508


>gi|115479237|ref|NP_001063212.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|50725896|dbj|BAD33424.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|50726136|dbj|BAD33657.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113631445|dbj|BAF25126.1| Os09g0423500 [Oryza sativa Japonica Group]
 gi|215767614|dbj|BAG99842.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222641596|gb|EEE69728.1| hypothetical protein OsJ_29412 [Oryza sativa Japonica Group]
          Length = 473

 Score =  104 bits (259), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 98/368 (26%), Positives = 159/368 (43%), Gaps = 65/368 (17%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKC 51
           +G       +I+DT S L +               +FDP  S S+  + C+   C   + 
Sbjct: 131 VGLGGGEATVIVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQV 190

Query: 52  VN----------EQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                       EQ  C YT+ Y D S ++G  AH+ +S+ G+     +  G +FGC   
Sbjct: 191 ATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGE-----VIDGFVFGCGTS 245

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           N G       G  +G++GL R  +S ISQ        FSYCL  PL   E +S  L  G 
Sbjct: 246 NQG-----PFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCL--PLKESE-SSGSLVLGD 297

Query: 160 DMGYRRPSTQA--TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           D    R ST    T  ++ P    FY+++L  I+I  + +           S  G  I+D
Sbjct: 298 DTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVE----------SSAGKVIVD 347

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAF 272
           SG+++T     VY  +  +F+S F  +  A     P    +  C+ L      + PS+ F
Sbjct: 348 SGTIITSLVPSVYNAVKAEFLSQFAEYPQA-----PGFSILDTCFNLTGFREVQIPSLKF 402

Query: 273 YFE-DANLRIDGENV-FIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDL 328
            FE +  + +D   V + +  ++    LA+A    +   ++IG+ QQ++ R ++D     
Sbjct: 403 VFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQ 462

Query: 329 LSFVKENC 336
           + F +E C
Sbjct: 463 IGFAQETC 470


>gi|326520291|dbj|BAK07404.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 464

 Score =  104 bits (259), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 96/380 (25%), Positives = 162/380 (42%), Gaps = 57/380 (15%)

Query: 1   MVRLFIGTP-SKGVLLILDTGSALIYA-------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP  + V+L LDTGS L++              +F    S +F ++ C  P C
Sbjct: 95  LIHLGIGTPRPQRVVLHLDTGSDLVWTQCACTVCFDQPVPVFRASVSHTFSRVPCSDPLC 154

Query: 47  TYF--------KCVNEQCVYTMKYADQSVTKGFAAHETISVIG--KGEGKAIFHGALFGC 96
            +            +  C Y   Y D S+T G  A +T +     + +  A      FGC
Sbjct: 155 GHAVYLPLSGCAARDRSCFYAYGYMDHSITTGKMAEDTFTFKAPDRADTAAAVPNIRFGC 214

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
              N+G     +    +G+ G     +S  SQL     +RFSYC        E   S + 
Sbjct: 215 GMMNYGLFTPNQ----SGIAGFGTGPLSLPSQLKV---RRFSYCFTA---MEESRVSPVI 264

Query: 157 FGTD----MGYRRPSTQATKFINHPNN-------FYYLSLKDISIDNERMNFPPDTFDIT 205
            G +      +     Q+T F   P         FY+LSL+ +++   R+ F   TF + 
Sbjct: 265 LGGEPENIEAHATGPIQSTPFAPGPAGAPVGSQPFYFLSLRGVTVGETRLPFNASTFALK 324

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ET 263
             G GG  IDSG+ +T+F   V+  L E FV+      +A+    P+ + LC+ +P  + 
Sbjct: 325 GDGSGGTFIDSGTAITFFPQAVFRSLREAFVAQVP-LPVAKGYTDPDNL-LCFSVPAKKK 382

Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDYEN-------HFFLLAVAPHDDLVALIGSQQQR 316
               P +  + E A+  +  EN +++D ++          ++ ++  +    +IG+ QQ+
Sbjct: 383 APAVPKLILHLEGADWELPREN-YVLDNDDDGSGAGRKLCVVILSAGNSNGTIIGNFQQQ 441

Query: 317 DTRFVYDLNIDLLSFVKENC 336
           +   VYDL  + + F    C
Sbjct: 442 NMHIVYDLESNKMVFAPARC 461


>gi|326532334|dbj|BAK05096.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 437

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 89/328 (27%), Positives = 146/328 (44%), Gaps = 19/328 (5%)

Query: 26  AIFDPRKSSSFQKINCDHPDCT--YFKCVNEQC-VYTMKYADQSVTKGFAAHETISVIGK 82
           A+FD  +S  ++ +    P CT  Y   V  +C  YT  +       G+   +  +  G 
Sbjct: 110 AVFDSAESPRYKHMKATDPMCTPPYTPSVGNRCSFYTTTW--NVAAHGYLGSDMFAFAGT 167

Query: 83  GEG--KAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFS 138
           G G         +FGC++   G  E    G LAG L LSR  +SF+SQL +  +   RFS
Sbjct: 168 GAGGHSTDVDQLIFGCAHTTDGL-ERLSHGVLAGALSLSRHPMSFLSQLTARGLADSRFS 226

Query: 139 YCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNER-M 195
           YCL     +      +L+FG D+     +   +     P +   Y++ +  IS++  R M
Sbjct: 227 YCLFPEQSHPIAKHGFLRFGRDIPRHDHAHSTSLLFTGPGSGGMYHIRVVGISLNGRRIM 286

Query: 196 NFPPDTFDITV-SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI 254
              P  F   + +  GG ++D G+ LT      Y  +  + V+  ++    +     +  
Sbjct: 287 RLQPAMFTRNLQTRRGGSVVDPGTPLTRLVRQAYDIVEAEVVANMQKQGARRAKAQVQGH 346

Query: 255 QLCYFLPETFNRFPSMA--FYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGS 312
           +LC F+       PS+    Y + A L I  E +F            V P D+ + ++G+
Sbjct: 347 RLC-FVSWGHVHLPSLTINMYEDTAKLFIKPELLFR-KVTARLLCFTVMP-DEEMTVLGA 403

Query: 313 QQQRDTRFVYDLNIDLLSFVKENCSDDS 340
            QQ DTRF +DL+ + L F +ENC+ D+
Sbjct: 404 AQQMDTRFTFDLHANRLYFAQENCNADT 431


>gi|326503794|dbj|BAK02683.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 91/362 (25%), Positives = 152/362 (41%), Gaps = 61/362 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------AIFDPRKSSSFQKINCDHPDCTYFK- 50
           ++ + IG+P+    +++DTGS + +          +FDP KS+++   +C    C     
Sbjct: 130 VITVGIGSPAVTQTMMIDTGSDVSWVRCNSTDGLTLFDPSKSTTYAPFSCSSAACAQLGN 189

Query: 51  ----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
               C N  C Y ++Y D S T G  + +T++ +   +    FH   FGCS+    FD +
Sbjct: 190 NGDGCSNSGCQYRVQYGDGSNTTGTYSSDTLA-LSASDTVTDFH---FGCSHHEEDFDGE 245

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
             DG    ++GL     S +SQ  +   K FSYC    LP    TS +L FG       P
Sbjct: 246 KIDG----LMGLGGDAQSLVSQTAATYGKSFSYC----LPPTNRTSGFLTFGA------P 291

Query: 167 STQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
           +  +  F+  P          Y + L+DIS+    +   P           G ++DSG+V
Sbjct: 292 NGTSGGFVTTPMLRWPKAPTLYGVLLQDISVGGTPLGIQPSVL------SNGSVMDSGTV 345

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLPETFN-RFPSMAFYFE 275
           +T+     Y  L   F S   R +  + +    P+ +   CY      N   P+++   +
Sbjct: 346 ITWLPRRAYSALSSAFRSSMTRLRHQRAA----PLGILDTCYDFTGLVNVSIPAVSLVLD 401

Query: 276 -DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
             A + +DG  + I D        A    D   ++IG+ QQR    ++D+   +  F   
Sbjct: 402 GGAVVDLDGNGIMIQD----CLAFAATSGD---SIIGNVQQRTFEVLHDVGQGVFGFRSG 454

Query: 335 NC 336
            C
Sbjct: 455 AC 456


>gi|255554715|ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 489

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 98/371 (26%), Positives = 160/371 (43%), Gaps = 48/371 (12%)

Query: 2   VRLFIGTPS-KGVLLILDTGSALIYA------------------IFDPRKSSSFQKINCD 42
           V + IGTP  +  +L+ DTGS L +                   +F    SSSF+ I C 
Sbjct: 121 VSIRIGTPRPQKFILVTDTGSDLTWMNCEYWCKSCPKPNPHPGRVFRANDSSSFRTIPCS 180

Query: 43  HPDCT-----YFKCV-----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA 92
             DC      YF        N  C++  +Y +     G  A+ET++V      K      
Sbjct: 181 SDDCKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVGLNDHKKIRLFDV 240

Query: 93  LFGCS---NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGE 149
           L GC+   N+ +GF +        GV+GL     S   +L  I   +FSYCLV  L +  
Sbjct: 241 LIGCTESFNETNGFPD--------GVMGLGYRKHSLALRLAEIFGNKFSYCLVDHLSSSN 292

Query: 150 YTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
           +  ++L FG     + P  Q T+ +  + N FY +++  IS+    ++   D +++T  G
Sbjct: 293 H-KNFLSFGDIPEMKLPKMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDIWNVT--G 349

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-- 266
            GG I+DSG+ LT    + Y K+ +     F++ +     + PE    C F  + F+R  
Sbjct: 350 VGGMIVDSGTSLTMLAGEAYDKVVDALKPIFDKHKKVVPIELPELNNFC-FEDKGFDRAA 408

Query: 267 FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLN 325
            P +  +F D  +       +IID       L +   D    +++G+  Q++  + YDL 
Sbjct: 409 VPRLLIHFADGAIFKPPVKSYIIDVAEGIKCLGIIKADFPGSSILGNVMQQNHLWEYDLG 468

Query: 326 IDLLSFVKENC 336
              L F   +C
Sbjct: 469 RGKLGFGPSSC 479


>gi|116786826|gb|ABK24255.1| unknown [Picea sitchensis]
 gi|148906052|gb|ABR16185.1| unknown [Picea sitchensis]
          Length = 485

 Score =  104 bits (259), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 93/358 (25%), Positives = 152/358 (42%), Gaps = 44/358 (12%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTY 48
           R+ +G P +  L++LDTGS + +               I++P  SSS++ + C    C  
Sbjct: 148 RIGVGAPRRDQLMVLDTGSDVTWIQCEPCSDCYQQSDPIYNPALSSSYKLVGCQANLCQQ 207

Query: 49  FKCV----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                   N  C+Y + Y D S T+G  A ET+++     G A       GC +DN G  
Sbjct: 208 LDVSGCSRNGSCLYQVSYGDGSYTQGNFATETLTL-----GGAPLQNVAIGCGHDNEGLF 262

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT----D 160
             A              ++SF SQL     K FSYCLV        +SS L+FG     +
Sbjct: 263 VGAAGLLGL-----GGGSLSFPSQLTDENGKIFSYCLV---DRDSESSSTLQFGRAAVPN 314

Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
                P  + ++     + FYY+SL  IS+  + ++     F I  SG GG I+DSG+ +
Sbjct: 315 GAVLAPMLKNSRL----DTFYYVSLSGISVGGKMLSISDSVFGIDASGNGGVIVDSGTAV 370

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-DAN 278
           T   +  Y  L + F +  +       +D       CY L    +   P++ F+F    +
Sbjct: 371 TRLQTAAYDSLRDAFRAGTKNL---PSTDGVSLFDTCYDLSSKESVDVPTVVFHFSGGGS 427

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           + +  +N  +       F  A AP    ++++G+ QQ+  R  +D   + + F    C
Sbjct: 428 MSLPAKNYLVPVDSMGTFCFAFAPTSSSLSIVGNIQQQGIRVSFDRANNQVGFAVNKC 485


>gi|449527149|ref|XP_004170575.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 487

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 106/364 (29%), Positives = 161/364 (44%), Gaps = 53/364 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDH 43
           + ++ +G P K   L+ DTGS + +                  IFDP+ SSS+  ++C+ 
Sbjct: 149 LAQIGVGQPVKLFYLVPDTGSDVTWLQCQPCASENTCYKQFDPIFDPKSSSSYSPLSCNS 208

Query: 44  PDCTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
             C       C ++ C+Y + Y D S T G  A ET+S    G   +I +  + GC +DN
Sbjct: 209 QQCKLLDKANCNSDTCIYQVHYGDGSFTTGELATETLSF---GNSNSIPNLPI-GCGHDN 264

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFGT 159
            G                    IS  SQL +     FSYCLV    N +  +SS L+F +
Sbjct: 265 EGLFAGGAGLIGL-----GGGAISLSSQLKA---SSFSYCLV----NLDSDSSSTLEFNS 312

Query: 160 DMGYRRPSTQATKFINHPNNFY---YLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
            M    PS   T  +   + F+   Y+ +  IS+  + +   P  F+I  SG GG I+DS
Sbjct: 313 YM----PSDSLTSPLVKNDRFHSYRYVKVVGISVGGKTLPISPTRFEIDESGLGGIIVDS 368

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSMAFY 273
           G++++   SDVY  L E FV        + LS  P       CY F  ++    P++AF 
Sbjct: 369 GTIISRLPSDVYESLREAFVKL-----TSSLSPAPGISVFDTCYNFSGQSNVEVPTIAFV 423

Query: 274 F-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
             E  +LR+   N  I+      + LA       +++IGS QQ+  R  YDL   ++ F 
Sbjct: 424 LSEGTSLRLPARNYLIMLDTAGTYCLAFIKTKSSLSIIGSFQQQGIRVSYDLTNSIVGFS 483

Query: 333 KENC 336
              C
Sbjct: 484 TNKC 487


>gi|125563764|gb|EAZ09144.1| hypothetical protein OsI_31414 [Oryza sativa Indica Group]
          Length = 472

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 98/368 (26%), Positives = 159/368 (43%), Gaps = 65/368 (17%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKC 51
           +G       +I+DT S L +               +FDP  S S+  + C+   C   + 
Sbjct: 130 VGLGGGEATVIVDTASELTWVQCAPCASCHDQQGPLFDPASSPSYAVLPCNSSSCDALQV 189

Query: 52  VN----------EQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                       EQ  C YT+ Y D S ++G  AH+ +S+ G+     +  G +FGC   
Sbjct: 190 ATGSAAGACGGGEQPSCSYTLSYRDGSYSQGVLAHDKLSLAGE-----VIDGFVFGCGTS 244

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           N G       G  +G++GL R  +S ISQ        FSYCL  PL   E +S  L  G 
Sbjct: 245 NQG-----PFGGTSGLMGLGRSQLSLISQTMDQFGGVFSYCL--PLKESE-SSGSLVLGD 296

Query: 160 DMGYRRPSTQA--TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           D    R ST    T  ++ P    FY+++L  I+I  + +           S  G  I+D
Sbjct: 297 DTSVYRNSTPIVYTTMVSDPVQGPFYFVNLTGITIGGQEVE----------SSAGKVIVD 346

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAF 272
           SG+++T     VY  +  +F+S F  +  A     P    +  C+ L      + PS+ F
Sbjct: 347 SGTIITSLVPSVYNAVKAEFLSQFAEYPQA-----PGFSILDTCFNLTGFREVQIPSLKF 401

Query: 273 YFE-DANLRIDGENV-FIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDL 328
            FE +  + +D   V + +  ++    LA+A    +   ++IG+ QQ++ R ++D     
Sbjct: 402 VFEGNVEVEVDSSGVLYFVSSDSSQVCLALASLKSEYETSIIGNYQQKNLRVIFDTLGSQ 461

Query: 329 LSFVKENC 336
           + F +E C
Sbjct: 462 IGFAQETC 469


>gi|224090744|ref|XP_002309070.1| predicted protein [Populus trichocarpa]
 gi|222855046|gb|EEE92593.1| predicted protein [Populus trichocarpa]
          Length = 404

 Score =  103 bits (258), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 86/323 (26%), Positives = 149/323 (46%), Gaps = 54/323 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY-- 48
           +V L +GTP + V +++DTGS L +            FDP +S+S+Q I C  P CT   
Sbjct: 32  IVSLTVGTPPQNVSMVIDTGSELSWLHCNKTLSYPTTFDPTRSTSYQTIPCSSPTCTNRT 91

Query: 49  ------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND-- 99
                   C  N  C  T+ YAD S + G  A +   +     G +   G +FGC +   
Sbjct: 92  QDFPIPASCDSNNLCHATLSYADASSSDGNLASDVFHI-----GSSDISGLVFGCMDSVF 146

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG- 158
           +   DED++     G++G++R ++SF+SQLG     +FSYC+     +G   S  L  G 
Sbjct: 147 SSNSDEDSKS---TGLMGMNRGSLSFVSQLG---FPKFSYCI-----SGTDFSGLLLLGE 195

Query: 159 TDMGYRRPSTQATKF-INHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
           +++ +  P        I+ P  +     Y + L+ I + ++ +  P  TF+   +G G  
Sbjct: 196 SNLTWSVPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVLDKLLPIPKSTFEPDHTGAGQT 255

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLP---ETF 264
           ++DSG+  T+    VY  L   F++  +   + ++ + P+      + LCY +P      
Sbjct: 256 MVDSGTQFTFLLGPVYNALRSAFLN--QTSSVLRVLEDPDFVFQGAMDLCYLVPLSQRVL 313

Query: 265 NRFPSMAFYFEDANLRIDGENVF 287
              P++   F  A + + G+ V 
Sbjct: 314 PLLPTVTLVFRGAEMTVSGDRVL 336


>gi|356522504|ref|XP_003529886.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 473

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 99/361 (27%), Positives = 151/361 (41%), Gaps = 44/361 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            RL +GTP K + ++LDTGS +++               IFDP KS SF  I C  P C 
Sbjct: 132 TRLGVGTPPKYLYMVLDTGSDVVWLQCKPCTKCYSQTDQIFDPSKSKSFAGIPCYSPLCR 191

Query: 48  YF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                     N  C Y + Y D S T G  + ET++       +A       GC +DN G
Sbjct: 192 RLDSPGCSLKNNLCQYQVSYGDGSFTFGDFSTETLTF-----RRAAVPRVAIGCGHDNEG 246

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               A            R  +SF +Q G+    +FSYCL     + + +S  + FG    
Sbjct: 247 LFVGAAGLLGL-----GRGGLSFPTQTGTRFNNKFSYCLTDRTASAKPSS--IVFGDSAV 299

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMN-FPPDTFDITVSGEGGCIIDSGSV 219
            R  + + T  + +P  + FYY+ L  IS+    +       F +  +G GG IIDSG+ 
Sbjct: 300 SR--TARFTPLVKNPKLDTFYYVELLGISVGGAPVRGISASFFRLDSTGNGGVIIDSGTS 357

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFN-RFPSMAFYFED 276
           +T      Y  L + F     R   + L   PE      CY L      + P++  +F  
Sbjct: 358 VTRLTRPAYVSLRDAF-----RVGASHLKRAPEFSLFDTCYDLSGLSEVKVPTVVLHFRG 412

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A++ +   N  +    +  F  A A     +++IG+ QQ+  R V+DL    + F    C
Sbjct: 413 ADVSLPAANYLVPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVVFDLAGSRVGFAPRGC 472

Query: 337 S 337
           +
Sbjct: 473 A 473


>gi|300681506|emb|CBH32600.1| pepsin A, putative, expressed [Triticum aestivum]
          Length = 477

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 102/393 (25%), Positives = 153/393 (38%), Gaps = 72/393 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA-----------------------IFDPRKSSSFQK 38
           VR  +GTP++  LL+ DTGS L +                         F P  S ++  
Sbjct: 99  VRFRVGTPAQPFLLVADTGSDLTWVKCRRPASANSSLSPADSGPGPGRAFRPEDSRTWAP 158

Query: 39  INCDHPDCTYF--------KCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAI 88
           I+C    CT                C Y  +Y D S  +G    E  TI++ G+ E KA 
Sbjct: 159 ISCASDTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGREERKAK 218

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG 148
             G + GCS+   G   +A D    GVL L    ISF S   S    RFSYCLV  L + 
Sbjct: 219 LKGLVLGCSSSYTGPSFEASD----GVLSLGYSGISFASHAASRFGGRFSYCLVDHL-SP 273

Query: 149 EYTSSYLKFGTDMGYRRP------------STQATKFI--NHPNNFYYLSLKDISIDNER 194
              +SYL FG +     P              + T  +       FY +SLK IS+  E 
Sbjct: 274 RNATSYLTFGPNPAVSSPRASPSSCAAAAPRARQTPLLLDRRMRPFYDVSLKAISVAGEF 333

Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKL---HEKFVSYFERFQLAQLSDCP 251
           +  P   +D+     GG I+DSG+ LT      Y  +     K ++   R  +       
Sbjct: 334 LKIPRAVWDVEAG--GGVILDSGTSLTVLAKPAYRAVVAALSKGLAGLPRVTM------- 384

Query: 252 EPIQLCYFLPETFNR-----FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAV--APHD 304
           +P + CY       +      P MA +F  A         ++ID       + +   P  
Sbjct: 385 DPFEYCYNWTSPSGKDADVAVPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWP 444

Query: 305 DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
             +++IG+  Q++  + +D+    L F +  C+
Sbjct: 445 G-ISVIGNILQQEHLWEFDIKNRRLKFQRSRCT 476


>gi|414881704|tpg|DAA58835.1| TPA: hypothetical protein ZEAMMB73_701358 [Zea mays]
          Length = 485

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 101/370 (27%), Positives = 155/370 (41%), Gaps = 50/370 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
            ++ +GTP+   L++LDTGS +++               +FDPR+SSS+  + C    C 
Sbjct: 131 TKIGVGTPATQALMVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCR 190

Query: 48  YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                        C+Y + Y D SVT G    ET++  G   G  +   AL GC +DN G
Sbjct: 191 RLDSGGCDLRRGACMYQVAYGDGSVTAGDFVTETLTFAG---GARVARVAL-GCGHDNEG 246

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV------IPLPNGEYTSSYLK 156
               A            R  +SF +Q+     + FSYCLV           G + SS + 
Sbjct: 247 LFVAAAGLLGL-----GRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVS 301

Query: 157 FGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGG 211
           FG        S   T  + +P    FYY+ L  IS+   R+    ++ D+ +   +G GG
Sbjct: 302 FGAGS-VGASSASFTPMVRNPRMETFYYVQLVGISVGGARVPGVAES-DLRLDPSTGRGG 359

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFL-PETFNRF 267
            I+DSG+ +T      Y  L + F +         L   P    L   CY L      + 
Sbjct: 360 VIVDSGTSVTRLARASYSALRDAFRAA----AAGGLRLSPGGFSLFDTCYDLGGRRVVKV 415

Query: 268 PSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
           P+++ +F   A   +  EN  I       F  A A  D  V++IG+ QQ+  R V+D + 
Sbjct: 416 PTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDG 475

Query: 327 DLLSFVKENC 336
             + F  + C
Sbjct: 476 QRVGFAPKGC 485


>gi|357481199|ref|XP_003610885.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512220|gb|AES93843.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 416

 Score =  103 bits (258), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 99/361 (27%), Positives = 159/361 (44%), Gaps = 39/361 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++ ++IGTP   +  ++DTGS LI+               +FDP KSS++  I+CD P C
Sbjct: 69  LMEIYIGTPPIKITGLVDTGSDLIWIQCAPCLGCYKQIKPMFDPLKSSTYNNISCDSPLC 128

Query: 47  ----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
               T      ++C YT  Y D S+TKG  A +T +              LFGC ++N G
Sbjct: 129 HKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKPVSLSRFLFGCGHNNTG 188

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
              D       G++GL     S ISQ+G +   K+FS CLV P       SS + FG   
Sbjct: 189 GFNDHE----MGLIGLGGGPTSLISQIGPLFGGKKFSQCLV-PFLTDIKISSRMSFGKGS 243

Query: 162 GYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
                    T  +    +  Y+++L  IS+  E   FP +    +  G+   ++DSG+  
Sbjct: 244 QVLGNGVVTTPLVPREKDTSYFVTLLGISV--EDTYFPMN----STIGKANMLVDSGTPP 297

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMAFYFEDANL 279
                 +Y K+   F     +  L  ++D P    QLCY   +T  + P++ F+F  AN+
Sbjct: 298 ILLPQQLYDKV---FAEVRNKVALKPITDDPSLGTQLCYRT-QTNLKGPTLTFHFVGANV 353

Query: 280 RIDGENVFI--IDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            +     FI         F LA+    +    + G+  Q +    +DL+  ++SF   +C
Sbjct: 354 LLTPIQTFIPPTPQTKGIFCLAIYNRTNSDPGVYGNFAQSNYLIGFDLDRQVVSFKPTDC 413

Query: 337 S 337
           +
Sbjct: 414 T 414


>gi|297795137|ref|XP_002865453.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297311288|gb|EFH41712.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 100/373 (26%), Positives = 153/373 (41%), Gaps = 67/373 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+DTGS + Y                F P  SSS++ + C+ PDC 
Sbjct: 82  TRLWIGTPPQEFALIVDTGSTVTYVPCSTCKQCGKHQDPKFQPELSSSYKALKCN-PDCN 140

Query: 48  YFKCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C +E   CVY  +YA+ S + G  + + IS     E +     A+FGC N   G   
Sbjct: 141 ---CDDEGKLCVYERRYAEMSSSSGVLSEDLISF--GNESQLTPQRAVFGCENVETG--- 192

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
           D       G++GL R  +S + QL    +I+  FS C         Y    +  G  M  
Sbjct: 193 DLFSQRADGIMGLGRGKLSVVDQLVDKGVIEDVFSLC---------YGGMEVGGGA-MVL 242

Query: 164 RRPSTQATKFINHPNNF----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
            + S  A    +H + F    Y + LK + +  + +   P  F+    G+ G ++DSG+ 
Sbjct: 243 GKISPPAGMVFSHSDPFRSPYYNIDLKQMHVAGKSLKLNPKVFN----GKHGTVLDSGTT 298

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYF 274
             YF  + +  + +  +      +     D P    +C+      + E  N FP +   F
Sbjct: 299 YAYFPKEAFIAIKDAIIKEIPSLKRIHGPD-PNYDDVCFSGAGRDVAEIHNFFPEIDMEF 357

Query: 275 EDANLRIDGENVFIIDYENHFF---------LLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
            +           I+  EN+ F          L + P  D   L+G    R+T   YD  
Sbjct: 358 GNGQ-------KLILSPENYLFRHTKVRGAYCLGIFPDRDSTTLLGGIVVRNTLVTYDRE 410

Query: 326 IDLLSFVKENCSD 338
            D L F+K NCSD
Sbjct: 411 NDKLGFLKTNCSD 423


>gi|297736090|emb|CBI24128.3| unnamed protein product [Vitis vinifera]
          Length = 378

 Score =  103 bits (258), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 152/371 (40%), Gaps = 49/371 (13%)

Query: 6   IGTPSKGVLLILDTGSALIYA-----------------------IFDPRKSSSFQKINCD 42
           +GTPS+  +L+ DTGS L +                        +F    SSSF+ I C 
Sbjct: 18  VGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCL 77

Query: 43  HPDCT-----YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA 92
              C       F   N       C Y  +Y+D S   GF A+ET++V  K   K   H  
Sbjct: 78  TDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNV 137

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
           L GCS    G    A D    GV+GL     SF  +       +FSYCLV  L + +  S
Sbjct: 138 LIGCSESFQGQSFQAAD----GVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSH-KNVS 192

Query: 153 SYLKFGTDMGYR---RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
           +YL FG+           T     +   N+FY +++  ISI    +  P + +D  V G 
Sbjct: 193 NYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWD--VKGA 250

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--F 267
           GG I+DSGS LT+     Y  +         +F+  ++     P++ C F    F     
Sbjct: 251 GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDI--GPLEYC-FNSTGFEESLV 307

Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLA-VAPHDDLVALIGSQQQRDTRFVYDLNI 326
           P + F+F D          ++I   +    L  V+      +++G+  Q++  + +DL +
Sbjct: 308 PRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGL 367

Query: 327 DLLSFVKENCS 337
             L F   +C+
Sbjct: 368 KKLGFAPSSCT 378


>gi|357132618|ref|XP_003567926.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 468

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 99/375 (26%), Positives = 146/375 (38%), Gaps = 52/375 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCD 42
           VRL +GTP++  +L+ DTGS L +                    +F P  S S+  + CD
Sbjct: 106 VRLRVGTPAQPFVLVADTGSDLTWVKCSSPSSSSSSPAASPPQRVFRPAGSKSWSPLPCD 165

Query: 43  HPDCTY---FKCVN-----EQCVYTMKYADQSVTKGFAA--HETISVIGK-GEGKAIFHG 91
              C     F   N     + C Y  +Y D S  +G       T+S+ G  G  KA    
Sbjct: 166 SDTCKSYVPFSLANCSSPPDPCSYDYRYKDNSSARGVVGLDSATVSLSGNDGTRKAKLQE 225

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
            + GC+    G    + DG    VL L    ISF S+  S    RFSYCLV  L     T
Sbjct: 226 VVLGCTTSYDGQSFKSSDG----VLSLGNSNISFASRAASRFGGRFSYCLVDHLAPRNAT 281

Query: 152 SSYLKFGTDMGY--------RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFD 203
            S+L FG             R P          P  FY++S+  +++  ER+   PD +D
Sbjct: 282 -SFLTFGNGDSSPGDDSSSRRTPLVLLEDARTRP--FYFVSVDAVTVAGERLEILPDVWD 338

Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET 263
                 GG I+DSG+ LT   +  Y  + +     F       +    +P + CY     
Sbjct: 339 F--RKNGGAILDSGTSLTILATPAYDAVVKAISKQFAGVPRVNM----DPFEYCYNWTGV 392

Query: 264 FNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVY 322
               P M   F  A         ++ID       + V       V++IG+  Q++  + +
Sbjct: 393 SAEIPRMELRFAGAATLAPPGKSYVIDTAPGVKCIGVVEGAWPGVSVIGNILQQEHLWEF 452

Query: 323 DLNIDLLSFVKENCS 337
           DL    L F +  C+
Sbjct: 453 DLANRWLRFKQSRCA 467


>gi|302817380|ref|XP_002990366.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
 gi|300141928|gb|EFJ08635.1| hypothetical protein SELMODRAFT_43971 [Selaginella moellendorffii]
          Length = 420

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 97/363 (26%), Positives = 154/363 (42%), Gaps = 53/363 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ +GTP++ V ++ DTGS + +               IF+P  SSSF+ + C    C 
Sbjct: 83  ARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASSICG 142

Query: 48  YFK---CVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
             K   C  + +C+Y + Y D S T G  + ET+S      G+        GC  +N G 
Sbjct: 143 KLKIKGCSRKNECMYQVSYGDGSFTVGDFSTETLSF-----GEHAVRSVAMGCGRNNQGL 197

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT-SSYLKFGTDMG 162
              A            R  +SF SQ G+     FSYCL    P  E   ++ L FG    
Sbjct: 198 FHGAAGLLGL-----GRGPLSFPSQTGTSYASVFSYCL----PRRESAIAASLVFGPSAV 248

Query: 163 YRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
             +   + TK +  PN     +YY+ L  I +    +N PPD F +   G GG I+DSG+
Sbjct: 249 PEK--ARFTKLL--PNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGT 304

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLPETFN-RFPSMAFYF 274
            ++   +  Y  L + F S      L      P  I L   CY L        P++   F
Sbjct: 305 AISRLTTPAYTALRDAFRS------LVTFPSAPG-ISLFDTCYDLSSMKTATLPAVVLDF 357

Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
           +  A++ +  + + +   +   + LA AP ++  ++IG+ QQ+  R   D   + +    
Sbjct: 358 DGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAP 417

Query: 334 ENC 336
           + C
Sbjct: 418 DQC 420


>gi|212274713|ref|NP_001130791.1| uncharacterized protein LOC100191895 precursor [Zea mays]
 gi|194690124|gb|ACF79146.1| unknown [Zea mays]
 gi|194708040|gb|ACF88104.1| unknown [Zea mays]
 gi|223950469|gb|ACN29318.1| unknown [Zea mays]
 gi|414885521|tpg|DAA61535.1| TPA: hypothetical protein ZEAMMB73_650724 [Zea mays]
          Length = 500

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 95/375 (25%), Positives = 158/375 (42%), Gaps = 68/375 (18%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFK- 50
           +G       +I+DT S L +               +FDP  S S+  + CD P C   + 
Sbjct: 147 VGLGGGEATVIVDTASELTWVQCAPCESCHDQQGPLFDPSSSPSYAAVPCDSPSCDALQQ 206

Query: 51  ------------C---VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
                       C       C Y + Y D S ++G  AH+ +S+ G+     +  G +FG
Sbjct: 207 QLATGAGAGAPPCDAGRPAACSYALSYRDGSYSRGVLAHDRLSLAGE-----VIDGFVFG 261

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           C   N G       G  +G++GL R  +S +SQ        FSYCL  PL      S  L
Sbjct: 262 CGTSNQG----PPFGGTSGLMGLGRSQLSLVSQTVDQFGGVFSYCL--PLSRESDASGSL 315

Query: 156 KFGTDMGYRRPSTQA--TKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSG 208
             G D    R ST    T  +++ +      FY ++L  I++  + +         +   
Sbjct: 316 VLGDDPSAYRNSTPVVYTSMVSNSDPLLQGPFYLVNLTGITVGGQEVE--------STGF 367

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN- 265
               I+DSG+V+T     VY  +  +F+S     QLA+    P    +  C+ +      
Sbjct: 368 SARAIVDSGTVITSLVPSVYNAVRAEFMS-----QLAEYPQAPGFSILDTCFNMTGLKEV 422

Query: 266 RFPSMAFYFED-ANLRID-GENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFV 321
           + PS+   F+  A + +D G  ++ +  ++    LAVA    +D  ++IG+ QQ++ R V
Sbjct: 423 QVPSLTLVFDGGAEVEVDSGGVLYFVSSDSSQVCLAVASLKSEDETSIIGNYQQKNLRVV 482

Query: 322 YDLNIDLLSFVKENC 336
           +D +   + F +E C
Sbjct: 483 FDTSASQVGFAQETC 497


>gi|15226315|ref|NP_180368.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4510415|gb|AAD21501.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252975|gb|AEC08069.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 396

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 101/358 (28%), Positives = 159/358 (44%), Gaps = 52/358 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L +GTP   +  I+DTGS + +               IFDP KSS+F++  CD    
Sbjct: 66  LMKLQVGTPPFEIQAIIDTGSEITWTQCLPCVHCYEQNAPIFDPSKSSTFKEKRCD---- 121

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                    C Y + Y D + T G  A ETI++        +    + GC ++N  F   
Sbjct: 122 ------GHSCPYEVDYFDHTYTMGTLATETITLHSTSGEPFVMPETIIGCGHNNSWFKP- 174

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD---MGY 163
               + +G++GL+    S I+Q+G       SYC      +G+ TS  + FG +    G 
Sbjct: 175 ----SFSGMVGLNWGPSSLITQMGGEYPGLMSYCF-----SGQGTSK-INFGANAIVAGD 224

Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
              ST        P  FYYL+L  +S+ N R+     TF      EG  +IDSG+ LTYF
Sbjct: 225 GVVSTTMFMTTAKP-GFYYLNLDAVSVGNTRIETMGTTFHAL---EGNIVIDSGTTLTYF 280

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA-NLRID 282
               Y  L  + V +       + +D      LCY   +T + FP +  +F    +L +D
Sbjct: 281 PVS-YCNLVRQAVEHV--VTAVRAADPTGNDMLCYN-SDTIDIFPVITMHFSGGVDLVLD 336

Query: 283 GENVFIIDYENHFFLLAV---APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
             N+++       F LA+   +P  +  A+ G++ Q +    YD +  L+SF   NCS
Sbjct: 337 KYNMYMESNNGGVFCLAIICNSPTQE--AIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 392


>gi|47777372|gb|AAT38006.1| unknow protein [Oryza sativa Japonica Group]
          Length = 475

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 103/363 (28%), Positives = 160/363 (44%), Gaps = 47/363 (12%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
           +GTP+   L++LDTGS +++               +FDPR+S S+  ++C  P C     
Sbjct: 128 VGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPICRRLDS 187

Query: 52  VN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                    C+Y + Y D SVT G  A ET++   +G   A       GC +DN G    
Sbjct: 188 AGCDRRRNSCLYQVAYGDGSVTAGDFASETLT-FARG---ARVQRVAIGCGHDNEGLFI- 242

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV---IPLPNGEYTSSYLKFGTDMGY 163
               A +G+LGL R  +SF SQ+     + FSYCLV     +      SS + FG     
Sbjct: 243 ----AASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAVA 298

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCIIDSGS 218
                  T    +P    FYY+ L   S+   R+     + D+ +   +G GG I+DSG+
Sbjct: 299 AAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQS-DLRLNPTTGRGGVILDSGT 357

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLP-ETFNRFPSMAFYF 274
            +T     VY  + + F     R     L   P    L   CY L      + P+++ + 
Sbjct: 358 SVTRLARPVYEAVRDAF-----RAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHL 412

Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
              A++ +  EN  I    +  F  A+A  D  V++IG+ QQ+  R V+D +   + FV 
Sbjct: 413 AGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVP 472

Query: 334 ENC 336
           ++C
Sbjct: 473 KSC 475


>gi|312282359|dbj|BAJ34045.1| unnamed protein product [Thellungiella halophila]
          Length = 484

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 98/362 (27%), Positives = 152/362 (41%), Gaps = 42/362 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           +RL +GTP+  + ++LDTGS +++               +F+P KS +F  + C    C 
Sbjct: 138 MRLGVGTPATNMYMVLDTGSDVVWLQCSPCKVCYNQSDPVFNPAKSKTFATVPCGSRLCR 197

Query: 48  YF----KCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 +CV+ +   C+Y + Y D S T G  + ET++      G  + H AL GC +DN
Sbjct: 198 RLDDSSECVSRRSKACLYQVSYGDGSFTVGDFSTETLTF----HGARVDHVAL-GCGHDN 252

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G    A            R  +SF SQ  +    +FSYCLV    +G  +         
Sbjct: 253 EGLFVGAAGLLGL-----GRGGLSFPSQTKNRYNGKFSYCLVDRTSSGSSSKPPSTIVFG 307

Query: 161 MGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSG 217
            G    +   T  + +P  + FYYL L  IS+   R+       F +  +G GG IIDSG
Sbjct: 308 NGAVPKTAVFTPLLTNPKLDTFYYLQLLGISVGGSRVPGVSESQFKLDATGNGGVIIDSG 367

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPE-TFNRFPSMAFYF 274
           + +T      Y  L + F     R    +L   P       C+ L   T  + P++ F+F
Sbjct: 368 TSVTRLTQSAYVALRDAF-----RLGATRLKRAPSYSLFDTCFDLSGMTTVKVPTVVFHF 422

Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
               + +   N  I       F  A A     +++IG+ QQ+  R  YDL    + F+  
Sbjct: 423 TGGEVSLPASNYLIPVNNQGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRVGFLSR 482

Query: 335 NC 336
            C
Sbjct: 483 AC 484


>gi|356526294|ref|XP_003531753.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 414

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 106/369 (28%), Positives = 161/369 (43%), Gaps = 62/369 (16%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKC 51
           +G  SK + +I+DTGS L +               IF P  SSS+Q ++C+   C   + 
Sbjct: 69  MGLGSKNMTVIIDTGSDLTWVQCEPCMSCYNQQGPIFKPSTSSSYQSVSCNSSTCQSLQF 128

Query: 52  VN-----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                          C Y + Y D S T G    E +S      G       +FGC  +N
Sbjct: 129 ATGNTGACGSSNPSTCNYVVNYGDGSYTNGELGVEALSF-----GGVSVSDFVFGCGRNN 183

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS-YLKFGT 159
            G       G ++G++GL R  +S +SQ  +     FSYCL    P  E  SS  L  G 
Sbjct: 184 KGLF-----GGVSGLMGLGRSYLSLVSQTNATFGGVFSYCL----PTTEAGSSGSLVMGN 234

Query: 160 DMGYRRPSTQAT--KFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           +    + +   T  + +++P  +NFY L+L  I +    +   P +F     G GG +ID
Sbjct: 235 ESSVFKNANPITYTRMLSNPQLSNFYILNLTGIDVGGVALK-APLSF-----GNGGILID 288

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAF 272
           SG+V+T   S VY  L  +F+  F  F  A     P    +  C+ L        P+++ 
Sbjct: 289 SGTVITRLPSSVYKALKAEFLKKFTGFPSA-----PGFSILDTCFNLTGYDEVSIPTISL 343

Query: 273 YFE-DANLRIDGENVF-IIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDL 328
            FE +A L +D    F ++  +     LA+A   D    A+IG+ QQR+ R +YD     
Sbjct: 344 RFEGNAQLNVDATGTFYVVKEDASQVCLALASLSDAYDTAIIGNYQQRNQRVIYDTKQSK 403

Query: 329 LSFVKENCS 337
           + F +E CS
Sbjct: 404 VGFAEEPCS 412


>gi|115465771|ref|NP_001056485.1| Os05g0590000 [Oryza sativa Japonica Group]
 gi|49328116|gb|AAT58814.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113580036|dbj|BAF18399.1| Os05g0590000 [Oryza sativa Japonica Group]
          Length = 481

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 103/363 (28%), Positives = 160/363 (44%), Gaps = 47/363 (12%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
           +GTP+   L++LDTGS +++               +FDPR+S S+  ++C  P C     
Sbjct: 134 VGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPICRRLDS 193

Query: 52  VN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                    C+Y + Y D SVT G  A ET++   +G   A       GC +DN G    
Sbjct: 194 AGCDRRRNSCLYQVAYGDGSVTAGDFASETLT-FARG---ARVQRVAIGCGHDNEGLFI- 248

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV---IPLPNGEYTSSYLKFGTDMGY 163
               A +G+LGL R  +SF SQ+     + FSYCLV     +      SS + FG     
Sbjct: 249 ----AASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAVA 304

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCIIDSGS 218
                  T    +P    FYY+ L   S+   R+     + D+ +   +G GG I+DSG+
Sbjct: 305 AAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQS-DLRLNPTTGRGGVILDSGT 363

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLP-ETFNRFPSMAFYF 274
            +T     VY  + + F     R     L   P    L   CY L      + P+++ + 
Sbjct: 364 SVTRLARPVYEAVRDAF-----RAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHL 418

Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
              A++ +  EN  I    +  F  A+A  D  V++IG+ QQ+  R V+D +   + FV 
Sbjct: 419 AGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVP 478

Query: 334 ENC 336
           ++C
Sbjct: 479 KSC 481


>gi|302795261|ref|XP_002979394.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
 gi|300153162|gb|EFJ19802.1| hypothetical protein SELMODRAFT_53966 [Selaginella moellendorffii]
          Length = 353

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 97/363 (26%), Positives = 154/363 (42%), Gaps = 53/363 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ +GTP++ V ++ DTGS + +               IF+P  SSSF+ + C    C 
Sbjct: 16  ARIGVGTPARSVYMVADTGSDVSWLQCSPCRKCYRQQDPIFNPSLSSSFKPLACASSICG 75

Query: 48  YFK---CVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
             K   C  + +C+Y + Y D S T G  + ET+S      G+        GC  +N G 
Sbjct: 76  KLKIKGCSRKNKCMYQVSYGDGSFTVGDFSTETLSF-----GEHAVRSVAMGCGRNNQGL 130

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT-SSYLKFGTDMG 162
              A            R  +SF SQ G+     FSYCL    P  E   ++ L FG    
Sbjct: 131 FHGAAGLLGL-----GRGPLSFPSQTGTSYASVFSYCL----PRRESAIAASLVFGPSAV 181

Query: 163 YRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
             +   + TK +  PN     +YY+ L  I +    +N PPD F +   G GG I+DSG+
Sbjct: 182 PEK--ARFTKLL--PNRRLDTYYYVGLARIRVAGSPVNIPPDAFAMGSRGTGGVIVDSGT 237

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLPETFN-RFPSMAFYF 274
            ++   +  Y  L + F S      L      P  I L   CY L        P++   F
Sbjct: 238 AISRLTTPAYTALRDAFRS------LVTFPSAPG-ISLFDTCYDLSSMKTATLPAVVLDF 290

Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
           +  A++ +  + + +   +   + LA AP ++  ++IG+ QQ+  R   D   + +    
Sbjct: 291 DGGASMPLPADGILVNVDDEGTYCLAFAPEEEAFSIIGNVQQQTFRISIDNQKEQMGIAP 350

Query: 334 ENC 336
           + C
Sbjct: 351 DQC 353


>gi|168008816|ref|XP_001757102.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691600|gb|EDQ77961.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 406

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 94/363 (25%), Positives = 154/363 (42%), Gaps = 43/363 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           +R+ +GTP + + L++DTGS +++              AIFDP KSS++  + C    C 
Sbjct: 60  IRISVGTPPRRMYLVMDTGSDILWLQCAPCVNCYHQSDAIFDPYKSSTYSTLGCSTRQCL 119

Query: 48  YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIG-KGEGKAIFHGALFGCSNDNHGF 103
                 C   +C+Y + Y D S T G    + +S+    G G+ + +    GC +DN G+
Sbjct: 120 NLDIGTCQANKCLYQVDYGDGSFTTGEFGTDDVSLNSTSGVGQVVLNKIPLGCGHDNEGY 179

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A            +  +SF +Q+      RFSYCL     +    SS +      G 
Sbjct: 180 FVGAAGLLGL-----GKGPLSFPNQVDPQNGGRFSYCLTDRETDSTEGSSLV-----FGE 229

Query: 164 RRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                   +F    +N     FYYL +  IS+    +  P   F +   G GG IIDSG+
Sbjct: 230 AAVPPAGARFTPQDSNMRVPTFYYLKMTGISVGGTILTIPTSAFQLDSLGNGGVIIDSGT 289

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAFYFE 275
            +T   +  Y  L + F     R   + L+          CY L    +   P++  +F+
Sbjct: 290 SVTRLQNAAYASLRDAF-----RAGTSDLAPTAGFSLFDTCYDLSGLASVDVPTVTLHFQ 344

Query: 276 DA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
              +L++   N  I    ++ F LA A      ++IG+ QQ+  R +YD   + + FV  
Sbjct: 345 GGTDLKLPASNYLIPVDNSNTFCLAFAGTTG-PSIIGNIQQQGFRVIYDNLHNQVGFVPS 403

Query: 335 NCS 337
            C+
Sbjct: 404 QCN 406


>gi|359483137|ref|XP_002272278.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 402

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 86/294 (29%), Positives = 128/294 (43%), Gaps = 38/294 (12%)

Query: 56  CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGV 115
           C Y + Y D S T+G   HE +       G  +    +FGC  +N G       G ++G+
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKF-----GTILVKDFIFGCGRNNKGLF-----GGVSGL 182

Query: 116 LGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQA-TKFI 174
           +GL R  +S ISQ   I    FSYCL  P    + + S +  G    YR  S  +  K I
Sbjct: 183 MGLGRSDLSLISQTSGIFGGVFSYCL--PSTERKGSGSLILGGNSSVYRNSSPISYAKMI 240

Query: 175 NHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLH 232
            +P   NFY+++L  ISI    +  P         G    ++DSG+V+T     +Y  L 
Sbjct: 241 ENPQLYNFYFINLTGISIGGVALQAPS-------VGPSRILVDSGTVITRLPPTIYKALK 293

Query: 233 EKFVSYFERFQLAQLSDCPEP----IQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENV 286
            +F+  F  F        P P    +  C+ L        P++  +FE +A L +D   V
Sbjct: 294 AEFLKQFTGFP-------PAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGV 346

Query: 287 FII---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           F     D       LA   + D VA++G+ QQ++ R +YD     + F  E CS
Sbjct: 347 FYFVKSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 400


>gi|356495752|ref|XP_003516737.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 396

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 97/360 (26%), Positives = 165/360 (45%), Gaps = 37/360 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L +GTP   V  ++DTGS L++A              +F+P +S+++  I CD  +C
Sbjct: 51  LMKLTLGTPPVDVYGLVDTGSDLVWAQCTPCQGCYRQKSPMFEPLRSNTYTPIPCDSEEC 110

Query: 47  TYF---KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                  C  ++ C Y+  YAD SVTKG  A ET++         +    +FGC + N G
Sbjct: 111 NSLFGHSCSPQKLCAYSYAYADSSVTKGVLARETVTFSSTDGEPVVVGDIVFGCGHSNSG 170

Query: 103 -FDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            F+E+                +S +SQ G++   KRFS CLV P     +T   + FG  
Sbjct: 171 TFNENDMGIIGL-----GGGPLSLVSQFGNLYGSKRFSQCLV-PFHADPHTLGTISFGDA 224

Query: 161 MGYRRPSTQATKFINHPNNFYYL-SLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                    AT  ++      YL +L+ IS+ +  ++F        +  +G  +IDSG+ 
Sbjct: 225 SDVSGEGVAATPLVSEEGQTPYLVTLEGISVGDTFVSFNSS----EMLSKGNIMIDSGTP 280

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMAFYFEDAN 278
            TY   + Y +L ++      +  +  + D P+   QLCY   ET    P +  +FE A+
Sbjct: 281 ATYLPQEFYDRLVKELKV---QSNMLPIDDDPDLGTQLCY-RSETNLEGPILIAHFEGAD 336

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
           +++     FI   ++  F  A+A   D   + G+  Q +    +DL+   +SF   +CS+
Sbjct: 337 VQLMPIQTFIPP-KDGVFCFAMAGTTDGEYIFGNFAQSNVLIGFDLDRKTVSFKATDCSN 395


>gi|115467014|ref|NP_001057106.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|51091210|dbj|BAD35903.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
 gi|113595146|dbj|BAF19020.1| Os06g0209100 [Oryza sativa Japonica Group]
 gi|125554496|gb|EAZ00102.1| hypothetical protein OsI_22105 [Oryza sativa Indica Group]
          Length = 454

 Score =  103 bits (256), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 98/380 (25%), Positives = 154/380 (40%), Gaps = 57/380 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           ++ + +GTP + V L LDTGS L++                + DP  SS+   + CD P 
Sbjct: 91  LMHVSVGTPPRPVALTLDTGSDLVWTQCAPCLDCFEQGAAPVLDPAASSTHAALPCDAPL 150

Query: 46  C---TYFKCV-----NEQCVYTMKYADQSVTKGFAAHETISVIGK-GEGKAIFHGALFGC 96
           C    +  C      +  CVY   Y D+S+T G  A ++ +  G    G        FGC
Sbjct: 151 CRALPFTSCGGRSWGDRSCVYVYHYGDRSLTVGQLATDSFTFGGDDNAGGLAARRVTFGC 210

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
            + N G  +        G+ G  R   S  SQL       FSYC          +SS + 
Sbjct: 211 GHINKGIFQANE----TGIAGFGRGRWSLPSQLN---VTSFSYCFTSMFDT--KSSSVVT 261

Query: 157 FG--------TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV 206
            G        T         + T+ I +P+  + Y++ L+ IS+   R+  P      + 
Sbjct: 262 LGAAAAELLHTHHAAHTGDVRTTRLIKNPSQPSLYFVPLRGISVGGARVAVPESRLRSST 321

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETF 264
                 IIDSG+ +T    DVY  +  +FVS   +  L   +     + LC+ LP    +
Sbjct: 322 ------IIDSGASITTLPEDVYEAVKAEFVS---QVGLPAAAAGSAALDLCFALPVAALW 372

Query: 265 NR--FPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
            R   P++  + +  A+  +   N    DY      + +        +IG+ QQ++T  V
Sbjct: 373 RRPAVPALTLHLDGGADWELPRGNYVFEDYAARVLCVVLDAAAGEQVVIGNYQQQNTHVV 432

Query: 322 YDLNIDLLSFVKENCSDDSA 341
           YDL  D+LSF    C   +A
Sbjct: 433 YDLENDVLSFAPARCDKLAA 452


>gi|296085498|emb|CBI29230.3| unnamed protein product [Vitis vinifera]
          Length = 542

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 98/344 (28%), Positives = 147/344 (42%), Gaps = 53/344 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L+IGTP   V+ I+DTGS L +               +FDP+ SS+++  +C    C
Sbjct: 93  LMNLYIGTPPVPVIAIVDTGSDLTWTQCRPCTHCYKQVVPLFDPKNSSTYRDSSCGTSFC 152

Query: 47  TYF----KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                   C  E+ C +   YAD S T G  A ET++V         F G  FGC + + 
Sbjct: 153 LALGKDRSCSKEKKCTFRYSYADGSFTGGNLASETLTVDSTAGKPVSFPGFAFGCGHSSG 212

Query: 102 G-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
           G FD+ +     +G++GL    +S ISQL S I   FSYCL +P+      SS + FG  
Sbjct: 213 GIFDKSS-----SGIVGLGGGELSLISQLKSTINGLFSYCL-LPVSTDSSISSRINFGAS 266

Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
                  T +T           L  K  S   E               EG  I+DSG+  
Sbjct: 267 GRVSGYGTVSTPL--------RLPYKGYSKKTEVE-------------EGNIIVDSGTTY 305

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
           T+   + Y KL +   S     +  ++ D      LCY      N  P +  +F+DAN+ 
Sbjct: 306 TFLPQEFYSKLEK---SVANSIKGKRVRDPNGIFSLCYNTTAEINA-PIITAHFKDANVE 361

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
           +   N F+   E+      VAP  D + ++G+  Q +    +DL
Sbjct: 362 LQPLNTFMRMQED-LVCFTVAPTSD-IGVLGNLAQVNFLVGFDL 403



 Score = 43.9 bits (102), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 35/129 (27%), Positives = 57/129 (44%), Gaps = 5/129 (3%)

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFP 268
           EG  I+DSG+  TY   + Y KL E   S     +  ++ D      LCY         P
Sbjct: 417 EGNIIVDSGTTYTYLPLEFYVKLEE---SVAHSIKGKRVRDPNGISSLCYNTTVDQIDAP 473

Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
            +  +F+DAN+ +   N F+   E+      V P  D + ++G+  Q +    +DL    
Sbjct: 474 IITAHFKDANVELQPWNTFLRMQED-LVCFTVLPTSD-IGILGNLAQVNFLVGFDLRKKR 531

Query: 329 LSFVKENCS 337
           +SF   +C+
Sbjct: 532 VSFKAADCT 540


>gi|147814824|emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera]
          Length = 449

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 152/371 (40%), Gaps = 49/371 (13%)

Query: 6   IGTPSKGVLLILDTGSALIYA-----------------------IFDPRKSSSFQKINCD 42
           +GTPS+  +L+ DTGS L +                        +F    SSSF+ I C 
Sbjct: 89  VGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCL 148

Query: 43  HPDCT-----YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA 92
              C       F   N       C Y  +Y+D S   GF A+ET++V  K   K   H  
Sbjct: 149 TDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNV 208

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
           L GCS    G    A D    GV+GL     SF  +       +FSYCLV  L + +  S
Sbjct: 209 LIGCSESFQGQSFQAAD----GVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSH-KNVS 263

Query: 153 SYLKFGTDMGYR---RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
           +YL FG+           T     +   N+FY +++  ISI    +  P + +D  V G 
Sbjct: 264 NYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWD--VKGA 321

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--F 267
           GG I+DSGS LT+     Y  +         +F+  ++     P++ C F    F     
Sbjct: 322 GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDI--GPLEYC-FNSTGFEESLV 378

Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLA-VAPHDDLVALIGSQQQRDTRFVYDLNI 326
           P + F+F D          ++I   +    L  V+      +++G+  Q++  + +DL +
Sbjct: 379 PRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGL 438

Query: 327 DLLSFVKENCS 337
             L F   +C+
Sbjct: 439 KKLGFAPSSCT 449


>gi|357521081|ref|XP_003630829.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355524851|gb|AET05305.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 526

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 96/352 (27%), Positives = 152/352 (43%), Gaps = 54/352 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V++ +G P +   +I D  +   +              +IFDP +SSS+  ++C+   C
Sbjct: 188 LVQIGVGGPPQKFYMIFDLQTDFTWLQCQPCIKCYDQPDSIFDPSQSSSYTLLSCETKHC 247

Query: 47  TYF---KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                  C ++  C Y + Y D + T+G   +ET+S     E          GCSN N G
Sbjct: 248 NLLPNSSCSDDGYCRYNITYKDGTNTEGVLINETVSF----ESSGWVDRVSLGCSNKNQG 303

Query: 103 --FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
                D       G  GL R ++SF S++ +      SYCLV       Y+SS L+F + 
Sbjct: 304 PFVGSD-------GTFGLGRGSLSFPSRINA---SSMSYCLV--ESKDGYSSSTLEFNSP 351

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                  +   K + +P   N YY+ LK I +  E+++ P  TF I   G GG I+ S S
Sbjct: 352 ---PCSGSVKAKLLQNPKAENLYYVGLKGIKVGGEKIDVPNSTFTIDPYGNGGMIVSSSS 408

Query: 219 VLTYFHSDVYWKLHEKFVS---YFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF 274
           ++T   +D Y  + + FV+   + ER +     D       CY L        P + F  
Sbjct: 409 LITMLENDTYNVVRDAFVAKTQHLERLKAFLQFD------TCYNLSSNNTVELPILEFEV 462

Query: 275 EDAN--LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
            D    L      ++ +D +N  F  A AP     +++G+ QQ  TR  +DL
Sbjct: 463 NDGKSWLLPKESYLYAVD-KNGTFCFAFAPSKGSFSILGTLQQYGTRVTFDL 513


>gi|115448349|ref|NP_001047954.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|45735841|dbj|BAD12876.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|45735967|dbj|BAD12996.1| putative 41 kD chloroplast nucleoid DNA binding protein (CND41)
           [Oryza sativa Japonica Group]
 gi|113537485|dbj|BAF09868.1| Os02g0720600 [Oryza sativa Japonica Group]
 gi|125583492|gb|EAZ24423.1| hypothetical protein OsJ_08176 [Oryza sativa Japonica Group]
 gi|215740989|dbj|BAG97484.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 463

 Score =  103 bits (256), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 89/361 (24%), Positives = 150/361 (41%), Gaps = 50/361 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGS----------------ALIYAIFDPRKSSSFQKINCDHP 44
           ++ + +GTP+    + +DTGS                A   A+FDP KSS+++ ++C   
Sbjct: 128 VISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCYAQTGALFDPAKSSTYRAVSCAAA 187

Query: 45  DCTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           +C   +         N +C Y ++Y D S T G  + +T+++ G  +    F    FGCS
Sbjct: 188 ECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQ---FGCS 244

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
           +   GF  D  D    G++GL     S +SQ  +     FSYCL    P    +      
Sbjct: 245 HVESGF-SDQTD----GLMGLGGGAQSLVSQTAAAYGNSFSYCLP---PTSGSSGFLTLG 296

Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           G        +T+  +    P  FY   L+DI++  +++   P  F        G ++DSG
Sbjct: 297 GGGGVSGFVTTRMLRSRQIP-TFYGARLQDIAVGGKQLGLSPSVF------AAGSVVDSG 349

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE- 275
           +++T      Y  L   F +  ++++ A        +  C+ F  +T    P++A  F  
Sbjct: 350 TIITRLPPTAYSALSSAFKAGMKQYRSAPARSI---LDTCFDFAGQTQISIPTVALVFSG 406

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            A + +D   +    Y N     A    D    +IG+ QQR    +YD+    L F    
Sbjct: 407 GAAIDLDPNGIM---YGNCLAFAATG-DDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGA 462

Query: 336 C 336
           C
Sbjct: 463 C 463


>gi|125540928|gb|EAY87323.1| hypothetical protein OsI_08727 [Oryza sativa Indica Group]
          Length = 463

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 89/361 (24%), Positives = 150/361 (41%), Gaps = 50/361 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGS----------------ALIYAIFDPRKSSSFQKINCDHP 44
           ++ + +GTP+    + +DTGS                A   A+FDP KSS+++ ++C   
Sbjct: 128 VISVGLGTPAVTQTVTIDTGSDVSWVQCNPCPNPPCHAQTGALFDPAKSSTYRAVSCAAA 187

Query: 45  DCTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           +C   +         N +C Y ++Y D S T G  + +T+++ G  +    F    FGCS
Sbjct: 188 ECAQLEQQGNGCGATNYECQYGVQYGDGSTTNGTYSRDTLTLSGASDAVKGFQ---FGCS 244

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
           +   GF  D  D    G++GL     S +SQ  +     FSYCL    P    +      
Sbjct: 245 HLESGF-SDQTD----GLMGLGGGAQSLVSQTAAAYGNSFSYCLP---PTSGSSGFLTLG 296

Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           G        +T+  +    P  FY   L+DI++  +++   P  F        G ++DSG
Sbjct: 297 GGGGASGFVTTRMLRSKQIP-TFYGARLQDIAVGGKQLGLSPSVF------AAGSVVDSG 349

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE- 275
           +++T      Y  L   F +  ++++ A        +  C+ F  +T    P++A  F  
Sbjct: 350 TIITRLPPTAYSALSSAFKAGMKQYRSAPARSI---LDTCFDFAGQTQISIPTVALVFSG 406

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            A + +D   +    Y N     A    D    +IG+ QQR    +YD+    L F    
Sbjct: 407 GAAIDLDPNGIM---YGNCLAFAATG-DDGTTGIIGNVQQRTFEVLYDVGSSTLGFRSGA 462

Query: 336 C 336
           C
Sbjct: 463 C 463


>gi|125564143|gb|EAZ09523.1| hypothetical protein OsI_31798 [Oryza sativa Indica Group]
          Length = 485

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 92/360 (25%), Positives = 147/360 (40%), Gaps = 47/360 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V + +GTP+K   +I DTGS L +               +FDP  SS++  + C  P+C
Sbjct: 150 VVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPEC 209

Query: 47  TYFKC----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                     + +C Y ++Y DQS T G    +T+++           G +FGC + N G
Sbjct: 210 QELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTL----SASDTLPGFVFGCGDQNAG 265

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                  G + G+ GL R  +S  SQ        F+YC    LP+      YL  G   G
Sbjct: 266 L-----FGQVDGLFGLGREKVSLPSQGAPSYGPGFTYC----LPSSSSGRGYLSLG---G 313

Query: 163 YRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
               + Q T   +    +FYY+ L  I +    +  P   F          +IDSG+V+T
Sbjct: 314 APPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGG----TVIDSGTVIT 369

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DANL 279
                 Y  L   F     +++ A        +  CY F      + P++   F   A +
Sbjct: 370 RLPPRAYAPLRAAFARSMAQYKKAPALSI---LDTCYDFTGHRTAQIPTVELAFAGGATV 426

Query: 280 RIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            +D   V  +   +    LA AP+  D  +A++G+ QQ+     YD+    + F  + CS
Sbjct: 427 SLDFTGVLYVSKVSQ-ACLAFAPNADDSSIAILGNTQQKTFAVTYDVANQRIGFGAKGCS 485


>gi|115479815|ref|NP_001063501.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|50725878|dbj|BAD33407.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50725881|dbj|BAD33410.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113631734|dbj|BAF25415.1| Os09g0482200 [Oryza sativa Japonica Group]
 gi|125606112|gb|EAZ45148.1| hypothetical protein OsJ_29786 [Oryza sativa Japonica Group]
          Length = 485

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 92/360 (25%), Positives = 147/360 (40%), Gaps = 47/360 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V + +GTP+K   +I DTGS L +               +FDP  SS++  + C  P+C
Sbjct: 150 VVSVGLGTPAKQYAVIFDTGSDLSWVQCKPCADCYEQQDPLFDPSLSSTYAAVACGAPEC 209

Query: 47  TYFKC----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                     + +C Y ++Y DQS T G    +T+++           G +FGC + N G
Sbjct: 210 QELDASGCSSDSRCRYEVQYGDQSQTDGNLVRDTLTL----SASDTLPGFVFGCGDQNAG 265

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                  G + G+ GL R  +S  SQ        F+YC    LP+      YL  G   G
Sbjct: 266 L-----FGQVDGLFGLGREKVSLPSQGAPSYGPGFTYC----LPSSSSGRGYLSLG---G 313

Query: 163 YRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
               + Q T   +    +FYY+ L  I +    +  P   F          +IDSG+V+T
Sbjct: 314 APPANAQFTALADGATPSFYYIDLVGIKVGGRAIRIPATAFAAAGG----TVIDSGTVIT 369

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DANL 279
                 Y  L   F     +++ A        +  CY F      + P++   F   A +
Sbjct: 370 RLPPRAYAPLRAAFARSMAQYKKAPALSI---LDTCYDFTGHRTAQIPTVELAFAGGATV 426

Query: 280 RIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            +D   V  +   +    LA AP+  D  +A++G+ QQ+     YD+    + F  + CS
Sbjct: 427 SLDFTGVLYVSKVSQ-ACLAFAPNADDSSIAILGNTQQKTFAVAYDVANQRIGFGAKGCS 485


>gi|359494621|ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 449

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 152/371 (40%), Gaps = 49/371 (13%)

Query: 6   IGTPSKGVLLILDTGSALIYA-----------------------IFDPRKSSSFQKINCD 42
           +GTPS+  +L+ DTGS L +                        +F    SSSF+ I C 
Sbjct: 89  VGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCL 148

Query: 43  HPDCT-----YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA 92
              C       F   N       C Y  +Y+D S   GF A+ET++V  K   K   H  
Sbjct: 149 TDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNV 208

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
           L GCS    G    A D    GV+GL     SF  +       +FSYCLV  L + +  S
Sbjct: 209 LIGCSESFQGQSFQAAD----GVMGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSH-KNVS 263

Query: 153 SYLKFGTDMGYR---RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
           +YL FG+           T     +   N+FY +++  ISI    +  P + +D  V G 
Sbjct: 264 NYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKIPSEVWD--VKGA 321

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--F 267
           GG I+DSGS LT+     Y  +         +F+  ++     P++ C F    F     
Sbjct: 322 GGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDI--GPLEYC-FNSTGFEESLV 378

Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLA-VAPHDDLVALIGSQQQRDTRFVYDLNI 326
           P + F+F D          ++I   +    L  V+      +++G+  Q++  + +DL +
Sbjct: 379 PRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHLWEFDLGL 438

Query: 327 DLLSFVKENCS 337
             L F   +C+
Sbjct: 439 KKLGFAPSSCT 449


>gi|115476830|ref|NP_001062011.1| Os08g0469100 [Oryza sativa Japonica Group]
 gi|42407408|dbj|BAD09566.1| putative nucleoid DNA-binding protein [Oryza sativa Japonica Group]
 gi|113623980|dbj|BAF23925.1| Os08g0469100 [Oryza sativa Japonica Group]
          Length = 373

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 155/361 (42%), Gaps = 57/361 (15%)

Query: 15  LILDTGSALIYA------------------IFDPRKSSSFQKINCDHPDC-----TYFKC 51
           LI+DTGS LI+                   ++DP +SS+F  + C    C     ++  C
Sbjct: 28  LIVDTGSDLIWTQCKLSSSTAAAARHGSPPVYDPGESSTFAFLPCSDRLCQEGQFSFKNC 87

Query: 52  VNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDG 110
            ++ +CVY   Y   +   G  A ET +    G  +A+     FGC   + G    A   
Sbjct: 88  TSKNRCVYEDVYGSAAAV-GVLASETFTF---GARRAVSLRLGFGCGALSAGSLIGA--- 140

Query: 111 ALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPST-- 168
              G+LGLS  ++S I+QL     +RFSYCL    P  +  +S L FG      R  T  
Sbjct: 141 --TGILGLSPESLSLITQLK---IQRFSYCLT---PFADKKTSPLLFGAMADLSRHKTTR 192

Query: 169 --QATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
             Q T  +++P    +YY+ L  IS+ ++R+  P  +  +   G GG I+DSGS + Y  
Sbjct: 193 PIQTTAIVSNPVETVYYYVPLVGISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLV 252

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGE 284
              +  + E   +  +  +L   +   E  +LC+ LP         A       L  DG 
Sbjct: 253 EAAFEAVKE---AVMDVVRLPVANRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGG 309

Query: 285 NVFIIDYENHF-------FLLAVAPHDD--LVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
              ++  +N+F         LAV    D   V++IG+ QQ++   ++D+     SF    
Sbjct: 310 AAMVLPRDNYFQEPRAGLMCLAVGKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQ 369

Query: 336 C 336
           C
Sbjct: 370 C 370


>gi|356504173|ref|XP_003520873.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 461

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 101/360 (28%), Positives = 152/360 (42%), Gaps = 44/360 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
            R+ +GTP++ V ++LDTGS +++               +FDP KS ++  I C  P C 
Sbjct: 120 TRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQTDHVFDPTKSRTYAGIPCGAPLCR 179

Query: 48  YF---KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                 C N+   C Y + Y D S T G  + ET++         +   AL GC +DN G
Sbjct: 180 RLDSPGCSNKNKVCQYQVSYGDGSFTFGDFSTETLTF----RRNRVTRVAL-GCGHDNEG 234

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               A            R  +SF  Q G     +FSYCLV    + + +S  + FG    
Sbjct: 235 LFTGAAGLLGL-----GRGRLSFPVQTGRRFNHKFSYCLVDRSASAKPSS--VIFGDSAV 287

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMN-FPPDTFDITVSGEGGCIIDSGSV 219
            R  +   T  I +P  + FYYL L  IS+    +       F +  +G GG IIDSG+ 
Sbjct: 288 SR--TAHFTPLIKNPKLDTFYYLELLGISVGGAPVRGLSASLFRLDAAGNGGVIIDSGTS 345

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPE-TFNRFPSMAFYFED 276
           +T      Y  L + F     R   + L   PE      C+ L   T  + P++  +F  
Sbjct: 346 VTRLTRPAYIALRDAF-----RIGASHLKRAPEFSLFDTCFDLSGLTEVKVPTVVLHFRG 400

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A++ +   N  I    +  F  A A     +++IG+ QQ+  R  YDL    + F    C
Sbjct: 401 ADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRISYDLTGSRVGFAPRGC 460


>gi|255537017|ref|XP_002509575.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223549474|gb|EEF50962.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 459

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 101/361 (27%), Positives = 150/361 (41%), Gaps = 42/361 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQK---------------INCDHPD 45
           +V   IG P      ++DTGS+L +   +P  +   QK                + D  D
Sbjct: 111 LVNFSIGQPPVPQYAVMDTGSSLTWIQCEPCINCHQQKGPLYNPSSSSTYVSCSDFDRTD 170

Query: 46  CTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
            T+       C Y+  YAD++ T+G  A E +      +G  I H  +FGC ++N     
Sbjct: 171 TTFTATHGSDCNYSQTYADKTTTRGTYAREQLLFETPDDGITIMHDVIFGCGHNNTQL-- 228

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM---G 162
               G  +GV GL     S IS+LG      FSYC +  + +  Y    L  G  +   G
Sbjct: 229 PGPTGYASGVFGLGDSGSSIISKLGF----GFSYC-IGNIGDPLYGFHRLTLGNKLKIEG 283

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFD-ITVSG-EGGCIIDSGSVL 220
           Y  P          P   YY++L  ISI  ER++  P  F  + ++G     +IDSG+ L
Sbjct: 284 YSTPLV--------PRGLYYITLVGISIGQERLDIDPIVFQRVDLNGISSRIVIDSGATL 335

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF--LPETFNRFPSMAFYFED-A 277
           +Y     Y  + +K  S    F L++       + LCY   L +    FP   F+  D A
Sbjct: 336 SYIPRQAYNVVRDKVSSILSGF-LSRYRYIARHLSLCYIGKLNQDLQGFPDATFHLADGA 394

Query: 278 NLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
           +L    E +F   Y ++   LA+ P   D+   LIG   Q+     YDL    L F +  
Sbjct: 395 DLVFQVEGLF-FQYTDNVLCLALVPTESDEETCLIGLLAQQYYNVAYDLKQQKLYFQRIE 453

Query: 336 C 336
           C
Sbjct: 454 C 454


>gi|125553531|gb|EAY99240.1| hypothetical protein OsI_21202 [Oryza sativa Indica Group]
          Length = 475

 Score =  102 bits (255), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 102/363 (28%), Positives = 160/363 (44%), Gaps = 47/363 (12%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
           +GTP+   L++LDTGS +++               +FDPR+S S+  ++C  P C     
Sbjct: 128 VGTPATTALMVLDTGSDVVWLQCAPCRHCYAQSGRVFDPRRSRSYAAVDCVAPICRRLDS 187

Query: 52  VN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                    C+Y + Y D SVT G  A ET++   +G   A       GC +DN G    
Sbjct: 188 AGCDRRRNSCLYQVAYGDGSVTAGDFASETLT-FARG---ARVQRVAIGCGHDNEGLFI- 242

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV---IPLPNGEYTSSYLKFGTDMGY 163
               A +G+LGL R  +SF +Q+     + FSYCLV     +      SS + FG     
Sbjct: 243 ----AASGLLGLGRGRLSFPTQIARSFGRSFSYCLVDRTSSVRPSSTRSSTVTFGAGAVA 298

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCIIDSGS 218
                  T    +P    FYY+ L   S+   R+     + D+ +   +G GG I+DSG+
Sbjct: 299 AAAGASFTPMGRNPRMATFYYVHLLGFSVGGARVKGVSQS-DLRLNPTTGRGGVILDSGT 357

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLP-ETFNRFPSMAFYF 274
            +T     VY  + + F     R     L   P    L   CY L      + P+++ + 
Sbjct: 358 SVTRLARPVYEAVRDAF-----RAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHL 412

Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
              A++ +  EN  I    +  F  A+A  D  V++IG+ QQ+  R V+D +   + FV 
Sbjct: 413 AGGASVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVP 472

Query: 334 ENC 336
           ++C
Sbjct: 473 KSC 475


>gi|326491519|dbj|BAJ94237.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524456|dbj|BAK00611.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 92/361 (25%), Positives = 151/361 (41%), Gaps = 50/361 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP+    ++ DTGS   +                +FDP KSS++  ++C    
Sbjct: 164 VVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKEPLFDPAKSSTYANVSCTDSA 223

Query: 46  CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C       C    C+Y ++Y D S T GF A +T+++           G  FGC   N+G
Sbjct: 224 CADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIA-----HDAIKGFRFGCGEKNNG 278

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                  G  AG++GL R   S   Q  +     F+YC    LP     + YL FG   G
Sbjct: 279 L-----FGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYC----LPALTTGTGYLDFGP--G 327

Query: 163 YRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
               + + T  + +    FYY+ +  I +  +++      F        G ++DSG+V+T
Sbjct: 328 SAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTA-----GTLVDSGTVIT 382

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQ-LSDCP--EPIQLCY-FLPETFNRFPSMAFYFE-D 276
              +  Y  L     S F++  LA+     P    +  CY F   +    P+++  F+  
Sbjct: 383 RLPATAYTALS----SAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGG 438

Query: 277 ANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
           A L +D    V+ I         A    D+ VA++G+ QQ+    +YDL    + F   +
Sbjct: 439 ACLDVDVSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGS 498

Query: 336 C 336
           C
Sbjct: 499 C 499


>gi|326520736|dbj|BAJ92731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 499

 Score =  102 bits (254), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 92/361 (25%), Positives = 151/361 (41%), Gaps = 50/361 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP+    ++ DTGS   +                +FDP KSS++  ++C    
Sbjct: 164 VVTVGLGTPASKYTVVFDTGSDTTWVQCRPCVVKCYKQKGPLFDPAKSSTYANVSCTDSA 223

Query: 46  CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C       C    C+Y ++Y D S T GF A +T+++           G  FGC   N+G
Sbjct: 224 CADLDTNGCTGGHCLYAVQYGDGSYTVGFFAQDTLTIA-----HDAIKGFRFGCGEKNNG 278

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                  G  AG++GL R   S   Q  +     F+YC    LP     + YL FG   G
Sbjct: 279 L-----FGKTAGLMGLGRGKTSLTVQAYNKYGGAFAYC----LPALTTGTGYLDFGP--G 327

Query: 163 YRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
               + + T  + +    FYY+ +  I +  +++      F        G ++DSG+V+T
Sbjct: 328 SAGNNARLTPMLTDKGQTFYYVGMTGIRVGGQQVPVAESVFSTA-----GTLVDSGTVIT 382

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQ-LSDCP--EPIQLCY-FLPETFNRFPSMAFYFE-D 276
              +  Y  L     S F++  LA+     P    +  CY F   +    P+++  F+  
Sbjct: 383 RLPATAYTALS----SAFDKVMLARGYKKAPGYSILDTCYDFTGLSDVELPTVSLVFQGG 438

Query: 277 ANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
           A L +D    V+ I         A    D+ VA++G+ QQ+    +YDL    + F   +
Sbjct: 439 ACLDVDVSGIVYAISEAQVCLAFASNGDDESVAIVGNTQQKTYGVLYDLGKKTVGFAPGS 498

Query: 336 C 336
           C
Sbjct: 499 C 499


>gi|224091849|ref|XP_002309371.1| predicted protein [Populus trichocarpa]
 gi|222855347|gb|EEE92894.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 101/360 (28%), Positives = 156/360 (43%), Gaps = 53/360 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           +V   +G P    L I+DTGS+L++                +FDP  SS++  ++C +  
Sbjct: 103 LVNFSMGQPPVPQLAIMDTGSSLLWIQCAPCKSCSQQIIGPMFDPSISSTYDSLSCKNII 162

Query: 46  CTYF---KC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
           C Y    +C  + QCVY   Y +   + G  A E +      EG+   +  LFGCS+ N 
Sbjct: 163 CRYAPSGECDSSSQCVYNQTYVEGLPSVGVIATEQLIFGSSDEGRNAVNNVLFGCSHRNG 222

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTD 160
            +    +D    GV GL     S ++Q+GS    +FSYC+  I  P+  Y    L  G +
Sbjct: 223 NY----KDRRFTGVFGLGSGITSVVNQMGS----KFSYCIGNIADPDYSYNQLVLSEGVN 274

Query: 161 M-GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
           M GY  P       ++  +  Y + L+ IS+   R+   P  F  T   +   IIDSG+ 
Sbjct: 275 MEGYSTP-------LDVVDGHYQVILEGISVGETRLVIDPSAFKRT-EKQRRVIIDSGTA 326

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYF-ED 276
            T+   + Y  L  +  +  +RF    L+       LCY   + +    FP++ F+F E 
Sbjct: 327 PTWLAENEYRALEREVRNLLDRF----LTPFMRESFLCYKGKVGQDLVGFPAVTFHFAEG 382

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A+L +D E      Y   F   +V         IG   Q+     YDLN   L F + +C
Sbjct: 383 ADLVVDTEMRQASVYGKDFKDFSV---------IGLMAQQYYNVAYDLNKHKLFFQRIDC 433


>gi|242084330|ref|XP_002442590.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
 gi|241943283|gb|EES16428.1| hypothetical protein SORBIDRAFT_08g022570 [Sorghum bicolor]
          Length = 494

 Score =  102 bits (254), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 104/374 (27%), Positives = 157/374 (41%), Gaps = 52/374 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           M ++ +GTP+   LL LDT S L +               +FDPR S+S+ ++N D PDC
Sbjct: 135 MAKIAVGTPAVQALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDC 194

Query: 47  TYFK------CVNEQCVYTMKYAD----QSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
                          C+YT++Y D     S + G    ET++  G G  +A       GC
Sbjct: 195 QALGRSGGGDAKRGTCIYTVQYGDGHGSTSTSVGDLVEETLTFAG-GVRQAYLS---IGC 250

Query: 97  SNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSI-IKKRFSYCLVIPLPNGEYTSSY 154
            +DN G F   A     AG+LGL R  IS   Q+  +     FSYCLV  +      SS 
Sbjct: 251 GHDNKGLFGAPA-----AGILGLGRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPSST 305

Query: 155 LKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV-----S 207
           L FG       P    T  + + N   FYY+ L  +S+   R+   P   +  +     +
Sbjct: 306 LTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRV---PGVTERDLQLDPYT 362

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLS-DCPEPI-QLCYFLPETFN 265
           G GG I+DSG+ +T      Y  +  +         L Q+S   P  +   CY +     
Sbjct: 363 GRGGVILDSGTTVTRLARPAY--VAFRDAFRAAATSLGQVSTGGPSGLFDTCYTVGGRAG 420

Query: 266 -RFPSMAFYFEDA-NLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
            + P+++ +F     + +  +N  I +D             D  V++IG+  Q+  R VY
Sbjct: 421 VKVPAVSMHFAGGVEVSLQPKNYLIPVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRVVY 480

Query: 323 DLNIDLLSFVKENC 336
           DL    + F   NC
Sbjct: 481 DLAGQRVGFAPNNC 494


>gi|79315693|ref|NP_001030891.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332646353|gb|AEE79874.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 499

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 103/346 (29%), Positives = 152/346 (43%), Gaps = 32/346 (9%)

Query: 4   LFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTMKYA 63
           + +G+P K   LILDTGS L +           Q + C   DC + +  N+ C Y   Y 
Sbjct: 174 VLVGSPPKHFSLILDTGSDLNW----------IQCLPC--YDC-FQQNDNQSCPYYYWYG 220

Query: 64  DQSVTKGFAAHETISV--IGKGEGKAIFH--GALFGCSNDNHGFDEDARDGALAGVLGLS 119
           D S T G  A ET +V     G    +++    +FGC + N G    A            
Sbjct: 221 DSSNTTGDFAVETFTVNLTTNGGSSELYNVENMMFGCGHWNRGLFHGAAGLLGL-----G 275

Query: 120 RVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY-RRPSTQATKFINHPN 178
           R  +SF SQL S+    FSYCLV    +    SS L FG D      P+   T F+    
Sbjct: 276 RGPLSFSSQLQSLYGHSFSYCLV-DRNSDTNVSSKLIFGEDKDLLSHPNLNFTSFVAGKE 334

Query: 179 N----FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEK 234
           N    FYY+ +K I +  E +N P +T++I+  G GG IIDSG+ L+YF    Y  +  K
Sbjct: 335 NLVDTFYYVQIKSILVAGEVLNIPEETWNISSDGAGGTIIDSGTTLSYFAEPAYEFIKNK 394

Query: 235 FVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFED-ANLRIDGENVFIIDYE 292
            ++   + +     D P  +  C+ +    N + P +   F D A      EN FI   E
Sbjct: 395 -IAEKAKGKYPVYRDFP-ILDPCFNVSGIHNVQLPELGIAFADGAVWNFPTENSFIWLNE 452

Query: 293 NHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
           +   L  +       ++IG+ QQ++   +YD     L +    C+D
Sbjct: 453 DLVCLAMLGTPKSAFSIIGNYQQQNFHILYDTKRSRLGYAPTKCAD 498


>gi|224067042|ref|XP_002302336.1| predicted protein [Populus trichocarpa]
 gi|222844062|gb|EEE81609.1| predicted protein [Populus trichocarpa]
          Length = 438

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 92/359 (25%), Positives = 154/359 (42%), Gaps = 43/359 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTY- 48
           +VR+ +GTP + + ++LDT +   +             F P  S++   ++C    C+  
Sbjct: 99  VVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTTFLPNASTTLGSLDCSGAQCSQV 158

Query: 49  --FKCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
             F C    +  C++   Y   S        + I++        +  G  FGC N   G 
Sbjct: 159 RGFSCPATGSSACLFNQSYGGDSSLTATLVQDAITL-----ANDVIPGFTFGCINAVSGG 213

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
               +     G+LGL R  IS ISQ G++    FSYCL  P     Y S  LK G  +G 
Sbjct: 214 SIPPQ-----GLLGLGRGPISLISQAGAMYSGVFSYCL--PSFKSYYFSGSLKLG-PVGQ 265

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            + S + T  + +P+  + YY++L  +S+   ++  P +      +   G IIDSG+V+T
Sbjct: 266 PK-SIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVIT 324

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
            F   VY+ + ++F     R Q+            C F        P++  +FE  NL +
Sbjct: 325 RFVQPVYFAIRDEF-----RKQVNGPISSLGAFDTC-FAATNEAEAPAITLHFEGLNLVL 378

Query: 282 DGENVFIIDYENHFFLL--AVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             EN  I         L  A AP+  + ++ +I + QQ++ R ++D     L   +E C
Sbjct: 379 PMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437


>gi|242066142|ref|XP_002454360.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
 gi|241934191|gb|EES07336.1| hypothetical protein SORBIDRAFT_04g029400 [Sorghum bicolor]
          Length = 466

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 86/360 (23%), Positives = 149/360 (41%), Gaps = 51/360 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + +G+P K   +++DTGS + +               +FDP  SS++   +C    C
Sbjct: 134 LITVRLGSPGKSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCSSAAC 193

Query: 47  TYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                    C + QC YT+ Y D S T G  + +T+++     G        FGCSN   
Sbjct: 194 AQLGQEGNGCSSSQCQYTVTYGDGSSTTGTYSSDTLAL-----GSNAVRKFQFGCSNVES 248

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           GF++        G++GL     S +SQ        FSYC    LP    +S +L  G   
Sbjct: 249 GFNDQTD-----GLMGLGGGAQSLVSQTAGTFGAAFSYC----LPATSSSSGFLTLGAGT 299

Query: 162 G--YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
               + P  ++++       FY + ++ I +   +++ P   F        G I+DSG+V
Sbjct: 300 SGFVKTPMLRSSQV----PTFYGVRIQAIRVGGRQLSIPTSVF------SAGTIMDSGTV 349

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDAN 278
           LT      Y  L   F +  +++  A  S     +  C+ F  ++    P++A  F    
Sbjct: 350 LTRLPPTAYSALSSAFKAGMKQYPSAPPSGI---LDTCFDFSGQSSVSIPTVALVFSGGA 406

Query: 279 LRIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +     +  ++   N    LA A +  D  + +IG+ QQR    +YD+    + F    C
Sbjct: 407 VVDIASDGIMLQTSNSILCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 466


>gi|118486912|gb|ABK95290.1| unknown [Populus trichocarpa]
          Length = 438

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 92/359 (25%), Positives = 154/359 (42%), Gaps = 43/359 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTY- 48
           +VR+ +GTP + + ++LDT +   +             F P  S++   ++C    C+  
Sbjct: 99  VVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGFSSTTFLPNASTTLGSLDCSGAQCSQV 158

Query: 49  --FKCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
             F C    +  C++   Y   S        + I++        +  G  FGC N   G 
Sbjct: 159 RGFSCPATGSSACLFNQSYGGDSSLTATLVQDAITL-----ANDVIPGFTFGCINAVSGG 213

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
               +     G+LGL R  IS ISQ G++    FSYCL  P     Y S  LK G  +G 
Sbjct: 214 SIPPQ-----GLLGLGRGPISLISQAGAMYSGVFSYCL--PSFKSYYFSGSLKLG-PVGQ 265

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            + S + T  + +P+  + YY++L  +S+   ++  P +      +   G IIDSG+V+T
Sbjct: 266 PK-SIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVIT 324

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
            F   VY+ + ++F     R Q+            C F        P++  +FE  NL +
Sbjct: 325 RFVQPVYFAIRDEF-----RKQVNGPISSLGAFDTC-FAATNEAEAPAITLHFEGLNLVL 378

Query: 282 DGENVFIIDYENHFFLL--AVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             EN  I         L  A AP+  + ++ +I + QQ++ R ++D     L   +E C
Sbjct: 379 PMENSLIHSSSGSLACLSMAAAPNNVNSVLNVIANLQQQNLRIMFDTTNSRLGIARELC 437


>gi|297826117|ref|XP_002880941.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297326780|gb|EFH57200.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 397

 Score =  102 bits (253), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 153/357 (42%), Gaps = 45/357 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++RL +GTP   ++  +DTGS LI+               IFDP KSS+F++        
Sbjct: 62  LMRLQLGTPPFEIVAEIDTGSDLIWTQCMPCPNCYTQFAPIFDPSKSSTFKEK------- 114

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
              +C    C Y + YAD+S + G  A ET+++        +      GC  +N      
Sbjct: 115 ---RCHGNSCPYEIIYADESYSTGILATETVTIQSTSGEPFVMAETSIGCGLNNSNLMTP 171

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
               + +G++GL+    S ISQ+   I    SYC           +S + FGT+      
Sbjct: 172 GYAASSSGIVGLNMGPSSLISQMDLPIPGLISYCF------SSQGTSKINFGTNAVVAGD 225

Query: 167 STQAT-KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
            T A   FI     FYYL+L  +S+ ++R+      F    + +G   IDSG+  TY  +
Sbjct: 226 GTVAADMFIKKDQPFYYLNLDAVSVGDKRIETLGTPFH---AQDGNIFIDSGTTYTYLPT 282

Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQ---LCYFLPETFNRFPSMAFYFE-DANLRI 281
                 +   V       +   +  P+P     LCY   +T   FP +  +F   A+L +
Sbjct: 283 S-----YCNLVREAVAASVVAANQVPDPSSENLLCYNW-DTMEIFPVITLHFAGGADLVL 336

Query: 282 DGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           D  N+++       F LA+   D  + A+ G++   +    YD +  ++SF   NCS
Sbjct: 337 DKYNMYVETITGGTFCLAIGCVDPSMPAIFGNRAHNNLLVGYDSSTLVISFSPTNCS 393


>gi|297805036|ref|XP_002870402.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316238|gb|EFH46661.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  101 bits (252), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 96/358 (26%), Positives = 152/358 (42%), Gaps = 40/358 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L +GTP   ++ + DTGS LI+               +FDP+ SS+++ ++C    C
Sbjct: 95  LMNLSLGTPPSPIMAVADTGSNLIWTQCKPCDDCYTQVDPLFDPKASSTYKDVSCSSSQC 154

Query: 47  TYFK----CVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
           T  +    C  E   C Y + YAD S T G  A +T+++             + GC  +N
Sbjct: 155 TALENQASCSTEDKTCSYLVSYADGSYTMGKFAVDTLTLGSTDNRPVQLKNIIIGCGQNN 214

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
                +   G +     L    +S I QLG  I  +FSYCLV   P  + TS  + FGT+
Sbjct: 215 AVTFRNKSSGVVG----LGGGAVSLIKQLGDSIDGKFSYCLV---PENDQTSK-INFGTN 266

Query: 161 MGYRRPSTQATKF-INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                P T +T   +   + FYYL+LK IS+ ++ M  P          +G  +IDSG+ 
Sbjct: 267 AVVSGPGTVSTPLVVKSRDTFYYLTLKSISVGSKNMQTPDSNI------KGNMVIDSGTT 320

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
           LT      Y ++     S        +  D      LCY      N  P +  +FE A++
Sbjct: 321 LTLLPVKYYIEIENAVASLINA---DKSKDERIGSSLCYNATADLN-IPVITMHFEGADV 376

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ++   N F    E+    LA         + G+  Q++    YD     +SF   +C+
Sbjct: 377 KLYPYNSFFKVTED-LVCLAFGMSFYRNGIYGNVAQKNFLVGYDTASKTMSFKPTDCA 433


>gi|297829808|ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 98/370 (26%), Positives = 152/370 (41%), Gaps = 52/370 (14%)

Query: 6   IGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDC--- 46
           +GTP+K   +++DTGS L +                 +F   +S SF+ + C    C   
Sbjct: 94  VGTPAKKFRVVVDTGSELTWVNCRYRGRGKGKVKNRRVFRAEESKSFKTVGCFTQTCKVD 153

Query: 47  -------TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                  +     +  C Y  +YAD S  +G  A ETI+V      KA   G L GCS+ 
Sbjct: 154 LMNLFSLSTCPTPSTPCSYDYRYADGSAAQGVFAKETITVGLTNGRKARLRGLLVGCSSS 213

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
             G      D    GVLGL+    SF S   S+   + SYCLV  L N +  S+YL FG 
Sbjct: 214 FSGQSFQGAD----GVLGLAFSDFSFTSTATSLFGAKLSYCLVDHLSN-KNISNYLIFGY 268

Query: 160 DMGYRRPSTQATKF----INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
                   T   +     +     FY +++  ISI ++ ++ P   +D T    GG I+D
Sbjct: 269 SSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGISIGDDMLDIPTQVWDATTG--GGTILD 326

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFN--RFPSMA 271
           SG+ LT      Y  +      Y    +  +    PE  PI+ C+     FN  + P + 
Sbjct: 327 SGTSLTLLAEAAYKPVVTGLARYLVELKRVK----PEGIPIEYCFSSTSGFNESKLPQLT 382

Query: 272 FYFEDANLRIDGENVFIIDYENHF----FLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
           F+ +           +++D         F+ A  P  ++V   G+  Q++  + +DL   
Sbjct: 383 FHLKGGARFEPHRKSYLVDAAPGVKCLGFMSAGTPATNVV---GNIMQQNYLWEFDLMAS 439

Query: 328 LLSFVKENCS 337
            LSF    C+
Sbjct: 440 TLSFAPSTCT 449


>gi|255548664|ref|XP_002515388.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545332|gb|EEF46837.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 494

 Score =  101 bits (251), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 100/365 (27%), Positives = 147/365 (40%), Gaps = 54/365 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
           V + +GTP K   LI DTGS L +               AIF+P +S+S+  I+C    C
Sbjct: 155 VTVGLGTPKKDFSLIFDTGSDLTWTQCEPCVKSCYNQKEAIFNPSQSTSYANISCGSTLC 214

Query: 47  --------TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
                     F C +  CVY ++Y D S + GF   E +S+        +F+   FGC  
Sbjct: 215 DSLASATGNIFNCASSTCVYGIQYGDSSFSIGFFGKEKLSLTATD----VFNDFYFGCGQ 270

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
           +N G    A            R  +S +SQ      K FSYC    LP+   ++ +L FG
Sbjct: 271 NNKGLFGGAAGLLGL-----GRDKLSLVSQTAQRYNKIFSYC----LPSSSSSTGFLTFG 321

Query: 159 TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                +  S      I+  ++FY L L  IS+   ++   P  F        G IIDSG+
Sbjct: 322 GSTS-KSASFTPLATISGGSSFYGLDLTGISVGGRKLAISPSVFST-----AGTIIDSGT 375

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCY-FLPETFNRFPSMAFYFE 275
           V+T      Y  L   F     R  ++Q    P    +  C+ F        P +  +F 
Sbjct: 376 VITRLPPAAYSALSSTF-----RKLMSQYPAAPALSILDTCFDFSNHDTISVPKIGLFFS 430

Query: 276 DA-NLRIDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFV 332
               + ID   +F ++       LA A + D   VA+ G+ QQ+    VYD     + F 
Sbjct: 431 GGVVVDIDKTGIFYVNDLTQ-VCLAFAGNSDASDVAIFGNVQQKTLEVVYDGAAGRVGFA 489

Query: 333 KENCS 337
              CS
Sbjct: 490 PAGCS 494


>gi|225217022|gb|ACN85307.1| aspartic proteinase nepenthesin-1 precursor [Oryza ridleyi]
          Length = 525

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 92/363 (25%), Positives = 148/363 (40%), Gaps = 51/363 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP+    ++ DTGS   +                +FDP +SS+   I+C  P 
Sbjct: 187 VVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYEQQEKLFDPARSSTDANISCAAPA 246

Query: 46  CT--YFK-CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C+  Y K C    C+Y ++Y D S + GF A +T+++           G  FGC   N G
Sbjct: 247 CSDLYTKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAIKGFRFGCGERNEG 302

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
              +A     AG+LGL R   S   Q        F++C     P     + YL FG    
Sbjct: 303 LFGEA-----AGLLGLGRGKTSLPVQAYDKYGGVFAHC----FPARSSGTGYLDFGPGSS 353

Query: 163 YRRPSTQATK-FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
               +   T   +++   FYY+ L  I +  + ++ PP  F        G I+DSG+V+T
Sbjct: 354 PAVSTKLTTPMLVDNGLTFYYVGLTGIRVGGKLLSIPPSVFTTA-----GTIVDSGTVIT 408

Query: 222 YFHSDVYWKLHEKFVS------YFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
                 Y  L   F S      Y +   L+ L  C +      F   +    P+++  F+
Sbjct: 409 RLPPAAYSSLRSAFASAIAARGYKKAPALSLLDTCYD------FTGMSQVAIPTVSLLFQ 462

Query: 276 -DANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
             A+L +D    ++           A    DD V ++G+ Q +    VYD+   ++ F  
Sbjct: 463 GGASLDVDASGIIYAASVSQACLGFAANEEDDDVGIVGNTQLKTFGVVYDIGKKVVGFSP 522

Query: 334 ENC 336
             C
Sbjct: 523 GAC 525


>gi|357137345|ref|XP_003570261.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 458

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 94/368 (25%), Positives = 153/368 (41%), Gaps = 67/368 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
           ++ + IGTP+    +++DTGS + +              FDP KSS++   +C    CT 
Sbjct: 126 VITVSIGTPAMTQAVMIDTGSDVSWVHCHARAGAGSSLFFDPGKSSTYTPFSCSSAACTR 185

Query: 49  FK------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC---SND 99
            +       +N  C YT++Y D S T G    +T++ +   E    F    FGC   S+ 
Sbjct: 186 LEGRDNGCSLNSTCQYTVRYGDGSNTTGTYGSDTLA-LNSTEKVENFQ---FGCSETSDP 241

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
             G DED  D    G++GL     S +SQ  +     FSYC    LP    +S +L  G 
Sbjct: 242 GEGLDEDQTD----GLMGLGGGAPSLVSQTAATYGSAFSYC----LPATTRSSGFLTLGA 293

Query: 160 DMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
                  ST  + F+  P         FY++ L+ I++  + +   P  F        G 
Sbjct: 294 -------STGTSGFVTTPMFRSRRAPTFYFVILQGINVGGDPVAISPTVF------AAGS 340

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMA 271
           I+DSG+++T      Y  L   F +   R+  A+       +  C+ F  +     P++ 
Sbjct: 341 IMDSGTIITRLPPRAYSALSAAFRAGMRRYPRARAFSI---LDTCFDFTGQDNVSIPAVE 397

Query: 272 FYFEDANLRIDGENVFIIDYENHFF--LLAVAPHDDLV-ALIGSQQQRDTRFVYDLNIDL 328
             F        G  V  +D +   +   LA AP    + ++IG+ QQR    ++D+   +
Sbjct: 398 LVFS-------GGAVVDLDADGIMYGSCLAFAPATGGIGSIIGNVQQRTFEVLHDVGQSV 450

Query: 329 LSFVKENC 336
           L F    C
Sbjct: 451 LGFRPGAC 458


>gi|225216930|gb|ACN85225.1| aspartic proteinase nepenthesin-1 precursor [Oryza punctata]
 gi|225216938|gb|ACN85232.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 516

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 89/357 (24%), Positives = 142/357 (39%), Gaps = 41/357 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP+    ++ DTGS   +                +FDP +SS++  ++C  P 
Sbjct: 180 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAPA 239

Query: 46  CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C+      C    C+Y ++Y D S + GF A +T+++           G  FGC   N G
Sbjct: 240 CSDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNEG 295

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
              +A     AG+LGL R   S   Q        F++C    LP     + YL FG    
Sbjct: 296 LFGEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----LPARSTGTGYLDFGAGSP 346

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
             R +T      N P  FYY+ L  I +    +  P   F        G I+DSG+V+T 
Sbjct: 347 AARLTTTPMLVDNGP-TFYYVGLTGIRVGGRLLYIPQSVFATA-----GTIVDSGTVITR 400

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DANLR 280
                Y  L   F +        + +     +  CY F   +    P+++  F+  A L 
Sbjct: 401 LPPAAYSSLRSAFAAAMSARGYKK-APAVSLLDTCYDFAGMSQVAIPTVSLLFQGGARLD 459

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +D   +      +   L   A  D   V ++G+ Q +     YD+   ++SF    C
Sbjct: 460 VDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVSFSPGAC 516


>gi|302758750|ref|XP_002962798.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
 gi|300169659|gb|EFJ36261.1| hypothetical protein SELMODRAFT_78156 [Selaginella moellendorffii]
          Length = 427

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 106/383 (27%), Positives = 160/383 (41%), Gaps = 65/383 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDHP 44
           V L +GTP+K   LI+DTGS L +                   +D   SSS+++I C   
Sbjct: 61  VELRVGTPAKKFPLIVDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPCTDD 120

Query: 45  DCTYFKC---------VNEQCVYTMKYADQSVTKGFAAHETISV-----IGKGEG----- 85
           +C +                C YT  Y+DQS T G  A+ETIS+      GK  G     
Sbjct: 121 ECQFLPAPIGSSCSITSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGNHKTR 180

Query: 86  KAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQ-----LGSIIKKRFSYC 140
           +        GCS ++ G    A     +GVLGL +  IS  +Q     LG I    FSYC
Sbjct: 181 RIRIKNVALGCSRESVG----ASFLGASGVLGLGQGPISLATQTRHTALGGI----FSYC 232

Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMN-F 197
           LV  L  G   SS+L  G    +R+     T  + +P   +FYY+++  +++D + ++  
Sbjct: 233 LVDYL-RGSNASSFLVMGRTH-WRK--LAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGI 288

Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVS--YFERFQLAQLSDCPEPIQ 255
               + I   G  G I DSG+ L+Y     Y K+     +  Y  R Q     + PE  +
Sbjct: 289 ASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQ-----EIPEGFE 343

Query: 256 LCYFLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYEN-HFFLLAVAPHDDLVALIGSQ 313
           LCY +       P +   F+  A + +   N  ++  EN     L      +   ++G+ 
Sbjct: 344 LCYNVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNL 403

Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
            Q+D    YDL    + F    C
Sbjct: 404 LQQDHHIEYDLAKARIGFKWSPC 426


>gi|224101015|ref|XP_002312106.1| predicted protein [Populus trichocarpa]
 gi|222851926|gb|EEE89473.1| predicted protein [Populus trichocarpa]
          Length = 440

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 164/378 (43%), Gaps = 60/378 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
           V L  GTP + + ++LDTGS L +          +IF+P  S ++ KI C  P C     
Sbjct: 69  VSLTAGTPLQNITMVLDTGSELSWLHCKKEPNFNSIFNPLASKTYTKIPCSSPTCETRTR 128

Query: 49  -----FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                  C   + C + + YAD S  +G  A ET  V     G       +FGC +   G
Sbjct: 129 DLPLPVSCDPAKLCHFIISYADASSVEGNLAFETFRV-----GSVTGPATVFGCMDS--G 181

Query: 103 FDEDAR-DGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-------VIPLPNGEYTSSY 154
           F  ++  D    G++G++R ++SF++Q+G    ++FSYC+       V+ L  GE + S+
Sbjct: 182 FSSNSEEDAKTTGLMGMNRGSLSFVNQMGF---RKFSYCISDRDSSGVLLL--GEASFSW 236

Query: 155 LKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
           LK    + Y      +T         Y + L+ I + ++ ++ P   F    +G G  ++
Sbjct: 237 LK---PLNYTPLVEMSTPLPYFDRVAYSVQLEGIRVSDKVLSLPKSVFVPDHTGAGQTMV 293

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPETFNRFPS 269
           DSG+  T+    VY  L ++F+   +   + ++ + P       + LCY +  T    P+
Sbjct: 294 DSGTQFTFLLGPVYSALKQEFL--LQTKGVLRVLNEPRYVFQGAMDLCYLIEPTRAALPN 351

Query: 270 MA---FYFEDANLRIDGENVFI-----IDYENHFFLLAVAPHDDLVA---LIGSQQQRDT 318
           +      F  A + + G+ +       +  ++  +       D L     +IG  QQ++ 
Sbjct: 352 LPVVNLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDSLGIESFVIGHHQQQNV 411

Query: 319 RFVYDLNIDLLSFVKENC 336
              YDL    + F +  C
Sbjct: 412 WMEYDLEKSRIGFAEVRC 429


>gi|302758122|ref|XP_002962484.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
 gi|300169345|gb|EFJ35947.1| hypothetical protein SELMODRAFT_78458 [Selaginella moellendorffii]
          Length = 395

 Score =  101 bits (251), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 106/383 (27%), Positives = 157/383 (40%), Gaps = 65/383 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDHP 44
           V L +GTP+K   LI+DTGS L +                   +D   SSS+++I C   
Sbjct: 29  VELRVGTPAKKFPLIIDTGSDLTWIQCNPPNTTANSSSPPAPWYDKSSSSSYREIPCTDD 88

Query: 45  DCTYFKC---------VNEQCVYTMKYADQSVTKGFAAHETISVIG-KGEGKAI------ 88
           +C +                C YT  Y+DQS T G  A+ETIS+   K  GK        
Sbjct: 89  ECLFLPAPIGSSCSIKSPSPCDYTYGYSDQSRTTGILAYETISMKSRKRSGKRAGNHKTR 148

Query: 89  ---FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQ-----LGSIIKKRFSYC 140
                    GCS ++ G    A     +GVLGL +  IS  +Q     LG I    FSYC
Sbjct: 149 TIRIKNVALGCSRESVG----ASFLGASGVLGLGQGPISLATQTRHTALGGI----FSYC 200

Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMN-F 197
           LV  L  G   SS+L  G     R      T  + +P   +FYY+++  +++D + ++  
Sbjct: 201 LVDYL-RGSNASSFLVMGRT---RWRKLAHTPIVRNPAAQSFYYVNVTGVAVDGKPVDGI 256

Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVS--YFERFQLAQLSDCPEPIQ 255
               + I   G  G I DSG+ L+Y     Y K+     +  Y  R Q     + PE  +
Sbjct: 257 ASSDWGIDGDGNKGTIFDSGTTLSYLREPAYSKVLGALNASIYLPRAQ-----EIPEGFE 311

Query: 256 LCYFLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYEN-HFFLLAVAPHDDLVALIGSQ 313
           LCY +       P +   F+  A + +   N  ++  EN     L      +   ++G+ 
Sbjct: 312 LCYNVTRMEKGMPKLGVEFQGGAVMELPWNNYMVLVAENVQCVALQKVTTTNGSNILGNL 371

Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
            Q+D    YDL    + F    C
Sbjct: 372 LQQDHHIEYDLAKARIGFKWSPC 394


>gi|302757235|ref|XP_002962041.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
 gi|300170700|gb|EFJ37301.1| hypothetical protein SELMODRAFT_64201 [Selaginella moellendorffii]
          Length = 367

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 99/371 (26%), Positives = 154/371 (41%), Gaps = 45/371 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           + + +G+P K    I+DTGS L++               I+DP  SS+F K +C    C 
Sbjct: 6   MEIELGSPPKKFNAIVDTGSDLVWIQCKPCSQCYSQSDPIYDPSASSTFAKTSCSTSSCQ 65

Query: 48  YFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                      + C+Y  +Y D S T+G  A ET+++   G     F    FGC   N G
Sbjct: 66  SLPASGCSSSAKTCIYGYQYGDSSSTQGDFALETLTLRSSGGSSKAFPNFQFGCGRLNSG 125

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                  G  AG++GL +  IS  +QLGS I  +FSYCLV    +    +S L FG+   
Sbjct: 126 -----SFGGAAGIVGLGQGKISLSTQLGSAINNKFSYCLV-DFDDDSSKTSPLIFGSSAS 179

Query: 163 YRRPSTQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFD-ITVSGE----------- 209
               +       N   + +Y++ L+ IS+  ++++      D ++V  +           
Sbjct: 180 TGSGAISTPIIPNSGRSTYYFVGLEGISVGGKQLSLATRAIDFLSVRSKKKLRVRALEVN 239

Query: 210 -GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RF 267
            GG I DSG+ LT     VY K+   F S      L  +        LCY + ++ N +F
Sbjct: 240 SGGTIFDSGTTLTLLDDAVYSKVKSAFAS---SVSLPTVDASSSGFDLCYDVSKSKNFKF 296

Query: 268 PSMAFYFEDANLRIDGENVF-IIDYENHFFLLAVAPHDDLVALIGSQ-QQRDTRFVYDLN 325
           P++   F+        +N F I+D       LA+     L   I     Q++   VYD  
Sbjct: 297 PALTLAFKGTKFSPPQKNYFVIVDTAETVACLAMGGSGSLGLGIIGNLMQQNYHVVYDRG 356

Query: 326 IDLLSFVKENC 336
              +S     C
Sbjct: 357 TSTISMSPAQC 367


>gi|449516339|ref|XP_004165204.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 456

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 97/360 (26%), Positives = 165/360 (45%), Gaps = 37/360 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V L IG+P    L+++DTGS+L++              + FDP KS SF+ + C  P  
Sbjct: 105 LVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGY 164

Query: 47  TY---FKCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
            Y   +KC    Q  Y ++Y     ++G  A E++      EGK       FGC + N  
Sbjct: 165 NYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGKIKKSNITFGCGHMN-- 222

Query: 103 FDEDARDGALAGVLGLSRVT-ISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
             +   D A  GV GL     I+  +QLG+    +FSYC +  + N  YT ++L  G   
Sbjct: 223 -IKTNNDDAYNGVFGLGAYPHITMATQLGN----KFSYC-IGDINNPLYTHNHLVLGQGS 276

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
                ST       H    YY++L+ IS+ ++ +   P+ F I+  G GG +IDSG   T
Sbjct: 277 YIEGDSTPLQIHFGH----YYVTLQSISVGSKTLKIDPNAFKISSDGSGGVLIDSGMTYT 332

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYFE-DAN 278
              +  +  L+++ V   +   L ++    +   LC+   +      FP++ F+F   A+
Sbjct: 333 KLANGGFELLYDEIVDLMKGL-LERIPTQRKFEGLCFKGVVSRDLVGFPAVTFHFAGGAD 391

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVAL--IGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L ++  ++F     + F L  +  + +L+ L  IG   Q++    +DL    + F + +C
Sbjct: 392 LVLESGSLFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFDLEQMKVFFRRIDC 451


>gi|293333012|ref|NP_001168410.1| uncharacterized protein LOC100382179 precursor [Zea mays]
 gi|223948083|gb|ACN28125.1| unknown [Zea mays]
 gi|413953783|gb|AFW86432.1| hypothetical protein ZEAMMB73_737575 [Zea mays]
          Length = 466

 Score =  100 bits (250), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 91/368 (24%), Positives = 148/368 (40%), Gaps = 64/368 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           ++ + +GTP+   ++ +DTGS + +                 +FDP KS+++   +C   
Sbjct: 131 VITVSLGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAKSATYSAFSCSSA 190

Query: 45  DCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
            C         C+N  C Y +KY D S T G    +T+ +      K       FGCS+ 
Sbjct: 191 QCAQLGGEGNGCLNSHCQYIVKYVDHSNTTGTYGSDTLGLTTSDAVKNF----QFGCSHR 246

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
            +GF      G L G++GL   T S +SQ  +   K FSYCL    P+      +L  G 
Sbjct: 247 ANGFV-----GQLDGLMGLGGDTESLVSQTAATYGKAFSYCLP---PSSSSAGGFLTLGA 298

Query: 160 DMGYRRPSTQATKFINHP------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
             G     T ++++   P        FY + L+ I++   ++N P   F       G  +
Sbjct: 299 AAG----GTSSSRYSRTPLVRFNVPTFYGVFLQAITVAGTKLNVPASVF------SGASV 348

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CY-FLPETFNRFPS 269
           +DSG+V+T      Y  L   F    + +  A       P+ +   C+ F      R P 
Sbjct: 349 VDSGTVITQLPPTAYQALRTAFKKEMKAYPSA------APVGILDTCFDFSGIKTVRVPV 402

Query: 270 MAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
           +   F   A + +D   +F   Y       A A  D    ++G+ QQR    ++D+    
Sbjct: 403 VTLTFSRGAVMDLDVSGIF---YAGCLAFTATA-QDGDTGILGNVQQRTFEMLFDVGGST 458

Query: 329 LSFVKENC 336
           L F    C
Sbjct: 459 LGFRPGAC 466


>gi|242058093|ref|XP_002458192.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
 gi|241930167|gb|EES03312.1| hypothetical protein SORBIDRAFT_03g028480 [Sorghum bicolor]
          Length = 468

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 96/368 (26%), Positives = 147/368 (39%), Gaps = 56/368 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +V + +GTPS   +L++DTGS L +                 +FDP KSS++  I C+  
Sbjct: 125 VVTVGLGTPSVSQVLLIDTGSDLSWVQCQPCNSTTCYPQKDPLFDPSKSSTYAPIPCNTD 184

Query: 45  DCTYFK-------CVN----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
            C           C +     QC + + Y D S T+G  ++ET++ +  G     F    
Sbjct: 185 ACRDLTDDGYGGGCASGDGAAQCGFAITYGDGSQTRGVYSNETLA-LAPGVAVKDFR--- 240

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
           FGC     G D+D  +    G+LGL     S + Q  S+    FSYCL  P  N +    
Sbjct: 241 FGC-----GHDQDGANDKYDGLLGLGGAPESLVVQTASVYGGAFSYCL--PALNNQVGFL 293

Query: 154 YLKFGTDMGYRRPSTQA---TKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
            L  G        +T     T  I     FY +++  I++  E ++ PP  F       G
Sbjct: 294 ALGGGGAPSGGVVNTSGFVFTPMIREEETFYVVNMTGITVGGEPIDVPPSAF------SG 347

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPS 269
           G IIDSG+V+T      Y  L   F      + L +  +    +  CY      N   P 
Sbjct: 348 GMIIDSGTVVTELQHTAYNALQAAFRKAMAAYPLVRNGE----LDTCYDFSGYSNVTLPK 403

Query: 270 MAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
           +A  F   A + +D  N  ++D          +  DD   ++G+  QR    +YD     
Sbjct: 404 VALTFSGGATIDLDVPNGILLD---DCLAFQESGPDDQPGILGNVNQRTLEVLYDAGRGR 460

Query: 329 LSFVKENC 336
           + F    C
Sbjct: 461 VGFRAAVC 468


>gi|297741705|emb|CBI32837.3| unnamed protein product [Vitis vinifera]
          Length = 455

 Score =  100 bits (250), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 93/356 (26%), Positives = 156/356 (43%), Gaps = 40/356 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +  L IG P   V ++LDTGS L +               I++  KS S+ ++ C+ P C
Sbjct: 107 LANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEPPC 166

Query: 47  TYF----KCVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                  +C +   C+Y   YAD S T G  ++E ++       +       FGC   N 
Sbjct: 167 LSLGREGQCSDSGSCLYQTSYADGSRTSGLLSYEKVAFTSHYSDEDKTAQVGFGCGLQNL 226

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV-IPLPNGEYTSSYLKFGTD 160
            F   +RDG + G+       +S +S +G +  K F+YC   +  PN      +L FG D
Sbjct: 227 NFVTSSRDGGVLGLGPGLVSLVSQLSAIGKV-SKSFAYCFGNLSNPNA---GGFLVFG-D 281

Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDIS--IDNERMNFPPDTFDITVSGEGGCIIDSGS 218
             Y                FYY++L  I   ++  R++    +F+    G GG IIDSGS
Sbjct: 282 ATYLNGDMTPMVIAE----FYYVNLLGIGLGVEEPRLDINSSSFERKPDGSGGVIIDSGS 337

Query: 219 VLTYFHSDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYFE 275
            L+ F  +VY  +    V   ++ + ++ L+  P+    C+   +      FP++  Y E
Sbjct: 338 TLSIFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD----CFEGKIGRDLPLFPTLVLYLE 393

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
              +  D  ++F+  Y+   F L     + L ++IG+  Q+  +F Y+L +  LS 
Sbjct: 394 STGILNDRWSIFLQRYD-ELFCLGFTSGEGL-SIIGTLAQQSYKFGYNLELSTLSI 447


>gi|449464952|ref|XP_004150193.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
 gi|449526850|ref|XP_004170426.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 2-like [Cucumis
           sativus]
          Length = 476

 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 92/358 (25%), Positives = 150/358 (41%), Gaps = 43/358 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ +G+P +   +++D+GS +++               +FDP  S+++  I+CD   C 
Sbjct: 139 VRIGVGSPPRSQYVVIDSGSDIVWVQCQPCSECYQQSDPVFDPAGSATYAGISCDSSVCD 198

Query: 48  YF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                 C + +C Y + Y D S T+G  A ET++      G+ +      GC + N G  
Sbjct: 199 RLDNAGCNDGRCRYEVSYGDGSYTRGTLALETLTF-----GRVLIRNIAIGCGHMNRGMF 253

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A               +SF+ QLG      FSYCLV     G  ++  L+FG   G  
Sbjct: 254 IGAAGLLGL-----GGGAMSFVGQLGGQTGGAFSYCLV---SRGTESTGTLEFG--RGAM 303

Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
                    I +P   +FYY+ L  + +   R+  P   F++T  G GG ++D+G+ +T 
Sbjct: 304 PVGAAWVPLIRNPRAPSFYYVGLSGLGVGGIRVPIPEQIFELTDLGYGGVVMDTGTAVTR 363

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQL--SDCPEPIQLCYFLPETFN-RFPSMAFYFEDAN- 278
             +  Y    + F+      Q A L  SD       CY L    + R P+++FYF     
Sbjct: 364 LPAPAYEAFRDTFIG-----QTANLPRSDRVSIFDTCYNLNGFVSVRVPTVSFYFSGGPI 418

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L +   N  I       F  A A     +++IG+ QQ   +   D +   + F    C
Sbjct: 419 LTLPARNFLIPVDGEGTFCFAFAASASGLSIIGNIQQEGIQISIDGSNGFVGFGPTIC 476


>gi|358346443|ref|XP_003637277.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355503212|gb|AES84415.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 92/357 (25%), Positives = 164/357 (45%), Gaps = 42/357 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++   +GTP   V   +DTGS +++               IF+P KSSS++ I C    C
Sbjct: 90  LISYSVGTPPFKVYGFMDTGSNIVWLQCQPCNTCFNQTSPIFNPSKSSSYKNIPCTSSTC 149

Query: 47  -----TYFKCVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                T+  C N  + C Y++ Y   + ++G  +++++++        +F   + GC + 
Sbjct: 150 KDTNDTHISCSNGGDVCEYSITYGGDAKSQGDLSNDSLTLDSTSGSSVLFPNIVIGCGHI 209

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLG-SIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
           N   D        +GV+G+ R  +S I Q+G S +  +FSYCL IP  +   +SS L FG
Sbjct: 210 NVLQDNSQS----SGVVGMGRGPMSLIKQVGSSSVGSKFSYCL-IPYNSDSNSSSKLIFG 264

Query: 159 TDM---GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
            D+   G    ST   K +N   N+Y+L+L+  S+ N R+ +     + + +     +ID
Sbjct: 265 EDVVVSGEIVVSTPMVK-VNGQENYYFLTLEAFSVGNNRIEYG----ERSNASTQNILID 319

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYF 274
           SG+ LT   +    KL    VSY  +  +L ++      + LCY         P +  +F
Sbjct: 320 SGTPLTMLPNLFLSKL----VSYVAQEVKLPRIEPPDHHLSLCYNTTGKQLNVPDITAHF 375

Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
             A+++++    F   +E+          + L  + G+  Q +    YDL  +++SF
Sbjct: 376 NGADVKLNSNGTF-FPFEDGIMCFGFISSNGL-EIFGNIAQNNLLIDYDLEKEIISF 430


>gi|15226317|ref|NP_180370.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|4063755|gb|AAC98463.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|20197953|gb|AAM15327.1| putative chloroplast nucleoid DNA binding protein [Arabidopsis
           thaliana]
 gi|330252977|gb|AEC08071.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 392

 Score =  100 bits (249), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 97/358 (27%), Positives = 155/358 (43%), Gaps = 52/358 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L +GTP   +   +DTGS LI+               IFDP  SS+F++  C+    
Sbjct: 62  LMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCN---- 117

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                    C Y + YAD + +KG  A ET+++        +      GC +++  F   
Sbjct: 118 ------GNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSWFKP- 170

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
                 +G++GLS    S I+Q+G       SYC           +S + FGT+      
Sbjct: 171 ----TFSGMVGLSWGPSSLITQMGGEYPGLMSYCF------ASQGTSKINFGTNAIVAGD 220

Query: 167 STQATK--FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
              +T           YYL+L  +S+ +  +     TF    + EG  IIDSG+ LTYF 
Sbjct: 221 GVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFH---ALEGNIIIDSGTTLTYFP 277

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DANLRIDG 283
              Y  L  + V ++      + +D      LCY+  +T + FP +  +F   A+L +D 
Sbjct: 278 VS-YCNLVREAVDHY--VTAVRTADPTGNDMLCYYT-DTIDIFPVITMHFSGGADLVLDK 333

Query: 284 ENVFIIDYENHFFLLAV----APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            N++I       F LA+     P D   A+ G++ Q +    YD +  L+SF   NCS
Sbjct: 334 YNMYIETITRGTFCLAIICNNPPQD---AIFGNRAQNNFLVGYDSSSLLVSFSPTNCS 388


>gi|326524806|dbj|BAK04339.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 460

 Score =  100 bits (249), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 94/342 (27%), Positives = 161/342 (47%), Gaps = 40/342 (11%)

Query: 28  FDPRKSSSFQKINCDHPDCT-----YFKCVNEQCVYTMKYADQSV-TKGFAAHETISVIG 81
           F+P KS SF+++  ++  C      + + V + C +     D S   +G  ++ET++   
Sbjct: 128 FEPAKSPSFRRLPGNNAFCLPAPRGHRRTVQDPCKFHSIRLDGSADARGVLSNETLAFAA 187

Query: 82  KGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLG-----SIIKKR 136
            G+ +    G + GC++++ GF+ ++  G LAGVLGL R   S I  LG     ++   R
Sbjct: 188 SGQQQTEVTGVVIGCTHNSKGFNFNSH-GVLAGVLGLGRQAPSLIWTLGQHRHGTVQVHR 246

Query: 137 FSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQ---ATKFI----NHPNNF--YYLSLKD 187
           FSYCL     +     ++L+F  D+    P+TQ   +TK +        +F  Y++SL  
Sbjct: 247 FSYCLPSHGSSSSDHHTFLRFDDDV----PNTQHMVSTKIMYMDSTTSRDFRAYFVSLTG 302

Query: 188 ISIDNERMNFPPDTFDITVSGE---GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQL 244
           IS+  + +    + F   V G+    GC  D+G+         Y KL +  V + +   L
Sbjct: 303 ISVAGKPLQDVKELFKRHVHGQVWTSGCAFDAGTPTMVMIMPAYNKLKDAVVRHLKPLGL 362

Query: 245 AQLSDCPEPIQLCY-FLPETFNRFPSMAFYFED--ANLRIDGENVFI-IDYENHFFLLAV 300
             +S       LC+    + +   P++   F +  A L +  + +F+ + Y+     LAV
Sbjct: 363 QIVSG---QYHLCFRATSQLWQHLPTVMLQFAETEARLVLPPQRLFVAVGYD---ICLAV 416

Query: 301 APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN-CSDDSA 341
               D + +IG+ QQ D RFVYD+    + FV EN C  D+ 
Sbjct: 417 VRSYD-ITIIGAMQQVDKRFVYDVRHGRIYFVPENACHADAG 457


>gi|357118064|ref|XP_003560779.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 472

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 100/374 (26%), Positives = 152/374 (40%), Gaps = 65/374 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +V L IGTP+    +++DTGS L +                 +FDP KSS+F  I C   
Sbjct: 126 VVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNASDCYPQKDPLFDPSKSSTFATIPCASD 185

Query: 45  DCTYFK-------CVNE------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
            C           C N       QC Y ++Y + ++T+G  + ET+++       A+   
Sbjct: 186 ACKQLPVDGYDNGCTNNTSGMPPQCGYAIEYGNGAITEGVYSTETLAL----GSSAVVKS 241

Query: 92  ALFGCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY 150
             FGC +D HG +D+        G+LGL     S +SQ  S+    FSYCL  PL +G  
Sbjct: 242 FRFGCGSDQHGPYDK------FDGLLGLGGAPESLVSQTASVYGGAFSYCLP-PLNSG-- 292

Query: 151 TSSYLKFGTDMGYRRPS-----TQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
            + +L  G        +     T    F      FY ++L  IS+  + ++ PP  F   
Sbjct: 293 -AGFLTLGAPNSTNNSNSGFVFTPMHAFSPKIATFYVVTLTGISVGGKALDIPPAVF--- 348

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETF 264
                G I+DSG+V+T   +  Y  L   F S    + L   +D    +  CY F     
Sbjct: 349 ---AKGNIVDSGTVITGIPTTAYKALRTAFRSAMAEYPLLPPAD--SALDTCYNFTGHGT 403

Query: 265 NRFPSMAFYF-EDANLRIDGENVFIIDYENHFFLLAVA-PHDDLVALIGSQQQRDTRFVY 322
              P +A  F   A + +D  +  +++       LA A   D    +IG+   R    +Y
Sbjct: 404 VTVPKVALTFVGGATVDLDVPSGVLVED-----CLAFADAGDGSFGIIGNVNTRTIEVLY 458

Query: 323 DLNIDLLSFVKENC 336
           D     L F    C
Sbjct: 459 DSGKGHLGFRAGAC 472


>gi|224099307|ref|XP_002311432.1| predicted protein [Populus trichocarpa]
 gi|222851252|gb|EEE88799.1| predicted protein [Populus trichocarpa]
          Length = 458

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 98/384 (25%), Positives = 161/384 (41%), Gaps = 60/384 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINC---- 41
           V + +G+P + +LL+ DTGS L +                + F  R S++F   +C    
Sbjct: 85  VSIRLGSPPQTLLLVADTGSDLTWVRCSACKTNCSIHPPGSTFLARHSTTFSPTHCFSSL 144

Query: 42  ------DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
                  +P+      ++  C Y   Y+D S T GF + ET ++      +       FG
Sbjct: 145 CQLVPQPNPNPCNHTRLHSTCRYEYVYSDGSKTSGFFSKETTTLNTSSGREMKLKSIAFG 204

Query: 96  CSNDNHG--FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT-- 151
           C     G      + +GA +GV+GL R  ISF SQLG    + FSYCL+      +YT  
Sbjct: 205 CGFHASGPSLIGSSFNGA-SGVMGLGRGPISFASQLGRRFGRSFSYCLL------DYTLS 257

Query: 152 ---SSYLKFGTDMGYRRPSTQATKF----IN-HPNNFYYLSLKDISIDNERMNFPPDTFD 203
              +SYL  G  +  ++ +     F    IN     FYY+S+K + +D  +++  P  + 
Sbjct: 258 PPPTSYLMIGDVVSTKKDNKSMMSFTPLLINPEAPTFYYISIKGVFVDGVKLHIDPSVWS 317

Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP------EPIQLC 257
           +   G GG +IDSG+ LT+     Y     + +S F+R ++   S  P          LC
Sbjct: 318 LDELGNGGTVIDSGTTLTFLTEPAY----REILSAFKR-EVKLPSPTPGGASTRSGFDLC 372

Query: 258 YFLPE-TFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAP---HDDLVALIGSQ 313
             +   +  RFP ++      +L       + ID       LA+ P        ++IG+ 
Sbjct: 373 VNVTGVSRPRFPRLSLELGGESLYSPPPRNYFIDISEGIKCLAIQPVEAESGRFSVIGNL 432

Query: 314 QQRDTRFVYDLNIDLLSFVKENCS 337
            Q+     +D     L F +  C+
Sbjct: 433 MQQGFLLEFDRGKSRLGFSRRGCA 456


>gi|212274314|ref|NP_001130524.1| uncharacterized protein LOC100191623 [Zea mays]
 gi|194689376|gb|ACF78772.1| unknown [Zea mays]
 gi|224031455|gb|ACN34803.1| unknown [Zea mays]
 gi|238011528|gb|ACR36799.1| unknown [Zea mays]
 gi|238015454|gb|ACR38762.1| unknown [Zea mays]
          Length = 304

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 87/316 (27%), Positives = 133/316 (42%), Gaps = 45/316 (14%)

Query: 40  NCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL--FGCS 97
           +C+ PD          C Y   Y D ++T G  A E  +    G G          FGC 
Sbjct: 15  SCERPD---------TCTYRYNYGDGTMTVGVYATERFTFASSGGGGLTTTTVPLGFGCG 65

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS---SY 154
           + N G   +      +G++G  R  +S +SQL SI  +RFSYCL        Y S   S 
Sbjct: 66  SVNVGSLNNG-----SGIVGFGRNPLSLVSQL-SI--RRFSYCLT------SYASRRQST 111

Query: 155 LKFGT----DMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSG 208
           L FG+      G      Q T  +  P N  FYY+    +++   R+  P   F +   G
Sbjct: 112 LLFGSLSDGVYGDATGRVQTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDG 171

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-- 266
            GG I+DSG+ LT   + V   L E   ++ ++ +L   +       +C+ +P  + R  
Sbjct: 172 SGGVIVDSGTALTLLPAAV---LAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSS 228

Query: 267 ------FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
                  P M  +F+ A+L +   N  + D+      L +A   D  + IG+  Q+D R 
Sbjct: 229 STSQMPVPRMVLHFQGADLDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRV 288

Query: 321 VYDLNIDLLSFVKENC 336
           +YDL  + LS     C
Sbjct: 289 LYDLEAETLSIAPARC 304


>gi|168014188|ref|XP_001759635.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162689174|gb|EDQ75547.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 485

 Score =  100 bits (248), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 97/370 (26%), Positives = 148/370 (40%), Gaps = 56/370 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY-------------AIFDPR----KSSSFQKINCDHPD 45
           R+FIGTP++   LI+DTGS + Y             A FDPR     SSS+Q ++C+ PD
Sbjct: 102 RVFIGTPAQEFALIVDTGSTVTYVPCSSCTHCGHHQACFDPRFKPDNSSSYQTVSCNSPD 161

Query: 46  CTYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIF-HGALFGCSNDNHG 102
           C    C     QC Y   YA+ S +KG    +   ++G G G  +  H  LFGC     G
Sbjct: 162 CITKMCDARVHQCKYERVYAEMSSSKGVLGKD---LLGFGNGSRLQPHPLLFGCETAETG 218

Query: 103 FDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
              D       G++GL R  +S + QL     ++  FS C         Y       G+ 
Sbjct: 219 ---DLYLQHADGIMGLGRGPLSIVDQLVGTGAMEDSFSLC---------YGGMDEGGGSM 266

Query: 161 MGYRRPSTQATKFINH-PN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           +    P   A  F    PN  N+Y L L +I +    +N P + F+    G  G ++DSG
Sbjct: 267 VLGAIPPPPAMVFAKSDPNRSNYYNLELSEIQVQGVSLNVPSEVFN----GRLGTVLDSG 322

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
           +   Y     +    +         Q     D P    +C+    + ++  ++  +F   
Sbjct: 323 TTYAYLPDKAFDAFKDAITQQLGSLQAVPGPD-PSYPDVCFAGAGSDSK--ALGKHFPPV 379

Query: 278 NLRIDGENVFIIDYENHFFLLAVAP---------HDDLVALIGSQQQRDTRFVYDLNIDL 328
           +    G     +  EN+ F     P         + D   L+G    R+T   YD     
Sbjct: 380 DFVFSGNQKVFLAPENYLFKHTKVPGAYCLGFFKNQDATTLLGGIVVRNTLVTYDRANHQ 439

Query: 329 LSFVKENCSD 338
           + F K NC++
Sbjct: 440 IGFFKTNCTN 449


>gi|255544139|ref|XP_002513132.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223548143|gb|EEF49635.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 481

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 86/361 (23%), Positives = 145/361 (40%), Gaps = 49/361 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           +R+ +G+P +   +++D+GS +++               +FDP  S+SF  + C    C 
Sbjct: 144 IRIGVGSPPREQYVVIDSGSDIVWVQCQPCTQCYHQTDPVFDPADSASFMGVPCSSSVCE 203

Query: 48  YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
             +   C    C Y + Y D S TKG  A ET++      G+ +      GC + N G  
Sbjct: 204 RIENAGCHAGGCRYEVMYGDGSYTKGTLALETLTF-----GRTVVRNVAIGCGHRNRGMF 258

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A              ++S + QLG      FSYCLV     G  ++  L+FG   G  
Sbjct: 259 VGAAGLLGL-----GGGSMSLVGQLGGQTGGAFSYCLV---SRGTDSAGSLEFG--RGAM 308

Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
                    I +P   +FYY+ L  + +   ++    D F +   G GG ++D+G+ +T 
Sbjct: 309 PVGAAWIPLIRNPRAPSFYYIRLSGVGVGGMKVPISEDVFQLNEMGNGGVVMDTGTAVTR 368

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-----CYFLPETFN-RFPSMAFYFED 276
             +  Y    + F+         Q  + P    +     CY L    + R P+++FYF  
Sbjct: 369 IPTVAYVAFRDAFI--------GQTGNLPRASGVSIFDTCYNLNGFVSVRVPTVSFYFAG 420

Query: 277 AN-LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
              L +   N  I   +   F  A A     +++IG+ QQ   +  +D     + F    
Sbjct: 421 GPILTLPARNFLIPVDDVGTFCFAFAASPSGLSIIGNIQQEGIQISFDGANGFVGFGPNV 480

Query: 336 C 336
           C
Sbjct: 481 C 481


>gi|356537928|ref|XP_003537458.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 445

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 99/380 (26%), Positives = 162/380 (42%), Gaps = 62/380 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSAL----------IYAIFDPRKSSSFQKINCDHPDCTY--- 48
           V L +GTP + V ++LDTGS L          I ++F+P  SSS+  I C  P C     
Sbjct: 72  VSLTVGTPPQSVTMVLDTGSELSWLHCKKQQNINSVFNPHLSSSYTPIPCMSPICKTRTR 131

Query: 49  -----FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                  C  N  C  T+ YAD +  +G  A +T ++ G G+   IF G++      + G
Sbjct: 132 DFLIPVSCDSNNLCHVTVSYADFTSLEGNLASDTFAISGSGQPGIIF-GSM------DSG 184

Query: 103 FDEDA-RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           F  +A  D    G++G++R ++SF++Q+G     +FSYC+     +G+  S  L FG   
Sbjct: 185 FSSNANEDSKTTGLMGMNRGSLSFVTQMGF---PKFSYCI-----SGKDASGVLLFGDAT 236

Query: 162 GYRRPSTQATKFI--NHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
                  + T  +  N P  +     Y + L  I + ++ +  P + F    +G G  ++
Sbjct: 237 FKWLGPLKYTPLVKMNTPLPYFDRVAYTVRLMGIRVGSKPLQVPKEIFAPDHTGAGQTMV 296

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPE--TFNRF 267
           DSG+  T+    VY  L  +FV+   R  L  L D P       + LC+ +         
Sbjct: 297 DSGTRFTFLLGSVYTALRNEFVAQ-TRGVLTLLED-PNFVFEGAMDLCFRVRRGGVVPAV 354

Query: 268 PSMAFYFEDANLRIDGENVF--------IIDYENHFFLLAVAPHDDL---VALIGSQQQR 316
           P++   FE A + + GE +         +       + L     D L     +IG   Q+
Sbjct: 355 PAVTMVFEGAEMSVSGERLLYRVGGDGDVAKGNGDVYCLTFGNSDLLGIEAYVIGHHHQQ 414

Query: 317 DTRFVYDLNIDLLSFVKENC 336
           +    +DL    + F    C
Sbjct: 415 NVWMEFDLVNSRVGFADTKC 434


>gi|388504358|gb|AFK40245.1| unknown [Medicago truncatula]
          Length = 480

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 93/364 (25%), Positives = 156/364 (42%), Gaps = 54/364 (14%)

Query: 6   IGTPSKGVLLILDTGSALIYAIFDPRKSSSFQK--------------INCDHPDCTYFKC 51
           IG  ++ + +I+DTGS L +   DP  S   Q+              + C+   C   + 
Sbjct: 137 IGLGNQNMTVIIDTGSDLTWVQCDPCMSCYSQQGPVFNPSNSSSYNSLLCNSSTCQNLQF 196

Query: 52  VN-----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                          C +T+ Y D S T G    E +S      G       +FGC  +N
Sbjct: 197 TTGNTEACESNNPSSCNHTVSYGDGSFTDGELGVEHLSF-----GGISVSNFVFGCGRNN 251

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G       G ++G++GL R  +S ISQ  +     FSYCL  P  +   + S +     
Sbjct: 252 KGLF-----GGVSGIMGLGRSNLSMISQTNTTFGGVFSYCL--PTTDSGASGSLVIGNES 304

Query: 161 MGYRRPSTQA-TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
             ++  +  A T  +++P  +NFY L+L  I +    +         T  G GG +IDSG
Sbjct: 305 SLFKNLTPIAYTSMVSNPQLSNFYVLNLTGIDVGGVAI-------QDTSFGNGGILIDSG 357

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFED 276
           +V+T     +Y  L  +F+  F  + +A        +  C+ L        P+++ +FE+
Sbjct: 358 TVITRLAPSLYNALKAEFLKQFSGYPIA---PALSILDTCFNLTGIEEVSIPTLSMHFEN 414

Query: 277 -ANLRIDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFVK 333
             +L +D   +  +  +     LA+A   D   +A+IG+ QQR+ R +YD     + F +
Sbjct: 415 NVDLNVDAVGILYMPKDGSQVCLALASLSDENDMAIIGNYQQRNQRVIYDAKQSKIGFAR 474

Query: 334 ENCS 337
           E+CS
Sbjct: 475 EDCS 478


>gi|224109494|ref|XP_002315215.1| predicted protein [Populus trichocarpa]
 gi|222864255|gb|EEF01386.1| predicted protein [Populus trichocarpa]
          Length = 444

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 93/375 (24%), Positives = 163/375 (43%), Gaps = 58/375 (15%)

Query: 4   LFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY----- 48
           L IGTP + + ++LDTGS L +          +IF+P  S ++ KI C    C       
Sbjct: 71  LTIGTPPQNITMVLDTGSELSWLRCKKEPNFTSIFNPLASKTYTKIPCSSQTCKTRTSDL 130

Query: 49  ---FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDNHG 102
                C   + C + + YAD S  +G  A ET        G       +FGC  S  +  
Sbjct: 131 TLPVTCDPAKLCHFIISYADASSVEGHLAFETFRF-----GSLTRPATVFGCMDSGSSSN 185

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-----VIPLPNGEYTSSYLKF 157
            +EDA+     G++G++R ++SF++Q+G    ++FSYC+        L  GE   S+LK 
Sbjct: 186 TEEDAKT---TGLMGMNRGSLSFVNQMGF---RKFSYCISGLDSTGFLLLGEARYSWLK- 238

Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
              + Y      +T         Y + L+ I ++N+ +  P   F    +G G  ++DSG
Sbjct: 239 --PLNYTPLVQISTPLPYFDRVAYSVQLEGIKVNNKVLPLPKSVFVPDHTGAGQTMVDSG 296

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPETFNRFPSM-- 270
           +  T+    VY  L ++F+   +   + ++ + P+      + LCY +  T +  P++  
Sbjct: 297 TQFTFLLGPVYSALRKEFL--LQTAGVLRVLNEPQYVFQGAMDLCYLIDSTSSTLPNLPV 354

Query: 271 -AFYFEDANLRIDGENVFI-----IDYENHFFLLAVAPHDDLVA---LIGSQQQRDTRFV 321
               F  A + + G+ +       +  ++  +       D+L     LIG  QQ++    
Sbjct: 355 VKLMFRGAEMSVSGQRLLYRVPGEVRGKDSVWCFTFGNSDELGISSFLIGHHQQQNVWME 414

Query: 322 YDLNIDLLSFVKENC 336
           YDL    + F +  C
Sbjct: 415 YDLENSRIGFAELRC 429


>gi|357133002|ref|XP_003568117.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 497

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 102/388 (26%), Positives = 171/388 (44%), Gaps = 81/388 (20%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ IGTP K   + +DTGS +++                   A++DP+ SSS   ++CD
Sbjct: 89  TKIEIGTPPKPFHVQVDTGSDILWVNCVSCDKCPTKSGLGIDLALYDPKGSSSGSAVSCD 148

Query: 43  H-------------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGK 86
           +             P CT      + C Y  +Y D S T G    +++    + G  + +
Sbjct: 149 NKFCAATYGSGEKLPGCT----AGKPCEYRAEYGDGSSTAGSFVSDSLQYNQLSGNAQTR 204

Query: 87  AIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIP 144
                 +FGC     G D ++ + AL G++G  +   S +SQL S   +KK FS+CL   
Sbjct: 205 HAKANVIFGCGA-QQGGDLESTNQALDGIIGFGQSNTSTLSQLASAGEVKKIFSHCLDTI 263

Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFD 203
              G +    +         +P  ++T  +  PN + Y ++L+ I +    +  PP  F+
Sbjct: 264 KGGGIFAIGEV--------VQPKVKSTPLL--PNMSHYNVNLQSIDVAGNALQLPPHIFE 313

Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ--LCYFLP 261
              S + G IIDSG+ LTY    VY  +     + F++ Q          IQ  LC+   
Sbjct: 314 --TSEKRGTIIDSGTTLTYLPELVYKDI---LAAVFQKHQDITF----RTIQGFLCFEYS 364

Query: 262 ETF-NRFPSMAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHD--DLVAL 309
           E+  + FP + F+FE D  L +        +G+N++ + ++N  F     P D  D+V L
Sbjct: 365 ESVDDGFPKITFHFEDDLGLNVYPHDYFFQNGDNLYCLGFQNGGF----QPKDAKDMV-L 419

Query: 310 IGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +G     +   VYDL   ++ +   NCS
Sbjct: 420 LGDLVLSNKVVVYDLEKQVIGWTDYNCS 447


>gi|356573235|ref|XP_003554768.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 472

 Score = 99.8 bits (247), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 153/361 (42%), Gaps = 46/361 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            R+ +GTP++ V ++LDTGS +++               +FDP KS ++  I C  P C 
Sbjct: 131 TRIGVGTPARYVYMVLDTGSDVVWLQCAPCRKCYTQADPVFDPTKSRTYAGIPCGAPLCR 190

Query: 48  YF---KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                 C N+   C Y + Y D S T G  + ET++       +        GC +DN G
Sbjct: 191 RLDSPGCNNKNKVCQYQVSYGDGSFTFGDFSTETLTFR-----RTRVTRVALGCGHDNEG 245

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               A            R  +SF  Q G    ++FSYCLV    + + +S  + FG    
Sbjct: 246 LFIGAAGLLGL-----GRGRLSFPVQTGRRFNQKFSYCLVDRSASAKPSS--VVFGDSAV 298

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERM-NFPPDTFDITVSGEGGCIIDSGSV 219
            R  + + T  I +P  + FYYL L  IS+    +       F +  +G GG IIDSG+ 
Sbjct: 299 SR--TARFTPLIKNPKLDTFYYLELLGISVGGSPVRGLSASLFRLDAAGNGGVIIDSGTS 356

Query: 220 LTYFHSDVYWKLHEKF---VSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFE 275
           +T      Y  L + F    S+ +R     L D       C+ L   T  + P++  +F 
Sbjct: 357 VTRLTRPAYIALRDAFRVGASHLKRAAEFSLFD------TCFDLSGLTEVKVPTVVLHFR 410

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            A++ +   N  I    +  F  A A     +++IG+ QQ+  R  +DL    + F    
Sbjct: 411 GADVSLPATNYLIPVDNSGSFCFAFAGTMSGLSIIGNIQQQGFRVSFDLAGSRVGFAPRG 470

Query: 336 C 336
           C
Sbjct: 471 C 471


>gi|225217008|gb|ACN85295.1| aspartic proteinase nepenthesin-1 precursor [Oryza coarctata]
          Length = 500

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 89/361 (24%), Positives = 147/361 (40%), Gaps = 47/361 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP+    ++ DTGS   +                +FDP +SS++  I+C  P 
Sbjct: 162 VVTIGLGTPAGRYTVVFDTGSDTTWVQCEPCVVVCYKQQEKLFDPARSSTYANISCAAPA 221

Query: 46  CT--YFK-CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C+  Y K C    C+Y ++Y D S + GF A +T+++           G  FGC   N G
Sbjct: 222 CSDLYIKGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAIKGFRFGCGERNEG 277

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
              +A     AG+LGL R   S   Q        F++C     P     + YL FG    
Sbjct: 278 LYGEA-----AGLLGLGRGKTSLPVQAYDKYGGVFAHC----FPARSSGTGYLDFGPG-- 326

Query: 163 YRRPSTQAT----KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
              P+  A       +++   FYY+ L  I +  + ++ P   F  +     G I+DSG+
Sbjct: 327 -SLPAVSAKLTTPMLVDNGPTFYYVGLTGIRVGGKLLSIPQSVFTTS-----GTIVDSGT 380

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-D 276
           V+T      Y  L   F S        + +     +  CY F   +    P+++  F+  
Sbjct: 381 VITRLPPAAYSSLRSAFASAMAERGYKK-APALSLLDTCYDFTGMSEVAIPTVSLLFQGG 439

Query: 277 ANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
           A+L +     ++           A    DD V ++G+ Q +    VYD+   ++ F    
Sbjct: 440 ASLDVHASGIIYAASVSQACLGFAGNKEDDDVGIVGNTQLKTFGVVYDIGKKVVGFCPGA 499

Query: 336 C 336
           C
Sbjct: 500 C 500


>gi|225217056|gb|ACN85339.1| aspartic proteinase nepenthesin-1 precursor [Oryza granulata]
          Length = 521

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 92/364 (25%), Positives = 149/364 (40%), Gaps = 53/364 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP+    ++ DTGS   +                +FDP +SS++  ++C  P 
Sbjct: 183 VVTIGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYKQQEKLFDPARSSTYANVSCAAPA 242

Query: 46  CT--YFK-CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C+  Y + C    C+Y+++Y D S + GF A +T+++           G  FGC   N G
Sbjct: 243 CSDLYTRGCSGGHCLYSVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNEG 298

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
              +A     AG+LGL R   S   Q        F++C    LP     + YL FG    
Sbjct: 299 LFGEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----LPARSSGTGYLDFGPGSP 349

Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
               + Q T  +  N P  FYY+ +  I +  + ++ P   F        G I+DSG+V+
Sbjct: 350 AAVGARQTTPMLTDNGP-TFYYVGMTGIRVGGQLLSIPQSVFSTA-----GTIVDSGTVI 403

Query: 221 TYFHSDVYWKLHEKFVS------YFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYF 274
           T      Y  L   F S      Y +   L+ L  C +      F   +    P ++  F
Sbjct: 404 TRLPPAAYSSLRSAFASAMAARGYKKAPALSLLDTCYD------FTGMSEVAIPKVSLLF 457

Query: 275 E-DANLRIDGENVFIIDYENHFFL-LAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +  A L ++   +      +   L  A    DD V ++G+ Q +    VYD+    + F 
Sbjct: 458 QGGAYLDVNASGIMYAASLSQVCLGFAANEDDDDVGIVGNTQLKTFGVVYDIGKKTVGFS 517

Query: 333 KENC 336
              C
Sbjct: 518 PGAC 521


>gi|222624820|gb|EEE58952.1| hypothetical protein OsJ_10633 [Oryza sativa Japonica Group]
          Length = 415

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 100/355 (28%), Positives = 149/355 (41%), Gaps = 51/355 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V L IGTP + V L LDTGS LI+                FDP  SS+    +CD   C
Sbjct: 90  LVHLAIGTPPQPVQLTLDTGSDLIWTQCQPCPACFDQALPYFDPSTSSTLSLTSCDSTLC 149

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                     V ++  +D+            + +G G   A   G  FGC   N+G  + 
Sbjct: 150 QGLP------VASLPRSDK-----------FTFVGAG---ASVPGVAFGCGLFNNGVFKS 189

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-GYRR 165
                  G+ G  R  +S  SQL       FS+C    +     ++  L    D+    +
Sbjct: 190 NE----TGIAGFGRGPLSLPSQLKV---GNFSHCFTT-ITGAIPSTVLLDLPADLFSNGQ 241

Query: 166 PSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
            + Q T  I +P N  FYYLSLK I++ + R+  P   F +  +G GG IIDSG+ +T  
Sbjct: 242 GAVQTTPLIQNPANPTFYYLSLKGITVGSTRLPVPESEFALK-NGTGGTIIDSGTAMTSL 300

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-PSMAFYFEDANLRID 282
            + VY  + + F +   + +L  +S        C   P     + P +  +FE A + + 
Sbjct: 301 PTRVYRLVRDAFAA---QVKLPVVSGNTTDPYFCLSAPLRAKPYVPKLVLHFEGATMDLP 357

Query: 283 GEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            EN VF ++      L         V  IG+ QQ++   +YDL    LSFV   C
Sbjct: 358 RENYVFEVEDAGSSILCLAIIEGGEVTTIGNFQQQNMHVLYDLQNSKLSFVPAQC 412


>gi|357515001|ref|XP_003627789.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355521811|gb|AET02265.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 415

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 93/347 (26%), Positives = 153/347 (44%), Gaps = 43/347 (12%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
           IGTP   V   +DTGS L++               IFDP  SSS+Q I           C
Sbjct: 94  IGTPPFKVFGFVDTGSDLVWLQCEPCKQCYPQITPIFDPSLSSSYQNI----------PC 143

Query: 52  VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGA 111
           +++ C ++M+     V +G+ + ET+++         F   + GC   N G       G 
Sbjct: 144 LSDTC-HSMRTTSCDV-RGYLSVETLTLDSTTGYSVSFPKTMIGCGYRNTG----TFHGP 197

Query: 112 LAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQAT 171
            +G++GL    +S  SQLG+ I  +FSYCL   LPN   ++S L FG            T
Sbjct: 198 SSGIVGLGSGPMSLPSQLGTSIGGKFSYCLGPWLPN---STSKLNFGDAAIVYGDGAMTT 254

Query: 172 KFINH-PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWK 230
             +     + YYL+L+  S+ N+ + F   T+      EG  +IDSG+  T+   DVY++
Sbjct: 255 PIVKKDAQSGYYLTLEAFSVGNKLIEFGGPTYG---GNEGNILIDSGTTFTFLPYDVYYR 311

Query: 231 LHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGENVFIID 290
                  Y     L  + D     +LCY +       P +  +F+ A++++   + F I 
Sbjct: 312 FESAVAEY---INLEHVEDPNGTFKLCYNVAYHGFEAPLITAHFKGADIKLYYISTF-IK 367

Query: 291 YENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
             +    LA  P     A+ G+  Q++    Y+L  + ++F   +C+
Sbjct: 368 VSDGIACLAFIPSQ--TAIFGNVAQQNLLVGYNLVQNTVTFKPVDCT 412


>gi|147858841|emb|CAN78694.1| hypothetical protein VITISV_037475 [Vitis vinifera]
          Length = 442

 Score = 99.4 bits (246), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 93/356 (26%), Positives = 154/356 (43%), Gaps = 40/356 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +  L IG P   V ++LDTGS L +               I++  KS S+ ++ C+ P C
Sbjct: 94  LANLSIGNPPTNVYVVLDTGSDLFWIQCEPCDVCYKQKDPIYNRTKSDSYTEMLCNEPPC 153

Query: 47  TYF----KCVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                  +C +   C+Y   YAD + T G  ++E ++       +       FGC   N 
Sbjct: 154 VSLGREGQCSDSGSCLYQTAYADGARTSGLLSYEKVAFTSHYSDEDKTAQVGFGCGLQNL 213

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV-IPLPNGEYTSSYLKFGTD 160
            F    RDG + G+       +S +S +G +  K F+YC   I  PN      +L FG D
Sbjct: 214 NFITSNRDGGVLGLGPGLVSLVSQLSAIGKV-SKSFAYCFGNISNPNA---GGFLVFG-D 268

Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDIS--IDNERMNFPPDTFDITVSGEGGCIIDSGS 218
             Y                FYY++L  I   +   R++    +F+    G GG IIDSGS
Sbjct: 269 ATYLNGDMTPMVIAE----FYYVNLLGIGLGVGEPRLDINSSSFERKPDGSGGVIIDSGS 324

Query: 219 VLTYFHSDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYFE 275
            L+ F  +VY  +    V   ++ + ++ L+  P+    C+   +      FP++  Y E
Sbjct: 325 TLSVFPPEVYEVVRNAVVDKLKKGYNISPLTSSPD----CFEGKIERDLPLFPTLVLYLE 380

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
              +  D  ++F+  Y+   F L     + L ++IG+  Q+  +F Y+L +  LS 
Sbjct: 381 STGILNDRWSIFLQRYD-ELFCLGFTSGEGL-SIIGTLAQQSYKFGYNLELSTLSI 434


>gi|359476193|ref|XP_003631802.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 496

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 103/361 (28%), Positives = 159/361 (44%), Gaps = 52/361 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDC 46
           +V +  GTP +   LILDTGS++ +                FDP  S ++   +C     
Sbjct: 163 LVDVAFGTPPQKFTLILDTGSSITWTQCKPCVRCLKASRRHFDPSASLTYSLGSCIPSTV 222

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                      Y M Y D+S + G    +T+++    E   +F    FGC  +N G   D
Sbjct: 223 GN--------TYNMTYGDKSTSVGNYGCDTMTL----EHSDVFPKFQFGCGRNNEG---D 267

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
              GA  G+LGL +  +S +SQ  S  KK FSYC    LP  +   S L FG     +  
Sbjct: 268 FGSGA-DGMLGLGQGQLSTVSQTASKFKKVFSYC----LPEEDSIGSLL-FGEKATSQSS 321

Query: 167 STQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
           S + T  +N P       + +Y++ L DIS+ N+R+N P   F        G IIDSG+V
Sbjct: 322 SLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSGTV 376

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQ-LSDCPEPIQLCYFLPETFN-RFPSMAFYF-ED 276
           +T      Y  L   F     ++ L+       + +  CY L    +   P +  +F E 
Sbjct: 377 ITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEG 436

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A++R++G+ V I   +     LA A + +L  +IG++QQ     +YD+    + F    C
Sbjct: 437 ADVRLNGKRV-IWGNDASRLCLAFAGNSELT-IIGNRQQVSLTVLYDIQGGRIGFGGNGC 494

Query: 337 S 337
           S
Sbjct: 495 S 495


>gi|125527257|gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group]
          Length = 484

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 106/407 (26%), Positives = 155/407 (38%), Gaps = 85/407 (20%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA-----------------------------IFDPRK 32
           VR  +GTP++  LL+ DTGS L +                               F P K
Sbjct: 89  VRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAPAPASPRRTFRPDK 148

Query: 33  SSSFQKINCDHPDC------TYFKCVNEQ--CVYTMKYADQSVTKGFAA--HETISVIGK 82
           S ++  I C    C      +   C      C Y  +Y D S  +G       TI++ G+
Sbjct: 149 SRTWAPIPCSSATCRESLPFSLAACATPANPCAYDYRYKDGSAARGTVGVDSATIALSGR 208

Query: 83  GEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV 142
              KA   G + GC+   +G    A DG    VL L    ISF S+  S    RFSYCLV
Sbjct: 209 AARKAKLRGVVLGCTTSYNGQSFLASDG----VLSLGYSNISFASRAASRFGGRFSYCLV 264

Query: 143 IPLPNGEYTSSYLKFGTDMGY--RRPS----------------------TQATKFINHPN 178
             L     T SYL FG +  +  RRPS                       Q    ++H  
Sbjct: 265 DHLAPRNAT-SYLTFGPNPAFSSRRPSEGIASCKPAPAPTPAPAGAPGARQTPLVLDHRT 323

Query: 179 N-FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVS 237
             FY +++K +S+  E +  P   +D  V   GG I+DSG+ LT      Y       V+
Sbjct: 324 RPFYAVTVKGVSVAGELLKIPRAVWD--VEQGGGAILDSGTSLTMLAKPAY----RAVVA 377

Query: 238 YFERFQLAQLSDCP-EPIQLCYFL-----PETFNRFPSMAFYFEDANLRIDGENVFIIDY 291
              + +LA L     +P   CY        +     P +A +F  +         ++ID 
Sbjct: 378 ALSK-RLAGLPRVTMDPFDYCYNWTSPSGSDVAAPLPMLAVHFAGSARLEPPAKSYVIDA 436

Query: 292 ENHFFLLAV--APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
                 + +   P   L ++IG+  Q++  + YDL    L F +  C
Sbjct: 437 APGVKCIGLQEGPWPGL-SVIGNILQQEHLWEYDLKNRRLRFKRSRC 482


>gi|242073262|ref|XP_002446567.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
 gi|241937750|gb|EES10895.1| hypothetical protein SORBIDRAFT_06g018180 [Sorghum bicolor]
          Length = 453

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 91/382 (23%), Positives = 167/382 (43%), Gaps = 67/382 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V+L IGTP       +DT S L++               IF+PR SSS+  + C    C
Sbjct: 89  LVKLGIGTPQHYFSAAIDTASDLVWLQCQPCVSCYRQLDPIFNPRLSSSYAVVPCSSDTC 148

Query: 47  TYF---KCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
           +     +C    ++ C Y  KY+  +VT G  A + ++V     G  +FH  + GCS+ +
Sbjct: 149 SQLDGHRCDEDDDQACRYNYKYSGNAVTNGTLAIDKLAV-----GGNVFHAVVLGCSDSS 203

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G          +G++GL+R  +S +SQL     +RF YCL  P+     T   L  G  
Sbjct: 204 VGGPPPQ----ASGLVGLARGPLSLLSQLSV---RRFMYCLPPPM---SRTPGKLVLGAG 253

Query: 161 MG---YRRPSTQATKFINHPN---NFYYLSLKDISIDNE------RMNFPPDTFDITV-- 206
            G    R  S + T  ++      ++YYL+   +++ ++      R   PP T       
Sbjct: 254 AGADAVRNVSDRVTVTMSSSTRYPSYYYLNFDGLAVGDQTPGTIRRPTSPPATGGGVGGG 313

Query: 207 -------SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCY 258
                  +   G I+D  S +++  + +Y +L +      E  +L + +      + LC+
Sbjct: 314 GGDGGSGANAYGMIVDVASTISFLEASLYDELADDL---EEEIRLPRATPSTRLGLDLCF 370

Query: 259 FLPETFN----RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQ 314
            LPE         P+++  F+   L ++ + +F+ D      ++        V+++G+ Q
Sbjct: 371 ILPEGVGIDRVYVPTVSMSFDGRWLELERDRLFLEDGRMMCLMIG---RTSGVSILGNYQ 427

Query: 315 QRDTRFVYDLNIDLLSFVKENC 336
           Q++   +Y+L    ++F K +C
Sbjct: 428 QQNMHVLYNLRRGKITFAKASC 449


>gi|297834938|ref|XP_002885351.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
 gi|297331191|gb|EFH61610.1| pepsin A [Arabidopsis lyrata subsp. lyrata]
          Length = 471

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 87/356 (24%), Positives = 148/356 (41%), Gaps = 39/356 (10%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ +G+P +   +++D+GS +++               +FDP KS S+  ++C    C 
Sbjct: 134 VRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCD 193

Query: 48  YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
             +   C +  C Y + Y D S TKG  A ET++       K +      GC + N G  
Sbjct: 194 RIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTF-----AKTVVRNVAMGCGHRNRGMF 248

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A              ++SF+ QL       F YCLV     G  ++  L FG +    
Sbjct: 249 IGAAGLLGI-----GGGSMSFVGQLSGQTGGAFGYCLV---SRGTDSTGSLVFGREA--L 298

Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
                    + +P   +FYY+ LK + +   R+  P   FD+T +G+GG ++D+G+ +T 
Sbjct: 299 PVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTR 358

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF-EDANLR 280
             +  Y    + F S       A           CY L    + R P+++FYF E   L 
Sbjct: 359 LPTGAYAAFRDGFKSQTANLPRASGVSI---FDTCYDLSGFVSVRVPTVSFYFTEGPVLT 415

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +   N  +   ++  +  A A     +++IG+ QQ   +  +D     + F    C
Sbjct: 416 LPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 471


>gi|86438622|emb|CAJ26370.1| chloroplast nucleoid binding protein [Brachypodium sylvaticum]
          Length = 443

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 97/381 (25%), Positives = 158/381 (41%), Gaps = 67/381 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +V + +GTP++ + ++ DTGS L +                 +F P  SS+F  + C  P
Sbjct: 86  VVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYHQQDPLFAPSSSSTFSAVRCGEP 145

Query: 45  DCTYFK--CV----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI------FHGA 92
           +C   +  C     +++C Y + Y D+S T G   ++T+++       A         G 
Sbjct: 146 ECPRARQSCSSSPGDDRCPYEVVYGDKSRTVGHLGNDTLTLGTTPSTNASENNSNKLPGF 205

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
           +FGC  +N G       G   G+ GL R  +S  SQ      + FSYCL     N     
Sbjct: 206 VFGCGENNTGLF-----GKADGLFGLGRGKVSLSSQAAGKYGEGFSYCLPSSSSNAH--- 257

Query: 153 SYLKFGTDMGYRRPSTQATKF---INHPN--NFYYLSLKDISIDNE--RMNFPPDTFDIT 205
            YL  GT      P+    +F   +N  N  +FYY+ L  I +     +++  P  +   
Sbjct: 258 GYLSLGTPA----PAPAHARFTPMLNRSNTPSFYYVKLVGIRVAGRAIKVSSRPALW--- 310

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERF---QLAQLSDCPEPIQLCYFLPE 262
                G I+DSG+V+T      Y  L   F+S   ++   +  +LS     +  CY    
Sbjct: 311 ---PAGLIVDSGTVITRLAPRAYSALRTAFLSAMGKYGYKRAPRLSI----LDTCYDFTA 363

Query: 263 TFN---RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDD--LVALIGSQQQR 316
             N     P++A  F   A + +D   V  +        LA AP+ +     ++G+ QQR
Sbjct: 364 HANATVSIPAVALVFAGGATISVDFSGVLYVAKVAQ-ACLAFAPNGNGRSAGILGNTQQR 422

Query: 317 DTRFVYDLNIDLLSFVKENCS 337
               VYD+    + F  + CS
Sbjct: 423 TVAVVYDVGRQKIGFAAKGCS 443


>gi|413923780|gb|AFW63712.1| hypothetical protein ZEAMMB73_689747 [Zea mays]
          Length = 470

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 94/357 (26%), Positives = 139/357 (38%), Gaps = 65/357 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +V   +GTP     L +DTGS L +                 +FDP +SSS+  + C   
Sbjct: 138 VVTASLGTPGMAQTLEVDTGSDLSWVQCKPCAAPSCYRQKDPLFDPAQSSSYAAVPCGRS 197

Query: 45  DCTYF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
            C         C   QC Y + Y D S T G  + +T+++       A   G LFGC + 
Sbjct: 198 ACAGLGIYASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL----AANATVQGFLFGCGHA 253

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
             G      D    G+LG  R   S + Q        FSYC    LP    T+ YL  G 
Sbjct: 254 QSGGLFTGID----GLLGFGREQPSLVQQTAGAYGGVFSYC----LPTKSSTTGYLTLGG 305

Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
             G   P    T+ +  PN   +Y + L  IS+  + ++ P   F        G ++D+G
Sbjct: 306 PSGV-APGFSTTQLLPSPNAPTYYVVMLTGISVGGQPLSVPASAF------AAGTVVDTG 358

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CY-FLPETFNRFPSMAFY 273
           +V+T      Y  L   F S    +  A       PI +   CY F         S+A  
Sbjct: 359 TVITRLPPAAYAALRSAFRSGMASYPSA------PPIGILDTCYSFAGYGTVNLTSVALT 412

Query: 274 FED-ANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNID 327
           F   A + +  + +        F  LA A    D  +A++G+ QQR     +++ ID
Sbjct: 413 FSSGATMTLGADGIM------SFGCLAFASSGSDGSMAILGNVQQRS----FEVRID 459


>gi|449515031|ref|XP_004164553.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 103/366 (28%), Positives = 161/366 (43%), Gaps = 57/366 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP    L I DTGS L +A              IF+P KS+SF  + C+   C
Sbjct: 93  LMSVSIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTC 152

Query: 47  TYF---KC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                  C V   C Y+  Y D++ +KG    E I+ IG    K++      GC + + G
Sbjct: 153 HAVDDGHCGVQGVCDYSYTYGDRTYSKGDLGFEKIT-IGSSSVKSV-----IGCGHASSG 206

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
                  G  +GV+GL    +S +SQ+   S I +RFSYCL   L    + +  + FG +
Sbjct: 207 -----GFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLL---SHANGKINFGEN 258

Query: 161 MGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                P   +T  I+     +YY++L+ ISI NER         +  + +G  IIDSG+ 
Sbjct: 259 AVVSGPGVVSTPLISKNTVTYYYITLEAISIGNER--------HMAFAKQGNVIIDSGTT 310

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSM------AFY 273
           LT    ++Y  +     S  +  +  ++ D    + LC+   +  N   S+      A +
Sbjct: 311 LTILPKELYDGV---VSSLLKVVKAKRVKDPHGSLDLCF--DDGINAAASLGIPVITAHF 365

Query: 274 FEDANLRIDGENVF--IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
              AN+ +   N F  + D  N   L A +P  +   +IG+  Q +    YDL    LSF
Sbjct: 366 SGGANVNLLPINTFRKVADNVNCLTLKAASPTTEF-GIIGNLAQANFLIGYDLEAKRLSF 424

Query: 332 VKENCS 337
               C+
Sbjct: 425 KPTVCA 430


>gi|242095602|ref|XP_002438291.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
 gi|241916514|gb|EER89658.1| hypothetical protein SORBIDRAFT_10g011210 [Sorghum bicolor]
          Length = 464

 Score = 99.0 bits (245), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 90/361 (24%), Positives = 143/361 (39%), Gaps = 51/361 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           ++ + IGTP+   ++ +DTGS + +                 +FDP  S+++   +C   
Sbjct: 130 VITVTIGTPAVTQVMSIDTGSDVSWVQCAPCAAQSCSSQKDKLFDPAMSATYSAFSCGSA 189

Query: 45  DCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
            C         C+  QC Y +KY D S T G    +T+S+      K+      FGCS+ 
Sbjct: 190 QCAQLGDEGNGCLKSQCQYIVKYGDGSNTAGTYGSDTLSLTSSDAVKSF----QFGCSHR 245

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
             GF      G L G++GL   T S +SQ  +   K FSYCL  P  +G     +L  G 
Sbjct: 246 AAGFV-----GELDGLMGLGGDTESLVSQTAATYGKAFSYCLPPPSSSG---GGFLTLGA 297

Query: 160 DMGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
             G        T  +      FY + L+ I++    +N P   F       G  ++DSG+
Sbjct: 298 AGGASSSRYSHTPMVRFSVPTFYGVFLQGITVAGTMLNVPASVF------SGASVVDSGT 351

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF--PSMAFYF-E 275
           V+T      Y  L   F    + +  A        +  C+     FN    P++   F  
Sbjct: 352 VITQLPPTAYQALRTAFKKEMKAYPSAAPVGS---LDTCFDF-SGFNTITVPTVTLTFSR 407

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            A + +D   +    Y       A A HD    ++G+ QQR    ++D+    + F    
Sbjct: 408 GAAMDLDISGIL---YAGCLAFTATA-HDGDTGILGNVQQRTFEMLFDVGGRTIGFRSGA 463

Query: 336 C 336
           C
Sbjct: 464 C 464


>gi|326523839|dbj|BAJ96930.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 473

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 97/378 (25%), Positives = 147/378 (38%), Gaps = 59/378 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCD 42
           V+  +GTP++  +L+ DTGS L +                    +F P  S S+  I C 
Sbjct: 112 VQFRVGTPAQPFVLVADTGSDLTWVKCRGRRASSPDASPLASPRVFRPANSKSWAPIPCS 171

Query: 43  HPDCTY---FKCVN--------EQCVYTMKYADQSVTKGFAAHETISVIGKGEG---KAI 88
              C     F   N          C Y  +Y D+S  +G    +  ++   G G   KA 
Sbjct: 172 SDTCKSYVPFSLANCSAGTTPPAPCGYDYRYKDKSSARGVVGTDAATIALSGSGSDRKAK 231

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG 148
               + GC+    G    + DG    VL L    ISF S+  +    RFSYCLV  L   
Sbjct: 232 LQEVVLGCTTSYDGQSFQSSDG----VLSLGNSNISFASRAAARFGGRFSYCLVDHLAPR 287

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
             T SYL FG       PS            FY +++  +S+  + +N P + +D  V  
Sbjct: 288 NAT-SYLTFGPVGAAHSPSRTPLLLDAQVAPFYAVTVDAVSVAGKALNIPAEVWD--VKK 344

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFP 268
            GG I+DSG+ LT   +  Y  +         R     +    +P + CY    T  R P
Sbjct: 345 NGGAILDSGTSLTILATPAYKAVVAALSKQLARVPRVTM----DPFEYCYNW--TATRRP 398

Query: 269 SMAFYFE-----DANLRIDGENVFIIDYENHFFLL----AVAPHDDLVALIGSQQQRDTR 319
                 E      A LR   ++ ++ID       +     V P    V++IG+  Q++  
Sbjct: 399 PAVPRLEVRFAGSARLRPPTKS-YVIDAAPGVKCIGLQEGVWPG---VSVIGNILQQEHL 454

Query: 320 FVYDLNIDLLSFVKENCS 337
           + +DL    L F +  C+
Sbjct: 455 WEFDLANRWLRFQESRCA 472


>gi|18405138|ref|NP_565911.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
 gi|13877759|gb|AAK43957.1|AF370142_1 unknown protein [Arabidopsis thaliana]
 gi|15293231|gb|AAK93726.1| unknown protein [Arabidopsis thaliana]
 gi|20196976|gb|AAB87120.2| expressed protein [Arabidopsis thaliana]
 gi|20197046|gb|AAM14894.1| expressed protein [Arabidopsis thaliana]
 gi|330254616|gb|AEC09710.1| Eukaryotic aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score = 98.6 bits (244), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 83/324 (25%), Positives = 145/324 (44%), Gaps = 56/324 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
           V L +G P + + ++LDTGS L +          ++F+P  SS++  + C  P C     
Sbjct: 67  VTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSPVPCSSPICRTRTR 126

Query: 49  -----FKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SND 99
                  C      C   + YAD +  +G  AHET  +     G     G LFGC  S  
Sbjct: 127 DLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI-----GSVTRPGTLFGCMDSGL 181

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG- 158
           +   +EDA+     G++G++R ++SF++QLG     +FSYC+     +G  +S +L  G 
Sbjct: 182 SSNSEEDAKS---TGLMGMNRGSLSFVNQLGF---SKFSYCI-----SGSDSSGFLLLGD 230

Query: 159 ------TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
                   + Y     Q+T         Y + L+ I + ++ ++ P   F    +G G  
Sbjct: 231 ASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQT 290

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPET---- 263
           ++DSG+  T+    VY  L  +F++  +   + +L D P+      + LCY +  T    
Sbjct: 291 MVDSGTQFTFLMGPVYTALKNEFIT--QTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPN 348

Query: 264 FNRFPSMAFYFEDANLRIDGENVF 287
           F+  P ++  F  A + + G+ + 
Sbjct: 349 FSGLPMVSLMFRGAEMSVSGQKLL 372


>gi|217074470|gb|ACJ85595.1| unknown [Medicago truncatula]
 gi|388505166|gb|AFK40649.1| unknown [Medicago truncatula]
          Length = 452

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 87/371 (23%), Positives = 158/371 (42%), Gaps = 60/371 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINC----- 41
           V++ +G+P+K   +I+DTGS+  +                +F+P  S +++ + C     
Sbjct: 105 VKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQC 164

Query: 42  --------DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
                   + P C+     +  CVY   Y D S + G+ + + +++             +
Sbjct: 165 SSLKSATLNEPTCSK---QSNACVYKASYGDSSFSLGYLSQDVLTLTPS----QTLSSFV 217

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL--PNGEYT 151
           +GC  DN G       G   G++GL+   +S +SQL       FSYCL      PN    
Sbjct: 218 YGCGQDNQGLF-----GRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSP-K 271

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGE 209
             +L  GT       S + T  + +PNN   Y++ L+ I++    +     ++ +     
Sbjct: 272 EGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPT--- 328

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCY--FLPETFNR 266
              IIDSG+V+T   + VY  L   +V+   +++Q A        +  C+   L      
Sbjct: 329 ---IIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISL---LDTCFKGSLAGISEV 382

Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
            P +   F+  A+L++ G N  +++ E     LA+A     +A+IG+ QQ+  +  YD+ 
Sbjct: 383 APDIRIIFKGGADLQLKGHNS-LVELETGITCLAMAGSSS-IAIIGNYQQQTVKVAYDVG 440

Query: 326 IDLLSFVKENC 336
              + F    C
Sbjct: 441 NSRVGFAPGGC 451


>gi|357448247|ref|XP_003594399.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355483447|gb|AES64650.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 452

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 87/371 (23%), Positives = 158/371 (42%), Gaps = 60/371 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINC----- 41
           V++ +G+P+K   +I+DTGS+  +                +F+P  S +++ + C     
Sbjct: 105 VKMGLGSPTKYYTMIVDTGSSFSWLQCQPCTIYCHIQEDPVFNPSASKTYKTVPCSSSQC 164

Query: 42  --------DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
                   + P C+     +  CVY   Y D S + G+ + + +++             +
Sbjct: 165 SSLKSATLNEPTCSK---QSNACVYKASYGDSSFSLGYLSQDVLTLTPS----QTLSSFV 217

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL--PNGEYT 151
           +GC  DN G       G   G++GL+   +S +SQL       FSYCL      PN    
Sbjct: 218 YGCGQDNQGLF-----GRTDGIIGLANNELSMLSQLSGKYGNAFSYCLPTSFSTPNSP-K 271

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGE 209
             +L  GT       S + T  + +PNN   Y++ L+ I++    +     ++ +     
Sbjct: 272 EGFLSIGTSSLTPSSSYKFTPLLKNPNNPSLYFIDLESITVAGRPLGVAASSYKVPT--- 328

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCY--FLPETFNR 266
              IIDSG+V+T   + VY  L   +V+   +++Q A        +  C+   L      
Sbjct: 329 ---IIDSGTVITRLPTPVYTTLKNAYVTILSKKYQQAPGISL---LDTCFKGSLAGISEV 382

Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
            P +   F+  A+L++ G N  +++ E     LA+A     +A+IG+ QQ+  +  YD+ 
Sbjct: 383 APDIRIIFKGGADLQLKGHNS-LVELETGITCLAMAGSSS-IAIIGNYQQQTVKVAYDVG 440

Query: 326 IDLLSFVKENC 336
              + F    C
Sbjct: 441 NSRVGFAPGGC 451


>gi|222637180|gb|EEE67312.1| hypothetical protein OsJ_24552 [Oryza sativa Japonica Group]
          Length = 420

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 98/374 (26%), Positives = 153/374 (40%), Gaps = 81/374 (21%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           + + +GTP     ++ DTGS LI+                F P  SS+F K+ C    C 
Sbjct: 88  MNISVGTPLLTFSVVADTGSDLIWTQCAPCTKCFQQPAPPFQPASSSTFSKLPCTSSFCQ 147

Query: 48  YF-----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           +       C    CVY  KY     T G+ A ET+ V     G A F    FGCS +N  
Sbjct: 148 FLPNSIRTCNATGCVYNYKYG-SGYTAGYLATETLKV-----GDASFPSVAFGCSTEN-- 199

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                         GL ++ +            RFSYCL      G   +S + FG+   
Sbjct: 200 --------------GLGQLDLGV---------GRFSYCLRSGSAAG---ASPILFGSLAN 233

Query: 163 YRRPSTQATKFINHPN---NFYYLSLKDISIDNERMNFPPDTFDITVSG-EGGCIIDSGS 218
               + Q+T F+N+P    ++YY++L  I++    +     TF  T +G  GG I+DSG+
Sbjct: 234 LTDGNVQSTPFVNNPAVHPSYYYVNLTGITVGETDLPVTTSTFGFTQNGLGGGTIVDSGT 293

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLS--DCPEPIQLCY----------FLPETFNR 266
            LTY   D Y  + + F+S     Q A ++  +    + LC+           +P    R
Sbjct: 294 TLTYLAKDGYEMVKQAFLS-----QTADVTTVNGTRGLDLCFKSTGGGGGGIAVPSLVLR 348

Query: 267 FPSMAFYFED---ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
           F   A Y      A +  D +    +       ++  A  D  +++IG+  Q D   +YD
Sbjct: 349 FDGGAEYAVPTYFAGVETDSQGSVTV----ACLMMLPAKGDQPMSVIGNVMQMDMHLLYD 404

Query: 324 LNIDLLSFVKENCS 337
           L+  + SF   +C+
Sbjct: 405 LDGGIFSFAPADCA 418


>gi|79407941|ref|NP_188636.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75273243|sp|Q9LHE3.1|ASPG2_ARATH RecName: Full=Protein ASPARTIC PROTEASE IN GUARD CELL 2;
           Short=AtASPG2; Flags: Precursor
 gi|11994777|dbj|BAB03167.1| nucleoid chloroplast DNA-binding protein-like [Arabidopsis
           thaliana]
 gi|28392860|gb|AAO41867.1| unknown protein [Arabidopsis thaliana]
 gi|332642798|gb|AEE76319.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 470

 Score = 98.6 bits (244), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 87/356 (24%), Positives = 148/356 (41%), Gaps = 39/356 (10%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           VR+ +G+P +   +++D+GS +++               +FDP KS S+  ++C    C 
Sbjct: 133 VRIGVGSPPRDQYMVIDSGSDMVWVQCQPCKLCYKQSDPVFDPAKSGSYTGVSCGSSVCD 192

Query: 48  YFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
             +   C +  C Y + Y D S TKG  A ET++       K +      GC + N G  
Sbjct: 193 RIENSGCHSGGCRYEVMYGDGSYTKGTLALETLTF-----AKTVVRNVAMGCGHRNRGMF 247

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             A              ++SF+ QL       F YCLV     G  ++  L FG +    
Sbjct: 248 IGAAGLLGI-----GGGSMSFVGQLSGQTGGAFGYCLV---SRGTDSTGSLVFGREA--L 297

Query: 165 RPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
                    + +P   +FYY+ LK + +   R+  P   FD+T +G+GG ++D+G+ +T 
Sbjct: 298 PVGASWVPLVRNPRAPSFYYVGLKGLGVGGVRIPLPDGVFDLTETGDGGVVMDTGTAVTR 357

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF-EDANLR 280
             +  Y    + F S       A           CY L    + R P+++FYF E   L 
Sbjct: 358 LPTAAYVAFRDGFKSQTANLPRASGVSI---FDTCYDLSGFVSVRVPTVSFYFTEGPVLT 414

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +   N  +   ++  +  A A     +++IG+ QQ   +  +D     + F    C
Sbjct: 415 LPARNFLMPVDDSGTYCFAFAASPTGLSIIGNIQQEGIQVSFDGANGFVGFGPNVC 470


>gi|293332735|ref|NP_001168472.1| uncharacterized protein LOC100382248 [Zea mays]
 gi|223948487|gb|ACN28327.1| unknown [Zea mays]
          Length = 434

 Score = 98.2 bits (243), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 89/364 (24%), Positives = 144/364 (39%), Gaps = 54/364 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP++   ++ DTGS   +                +FDP KS+++  I+C    
Sbjct: 97  VVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSSY 156

Query: 46  CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C+      C    C+Y ++Y D S T GF A +T+++              FGC   N G
Sbjct: 157 CSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLA-----YDTIKNFRFGCGEKNRG 211

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                  G  AG+LGL R   S   Q        F+YC    LP     + +L    D+G
Sbjct: 212 L-----FGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYC----LPATSAGTGFL----DLG 258

Query: 163 YRRPSTQA---TKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
              P+  A      ++    FYY+ +  I +    +  P   F        G ++DSG+V
Sbjct: 259 PGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTA-----GTLVDSGTV 313

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN---RFPSMAFYF 274
           +T      Y  L   F    +  Q    S  P    +  CY L          P+++  F
Sbjct: 314 ITRLPPSAYAPLRSAFS---KAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVF 370

Query: 275 E-DANLRIDGENV-FIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +  A L +D   + ++ D        A    D  VA++G+ QQ+    +YD+   ++ F 
Sbjct: 371 QGGACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFA 430

Query: 333 KENC 336
              C
Sbjct: 431 PGAC 434


>gi|357166728|ref|XP_003580821.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 479

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 95/368 (25%), Positives = 154/368 (41%), Gaps = 44/368 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +VR+ IG+P     L+ DTGS +I+               +FDP  S+SF  + C+   C
Sbjct: 124 LVRVGIGSPPLEQHLVADTGSDVIWVQCSPCSDCYAQGDPLFDPANSASFSPVPCNSGVC 183

Query: 47  --------TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
                   +       +C Y + Y D+S T G  A ET+++ G  E      G   GC +
Sbjct: 184 RAAARYSSSSCGGGGGECEYKVSYGDKSYTNGVLALETLTLDGGTE----VQGVAMGCGH 239

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
           +N G   +A     AG+LGL    +S + QLG      FSYCL          S  L  G
Sbjct: 240 ENRGLFAEA-----AGLLGLGWGPMSLVGQLGGAAGGAFSYCLAGYYSGEGSGSGSLVLG 294

Query: 159 TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
            +      +      + +P+  +FYY+ +  + +  ER+      FD+   G GG ++D+
Sbjct: 295 REDAAPTGAVW-VPLVRNPDAPSFYYVGVNGLGVAGERLQLQDGLFDLGDDGGGGVVMDT 353

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF- 274
           G+ +T   ++ Y  L   F   FE  + A  +        CY L    + R P++A YF 
Sbjct: 354 GTAVTRLPAEAYAALRGAFAGAFE--EGAPRAPGVSLFDTCYDLSGYASVRVPTVALYFG 411

Query: 275 ------EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
                 E A+L +   N+ +   +   + LA A      +++G+ QQ+      D     
Sbjct: 412 GGGQGQEAASLTLPARNLLVPVDDGGTYCLAFAAVASGPSILGNIQQQGIEITVDSASGY 471

Query: 329 LSFVKENC 336
           + F    C
Sbjct: 472 VGFGPATC 479


>gi|449461377|ref|XP_004148418.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449518059|ref|XP_004166061.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 436

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 90/362 (24%), Positives = 158/362 (43%), Gaps = 48/362 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
           +VR+ +GTP + + ++LDT +   +A             F  + SS+F  ++C  P+CT 
Sbjct: 96  VVRVQLGTPGQTMYMVLDTSNDAAWAPCSGCIGCSSTTTFSAQNSSTFATLDCSKPECTQ 155

Query: 49  FKCV------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
            + +      N  C++   Y   S        +++ +     G  +     FGC +   G
Sbjct: 156 ARGLSCPTTGNVDCLFNQTYGGDSTFSATLVQDSLHL-----GPNVIPNFSFGCISSASG 210

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                +     G++GL R  +S ISQ GS+    FSYCL  P     Y S  LK G  +G
Sbjct: 211 SSIPPQ-----GLMGLGRGPLSLISQSGSLYSGLFSYCL--PSFKSYYFSGSLKLG-PVG 262

Query: 163 YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
             + + + T  +++P+  + YY++L  IS+    +   P+      +   G IIDSG+V+
Sbjct: 263 QPK-AIRTTPLLHNPHRPSLYYVNLTGISVGRVLVPISPELLAFDPNTGAGTIIDSGTVI 321

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF--PSMAFYFEDAN 278
           T F   +Y  + ++F     R Q+            C+    T N    P++  +    +
Sbjct: 322 TRFVPAIYTAVRDEF-----RKQVGGSFSPLGAFDTCF---ATNNEVSAPAITLHLSGLD 373

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHD----DLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
           L++  EN  I         LA+A        +V +I + QQ++ R ++D+N   L   +E
Sbjct: 374 LKLPMENSLIHSSAGSLACLAMAAAPNNVNSVVNVIANLQQQNHRILFDINNSKLGIARE 433

Query: 335 NC 336
            C
Sbjct: 434 LC 435


>gi|26451756|dbj|BAC42973.1| unknown protein [Arabidopsis thaliana]
          Length = 442

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 83/324 (25%), Positives = 145/324 (44%), Gaps = 56/324 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
           V L +G P + + ++LDTGS L +          ++F+P  SS++  + C  P C     
Sbjct: 67  VTLAVGDPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSPVPCSSPICRTRTR 126

Query: 49  -----FKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SND 99
                  C      C   + YAD +  +G  AHET  +     G     G LFGC  S  
Sbjct: 127 DLPIPASCDPKTHLCHVAISYADATSIEGNLAHETFVI-----GSVTRPGTLFGCMDSGL 181

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG- 158
           +   +EDA+     G++G++R ++SF++QLG     +FSYC+     +G  +S +L  G 
Sbjct: 182 SSNSEEDAKS---TGLMGMNRGSLSFVNQLGF---SKFSYCI-----SGSDSSVFLLLGD 230

Query: 159 ------TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
                   + Y     Q+T         Y + L+ I + ++ ++ P   F    +G G  
Sbjct: 231 ASYSWLGPIQYTPLVLQSTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTGAGQT 290

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPET---- 263
           ++DSG+  T+    VY  L  +F++  +   + +L D P+      + LCY +  T    
Sbjct: 291 MVDSGTQFTFLMGPVYTALKNEFIT--QTKSVLRLVDDPDFVFQGTMDLCYKVGSTTRPN 348

Query: 264 FNRFPSMAFYFEDANLRIDGENVF 287
           F+  P ++  F  A + + G+ + 
Sbjct: 349 FSGLPMVSLMFRGAEMSVSGQKLL 372


>gi|224142013|ref|XP_002324355.1| predicted protein [Populus trichocarpa]
 gi|222865789|gb|EEF02920.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 91/366 (24%), Positives = 149/366 (40%), Gaps = 59/366 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP K   L  DTGS L +                 FDP  S+S++ ++C    
Sbjct: 141 VVTVGLGTPKKDFTLSFDTGSDLTWTQCEPCLGGCFPQNQPKFDPTTSTSYKNVSCSSEF 200

Query: 46  CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C            C++  C+Y ++Y     T GF A ET+++        +F   LFGCS
Sbjct: 201 CKLIAEGNYPAQDCISNTCLYGIQYGS-GYTIGFLATETLAI----ASSDVFKNFLFGCS 255

Query: 98  NDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
            ++ G F+         G+LGL R  I+  SQ  +  K  FSYC    LP    ++ +L 
Sbjct: 256 EESRGTFN------GTTGLLGLGRSPIALPSQTTNKYKNLFSYC----LPASPSSTGHLS 305

Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
           FG ++     ST  +  +      Y L+   IS+    +   P    I+ +     IIDS
Sbjct: 306 FGVEVSQAAKSTPISPKLKQ---LYGLNTVGISVRGREL---PINGSISRT-----IIDS 354

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN---RFPSMAFY 273
           G+  T+  S  Y  L   F      + L   +   +P   CY      N     P ++ +
Sbjct: 355 GTTFTFLPSPTYSALGSAFREMMANYTLTNGTSSFQP---CYDFSNIGNGTLTIPGISIF 411

Query: 274 FE---DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
           FE   +  + + G  + +   +      A    D   A+ G+ QQ+    +YD+   ++ 
Sbjct: 412 FEGGVEVEIDVSGIMIPVNGLKEVCLAFADTGSDSDFAIFGNYQQKTYEVIYDVAKGMVG 471

Query: 331 FVKENC 336
           F  + C
Sbjct: 472 FAPKGC 477


>gi|24417232|gb|AAN60226.1| unknown [Arabidopsis thaliana]
          Length = 464

 Score = 98.2 bits (243), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 98/356 (27%), Positives = 154/356 (43%), Gaps = 44/356 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + IGTP   + L+ DTGS L +                 F+P  SS++Q ++C  P 
Sbjct: 133 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPM 192

Query: 46  CTYFK-CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG-F 103
           C   + C    CVY++ Y D+S T+GF A E  ++        +     FGC  +N G F
Sbjct: 193 CEDAESCSASNCVYSIGYGDKSFTQGFLAKEKFTLTNSD----VLEDVYFGCGENNQGLF 248

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
           D  A    L          +S  +Q  +     FSYCL     N   ++ +L FG+  G 
Sbjct: 249 DGVAGLLGLG------PGKLSLPAQTTTTYNNIFSYCLPSFTSN---STGHLTFGS-AGI 298

Query: 164 RRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
              S + T   + P+ F Y + +  IS+ ++ +   P++F        G IIDSG+V T 
Sbjct: 299 SE-SVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTE-----GAIIDSGTVFTR 352

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDAN-LR 280
             + VY +L   F    E+    + +        CY F       +P++AF F     + 
Sbjct: 353 LPTKVYAELRSVFK---EKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGGTVVE 409

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +DG  +  +  +     LA A +DDL A+ G+ QQ     VYD+    + F    C
Sbjct: 410 LDGSGIS-LPIKISQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|449440931|ref|XP_004138237.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 103/357 (28%), Positives = 145/357 (40%), Gaps = 41/357 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSAL-----------------IYAIFDPRKSSSFQKINCDHP 44
            R+ +G P +    + DTGS +                 I  IFDP+ SSS+  ++CD  
Sbjct: 186 ARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSE 245

Query: 45  DCTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
            C       C    C+Y ++Y D S T G  A ET S                GC +DN 
Sbjct: 246 QCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNS----IPNLPIGCGHDNE 301

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G    A               IS  SQL +     FSYCLV        +SS L F  D 
Sbjct: 302 GLFVGADGLIGL-----GGGAISLSSQLEAT---SFSYCLV---DLDSESSSTLDFNADQ 350

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
                ++   K    P  F Y+ +  +S+  + +     +F+I  SG GG I+DSG+ +T
Sbjct: 351 PSDSLTSPLVKNDRFPT-FRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTIT 409

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDAN-L 279
              SDVY  L + FV   +    A       P   CY L    N   P++AF     N L
Sbjct: 410 EIPSDVYDVLRDAFVGLTKNLPPAPGV---SPFDTCYDLSSQSNVEVPTIAFILPGENSL 466

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           ++  +N  I       F LA  P    +++IG+ QQ+  R  YDL   L+ F  + C
Sbjct: 467 QLPAKNCLIQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523


>gi|413943688|gb|AFW76337.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
 gi|413943689|gb|AFW76338.1| hypothetical protein ZEAMMB73_223549 [Zea mays]
          Length = 499

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 89/364 (24%), Positives = 144/364 (39%), Gaps = 54/364 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP++   ++ DTGS   +                +FDP KS+++  I+C    
Sbjct: 162 VVPVRLGTPAERFTVVFDTGSDTTWVQCQPCVAYCYRQKEPLFDPTKSATYANISCSSSY 221

Query: 46  CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C+      C    C+Y ++Y D S T GF A +T+++              FGC   N G
Sbjct: 222 CSDLYVSGCSGGHCLYGIQYGDGSYTIGFYAQDTLTLA-----YDTIKNFRFGCGEKNRG 276

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                  G  AG+LGL R   S   Q        F+YC    LP     + +L    D+G
Sbjct: 277 L-----FGRAAGLLGLGRGKTSLPVQAYDKYGGVFAYC----LPATSAGTGFL----DLG 323

Query: 163 YRRPSTQA---TKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
              P+  A      ++    FYY+ +  I +    +  P   F        G ++DSG+V
Sbjct: 324 PGAPAANARLTPMLVDRGPTFYYVGMTGIKVGGHVLPIPGSVFSTA-----GTLVDSGTV 378

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN---RFPSMAFYF 274
           +T      Y  L   F    +  Q    S  P    +  CY L          P+++  F
Sbjct: 379 ITRLPPSAYAPLRSAFS---KAMQGLGYSAAPAFSILDTCYDLTGHKGGSIALPAVSLVF 435

Query: 275 E-DANLRIDGENV-FIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +  A L +D   + ++ D        A    D  VA++G+ QQ+    +YD+   ++ F 
Sbjct: 436 QGGACLDVDASGILYVADVSQACLAFAPNADDTDVAIVGNTQQKTHGVLYDIGKKIVGFA 495

Query: 333 KENC 336
              C
Sbjct: 496 PGAC 499


>gi|255568540|ref|XP_002525244.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223535541|gb|EEF37210.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 460

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 89/366 (24%), Positives = 156/366 (42%), Gaps = 59/366 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
           ++L +G+P K   +ILDTGS+L +                +F+P  S++++ + C   +C
Sbjct: 122 LKLGLGSPPKYYTMILDTGSSLSWLQCKPCVVYCHSQVDPLFEPSASNTYRPLYCSSSEC 181

Query: 47  TYFKCVNEQ---------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           +  K              CVYT  Y D S + G+ + + +++              +GC 
Sbjct: 182 SLLKAATLNDPLCTASGVCVYTASYGDASYSMGYLSRDLLTL----TPSQTLPSFTYGCG 237

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            DN G       G  AG++GL+R  +S ++QL       FSYCL     +G     +L  
Sbjct: 238 QDNEGLF-----GKAAGIVGLARDKLSMLAQLSPKYGYAFSYCLPTSTSSG---GGFL-- 287

Query: 158 GTDMGYRRPST-QATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
              +G   PS+ + T  I +  N   Y+L L  I++    +      + +        II
Sbjct: 288 --SIGKISPSSYKFTPMIRNSQNPSLYFLRLAAITVAGRPVGVAAAGYQVPT------II 339

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP-ETFNRFPSMA 271
           DSG+V+T     +Y  L E FV    R    +    P    +  C+    ++ +  P + 
Sbjct: 340 DSGTVVTRLPISIYAALREAFVKIMSR----RYEQAPAYSILDTCFKGSLKSMSGAPEIR 395

Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
             F+  A+L +   N+ +I+ +     LA A  +  +A+IG+ QQ+     YD++   + 
Sbjct: 396 MIFQGGADLSLRAPNI-LIEADKGIACLAFASSNQ-IAIIGNHQQQTYNIAYDVSASKIG 453

Query: 331 FVKENC 336
           F    C
Sbjct: 454 FAPGGC 459


>gi|15238250|ref|NP_196637.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|8979710|emb|CAB96831.1| nucleoid DNA-binding protein cnd41-like protein [Arabidopsis
           thaliana]
 gi|18176136|gb|AAL59990.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|22136986|gb|AAM91722.1| putative nucleoid DNA-binding protein cnd41 [Arabidopsis thaliana]
 gi|110740988|dbj|BAE98588.1| nucleoid DNA-binding protein cnd41 - like protein [Arabidopsis
           thaliana]
 gi|332004210|gb|AED91593.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 464

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 98/356 (27%), Positives = 155/356 (43%), Gaps = 44/356 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + IGTP   + L+ DTGS L +                 F+P  SS++Q ++C  P 
Sbjct: 133 IVTIGIGTPKHDLSLVFDTGSDLTWTQCEPCLGSCYSQKEPKFNPSSSSTYQNVSCSSPM 192

Query: 46  CTYFK-CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG-F 103
           C   + C    CVY++ Y D+S T+GF A E  ++        +     FGC  +N G F
Sbjct: 193 CEDAESCSASNCVYSIVYGDKSFTQGFLAKEKFTLTNSD----VLEDVYFGCGENNQGLF 248

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
           D  A    L          +S  +Q  +     FSYCL     N   ++ +L FG+  G 
Sbjct: 249 DGVAGLLGLG------PGKLSLPAQTTTTYNNIFSYCLPSFTSN---STGHLTFGS-AGI 298

Query: 164 RRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
              S + T   + P+ F Y + +  IS+ ++ +   P++F        G IIDSG+V T 
Sbjct: 299 SE-SVKFTPISSFPSAFNYGIDIIGISVGDKELAITPNSFSTE-----GAIIDSGTVFTR 352

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDAN-LR 280
             + VY +L   F    E+    + +        CY F       +P++AF F  +  + 
Sbjct: 353 LPTKVYAELRSVFK---EKMSSYKSTSGYGLFDTCYDFTGLDTVTYPTIAFSFAGSTVVE 409

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +DG  +  +  +     LA A +DDL A+ G+ QQ     VYD+    + F    C
Sbjct: 410 LDGSGIS-LPIKISQVCLAFAGNDDLPAIFGNVQQTTLDVVYDVAGGRVGFAPNGC 464


>gi|67633548|gb|AAY78698.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 392

 Score = 97.8 bits (242), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 96/358 (26%), Positives = 154/358 (43%), Gaps = 52/358 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++L +GTP   +   +DTGS LI+               IFDP  SS+F++  C+    
Sbjct: 62  LMKLQVGTPPFEIEAEIDTGSDLIWTQCMPCTNCYSQYAPIFDPSNSSTFKEKRCN---- 117

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
                    C Y + YAD + +KG  A ET+++        +      GC +++  F   
Sbjct: 118 ------GNSCHYKIIYADTTYSKGTLATETVTIHSTSGEPFVMPETTIGCGHNSSWFKP- 170

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
                 +G++GLS    S I+Q+G       SYC           +S + FGT+      
Sbjct: 171 ----TFSGMVGLSWGPSSLITQMGGEYPGLMSYCF------ASQGTSKINFGTNAIVAGD 220

Query: 167 STQATK--FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
              +T           YYL+L  +S+ +  +     TF    + EG  IIDSG+ LTYF 
Sbjct: 221 GVVSTTMFLTTAKPGLYYLNLDAVSVGDTHVETMGTTFH---ALEGNIIIDSGTTLTYFP 277

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DANLRIDG 283
              Y  L  + V ++      + +D      LCY+  +T + FP +  +F   A+L +D 
Sbjct: 278 VS-YCNLVREAVDHY--VTAVRTADPTGNDMLCYYT-DTIDIFPVITMHFSGGADLVLDK 333

Query: 284 ENVFIIDYENHFFLLAV----APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            N++I       F LA+     P D   A+ G++ Q +    YD +  L+ F   NCS
Sbjct: 334 YNMYIETITRGTFCLAIICNNPPQD---AIFGNRAQNNFLVGYDSSSLLVFFSPTNCS 388


>gi|449462551|ref|XP_004149004.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449515029|ref|XP_004164552.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 434

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 103/362 (28%), Positives = 164/362 (45%), Gaps = 43/362 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ +FIGTP   V+ I DTGS L +               IF+PR+SSS++K++C    C
Sbjct: 91  LMSIFIGTPPVNVIAIADTGSDLTWTQCLPCRECFNQSQPIFNPRRSSSYRKVSCASDTC 150

Query: 47  TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
              +  +     + C Y   Y D+S T G  A + I++     G       + GC + N 
Sbjct: 151 RSLESYHCGPDLQSCSYGYSYGDRSFTYGDLASDQITI-----GSFKLPKTVIGCGHQNG 205

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           G       G  +G++GL   ++S +SQ+ +I  +K RFSYCL     N   T + + FG 
Sbjct: 206 G----TFGGVTSGIIGLGGGSLSLVSQMRTIAGVKPRFSYCLPTFFSNANITGT-ISFGR 260

Query: 160 DMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                     +T  +   P+ FY+L+L+ IS+  +R  F        ++  G  IIDSG+
Sbjct: 261 KAVVSGRQVVSTPLVPRSPDTFYFLTLEAISVGKKR--FKAANGISAMTNHGNIIIDSGT 318

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI-QLCYFLPETFN-RFPSMAFYFE- 275
            LT     +Y+ +     S   R   A+  D P  I +LCY   +  +   P +  +F  
Sbjct: 319 TLTLLPRSLYYGV----FSTLARVIKAKRVDDPSGILELCYSAGQVDDLNIPIITAHFAG 374

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            A++++   N F    +N    L  AP    VA+ G+  Q +    YDL    LSF  + 
Sbjct: 375 GADVKLLPVNTFAPVADN-VTCLTFAPATQ-VAIFGNLAQINFEVGYDLGNKRLSFEPKL 432

Query: 336 CS 337
           C+
Sbjct: 433 CA 434


>gi|147811402|emb|CAN61225.1| hypothetical protein VITISV_006732 [Vitis vinifera]
          Length = 440

 Score = 97.8 bits (242), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 89/360 (24%), Positives = 162/360 (45%), Gaps = 37/360 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++++ IGTP   V  I DTGS L++               +FDP KS+SF++++C+   C
Sbjct: 92  LMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQQC 151

Query: 47  TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                V+     + C ++  Y D S+ +G  A ET+++             +FGC ++N 
Sbjct: 152 RLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPXSIXNIVFGCGHNNS 211

Query: 102 G-FDEDARDGALAGVLGLSRVTISFISQLGSII--KKRFSYCLVIPLPNGEYTSSYLKFG 158
           G F+E+       G+ G     +S  SQ+ S +   ++FS CLV P       +S + FG
Sbjct: 212 GTFNENE-----MGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLV-PFRTDPSITSKIIFG 265

Query: 159 TDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
            +         +T  +   +  +Y+++L  IS+ ++   F   +    ++ +G   ID+G
Sbjct: 266 PEAEVSGSXVVSTPLVTKDDPTYYFVTLDGISVGDKLFPF---SSSSPMATKGNVFIDAG 322

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
           +  T    D Y +L +      E   +  + D     QLCY    T    P +  +F+ A
Sbjct: 323 TPPTLLPRDFYNRLVQGVK---EAIPMEPVQDPDLQPQLCY-RSATLIDGPILTAHFDGA 378

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ++++   N FI   E   +  A+ P D    + G+  Q +    +DL+   +SF   +C+
Sbjct: 379 DVQLKPLNTFISPKEG-VYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCT 437


>gi|357439021|ref|XP_003589787.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
 gi|355478835|gb|AES60038.1| Aspartic proteinase nepenthesin-2 [Medicago truncatula]
          Length = 456

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 90/359 (25%), Positives = 142/359 (39%), Gaps = 57/359 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
           VR+ IG+P+    +++D+GS +++               IF+P  S+SF  + C    C 
Sbjct: 131 VRIGIGSPAIYQYMVIDSGSDIVWIQCEPCDQCYNQTDPIFNPATSASFIGVACSSNVCN 190

Query: 48  YF----KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                  C   +C Y + Y D S TKG  A ETI++     G+ +      GC + N G 
Sbjct: 191 QLDDDVACRKGRCGYQVAYGDGSYTKGTLALETITI-----GRTVIQDTAIGCGHWNEGM 245

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A               +SF+ QLG+     F YCLV         S  +  G     
Sbjct: 246 FVGAAGLLGL-----GGGPMSFVGQLGAQTGGAFGYCLV---------SRAMPVG----- 286

Query: 164 RRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
                     I++P   +FYY+SL  +++   R+      F +T  G GG ++D+G+ +T
Sbjct: 287 ----AMWVPLIHNPFYPSFYYVSLSGLAVGGIRVPISEQIFQLTDIGTGGVVMDTGTAIT 342

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RFPSMAFYFEDAN 278
              +  Y    + F++     Q   L   P       CY L      R P+++FYF    
Sbjct: 343 RLPTVAYNAFRDAFIA-----QTTNLPRAPGVSIFDTCYDLNGFVTVRVPTVSFYFSGGQ 397

Query: 279 -LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            L     N  I   +   F  A AP    +++IG+ QQ   +   D     + F    C
Sbjct: 398 ILTFPARNFLIPADDVGTFCFAFAPSPSGLSIIGNIQQEGIQVSIDGTNGFVGFGPNVC 456


>gi|225436202|ref|XP_002271145.1| PREDICTED: probable aspartic protease At2g35615-like [Vitis
           vinifera]
          Length = 440

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 89/360 (24%), Positives = 162/360 (45%), Gaps = 37/360 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++++ IGTP   V  I DTGS L++               +FDP KS+SF++++C+   C
Sbjct: 92  LMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCESQQC 151

Query: 47  TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                V+     + C ++  Y D S+ +G  A ET+++             +FGC ++N 
Sbjct: 152 RLLDTVSCSQPQKLCDFSYGYGDGSLAQGVIATETLTLNSNSGQPTSILNIVFGCGHNNS 211

Query: 102 G-FDEDARDGALAGVLGLSRVTISFISQLGSII--KKRFSYCLVIPLPNGEYTSSYLKFG 158
           G F+E+       G+ G     +S  SQ+ S +   ++FS CLV P       +S + FG
Sbjct: 212 GTFNENE-----MGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLV-PFRTDPSITSKIIFG 265

Query: 159 TDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
            +         +T  +   +  +Y+++L  IS+ ++   F   +    ++ +G   ID+G
Sbjct: 266 PEAEVSGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPF---SSSSPMATKGNVFIDAG 322

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
           +  T    D Y +L +      E   +  + D     QLCY    T    P +  +F+ A
Sbjct: 323 TPPTLLPRDFYNRLVQGVK---EAIPMEPVQDPDLQPQLCY-RSATLIDGPILTAHFDGA 378

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ++++   N FI   E   +  A+ P D    + G+  Q +    +DL+   +SF   +C+
Sbjct: 379 DVQLKPLNTFISPKEG-VYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCT 437


>gi|255585473|ref|XP_002533429.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223526717|gb|EEF28949.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 448

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 93/365 (25%), Positives = 155/365 (42%), Gaps = 50/365 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V   +G P+   L I+DTGS +++               + DP KSS++  + C +  C
Sbjct: 100 LVNFSMGQPATPQLAIMDTGSNILWVRCAPCKRCTQQNGPLLDPSKSSTYASLPCTNTMC 159

Query: 47  TYFKCVN----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
            Y          QC Y + YA    + G  A E +      EG       +FGCS++N  
Sbjct: 160 HYAPSAYCNRLNQCGYNLSYATGLSSAGVLATEQLIFHSSDEGVNAVPSVVFGCSHENG- 218

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
              D +D    GV GL +   SF++++GS    +FSYCL   + +  Y  + L FG    
Sbjct: 219 ---DYKDRRFTGVFGLGKGITSFVTRMGS----KFSYCLG-NIADPHYGYNQLVFGEKAN 270

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
           +   ST     +   N  YY++L+ IS+  +R++     F +    E   +IDSG+ LT+
Sbjct: 271 FEGYSTP----LKVVNGHYYVTLEGISVGEKRLDIDSTAFSMK-GNEKSALIDSGTALTW 325

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYFE-DANL 279
                +  L  +     +   +            CY   + +    FP + F+F   A+L
Sbjct: 326 LAESAFRALDNEVRQLLDGVLMPFWRGSFA----CYKGTVSQDLIGFPVVTFHFSGGADL 381

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLV--------ALIGSQQQRDTRFVYDLNIDLLSF 331
            +D E++F   Y+    +L +A              ++IG   Q+     YDLN + L F
Sbjct: 382 DLDTESMF---YQATPDILCIAVRQASAYGNDFKSFSVIGLMAQQYYNMAYDLNSNKLFF 438

Query: 332 VKENC 336
            + +C
Sbjct: 439 QRIDC 443


>gi|147776733|emb|CAN74676.1| hypothetical protein VITISV_038368 [Vitis vinifera]
          Length = 389

 Score = 97.4 bits (241), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 82/280 (29%), Positives = 123/280 (43%), Gaps = 38/280 (13%)

Query: 56  CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGV 115
           C Y + Y D S T+G   HE +       G  +    +FGC  +N G       G ++G+
Sbjct: 76  CNYAINYGDGSFTRGELGHEKLKF-----GTILVKDFIFGCGRNNKGLF-----GGVSGL 125

Query: 116 LGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQA-TKFI 174
           +GL R  +S ISQ   I    FSYCL  P    + + S +  G    YR  S  +  K I
Sbjct: 126 MGLGRSDLSLISQTSGIFGGVFSYCL--PSTERKGSGSLILGGNSSVYRNSSPISYAKMI 183

Query: 175 NHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLH 232
            +P   NFY+++L  ISI    +  P         G    ++DSG+V+T     +Y  L 
Sbjct: 184 ENPQLYNFYFINLTGISIGGVALQAPS-------VGPSRILVDSGTVITRLPPTIYKALK 236

Query: 233 EKFVSYFERFQLAQLSDCPEP----IQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENV 286
            +F+  F  F        P P    +  C+ L        P++  +FE +A L +D   V
Sbjct: 237 AEFLKQFTGFP-------PAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGV 289

Query: 287 FII---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
           F     D       LA   + D VA++G+ QQ++ R +YD
Sbjct: 290 FYFVKSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYD 329


>gi|218188634|gb|EEC71061.1| hypothetical protein OsI_02803 [Oryza sativa Indica Group]
          Length = 479

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 92/375 (24%), Positives = 149/375 (39%), Gaps = 70/375 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDH 43
           ++ + +G+P+    +++DTGS + +                 A+FDP  SS++   NC  
Sbjct: 136 VISVGLGSPAMTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCSA 195

Query: 44  PDCTYFKCVNE--------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
             C       E        +C Y +KY D S T G  + + +++     G  +  G  FG
Sbjct: 196 AACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTL----SGSDVVRGFQFG 251

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-VIPLPNGEYTSSY 154
           CS+   G   D +     G++GL     S +SQ  +   K FSYCL   P  +G  T   
Sbjct: 252 CSHAELGAGMDDK---TDGLIGLGGDAQSLVSQTAARYGKSFSYCLPATPASSGFLTLGA 308

Query: 155 LKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
              G   G  R +T           +Y+ +L+DI++  +++   P  F        G ++
Sbjct: 309 PASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVF------AAGSLV 362

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-------RF 267
           DSG+V+T      Y  L   F +   R+  A      EP+ +   L   FN         
Sbjct: 363 DSGTVITRLPPAAYAALSSAFRAGMTRYARA------EPLGI---LDTCFNFTGLDKVSI 413

Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFL----LAVAP--HDDLVALIGSQQQRDTRFV 321
           P++A  F             ++D + H  +    LA AP   D     IG+ QQR    +
Sbjct: 414 PTVALVFAGGA---------VVDLDAHGIVSGGCLAFAPTRDDKAFGTIGNVQQRTFEVL 464

Query: 322 YDLNIDLLSFVKENC 336
           YD+   +  F    C
Sbjct: 465 YDVGGGVFGFRAGAC 479


>gi|449515033|ref|XP_004164554.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 430

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 101/367 (27%), Positives = 164/367 (44%), Gaps = 59/367 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + IGTP    + + DTGS L++A              IFDP KS+SF  + C+  +C
Sbjct: 93  LMSVSIGTPPVDYIGMADTGSDLMWAQCLPCLKCYKQSRPIFDPLKSTSFSHVPCNSQNC 152

Query: 47  TYFKCVNEQ-------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
              K +++        C Y+  Y DQ+ TKG    E I+ IG    K++      GC   
Sbjct: 153 ---KAIDDSHCGAQGVCDYSYTYGDQTYTKGDLGFEKIT-IGSSSVKSV-----IGC--- 200

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKF 157
             G +     G  +GV+GL    +S +SQ+   S I +RFSYCL   L    + +  + F
Sbjct: 201 --GHESGGGFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLL---SHANGKINF 255

Query: 158 GTDMGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
           G +     P   +T  I+ +P  +YY++L+ ISI NER         +  + +G  IIDS
Sbjct: 256 GQNAVVSGPGVVSTPLISKNPVTYYYVTLEAISIGNER--------HMASAKQGNVIIDS 307

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY---FLPETFNRFPSMAFY 273
           G+ L++   ++Y  +     S  +  +  ++ D      LC+       T +  P +   
Sbjct: 308 GTTLSFLPKELYDGV---VSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQ 364

Query: 274 FE-DANLRIDGENVF--IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
           F   AN+ +   N F  + +  N   L   +P D+   +IG+    +    YDL    LS
Sbjct: 365 FSGGANVNLLPVNTFQKVANNVNCLTLTPASPTDEF-GIIGNLALANFLIGYDLEAKRLS 423

Query: 331 FVKENCS 337
           F    C+
Sbjct: 424 FKPTVCT 430


>gi|357153697|ref|XP_003576537.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 474

 Score = 97.1 bits (240), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 96/373 (25%), Positives = 154/373 (41%), Gaps = 67/373 (17%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKC 51
           +G  +    +++DT S L +               +FDP  S S+  + C+   C   + 
Sbjct: 124 VGLGAAEATVVVDTASELTWVQCQPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALRV 183

Query: 52  V-----------NEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
                       NEQ   C Y + Y D S ++G  A + + + G+        G +FGC 
Sbjct: 184 AMAAGTSPCADDNEQQPACSYALSYRDGSYSRGVLARDKLRLAGQD-----IEGFVFGCG 238

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS-YLK 156
             N G    A  G  +G++GL R  +S +SQ        FSYC    LP  E  SS  L 
Sbjct: 239 TSNQG----APFGGTSGLMGLGRSHVSLVSQTMDQFGGVFSYC----LPMRESGSSGSLV 290

Query: 157 FGTDMGYRRPSTQA--TKFINHP----NNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
            G D    R ST    T  ++        FY+L+L  I++  + +  P  +        G
Sbjct: 291 LGDDSSAYRNSTPIVYTAMVSDSGPLQGPFYFLNLTGITVGGQEVESPWFS-------AG 343

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RF 267
             IIDSG+++T     VY  +  +F+S     QLA+    P    +  C+ L      + 
Sbjct: 344 RVIIDSGTIITTLVPSVYNAVRAEFLS-----QLAEYPQAPAFSILDTCFNLTGLKEVQV 398

Query: 268 PSMAFYFEDA-NLRIDGENVFII---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
           PS+ F FE +  + +D + V      D       LA    +   ++IG+ QQ++ R ++D
Sbjct: 399 PSLKFVFEGSVEVEVDSKGVLYFVSSDASQVCLALASLKSEYDTSIIGNYQQKNLRVIFD 458

Query: 324 LNIDLLSFVKENC 336
                + F +E C
Sbjct: 459 TLGSQIGFAQETC 471


>gi|356558304|ref|XP_003547447.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 336

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 99/353 (28%), Positives = 159/353 (45%), Gaps = 52/353 (14%)

Query: 4   LFIGTPSKGVLLILDTGSALIYA-------IFDPRKSSSFQKINCDHPDCTYFKCVNEQC 56
           L IG P    L+I+DT S +++        +FDP KSS+F  + C  P C +  C  +  
Sbjct: 13  LSIGQPPIPQLVIMDTSSDILWIMCNHVGLLFDPSKSSTFSPL-CKTP-CGFKGCKCDPI 70

Query: 57  VYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVL 116
            + + Y D+S T G    +T+      EG +     L  C + N GF+ D       G+ 
Sbjct: 71  PFNISYVDKSSTSGTFGSDTVVFETTDEGHSQIFDVLVRCGH-NIGFNTDP---GYNGIR 126

Query: 117 GLSRVTISFISQLGSIIKKRFSYCLV-IPLPNGEYTSSYLKFGTDM-GYRRPSTQATKFI 174
           GL+    S  +++G    ++FSYC+  +  P   Y    L  G D+ GY  P      F 
Sbjct: 127 GLNNGPNSLATKIG----QKFSYCVGNLADPYYNYNQLILCEGADLEGYSTP------FE 176

Query: 175 NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEK 234
            H + FYY++LK I +  +R++  P TF+I  +  GG I DSG+ +TY    V+  L+ +
Sbjct: 177 VH-HGFYYVTLKGIIVGEKRLDIAPITFEIKGNNTGGVIRDSGTTITYLVDSVHKLLYNE 235

Query: 235 ---FVSYFERFQLAQLSDCPEPIQLCYF--LPETFNRFPSMAFYFED-ANLRIDGENVFI 288
               +S+  R             QLC++  +      FP + F+F D A+L +D  + F 
Sbjct: 236 VRNLLSWSFR-------------QLCHYGIISRDLVGFPVVTFHFADGADLALDTGSFF- 281

Query: 289 IDYENHFFLLAVAPHDDLVALIGSQ-----QQRDTRFVYDLNIDLLSFVKENC 336
            +  N    + V+P   L   I         Q+     YDL  + + F + +C
Sbjct: 282 -NQLNSILCMTVSPASILNTTISPSVIELLAQQSYNVGYDLLTNFVYFQRIDC 333


>gi|125555058|gb|EAZ00664.1| hypothetical protein OsI_22685 [Oryza sativa Indica Group]
          Length = 465

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 100/371 (26%), Positives = 149/371 (40%), Gaps = 59/371 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +V L IGTP+   ++++DTGS L +                 +FDP  SSS+  + CD  
Sbjct: 119 VVTLGIGTPAVQQIVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSD 178

Query: 45  DCTYFK-------CVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF 94
            C           C +     C Y ++Y +++ T G  + ET++ +  G   A F    F
Sbjct: 179 ACRKLAAGAYGHGCTSGAAALCEYGIEYGNRATTTGVYSTETLT-LKPGVVVADFG---F 234

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
           GC +  HG  E        G+LGL     S +SQ  S     FSYC    LP     + +
Sbjct: 235 GCGDHQHGPYEK-----FDGLLGLGGAPESLVSQTSSQFGGPFSYC----LPPTSGGAGF 285

Query: 155 LKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVS 207
           L  G        ST A  F+  P         FY ++L  IS+    +  PP  F     
Sbjct: 286 LALGAPNSSSS-STAAAGFLFTPMRRIPSVPTFYVVTLTGISVGGAPLAVPPSAF----- 339

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNR 266
              G +IDSG+V+T   +  Y  L   F S    ++L   S+    +  CY F   T   
Sbjct: 340 -SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-AVLDTCYDFTGHTNVT 397

Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
            P++A  F   A + +      ++D        A A  DD + +IG+  QR    +YD  
Sbjct: 398 VPTIALTFSGGATIDLATPAGVLVD---GCLAFAGAGTDDTIGIIGNVNQRTFEVLYDSG 454

Query: 326 IDLLSFVKENC 336
              + F    C
Sbjct: 455 KGTVGFRAGAC 465


>gi|359476204|ref|XP_002262813.2| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Vitis vinifera]
          Length = 460

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 98/361 (27%), Positives = 156/361 (43%), Gaps = 54/361 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V +  GTP   + LILDTGS++ +                FD   SS++          
Sbjct: 129 LVDVAFGTPXTEIXLILDTGSSITWTQCKACVNCLQDSNRYFDSSASSTY---------- 178

Query: 47  TYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
           ++  C+    +  Y M Y D S + G    +T+++    E   +F    FGC  +N G  
Sbjct: 179 SFGSCIPSTVENNYNMTYGDDSTSVGNYGCDTMTL----EPSDVFQKFQFGCGRNNKGDF 234

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
               DG    +LGL +  +S +SQ  S   K FSYCL    P  +   S L FG     +
Sbjct: 235 GSGVDG----MLGLGQGQLSTVSQTASKFNKVFSYCL----PEEDSIGSLL-FGEKATSQ 285

Query: 165 RPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
             S + T  +N P     + +Y+++L DIS+ NER+N P   F        G IIDS +V
Sbjct: 286 SSSLKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTV 340

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQ-LSDCPEPIQLCYFLPETFN-RFPSMAFYF-ED 276
           +T      Y  L   F     ++ L+       + +  CY L    +   P +  +F   
Sbjct: 341 ITRLPQRAYSALKAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGG 400

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A++R++G N+ +   +     LA A   +L  +IG++QQ     +YD+    + F    C
Sbjct: 401 ADVRLNGTNI-VWGSDASRLCLAFAGTSELT-IIGNRQQLSLTVLYDIQGRRIGFGGNGC 458

Query: 337 S 337
           S
Sbjct: 459 S 459


>gi|242046218|ref|XP_002460980.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
 gi|241924357|gb|EER97501.1| hypothetical protein SORBIDRAFT_02g038700 [Sorghum bicolor]
          Length = 517

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 108/380 (28%), Positives = 167/380 (43%), Gaps = 57/380 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSAL--------------IYAIFDPRKSSSFQKINCDHPDC 46
           ++ +++GTP +   +I+DTGS L              +  +FDP  SSS++ + C    C
Sbjct: 152 LMDVYVGTPPRRFRMIMDTGSDLNWLQCAPCLDCFDQVGPVFDPAASSSYRNVTCGDQRC 211

Query: 47  TYF----------KCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIFHGALF 94
                        +   + C Y   Y DQS T G  A E  T+++   G  + +    +F
Sbjct: 212 GLVAPPEPPRACRRPGEDSCPYYYWYGDQSNTTGDLALESFTVNLTAPGASRRV-DDVVF 270

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
           GC + N G          AG+LGL R  +SF SQL ++    FSYCLV    +G   +S 
Sbjct: 271 GCGHWNRGLFH-----GAAGLLGLGRGPLSFASQLRAVYGHTFSYCLVD---HGSDVASK 322

Query: 155 LKFG----TDMGYRRPSTQATKFI---NHPNNFYYLSLKDISIDNERMNFPPDTF--DIT 205
           + FG      +    P    T F    +  + FYY+ LK + +  E +N   DT+     
Sbjct: 323 VVFGEDDALALAAAHPQLNYTAFAPASSPADTFYYVKLKGVLVGGELLNISSDTWGVGEG 382

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL----- 260
             G GG IIDSG+ L+YF    Y  + + F+    R     + D P  +  CY +     
Sbjct: 383 EGGSGGTIIDSGTTLSYFVEPAYQVIRQAFIDRMGR-SYPLIPDFPV-LSPCYNVSGVDR 440

Query: 261 PETFNRFPSMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
           PE     P ++  F D A      EN FI +D +    L  +      +++IG+ QQ++ 
Sbjct: 441 PEV----PELSLLFADGAVWDFPAENYFIRLDPDGIMCLAVLGTPRTGMSIIGNFQQQNF 496

Query: 319 RFVYDLNIDLLSFVKENCSD 338
             VYDL  + L F    C++
Sbjct: 497 HVVYDLKNNRLGFAPRRCAE 516


>gi|218194599|gb|EEC77026.1| hypothetical protein OsI_15382 [Oryza sativa Indica Group]
          Length = 409

 Score = 96.7 bits (239), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 96/383 (25%), Positives = 167/383 (43%), Gaps = 73/383 (19%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
             + IGTP+K   + +DTGS +++                    ++DP+ SS+  K++CD
Sbjct: 6   TEIGIGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVSCD 65

Query: 43  H-----------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
                       P CT     +  C Y++ Y D S T G+   + +    V G G+ +  
Sbjct: 66  QGFCAATYGGLLPGCT----TSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPA 121

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLP 146
                FGC +   G D  + + AL G++G  +   S +SQL +   +KK F++CL     
Sbjct: 122 NSTVTFGCGSQQGG-DLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTING 180

Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
            G +    +         +P  + T  + N P+  Y ++LK I +    +  P   FD  
Sbjct: 181 GGIFAIGNV--------VQPKVKTTPLVPNMPH--YNVNLKSIDVGGTALKLPSHMFD-- 228

Query: 206 VSGE-GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPET 263
            +GE  G IIDSG+ LTY    VY    E  ++ F + +     +  E   LC+ ++   
Sbjct: 229 -TGEKKGTIIDSGTTLTYLPEIVY---KEIMLAVFAKHKDITFHNVQE--FLCFQYVGRV 282

Query: 264 FNRFPSMAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQ 314
            + FP + F+FE D  L +        +G+N++ + ++N       +     + L+G   
Sbjct: 283 DDDFPKITFHFENDLPLNVYPHDYFFENGDNLYCVGFQNGGLQ---SKDGKGMVLLGDLV 339

Query: 315 QRDTRFVYDLNIDLLSFVKENCS 337
             +   VYDL   ++ + + NCS
Sbjct: 340 LSNKLVVYDLENQVIGWTEYNCS 362


>gi|357154085|ref|XP_003576664.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 509

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 95/378 (25%), Positives = 156/378 (41%), Gaps = 64/378 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +V + +GTP++ + ++ DTGS L +                 +F P  SS+F  + C   
Sbjct: 155 VVSVGLGTPARDLTVVFDTGSDLSWVQCGPCSSGGCYKQQDPLFAPSDSSTFSAVRCGAR 214

Query: 45  DCTYFKCV-----NEQCVYTMKYADQSVTKGFAAHETISVI------GKGEGKAIFHGAL 93
           +C   +       +++C Y + Y D+S T+G   ++T+++          E      G +
Sbjct: 215 ECRARQSCGGSPGDDRCPYEVVYGDKSRTQGHLGNDTLTLGTMAPANASAENDNKLPGFV 274

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
           FGC  +N G    A      G+ GL R  +S  SQ      + FSYCL     +      
Sbjct: 275 FGCGENNTGLFGQAD-----GLFGLGRGKVSLSSQAAGKFGEGFSYCLPS---SSSSAPG 326

Query: 154 YLKFGTDMGYRRPS-TQATKFINHPN--NFYYLSLKDISIDNE--RMNFPPDTFDITVSG 208
           YL  GT +    P+  Q T  +N     +FYY+ L  I +     R++ P     +    
Sbjct: 327 YLSLGTPV--PAPAHAQFTPMLNRTTTPSFYYVKLVGIRVAGRAIRVSSPRVALPL---- 380

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERF---QLAQLSDCPEPIQLCYFLPETFN 265
               I+DSG+V+T      Y  L   F+S   ++   +  +LS     +  CY      N
Sbjct: 381 ----IVDSGTVITRLAPRAYRALRAAFLSAMGKYGYKRAPRLSI----LDTCYDFTAHAN 432

Query: 266 ---RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDD--LVALIGSQQQRDTR 319
                P++A  F   A + +D   V  +        LA AP+ D     ++G+ QQR   
Sbjct: 433 ATVSIPAVALVFAGGATISVDFSGVLYVAKVAQ-ACLAFAPNGDGRSAGILGNTQQRTLA 491

Query: 320 FVYDLNIDLLSFVKENCS 337
            VYD+    + F  + CS
Sbjct: 492 VVYDVARQKIGFAAKGCS 509


>gi|449441618|ref|XP_004138579.1| PREDICTED: uncharacterized protein LOC101220661 [Cucumis sativus]
          Length = 2819

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 97/377 (25%), Positives = 161/377 (42%), Gaps = 63/377 (16%)

Query: 2    VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
            V L +G+P + V ++LDTGS L +          ++F+P  SSS+  I C  P C     
Sbjct: 1002 VSLTVGSPPQQVTMVLDTGSELSWLHCKKSPNLTSVFNPLSSSSYSPIPCSSPICRTRTR 1061

Query: 49   -----FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                   C  ++ C   + YAD S  +G  A +   +     G +   G LFGC +   G
Sbjct: 1062 DLPNPVTCDPKKLCHAIVSYADASSLEGNLASDNFRI-----GSSALPGTLFGCMDS--G 1114

Query: 103  FDEDAR-DGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG--- 158
            F  ++  D    G++G++R ++SF++QLG     +FSYC+     +G  +S  L FG   
Sbjct: 1115 FSSNSEEDAKTTGLMGMNRGSLSFVTQLG---LPKFSYCI-----SGRDSSGVLLFGDLH 1166

Query: 159  ----TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
                 ++ Y      +T         Y + L  I + N+ +  P   F    +G G  ++
Sbjct: 1167 LSWLGNLTYTPLVQISTPLPYFDRVAYTVQLDGIRVGNKILPLPKSIFAPDHTGAGQTMV 1226

Query: 215  DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPE--TFNRF 267
            DSG+  T+    VY  L  +F+    +  LA L D P       + LCY +         
Sbjct: 1227 DSGTQFTFLLGPVYTALRNEFLEQ-TKGVLAPLGD-PNFVFQGAMDLCYSVAAGGKLPTL 1284

Query: 268  PSMAFYFEDANLRIDGENVFIIDYE----NHFFLLAVAPHDDLVAL----IGSQQQRDTR 319
            PS++  F  A + + GE +     E    N +       + DL+ +    IG   Q++  
Sbjct: 1285 PSVSLMFRGAEMVVGGEVLLYRVPEMMKGNEWVYCLTFGNSDLLGIEAFVIGHHHQQNVW 1344

Query: 320  FVYDLNIDLLSFVKENC 336
                +  DL++F  + C
Sbjct: 1345 ----MEFDLVAFAADLC 1357


>gi|225216995|gb|ACN85284.1| aspartic proteinase nepenthesin-1 precursor [Oryza australiensis]
          Length = 519

 Score = 96.7 bits (239), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 88/359 (24%), Positives = 143/359 (39%), Gaps = 43/359 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP+    ++ DTGS   +                +FDP +SS++  I+C  P 
Sbjct: 181 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANISCAAPA 240

Query: 46  CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C+      C    C+Y ++Y D S + GF A +T+++           G  FGC   N G
Sbjct: 241 CSDLDTRGCSGGNCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNEG 296

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
              +A     AG+LGL R   S   Q        F++C    LP     + YL FG    
Sbjct: 297 LFGEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----LPARSSGTGYLDFGPGSP 347

Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
               +   T  +  N P  FYY+ +  I +  + ++ P   F        G I+DSG+V+
Sbjct: 348 AAAGARLTTPMLTDNGP-TFYYVGMTGIRVGGQLLSIPQSVFTTA-----GTIVDSGTVI 401

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DAN 278
           T      Y  L   F S        + +     +  CY F   +    P+++  F+  A 
Sbjct: 402 TRLPPAAYSSLRSAFASAMAARGYKK-APAVSLLDTCYDFTGMSQVAIPTVSLLFQGGAR 460

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L +D   +      +   L   A  D   V ++G+ Q +     YD+   ++ F    C
Sbjct: 461 LDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFSPGAC 519


>gi|359482097|ref|XP_002271077.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 458

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 89/379 (23%), Positives = 170/379 (44%), Gaps = 62/379 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCT---- 47
           V L +GTP + V ++LDTGS L +            FDP +SSS+  + C    CT    
Sbjct: 87  VSLTVGTPPQNVSMVLDTGSELSWLRCNKTQTFQTTFDPNRSSSYSPVPCSSLTCTDRTR 146

Query: 48  ----YFKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--SNDN 100
                  C  N+ C   + YAD S ++G  A +T  +     G +   G +FGC  S+ +
Sbjct: 147 DFPIPASCDSNQLCHAILSYADASSSEGNLASDTFYI-----GNSDMPGTIFGCMDSSFS 201

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
              +ED+++    G++G++R ++SF+SQ+      +FSYC+     + +++   L    +
Sbjct: 202 TNTEEDSKN---TGLMGMNRGSLSFVSQMD---FPKFSYCIS----DSDFSGVLLLGDAN 251

Query: 161 MGYRRPSTQATKF-INHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
             +  P        I+ P  +     Y + L+ I + ++ +  P   F    +G G  ++
Sbjct: 252 FSWLMPLNYTPLIQISTPLPYFDRVAYTVQLEGIKVSSKLLPLPKSVFVPDHTGAGQTMV 311

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLP---ETFNR 266
           DSG+  T+    VY  L  +F++  +  Q+ ++ + P       + LCY +P    +   
Sbjct: 312 DSGTQFTFLLGPVYSALRNEFLN--QTSQILRVLEDPNYVFQGGMDLCYRVPLSQTSLPW 369

Query: 267 FPSMAFYFEDANLRIDGENVFI-----IDYENHFFLLAVAPHDDLVA----LIGSQQQRD 317
            P+++  F  A +++ G+ +       +   +  +      + DL+A    +IG   Q++
Sbjct: 370 LPTVSLMFRGAEMKVSGDRLLYRVPGEVRGSDSVYCFTFG-NSDLLAVEAYVIGHHHQQN 428

Query: 318 TRFVYDLNIDLLSFVKENC 336
               +DL    + F +  C
Sbjct: 429 VWMEFDLEKSRIGFAQVQC 447


>gi|357120129|ref|XP_003561782.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 452

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 95/393 (24%), Positives = 151/393 (38%), Gaps = 73/393 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------AIFDPRKSSSFQKINCDHPDCTYFK-- 50
           V + +GTP + V ++LDTGS L +         A FD   SSS+  + C  P CT+    
Sbjct: 65  VPVAVGTPPQNVTMVLDTGSELSWLLCNGSRHDAPFDASASSSYAPVPCSSPACTWLGRD 124

Query: 51  ------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
                 C +  C  ++ YAD S   G  A +T  ++G     A     LFGC   ++   
Sbjct: 125 LPVRPFCDSSACRVSLSYADASSADGLLAADTF-LLGSSPMPA-----LFGCIT-SYSSS 177

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG--TDMG 162
            D  +    G+LG++R  +SF++Q  +   +RF+YC+      G+     L  G  T+  
Sbjct: 178 TDPSETPPTGLLGMNRGGLSFVTQTAT---RRFAYCIAA----GQGPGILLLGGNDTETP 230

Query: 163 YRRPSTQATKF-----INHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
              P  Q   +     I+ P  +     Y + L+ I + +  +  P        +G G  
Sbjct: 231 LTSPPQQQLNYTPLVEISQPLPYFDRAAYTVQLEGIRVGSALLAIPKHLLTPDHTGAGQT 290

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLCYFLPETFN 265
           ++DSG+  T+   D Y  L  +F +   R     L+   EP          C+   E   
Sbjct: 291 MVDSGTRFTFLLPDAYAALKAEFANQLTRSLDGGLAPLGEPGFVFQGAFDACFRGTEARV 350

Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD----------------DLVA- 308
              +      +  L + G  V +   E    LL   P +                D+   
Sbjct: 351 SAAAAGGLLPEVGLVLRGAEVVVAGAEK---LLYRVPGERRGEGEGVWCLTFGSSDMAGV 407

Query: 309 ---LIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
              +IG   Q+D    YDL    L F    C+D
Sbjct: 408 SAYVIGHHHQQDVWVEYDLRNARLGFAAARCAD 440


>gi|359476195|ref|XP_002268758.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
 gi|296082174|emb|CBI21179.3| unnamed protein product [Vitis vinifera]
          Length = 460

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 100/361 (27%), Positives = 146/361 (40%), Gaps = 66/361 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQ----- 55
           +V +  GTP +   LI+DTGS   +             I C+   C+   C N++     
Sbjct: 130 LVNVGFGTPQQKFNLIIDTGSDTTW-------------IQCN--SCSLGNCHNKKTFNPS 174

Query: 56  ---------CV------YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                    C+      YTMKY D S +KG    + +++        +F    FGC +  
Sbjct: 175 LSSSYSNRSCIPSTDTNYTMKYEDNSYSKGVFVCDEVTL-----KPDVFPKFQFGCGDSG 229

Query: 101 HGFDEDARDGALAGVLGLSR-VTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
            G       G  +GVLGL++    S ISQ  S  KK+FSYC     P  E+T   L FG 
Sbjct: 230 GG-----EFGTASGVLGLAKGEQYSLISQTASKFKKKFSYC----FPPKEHTLGSLLFGE 280

Query: 160 DMGYRRPSTQATKFINHPNNF-YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
                 PS + T+ +N P+   Y++ L  IS+  +R+N     F        G IIDSG+
Sbjct: 281 KAISASPSLKFTQLLNPPSGLGYFVELIGISVAKKRLNVSSSLF-----ASPGTIIDSGT 335

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYFLPETFNR---FPSMAFY 273
           V+T   +  Y  L   F    E      +S  P+   +  CY L     R    P +  +
Sbjct: 336 VITRLPTAAYEALRTAFQQ--EMLHCPSISPPPQEKLLDTCYNLKGCGGRNIKLPEIVLH 393

Query: 274 F---EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
           F    D +L   G      D        A   +   V +IG++QQ   + VYD+    L 
Sbjct: 394 FVGEVDVSLHPSGILWANGDLTQACLAFARKSNPSHVTIIGNRQQVSLKVVYDIEGGRLG 453

Query: 331 F 331
           F
Sbjct: 454 F 454


>gi|38344991|emb|CAE01597.2| OSJNBa0008A08.5 [Oryza sativa Japonica Group]
 gi|116309515|emb|CAH66581.1| OSIGBa0137O04.7 [Oryza sativa Indica Group]
 gi|222628622|gb|EEE60754.1| hypothetical protein OsJ_14310 [Oryza sativa Japonica Group]
          Length = 494

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 96/379 (25%), Positives = 166/379 (43%), Gaps = 73/379 (19%)

Query: 6   IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH--- 43
           IGTP+K   + +DTGS +++                    ++DP+ SS+  K++CD    
Sbjct: 95  IGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVSCDQGFC 154

Query: 44  --------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
                   P CT     +  C Y++ Y D S T G+   + +    V G G+ +      
Sbjct: 155 AATYGGLLPGCT----TSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANSTV 210

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
            FGC +   G D  + + AL G++G  +   S +SQL +   +KK F++CL      G +
Sbjct: 211 TFGCGS-QQGGDLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGGGIF 269

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
               +         +P  + T  + N P+  Y ++LK I +    +  P   FD   +GE
Sbjct: 270 AIGNV--------VQPKVKTTPLVPNMPH--YNVNLKSIDVGGTALKLPSHMFD---TGE 316

Query: 210 -GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRF 267
             G IIDSG+ LTY    VY    E  ++ F + +     +  E   LC+ ++    + F
Sbjct: 317 KKGTIIDSGTTLTYLPEIVY---KEIMLAVFAKHKDITFHNVQE--FLCFQYVGRVDDDF 371

Query: 268 PSMAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
           P + F+FE D  L +        +G+N++ + ++N       +     + L+G     + 
Sbjct: 372 PKITFHFENDLPLNVYPHDYFFENGDNLYCVGFQNGGLQ---SKDGKGMVLLGDLVLSNK 428

Query: 319 RFVYDLNIDLLSFVKENCS 337
             VYDL   ++ + + NCS
Sbjct: 429 LVVYDLENQVIGWTEYNCS 447


>gi|15234606|ref|NP_194732.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|4938479|emb|CAB43838.1| putative protein [Arabidopsis thaliana]
 gi|7269903|emb|CAB80996.1| putative protein [Arabidopsis thaliana]
 gi|67633774|gb|AAY78811.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332660310|gb|AEE85710.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 424

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 96/358 (26%), Positives = 156/358 (43%), Gaps = 39/358 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
           +  + IG P    LL++DTGS L +               F P +SS+++  +C      
Sbjct: 79  LANISIGNPPVPQLLLIDTGSDLTWIHCLPCKCYPQTIPFFHPSRSSTYRNASCVSAPHA 138

Query: 48  YFKCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
             +   ++    C Y ++Y D S T+G  A E ++     +G       +FGC  DN GF
Sbjct: 139 MPQIFRDEKTGNCQYHLRYRDFSNTRGILAEEKLTFETSDDGLISKQNIVFGCGQDNSGF 198

Query: 104 DEDARDGALAGVLGLSRVTISFISQ-LGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
            +       +GVLGL   T S +++  GS    +FSYC    L N  Y  + L  G    
Sbjct: 199 TK------YSGVLGLGPGTFSIVTRNFGS----KFSYCFG-SLTNPTYPHNILILGNGAK 247

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
                T    F     + YYL L+ IS   + ++  P TF    S +GG +ID+G   T 
Sbjct: 248 IEGDPTPLQIF----QDRYYLDLQAISFGEKLLDIEPGTFQRYRS-QGGTVIDTGCSPTI 302

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYFE-DANL 279
              + Y  L E+ + +     L ++ D  +    CY   L      FP + F+F   A L
Sbjct: 303 LAREAYETLSEE-IDFLLGEVLRRVKDWDQYTTPCYEGNLKLDLYGFPVVTFHFAGGAEL 361

Query: 280 RIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            +D E++F+       F LA+  +  D +++IG+  Q++    Y+L    + F + +C
Sbjct: 362 ALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 419


>gi|449527151|ref|XP_004170576.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cucumis
           sativus]
          Length = 523

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 102/357 (28%), Positives = 144/357 (40%), Gaps = 41/357 (11%)

Query: 2   VRLFIGTPSKGVLLILDTGSAL-----------------IYAIFDPRKSSSFQKINCDHP 44
            R+ +G P +    + DTGS +                 I  IFDP+ SSS+  ++CD  
Sbjct: 186 ARIGVGQPVQSYFFVPDTGSDVSWLQCQPCDGENGCYKQIGPIFDPKSSSSYSPLSCDSE 245

Query: 45  DCTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
            C       C    C+Y ++Y D S T G  A ET S                GC +DN 
Sbjct: 246 QCHLLDEAACDANSCIYEVEYGDGSFTVGELATETFSFRHSNS----IPNLPIGCGHDNE 301

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G    A               IS  SQL +     FSYCLV        +SS L F  D 
Sbjct: 302 GLFVGAAGLIGL-----GGGAISLSSQLEAT---SFSYCLV---DLDSESSSTLDFNADQ 350

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
                ++   K    P  F Y+ +  +S+  + +     +F+I  SG GG I+DSG+ +T
Sbjct: 351 PSDSLTSPLVKNDRFPT-FRYVKVIGMSVGGKPLPISSSSFEIDESGSGGIIVDSGTTIT 409

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDAN-L 279
              SDVY  L + FV   +    A       P   CY L    N   P++AF     N L
Sbjct: 410 EIPSDVYDVLRDAFVGLTKNLPPAPGV---SPFDTCYDLSSQSNVEVPTIAFILPGENSL 466

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           ++  +N          F LA  P    +++IG+ QQ+  R  YDL   L+ F  + C
Sbjct: 467 QLPAKNCLFQVDSAGTFCLAFLPSTFPLSIIGNVQQQGIRVSYDLANSLVGFSTDKC 523


>gi|297798978|ref|XP_002867373.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297313209|gb|EFH43632.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 434

 Score = 96.3 bits (238), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 95/358 (26%), Positives = 158/358 (44%), Gaps = 39/358 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
           +  + IG P    LL++DTGS L +               F P +SS+++  +C+     
Sbjct: 89  LANISIGDPPVPQLLLIDTGSDLTWIQCLPCKCYPQTIPFFHPSRSSTYRNASCESAPHA 148

Query: 48  YFKCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
             +   ++    C Y ++Y D S T+G  A E ++     EG       +FGC  DN GF
Sbjct: 149 MPQIFRDEKTGNCRYHLRYRDFSNTRGILAKEKLTFQTSDEGLISKPNIVFGCGQDNSGF 208

Query: 104 DEDARDGALAGVLGLSRVTISFISQ-LGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
            +       +GVLGL   T S +++  GS    +FSYC    L +  Y  ++L  G    
Sbjct: 209 TQ------YSGVLGLGPGTFSIVTRNFGS----KFSYCFG-SLIDPTYPHNFLILGNGAR 257

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
                T    F     + YYL L+ IS+  + ++  P  F    S +GG +ID+G   T 
Sbjct: 258 IEGDPTPLQIF----QDRYYLDLQAISLGEKLLDIEPGIFQRYRS-KGGTVIDTGCSPTI 312

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRFPSMAFYFE-DANL 279
              + Y  L E+ + +     L ++ D  +    CY   L      FP + F+F   A L
Sbjct: 313 LAREAYETLSEE-IDFLLGEVLRRVKDWEQYTNHCYEGNLKLDLYGFPVVTFHFAGGAEL 371

Query: 280 RIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            +D E++F+       F LA+  +  D +++IG+  Q++    Y+L    + F + +C
Sbjct: 372 ALDVESLFVSSESGDSFCLAMTMNTFDDMSVIGAMAQQNYNVGYNLRTMKVYFQRTDC 429


>gi|242093566|ref|XP_002437273.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
 gi|241915496|gb|EER88640.1| hypothetical protein SORBIDRAFT_10g023970 [Sorghum bicolor]
          Length = 503

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 82/358 (22%), Positives = 142/358 (39%), Gaps = 42/358 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP+    ++ DTGS   +                +F P KS+++  I+C    
Sbjct: 166 VVPIRLGTPAARFTVVFDTGSDTTWVQCQPCVAYCYQQKEPLFTPTKSATYANISCTSSY 225

Query: 46  CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C+      C    C+Y ++Y D S T GF A +T+++     G        FGC   N G
Sbjct: 226 CSDLDTRGCSGGHCLYAVQYGDGSYTVGFYAQDTLTL-----GYDTVKDFRFGCGEKNRG 280

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                  G  AG++GL R   S   Q        F+YC    +P     + +L FG    
Sbjct: 281 L-----FGKAAGLMGLGRGKTSVPVQAYDKYSGVFAYC----IPATSSGTGFLDFGPGAP 331

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
               +      +++   FYY+ +  I +    ++ P   F      + G ++DSG+V+T 
Sbjct: 332 AAANARLTPMLVDNGPTFYYVGMTGIKVGGHLLSIPATVFS-----DAGALVDSGTVITR 386

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFE-DANL 279
                Y  L   F    E     + +     +  CY L   +     P+++  F+  A L
Sbjct: 387 LPPSAYEPLRSAFAKGMEGLGY-KTAPAFSILDTCYDLTGYQGSIALPAVSLVFQGGACL 445

Query: 280 RIDGENV-FIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            +D   + ++ D        A    D  + ++G+ QQ+    +YDL   ++ F    C
Sbjct: 446 DVDASGILYVADVSQACLAFAANDDDTDMTIVGNTQQKTYSVLYDLGKKVVGFAPGAC 503


>gi|242086412|ref|XP_002443631.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
 gi|241944324|gb|EES17469.1| hypothetical protein SORBIDRAFT_08g022620 [Sorghum bicolor]
          Length = 507

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 100/376 (26%), Positives = 153/376 (40%), Gaps = 50/376 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           + ++ +GTP+   LL LDT S L +               +FDPR S+S+ ++N D PDC
Sbjct: 142 IAKIAVGTPAVEALLALDTASDLTWLQCQPCRRCYPQSGPVFDPRHSTSYGEMNYDAPDC 201

Query: 47  TYFK------CVNEQCVYTMKYAD------QSVTKGFAAHETISVIGKGEGKAIFHGALF 94
                          C+YT+ Y D       S + G    ET++  G G  +A       
Sbjct: 202 QALGRSGGGDAKRGTCIYTVLYGDGDGHGSTSTSVGDLVEETLTFAG-GVRQAYLS---I 257

Query: 95  GCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSI-IKKRFSYCLVIPLPNGEYTS 152
           GC +DN G F   A     AG+LGLSR  IS   Q+  +     FSYCLV  +      S
Sbjct: 258 GCGHDNKGLFGAPA-----AGILGLSRGQISIPHQIAFLGYNASFSYCLVDFISGPGSPS 312

Query: 153 SYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---S 207
           S L FG       P    T  + + N   FYY+ L  +S+   R+    +  D+ +   +
Sbjct: 313 STLTFGAGAVDTSPPASFTPTVLNQNMPTFYYVRLIGVSVGGVRVPGVTER-DLQLDPYT 371

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-- 265
           G GG I+DSG+ +T      Y    + F +                   CY +       
Sbjct: 372 GHGGVILDSGTTVTRLARPAYTAFRDAFRAAATGLGQVSTGGPSGLFDTCYTVGGRAGLR 431

Query: 266 ---RFPSMAFYFEDA-NLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
              + P+++ +F     L +  +N  I +D             D  V++IG+  Q+  R 
Sbjct: 432 HCVKVPAVSMHFAGGVELSLQPKNYLITVDSRGTVCFAFAGTGDRSVSVIGNILQQGFRV 491

Query: 321 VYDLNIDLLSFVKENC 336
           VYD+    + F   +C
Sbjct: 492 VYDIGGQRVGFAPNSC 507


>gi|449457263|ref|XP_004146368.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 469

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 97/373 (26%), Positives = 168/373 (45%), Gaps = 50/373 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V L IG+P    L+++DTGS+L++              + FDP KS SF+ + C  P  
Sbjct: 105 LVNLSIGSPPVTQLVVVDTGSSLLWVQCLPCINCFQQSTSWFDPLKSVSFKTLGCGFPGY 164

Query: 47  TY---FKCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL--------- 93
            Y   +KC    Q  Y ++Y     ++G  A E++      EG+   + A+         
Sbjct: 165 NYINGYKCNRFNQAEYKLRYLGGDSSQGILAKESLLFETLDEGRVFQYNAISTQISKIKK 224

Query: 94  ----FGCSNDNHGFDEDARDGALAGVLGLSRVT-ISFISQLGSIIKKRFSYCLVIPLPNG 148
               FGC + N    +   D A  GV GL     I+  +QLG+    +FSYC +  + N 
Sbjct: 225 SNITFGCGHMN---IKTNNDDAYNGVFGLGAYPHITMATQLGN----KFSYC-IGDINNP 276

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
            YT ++L  G        ST       H    YY++L+ IS+ ++ +   P+ F I+  G
Sbjct: 277 LYTHNHLVLGQGSYIEGDSTPLQIHFGH----YYVTLQSISVGSKTLKIDPNAFKISSDG 332

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNR 266
            GG +IDSG   T   +  +  L+++ V   +   L ++    +   LC+   +      
Sbjct: 333 SGGVLIDSGMTYTKLANGGFELLYDEIVDLMKGL-LERIPTQRKFEGLCFKGVVSRDLVG 391

Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVAL--IGSQQQRDTRFVYD 323
           FP++ F+F   A+L ++  ++F     + F L  +  + +L+ L  IG   Q++    +D
Sbjct: 392 FPAVTFHFAGGADLVLESGSLFRQHGGDRFCLAILPSNSELLNLSVIGILAQQNYNVGFD 451

Query: 324 LNIDLLSFVKENC 336
           L    + F + +C
Sbjct: 452 LEQMKVFFRRIDC 464


>gi|302143530|emb|CBI22091.3| unnamed protein product [Vitis vinifera]
          Length = 360

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 91/302 (30%), Positives = 128/302 (42%), Gaps = 23/302 (7%)

Query: 50  KCVNEQCVYTMKYADQSVTKGFAAHETISV-----IGKGEGKAIFHGALFGCSNDNHGFD 104
           K  N+ C Y   Y D S T G  A ET +V      GK E + +    +FGC + N G  
Sbjct: 68  KAENQTCPYYYWYGDSSNTTGDFALETFTVNLTMSSGKPELRRV-ENVMFGCGHWNRGLF 126

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY- 163
             A            R  +SF SQL S+    FSYCLV    +    SS L FG D    
Sbjct: 127 HGAAGLLGL-----GRGPLSFSSQLQSLYGHSFSYCLVDRNSDAN-VSSKLIFGEDKDLL 180

Query: 164 RRPSTQATKFI----NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
             P    T  +    N  + FYY+ +K I +  E +N P + + I   G GG IIDSG+ 
Sbjct: 181 SHPELNFTTLVAGKENPVDTFYYVQIKSIVVGGEVVNIPEEKWQIATDGSGGTIIDSGTT 240

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSMAFYFED-A 277
           L+YF    Y  + E F++  + + + +     EP   CY +        P     F D A
Sbjct: 241 LSYFAEPAYQVIKEAFMAKVKGYPVVKDFPVLEP---CYNVTGVEQPDLPDFGIVFSDGA 297

Query: 278 NLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
                 EN FI I+      L  +      +++IG+ QQ++   +YD     L F    C
Sbjct: 298 VWNFPVENYFIEIEPREVVCLAILGTPPSALSIIGNYQQQNFHILYDTKKSRLGFAPTKC 357

Query: 337 SD 338
           +D
Sbjct: 358 AD 359


>gi|242092900|ref|XP_002436940.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
 gi|241915163|gb|EER88307.1| hypothetical protein SORBIDRAFT_10g011750 [Sorghum bicolor]
          Length = 465

 Score = 95.9 bits (237), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 101/371 (27%), Positives = 151/371 (40%), Gaps = 66/371 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           MV L  GTPS   +L++DTGS + +                 +FDP KSS++  I C   
Sbjct: 126 MVTLGFGTPSVPQVLLMDTGSDVSWVQCAPCNSTECYPQKDPLFDPSKSSTYAPIACGAD 185

Query: 45  DCTYFK------CVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
            C          C +   QC Y ++Y D S T+G  ++ETI+    G     FH   FGC
Sbjct: 186 ACNKLGDHYRNGCTSGGTQCGYRVEYGDGSSTRGVYSNETIT-FAPGITVKDFH---FGC 241

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
            +D  G   D  DG    +LGL     S + Q  S+    FSYCL  P  N E  + +L 
Sbjct: 242 GHDQRG-PSDKFDG----LLGLGGAPESLVVQTASVYGGAFSYCL--PALNSE--AGFLA 292

Query: 157 FGTDMGYRRPS--TQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVS 207
            G      RPS  T  + F+  P          Y +++  IS+  + ++ P   F     
Sbjct: 293 LGV-----RPSAATNTSAFVFTPMWHLPMDATSYMVNMTGISVGGKPLDIPRSAF----- 342

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNR 266
             GG +IDSG+++T      Y  L+      F  + +    D       CY F   +   
Sbjct: 343 -RGGMLIDSGTIVTELPETAYNALNAALRKAFAAYPMVASED----FDTCYNFTGYSNVT 397

Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
            P +A  F   A + +D  N  ++  ++        P D  + +IG+  QR    +YD  
Sbjct: 398 VPRVALTFSGGATIDLDVPNGILV--KDCLAFRESGP-DVGLGIIGNVNQRTLEVLYDAG 454

Query: 326 IDLLSFVKENC 336
              + F    C
Sbjct: 455 HGKVGFRAGAC 465


>gi|147809812|emb|CAN71447.1| hypothetical protein VITISV_040904 [Vitis vinifera]
          Length = 988

 Score = 95.9 bits (237), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 99/368 (26%), Positives = 159/368 (43%), Gaps = 58/368 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDC 46
           +V +  GTP +   LILDTGS++ +                FD   SS++          
Sbjct: 128 LVDVAFGTPPQKFKLILDTGSSITWTQCKACVHCLKDSHRHFDSLASSTY---------- 177

Query: 47  TYFKCVNEQC--VYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
           ++  C+       Y M Y D+S + G    +T+++    E   +F    FGC  +N G  
Sbjct: 178 SFGSCIPSTVGNTYNMTYGDKSTSVGNYGCDTMTL----EPSDVFQKFQFGCGRNNEG-- 231

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
            D   GA  G+LGL +  +S +SQ  S  KK FSYC    LP      S L FG     +
Sbjct: 232 -DFGSGA-DGMLGLGQGQLSTVSQTASKFKKVFSYC----LPEENSIGSLL-FGEKATSQ 284

Query: 165 RPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
             S + T  +N P       + +Y++ L DIS+ N+R+N P   F        G IIDSG
Sbjct: 285 SSSLKFTSLVNGPGTSGLEESGYYFVKLLDISVGNKRLNIPSSVF-----ASPGTIIDSG 339

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQ-LSDCPEPIQLCYFLPETFN-RFPSMAFYFE 275
           +V+T      Y  L   F     ++ L+       + +  CY L    +   P    +F 
Sbjct: 340 TVITRLPQRAYSALKAAFKKAMAKYPLSNGRRKENDMLDTCYNLSGRKDVLLPEXVLHFG 399

Query: 276 D-ANLRIDGENVFIIDYENHFFLL----AVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
           D A++R++G+ V   +  +   L     + +  +  + +IG++QQ     +YD+    + 
Sbjct: 400 DGADVRLNGKRVVWGNDASRLCLAFAGNSKSTMNPELTIIGNRQQVSLTVLYDIRGRRIG 459

Query: 331 FVKENCSD 338
           F    CS+
Sbjct: 460 FGGNGCSN 467


>gi|297825301|ref|XP_002880533.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326372|gb|EFH56792.1| hypothetical protein ARALYDRAFT_481251 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 430

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 98/363 (26%), Positives = 153/363 (42%), Gaps = 64/363 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSAL----------------IYAIFDPRKSSSFQKINCDHPD 45
           V   +G P      I+DTGS+L                I+ +F+P  SS+F + +CD   
Sbjct: 70  VNFSVGQPPVPQFTIMDTGSSLLWIQCHPCKHCSSNHMIHPVFNPALSSTFVECSCDDRF 129

Query: 46  CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C Y     C + +CVY   Y   + +KG  A E ++         +     FGC ++N  
Sbjct: 130 CRYAPNGHCSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGHEN-- 187

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD-- 160
              +  +    G+LGL     S   QLGS    +FSYC +  L N  Y  + L  G D  
Sbjct: 188 --GEQLESEFTGILGLGAKPTSLAVQLGS----KFSYC-IGDLANKNYGYNQLVLGEDAD 240

Query: 161 -MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
            +G   P    T+     N  YY++L+ IS+ ++++N  P  F    S   G I+D+G++
Sbjct: 241 ILGDPTPIEFETE-----NGIYYMNLEGISVGDKQLNIEPVVFKRRGS-RTGVILDTGTL 294

Query: 220 LTYFHSDVYWKLHEKFVSY----FERFQLAQLSDCPEPIQLCYF--LPETFNRFPSMAFY 273
            T+     Y +L+ +  S      ERF             LCY   + E    FP + F+
Sbjct: 295 YTWLADIAYRELYNEIKSILDPKLERFWFRDF--------LCYHGRVNEELIGFPVVTFH 346

Query: 274 FE-DANLRIDGENVFI----IDYENHFFLLAVAP-------HDDLVALIGSQQQRDTRFV 321
           F   A L ++  ++F      D  ++ F ++V P       + D  A IG   Q+     
Sbjct: 347 FAGGAELAMEATSMFYPMTESDTYHNVFCMSVRPTTEHGGEYKDFTA-IGLMAQQYYNIA 405

Query: 322 YDL 324
           YDL
Sbjct: 406 YDL 408


>gi|118487542|gb|ABK95598.1| unknown [Populus trichocarpa]
          Length = 471

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 86/368 (23%), Positives = 159/368 (43%), Gaps = 57/368 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
           V+L +GTP K   +ILDTGS+L +                ++DP  S +++K++C   +C
Sbjct: 127 VKLGLGTPPKYYAMILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVEC 186

Query: 47  TYFKCV----------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           +  K            +  C+YT  Y D S + G+ + + +++              +GC
Sbjct: 187 SRLKAATLNDPLCETDSNACLYTASYGDTSFSIGYLSQDLLTL----TSSQTLPQFTYGC 242

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             DN G       G  AG++GL+R  +S ++QL +     FSYC    LP     SS   
Sbjct: 243 GQDNQGLF-----GRAAGIIGLARDKLSMLAQLSTKYGHAFSYC----LPTANSGSSGGG 293

Query: 157 FGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
           F +       S + T  +    N   Y+L L  I++    ++     + +        +I
Sbjct: 294 FLSIGSISPTSYKFTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPT------LI 347

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP-ETFNRFPSMA 271
           DSG+V+T     +Y  L + FV    +    + +  P    +  C+    ++ +  P + 
Sbjct: 348 DSGTVITRLPMSMYAALRQAFV----KIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIK 403

Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHD--DLVALIGSQQQRDTRFVYDLNIDL 328
             F+  A+L +   ++ +I+ +     LA A     + +A+IG++QQ+     YD++   
Sbjct: 404 MIFQGGADLTLRAPSI-LIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSR 462

Query: 329 LSFVKENC 336
           + F   +C
Sbjct: 463 IGFAPGSC 470


>gi|224136884|ref|XP_002326969.1| predicted protein [Populus trichocarpa]
 gi|222835284|gb|EEE73719.1| predicted protein [Populus trichocarpa]
          Length = 626

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 97/379 (25%), Positives = 147/379 (38%), Gaps = 78/379 (20%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RLFIGTP +   LI+DTGS + Y                F P  SS+++ + C+ P C 
Sbjct: 79  TRLFIGTPPQEFALIVDTGSTVTYVPCSSCEQCGKHQDPRFQPDLSSTYRPVKCN-PSCN 137

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C +E  QC Y  +YA+ S + G  A + +S   + E K     A+FGC N   G   
Sbjct: 138 ---CDDEGKQCTYERRYAEMSSSSGVIAEDVVSFGNESELKP--QRAVFGCENVETG--- 189

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL--------------VIPLPNGE 149
           D       G++GL R  +S + QL    +I   FS C               + P PN  
Sbjct: 190 DLYSQRADGIMGLGRGRLSVVDQLVDKGVIGDSFSLCYGGMDVGGGAMVLGQISPPPN-- 247

Query: 150 YTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
                + F     YR P             +Y + LK++ +  + +   P  FD     +
Sbjct: 248 -----MVFSHSNPYRSP-------------YYNIELKELHVAGKPLKLKPKVFD----EK 285

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
            G ++DSG+   YF    +  L +  +   E   L Q+   P+P                
Sbjct: 286 HGTVLDSGTTYAYFPEAAFHALKDAIMK--EIRHLKQIPG-PDPNYHDICFSGAGREVSH 342

Query: 270 MAFYFEDANLRIDGENVFIIDYENHFF----------LLAVAPHDDLVALIGSQQQRDTR 319
           ++  F + N+         +  EN+ F          L      +DL  L+G    R+T 
Sbjct: 343 LSKVFPEVNMVFGSGQKLSLSPENYLFRHTKVSGAYCLGIFQNGNDLTTLLGGIVVRNTL 402

Query: 320 FVYDLNIDLLSFVKENCSD 338
             YD   D + F K NCS+
Sbjct: 403 VTYDRENDKIGFWKTNCSE 421


>gi|212722026|ref|NP_001131674.1| uncharacterized protein LOC100193034 precursor [Zea mays]
 gi|194692214|gb|ACF80191.1| unknown [Zea mays]
 gi|413946454|gb|AFW79103.1| hypothetical protein ZEAMMB73_752316 [Zea mays]
          Length = 441

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 88/364 (24%), Positives = 148/364 (40%), Gaps = 44/364 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA-----------IFDPRKSSSFQKINCDHPDCTY-- 48
           V++ +GTP++   L+ DTGS L +            +F P  S S+  + C    C    
Sbjct: 93  VKVLVGTPAQEFTLVADTGSELTWVKCAGGASPPGLVFRPEASKSWAPVPCSSDTCKLDV 152

Query: 49  -FKCVN-----EQCVYTMKYADQSVTK-GFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
            F   N       C Y  +Y + S    G    ++ ++   G   A     + GCS+ + 
Sbjct: 153 PFSLANCSSSASPCSYDYRYKEGSAGALGVVGTDSATIALPGGKVAQLQDVVLGCSSTHD 212

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G    + DG    VL L    ISF S+  +     FSYCLV  L     T  YL FG   
Sbjct: 213 GQSFKSVDG----VLSLGNAKISFASRAAARFGGSFSYCLVDHLAPRNATG-YLAFGPGQ 267

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
             R P+TQ   F++    FY + +  + +  + ++ P + +D      GG I+DSG+ LT
Sbjct: 268 VPRTPATQTKLFLDPAMPFYGVKVDAVHVAGQALDIPAEVWDPK---SGGVILDSGTTLT 324

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPE----PIQLCY--FLPET-FNRFPSMAFYF 274
              +  Y    +  V+   +     L+  P+    P + CY    P       P +A  F
Sbjct: 325 VLATPAY----KAVVAALTKL----LAGVPKVDFPPFEHCYNWTAPRPGAPEIPKLAVQF 376

Query: 275 EDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
                       ++ID +     + +   +   V++IG+  Q++  + +DL    + F+ 
Sbjct: 377 TGCARLEPPAKSYVIDVKPGVKCIGLQEGEWPGVSVIGNIMQQEHLWEFDLKNMEVRFMP 436

Query: 334 ENCS 337
             C+
Sbjct: 437 STCT 440


>gi|357123876|ref|XP_003563633.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 503

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 91/359 (25%), Positives = 145/359 (40%), Gaps = 42/359 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP     ++ DTGS   +                +FDP KSS++  ++C  P 
Sbjct: 164 VVPIGLGTPPSRFTVVFDTGSDTTWVQCRPCVVSCYKQKDRLFDPAKSSTYANVSCADPA 223

Query: 46  CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C       C    C+Y ++Y D S T GF A +T++V      +    G  FGC   N G
Sbjct: 224 CADLDASGCNAGHCLYGIQYGDGSYTVGFFAKDTLAVA-----QDAIKGFKFGCGEKNRG 278

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                  G  AG+LGL R   S   Q        FSYC    LP     + YL+FG    
Sbjct: 279 L-----FGQTAGLLGLGRGPTSITVQAYEKYGGSFSYC----LPASSAATGYLEFGPLSP 329

Query: 163 YRRPSTQATK--FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
               S   T     +    FYY+ L  I +  +++   P+    +V    G ++DSG+V+
Sbjct: 330 SSSGSNAKTTPMLTDKGPTFYYVGLTGIRVGGKQLGAIPE----SVFSNSGTLVDSGTVI 385

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-DAN 278
           T    D  +       +        + +     +  CY F   +    P+++  F+  A 
Sbjct: 386 TRL-PDTAYAALSSAFAAAMAASGYKKAAAYSILDTCYDFTGLSQVSLPTVSLVFQGGAC 444

Query: 279 LRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L +D    V+ I         A    D+ V ++G+ QQR    +YD++  ++ F    C
Sbjct: 445 LDLDASGIVYAISQSQVCLGFASNGDDESVGIVGNTQQRTYGVLYDVSKKVVGFAPGAC 503


>gi|224072901|ref|XP_002303933.1| predicted protein [Populus trichocarpa]
 gi|222841365|gb|EEE78912.1| predicted protein [Populus trichocarpa]
          Length = 370

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 86/357 (24%), Positives = 148/357 (41%), Gaps = 42/357 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTG--SALI---------YAIFDPRKSSSFQKINCDHPDCTYF 49
           +V+  +GTP + +L+ LD    +A I           +F+  KS++F+ + C  P C   
Sbjct: 36  IVKAKVGTPPQTLLMALDNSYDAAWIPCKGCVGCSSTVFNTVKSTTFKTLGCGAPQCKQV 95

Query: 50  K---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
               C    C +   Y   ++     ++ T   I        ++   FGC     G    
Sbjct: 96  PNPICGGSTCTWNTTYGSSTI----LSNLTRDTIALSMDPVPYYA--FGCIQKATGSSVP 149

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
            +     G+LG  R  +SF+SQ  ++ K  FSYCL  P       S  L+ G  +G + P
Sbjct: 150 PQ-----GLLGFGRGPLSFLSQTQNLYKSTFSYCL--PSFRTLNFSGSLRLG-PVG-QPP 200

Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
             + T  + +P  ++ YY+ L  I +  + ++ P        +   G I DSG+V T   
Sbjct: 201 RIKTTPLLKNPRRSSLYYVKLNGIRVGRKIVDIPRSALAFNPTTGAGTIFDSGTVFTRLV 260

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGE 284
           +  Y  +  +F        ++ L         CY +P      P++ F F   N+ +  E
Sbjct: 261 APAYIAVRNEFRKRVGNATVSSLGG----FDTCYSVPIV---PPTITFMFSGMNVTMPPE 313

Query: 285 NVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           N+ I         LA+A   D    ++ +I S QQ++ R ++D+    L   +E CS
Sbjct: 314 NLLIHSTAGVTSCLAMAAAPDNVNSVLNVIASMQQQNHRILFDVPNSRLGVAREQCS 370


>gi|223950123|gb|ACN29145.1| unknown [Zea mays]
 gi|413923785|gb|AFW63717.1| hypothetical protein ZEAMMB73_445506 [Zea mays]
          Length = 385

 Score = 95.5 bits (236), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 86/360 (23%), Positives = 152/360 (42%), Gaps = 51/360 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + +G+P+    +++DTGS + +               +FDP  SS++   +C   DC
Sbjct: 53  LITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADC 112

Query: 47  TYFK------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                       + QC Y + Y D S T G  + +T+++     G +      FGCSN  
Sbjct: 113 AQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL-----GSSAVRSFQFGCSNVE 167

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            GF++        G++GL     S +SQ    + + FSYCL  P P+   +S +L  G  
Sbjct: 168 SGFNDQTD-----GLMGLGGGAQSLVSQTAGTLGRAFSYCLP-PTPS---SSGFLTLGAA 218

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
            G        T  +       FY + L+ I +   +++ P   F        G ++DSG+
Sbjct: 219 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGT 272

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-D 276
           V+T      Y  L   F +  +++  AQ S     +  C+ F  ++    PS+A  F   
Sbjct: 273 VITRLPPTAYSALSSAFKAGMKQYPPAQPSGI---LDTCFDFSGQSSVSIPSVALVFSGG 329

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A + +D   + +    ++    A    D  + +IG+ QQR    +YD+   ++ F    C
Sbjct: 330 AVVSLDASGIIL----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 385


>gi|294461757|gb|ADE76437.1| unknown [Picea sitchensis]
          Length = 325

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 84/348 (24%), Positives = 149/348 (42%), Gaps = 46/348 (13%)

Query: 13  VLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFK-----CVN 53
           + L++DTGS + +              ++F P  S++++ + C+   C   +     C+N
Sbjct: 1   MFLLIDTGSDITWIQCDPCPQCYKQQDSLFQPAGSATYKPLPCNSTMCQQLQSFSHSCLN 60

Query: 54  EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALA 113
             C Y + Y D+S T+G  A ET+++              FGC + N G    A     A
Sbjct: 61  SSCNYMVSYGDKSTTRGDFALETLTLRSDDTILVSVPNFAFGCGHANKGLFNGA-----A 115

Query: 114 GVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD--MGYRRPSTQAT 171
           G++GL + +I F +Q      K FSYCL  P  +    S  L FG    + Y    T   
Sbjct: 116 GLMGLGKSSIGFPAQTSVAFGKVFSYCL--PSVSSTIPSGILHFGEAAMLDYDVRFTPLV 173

Query: 172 KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKL 231
              + P+  Y++S+  I++ +E +                 ++DSG+V++ F    Y +L
Sbjct: 174 DSSSGPSQ-YFVSMTGINVGDELLPI-----------SATVMVDSGTVISRFEQSAYERL 221

Query: 232 HEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYF-EDANLRIDGENVFII 289
            + F       Q A       P   C+ +    +   P +  +F +DA LR+   ++ + 
Sbjct: 222 RDAFTQILPGLQTAVSV---APFDTCFRVSTVDDINIPLITLHFRDDAELRLSPVHI-LY 277

Query: 290 DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
             ++     A AP     +++G+ QQ++ RFVYD+    L      C+
Sbjct: 278 PVDDGVMCFAFAPSSSGRSVLGNFQQQNLRFVYDIPKSRLGISAFECN 325


>gi|326513755|dbj|BAJ87896.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 442

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 96/365 (26%), Positives = 152/365 (41%), Gaps = 47/365 (12%)

Query: 1   MVRLFIGTP-SKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPD 45
           ++ L IG P S+ V+L LDTGS +++                FD   S++ + + C  P 
Sbjct: 93  LIHLSIGAPRSQPVVLTLDTGSDVVWTQCEPCAECFTQPLPRFDTAASNTVRSVACSDPL 152

Query: 46  C---TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVI-GKGEGKAIFHGALFGCSNDNH 101
           C   +   C    C Y   Y D S++ G    ++ +   GKG GK       FGC   N 
Sbjct: 153 CNAHSEHGCFLHGCTYVSGYGDGSLSFGHFLRDSFTFDDGKGGGKVTVPDIGFGCGMYNA 212

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G           G+ G  R  +S  SQL     ++FSYC        E  SS +  G   
Sbjct: 213 GRFLQTE----TGIAGFGRGPLSLPSQLKV---RQFSYCFTTRF---EAKSSPVFLGGAG 262

Query: 162 GYRRPSTQ---ATKFINH-----PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
             +  +T    +T F+        N+ Y LS K +++   R+  P    +I   G G   
Sbjct: 263 DLKAHATGPILSTPFVRSLPPGTDNSHYVLSFKGVTVGKTRLPVP----EIKADGSGATF 318

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAF 272
           IDSG+ +T F   V+ +L   F++         ++   +   +C+ +  +     P + F
Sbjct: 319 IDSGTDITTFPDAVFRQLKSAFIAQ----AALPVNKTADEDDICFSWDGKKTAAMPKLVF 374

Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSF 331
           + E A+  +  EN    D E+    +AV+    +   LIG+ QQ++T  VYDL    L  
Sbjct: 375 HLEGADWDLPRENYVTEDRESGQVCVAVSTSGQMDRTLIGNFQQQNTHIVYDLAAGKLLL 434

Query: 332 VKENC 336
           V   C
Sbjct: 435 VPAQC 439


>gi|194707866|gb|ACF88017.1| unknown [Zea mays]
          Length = 461

 Score = 95.1 bits (235), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 86/360 (23%), Positives = 152/360 (42%), Gaps = 51/360 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + +G+P+    +++DTGS + +               +FDP  SS++   +C   DC
Sbjct: 129 LITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADC 188

Query: 47  TYFK------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                       + QC Y + Y D S T G  + +T+++     G +      FGCSN  
Sbjct: 189 AQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL-----GSSAVRSFQFGCSNVE 243

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            GF++        G++GL     S +SQ    + + FSYCL  P P+   +S +L  G  
Sbjct: 244 SGFNDQTD-----GLMGLGGGAQSLVSQTAGTLGRAFSYCLP-PTPS---SSGFLTLGAA 294

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
            G        T  +       FY + L+ I +   +++ P   F        G ++DSG+
Sbjct: 295 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGT 348

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-D 276
           V+T      Y  L   F +  +++  AQ S     +  C+ F  ++    PS+A  F   
Sbjct: 349 VITRLPPTAYSALSSAFKAGMKQYPPAQPSGI---LDTCFDFSGQSSVSIPSVALVFSGG 405

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A + +D   + +    ++    A    D  + +IG+ QQR    +YD+   ++ F    C
Sbjct: 406 AVVSLDASGIIL----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461


>gi|326533786|dbj|BAK05424.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 412

 Score = 95.1 bits (235), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 85/323 (26%), Positives = 141/323 (43%), Gaps = 27/323 (8%)

Query: 27  IFDPRKSSSFQKINCDHPDCT--YFKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKG 83
           +F P +S +F+ +  D P C   Y +  +   C +    A      G+ A +T   +   
Sbjct: 109 LFSPAESPTFRGVRRDDPVCVPPYHRLHSTNGCSFAFPSA-----IGYLARDTFH-LRHS 162

Query: 84  EGKAI--FHGALFGCSNDNHGF-DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
           E   +    G  FGC++   GF +ED     L GVL LS   +SF++Q GS    RFSYC
Sbjct: 163 ERSVVKSISGVAFGCAHTTTGFYNEDI----LGGVLSLSPSPLSFLTQFGSRAGGRFSYC 218

Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPD 200
           L  P       S +++FG ++    P    T  +    + Y+LSL  IS+ N+R++    
Sbjct: 219 LPDPT-TSHNPSGFIQFGIEV-PSLPRHAHTTTLTVSASGYHLSLIGISLGNKRLD---- 272

Query: 201 TFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYF 259
             D  +    GC I+    +T      Y  +  + ++        Q+   P  P+     
Sbjct: 273 -IDRHILTSHGCSINPAETITKIAEPAYIIVARELMAQMNELGSKQVKGPPSSPLVFNKI 331

Query: 260 LPETFNRFPSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
                 R P+M F+F D  ++      +F +      FL  V  H     +IG+ QQ + 
Sbjct: 332 SRRVRARLPNMVFHFADGGDMWFTAGKLFQVIGTTARFL--VEGHGSHRTVIGAAQQVNA 389

Query: 319 RFVYDLNIDLLSFVKENCSDDSA 341
           RF++++    L+F +E CS ++A
Sbjct: 390 RFIFNVAAGRLTFAEELCSREAA 412


>gi|115434870|ref|NP_001042193.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|55296112|dbj|BAD67831.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|55296252|dbj|BAD67993.1| putative CDR1 [Oryza sativa Japonica Group]
 gi|113531724|dbj|BAF04107.1| Os01g0178600 [Oryza sativa Japonica Group]
 gi|125569253|gb|EAZ10768.1| hypothetical protein OsJ_00604 [Oryza sativa Japonica Group]
          Length = 454

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 89/374 (23%), Positives = 152/374 (40%), Gaps = 55/374 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDH 43
           ++ + +G+P + +L I DTGS L++                   FDP +SS++ +++C  
Sbjct: 102 LMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGRVSCQT 161

Query: 44  PDCTYFKCVN----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFH----GALFG 95
             C             C Y   Y D S T G  + ET +    G G++       G  FG
Sbjct: 162 DACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGSGRSPRQVRVGGVKFG 221

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSS 153
           CS    G            ++GL    +S ++QLG    + +RFSYCLV   P+    SS
Sbjct: 222 CSTATAGSFPADG------LVGLGGGAVSLVTQLGGATSLGRRFSYCLV---PHSVNASS 272

Query: 154 YLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
            L FG       P   +T  +    + +Y + L  + + N+          +  +     
Sbjct: 273 ALNFGALADVTEPGAASTPLVAGDVDTYYTVVLDSVKVGNKT---------VASAASSRI 323

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP----ETFNRFP 268
           I+DSG+ LT+    +   + ++      R  L  +      +QLCY +     E     P
Sbjct: 324 IVDSGTTLTFLDPSLLGPIVDEL---SRRITLPPVQSPDGLLQLCYNVAGREVEAGESIP 380

Query: 269 SMAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNI 326
            +   F   A + +  EN F+   E    L  VA  +   V+++G+  Q++    YDL+ 
Sbjct: 381 DLTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLDA 440

Query: 327 DLLSFVKENCSDDS 340
             ++F   +C+  S
Sbjct: 441 GTVTFAGADCAGSS 454


>gi|359492937|ref|XP_002283889.2| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 439

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 86/359 (23%), Positives = 149/359 (41%), Gaps = 43/359 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA-----------IFDPRKSSSFQKINCDHPDCTYF 49
           +VR+ +GTP + + ++LDT     +             F P  SS++  + C  P CT  
Sbjct: 100 VVRVKLGTPGQLMFMVLDTSRDAAWVPCADCAGCSSPTFSPNTSSTYASLQCSVPQCTQV 159

Query: 50  KCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
           + ++        C +   Y   S      + +++     G          FGC N   G 
Sbjct: 160 RGLSCPTTGTAACFFNQTYGGDSSFSAMLSQDSL-----GLAVDTLPSYSFGCVNAVSGS 214

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
               +     G+LGL R  +S +SQ GS+    FSYC   P     Y S  L+ G  +G 
Sbjct: 215 TLPPQ-----GLLGLGRGPMSLLSQSGSLYSGVFSYCF--PSFKSYYFSGSLRLG-PLGQ 266

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            + + + T  + +P+    YY++L  +S+    +   P+      +   G IIDSG+V+T
Sbjct: 267 PK-NIRTTPLLRNPHRPTLYYVNLTGVSVGRVLVPVAPELLAFDPNTGAGTIIDSGTVIT 325

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
            F   VY  + ++F     R Q+            C F     +  P + F+F   +L++
Sbjct: 326 RFVEPVYAAIRDEF-----RKQVKGPFATIGAFDTC-FAATNEDIAPPVTFHFTGMDLKL 379

Query: 282 DGENVFIIDYENHFFLLAVAPH----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             EN  I         LA+A      + ++ +I + QQ++ R ++D+    L   +E C
Sbjct: 380 PLENTLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRIMFDVTNSRLGIARELC 438


>gi|302784853|ref|XP_002974198.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
 gi|300157796|gb|EFJ24420.1| hypothetical protein SELMODRAFT_54030 [Selaginella moellendorffii]
          Length = 359

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 101/365 (27%), Positives = 162/365 (44%), Gaps = 45/365 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
           M+ L IGTP + +  ++DTGS L++                 IF    SSS++K+ C+  
Sbjct: 6   MMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNST 65

Query: 45  DCTYFKCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEG---KAIFHGALFG 95
            C+            E C Y  +Y D S T G    + IS    G G   ++ F G LFG
Sbjct: 66  HCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLFG 125

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           C+    G D +   G    ++GL + + S I QLG  +  +FSYCLV    +     S+L
Sbjct: 126 CARKLKG-DWNFTQG----LIGLGQKSHSLIQQLGDKLGYKFSYCLV-SYDSPPSAKSFL 179

Query: 156 KFGTDMGYRRPSTQATKFINHPN---NFYYLSLKDISIDN-ERMNFPPDTFDITVSG--- 208
             G+    R     +T  ++  +     YY+ L+ I+I     + +  ++   T  G   
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITIGGVPVVVYDKESGHNTSVGPFL 239

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRF 267
               +IDSG+  T     VY  + +   S  E+  L  L +    + LC+    +T   F
Sbjct: 240 ANKTVIDSGTTYTLLTPPVYEAMRK---SIEEQVILPTLGN-SAGLDLCFNSSGDTSYGF 295

Query: 268 PSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
           PS+ FYF +   L +  EN+F +   +   L   +   DL ++IG+ QQ++   +YDL  
Sbjct: 296 PSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDL-SIIGNMQQQNFHILYDLVA 354

Query: 327 DLLSF 331
             +SF
Sbjct: 355 SQISF 359


>gi|356555248|ref|XP_003545946.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 453

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 85/349 (24%), Positives = 149/349 (42%), Gaps = 31/349 (8%)

Query: 6   IGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDC----------TYFKCVNEQ 55
           +G+P +   L++DTGS   +       S SF+ + C    C          +     ++ 
Sbjct: 119 VGSPGQRFWLVVDTGSEFTWL----NCSKSFEAVTCASRKCKVDLSELFSLSVCPKPSDP 174

Query: 56  CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN---DNHGFDEDARDGAL 112
           C+Y + YAD S  KGF   ++I+V      +   +    GC+    +   F+E+      
Sbjct: 175 CLYDISYADGSSAKGFFGTDSITVGLTNGKQGKLNNLTIGCTKSMLNGVNFNEET----- 229

Query: 113 AGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATK 172
            G+LGL     SFI +  +    +FSYCLV  L +   +S+    G          + T+
Sbjct: 230 GGILGLGFAKDSFIDKAANKYGAKFSYCLVDHLSHRSVSSNLTIGGHHNAKLLGEIRRTE 289

Query: 173 FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLH 232
            I  P  FY +++  ISI  + +  PP  +D   + EGG +IDSG+ LT      Y  + 
Sbjct: 290 LILFP-PFYGVNVVGISIGGQMLKIPPQVWDF--NAEGGTLIDSGTTLTSLLLPAYEAVF 346

Query: 233 EKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAFYFEDANLRIDGENVFIID 290
           E       + +     D  + ++ C F  E F+    P + F+F            +IID
Sbjct: 347 EALTKSLTKVKRVTGEDF-DALEFC-FDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIID 404

Query: 291 YENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
                  + + P D +   ++IG+  Q++  + +DL+ + + F    C+
Sbjct: 405 VAPLVKCIGIVPIDGIGGASVIGNIMQQNHLWEFDLSTNTVGFAPSTCT 453


>gi|108707839|gb|ABF95634.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
          Length = 330

 Score = 94.7 bits (234), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 86/281 (30%), Positives = 134/281 (47%), Gaps = 28/281 (9%)

Query: 53  NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGAL 112
           N+ CVYT  Y D+SVT G    +  +    G G ++  G  FGC   N+G  +       
Sbjct: 59  NQTCVYTYYYNDKSVTTGLIEVDKFTF---GAGASV-PGVAFGCGLFNNGVFKSNE---- 110

Query: 113 AGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNG-EYTSSYLKFGTDMGYR--RPSTQ 169
            G+ G  R  +S  SQL       FS+C      NG + ++  L    D+ Y+  R + Q
Sbjct: 111 TGIAGFGRGPLSLPSQLKV---GNFSHCFTAV--NGLKQSTVLLDLPADL-YKNGRGAVQ 164

Query: 170 ATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDV 227
           +T  I +  N  FYYLSLK I++ + R+  P   F +T +G GG IIDSG+ +T     V
Sbjct: 165 STPLIQNSANPTFYYLSLKGITVGSTRLPVPESAFALT-NGTGGTIIDSGTSITSLPPQV 223

Query: 228 YWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFEDANLRIDGEN- 285
           Y  + ++F +  +   +   +  P     C+  P +     P +  +FE A + +  EN 
Sbjct: 224 YQVVRDEFAAQIKLPVVPGNATGP---YTCFSAPSQAKPDVPKLVLHFEGATMDLPRENY 280

Query: 286 VFII--DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
           VF +  D  N    LA+   D+   +IG+ QQ++   +YDL
Sbjct: 281 VFEVPDDAGNSIICLAINKGDE-TTIIGNFQQQNMHVLYDL 320


>gi|9759559|dbj|BAB11161.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|21553652|gb|AAM62745.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|109134179|gb|ABG25087.1| At5g07030 [Arabidopsis thaliana]
          Length = 439

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 90/358 (25%), Positives = 147/358 (41%), Gaps = 39/358 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
           +V+  IGTP++ +LL +DT S + +              F P KS+SF+ ++C  P C  
Sbjct: 100 IVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQ 159

Query: 49  FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
                C    C + + Y   S+    +  +TI +              FGC N   G   
Sbjct: 160 VPNPTCGARACSFNLTYGSSSIAANLS-QDTIRLAADP-----IKAFTFGCVNKVAGGGT 213

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
                 L G+    R  +S +SQ  SI K  FSYCL  P       S  L+ G     +R
Sbjct: 214 IPPPQGLLGL---GRGPLSLMSQAQSIYKSTFSYCL--PSFRSLTFSGSLRLGPTSQPQR 268

Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
              + T+ + +P  ++ YY++L  I +  + ++ PP       S   G I DSG+V T  
Sbjct: 269 --VKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRL 326

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
              VY  +  +F    +       S        CY       + P++ F F+  N+ +  
Sbjct: 327 AKPVYEAVRNEFRKRVKPTTAVVTSL--GGFDTCY---SGQVKVPTITFMFKGVNMTMPA 381

Query: 284 ENVFI--IDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +N+ +           +A AP   + +V +I S QQ++ R + D+    L   +E CS
Sbjct: 382 DNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439


>gi|79507883|ref|NP_196320.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|332003717|gb|AED91100.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 455

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 90/358 (25%), Positives = 147/358 (41%), Gaps = 39/358 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
           +V+  IGTP++ +LL +DT S + +              F P KS+SF+ ++C  P C  
Sbjct: 116 IVKALIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQ 175

Query: 49  FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
                C    C + + Y   S+    +  +TI +              FGC N   G   
Sbjct: 176 VPNPTCGARACSFNLTYGSSSIAANLS-QDTIRLAADP-----IKAFTFGCVNKVAGGGT 229

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
                 L G+    R  +S +SQ  SI K  FSYCL  P       S  L+ G     +R
Sbjct: 230 IPPPQGLLGL---GRGPLSLMSQAQSIYKSTFSYCL--PSFRSLTFSGSLRLGPTSQPQR 284

Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
              + T+ + +P  ++ YY++L  I +  + ++ PP       S   G I DSG+V T  
Sbjct: 285 --VKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRL 342

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
              VY  +  +F    +       S        CY       + P++ F F+  N+ +  
Sbjct: 343 AKPVYEAVRNEFRKRVKPTTAVVTSL--GGFDTCY---SGQVKVPTITFMFKGVNMTMPA 397

Query: 284 ENVFI--IDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +N+ +           +A AP   + +V +I S QQ++ R + D+    L   +E CS
Sbjct: 398 DNLMLHSTAGSTSCLAMAAAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 455


>gi|125555051|gb|EAZ00657.1| hypothetical protein OsI_22678 [Oryza sativa Indica Group]
          Length = 435

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 93/342 (27%), Positives = 146/342 (42%), Gaps = 52/342 (15%)

Query: 20  GSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISV 79
           G A     F+P +SSSF  I C  P+C   +C    C +T+++ + +V  G    +T+++
Sbjct: 121 GGAPCDPAFEPSRSSSFAAIPCGSPECA-VECTGASCPFTIQFGNVTVANGTLVRDTLTL 179

Query: 80  IGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL----GSIIKK 135
                  A F G  FGC     G D D  DGA+ G++ LSR + S  S++     +    
Sbjct: 180 ----PPSATFAGFTFGC--IEVGADADTFDGAV-GLIDLSRSSHSLASRVISNGATTSAA 232

Query: 136 RFSYCLVIPLPNGEYTSS--YLKFGT--------DMGYRRPSTQATKFINHPNNFYYLSL 185
            FSYC    LP+   TSS  +L  G         D+ Y   S+      NHPN+ Y++ L
Sbjct: 233 AFSYC----LPSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNP----NHPNS-YFVEL 283

Query: 186 KDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLA 245
             IS+  E +  PP  F        G ++++ +  T+     Y  L + F     R  +A
Sbjct: 284 VGISVGGEDLPVPPAVF-----AAHGTLLEAATEFTFLAPAAYAALRDAF-----RRDMA 333

Query: 246 QLSDCP--EPIQLCYFLPETFN-RFPSMAFYFEDAN-LRID-GENVFIIDYENHFFLLAV 300
                P    +  CY L    +   P++A  F     L +D  + ++  D  + F  +A 
Sbjct: 334 PYPAAPPFRVLDTCYNLTGLASLAVPTVALRFAGGTELELDVRQMMYFADPSSVFSSVAC 393

Query: 301 ------APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
                       V++IG+  QR T  VYDL    + F+   C
Sbjct: 394 LAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435


>gi|308044575|ref|NP_001183392.1| uncharacterized protein LOC100501808 [Zea mays]
 gi|238011188|gb|ACR36629.1| unknown [Zea mays]
          Length = 342

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 97/357 (27%), Positives = 147/357 (41%), Gaps = 50/357 (14%)

Query: 15  LILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKCVN-----EQ 55
           ++LDTGS +++               +FDPR+SSS+  + C    C              
Sbjct: 1   MVLDTGSDVVWVQCAPCRRCYEQSGPVFDPRRSSSYGAVGCGAALCRRLDSGGCDLRRGA 60

Query: 56  CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGV 115
           C+Y + Y D SVT G    ET++  G   G  +   AL GC +DN G    A        
Sbjct: 61  CMYQVAYGDGSVTAGDFVTETLTFAG---GARVARVAL-GCGHDNEGLFVAAAGLLGL-- 114

Query: 116 LGLSRVTISFISQLGSIIKKRFSYCLV------IPLPNGEYTSSYLKFGTDMGYRRPSTQ 169
               R  +SF +Q+     + FSYCLV           G + SS + FG        S  
Sbjct: 115 ---GRGGLSFPTQISRRYGRSFSYCLVDRTSSGAGAAPGSHRSSTVSFGAGS-VGASSAS 170

Query: 170 ATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCIIDSGSVLTYFH 224
            T  + +P    FYY+ L  IS+   R+    ++ D+ +   +G GG I+DSG+ +T   
Sbjct: 171 FTPMVRNPRMETFYYVQLVGISVGGARVPGVAES-DLRLDPSTGRGGVIVDSGTSVTRLA 229

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFL-PETFNRFPSMAFYFE-DANL 279
              Y  L + F +         L   P    L   CY L      + P+++ +F   A  
Sbjct: 230 RASYSALRDAFRAA----AAGGLRLSPGGFSLFDTCYDLGGRRVVKVPTVSMHFAGGAEA 285

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            +  EN  I       F  A A  D  V++IG+ QQ+  R V+D +   + F  + C
Sbjct: 286 ALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQRVGFAPKGC 342


>gi|225216988|gb|ACN85278.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 518

 Score = 94.7 bits (234), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 90/364 (24%), Positives = 145/364 (39%), Gaps = 53/364 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP+    ++ DTGS   +                +FDP +SS++  ++C  P 
Sbjct: 180 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPARSSTYANVSCAAPA 239

Query: 46  C---TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C       C    C+Y ++Y D S + GF A +T+++           G  FGC   N G
Sbjct: 240 CFDLDTRGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNEG 295

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
              +A     AG+LGL R   S   Q        F++C    LP     + YL FG    
Sbjct: 296 LFGEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----LPARSSGTGYLDFGPGSP 346

Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
               +   T  +  N P  FYY+ +  I +  + ++ P   F        G I+DSG+V+
Sbjct: 347 AAAGARLTTPMLTDNGP-TFYYVGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVI 400

Query: 221 TYFHSDVYWKLHEKFVSY-----FERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYF 274
           T      Y  L   FVS      +++     L D       CY F   +    P+++  F
Sbjct: 401 TRLPPPAYSSLRSAFVSAMAARGYKKAPAVSLLD------TCYDFTGMSQVAIPTVSLLF 454

Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFV 332
           +  A L +D   +      +   L   A  D   V ++G+ Q +     YD+   ++ F 
Sbjct: 455 QGGAILDVDASGIMYAASVSQVCLGFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFS 514

Query: 333 KENC 336
              C
Sbjct: 515 PGAC 518


>gi|449463971|ref|XP_004149703.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 100/368 (27%), Positives = 158/368 (42%), Gaps = 56/368 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+DTGS + Y                FDP  SS+++ I C+  DC 
Sbjct: 85  TRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCNI-DCI 143

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C ++  QCVY  +YA+ S + G    + IS   + E   I   A+FGC N   G   
Sbjct: 144 ---CDSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSE--LIPQRAVFGCENMETG--- 195

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDM- 161
           D       G++GL    +S + QL     I   FS C   + +  G      +   +DM 
Sbjct: 196 DLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISPPSDMI 255

Query: 162 -GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
             Y  P       +  P  +Y + LK+I +  +++      FD    G  G ++DSG+  
Sbjct: 256 FTYSDP-------VRSP--YYNVDLKEIHVAGKKLPLSSGIFD----GRYGAVLDSGTTY 302

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYF-----LPETFNRFPSMAFY 273
            Y  ++ +    +  +   E   L ++ D P+P    +C+        E  N+FP++   
Sbjct: 303 AYLPAEAFSAFKDAIMD--EIHSLKKI-DGPDPNFKDICFSGAGSDAAELSNKFPTVDMV 359

Query: 274 FEDAN-LRIDGENVFIIDYENH--FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
           FE+   L +  EN F    + H  + L      +D   L+G    R+T  +YD     + 
Sbjct: 360 FENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIG 419

Query: 331 FVKENCSD 338
           F K NCS+
Sbjct: 420 FWKTNCSE 427


>gi|296082170|emb|CBI21175.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 89/362 (24%), Positives = 140/362 (38%), Gaps = 92/362 (25%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +G+P + +  I DTGS L +                IFDP  S S+  ++CD P 
Sbjct: 90  VVTVGLGSPKRDLTFIFDTGSDLTWTQCEPCVGYCYQQREHIFDPSTSLSYSNVSCDSPS 149

Query: 46  CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C   +        C +  C+Y ++Y D S + GF A E +S+        +F+   FGC 
Sbjct: 150 CEKLESATGNSPGCSSSTCLYGIRYGDGSYSIGFFAREKLSLTST----DVFNNFQFGCG 205

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            +N G       G  AG+LGL+R  +S +SQ      K FSYC    LP+   ++ YL F
Sbjct: 206 QNNRGLF-----GGTAGLLGLARNPLSLVSQTAQKYGKVFSYC----LPSSSSSTGYLSF 256

Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           G+  G     ++A KF                        PP  +               
Sbjct: 257 GSGDG----DSKAVKFTPR--------------------LPPTVYSSV------------ 280

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE- 275
                       K+  + +S + R +   + D       CY L +    + P +  YF  
Sbjct: 281 -----------QKVFRELMSDYPRVKGVSILD------TCYDLSKYKTVKVPKIILYFSG 323

Query: 276 DANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
            A + +  E  ++++         A    DD VA+IG+ QQ+    VYD     + F   
Sbjct: 324 GAEMDLAPEGIIYVLKVSQVCLAFAGNSDDDEVAIIGNVQQKTIHVVYDDAEGRVGFAPS 383

Query: 335 NC 336
            C
Sbjct: 384 GC 385


>gi|357113696|ref|XP_003558637.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 432

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 91/372 (24%), Positives = 157/372 (42%), Gaps = 54/372 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA-------------IFDPRKSSSFQKINCDHPDCT 47
           +VR  +G+P++ +LL LDT +   +A             +F P  S+S+  + C    CT
Sbjct: 78  VVRAGLGSPAQPILLALDTSADATWAHCSPCGTCPSSGSLFAPANSTSYAPLPCSSTMCT 137

Query: 48  YFK---CVNE----------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF 94
             +   C  +           C +T  +AD S     A+           GK       F
Sbjct: 138 VLQGQPCPAQDPYDSSAPLPMCAFTKPFADASFQASLASDWL------HLGKDAIPNYAF 191

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
           GC +   G   +       G+LGL R  ++ +SQ+G++    FSYCL  P     Y S  
Sbjct: 192 GCVSAVSGPTANLPK---QGLLGLGRGPMALLSQVGNMYNGVFSYCL--PSYKSYYFSGS 246

Query: 155 LKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
           L+ G     R    + T  + +PN  + YY+++  +S+    +  P  +F    +   G 
Sbjct: 247 LRLGAAGQPR--GVRYTPMLKNPNRSSLYYVNVTGLSVGRAPVKVPAGSFAFDPATGAGT 304

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFNRF-PS 269
           ++DSG+V+T +   VY  L E+F     R  +A  S          C+   E      P+
Sbjct: 305 VVDSGTVITRWTPPVYAALREEF-----RRHVAAPSGYTSLGAFDTCFNTDEVAAGVAPA 359

Query: 270 MAFYFEDA-NLRIDGENVFIIDYENHFFLLAV--APH--DDLVALIGSQQQRDTRFVYDL 324
           +  + +   +L +  EN  I         LA+  AP   + +V ++ + QQ++ R V+D+
Sbjct: 360 VTVHMDGGLDLALPMENTLIHSSATPLACLAMAEAPQNVNAVVNVLANLQQQNLRVVFDV 419

Query: 325 NIDLLSFVKENC 336
               + F +E+C
Sbjct: 420 ANSRVGFARESC 431


>gi|115465373|ref|NP_001056286.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|113579837|dbj|BAF18200.1| Os05g0557100 [Oryza sativa Japonica Group]
 gi|125553268|gb|EAY98977.1| hypothetical protein OsI_20935 [Oryza sativa Indica Group]
          Length = 494

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 103/394 (26%), Positives = 154/394 (39%), Gaps = 70/394 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA-----------------------------IFDPRK 32
           VR  +GTP++  +LI DTGS L +                              +F P  
Sbjct: 112 VRFRVGTPAQPFVLIADTGSDLTWVKCRGAASPSHATATASPAAAPSPAVAPPRVFRPGD 171

Query: 33  SSSFQKINCDHPDCTY---FKCVN-----EQCVYTMKYADQSVTKGFAAHETISVI---- 80
           S ++  I C    C     F   N       C Y  +Y D S  +G    ++ +V     
Sbjct: 172 SKTWSPIPCSSETCKSTIPFSLANCSSSTAACSYDYRYNDNSAARGVVGTDSATVALSGG 231

Query: 81  ----GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR 136
               G G+ KA   G + GC+  + G   +A D    GVL L    ISF S+  S    R
Sbjct: 232 RGGGGGGDRKAKLQGVVLGCTTAHAGQGFEASD----GVLSLGYSNISFASRAASRFGGR 287

Query: 137 FSYCLVIPLPNGEYTSSYLKFG-----TDMGYRRPSTQATKFIN-HPNNFYYLSLKDISI 190
           FSYCLV  L     T SYL FG            P ++    ++     FY +++  +S+
Sbjct: 288 FSYCLVDHLAPRNAT-SYLTFGAGPDAASSSAPAPGSRTPLLLDARVRPFYAVAVDSVSV 346

Query: 191 DNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDC 250
           D   ++ P + +D  V   GG IIDSG+ LT   +  Y     K V      QLA L   
Sbjct: 347 DGVALDIPAEVWD--VGSNGGTIIDSGTSLTVLATPAY-----KAVVAALSEQLAGLPRV 399

Query: 251 P-EPIQLCYFLPETFN-----RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD 304
             +P   CY      +       P +A  F  +         ++ID       + V    
Sbjct: 400 AMDPFDYCYNWTARGDGGGDLAVPKLAVQFAGSARLEPPAKSYVIDAAPGVKCIGVQEGA 459

Query: 305 -DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
              V++IG+  Q++  + +DLN   L F + +C+
Sbjct: 460 WPGVSVIGNILQQEHLWEFDLNNRWLRFRQTSCT 493


>gi|449508297|ref|XP_004163275.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 641

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 100/368 (27%), Positives = 158/368 (42%), Gaps = 56/368 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+DTGS + Y                FDP  SS+++ I C+  DC 
Sbjct: 85  TRLWIGTPPQQFALIVDTGSTVTYVPCSTCEQCGRHQDPKFDPESSSTYKPIKCNI-DCI 143

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C ++  QCVY  +YA+ S + G    + IS   + E   I   A+FGC N   G   
Sbjct: 144 ---CDSDGVQCVYERQYAEMSTSSGVLGEDVISFGNQSE--LIPQRAVFGCENMETG--- 195

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDM- 161
           D       G++GL    +S + QL     I   FS C   + +  G      +   +DM 
Sbjct: 196 DLFSQRADGIMGLGTGDLSLVDQLVEKGAINDSFSLCYGGMDIGGGAMVLGGISPPSDMI 255

Query: 162 -GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
             Y  P       +  P  +Y + LK+I +  +++      FD    G  G ++DSG+  
Sbjct: 256 FTYSDP-------VRSP--YYNVDLKEIHVAGKKLPLSSGIFD----GRYGAVLDSGTTY 302

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYF-----LPETFNRFPSMAFY 273
            Y  ++ +    +  +   E   L ++ D P+P    +C+        E  N+FP++   
Sbjct: 303 AYLPAEAFSAFKDAIMD--EIHSLKKI-DGPDPNFKDICFSGAGSDAAELSNKFPTVDMV 359

Query: 274 FEDAN-LRIDGENVFIIDYENH--FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
           FE+   L +  EN F    + H  + L      +D   L+G    R+T  +YD     + 
Sbjct: 360 FENGQKLSLTPENYFFRHSKVHGAYCLGIFENGNDQTTLLGGIVVRNTLVMYDRANSKIG 419

Query: 331 FVKENCSD 338
           F K NCS+
Sbjct: 420 FWKTNCSE 427


>gi|356528623|ref|XP_003532899.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 507

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 86/330 (26%), Positives = 141/330 (42%), Gaps = 29/330 (8%)

Query: 26  AIFDPRKSSSFQKINCDHPDC----------TYFKCVNEQCVYTMKYADQSVTKGFAAHE 75
            +F P +S SFQ + C    C          +     ++ C+Y + YAD S  KGF   +
Sbjct: 189 GVFCPHRSKSFQAVTCASQKCKIDLSQLFSLSLCPKPSDPCLYDISYADGSSAKGFFGTD 248

Query: 76  TISVIGKGEGKAIFHGALFGCSN---DNHGFDEDARDGALAGVLGLSRVTISFISQLGSI 132
           TI+V  K   +   +    GC+    +   F+ED       G+LGL     SFI +    
Sbjct: 249 TITVDLKNGKEGKLNNLTIGCTKSMENGVNFNEDT-----GGILGLGFAKDSFIDKAAYE 303

Query: 133 IKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR-RPSTQATKFINHPNNFYYLSLKDISID 191
              +FSYCLV  L +    SSYL  G     +     + T+ I  P  FY +++  ISI 
Sbjct: 304 YGAKFSYCLVDHLSH-RNVSSYLTIGGHHNAKLLGEIKRTELILFP-PFYGVNVVGISIG 361

Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
            + +  PP  +D   + +GG +IDSG+ LT      Y  + E  +    + +     D  
Sbjct: 362 GQMLKIPPQVWDF--NSQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRVTGEDF- 418

Query: 252 EPIQLCYFLPETFNR--FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDL--V 307
             +  C F  E F+    P + F+F            +IID       + + P D +   
Sbjct: 419 GALDFC-FDAEGFDDSVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGA 477

Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ++IG+  Q++  + +DL+ + + F    C+
Sbjct: 478 SVIGNIMQQNHLWEFDLSTNTIGFAPSICT 507


>gi|255581545|ref|XP_002531578.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223528808|gb|EEF30814.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score = 94.4 bits (233), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 82/320 (25%), Positives = 141/320 (44%), Gaps = 50/320 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY--- 48
           V L +G+P + V ++LDTGS L +          ++F+P  S ++ K+ C  P C     
Sbjct: 71  VSLTVGSPPQNVTMVLDTGSELSWLHCKKTQFLNSVFNPLSSKTYSKVPCLSPTCKTRTR 130

Query: 49  -----FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                  C   + C   + YAD +  +G  A ET  +     G       +FGC +   G
Sbjct: 131 DLTIPVSCDATKLCHVIVSYADATSIEGNLAFETFRL-----GSLTKPATIFGCMDS--G 183

Query: 103 FDEDAR-DGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-------VIPLPNGEYTSSY 154
           F  ++  D    G++G++R ++SF++Q+G     +FSYC+       V+ L N  +   +
Sbjct: 184 FSSNSEEDSKTTGLIGMNRGSLSFVNQMG---YPKFSYCISGFDSAGVLLLGNASF--PW 238

Query: 155 LKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
           LK    + Y      +T         Y + L+ I + N+ ++ P   F    +G G  ++
Sbjct: 239 LK---PLSYTPLVQISTPLPYFDRVAYTVQLEGIKVKNKVLSLPKSVFVPDHTGAGQTMV 295

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD----CPEPIQLCYFLPET---FNRF 267
           DSG+  T+    VY  L  +F+S   R  L  L+D        + LCY L  +       
Sbjct: 296 DSGTQFTFLLGPVYTALKNEFLSQ-TRGILKVLNDDNFVFQGAMDLCYLLDSSRPNLQNL 354

Query: 268 PSMAFYFEDANLRIDGENVF 287
           P ++  F+ A + + GE + 
Sbjct: 355 PVVSLMFQGAEMSVSGERLL 374


>gi|297810815|ref|XP_002873291.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319128|gb|EFH49550.1| hypothetical protein ARALYDRAFT_487523 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 439

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 89/358 (24%), Positives = 148/358 (41%), Gaps = 39/358 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
           +V++ IGTP++ +LL +DT S + +              F P KS+SF+ ++C  P C  
Sbjct: 100 IVKVLIGTPAQPLLLAMDTSSDVAWIPCSGCVGCPSNTAFSPAKSTSFKNVSCSAPQCKQ 159

Query: 49  FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
                C    C + + Y   S+    +  +TI +              FGC N   G   
Sbjct: 160 VPNPACGARACSFNLTYGSSSIAANLS-QDTIRLAADP-----IKAFTFGCVNKVAGGGT 213

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
                 L G+    R  +S +SQ  S+ K  FSYCL  P       S  L+ G     +R
Sbjct: 214 IPPPQGLLGL---GRGPLSLMSQAQSVYKSTFSYCL--PSFRSLTFSGSLRLGPTSQPQR 268

Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
              + T+ + +P  ++ YY++L  I +  + ++ PP       S   G I DSG+V T  
Sbjct: 269 --VKYTQLLRNPRRSSLYYVNLVAIRVGRKVVDLPPAAIAFNPSTGAGTIFDSGTVYTRL 326

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
              VY  +  +F    +       S        CY       + P++ F F+  N+ +  
Sbjct: 327 AKPVYEAVRNEFRKRVKPPTAVVTSL--GGFDTCY---SGQVKVPTITFMFKGVNMTMPA 381

Query: 284 ENVFIIDYENHFFLLAVAPH----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +N+ +         LA+A      + +V +I S QQ++ R + D+    L   +E CS
Sbjct: 382 DNLMLHSTAGSTSCLAMASAPENVNSVVNVIASMQQQNHRVLIDVPNGRLGLARERCS 439


>gi|297827577|ref|XP_002881671.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297327510|gb|EFH57930.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 438

 Score = 94.4 bits (233), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 79/328 (24%), Positives = 141/328 (42%), Gaps = 64/328 (19%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDC----- 46
           V L +G+P + + ++LDTGS L +          ++F+P  SS++  + C  P C     
Sbjct: 63  VTLAVGSPPQNISMVLDTGSELSWLHCKKSPNLGSVFNPVSSSTYSPVPCSSPICRTRTR 122

Query: 47  -----------TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
                      T+F      C   + YAD +  +G  AH+T  +     G     G LFG
Sbjct: 123 DLPIPASCDPKTHF------CHVAISYADATSIEGNLAHDTFVI-----GSVTRPGTLFG 171

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           C +     D +  D    G++G++R ++SF++QLG     +FSYC+     +G  +S  L
Sbjct: 172 CMDSGLSSDSE-EDAKSTGLMGMNRGSLSFVNQLG---FSKFSYCI-----SGSDSSGIL 222

Query: 156 KFG-------TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
             G         + Y     Q T         Y + L+ I + ++ ++ P   F    +G
Sbjct: 223 LLGDASYSWLGPIQYTPLVLQTTPLPYFDRVAYTVQLEGIRVGSKILSLPKSVFVPDHTG 282

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPET 263
            G  ++DSG+  T+    VY  L  +F++  +   + ++ D P       + LCY +  +
Sbjct: 283 AGQTMVDSGTQFTFLMGPVYTALKNEFIA--QTKSVLRIVDDPNFVFQGTMDLCYRVGSS 340

Query: 264 ----FNRFPSMAFYFEDANLRIDGENVF 287
               F   P ++  F  A + + G+ + 
Sbjct: 341 TRPNFTGLPVISLMFRGAEMSVSGQKLL 368


>gi|413923784|gb|AFW63716.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 531

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 85/360 (23%), Positives = 150/360 (41%), Gaps = 51/360 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + +G+P+    +++DTGS + +               +FDP  SS++   +C   DC
Sbjct: 199 LITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSADC 258

Query: 47  TYFK------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                       + QC Y + Y D S T G  + +T+++     G +      FGCSN  
Sbjct: 259 AQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL-----GSSAVRSFQFGCSNVE 313

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            GF++        G++GL     S +SQ    + + FSYC    LP    +S +L  G  
Sbjct: 314 SGFNDQTD-----GLMGLGGGAQSLVSQTAGTLGRAFSYC----LPPTPSSSGFLTLGAA 364

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
            G        T  +       FY + L+ I +   +++ P   F        G ++DSG+
Sbjct: 365 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGT 418

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-D 276
           V+T      Y  L   F +  +++  AQ S     +  C+ F  ++    PS+A  F   
Sbjct: 419 VITRLPPTAYSALSSAFKAGMKQYPPAQPSGI---LDTCFDFSGQSSVSIPSVALVFSGG 475

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A + +D   + +    ++    A    D  + +IG+ QQR    +YD+   ++ F    C
Sbjct: 476 AVVSLDASGIIL----SNCLAFAGNSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 531


>gi|343172998|gb|AEL99202.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 94.0 bits (232), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 90/360 (25%), Positives = 146/360 (40%), Gaps = 48/360 (13%)

Query: 5   FIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCTYFK 50
           +IGTP +   LI+DTGS + Y                F P  S ++  + C+ PDCT   
Sbjct: 1   WIGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCN-PDCT-CD 58

Query: 51  CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDEDAR 108
             N+QC Y  +YA+ S + G    + +S     E K     A+FGC N   G  F + A 
Sbjct: 59  TENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKP--QRAVFGCENAETGDLFSQHAD 116

Query: 109 DGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMGYRR 165
                G++GL R  +S + QL    +I   FS C   + +  G      +   +DM +  
Sbjct: 117 -----GIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSDMVFSH 171

Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
                +        +Y + L+ + +  ++++  P  FD    G+ G I+DSG+   Y   
Sbjct: 172 SDPDRSP-------YYNIELRGLHVAGKKLDINPQVFD----GKHGTILDSGTTYAYLPE 220

Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA-NL 279
             +    +   S     +  +  D P    +C+      +PE +  FPS+   F++    
Sbjct: 221 AAFLPFIQAITSELHGLKQIRGPD-PNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEKY 279

Query: 280 RIDGENVFIIDYENH--FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            +  EN      + H  + L       D   L+G    R+T   YD     + F K NCS
Sbjct: 280 SLSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339


>gi|242044724|ref|XP_002460233.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
 gi|241923610|gb|EER96754.1| hypothetical protein SORBIDRAFT_02g025060 [Sorghum bicolor]
          Length = 512

 Score = 94.0 bits (232), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 90/373 (24%), Positives = 157/373 (42%), Gaps = 62/373 (16%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFK- 50
           +G       +I+DT S L +               +FDP  S S+  + C+   C   + 
Sbjct: 157 VGLGGGEATVIVDTASELTWVQCAPCESCHDQQDPLFDPSSSPSYAAVPCNSSSCDALQL 216

Query: 51  -----------CVNE-----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF 94
                      C  +      C YT+ Y D S ++G  AH+ +S+ G+     +  G +F
Sbjct: 217 ATGGTSGGAAACQGQDQSAAACSYTLSYRDGSYSRGVLAHDRLSLAGE-----VIDGFVF 271

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
           GC   N G       G  +G++GL R  +S +SQ        FSYCL  PL   + +S  
Sbjct: 272 GCGTSNQG----PPFGGTSGLMGLGRSQLSLVSQTMDQFGGVFSYCL--PLKESD-SSGS 324

Query: 155 LKFGTDMGYRRPSTQA--TKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
           L  G D    R ST       ++ P    FY+++L  I++  + +     +         
Sbjct: 325 LVIGDDSSVYRNSTPIVYASMVSDPLQGPFYFVNLTGITVGGQEVESSGFSSGGGGG--- 381

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFN-RF 267
             IIDSG+V+T     +Y  +  +F+S F  +  A     P    +  C+ +      + 
Sbjct: 382 KAIIDSGTVITSLVPSIYNAVKAEFLSQFAEYPQA-----PGFSILDTCFNMTGLREVQV 436

Query: 268 PSMAFYFEDA-NLRID-GENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYD 323
           PS+   F+    + +D G  ++ +  ++    LA+AP   +    +IG+ QQ++ R ++D
Sbjct: 437 PSLKLVFDGGVEVEVDSGGVLYFVSSDSSQVCLAMAPLKSEYETNIIGNYQQKNLRVIFD 496

Query: 324 LNIDLLSFVKENC 336
            +   + F +E C
Sbjct: 497 TSGSQVGFAQETC 509


>gi|343172996|gb|AEL99201.1| aspartyl protease family protein, partial [Silene latifolia]
          Length = 584

 Score = 94.0 bits (232), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 90/360 (25%), Positives = 146/360 (40%), Gaps = 48/360 (13%)

Query: 5   FIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCTYFK 50
           +IGTP +   LI+DTGS + Y                F P  S ++  + C+ PDCT   
Sbjct: 1   WIGTPPQEFALIVDTGSTVTYVPCNSCDQCGNHQDPKFQPDLSDTYHPVKCN-PDCT-CD 58

Query: 51  CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDEDAR 108
             N+QC Y  +YA+ S + G    + +S     E K     A+FGC N   G  F + A 
Sbjct: 59  TENDQCTYERQYAEMSSSSGILGEDLVSFGNMSELKP--QRAVFGCENAETGDLFSQHAD 116

Query: 109 DGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMGYRR 165
                G++GL R  +S + QL    +I   FS C   + +  G      +   +DM +  
Sbjct: 117 -----GIMGLGRGDLSIVDQLVEKGVINDSFSLCYGGMEVGGGAMVLGQISPPSDMVFSH 171

Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
                +        +Y + L+ + +  ++++  P  FD    G+ G I+DSG+   Y   
Sbjct: 172 SDPDRSP-------YYNIELRGLHVAGKKLDINPQVFD----GKHGTILDSGTTYAYLPE 220

Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA-NL 279
             +    +   S     +  +  D P    +C+      +PE +  FPS+   F++    
Sbjct: 221 AAFLPFIQAITSELHGLKQIRGPD-PNYNDVCFSGAGSEIPELYKTFPSVDMVFDNGEKY 279

Query: 280 RIDGENVFIIDYENH--FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            +  EN      + H  + L       D   L+G    R+T   YD     + F K NCS
Sbjct: 280 SLSPENYLFKHSKVHGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDREHSKVGFWKTNCS 339


>gi|414587000|tpg|DAA37571.1| TPA: hypothetical protein ZEAMMB73_036171 [Zea mays]
          Length = 459

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 91/384 (23%), Positives = 164/384 (42%), Gaps = 69/384 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V+L  GTP       +DT S L++               +F+P+ SSS+  + C    C
Sbjct: 93  LVKLGTGTPQHFFSAAIDTASDLVWMQCQPCVSCYRQLDPVFNPKLSSSYAVVPCTSDTC 152

Query: 47  TYF---KCVNE---QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 +C  +    C YT KY+   VTKG  A + +++     G  +FH  +FGCS+ +
Sbjct: 153 AQLDGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAI-----GGDVFHAVVFGCSDSS 207

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G          +G++GL R  +S +SQL      RF YCL  P+     TS  L  G  
Sbjct: 208 VG----GPAAQASGLVGLGRGPLSLVSQLS---VHRFMYCLPPPM---SRTSGKLVLGAG 257

Query: 161 M-GYRRPSTQATKFINHPN---NFYYLSLKDISIDNE------RMNFPPDTFDITVSGEG 210
               R  S + T  ++      ++YYL+L  +++ ++          PP        G G
Sbjct: 258 ADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPGTTRNATSPPSGGAGGGGGGG 317

Query: 211 -------------GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQL 256
                        G I+D  S +++  + +Y +L +      E  +L + +      + L
Sbjct: 318 GGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLE---EEIRLPRATPSLRLGLDL 374

Query: 257 CYFLPETFNR----FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGS 312
           C+ LPE         P+++  F+   L +D + +F+ D      ++        V+++G+
Sbjct: 375 CFILPEGVGMDRVYVPTVSLSFDGRWLELDRDRLFVTDGRMMCLMIG---RTSGVSILGN 431

Query: 313 QQQRDTRFVYDLNIDLLSFVKENC 336
            Q ++ R +++L    ++F K +C
Sbjct: 432 FQLQNMRVLFNLRRGKITFAKASC 455


>gi|125596976|gb|EAZ36756.1| hypothetical protein OsJ_21092 [Oryza sativa Japonica Group]
          Length = 435

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 93/342 (27%), Positives = 146/342 (42%), Gaps = 52/342 (15%)

Query: 20  GSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISV 79
           G A     F+P +SSSF  I C  P+C   +C    C +T+++ + +V  G    +T+++
Sbjct: 121 GGAPCDPAFEPSRSSSFAAIPCGSPECA-VECTGASCPFTIQFGNVTVANGTLVRDTLTL 179

Query: 80  IGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL----GSIIKK 135
                  A F G  FGC     G D D  DGA+ G++ LSR + S  S++     +    
Sbjct: 180 ----PPSATFAGFTFGC--IEVGADADTFDGAV-GLIDLSRSSHSLASRVISNGATTSAA 232

Query: 136 RFSYCLVIPLPNGEYTSS--YLKFGT--------DMGYRRPSTQATKFINHPNNFYYLSL 185
            FSYC    LP+   TSS  +L  G         D+ Y   S+      NHPN+ Y++ L
Sbjct: 233 AFSYC----LPSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNP----NHPNS-YFVDL 283

Query: 186 KDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLA 245
             IS+  E +  PP  F        G ++++ +  T+     Y  L + F     R  +A
Sbjct: 284 VGISVGGEDLPVPPAVF-----AAHGTLLEAATEFTFLAPAAYAALRDAF-----RKDMA 333

Query: 246 QLSDCP--EPIQLCYFLPETFN-RFPSMAFYFEDAN-LRID-GENVFIIDYENHFFLLAV 300
                P    +  CY L    +   P++A  F     L +D  + ++  D  + F  +A 
Sbjct: 334 PYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVAC 393

Query: 301 ------APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
                       V++IG+  QR T  VYDL    + F+   C
Sbjct: 394 LAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 435


>gi|242066140|ref|XP_002454359.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
 gi|241934190|gb|EES07335.1| hypothetical protein SORBIDRAFT_04g029390 [Sorghum bicolor]
          Length = 460

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 86/366 (23%), Positives = 150/366 (40%), Gaps = 67/366 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + +G+P+K   +++D+GS + +               +FDP  SS++   +C    C
Sbjct: 132 LITVRLGSPAKTQTVLIDSGSDVSWVQCKPCLQCHSQVDPLFDPSLSSTYSPFSCSSAAC 191

Query: 47  TYFK------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                       + QC Y ++YAD S T G  + +T+++     G        FGCS+  
Sbjct: 192 AQLGQDGNGCSSSSQCQYIVRYADGSSTTGTYSSDTLAL-----GSNTISNFQFGCSHVE 246

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            GF+ D  D    G++GL     S  SQ        FSYCL  P P+   +S +L  G  
Sbjct: 247 SGFN-DLTD----GLMGLGGGAPSLASQTAGTFGTAFSYCLP-PTPS---SSGFLTLGAG 297

Query: 161 MGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
                     + F+  P         FY + L+ I +   +++ P   F        G +
Sbjct: 298 T---------SGFVKTPMLRSSPVPTFYGVRLEAIRVGGTQLSIPTSVF------SAGMV 342

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ-LCY-FLPETFNRFPSMA 271
           +DSG+++T      Y  L   F +  ++++ A     P  I   C+ F  ++  R PS+A
Sbjct: 343 MDSGTIITRLPRTAYSALSSAFKAGMKQYRPAP----PRSIMDTCFDFSGQSSVRLPSVA 398

Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
             F   A + +D   + +     +    A    D    ++G+ QQR    +YD+    + 
Sbjct: 399 LVFSGGAVVNLDANGIIL----GNCLAFAANSDDSSPGIVGNVQQRTFEVLYDVGGGAVG 454

Query: 331 FVKENC 336
           F    C
Sbjct: 455 FKAGAC 460


>gi|302786560|ref|XP_002975051.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
 gi|300157210|gb|EFJ23836.1| hypothetical protein SELMODRAFT_54028 [Selaginella moellendorffii]
          Length = 359

 Score = 94.0 bits (232), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 100/365 (27%), Positives = 161/365 (44%), Gaps = 45/365 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
           M+ L IGTP + +  ++DTGS L++                 IF    SSS++K+ C+  
Sbjct: 6   MMELSIGTPPQLIPAMIDTGSDLVWLKCDNCDHCDLDHHGETIFFSDASSSYKKLPCNST 65

Query: 45  DCTYFKCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEG---KAIFHGALFG 95
            C+            E C Y  +Y D S T G    + IS    G G   ++ F G LFG
Sbjct: 66  HCSGMSSAGIGPRCEETCKYKYEYGDGSRTSGDVGSDRISFRSHGAGEDHRSFFDGFLFG 125

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           C     G D +   G    ++GL + + S I QLG  +  +FSYCLV    +     S+L
Sbjct: 126 CGRKLKG-DWNFTQG----LIGLGQKSHSLIQQLGDKLGYKFSYCLV-SYDSPPSAKSFL 179

Query: 156 KFGTDMGYRRPSTQATKFINHPN---NFYYLSLKDISIDN-ERMNFPPDTFDITVSG--- 208
             G+    R     +T  ++  +     YY+ L+ I++     + +  ++   T  G   
Sbjct: 180 FLGSSAALRGHDVVSTPILHGDHLDQTLYYVDLQSITVGGVPVVVYDKESGHNTSVGPFL 239

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRF 267
               +IDSG+  T     VY  + +   S  E+  L  L +    + LC+    +T   F
Sbjct: 240 ANKTVIDSGTTYTLLTPPVYEAMRK---SIEEQVILPTLGN-SAGLDLCFNSSGDTSYGF 295

Query: 268 PSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNI 326
           PS+ FYF +   L +  EN+F +   +   L   +   DL ++IG+ QQ++   +YDL  
Sbjct: 296 PSVTFYFANQVQLVLPFENIFQVTSRDVVCLSMDSSGGDL-SIIGNMQQQNFHILYDLVA 354

Query: 327 DLLSF 331
             +SF
Sbjct: 355 SQISF 359


>gi|222637181|gb|EEE67313.1| hypothetical protein OsJ_24553 [Oryza sativa Japonica Group]
          Length = 414

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 93/340 (27%), Positives = 142/340 (41%), Gaps = 51/340 (15%)

Query: 28  FDPRKSSSFQKINCDHPDCT-----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGK 82
           F P  SS+F K+ C    C      Y  C    CVY   Y     T G+ A ET+ V   
Sbjct: 96  FQPASSSTFSKLPCASSLCQFLTSPYLTCNATGCVYYYPYG-MGFTAGYLATETLHV--- 151

Query: 83  GEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV 142
             G A F G  FGCS +N G    +     +G++GL R  +S +SQ+G     RFSYCL 
Sbjct: 152 --GGASFPGVAFGCSTEN-GVGNSS-----SGIVGLGRSPLSLVSQVG---VGRFSYCLR 200

Query: 143 IPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP----NNFYYLSLKDISIDNERMNFP 198
                G+   S + FG+ +        +   + +P    +++YY++L  I++    +   
Sbjct: 201 SDADAGD---SPILFGS-LAKVTGGKSSPAILENPEMPSSSYYYVNLTGITVGATDLPVT 256

Query: 199 PDTFDITVSGE----GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-P 253
             TF  T        GG I+DSG+ LTY   + Y  +   F+S      L    +     
Sbjct: 257 STTFGFTRGAGAGLVGGTIVDSGTTLTYLVKEGYAMVKRAFLSQMATANLTTTVNGTRFG 316

Query: 254 IQLCY-----------FLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFF---LLA 299
             LC+            +P    RF   A Y   A  R     V  +D +       LL 
Sbjct: 317 FDLCFDANAAGGGSGVPVPTLVLRFAGGAEY---AVRRRSYVGVVEVDSQGRAAVECLLV 373

Query: 300 VAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
           +   + L +++IG+  Q D   +YDL+  + SF   +C++
Sbjct: 374 LPASEKLSISIIGNVMQMDLHVLYDLDGGMFSFAPADCAN 413


>gi|449433371|ref|XP_004134471.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
 gi|449495479|ref|XP_004159853.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 424

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 104/368 (28%), Positives = 167/368 (45%), Gaps = 50/368 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------------AIFDPRKSSSFQKINCD 42
           ++   IG PS  V+  LDT + LI+                    F   KS +++   C 
Sbjct: 76  LMSFNIGNPSSQVMGFLDTSNGLIWVQCSNCNSQCEPEKRGLTTKFLSSKSFTYEMEPCG 135

Query: 43  HPDC---TYFKCVN---EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-FG 95
              C   T F+  N   + C Y + Y D   T G  + ++       +G  +  G L FG
Sbjct: 136 SNFCNSLTGFQTCNSSDKWCKYRLVYGDNKATSGILSSDSFG-FDTSDGMLVDVGFLNFG 194

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           CS      DE +      G +GL++  +S ISQLG    K+FSYCLV P  N   TS  +
Sbjct: 195 CSEAPLTGDEQS----YTGNVGLNQTPLSLISQLG---IKKFSYCLV-PFNNLGSTSK-M 245

Query: 156 KFGTDMGYRRPSTQATKF-INHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
            FG+      P T   +  + +PN + YY+ +  ISI N+  +F    FD+      G I
Sbjct: 246 YFGS-----LPVTSGGQTPLLYPNSDAYYVKVLGISIGNDEPHFD-GVFDV-YEVRDGWI 298

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPET--FNRFPSM 270
           ID+G   +   +D +  L  KF++  +     Q  D P E  +LC+ L        FP +
Sbjct: 299 IDTGITYSSLETDAFDSLLAKFLTLKD---FPQRKDDPKERFELCFELQNANDLESFPDV 355

Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
             +F+ A+L ++ E+ F+   ++  F LA+      V+++G+ Q ++    YDL   ++S
Sbjct: 356 TVHFDGADLILNVESTFVKIEDDGIFCLALLRSGSPVSILGNFQLQNYHVGYDLEAQVIS 415

Query: 331 FVKENCSD 338
           F   +C+D
Sbjct: 416 FAPVDCAD 423


>gi|226530755|ref|NP_001150335.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195638492|gb|ACG38714.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 461

 Score = 93.6 bits (231), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 85/360 (23%), Positives = 151/360 (41%), Gaps = 51/360 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ + +G+P+    +++DTGS + +               +FDP  SS++   +C    C
Sbjct: 129 LITVGLGSPATSQTMLIDTGSDVSWVQCKPCSQCHSQADPLFDPSSSSTYSPFSCGSAAC 188

Query: 47  TYFK------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                       + QC Y + Y D S T G  + +T+++     G +      FGCSN  
Sbjct: 189 AQLGQEGNGCSSSSQCQYIVTYGDGSSTTGTYSSDTLAL-----GSSAVKSFQFGCSNVE 243

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            GF++        G++GL     S +SQ    + + FSYCL  P P+   +S +L  G  
Sbjct: 244 SGFNDQTD-----GLMGLGGGAQSLVSQTAGTLGRAFSYCLP-PTPS---SSGFLTLGAA 294

Query: 161 MGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
            G        T  +       FY + L+ I +   +++ P   F        G ++DSG+
Sbjct: 295 GGSGTSGFVKTPMLRSSQVPTFYGVRLQAIRVGGRQLSIPASVF------SAGTVMDSGT 348

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE-D 276
           V+T      Y  L   F +  +++  AQ S     +  C+ F  ++    PS+A  F   
Sbjct: 349 VITRLPPTAYSALSSAFKAGMKQYPPAQPSGI---LDTCFDFSGQSSVSIPSVALVFSGG 405

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A + +D   + +    ++    A    D  + +IG+ QQR    +YD+   ++ F    C
Sbjct: 406 AVVSLDASGIIL----SNCLAFAANSDDSSLGIIGNVQQRTFEVLYDVGRGVVGFRAGAC 461


>gi|54290724|dbj|BAD62394.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 523

 Score = 93.2 bits (230), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 93/342 (27%), Positives = 146/342 (42%), Gaps = 52/342 (15%)

Query: 20  GSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISV 79
           G A     F+P +SSSF  I C  P+C   +C    C +T+++ + +V  G    +T+++
Sbjct: 209 GGAPCDPAFEPSRSSSFAAIPCGSPECA-VECTGASCPFTIQFGNVTVANGTLVRDTLTL 267

Query: 80  IGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL----GSIIKK 135
                  A F G  FGC     G D D  DGA+ G++ LSR + S  S++     +    
Sbjct: 268 ----PPSATFAGFTFGCI--EVGADADTFDGAV-GLIDLSRSSHSLASRVISNGATTSAA 320

Query: 136 RFSYCLVIPLPNGEYTSS--YLKFGT--------DMGYRRPSTQATKFINHPNNFYYLSL 185
            FSYC    LP+   TSS  +L  G         D+ Y   S+      NHPN+ Y++ L
Sbjct: 321 AFSYC----LPSSSATSSRGFLSIGASRPEYSGGDIKYAPMSSNP----NHPNS-YFVDL 371

Query: 186 KDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLA 245
             IS+  E +  PP  F        G ++++ +  T+     Y  L + F     R  +A
Sbjct: 372 VGISVGGEDLPVPPAVF-----AAHGTLLEAATEFTFLAPAAYAALRDAF-----RKDMA 421

Query: 246 QLSDCP--EPIQLCYFLPETFN-RFPSMAFYFEDAN-LRID-GENVFIIDYENHFFLLA- 299
                P    +  CY L    +   P++A  F     L +D  + ++  D  + F  +A 
Sbjct: 422 PYPAAPPFRVLDTCYNLTGLASLAVPAVALRFAGGTELELDVRQMMYFADPSSVFSSVAC 481

Query: 300 -----VAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
                       V++IG+  QR T  VYDL    + F+   C
Sbjct: 482 LAFAAAPLPAFPVSVIGTLAQRSTEVVYDLRGGRVGFIPGRC 523


>gi|115475621|ref|NP_001061407.1| Os08g0267300 [Oryza sativa Japonica Group]
 gi|37806402|dbj|BAC99940.1| putative 41 kD chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113623376|dbj|BAF23321.1| Os08g0267300 [Oryza sativa Japonica Group]
          Length = 524

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 97/372 (26%), Positives = 152/372 (40%), Gaps = 72/372 (19%)

Query: 13  VLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF--------- 49
           + +I+DTGS L +               +FDP  S+S+  + C+   C            
Sbjct: 176 LTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPG 235

Query: 50  KCV----------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
            C           +E+C Y++ Y D S ++G  A +T+++     G A   G +FGC   
Sbjct: 236 SCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL-----GGASVDGFVFGCGLS 290

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           N G       G  AG++GL R  +S +SQ        FSYCL  P       +  L  G 
Sbjct: 291 NRGLF-----GGTAGLMGLGRTELSLVSQTAPRFGGVFSYCL--PAATSGDAAGSLSLGG 343

Query: 160 DMGYRRPSTQA--TKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           D    R +T    T+ I  P    FY++++   S+    +            G    ++D
Sbjct: 344 DTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVA-------AAGLGAANVLLD 396

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYF--ERFQLAQLSDCPEPIQL---CYFLPETFN-RFPS 269
           SG+V+T     VY  +  +F   F  ER+  A       P  L   CY L      + P 
Sbjct: 397 SGTVITRLAPSVYRAVRAEFARQFGAERYPAA------PPFSLLDACYNLTGHDEVKVPL 450

Query: 270 MAFYFED-ANLRIDGENV-FIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLN 325
           +    E  A++ +D   + F+   +     LA+A    +D   +IG+ QQ++ R VYD  
Sbjct: 451 LTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTV 510

Query: 326 IDLLSFVKENCS 337
              L F  E+CS
Sbjct: 511 GSRLGFADEDCS 522


>gi|449462553|ref|XP_004149005.1| PREDICTED: probable aspartic protease At2g35615-like [Cucumis
           sativus]
          Length = 418

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 100/360 (27%), Positives = 156/360 (43%), Gaps = 53/360 (14%)

Query: 5   FIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF- 49
            IGTP    L I DTGS L +A              IF+P KS+SF  + C+   C    
Sbjct: 85  IIGTPPVDYLGIADTGSDLTWAQCLPCLKCYQQLRPIFNPLKSTSFSHVPCNTQTCHAVD 144

Query: 50  --KC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
              C V   C Y+  Y D++ +KG    E I+ IG    K++      GC + + G    
Sbjct: 145 DGHCGVQGVCDYSYTYGDRTYSKGDLGFEKIT-IGSSSVKSV-----IGCGHASSG---- 194

Query: 107 ARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
              G  +GV+GL    +S +SQ+   S I +RFSYCL   L    + +  + FG +    
Sbjct: 195 -GFGFASGVIGLGGGQLSLVSQMSQTSGISRRFSYCLPTLL---SHANGKINFGQNAVVS 250

Query: 165 RPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
            P   +T  I+     +YY++L+ ISI NER         +  + +G  IIDSG+ L++ 
Sbjct: 251 GPGVVSTPLISKNTVTYYYITLEAISIGNER--------HMAFAKQGNVIIDSGTTLSFL 302

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY---FLPETFNRFPSMAFYFE-DANL 279
             ++Y  +     S  +  +  ++ D      LC+       T +  P +   F   AN+
Sbjct: 303 PKELYDGV---VSSLLKVVKAKRVKDPGNFWDLCFDDGINVATSSGIPIITAQFSGGANV 359

Query: 280 RIDGENVF--IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            +   N F  + +  N   L   +P D+   +IG+    +    YDL    LSF    C+
Sbjct: 360 NLLPVNTFQKVANNVNCLTLTPASPTDEF-GIIGNLALANFLIGYDLEAKRLSFKPTVCT 418


>gi|125560845|gb|EAZ06293.1| hypothetical protein OsI_28528 [Oryza sativa Indica Group]
          Length = 525

 Score = 93.2 bits (230), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 97/372 (26%), Positives = 152/372 (40%), Gaps = 72/372 (19%)

Query: 13  VLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF--------- 49
           + +I+DTGS L +               +FDP  S+S+  + C+   C            
Sbjct: 177 LTVIVDTGSDLTWVQCKPCSVCYAQRDPLFDPSGSASYAAVPCNASACEASLKAATGVPG 236

Query: 50  KCV----------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
            C           +E+C Y++ Y D S ++G  A +T+++     G A   G +FGC   
Sbjct: 237 SCATVGGGGGGGKSERCYYSLAYGDGSFSRGVLATDTVAL-----GGASVDGFVFGCGLS 291

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           N G       G  AG++GL R  +S +SQ        FSYCL  P       +  L  G 
Sbjct: 292 NRGLF-----GGTAGLMGLGRTELSLVSQTAPRFGGVFSYCL--PAATSGDAAGSLSLGG 344

Query: 160 DMGYRRPSTQA--TKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           D    R +T    T+ I  P    FY++++   S+    +            G    ++D
Sbjct: 345 DTSSYRNATPVSYTRMIADPAQPPFYFMNVTGASVGGAAVA-------AAGLGAANVLLD 397

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYF--ERFQLAQLSDCPEPIQL---CYFLPETFN-RFPS 269
           SG+V+T     VY  +  +F   F  ER+  A       P  L   CY L      + P 
Sbjct: 398 SGTVITRLAPSVYRAVRAEFARQFGAERYPAA------PPFSLLDACYNLTGHDEVKVPL 451

Query: 270 MAFYFED-ANLRIDGENV-FIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLN 325
           +    E  A++ +D   + F+   +     LA+A    +D   +IG+ QQ++ R VYD  
Sbjct: 452 LTLRLEGGADMTVDAAGMLFMARKDGSQVCLAMASLSFEDQTPIIGNYQQKNKRVVYDTV 511

Query: 326 IDLLSFVKENCS 337
              L F  E+CS
Sbjct: 512 GSRLGFADEDCS 523


>gi|357487593|ref|XP_003614084.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355515419|gb|AES97042.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 412

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 92/349 (26%), Positives = 149/349 (42%), Gaps = 51/349 (14%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
           IGTP   +  ++DTG+  I+               +F P KSS+++ I C  P C     
Sbjct: 96  IGTPPFQLYSLIDTGNDNIWFQCKPCKPCLNQTSPMFHPSKSSTYKTIPCTSPIC----- 150

Query: 52  VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGA 111
                    K AD      +   +T+++         F   + GC + N G      +G 
Sbjct: 151 ---------KNADGH----YLGVDTLTLNSNNGTPISFKNIVIGCGHRNQG----PLEGY 193

Query: 112 LAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQAT 171
           ++G +GL+R  +SFISQL S I  +FSYCLV PL + E  SS L FG         T +T
Sbjct: 194 VSGNIGLARGPLSFISQLNSSIGGKFSYCLV-PLFSKENVSSKLHFGDKSTVSGLGTVST 252

Query: 172 KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKL 231
                  N Y++SL+  S+ +  +              G  IIDSG+ +T    DVY +L
Sbjct: 253 PI--KEENGYFVSLEAFSVGDHIIKLE------NSDNRGNSIIDSGTTMTILPKDVYSRL 304

Query: 232 HEKFVSYFERFQLAQLSDCPEPIQLCYFLPET--FNRFPSMAFYFEDANLRIDGENVFI- 288
               +   +  +L ++ D  +   LCY    T    +   +  +F  + + ++  N F  
Sbjct: 305 ESVVL---DMVKLKRVKDPSQQFNLCYQTTSTTLLTKVLIITAHFSGSEVHLNALNTFYP 361

Query: 289 IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           I  E   F      +   +A+ G+  Q++    +DLN   +SF   +C+
Sbjct: 362 ITDEVICFAFVSGGNFSSLAIFGNVVQQNFLVGFDLNKKTISFKPTDCT 410


>gi|225216949|gb|ACN85242.1| aspartic proteinase nepenthesin-1 precursor [Oryza minuta]
          Length = 519

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 85/363 (23%), Positives = 141/363 (38%), Gaps = 51/363 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP+    ++ DTGS   +                +FDP +SS++  ++C  P 
Sbjct: 181 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAPA 240

Query: 46  CTYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C+      C    C+Y ++Y D S + GF A +T+++           G  FGC   N G
Sbjct: 241 CSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNEG 296

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT-DM 161
              +A     AG+LGL R   S   Q        F++C    LP     + YL FG   +
Sbjct: 297 LFGEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----LPARSTGTGYLDFGAGSL 347

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
              R          +   FYY+ +  I +  + ++ P   F        G I+DSG+V+T
Sbjct: 348 AAARARLTTPMLTENGPTFYYVGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVIT 402

Query: 222 YFHSDVYWKLH-----EKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE 275
                 Y  L            +++     L D       CY F   +    P+++  F+
Sbjct: 403 RLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLD------TCYDFTGMSQVAIPTVSLLFQ 456

Query: 276 -DANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFVK 333
             A L +D   +      +   L   A  D   V ++G+ Q +     YD+   ++ F  
Sbjct: 457 GGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFYP 516

Query: 334 ENC 336
             C
Sbjct: 517 GAC 519


>gi|242051593|ref|XP_002454942.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
 gi|241926917|gb|EES00062.1| hypothetical protein SORBIDRAFT_03g001790 [Sorghum bicolor]
          Length = 431

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 94/369 (25%), Positives = 153/369 (41%), Gaps = 63/369 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           V + IGTP +   LI DT S L +               +FDP KSSSF  + C    CT
Sbjct: 93  VTIGIGTPPQLHTLIADTASDLTWTQCNLFNDTAKQVEPLFDPAKSSSFAFVTCSSKLCT 152

Query: 48  -----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                  +C N+ C Y   Y       G  A+E+ ++    +   +  G  FGC     G
Sbjct: 153 EDNPGTKRCSNKTCRYVYPYVSVEAA-GVLAYESFTLSDNNQHICMSFG--FGC-----G 204

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG--TD 160
              D      +G+LG+S   +S +SQL      +FSYCL    P  +  SS L FG   D
Sbjct: 205 ALTDGNLLGASGILGMSPAILSMVSQLA---IPKFSYCLT---PYTDRKSSPLFFGAWAD 258

Query: 161 MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
           +G  + +    K +     +YY+ L  +S+   R++ P  TF +    +GG ++D G  +
Sbjct: 259 LGRYKTTGPIQKSLTF---YYYVPLVGLSLGTRRLDVPAATFALK---QGGTVVDLGCTV 312

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN----RFPSMAFYFED 276
                  +  L E   +      L   +   +  ++C+ LP        + P +  YF  
Sbjct: 313 GQLAEPAFTALKE---AVLHTLNLPLTNRTVKDYKVCFALPSGVAMGAVQTPPLVLYF-- 367

Query: 277 ANLRIDGENVFIIDYENHF-------FLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
                DG    ++  +N+F         LA+ P   + ++IG+ QQ++   ++D++    
Sbjct: 368 -----DGGADMVLPRDNYFQEPTAGLMCLALVPGGGM-SIIGNVQQQNFHLLFDVHDSKF 421

Query: 330 SFVKENCSD 338
            F    C D
Sbjct: 422 LFAPTICDD 430


>gi|115448353|ref|NP_001047956.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|45735844|dbj|BAD12879.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735970|dbj|BAD12999.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|113537487|dbj|BAF09870.1| Os02g0720900 [Oryza sativa Japonica Group]
 gi|125540930|gb|EAY87325.1| hypothetical protein OsI_08729 [Oryza sativa Indica Group]
 gi|215692622|dbj|BAG88042.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 458

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 81/362 (22%), Positives = 152/362 (41%), Gaps = 51/362 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           + R+ +GTP+   ++++DTGS+L +                +F+P+ SS++  + C    
Sbjct: 123 VTRMGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQ 182

Query: 46  CTYF--------KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           C+           C +   C+Y   Y D S + G+ + +T+S      G        +GC
Sbjct: 183 CSDLPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSF-----GSTSLPNFYYGC 237

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             DN G       G  AG++GL+R  +S + QL   +   F+YC    LP+   +     
Sbjct: 238 GQDNEGLF-----GRSAGLIGLARNKLSLLYQLAPSLGYSFTYC----LPSSSSSGYLSL 288

Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
              + G    +   +  ++  ++ Y++ L  +++        P +   +       IIDS
Sbjct: 289 GSYNPGQYSYTPMVSSSLD--DSLYFIKLSGMTVAGN-----PLSVSSSAYSSLPTIIDS 341

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE- 275
           G+V+T   + VY  L +   +  +    A        +  C+    +    P++   F  
Sbjct: 342 GTVITRLPTSVYSALSKAVAAAMKGTSRASAYSI---LDTCFKGQASRVSAPAVTMSFAG 398

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            A L++  +N  ++D ++    LA AP     A+IG+ QQ+    VYD+    + F    
Sbjct: 399 GAALKLSAQN-LLVDVDDSTTCLAFAPARS-AAIIGNTQQQTFSVVYDVKSSRIGFAAGG 456

Query: 336 CS 337
           CS
Sbjct: 457 CS 458


>gi|242084332|ref|XP_002442591.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
 gi|241943284|gb|EES16429.1| hypothetical protein SORBIDRAFT_08g022580 [Sorghum bicolor]
          Length = 493

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 103/372 (27%), Positives = 161/372 (43%), Gaps = 49/372 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           M ++ +GTP+   LL +DTGS + +               +FDPR S+S++++  D PDC
Sbjct: 135 MAKIAVGTPAVEALLAMDTGSDITWLQCQPCRRCYPQSGPVFDPRHSTSYREMGYDAPDC 194

Query: 47  TYFK------CVNEQCVYTMKYADQ-SVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                          CVY + Y D  S T G    ET++  G   G  + H ++ GC +D
Sbjct: 195 QALGRSGGGDAKRMTCVYAVGYGDDGSTTVGDFIEETLTFAG---GVQVPHMSI-GCGHD 250

Query: 100 NHG-FDEDARDGALAGVLGLSRVTISFISQLGSII--KKRFSYCLV---IPLPNGEYTSS 153
           N G F   A     AG+LGL R  IS  SQ+ ++      FSYCL    +  P G   SS
Sbjct: 251 NKGLFAAPA-----AGILGLGRGQISCPSQIAALGYNVTSFSYCLADFFLSSP-GRSVSS 304

Query: 154 YLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITV---SG 208
            L  G       P    T  + + N   FYY+ L  +S+   R+    +  D+ +   +G
Sbjct: 305 TLTIGDGAAAGSPPPSFTPTVQNLNMATFYYVRLVGVSVGGVRVPGVTED-DLKLDPYTG 363

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLS-DCPEP-IQLCYFLPETFNR 266
            GG I+DSG+ +T      Y  +  +         L Q+S   P      CY +     +
Sbjct: 364 RGGVILDSGTAVTRLARRAY--IAFRDAFRAAAVDLGQVSIGGPSGFFDTCYTMGGRAMK 421

Query: 267 FPSMAFYFEDA-NLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
            P+++ +F     L +  +N  I +D             D  V++IG+ QQ+  R VY++
Sbjct: 422 VPTVSMHFAGGVELTLPPKNYLIPVDSMGTVCFAFAGTGDRSVSIIGNIQQQGFRVVYNI 481

Query: 325 NIDLLSFVKENC 336
               + F   +C
Sbjct: 482 GGGRVGFAPNSC 493


>gi|242045118|ref|XP_002460430.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
 gi|241923807|gb|EER96951.1| hypothetical protein SORBIDRAFT_02g027990 [Sorghum bicolor]
          Length = 488

 Score = 92.8 bits (229), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 93/366 (25%), Positives = 147/366 (40%), Gaps = 46/366 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +  L +GTP+  +++ LDTGS   +               +FDP  SS++  + C   +C
Sbjct: 140 VASLRLGTPATELVVELDTGSDQSWVQCKPCADCYEQRDPVFDPTASSTYSAVPCGAREC 199

Query: 47  TYFKCVN----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKA--IFHGALF 94
                 +          + C Y + Y D S T G  A +T+++             G +F
Sbjct: 200 QELASSSSSRNCSSDNNKNCPYEVSYDDDSHTVGDLARDTLTLSPSPSPSPADTVPGFVF 259

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
           GC + N G       G + G+LGL     S  SQ+ +     FSYC    LP+    + Y
Sbjct: 260 GCGHSNAG-----TFGEVDGLLGLGLGKASLPSQVAARYGAAFSYC----LPSSPSAAGY 310

Query: 155 LKFGTDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
           L FG      R + Q T+ +   +   YYL+L  I +    +  P   F        G I
Sbjct: 311 LSFGGAAA--RANAQFTEMVTGQDPTSYYLNLTGIVVAGRAIKVPASAFATAA----GTI 364

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAF 272
           IDSG+  +      Y  L   F S   R++  +    P     CY F      R P++  
Sbjct: 365 IDSGTAFSRLPPSAYAALRSSFRSAMGRYRYKRAPSSPI-FDTCYDFTGHETVRIPAVEL 423

Query: 273 YFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
            F D A + +    V     +     LA  P+ DL  ++G+ QQR    +YD+    + F
Sbjct: 424 VFADGATVHLHPSGVLYTWNDVAQTCLAFVPNHDL-GILGNTQQRTLAVIYDVGSQRIGF 482

Query: 332 VKENCS 337
            ++ C+
Sbjct: 483 GRKGCA 488


>gi|115452187|ref|NP_001049694.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|29893618|gb|AAP06872.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|108707424|gb|ABF95219.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|108707425|gb|ABF95220.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113548165|dbj|BAF11608.1| Os03g0271900 [Oryza sativa Japonica Group]
 gi|215715205|dbj|BAG94956.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737033|dbj|BAG95962.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215740994|dbj|BAG97489.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 447

 Score = 92.4 bits (228), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 98/395 (24%), Positives = 157/395 (39%), Gaps = 75/395 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI------------FDPRKSSSFQKINCDHPDCTYF 49
           V + +GTP + V ++LDTGS L + +            F+   SSS+  + C    C + 
Sbjct: 57  VPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTPAFNASGSSSYGAVPCPSTACEWR 116

Query: 50  K--------C---VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC-- 96
                    C    +  C  ++ YAD S   G  A +T  + G     A+  GA FGC  
Sbjct: 117 GRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAV--GAYFGCIT 174

Query: 97  ------SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY 150
                 + +++G   D  + A  G+LG++R T+SF++Q G+   +RF+YC+      GE 
Sbjct: 175 SYSSTTATNSNGTGTDVSEAA-TGLLGMNRGTLSFVTQTGT---RRFAYCIA----PGE- 225

Query: 151 TSSYLKFGTDMGYRRPSTQATKF-INHPNNF-----YYLSLKDISIDNERMNFPPDTFDI 204
               L  G D G   P        I+ P  +     Y + L+ I +    +  P      
Sbjct: 226 GPGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTP 285

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLC 257
             +G G  ++DSG+  T+  +D Y  L  +F S   R  LA L    EP          C
Sbjct: 286 DHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQ-ARLLLAPLG---EPGFVFQGAFDAC 341

Query: 258 YFLPE-----TFNRFPSMAFYFEDANLRIDGEN-VFIIDYENHFFLLAVA------PHDD 305
           +  PE          P +      A + + GE  ++++  E      A A       + D
Sbjct: 342 FRGPEARVAAASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSD 401

Query: 306 LVAL----IGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  +    IG   Q++    YDL    + F    C
Sbjct: 402 MAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436


>gi|242091327|ref|XP_002441496.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
 gi|241946781|gb|EES19926.1| hypothetical protein SORBIDRAFT_09g028060 [Sorghum bicolor]
          Length = 466

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 91/376 (24%), Positives = 146/376 (38%), Gaps = 53/376 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA------------------IFDPRKSSSFQKINCDH 43
           VR  +GTP++  +L+ DTGS L +                   +F    S S+  I C  
Sbjct: 103 VRFRVGTPAQPFVLVADTGSDLTWVKCRGAGAAAGTGAGSPARVFRTAASKSWAPIACSS 162

Query: 44  PDCTYF------KCVN--EQCVYTMKYADQSVTKGFAAHETISVI-----------GKGE 84
             CT +       C +    C Y  +Y D S  +G    ++ ++              G 
Sbjct: 163 DTCTSYVPFSLANCSSPASPCAYDYRYRDGSAARGVVGTDSATIALSSGSGRGGGDSSGG 222

Query: 85  GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
            +A   G + GC+    G    + DG    VL L    ISF S+  +    RFSYCLV  
Sbjct: 223 RRAKLQGVVLGCAATYDGQSFQSSDG----VLSLGNSNISFASRAAARFGGRFSYCLVDH 278

Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFD 203
           L     T SYL FG   G   P+ Q    ++     FY +++  + +  E ++ P D +D
Sbjct: 279 LAPRNAT-SYLTFGP--GATAPAAQTPLLLDRRMTPFYAVTVDAVYVAGEALDIPADVWD 335

Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET 263
             V   GG I+DSG+ LT   +  Y  +      +        +    +P + CY   + 
Sbjct: 336 --VDRNGGAILDSGTSLTILATPAYRAVVTALSKHLAGLPRVTM----DPFEYCYNWTDA 389

Query: 264 FN-RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFV 321
                P M  +F  +         ++ID       + V       V++IG+  Q++  + 
Sbjct: 390 GALEIPKMEVHFAGSARLEPPAKSYVIDAAPGVKCIGVQEGSWPGVSVIGNILQQEHLWE 449

Query: 322 YDLNIDLLSFVKENCS 337
           +DL    L F    C+
Sbjct: 450 FDLRDRWLRFKHTRCA 465


>gi|115468912|ref|NP_001058055.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|51091964|dbj|BAD35493.1| putative nucleoid DNA-binding protein cnd41, chloroplast [Oryza
           sativa Japonica Group]
 gi|113596095|dbj|BAF19969.1| Os06g0610800 [Oryza sativa Japonica Group]
 gi|218198535|gb|EEC80962.1| hypothetical protein OsI_23679 [Oryza sativa Indica Group]
          Length = 519

 Score = 92.4 bits (228), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 91/365 (24%), Positives = 146/365 (40%), Gaps = 58/365 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP+    ++ DTGS   +                +FDP  SS++  ++C  P 
Sbjct: 184 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAPA 243

Query: 46  CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C+      C    C+Y ++Y D S + GF A +T+++           G  FGC   N G
Sbjct: 244 CSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNDG 299

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
              +A     AG+LGL R   S   Q        F++C    LP     + YL FG    
Sbjct: 300 LFGEA-----AGLLGLGRGKTSLPVQTYGKYGGVFAHC----LPARSTGTGYLDFGAG-- 348

Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
              P+T  T  +  N P  FYY+ +  I +    +   P  F        G I+DSG+V+
Sbjct: 349 -SPPATTTTPMLTGNGP-TFYYVGMTGIRVGGRLLPIAPSVFAAA-----GTIVDSGTVI 401

Query: 221 TYFHSDVYWKLHEKFVSY-----FERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYF 274
           T      Y  L   F +      + +     L D       CY F   +    P+++  F
Sbjct: 402 TRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLD------TCYDFTGMSQVAIPTVSLLF 455

Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSF 331
           +  A L +D   +      +    LA A ++D   V ++G+ Q +     YD+   ++ F
Sbjct: 456 QGGAALDVDASGIMYTVSASQ-VCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 514

Query: 332 VKENC 336
               C
Sbjct: 515 SPGAC 519


>gi|222640709|gb|EEE68841.1| hypothetical protein OsJ_27628 [Oryza sativa Japonica Group]
          Length = 375

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 91/338 (26%), Positives = 141/338 (41%), Gaps = 36/338 (10%)

Query: 15  LILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQC-VYTMKYADQSVTKGFAA 73
           LI+DTGS LI+      K SS       H      +    +   +T      +   G  A
Sbjct: 55  LIVDTGSDLIWTQC---KLSSSTAAAARHGSPPLSRTAPARTGAFTRTCTASAAAVGVLA 111

Query: 74  HETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSII 133
            ET +    G  +A+     FGC   + G    A      G+LGLS  ++S I+QL    
Sbjct: 112 SETFTF---GARRAVSLRLGFGCGALSAGSLIGA-----TGILGLSPESLSLITQLK--- 160

Query: 134 KKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPST----QATKFINHP--NNFYYLSLKD 187
            +RFSYCL    P  +  +S L FG      R  T    Q T  +++P    +YY+ L  
Sbjct: 161 IQRFSYCLT---PFADKKTSPLLFGAMADLSRHKTTRPIQTTAIVSNPVETVYYYVPLVG 217

Query: 188 ISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQL 247
           IS+ ++R+  P  +  +   G GG I+DSGS + Y     +  + E   +  +  +L   
Sbjct: 218 ISLGHKRLAVPAASLAMRPDGGGGTIVDSGSTVAYLVEAAFEAVKE---AVMDVVRLPVA 274

Query: 248 SDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHF-------FLLAV 300
           +   E  +LC+ LP         A       L  DG    ++  +N+F         LAV
Sbjct: 275 NRTVEDYELCFVLPRRTAAAAMEAVQVPPLVLHFDGGAAMVLPRDNYFQEPRAGLMCLAV 334

Query: 301 APHDD--LVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
               D   V++IG+ QQ++   ++D+     SF    C
Sbjct: 335 GKTTDGSGVSIIGNVQQQNMHVLFDVQHHKFSFAPTQC 372


>gi|242079765|ref|XP_002444651.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
 gi|241941001|gb|EES14146.1| hypothetical protein SORBIDRAFT_07g025440 [Sorghum bicolor]
          Length = 493

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 98/375 (26%), Positives = 161/375 (42%), Gaps = 65/375 (17%)

Query: 6   IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDHPDC 46
           +GTP K   + +DTGS +++                    ++DP+ SS+   + CD   C
Sbjct: 94  LGTPPKRFYVQVDTGSDILWVNCITCDQCPHKSGLGLDLTLYDPKASSTGSTVMCDQGFC 153

Query: 47  T------YFKC-VNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFGC 96
                    KC  N  C Y++ Y D S T G   ++ +    V G G+ +      +FGC
Sbjct: 154 ADTFGGRLPKCSANVPCEYSVTYGDGSSTVGSFVNDALQFDQVTGDGQTQPANASVIFGC 213

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSY 154
                G D  +   AL G+LG      S +SQL +   +KK F++CL      G +    
Sbjct: 214 GA-QQGGDLGSSSQALDGILGFGEANTSMLSQLATAGKVKKIFAHCLDTIKGGGIFAIGD 272

Query: 155 LKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG-GC 212
           +         +P  + T  + + P+  Y ++LK I +    +  P D F     GE  G 
Sbjct: 273 V--------VQPKVKTTPLVADKPH--YNVNLKTIDVGGTTLELPADIFK---PGEKRGT 319

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMA 271
           IIDSG+ LTY    V+ K+    ++ F + Q     D  +   LC+ +     + FP++ 
Sbjct: 320 IIDSGTTLTYLPELVFKKV---MLAVFNKHQDITFHDVQD--FLCFEYSGSVDDGFPTLT 374

Query: 272 FYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
           F+FE D  L +        +G +V+ + ++N    L      D+V L+G     +   VY
Sbjct: 375 FHFEDDLALHVYPHEYFFPNGNDVYCVGFQNG--ALQSKDGKDIV-LMGDLVLSNKLVVY 431

Query: 323 DLNIDLLSFVKENCS 337
           DL   ++ +   NCS
Sbjct: 432 DLENRVIGWTDYNCS 446


>gi|194696934|gb|ACF82551.1| unknown [Zea mays]
 gi|413936470|gb|AFW71021.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
 gi|413953801|gb|AFW86450.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 445

 Score = 92.0 bits (227), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 90/366 (24%), Positives = 149/366 (40%), Gaps = 64/366 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
           +VR+  GTP+   ++++DTGS + +                 ++DP  SS++  + C   
Sbjct: 114 VVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCASD 173

Query: 45  DCTYFK-------CVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
            C           C + +QC + + YAD + T G  + + +++       AI     FGC
Sbjct: 174 VCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL----APGAIVQNFYFGC 229

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
            +  H     A  G   GVLGL R+  S  ++ G +    FSYCL    P+      +L 
Sbjct: 230 GHGKH-----AVRGLFDGVLGLGRLRESLGARYGGV----FSYCL----PSVSSKPGFLA 276

Query: 157 FGTDMGYRRPS----TQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
            G     + PS    T        P  F  ++L  I++  ++++  P  F       GG 
Sbjct: 277 LGAG---KNPSGFVFTPMGTVPGQPT-FSTVTLAGINVGGKKLDLRPSAF------SGGM 326

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMA 271
           I+DSG+V+T   S  Y  L   F    E ++L    D    +  CY L    N   P +A
Sbjct: 327 IVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD----LDTCYNLTGYKNVVVPKIA 382

Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
             F   A + +D  N  ++   N     A +  D    ++G+  QR    ++D +     
Sbjct: 383 LTFTGGATINLDVPNGILV---NGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFG 439

Query: 331 FVKENC 336
           F  + C
Sbjct: 440 FRAKAC 445


>gi|326531454|dbj|BAJ97731.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 91/369 (24%), Positives = 144/369 (39%), Gaps = 69/369 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +V + +GTP     + +DTGS + +                 +FDP KSS++  + C   
Sbjct: 144 VVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGAD 203

Query: 45  DCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
            C+  +     C   QC Y + Y D S T G    +T++ +  G     F   LFGC + 
Sbjct: 204 ACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLA-LAPGNTVGTF---LFGCGHA 259

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
             G         + G+L L R ++S  SQ        FSYC    LP+ +  + YL  G 
Sbjct: 260 QAGMFA-----GIDGLLALGRQSMSLKSQAAGAYGGVFSYC----LPSKQSAAGYLTLGG 310

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                  +T           FY + L  IS+  +++  P   F       GG ++D+G+V
Sbjct: 311 PTSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAF------AGGTVVDTGTV 364

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP----IQLCYFLPETFNRF-----PSM 270
           +T      Y  L   F     R  +A       P    +  CY     F+R+     P++
Sbjct: 365 ITRLPPTAYAALRSAF-----RGAIAPYGYPSAPANGILDTCY----DFSRYGVVTLPTV 415

Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNID 327
           A  F   A L ++   +           LA AP+  D   A++G+ QQR   F    +  
Sbjct: 416 ALTFSGGATLALEAPGIL------SSGCLAFAPNGGDGDAAILGNVQQRS--FAVRFDGS 467

Query: 328 LLSFVKENC 336
            + F+   C
Sbjct: 468 TVGFMPGAC 476


>gi|297805182|ref|XP_002870475.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297316311|gb|EFH46734.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 480

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 90/378 (23%), Positives = 161/378 (42%), Gaps = 67/378 (17%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH 43
           ++ +G+P K   + +DTGS +++                   +++D + SS+ + + C+ 
Sbjct: 80  KIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKASSTSKNVGCED 139

Query: 44  PDCTYF----KC-VNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFG 95
             C++      C   + C Y + Y D S + G    + I+   V G      +    +FG
Sbjct: 140 AFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFVKDNITLDQVTGNLRTAPLAQEVVFG 199

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSS 153
           C  +  G      + A+ G++G  +   S ISQL  G  +K+ FS+CL      G +   
Sbjct: 200 CGKNQSG-QLGQTESAVDGIMGFGQSNTSVISQLAAGGSVKRIFSHCLDNMNGGGIFAIG 258

Query: 154 YLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGC 212
            ++         P  + T  +  PN  +Y + LK + +D E ++ PP     + +G+GG 
Sbjct: 259 EVE--------SPVVKTTPLV--PNQVHYNVILKGMDVDGEPIDLPPSL--ASTNGDGGT 306

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
           IIDSG+ L Y   ++Y  L EK  +     Q  +L    E      F   T   FP +  
Sbjct: 307 IIDSGTTLAYLPQNLYNSLIEKITAK----QQVKLHMVQETFACFSFTSNTDKAFPVVNL 362

Query: 273 YFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTR 319
           +FED+            +LR   E+++   +++      +   D   V L+G     +  
Sbjct: 363 HFEDSLKLSVYPHDYLFSLR---EDMYCFGWQSG----GMTTQDGADVILLGDLVLSNKL 415

Query: 320 FVYDLNIDLLSFVKENCS 337
            VYDL  +++ +   NCS
Sbjct: 416 VVYDLENEVIGWADHNCS 433


>gi|242069057|ref|XP_002449805.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
 gi|241935648|gb|EES08793.1| hypothetical protein SORBIDRAFT_05g023600 [Sorghum bicolor]
          Length = 430

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 99/372 (26%), Positives = 158/372 (42%), Gaps = 62/372 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP    + + DTGS L +               I+D   SSSF  + C    C
Sbjct: 84  LMELAIGTPPVPFIALADTGSDLTWTQCKPCKLCFGQDTPIYDTTTSSSFSPLPCSSATC 143

Query: 47  TYF---KCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                 +C   +  C Y   Y D + +   A    ISV           G  FGC  DN 
Sbjct: 144 LPIWSSRCSTPSATCRYRYAYDDGAYSPECAG---ISV----------GGIAFGCGVDNG 190

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G   ++      G +GL R ++S ++QLG     +FSYCL          SS + FG+  
Sbjct: 191 GLSYNS-----TGTVGLGRGSLSLVAQLG---VGKFSYCLTDFF--NTSLSSPVFFGSLA 240

Query: 162 GYRRPS-------TQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVS-GEGG 211
                S        Q+T  +  P N   YY+SL+ IS+ + R+  P  TFD+    G GG
Sbjct: 241 ELAASSASADAAVVQSTPLVQSPYNPSRYYVSLEGISLGDARLPIPNGTFDLNDDDGSGG 300

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLS---DC-PEPIQLCYFLPETFNRF 267
            I+DSG++ T      +  + +       +  +   S    C P P      LP+     
Sbjct: 301 MIVDSGTIFTILVETGFRVVVDHVAGVLGQPVVNASSLDRPCFPAPAAGVQELPD----M 356

Query: 268 PSMAFYFED-ANLRIDGENVFIIDYENHFFLLAVAPHDDLV-ALIGSQQQRDTRFVYDLN 325
           P M  +F   A++R+  +N    + E   F L +   +    +++G+ QQ++ + ++D+ 
Sbjct: 357 PDMVLHFAGGADMRLHRDNYMSFNEEESSFCLNIVGTESASGSVLGNFQQQNIQMLFDIT 416

Query: 326 IDLLSFVKENCS 337
           +  LSF+  +CS
Sbjct: 417 VGQLSFMPTDCS 428


>gi|219362525|ref|NP_001136612.1| uncharacterized protein LOC100216735 [Zea mays]
 gi|194696366|gb|ACF82267.1| unknown [Zea mays]
 gi|413953802|gb|AFW86451.1| hypothetical protein ZEAMMB73_488726 [Zea mays]
          Length = 411

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 90/366 (24%), Positives = 149/366 (40%), Gaps = 64/366 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
           +VR+  GTP+   ++++DTGS + +                 ++DP  SS++  + C   
Sbjct: 80  VVRVSFGTPAVPQVVVIDTGSDVSWLQCKPCSSGQCFPQKDPLYDPSHSSTYSAVPCASD 139

Query: 45  DCTYFK-------CVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
            C           C + +QC + + YAD + T G  + + +++       AI     FGC
Sbjct: 140 VCKKLAADAYGSGCTSGKQCGFAISYADGTSTVGAYSQDKLTL----APGAIVQNFYFGC 195

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
            +  H     A  G   GVLGL R+  S  ++ G +    FSYCL    P+      +L 
Sbjct: 196 GHGKH-----AVRGLFDGVLGLGRLRESLGARYGGV----FSYCL----PSVSSKPGFLA 242

Query: 157 FGTDMGYRRPS----TQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
            G     + PS    T        P  F  ++L  I++  ++++  P  F       GG 
Sbjct: 243 LGAG---KNPSGFVFTPMGTVPGQPT-FSTVTLAGINVGGKKLDLRPSAF------SGGM 292

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMA 271
           I+DSG+V+T   S  Y  L   F    E ++L    D    +  CY L    N   P +A
Sbjct: 293 IVDSGTVITGLQSTAYRALRSAFRKAMEAYRLLPNGD----LDTCYNLTGYKNVVVPKIA 348

Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
             F   A + +D  N  ++   N     A +  D    ++G+  QR    ++D +     
Sbjct: 349 LTFTGGATINLDVPNGILV---NGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFG 405

Query: 331 FVKENC 336
           F  + C
Sbjct: 406 FRAKAC 411


>gi|326495920|dbj|BAJ90582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 91/369 (24%), Positives = 144/369 (39%), Gaps = 69/369 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +V + +GTP     + +DTGS + +                 +FDP KSS++  + C   
Sbjct: 144 VVTVSLGTPGVSQTVEVDTGSDVSWVQCKPCSAPACNSQRDQLFDPAKSSTYSAVPCGAD 203

Query: 45  DCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
            C+  +     C   QC Y + Y D S T G    +T++ +  G     F   LFGC + 
Sbjct: 204 ACSELRIYEAGCSGSQCGYVVSYGDGSNTTGVYGSDTLA-LAPGNTVGTF---LFGCGHA 259

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
             G         + G+L L R ++S  SQ        FSYC    LP+ +  + YL  G 
Sbjct: 260 QAGMFA-----GIDGLLALGRQSMSLKSQAAGAYGGVFSYC----LPSKQSAAGYLTLGG 310

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                  +T           FY + L  IS+  +++  P   F       GG ++D+G+V
Sbjct: 311 PSSASGFATTGLLTAWAAPTFYMVMLTGISVGGQQVAVPASAF------AGGTVVDTGTV 364

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP----IQLCYFLPETFNRF-----PSM 270
           +T      Y  L   F     R  +A       P    +  CY     F+R+     P++
Sbjct: 365 ITRLPPTAYAALRSAF-----RGAIAPCGYPSAPANGILDTCY----DFSRYGVVTLPTV 415

Query: 271 AFYFE-DANLRIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNID 327
           A  F   A L ++   +           LA AP+  D   A++G+ QQR   F    +  
Sbjct: 416 ALTFSGGATLALEAPGIL------SSGCLAFAPNGGDGDAAILGNVQQRS--FAVRFDGS 467

Query: 328 LLSFVKENC 336
            + F+   C
Sbjct: 468 TVGFMPGAC 476


>gi|225458774|ref|XP_002283258.1| PREDICTED: aspartic proteinase-like protein 2 [Vitis vinifera]
 gi|302142232|emb|CBI19435.3| unnamed protein product [Vitis vinifera]
          Length = 659

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 89/363 (24%), Positives = 149/363 (41%), Gaps = 47/363 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+DTGS + Y                F P +SS++  + C+  DC 
Sbjct: 90  TRLWIGTPPQEFALIVDTGSTVTYVPCSDCEHCGKHQDPRFQPDESSTYHPVKCNM-DC- 147

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C ++   CVY  +YA+ S + G    + IS   + E   +   A+FGC N   G   
Sbjct: 148 --NCDHDGVNCVYERRYAEMSSSSGVLGEDIISFGNQSE--VVPQRAVFGCENVETG--- 200

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
           D       G++GL R  +S + QL   ++I   FS C   + +  G      +    DM 
Sbjct: 201 DLYSQRADGIMGLGRGQLSIVDQLVDKNVINDSFSLCYGGMHVGGGAMVLGGIPPPPDMV 260

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
           + R     +        +Y + LK+I +  + +   P TFD     + G ++DSG+   Y
Sbjct: 261 FSRSDPYRSP-------YYNIELKEIHVAGKPLKLSPSTFD----RKHGTVLDSGTTYAY 309

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA 277
              + +    +  +      +     D P    +C+      + +    FP +   F + 
Sbjct: 310 LPEEAFVAFRDAIIKKSHNLKQIHGPD-PNYNDICFSGAGRDVSQLSKAFPEVDMVFSNG 368

Query: 278 N-LRIDGENVFIIDYENH-FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
             L +  EN      + H  + L +  + D   L+G    R+T   YD   + + F K N
Sbjct: 369 QKLSLTPENYLFQHTKVHGAYCLGIFRNGDSTTLLGGIIVRNTLVTYDRENEKIGFWKTN 428

Query: 336 CSD 338
           CS+
Sbjct: 429 CSE 431


>gi|388518257|gb|AFK47190.1| unknown [Lotus japonicus]
          Length = 478

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 91/377 (24%), Positives = 168/377 (44%), Gaps = 60/377 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH 43
           ++ +G+PSK   + +DTGS +++                    ++DP++S + + ++C+H
Sbjct: 72  KIGLGSPSKDYYVQVDTGSDILWVNCVECTRCPRKSDIGIGLTLYDPKRSKTSEFVSCEH 131

Query: 44  PDC--TY----FKCVNEQ-CVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
             C  TY      C  E  C Y++ Y D S T G+   + ++   V G           +
Sbjct: 132 NFCSSTYEGRILGCKAENPCPYSISYGDGSATTGYYVQDYLTFNRVNGNPHTATQNSSII 191

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
           FGC     G    + + AL G++G  +   S +SQL +   +KK FS+CL   +  G ++
Sbjct: 192 FGCGAAQSGTFASSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCLDTNVGGGIFS 251

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEG 210
              +          P  + T  +  PN  +Y + LK+I +D + +  P DTFD + +G+ 
Sbjct: 252 IGEV--------VEPKVKTTPLV--PNMAHYNVILKNIEVDGDILQLPSDTFD-SENGK- 299

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSM 270
           G +IDSG+ L Y    VY +L  K ++   R ++  + +     Q   +     + FP +
Sbjct: 300 GTVIDSGTTLAYLPRIVYDQLMSKVLAKQPRLKVYLVEEQYSCFQ---YTGNVDSGFPIV 356

Query: 271 AFYFEDA-NLRI---------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
             +FED+ +L +          G++ + I ++      +   +   + L+G     +   
Sbjct: 357 KLHFEDSLSLTVYPHDYLFNYKGDSYWCIGWQKS---ASETKNGKDMTLLGDFVLSNKLV 413

Query: 321 VYDLNIDLLSFVKENCS 337
           VYDL    + +   NCS
Sbjct: 414 VYDLENMTIGWTDYNCS 430


>gi|255566008|ref|XP_002523992.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536719|gb|EEF38360.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 442

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 91/360 (25%), Positives = 157/360 (43%), Gaps = 41/360 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           ++++ IG P   +L+ + TGS L++                 FDP +SS+++ + CD   
Sbjct: 99  LMKISIGIPPTELLVNVATGSDLVWIPCLSFKPCTHNCDLRFFDPMESSTYKNVPCDSYR 158

Query: 46  CTYFK---CVNEQCVYTMKYADQ-SVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
           C       C    C Y+     Q S   G  A +T+++        +     F C N   
Sbjct: 159 CQITNAATCQFSDCFYSCDPRHQDSCPDGDLAMDTLTLNSTTGKSFMLPNTGFICGNRIG 218

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G      D    G+LGL   ++S ++++  +I  +FS+C+V   P     +S L FG   
Sbjct: 219 G------DYPGVGILGLGHGSLSLLNRISHLIDGKFSHCIV---PYSSNQTSKLSFGDKA 269

Query: 162 GYRRPSTQATKFINHPNNF-YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
                +  +T+       + Y LS   IS+ N+ ++      D  ++G G   +DSG++ 
Sbjct: 270 VVSGSAMFSTRLDMTGGPYSYTLSFYGISVGNKSISAGGIGSDYYMNGLG---MDSGTMF 326

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP---IQLCYFLPETFNRFPSMAFYFEDA 277
           TYF    Y +L      Y  R+ + Q    P+P   ++LCY     F+  P++  +FE  
Sbjct: 327 TYFPEYFYSQLE-----YDVRYAIQQEPLYPDPTRRLRLCYRYSPDFSP-PTITMHFEGG 380

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ++ +   N FI   E+   L       +  A+ G  QQ +    YDL+   LSF+K +C+
Sbjct: 381 SVELSSSNSFIRMTEDIVCLAFATSSSEQDAVFGYWQQTNLLIGYDLDAGFLSFLKTDCT 440


>gi|125543284|gb|EAY89423.1| hypothetical protein OsI_10930 [Oryza sativa Indica Group]
          Length = 447

 Score = 92.0 bits (227), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 98/395 (24%), Positives = 157/395 (39%), Gaps = 75/395 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI------------FDPRKSSSFQKINCDHPDCTYF 49
           V + +GTP + V ++LDTGS L + +            F+   SSS+  + C    C + 
Sbjct: 57  VPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTPAFNASGSSSYGAVPCPSTACEWR 116

Query: 50  K--------C---VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC-- 96
                    C    +  C  ++ YAD S   G  A +T  + G     A+  GA FGC  
Sbjct: 117 GRDLPVPPFCDTPPSNACRVSLSYADASSADGVLATDTFLLTGGAPPVAV--GAYFGCIT 174

Query: 97  ------SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY 150
                 + +++G   D  + A  G+LG++R T+SF++Q G+   +RF+YC+      GE 
Sbjct: 175 SYSSTTATNSNGTGTDVSEAA-TGLLGMNRGTLSFVTQTGT---RRFAYCIA----PGE- 225

Query: 151 TSSYLKFGTDMGYRRPSTQATKF-INHPNNF-----YYLSLKDISIDNERMNFPPDTFDI 204
               L  G D G   P        I+ P  +     Y + L+ I +    +  P      
Sbjct: 226 GPGVLLLGDDGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTP 285

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLC 257
             +G G  ++DSG+  T+  +D Y  L  +F S   R  LA L    EP          C
Sbjct: 286 DHTGAGQTMVDSGTQFTFLLADAYAALKAEFTSQ-ARLLLAPLG---EPGFVFQGAFDAC 341

Query: 258 YFLPE-----TFNRFPSMAFYFEDANLRIDGEN-VFIIDYENHFFLLAVA------PHDD 305
           +  PE          P +      A + + GE  ++++  E      A A       + D
Sbjct: 342 FRGPEARVAAASGLLPVVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSD 401

Query: 306 LVAL----IGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  +    IG   Q++    YDL    + F    C
Sbjct: 402 MAGMSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 436


>gi|242057809|ref|XP_002458050.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
 gi|241930025|gb|EES03170.1| hypothetical protein SORBIDRAFT_03g026140 [Sorghum bicolor]
          Length = 489

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 94/379 (24%), Positives = 161/379 (42%), Gaps = 73/379 (19%)

Query: 6   IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH--- 43
           +GTP K   + +DTGS +++                     +DP+ SSS   ++CD    
Sbjct: 90  LGTPPKRYYVQVDTGSDILWVNCISCEKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFC 149

Query: 44  --------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
                   P CT     N  C Y++ Y D S T GF   + +    V G G+ +      
Sbjct: 150 AATYGGKLPGCT----ANVPCEYSVMYGDGSSTTGFFVTDALQFDQVTGDGQTQPGNATV 205

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
            FGC     G D  + + AL G+LG  +   S +SQL +   +KK F++CL      G +
Sbjct: 206 TFGCGA-QQGGDLGSSNQALDGILGFGQANTSMLSQLAAAGKVKKIFAHCLDTIKGGGIF 264

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
               +         +P  + T  + + P+  Y ++LK I +    +  P   F+   +GE
Sbjct: 265 AIGNV--------VQPKVKTTPLVADMPH--YNVNLKSIDVGGTTLQLPAHVFE---TGE 311

Query: 210 -GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NRF 267
             G IIDSG+ LTY    V+ ++     +  +      + D      +C+  P +  + F
Sbjct: 312 RKGTIIDSGTTLTYLPELVFKEVMAAIFNKHQDIVFHNVQDF-----MCFQYPGSVDDGF 366

Query: 268 PSMAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
           P++ F+FE D  L +        +G +++ + ++N    L      D+V L+G     + 
Sbjct: 367 PTITFHFEDDLALHVYPHEYFFPNGNDMYCVGFQNG--ALQSKDGKDIV-LMGDLVLSNK 423

Query: 319 RFVYDLNIDLLSFVKENCS 337
             +YDL   ++ +   NCS
Sbjct: 424 LVIYDLENQVIGWTDYNCS 442


>gi|225216879|gb|ACN85177.1| aspartic proteinase nepenthesin-1 precursor [Oryza nivara]
 gi|225216896|gb|ACN85193.1| aspartic proteinase nepenthesin-1 precursor [Oryza rufipogon]
          Length = 515

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 91/365 (24%), Positives = 146/365 (40%), Gaps = 58/365 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP+    ++ DTGS   +                +FDP  SS++  ++C  P 
Sbjct: 180 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAPA 239

Query: 46  CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C+      C    C+Y ++Y D S + GF A +T+++           G  FGC   N G
Sbjct: 240 CSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNDG 295

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
              +A     AG+LGL R   S   Q        F++C    LP     + YL FG    
Sbjct: 296 LFGEA-----AGLLGLGRGKTSLPVQTYGKYGGVFAHC----LPARSTGTGYLDFGAG-- 344

Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
              P+T  T  +  N P  FYY+ +  I +    +   P  F        G I+DSG+V+
Sbjct: 345 -SPPATTTTPMLTGNGP-TFYYVGMTGIRVGGRLLPIAPSVF-----AAAGTIVDSGTVI 397

Query: 221 TYFHSDVYWKLHEKFVSY-----FERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYF 274
           T      Y  L   F +      + +     L D       CY F   +    P+++  F
Sbjct: 398 TRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLD------TCYDFTGMSQVAIPTVSLLF 451

Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSF 331
           +  A L +D   +      +    LA A ++D   V ++G+ Q +     YD+   ++ F
Sbjct: 452 QGGAALDVDASGIMYTVSASQ-VCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 510

Query: 332 VKENC 336
               C
Sbjct: 511 SPGAC 515


>gi|225216914|gb|ACN85210.1| aspartic proteinase nepenthesin-1 precursor [Oryza glaberrima]
          Length = 516

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 91/365 (24%), Positives = 146/365 (40%), Gaps = 58/365 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP+    ++ DTGS   +                +FDP  SS++  ++C  P 
Sbjct: 181 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVACYEQREKLFDPASSSTYANVSCAAPA 240

Query: 46  CTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C+      C    C+Y ++Y D S + GF A +T+++           G  FGC   N G
Sbjct: 241 CSDLDVSGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNDG 296

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
              +A     AG+LGL R   S   Q        F++C    LP     + YL FG    
Sbjct: 297 LFGEA-----AGLLGLGRGKTSLPVQTYGKYGGVFAHC----LPPRSTGTGYLDFGAG-- 345

Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
              P+T  T  +  N P  FYY+ +  I +    +   P  F        G I+DSG+V+
Sbjct: 346 -SPPATTTTPMLTGNGP-TFYYVGMTGIRVGGRLLPIAPSVF-----AAAGTIVDSGTVI 398

Query: 221 TYFHSDVYWKLHEKFVSY-----FERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYF 274
           T      Y  L   F +      + +     L D       CY F   +    P+++  F
Sbjct: 399 TRLPPAAYSSLRSAFAAAMAARGYRKAAAVSLLD------TCYDFTGMSQVAIPTVSLLF 452

Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSF 331
           +  A L +D   +      +    LA A ++D   V ++G+ Q +     YD+   ++ F
Sbjct: 453 QGGAALDVDASGIMYTVSASQ-VCLAFAGNEDGGDVGIVGNTQLKTFGVAYDIGKKVVGF 511

Query: 332 VKENC 336
               C
Sbjct: 512 SPGAC 516


>gi|359478045|ref|XP_002267046.2| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 502

 Score = 91.7 bits (226), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 99/393 (25%), Positives = 166/393 (42%), Gaps = 94/393 (23%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ IGTP++   + +DTGS +++                    ++D ++S + + ++CD
Sbjct: 100 AKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVSCD 159

Query: 43  HPDCTYFK------CV-NEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
              C          C+ N  C YT  YAD S + G+   + +    V G  E  +     
Sbjct: 160 QDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANGSV 219

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
           +FGCS    G  + + + AL G+LG  +   S ISQL S   ++K F++CL      G +
Sbjct: 220 IFGCSATQSG--DLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIF 277

Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGE 209
              ++         +P    T  +  PN  +Y +++K + +    +N P D FD  V  +
Sbjct: 278 AIGHI--------VQPKVNTTPLV--PNQTHYNVNMKAVEVGGYFLNLPTDVFD--VGDK 325

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-CYFLPETF-NRF 267
            G IIDSG+ L Y    VY +L  K  S+    ++  + D     Q  C+   E+  + F
Sbjct: 326 KGTIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHD-----QFTCFQYSESLDDGF 380

Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-----DLVALIGSQ----QQRDT 318
           P++ F+FE++                    L V PH+     D +  IG Q    Q RD 
Sbjct: 381 PAVTFHFENS------------------LYLKVHPHEYLFSYDGLWCIGWQNSGMQSRDR 422

Query: 319 R--------------FVYDLNIDLLSFVKENCS 337
           R               +YDL   ++ + + NCS
Sbjct: 423 RNITLLGDLALSNKLVLYDLENQVIGWTEYNCS 455


>gi|357482719|ref|XP_003611646.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355512981|gb|AES94604.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 640

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 92/364 (25%), Positives = 149/364 (40%), Gaps = 48/364 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+DTGS + Y                F P  S ++Q + C  PDC 
Sbjct: 91  TRLWIGTPPQRFALIVDTGSTVTYVPCSTCEHCGRHQDPKFQPDLSETYQPVKCT-PDCN 149

Query: 48  YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
                N QC+Y  +YA+ S + G    + +S     E       A+FGC ND  G   D 
Sbjct: 150 CDGDTN-QCMYDRQYAEMSSSSGVLGEDVVSFGNLSELAP--QRAVFGCENDETG---DL 203

Query: 108 RDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMGYR 164
                 G++GL R  +S + QL    +I   FS C   + +  G      +    DM + 
Sbjct: 204 YSQRADGIMGLGRGDLSIMDQLVDKKVISDSFSLCYGGMDVGGGAMILGGISPPEDMVFT 263

Query: 165 RPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
                 +        +Y ++LK++ +  +++   P  FD    G+ G ++DSG+   Y  
Sbjct: 264 HSDPDRSP-------YYNINLKEMHVAGKKLQLNPKVFD----GKHGTVLDSGTTYAYLP 312

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYF-----LPETFNRFPSMAFYFEDA 277
              +       +   ER  L Q+ + P+P    +C+      + +    FP +   FE+ 
Sbjct: 313 ETAFLAFKRAIMK--ERNSLKQI-NGPDPNYKDICFTGAGIDVSQLAKSFPVVDMVFENG 369

Query: 278 N-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
           + L +  EN           + L   +   D   L+G    R+T  +YD     + F K 
Sbjct: 370 HKLSLSPENYLFRHSKVRGAYCLGVFSNGRDPTTLLGGIFVRNTLVMYDRENSKIGFWKT 429

Query: 335 NCSD 338
           NCS+
Sbjct: 430 NCSE 433


>gi|326531368|dbj|BAK05035.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 412

 Score = 91.7 bits (226), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 88/318 (27%), Positives = 141/318 (44%), Gaps = 27/318 (8%)

Query: 27  IFDPRKSSSFQKINCDHPDCT--YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE 84
           +F P  S +F  ++ + P CT  Y K  N  C +       S   G+ + +T   +  G 
Sbjct: 113 LFSPGASPTFHGVHSNDPVCTVPYRKTAN-GCSFHF-----SSITGYLSRDTFH-LRTGR 165

Query: 85  GKAIFHG---ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
             A+       +FGC++ + GF  D     L GVL LS + +S ++QLG+    RFSYCL
Sbjct: 166 AGAVRESIPRVVFGCAHSSTGFHND---NTLGGVLSLSHLPLSLLTQLGAHASGRFSYCL 222

Query: 142 VIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPD 200
             P   G      L  G D+    P +  T  + HP  + Y+L+L  I+   +R+     
Sbjct: 223 --PKSTGHNPHGSLFLGADVPSPPPHSHTTNLVIHPGVSGYHLNLIGITRGYKRLK---- 276

Query: 201 TFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYF 259
             D  V     C I+    +T+    +Y  + +  V+  +     ++   P  P+     
Sbjct: 277 -IDKRVLVSHSCSINPAETITHIAEPIYLVVEKALVARMKELGSDRVKGPPGGPLWFDRM 335

Query: 260 LPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDT 318
                 + P+MAF+FE  A L    + +F +   N  F++A   +   V  IG+ QQ +T
Sbjct: 336 YQSVKEQLPNMAFHFEGGAELWFTSDRLFEVHGMNARFMVAGRGYRRTV--IGAAQQVNT 393

Query: 319 RFVYDLNIDLLSFVKENC 336
           RF +D+    LSFV E C
Sbjct: 394 RFTFDVARGKLSFVSEVC 411


>gi|359497446|ref|XP_003635520.1| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 354

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 85/366 (23%), Positives = 154/366 (42%), Gaps = 58/366 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
           V++ +G+P++   +I+DTGS+L +                +FDP  S +++ ++C    C
Sbjct: 15  VKVGLGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQC 74

Query: 47  TYF----------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           +            +  +  CVYT  Y D S + G+ + + +++           G ++GC
Sbjct: 75  SSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTL----APSQTLPGFVYGC 130

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             D+ G       G  AG+LGL R  +S + Q+ S     FSYCL  P   G     +L 
Sbjct: 131 GQDSEGLF-----GRAAGILGLGRNKLSMLGQVSSKFGYAFSYCL--PTRGG---GGFLS 180

Query: 157 FGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
            G        + + T     P N   Y+L L  I++    +      + +        II
Sbjct: 181 IG-KASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT------II 233

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP-ETFNRFPSMA 271
           DSG+V+T     VY    + FV    +   ++ +  P    +  C+    +     P + 
Sbjct: 234 DSGTVITRLPMSVYTPFQQAFV----KIMSSKYARAPGFSILDTCFKGNLKDMQSVPEVR 289

Query: 272 FYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
             F+  A+L +   NV ++  +     LA A ++  VA+IG+ QQ+  +  +D++   + 
Sbjct: 290 LIFQGGADLNLRPVNV-LLQVDEGLTCLAFAGNNG-VAIIGNHQQQTFKVAHDISTARIG 347

Query: 331 FVKENC 336
           F    C
Sbjct: 348 FATGGC 353


>gi|356523171|ref|XP_003530215.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 442

 Score = 91.3 bits (225), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 85/323 (26%), Positives = 144/323 (44%), Gaps = 53/323 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSAL-------------IYAIFDPRKSSSFQKINCDHPDCTY 48
           + + +GTP + + +++DTGS L              Y  F+P  SSS+  I+C  P CT 
Sbjct: 68  ISITVGTPPQNMSMVIDTGSELSWLHCNTNTTATIPYPFFNPNISSSYTPISCSSPTCTT 127

Query: 49  --------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                     C  N  C  T+ YAD S ++G  A +T      G G +   G +FGC N 
Sbjct: 128 RTRDFPIPASCDSNNLCHATLSYADASSSEGNLASDTF-----GFGSSFNPGIVFGCMNS 182

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-------VIPLPNGEYTS 152
           ++  + ++ D    G++G++  ++S +SQL      +FSYC+       ++ L  GE   
Sbjct: 183 SYSTNSES-DSNTTGLMGMNLGSLSLVSQLK---IPKFSYCISGSDFSGILLL--GE--- 233

Query: 153 SYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
           S   +G  + Y      +T       + Y + L+ I I ++ +N   + F    +G G  
Sbjct: 234 SNFSWGGSLNYTPLVQISTPLPYFDRSAYTVRLEGIKISDKLLNISGNLFVPDHTGAGQT 293

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLP---ETF 264
           + D G+  +Y    VY  L ++F++      L  L D P       + LCY +P      
Sbjct: 294 MFDLGTQFSYLLGPVYNALRDEFLNQ-TNGTLRALDD-PNFVFQIAMDLCYRVPVNQSEL 351

Query: 265 NRFPSMAFYFEDANLRIDGENVF 287
              PS++  FE A +R+ G+ + 
Sbjct: 352 PELPSVSLVFEGAEMRVFGDQLL 374


>gi|357131735|ref|XP_003567490.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 458

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 86/370 (23%), Positives = 151/370 (40%), Gaps = 47/370 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAI---------------FDPRKSSSFQKINCDHPD 45
           + R  +GTP + +L+ +D  +   +                 FDP +SS+++ + C  P 
Sbjct: 101 VARARLGTPPQTLLVAIDPSNDAAWVPCSACLGCAPGASSPSFDPTQSSTYRPVRCGAPQ 160

Query: 46  CTYFKCVNEQC--------VYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA-LFGC 96
           C         C         + + YA  ++       + +S +    G A+      FGC
Sbjct: 161 CAQVPPATPSCPAGPGASCAFNLSYASSTL-HAVLGQDALS-LSDSNGAAVPDDHYTFGC 218

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
                G           G++G  R  +SF+SQ  +     FSYCL  P       S  L+
Sbjct: 219 LRVVTGSGGSVPP---QGLVGFGRGPLSFLSQTKATYGSIFSYCL--PSYKSSNFSGTLR 273

Query: 157 FGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCI 213
            G     RR   + T  +++P+  + YY+++  + ++ + +  P     +   +G GG I
Sbjct: 274 LGPAGQPRR--IKTTPLLSNPHRPSLYYVAMVGVRVNGKAVPIPASALALDAATGRGGTI 331

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFY 273
           +D+G++ T      Y  L       F R   A  +        CY++  T    P++AF 
Sbjct: 332 VDAGTMFTRLSPPAYAALRNA----FRRGVSAPAAPALGGFDTCYYVNGT-KSVPAVAFV 386

Query: 274 FE-DANLRIDGENVFIIDYENHFFLLAVA--PHDDLVA---LIGSQQQRDTRFVYDLNID 327
           F   A + +  ENV I         LA+A  P D + A   ++ S QQ++ R V+D+   
Sbjct: 387 FAGGARVTLPEENVVISSTSGGVACLAMAAGPSDGVNAGLNVLASMQQQNHRVVFDVGNG 446

Query: 328 LLSFVKENCS 337
            + F +E C+
Sbjct: 447 RVGFSRELCT 456


>gi|255581508|ref|XP_002531560.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223528821|gb|EEF30826.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 407

 Score = 91.3 bits (225), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 94/382 (24%), Positives = 159/382 (41%), Gaps = 65/382 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCTY 48
           V L +GTP + V +++DTGS L +               F+  +S S++ I C    CT 
Sbjct: 33  VSLTVGTPPQNVSMVIDTGSELSWLYCNKTTTTTSYPTTFNQTRSISYRPIPCSSSTCTN 92

Query: 49  --------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                     C  N  C  T+ YAD S ++G  A +T  +     G +   G +FGC + 
Sbjct: 93  QTRDFSIPASCDSNSLCHATLSYADASSSEGNLASDTFHM-----GASDIPGMVFGCMDS 147

Query: 100 --NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
             +   DED+++    G++G++R ++SF+SQ+G     +FSYC+     +G   S  L  
Sbjct: 148 VFSSNSDEDSKN---TGLMGMNRGSLSFVSQMG---FPKFSYCI-----SGTDFSGMLLL 196

Query: 158 G-------TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
           G         + Y      +T         Y + L+ I + +  +  P   F+   +G G
Sbjct: 197 GESNFTWAVPLNYTPLVQISTPLPYFDRIAYTVQLEGIKVSDRLLPIPKSVFEPDHTGAG 256

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLP---E 262
             ++DSG+  T+     Y  L  +F++    F L  L D P+      + LCY +P    
Sbjct: 257 QTMVDSGTQFTFLLGPAYTALRSEFLNQTTGF-LRVLED-PDFVFQGAMDLCYRVPISQR 314

Query: 263 TFNRFPSMAFYFEDANLRIDGENVFI-----IDYENHFFLLAVAPHDDL---VALIGSQQ 314
              R P+++  F  A + +  E V       I   +    L+    D L     +IG   
Sbjct: 315 VLPRLPTVSLVFNGAEMTVADERVLYRVPGEIRGNDSVHCLSFGNSDLLGVEAYVIGHHH 374

Query: 315 QRDTRFVYDLNIDLLSFVKENC 336
           Q++    +DL    +   +  C
Sbjct: 375 QQNVWMEFDLERSRIGLAQVRC 396


>gi|357476337|ref|XP_003608454.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355509509|gb|AES90651.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 683

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 95/366 (25%), Positives = 158/366 (43%), Gaps = 52/366 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+DTGS + Y                F P  SS++Q + C   DC 
Sbjct: 83  TRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPDLSSTYQPVKCTL-DC- 140

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C N+  QCVY  +YA+ S + G    + +S   + E       A+FGC N   G   
Sbjct: 141 --NCDNDRMQCVYERQYAEMSTSSGVLGEDVVSFGNQSELAP--QRAVFGCENVETG--- 193

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
           D       G++GL R  +S + QL   +++   FS C   + +  G      +   +DM 
Sbjct: 194 DLYSQHADGIMGLGRGDLSIMDQLVDKNVVSDSFSLCYGGMDVGGGAMVLGGISPPSDMV 253

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
           + +     +  +  P  +Y + LK+I +  +R+   P  FD    G+ G ++DSG+   Y
Sbjct: 254 FAQ-----SDPVRSP--YYNIDLKEIHVAGKRLPLNPSVFD----GKHGSVLDSGTTYAY 302

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYF-----LPETFNRFPSMAFYFE 275
              + +    E  V   + F  +Q+S  P+P    LC+      + +    FP +   F 
Sbjct: 303 LPEEAFLAFKEAIVKELQSF--SQISG-PDPNYNDLCFSGAGIDVSQLSKTFPVVDMIFG 359

Query: 276 DAN-LRIDGEN-VFIIDYENHFFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           + +   +  EN +F        + L +  +  D   L+G    R+T  +YD     + F 
Sbjct: 360 NGHKYSLSPENYMFRHSKVRGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDREQTKIGFW 419

Query: 333 KENCSD 338
           K NC++
Sbjct: 420 KTNCAE 425


>gi|242066168|ref|XP_002454373.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
 gi|241934204|gb|EES07349.1| hypothetical protein SORBIDRAFT_04g029630 [Sorghum bicolor]
          Length = 458

 Score = 91.3 bits (225), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 86/362 (23%), Positives = 154/362 (42%), Gaps = 50/362 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           + R+ +GTP+K  ++++DTGS+L +                +F+PR SSS+  ++C  P 
Sbjct: 122 VTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPRSSSSYASVSCSAPQ 181

Query: 46  CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           C               +  C+Y   Y D S + G+ + +T+S      G        +GC
Sbjct: 182 CDALTTATLNPSTCSTSNVCIYQASYGDSSFSVGYLSKDTVSF-----GSTSVPNFYYGC 236

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             DN G       G  AG++GL+R  +S + QL   +   FSYCL     +   +     
Sbjct: 237 GQDNEGLF-----GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCLPT---SSSSSGYLSI 288

Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
              + G    +  A   ++  ++ Y++ +  I++  + ++     +    +     IIDS
Sbjct: 289 GSYNPGQYSYTPMAKSSLD--DSLYFIKMTGITVAGKPLSVSASAYSSLPT-----IIDS 341

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE- 275
           G+V+T   +DVY  L +      +    A        +  C+    +  R P ++  F  
Sbjct: 342 GTVITRLPTDVYSALSKAVAGAMKGTPRASAFSI---LDTCFQGQASRLRVPQVSMAFAG 398

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            A L++   N  ++D ++    LA AP     A+IG+ QQ+    VYD+    + F    
Sbjct: 399 GAALKLKATN-LLVDVDSATTCLAFAPARS-AAIIGNTQQQTFSVVYDVKNSKIGFAAGG 456

Query: 336 CS 337
           CS
Sbjct: 457 CS 458


>gi|125536523|gb|EAY83011.1| hypothetical protein OsI_38231 [Oryza sativa Indica Group]
          Length = 469

 Score = 90.9 bits (224), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 82/328 (25%), Positives = 143/328 (43%), Gaps = 43/328 (13%)

Query: 28  FDPRKSSSFQKINCDHPDCTYFKCVNEQCV----------------YTMKYADQSV-TKG 70
           F P  S++F  + C    C     + E C                 Y++ Y   +  T G
Sbjct: 134 FRPNGSATFSPLPCSSDMC--LPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSG 191

Query: 71  FAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLG 130
           + A +T +      G     G +FGCS+ ++G    A     +GV+G+ R  +S ISQL 
Sbjct: 192 YLATDTFTF-----GATAVPGVVFGCSDASYGDFAGA-----SGVIGIGRGNLSLISQL- 240

Query: 131 SIIKKRFSYCLVIPLPNGEYTS-SYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKD 187
                +FSY L+ P    + ++ S ++FG D   +    Q+T  ++     +FYY++L  
Sbjct: 241 --QFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGQSTPLLSSTLYPDFYYVNLTG 298

Query: 188 ISIDNERMN-FPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQ 246
           + +D  R++  P  TFD+  +G GG I+ S + +TY     Y  +     S   R  L  
Sbjct: 299 VRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVAS---RIGLPA 355

Query: 247 LSDCPE-PIQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPH 303
           ++      + LCY        + P +   F+  A++ +   N F ID +     L + P 
Sbjct: 356 VNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPS 415

Query: 304 DDLVALIGSQQQRDTRFVYDLNIDLLSF 331
               +++G+  Q  T  +YD++   L+F
Sbjct: 416 QG-GSVLGTLLQTGTNMIYDVDAGRLTF 442


>gi|359476191|ref|XP_003631801.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 439

 Score = 90.9 bits (224), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 93/349 (26%), Positives = 149/349 (42%), Gaps = 51/349 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTM 60
           +V +  GTP +   LILDTGS++ +                     T  K    +  Y M
Sbjct: 129 LVDVAFGTPPQNFTLILDTGSSITW---------------------TQCKACTVENNYNM 167

Query: 61  KYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSR 120
            Y D S + G    +T+++    E   +F    FG   +N G   D   G + G+LGL +
Sbjct: 168 TYGDDSTSVGNYGCDTMTL----EPSDVFQKFQFGRGRNNKG---DFGSG-VDGMLGLGQ 219

Query: 121 VTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP--- 177
             +S +SQ  S   K FSYCL    P  +   S L FG     +  S + T  +N P   
Sbjct: 220 GQLSTVSQTASKFNKVFSYCL----PEEDSIGSLL-FGEKATSQSSSLKFTSLVNGPGTL 274

Query: 178 --NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKF 235
             + +Y+++L DIS+ NER+N P   F        G IIDS +V+T      Y  L   F
Sbjct: 275 QESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTVITRLPQRAYSALKAAF 329

Query: 236 VSYFERFQLAQ-LSDCPEPIQLCYFLPETFN-RFPSMAFYF-EDANLRIDGENVFIIDYE 292
                ++ L+       + +  CY L    +   P +  +F   A++R++G N+     E
Sbjct: 330 KKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGGGADVRLNGTNIVWGSDE 389

Query: 293 NHFFLL----AVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +   L     + +  +  + +IG++QQ     +YD+    + F    CS
Sbjct: 390 SRLCLAFAGNSKSTMNPELTIIGNRQQLSLTVLYDIQGGRIGFRSNGCS 438


>gi|115441003|ref|NP_001044781.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|19571042|dbj|BAB86469.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|20160609|dbj|BAB89555.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
 gi|113534312|dbj|BAF06695.1| Os01g0844500 [Oryza sativa Japonica Group]
 gi|125572614|gb|EAZ14129.1| hypothetical protein OsJ_04051 [Oryza sativa Japonica Group]
          Length = 442

 Score = 90.5 bits (223), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 85/323 (26%), Positives = 139/323 (43%), Gaps = 53/323 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI----------------FDPRKSSSFQKINCDHPD 45
           V L +GTP + V ++LDTGS L + +                F PR S +F  + CD   
Sbjct: 68  VSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCDSAQ 127

Query: 46  C------TYFKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C      +   C   ++QC  ++ YAD S + G  A E  +V   G+G  +   A FGC 
Sbjct: 128 CRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTV---GQGPPLR--AAFGCM 182

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
                FD      A AG+LG++R  +SF+SQ  +   +RFSYC+     +    +  L  
Sbjct: 183 AT--AFDTSPDGVATAGLLGMNRGALSFVSQAST---RRFSYCI-----SDRDDAGVLLL 232

Query: 158 G-TDMGYR----RPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGG 211
           G +D+ +      P  Q    + + +   Y + L  I +  + +  P        +G G 
Sbjct: 233 GHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQ 292

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDC----PEPIQLCYFLPE---TF 264
            ++DSG+  T+   D Y  L  +F S   +  L  L+D      E    C+ +P+     
Sbjct: 293 TMVDSGTQFTFLLGDAYSALKAEF-SRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPP 351

Query: 265 NRFPSMAFYFEDANLRIDGENVF 287
            R P++   F  A + + G+ + 
Sbjct: 352 ARLPAVTLLFNGAQMTVAGDRLL 374


>gi|326501496|dbj|BAK02537.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 476

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 96/392 (24%), Positives = 151/392 (38%), Gaps = 68/392 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA------------------IFDPRKSSSFQKINCDH 43
           VR  +GTP++  LL+ DTGS L +                    F P  S ++  I+C  
Sbjct: 96  VRFRVGTPAQPFLLVADTGSDLTWVKCRRPAANSSESGSGSGRAFRPEDSRTWAPISCAS 155

Query: 44  PDCT------YFKCVN--EQCVYTMKYADQSVTKGFAAHE--TISVIGKG--EGKAIFHG 91
             CT         C      C Y  +Y D S  +G    E  TI++ G+G  E KA   G
Sbjct: 156 DTCTKSLPFSLATCPTPGSPCAYDYRYKDGSAARGTVGTESATIALSGRGREERKAKLKG 215

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
            + GC++   G   +  DG    VL L    +SF S   S    RFSYCLV  L +    
Sbjct: 216 LVLGCTSSYTGPSFEVSDG----VLSLGYSDVSFASHAASRFAGRFSYCLVDHL-SPRNA 270

Query: 152 SSYLKFGTD---------------------MGYRRPSTQATKFINHP-NNFYYLSLKDIS 189
           +SYL FG +                        R  + Q    ++     FY +++K +S
Sbjct: 271 TSYLTFGPNPAVASSSSPSSPAPASCTAAAPRPRPRARQTPLLLDRRMRPFYDVAVKAVS 330

Query: 190 IDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD 249
           +  + +  P   +D+     GG I+DSG+ LT      Y  +               +  
Sbjct: 331 VAGQFLKIPRAVWDVDAG--GGVILDSGTSLTVLAKPAYRAVVAALSEGLAGLPRVTM-- 386

Query: 250 CPEPIQLCYFL--PETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAV--APHDD 305
             +P + CY    P      P MA +F  A         ++ID       + +   P   
Sbjct: 387 --DPFEYCYNWTSPSGDVTLPKMAVHFAGAARLEPPGKSYVIDAAPGVKCIGLQEGPWPG 444

Query: 306 LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            +++IG+  Q++  + +D+    L F +  C+
Sbjct: 445 -ISVIGNILQQEHLWEFDIKNRRLKFQRSRCT 475


>gi|9759039|dbj|BAB09366.1| aspartyl protease-like [Arabidopsis thaliana]
          Length = 478

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 90/379 (23%), Positives = 160/379 (42%), Gaps = 67/379 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ +G+P K   + +DTGS +++                   +++D + SS+ + + C+
Sbjct: 76  TKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCE 135

Query: 43  HPDCTYF----KC-VNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALF 94
              C++      C   + C Y + Y D S + G    + I+   V G      +    +F
Sbjct: 136 DDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVF 195

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTS 152
           GC  +  G      D A+ G++G  +   S ISQL  G   K+ FS+CL      G +  
Sbjct: 196 GCGKNQSG-QLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAV 254

Query: 153 SYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGG 211
             ++         P  + T  +  PN  +Y + LK + +D + ++ PP     + +G+GG
Sbjct: 255 GEVE--------SPVVKTTPIV--PNQVHYNVILKGMDVDGDPIDLPPSL--ASTNGDGG 302

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMA 271
            IIDSG+ L Y   ++Y  L EK  +     Q  +L    E      F   T   FP + 
Sbjct: 303 TIIDSGTTLAYLPQNLYNSLIEKITAK----QQVKLHMVQETFACFSFTSNTDKAFPVVN 358

Query: 272 FYFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDT 318
            +FED+            +LR   E+++   +++      +   D   V L+G     + 
Sbjct: 359 LHFEDSLKLSVYPHDYLFSLR---EDMYCFGWQSG----GMTTQDGADVILLGDLVLSNK 411

Query: 319 RFVYDLNIDLLSFVKENCS 337
             VYDL  +++ +   NCS
Sbjct: 412 LVVYDLENEVIGWADHNCS 430


>gi|115472519|ref|NP_001059858.1| Os07g0533800 [Oryza sativa Japonica Group]
 gi|113611394|dbj|BAF21772.1| Os07g0533800 [Oryza sativa Japonica Group]
          Length = 458

 Score = 90.5 bits (223), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 99/375 (26%), Positives = 158/375 (42%), Gaps = 54/375 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           ++ L IGTP +    I DTGS L++                +++P  S +F+ + C    
Sbjct: 98  IMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSS-- 155

Query: 46  CTYFKCVNEQ------------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
                C  E             C Y   Y     T G    ET +       +    G  
Sbjct: 156 -ALNLCAAEARLAGATPPPGCACRYNQTYGT-GWTSGLQGSETFTFGSSPADQVRVPGIA 213

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
           FGCSN +     D  +G+ AG++GL R  +S +SQL + +   FSYCL  P  + +  S+
Sbjct: 214 FGCSNAS----SDDWNGS-AGLVGLGRGGLSLVSQLAAGM---FSYCLT-PFQDTKSKST 264

Query: 154 YL--KFGTDMGYRRPSTQATKFINHPN-----NFYYLSLKDISIDNERMNFPPDTFDITV 206
            L               ++T F+  P+      +YYL+L  IS+    +  PP  F +  
Sbjct: 265 LLLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRA 324

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-- 264
            G GG IIDSG+ +T    D  +K     V    +  +   S+    + LC+ LP +   
Sbjct: 325 DGTGGLIIDSGTTITSL-VDAAYKRVRAAVRSLVKLPVTDGSNA-TGLDLCFALPSSSAP 382

Query: 265 -NRFPSMAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
               PSM  +F   A++ +  EN  I+D    + L   +  D  ++ +G+ QQ++   +Y
Sbjct: 383 PATLPSMTLHFGGGADMVLPVENYMILD-GGMWCLAMRSQTDGELSTLGNYQQQNLHILY 441

Query: 323 DLNIDLLSFVKENCS 337
           D+  + LSF    CS
Sbjct: 442 DVQKETLSFAPAKCS 456


>gi|30692930|ref|NP_198475.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|66792626|gb|AAY56415.1| At5g36260 [Arabidopsis thaliana]
 gi|332006680|gb|AED94063.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 482

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 90/379 (23%), Positives = 160/379 (42%), Gaps = 67/379 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ +G+P K   + +DTGS +++                   +++D + SS+ + + C+
Sbjct: 80  TKIKLGSPPKEYYVQVDTGSDILWVNCAPCPKCPVKTDLGIPLSLYDSKTSSTSKNVGCE 139

Query: 43  HPDCTYF----KC-VNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALF 94
              C++      C   + C Y + Y D S + G    + I+   V G      +    +F
Sbjct: 140 DDFCSFIMQSETCGAKKPCSYHVVYGDGSTSDGDFIKDNITLEQVTGNLRTAPLAQEVVF 199

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTS 152
           GC  +  G      D A+ G++G  +   S ISQL  G   K+ FS+CL      G +  
Sbjct: 200 GCGKNQSG-QLGQTDSAVDGIMGFGQSNTSIISQLAAGGSTKRIFSHCLDNMNGGGIFAV 258

Query: 153 SYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGG 211
             ++         P  + T  +  PN  +Y + LK + +D + ++ PP     + +G+GG
Sbjct: 259 GEVE--------SPVVKTTPIV--PNQVHYNVILKGMDVDGDPIDLPPSL--ASTNGDGG 306

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMA 271
            IIDSG+ L Y   ++Y  L EK  +     Q  +L    E      F   T   FP + 
Sbjct: 307 TIIDSGTTLAYLPQNLYNSLIEKITAK----QQVKLHMVQETFACFSFTSNTDKAFPVVN 362

Query: 272 FYFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDT 318
            +FED+            +LR   E+++   +++      +   D   V L+G     + 
Sbjct: 363 LHFEDSLKLSVYPHDYLFSLR---EDMYCFGWQSG----GMTTQDGADVILLGDLVLSNK 415

Query: 319 RFVYDLNIDLLSFVKENCS 337
             VYDL  +++ +   NCS
Sbjct: 416 LVVYDLENEVIGWADHNCS 434


>gi|22831049|dbj|BAC15912.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|50508281|dbj|BAD32130.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 453

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 99/375 (26%), Positives = 158/375 (42%), Gaps = 54/375 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           ++ L IGTP +    I DTGS L++                +++P  S +F+ + C    
Sbjct: 93  IMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSS-- 150

Query: 46  CTYFKCVNEQ------------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
                C  E             C Y   Y     T G    ET +       +    G  
Sbjct: 151 -ALNLCAAEARLAGATPPPGCACRYNQTYG-TGWTSGLQGSETFTFGSSPADQVRVPGIA 208

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
           FGCSN +     D  +G+ AG++GL R  +S +SQL + +   FSYCL  P  + +  S+
Sbjct: 209 FGCSNAS----SDDWNGS-AGLVGLGRGGLSLVSQLAAGM---FSYCLT-PFQDTKSKST 259

Query: 154 YL--KFGTDMGYRRPSTQATKFINHPN-----NFYYLSLKDISIDNERMNFPPDTFDITV 206
            L               ++T F+  P+      +YYL+L  IS+    +  PP  F +  
Sbjct: 260 LLLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGPAALPIPPGAFALRA 319

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-- 264
            G GG IIDSG+ +T    D  +K     V    +  +   S+    + LC+ LP +   
Sbjct: 320 DGTGGLIIDSGTTITSL-VDAAYKRVRAAVRSLVKLPVTDGSNA-TGLDLCFALPSSSAP 377

Query: 265 -NRFPSMAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
               PSM  +F   A++ +  EN  I+D    + L   +  D  ++ +G+ QQ++   +Y
Sbjct: 378 PATLPSMTLHFGGGADMVLPVENYMILD-GGMWCLAMRSQTDGELSTLGNYQQQNLHILY 436

Query: 323 DLNIDLLSFVKENCS 337
           D+  + LSF    CS
Sbjct: 437 DVQKETLSFAPAKCS 451


>gi|125558632|gb|EAZ04168.1| hypothetical protein OsI_26310 [Oryza sativa Indica Group]
 gi|125600539|gb|EAZ40115.1| hypothetical protein OsJ_24558 [Oryza sativa Japonica Group]
          Length = 453

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 99/375 (26%), Positives = 158/375 (42%), Gaps = 54/375 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           ++ L IGTP +    I DTGS L++                +++P  S +F+ + C    
Sbjct: 93  IMTLAIGTPPQSYPAIADTGSDLVWTQCAPCGERCFKQPSPLYNPSSSPTFRVLPCSS-- 150

Query: 46  CTYFKCVNEQ------------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
                C  E             C Y   Y     T G    ET +       +    G  
Sbjct: 151 -ALNLCAAEARLAGATPPPGCACRYNQTYG-TGWTSGLQGSETFTFGSSPADQVRVPGIA 208

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
           FGCSN +     D  +G+ AG++GL R  +S +SQL + +   FSYCL  P  + +  S+
Sbjct: 209 FGCSNAS----SDDWNGS-AGLVGLGRGGLSLVSQLAAGM---FSYCLT-PFQDTKSKST 259

Query: 154 YL--KFGTDMGYRRPSTQATKFINHPN-----NFYYLSLKDISIDNERMNFPPDTFDITV 206
            L               ++T F+  P+      +YYL+L  IS+    +  PP  F +  
Sbjct: 260 LLLGPAAAAAALNGTGVRSTPFVPSPSKPPMSTYYYLNLTGISVGAAALPIPPGAFALRA 319

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-- 264
            G GG IIDSG+ +T    D  +K     V    +  +   S+    + LC+ LP +   
Sbjct: 320 DGTGGLIIDSGTTITSL-VDAAYKRVRAAVRSLVKLPVTDGSNA-TGLDLCFALPSSSAP 377

Query: 265 -NRFPSMAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
               PSM  +F   A++ +  EN  I+D    + L   +  D  ++ +G+ QQ++   +Y
Sbjct: 378 PATLPSMTLHFGGGADMVLPVENYMILD-GGMWCLAMRSQTDGELSTLGNYQQQNLHILY 436

Query: 323 DLNIDLLSFVKENCS 337
           D+  + LSF    CS
Sbjct: 437 DVQKETLSFAPAKCS 451


>gi|195627138|gb|ACG35399.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 431

 Score = 90.1 bits (222), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 97/373 (26%), Positives = 148/373 (39%), Gaps = 59/373 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
           +VR  +GTP + +LL LDT +   ++             F P  SSS+  + C    C  
Sbjct: 80  VVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSSSYASLPCASDWCPL 139

Query: 49  FK---CVNEQ--------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           F+   C   Q        C ++  +AD S      + +T+ +     GK    G  FGC 
Sbjct: 140 FEGQPCPANQDASAPLPACAFSKPFADTSFQASLGS-DTLRL-----GKDAIAGYAFGCV 193

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
               G   +       G+LGL R  +S +SQ GS     FSYCL  P     Y S  L+ 
Sbjct: 194 GAVAGPTTNLPK---QGLLGLGRGPMSLLSQTGSTYNGVFSYCL--PSYRSYYFSGSLRL 248

Query: 158 GTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           G     R  + + T  + +P+  + YY+++  +S+    +  P  +F    +   G +ID
Sbjct: 249 GAAGQPR--NVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVID 306

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
           SG+V+T + + VY  L E+F     R Q+A  S       L  F    FN     A    
Sbjct: 307 SGTVITRWTAPVYAALREEF-----RRQVAAPSGY---TSLGAFD-TCFNTDEVAAGGAP 357

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQ------------QQRDTRFVYD 323
              L +DG     +  EN     +  P   L      Q            QQ++ R V D
Sbjct: 358 PVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVD 417

Query: 324 LNIDLLSFVKENC 336
           +    + F +E C
Sbjct: 418 VAGSRVGFAREPC 430


>gi|359476197|ref|XP_003631803.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 414

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 86/296 (29%), Positives = 135/296 (45%), Gaps = 35/296 (11%)

Query: 57  VYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVL 116
            Y M Y D+S + G    +T+++    E   +F    FGC  +N G   D   GA  G+L
Sbjct: 138 TYNMTYGDKSTSVGNYGCDTMTL----EPSDVFPKFQFGCGRNNEG---DFGSGA-DGML 189

Query: 117 GLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINH 176
           GL +  +S +SQ  S  KK FSYC    LP  +   S L FG +    + S + T  +N 
Sbjct: 190 GLGQGQLSTVSQTASKFKKVFSYC----LPEEDSIGSLL-FG-EKATSQSSLKFTSLVNG 243

Query: 177 P-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYW 229
           P       + +Y++ L DIS+ N+R+N P   F        G IIDSG+V+T      Y 
Sbjct: 244 PGTSGLEESGYYFVKLLDISVGNKRLNVPSSVF-----ASPGTIIDSGTVITCLPQRAYS 298

Query: 230 KLHEKFVSYFERFQLAQ-LSDCPEPIQLCYFLPETFN-RFPSMAFYF-EDANLRIDGENV 286
            L   F     ++ L+       + +  CY L    +   P +  +F E A++R++G+ V
Sbjct: 299 ALTAAFKKAMAKYPLSNGRRKKGDILDTCYNLSGRKDVLLPEIVLHFGEGADVRLNGKRV 358

Query: 287 FIIDYENHFFLLAVAPH-----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            I   +     LA A +     +  + +IG++QQ     +YD+    + F    CS
Sbjct: 359 -IWGNDASRLCLAFAGNSKSTMNSELTIIGNRQQVSLTVLYDIQGGRIGFGGNGCS 413


>gi|357116104|ref|XP_003559824.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 489

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 91/379 (24%), Positives = 148/379 (39%), Gaps = 70/379 (18%)

Query: 27  IFDPRKSSSFQKINCDHPDCTYF---KCVNEQCVYTMKYADQSVTKGFAAHETISV-IGK 82
           IFDP+ S  ++ +  D P C      +    +C + +++  +++  G+   +  +   G 
Sbjct: 117 IFDPKTSHRYKNVGHDDPLCKAPFTPRPTEHRCGFNIRFRAEAMATGYLGKDEFAFGAGS 176

Query: 83  GEGKAIFHGALFGCSNDNHGFDED---------------------------ARDG----- 110
           G       G +FGC++  +G++                             A DG     
Sbjct: 177 GSRTTNVDGLVFGCAHRINGWNNKDVLAGIPSLNRRPTSFVRQLSTHGGGGAVDGLVFGC 236

Query: 111 -----------ALAGVLGLSRVTISFISQL---GSIIKKRFSYCLV--IPLPNGEYTSSY 154
                       LAG+L L+R   SF+ QL   G     RFSYCLV     PN      +
Sbjct: 237 AHAINGWKNQDVLAGILSLNRRPTSFVRQLSVHGGGTTPRFSYCLVDHKKYPN---KHGF 293

Query: 155 LKFGTDMGYRRPSTQATKFINHPNN---FYYLSLKDISIDNERM-NFPPDTFDI-TVSGE 209
           L+FG D+     +         P+     YY+ L  +S+   ++    P  F     S  
Sbjct: 294 LRFGADVPDHSHAQSTALLYGEPDGGFGMYYVRLVGVSVAGRKLTGITPKMFQRDRRSRL 353

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-IQLCY--FLPETFNR 266
           GGC +D G+  T F    Y  L     ++     L +    P P  +LC     PE   +
Sbjct: 354 GGCYVDVGNPTTRFAEAPYDILEAGVAAHMASHGLHR---TPVPGHRLCVRGTSPEVMPK 410

Query: 267 FPSMAFYF---EDANLRIDGENVF-IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
            PS+  +F   E A L I    +F  + +    ++  +     +  +IG  QQ DTRF +
Sbjct: 411 LPSITLHFAEDEAAGLEIKSRLLFATVKHAGADYVCFIVQRAPVTTVIGGHQQVDTRFTF 470

Query: 323 DLNIDLLSFVKENCSDDSA 341
           DL  + L F  E+C  D++
Sbjct: 471 DLEENRLFFAPEDCHGDAS 489


>gi|20160862|dbj|BAB89801.1| putative aspartic proteinase nepenthesin I [Oryza sativa Japonica
           Group]
          Length = 488

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 94/348 (27%), Positives = 142/348 (40%), Gaps = 44/348 (12%)

Query: 6   IGTPSKGVLLILDTGSALIY------AIFDPRKSSSFQKINCDHPDCTYFK-----CVNE 54
           IGTP + V   LD  S L++      A F+P +S++   + C    C  F          
Sbjct: 106 IGTPPQQVSGALDISSDLVWTACGATAPFNPVRSTTVADVPCTDDACQQFAPQTCGAGAS 165

Query: 55  QCVYTMKYADQSVTKGFAAHETISVIGKGE---GKAIFHGALFGCSNDNHG-FDEDARDG 110
           +C YT  Y       G  A  T  ++G      G     G +FGC   N G F       
Sbjct: 166 ECAYTYMY-------GGGAANTTGLLGTEAFTFGDTRIDGVVFGCGLKNVGDFS------ 212

Query: 111 ALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFGTDMGYRRPSTQ 169
            ++GV+GL R  +S +SQL      RFSY      P+    T S++ FG D   +   T 
Sbjct: 213 GVSGVIGLGRGNLSLVSQL---QVDRFSYHFA---PDDSVDTQSFILFGDDATPQTSHTL 266

Query: 170 ATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVLTYFHSD 226
           +T+ +    N   YY+ L  I +D + +  P  TFD+    G GG  +    ++T     
Sbjct: 267 STRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTVLEEA 326

Query: 227 VYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-DANLRIDGE 284
            Y  L +   S   +  L  ++     + LCY        + PSMA  F   A + ++  
Sbjct: 327 AYKPLRQAVAS---KIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVMELELG 383

Query: 285 NVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSF 331
           N F +D       L + P      +++GS  Q  T  +YD+N   L F
Sbjct: 384 NYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 431


>gi|18400416|ref|NP_565559.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|20197296|gb|AAM15014.1| predicted protein [Arabidopsis thaliana]
 gi|330252412|gb|AEC07506.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 458

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 92/338 (27%), Positives = 144/338 (42%), Gaps = 56/338 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSAL----------------IYAIFDPRKSSSFQKINCDHP 44
           +V   +G P    L I+DTGS+L                I+ +F+P  SS+F + +CD  
Sbjct: 97  LVNFSVGQPPVPQLTIMDTGSSLLWIQCQPCKHCSSDHMIHPVFNPALSSTFVECSCDDR 156

Query: 45  DCTYF---KC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
            C Y     C  + +CVY   Y   + +KG  A E ++         +     FGC  +N
Sbjct: 157 FCRYAPNGHCGSSNKCVYEQVYISGTGSKGVLAKERLTFTTPNGNTVVTQPIAFGCGYEN 216

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
                +  +    G+LGL     S   QLGS    +FSYC +  L N  Y  + L  G D
Sbjct: 217 G----EQLESHFTGILGLGAKPTSLAVQLGS----KFSYC-IGDLANKNYGYNQLVLGED 267

Query: 161 ---MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
              +G   P    T+     N+ YY++L+ IS+ + ++N  P  F        G I+DSG
Sbjct: 268 ADILGDPTPIEFETE-----NSIYYMNLEGISVGDTQLNIEPVVFK-RRGPRTGVILDSG 321

Query: 218 SVLTYFHSDVYWKLHEKFVSY----FERFQLAQLSDCPEPIQLCYF--LPETFNRFPSMA 271
           ++ T+     Y +L+ +  S      ERF             LCY   + E    FP + 
Sbjct: 322 TLYTWLADIAYRELYNEIKSILDPKLERFWFRDF--------LCYHGRVSEELIGFPVVT 373

Query: 272 FYFE-DANLRIDGENVFI-IDYENHF--FLLAVAPHDD 305
           F+F   A L ++  ++F  +   N F  F ++V P  +
Sbjct: 374 FHFAGGAELAMEATSMFYPLSEPNTFNVFCMSVKPTKE 411


>gi|296089645|emb|CBI39464.3| unnamed protein product [Vitis vinifera]
          Length = 477

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 98/392 (25%), Positives = 165/392 (42%), Gaps = 94/392 (23%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCD 42
            ++ IGTP++   + +DTGS +++                    ++D ++S + + ++CD
Sbjct: 100 AKIGIGTPARDYYVQVDTGSDIMWVNCIQCNECPKKSSLGMELTLYDIKESLTGKLVSCD 159

Query: 43  HPDCTYFK------CV-NEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
              C          C+ N  C YT  YAD S + G+   + +    V G  E  +     
Sbjct: 160 QDFCYAINGGPPSYCIANMSCSYTEIYADGSSSFGYFVRDIVQYDQVSGDLETTSANGSV 219

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
           +FGCS    G  + + + AL G+LG  +   S ISQL S   ++K F++CL      G +
Sbjct: 220 IFGCSATQSG--DLSSEEALDGILGFGKSNTSMISQLASSGKVRKMFAHCLDGLNGGGIF 277

Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGE 209
              ++         +P    T  +  PN  +Y +++K + +    +N P D FD  V  +
Sbjct: 278 AIGHI--------VQPKVNTTPLV--PNQTHYNVNMKAVEVGGYFLNLPTDVFD--VGDK 325

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-CYFLPETF-NRF 267
            G IIDSG+ L Y    VY +L  K  S+    ++  + D     Q  C+   E+  + F
Sbjct: 326 KGTIIDSGTTLAYLPEVVYDQLLSKIFSWQSDLKVHTIHD-----QFTCFQYSESLDDGF 380

Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-----DLVALIGSQ----QQRDT 318
           P++ F+FE++                    L V PH+     D +  IG Q    Q RD 
Sbjct: 381 PAVTFHFENS------------------LYLKVHPHEYLFSYDGLWCIGWQNSGMQSRDR 422

Query: 319 R--------------FVYDLNIDLLSFVKENC 336
           R               +YDL   ++ + + NC
Sbjct: 423 RNITLLGDLALSNKLVLYDLENQVIGWTEYNC 454


>gi|190896584|gb|ACE96805.1| aspartyl protease [Populus tremula]
 gi|190896586|gb|ACE96806.1| aspartyl protease [Populus tremula]
 gi|190896588|gb|ACE96807.1| aspartyl protease [Populus tremula]
 gi|190896590|gb|ACE96808.1| aspartyl protease [Populus tremula]
 gi|190896592|gb|ACE96809.1| aspartyl protease [Populus tremula]
 gi|190896594|gb|ACE96810.1| aspartyl protease [Populus tremula]
 gi|190896596|gb|ACE96811.1| aspartyl protease [Populus tremula]
 gi|190896598|gb|ACE96812.1| aspartyl protease [Populus tremula]
 gi|190896600|gb|ACE96813.1| aspartyl protease [Populus tremula]
 gi|190896602|gb|ACE96814.1| aspartyl protease [Populus tremula]
 gi|190896604|gb|ACE96815.1| aspartyl protease [Populus tremula]
 gi|190896606|gb|ACE96816.1| aspartyl protease [Populus tremula]
 gi|190896610|gb|ACE96818.1| aspartyl protease [Populus tremula]
 gi|190896612|gb|ACE96819.1| aspartyl protease [Populus tremula]
 gi|190896614|gb|ACE96820.1| aspartyl protease [Populus tremula]
 gi|190896616|gb|ACE96821.1| aspartyl protease [Populus tremula]
 gi|190896618|gb|ACE96822.1| aspartyl protease [Populus tremula]
 gi|190896620|gb|ACE96823.1| aspartyl protease [Populus tremula]
 gi|190896622|gb|ACE96824.1| aspartyl protease [Populus tremula]
 gi|190896624|gb|ACE96825.1| aspartyl protease [Populus tremula]
 gi|190896626|gb|ACE96826.1| aspartyl protease [Populus tremula]
 gi|190896628|gb|ACE96827.1| aspartyl protease [Populus tremula]
 gi|190896630|gb|ACE96828.1| aspartyl protease [Populus tremula]
 gi|190896632|gb|ACE96829.1| aspartyl protease [Populus tremula]
 gi|190896634|gb|ACE96830.1| aspartyl protease [Populus tremula]
 gi|190896636|gb|ACE96831.1| aspartyl protease [Populus tremula]
 gi|190896638|gb|ACE96832.1| aspartyl protease [Populus tremula]
 gi|190896640|gb|ACE96833.1| aspartyl protease [Populus tremula]
 gi|190896642|gb|ACE96834.1| aspartyl protease [Populus tremula]
 gi|190896644|gb|ACE96835.1| aspartyl protease [Populus tremula]
 gi|190896646|gb|ACE96836.1| aspartyl protease [Populus tremula]
 gi|190896648|gb|ACE96837.1| aspartyl protease [Populus tremula]
 gi|190896650|gb|ACE96838.1| aspartyl protease [Populus tremula]
 gi|190896652|gb|ACE96839.1| aspartyl protease [Populus tremula]
 gi|190896654|gb|ACE96840.1| aspartyl protease [Populus tremula]
 gi|190896656|gb|ACE96841.1| aspartyl protease [Populus tremula]
 gi|190896658|gb|ACE96842.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 80/307 (26%), Positives = 131/307 (42%), Gaps = 39/307 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTY- 48
           +VR+ +GTP + + ++LDT +   +             F P  S++   ++C    C+  
Sbjct: 46  VVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTTFLPNASTTLGSLDCSEAQCSQV 105

Query: 49  --FKCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
             F C    +  C++   Y   S        + I++        +  G  FGC N   G 
Sbjct: 106 RGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITL-----ANDVIPGFTFGCINAVSGG 160

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
               +     G+LGL R  IS ISQ G++    FSYCL  P     Y S  LK G  +G 
Sbjct: 161 SIPPQ-----GLLGLGRGPISLISQAGAMYSGVFSYCL--PSFKSYYFSGSLKLG-PVGQ 212

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            + S + T  + +P+  + YY++L  +S+   ++  P +      +   G IIDSG+V+T
Sbjct: 213 PK-SIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVIT 271

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
            F   VY+ + ++F     R Q+            C F        P++  +FE  NL +
Sbjct: 272 RFVQPVYFAIRDEF-----RKQVNGPISSLGAFDTC-FAATNEAEAPAVTLHFEGLNLVL 325

Query: 282 DGENVFI 288
             EN  I
Sbjct: 326 PMENSLI 332


>gi|7715602|gb|AAF68120.1|AC010793_15 F20B17.14 [Arabidopsis thaliana]
 gi|12324588|gb|AAG52249.1|AC011717_17 putative aspartyl protease; 105611-106921 [Arabidopsis thaliana]
          Length = 436

 Score = 89.7 bits (221), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 96/363 (26%), Positives = 148/363 (40%), Gaps = 61/363 (16%)

Query: 11  KGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF------- 49
           K + LI+DTGS L +               ++DP  SSS++ + C+   C          
Sbjct: 96  KNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNS 155

Query: 50  -------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                    V   C Y + Y D S T+G  A E+I +     G       +FGC  +N G
Sbjct: 156 GPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL-----GDTKLENFVFGCGRNNKG 210

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               +            R ++S +SQ        FSYCL   L +G   S  L FG D  
Sbjct: 211 LFGGSSGLMGL-----GRSSVSLVSQTLKTFNGVFSYCLP-SLEDG--ASGSLSFGNDSS 262

Query: 163 YRRPSTQA--TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
               ST    T  + +P   +FY L+L   SI    +         + S   G +IDSG+
Sbjct: 263 VYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELK--------SSSFGRGILIDSGT 314

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-D 276
           V+T     +Y  +  +F+  F  F  A        +  C+ L    +   P +   F+ +
Sbjct: 315 VITRLPPSIYKAVKIEFLKQFSGFPTAPGYSI---LDTCFNLTSYEDISIPIIKMIFQGN 371

Query: 277 ANLRIDGENVF-IIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
           A L +D   VF  +  +     LA+A   +++ V +IG+ QQ++ R +YD   + L  V 
Sbjct: 372 AELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVG 431

Query: 334 ENC 336
           ENC
Sbjct: 432 ENC 434


>gi|414878073|tpg|DAA55204.1| TPA: hypothetical protein ZEAMMB73_344109 [Zea mays]
          Length = 440

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 94/375 (25%), Positives = 154/375 (41%), Gaps = 54/375 (14%)

Query: 5   FIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHPDC-- 46
            IG P +    I+DTGS LI+                + +DP +S + + + C+   C  
Sbjct: 76  LIGDPPQQAEAIIDTGSNLIWTQCSTCQPAGCFSQNLSFYDPSRSRTARPVACNDTACAL 135

Query: 47  -TYFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
            +  +C   N+ C     Y    V  G    E  +   + E  ++     FGC       
Sbjct: 136 GSETRCARDNKACAVLTAYG-AGVIGGVLGTEAFTFQPQSENVSL----AFGCIAATR-L 189

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              + DGA +G++GL R  +S +SQLG     +FSYCL  P  +    +S L  G   G 
Sbjct: 190 TPGSLDGA-SGIIGLGRGNLSLVSQLG---DNKFSYCLT-PYFSQSTNTSRLFVGASAGL 244

Query: 164 RRPSTQATK--FINHPN-----NFYYLSLKDISIDNERMNFPPDTFDITVSGEG---GCI 213
                 AT   F+ +P+      FYYL L  I++ + ++  P   FD+     G   G +
Sbjct: 245 SSGGAPATSVPFLKNPDVDPFSTFYYLPLTGITVGDAKLAVPEAAFDLRQVATGLWAGTL 304

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMA 271
           IDSGS  T      Y  L ++ V       +   +   E + LC  +   +     P + 
Sbjct: 305 IDSGSPFTSLVDVAYQALRDELVQQLGASIVPPPAGA-EGLDLCAAVAHGDVGKLVPPLV 363

Query: 272 FYFED--ANLRIDGENVF-IIDYENHFFLL--AVAPHDDL----VALIGSQQQRDTRFVY 322
            +F     ++ +  EN +  +D      ++  +  P+  L      +IG+  Q+D   +Y
Sbjct: 364 LHFGSGGGDVAVPPENYWGPVDDSTACMVVFSSGGPNSTLPMNETTIIGNYMQQDMHLLY 423

Query: 323 DLNIDLLSFVKENCS 337
           DL   +LSF   +CS
Sbjct: 424 DLEKGMLSFQPADCS 438


>gi|242117573|dbj|BAH80056.1| hypothetical protein [Oryza sativa Indica Group]
          Length = 469

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 81/328 (24%), Positives = 143/328 (43%), Gaps = 43/328 (13%)

Query: 28  FDPRKSSSFQKINCDHPDCTYFKCVNEQCV----------------YTMKYADQSV-TKG 70
           F P  S++F  + C    C     + E C                 Y++ Y   +  T G
Sbjct: 134 FRPNGSATFSPLPCSSDMC--LPVLRETCGRAGAAANATAGARCDSYSLTYGGSAANTSG 191

Query: 71  FAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLG 130
           + A +T +      G     G +FGCS+ ++G    A     +GV+G+ R  +S ISQL 
Sbjct: 192 YLATDTFTF-----GATAVPGVVFGCSDASYGDFAGA-----SGVIGIGRGNLSLISQL- 240

Query: 131 SIIKKRFSYCLVIPLPNGEYTS-SYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKD 187
                +FSY L+ P    + ++ S ++FG D   +    ++T  ++     +FYY++L  
Sbjct: 241 --QFGKFSYQLLAPEATDDGSADSVIRFGDDAVPKTKRGRSTPLLSSTLYPDFYYVNLTG 298

Query: 188 ISIDNERMN-FPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQ 246
           + +D  R++  P  TFD+  +G GG I+ S + +TY     Y  +     S   R  L  
Sbjct: 299 VRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPVTYLEQAAYDVVRAAVAS---RIGLPA 355

Query: 247 LSDCPE-PIQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPH 303
           ++      + LCY        + P +   F+  A++ +   N F ID +     L + P 
Sbjct: 356 VNGSAALELDLCYNASSMAKVKVPKLTLVFDGGADMDLSAANYFYIDNDTGLECLTMLPS 415

Query: 304 DDLVALIGSQQQRDTRFVYDLNIDLLSF 331
               +++G+  Q  T  +YD++   L+F
Sbjct: 416 QG-GSVLGTLLQTGTNMIYDVDAGRLTF 442


>gi|357148754|ref|XP_003574882.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 488

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 97/374 (25%), Positives = 164/374 (43%), Gaps = 63/374 (16%)

Query: 6   IGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDHPDC 46
           IGTP K   + +DTGS +++                    ++DP+ SSS   ++CD   C
Sbjct: 89  IGTPPKQYHVQVDTGSDILWVNCISCNKCPRKSDLGIDLRLYDPKGSSSGSTVSCDQKFC 148

Query: 47  --TYFK----CV-NEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFGC 96
             TY      C  N  C Y++ Y D S T G+   +++    V G G+ +      +FGC
Sbjct: 149 AATYGGKLPGCAKNIPCEYSVMYGDGSSTTGYFVSDSLQYNQVSGDGQTRHANASVIFGC 208

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSY 154
                G D  + + AL G++G  +   S +SQL +   +KK FS+CL      G +    
Sbjct: 209 GA-QQGGDLGSTNQALDGIIGFGQSNTSMLSQLAAAGEVKKIFSHCLDTIKGGGIFAIGD 267

Query: 155 LKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG-GC 212
           +         +P  ++T  + + P+  Y ++L+ I++    +  P   F+   +GE  G 
Sbjct: 268 V--------VQPKVKSTPLVPDMPH--YNVNLESINVGGTTLQLPSHMFE---TGEKKGT 314

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
           IIDSG+ LTY    VY    +   + F +          + + + YF     + FP + F
Sbjct: 315 IIDSGTTLTYLPELVY---KDVLAAVFAKHPDTTFHSVQDFLCIQYF-QSVDDGFPKITF 370

Query: 273 YFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
           +FE D  L +        +G+N++   ++N    L      D+V L+G     +   VYD
Sbjct: 371 HFEDDLGLNVYPHDYFFQNGDNLYCFGFQNGG--LQSKDGKDMV-LLGDLVLSNKVVVYD 427

Query: 324 LNIDLLSFVKENCS 337
           L   ++ +   NCS
Sbjct: 428 LENQVVGWTDYNCS 441


>gi|125528511|gb|EAY76625.1| hypothetical protein OsI_04577 [Oryza sativa Indica Group]
          Length = 492

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 94/352 (26%), Positives = 143/352 (40%), Gaps = 48/352 (13%)

Query: 6   IGTPSKGVLLILDTGSALIY------AIFDPRKSSSFQKINCDHPDCTYFK--------- 50
           IGTP + V   LD  S L++      A F+P +S++   + C    C  F          
Sbjct: 106 IGTPPQQVSGALDISSDLVWTACGATAPFNPVRSTTVADVPCTDDACQQFAPQTCGAGAG 165

Query: 51  CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE---GKAIFHGALFGCSNDNHG-FDED 106
             + +C YT  Y       G  A  T  ++G      G     G +FGC   N G F   
Sbjct: 166 AGSSECAYTYMY-------GGGAANTTGLLGTEAFTFGDTRIDGVVFGCGLQNVGDFS-- 216

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFGTDMGYRR 165
                ++GV+GL R  +S +SQL      RFSY      P+    T S++ FG D   + 
Sbjct: 217 ----GVSGVIGLGRGNLSLVSQL---QVDRFSYHFA---PDDSVDTQSFILFGDDATPQT 266

Query: 166 PSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDI-TVSGEGGCIIDSGSVLTY 222
             T +T+ +    N   YY+ L  I +D + +  P  TFD+    G GG  +    ++T 
Sbjct: 267 SHTLSTRLLASDANPSLYYVELAGIQVDGKDLAIPSGTFDLRNKDGSGGVFLSITDLVTV 326

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-DANLR 280
                Y  L +   S   +  L  ++     + LCY        + PSMA  F   A + 
Sbjct: 327 LEEAAYKPLRQAVAS---KIGLPAVNGSALGLDLCYTGESLAKAKVPSMALVFAGGAVME 383

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSF 331
           ++  N F +D       L + P      +++GS  Q  T  +YD+N   L F
Sbjct: 384 LELGNYFYMDSTTGLACLTILPSSAGDGSVLGSLIQVGTHMMYDINGSKLVF 435


>gi|190896608|gb|ACE96817.1| aspartyl protease [Populus tremula]
          Length = 339

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 80/307 (26%), Positives = 131/307 (42%), Gaps = 39/307 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTY- 48
           +VR+ +GTP + + ++LDT +   +             F P  S++   ++C    C+  
Sbjct: 46  VVRVKLGTPGQQMFMVLDTSNDAAWVPCSGCTGCSSTTFLPNASTTLGSLDCSEAQCSQV 105

Query: 49  --FKCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
             F C    +  C++   Y   S        + I++        +  G  FGC N   G 
Sbjct: 106 RGFSCPATGSSACLFNQSYGGDSSLAATLVQDAITL-----ANDVIPGFTFGCINAVSGG 160

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
               +     G+LGL R  IS ISQ G++    FSYCL  P     Y S  LK G  +G 
Sbjct: 161 SIPPQ-----GLLGLGRGPISLISQAGAMYSGVFSYCL--PSFKSYYFSGSLKLG-PVGQ 212

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            + S + T  + +P+  + YY++L  +S+   ++  P +      +   G IIDSG+V+T
Sbjct: 213 PK-SIRTTPLLRNPHRPSLYYVNLTGVSVGRIKVPIPSEQLVFDPNTGAGTIIDSGTVIT 271

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
            F   VY+ + ++F     R Q+            C F        P++  +FE  NL +
Sbjct: 272 RFVQPVYFAIRDEF-----RKQVNGPISSLGAFDTC-FAETNEAEAPAVTLHFEGLNLVL 325

Query: 282 DGENVFI 288
             EN  I
Sbjct: 326 PMENSLI 332


>gi|54290731|dbj|BAD62401.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
          Length = 521

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 96/374 (25%), Positives = 145/374 (38%), Gaps = 62/374 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
           +V L IGTP+    +++DTGS L +                 +FDP  SSS+  + CD  
Sbjct: 172 VVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSD 231

Query: 45  DCTYFK-------CVNEQ------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
            C           C          C Y ++Y +++ T G  + ET++ +  G   A F  
Sbjct: 232 ACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLT-LKPGVVVADFG- 289

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
             FGC +  HG  E        G+LGL     S +SQ  S     FSYC    LP     
Sbjct: 290 --FGCGDHQHGPYEK-----FDGLLGLGGAPESLVSQTSSQFGGPFSYC----LPPTSGG 338

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDI 204
           + +L  G        ST A+     P         FY ++L  IS+    +  PP  F  
Sbjct: 339 AGFLTLGAPPNSSS-STAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAF-- 395

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPET 263
                 G +IDSG+V+T   +  Y  L   F S    ++L   S+    +  CY F    
Sbjct: 396 ----SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-GVLDTCYDFTGHA 450

Query: 264 FNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
               P+++  F   A + +      ++D        A A  D+ + +IG+  QR    +Y
Sbjct: 451 NVTVPTISLTFSGGATIDLAAPAGVLVD---GCLAFAGAGTDNAIGIIGNVNQRTFEVLY 507

Query: 323 DLNIDLLSFVKENC 336
           D     + F    C
Sbjct: 508 DSGKGTVGFRAGAC 521


>gi|222635451|gb|EEE65583.1| hypothetical protein OsJ_21095 [Oryza sativa Japonica Group]
          Length = 441

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 96/374 (25%), Positives = 145/374 (38%), Gaps = 62/374 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +V L IGTP+    +++DTGS L +                 +FDP  SSS+  + CD  
Sbjct: 92  VVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCGAGECYAQKDPLFDPSSSSSYASVPCDSD 151

Query: 45  DCTYFK-------CVNEQ------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
            C           C          C Y ++Y +++ T G  + ET++ +  G   A F  
Sbjct: 152 ACRKLAAGAYGHGCTGVSGGAAALCEYGIEYGNRATTTGVYSTETLT-LKPGVVVADFG- 209

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
             FGC +  HG  E        G+LGL     S +SQ  S     FSYC    LP     
Sbjct: 210 --FGCGDHQHGPYEK-----FDGLLGLGGAPESLVSQTSSQFGGPFSYC----LPPTSGG 258

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDI 204
           + +L  G        ST A+     P         FY ++L  IS+    +  PP  F  
Sbjct: 259 AGFLTLGAPPNSSS-STAASGLSFTPMRRLPSVPTFYIVTLTGISVGGAPLAIPPSAF-- 315

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPET 263
                 G +IDSG+V+T   +  Y  L   F S    ++L   S+    +  CY F    
Sbjct: 316 ----SSGMVIDSGTVITGLPATAYAALRSAFRSAMSEYRLLPPSNG-GVLDTCYDFTGHA 370

Query: 264 FNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
               P+++  F   A + +      ++D        A A  D+ + +IG+  QR    +Y
Sbjct: 371 NVTVPTISLTFSGGATIDLAAPAGVLVD---GCLAFAGAGTDNAIGIIGNVNQRTFEVLY 427

Query: 323 DLNIDLLSFVKENC 336
           D     + F    C
Sbjct: 428 DSGKGTVGFRAGAC 441


>gi|413951979|gb|AFW84628.1| putative aspartic protease family protein [Zea mays]
          Length = 435

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 92/379 (24%), Positives = 156/379 (41%), Gaps = 61/379 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI-------------FDPRKSSSFQKINC------- 41
           V L +GTP + V ++LDTGS L + +             F PR S++F  + C       
Sbjct: 63  VSLAVGTPPQNVTMVLDTGSELSWLLCATGRAAAAAADSFRPRASATFAAVPCGSARCSS 122

Query: 42  -DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
            D P        + +C  ++ YAD S + G  A +  +V     G A    + FGC +  
Sbjct: 123 RDLPAPPSCDAASRRCRVSLSYADGSASDGALATDVFAV-----GDAPPLRSAFGCMSAA 177

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG-T 159
           +    DA   A AG+LG++R  +SF++Q  +   +RFSYC+     +    +  L  G +
Sbjct: 178 YDSSPDAV--ATAGLLGMNRGALSFVTQAST---RRFSYCI-----SDRDDAGVLLLGHS 227

Query: 160 DMGYR----RPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
           D+ +      P  Q T  + + +   Y + L  I +  + +  PP       +G G  ++
Sbjct: 228 DLPFLPLNYTPLYQPTPPLPYFDRVAYSVQLLGIRVGGKPLPIPPSVLAPDHTGAGQTMV 287

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP----EPIQLCYFLPE----TFNR 266
           DSG+  T+   D Y  +  +F+   +   L  L D      E    C+ +P+       R
Sbjct: 288 DSGTQFTFLLGDAYSAVKAEFLKQTKPL-LPALEDPSFAFQEAFDTCFRVPKGRPPPSAR 346

Query: 267 FPSMAFYFEDANLRIDGEN-VFIIDYENH----FFLLAVAPHDDLVAL----IGSQQQRD 317
            P +   F  A + + G+  ++ +  E       + L    + D+V L    IG   Q +
Sbjct: 347 LPPVTLLFNGAQMSVAGDRLLYKVPGERRGADGVWCLTFG-NADMVPLTAYVIGHHHQMN 405

Query: 318 TRFVYDLNIDLLSFVKENC 336
               YDL    +      C
Sbjct: 406 LWVEYDLERGRVGLAPVKC 424


>gi|255538124|ref|XP_002510127.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223550828|gb|EEF52314.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 641

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 91/368 (24%), Positives = 148/368 (40%), Gaps = 56/368 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RLFIGTP +   LI+DTGS + Y                F P  SS+++ + C+ P C 
Sbjct: 90  TRLFIGTPPQEFALIVDTGSTVTYVPCSTCEQCGKHQDPRFQPESSSTYKPMQCN-PSC- 147

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--F 103
              C +E  QC Y  +YA+ S + G  A + +S     E +     A+FGC     G  F
Sbjct: 148 --NCDDEGKQCTYERRYAEMSSSSGLLAEDVLSF--GNESELTPQRAIFGCETVETGELF 203

Query: 104 DEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTD 160
            + A      G++GL R  +S + QL    ++   FS C   + +  G      +    D
Sbjct: 204 SQRAD-----GIMGLGRGPLSVVDQLVIKEVVGNSFSLCYGGMDVVGGAMVLGNIPPPPD 258

Query: 161 M--GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
           M   +  P   A         +Y + LK++ +  +R+   P  FD    G+ G ++DSG+
Sbjct: 259 MVFAHSDPYRSA---------YYNIELKELHVAGKRLKLNPRVFD----GKHGTVLDSGT 305

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFY 273
              Y   + +    +  +   +  +     D P    +C+      + +    FP +   
Sbjct: 306 TYAYLPEEAFVAFKDAIIKEIKFLKQIHGPD-PSYNDICFSGAGRDVSQLSKIFPEVNMV 364

Query: 274 FEDAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
           F +   L +  EN           + L       D   L+G    R+T   YD + D + 
Sbjct: 365 FGNGQKLSLSPENYLFRHTKVSGAYCLGIFQNGKDPTTLLGGIVVRNTLVTYDRDNDKIG 424

Query: 331 FVKENCSD 338
           F K NCS+
Sbjct: 425 FWKTNCSE 432


>gi|226491934|ref|NP_001140743.1| uncharacterized protein LOC100272818 [Zea mays]
 gi|194700872|gb|ACF84520.1| unknown [Zea mays]
          Length = 351

 Score = 89.4 bits (220), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 80/325 (24%), Positives = 130/325 (40%), Gaps = 49/325 (15%)

Query: 28  FDPRKSSSFQKINCDHPDCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGK 82
           +DP +S +    +C  P CT        C N QC Y ++Y D S T G    + +++   
Sbjct: 60  YDPSRSPTSAAFSCSSPTCTALGPYANGCANNQCQYLVRYPDGSSTSGAYIADLLTL--- 116

Query: 83  GEGKAIFHGALFGCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
             G A+  G  FGCS+   G FD  A     AG++ L     S +SQ  S     FSYC 
Sbjct: 117 DAGNAV-SGFKFGCSHAEQGSFDARA-----AGIMALGGGPESLLSQTASRYGNAFSYC- 169

Query: 142 VIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNER 194
              +P     S +   G       P   +++++  P         FY + L+ I++  +R
Sbjct: 170 ---IPATASDSGFFTLGV------PRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQR 220

Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP- 253
           +   P  F        G ++DS + +T      Y  L   F S    ++ A     P+  
Sbjct: 221 LGVAPAVF------AAGSVLDSRTAITRLPPTAYQALRAAFRSSMTMYRSAP----PKGY 270

Query: 254 IQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIG 311
           +  CY      N R P ++  F+ +A L +D   +      N          D +  ++G
Sbjct: 271 LDTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILF----NDCLAFTSNADDRMPGVLG 326

Query: 312 SQQQRDTRFVYDLNIDLLSFVKENC 336
           S QQ+    +YD+    + F +  C
Sbjct: 327 SVQQQTIEVLYDVGGGAVGFRQGAC 351


>gi|356546378|ref|XP_003541603.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 439

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 97/360 (26%), Positives = 167/360 (46%), Gaps = 37/360 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++R  +G+P   VL I+DTGS +++               IFDP KS +++ + C    C
Sbjct: 92  LMRYSVGSPPFQVLGIVDTGSDILWLQCEPCEDCYKQTTPIFDPSKSKTYKTLPCSSNTC 151

Query: 47  TYFK---CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDNH 101
              +   C ++  C Y++ Y D S + G  + ET++ +G  +G ++ F   + GC ++N 
Sbjct: 152 ESLRNTACSSDNVCEYSIDYGDGSHSDGDLSVETLT-LGSTDGSSVHFPKTVIGCGHNNG 210

Query: 102 G-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
           G F E+       G   +S ++    S  G     +FSYCL  P+ +   +SS L FG  
Sbjct: 211 GTFQEEGSGIVGLGGGPVSLISQLSSSIGG-----KFSYCLA-PIFSESNSSSKLNFGDA 264

Query: 161 MGYRRPSTQATKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
                  T +T     P N   FY+L+L+  S+ + R+ F   +   + SG+G  IIDSG
Sbjct: 265 AVVSGRGTVSTPL--DPLNGQVFYFLTLEAFSVGDNRIEFSGSSSSGSGSGDGNIIIDSG 322

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
           + LT    + Y  L E  VS  +  +L +  D  + + LCY         P +  +F+ A
Sbjct: 323 TTLTLLPQEDYLNL-ESAVS--DVIKLERARDPSKLLSLCYKTTSDELDLPVITAHFKGA 379

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ++ ++  + F +  E      A      + A+ G+  Q++    YDL    +SF   +C+
Sbjct: 380 DVELNPISTF-VPVEKGVVCFAFI-SSKIGAIFGNLAQQNLLVGYDLVKKTVSFKPTDCT 437


>gi|238010910|gb|ACR36490.1| unknown [Zea mays]
 gi|413942664|gb|AFW75313.1| hypothetical protein ZEAMMB73_520329 [Zea mays]
          Length = 481

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 80/324 (24%), Positives = 128/324 (39%), Gaps = 47/324 (14%)

Query: 28  FDPRKSSSFQKINCDHPDCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGK 82
           +DP +S S    +C  P CT        C N QC Y ++Y D S T G    + +++   
Sbjct: 190 YDPSRSPSSAPFSCSSPTCTALGPYANGCANNQCQYLVRYPDGSSTSGAYIADLLTL--- 246

Query: 83  GEGKAIFHGALFGCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
             G A+  G  FGCS+   G FD  A     AG++ L     S +SQ  S     FSYC 
Sbjct: 247 DAGNAV-SGFKFGCSHAEQGSFDARA-----AGIMALGGGPESLLSQTASRYGNAFSYC- 299

Query: 142 VIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNER 194
              +P     S +   G       P   +++++  P         FY + L+ I++  +R
Sbjct: 300 ---IPATASDSGFFTLGV------PRRASSRYVVTPMVRFRQAATFYGVLLRTITVGGQR 350

Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI 254
           +   P  F        G ++DS + +T      Y  L   F S    ++ A        +
Sbjct: 351 LGVAPAVF------AAGSVLDSRTAITRLPPTAYQALRSAFRSSMTMYRSAPPKGY---L 401

Query: 255 QLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGS 312
             CY      N R P ++  F+ +A L +D   +      N          D +  ++GS
Sbjct: 402 DTCYDFTGVVNIRLPKISLVFDRNAVLPLDPSGILF----NDCLAFTSNADDRMPGVLGS 457

Query: 313 QQQRDTRFVYDLNIDLLSFVKENC 336
            QQ+    +YD+    + F +  C
Sbjct: 458 VQQQTIEVLYDVGGGAVGFRQGAC 481


>gi|413946455|gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays]
          Length = 480

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 96/378 (25%), Positives = 152/378 (40%), Gaps = 54/378 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDC 46
           VR  +GTP++  +L+ DTGS L +                +F    S S+  I C    C
Sbjct: 114 VRFRVGTPAQPFVLVADTGSDLTWVKCSGAGDGTGDAPRRVFRAAASRSWAPIACSSDTC 173

Query: 47  TYF------KCVN--EQCVYTMKYADQSVTKGFAAHE--TISVIGK-----GEGKAIFHG 91
           T +       C +    C Y  +Y D S  +G    +  TI++ G      G  +A   G
Sbjct: 174 TSYVPFSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATIALSGSESRDGGGRRAKLQG 233

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
            + GC+    G    + DG    VL L    ISF S+  +    RFSYCLV  L     T
Sbjct: 234 VVLGCTASYDGQSFQSSDG----VLSLGNSNISFASRAAARFGGRFSYCLVDHLAPRNAT 289

Query: 152 SSYLKF---GTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDT 201
            SYL F   G + G    S+ ++     P       + FY +++  + +  E ++ P D 
Sbjct: 290 -SYLTFGPPGPEGGAAASSSSSSAAARTPLLLDRRMSPFYAVAVDAVHVAGEALDIPADV 348

Query: 202 FDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQ-LAQLSDCPEPIQLCYFL 260
           +D+     GG I+DSG+ LT   +  Y        +  ER   L ++S   +P + CY  
Sbjct: 349 WDVARG--GGAILDSGTSLTVLATPAY---RAVVAALSERLAGLPRVSM--DPFEYCYNW 401

Query: 261 PETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTR 319
                  P +   F  +         +++D       + V       V++IG+  Q+D  
Sbjct: 402 TAAALEIPGLEVRFAGSARLQPPAKSYVVDAAPGVKCIGVQEGAWPGVSVIGNILQQDHL 461

Query: 320 FVYDLNIDLLSFVKENCS 337
           + +DL    L F    C+
Sbjct: 462 WEFDLRDRWLRFKHTRCA 479


>gi|195607464|gb|ACG25562.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 478

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 90/367 (24%), Positives = 141/367 (38%), Gaps = 60/367 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA-----------------IFDPRKSSSFQKINCDH 43
           +V   +GTP     + +DTGS L +                  +FDP +SSS+  + C  
Sbjct: 141 VVTASLGTPGVAQTMEVDTGSDLSWVQCKPCSAAPSCYSQKDPLFDPAQSSSYAAVPCGG 200

Query: 44  PDCTYFKCVNEQCV------YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           P C                 Y + Y D S T G  + +T+++       +   G  FGC 
Sbjct: 201 PVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL----SASSAVQGFFFGCG 256

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
           +   G         + G+LGL R   S + Q        FSYC    LP    T+ YL  
Sbjct: 257 HAQSGLFN-----GVDGLLGLGREQPSLVEQTAGTYGGVFSYC----LPTKPSTAGYLTL 307

Query: 158 G-TDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
           G        P    T+ +  PN   +Y + L  IS+  ++++ P   F       GG ++
Sbjct: 308 GLGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAF------AGGTVV 361

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFY 273
           D+G+V+T      Y  L   F S    +     +     +  CY F        P++A  
Sbjct: 362 DTGTVITRLPPTAYAALRSAFRSGMASYGY-PTAPSNGILDTCYNFAGYGTVTLPNVALT 420

Query: 274 FEDANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLS- 330
           F      + G +  +      F  LA AP   D  +A++G+ QQR     +++ ID  S 
Sbjct: 421 FGSGATVMLGADGIL-----SFGCLAFAPSGSDGGMAILGNVQQRS----FEVRIDGTSV 471

Query: 331 -FVKENC 336
            F   +C
Sbjct: 472 GFKPSSC 478


>gi|18412482|ref|NP_565219.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|19699359|gb|AAL91289.1| At1g79720/F19K16_30 [Arabidopsis thaliana]
 gi|26450464|dbj|BAC42346.1| unknown protein [Arabidopsis thaliana]
 gi|115646741|gb|ABJ17101.1| At1g79720 [Arabidopsis thaliana]
 gi|332198170|gb|AEE36291.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 484

 Score = 89.4 bits (220), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 96/363 (26%), Positives = 148/363 (40%), Gaps = 61/363 (16%)

Query: 11  KGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF------- 49
           K + LI+DTGS L +               ++DP  SSS++ + C+   C          
Sbjct: 144 KNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNS 203

Query: 50  -------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                    V   C Y + Y D S T+G  A E+I +     G       +FGC  +N G
Sbjct: 204 GPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL-----GDTKLENFVFGCGRNNKG 258

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               +            R ++S +SQ        FSYCL   L +G   S  L FG D  
Sbjct: 259 LFGGSSGLMGL-----GRSSVSLVSQTLKTFNGVFSYCLP-SLEDG--ASGSLSFGNDSS 310

Query: 163 YRRPSTQA--TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
               ST    T  + +P   +FY L+L   SI    +         + S   G +IDSG+
Sbjct: 311 VYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELK--------SSSFGRGILIDSGT 362

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-D 276
           V+T     +Y  +  +F+  F  F  A        +  C+ L    +   P +   F+ +
Sbjct: 363 VITRLPPSIYKAVKIEFLKQFSGFPTAPGYSI---LDTCFNLTSYEDISIPIIKMIFQGN 419

Query: 277 ANLRIDGENVF-IIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
           A L +D   VF  +  +     LA+A   +++ V +IG+ QQ++ R +YD   + L  V 
Sbjct: 420 AELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDTTQERLGIVG 479

Query: 334 ENC 336
           ENC
Sbjct: 480 ENC 482


>gi|356565521|ref|XP_003550988.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 634

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 95/373 (25%), Positives = 150/373 (40%), Gaps = 66/373 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+DTGS + Y                F P  SS++Q + C   DC 
Sbjct: 86  TRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCTI-DC- 143

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C ++  QCVY  +YA+ S + G    + IS   + E       A+FGC N   G   
Sbjct: 144 --NCDSDRMQCVYERQYAEMSTSSGVLGEDLISFGNQSELAP--QRAVFGCENVETG--- 196

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-- 161
           D       G++GL R  +S + QL   ++I   FS C         Y    +  G  +  
Sbjct: 197 DLYSQHADGIMGLGRGDLSIMDQLVDKNVISDSFSLC---------YGGMDVGGGAMVLG 247

Query: 162 GYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
           G   PS  A  + +   + YY + LK+I +  +R+    + FD    G+ G ++DSG+  
Sbjct: 248 GISPPSDMAFAYSDPVRSPYYNIDLKEIHVAGKRLPLNANVFD----GKHGTVLDSGTTY 303

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFE 275
            Y     +    +  V   +  +     D P    +C+      + +    FP +   FE
Sbjct: 304 AYLPEAAFLAFKDAIVKELQSLKKISGPD-PNYNDICFSGAGIDVSQLSKSFPVVDMVFE 362

Query: 276 DANLRIDGENVFIIDYENHFF----------LLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
           +          + +  EN+ F          L      +D   L+G    R+T  VYD  
Sbjct: 363 NG-------QKYTLSPENYMFRHSKVRGAYCLGVFQNGNDQTTLLGGIIVRNTLVVYDRE 415

Query: 326 IDLLSFVKENCSD 338
              + F K NC++
Sbjct: 416 QTKIGFWKTNCAE 428


>gi|356539555|ref|XP_003538263.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 438

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 83/358 (23%), Positives = 147/358 (41%), Gaps = 39/358 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
           +VR  IGTP + +LL +DT +   +            +F P KS++F+ ++C  P+C   
Sbjct: 98  IVRAKIGTPPQTLLLAIDTSNDAAWIPCTACDGCTSTLFAPEKSTTFKNVSCGSPECNKV 157

Query: 50  ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
               C    C + + Y   S+       +T+++           G  FGC     G    
Sbjct: 158 PSPSCGTSACTFNLTYGSSSIAANVV-QDTVTL-----ATDPIPGYTFGCVAKTTGPSTP 211

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
            +           R  +S +SQ  ++ +  FSYCL  P       S  L+ G      R 
Sbjct: 212 PQGLLGL-----GRGPLSLLSQTQNLYQSTFSYCL--PSFKSLNFSGSLRLGPVAQPIR- 263

Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
             + T  + +P  ++ YY++L  I +  + ++ PP       +   G + DSG+V T   
Sbjct: 264 -IKYTPLLKNPRRSSLYYVNLFAIRVGRKIVDIPPAALAFNAATGAGTVFDSGTVFTRLV 322

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
           + VY  + ++F         A L+         CY +P      P++ F F   N+ +  
Sbjct: 323 APVYTAVRDEFRRRVAMAAKANLTVTSLGGFDTCYTVPIV---APTITFMFSGMNVTLPQ 379

Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +N+ I         LA+A   D    ++ +I + QQ++ R +YD+    L   +E C+
Sbjct: 380 DNILIHSTAGSTSCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELCT 437


>gi|449511696|ref|XP_004164029.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 639

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 96/368 (26%), Positives = 150/368 (40%), Gaps = 56/368 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IG+P +   LI+DTGS + Y                F P  SS++Q + C+  DC 
Sbjct: 91  TRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCN-ADC- 148

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C     QC Y  +YA+ S + G  A + +S  GK E + +   A+FGC     G   
Sbjct: 149 --NCDENGVQCTYERRYAEMSTSSGVLAEDVMS-FGK-ESELVPQRAVFGCETMESG--- 201

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-- 161
           D       G++GL R T+S + QL    ++   FS C         Y    +  G  +  
Sbjct: 202 DLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLC---------YGGMDVGGGAMVLG 252

Query: 162 GYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
           G   P        +   + YY + LK+I +  + +   P TFD    G+ G I+DSG+  
Sbjct: 253 GISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFD----GKYGAILDSGTTY 308

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYF-----LPETFNRFPSMAFY 273
            YF    Y+   +  +       L Q+S  P+P    +C+      + E    FP +   
Sbjct: 309 AYFPEKAYYAFKDAIMKKISF--LKQISG-PDPNFKDICFSGAGRDVTELPKVFPEVDMV 365

Query: 274 FEDAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
           F +   + +  EN           + L      +D   L+G    R+T   Y+     + 
Sbjct: 366 FANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIG 425

Query: 331 FVKENCSD 338
           F K NCS+
Sbjct: 426 FWKTNCSE 433


>gi|194698750|gb|ACF83459.1| unknown [Zea mays]
 gi|194703964|gb|ACF86066.1| unknown [Zea mays]
 gi|219886221|gb|ACL53485.1| unknown [Zea mays]
 gi|219886359|gb|ACL53554.1| unknown [Zea mays]
 gi|223950085|gb|ACN29126.1| unknown [Zea mays]
 gi|414865218|tpg|DAA43775.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 431

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 97/373 (26%), Positives = 148/373 (39%), Gaps = 59/373 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
           +VR  +GTP + +LL LDT +   ++             F P  SSS+  + C    C  
Sbjct: 80  VVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSSSYASLPCASDWCPL 139

Query: 49  FK---CVNEQ--------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           F+   C   Q        C ++  +AD S      + +T+ +     GK    G  FGC 
Sbjct: 140 FEGQPCPANQDASAPLPACAFSKPFADTSFQASLGS-DTLRL-----GKDAIAGYAFGCV 193

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
               G   +       G+LGL R  +S +SQ GS     FSYCL  P     Y S  L+ 
Sbjct: 194 GAVAGPTTNLPK---QGLLGLGRGPMSLLSQTGSRYNGVFSYCL--PSYRSYYFSGSLRL 248

Query: 158 GTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           G     R  + + T  + +P+  + YY+++  +S+    +  P  +F    +   G +ID
Sbjct: 249 GAAGQPR--NVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVID 306

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
           SG+V+T + + VY  L E+F     R Q+A  S       L  F    FN     A    
Sbjct: 307 SGTVITRWTAPVYAALREEF-----RRQVAAPSGY---TSLGAFD-TCFNTDEVAAGGAP 357

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQ------------QQRDTRFVYD 323
              L +DG     +  EN     +  P   L      Q            QQ++ R V D
Sbjct: 358 PVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVD 417

Query: 324 LNIDLLSFVKENC 336
           +    + F +E C
Sbjct: 418 VAGSRVGFAREPC 430


>gi|212722554|ref|NP_001131154.1| uncharacterized protein LOC100192462 precursor [Zea mays]
 gi|194690728|gb|ACF79448.1| unknown [Zea mays]
          Length = 431

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 97/373 (26%), Positives = 148/373 (39%), Gaps = 59/373 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA------------IFDPRKSSSFQKINCDHPDCTY 48
           +VR  +GTP + +LL LDT +   ++             F P  SSS+  + C    C  
Sbjct: 80  VVRAGLGTPVQQLLLALDTSADATWSHCAPCDTCPAGSRFIPASSSSYASLPCASDWCPL 139

Query: 49  FK---CVNEQ--------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           F+   C   Q        C ++  +AD S      + +T+ +     GK    G  FGC 
Sbjct: 140 FEGQPCPANQDASAPLPACAFSKPFADTSFQASLGS-DTLRL-----GKDAIAGYAFGCV 193

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
               G   +       G+LGL R  +S +SQ GS     FSYCL  P     Y S  L+ 
Sbjct: 194 GAVAGPTTNLPK---QGLLGLGRGPMSLLSQTGSRYNGVFSYCL--PSYRSYYFSGSLRL 248

Query: 158 GTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           G     R  + + T  + +P+  + YY+++  +S+    +  P  +F    +   G +ID
Sbjct: 249 GAAGQPR--NVRYTPLLTNPHRPSLYYVNVTGLSVGRTWVKVPAGSFAFDPATGAGTVID 306

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
           SG+V+T + + VY  L E+F     R Q+A  S       L  F    FN     A    
Sbjct: 307 SGTVITRWTAPVYAALREEF-----RRQVAAPSGY---TSLGAFD-TCFNTDEVAAGGAP 357

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQ------------QQRDTRFVYD 323
              L +DG     +  EN     +  P   L      Q            QQ++ R V D
Sbjct: 358 PVTLHMDGGVDLTLPMENTLIHSSATPLACLAMAEAPQNVNAVVNVVANLQQQNVRVVVD 417

Query: 324 LNIDLLSFVKENC 336
           +    + F +E C
Sbjct: 418 VAGSRVGFAREPC 430


>gi|45735840|dbj|BAD12875.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735966|dbj|BAD12995.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|125583491|gb|EAZ24422.1| hypothetical protein OsJ_08175 [Oryza sativa Japonica Group]
          Length = 475

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 88/368 (23%), Positives = 143/368 (38%), Gaps = 67/368 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +V + +GTP+    L +DTGS + +                 +FDP +SSS+  + C   
Sbjct: 143 VVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAA 202

Query: 45  DCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
            C+        C   QC Y + Y D S T G  + +T+++ G         G LFGC + 
Sbjct: 203 SCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNA----LKGFLFGCGHA 258

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
             G         + G+LGL R   S +SQ  S     FSYC    LP  + +  Y+  G 
Sbjct: 259 QQGLFA-----GVDGLLGLGRQGQSLVSQASSTYGGVFSYC----LPPTQNSVGYISLGG 309

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                  ST      ++   +Y + L  IS+  + ++     F        G ++D+G+V
Sbjct: 310 PSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVF------ASGAVVDTGTV 363

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP----IQLCYFLPETFNRF-----PSM 270
           +T      Y  L   F     R  +A       P    +  CY     F R+     P++
Sbjct: 364 VTRLPPTAYSALRSAF-----RAAMAPYGYPSAPATGILDTCY----DFTRYGTVTLPTI 414

Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDL 328
           +  F        G +  +         LA AP   D   +++G+ QQR     +D +   
Sbjct: 415 SIAFGGGAAMDLGTSGILTS-----GCLAFAPTGGDSQASILGNVQQRSFEVRFDGST-- 467

Query: 329 LSFVKENC 336
           + F+  +C
Sbjct: 468 VGFMPASC 475


>gi|45735845|dbj|BAD12880.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
 gi|45735971|dbj|BAD13000.1| putative chloroplast nucleoid DNA-binding protein cnd41 [Oryza
           sativa Japonica Group]
          Length = 333

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 80/359 (22%), Positives = 150/359 (41%), Gaps = 51/359 (14%)

Query: 4   LFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDCTY 48
           + +GTP+   ++++DTGS+L +                +F+P+ SS++  + C    C+ 
Sbjct: 1   MGLGTPATQYVMVVDTGSSLTWLQCSPCLVSCHRQSGPVFNPKSSSTYASVGCSAQQCSD 60

Query: 49  F--------KCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                     C +   C+Y   Y D S + G+ + +T+S      G        +GC  D
Sbjct: 61  LPSATLNPSACSSSNVCIYQASYGDSSFSVGYLSKDTVSF-----GSTSLPNFYYGCGQD 115

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           N G       G  AG++GL+R  +S + QL   +   F+YC    LP+   +        
Sbjct: 116 NEGLF-----GRSAGLIGLARNKLSLLYQLAPSLGYSFTYC----LPSSSSSGYLSLGSY 166

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
           + G    +   +  ++  ++ Y++ L  +++        P +   +       IIDSG+V
Sbjct: 167 NPGQYSYTPMVSSSLD--DSLYFIKLSGMTVAGN-----PLSVSSSAYSSLPTIIDSGTV 219

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-DAN 278
           +T   + VY  L +   +  +    A        +  C+    +    P++   F   A 
Sbjct: 220 ITRLPTSVYSALSKAVAAAMKGTSRASAYSI---LDTCFKGQASRVSAPAVTMSFAGGAA 276

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           L++  +N  ++D ++    LA AP     A+IG+ QQ+    VYD+    + F    CS
Sbjct: 277 LKLSAQN-LLVDVDDSTTCLAFAPARS-AAIIGNTQQQTFSVVYDVKSSRIGFAAGGCS 333


>gi|302790323|ref|XP_002976929.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
 gi|300155407|gb|EFJ22039.1| hypothetical protein SELMODRAFT_105896 [Selaginella moellendorffii]
          Length = 373

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 101/380 (26%), Positives = 154/380 (40%), Gaps = 57/380 (15%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYFKC 51
           IGTP + VLL++DT S L +                F+P  SSSF    C    C     
Sbjct: 5   IGTPPREVLLLVDTASELTWVQGTSCTNCSPTKVPPFNPGLSSSFISEPCTSSVCLGRSK 64

Query: 52  VNEQ---------CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           +  Q         C + + Y D S   G  A E  S+       +     +FGC++ +  
Sbjct: 65  LGFQSACNRSTGSCSFQVAYLDGSEAYGVIAREIFSLQSWDGAASTLGDVIFGCASKDLQ 124

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKK----RFSYCLVIPLPN-GEY--TSSYL 155
              D      +G LGL+R + SF +Q+GS  K     RFSYC     PN  E+  +S  +
Sbjct: 125 RPVDFS----SGTLGLNRGSFSFPAQIGSRSKSGLSDRFSYCF----PNRAEHLNSSGVI 176

Query: 156 KFGTDMGYRRPSTQATKFINHPN-----NFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
            FG D G      Q       P      +FYY+ L+ IS+  E ++ P   F I   G G
Sbjct: 177 IFG-DSGIPAHHFQYLSLEQEPPIASIVDFYYVGLQGISVGGELLHIPRSAFKIDRLGNG 235

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS- 269
           G   DSG+ +++     +  L E F            SD  +  +LCY +     R P+ 
Sbjct: 236 GTYFDSGTTVSFLVEPAHTALVEAFGRRVLHLNRTSGSDFTK--ELCYDVAAGDARLPTA 293

Query: 270 --MAFYFE-DANLRIDGENVFIIDYENH-------FFLLAVAPHDDLVALIGSQQQRDTR 319
             +  +F+ + ++ +   +V++              F+ A A     V +IG+ QQ+D  
Sbjct: 294 PLVTLHFKNNVDMELREASVWVPLARTPQVVTICLAFVNAGAVAQGGVNVIGNYQQQDYL 353

Query: 320 FVYDLNIDLLSFVKENCSDD 339
             +DL    + F   NC  D
Sbjct: 354 IEHDLERSRIGFAPANCVMD 373


>gi|356495496|ref|XP_003516613.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 645

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 95/366 (25%), Positives = 151/366 (41%), Gaps = 52/366 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+DTGS + Y                F P  S ++Q + C    C 
Sbjct: 95  ARLWIGTPPQRFALIVDTGSTVTYVPCSTCRHCGSHQDPKFRPEDSETYQPVKCTW-QC- 152

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C N+  QC Y  +YA+ S + G    + +S   + E       A+FGC ND  G   
Sbjct: 153 --NCDNDRKQCTYERRYAEMSTSSGALGEDVVSFGNQTELSP--QRAIFGCENDETG--- 205

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTS-SYLKFGTDMG 162
           D  +    G++GL R  +S + QL    +I   FS C       G       +    DM 
Sbjct: 206 DIYNQRADGIMGLGRGDLSIMDQLVEKKVISDSFSLCYGGMGVGGGAMVLGGISPPADMV 265

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
           + R     +  +  P  +Y + LK+I +  +R++  P  FD    G+ G ++DSG+   Y
Sbjct: 266 FTR-----SDPVRSP--YYNIDLKEIHVAGKRLHLNPKVFD----GKHGTVLDSGTTYAY 314

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI--QLCYF-----LPETFNRFPSMAFYFE 275
                +  L  K     E   L ++S  P+P    +C+      + +    FP +   F 
Sbjct: 315 LPESAF--LAFKHAIMKETHSLKRISG-PDPRYNDICFSGAEIDVSQISKSFPVVEMVFG 371

Query: 276 DAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           + + L +  EN           + L   +  +D   L+G    R+T  +YD     + F 
Sbjct: 372 NGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHTKIGFW 431

Query: 333 KENCSD 338
           K NCS+
Sbjct: 432 KTNCSE 437


>gi|356541713|ref|XP_003539318.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 640

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 91/369 (24%), Positives = 148/369 (40%), Gaps = 58/369 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+DTGS + Y                F P  S ++Q + C      
Sbjct: 95  TRLWIGTPPQRFALIVDTGSTVTYVPCSTCKHCGSHQDPKFRPEASETYQPVKCT----- 149

Query: 48  YFKCV----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
            ++C      +QC Y  +YA+ S + G    + +S   + E       A+FGC ND  G 
Sbjct: 150 -WQCNCDDDRKQCTYERRYAEMSTSSGVLGEDVVSFGNQSELSP--QRAIFGCENDETG- 205

Query: 104 DEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
             D  +    G++GL R  +S + QL    +I   FS C                    +
Sbjct: 206 --DIYNQRADGIMGLGRGDLSIMDQLVEKKVISDAFSLCYG--------GMGVGGGAMVL 255

Query: 162 GYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
           G   P        + P  + +Y + LK+I +  +R++  P  FD    G+ G ++DSG+ 
Sbjct: 256 GGISPPADMVFTHSDPVRSPYYNIDLKEIHVAGKRLHLNPKVFD----GKHGTVLDSGTT 311

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI--QLCYF-----LPETFNRFPSMAF 272
             Y     +  L  K     E   L ++S  P+P    +C+      + +    FP +  
Sbjct: 312 YAYLPESAF--LAFKHAIMKETHSLKRISG-PDPHYNDICFSGAEINVSQLSKSFPVVEM 368

Query: 273 YFEDAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
            F + + L +  EN           + L   +  +D   L+G    R+T  +YD     +
Sbjct: 369 VFGNGHKLSLSPENYLFRHSKVRGAYCLGVFSNGNDPTTLLGGIVVRNTLVMYDREHSKI 428

Query: 330 SFVKENCSD 338
            F K NCS+
Sbjct: 429 GFWKTNCSE 437


>gi|242086034|ref|XP_002443442.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
 gi|241944135|gb|EES17280.1| hypothetical protein SORBIDRAFT_08g019550 [Sorghum bicolor]
          Length = 443

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 91/369 (24%), Positives = 149/369 (40%), Gaps = 46/369 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +    +G P +    ++DTGS+LI+                  F+   S SF  + C   
Sbjct: 87  IAEYMVGDPPQRAEALIDTGSSLIWTQCTACLRKVCVRQDLPYFNASSSGSFAPVPCQDK 146

Query: 45  DCT----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
            C     +F  ++  C + + Y    +  GF   +  +    G   A      FGC +  
Sbjct: 147 ACAGNYLHFCALDGTCTFRVTYGAGGII-GFLGTDAFTFQSGGATLA------FGCVSFT 199

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT- 159
                D   GA +G++GL R  +S  SQ G+   KRFSYCL  P  +    SS+L  G  
Sbjct: 200 RFAAPDVLHGA-SGLIGLGRGRLSLASQTGA---KRFSYCLT-PYFHNNGASSHLFVGAA 254

Query: 160 -DMGYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVS----GE 209
             +     +  +  F+  P +     FYYL L  I++   ++  P   FD+        E
Sbjct: 255 ASLSGGGGAVMSMAFVESPKDYPYSTFYYLPLVGITVGETKLAIPSTAFDLQEVEEGFWE 314

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
           GG IIDSGS  T    D Y  L  +         +    +    + LC    +     P+
Sbjct: 315 GGVIIDSGSPFTSLVEDAYEPLMGELARQLNGSLVPPPGEDDGGMALCVARGDLDRVVPT 374

Query: 270 MAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
           +  +F   A++ +  EN +    E     +A+     L ++IG+ QQ++   ++D+    
Sbjct: 375 LVLHFSGGADMALPPEN-YWAPLEKSTACMAIV-RGYLQSIIGNFQQQNMHILFDVGGGR 432

Query: 329 LSFVKENCS 337
           LSF   +CS
Sbjct: 433 LSFQNADCS 441


>gi|238011160|gb|ACR36615.1| unknown [Zea mays]
          Length = 461

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 95/414 (22%), Positives = 149/414 (35%), Gaps = 90/414 (21%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA----------------------------------- 26
           VR  +GTP++  LL+ DTGS L +                                    
Sbjct: 57  VRFRVGTPARPFLLVADTGSDLTWVKCRRHAAPAPAPAPAPGYNYGYGAPASNDSSSVSA 116

Query: 27  -------IFDPRKSSSFQKINCDHPDCTYF--------KCVNEQCVYTMKYADQSVTKGF 71
                  +F P +S ++  I C    CT                C Y  +Y D S  +G 
Sbjct: 117 AASSPARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYEYRYKDGSAARGT 176

Query: 72  AAHETISVI------GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISF 125
              ++ ++       GK + +A   G + GC+    G    A DG    VL L    +SF
Sbjct: 177 VGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFLASDG----VLSLGYSNVSF 232

Query: 126 ISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATK------------- 172
            S+  +    RFSYCLV  L     T SYL FG +      S   T              
Sbjct: 233 ASRAAARFGGRFSYCLVDHLAPRNAT-SYLTFGPNPAVSSASASRTACAGSAAAPGARQT 291

Query: 173 --FINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYW 229
              ++H    FY +++  +S+D E +  P   +D  V   GG I+DSG+ LT   S  Y 
Sbjct: 292 PLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWD--VQKGGGAILDSGTSLTVLVSPAY- 348

Query: 230 KLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN------RFPSMAFYFEDANLRIDG 283
                 V+   +  +       +P   CY               P++A +F  +      
Sbjct: 349 ---RAVVAALGKKLVGLPRVAMDPFDYCYNWTSPLTGEDLAVAVPALAVHFAGSARLQPP 405

Query: 284 ENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
              ++ID       + +   D   V++IG+  Q++  + +DL    L F +  C
Sbjct: 406 PKSYVIDAAPGVKCIGLQEGDWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSRC 459


>gi|115473845|ref|NP_001060521.1| Os07g0658600 [Oryza sativa Japonica Group]
 gi|22775625|dbj|BAC15479.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|50510141|dbj|BAD31106.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113612057|dbj|BAF22435.1| Os07g0658600 [Oryza sativa Japonica Group]
          Length = 449

 Score = 89.0 bits (219), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 86/358 (24%), Positives = 152/358 (42%), Gaps = 39/358 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
           +VR  +GTP++ +LL +DT +   +            + F+P  S+S++ + C  P C  
Sbjct: 108 VVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSPFNPAASASYRPVPCGSPQCVL 167

Query: 49  F---KCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                C    + C +++ YAD S+    +  +T++V G      +     FGC     G 
Sbjct: 168 APNPSCSPNAKSCGFSLSYADSSLQAALS-QDTLAVAGD-----VVKAYTFGCLQRATGT 221

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
               +      +LGL R  +SF+SQ   +    FSYCL  P       S  L+ G +   
Sbjct: 222 AAPPQG-----LLGLGRGPLSFLSQTKDMYGATFSYCL--PSFKSLNFSGTLRLGRNGQP 274

Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
           RR  T       H ++ YY+++  I +  + ++ P        +   G ++DSG++ T  
Sbjct: 275 RRIKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRL 334

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
            + VY  L ++          A  S        CY    T   +P +   F+   + +  
Sbjct: 335 VAPVYLALRDEVRRRVGAGAAAVSSL--GGFDTCY---NTTVAWPPVTLLFDGMQVTLPE 389

Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ENV I         LA+A   D    ++ +I S QQ++ R ++D+    + F +E+C+
Sbjct: 390 ENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESCT 447


>gi|356532386|ref|XP_003534754.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 463

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 88/372 (23%), Positives = 154/372 (41%), Gaps = 61/372 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINC----- 41
           V++ +GTP+K   +I+DTGS+L +                IF P  S +++ + C     
Sbjct: 115 VKIGLGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSTSKTYKALPCSSSQC 174

Query: 42  --------DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
                   + P C+        CVY   Y D S + G+ + + +++      +A   G +
Sbjct: 175 SSLKSSTLNAPGCSN---ATGACVYKASYGDTSFSIGYLSQDVLTLT---PSEAPSSGFV 228

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
           +GC  DN G       G  +G++GL+   IS + QL       FSYCL  P       SS
Sbjct: 229 YGCGQDNQGLF-----GRSSGIIGLANDKISMLGQLSKKYGNAFSYCL--PSSFSAPNSS 281

Query: 154 YLKFGTDMGYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSG 208
            L     +G    ++   KF     N      Y+L L  I++  + +     ++++    
Sbjct: 282 SLSGFLSIGASSLTSSPYKFTPLVKNQKIPSLYFLDLTTITVAGKPLGVSASSYNVPT-- 339

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP-ETFN 265
               IIDSG+V+T     VY  L + FV    +    + +  P    +  C+    +  +
Sbjct: 340 ----IIDSGTVITRLPVAVYNALKKSFVLIMSK----KYAQAPGFSILDTCFKGSVKEMS 391

Query: 266 RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
             P +   F   A L +   N  +++ E     LA+A   + +++IG+ QQ+  +  YD+
Sbjct: 392 TVPEIQIIFRGGAGLELKAHNS-LVEIEKGTTCLAIAASSNPISIIGNYQQQTFKVAYDV 450

Query: 325 NIDLLSFVKENC 336
               + F    C
Sbjct: 451 ANFKIGFAPGGC 462


>gi|302802500|ref|XP_002983004.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
 gi|300149157|gb|EFJ15813.1| hypothetical protein SELMODRAFT_13348 [Selaginella moellendorffii]
          Length = 332

 Score = 88.6 bits (218), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 85/345 (24%), Positives = 159/345 (46%), Gaps = 35/345 (10%)

Query: 6   IGTPSKGVLLILDTGSALIYAIFDPRK---SSSFQKINCDHPDCTY--FKCVNEQCVYTM 60
           +G+P K   L++DTGS L +   DP     SS+F ++  +    TY    C ++   Y+ 
Sbjct: 9   LGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSSTFDRLASN----TYKALTCADD---YSY 61

Query: 61  KYADQSVTKGFAAHETISVIGKGEGK-AIFHGALFGCSNDNHGFDEDARDGALAGVLGLS 119
            Y D S T+G  + +T+ + G    +   F G +FGC +   G           G+L LS
Sbjct: 62  GYGDGSFTQGDLSVDTLKMAGAASDELEEFPGFVFGCGSLLKGLISGE-----VGILALS 116

Query: 120 RVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG-TDMGYRRPST---QATKF-- 173
             ++SF SQ+G     +FSYCL+          S + FG   +  + P +   Q  ++  
Sbjct: 117 PGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVELKEPGSGKLQELQYTP 176

Query: 174 INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG-CIIDSGSVLTYFHSDVYWKLH 232
           I   + +Y + L  IS+ N+R++  P  F   ++G+    I DSG+ LT     V   + 
Sbjct: 177 IGESSIYYTVRLDGISVGNQRLDLSPSAF---LNGQDKPTIFDSGTTLTMLPPGVCDSIK 233

Query: 233 EKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSMAFYFEDANLRIDGENVFIIDY 291
           +   S     +   +    + +  C+ +P +  +  P + F+F      +   + ++ID 
Sbjct: 234 QSLASMVSGAEFVAI----KGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNYVIDL 289

Query: 292 ENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            +   L+ V  ++  V++ G+ QQ+D   ++D++   + F + +C
Sbjct: 290 GSLQCLIFVPTNE--VSIFGNLQQQDFFVLHDMDNRRIGFKETDC 332


>gi|449447285|ref|XP_004141399.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 609

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 96/368 (26%), Positives = 150/368 (40%), Gaps = 56/368 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IG+P +   LI+DTGS + Y                F P  SS++Q + C+  DC 
Sbjct: 91  TRLWIGSPPQEFALIVDTGSTVTYVPCSNCVQCGNHQDPRFQPELSSTYQPVKCN-ADC- 148

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C     QC Y  +YA+ S + G  A + +S  GK E + +   A+FGC     G   
Sbjct: 149 --NCDENGVQCTYERRYAEMSTSSGVLAEDVMS-FGK-ESELVPQRAVFGCETMESG--- 201

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-- 161
           D       G++GL R T+S + QL    ++   FS C         Y    +  G  +  
Sbjct: 202 DLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLC---------YGGMDVGGGAMVLG 252

Query: 162 GYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
           G   P        +   + YY + LK+I +  + +   P TFD    G+ G I+DSG+  
Sbjct: 253 GISSPPGMVFSHSDPSRSPYYNIELKEIHVAGKPLKLNPRTFD----GKYGAILDSGTTY 308

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLCYF-----LPETFNRFPSMAFY 273
            YF    Y+   +  +       L Q+S  P+P    +C+      + E    FP +   
Sbjct: 309 AYFPEKAYYAFKDAIMKKISF--LKQISG-PDPNFKDICFSGAGRDVTELPKVFPEVDMV 365

Query: 274 FEDAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
           F +   + +  EN           + L      +D   L+G    R+T   Y+     + 
Sbjct: 366 FANGQKISLSPENYLFRHTKVSGAYCLGIFKNGNDQTTLLGGIIVRNTLVTYNRENSTIG 425

Query: 331 FVKENCSD 338
           F K NCS+
Sbjct: 426 FWKTNCSE 433


>gi|125553822|gb|EAY99427.1| hypothetical protein OsI_21398 [Oryza sativa Indica Group]
          Length = 469

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 77/321 (23%), Positives = 129/321 (40%), Gaps = 36/321 (11%)

Query: 27  IFDPRKSSSFQKINCDHPDCTYF-----KCV-NEQCVYTMKYADQSVTKGFAAHETISVI 80
           ++DP KSSS    +C+ P CT        C  N QC Y ++Y D + T G    + +++ 
Sbjct: 174 LYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNNNQCQYRVRYPDGTSTAGTYISDLLTIT 233

Query: 81  GKGEGKAIFHGALFGCSNDNHG---FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRF 137
                ++      FGCS+   G   F   A     AG++ L     S +SQ  +   + F
Sbjct: 234 PATAVRSF----QFGCSHGVQGSFSFGSSA-----AGIMALGGGPESLVSQTAATYGRVF 284

Query: 138 SYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNF 197
           S+C   P   G +T   L       +R   T   K    P  FY + L+ I++  +R+  
Sbjct: 285 SHCFPPPTRRGFFT---LGVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAV 341

Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLC 257
           PP  F        G  +DS + +T      Y  L + F    +R  + Q +    P+  C
Sbjct: 342 PPTVF------AAGAALDSRTAITRLPPTAYQALRQAF---RDRMAMYQPAPPKGPLDTC 392

Query: 258 YFLPETFN-RFPSMAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQ 315
           Y +    +   P +   F ++A + +D   V               P+D +  +IG+ Q 
Sbjct: 393 YDMAGVRSFALPRITLVFDKNAAVELDPSGVLF----QGCLAFTAGPNDQVPGIIGNIQL 448

Query: 316 RDTRFVYDLNIDLLSFVKENC 336
           +    +Y++   L+ F    C
Sbjct: 449 QTLEVLYNIPAALVGFRHAAC 469


>gi|115466060|ref|NP_001056629.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|55296436|dbj|BAD68559.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594669|dbj|BAF18543.1| Os06g0118700 [Oryza sativa Japonica Group]
 gi|215767921|dbj|BAH00150.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 494

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 77/321 (23%), Positives = 129/321 (40%), Gaps = 36/321 (11%)

Query: 27  IFDPRKSSSFQKINCDHPDCTYF-----KCV-NEQCVYTMKYADQSVTKGFAAHETISVI 80
           ++DP KSSS    +C+ P CT        C  N QC Y ++Y D + T G    + +++ 
Sbjct: 199 LYDPTKSSSSGVFSCNSPTCTQLGPYANGCTNNNQCQYRVRYPDGTSTAGTYISDLLTIT 258

Query: 81  GKGEGKAIFHGALFGCSNDNHG---FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRF 137
                ++      FGCS+   G   F   A     AG++ L     S +SQ  +   + F
Sbjct: 259 PATAVRSF----QFGCSHGVQGSFSFGSSA-----AGIMALGGGPESLVSQTAATYGRVF 309

Query: 138 SYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNF 197
           S+C   P   G +T   L       +R   T   K    P  FY + L+ I++  +R+  
Sbjct: 310 SHCFPPPTRRGFFT---LGVPRVAAWRYVLTPMLKNPAIPPTFYMVRLEAIAVAGQRIAV 366

Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLC 257
           PP  F        G  +DS + +T      Y  L + F    +R  + Q +    P+  C
Sbjct: 367 PPTVF------AAGAALDSRTAITRLPPTAYQALRQAF---RDRMAMYQPAPPKGPLDTC 417

Query: 258 YFLPETFN-RFPSMAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQ 315
           Y +    +   P +   F ++A + +D   V               P+D +  +IG+ Q 
Sbjct: 418 YDMAGVRSFALPRITLVFDKNAAVELDPSGVLF----QGCLAFTAGPNDQVPGIIGNIQL 473

Query: 316 RDTRFVYDLNIDLLSFVKENC 336
           +    +Y++   L+ F    C
Sbjct: 474 QTLEVLYNIPAALVGFRHAAC 494


>gi|356555807|ref|XP_003546221.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 457

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 86/372 (23%), Positives = 154/372 (41%), Gaps = 61/372 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKI------- 39
           V++ +GTP+K   +I+DTGS+L +                IF P  S +++ +       
Sbjct: 109 VKIGVGTPAKYFSMIVDTGSSLSWLQCQPCVIYCHVQVDPIFTPSVSKTYKALSCSSSQC 168

Query: 40  ------NCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
                   + P C+        CVY   Y D S + G+ + + +++       A   G +
Sbjct: 169 SSLKSSTLNAPGCSN---ATGACVYKASYGDTSFSIGYLSQDVLTLT---PSAAPSSGFV 222

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL---PNGEY 150
           +GC  DN G       G  AG++GL+   +S + QL +     FSYCL       PN   
Sbjct: 223 YGCGQDNQGLF-----GRSAGIIGLANDKLSMLGQLSNKYGNAFSYCLPSSFSAQPNSS- 276

Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSG 208
            S +L  G       P  + T  + +P   + Y+L L  I++  + +     ++++    
Sbjct: 277 VSGFLSIGASSLSSSP-YKFTPLVKNPKIPSLYFLGLTTITVAGKPLGVSASSYNVPT-- 333

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP-ETFN 265
               IIDSG+V+T     +Y  L + FV    +    + +  P    +  C+    +  +
Sbjct: 334 ----IIDSGTVITRLPVAIYNALKKSFVMIMSK----KYAQAPGFSILDTCFKGSVKEMS 385

Query: 266 RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
             P +   F   A L +   N  +++ E     LA+A   + +++IG+ QQ+     YD+
Sbjct: 386 TVPEIRIIFRGGAGLELKVHNS-LVEIEKGTTCLAIAASSNPISIIGNYQQQTFTVAYDV 444

Query: 325 NIDLLSFVKENC 336
               + F    C
Sbjct: 445 ANSKIGFAPGGC 456


>gi|357142295|ref|XP_003572524.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 494

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 98/382 (25%), Positives = 154/382 (40%), Gaps = 69/382 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ IGTPSKG  + +DTGS +++                    ++DP  S+S + + C 
Sbjct: 91  TQIGIGTPSKGYYVQVDTGSDILWVNCISCDSCPRKSGLGIDLTLYDPTASASSKTVTCG 150

Query: 43  HPDCTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHG 91
              C              N  C Y++ Y D S T GF   + +    V G G+       
Sbjct: 151 QEFCATATNGGVPPSCAANSPCQYSITYGDGSSTTGFFVADFLQYDQVSGDGQTNLANAS 210

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGE 149
             FGC     G    + + AL G+LG  +   S +SQL S   + K FS+CL      G 
Sbjct: 211 VTFGCGAKIGGA-LGSSNVALDGILGFGQANSSMLSQLTSAGKVTKIFSHCLDTVNGGGI 269

Query: 150 YTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
           +    +         +P  + T  +   P+  Y + LK I +    +  P + FDI   G
Sbjct: 270 FAIGNV--------VQPKVKTTPLVPGMPH--YNVVLKTIDVGGSTLQLPTNIFDIG-GG 318

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRF 267
             G IIDSG+ L Y    VY  +     S      L  + D      LC+ +     N F
Sbjct: 319 SRGTIIDSGTTLAYLPEVVYKAVLSAVFSNHPDVTLKNVQDF-----LCFQYSGSVDNGF 373

Query: 268 PSMAFYFEDANLRI----------DGENVFIIDYENHFFLLAVAPHD--DLVALIGSQQQ 315
           P + F+F D +L +          + E+V+ + +++      V   D  D+V L+G    
Sbjct: 374 PEVTFHF-DGDLPLVVYPHDYLFQNTEDVYCVGFQSG----GVQSKDGKDMV-LLGDLAL 427

Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
            +   VYDL   ++ +   NCS
Sbjct: 428 SNKLVVYDLENQVIGWTNYNCS 449


>gi|21595063|gb|AAM66069.1| putative aspartyl protease [Arabidopsis thaliana]
          Length = 484

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 96/363 (26%), Positives = 148/363 (40%), Gaps = 61/363 (16%)

Query: 11  KGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF------- 49
           K + LI+DTGS L +               ++DP  SSS++ + C+   C          
Sbjct: 144 KNMSLIVDTGSDLTWVQCQPCRSCYNQQGPLYDPSVSSSYKTVFCNSSTCQDLVAATSNS 203

Query: 50  -------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                    V   C Y + Y D S T+G  A E+I +     G       +FGC  +N G
Sbjct: 204 GPCGGNNGVVKTPCEYVVSYGDGSYTRGDLASESILL-----GDTKLENFVFGCGRNNKG 258

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
               +            R ++S +SQ        FSYCL   L +G   S  L FG D  
Sbjct: 259 LFGGSSGLMGL-----GRSSVSLVSQTLKTFNGVFSYCLP-SLEDG--ASGSLSFGNDSS 310

Query: 163 YRRPSTQA--TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
               ST    T  + +P   +FY L+L   SI    +         + S   G +IDSG+
Sbjct: 311 VYTNSTSVSYTPLVQNPQLRSFYILNLTGASIGGVELK--------SSSFGRGILIDSGT 362

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-D 276
           V+T     +Y  +  +F+  F  F  A        +  C+ L    +   P +   F+ +
Sbjct: 363 VITRLPPSIYKAVKIEFLKQFSGFPTAPGYSI---LDTCFNLTSYEDISIPIIKMIFQGN 419

Query: 277 ANLRIDGENVF-IIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
           A L +D   VF  +  +     LA+A   +++ V +IG+ QQ++ R +YD   + L  V 
Sbjct: 420 AELEVDVTGVFYFVKPDASLVCLALASLSYENEVGIIGNYQQKNQRVIYDSTQERLGIVG 479

Query: 334 ENC 336
           ENC
Sbjct: 480 ENC 482


>gi|297720195|ref|NP_001172459.1| Os01g0608366 [Oryza sativa Japonica Group]
 gi|53792202|dbj|BAD52835.1| nucleoid DNA-binding protein cnd41-like [Oryza sativa Japonica
           Group]
 gi|255673454|dbj|BAH91189.1| Os01g0608366 [Oryza sativa Japonica Group]
          Length = 452

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 89/362 (24%), Positives = 144/362 (39%), Gaps = 70/362 (19%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDH 43
           ++ + +G+P+    +++DTGS + +                 A+FDP  SS++   NC  
Sbjct: 109 VISVGLGSPAVTQRVVIDTGSDVSWVQCEPCPAPSPCHAHAGALFDPAASSTYAAFNCSA 168

Query: 44  PDCTYFKCVNE--------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
             C       E        +C Y +KY D S T G  + + +++     G  +  G  FG
Sbjct: 169 AACAQLGDSGEANGCDAKSRCQYIVKYGDGSNTTGTYSSDVLTL----SGSDVVRGFQFG 224

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL-VIPLPNGEYTSSY 154
           CS+   G   D +     G++GL     S +SQ  +   K F YCL   P  +G  T   
Sbjct: 225 CSHAELGAGMDDK---TDGLIGLGGDAQSPVSQTAARYGKSFFYCLPATPASSGFLTLGA 281

Query: 155 LKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
              G   G  R +T           +Y+ +L+DI++  +++   P  F        G ++
Sbjct: 282 PASGGGGGASRFATTPMLRSKKVPTYYFAALEDIAVGGKKLGLSPSVF------AAGSLV 335

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-------RF 267
           DSG+V+T      Y  L   F +   R+  A      EP+ +   L   FN         
Sbjct: 336 DSGTVITRLPPAAYAALSSAFRAGMTRYARA------EPLGI---LDTCFNFTGLDKVSI 386

Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHFFL----LAVAP--HDDLVALIGSQQQRDTRFV 321
           P++A  F             ++D + H  +    LA AP   D     IG+ QQR    +
Sbjct: 387 PTVALVFAGGA---------VVDLDAHGIVSGGCLAFAPTRDDKAFGTIGNVQQRTFEVL 437

Query: 322 YD 323
           YD
Sbjct: 438 YD 439


>gi|293333354|ref|NP_001169607.1| uncharacterized protein LOC100383488 [Zea mays]
 gi|224030351|gb|ACN34251.1| unknown [Zea mays]
          Length = 342

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 82/344 (23%), Positives = 150/344 (43%), Gaps = 55/344 (15%)

Query: 27  IFDPRKSSSFQKINCDHPDCTYF---KCVNE---QCVYTMKYADQSVTKGFAAHETISVI 80
           +F+P+ SSS+  + C    C      +C  +    C YT KY+   VTKG  A + +++ 
Sbjct: 16  VFNPKLSSSYAVVPCTSDTCAQLDGHRCHEDDDGACQYTYKYSGHGVTKGTLAIDKLAI- 74

Query: 81  GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
               G  +FH  +FGCS+ + G          +G++GL R  +S +SQL      RF YC
Sbjct: 75  ----GGDVFHAVVFGCSDSSVG----GPAAQASGLVGLGRGPLSLVSQLS---VHRFMYC 123

Query: 141 LVIPLPNGEYTSSYLKFGTDM-GYRRPSTQATKFINHPN---NFYYLSLKDISIDNE--- 193
           L  P+     TS  L  G      R  S + T  ++      ++YYL+L  +++ ++   
Sbjct: 124 LPPPM---SRTSGKLVLGAGADAVRNMSDRVTVTMSSSTRYPSYYYLNLDGLAVGDQTPG 180

Query: 194 ---RMNFPPDTFDITVSGEG-------------GCIIDSGSVLTYFHSDVYWKLHEKFVS 237
                  PP        G G             G I+D  S +++  + +Y +L +    
Sbjct: 181 TTRNATSPPSGGAGGGGGGGGGGIVGAGGANAYGMIVDVASTISFLETSLYDELADDLE- 239

Query: 238 YFERFQLAQLSDCPE-PIQLCYFLPETFNR----FPSMAFYFEDANLRIDGENVFIIDYE 292
             E  +L + +      + LC+ LPE         P+++  F+   L +D + +F+ D  
Sbjct: 240 --EEIRLPRATPSLRLGLDLCFILPEGVGMDRVYVPTVSLSFDGRWLELDRDRLFVTDGR 297

Query: 293 NHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
               ++        V+++G+ Q ++ R +++L    ++F K +C
Sbjct: 298 MMCLMIG---RTSGVSILGNFQLQNMRVLFNLRRGKITFAKASC 338


>gi|302781476|ref|XP_002972512.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
 gi|300159979|gb|EFJ26598.1| hypothetical protein SELMODRAFT_441822 [Selaginella moellendorffii]
          Length = 496

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 92/382 (24%), Positives = 162/382 (42%), Gaps = 59/382 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------IFDPRKSSSFQKINCDHPDCTYFK--- 50
           ++L IG+  K +  I+DTGS  +          +FDP  S S++++ C    C   +   
Sbjct: 102 MQLGIGSLQKNLSAIIDTGSEAVLVQCGSRSRPVFDPAASQSYRQVPCISQLCLAVQQQT 161

Query: 51  -------CVNEQ--CVYTMKYADQSVTKGFAAHETISVIG-KGEGKAI-FHGALFGCSND 99
                  CVN    C Y++ Y D   + G  + + I +      G+A+ F    FGC++ 
Sbjct: 162 SNGSSQPCVNSSATCTYSLSYGDSRNSTGDFSQDVIFLNSTNSSGQAVQFRDVAFGCAHS 221

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIK-KRFSYCLVIPLPNGEYTSSYLKFG 158
             GF  D   G+L G++G +R  +S  SQL   +   +FSYC   P    +  ++ + F 
Sbjct: 222 PQGFLVDL--GSL-GIVGFNRGNLSLPSQLKDRLGGSKFSYCF--PSQPWQPRATGVIFL 276

Query: 159 TDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVS-GEGGC 212
            D G  +     T  +++P     +  YY+ L  IS+D + +  P   F +  S G+GG 
Sbjct: 277 GDSGLSKSKVGYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGT 336

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSM 270
           ++DSG+  T    D Y      F +   R  L +          CY +    +    P +
Sbjct: 337 VLDSGTTFTRVVDDAYTAFRNAFAAS-NRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEV 395

Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVAL---------------IGSQQQ 315
               ++ N+R++      + +E+ F  ++ A ++  V L               +G+ QQ
Sbjct: 396 RLSLQN-NVRLE------LRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQ 448

Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
            +    YD     + F + +CS
Sbjct: 449 SNYLVEYDNERSRVGFERADCS 470


>gi|357122155|ref|XP_003562781.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 629

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 91/364 (25%), Positives = 145/364 (39%), Gaps = 48/364 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+D+GS + Y                F P  SS++  + C   DCT
Sbjct: 87  TRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCSA-DCT 145

Query: 48  YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDE 105
                  QC Y  +YA+ S + G    + +S   + E K     A+FGC N   G  F +
Sbjct: 146 -CDSDKSQCTYERQYAEMSSSSGVLGEDIVSFGTESELKP--QRAVFGCENSETGDLFSQ 202

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
            A      G++GL R  +S + QL    +I   FS C   + +  G      +    DM 
Sbjct: 203 HAD-----GIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPDMV 257

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
           + R     +  +  P  +Y + LK+I +  + +   P  FD     + G ++DSG+   Y
Sbjct: 258 FSR-----SDPVRSP--YYNIELKEIHVAGKALRLDPRIFD----SKHGTVLDSGTTYAY 306

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA 277
                +    +   S     +  +  D P    +C+      + +    FP +   F D 
Sbjct: 307 LPEQAFVAFKDAVTSKVRPLKKIRGPD-PNYKDICFAGAGRNVSQLSQAFPDVDMVFGDG 365

Query: 278 N-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
             L +  EN        E  + L       D   L+G    R+T   YD + + + F K 
Sbjct: 366 QKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 425

Query: 335 NCSD 338
           NCS+
Sbjct: 426 NCSE 429


>gi|297840891|ref|XP_002888327.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297334168|gb|EFH64586.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 473

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 87/372 (23%), Positives = 156/372 (41%), Gaps = 58/372 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH 43
           ++ +G+P K   + +DTGS +++                   ++FD   SS+ +K+ CD 
Sbjct: 77  KIKLGSPPKEYHVQVDTGSDILWVNCKPCPECPSKTNLNFHLSLFDVNASSTSKKVGCDD 136

Query: 44  PDCTYFKCVNE-----QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFG 95
             C++    +       C Y + YAD+S ++G    + ++   V G  +   +    +FG
Sbjct: 137 DFCSFISQSDSCQPAVGCSYHIVYADESTSEGNFIRDKLTLEQVTGDLQTGPLGQEVVFG 196

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSS 153
           C +D  G      D A+ GV+G  +   S +SQL +    K+ FS+CL      G +   
Sbjct: 197 CGSDQSG-QLGKSDSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAVG 255

Query: 154 YLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGC 212
            +          P  + T  +  PN  +Y + L  + +D   ++ PP     ++   GG 
Sbjct: 256 VVD--------SPKVKTTPMV--PNQMHYNVMLMGMDVDGTALDLPP-----SIMRNGGT 300

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMA 271
           I+DSG+ L YF   +Y  L E  ++     Q  +L    +  Q C+   E  +  FP ++
Sbjct: 301 IVDSGTTLAYFPKVLYDSLIETILAR----QPVKLHIVEDTFQ-CFSFSENVDVAFPPVS 355

Query: 272 FYFEDANLRIDGENVFIIDYENHFFLLA------VAPHDDLVALIGSQQQRDTRFVYDLN 325
           F FED+       + ++   E   +                V L+G     +   VYDL 
Sbjct: 356 FEFEDSVKLTVYPHDYLFTLEKELYCFGWQAGGLTTGERTEVILLGDLVLSNKLVVYDLE 415

Query: 326 IDLLSFVKENCS 337
            +++ +   NCS
Sbjct: 416 NEVIGWADHNCS 427


>gi|125540927|gb|EAY87322.1| hypothetical protein OsI_08726 [Oryza sativa Indica Group]
          Length = 464

 Score = 88.6 bits (218), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 88/368 (23%), Positives = 143/368 (38%), Gaps = 67/368 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +V + +GTP+    L +DTGS + +                 +FDP +SSS+  + C   
Sbjct: 132 VVTVSLGTPAVAQTLEVDTGSDVSWVQCKPCPSPPCYSQRDPLFDPTRSSSYSAVPCAAA 191

Query: 45  DCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
            C+        C   QC Y + Y D S T G  + +T+++ G         G LFGC + 
Sbjct: 192 SCSQLALYSNGCSGGQCGYVVSYGDGSTTTGVYSSDTLTLTGSNA----LKGFLFGCGHA 247

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
             G         + G+LGL R   S +SQ  S     FSYC    LP  + +  Y+  G 
Sbjct: 248 QQGLFA-----GVDGLLGLGRQGQSLVSQASSTYGGVFSYC----LPPTQNSVGYISLGG 298

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                  ST      ++   +Y + L  IS+  + ++     F        G ++D+G+V
Sbjct: 299 PSSTAGFSTTPLLTASNDPTYYIVMLAGISVGGQPLSIDASVF------ASGAVVDTGTV 352

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP----IQLCYFLPETFNRF-----PSM 270
           +T      Y  L   F     R  +A       P    +  CY     F R+     P++
Sbjct: 353 VTRLPPTAYSALRSAF-----RAAMAPYGYPSAPATGILDTCY----DFTRYGTVTLPTI 403

Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNIDL 328
           +  F        G +  +         LA AP   D   +++G+ QQR     +D +   
Sbjct: 404 SIAFGGGAAMDLGTSGILTS-----GCLAFAPTGGDSQASILGNVQQRSFEVRFDGST-- 456

Query: 329 LSFVKENC 336
           + F+  +C
Sbjct: 457 VGFMPASC 464


>gi|222637611|gb|EEE67743.1| hypothetical protein OsJ_25435 [Oryza sativa Japonica Group]
          Length = 396

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 82/358 (22%), Positives = 148/358 (41%), Gaps = 39/358 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
           +VR  +GTP++ +LL +DT +   +            + F+P  S+S++ + C  P C  
Sbjct: 55  VVRARLGTPAQQLLLAVDTSNDAAWIPCSGCAGCPTSSPFNPAASASYRPVPCGSPQCVL 114

Query: 49  FKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
               +     + C +++ YAD S+    +  +T++V G      +     FGC     G 
Sbjct: 115 APNPSCSPNAKSCGFSLSYADSSLQAALS-QDTLAVAGD-----VVKAYTFGCLQRATGT 168

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
               +           R  +SF+SQ   +    FSYCL  P       S  L+ G +   
Sbjct: 169 AAPPQGLLGL-----GRGPLSFLSQTKDMYGATFSYCL--PSFKSLNFSGTLRLGRNGQP 221

Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
           RR  T       H ++ YY+++  I +  + ++ P        +   G ++DSG++ T  
Sbjct: 222 RRIKTTPLLANPHRSSLYYVNMTGIRVGKKVVSIPASALAFDPATGAGTVLDSGTMFTRL 281

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
            + VY  L ++          A  S        CY    T   +P +   F+   + +  
Sbjct: 282 VAPVYLALRDEVRRRVGAGAAAVSSL--GGFDTCY---NTTVAWPPVTLLFDGMQVTLPE 336

Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ENV I         LA+A   D    ++ +I S QQ++ R ++D+    + F +E+C+
Sbjct: 337 ENVVIHTTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARESCT 394


>gi|125528357|gb|EAY76471.1| hypothetical protein OsI_04407 [Oryza sativa Indica Group]
          Length = 441

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 84/323 (26%), Positives = 137/323 (42%), Gaps = 53/323 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI----------------FDPRKSSSFQKINC---- 41
           V L +GTP + V ++LDTGS L + +                F PR S +F  + C    
Sbjct: 67  VSLAVGTPPQNVTMVLDTGSELSWLLCAPGGGGGGGGRSALSFRPRASLTFASVPCGSAQ 126

Query: 42  ----DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
               D P        ++QC  ++ YAD S + G  A E  +V   G+G  +   A FGC 
Sbjct: 127 CRSRDLPSPPACDGASKQCRVSLSYADGSSSDGALATEVFTV---GQGPPLR--AAFGCM 181

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
                FD      A AG+LG++R  +SF+SQ  +   +RFSYC+     +    +  L  
Sbjct: 182 AT--AFDTSPDGVATAGLLGMNRGALSFVSQAST---RRFSYCI-----SDRDDAGVLLL 231

Query: 158 G-TDMGYR----RPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGG 211
           G +D+ +      P  Q    + + +   Y + L  I +  + +  P        +G G 
Sbjct: 232 GHSDLPFLPLNYTPLYQPAMPLPYFDRVAYSVQLLGIRVGGKPLPIPASVLAPDHTGAGQ 291

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDC----PEPIQLCYFLPE---TF 264
            ++DSG+  T+   D Y  L  +F S   +  L  L+D      E    C+ +P+     
Sbjct: 292 TMVDSGTQFTFLLGDAYSALKAEF-SRQTKPWLPALNDPNFAFQEAFDTCFRVPQGRAPP 350

Query: 265 NRFPSMAFYFEDANLRIDGENVF 287
            R P++   F  A + + G+ + 
Sbjct: 351 ARLPAVTLLFNGAQMTVAGDRLL 373


>gi|357456413|ref|XP_003598487.1| Peptidase A1, putative [Medicago truncatula]
 gi|355487535|gb|AES68738.1| Peptidase A1, putative [Medicago truncatula]
          Length = 414

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 82/358 (22%), Positives = 149/358 (41%), Gaps = 43/358 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
           +VR  IGTP + +LL +DT +   +            +F P KS++F+ ++C  P+C   
Sbjct: 79  IVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTLFAPEKSTTFKNVSCAAPECKQV 138

Query: 50  K---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
               C    C + + Y   S+       +TI++              FGC +   G    
Sbjct: 139 PNPGCGVSSCNFNLTYGSSSIAANLV-QDTITL-----ATDPVPSYTFGCVSKTTGTSAP 192

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
            +           R  +S +SQ  ++ +  FSYCL  P       S  L+ G     +R 
Sbjct: 193 PQGLLGL-----GRGPLSLLSQTQNLYQSTFSYCL--PSFKSLNFSGSLRLGPVAQPKR- 244

Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
             + T  + +P  ++ YY++L+ I +  + ++ PP       +   G I DSG+V T   
Sbjct: 245 -IKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLV 303

Query: 225 SDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
           + VY  + ++F      +  +  L         CY +P      P++ F F   N+ +  
Sbjct: 304 APVYVAVRDEFRRRVGPKLTVTSLGG----FDTCYNVPIV---VPTITFIFTGMNVTLPQ 356

Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +N+ I         LA+A   D    ++ +I + QQ++ R +YD+    +   +E C+
Sbjct: 357 DNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDVPNSRVGVARELCT 414


>gi|118482048|gb|ABK92955.1| unknown [Populus trichocarpa]
          Length = 425

 Score = 88.2 bits (217), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 82/357 (22%), Positives = 143/357 (40%), Gaps = 42/357 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
           +V+  +GTP++  L+ LDT +   +            +F+   S++F+ + CD P C   
Sbjct: 91  IVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSSTVFNSVTSTTFKTLGCDAPQCKQV 150

Query: 50  ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
               C    C +   Y   ++       +TI++        I  G  FGC     G    
Sbjct: 151 PNPTCGGSTCTWNTTYGGSTILSNLT-RDTIAL-----STDIVPGYTFGCIQKTTGSSVP 204

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
            +           R  +SF+SQ   + K  FSYCL  P       S  L+ G      R 
Sbjct: 205 PQGLLGL-----GRGPLSFLSQTQDLYKSTFSYCL--PSFRTLNFSGTLRLGPAGQPLR- 256

Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
             + T  + +P  ++ YY++L  I +  + ++ P        +   G I DSG+V T   
Sbjct: 257 -IKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLV 315

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGE 284
           + VY  + ++F        ++ L         CY  P      P+M F F   N+ +  +
Sbjct: 316 APVYTAVRDEFRKRVGNAIVSSLGG----FDTCYTGPIV---APTMTFMFSGMNVTLPTD 368

Query: 285 NVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           N+ I         LA+A   D    ++ +I + QQ++ R ++D+    +   +E CS
Sbjct: 369 NLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPCS 425


>gi|242092892|ref|XP_002436936.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
 gi|241915159|gb|EER88303.1| hypothetical protein SORBIDRAFT_10g011730 [Sorghum bicolor]
          Length = 469

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 96/368 (26%), Positives = 143/368 (38%), Gaps = 53/368 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDH- 43
           +V L  GTP+   +L++DTGS L +                 +FDP  SS++  + C   
Sbjct: 123 VVTLGFGTPAVPQVLLIDTGSDLSWVQCQPCNSSTCYPQKDPVFDPSASSTYAPVPCGSE 182

Query: 44  ------PDCTYFKCVNEQ-----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA 92
                 PD     C N       C Y ++Y +   T G  + ET+++    E   + +  
Sbjct: 183 ACRDLDPDSYANGCTNSSSGASLCQYGIQYGNGDTTVGVYSTETLTL--SPEAATVVNNF 240

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
            FGC     G  +        G+LGL     S +SQ        FSYC    LP G  T+
Sbjct: 241 SFGC-----GLVQKGVFDLFDGLLGLGGAPESLVSQTTGTYGGAFSYC----LPAGNSTA 291

Query: 153 SYLKFGTDM--GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
            +L  G     G      Q T        FY + L  IS+  ++++  P  F       G
Sbjct: 292 GFLALGAPATGGNNTAGFQFTPLQVVETTFYLVKLTGISVGGKQLDIEPTVF------AG 345

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPS 269
           G IIDSG+++T      Y  L   F S    + L   +D  E +  CY F   T    P+
Sbjct: 346 GMIIDSGTIVTGLPETAYSALRTAFRSAMSAYPLLPPND-DEDLDTCYDFTGNTNVTVPT 404

Query: 270 MAFYFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDL 328
           +A  FE    + +D  +  ++D    F   A    D    +IG+  QR    +YD     
Sbjct: 405 VALTFEGGVTIDLDVPSGVLLDGCLAFVAGA---SDGDTGIIGNVNQRTFEVLYDSARGH 461

Query: 329 LSFVKENC 336
           + F    C
Sbjct: 462 VGFRAGAC 469


>gi|224057272|ref|XP_002299201.1| predicted protein [Populus trichocarpa]
 gi|118483775|gb|ABK93780.1| unknown [Populus trichocarpa]
 gi|222846459|gb|EEE84006.1| predicted protein [Populus trichocarpa]
          Length = 425

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 82/357 (22%), Positives = 143/357 (40%), Gaps = 42/357 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
           +V+  +GTP++  L+ LDT +   +            +F+   S++F+ + CD P C   
Sbjct: 91  IVKANVGTPAQTFLMALDTSNDAAWIPCNGCVGCSSTVFNSVTSTTFKTLGCDAPQCKQV 150

Query: 50  ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
               C    C +   Y   ++       +TI++        I  G  FGC     G    
Sbjct: 151 PNPTCGGSTCTWNTTYGGSTILSNLT-RDTIAL-----STDIVPGYTFGCIQKTTGSSVP 204

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
            +           R  +SF+SQ   + K  FSYCL  P       S  L+ G      R 
Sbjct: 205 PQGLLGL-----GRGPLSFLSQTQDLYKSTFSYCL--PSFRTLNFSGTLRLGPAGQPLR- 256

Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
             + T  + +P  ++ YY++L  I +  + ++ P        +   G I DSG+V T   
Sbjct: 257 -IKTTPLLKNPRRSSLYYVNLIGIRVGRKIVDIPASALAFNPTTGAGTIFDSGTVFTRLV 315

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGE 284
           + VY  + ++F        ++ L         CY  P      P+M F F   N+ +  +
Sbjct: 316 APVYTAVRDEFRKRVGNAIVSSLGG----FDTCYTGPIV---APTMTFMFSGMNVTLPPD 368

Query: 285 NVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           N+ I         LA+A   D    ++ +I + QQ++ R ++D+    +   +E CS
Sbjct: 369 NLLIRSTAGSTSCLAMAAAPDNVNSVLNVIANMQQQNHRILFDVPNSRIGVAREPCS 425


>gi|296087361|emb|CBI33735.3| unnamed protein product [Vitis vinifera]
          Length = 633

 Score = 87.8 bits (216), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 95/366 (25%), Positives = 149/366 (40%), Gaps = 52/366 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            R++IGTP +   LI+DTGS L Y                F P  SS++Q + C   +CT
Sbjct: 94  TRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSM-ECT 152

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C +E   CVY  +YA+ S + G    + +S   + E K      +FGC N   G   
Sbjct: 153 ---CDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKP--QRTVFGCENVETG--- 204

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-- 161
           D       G++GL R  +S + QL    +I   FS C         Y    +  G  +  
Sbjct: 205 DIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLC---------YGGMDVGGGAMVLG 255

Query: 162 GYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
           G   P+       +   + YY + LK+I I  +++   P  FD    G+ G I+DSG+  
Sbjct: 256 GISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTY 311

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFE 275
            Y     +    +  +      +L Q  D      +C+      + +    FP++   F 
Sbjct: 312 AYLPEPAFKAFKDAIMKELNSLKLIQGPD-RNYNDICFSGVGSDVSQLSKTFPAVDLVFS 370

Query: 276 DAN-LRIDGENVFIIDYENH--FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           + N L +  EN      + H  + L      +D   L+G    R+T  +YD     + F 
Sbjct: 371 NGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFW 430

Query: 333 KENCSD 338
           K NCS+
Sbjct: 431 KTNCSE 436


>gi|225438908|ref|XP_002279194.1| PREDICTED: aspartic proteinase-like protein 2-like [Vitis vinifera]
          Length = 634

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 95/366 (25%), Positives = 149/366 (40%), Gaps = 52/366 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            R++IGTP +   LI+DTGS L Y                F P  SS++Q + C   +CT
Sbjct: 94  TRIWIGTPPQTFALIVDTGSTLTYVPCSTCEQCGKHQDPNFQPDWSSTYQPLKCSM-ECT 152

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C +E   CVY  +YA+ S + G    + +S   + E K      +FGC N   G   
Sbjct: 153 ---CDSEMMHCVYDRQYAEMSSSSGVLGEDIVSFGKQSELKP--QRTVFGCENVETG--- 204

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-- 161
           D       G++GL R  +S + QL    +I   FS C         Y    +  G  +  
Sbjct: 205 DIYSQRADGIMGLGRGDLSIVDQLVEKGVIGNSFSLC---------YGGMDVGGGAMVLG 255

Query: 162 GYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
           G   P+       +   + YY + LK+I I  +++   P  FD    G+ G I+DSG+  
Sbjct: 256 GISPPAGMVFTHSDPARSAYYNIDLKEIHIAGKQLPINPMVFD----GKYGTILDSGTTY 311

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFE 275
            Y     +    +  +      +L Q  D      +C+      + +    FP++   F 
Sbjct: 312 AYLPEPAFKAFKDAIMKELNSLKLIQGPD-RNYNDICFSGVGSDVSQLSKTFPAVDLVFS 370

Query: 276 DAN-LRIDGENVFIIDYENH--FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           + N L +  EN      + H  + L      +D   L+G    R+T  +YD     + F 
Sbjct: 371 NGNRLSLSPENYLFQHSKAHGAYCLGIFQNENDQTTLLGGIIVRNTLVMYDREHLKIGFW 430

Query: 333 KENCSD 338
           K NCS+
Sbjct: 431 KTNCSE 436


>gi|326515366|dbj|BAK03596.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 452

 Score = 87.8 bits (216), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 88/336 (26%), Positives = 145/336 (43%), Gaps = 27/336 (8%)

Query: 26  AIFDPRKSSSFQKINCDHPDCT--YFKCVNEQCVYTMKYA-DQSVTKGFAAHETISVIGK 82
           ++F+   S  +  I    P C   Y +    +C + +K+    S  +G    +     G 
Sbjct: 121 SVFNTAASPHYHHIASTDPRCMAPYTRAGQGRCTFDVKFQYGDSRARGVLGSDDFVFDGS 180

Query: 83  GEGKAI--FHGALFGCSNDNHGF-DEDARDGALAGVLGLSRVTISFISQLGS--IIKKRF 137
           G G  I   +G +FGC+++ H F + D      AGV+ L+R   SFI QL +  +   RF
Sbjct: 181 GPGSPISSVNGLVFGCAHNTHDFYNHDL----WAGVMSLNRHPTSFIRQLSARGLAAPRF 236

Query: 138 SYCLVIPLPNGEYTSSYLKFGTDM---GYRRPSTQATKFINHPNNFYYLSLKDISIDNER 194
           SYCL            +L+FG D+    + R +      +      YY+ +  +S+   R
Sbjct: 237 SYCLASR--QHRDRRGFLRFGADIPDQSHARSTPLLHGDLAQGGGMYYVGVVGVSLGGRR 294

Query: 195 MN-FPPDTFDITV-SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE 252
           +    P  F++   S  GGCIID G+ LT   +  Y  L  + +++     +      P 
Sbjct: 295 LTAITPVMFELNRRSLRGGCIIDVGTSLTLMATAPYHVLVAELIAHMRSRGVQHAIFSPG 354

Query: 253 PIQLCYFLPETFNR-FPSMAFYF----EDANLRIDGENVFI--IDYENHFFLLAVAPHDD 305
                    E+ +R  PS+  +F    E   L I  E +F+        +  LA+ P+ +
Sbjct: 355 QKHCFRGKWESIHRHLPSVTLHFQFHPESVALFIRPELLFVAMTGERTDYVCLAIVPYAE 414

Query: 306 LVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDDSA 341
              +IG+ Q  DTRF +DL  + L F  E C  D++
Sbjct: 415 RT-IIGAGQMLDTRFTFDLQQNRLFFAPEQCHLDTS 449


>gi|225216973|gb|ACN85264.1| aspartic proteinase nepenthesin-1 precursor [Oryza alta]
          Length = 517

 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 87/364 (23%), Positives = 143/364 (39%), Gaps = 53/364 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP+    ++ DTGS   +                +FDP +SS++  ++C  P 
Sbjct: 179 VVTVGLGTPASRYTVVFDTGSDTTWVQCQPCVVVCYEQQEKLFDPVRSSTYANVSCAAPA 238

Query: 46  CT---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C+      C    C+Y ++Y D S + GF A +T+++           G  FGC   N G
Sbjct: 239 CSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNEG 294

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
              +A     AG+LGL R   S   Q        F++C    LP     + YL FG    
Sbjct: 295 LFGEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----LPARSTGTGYLDFGAGSP 345

Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
               +   T  +  N P  FYY+ +  I +  + ++ P   F        G I+DSG+V+
Sbjct: 346 AAASARLTTPMLTDNGP-TFYYIGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVI 399

Query: 221 TYFHSDVYWKLH-----EKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYF 274
           T      Y  L            +++     L D       CY F   +    P+++  F
Sbjct: 400 TRLPPPAYSSLRYAFAAAMAARGYKKAPAVSLLD------TCYDFTGMSQVAIPTVSLLF 453

Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFV 332
           +  A L +D   +      +   L   A  D   V ++G+ Q +     YD+   ++ F 
Sbjct: 454 QGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFY 513

Query: 333 KENC 336
              C
Sbjct: 514 PGVC 517


>gi|125547728|gb|EAY93550.1| hypothetical protein OsI_15341 [Oryza sativa Indica Group]
          Length = 418

 Score = 87.4 bits (215), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 99/369 (26%), Positives = 149/369 (40%), Gaps = 75/369 (20%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +V L  GTP + V L LDTGS + +                 +FDP  SSSF  + C  P
Sbjct: 89  LVHLAAGTPPQEVQLTLDTGSDITWTQCKRCPASACFNQTLPLFDPSASSSFASLPCSSP 148

Query: 45  DC-TYFKC------VNEQCVYTMKYADQSVTKGFAAHETIS-VIGKGEG-KAIFHGALFG 95
            C T   C       +  C Y++ Y D SV++G    E  +   G GEG  A   G +FG
Sbjct: 149 ACETTPPCGGGNDATSRPCNYSISYGDGSVSRGEIGREVFTFASGTGEGSSAAVPGLVFG 208

Query: 96  CSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
           C + N G F  +       G+ G  R ++S  SQL       FS+C       G  TS+ 
Sbjct: 209 CGHANRGVFTSNE-----TGIAGFGRGSLSLPSQL---KVGNFSHCFTT--ITGSKTSAV 258

Query: 155 LKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
           L  G   G   PS  A+       ++   S    S                         
Sbjct: 259 L-LGLP-GVAPPS--ASPLGRRRGSYRCRSTPRSS------------------------- 289

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAF 272
           +SG+ +T      Y  + E+F +  +   +    +  +P   C+  P    +   P+MA 
Sbjct: 290 NSGTSITSLPPRTYRAVREEFAAQVKLPVVP--GNATDPF-TCFSAPLRGPKPDVPTMAL 346

Query: 273 YFEDANLRIDGEN-VFII----DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
           +FE A +R+  EN VF +    D  N   ++ +A  +    ++G+ QQ++   +YDL   
Sbjct: 347 HFEGATMRLPQENYVFEVVDDDDAGNSSRIICLAVIEGGEIILGNIQQQNMHVLYDLQNS 406

Query: 328 LLSFVKENC 336
            LSFV   C
Sbjct: 407 KLSFVPAQC 415


>gi|326503602|dbj|BAJ86307.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 461

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 82/318 (25%), Positives = 128/318 (40%), Gaps = 43/318 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINC------ 41
           V L +GTP + V ++LDTGS L + +              F PR SS+F  + C      
Sbjct: 87  VSLAVGTPPQNVTMVLDTGSELSWLLCAPAGARNKFSAMSFRPRASSTFAAVPCASAQCR 146

Query: 42  --DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
             D P        + +C  ++ YAD S + G  A +  +V   G G  +   A FGC + 
Sbjct: 147 SRDLPSPPACDGASSRCSVSLSYADGSSSDGALATDVFAV---GSGPPLR--AAFGCMSS 201

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
              FD      A AG+LG++R  +SF+SQ  +   +RFSYC+      G     +    T
Sbjct: 202 --AFDSSPDGVASAGLLGMNRGALSFVSQAST---RRFSYCISDRDDAGVLLLGHSDLPT 256

Query: 160 --DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
              + Y      A          Y + L  I +  + +  P        +G G  ++DSG
Sbjct: 257 FLPLNYTPMYQPALPLPYFDRVAYSVQLLGIRVGGKHLPIPASVLAPDHTGAGQTMVDSG 316

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP----EPIQLCYFLPETFN----RFPS 269
           +  T+   D Y  L  +F     R  L  L D      E    C+ +P+  +    R P 
Sbjct: 317 TQFTFLLGDAYSALKAEFTRQ-ARPLLPALDDPSFAFQEAFDTCFRVPQGRSPPTARLPG 375

Query: 270 MAFYFEDANLRIDGENVF 287
           +   F  A + + G+ + 
Sbjct: 376 VTLLFNGAEMAVAGDRLL 393


>gi|224065128|ref|XP_002301682.1| predicted protein [Populus trichocarpa]
 gi|222843408|gb|EEE80955.1| predicted protein [Populus trichocarpa]
          Length = 441

 Score = 87.4 bits (215), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 77/271 (28%), Positives = 120/271 (44%), Gaps = 55/271 (20%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V L IGTP +   +ILDTGS L +              ++FDP  SSSF  + C+HP C
Sbjct: 83  LVSLPIGTPPQTQQMILDTGSQLSWIQCHKKVPRKPPPSSVFDPSLSSSFSVLPCNHPLC 142

Query: 47  TY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
                       C  N  C Y+  YAD ++ +G    E I+         +    + GC+
Sbjct: 143 KPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKITFSRSQSTPPL----ILGCA 198

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            ++     DA+     G+LG++   +SF SQ       +FSYC+    P  +    +   
Sbjct: 199 EES----SDAK-----GILGMNLGRLSFASQAK---LTKFSYCV----PTRQVRPGFTPT 242

Query: 158 GTDMGYRRPSTQATKFIN---------HPNN---FYYLSLKDISIDNERMNFPPDTFDIT 205
           G+      P++   ++IN          PN     Y ++++ I I N+++N P   F   
Sbjct: 243 GSFYLGENPNSGGFRYINLLTFSQSQRMPNLDPLAYTVAMQGIRIGNQKLNIPISAFRPD 302

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFV 236
            SG G  +IDSGS  TY   + Y K+ E+ V
Sbjct: 303 PSGAGQTMIDSGSEFTYLVDEAYNKVREEVV 333


>gi|225216960|gb|ACN85252.1| aspartic proteinase nepenthesin-1 precursor [Oryza officinalis]
          Length = 519

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 87/364 (23%), Positives = 142/364 (39%), Gaps = 53/364 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP     ++ DTGS   +                +FDP +SS++  ++C  P 
Sbjct: 181 VVTVGLGTPVSRYTVVFDTGSDTTWVQCQPCVVVCYEQREKLFDPARSSTYANVSCAAPA 240

Query: 46  CTYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
           C+      C    C+Y ++Y D S + GF A +T+++           G  FGC   N G
Sbjct: 241 CSDLNIHGCSGGHCLYGVQYGDGSYSIGFFAMDTLTL----SSYDAVKGFRFGCGERNEG 296

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
              +A     AG+LGL R   S   Q        F++C    LP     + YL FG    
Sbjct: 297 LFGEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----LPARSTGTGYLDFGAGSL 347

Query: 163 YRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
               +   T  +  N P  FYY+ +  I +  + ++ P   F        G I+DSG+V+
Sbjct: 348 AAASARLTTPMLTDNGP-TFYYVGMTGIRVGGQLLSIPQSVFATA-----GTIVDSGTVI 401

Query: 221 TYFHSDVYWKLH-----EKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYF 274
           T      Y  L            +++     L D       CY F   +    P+++  F
Sbjct: 402 TRLPPAAYSSLRYAFAAAMAARGYKKAPAVSLLD------TCYDFTGMSQVAIPTVSLLF 455

Query: 275 E-DANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLNIDLLSFV 332
           +  A L +D   +      +   L   A  D   V ++G+ Q +     YD+   ++ F 
Sbjct: 456 QGGARLDVDASGIMYAASASQVCLAFAANEDGGDVGIVGNTQLKTFGVAYDIGKKVVGFY 515

Query: 333 KENC 336
              C
Sbjct: 516 PGAC 519


>gi|242059211|ref|XP_002458751.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
 gi|241930726|gb|EES03871.1| hypothetical protein SORBIDRAFT_03g039590 [Sorghum bicolor]
          Length = 444

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 93/387 (24%), Positives = 156/387 (40%), Gaps = 70/387 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------------FDPRKSSSFQKINC 41
           V L +GTP + V ++LDTGS L + +                    F PR S++F  + C
Sbjct: 65  VSLAVGTPPQNVTMVLDTGSELSWLLCATGRQGSAAAGAAAAMGESFRPRASATFAAVPC 124

Query: 42  DHPDCTYF------KC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
               C+         C   + QC  ++ YAD S + G  A +  +V     G+A    + 
Sbjct: 125 GSTQCSSRDLPAPPSCDGASRQCHVSLSYADGSASDGALATDVFAV-----GEAPPLRSA 179

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
           FGC +    +D      A AG+LG++R T+SF++Q  +   +RFSYC+     +    + 
Sbjct: 180 FGCMST--AYDSSPDGVATAGLLGMNRGTLSFVTQAST---RRFSYCI-----SDRDDAG 229

Query: 154 YLKFG-TDMGYR----RPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVS 207
            L  G +D+ +      P  Q T  + + +   Y + L  I +  + +  P        +
Sbjct: 230 VLLLGHSDLPFLPLNYTPLYQPTLPLPYFDRVAYSVQLLGIRVGGKALPIPASVLAPDHT 289

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP-----EPIQLCYFL-- 260
           G G  ++DSG+  T+   D Y  L  +F+   +   L +  D P     E +  C+ +  
Sbjct: 290 GAGQTMVDSGTQFTFLLGDAYSALKAEFLKQTK--PLLRALDDPSFAFQEALDTCFRVPA 347

Query: 261 --PETFNRFPSMAFYFEDANLRIDGENVFIIDYENH-----FFLLAVAPHDDLVAL---- 309
             P    R P +   F  A + + G+ +       H      + L    + D+V L    
Sbjct: 348 GRPPPSARLPPVTLLFNGAEMSVAGDRLLYKVPGEHRGADGVWCLTFG-NADMVPLTAYV 406

Query: 310 IGSQQQRDTRFVYDLNIDLLSFVKENC 336
           IG   Q +    YDL    +      C
Sbjct: 407 IGHHHQMNLWVEYDLERGRVGLAPVKC 433


>gi|358345197|ref|XP_003636668.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355502603|gb|AES83806.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 294

 Score = 87.0 bits (214), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 67/220 (30%), Positives = 100/220 (45%), Gaps = 29/220 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP   +    DTGS LI+               +FD + SS+F  I C    C
Sbjct: 60  LMELSIGTPPVKIYAQADTGSDLIWLQCIPCTNCYKQLNPMFDSQSSSTFSNIACGSESC 119

Query: 47  TYF---KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
           +      C  +Q  C Y   Y D S T+G  A ET+++         F G +FGC ++N+
Sbjct: 120 SKLYSTSCSPDQINCKYNYSYVDGSETQGVLAQETLTLTSTTGEPVAFKGVIFGCGHNNN 179

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTD 160
           G   D       G++GL R  +S +SQ+GS +    FS CLV P       SS + FG  
Sbjct: 180 GAFNDKE----MGIIGLGRGPLSLVSQIGSSLGGNMFSQCLV-PFNTNPSISSPMSFGKG 234

Query: 161 MGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFP 198
                    +T  ++     +FY+++L  IS+  E +N P
Sbjct: 235 SEVLGNGVVSTPLVSKTTYQSFYFVTLLGISV--EDINLP 272


>gi|326521034|dbj|BAJ92880.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 448

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 79/358 (22%), Positives = 144/358 (40%), Gaps = 41/358 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
           +VR  +GTP + +LL +DT +   +              F+P  S S++ + C  P C+ 
Sbjct: 109 VVRARLGTPPQQLLLAVDTSNDAAWIPCSGCAGCPTTTPFNPAASKSYRAVPCGSPACSR 168

Query: 49  F---KCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                C    + C +++ YAD S+    +  ++++V        +     FGC     G 
Sbjct: 169 APNPSCSLNTKSCGFSLTYADSSLEAALS-QDSLAVAND-----VVKSYTFGCLQKATGT 222

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
               +           R  +SF+SQ   + +  FSYCL  P       S  L+ G     
Sbjct: 223 ATPPQGLLGL-----GRGPLSFLSQTKDMYEGTFSYCL--PSFKSLNFSGTLRLGRKGQP 275

Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
            R  T       H ++ YY+S+  I +  + +  PP       +   G ++DSG++ T  
Sbjct: 276 LRIKTTPLLVNPHRSSLYYVSMTGIRVGKKVVPIPPAALAFDPATGAGTVLDSGTMFTRL 335

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
            +  Y  + ++         L+ L         CY    T  ++P + F F    + +  
Sbjct: 336 VAPAYVAVRDEVRRRIRGAPLSSLGG----FDTCY---NTTVKWPPVTFMFTGMQVTLPA 388

Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +N+ I         LA+A   D    ++ +I S QQ++ R ++D+    + F +E C+
Sbjct: 389 DNLVIHSTYGTTSCLAMAAAPDGVNTVLNVIASMQQQNHRILFDVPNGRVGFAREQCT 446


>gi|293332531|ref|NP_001169558.1| uncharacterized protein LOC100383437 precursor [Zea mays]
 gi|224030089|gb|ACN34120.1| unknown [Zea mays]
 gi|413925069|gb|AFW65001.1| hypothetical protein ZEAMMB73_160528 [Zea mays]
          Length = 491

 Score = 86.7 bits (213), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 94/377 (24%), Positives = 161/377 (42%), Gaps = 69/377 (18%)

Query: 6   IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDHPDC 46
           +GTP K   + +DTGS +++                    ++DP+ SS+   + CD   C
Sbjct: 92  LGTPPKHYYVQVDTGSDILWVNCITCEQCPHKSGLGLDLTLYDPKASSTGSMVMCDQAFC 151

Query: 47  TYF------KC-VNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGALFGC 96
                    KC  N  C Y++ Y D S T G    + +    V   G+ +      +FGC
Sbjct: 152 AATFGGKLPKCGANVPCEYSVTYGDGSSTIGSFVTDALQFDQVTRDGQTQPANASVIFGC 211

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSY 154
                G D  + + AL G+LG      S +SQL +   +KK F++CL      G ++   
Sbjct: 212 GA-QQGGDLGSSNQALDGILGFGEANTSMLSQLTTAGKVKKIFAHCLDTIKGGGIFSIGD 270

Query: 155 LKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE-GGC 212
           +         +P  + T  + + P+  Y ++LK I +    +  P   F+    GE  G 
Sbjct: 271 V--------VQPKVKTTPLVADKPH--YNVNLKTIDVGGTTLQLPAHIFE---PGEKKGT 317

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ--LCYFLPETF-NRFPS 269
           IIDSG+ LTY    V+    E  ++ F + Q     D    +Q  LC+  P +  + FP+
Sbjct: 318 IIDSGTTLTYLPELVF---KEVMLAVFNKHQDITFHD----VQGFLCFQYPGSVDDGFPT 370

Query: 270 MAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
           + F+FE D  L +        +G +V+ + ++N     + +     + L+G     +   
Sbjct: 371 ITFHFEDDLALHVYPHEYFFANGNDVYCVGFQNG---ASQSKDGKDIVLMGDLVLSNKLV 427

Query: 321 VYDLNIDLLSFVKENCS 337
           +YDL   ++ +   NCS
Sbjct: 428 IYDLENRVIGWTDYNCS 444


>gi|222624645|gb|EEE58777.1| hypothetical protein OsJ_10300 [Oryza sativa Japonica Group]
          Length = 431

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 94/386 (24%), Positives = 153/386 (39%), Gaps = 73/386 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKIN--------------CDHPDCT 47
           V + +GTP + V ++LDTGS L + + +   +    + +              CD P   
Sbjct: 57  VPVAVGTPPQNVTMVLDTGSELSWLLCNGSYAPPLTRRSTRRWRGRDLPVPPFCDTPP-- 114

Query: 48  YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC--------SND 99
                +  C  ++ YAD S   G  A +T  + G     A+  GA FGC        + +
Sbjct: 115 -----SNACRVSLSYADASSADGVLATDTFLLTGGAPPVAV--GAYFGCITSYSSTTATN 167

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           ++G   D  + A  G+LG++R T+SF++Q G+   +RF+YC+      GE     L  G 
Sbjct: 168 SNGTGTDVSEAA-TGLLGMNRGTLSFVTQTGT---RRFAYCIA----PGE-GPGVLLLGD 218

Query: 160 DMGYRRPSTQATKF-INHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
           D G   P        I+ P  +     Y + L+ I +    +  P        +G G  +
Sbjct: 219 DGGVAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGCALLPIPKSVLTPDHTGAGQTM 278

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLCYFLPE---- 262
           +DSG+  T+  +D Y  L  +F S   R  LA L    EP          C+  PE    
Sbjct: 279 VDSGTQFTFLLADAYAALKAEFTSQ-ARLLLAPLG---EPGFVFQGAFDACFRGPEARVA 334

Query: 263 -TFNRFPSMAFYFEDANLRIDGEN-VFIIDYENHFFLLAVA------PHDDLVAL----I 310
                 P +      A + + GE  ++++  E      A A       + D+  +    I
Sbjct: 335 AASGLLPEVGLVLRGAEVAVSGEKLLYMVPGERRGEGGAEAVWCLTFGNSDMAGMSAYVI 394

Query: 311 GSQQQRDTRFVYDLNIDLLSFVKENC 336
           G   Q++    YDL    + F    C
Sbjct: 395 GHHHQQNVWVEYDLQNGRVGFAPARC 420


>gi|242089069|ref|XP_002440367.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
 gi|241945652|gb|EES18797.1| hypothetical protein SORBIDRAFT_09g030430 [Sorghum bicolor]
          Length = 462

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 90/358 (25%), Positives = 141/358 (39%), Gaps = 70/358 (19%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
           +GTP    LL+LDTGS +++               +FDPR+S S+  + C  P C     
Sbjct: 148 VGTPPTPALLVLDTGSDVVWLQCAPCRQCYAQSGRVFDPRRSRSYAAVRCGAPPCRGLDA 207

Query: 52  VNE--------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
                       C+Y + Y D SVT G  A ET+       G  +   A+ GC +DN G 
Sbjct: 208 GGGGGCDRRRGTCLYQVAYGDGSVTAGDLATETLWF---ARGARVPRVAV-GCGHDNEGL 263

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A            R  +S  +Q      +RFSYC                 G+D+ +
Sbjct: 264 FVAAAGLLGL-----GRGRLSLPTQTARRYGRRFSYCFQ---------------GSDLDH 303

Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
           R         I    + +    +   +    +   P T      G GG I+DSG+ +T  
Sbjct: 304 R--------TIIRTVHQHVGGARVRGVGERSLRLDPST------GRGGVILDSGTSVTRL 349

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFL-PETFNRFPSMAFYFED-AN 278
              VY  + E F +     +LA     P    L   CY L      + P+++ +    A 
Sbjct: 350 ARPVYVAVREAFRAAAGGLRLA-----PGGFSLFDTCYDLRGRRVVKVPTVSVHLAGGAE 404

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           + +  EN  I       F LA+A  D  V+++G+ QQ+  R V+D +   ++ V ++C
Sbjct: 405 VALPPENYLIPVDTRGTFCLALAGTDGGVSIVGNIQQQGFRVVFDGDRQRVALVPKSC 462


>gi|222632750|gb|EEE64882.1| hypothetical protein OsJ_19741 [Oryza sativa Japonica Group]
          Length = 456

 Score = 86.7 bits (213), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 102/359 (28%), Positives = 155/359 (43%), Gaps = 58/359 (16%)

Query: 6   IGTPSKGVLLILDTGSALIYA---IFDP-----RKSSSF-------QKINCDHPDCTYFK 50
           +GTP+   L++LDTGS +++A      P     R+ SS         + NC  P C    
Sbjct: 128 VGTPATTALMVLDTGSDVVWAPVRALPPLLRAVRQGSSTGAAPAPTPRWNCVAPICRRLD 187

Query: 51  CVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
                     C+Y + Y D SVT G  A ET++   +G   A       GC +DN G   
Sbjct: 188 SAGCDRRRNSCLYQVAYGDGSVTAGDFASETLT-FARG---ARVQRVAIGCGHDNEGLFI 243

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
                A +G+LGL R  +SF SQ+     + FSYCLV    +     S    GT     R
Sbjct: 244 -----AASGLLGLGRGRLSFPSQIARSFGRSFSYCLVDRTSSRRARPSRRWGGTP----R 294

Query: 166 PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITV---SGEGGCIIDSGSVLTY 222
            +T           FYY+ L   S+   R+     + D+ +   +G GG I+DSG+ +T 
Sbjct: 295 MAT-----------FYYVHLLGFSVGGARVKGVSQS-DLRLNPTTGRGGVILDSGTSVTR 342

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL---CYFLP-ETFNRFPSMAFYFE-DA 277
               VY  + + F     R     L   P    L   CY L      + P+++ +    A
Sbjct: 343 LARPVYEAVRDAF-----RAAAVGLRVSPGGFSLFDTCYNLSGRRVVKVPTVSMHLAGGA 397

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           ++ +  EN  I    +  F  A+A  D  V++IG+ QQ+  R V+D +   + FV ++C
Sbjct: 398 SVALPPENYLIPVDTSGTFCFAMAGTDGGVSIIGNIQQQGFRVVFDGDAQRVGFVPKSC 456


>gi|255565759|ref|XP_002523869.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223536957|gb|EEF38595.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 447

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 92/385 (23%), Positives = 155/385 (40%), Gaps = 67/385 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY------------------AIFDPRKSSSFQKINCDH 43
           + L  GTP + +  ++DTGS+ ++                  + F P+ SSS + I C +
Sbjct: 79  ISLSFGTPPQTLSFVMDTGSSFVWFPCTLRYLCNNCSFTSRISPFLPKHSSSSKIIGCKN 138

Query: 44  PDCTYFKCVNEQCV---------------YTMKYADQSVTKGFAAHETISVIGKGEGKAI 88
           P C++    + +C                Y + Y     T G A  ET+ + G      I
Sbjct: 139 PKCSWIHQTDLRCTDCDNNSRNCSQICPPYLILYGS-GTTGGVALSETLHLHG-----LI 192

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL--P 146
               L GCS     F         AG+ G  R   S  SQLG     +FSYCL+      
Sbjct: 193 VPNFLVGCSV----FSSRQP----AGIAGFGRGPSSLPSQLG---LTKFSYCLLSHKFDD 241

Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFINHP--------NNFYYLSLKDISIDNERMNFP 198
             E +S  L   +D   +  +   T  + +P        + +YY+SL+ ISI    +  P
Sbjct: 242 TQESSSLVLDSQSDSDKKTAALMYTPLVKNPKVQDKPAFSVYYYVSLRRISIGGRSVKIP 301

Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
                    G GG IIDSG+  TY  ++ +  L  +F+S  + ++ A + +    ++ C+
Sbjct: 302 YKYLSPDKDGNGGTIIDSGTTFTYMSTEAFEILSNEFISQVKNYERALMVEALSGLKPCF 361

Query: 259 FLPETFN-RFPSMAFYFE---DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA---LIG 311
            +        P +  +F+   D  L ++    F+   E   F +     +       ++G
Sbjct: 362 NVSGAKELELPQLRLHFKGGADVELPLENYFAFLGSREVACFTVVTDGAEKASGPGMILG 421

Query: 312 SQQQRDTRFVYDLNIDLLSFVKENC 336
           + Q ++    YDL  + L F KE+C
Sbjct: 422 NFQMQNFYVEYDLQNERLGFKKESC 446


>gi|388508518|gb|AFK42325.1| unknown [Lotus japonicus]
          Length = 204

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 58/207 (28%), Positives = 104/207 (50%), Gaps = 12/207 (5%)

Query: 134 KKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISID 191
           + +FSYCL       +  +S L  G+ +        +T  + +P+  +FYYLSL+ I + 
Sbjct: 3   EAKFSYCLT---SMDDSKASVLLLGS-LAKATKDAISTPLLTNPSQPSFYYLSLEGIPVG 58

Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
             +++     FD++  G GG IIDSG+ +TY    V+  L ++F+S     QL + S   
Sbjct: 59  GTQLSIEQSIFDVSDDGSGGVIIDSGTTITYLEKSVFDTLKKEFISQ-SNLQLDKSSST- 116

Query: 252 EPIQLCYFLPE--TFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVAL 309
             + +C+ LP   T    P + F+F+  +L +  E+  I D +     LA+   + + ++
Sbjct: 117 -GLDVCFSLPSETTQVEVPKLVFHFKGGDLELPAESYMIADSKLGVACLAMGASNGM-SI 174

Query: 310 IGSQQQRDTRFVYDLNIDLLSFVKENC 336
            G+ QQ++    +DL  + +SFV   C
Sbjct: 175 FGNVQQQNILVNHDLEKETISFVPTQC 201


>gi|218190722|gb|EEC73149.1| hypothetical protein OsI_07179 [Oryza sativa Indica Group]
          Length = 494

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 94/381 (24%), Positives = 155/381 (40%), Gaps = 69/381 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            R+ IGTP+K   + +DTGS +++                    ++DPR S S + + CD
Sbjct: 92  TRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCD 151

Query: 43  H-----------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
                       P CT        C Y++ Y D S T GF   + +    V G G+    
Sbjct: 152 QQFCVANYGGVLPSCTS----TSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPA 207

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLP 146
                FGC     G D  + + AL G+LG  +   S +SQL +   ++K F++CL     
Sbjct: 208 NASVSFGCGA-KLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNG 266

Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITV 206
            G +    +         +P  + T  ++   + Y + LK I +    +  P + FD   
Sbjct: 267 GGIFAIGNV--------VQPKVKTTPLVSDMPH-YNVILKGIDVGGTALGLPTNIFDSGN 317

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFN 265
           S   G IIDSG+ L Y    VY  L        +   +  L D       C+ +     +
Sbjct: 318 S--KGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVDD 370

Query: 266 RFPSMAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQR 316
            FP + F+FE D +L +        +G+N++ + ++N    +      D+V L+G     
Sbjct: 371 GFPEVTFHFEGDVSLIVSPHDYLFQNGKNLYCMGFQNGG--VQTKDGKDMV-LLGDLVLS 427

Query: 317 DTRFVYDLNIDLLSFVKENCS 337
           +   +YDL    + +   NCS
Sbjct: 428 NKLVLYDLENQAIGWADYNCS 448


>gi|224124882|ref|XP_002329972.1| predicted protein [Populus trichocarpa]
 gi|222871994|gb|EEF09125.1| predicted protein [Populus trichocarpa]
          Length = 332

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 80/355 (22%), Positives = 151/355 (42%), Gaps = 57/355 (16%)

Query: 15  LILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDCTYFKCV------- 52
           +ILDTGS+L +                ++DP  S +++K++C   +C+  K         
Sbjct: 1   MILDTGSSLSWLQCQPCAVYCHAQADPLYDPSVSKTYKKLSCASVECSRLKAATLNDPLC 60

Query: 53  ---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARD 109
              +  C+YT  Y D S + G+ + + +++              +GC  DN G       
Sbjct: 61  ETDSNACLYTASYGDTSFSIGYLSQDLLTL----TSSQTLPQFTYGCGQDNQGLF----- 111

Query: 110 GALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQ 169
           G  AG++GL+R  +S ++QL +     FSYC    LP     SS   F +       S +
Sbjct: 112 GRAAGIIGLARDKLSMLAQLSTKYGHAFSYC----LPTANSGSSGGGFLSIGSISPTSYK 167

Query: 170 ATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDV 227
            T  +    N   Y+L L  I++    ++     + +        +IDSG+V+T     +
Sbjct: 168 FTPMLTDSKNPSLYFLRLTAITVSGRPLDLAAAMYRVPT------LIDSGTVITRLPMSM 221

Query: 228 YWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLP-ETFNRFPSMAFYFE-DANLRIDG 283
           Y  L + FV    +    + +  P    +  C+    ++ +  P +   F+  A+L +  
Sbjct: 222 YAALRQAFV----KIMSTKYAKAPAYSILDTCFKGSLKSISAVPEIKMIFQGGADLTLRA 277

Query: 284 ENVFIIDYENHFFLLAVAPHD--DLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            ++ +I+ +     LA A     + +A+IG++QQ+     YD++   + F   +C
Sbjct: 278 PSI-LIEADKGITCLAFAGSSGTNQIAIIGNRQQQTYNIAYDVSTSRIGFAPGSC 331


>gi|115473125|ref|NP_001060161.1| Os07g0592200 [Oryza sativa Japonica Group]
 gi|29027762|dbj|BAC65898.1| putative CND41, chloroplast nucleoid DNA binding protein [Oryza
           sativa Japonica Group]
 gi|113611697|dbj|BAF22075.1| Os07g0592200 [Oryza sativa Japonica Group]
          Length = 631

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 94/369 (25%), Positives = 150/369 (40%), Gaps = 58/369 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTPS+   LI+D+GS + Y                F P  SS++  + C+  DCT
Sbjct: 93  TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQDPRFQPDLSSTYSPVKCNV-DCT 151

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--F 103
              C NE  QC Y  +YA+ S + G    + +S   + E K     A+FGC N   G  F
Sbjct: 152 ---CDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP--QRAVFGCENTETGDLF 206

Query: 104 DEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
            + A      G++GL R  +S + QL    +I   FS C         Y    +  GT +
Sbjct: 207 SQHAD-----GIMGLGRGQLSIMDQLVEKGVISDSFSLC---------YGGMDVGGGTMV 252

Query: 162 GYRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
               P+     F +H N     +Y + LK+I +  + +   P  F+     + G ++DSG
Sbjct: 253 LGGMPAPPDMVF-SHSNPVRSPYYNIELKEIHVAGKALRLDPKIFN----SKHGTVLDSG 307

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAF 272
           +   Y     +    +   +     +  +  D P    +C+      + +    FP +  
Sbjct: 308 TTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPD-PNYKDICFAGAGRNVSQLSEVFPDVDM 366

Query: 273 YFEDAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
            F +   L +  EN        E  + L       D   L+G    R+T   YD + + +
Sbjct: 367 VFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKI 426

Query: 330 SFVKENCSD 338
            F K NCS+
Sbjct: 427 GFWKTNCSE 435


>gi|115446115|ref|NP_001046837.1| Os02g0473200 [Oryza sativa Japonica Group]
 gi|47497549|dbj|BAD19621.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|47847591|dbj|BAD21978.1| putative aspartic proteinase nepenthesin [Oryza sativa Japonica
           Group]
 gi|113536368|dbj|BAF08751.1| Os02g0473200 [Oryza sativa Japonica Group]
          Length = 494

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 95/382 (24%), Positives = 156/382 (40%), Gaps = 71/382 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            R+ IGTP+K   + +DTGS +++                    ++DPR S S + + CD
Sbjct: 92  TRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCD 151

Query: 43  H-----------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
                       P CT        C Y++ Y D S T GF   + +    V G G+    
Sbjct: 152 QQFCVANYGGVLPSCTS----TSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPA 207

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLP 146
                FGC     G D  + + AL G+LG  +   S +SQL +   ++K F++CL     
Sbjct: 208 NASVSFGCGA-KLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNG 266

Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
            G +    +         +P  + T  + + P+  Y + LK I +    +  P + FD  
Sbjct: 267 GGIFAIGNV--------VQPKVKTTPLVPDMPH--YNVILKGIDVGGTALGLPTNIFDSG 316

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETF 264
            S   G IIDSG+ L Y    VY  L        +   +  L D       C+ +     
Sbjct: 317 NS--KGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVD 369

Query: 265 NRFPSMAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQ 315
           + FP + F+FE D +L +        +G+N++ + ++N    +      D+V L+G    
Sbjct: 370 DGFPEVTFHFEGDVSLIVSPHDYLFQNGKNLYCMGFQNGG--VQTKDGKDMV-LLGDLVL 426

Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
            +   +YDL    + +   NCS
Sbjct: 427 SNKLVLYDLENQAIGWADYNCS 448


>gi|326513976|dbj|BAJ92138.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 342

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 73/267 (27%), Positives = 119/267 (44%), Gaps = 43/267 (16%)

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
           FGC   + G    A     +G++GLS  T+S ISQL      RFSYCL    P  E  +S
Sbjct: 96  FGCGALSAGSLVGA-----SGLMGLSPGTMSLISQLSV---PRFSYCLT---PFAERKTS 144

Query: 154 YLKFGTDMGYRRPST----QATKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITV 206
            + FG     R+ +T    Q T  + +P     +YY+ L  +S+  +R+  P  +  I  
Sbjct: 145 PMLFGAMADLRKYNTTGPIQTTAILRNPAMDTFYYYVPLVGLSLGTKRLRVPAASLAINP 204

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN- 265
            G GG I+DSGS + +     +  + +   +  E  +L   +   E  +LC+ +P     
Sbjct: 205 DGTGGTIVDSGSTMAHLAGKAFDAVKK---AVLEAVKLPVFNGTVEDYELCFAVPSGVAM 261

Query: 266 ---RFPSMAFYFEDANLRIDGENVFIIDYENHF-------FLLAVA--PHD--DLVALIG 311
              + P +  +F       DG     +  +N+F         LAVA  P D    +++IG
Sbjct: 262 AAVKTPPLVLHF-------DGGAAMALPRDNYFQEPRAGLMCLAVARSPEDLGAPISIIG 314

Query: 312 SQQQRDTRFVYDLNIDLLSFVKENCSD 338
           + QQ++   ++D++    SF    C D
Sbjct: 315 NVQQQNMHVLFDVHNQKFSFAPTKCHD 341


>gi|255548660|ref|XP_002515386.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
 gi|223545330|gb|EEF46835.1| Aspartic proteinase nepenthesin-2 precursor, putative [Ricinus
           communis]
          Length = 387

 Score = 86.3 bits (212), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 103/368 (27%), Positives = 158/368 (42%), Gaps = 57/368 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V++ +GTP   + L LDTGS + +                 FDPRKSSS++ ++C    
Sbjct: 46  LVKMALGTPKLSLSLALDTGSDITWTQCEPCVGSCYRQAQTKFDPRKSSSYKNVSCSSSS 105

Query: 46  CTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
           C           CV+  C+Y ++Y D S + GF A E +++        +    LFGC  
Sbjct: 106 CRIITDSGGARGCVSSTCIYKVQYGDGSYSVGFFATEKLTI----SPSDVISNFLFGCGQ 161

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
            N G     R G +AG+LGL R  +S   Q        F+YCL        ++SS     
Sbjct: 162 QNAG-----RFGRIAGLLGLGRGKLSLALQTSEKYNNLFTYCLP------SFSSSSTGHL 210

Query: 159 TDMGYRRPSTQAT----KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
           T  G    S + T     F N P  FY + +K +S+    +       D +V    G II
Sbjct: 211 TLGGQVPKSVKFTPLSPAFKNTP--FYGIDIKGLSVGGHVL-----PIDASVFSNAGAII 263

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFY 273
           DSG+V+T     VY  L  KF    + +     +D    +  CY F        P ++F+
Sbjct: 264 DSGTVITRLQPTVYSALSSKFQQLMKDY---PKTDGFSILDTCYDFSGNESISVPRISFF 320

Query: 274 FEDANLRIDGENVFIIDYENHF--FLLAVAPHDD--LVALIGSQQQRDTRFVYDLNIDLL 329
           F+   + +D +   I+   N +    LA AP+DD     + G+ QQ+    V+DL    +
Sbjct: 321 FK-GGVEVDIKFFGILTVINAWDKVCLAFAPNDDDGDFVVFGNSQQQTYDVVHDLAKGRI 379

Query: 330 SFVKENCS 337
            F    C+
Sbjct: 380 GFAPSGCN 387


>gi|414881575|tpg|DAA58706.1| TPA: hypothetical protein ZEAMMB73_168363 [Zea mays]
          Length = 506

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 98/385 (25%), Positives = 154/385 (40%), Gaps = 69/385 (17%)

Query: 6   IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH--- 43
           +GTP K   + +DTGS +++                     +DP+ SSS   ++CD    
Sbjct: 93  LGTPPKRYYVQVDTGSDILWVNCISCSKCPRKSGLGLDLTFYDPKASSSGSTVSCDQGFC 152

Query: 44  --------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
                   P CT     N  C Y++ Y D S T GF   + +    V G G+ +      
Sbjct: 153 AATYGGKLPGCT----ANVPCEYSVMYGDGSSTTGFFITDALQFDQVTGDGQTQPGNATI 208

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCL-------VI 143
            FGC     G D    + AL G+LG  +   S +SQL +    KK F++CL       + 
Sbjct: 209 TFGCGAQQGG-DLGNSNQALDGILGFGQANTSMLSQLAAAGKAKKIFAHCLDTIKGGGIF 267

Query: 144 PLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFD 203
            + N      Y  F    G           I      Y ++LK I +    +  P   F+
Sbjct: 268 AIGNVVQPKCYFVFFFAHGLLNIPLFLLVMILLSRPHYNVNLKSIDVGGTTLQLPAHVFE 327

Query: 204 ITVSGE-GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLP 261
              +GE  G IIDSG+ LTY    V+ ++ +   S         L D      LC+ +  
Sbjct: 328 ---TGEKKGTIIDSGTTLTYLPELVFKQVMDVVFSKHRDIAFHNLQDF-----LCFQYSG 379

Query: 262 ETFNRFPSMAFYFE-DANLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGS 312
              + FP++ F+FE D  L +        +G +++ + ++N    L      D+V L+G 
Sbjct: 380 SVDDGFPTITFHFEDDLALHVYPHEYFFPNGNDIYCVGFQNG--ALQSKDGKDIV-LMGD 436

Query: 313 QQQRDTRFVYDLNIDLLSFVKENCS 337
               +   VYDL   ++ +   NCS
Sbjct: 437 LVLSNKLVVYDLENQVIGWTDYNCS 461


>gi|255567949|ref|XP_002524952.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223535787|gb|EEF37449.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 394

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 82/311 (26%), Positives = 132/311 (42%), Gaps = 50/311 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
            R++IGTP +   LI+DTGS + Y                F+P  SS++Q ++C+  DCT
Sbjct: 92  TRIWIGTPPQTFALIVDTGSTVTYVPCSTCEQCGRHQDPKFEPELSSTYQPVSCNI-DCT 150

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C NE  QCVY  +YA+ S + G    + IS   + E   +   A+FGC N   G   
Sbjct: 151 ---CDNERKQCVYERQYAEMSSSSGVLGEDIISFGNQSE--LVPQRAIFGCENQETG--- 202

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD-MG 162
           D       G++GL R  +S + QL    +I   FS C         Y    +  G   +G
Sbjct: 203 DLYSQRADGIMGLGRGDLSIVDQLVEKGVISDSFSLC---------YGGMDIGGGAMILG 253

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
              P +      + P  + +Y + LK I +  ++++  P  FD    G+ G ++DSG+  
Sbjct: 254 GISPPSGMVFAESDPVRSQYYNIDLKAIHVAGKQLHLDPSIFD----GKHGTVLDSGTTY 309

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-----FNRFPSMAFYFE 275
            Y     +    +  +      +     D P    +C+   E+      N FP++   F 
Sbjct: 310 AYLPEAAFTAFKDAMMKELTSLKQIHGPD-PNYNDICFSGAESDVSQLSNTFPAVEMVFS 368

Query: 276 DAN-LRIDGEN 285
           +   L +  EN
Sbjct: 369 NGQKLSLSPEN 379


>gi|15217887|ref|NP_176703.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
 gi|118572746|sp|Q9S9K4.2|ASPL2_ARATH RecName: Full=Aspartic proteinase-like protein 2; Flags: Precursor
 gi|332196226|gb|AEE34347.1| aspartic proteinase-like protein 2 [Arabidopsis thaliana]
          Length = 475

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 86/372 (23%), Positives = 153/372 (41%), Gaps = 56/372 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ +G+P K   + +DTGS +++                   ++FD   SS+ +K+ CD
Sbjct: 76  TKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCD 135

Query: 43  HPDCTYFKCVNE-----QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALF 94
              C++    +       C Y + YAD+S + G    + ++   V G  +   +    +F
Sbjct: 136 DDFCSFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVF 195

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTS 152
           GC +D  G   +  D A+ GV+G  +   S +SQL +    K+ FS+CL      G +  
Sbjct: 196 GCGSDQSGQLGNG-DSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAV 254

Query: 153 SYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGG 211
             +          P  + T  +  PN  +Y + L  + +D   ++ P      ++   GG
Sbjct: 255 GVVD--------SPKVKTTPMV--PNQMHYNVMLMGMDVDGTSLDLPR-----SIVRNGG 299

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMA 271
            I+DSG+ L YF   +Y  L E  ++     Q  +L    E  Q   F       FP ++
Sbjct: 300 TIVDSGTTLAYFPKVLYDSLIETILAR----QPVKLHIVEETFQCFSFSTNVDEAFPPVS 355

Query: 272 FYFEDANLRIDGENVFIIDYENHFFLLA------VAPHDDLVALIGSQQQRDTRFVYDLN 325
           F FED+       + ++   E   +                V L+G     +   VYDL+
Sbjct: 356 FEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLD 415

Query: 326 IDLLSFVKENCS 337
            +++ +   NCS
Sbjct: 416 NEVIGWADHNCS 427


>gi|357502759|ref|XP_003621668.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355496683|gb|AES77886.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 481

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 159/378 (42%), Gaps = 60/378 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ IGTPSK   L +DTG+ +++                    +++ ++SSS + + CD
Sbjct: 75  AKIGIGTPSKDYYLQVDTGTDMMWVNCIQCKECPTRSNLGMDLTLYNIKESSSGKLVPCD 134

Query: 43  HPDCTYFK------CV---NEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
              C          C    N+ C Y   Y D S T G+   + +    V G  +  +   
Sbjct: 135 QELCKEINGGLLTGCTSKTNDSCPYLEIYGDGSSTAGYFVKDVVLFDQVSGDLKTASANG 194

Query: 91  GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNG 148
             +FGC     G    + + AL G+LG  +   S ISQL S   +KK F++CL     NG
Sbjct: 195 SVIFGCGARQSGDLSYSNEEALDGILGFGKANYSMISQLSSSGKVKKMFAHCL-----NG 249

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
                    G  +   +P+   T  + + P+  Y +++  I + +  +N   D  +   S
Sbjct: 250 VNGGGIFAIGHVV---QPTVNTTPLLPDQPH--YSVNMTAIQVGHTFLNLSTDASEQRDS 304

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF 267
              G IIDSG+ L Y    +Y  L  K +S     ++  L D     Q   +     + F
Sbjct: 305 --KGTIIDSGTTLAYLPDGIYQPLVYKILSQQPNLKVQTLHDEYTCFQ---YSGSVDDGF 359

Query: 268 PSMAFYFEDA-NLRI-------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
           P++ FYFE+  +L++         EN++ I ++N     A +     + L+G     +  
Sbjct: 360 PNVTFYFENGLSLKVYPHDYLFLSENLWCIGWQNSG---AQSRDSKNMTLLGDLVLSNKL 416

Query: 320 FVYDLNIDLLSFVKENCS 337
             YDL   ++ + + NCS
Sbjct: 417 VFYDLENQVIGWTEYNCS 434


>gi|302141829|emb|CBI19032.3| unnamed protein product [Vitis vinifera]
          Length = 382

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 76/252 (30%), Positives = 117/252 (46%), Gaps = 25/252 (9%)

Query: 94  FGCSNDNH--GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
           FGC  +N   G D+ A    L       R  +S +SQLG+   ++FSYCL       E  
Sbjct: 144 FGCGVNNRATGMDQTAGLLGLG------RGVLSLVSQLGT---QKFSYCLT---SIHENK 191

Query: 152 SSYLKFGTDM--GYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVS 207
           +S L FG+     +       T  I +P   ++YYL+LK I++    +  P   F +   
Sbjct: 192 TSSLLFGSLAYSNFNPGKIPRTPLIQNPFLPSYYYLALKGITVGYTLLPIPEFAFQLGKD 251

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP---ETF 264
           G GG I+DSG+ +TY   D +  L   F+S  E  Q+A  S     + LC+ LP      
Sbjct: 252 GSGGMILDSGTTITYLQEDAFDVLKNAFISQTE-LQVANSSTT--GLDLCFHLPVKNAAE 308

Query: 265 NRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
            + P + F+F+  +L +  EN  + D E     LA+     L ++ G+ QQ++   ++DL
Sbjct: 309 VKVPKLIFHFKGLDLALPVENYMVSDPEMGLICLAIDATGSL-SIFGNIQQQNMLVLHDL 367

Query: 325 NIDLLSFVKENC 336
               LS V   C
Sbjct: 368 KKSTLSLVPTQC 379


>gi|224079535|ref|XP_002305886.1| predicted protein [Populus trichocarpa]
 gi|222848850|gb|EEE86397.1| predicted protein [Populus trichocarpa]
          Length = 436

 Score = 85.9 bits (211), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 80/271 (29%), Positives = 119/271 (43%), Gaps = 55/271 (20%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V L IGTP +   +ILDTGS L +               +FDP  SSSF  + C+HP C
Sbjct: 78  LVSLPIGTPPQSQQMILDTGSQLSWIQCHKKVPRKPPPSTVFDPSLSSSFSVLPCNHPLC 137

Query: 47  TY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
                       C +N  C Y+  YAD ++ +G    E I+         +    + GC+
Sbjct: 138 KPRIPDFTLPTSCDLNRLCHYSYFYADGTLAEGNLVREKITFSTSQSTPPL----ILGCA 193

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
                  EDA D    G+LG++   +SF SQ   I K  FSYC+    P  +    +   
Sbjct: 194 -------EDASDDK--GILGMNLGRLSFASQ-AKITK--FSYCV----PTRQVRPGFTPT 237

Query: 158 GTDMGYRRPSTQATKFIN---------HPNN---FYYLSLKDISIDNERMNFPPDTFDIT 205
           G+      P++   ++I+          PN     + ++L+ I I N+++N P   F   
Sbjct: 238 GSFYLGENPNSAGFQYISLLTFSQSQRMPNLDPLAHTVALQGIRIGNKKLNIPVSAFRAD 297

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFV 236
            SG G  +IDSGS  TY     Y K+ E+ V
Sbjct: 298 PSGAGQSMIDSGSEFTYLVDVAYNKVREEVV 328


>gi|159464048|ref|XP_001690254.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
 gi|158284242|gb|EDP09992.1| pepsin-type aspartyl protease [Chlamydomonas reinhardtii]
          Length = 485

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 90/367 (24%), Positives = 148/367 (40%), Gaps = 54/367 (14%)

Query: 4   LFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKI-------NCD 42
           L +GTP +   +I+DTGS + Y                FDP KS++ +K+       NC 
Sbjct: 17  LKLGTPERTFSVIIDTGSTITYIPCKDCSHCGKHTAEWFDPDKSTTAKKLACGDPLCNCG 76

Query: 43  HPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
            P CT   C N++C Y+  YA++S ++G+   +T           +    +FGC N   G
Sbjct: 77  TPSCT---CNNDRCYYSRTYAERSSSEGWMIEDTFGFPDSDSPVRL----VFGCENGETG 129

Query: 103 FDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
             E  R  A  G++G+     +F SQL    +I+  FS C   P          L  G  
Sbjct: 130 --EIYRQMA-DGIMGMGNNHNAFQSQLVQRKVIEDVFSLCFGYP------KDGILLLGDV 180

Query: 161 MGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                 +T  T  + H +  YY + +  I+++ + + F    FD       G ++DSG+ 
Sbjct: 181 TLPEGANTVYTPLLTHLHLHYYNVKMDGITVNGQTLAFDASVFDRGY----GTVLDSGTT 236

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDC-PEPIQLCYFLPETFNRFPSMAFYFEDAN 278
            TY  +D +  + +    Y E+  L       P+   +C+      ++F  +  YF  A 
Sbjct: 237 FTYLPTDAFKAMAKAVGDYVEKKGLQSTPGADPQYNDICW--KGAPDQFKDLDKYFPPAE 294

Query: 279 LRIDGENVFIIDYENHFFL-------LAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
               G     +    + FL       L +  + +  AL+G    RD    YD     + F
Sbjct: 295 FVFGGGAKLTLPPLRYLFLSKPAEYCLGIFDNGNSGALVGGVSVRDVVVTYDRRNSKVGF 354

Query: 332 VKENCSD 338
               C+D
Sbjct: 355 TTMACAD 361


>gi|242053991|ref|XP_002456141.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
 gi|241928116|gb|EES01261.1| hypothetical protein SORBIDRAFT_03g031170 [Sorghum bicolor]
          Length = 519

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 105/422 (24%), Positives = 158/422 (37%), Gaps = 98/422 (23%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA----------------------------------- 26
           VR  +GTP++  LL+ DTGS L +                                    
Sbjct: 109 VRFRVGTPARPFLLVADTGSDLTWVKCHRHDHDAPAPGYGYAAPASNDSSTSSLSAAAAS 168

Query: 27  ------IFDPRKSSSFQKINCDHPDCTYF--------KCVNEQCVYTMKYADQSVTKGFA 72
                 +F P +S ++  I C    CT                C Y  +Y D S  +G  
Sbjct: 169 SSSHARVFRPDRSRTWAPIPCSSDTCTASLPFSLAACPTPGSPCAYDYRYKDGSAARGTV 228

Query: 73  AHE--TISVIGKGEGK----AIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFI 126
             +  TI++ G+G  K    A   G + GC+    G    A DG    VL L    ISF 
Sbjct: 229 GTDSATIALSGRGAKKKQRQAKLRGVVLGCTTSYTGDSFLASDG----VLSLGYSNISFA 284

Query: 127 SQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD--MGYRRPST---------------- 168
           S+  +    RFSYCLV  L     T SYL FG +  +    PS                 
Sbjct: 285 SRAAARFGGRFSYCLVDHLAPRNAT-SYLTFGPNPAVSSSPPSKTACAGGGSPAAAPPGP 343

Query: 169 ----QATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
               Q    ++H    FY +++  IS+D E +  P   +D  V+  GG I+DSG+ LT  
Sbjct: 344 GGARQTPLLLDHRMRPFYAVTVNGISVDGELLRIPRLVWD--VAKGGGAILDSGTSLTVL 401

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFL--PETFN----RFPSMAFYFED 276
            S  Y       V+   + +LA L     +P   CY    P T        P +A +F  
Sbjct: 402 VSPAY----RAVVAALNK-KLAGLPRVTMDPFDYCYNWTSPSTGEDLTVAMPELAVHFAG 456

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
           +         ++ID       + +   +   V++IG+  Q++  + +DL    L F +  
Sbjct: 457 SARLQPPAKSYVIDAAPGVKCIGLQEGEWPGVSVIGNILQQEHLWEFDLKNRRLRFKRSR 516

Query: 336 CS 337
           C+
Sbjct: 517 CT 518


>gi|302821814|ref|XP_002992568.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
 gi|300139637|gb|EFJ06374.1| hypothetical protein SELMODRAFT_46291 [Selaginella moellendorffii]
          Length = 368

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 89/381 (23%), Positives = 157/381 (41%), Gaps = 59/381 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------IFDPRKSSSFQKINCDHPDCTYFK--- 50
           ++L IG+  K +  I+DTGS  +          +FDP  S S++++ C    C   +   
Sbjct: 1   MQLGIGSLQKNLSAIIDTGSEAVLVQCGSRSRPVFDPAASQSYRQVPCISQLCLAVQQQT 60

Query: 51  -------CVNEQ--CVYTMKYADQSVTKGFAAHETI--SVIGKGEGKAIFHGALFGCSND 99
                  CVN    C Y++ Y D   + G  + + I  +          F    FGC++ 
Sbjct: 61  SNGSSQPCVNSSAACTYSLSYGDSRNSTGDFSQDVIFLNSTNSSSQAVQFRDVAFGCAHS 120

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIK-KRFSYCLVIPLPNGEYTSSYLKFG 158
             GF  D   G+L G++G +R  +S  SQL   +   +FSYC   P    +  ++ + F 
Sbjct: 121 PQGFLVDL--GSL-GIVGFNRGNLSLPSQLKDRLGGSKFSYCF--PSQPWQPRATGVIFL 175

Query: 159 TDMGYRRPSTQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVS-GEGGC 212
            D G  +     T  +++P     +  YY+ L  IS+D + +  P   F +  S G+GG 
Sbjct: 176 GDSGLSKSKVSYTPLLDNPVTPARSQLYYVGLTSISVDGKTLAIPESAFKLDPSTGDGGT 235

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSM 270
           ++DSG+  T    D Y      F +   R  L +          CY +    +    P +
Sbjct: 236 VLDSGTTFTRVVDDAYTAFRNAFAAS-NRSGLRKKVGAAAGFDDCYNISAGSSLPGVPEV 294

Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVAL---------------IGSQQQ 315
               ++ N+R++      + +E+ F  ++ A ++  V L               +G+ QQ
Sbjct: 295 RLSLQN-NVRLE------LRFEHLFVPVSAAGNEVTVCLAILSSQKSGFGKINVLGNYQQ 347

Query: 316 RDTRFVYDLNIDLLSFVKENC 336
            +    YD     + F + +C
Sbjct: 348 SNYLVEYDNERSRVGFERADC 368


>gi|359476756|ref|XP_002277082.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 2 [Vitis
           vinifera]
          Length = 560

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 93/376 (24%), Positives = 162/376 (43%), Gaps = 60/376 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ IGTPSK   + +DTGS +++                    ++D + S++   + CD
Sbjct: 157 AKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCD 216

Query: 43  HPDCTYFK-----CV-NEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
              C+ +      C    QC+Y++ Y D S T G+   + +    + G  +        +
Sbjct: 217 DNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVV 276

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
           FGC N   G +  +   AL G+LG  +   S +SQL S   +KK FS+CL      G + 
Sbjct: 277 FGCGNKQSG-ELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFA 335

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE-G 210
              +          P    T  + +  + Y + +K+I +  + ++ P D F+   SG+  
Sbjct: 336 IGEVV--------EPKVNITPLVQNQAH-YNVVMKEIEVGGDPLDVPSDAFE---SGDRK 383

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPS 269
           G IIDSG+ L YF  +VY  L EK +S     +L  +    E    C+ +     + FP+
Sbjct: 384 GTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTV----EQAFTCFDYTGNVDDGFPT 439

Query: 270 MAFYFEDA-NLRI-------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
           +  +F+ + +L +         E  + I ++N     A       + L+G     +   V
Sbjct: 440 VTLHFDKSISLTVYPHEYLFQHEFEWCIGWQNSG---AQTKDGKDLTLLGDLVLSNKLVV 496

Query: 322 YDLNIDLLSFVKENCS 337
           YDL    + +V+ NCS
Sbjct: 497 YDLEKQGIGWVEYNCS 512


>gi|297828736|ref|XP_002882250.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328090|gb|EFH58509.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 488

 Score = 85.9 bits (211), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 86/377 (22%), Positives = 157/377 (41%), Gaps = 64/377 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA-----IFDPRKS-------------SSFQKINCDH 43
            ++ +GTPS+   + +DTGS +++      I  PRKS             S+ + ++C  
Sbjct: 87  AKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDADASSTAKSVSCSD 146

Query: 44  PDCTYFKCVNE-----QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFG 95
             C+Y    +E      C Y + Y D S T G+   + +    V G  +  +     +FG
Sbjct: 147 NFCSYVNQRSECHSGSTCQYVILYGDGSSTNGYLVRDVVHLDLVTGNRQTGSTNGTIIFG 206

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSS 153
           C +   G   +++  A+ G++G  +   SFISQL S   +K+ F++CL     +      
Sbjct: 207 CGSKQSGQLGESQ-AAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCL-----DNNNGGG 260

Query: 154 YLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE-GGC 212
               G  +    P  + T  ++   + Y ++L  I + N  +    D FD   SG+  G 
Sbjct: 261 IFAIGEVV---SPKVKTTPMLSKSAH-YSVNLNAIEVGNSVLQLSSDAFD---SGDDKGV 313

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
           IIDSG+ L Y    VY  L  + ++  +   L  + D       C+   +  +RFP++ F
Sbjct: 314 IIDSGTTLVYLPDAVYNPLMNQILASHQELNLHTVQDSFT----CFHYIDRLDRFPTVTF 369

Query: 273 YFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
            F+ +             +R   E+ +   ++N             + ++G     +   
Sbjct: 370 QFDKSVSLAVYPQEYLFQVR---EDTWCFGWQNGGLQTKGGAS---LTILGDMALSNKLV 423

Query: 321 VYDLNIDLLSFVKENCS 337
           VYD+   ++ +   NCS
Sbjct: 424 VYDIENQVIGWTNHNCS 440


>gi|77555282|gb|ABA98078.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
          Length = 409

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 77/294 (26%), Positives = 135/294 (45%), Gaps = 28/294 (9%)

Query: 46  CTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
            +YF  V  QC         + T G+ A +T +      G     G +FGCS+ ++G   
Sbjct: 109 TSYF--VWAQCAPLTYGGSAANTSGYLATDTFTF-----GATAVPGVVFGCSDASYGDFA 161

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS-SYLKFGTDMGYR 164
            A     +GV+G+ R  +S ISQL      +FSY L+ P    + ++ S ++FG D   +
Sbjct: 162 GA-----SGVIGIGRGNLSLISQL---QFGKFSYQLLAPEATDDGSADSVIRFGDDAVPK 213

Query: 165 RPSTQATKFIN---HPNNFYYLSLKDISIDNERMN-FPPDTFDITVSGEGGCIIDSGSVL 220
               ++T  ++   +P+ FYY++L  + +D  R++  P  TFD+  +G GG I+ S + +
Sbjct: 214 TKRGRSTPLLSSTLYPD-FYYVNLTGVRVDGNRLDAIPAGTFDLRANGTGGVILSSTTPV 272

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFN-RFPSMAFYFE-DA 277
           TY     Y  +     S   R  L  ++      + LCY        + P +   F+  A
Sbjct: 273 TYLEQAAYDVVRAAVAS---RIGLPAVNGSAALELDLCYNASSMAKVKVPKLTLVFDGGA 329

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
           ++ +   N F ID +     L + P     +++G+  Q  T  +YD++   L+F
Sbjct: 330 DMDLSAANYFYIDNDTGLECLTMLPSQG-GSVLGTLLQTGTNMIYDVDAGRLTF 382


>gi|242094478|ref|XP_002437729.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
 gi|241915952|gb|EER89096.1| hypothetical protein SORBIDRAFT_10g001440 [Sorghum bicolor]
          Length = 486

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 90/358 (25%), Positives = 143/358 (39%), Gaps = 56/358 (15%)

Query: 7   GTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYFK----- 50
           G+ S  V ++LDT   + +           A +DP +SS++    C+   C         
Sbjct: 157 GSSSPPVTVVLDTAGDVPWMRCVPCTFAQCADYDPTRSSTYSAFPCNSSACKQLGRYANG 216

Query: 51  C-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARD 109
           C  N QC Y +  A  S T        +  I  G+      G  FGCS +  G  E+  D
Sbjct: 217 CDANGQCQYMVVTAGDSFTTSGTYSSDVLTINSGD---RVEGFRFGCSQNEQGSFENQAD 273

Query: 110 GALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG--YRRPS 167
           G    ++ L R   S ++Q  S     FSYC    LP  E T  + + G  +G  YR  +
Sbjct: 274 G----IMALGRGVQSLMAQTSSTYGDAFSYC----LPPTETTKGFFQIGVPIGASYRFVT 325

Query: 168 TQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
           T   K     +      Y   L  I++D + +N P + F        G ++DS +++T  
Sbjct: 326 TPMLKERGGASAAAATLYRALLLAITVDGKELNVPAEVF------AAGTVMDSRTIITRL 379

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAFYFEDANLRID 282
               Y  L   F +   R+++A      E +  CY L    + R P +A  F       D
Sbjct: 380 PVTAYGALRAAFRNRM-RYRVAPPQ---EELDTCYDLTGVRYPRLPRIALVF-------D 428

Query: 283 GENVFIIDYENHFF--LLAVAPHDD--LVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           G  V  +D         LA A +DD    +++G+ QQ+  + ++D+    + F    C
Sbjct: 429 GNAVVEMDRSGILLNGCLAFASNDDDSSPSILGNVQQQTIQVLHDVGGGRIGFRSAAC 486


>gi|297848856|ref|XP_002892309.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297338151|gb|EFH68568.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 484

 Score = 85.5 bits (210), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 93/377 (24%), Positives = 159/377 (42%), Gaps = 60/377 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCD 42
            ++ IGTP+K   + +DTGS +++                    +++  +S S + ++CD
Sbjct: 82  AKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCD 141

Query: 43  HPDCTYFK------C-VNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
              C          C  N  C Y   Y D S T G+   + +   SV G  + +      
Sbjct: 142 DDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSV 201

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
           +FGC     G  + + + AL G+LG  +   S ISQL S   +KK F++CL     +G  
Sbjct: 202 IFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL-----DGRN 256

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
                  G  +   +P    T  + N P+  Y +++  + +  E +N P D F       
Sbjct: 257 GGGIFAIGRVV---QPKVNMTPLVPNQPH--YNVNMTAVQVGQEFLNIPADLFQ--PGDR 309

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
            G IIDSG+ L Y    +Y  L +K  S     ++  +    +  Q    + E    FP+
Sbjct: 310 KGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYKCFQYSGRVDEG---FPN 366

Query: 270 MAFYFEDAN-LRI-------DGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRF 320
           + F+FE++  LR+         E ++ I ++N     A+   D   + L+G     +   
Sbjct: 367 VTFHFENSVFLRVYPHDYLFPYEGMWCIGWQNS----AMQSRDRRNMTLLGDLVLSNKLV 422

Query: 321 VYDLNIDLLSFVKENCS 337
           +YDL   L+ + + NCS
Sbjct: 423 LYDLENQLIGWTEYNCS 439


>gi|359476206|ref|XP_002262837.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 462

 Score = 85.5 bits (210), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 95/353 (26%), Positives = 143/353 (40%), Gaps = 48/353 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFK---------- 50
           +V +  G P + + LI+DTGS   +      + +S    NC +     F           
Sbjct: 130 LVNVGFGKPQQNLNLIIDTGSDTTWI-----RCNSCSLGNCHNKKIPTFNPSLSSSYSNR 184

Query: 51  -CV-NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDAR 108
            C+ + +  YTM Y D S +KG    + +++             +F       G      
Sbjct: 185 SCIPSTKTNYTMNYEDNSYSKGVFVCDEVTL----------KPDVFPKFQFGCGDSGGGD 234

Query: 109 DGALAGVLGLSR-VTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS 167
            G+ +GVLGL++    S ISQ  S  KK+FSYC     P+ E T   L FG       PS
Sbjct: 235 FGSASGVLGLAQGEQYSLISQTASKFKKKFSYC----FPHNENTRGSLLFGEKAISASPS 290

Query: 168 TQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSD 226
            + T+ +N    + Y++ L  IS+  +R+N     F        G IIDSG+V+T+  + 
Sbjct: 291 LKFTRLLNPSSGSVYFVELIGISVAKKRLNVSSSLF-----ASPGTIIDSGTVITHLPTA 345

Query: 227 VYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFNR---FPSMAFYF---EDAN 278
            Y  L   F    E      +S  P+  P+  CY L     R    P +  +F    D +
Sbjct: 346 AYEALRTAFQQ--EMLHCPSVSPPPQEKPLDTCYNLKGCGGRNIKLPEIVLHFVGEVDVS 403

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
           L   G      D        A   H   V +IG++QQ   + VYD+    L F
Sbjct: 404 LHPSGILWANGDLTQACLAFARKSHPSHVTIIGNRQQVSLKVVYDIEGGRLGF 456


>gi|297735249|emb|CBI17611.3| unnamed protein product [Vitis vinifera]
          Length = 480

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 93/377 (24%), Positives = 162/377 (42%), Gaps = 61/377 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ IGTPSK   + +DTGS +++                    ++D + S++   + CD
Sbjct: 76  AKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCD 135

Query: 43  HPDCTYFK-----CV-NEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
              C+ +      C    QC+Y++ Y D S T G+   + +    + G  +        +
Sbjct: 136 DNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVV 195

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
           FGC N   G +  +   AL G+LG  +   S +SQL S   +KK FS+CL      G + 
Sbjct: 196 FGCGNKQSG-ELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFA 254

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE-G 210
              +          P    T  + +  + Y + +K+I +  + ++ P D F+   SG+  
Sbjct: 255 IGEV--------VEPKVNITPLVQNQAH-YNVVMKEIEVGGDPLDVPSDAFE---SGDRK 302

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPS 269
           G IIDSG+ L YF  +VY  L EK +S     +L  +    E    C+ +     + FP+
Sbjct: 303 GTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTV----EQAFTCFDYTGNVDDGFPT 358

Query: 270 MAFYFEDA-NLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
           +  +F+ + +L +          E  + I ++N     A       + L+G     +   
Sbjct: 359 VTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSG---AQTKDGKDLTLLGDLVLSNKLV 415

Query: 321 VYDLNIDLLSFVKENCS 337
           VYDL    + +V+ NCS
Sbjct: 416 VYDLEKQGIGWVEYNCS 432


>gi|449440014|ref|XP_004137780.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
 gi|449483406|ref|XP_004156582.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 449

 Score = 85.1 bits (209), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 84/332 (25%), Positives = 142/332 (42%), Gaps = 71/332 (21%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT- 47
           V L +GTP + V +++DTGS L +             + F+P  SSS+  I C    CT 
Sbjct: 75  VSLTVGTPPQNVTMVIDTGSELSWLHCNTSQNSSSSSSTFNPVWSSSYSPIPCSSSTCTD 134

Query: 48  -------YFKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                     C + Q C  T+ YAD S ++G  A +T  +     G +     +FGC + 
Sbjct: 135 QTRDFPIRPSCDSNQFCHATLSYADASSSEGNLATDTFYI-----GSSGIPNVVFGCMDS 189

Query: 100 --NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
             +   +ED+++    G++G++R ++SF+SQ+G     +FSYC+       EY  S L  
Sbjct: 190 IFSSNSEEDSKN---TGLMGMNRGSLSFVSQMG---FPKFSYCI------SEYDFSGLLL 237

Query: 158 GTD--------MGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
             D        + Y      +T         Y + L+ I + ++ +  P   F+   +G 
Sbjct: 238 LGDANFSWLAPLNYTPLIEMSTPLPYFDRVAYTVQLEGIKVAHKLLPIPESVFEPDHTGA 297

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVS-------YFER----FQLAQLSDCPEPIQLCY 258
           G  ++DSG+  T+     Y  L + F++        +E     FQ A        + LCY
Sbjct: 298 GQTMVDSGTQFTFLLGPAYTALRDHFLNKTAGSLRVYEDSNFVFQGA--------MDLCY 349

Query: 259 FLPETFNR---FPSMAFYFEDANLRIDGENVF 287
            +P    R    PS+   F  A + + G+ + 
Sbjct: 350 RVPTNQTRLPPLPSVTLVFRGAEMTVTGDRIL 381


>gi|356508308|ref|XP_003522900.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 80/358 (22%), Positives = 145/358 (40%), Gaps = 39/358 (10%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
           +VR  IG+P + +LL +DT +   +            +F P KS++F+ ++C  P C   
Sbjct: 99  IVRAKIGSPPQTLLLAMDTSNDAAWIPCTACDGCTSTLFAPEKSTTFKNVSCGSPQCNQV 158

Query: 50  ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
               C    C + + Y   S+       +T+++              FGC     G    
Sbjct: 159 PNPSCGTSACTFNLTYGSSSIAANVV-QDTVTL-----ATDPIPDYTFGCVAKTTGASAP 212

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
            +           R  +S +SQ  ++ +  FSYCL  P       S  L+ G      R 
Sbjct: 213 PQGLLGL-----GRGPLSLLSQTQNLYQSTFSYCL--PSFKSLNFSGSLRLGPVAQPIR- 264

Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
             + T  + +P  ++ YY++L  I +  + ++ PP+      +   G + DSG+V T   
Sbjct: 265 -IKYTPLLKNPRRSSLYYVNLVAIRVGRKVVDIPPEALAFNAATGAGTVFDSGTVFTRLV 323

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
           +  Y  + ++F         A L+         CY +P      P++ F F   N+ +  
Sbjct: 324 APAYTAVRDEFQRRVAIAAKANLTVTSLGGFDTCYTVPIV---APTITFMFSGMNVTLPE 380

Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +N+ I         LA+A   D    ++ +I + QQ++ R +YD+    L   +E C+
Sbjct: 381 DNILIHSTAGSTTCLAMASAPDNVNSVLNVIANMQQQNHRVLYDVPNSRLGVARELCT 438


>gi|326499093|dbj|BAK06037.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 471

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 94/382 (24%), Positives = 154/382 (40%), Gaps = 68/382 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------------------AIFDPRKSSSF 36
           ++ + IGTP   ++ I DTGS LI+                          FDP KS++F
Sbjct: 101 LMAVNIGTPPTRMVAIADTGSDLIWLNCSYGGDGPGLAAARDADAQPPGVQFDPSKSTTF 160

Query: 37  QKINCDHPDCTYF---KC-VNEQCVYTMKYADQSVTKGFAAHETISVI----GKGEGKAI 88
           + ++CD   C+      C  + +C Y+  Y D S T G  + ET +       +G+G   
Sbjct: 161 RLVDCDSVACSELPEASCGADSKCRYSYSYGDGSHTSGVLSTETFTFADAPGARGDGTTT 220

Query: 89  FHGAL-FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPL 145
               + FGCS    G           G++GL    +S +SQLG  + + +RFSYCLV   
Sbjct: 221 RVANVNFGCSTTFVG------SSVGDGLVGLGGGDLSLVSQLGADTSLGRRFSYCLV--- 271

Query: 146 PNGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDI 204
           P     SS L FG       P    T  I +    +Y + L+ + + N+    P      
Sbjct: 272 PYSVKASSALNFGPRAAVTDPGAVTTPLIPSQVKAYYIVELRSVKVGNKTFEAP------ 325

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF 264
                   I+DSG+ LT+    +   L ++      R +L         + LC+ +  + 
Sbjct: 326 ---DRSPLIVDSGTTLTFLPEALVDPLVKELTG---RIKLPPAQSPERLLPLCFDV--SG 377

Query: 265 NRFPSMAFYFEDANLRIDGENVFIIDYENHF-------FLLAVAPHDDL--VALIGSQQQ 315
            R   +A    D  + + G     +  EN F         LAV+   +    ++IG+  Q
Sbjct: 378 VREGQVAAMIPDVTVGLGGGAAVTLKAENTFVEVQEGTLCLAVSAMSEQFPASIIGNIAQ 437

Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
           ++    YDL+   ++F    C+
Sbjct: 438 QNMHVGYDLDKGTVTFAPAACA 459


>gi|357128280|ref|XP_003565802.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 530

 Score = 84.7 bits (208), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 85/341 (24%), Positives = 138/341 (40%), Gaps = 40/341 (11%)

Query: 28  FDPRKSSSFQKINCDHPD-CTYF---KCV----NEQCVYTMKYADQSVTKGFAAHET--- 76
           + P  SSS+++  C   D C  F    C     NE C Y   Y D +VT+G    ET   
Sbjct: 175 YRPSLSSSWRRYRCSQKDACGSFPHNTCRSPNHNESCSYEQMYEDGTVTRGIYGRETATV 234

Query: 77  -ISVIGKGEGKA--IFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSII 133
            +SV G GEG+   +  G + GCS    G   DA D    GVL L    +SF +   +  
Sbjct: 235 PVSVSGAGEGQTAVLLPGLVLGCSTFEAGATVDAHD----GVLTLGNHAVSFGTVAAARF 290

Query: 134 KKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISID 191
             RFS+CL+  + +G  T SYL FG +      + + T  +  P+    +   +  + +D
Sbjct: 291 GGRFSFCLLHTM-SGRDTFSYLTFGPNPALNGGAMEETNLVYSPDGEPAFGAGVTGVFVD 349

Query: 192 NERM-NFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDC 250
            ER+   PP+ +D  V G G   +D+G+ LT      +  +           Q   ++  
Sbjct: 350 GERLAGIPPEVWDPAVLG-GALNLDTGTSLTGLVEPAFEAVRAAVDRRLGHLQKEDVAG- 407

Query: 251 PEPIQLCYFL------------PETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFL 297
                +CY              P      P +AF FE  A L      + + +       
Sbjct: 408 ---FDICYKWAFGAGAGDEGVDPAHNVTVPKVAFEFEGGARLEPVARGIVLPEVVPGVAC 464

Query: 298 LAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
           L     +   +++G+   ++  + +D     L F K+ C++
Sbjct: 465 LGFRRREVGPSVLGNVHMQEHVWEFDHMAGKLRFRKDKCTN 505


>gi|356542355|ref|XP_003539632.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 444

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 95/360 (26%), Positives = 165/360 (45%), Gaps = 41/360 (11%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
           +GTP   +L I+DTGS +I+               IFDP +S +++ + C    C   + 
Sbjct: 100 VGTPPFQILGIVDTGSDIIWLQCQPCEDCYNQTTPIFDPSQSKTYKTLPCSSNICQSVQS 159

Query: 52  V------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDNHG-F 103
                  N++C YT+ Y D S ++G  + ET++ +G  +G ++ F   + GC ++N G F
Sbjct: 160 AASCSSNNDECEYTITYGDNSHSQGDLSVETLT-LGSTDGSSVQFPKTVIGCGHNNKGTF 218

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
             +       G   +S ++    S  G     +FSYCL  PL +   +SS L FG +   
Sbjct: 219 QREGSGIVGLGGGPVSLISQLSSSIGG-----KFSYCLA-PLFSQSNSSSKLNFGDEAVV 272

Query: 164 RRPSTQATKFINHPNN---FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
               T +T  +  P N   FY+L+L+  S+ + R+ F   +   +  GEG  IIDSG+ L
Sbjct: 273 SGRGTVSTPIV--PKNGLGFYFLTLEAFSVGDNRIEF-GSSSFESSGGEGNIIIDSGTTL 329

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDANL 279
           T    D Y  L        E   L ++ D  + ++LCY    +     P +  +F+ A++
Sbjct: 330 TILPEDDYLNLESAVADAIE---LERVEDPSKFLRLCYRTTSSDELNVPVITAHFKGADV 386

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDD 339
            ++  + F I+ +      A      +  + G+  Q++    YDL    +SF   +C+ +
Sbjct: 387 ELNPISTF-IEVDEGVVCFAFR-SSKIGPIFGNLAQQNLLVGYDLVKQTVSFKPTDCTQE 444


>gi|15229663|ref|NP_190574.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|6522926|emb|CAB62113.1| putative protein [Arabidopsis thaliana]
 gi|53828539|gb|AAU94379.1| At3g50050 [Arabidopsis thaliana]
 gi|55733749|gb|AAV59271.1| At3g50050 [Arabidopsis thaliana]
 gi|332645100|gb|AEE78621.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 632

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 94/367 (25%), Positives = 151/367 (41%), Gaps = 53/367 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+D+GS + Y                F P  SS++Q + C+  DC 
Sbjct: 95  TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPEMSSTYQPVKCNM-DC- 152

Query: 48  YFKCVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C +  EQCVY  +YA+ S +KG    + IS     E +     A+FGC     G   
Sbjct: 153 --NCDDDREQCVYEREYAEHSSSKGVLGEDLISF--GNESQLTPQRAVFGCETVETG--- 205

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
           D       G++GL +  +S + QL    +I   F  C   + +  G        + +DM 
Sbjct: 206 DLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGFDYPSDMV 265

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
           +       +        +Y + L  I +  ++++     FD    GE G ++DSG+   Y
Sbjct: 266 FTDSDPDRSP-------YYNIDLTGIRVAGKQLSLHSRVFD----GEHGAVLDSGTTYAY 314

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--------IQLCYFLPETFNRFPSMAFYF 274
                +    E  +   E   L Q+ D P+P        +    ++ E    FPS+   F
Sbjct: 315 LPDAAFAAFEEAVMR--EVSTLKQI-DGPDPNFKDTCFQVAASNYVSELSKIFPSVEMVF 371

Query: 275 EDA-NLRIDGENVFIIDYENH-FFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSF 331
           +   +  +  EN      + H  + L V P+  D   L+G    R+T  VYD     + F
Sbjct: 372 KSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGF 431

Query: 332 VKENCSD 338
            + NCS+
Sbjct: 432 WRTNCSE 438


>gi|388495448|gb|AFK35790.1| unknown [Medicago truncatula]
          Length = 434

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 84/359 (23%), Positives = 143/359 (39%), Gaps = 44/359 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
           +V+   GTP + +LL LDT S   +              F P KS+SF+ ++C  P C  
Sbjct: 98  IVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCSTSKPFAPIKSTSFRNVSCGSPHCKQ 157

Query: 49  FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
                C    C +   Y   S+       +T+++           G  FGC N   G   
Sbjct: 158 VPNPTCGGSACAFNFTYGSSSIAAS-VVQDTLTLAADP-----IPGYTFGCVNKTTGSSA 211

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
             +           R  +S +SQ  ++ K  FSYCL  P       S  L+ G     +R
Sbjct: 212 PQQGLLGL-----GRGPLSLLSQSQNLYKSTFSYCL--PSFKSINFSGSLRLGPVYQPKR 264

Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
              + T  + +P  ++ YY++L  I +  + ++ PP       +   G I DSG+V T  
Sbjct: 265 --IKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRL 322

Query: 224 HSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRID 282
              VY  +  +F      +  +  L         CY +P      P++ F F   N+ + 
Sbjct: 323 AEPVYTAVRNEFRRRVGPKLPVTTLGG----FDTCYNVPIV---VPTITFLFSGMNVALP 375

Query: 283 GENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            +N+ I         LA+A   D    ++ +I + QQ++ R ++D+    +   +E C+
Sbjct: 376 PDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELCT 434


>gi|356514298|ref|XP_003525843.1| PREDICTED: aspartic proteinase-like protein 1-like [Glycine max]
          Length = 663

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 92/368 (25%), Positives = 148/368 (40%), Gaps = 56/368 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+DTGS + Y                F P  SS++Q + C   DC 
Sbjct: 114 TRLWIGTPPQMFALIVDTGSTVTYVPCSTCEQCGRHQDPKFQPESSSTYQPVKCTI-DC- 171

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C  +  QCVY  +YA+ S + G    + IS   + E       A+FGC N   G   
Sbjct: 172 --NCDGDRMQCVYERQYAEMSTSSGVLGEDVISFGNQSELAP--QRAVFGCENVETG--- 224

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD-MG 162
           D       G++GL R  +S + QL    +I   FS C         Y    +  G   +G
Sbjct: 225 DLYSQHADGIMGLGRGDLSIMDQLVDKKVISDSFSLC---------YGGMDVGGGAMVLG 275

Query: 163 YRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
              P +  T   + P+   +Y + LK++ +  +R+    + FD    G+ G ++DSG+  
Sbjct: 276 GISPPSDMTFAYSDPDRSPYYNIDLKEMHVAGKRLPLNANVFD----GKHGTVLDSGTTY 331

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
            Y     +    +  V   +   L Q+S  P+P           N    ++  F   ++ 
Sbjct: 332 AYLPEAAFLAFKDAIVKELQ--SLKQISG-PDPNYNDICFSGAGNDVSQLSKSFPVVDMV 388

Query: 281 IDGENVFIIDYENHFF----------LLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
               + + +  EN+ F          L      +D   L+G    R+T  +YD     + 
Sbjct: 389 FGNGHKYSLSPENYMFRHSKVRGAYCLGIFQNGNDQTTLLGGIIVRNTLVMYDREQTKIG 448

Query: 331 FVKENCSD 338
           F K NC++
Sbjct: 449 FWKTNCAE 456


>gi|242084336|ref|XP_002442593.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
 gi|241943286|gb|EES16431.1| hypothetical protein SORBIDRAFT_08g022613 [Sorghum bicolor]
          Length = 482

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 88/325 (27%), Positives = 135/325 (41%), Gaps = 43/325 (13%)

Query: 27  IFDPRKSSSFQKINCDHPDCTYFK----CVN--EQCVYTMKYADQSVTKGFAAHETISVI 80
           +++  KSSS   + C  P C        CV    +C Y ++Y D S + G    ET++  
Sbjct: 171 VYNRLKSSSASDVGCYAPACRALGSSGGCVQFLNECQYKVEYGDGSSSAGDFGVETLTF- 229

Query: 81  GKGEGKAIFHGALFGCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSY 139
                     G   GC +DN G F   A     AG+LGL R ++SF SQ+     + FSY
Sbjct: 230 ---PPGVRVPGVAIGCGSDNQGLFPAPA-----AGILGLGRGSLSFPSQIAGRYGRSFSY 281

Query: 140 CLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNN-----FYYLSLKDISIDNER 194
           CL      G   SS L FG+       +T    F     N     FYY+ L  IS+   R
Sbjct: 282 CLAGQGTGGR--SSTLTFGSGASATTTTTTPPSFTPMLTNSRMYTFYYVGLVGISVGGVR 339

Query: 195 MNFPPDTFDITV---SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
           +    ++ D+ +   +G GG I+DSG+ +T      Y    + F     R    +    P
Sbjct: 340 VRGVTES-DLRLDPSTGHGGVIVDSGTAVTRLSGPAYAAFRDAF-----RVAAVKELGWP 393

Query: 252 EP------IQLCY--FLPETFNRFPSMAFYFEDA-NLRIDGENVFI-IDYENHFFLLAVA 301
            P         CY         + P+++ +F     +++  +N  I +D        A A
Sbjct: 394 SPGGPFAFFDTCYSSVRGRVMKKVPAVSMHFAGGVEVKLPPQNYLIPVDSNKGTMCFAFA 453

Query: 302 PHDDL-VALIGSQQQRDTRFVYDLN 325
              D  V++IG+ Q +  R VYD++
Sbjct: 454 GSGDRGVSIIGNIQLQGFRVVYDVD 478


>gi|224142007|ref|XP_002324352.1| predicted protein [Populus trichocarpa]
 gi|222865786|gb|EEF02917.1| predicted protein [Populus trichocarpa]
          Length = 460

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 93/370 (25%), Positives = 142/370 (38%), Gaps = 62/370 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAI---------------FDPRKSSSFQKINCDHPD 45
           +V + +GTP K   LI DTGS + +                  +P  S+S++ I+C    
Sbjct: 120 VVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSAL 179

Query: 46  CTYF--------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C            C +  C+Y ++Y D S + GF A ET+++        +F   LFGC 
Sbjct: 180 CKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTL----SSSNVFKNFLFGCG 235

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
             N+G    A            R  ++  SQ     KK FSYC    LP    +  YL  
Sbjct: 236 QQNNGLFGGAAGLLGL-----GRTKLALPSQTAKTYKKLFSYC----LPASSSSKGYLSL 286

Query: 158 GTDMGYRRPSTQ-ATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
           G  +      T  +  F + P  FY L +  +S+   +++     F        G +IDS
Sbjct: 287 GGQVSKSVKFTPLSADFDSTP--FYGLDITGLSVGGRKLSIDESAF------SAGTVIDS 338

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-----IQLCY-FLPETFNRFPSM 270
           G+V+T      Y +L   F +         ++D P          CY F      R P +
Sbjct: 339 GTVITRLSPTAYSELSSAFQNL--------MTDYPSTSGYSIFDTCYDFSKYDTVRIPKV 390

Query: 271 AFYFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDD--LVALIGSQQQRDTRFVYDLNID 327
              F+    + ID   +           LA A +DD    ++ G+ QQR  + VYD    
Sbjct: 391 GVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKG 450

Query: 328 LLSFVKENCS 337
            + F    CS
Sbjct: 451 RVGFAPGGCS 460


>gi|224142005|ref|XP_002324351.1| predicted protein [Populus trichocarpa]
 gi|222865785|gb|EEF02916.1| predicted protein [Populus trichocarpa]
          Length = 472

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 93/370 (25%), Positives = 142/370 (38%), Gaps = 62/370 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAI---------------FDPRKSSSFQKINCDHPD 45
           +V + +GTP K   LI DTGS + +                  +P  S+S++ I+C    
Sbjct: 132 VVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSAL 191

Query: 46  CTYF--------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C            C +  C+Y ++Y D S + GF A ET+++        +F   LFGC 
Sbjct: 192 CKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTL----SSSNVFKNFLFGCG 247

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
             N+G    A            R  ++  SQ     KK FSYC    LP    +  YL  
Sbjct: 248 QQNNGLFGGAAGLLGL-----GRTKLALPSQTAKTYKKLFSYC----LPASSSSKGYLSL 298

Query: 158 GTDMGYRRPSTQ-ATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
           G  +      T  +  F + P  FY L +  +S+   +++     F        G +IDS
Sbjct: 299 GGQVSKSVKFTPLSADFDSTP--FYGLDITGLSVGGRKLSIDESAF------SAGTVIDS 350

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-----IQLCY-FLPETFNRFPSM 270
           G+V+T      Y +L   F +         ++D P          CY F      R P +
Sbjct: 351 GTVITRLSPTAYSELSSAFQNL--------MTDYPSTSGYSIFDTCYDFSKYDTVRIPKV 402

Query: 271 AFYFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDD--LVALIGSQQQRDTRFVYDLNID 327
              F+    + ID   +           LA A +DD    ++ G+ QQR  + VYD    
Sbjct: 403 GVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKG 462

Query: 328 LLSFVKENCS 337
            + F    CS
Sbjct: 463 RVGFAPGGCS 472


>gi|356546376|ref|XP_003541602.1| PREDICTED: probable aspartic protease At2g35615-like [Glycine max]
          Length = 450

 Score = 84.7 bits (208), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 93/360 (25%), Positives = 166/360 (46%), Gaps = 38/360 (10%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYF-- 49
           +GTP   +L ++DTGS + +               IFDP KS +++ + C    C     
Sbjct: 103 VGTPPFEILGVVDTGSGITWMQCQRCEDCYEQTTPIFDPSKSKTYKTLPCSSNMCQSVIS 162

Query: 50  --KCVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGCSNDNHGFD 104
              C +++  C YT+KY D S ++G  + ET++ +G   G ++ F   + GC ++N G  
Sbjct: 163 TPSCSSDKIGCKYTIKYGDGSHSQGDLSVETLT-LGSTNGSSVQFPNTVIGCGHNNKGTF 221

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
           +    G +    G   +     S +G     +FSYCL  P+ +   +SS L FG      
Sbjct: 222 QGEGSGVVGLGGGPVSLISQLSSSIGG----KFSYCLA-PMFSQSNSSSKLNFGDAAVVS 276

Query: 165 RPSTQATKFINHPNN--FYYLSLKDISIDNERMNF-PPDTFDITVSGEGGCIIDSGSVLT 221
                +T  ++   +  FYYL+L+  S+ ++R+ F    +   + +GEG  IIDSG+ LT
Sbjct: 277 GLGAVSTPLVSKTGSEVFYYLTLEAFSVGDKRIEFVGGSSSSGSSNGEGNIIIDSGTTLT 336

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFEDANLR 280
               + Y  L        +  Q  ++SD    + LCY   P      P +  +F+ A++ 
Sbjct: 337 LLPQEDYSNLESAVA---DAIQANRVSDPSNFLSLCYQTTPSGQLDVPVITAHFKGADVE 393

Query: 281 IDGENVFIIDYENHFFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDD 339
           ++  + F+   E    ++  A H  ++V++ G+  Q +    YDL    +SF   +C+ +
Sbjct: 394 LNPISTFVQVAEG---VVCFAFHSSEVVSIFGNLAQLNLLVGYDLMEQTVSFKPTDCTQE 450


>gi|359476754|ref|XP_002277058.2| PREDICTED: aspartic proteinase-like protein 2-like isoform 1 [Vitis
           vinifera]
          Length = 561

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 93/377 (24%), Positives = 162/377 (42%), Gaps = 61/377 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ IGTPSK   + +DTGS +++                    ++D + S++   + CD
Sbjct: 157 AKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCD 216

Query: 43  HPDCTYFK-----CV-NEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
              C+ +      C    QC+Y++ Y D S T G+   + +    + G  +        +
Sbjct: 217 DNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVV 276

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
           FGC N   G +  +   AL G+LG  +   S +SQL S   +KK FS+CL      G + 
Sbjct: 277 FGCGNKQSG-ELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFA 335

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE-G 210
              +          P    T  + +  + Y + +K+I +  + ++ P D F+   SG+  
Sbjct: 336 IGEVV--------EPKVNITPLVQNQAH-YNVVMKEIEVGGDPLDVPSDAFE---SGDRK 383

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPS 269
           G IIDSG+ L YF  +VY  L EK +S     +L  +    E    C+ +     + FP+
Sbjct: 384 GTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTV----EQAFTCFDYTGNVDDGFPT 439

Query: 270 MAFYFEDA-NLRI--------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
           +  +F+ + +L +          E  + I ++N     A       + L+G     +   
Sbjct: 440 VTLHFDKSISLTVYPHEYLFQVKEFEWCIGWQNSG---AQTKDGKDLTLLGDLVLSNKLV 496

Query: 321 VYDLNIDLLSFVKENCS 337
           VYDL    + +V+ NCS
Sbjct: 497 VYDLEKQGIGWVEYNCS 513


>gi|224142009|ref|XP_002324353.1| predicted protein [Populus trichocarpa]
 gi|222865787|gb|EEF02918.1| predicted protein [Populus trichocarpa]
          Length = 412

 Score = 84.3 bits (207), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 93/370 (25%), Positives = 142/370 (38%), Gaps = 62/370 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAI---------------FDPRKSSSFQKINCDHPD 45
           +V + +GTP K   LI DTGS + +                  +P  S+S++ I+C    
Sbjct: 72  VVTVGLGTPKKEFTLIFDTGSDITWTQCEPCVKTCYKQKEPRLNPSTSTSYKNISCSSAL 131

Query: 46  CTYF--------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C            C +  C+Y ++Y D S + GF A ET+++        +F   LFGC 
Sbjct: 132 CKLVASGKKFSQSCSSSTCLYQVQYGDGSYSIGFFATETLTL----SSSNVFKNFLFGCG 187

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
             N+G    A            R  ++  SQ     KK FSYC    LP    +  YL  
Sbjct: 188 QQNNGLFGGAAGLLGL-----GRTKLALPSQTAKTYKKLFSYC----LPASSSSKGYLSL 238

Query: 158 GTDMGYRRPSTQ-ATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
           G  +      T  +  F + P  FY L +  +S+   +++     F        G +IDS
Sbjct: 239 GGQVSKSVKFTPLSADFDSTP--FYGLDITGLSVGGRQLSIDESAF------SAGTVIDS 290

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-----IQLCY-FLPETFNRFPSM 270
           G+V+T      Y +L   F +         ++D P          CY F      R P +
Sbjct: 291 GTVITRLSPTAYSELSSAFQNL--------MTDYPSTSGYSIFDTCYDFSKYDTVRIPKV 342

Query: 271 AFYFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDD--LVALIGSQQQRDTRFVYDLNID 327
              F+    + ID   +           LA A +DD    ++ G+ QQR  + VYD    
Sbjct: 343 GVTFKGGVEMDIDVSGILYPVNGLKKVCLAFAGNDDDSDTSIFGNVQQRTYQVVYDGAKG 402

Query: 328 LLSFVKENCS 337
            + F    CS
Sbjct: 403 RVGFAPGGCS 412


>gi|357507803|ref|XP_003624190.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499205|gb|AES80408.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 476

 Score = 84.3 bits (207), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 93/378 (24%), Positives = 156/378 (41%), Gaps = 62/378 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ +G+P+K   + +DTGS +++                    ++DP  S +   + C 
Sbjct: 74  TKVGLGSPAKEFYVQVDTGSDILWVNCAGCTACPKKSGLGMDLTLYDPNGSKTSNAVPCG 133

Query: 43  HPDCT------YFKCVNE-QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
              CT         C  +  C Y++ Y D S T G   +++++   V G    K      
Sbjct: 134 DGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGSFVNDSLTFDEVSGNLHTKPDNSSV 193

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
           +FGC     G      D AL G++G  +   S +SQL +   +K+ FS+CL     +  +
Sbjct: 194 IFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQLAASGKVKRIFSHCL-----DSHH 248

Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
                  G  M  +  +T     + H    Y + LKD+ +D E +  P   FD   SG G
Sbjct: 249 GGGIFSIGQVMEPKFNTTPLVPRMAH----YNVILKDMDVDGEPILLPLYLFD---SGSG 301

Query: 211 -GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL-CYFLPETFNR-F 267
            G IIDSG+ L Y    +Y +L  K +      +L  + D     Q  C+   +  +  F
Sbjct: 302 RGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLMIVED-----QFTCFHYSDKLDEGF 356

Query: 268 PSMAFYFEDANLRID--------GENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
           P + F+FE  +L +          E+++ I ++            DL+ LIG     +  
Sbjct: 357 PVVKFHFEGLSLTVHPHDYLFLYKEDIYCIGWQKSS--TQTKEGRDLI-LIGDLVLSNKL 413

Query: 320 FVYDLNIDLLSFVKENCS 337
            VYDL   ++ +   NCS
Sbjct: 414 VVYDLENMVIGWTNFNCS 431


>gi|302764208|ref|XP_002965525.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
 gi|300166339|gb|EFJ32945.1| hypothetical protein SELMODRAFT_406966 [Selaginella moellendorffii]
          Length = 464

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 83/350 (23%), Positives = 156/350 (44%), Gaps = 33/350 (9%)

Query: 6   IGTPSKGVLLILDTGSALIYAIFDPRK---SSSFQKINCDHPDCTY--FKCVNE-QCVYT 59
           +G+P K   L++DTGS L +   DP     SS+F ++  +    TY    C ++ +    
Sbjct: 130 LGSPPKDFSLVMDTGSDLTWVRCDPCSPDCSSTFDRLASN----TYKALTCADDLRLPVL 185

Query: 60  MKYADQSVTKGFAAHETISVIGKGEGK-AIFHGALFGCSNDNHGFDEDARDGALAGVLGL 118
           ++   +    G +  +T+ + G    +   F G +FGC +   G           G+L L
Sbjct: 186 LRLWRRLFHSGRSLRDTLKMAGAASDELEEFPGFVFGCGSLLKGLISGE-----VGILAL 240

Query: 119 SRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG------TDMGYRRPSTQATK 172
           S  ++SF SQ+G     +FSYCL+          S + FG       + G  +P      
Sbjct: 241 SPGSLSFPSQIGEKYGNKFSYCLLRQTAQNSLKKSPMVFGEAAVELKEPGSGKPQELQYT 300

Query: 173 FINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG-CIIDSGSVLTYFHSDVYWKL 231
            I   + +Y + L  IS+ N+R++  P TF   ++G+    I DSG+ LT   S V   +
Sbjct: 301 PIGESSIYYTVRLDGISVGNQRLDLSPSTF---LNGQDKPTIFDSGTTLTMLPSGVCDSI 357

Query: 232 HEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSMAFYFEDANLRIDGENVFIID 290
            +   S     +   +    + +  C+ +P +  +  P + F+F      +   + ++ID
Sbjct: 358 KQSLASMVSGAEFVAI----KGLDACFRVPPSSGQGLPDITFHFNGGADFVTRPSNYVID 413

Query: 291 YENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSDDS 340
             +   L+ V  ++  V++ G+ QQ+D   ++D++   + F + +C   S
Sbjct: 414 LGSLQCLIFVPTNE--VSIFGNLQQQDFFVLHDMDNRRIGFKETDCGAHS 461


>gi|297819684|ref|XP_002877725.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323563|gb|EFH53984.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score = 84.3 bits (207), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 93/367 (25%), Positives = 149/367 (40%), Gaps = 53/367 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+D+GS + Y                F P  SS++Q + C+  DC 
Sbjct: 96  TRLWIGTPPQMFALIVDSGSTVTYVPCSDCEQCGKHQDPKFQPELSSTYQPVKCNM-DCN 154

Query: 48  YFKCVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              C +  EQCVY  +YA+ S +KG    + IS     E +     A+FGC     G   
Sbjct: 155 ---CDDDKEQCVYEREYAEHSSSKGVLGEDLISF--GNESQLTPQRAVFGCETVETG--- 206

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
           D       G++GL +  +S + QL    +I   F  C   + +  G        + +DM 
Sbjct: 207 DLYSQRADGIIGLGQGDLSLVDQLVDKGLISNSFGLCYGGMDVGGGSMILGGFDYPSDMI 266

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
           +       +        +Y + L  I +  ++++     FD    GE G ++DSG+   Y
Sbjct: 267 FTDSDPDRSP-------YYNIDLTGIRVAGKKLSLNSRVFD----GEHGAVLDSGTTYAY 315

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-IQLCYFLPETFNRFPSMAFYFEDANLRI 281
                +    E  +   E   L Q+ D P+P  +   FL    N    ++  F    +  
Sbjct: 316 LPDAAFAAFEEAVMR--EVSPLKQI-DGPDPNFKDTCFLVAASNDVSELSKIFPSVEMIF 372

Query: 282 DGENVFIIDYENHFF---------LLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSF 331
                +++  EN+ F          L V P+  D   L+G    R+T  VYD     + F
Sbjct: 373 KSGQSWLLSPENYMFRHSKVHGAYCLGVFPNGKDHTTLLGGIVVRNTLVVYDRENSKVGF 432

Query: 332 VKENCSD 338
            + NCS+
Sbjct: 433 WRTNCSE 439


>gi|388505490|gb|AFK40811.1| unknown [Medicago truncatula]
          Length = 193

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 50/181 (27%), Positives = 94/181 (51%), Gaps = 13/181 (7%)

Query: 166 PSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
           P+  ATK +  P        +FYY+SL+ IS+ + +++    TF+++  G GG IIDSG+
Sbjct: 15  PNVNATKQVTTPLITNPLQPSFYYISLEVISVGDTKLSIEQSTFEVSDDGSGGVIIDSGT 74

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFNRFPSMAFYFED 276
            +TY   + +  L ++F S   + +L         + +C+ LP  +T    P + F+F+ 
Sbjct: 75  TITYIEENAFDSLKKEFTS---QTKLPVDKSGSTGLDVCFSLPSGKTEVEIPKLVFHFKG 131

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            +L + GEN  I D       LA+   + + ++ G+ QQ++    +DL  + ++F+   C
Sbjct: 132 GDLELPGENYMIADSSLGVACLAMGASNGM-SIFGNIQQQNILVNHDLQKETITFIPTQC 190

Query: 337 S 337
           +
Sbjct: 191 N 191


>gi|357515189|ref|XP_003627883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355521905|gb|AET02359.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 434

 Score = 84.0 bits (206), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 84/359 (23%), Positives = 143/359 (39%), Gaps = 44/359 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
           +V+   GTP + +LL LDT S   +              F P KS+SF+ ++C  P C  
Sbjct: 98  IVKAKFGTPPQTLLLALDTSSDAAWIPCSGCVGCSTSKPFAPIKSTSFRNVSCGSPHCKQ 157

Query: 49  FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
                C    C +   Y   S+       +T+++           G  FGC N   G   
Sbjct: 158 VPNPTCGGSACAFNFTYGSSSIAAS-VVQDTLTL-----ATDPIPGYTFGCVNKTTGSSA 211

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
             +           R  +S +SQ  ++ K  FSYCL  P       S  L+ G     +R
Sbjct: 212 PQQGLLGL-----GRGPLSLLSQSQNLYKSTFSYCL--PSFKSINFSGSLRLGPVYQPKR 264

Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
              + T  + +P  ++ YY++L  I +  + ++ PP       +   G I DSG+V T  
Sbjct: 265 --IKYTPLLRNPRRSSLYYVNLVAIKVGRKIVDIPPAALAFNPTTGAGTIFDSGTVFTRL 322

Query: 224 HSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRID 282
              VY  +  +F      +  +  L         CY +P      P++ F F   N+ + 
Sbjct: 323 AEPVYTAVRNEFRRRVGPKLPVTTLGG----FDTCYNVPIV---VPTITFLFSGMNVTLP 375

Query: 283 GENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            +N+ I         LA+A   D    ++ +I + QQ++ R ++D+    +   +E C+
Sbjct: 376 PDNIVIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLFDVPNSRIGIARELCT 434


>gi|224081804|ref|XP_002306494.1| predicted protein [Populus trichocarpa]
 gi|222855943|gb|EEE93490.1| predicted protein [Populus trichocarpa]
          Length = 564

 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 91/366 (24%), Positives = 147/366 (40%), Gaps = 52/366 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+DTGS++ Y                F P  SS++Q + C+  DC 
Sbjct: 15  TRLWIGTPPQRFALIVDTGSSVTYVPCSSCEQCGRHQDPKFQPDLSSTYQSVKCNI-DC- 72

Query: 48  YFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIF-HGALFGCSNDNHGFD 104
              C +E  QCVY  +YA+ S + G    + IS    G   A+    A+FGC N   G  
Sbjct: 73  --NCDDEKQQCVYERQYAEMSTSSGVLGEDIISF---GNLSALAPQRAVFGCENMETG-- 125

Query: 105 EDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
            D       G++G+ R  +S +  L    +I   FS C                    +G
Sbjct: 126 -DLYSQHADGIMGMGRGDLSIVDHLVDKGVINDSFSLCYG--------GMGIGGGAMVLG 176

Query: 163 YRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
              P +      + P  + +Y + LK+I +  + +   P  FD    G+ G I+DSG+  
Sbjct: 177 GISPPSNMVFSQSDPVRSPYYNIDLKEIHVAGKPLPLNPTVFD----GKHGTILDSGTTY 232

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFE 275
            Y     +    +  +      +  +  D P    +C+      + +  + FP++   F 
Sbjct: 233 AYLPEAAFVSFKDAIMKELHSLKPIRGPD-PNYNDICFSGAGSDISQLSSSFPAVEMVFG 291

Query: 276 DAN-LRIDGENVFIIDYENH--FFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           +   L +  EN      + H  + L       D   L+G    R+T  +YD     + F 
Sbjct: 292 NGQKLLLSPENYLFRHSKVHGAYCLGIFQNGKDPTTLLGGIVVRNTLVLYDRENSKIGFW 351

Query: 333 KENCSD 338
           K NCS+
Sbjct: 352 KTNCSE 357


>gi|449442281|ref|XP_004138910.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449506266|ref|XP_004162699.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 482

 Score = 84.0 bits (206), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 92/374 (24%), Positives = 156/374 (41%), Gaps = 54/374 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAIFD-----PRKS--------------SSFQKINCD 42
            ++ +GTP +   + +DTGS +++         P+KS              S+  ++ C+
Sbjct: 76  AKIGLGTPVQDYYVQVDTGSDILWVNCAGCTNCPKKSDLGIELSLYSPSSSSTSNRVTCN 135

Query: 43  HPDCTYF------KCVNEQ-CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
              CT         C  E  C Y + Y D S T G+   + +    V G  +  +     
Sbjct: 136 QDFCTSTYDGPIPGCTPELLCEYRVAYGDGSSTAGYFVRDHVVLDRVTGNFQTTSTNGSI 195

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
           +FGC     G    A   AL G+LG  +   S ISQL S   +K+ F++CL      G +
Sbjct: 196 VFGCGAQQSG-QLGATSAALDGILGFGQANSSMISQLASSGKVKRVFAHCLDNINGGGIF 254

Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGE 209
               +         +P  + T  +  P   +Y + +K I +DNE +N P D FD  +   
Sbjct: 255 AIGEVV--------QPKVRTTPLV--PQQAHYNVFMKAIEVDNEVLNLPTDVFDTDL--R 302

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
            G IIDSG+ L YF   +Y  L  K    F R    +L    E      +     + FP+
Sbjct: 303 KGTIIDSGTTLAYFPDVIYEPLISKI---FARQSTLKLHTVEEQFTCFEYDGNVDDGFPT 359

Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLL------AVAPHDDLVALIGSQQQRDTRFVYD 323
           + F+FED+       + ++ D +++ + +      A +     + L+G    ++   +YD
Sbjct: 360 VTFHFEDSLSLTVYPHEYLFDIDSNKWCVGWQNSGAQSRDGKDMILLGDLVLQNRLVMYD 419

Query: 324 LNIDLLSFVKENCS 337
           L    + + + NCS
Sbjct: 420 LENQTIGWTEYNCS 433


>gi|115457772|ref|NP_001052486.1| Os04g0334700 [Oryza sativa Japonica Group]
 gi|113564057|dbj|BAF14400.1| Os04g0334700 [Oryza sativa Japonica Group]
          Length = 482

 Score = 84.0 bits (206), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 83/364 (22%), Positives = 151/364 (41%), Gaps = 49/364 (13%)

Query: 6   IGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDHPDC 46
           IGTP+    + LDTGS   +                     +DPR S S +++ CD   C
Sbjct: 89  IGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC 148

Query: 47  TYFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGALFGCSNDNH 101
           T     N   +C Y   YAD  +T G    + +    + G G+ +       FGC     
Sbjct: 149 TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQS 208

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           G   ++   A+ G++G      + +SQL +    KK FS+CL      G +    +    
Sbjct: 209 GSLNNSAV-AIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEV---- 263

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                 P  + T  + +   ++ ++LK I++    +  P + F  T +   G  IDSGS 
Sbjct: 264 ----VEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGST 317

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
           L Y    +Y +L     +      +  + +     Q  +FL    ++FP + F+FE+ +L
Sbjct: 318 LVYLPEIIYSELILAVFAKHPDITMGAMYN----FQCFHFLGSVDDKFPKITFHFEN-DL 372

Query: 280 RIDGENV-FIIDYENHFFLL-----AVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
            +D     ++++YE + +        +  + D++ ++G     +   VYD+    + + +
Sbjct: 373 TLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMI-ILGDMVISNKVVVYDMEKQAIGWTE 431

Query: 334 ENCS 337
            NCS
Sbjct: 432 HNCS 435


>gi|293332561|ref|NP_001170100.1| uncharacterized protein LOC100384018 precursor [Zea mays]
 gi|224033441|gb|ACN35796.1| unknown [Zea mays]
 gi|413944035|gb|AFW76684.1| hypothetical protein ZEAMMB73_746438 [Zea mays]
          Length = 456

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 81/295 (27%), Positives = 117/295 (39%), Gaps = 59/295 (20%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V L +GTP + V L LDTGS L++               + DP  SS++  + C  P C
Sbjct: 87  LVHLAVGTPPRPVALTLDTGSDLVWTQCAPCRDCFDQGIPLLDPAASSTYAALPCGAPRC 146

Query: 47  ---TYFKCVNEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIFHGA---LFGCSN 98
               +  C    CVY   Y D+SVT G  A +  T    G+  G           FGC +
Sbjct: 147 RALPFTSCGGRSCVYVYHYGDKSVTVGKIATDRFTFGDNGRRNGDGSLPATRRLTFGCGH 206

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
            N G  +        G+ G  R   S  SQL +     FSYC        +  SS +  G
Sbjct: 207 FNKGVFQSNET----GIAGFGRGRWSLPSQLNAT---SFSYCFTSMF---DSKSSIVTLG 256

Query: 159 TDMG-----YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
                          + T    +P+  + Y+LSLK IS+   R+  P   F  T      
Sbjct: 257 GAPAALYSHAHSGEVRTTPLFKNPSQPSLYFLSLKGISVGKTRLPVPETKFRST------ 310

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ-----LCYFLP 261
            IIDSG+ +T    +VY  +  +F         AQ+   P  ++     +C+ LP
Sbjct: 311 -IIDSGASITTLPEEVYEAVKAEFA--------AQVGLPPSGVEGSALDVCFALP 356


>gi|194707632|gb|ACF87900.1| unknown [Zea mays]
          Length = 423

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 153/378 (40%), Gaps = 58/378 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
            R+ +G P+K   + +DTGS +++                     F+P  SS+  +I C 
Sbjct: 7   TRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCS 66

Query: 43  HPDCT--------YFKCVNEQ---CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
              CT          +  N Q   C YT  Y D S T G+   +T+   +V+G  +    
Sbjct: 67  DDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANS 126

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLP 146
               +FGCSN   G D    D A+ G+ G  +  +S ISQL S  +  K FS+CL     
Sbjct: 127 SASIVFGCSNSQSG-DLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL----K 181

Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
             +     L  G  +    P    T  + + P+  Y L+L+ I+++ +++  P D+   T
Sbjct: 182 GSDNGGGILVLGEIV---EPGLVYTPLVPSQPH--YNLNLESIAVNGQKL--PIDSSLFT 234

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
            S   G I+DSG+ L Y     Y    + FVS         +         C+    + +
Sbjct: 235 TSNTQGTIVDSGTTLAYLADGAY----DPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVD 290

Query: 266 -RFPSMAFYFEDA-NLRIDGENVFI----IDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
             FP++  YF     + +  EN  +    +D    + +         + ++G    +D  
Sbjct: 291 SSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKI 350

Query: 320 FVYDLNIDLLSFVKENCS 337
           FVYDL    + +   +CS
Sbjct: 351 FVYDLANMRMGWADYDCS 368


>gi|356528675|ref|XP_003532925.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 342

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 83/343 (24%), Positives = 140/343 (40%), Gaps = 87/343 (25%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTM 60
           ++RL+IGTP    L+I DTGS  I+    P                    C N QCVY  
Sbjct: 79  LMRLYIGTPPVERLVIADTGSDFIWVQCSP--------------------CQNCQCVYLN 118

Query: 61  KYADQSVTKGFAAHETISVIGKGEGKAI-FHGALFGC-SNDNHGFDEDARDGALAGVLGL 118
            YA++S T      ET+S    G  + + F  ++FGC +N+N  F    +     G++GL
Sbjct: 119 IYANKSFTIEVVGTETLSFDSTGGAQTVSFPNSIFGCGANNNLTFRSSDKA---TGLVGL 175

Query: 119 SRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN 178
               +S +SQLG+ I  +F               SYLKFG++         +T  I  P+
Sbjct: 176 VAGQLSLVSQLGAQIGYKF---------------SYLKFGSEAIITTNGVVSTPLIIKPS 220

Query: 179 -NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVS 237
              Y+L+L+ ++I  + +  P +T  +                                 
Sbjct: 221 LPLYFLNLEVVTIGQKVV--PTETLGV--------------------------------- 245

Query: 238 YFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFL 297
                    + D P P + C+   +     P++AF F  A++ +  +N+ I   + +   
Sbjct: 246 -------ESVQDLPFPFKFCFPYRDNMT-VPAIAFQFTGASVALRPKNLLIKLQDRNMLX 297

Query: 298 LAVAPHD---DLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           LAV P      ++++ G   Q D + +YDL+   +S    +C+
Sbjct: 298 LAVVPSASSLSVISIFGIIAQFDFQVLYDLDGKKVSVAPTDCT 340


>gi|255566006|ref|XP_002523991.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223536718|gb|EEF38359.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 455

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 98/383 (25%), Positives = 161/383 (42%), Gaps = 74/383 (19%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +++L IGTP   +   +DTGS +I+              +IF+P  SS++Q   CD   C
Sbjct: 99  LMKLLIGTPPTEIHAAIDTGSNVIWIPCINCKDCFNQSSSIFNPLASSTYQDAPCDSYQC 158

Query: 47  --TYFKCVNEQ-CVYTMKYADQ-SVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
             T   C ++  C+Y+     Q +   G  A +T+++            + F C N  + 
Sbjct: 159 ETTSSSCQSDNVCLYSCDEKHQLNCPNGRIAVDTMTLTSSDGRPFPLPYSDFVCGNSIY- 217

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS---SYLKFG- 158
                +  A  GV+GL R  +S  S+L  +   +FSYCL       +Y S   S + FG 
Sbjct: 218 -----KTFAGVGVIGLGRGALSLTSKLYHLSDGKFSYCL------ADYYSKQPSKINFGL 266

Query: 159 -------------TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMN--FPPDTFD 203
                        T +G+ R          H  N YY++L+ IS+  +R +  +  D F 
Sbjct: 267 QSFISDDDLEVVSTTLGHHR----------HSGN-YYVTLEGISVGEKRQDLYYVDDPFA 315

Query: 204 ITVSGEGGCIIDSGSVLT--------YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ 255
             V   G  +IDSG++ T        Y  S V + + E   ++    +     D    + 
Sbjct: 316 PPV---GNMLIDSGTMFTLLPKDFYDYLWSTVSYAIPENPQNHPHNSRFPFSMDNTLKLS 372

Query: 256 LC-YFLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQ 314
            C ++ PE   +FP +  +F DA++ +  +N FI   E+       A       + GS Q
Sbjct: 373 PCFWYYPEL--KFPKITIHFTDADVELSDDNSFIRVAEDVVCFAFAATQPGQSTVYGSWQ 430

Query: 315 QRDTRFVYDLNIDLLSFVKENCS 337
           Q +    YDL    +SF + +CS
Sbjct: 431 QMNFILGYDLKRGTVSFKRTDCS 453


>gi|356540369|ref|XP_003538662.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Glycine max]
          Length = 364

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 94/341 (27%), Positives = 153/341 (44%), Gaps = 26/341 (7%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHP--DCTYF---KCVNEQ 55
           +++L +GTP   V  ++DT S L++A   P +    QK     P  +C  F    C  E+
Sbjct: 32  LMKLTLGTPPVDVYGLVDTDSDLVWAQCTPCQGCYKQKNPMFDPLKECNSFFDHSCSPEK 91

Query: 56  -CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG-FDEDARDGALA 113
            C Y   YAD S TKG  A E I+     +GK I    +FGC ++N G F+E+       
Sbjct: 92  ACDYVYAYADDSATKGMLAKE-IATFSSTDGKPIVESIIFGCGHNNTGVFNENDMGLIGL 150

Query: 114 GVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATK 172
                    +S +SQ+G++   KRFS CLV P     +TS  +  G            T 
Sbjct: 151 -----GGGPLSLVSQMGNLYGSKRFSQCLV-PFHADPHTSGTISLGEASDVSGEGVVTTP 204

Query: 173 FINHPNNFYYL-SLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKL 231
            ++      YL +L+ IS+ +  + F        +  +G  +IDSG+  TY   + Y +L
Sbjct: 205 LVSEEGQTPYLVTLEGISVGDTFVPF----NSSEMLSKGNIMIDSGTPETYLPQEFYDRL 260

Query: 232 HEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMAFYFEDANLRIDGENVFIID 290
            E+      +  L  +   P+   QLCY   ET    P +  +FE A++++     FI  
Sbjct: 261 VEELKV---QINLPPIHVDPDLGTQLCY-KSETNLEGPILTAHFEGADVKLLPLQTFIPP 316

Query: 291 YENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
            ++  F  A+    D + + G+  Q +    +DL+  ++ F
Sbjct: 317 -KDGVFCFAMTGTTDGLYIFGNFAQSNVLIGFDLDKRIVFF 356


>gi|148907752|gb|ABR17002.1| unknown [Picea sitchensis]
          Length = 454

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 97/390 (24%), Positives = 154/390 (39%), Gaps = 82/390 (21%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCD 42
            R+ +GTP +   + +DTGS +++                     FDPR SS+   ++C 
Sbjct: 43  TRIELGTPPRPFYVQIDTGSDILWVNCKPCNACPLTSGLGVALNFFDPRGSSTASPLSCI 102

Query: 43  HPDCTYFKCVNEQ-------CVYTMKYADQSVTKGFAAHETIS--------VIGKGEGKA 87
              C     ++E        C Y+ +Y D S T G+   +           V      K 
Sbjct: 103 DSKCVSSNQISESVCTTDRYCGYSFEYGDGSGTLGYYVSDEFDYNQYVNQYVTNNASAKI 162

Query: 88  IFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPL 145
                 FGCS +  G D    D A+ G+ G  +  +S +SQL S  +  K FS+CL    
Sbjct: 163 T-----FGCSYNQSG-DLTKPDRAVDGIFGFGQNDLSVVSQLNSQGLAPKIFSHCLEGAD 216

Query: 146 PNGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDI 204
           P G      L  G       P    T  + + P+  Y L+L+ I+++ ++++  P  F  
Sbjct: 217 PGG----GILVLGE---ITEPGMVYTPIVPSQPH--YNLNLQGIAVNGQQLSIDPQVFAT 267

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL----CYFL 260
           T     G IID G+ L Y   + Y    E FV+      +A +S   +P  L    C+  
Sbjct: 268 T--NTRGTIIDCGTTLAYLAEEAY----EPFVNTI----IAAVSQSTQPFMLKGNPCFLT 317

Query: 261 PETFNR-FPSMAFYFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDLV 307
             + +  FPS+  YFE A             L  D   V+ I ++        A     +
Sbjct: 318 VHSIDEIFPSVTLYFEGAPMDLKPKDYLIQQLSPDSSPVWCIGWQKSG---QQATDSSKM 374

Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            ++G    +D  FVYDL    + +   +CS
Sbjct: 375 TILGDLVLKDKVFVYDLENQRIGWTSFDCS 404


>gi|168063189|ref|XP_001783556.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664943|gb|EDQ51645.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 414

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 94/377 (24%), Positives = 142/377 (37%), Gaps = 61/377 (16%)

Query: 4   LFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDCTY 48
           + IG P+K   L +DTGS L +                ++DP+++   + ++C  P C  
Sbjct: 35  MRIGNPAKLYYLDMDTGSDLTWLQCDAPCRSCAVGPHGLYDPKRA---RVVDCRRPTCAQ 91

Query: 49  ------FKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 F C  +  QC Y + Y D S T G    +TI+++    G      A+ GC  D 
Sbjct: 92  VQRGGQFTCSGDVRQCDYEVDYVDGSSTMGILVEDTITLV-LTNGTRFQTRAVIGCGYDQ 150

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSSYLKFG 158
            G    A      GV+GLS   IS  SQL +  I      +CL      G     YL FG
Sbjct: 151 QGTLAKA-PAVTDGVIGLSSSKISLPSQLAAKGIANNVIGHCLA----GGSNGGGYLFFG 205

Query: 159 TDMGYRRPSTQATKFINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
            D          T  I  P    Y   L+ I    E +     T D+     GG + DSG
Sbjct: 206 -DTLVPALGMTWTPMIGRPLVEGYQARLRSIKYGGEVLELEGTTDDV-----GGAMFDSG 259

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQL-SDCPEPIQLCYFLPETFNRFPSMAFYFED 276
           +  TY   + Y  +    V   +R  L ++ +D   P   C+  P  F     ++ YF+ 
Sbjct: 260 TSFTYLVPNAYTAVLSAVVRQAQRSGLERIKTDTTLP--FCWRGPSPFESVADVSAYFKT 317

Query: 277 ANLRIDG--------------ENVFIIDYENHF---FLLAVAPHDDLVALIGSQQQRDTR 319
             L   G              E   I+  + +     L A     ++  ++G    R   
Sbjct: 318 VTLDFGGSTWWSSGKLLELSPEGYLIVSTQGNVCLGVLDASVASLEVTNILGDISMRGYL 377

Query: 320 FVYDLNIDLLSFVKENC 336
            VYD   + + +V+ NC
Sbjct: 378 VVYDNMREQIGWVRRNC 394


>gi|357116170|ref|XP_003559856.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 460

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 84/374 (22%), Positives = 151/374 (40%), Gaps = 49/374 (13%)

Query: 1   MVRLFIGTPSKGVLLILDT-------------GSALIYAIFDPRKSSSFQKINCD----- 42
           +VR  +GTP + +LL +DT             G       F+P  S++F+ + C      
Sbjct: 95  LVRASLGTPPQRLLLAVDTSNDAAWVPCAGCHGCPTTAPSFNPASSATFRPVPCGAPPCS 154

Query: 43  ---HPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
              +P CT        C +++ Y D S+    +  + ++V   G    +  G  FGC   
Sbjct: 155 QAPNPSCTSLAKSKNSCGFSLSYGDSSLDATLS-QDNLAVTANG---GVIKGYTFGCLTK 210

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           ++G    A+      +LGL R  + F++Q   I +  FSYCL     +    S  L  G 
Sbjct: 211 SNGSAAPAQG-----LLGLGRGPLGFVAQTKGIYEGTFSYCLPSYYRSAANFSGSLTLGR 265

Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
                    + T  +  P+  + YY+++  + I  + +  PP       +   G ++DSG
Sbjct: 266 KGQPAPEKMKTTPLLASPHRPSLYYVAMTGVRIGKKSVPIPPSALAFDAATGAGTVLDSG 325

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL--------CYFLPETFNRFPS 269
           ++        Y  + ++ V       L +       + +        CY +      +P+
Sbjct: 326 TMFARLAQPAYAAVRDE-VRRRVAGSLRRRGGGGASVSVSSLGGFDTCYNVSTV--AWPA 382

Query: 270 MAFYFEDA-NLRIDGENVFIIDYENHFFLLAVA--PHDDLVA---LIGSQQQRDTRFVYD 323
           +   F     +R+  ENV I         LA+A  P D + A   +IGS QQ++ R ++D
Sbjct: 383 VTLVFGGGMEVRLPEENVVIRSTYGSTSCLAMAASPADGVNAALNVIGSLQQQNHRVLFD 442

Query: 324 LNIDLLSFVKENCS 337
           +    + F +E C+
Sbjct: 443 VPNARVGFARERCT 456


>gi|18390579|ref|NP_563751.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|332189782|gb|AEE27903.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 485

 Score = 83.6 bits (205), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 92/377 (24%), Positives = 158/377 (41%), Gaps = 60/377 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ IGTP+K   + +DTGS +++                    +++  +S S + ++CD
Sbjct: 82  AKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCD 141

Query: 43  HPDCTYFK------C-VNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
              C          C  N  C Y   Y D S T G+   + +   SV G  + +      
Sbjct: 142 DDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSV 201

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
           +FGC     G  + + + AL G+LG  +   S ISQL S   +KK F++CL     +G  
Sbjct: 202 IFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL-----DGRN 256

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
                  G  +   +P    T  + N P+  Y +++  + +  E +  P D F       
Sbjct: 257 GGGIFAIGRVV---QPKVNMTPLVPNQPH--YNVNMTAVQVGQEFLTIPADLFQ--PGDR 309

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
            G IIDSG+ L Y    +Y  L +K  S     ++  +    +  Q    + E    FP+
Sbjct: 310 KGAIIDSGTTLAYLPEIIYEPLVKKITSQEPALKVHIVDKDYKCFQYSGRVDEG---FPN 366

Query: 270 MAFYFEDAN-LRI-------DGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRF 320
           + F+FE++  LR+         E ++ I ++N     A+   D   + L+G     +   
Sbjct: 367 VTFHFENSVFLRVYPHDYLFPHEGMWCIGWQNS----AMQSRDRRNMTLLGDLVLSNKLV 422

Query: 321 VYDLNIDLLSFVKENCS 337
           +YDL   L+ + + NCS
Sbjct: 423 LYDLENQLIGWTEYNCS 439


>gi|218195474|gb|EEC77901.1| hypothetical protein OsI_17222 [Oryza sativa Indica Group]
          Length = 467

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 96/381 (25%), Positives = 154/381 (40%), Gaps = 64/381 (16%)

Query: 1   MVRLFIGTPSKGV---LLILDTGSALI---------------YAIFDPRKSSSFQKINCD 42
           +V+L IGTP+  +    ++ DTGS L                Y   DP KS +F++++C 
Sbjct: 103 LVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSCF 162

Query: 43  HPDCTYFKCV------NEQCVYTMKYADQSVTKGFAAHET--ISVIGKGEGKAIFHGALF 94
            P C     V      +  C++  +Y D     G    +       G G G  +     F
Sbjct: 163 DPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVAF 222

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--- 151
           GC+   H  D  A  G   G+L L     SF++QLG     RFSYC    +P  E T   
Sbjct: 223 GCA---HVEDSKAVRGYSTGILALGIGKPSFVTQLGV---DRFSYC----IPASEITDDD 272

Query: 152 --------SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFD 203
                   +S+L+FG+   + R + +   F     + Y + LK +   +           
Sbjct: 273 DDDDEERSASFLRFGS---HARMTGKRAPF-KQDGSGYAVRLKSVVYQHGGRLNQQQPVP 328

Query: 204 ITVSGEGGC-----IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
           + V+GE        ++DSG+ L +    V++ L  +     E   L +  D   P   CY
Sbjct: 329 VYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIE---EDISLTRRYDLTHPSLYCY 385

Query: 259 FLPETFNRFPSMAFYF-EDANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQQQ 315
               T     S+   F   A+L + G ++F  D      +  LAVA  +   A++G   Q
Sbjct: 386 LGNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR--AILGVYPQ 443

Query: 316 RDTRFVYDLNIDLLSFVKENC 336
           R+    YDL+   ++F ++ C
Sbjct: 444 RNINVGYDLSTMEIAFDRDQC 464


>gi|359476199|ref|XP_003631804.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 421

 Score = 83.2 bits (204), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 74/246 (30%), Positives = 109/246 (44%), Gaps = 43/246 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQ----- 55
           +V +  GTP +  +LILDTGS++ +       +     +NC      YF           
Sbjct: 129 LVDVAFGTPPQNFMLILDTGSSITW-------TQCKACVNCLQDSHRYFNWSASSTYSSG 181

Query: 56  -CV-------YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
            C+       Y M Y D S + G    +T+++    E   +F    FGC  +N G   D 
Sbjct: 182 SCIPGTVENNYNMTYGDDSTSVGNYGCDTMTL----EPSDVFQKFQFGCGRNNKG---DF 234

Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS 167
             G + G+LGL +  +S +SQ  S   K FSYCL    P  +   S L FG     +  S
Sbjct: 235 GSG-VDGMLGLGQGQLSTVSQTASKFNKVFSYCL----PEEDSIGSLL-FGEKATSQSSS 288

Query: 168 TQATKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
            + T  +N P     + +Y+++L DIS+ NER+N P   F        G IIDS +V+T 
Sbjct: 289 LKFTSLVNGPGTLQESGYYFVNLSDISVGNERLNIPSSVF-----ASPGTIIDSRTVITR 343

Query: 223 FHSDVY 228
                Y
Sbjct: 344 LPQRAY 349


>gi|116311058|emb|CAH67989.1| OSIGBa0142I02-OSIGBa0101B20.32 [Oryza sativa Indica Group]
          Length = 488

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 96/381 (25%), Positives = 154/381 (40%), Gaps = 64/381 (16%)

Query: 1   MVRLFIGTPSKGV---LLILDTGSALI---------------YAIFDPRKSSSFQKINCD 42
           +V+L IGTP+  +    ++ DTGS L                Y   DP KS +F++++C 
Sbjct: 124 LVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSCF 183

Query: 43  HPDCTYFKCV------NEQCVYTMKYADQSVTKGFAAHET--ISVIGKGEGKAIFHGALF 94
            P C     V      +  C++  +Y D     G    +       G G G  +     F
Sbjct: 184 DPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVAF 243

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--- 151
           GC+   H  D  A  G   G+L L     SF++QLG     RFSYC    +P  E T   
Sbjct: 244 GCA---HVEDSKAVRGYSTGILALGIGKPSFVTQLG---VDRFSYC----IPASEITDDD 293

Query: 152 --------SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFD 203
                   +S+L+FG+   + R + +   F     + Y + LK +   +           
Sbjct: 294 DDDDEERSASFLRFGS---HARMTGKRAPF-KQDGSGYAVRLKSVVYQHGGRLNQQQPVP 349

Query: 204 ITVSGEGGC-----IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
           + V+GE        ++DSG+ L +    V++ L  +     E   L +  D   P   CY
Sbjct: 350 VYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIE---EDISLTRRYDLTHPSLYCY 406

Query: 259 FLPETFNRFPSMAFYF-EDANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQQQ 315
               T     S+   F   A+L + G ++F  D      +  LAVA  +   A++G   Q
Sbjct: 407 LGNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR--AILGVYPQ 464

Query: 316 RDTRFVYDLNIDLLSFVKENC 336
           R+    YDL+   ++F ++ C
Sbjct: 465 RNINVGYDLSTMEIAFDRDQC 485


>gi|18409620|ref|NP_566966.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|13430562|gb|AAK25903.1|AF360193_1 unknown protein [Arabidopsis thaliana]
 gi|4886277|emb|CAB43423.1| putative protein [Arabidopsis thaliana]
 gi|14532764|gb|AAK64083.1| unknown protein [Arabidopsis thaliana]
 gi|15450892|gb|AAK96717.1| Unknown protein [Arabidopsis thaliana]
 gi|30387567|gb|AAP31949.1| At3g52500 [Arabidopsis thaliana]
 gi|332645431|gb|AEE78952.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 469

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 95/389 (24%), Positives = 147/389 (37%), Gaps = 65/389 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKI 39
           V L  GTPS+ +  + DTGS+L++                        F P+ SSS + I
Sbjct: 92  VSLSFGTPSQTIPFVFDTGSSLVWLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKII 151

Query: 40  NCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF----- 94
            C  P C +    N QC        ++ T G   +     +G   G  I     F     
Sbjct: 152 GCQSPKCQFLYGPNVQC-RGCDPNTRNCTVGCPPYILQYGLGSTAGVLITEKLDFPDLTV 210

Query: 95  -----GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGE 149
                GCS               AG+ G  R  +S  SQ+     KRFS+CLV    +  
Sbjct: 211 PDFVVGCS--------IISTRQPAGIAGFGRGPVSLPSQMN---LKRFSHCLVSRRFDDT 259

Query: 150 YTSSYLKF----GTDMGYRRPSTQATKFINHPN-------NFYYLSLKDISIDNERMNFP 198
             ++ L      G + G + P    T F  +PN        +YYL+L+ I +  + +  P
Sbjct: 260 NVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIP 319

Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
                   +G+GG I+DSGS  T+    V+  + E+F S    +   +  +    +  C+
Sbjct: 320 YKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLGPCF 379

Query: 259 FLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDL--------VA 308
            +    +   P + F F+  A L +   N F          L V     +          
Sbjct: 380 NISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTGPAI 439

Query: 309 LIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ++GS QQ++    YDL  D   F K+ CS
Sbjct: 440 ILGSFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|413923782|gb|AFW63714.1| hypothetical protein ZEAMMB73_300584 [Zea mays]
          Length = 458

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 86/365 (23%), Positives = 152/365 (41%), Gaps = 58/365 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++ + IG+P+    + +DTGS + +              ++FDP  SS++   +C    C
Sbjct: 123 VITVGIGSPAVTQTMSMDTGSDVSWVQCKPCSQCHSEVDSLFDPSSSSTYSPFSCSSAPC 182

Query: 47  TYFK-------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                      C++ QC Y + Y D S T G  + +T+++     G +      FGCS  
Sbjct: 183 AQLSQSQEGNGCMSSQCQYIVNYGDSSSTTGTYSSDTLTL-----GSSAMTDFQFGCSQS 237

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
             G   D  D    G++GL     S  SQ        FSYC    LP    +S +L  GT
Sbjct: 238 ESGGFNDQTD----GLMGLGGGAQSLASQTAGTFGTAFSYC----LPPTSGSSGFLTLGT 289

Query: 160 DMG--YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
                 + P  ++T+       +Y + L+ I + ++++N P   F        G ++DSG
Sbjct: 290 GSSGFVKTPMLRSTQI----PTYYVVLLESIKVGSQQLNLPTSVF------SAGSLMDSG 339

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFED 276
           +++T      Y  L   F +  +++  A  S     +  C+ F  ++    P++   F  
Sbjct: 340 TIITRLPPTAYSALSSAFKAGMQQYPPATPSGI---LDTCFDFSGQSSISIPTVTLVFSG 396

Query: 277 A---NLRIDGENVFIIDYENHFFLLAVAPH--DDLVALIGSQQQRDTRFVYDLNIDLLSF 331
               +L  DG    +++  +    LA  P+  D  + +IG+ QQR    +YD+    + F
Sbjct: 397 GAAVDLAFDG---IMLEISSSIRCLAFTPNGDDSSLGIIGNVQQRTFEVLYDVGGGAVGF 453

Query: 332 VKENC 336
               C
Sbjct: 454 KAGAC 458


>gi|226508052|ref|NP_001150337.1| LOC100283967 precursor [Zea mays]
 gi|195638522|gb|ACG38729.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 507

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 153/378 (40%), Gaps = 58/378 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
            R+ +G P+K   + +DTGS +++                     F+P  SS+  +I C 
Sbjct: 91  TRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCS 150

Query: 43  HPDCT--------YFKCVNEQ---CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
              CT          +  N Q   C YT  Y D S T G+   +T+   +V+G  +    
Sbjct: 151 DDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANS 210

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLP 146
               +FGCSN   G D    D A+ G+ G  +  +S ISQL S  +  K FS+CL     
Sbjct: 211 SASIVFGCSNSQSG-DLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL----K 265

Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
             +     L  G  +    P    T  + + P+  Y L+L+ I+++ +++  P D+   T
Sbjct: 266 GSDNGGGILVLGEIV---EPGLVYTPLVPSQPH--YNLNLESIAVNGQKL--PIDSSLFT 318

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
            S   G I+DSG+ L Y     Y    + FVS         +         C+    + +
Sbjct: 319 TSNTQGTIVDSGTTLAYLADGAY----DPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVD 374

Query: 266 -RFPSMAFYFEDA-NLRIDGENVFI----IDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
             FP++  YF     + +  EN  +    +D    + +         + ++G    +D  
Sbjct: 375 SSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKI 434

Query: 320 FVYDLNIDLLSFVKENCS 337
           FVYDL    + +   +CS
Sbjct: 435 FVYDLANMRMGWADYDCS 452


>gi|413952263|gb|AFW84912.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 509

 Score = 83.2 bits (204), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 153/378 (40%), Gaps = 58/378 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
            R+ +G P+K   + +DTGS +++                     F+P  SS+  +I C 
Sbjct: 93  TRVKLGNPAKEFFVQIDTGSDILWVTCSPCTGCPTSSGLNIQLESFNPDSSSTASRITCS 152

Query: 43  HPDCT--------YFKCVNEQ---CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
              CT          +  N Q   C YT  Y D S T G+   +T+   +V+G  +    
Sbjct: 153 DDRCTAGFQTGEAICQTSNSQSSPCGYTFTYGDGSGTSGYYVSDTMFFETVMGNEQTANS 212

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLP 146
               +FGCSN   G D    D A+ G+ G  +  +S ISQL S  +  K FS+CL     
Sbjct: 213 SASIVFGCSNSQSG-DLTKADRAVDGIFGFGQHQLSVISQLNSLGVSPKVFSHCL----K 267

Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
             +     L  G  +    P    T  + + P+  Y L+L+ I+++ +++  P D+   T
Sbjct: 268 GSDNGGGILVLGEIV---EPGLVYTPLVPSQPH--YNLNLESIAVNGQKL--PIDSSLFT 320

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
            S   G I+DSG+ L Y     Y    + FVS         +         C+    + +
Sbjct: 321 TSNTQGTIVDSGTTLAYLADGAY----DPFVSAIAAAVSPSVRSLVSKGSQCFITSSSVD 376

Query: 266 -RFPSMAFYFEDA-NLRIDGENVFI----IDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
             FP++  YF     + +  EN  +    +D    + +         + ++G    +D  
Sbjct: 377 SSFPTVTLYFMGGVAMSVKPENYLLQQASVDNSVLWCIGWQRNQGQEITILGDLVLKDKI 436

Query: 320 FVYDLNIDLLSFVKENCS 337
           FVYDL    + +   +CS
Sbjct: 437 FVYDLANMRMGWADYDCS 454


>gi|326490656|dbj|BAJ89995.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 456

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 84/359 (23%), Positives = 147/359 (40%), Gaps = 53/359 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           ++ + +G+P+    +++DTGS + +              ++FDP  SS++   +C    C
Sbjct: 128 LITVGMGSPAVAQTMLIDTGSDVSWVQCKPCSQCHSQADSLFDPSSSSTYSAFSCTSAAC 187

Query: 47  TYFK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG- 102
              +   C + QC YT+KY D S   G  + +T+++     G +      FGCS    G 
Sbjct: 188 AQLRQRGCSSSQCQYTVKYGDGSTGSGTYSSDTLAL-----GSSTVENFQFGCSQSESGN 242

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
             +D   G +     L     S  +Q      K FSYCL  P P    +S +L  G    
Sbjct: 243 LLQDQTAGLMG----LGGGAESLATQTAGTFGKAFSYCLP-PTPG---SSGFLTLGASTS 294

Query: 163 ---YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
               + P  ++T+      ++Y + L+ I +   ++N P   F        G I+DSG++
Sbjct: 295 GFVVKTPMLRSTQV----PSYYGVLLQAIRVGGRQLNIPASAF------SAGSIMDSGTI 344

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI-QLCY-FLPETFNRFPSMAFYFEDA 277
           +T      Y  L   F +  +++  AQ    P  I   C+ F  ++    P++A  F   
Sbjct: 345 ITRLPRTAYSALSSAFKAGMKQYPPAQ----PMGIFDTCFDFSGQSSVSIPTVALVFSGG 400

Query: 278 NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            +     +  I+         A    D  + +IG+ QQR    +YD+    + F    C
Sbjct: 401 AVVDLASDGIIL---GSCLAFAANSDDTSLGIIGNVQQRTFEVLYDVGGGAVGFKAGAC 456


>gi|326501422|dbj|BAK02500.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 90/364 (24%), Positives = 145/364 (39%), Gaps = 48/364 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL IGTP +   LI+D+GS + Y                F P  SS++  + C+  DCT
Sbjct: 90  TRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCNV-DCT 148

Query: 48  YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDE 105
                N QC Y  +YA+ S + G    + +S   + E K     A+FGC N   G  F +
Sbjct: 149 CDSDKN-QCTYERQYAEMSSSSGVLGEDIVSFGTESELKP--QRAVFGCENSETGDLFSQ 205

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
            A      G++GL R  +S + QL    +I   FS C   + +  G      +     M 
Sbjct: 206 HAD-----GIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPGMI 260

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
           Y       +  +  P  +Y + LK++ +  + +   P  FD    G+ G ++DSG+   Y
Sbjct: 261 YTH-----SNAVRSP--YYNIELKEMHVAGKALRVDPRIFD----GKHGTVLDSGTTYAY 309

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA 277
                +    +   S     +  +  D P    +C+      + +    FP +   F + 
Sbjct: 310 LPEQAFVAFKDAVSSQVHPLKKIRGPD-PNYKDICFAGAGRNVSQLSEVFPKVDMVFGNG 368

Query: 278 N-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
             L +  EN        E  + L       D   L+G    R+T   YD + + + F K 
Sbjct: 369 QKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 428

Query: 335 NCSD 338
           NCS+
Sbjct: 429 NCSE 432


>gi|356535252|ref|XP_003536162.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 475

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 89/374 (23%), Positives = 157/374 (41%), Gaps = 55/374 (14%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           +L +G+P K   + +DTGS +++                    ++DP+ S + + I+CD 
Sbjct: 73  KLGLGSPPKDYYVQVDTGSDILWVNCVKCSRCPRKSDLGIDLTLYDPKGSETSELISCDQ 132

Query: 44  PDCTYF------KCVNE-QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
             C+         C +E  C Y++ Y D S T G+   + ++   V             +
Sbjct: 133 EFCSATYDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNHVNDNLRTAPQNSSII 192

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
           FGC     G    + + AL G++G  +   S +SQL +   +KK FS+CL     +    
Sbjct: 193 FGCGAVQSGTLSSSSEEALDGIIGFGQSNSSVLSQLAASGKVKKIFSHCL-----DNIRG 247

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG- 210
                 G  +  +  +T     + H    Y + LK I +D + +  P D FD   SG G 
Sbjct: 248 GGIFAIGEVVEPKVSTTPLVPRMAH----YNVVLKSIEVDTDILQLPSDIFD---SGNGK 300

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPS 269
           G IIDSG+ L Y  + VY +L  K ++   R +L  +    E    C+      +R FP 
Sbjct: 301 GTIIDSGTTLAYLPAIVYDELIPKVMARQPRLKLYLV----EQQFSCFQYTGNVDRGFPV 356

Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLL------AVAPHDDLVALIGSQQQRDTRFVYD 323
           +  +FED+       + ++  +++  + +      A   +   + L+G     +   +YD
Sbjct: 357 VKLHFEDSLSLTVYPHDYLFQFKDGIWCIGWQKSVAQTKNGKDMTLLGDLVLSNKLVIYD 416

Query: 324 LNIDLLSFVKENCS 337
           L    + +   NCS
Sbjct: 417 LENMAIGWTDYNCS 430


>gi|218189149|gb|EEC71576.1| hypothetical protein OsI_03949 [Oryza sativa Indica Group]
          Length = 504

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 92/376 (24%), Positives = 152/376 (40%), Gaps = 56/376 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
            R+ +G+P K   + +DTGS +++                     F+P  SS+  KI C 
Sbjct: 93  TRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCS 152

Query: 43  HPDCTYFKCVNEQ---------CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
              CT     +E          C YT  Y D S T G+   +T+   SV+G  +      
Sbjct: 153 DDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDSVMGNEQTANSSA 212

Query: 91  GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNG 148
             +FGCSN   G D    D A+ G+ G  +  +S +SQL S  +  K FS+CL       
Sbjct: 213 SIVFGCSNSQSG-DLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL----KGS 267

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
           +     L  G  +    P    T  + + P+  Y L+L+ I ++ +++  P D+   T S
Sbjct: 268 DNGGGILVLGEIV---EPGLVYTPLVPSQPH--YNLNLESIVVNGQKL--PIDSSLFTTS 320

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-R 266
              G I+DSG+ L Y     Y    + FV+         +         C+    + +  
Sbjct: 321 NTQGTIVDSGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSS 376

Query: 267 FPSMAFYFEDA-NLRIDGENVFI----IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
           FP+++ YF     + +  EN  +    ID    + +         + ++G    +D  FV
Sbjct: 377 FPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFV 436

Query: 322 YDLNIDLLSFVKENCS 337
           YDL    + +   +CS
Sbjct: 437 YDLANMRMGWTDYDCS 452


>gi|326515172|dbj|BAK03499.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 494

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 80/334 (23%), Positives = 129/334 (38%), Gaps = 62/334 (18%)

Query: 27  IFDPRKSSSFQKINCDHPDCTYFKC--------VNEQCVYTMKYADQSVTKGFAAHETIS 78
           ++DP KSS+F  I C  P C               ++C Y + Y D   T G    +T++
Sbjct: 199 LYDPAKSSTFAPIPCGSPACKELGSSYGNGCSPTTDECKYIVNYGDGKATTGTYVTDTLT 258

Query: 79  VIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFS 138
           +        +     FGCS+   G   +      AG+L L     S + Q        FS
Sbjct: 259 M----SPTIVVKDFRFGCSHAVRGSFSNQN----AGILALGGGRGSLLEQTADAYGNAFS 310

Query: 139 YCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFI-------NHPNNFYYLSLKDISID 191
           YC  IP P+   ++ +L  G       P   + KF         H   FY + L+ I + 
Sbjct: 311 YC--IPKPS---SAGFLSLG------GPVEASLKFSYTPLIKNKHAPTFYIVHLEAIIVA 359

Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
            +++  PP  F        G ++DSG+V+T     VY  L   F     R  +A      
Sbjct: 360 GKQLAVPPTAFAT------GAVMDSGAVVTQLPPQVYAALRAAF-----RSAMAAYGPLA 408

Query: 252 EPIQ---LCYFLPETFNRFPSMA------FYFEDANLRIDGENVFIIDYENHFFLLAVAP 302
            P++    CY     F RFP +        +   A L ++  ++ +    +     A  P
Sbjct: 409 APVRNLDTCY----DFTRFPDVKVPKVSLVFAGGATLDLEPASIIL----DGCLAFAATP 460

Query: 303 HDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            ++ V  IG+ QQ+    +YD+    + F +  C
Sbjct: 461 GEESVGFIGNVQQQTYEVLYDVGGGKVGFRRGAC 494


>gi|242085924|ref|XP_002443387.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
 gi|241944080|gb|EES17225.1| hypothetical protein SORBIDRAFT_08g018620 [Sorghum bicolor]
          Length = 460

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 90/382 (23%), Positives = 155/382 (40%), Gaps = 53/382 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHP 44
           +    IG P +    I+DTGS LI+                  +DP +S + + + C+  
Sbjct: 85  IAEYLIGDPPQQAAAIIDTGSNLIWTQCSTCRANGCFGQDLTFYDPSRSRTAKPVACNDT 144

Query: 45  DC---TYFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
            C   +  +C    + C     Y   ++  GF   E  +  G G+         FGC   
Sbjct: 145 ACLLGSETRCARDGKACAVLTAYGAGAI-GGFLGTEVFT-FGHGQSSENNVSLAFGCITA 202

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS-YLKFG 158
           +      + DGA +G++GL R  +S  SQLG     +FSYCL     +   TS+ ++   
Sbjct: 203 SR-LTPGSLDGA-SGIIGLGRGKLSLPSQLG---DNKFSYCLTPYFSDAANTSTLFVGAS 257

Query: 159 TDMGYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERMNFPPDTFD---ITVSGEG 210
             +        +  F+ +P++     FYYL L  I++   +++ P   FD   +  +  G
Sbjct: 258 AGLSGGGAPATSVPFLKNPDDDPFDSFYYLPLTGITVGTAKLDVPAAAFDLREVAPAKWG 317

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY--FLPETFNRF- 267
           G +IDSGS  T      Y  L ++ V       +   +   E + LC     P    +  
Sbjct: 318 GTLIDSGSPFTSLIDVAYQALRDELVRQLGASVVPPPAGA-EGLDLCVGGVAPGDAGKLV 376

Query: 268 PSMAFYF-----EDANLRIDGENVF-IIDYENHFFLL--AVAPHDDL----VALIGSQQQ 315
           P +  +F        ++ +  EN +  +D      ++  +  P+  L      +IG+  Q
Sbjct: 377 PPLVLHFGSGGGGGGDVVVPPENYWGPVDDSTACMVVFSSGGPNSTLPLNETTIIGNYMQ 436

Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
           +D   +YDL   +LSF   +CS
Sbjct: 437 QDMHLLYDLGQGVLSFQPADCS 458


>gi|242065058|ref|XP_002453818.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
 gi|241933649|gb|EES06794.1| hypothetical protein SORBIDRAFT_04g018520 [Sorghum bicolor]
          Length = 490

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 99/385 (25%), Positives = 160/385 (41%), Gaps = 77/385 (20%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ IG+PSKG  + +DTGS +++                     +DP  S +   + CD
Sbjct: 87  TQIEIGSPSKGYYVQVDTGSDILWVNCIRCDGCPTTSGLGIELTQYDPAGSGT--TVGCD 144

Query: 43  HPDCTYFK---------CVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
              C               +  C + + Y D S T GF   +++    V G G+      
Sbjct: 145 QEFCVANSPNGLPPACPSTSSPCQFRIAYGDGSSTTGFYVSDSVQYNQVSGNGQTTPSNA 204

Query: 91  GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNG 148
              FGC     G D  +   AL G+LG  +   S +SQL +   ++K F++CL     + 
Sbjct: 205 SITFGCGA-QLGGDLGSSSQALDGILGFGQADSSMLSQLAAARKVRKIFAHCL-----DT 258

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
            +       G  +  +  +T   + + H    Y ++L+ IS+    +  P  TFD   SG
Sbjct: 259 VHGGGIFAIGNVVQPKVKTTPLVQNVTH----YNVNLQGISVGGATLQLPSSTFD---SG 311

Query: 209 EG-GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNR 266
           +  G IIDSG+ L Y   +VY  L     + F+++Q   L +  + +  C+ F     + 
Sbjct: 312 DSKGTIIDSGTTLAYLPREVYRTL---LTAVFDKYQDLALHNYQDFV--CFQFSGSIDDG 366

Query: 267 FPSMAFYFEDANLRIDGE---NVFIIDY----ENHFFLL-----AVAPHD--DLVALIGS 312
           FP + F FE       GE   NV+  DY    EN  + +      V   D  D+V L+G 
Sbjct: 367 FPVVTFSFE-------GEITLNVYPHDYLFQNENDLYCMGFLDGGVQTKDGKDMV-LLGD 418

Query: 313 QQQRDTRFVYDLNIDLLSFVKENCS 337
               +   VYDL   ++ +   NCS
Sbjct: 419 LVLSNKLVVYDLEKQVIGWADYNCS 443


>gi|307103543|gb|EFN51802.1| hypothetical protein CHLNCDRAFT_59135 [Chlorella variabilis]
          Length = 746

 Score = 82.8 bits (203), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 80/324 (24%), Positives = 136/324 (41%), Gaps = 51/324 (15%)

Query: 4   LFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHPDCT 47
           L++GTP+K   +I+DTGS + Y                A FDP  SS+  +I+C  P C+
Sbjct: 82  LYLGTPAKKFAVIVDTGSTMTYVPCSSCGSGCGPNHQDAAFDPEASSTASRISCTSPKCS 141

Query: 48  ----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG- 102
                  C  +QC YT  YA+QS + G    + +++     G  I    +FGC     G 
Sbjct: 142 CGSPRCGCSTQQCTYTRSYAEQSSSSGILLEDVLALHDGLPGAPI----IFGCETRETGE 197

Query: 103 -FDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
            F + A      G+ GL     S ++QL    +I   FS C  +   +G      L  G 
Sbjct: 198 IFRQRAD-----GLFGLGNSDASVVNQLVKAGVIDDVFSLCFGMVEGDGA-----LLLGD 247

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG-GCIIDSGS 218
                  S Q T  +    + +Y ++K +S+  E    P      ++  +G G ++DSG+
Sbjct: 248 AEVPGSISLQYTPLLTSTTHPFYYNVKMLSLAVEGQLLP---VSQSLFDQGYGTVLDSGT 304

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQ-LSDCPEP----IQLCYFLPETFNRFPSMAFY 273
             TY  S V+    + F    E++ L+  L   P P      +C+    + +   +++  
Sbjct: 305 TFTYMPSPVF----KAFAGAVEKYALSHGLKRVPGPDPQFDDICFGQAPSHDDLEALSSV 360

Query: 274 FEDANLRIDGENVFIIDYENHFFL 297
           F    ++ D     ++   N+ F+
Sbjct: 361 FPSMEVQFDQGTSLVLGPLNYLFV 384


>gi|32489096|emb|CAE03928.1| OSJNba0093F12.2 [Oryza sativa Japonica Group]
 gi|58532027|emb|CAD41565.3| OSJNBa0006A01.20 [Oryza sativa Japonica Group]
          Length = 489

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 96/383 (25%), Positives = 154/383 (40%), Gaps = 66/383 (17%)

Query: 1   MVRLFIGTPSKGV---LLILDTGSALI---------------YAIFDPRKSSSFQKINCD 42
           +V+L IGTP+  +    ++ DTGS L                Y   DP KS +F++++C 
Sbjct: 123 LVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSCF 182

Query: 43  HPDCTYFKCV------NEQCVYTMKYADQSVTKGFAAHET--ISVIGKGEGKAIFHGALF 94
            P C     V      +  C++  +Y D     G    +       G G G  +     F
Sbjct: 183 DPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVAF 242

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--- 151
           GC+   H  D  A  G   G+L L     SF++QLG     RFSYC    +P  E T   
Sbjct: 243 GCA---HVEDSKAVRGYSTGILALGIGKPSFVTQLG---VDRFSYC----IPASEITDDD 292

Query: 152 ----------SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDT 201
                     +S+L+FG+   + R + +   F     + Y + LK +   +         
Sbjct: 293 DDDDDDEERSASFLRFGS---HARMTGKRAPF-KQDGSGYAVRLKSVVYQHGGRLNQQQP 348

Query: 202 FDITVSGEGGC-----IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL 256
             + V+GE        ++DSG+ L +    V++ L  +     E   L +  D   P   
Sbjct: 349 VPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIE---EDISLTRRYDLTHPSLY 405

Query: 257 CYFLPETFNRFPSMAFYF-EDANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQ 313
           CY    T     S+   F   A+L + G ++F  D      +  LAVA  +   A++G  
Sbjct: 406 CYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR--AILGVY 463

Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
            QR+    YDL+   ++F ++ C
Sbjct: 464 PQRNINVGYDLSTMEIAFDRDQC 486


>gi|224142001|ref|XP_002324349.1| predicted protein [Populus trichocarpa]
 gi|222865783|gb|EEF02914.1| predicted protein [Populus trichocarpa]
          Length = 490

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 91/364 (25%), Positives = 141/364 (38%), Gaps = 50/364 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP K + LI DTGS + +                IFDP +S+S+  I+C    
Sbjct: 150 IVTVGLGTPKKDLSLIFDTGSDITWTQCQPCARSCYKQKEQIFDPSQSTSYTNISCSSSI 209

Query: 46  CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C            C +  CVY ++Y D S + GF   E +++         F+   FGC 
Sbjct: 210 CNSLTSATGNTPGCASSACVYGIQYGDSSFSVGFFGTEKLTLTSTDA----FNNIYFGCG 265

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
            +N G    +            R  +S +SQ      K FSYC    LP+   ++ +L F
Sbjct: 266 QNNQGLFGGSAGLLGL-----GRDKLSVVSQTAQKYNKIFSYC----LPSSSSSTGFLTF 316

Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           G         T  +     P +FY L    IS+  +++      F        G IIDSG
Sbjct: 317 GGSASKNAKFTPLSTISAGP-SFYGLDFTGISVGGKKLAISASVFST-----AGAIIDSG 370

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFED 276
           +V+T      Y  L   F +   ++ + +       +  CY F   T    P + F F  
Sbjct: 371 TVITRLPPAAYSALRASFRNLMSKYPMTKALSI---LDTCYDFSSYTTISVPKIGFSFSS 427

Query: 277 A-NLRIDGENVFIIDYENHFFLLAVAPHDDL--VALIGSQQQRDTRFVYDLNIDLLSFVK 333
              + ID   +      +    LA A + D   V + G+ QQ+     YD +   + F  
Sbjct: 428 GIEVDIDATGILYASSLSQ-VCLAFAGNSDATDVFIFGNVQQKTLEVFYDGSAGKVGFAP 486

Query: 334 ENCS 337
             CS
Sbjct: 487 GGCS 490


>gi|222629462|gb|EEE61594.1| hypothetical protein OsJ_16002 [Oryza sativa Japonica Group]
          Length = 468

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 96/383 (25%), Positives = 154/383 (40%), Gaps = 66/383 (17%)

Query: 1   MVRLFIGTPSKGV---LLILDTGSALI---------------YAIFDPRKSSSFQKINCD 42
           +V+L IGTP+  +    ++ DTGS L                Y   DP KS +F++++C 
Sbjct: 102 LVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSCF 161

Query: 43  HPDCTYFKCV------NEQCVYTMKYADQSVTKGFAAHET--ISVIGKGEGKAIFHGALF 94
            P C     V      +  C++  +Y D     G    +       G G G  +     F
Sbjct: 162 DPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVAF 221

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--- 151
           GC+   H  D  A  G   G+L L     SF++QLG     RFSYC    +P  E T   
Sbjct: 222 GCA---HVEDSKAVRGYSTGILALGIGKPSFVTQLGV---DRFSYC----IPASEITDDD 271

Query: 152 ----------SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDT 201
                     +S+L+FG+   + R + +   F     + Y + LK +   +         
Sbjct: 272 DDDDDDEERSASFLRFGS---HARMTGKRAPF-KQDGSGYAVRLKSVVYQHGGRLNQQQP 327

Query: 202 FDITVSGEGGC-----IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL 256
             + V+GE        ++DSG+ L +    V++ L  +     E   L +  D   P   
Sbjct: 328 VPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIE---EDISLTRRYDLTHPSLY 384

Query: 257 CYFLPETFNRFPSMAFYF-EDANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQ 313
           CY    T     S+   F   A+L + G ++F  D      +  LAVA  +   A++G  
Sbjct: 385 CYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR--AILGVY 442

Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
            QR+    YDL+   ++F ++ C
Sbjct: 443 PQRNINVGYDLSTMEIAFDRDQC 465


>gi|115460260|ref|NP_001053730.1| Os04g0595000 [Oryza sativa Japonica Group]
 gi|113565301|dbj|BAF15644.1| Os04g0595000, partial [Oryza sativa Japonica Group]
          Length = 471

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 96/383 (25%), Positives = 154/383 (40%), Gaps = 66/383 (17%)

Query: 1   MVRLFIGTPSKGV---LLILDTGSALI---------------YAIFDPRKSSSFQKINCD 42
           +V+L IGTP+  +    ++ DTGS L                Y   DP KS +F++++C 
Sbjct: 105 LVQLRIGTPTDRISPRYVLFDTGSDLSWTQCEPCTNCSSFTPYPPHDPSKSRTFRRLSCF 164

Query: 43  HPDCTYFKCV------NEQCVYTMKYADQSVTKGFAAHET--ISVIGKGEGKAIFHGALF 94
            P C     V      +  C++  +Y D     G    +       G G G  +     F
Sbjct: 165 DPMCELCTAVVDGGGGSAGCLFRRRYGDGGAVSGELVSDVFHFGAAGDGGGYQLERDVAF 224

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--- 151
           GC+   H  D  A  G   G+L L     SF++QLG     RFSYC    +P  E T   
Sbjct: 225 GCA---HVEDSKAVRGYSTGILALGIGKPSFVTQLGV---DRFSYC----IPASEITDDD 274

Query: 152 ----------SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDT 201
                     +S+L+FG+   + R + +   F     + Y + LK +   +         
Sbjct: 275 DDDDDDEERSASFLRFGS---HARMTGKRAPF-KQDGSGYAVRLKSVVYQHGGRLNQQQP 330

Query: 202 FDITVSGEGGC-----IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL 256
             + V+GE        ++DSG+ L +    V++ L  +     E   L +  D   P   
Sbjct: 331 VPVYVAGEEAAAAMPMLVDSGTTLLWLPGSVFYPLQRRIE---EDISLTRRYDLTHPSLY 387

Query: 257 CYFLPETFNRFPSMAFYF-EDANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQ 313
           CY    T     S+   F   A+L + G ++F  D      +  LAVA  +   A++G  
Sbjct: 388 CYLGNMTDVEAVSVTLGFGGGADLELFGTSLFFTDENLTEDWVCLAVAAGNR--AILGVY 445

Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
            QR+    YDL+   ++F ++ C
Sbjct: 446 PQRNINVGYDLSTMEIAFDRDQC 468


>gi|222637379|gb|EEE67511.1| hypothetical protein OsJ_24961 [Oryza sativa Japonica Group]
          Length = 641

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 94/379 (24%), Positives = 150/379 (39%), Gaps = 68/379 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI------------------------FDPRKSSSFQ 37
            RL+IGTPS+   LI+D+GS + Y                          F P  SS++ 
Sbjct: 93  TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 152

Query: 38  KINCDHPDCTYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
            + C+  DCT   C NE  QC Y  +YA+ S + G    + +S   + E K     A+FG
Sbjct: 153 PVKCNV-DCT---CDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP--QRAVFG 206

Query: 96  CSNDNHG--FDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYT 151
           C N   G  F + A      G++GL R  +S + QL    +I   FS C         Y 
Sbjct: 207 CENTETGDLFSQHAD-----GIMGLGRGQLSIMDQLVEKGVISDSFSLC---------YG 252

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVS 207
              +  GT +    P+     F +H N     +Y + LK+I +  + +   P  F+    
Sbjct: 253 GMDVGGGTMVLGGMPAPPDMVF-SHSNPVRSPYYNIELKEIHVAGKALRLDPKIFN---- 307

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPE 262
            + G ++DSG+   Y     +    +   +     +  +  D P    +C+      + +
Sbjct: 308 SKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPD-PNYKDICFAGAGRNVSQ 366

Query: 263 TFNRFPSMAFYFEDAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
               FP +   F +   L +  EN        E  + L       D   L+G    R+T 
Sbjct: 367 LSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTL 426

Query: 320 FVYDLNIDLLSFVKENCSD 338
             YD + + + F K NCS+
Sbjct: 427 VTYDRHNEKIGFWKTNCSE 445


>gi|218199944|gb|EEC82371.1| hypothetical protein OsI_26705 [Oryza sativa Indica Group]
          Length = 642

 Score = 82.4 bits (202), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 94/379 (24%), Positives = 150/379 (39%), Gaps = 68/379 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI------------------------FDPRKSSSFQ 37
            RL+IGTPS+   LI+D+GS + Y                          F P  SS++ 
Sbjct: 94  TRLYIGTPSQEFALIVDSGSTVTYVPCATCEQCGNHQSESPNIIEAHDPRFQPDLSSTYS 153

Query: 38  KINCDHPDCTYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
            + C+  DCT   C NE  QC Y  +YA+ S + G    + +S   + E K     A+FG
Sbjct: 154 PVKCNV-DCT---CDNERSQCTYERQYAEMSSSSGVLGEDIMSFGKESELKP--QRAVFG 207

Query: 96  CSNDNHG--FDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYT 151
           C N   G  F + A      G++GL R  +S + QL    +I   FS C         Y 
Sbjct: 208 CENTETGDLFSQHAD-----GIMGLGRGQLSIMDQLVEKGVISDSFSLC---------YG 253

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVS 207
              +  GT +    P+     F +H N     +Y + LK+I +  + +   P  F+    
Sbjct: 254 GMDVGGGTMVLGGMPAPPDMVF-SHSNPVRSPYYNIELKEIHVAGKALRLDPKIFN---- 308

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPE 262
            + G ++DSG+   Y     +    +   +     +  +  D P    +C+      + +
Sbjct: 309 SKHGTVLDSGTTYAYLPEQAFVAFKDAVTNKVNSLKKIRGPD-PNYKDICFAGAGRNVSQ 367

Query: 263 TFNRFPSMAFYFEDAN-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
               FP +   F +   L +  EN        E  + L       D   L+G    R+T 
Sbjct: 368 LSEVFPDVDMVFGNGQKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTL 427

Query: 320 FVYDLNIDLLSFVKENCSD 338
             YD + + + F K NCS+
Sbjct: 428 VTYDRHNEKIGFWKTNCSE 446


>gi|168030587|ref|XP_001767804.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680886|gb|EDQ67318.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 97/388 (25%), Positives = 142/388 (36%), Gaps = 84/388 (21%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYAI-------------------------FDPRKSSSFQ 37
           R+FIGTP     LI+DTGS + Y                           F P  SSS+Q
Sbjct: 43  RVFIGTPPNEFALIVDTGSTVTYVPCSSCTHCGHHQASFSTHRLFCRDPRFKPENSSSYQ 102

Query: 38  KINCDHPDCTYFKC--VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-F 94
           KI C   DC    C   + QC Y   YA+ S +KG    + +     G    +    L F
Sbjct: 103 KIGCRSSDCITGLCDSNSHQCKYERMYAEMSTSKGVLGKDLLDF---GPASRLQSQLLSF 159

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL----------- 141
           GC     G   D       G++GL R  +S + QL     I+  FS C            
Sbjct: 160 GCETAESG---DLYLQVADGIMGLGRGPLSIVDQLVGNGAIEDSFSLCYGGMDEGGGSMV 216

Query: 142 --VIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPP 199
              IP P+G      + F      R             +N+Y L L +I +    +    
Sbjct: 217 LGAIPAPSG------MVFAKSDPRR-------------SNYYNLELTEIQVQGASLKLDS 257

Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP--IQLC 257
           + F+    G+ G I+DSG+   Y     +    +  V+     Q     D P+P    +C
Sbjct: 258 NVFN----GKFGTILDSGTTYAYLPDRAFEAFTDAVVAQLGSLQAV---DGPDPNYPDIC 310

Query: 258 YF-----LPETFNRFPSMAFYF-EDANLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALI 310
           Y        E    FP + F F E+  + +  EN +F        + L    + D   L+
Sbjct: 311 YAGAGTDTKELGKHFPLVDFVFAENQKVSLAPENYLFKHTKVPGAYCLGFFKNQDATTLL 370

Query: 311 GSQQQRDTRFVYDLNIDLLSFVKENCSD 338
           G    R+    YD     + F+K NC++
Sbjct: 371 GGIIVRNMLVTYDRYNHQIGFLKTNCTE 398


>gi|53791672|dbj|BAD53242.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 504

 Score = 82.0 bits (201), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 91/376 (24%), Positives = 152/376 (40%), Gaps = 56/376 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
            R+ +G+P K   + +DTGS +++                     F+P  SS+  KI C 
Sbjct: 93  TRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCS 152

Query: 43  HPDCTYFKCVNEQ---------CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
              CT     +E          C YT  Y D S T G+   +T+   +V+G  +      
Sbjct: 153 DDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANSSA 212

Query: 91  GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNG 148
             +FGCSN   G D    D A+ G+ G  +  +S +SQL S  +  K FS+CL       
Sbjct: 213 SIVFGCSNSQSG-DLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL----KGS 267

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
           +     L  G  +    P    T  + + P+  Y L+L+ I ++ +++  P D+   T S
Sbjct: 268 DNGGGILVLGEIV---EPGLVYTPLVPSQPH--YNLNLESIVVNGQKL--PIDSSLFTTS 320

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-R 266
              G I+DSG+ L Y     Y    + FV+         +         C+    + +  
Sbjct: 321 NTQGTIVDSGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSS 376

Query: 267 FPSMAFYFEDA-NLRIDGENVFI----IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
           FP+++ YF     + +  EN  +    ID    + +         + ++G    +D  FV
Sbjct: 377 FPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFV 436

Query: 322 YDLNIDLLSFVKENCS 337
           YDL    + +   +CS
Sbjct: 437 YDLANMRMGWTDYDCS 452


>gi|356568907|ref|XP_003552649.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score = 82.0 bits (201), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 97/392 (24%), Positives = 153/392 (39%), Gaps = 90/392 (22%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ IGTP K   L +DTGS +++                    ++D ++SSS + + CD
Sbjct: 87  AKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSNLGMDLTLYDIKESSSGKFVPCD 146

Query: 43  HPDCTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
              C             N  C Y   Y D S T G+   + +    V G  +  +     
Sbjct: 147 QEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANGSI 206

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
           +FGC     G    + + AL G+LG  +   S ISQL S   +KK F++CL     NG  
Sbjct: 207 VFGCGARQSGDLSSSNEEALGGILGFGKANSSMISQLASSGKVKKMFAHCL-----NGVN 261

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
                  G  +   +P    T  + + P+  Y +++  + + +  ++   DT   T    
Sbjct: 262 GGGIFAIGHVV---QPKVNMTPLLPDQPH--YSVNMTAVQVGHAFLSLSTDTS--TQGDR 314

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NRFP 268
            G IIDSG+ L Y    +Y  L  K +S     ++  L D       C+   E+  + FP
Sbjct: 315 KGTIIDSGTTLAYLPEGIYEPLVYKIISQHPDLKVRTLHD----EYTCFQYSESVDDGFP 370

Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVA-----LIGSQ----QQRDTR 319
           ++ FYFE+                     L V PHD L        IG Q    Q RD++
Sbjct: 371 AVTFYFENG------------------LSLKVYPHDYLFPSGDFWCIGWQNSGTQSRDSK 412

Query: 320 FV--------------YDLNIDLLSFVKENCS 337
            +              YDL   ++ + + NCS
Sbjct: 413 NMTLLGDLVLSNKLVFYDLENQVIGWTEYNCS 444


>gi|326529233|dbj|BAK01010.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 441

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 90/374 (24%), Positives = 151/374 (40%), Gaps = 64/374 (17%)

Query: 5   FIGTPSKGVLLILDTGSALIYA-----------------IFDPRKSSSFQKINCD----- 42
            IG P +    ++DTGS LI+                   ++  +SS+F  + C      
Sbjct: 89  LIGDPPQRAAALIDTGSNLIWTQCGTTCGLKACAKQDLPYYNLSRSSTFAAVPCADSAKL 148

Query: 43  -HPDCTYFKCVNEQCVYTMKYADQSV-----TKGFAAHETISVIGKGEGKAIFHGALFGC 96
              +  +   ++  C +   Y   SV     T+ F      + +G            FGC
Sbjct: 149 CAANGVHLCGLDGSCTFAASYGAGSVFGSLGTEAFTFQSGAAKLG------------FGC 196

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
            +      + A +GA +G++GL R  +S +SQ G+    +FSYCL   L N    SS+L 
Sbjct: 197 VSLTR-ITKGALNGA-SGLIGLGRGRLSLVSQTGA---TKFSYCLTPYLRN-HGASSHLF 250

Query: 157 FGTDMGYRRPSTQATK--FINHP-----NNFYYLSLKDISIDNERMNFPPDTFDI--TVS 207
            G            T   F+  P     + FYYL L  IS+   ++  P   F++    +
Sbjct: 251 VGASASLSGGGGAVTSIPFVKSPEDYPYSTFYYLPLVGISVGETKLPIPSAAFELRRVAA 310

Query: 208 G--EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
           G   GG IID+GS +T      Y  L ++      R  +   +D    + LC    +   
Sbjct: 311 GYWSGGVIIDTGSPVTSLAEAAYSALSDEVARQLNRSLVQPPAD--TGLDLCVARQDVDK 368

Query: 266 RFPSMAFYFED-ANLRIDGENVF-IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
             P + F+F   A++ +   + +  +D      L+    ++ +   IG+ QQ+D   +YD
Sbjct: 369 VVPVLVFHFGGGADMAVSAGSYWGPVDKSTACMLIEEGGYETV---IGNFQQQDVHLLYD 425

Query: 324 LNIDLLSFVKENCS 337
           +    LSF   +CS
Sbjct: 426 IGKGELSFQTADCS 439


>gi|413951280|gb|AFW83929.1| hypothetical protein ZEAMMB73_279135 [Zea mays]
          Length = 451

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 79/363 (21%), Positives = 149/363 (41%), Gaps = 46/363 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAI------------FDPRKSSSFQKINCDHPDCTY 48
           + R  +GTP++ +L+ +D  +   +              FDP +SS+++ + C  P C+ 
Sbjct: 108 VARARLGTPAQALLVAIDPSNDAAWVPCAACAGCARAPSFDPTRSSTYRPVRCGAPQCSQ 167

Query: 49  FKC------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                    +   C + + YA  S  +     + +++    +  A +    FGC +   G
Sbjct: 168 APAPSCPGGLGSSCAFNLSYA-ASTFQALLGQDALALHDDVDAVAAYT---FGCLHVVTG 223

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                +     G++G  R  +SF SQ   +    FSYCL  P       S  L+ G    
Sbjct: 224 GSVPPQ-----GLVGFGRGPLSFPSQTKDVYGSVFSYCL--PSYKSSNFSGTLRLGPAGQ 276

Query: 163 YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVL 220
            +R   + T  +++P+  + YY+++  I +    +  P        +   G I+D+G++ 
Sbjct: 277 PKR--IKTTPLLSNPHRPSLYYVNMVGIRVGGRPVPVPASALAFDPTSGRGTIVDAGTMF 334

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED-ANL 279
           T   + VY  + + F S         L         CY         P++ F F+   ++
Sbjct: 335 TRLSAPVYAAVRDVFRSRVRAPVAGPLGG----FDTCY---NVTISVPTVTFSFDGRVSV 387

Query: 280 RIDGENVFIIDYENHFFLLAVA--PHDDLVA---LIGSQQQRDTRFVYDLNIDLLSFVKE 334
            +  ENV I         LA+A  P D + A   ++ S QQ++ R ++D+    + F +E
Sbjct: 388 TLPEENVVIRSSSGGIACLAMAAGPPDGVDAALNVLASMQQQNHRVLFDVANGRVGFSRE 447

Query: 335 NCS 337
            C+
Sbjct: 448 LCT 450


>gi|125529158|gb|EAY77272.1| hypothetical protein OsI_05246 [Oryza sativa Indica Group]
          Length = 426

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 84/364 (23%), Positives = 143/364 (39%), Gaps = 49/364 (13%)

Query: 1   MVRLFIGTPSKGVLLILD-------------TGSALIYAIFDPRKSSSFQKINCDHPDCT 47
           + R  +GTP++ +L+ +D              G A     F P +SS+++ + C  P C 
Sbjct: 84  IARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPSFSPTQSSTYRTVPCGSPQCA 143

Query: 48  YFKC------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                     V   C + + YA  S  +     +++++    E   +     FGC     
Sbjct: 144 QVPSPSCPAGVGSSCGFNLTYA-ASTFQAVLGQDSLAL----ENNVVVS-YTFGCLRVVS 197

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--SSYLKFGT 159
           G     +     G++G  R  +SF+SQ        FSYCL    PN   +  S  LK G 
Sbjct: 198 GNSVPPQ-----GLIGFGRGPLSFLSQTKDTYGSVFSYCL----PNYRSSNFSGTLKLGP 248

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
               +R  T    +  H  + YY+++  I + ++ +  P            G IID+G++
Sbjct: 249 IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 308

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA-N 278
            T   + VY  + + F           L         CY         P++ F F  A  
Sbjct: 309 FTRLAAPVYAAVRDAFRGRVRTPVAPPLGG----FDTCY---NVTVSVPTVTFMFAGAVA 361

Query: 279 LRIDGENVFIIDYENHFFLLAVA--PHDDLVA---LIGSQQQRDTRFVYDLNIDLLSFVK 333
           + +  ENV I         LA+A  P D + A   ++ S QQ++ R ++D+    + F +
Sbjct: 362 VTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSR 421

Query: 334 ENCS 337
           E C+
Sbjct: 422 ELCT 425


>gi|357127505|ref|XP_003565420.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 466

 Score = 81.6 bits (200), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 85/337 (25%), Positives = 139/337 (41%), Gaps = 38/337 (11%)

Query: 22  ALIYAIFDPRKSSSFQKINCDHPDC----TYFKCVNEQ--CVYTMKYADQSVTKGFAAHE 75
           A++Y  F+P  SSS+ ++ CD P C    T   C  +   C +   Y D +   G  A +
Sbjct: 139 AVVY--FNPFDSSSYSRVGCDGPSCLALATNASCNGDSHACDFRYSYRDGASATGLLAAD 196

Query: 76  TISVIGKGEGKAIFHGAL-FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIK 134
           T +  G          ++ FGC+    G     R+    G++GL    +S  SQLG    
Sbjct: 197 TFTFGGNINNDTTSTASIDFGCATGTAG-----REFQADGMVGLGAGPLSLASQLG---- 247

Query: 135 KRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDN 192
           ++FS+CL     + +  SS L FG       P    T  I   +N   YY     ISID+
Sbjct: 248 RKFSFCLTA--YDIDDASSILNFGARAVVSDPGAATTPLIASSSNAAAYYA----ISIDS 301

Query: 193 ERMNFPPDTFDITVSGEGGCIIDSGSVLTYF-HSDVYWKLHEKFVSYFERFQLAQLSDCP 251
            ++   P     +VS     I+D+G+VLT+   + +   L E      +   L +     
Sbjct: 302 LKVAGQPVPGTTSVS---KVIVDTGTVLTFLDRAALLAPLTESLARVMDGAGLPRAPPPD 358

Query: 252 EPIQLCY---FLPETFNRFPSMAFYF---EDANLRIDGENVFIIDYENHFFLLAVAPHDD 305
           E ++LCY    + +     P +           +R+ GE  F++  E    L  V    +
Sbjct: 359 ETLELCYDVSRVKDVDGVIPDVTLVLGGGGGGEVRLTGEGTFVLVKEGVLCLAVVTTSPE 418

Query: 306 L--VALIGSQQQRDTRFVYDLNIDLLSFVKENCSDDS 340
           L  ++++G+   +D     DL+    +F   NC   S
Sbjct: 419 LQPLSVLGNVALQDLHVGIDLDARTATFATANCDSSS 455


>gi|222619345|gb|EEE55477.1| hypothetical protein OsJ_03658 [Oryza sativa Japonica Group]
          Length = 530

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 91/376 (24%), Positives = 152/376 (40%), Gaps = 56/376 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
            R+ +G+P K   + +DTGS +++                     F+P  SS+  KI C 
Sbjct: 119 TRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCS 178

Query: 43  HPDCTYFKCVNEQ---------CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
              CT     +E          C YT  Y D S T G+   +T+   +V+G  +      
Sbjct: 179 DDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANSSA 238

Query: 91  GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNG 148
             +FGCSN   G D    D A+ G+ G  +  +S +SQL S  +  K FS+CL       
Sbjct: 239 SIVFGCSNSQSG-DLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL----KGS 293

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
           +     L  G  +    P    T  + + P+  Y L+L+ I ++ +++  P D+   T S
Sbjct: 294 DNGGGILVLGEIV---EPGLVYTPLVPSQPH--YNLNLESIVVNGQKL--PIDSSLFTTS 346

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-R 266
              G I+DSG+ L Y     Y    + FV+         +         C+    + +  
Sbjct: 347 NTQGTIVDSGTTLAYLADGAY----DPFVNAITAAVSPSVRSLVSKGNQCFVTSSSVDSS 402

Query: 267 FPSMAFYFEDA-NLRIDGENVFI----IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
           FP+++ YF     + +  EN  +    ID    + +         + ++G    +D  FV
Sbjct: 403 FPTVSLYFMGGVAMTVKPENYLLQQASIDNNVLWCIGWQRNQGQQITILGDLVLKDKIFV 462

Query: 322 YDLNIDLLSFVKENCS 337
           YDL    + +   +CS
Sbjct: 463 YDLANMRMGWTDYDCS 478


>gi|16209647|gb|AAL14384.1| AT3g52500/F22O6_120 [Arabidopsis thaliana]
          Length = 469

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 95/389 (24%), Positives = 146/389 (37%), Gaps = 65/389 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALI----------------------YAIFDPRKSSSFQKI 39
           V L  GTPS+ +  + DTGS+L+                         F P+ SSS + I
Sbjct: 92  VSLSFGTPSQTIPFVFDTGSSLVCLPCTSRYLCSGCDFSGLDPTLIPRFIPKNSSSSKII 151

Query: 40  NCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF----- 94
            C  P C +    N QC        ++ T G   +     +G   G  I     F     
Sbjct: 152 GCQSPKCQFLYGPNVQC-RGCDPNTRNCTVGCPPYILQYGLGSTAGVLITEKLDFPDLTV 210

Query: 95  -----GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGE 149
                GCS               AG+ G  R  +S  SQ+     KRFS+CLV    +  
Sbjct: 211 PDFVVGCS--------IISTRQPAGIAGFGRGPVSLPSQMN---LKRFSHCLVSRRFDDT 259

Query: 150 YTSSYLKF----GTDMGYRRPSTQATKFINHPN-------NFYYLSLKDISIDNERMNFP 198
             ++ L      G + G + P    T F  +PN        +YYL+L+ I +  + +  P
Sbjct: 260 NVTTDLDLDTGSGHNSGSKTPGLTYTPFRKNPNVSNKAFLEYYYLNLRRIYVGRKHVKIP 319

Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
                   +G+GG I+DSGS  T+    V+  + E+F S    +   +  +    +  C+
Sbjct: 320 YKYLAPGTNGDGGSIVDSGSTFTFMERPVFELVAEEFASQMSNYTREKDLEKETGLGPCF 379

Query: 259 FLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDL--------VA 308
            +    +   P + F F+  A L +   N F          L V     +          
Sbjct: 380 NISGKGDVTVPELIFEFKGGAKLELPLSNYFTFVGNTDTVCLTVVSDKTVNPSGGTGPAI 439

Query: 309 LIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           ++GS QQ++    YDL  D   F K+ CS
Sbjct: 440 ILGSFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|56202144|dbj|BAD73477.1| chloroplast nucleoid DNA binding protein-like [Oryza sativa
           Japonica Group]
 gi|125571574|gb|EAZ13089.1| hypothetical protein OsJ_03009 [Oryza sativa Japonica Group]
          Length = 316

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 85/316 (26%), Positives = 126/316 (39%), Gaps = 48/316 (15%)

Query: 56  CVYTMKYADQSVTKGFAA--HETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALA 113
           C    +Y D S  +G       TI++ G+   KA   G + GC+   +G    A DG   
Sbjct: 12  CSAARRYKDGSAARGTVGVDSATIALSGRAARKAKLRGVVLGCTTSYNGQSFLASDG--- 68

Query: 114 GVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY--RRPS---- 167
            VL L    ISF S+  S    RFSYCLV  L     T SYL FG +  +  RRPS    
Sbjct: 69  -VLSLGYSNISFASRAASRFGGRFSYCLVDHLAPRNAT-SYLTFGPNPAFSSRRPSEGTA 126

Query: 168 ------------------TQATKFINHPNN-FYYLSLKDISIDNERMNFPPDTFDITVSG 208
                              Q    ++H    FY +++K +S+  E +  P   +D  V  
Sbjct: 127 SCKPAPAPTPAPAGAPGARQTPLVLDHRTRPFYAVTVKGVSVAGELLKIPRAVWD--VEQ 184

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP-EPIQLCYFL-----PE 262
            GG I+DSG+ LT      Y       V+   + +LA L     +P   CY        +
Sbjct: 185 GGGAILDSGTSLTMLAKPAY----RAVVAALSK-RLAGLPRVTMDPFDYCYNWTSPSGSD 239

Query: 263 TFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRF 320
                P +A +F  +         ++ID       + +   P   L ++IG+  Q++  +
Sbjct: 240 VAAPLPMLAVHFAGSARLEPPAKSYVIDAAPGVKCIGLQEGPWPGL-SVIGNILQQEHLW 298

Query: 321 VYDLNIDLLSFVKENC 336
            YDL    L F +  C
Sbjct: 299 EYDLKNRRLRFKRSRC 314


>gi|4646203|gb|AAD26876.1|AC007230_10 Belongs to PF|00026 Eukaryotic aspartyl protease family
           [Arabidopsis thaliana]
          Length = 449

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 84/370 (22%), Positives = 151/370 (40%), Gaps = 56/370 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ +G+P K   + +DTGS +++                   ++FD   SS+ +K+ CD
Sbjct: 76  TKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVGCD 135

Query: 43  HPDCTYFKCVNE-----QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALF 94
              C++    +       C Y + YAD+S + G    + ++   V G  +   +    +F
Sbjct: 136 DDFCSFISQSDSCQPALGCSYHIVYADESTSDGKFIRDMLTLEQVTGDLKTGPLGQEVVF 195

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTS 152
           GC +D  G   +  D A+ GV+G  +   S +SQL +    K+ FS+CL      G +  
Sbjct: 196 GCGSDQSGQLGNG-DSAVDGVMGFGQSNTSVLSQLAATGDAKRVFSHCLDNVKGGGIFAV 254

Query: 153 SYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGG 211
             +          P  + T  +  PN  +Y + L  + +D   ++ P      ++   GG
Sbjct: 255 GVVD--------SPKVKTTPMV--PNQMHYNVMLMGMDVDGTSLDLPR-----SIVRNGG 299

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMA 271
            I+DSG+ L YF   +Y  L E  ++     Q  +L    E  Q   F       FP ++
Sbjct: 300 TIVDSGTTLAYFPKVLYDSLIETILAR----QPVKLHIVEETFQCFSFSTNVDEAFPPVS 355

Query: 272 FYFEDANLRIDGENVFIIDYENHFFLLA------VAPHDDLVALIGSQQQRDTRFVYDLN 325
           F FED+       + ++   E   +                V L+G     +   VYDL+
Sbjct: 356 FEFEDSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYDLD 415

Query: 326 IDLLSFVKEN 335
            +++ +   N
Sbjct: 416 NEVIGWADHN 425


>gi|115442309|ref|NP_001045434.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|20161865|dbj|BAB90778.1| nucleoid DNA-binding-like protein [Oryza sativa Japonica Group]
 gi|113534965|dbj|BAF07348.1| Os01g0954900 [Oryza sativa Japonica Group]
 gi|215766867|dbj|BAG99095.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 445

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 84/364 (23%), Positives = 143/364 (39%), Gaps = 49/364 (13%)

Query: 1   MVRLFIGTPSKGVLLILD-------------TGSALIYAIFDPRKSSSFQKINCDHPDCT 47
           + R  +GTP++ +L+ +D              G A     F P +SS+++ + C  P C 
Sbjct: 103 IARAGLGTPAQTLLVAIDPSNDAAWVPCSACAGCAASSPSFSPTQSSTYRTVPCGSPQCA 162

Query: 48  YFKC------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                     V   C + + YA  S  +     +++++    E   +     FGC     
Sbjct: 163 QVPSPSCPAGVGSSCGFNLTYA-ASTFQAVLGQDSLAL----ENNVVVS-YTFGCLRVVS 216

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--SSYLKFGT 159
           G     +     G++G  R  +SF+SQ        FSYCL    PN   +  S  LK G 
Sbjct: 217 GNSVPPQ-----GLIGFGRGPLSFLSQTKDTYGSVFSYCL----PNYRSSNFSGTLKLGP 267

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
               +R  T    +  H  + YY+++  I + ++ +  P            G IID+G++
Sbjct: 268 IGQPKRIKTTPLLYNPHRPSLYYVNMIGIRVGSKVVQVPQSALAFNPVTGSGTIIDAGTM 327

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA-N 278
            T   + VY  + + F           L         CY         P++ F F  A  
Sbjct: 328 FTRLAAPVYAAVRDAFRGRVRTPVAPPLGG----FDTCY---NVTVSVPTVTFMFAGAVA 380

Query: 279 LRIDGENVFIIDYENHFFLLAVA--PHDDLVA---LIGSQQQRDTRFVYDLNIDLLSFVK 333
           + +  ENV I         LA+A  P D + A   ++ S QQ++ R ++D+    + F +
Sbjct: 381 VTLPEENVMIHSSSGGVACLAMAAGPSDGVNAALNVLASMQQQNQRVLFDVANGRVGFSR 440

Query: 334 ENCS 337
           E C+
Sbjct: 441 ELCT 444


>gi|413953788|gb|AFW86437.1| hypothetical protein ZEAMMB73_618532 [Zea mays]
          Length = 469

 Score = 81.6 bits (200), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 94/376 (25%), Positives = 143/376 (38%), Gaps = 86/376 (22%)

Query: 6   IGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDCTYF 49
           +GTP+    LILDTGS+L +                 +FDP  SSS+  + CD  +C   
Sbjct: 135 LGTPAVPQTLILDTGSSLTWVQCKPCNSSQCYPQRLPLFDPNTSSSYSPVPCDSQECRAL 194

Query: 50  K-------CVNE---QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
                   C ++    C Y + Y   +   G  + + ++ +G G     FH   FGC + 
Sbjct: 195 AAGIDGDGCTSDGDWGCAYEIHYGSGATPAGEYSTDALT-LGPGAIVKRFH---FGCGHH 250

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR----FSYCLVIPLPNGEYTSSYL 155
                 D  D    GVLGL R+  S   Q  +   +R    FS+C    LP    ++ +L
Sbjct: 251 QQRGKFDMAD----GVLGLGRLPQSLAWQASA---RRGGGVFSHC----LPPTGVSTGFL 299

Query: 156 KFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
             G            + F+  P         FY L    IS+  + ++ PP  F      
Sbjct: 300 ALGAPH-------DTSAFVFTPLLTMDDQPWFYQLMPTAISVAGQLLDIPPAVF------ 346

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN--- 265
             G I DSG+VL+      Y  L   F S    + LA       P+     L   FN   
Sbjct: 347 REGVITDSGTVLSALQETAYTALRTAFRSAMAEYPLA------PPVG---HLDTCFNFTG 397

Query: 266 ----RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
                 P+++  F   A + +D  +  ++D    F+    +  D+   LIGS  QR    
Sbjct: 398 YDNVTVPTVSLTFRGGATVHLDASSGVLMDGCLAFW----SSGDEYTGLIGSVSQRTIEV 453

Query: 321 VYDLNIDLLSFVKENC 336
           +YD+    + F    C
Sbjct: 454 LYDMPGRKVGFRTGAC 469


>gi|357507805|ref|XP_003624191.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355499206|gb|AES80409.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 406

 Score = 81.3 bits (199), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 87/342 (25%), Positives = 142/342 (41%), Gaps = 43/342 (12%)

Query: 19  TGSALIYAIFDPRKSSSFQKINCDHPDCT------YFKCVNEQ-CVYTMKYADQSVTKGF 71
           +G  +   ++DP  S +   + C    CT         C  +  C Y++ Y D S T G 
Sbjct: 40  SGLGMDLTLYDPNGSKTSNAVPCGDGFCTDTYSGPISGCKQDMSCPYSITYGDGSTTSGS 99

Query: 72  AAHETIS---VIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQ 128
             +++++   V G    K      +FGC     G      D AL G++G  +   S +SQ
Sbjct: 100 FVNDSLTFDEVSGNLHTKPDNSSVIFGCGAKQSGSLSSNSDEALDGIIGFGQANSSVLSQ 159

Query: 129 LGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLK 186
           L +   +K+ FS+CL     +  +       G  M  +  +T     + H    Y + LK
Sbjct: 160 LAASGKVKRIFSHCL-----DSHHGGGIFSIGQVMEPKFNTTPLVPRMAH----YNVILK 210

Query: 187 DISIDNERMNFPPDTFDITVSGEG-GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLA 245
           D+ +D E +  P   FD   SG G G IIDSG+ L Y    +Y +L  K +      +L 
Sbjct: 211 DMDVDGEPILLPLYLFD---SGSGRGTIIDSGTTLAYLPLSIYNQLLPKVLGRQPGLKLM 267

Query: 246 QLSDCPEPIQL-CYFLPETFNR-FPSMAFYFEDANLRID--------GENVFIIDYENHF 295
            + D     Q  C+   +  +  FP + F+FE  +L +          E+++ I ++   
Sbjct: 268 IVED-----QFTCFHYSDKLDEGFPVVKFHFEGLSLTVHPHDYLFLYKEDIYCIGWQKSS 322

Query: 296 FLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
                    DL+ LIG     +   VYDL   ++ +   NCS
Sbjct: 323 --TQTKEGRDLI-LIGDLVLSNKLVVYDLENMVIGWTNFNCS 361


>gi|359806832|ref|NP_001241567.1| uncharacterized protein LOC100819698 precursor [Glycine max]
 gi|255638149|gb|ACU19388.1| unknown [Glycine max]
          Length = 437

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 79/361 (21%), Positives = 144/361 (39%), Gaps = 46/361 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
           +V+  IGTP++ +LL +DT +   +              F P KS++F+K+ C    C  
Sbjct: 99  IVKAKIGTPAQTLLLAMDTSNDASWVPCTACVGCSTTTPFAPAKSTTFKKVGCGASQCKQ 158

Query: 49  FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
            +   C    C +   Y   SV       +T+++              FGC     G   
Sbjct: 159 VRNPTCDGSACAFNFTYGTSSVAASLV-QDTVTL-----ATDPVPAYAFGCIQKVTGSSV 212

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
             +           R  +S ++Q   + +  FSYCL  P       S  L+ G     +R
Sbjct: 213 PPQGLLGL-----GRGPLSLLAQTQKLYQSTFSYCL--PSFKTLNFSGSLRLGPVAQPKR 265

Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
              + T  + +P  ++ YY++L  I +    ++ PP+      +   G + DSG+V T  
Sbjct: 266 --IKFTPLLKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNANTGAGTVFDSGTVFTRL 323

Query: 224 HSDVYWKLHEKF---VSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLR 280
               Y  +  +F   ++  ++  +  L         CY  P      P++ F F   N+ 
Sbjct: 324 VEPAYNAVRNEFRRRIAVHKKLTVTSLGG----FDTCYTAPIV---APTITFMFSGMNVT 376

Query: 281 IDGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  +N+ I         LA+AP  D    ++ +I + QQ++ R ++D+    L   +E C
Sbjct: 377 LPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVARELC 436

Query: 337 S 337
           +
Sbjct: 437 T 437


>gi|449451076|ref|XP_004143288.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 488

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 97/371 (26%), Positives = 156/371 (42%), Gaps = 54/371 (14%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           ++ +G P++   + +DTGS +++                    +FD  KSSS + + C  
Sbjct: 87  KVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLPCTD 146

Query: 44  PDCTYFKCVNEQCV-------YTMKYADQSVTKGFAAHETISV-IGKGEGKAIFHGA--L 93
           P C       +QC+       Y+  Y D+S T GF   +++   I  GE       A  +
Sbjct: 147 PICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIANSSATIV 206

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYT 151
           FGCS   +G D      AL G+ G  +   S ISQL S  I  K FS+C    L  GE  
Sbjct: 207 FGCSIYQYG-DLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHC----LKGGENG 261

Query: 152 SSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFP-PDTFDITVSGE 209
              L  G  +    PS   +  I + P+  Y L L+ I++  +   FP P  F I+ +GE
Sbjct: 262 GGILVLGEIL---EPSIVYSPLIPSQPH--YTLKLQSIALSGQL--FPNPTMFPISNAGE 314

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
              IIDSG+ L Y   +VY  +     S   +     +S   +  ++   + +    FP 
Sbjct: 315 --TIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVSMSVADI---FPV 369

Query: 270 MAFYFED-ANLRIDGENVFIID---YENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLN 325
           + F FE  A++ +  E     D    E   + +     +D + ++G    +D   VYDL 
Sbjct: 370 LRFNFEGIASMVVTPEEYLQFDSIVREPALWCIGFQKAEDGLNILGDLVLKDKIIVYDLA 429

Query: 326 IDLLSFVKENC 336
              + +   +C
Sbjct: 430 RQRIGWANYDC 440


>gi|15232960|ref|NP_186923.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|6728988|gb|AAF26986.1|AC018363_31 putative aspartyl protease [Arabidopsis thaliana]
 gi|21593593|gb|AAM65560.1| putative aspartyl protease [Arabidopsis thaliana]
 gi|332640332|gb|AEE73853.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 488

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 84/377 (22%), Positives = 156/377 (41%), Gaps = 64/377 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA-----IFDPRKS-------------SSFQKINCDH 43
            ++ +GTPS+   + +DTGS +++      I  PRKS             S+ + ++C  
Sbjct: 87  AKIGLGTPSRDFHVQVDTGSDILWVNCAGCIRCPRKSDLVELTPYDVDASSTAKSVSCSD 146

Query: 44  PDCTYFKCVNE-----QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGALFG 95
             C+Y    +E      C Y + Y D S T G+   + +    V G  +  +     +FG
Sbjct: 147 NFCSYVNQRSECHSGSTCQYVIMYGDGSSTNGYLVKDVVHLDLVTGNRQTGSTNGTIIFG 206

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSS 153
           C +   G   +++  A+ G++G  +   SFISQL S   +K+ F++CL     +      
Sbjct: 207 CGSKQSGQLGESQ-AAVDGIMGFGQSNSSFISQLASQGKVKRSFAHCL-----DNNNGGG 260

Query: 154 YLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG-EGGC 212
               G  +    P  + T  ++   + Y ++L  I + N  +    + FD   SG + G 
Sbjct: 261 IFAIGEVV---SPKVKTTPMLSKSAH-YSVNLNAIEVGNSVLELSSNAFD---SGDDKGV 313

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
           IIDSG+ L Y    VY  L  + ++      L  + +       C+   +  +RFP++ F
Sbjct: 314 IIDSGTTLVYLPDAVYNPLLNEILASHPELTLHTVQESFT----CFHYTDKLDRFPTVTF 369

Query: 273 YFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRF 320
            F+ +             +R   E+ +   ++N             + ++G     +   
Sbjct: 370 QFDKSVSLAVYPREYLFQVR---EDTWCFGWQNGGLQTKGGAS---LTILGDMALSNKLV 423

Query: 321 VYDLNIDLLSFVKENCS 337
           VYD+   ++ +   NCS
Sbjct: 424 VYDIENQVIGWTNHNCS 440


>gi|449493359|ref|XP_004159266.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 511

 Score = 81.3 bits (199), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 95/398 (23%), Positives = 153/398 (38%), Gaps = 82/398 (20%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKI 39
           V L  GTP + +  I DTGS+L++                      + F P+ SSS + +
Sbjct: 134 VSLAFGTPPQNLSFIFDTGSSLVWFPCTAGYRCSRCSFPYVDPATISKFVPKLSSSVKVV 193

Query: 40  NCDHPDCTYF-----------------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGK 82
            C +P C +                  KC +    Y ++Y     T G    ET+ +   
Sbjct: 194 GCRNPKCAWIFGPNLKSRCRNCNSKSRKCSDSCPGYGLQYGS-GATAGILLSETLDL--- 249

Query: 83  GEGKAIFHGALFGCS-NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
            E K +    L GCS    H           AG+ G  R   S  SQ+     KRFS+CL
Sbjct: 250 -ENKRV-PDFLVGCSVMSVH---------QPAGIAGFGRGPESLPSQMR---LKRFSHCL 295

Query: 142 VIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP------------NNFYYLSLKDIS 189
           V    +    SS L    D G     ++   FI  P              +YYLSL+ I 
Sbjct: 296 VSRGFDDSPVSSPLVL--DSGSESDESKTKSFIYAPFRENPSVSNAAFREYYYLSLRRIL 353

Query: 190 IDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD 249
           I  + + FP        +G GG IIDSGS  T+    ++  + ++      ++  A+  +
Sbjct: 354 IGGKPVKFPYKYLVPDSTGNGGAIIDSGSTFTFLDKPIFEAIADELEKQLVKYPRAKDVE 413

Query: 250 CPEPIQLCYFLP--ETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDL 306
               ++ C+ +P  E    FP +   F+    L +  EN   +  +     L +   + +
Sbjct: 414 AQSGLRPCFNIPKEEESAEFPDVVLKFKGGGKLSLAAENYLAMVTDEGVVCLTMMTDEAV 473

Query: 307 -------VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
                    ++G+ QQ++    YDL    + F K+ C+
Sbjct: 474 VGGGGGPAIILGAFQQQNVLVEYDLAKQRIGFRKQKCT 511


>gi|225440729|ref|XP_002275391.1| PREDICTED: aspartic proteinase nepenthesin-1 [Vitis vinifera]
 gi|147789748|emb|CAN67404.1| hypothetical protein VITISV_025615 [Vitis vinifera]
          Length = 450

 Score = 81.3 bits (199), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 94/386 (24%), Positives = 156/386 (40%), Gaps = 66/386 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA----------------------IFDPRKSSSFQKI 39
           + L  GTP + +  ++DTGS +++A                      IFDP+ SSS + +
Sbjct: 80  ISLSFGTPPQKLSFLVDTGSDVVWAPCTTDYTCTNCSFSAADPKKVPIFDPKLSSSSKIL 139

Query: 40  NCDHPDC--TYFKCVN----------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKA 87
           +C +P C  TYF  V+          + C Y   Y+ Q  T   + +  +  + K   K 
Sbjct: 140 DCRNPKCVSTYFPYVHLGCPRCNGNSKHCSYACPYSTQYGTGASSGYFLLENL-KFPRKT 198

Query: 88  IFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPN 147
           I    L GC+         AR+ +   + G  R   S   Q+G    K+F+YCL     +
Sbjct: 199 I-RNFLLGCTT------SAARELSSDALAGFGRSMFSLPIQMGV---KKFAYCLN----S 244

Query: 148 GEYTSSYLKFGTDMGYRRPSTQA---TKFINHPNN---FYYLSLKDISIDNERMNFPPDT 201
            +Y  +       + YR   T+    T F+  P     +Y+L +KDI I N+ +  P   
Sbjct: 245 HDYDDTRNSGKLILDYRDGKTKGLSYTPFLKSPPASAFYYHLGVKDIKIGNKLLRIPSKY 304

Query: 202 FDITVSGEGGCIIDSG-SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-F 259
                 G  G IIDSG     Y    V+  +  +      +++ +  ++    +  CY F
Sbjct: 305 LAPGSDGRSGVIIDSGYGGAGYMTGPVFKIVTNELKKQMSKYRRSLEAETQTGLTPCYNF 364

Query: 260 LPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHF--FLL------AVAPHDDLVALI 310
                 + P + + F   AN+ + G+N F I  +     FL+      A+    D   ++
Sbjct: 365 TGHKSIKIPPLIYQFRGGANMVVPGKNYFGISPQESLACFLMDTNGTNALEITPDPSIIL 424

Query: 311 GSQQQRDTRFVYDLNIDLLSFVKENC 336
           G+ Q  D    YDL  D   F ++ C
Sbjct: 425 GNSQHVDYYVEYDLKNDRFGFRRQTC 450


>gi|413925432|gb|AFW65364.1| hypothetical protein ZEAMMB73_378208 [Zea mays]
          Length = 418

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 96/364 (26%), Positives = 144/364 (39%), Gaps = 68/364 (18%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
           +GTP + +  + DTGS LI+A               + P KSSSF K+ C    C     
Sbjct: 87  MGTPPQTLSALADTGSDLIWAKCGACKRCAPRGSASYYPTKSSSFSKLPCSSALCRTLES 146

Query: 50  ---------KCVNEQCVYTMKYADQS----VTKGFAAHETISVIGKGEGKAIFHGALFGC 96
                    +     C Y   Y   S     T+G+   ET ++     G     G  FGC
Sbjct: 147 QSLATCGGTRARGAVCSYRYSYGLSSNPHHYTQGYMGSETFTL-----GSDAVQGIGFGC 201

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
           +  + G                 R  +S + QL       FSYCL     +   TSS L 
Sbjct: 202 TTMSEGGYGSGSGLVGL-----GRGKLSLVRQL---KVGAFSYCLT----SDPSTSSPLL 249

Query: 157 FGTDMGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           FG       P  Q+T  +N   + FY ++L  ISI   +    P T      G  G I D
Sbjct: 250 FGAG-ALTGPGVQSTPLVNLKTSTFYTVNLDSISIGAAKT---PGT------GRHGIIFD 299

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP--EPIQLCYFLPETFNRFPSMAFY 273
           SG+ LT+     Y       +S     Q   L+  P  +  ++C F       FPSM  +
Sbjct: 300 SGTTLTFLAEPAYTLAEAGLLS-----QTTNLTRVPGTDGYEVC-FQTSGGAVFPSMVLH 353

Query: 274 FEDANLRIDGENVF-IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           F+  ++ +  EN F  ++     +L+  +P +  ++++G+  Q D    YDL+  +LSF 
Sbjct: 354 FDGGDMALKTENYFGAVNDSVSCWLVQKSPSE--MSIVGNIMQMDYHIRYDLDKSVLSFQ 411

Query: 333 KENC 336
             NC
Sbjct: 412 PTNC 415


>gi|224128838|ref|XP_002320434.1| predicted protein [Populus trichocarpa]
 gi|222861207|gb|EEE98749.1| predicted protein [Populus trichocarpa]
          Length = 485

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 91/381 (23%), Positives = 158/381 (41%), Gaps = 70/381 (18%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           ++ IGTP+K   + +DTGS +++                    +++  +S + + + CD 
Sbjct: 81  KIGIGTPTKDYYVQVDTGSDIMWVNCIQCRECPKTSSLGIDLTLYNINESDTGKLVPCDQ 140

Query: 44  -----------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIF 89
                      P CT     N  C Y   Y D S T G+   + +    V G  +  A  
Sbjct: 141 EFCYEINGGQLPGCT----ANMSCPYLEIYGDGSSTAGYFVKDVVQYARVSGDLKTTAAN 196

Query: 90  HGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPN 147
              +FGC     G    + + AL G+LG  +   S ISQL     +KK F++CL     +
Sbjct: 197 GSVIFGCGARQSGDLGSSNEEALDGILGFGKSNSSMISQLAVTGKVKKIFAHCL-----D 251

Query: 148 GEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITV 206
           G         G  +   +P    T  I N P+  Y +++  + + +E ++ P D F+   
Sbjct: 252 GTNGGGIFVIGHVV---QPKVNMTPLIPNQPH--YNVNMTAVQVGHEFLSLPTDVFE--A 304

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-N 265
               G IIDSG+ L Y    VY  L  K +S     ++  + D       C+   ++  +
Sbjct: 305 GDRKGAIIDSGTTLAYLPEMVYKPLVSKIISQQPDLKVHTVRD----EYTCFQYSDSLDD 360

Query: 266 RFPSMAFYFEDAN-LRI-------DGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQR 316
            FP++ F+FE++  L++         E ++ I ++N      V   D   + L+G     
Sbjct: 361 GFPNVTFHFENSVILKVYPHEYLFPFEGLWCIGWQNS----GVQSRDRRNMTLLGDLVLS 416

Query: 317 DTRFVYDLNIDLLSFVKENCS 337
           +   +YDL    + + + NCS
Sbjct: 417 NKLVLYDLENQAIGWTEYNCS 437


>gi|356505293|ref|XP_003521426.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 499

 Score = 80.9 bits (198), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 87/374 (23%), Positives = 155/374 (41%), Gaps = 55/374 (14%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           ++ +G+P+K   + +DTGS +++                     FD   SS+   ++C  
Sbjct: 86  KVKLGSPAKDFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCAD 145

Query: 44  PDCTYF------KCVNE--QCVYTMKYADQSVTKGFAAHETI----SVIGKGEGKAIFHG 91
           P C+Y        C ++  QC YT +Y D S T G+   +T+     ++G+         
Sbjct: 146 PICSYAVQTATSGCSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSMVANSSST 205

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGE 149
            +FGCS    G D    D A+ G+ G     +S ISQL S  +  K FS+C    L  GE
Sbjct: 206 IVFGCSTYQSG-DLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHC----LKGGE 260

Query: 150 YTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
                L  G  +    PS   +  + + P+  Y L+L+ I+++ + +  P D+     + 
Sbjct: 261 NGGGVLVLGEIL---EPSIVYSPLVPSLPH--YNLNLQSIAVNGQLL--PIDSNVFATTN 313

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NRF 267
             G I+DSG+ L Y   + Y    +   +   +F    +S   +    CY +  +  + F
Sbjct: 314 NQGTIVDSGTTLAYLVQEAYNPFVDAITAAVSQFSKPIISKGNQ----CYLVSNSVGDIF 369

Query: 268 PSMAFYFEDANLRIDGENVFIIDY----ENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
           P ++  F      +     +++ Y        + +     +    ++G    +D  FVYD
Sbjct: 370 PQVSLNFMGGASMVLNPEHYLMHYGFLDSAAMWCIGFQKVERGFTILGDLVLKDKIFVYD 429

Query: 324 LNIDLLSFVKENCS 337
           L    + +   NCS
Sbjct: 430 LANQRIGWADYNCS 443


>gi|115457778|ref|NP_001052489.1| Os04g0337000 [Oryza sativa Japonica Group]
 gi|113564060|dbj|BAF14403.1| Os04g0337000, partial [Oryza sativa Japonica Group]
          Length = 321

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 79/303 (26%), Positives = 130/303 (42%), Gaps = 58/303 (19%)

Query: 6   IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH--- 43
           IGTP+K   + +DTGS +++                    ++DP+ SS+  K++CD    
Sbjct: 39  IGTPTKRYYVQVDTGSDILWVNCISCDRCPRKSGLGLELTLYDPKDSSTGSKVSCDQGFC 98

Query: 44  --------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
                   P CT     +  C Y++ Y D S T G+   + +    V G G+ +      
Sbjct: 99  AATYGGLLPGCT----TSLPCEYSVTYGDGSSTTGYFVSDLLQFDQVSGDGQTRPANSTV 154

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
            FGC +   G D  + + AL G++G  +   S +SQL +   +KK F++CL      G +
Sbjct: 155 TFGCGSQQGG-DLGSSNQALDGIIGFGQSNTSMLSQLSAAGKVKKIFAHCLDTINGGGIF 213

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
               +         +P  + T  + N P+  Y ++LK I +    +  P   FD   +GE
Sbjct: 214 AIGNVV--------QPKVKTTPLVPNMPH--YNVNLKSIDVGGTALKLPSHMFD---TGE 260

Query: 210 -GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFP 268
             G IIDSG+ LTY    VY    E  ++ F + +     +  E +   Y    T    P
Sbjct: 261 KKGTIIDSGTTLTYLPEIVY---KEIMLAVFAKHKDITFHNVQEFLCFQYVGRYTLQHTP 317

Query: 269 SMA 271
           S++
Sbjct: 318 SVS 320


>gi|218197468|gb|EEC79895.1| hypothetical protein OsI_21423 [Oryza sativa Indica Group]
          Length = 471

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 77/325 (23%), Positives = 124/325 (38%), Gaps = 42/325 (12%)

Query: 26  AIFDPRKSSSFQKINCDHPDCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVI 80
           A+FDPR+S +   + C    C         C N QC Y + Y D   T G    + +++ 
Sbjct: 175 ALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTL- 233

Query: 81  GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
                  +     FGCS+   G    +  G ++  LG  R   S +SQ  +     FSYC
Sbjct: 234 ---NPSTVVMNFRFGCSHAVRGNFSASTSGTMS--LGGGRQ--SLLSQTAATFGNAFSYC 286

Query: 141 LVIPLPNG-------EYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNE 193
           +  P  +G              +F      R PS   T         Y + L+ I +   
Sbjct: 287 VPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSIIPT--------LYLVRLRGIEVGGR 338

Query: 194 RMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP 253
           R+N PP  F       GG ++DS  ++T      Y  L   F S    +   +++     
Sbjct: 339 RLNVPPVVF------AGGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYP--RVAGGRAG 390

Query: 254 IQLCY-FLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIG 311
           +  CY F+  T    P+++  F+  A +R+D   V +             P D  +  IG
Sbjct: 391 LDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV----EGCLAFVPTPGDFALGFIG 446

Query: 312 SQQQRDTRFVYDLNIDLLSFVKENC 336
           + QQ+    +YD+    + F +  C
Sbjct: 447 NVQQQTHEVLYDVGGGSVGFRRGAC 471


>gi|242046812|ref|XP_002461152.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
 gi|241924529|gb|EER97673.1| hypothetical protein SORBIDRAFT_02g041760 [Sorghum bicolor]
          Length = 452

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 78/362 (21%), Positives = 147/362 (40%), Gaps = 47/362 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +VR  +GTP + +LL +DT +   +                FDP  S+S++ + C  P C
Sbjct: 111 VVRARLGTPPQQLLLAVDTSNDAAWIPCAGCAGCPTSSAPPFDPAASTSYRSVPCGSPLC 170

Query: 47  TYFKCV-----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                       + C +++ YAD S+    +  ++++V G            FGC     
Sbjct: 171 AQAPNAACPPGGKACGFSLTYADSSLQAALS-QDSLAVAGDA-----VKTYTFGCLQKAT 224

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G     +           R  +SF+SQ   + +  FSYCL  P       S  L+ G + 
Sbjct: 225 GTAAPPQGLLGL-----GRGPLSFLSQTRDMYQGTFSYCL--PSFKSLNFSGTLRLGRN- 276

Query: 162 GYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
             + P  + T  + +P  ++ YY+++  I +  + +  PP       +   G ++DSG++
Sbjct: 277 -GQPPRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIPPPALAFDPATGAGTVLDSGTM 335

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
            T   +  Y  + ++      R ++            C+    T   +P +   F+   +
Sbjct: 336 FTRLVAPAYVAVRDEV-----RRRVGAPVSSLGGFDTCFN--TTAVAWPPVTLLFDGMQV 388

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            +  ENV I         LA+A   D    ++ +I S QQ++ R ++D+    + F +E 
Sbjct: 389 TLPEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARER 448

Query: 336 CS 337
           C+
Sbjct: 449 CT 450


>gi|449442641|ref|XP_004139089.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 478

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 89/381 (23%), Positives = 159/381 (41%), Gaps = 70/381 (18%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           R+ IG+P     + +DTGS +++                    +++P+ SS+   I CD 
Sbjct: 76  RIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLITCDQ 135

Query: 44  PDC--TYFKCV-----NEQCVYTMKYADQSVTKGFAAHETISV---IGKGEGKAIFHGAL 93
           P C  TY   +     +  C Y + Y D S T G+  ++ I +   +G  +        +
Sbjct: 136 PFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIV 195

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
           FGC     G +  +   AL G+LG  +   S ISQL +   +KK F++CL      G + 
Sbjct: 196 FGCGAKQSG-ELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFA 254

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEG 210
              +          P  + T  +  PN  +Y + L  + + +  ++ P   F+   S + 
Sbjct: 255 IGEV--------VEPKLKTTPVV--PNQAHYNVVLNGVKVGDTALDLPLGLFE--TSYKR 302

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL--PETFNRFP 268
           G IIDSG+ L Y    +Y  L EK +      +L  + D     Q   F+      + FP
Sbjct: 303 GAIIDSGTTLAYLPDSIYLPLMEKILGAQPDLKLRTVDD-----QFTCFVFDKNVDDGFP 357

Query: 269 SMAFYFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQR 316
           ++ F FE++             +R   ++V+ + ++N     A +   + V L+G    +
Sbjct: 358 TVTFKFEESLILTIYPHEYLFQIR---DDVWCVGWQNSG---AQSKDGNEVTLLGDLVLQ 411

Query: 317 DTRFVYDLNIDLLSFVKENCS 337
           +    Y+L    + + + NCS
Sbjct: 412 NKLVYYNLENQTIGWTEYNCS 432


>gi|225464832|ref|XP_002272243.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 467

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 92/401 (22%), Positives = 152/401 (37%), Gaps = 92/401 (22%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------------AIFDPRKSSSFQKIN 40
           + L  GTP + + LI+DTGS L++                      IF P+ SSS + + 
Sbjct: 92  IPLSFGTPPQTLPLIMDTGSDLVWFPCTHRYVCRNCSFSTSNPSSNIFIPKSSSSSKVLG 151

Query: 41  CDHPDCTYF-------KC---------VNEQCVYTMKYADQSVTKGFAAHETISVIGKGE 84
           C +P C +        +C           + C   + +    +T G    ET+ + GKG 
Sbjct: 152 CVNPKCGWIHGSKVQSRCRDCEPTSPNCTQICPPYLVFYGSGITGGIMLSETLDLPGKGV 211

Query: 85  GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
              I   ++   S               AG+ G  R   S  SQLG    K+FSYCL+  
Sbjct: 212 PNFIVGCSVLSTSQP-------------AGISGFGRGPPSLPSQLG---LKKFSYCLLSR 255

Query: 145 L--PNGEYTSSYLKFGTDMGYRRPSTQATKFINHP--------NNFYYLSLKDISIDNER 194
                 E +S  L   +D G +      T F+ +P        + +YYL L+ I++  + 
Sbjct: 256 RYDDTTESSSLVLDGESDSGEKTAGLSYTPFVQNPKVAGKHAFSVYYYLGLRHITVGGKH 315

Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI 254
           +  P         G+GG IIDSG+  TY   +++  +  +F    +  +  +       +
Sbjct: 316 VKIPYKYLIPGADGDGGTIIDSGTTFTYMKGEIFELVAAEFEKQVQSKRATE-------V 368

Query: 255 QLCYFLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVAL----- 309
           +    L   FN        F +  L+  G     +   N+   L     DD+V L     
Sbjct: 369 EGITGLRPCFNISGLNTPSFPELTLKFRGGAEMELPLANYVAFLG---GDDVVCLTIVTD 425

Query: 310 --------------IGSQQQRDTRFVYDLNIDLLSFVKENC 336
                         +G+ QQ++    YDL  + L F +++C
Sbjct: 426 GAAGKEFSGGPAIILGNFQQQNFYVEYDLRNERLGFRQQSC 466


>gi|356570798|ref|XP_003553571.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 500

 Score = 80.9 bits (198), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 86/374 (22%), Positives = 155/374 (41%), Gaps = 55/374 (14%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           ++ +G+P+K   + +DTGS +++                     FD   SS+   ++C  
Sbjct: 86  KVKLGSPAKEFYVQIDTGSDILWINCITCSNCPHSSGLGIELDFFDTAGSSTAALVSCGD 145

Query: 44  PDCTYF------KCVNE--QCVYTMKYADQSVTKGFAAHETI----SVIGKGEGKAIFHG 91
           P C+Y       +C ++  QC YT +Y D S T G+   +T+     ++G+         
Sbjct: 146 PICSYAVQTATSECSSQANQCSYTFQYGDGSGTTGYYVSDTMYFDTVLLGQSVVANSSST 205

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGE 149
            +FGCS    G D    D A+ G+ G     +S ISQL S  +  K FS+C    L  GE
Sbjct: 206 IIFGCSTYQSG-DLTKTDKAVDGIFGFGPGALSVISQLSSRGVTPKVFSHC----LKGGE 260

Query: 150 YTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
                L  G  +    PS   +  + + P+  Y L+L+ I+++ + +  P D+     + 
Sbjct: 261 NGGGVLVLGEIL---EPSIVYSPLVPSQPH--YNLNLQSIAVNGQLL--PIDSNVFATTN 313

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-F 267
             G I+DSG+ L Y   + Y    +   +   +F    +S   +    CY +  +    F
Sbjct: 314 NQGTIVDSGTTLAYLVQEAYNPFVKAITAAVSQFSKPIISKGNQ----CYLVSNSVGDIF 369

Query: 268 PSMAFYFEDANLRIDGENVFIIDY----ENHFFLLAVAPHDDLVALIGSQQQRDTRFVYD 323
           P ++  F      +     +++ Y        + +     +    ++G    +D  FVYD
Sbjct: 370 PQVSLNFMGGASMVLNPEHYLMHYGFLDGAAMWCIGFQKVEQGFTILGDLVLKDKIFVYD 429

Query: 324 LNIDLLSFVKENCS 337
           L    + +   +CS
Sbjct: 430 LANQRIGWADYDCS 443


>gi|222622847|gb|EEE56979.1| hypothetical protein OsJ_06707 [Oryza sativa Japonica Group]
          Length = 494

 Score = 80.9 bits (198), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 85/338 (25%), Positives = 139/338 (41%), Gaps = 68/338 (20%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            R+ IGTP+K   + +DTGS +++                    ++DPR S S + + CD
Sbjct: 92  TRIGIGTPAKRYYVQVDTGSDILWVNCVSCDGCPRKSNLGIELTMYDPRGSQSGELVTCD 151

Query: 43  H-----------PDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
                       P CT        C Y++ Y D S T GF   + +    V G G+    
Sbjct: 152 QQFCVANYGGVLPSCTS----TSPCEYSISYGDGSSTAGFFVTDFLQYNQVSGDGQTTPA 207

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLP 146
                FGC     G D  + + AL G+LG  +   S +SQL +   ++K F++CL     
Sbjct: 208 NASVSFGCGA-KLGGDLGSSNLALDGILGFGQSNSSMLSQLAAAGKVRKMFAHCLDTVNG 266

Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
            G +    +         +P  + T  + + P+  Y + LK I +    +  P + FD  
Sbjct: 267 GGIFAIGNV--------VQPKVKTTPLVPDMPH--YNVILKGIDVGGTALGLPTNIFDSG 316

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETF 264
            S   G IIDSG+ L Y    VY  L        +   +  L D       C+ +     
Sbjct: 317 NS--KGTIIDSGTTLAYVPEGVYKALFAMVFDKHQDISVQTLQDFS-----CFQYSGSVD 369

Query: 265 NRFPSMAFYFE-DANLRI--------DGENVFIIDYEN 293
           + FP + F+FE D +L +        +G+N++ + ++N
Sbjct: 370 DGFPEVTFHFEGDVSLIVSPHDYLFQNGKNLYCMGFQN 407


>gi|297605079|ref|NP_001056639.2| Os06g0121800 [Oryza sativa Japonica Group]
 gi|255676668|dbj|BAF18553.2| Os06g0121800 [Oryza sativa Japonica Group]
          Length = 487

 Score = 80.5 bits (197), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 77/325 (23%), Positives = 124/325 (38%), Gaps = 42/325 (12%)

Query: 26  AIFDPRKSSSFQKINCDHPDCTYFK-----CVNEQCVYTMKYADQSVTKGFAAHETISVI 80
           A+FDPR+S +   + C    C         C N QC Y + Y D   T G    + +++ 
Sbjct: 191 ALFDPRRSRTSAAVPCGSAACGELGRYGAGCSNNQCQYFVDYGDGRATSGTYMVDALTL- 249

Query: 81  GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
                  +     FGCS+   G    +  G ++  LG  R   S +SQ  +     FSYC
Sbjct: 250 ---NPSTVVMNFRFGCSHAVRGNFSASTSGTMS--LGGGRQ--SLLSQTAATFGNAFSYC 302

Query: 141 LVIPLPNG-------EYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNE 193
           +  P  +G              +F      R PS   T         Y + L+ I +   
Sbjct: 303 VPDPSSSGFLSLGGPADGGGAGRFARTPLVRNPSIIPT--------LYLVRLRGIEVGGR 354

Query: 194 RMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP 253
           R+N PP  F       GG ++DS  ++T      Y  L   F S    +   +++     
Sbjct: 355 RLNVPPVVF------AGGAVMDSSVIITQLPPTAYRALRLAFRSAMAAYP--RVAGGRAG 406

Query: 254 IQLCY-FLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIG 311
           +  CY F+  T    P+++  F+  A +R+D   V +             P D  +  IG
Sbjct: 407 LDTCYDFVRFTSVTVPAVSLVFDGGAVVRLDAMGVMV----EGCLAFVPTPGDFALGFIG 462

Query: 312 SQQQRDTRFVYDLNIDLLSFVKENC 336
           + QQ+    +YD+    + F +  C
Sbjct: 463 NVQQQTHEVLYDVGGGSVGFRRGAC 487


>gi|125553832|gb|EAY99437.1| hypothetical protein OsI_21406 [Oryza sativa Indica Group]
          Length = 409

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 82/336 (24%), Positives = 136/336 (40%), Gaps = 63/336 (18%)

Query: 27  IFDPRKSSSFQKINCDHPDCTYFK-----CV-NEQCVYTMKYADQSVTKGFAAHETISVI 80
           +FDP  S+++  + C    C         C+ N QC + + YA+ +   G  + + ++ +
Sbjct: 111 LFDPATSTTYAAVPCSSAACARLGPYRRGCLANSQCQFGITYANGATATGTYSSDDLT-L 169

Query: 81  GKGEGKAIFHGALFGCSNDNHG--FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFS 138
           G  +   +  G LFGC++ + G  F  D     +AG L L   + SF+ Q  S   + FS
Sbjct: 170 GPYD---VVRGFLFGCAHADQGSTFSYD-----VAGTLALGGGSQSFVQQTASQYSRVFS 221

Query: 139 YCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQAT---KFINHP--------NNFYYLSLKD 187
           YC    +P    +  ++ FG       P  +A     F++ P          FY + L+ 
Sbjct: 222 YC----VPPSTSSFGFIMFGV------PPQRAALVPTFVSTPLLSSSTMSPTFYRVLLRS 271

Query: 188 ISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQL 247
           I +    +  PP  F  +       +IDS +V++      Y  L   F S    ++ A  
Sbjct: 272 IIVAGRPLPVPPTVFSAS------SVIDSATVISRIPPTAYQALRAAFRSAMTMYRPA-- 323

Query: 248 SDCPEPIQL---CY-FLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAP 302
                P+ +   CY F        PS+A  F+  A + +D   + +         LA AP
Sbjct: 324 ----PPVSILDTCYDFSGVRSITLPSIALVFDGGATVNLDAAGILLQG------CLAFAP 373

Query: 303 --HDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
              D +   IG+ QQR    VYD+    + F    C
Sbjct: 374 TASDRMPGFIGNVQQRTLEVVYDVPGKAIRFRSAAC 409


>gi|363808270|ref|NP_001242239.1| uncharacterized protein LOC100801883 [Glycine max]
 gi|255641727|gb|ACU21134.1| unknown [Glycine max]
          Length = 475

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 83/373 (22%), Positives = 159/373 (42%), Gaps = 53/373 (14%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           +L +G+P +   + +DTGS +++                    ++DP+ S +   ++CD 
Sbjct: 73  KLGLGSPPRDYYVQVDTGSDILWVNCVECSRCPRKSDLGIDLTLYDPKGSETSDVVSCDQ 132

Query: 44  PDCTYF------KCVNE-QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
             C+         C +E  C Y++ Y D S T G+   + ++   + G           +
Sbjct: 133 DFCSATFDGPIPGCKSEIPCPYSITYGDGSATTGYYVQDYLTYNRINGNLRTSPQNSSII 192

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
           FGC     G    + + AL G++G  +   S +SQL +   +KK FS+CL     +    
Sbjct: 193 FGCGAVQSGTLGSSSEEALDGIIGFGQANSSVLSQLAASGKVKKIFSHCL-----DNVRG 247

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
                 G  +  +  +T     + H    Y + LK I +D + +  P D FD +V+G+ G
Sbjct: 248 GGIFAIGEVVEPKVSTTPLVPRMAH----YNVVLKSIEVDTDILQLPSDIFD-SVNGK-G 301

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPSM 270
            +IDSG+ L Y    VY +L +K ++     +L  +    E    C+      +R FP +
Sbjct: 302 TVIDSGTTLAYLPDIVYDELIQKVLARQPGLKLYLV----EQQFRCFLYTGNVDRGFPVV 357

Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLL------AVAPHDDLVALIGSQQQRDTRFVYDL 324
             +F+D+       + ++  +++  + +      A   +   + L+G     +   +YDL
Sbjct: 358 KLHFKDSLSLTVYPHDYLFQFKDGIWCIGWQRSVAQTKNGKDMTLLGDLVLSNKLVIYDL 417

Query: 325 NIDLLSFVKENCS 337
              ++ +   NCS
Sbjct: 418 ENMVIGWTDYNCS 430


>gi|388502484|gb|AFK39308.1| unknown [Medicago truncatula]
          Length = 425

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 79/345 (22%), Positives = 143/345 (41%), Gaps = 43/345 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
           +VR  IGTP + +LL +DT +   +            +F P KS++F+ ++C  P+C   
Sbjct: 94  IVRAKIGTPPQTLLLAMDTSNDAAWIPCTACDGCASTLFAPEKSTTFKNVSCAAPECKQV 153

Query: 50  K---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
               C      + + Y   S+       +TI++              FGC +   G    
Sbjct: 154 PNPGCGVSSRNFNLTYGSSSIAANLV-QDTITL-----ATDPVPSYTFGCVSKTTGTSAP 207

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
            +           R  +S +SQ  ++ +  FSYCL  P       S  L+ G     +R 
Sbjct: 208 PQGLLGL-----GRGPLSLLSQTQNLYQSTFSYCL--PSFKSLNFSGSLRLGPVAQPKR- 259

Query: 167 STQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
             + T  + +P  ++ YY++L+ I +  + ++ PP       +   G I DSG+V T   
Sbjct: 260 -IKYTPLLKNPRRSSLYYVNLEAIRVGRKVVDIPPAALAFNPTTGAGTIFDSGTVFTRLV 318

Query: 225 SDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
           + VY  + ++F      +  +  L         CY +P      P++ F F   N+ +  
Sbjct: 319 APVYVAVRDEFRRRVGPKLTVTSLGG----FDTCYNVPIV---VPTITFIFTGMNVTLPQ 371

Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDL 324
           +N+ I         LA+A   D    ++ +I + QQ++ R +YD+
Sbjct: 372 DNILIHSTAGSTTCLAMAGAPDNVNSVLNVIANMQQQNHRVLYDV 416


>gi|356531884|ref|XP_003534506.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 482

 Score = 80.5 bits (197), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 82/342 (23%), Positives = 144/342 (42%), Gaps = 40/342 (11%)

Query: 19  TGSALIYAIFDPRKSSSFQKINCDHPDCTYF------KCVNE-QCVYTMKYADQSVTKGF 71
           +G  +   ++DP  S + + + CD   CT         C  +  C Y++ Y D S T G 
Sbjct: 113 SGLGMELTLYDPNSSKTSKVVPCDDEFCTSTYDGPISGCKKDMSCPYSITYGDGSTTSGS 172

Query: 72  AAHETIS---VIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQ 128
              + ++   V+G           +FGC +   G      D +L G++G  +   S +SQ
Sbjct: 173 YIKDDLTFDRVVGDLRTVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQ 232

Query: 129 LGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLK 186
           L +   +K+ FS+CL      G +    +         +P  + T  +    + Y + LK
Sbjct: 233 LAAAGKVKRVFSHCLDTVNGGGIFAIGEV--------VQPKVKTTPLVPRMAH-YNVVLK 283

Query: 187 DISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQ 246
           DI +  + +  P D FD T SG  G IIDSG+ L Y    +Y +L EK ++     +L  
Sbjct: 284 DIEVAGDPIQLPTDIFDST-SGR-GTIIDSGTTLAYLPVSIYDQLLEKTLAQRSGMELYL 341

Query: 247 LSDCPEPIQL-CYFLPETF---NRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLA--- 299
           + D     Q  C+   +     + FP++ F FE+        + ++  ++   + +    
Sbjct: 342 VED-----QFTCFHYSDEKSLDDAFPTVKFTFEEGLTLTAYPHDYLFPFKEDMWCIGWQK 396

Query: 300 ----VAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
                    DL+ L+G     +  F+YDL+   + +   NCS
Sbjct: 397 STAQTKDGKDLI-LLGDLVLTNKLFIYDLDNMSIGWTDYNCS 437


>gi|195638734|gb|ACG38835.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 465

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 82/361 (22%), Positives = 146/361 (40%), Gaps = 47/361 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           + R+ +GTP+K  ++++DTGS+L +                +F+P+ SSS+  ++C    
Sbjct: 128 VTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQQ 187

Query: 46  CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           C+              +  C+Y   Y D S + G+ + +T+S      G        +GC
Sbjct: 188 CSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF-----GSTSVPNFYYGC 242

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             DN G       G  AG++GL+R  +S + QL   +   FSYCL  P  +   +     
Sbjct: 243 GQDNEGLF-----GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCL--PTSSSSSSGYLSI 295

Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
              + G    +  A+  ++  ++ Y++ +  I +  +     P +   +       IIDS
Sbjct: 296 GSYNPGQYSYTPMASSSLD--DSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDS 348

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
           G+V+T   + VY  L +      +    A        +  C+       R P +   F  
Sbjct: 349 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSI---LDTCFQGQAARLRVPEVTMAFAG 405

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
                      ++D ++    LA AP     A+IG+ QQ+    VYD+    + F    C
Sbjct: 406 GAALKLAARNLLVDVDSATTCLAFAPARS-AAIIGNTQQQTFSVVYDVKNSKIGFAAAGC 464

Query: 337 S 337
           S
Sbjct: 465 S 465


>gi|449446233|ref|XP_004140876.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 498

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 89/373 (23%), Positives = 150/373 (40%), Gaps = 52/373 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ IGTPSK   + +DTGS +++                     +D  +S++ + ++CD
Sbjct: 89  AKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVSCD 148

Query: 43  HPDCTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
              C             N  C Y   Y D S T G+   + +    V G  E  A     
Sbjct: 149 EQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANGSI 208

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
            FGC     G    + + AL G+LG  +   S ISQL S   +KK F++CL     +G  
Sbjct: 209 KFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL-----DGTN 263

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
                  G  +   +P    T  + N P+  Y +++  + + +  +N   D F+      
Sbjct: 264 GGGIFAMGHVV---QPKVNMTPLVPNQPH--YNVNMTGVQVGHIILNISADVFE--AGDR 316

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
            G IIDSG+ L Y    +Y  L  K +S     ++  +    +  Q   +     + FP 
Sbjct: 317 KGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYKCFQ---YSERVDDGFPP 373

Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFL----LAVAPHDDL-VALIGSQQQRDTRFVYDL 324
           + F+FE++ L     + ++  YEN + +      +   D   V L G     +   +YDL
Sbjct: 374 VIFHFENSLLLKVYPHEYLFQYENLWCIGWQNSGMQSRDRKNVTLFGDLVLSNKLVLYDL 433

Query: 325 NIDLLSFVKENCS 337
               + + + NCS
Sbjct: 434 ENQTIGWTEYNCS 446


>gi|32479948|emb|CAE01594.1| OSJNBa0008A08.2 [Oryza sativa Japonica Group]
 gi|38347627|emb|CAE05222.2| OSJNBa0011K22.4 [Oryza sativa Japonica Group]
 gi|38567678|emb|CAE75961.1| B1159F04.24 [Oryza sativa Japonica Group]
 gi|116309512|emb|CAH66578.1| OSIGBa0137O04.4 [Oryza sativa Indica Group]
          Length = 431

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 81/367 (22%), Positives = 152/367 (41%), Gaps = 49/367 (13%)

Query: 6   IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDHPDC 46
           IGTP+    + LDTGS   +                     +DPR S S +++ CD   C
Sbjct: 65  IGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC 124

Query: 47  TYFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGALFGCSNDNH 101
           T     N   +C Y   YAD  +T G    + +    + G G+ +       FGC     
Sbjct: 125 TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQS 184

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           G   ++   A+ G++G      + +SQL +    KK FS+CL      G +    +    
Sbjct: 185 GSLNNSAV-AIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEV---- 239

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                 P  + T  + +   ++ ++LK I++    +  P + F  T +   G  IDSGS 
Sbjct: 240 ----VEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGST 293

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
           L Y    +Y +L     +      +  + +     Q  +FL    ++FP + F+FE+ +L
Sbjct: 294 LVYLPEIIYSELILAVFAKHPDITMGAMYN----FQCFHFLGSVDDKFPKITFHFEN-DL 348

Query: 280 RIDGENV-FIIDYENHFFLL-----AVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
            +D     ++++YE + +        +  + D++ ++G     +   VYD+    + + +
Sbjct: 349 TLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMI-ILGDMVISNKVVVYDMEKQAIGWTE 407

Query: 334 ENCSDDS 340
            N  +++
Sbjct: 408 HNSVEEA 414


>gi|115484725|ref|NP_001067506.1| Os11g0215400 [Oryza sativa Japonica Group]
 gi|77549255|gb|ABA92052.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644728|dbj|BAF27869.1| Os11g0215400 [Oryza sativa Japonica Group]
          Length = 428

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 91/360 (25%), Positives = 151/360 (41%), Gaps = 53/360 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
           ++ + +GTP+K  ++ +DTGS+  +   +       PR     +S++  K++C    C  
Sbjct: 83  VISVGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 142

Query: 49  F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 C + +    C + + Y D S + G    +T++            G  FGC+ D+
Sbjct: 143 GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQKIPGFSFGCNMDS 198

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-----TSSYL 155
            G +E    G + G+LG+    +S + Q  S     FSYCL  PL   E      T+ Y 
Sbjct: 199 FGANEF---GNVDGLLGMGAGPMSVLKQ-SSPTFDCFSYCL--PLQKSERGFFSKTTGYF 252

Query: 156 KFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
             G      R   + TK +    N   +++ L  IS+D ER+   P  F        G +
Sbjct: 253 SLGKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFS-----RKGVV 305

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAF 272
            DSGS L+Y        L ++      +   A+     E  + CY +        P+++ 
Sbjct: 306 FDSGSELSYIPDRALSVLSQRIRELLLKRGAAE----EESERNCYDMRSVDEGDMPAISL 361

Query: 273 YFED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
           +F+D A   +    VF+     E   + LA AP +  V++IGS  Q     VYDL   L+
Sbjct: 362 HFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLI 420


>gi|294463081|gb|ADE77078.1| unknown [Picea sitchensis]
          Length = 370

 Score = 80.1 bits (196), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 96/384 (25%), Positives = 153/384 (39%), Gaps = 82/384 (21%)

Query: 17  LDTGSALIY-------------------AIFDPRKSSSFQKINCDHPDCTYF-------- 49
           +DTGS L++                    +F PR SSS   + C   +C           
Sbjct: 1   MDTGSDLVWVPCTRNYSCINCPEDSASNGVFLPRMSSSLHLVTCADSNCKTLYGNNTELL 60

Query: 50  --------KCVNEQCV-YTMKYADQSVTKGFAAHETISV-IGKGEG-KAIFHGALFGCSN 98
                   K  +E C  Y ++Y   S T G    ET+++ +  GEG +AI H A+ GCS 
Sbjct: 61  CQSCAGSLKNCSETCPPYGIQYGRGS-TAGLLLTETLNLPLENGEGARAITHFAV-GCS- 117

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGS-IIKKRFSYCLVIPLPNGEYTSSYLKF 157
                         +G+ G  R  +S  SQLG  I K RF+YCL     + E   S +  
Sbjct: 118 -------IVSSQQPSGIAGFGRGALSMPSQLGEHIGKDRFAYCLQSHRFDEENKKSLMVL 170

Query: 158 GTDMGYRRPSTQATKFINH----PNN----FYYLSLKDISIDNERM-NFPPDTFDITVSG 208
           G            T F+ +    P++    +YY+ L+ +SI  +R+   P         G
Sbjct: 171 GDKALPNNIPLNYTPFLTNSRAPPSSQYGVYYYIGLRGVSIGGKRLKQLPSKLLRFDTKG 230

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RF 267
            GG IIDSG+  T F  +++  +   F S     +  ++ D    + LCY +    N   
Sbjct: 231 NGGTIIDSGTTFTVFSDEIFKHIAAGFASQIGYRRAGEVED-KTGMGLCYDVTGLENIVL 289

Query: 268 PSMAFYFEDANLRIDGENVFIIDYENHF--------FLLAVAPHDDLV-------ALIGS 312
           P  AF+F+       G +  ++   N+F          L +     L+        ++G+
Sbjct: 290 PEFAFHFK-------GGSDMVLPVANYFSYFSSFDSICLTMISSRGLLEVDSGPAVILGN 342

Query: 313 QQQRDTRFVYDLNIDLLSFVKENC 336
            QQ+D   +YD   + L F ++ C
Sbjct: 343 DQQQDFYLLYDREKNRLGFTQQTC 366


>gi|300681439|emb|CBH32531.1| hypothetical protein TAA_ctg0091b.00060.1 [Triticum aestivum]
          Length = 426

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 84/322 (26%), Positives = 136/322 (42%), Gaps = 33/322 (10%)

Query: 33  SSSFQKINCDHPDCTYF-----KCVNEQ---CVYTMKYADQSVTKGFAAHETISVIGKGE 84
           SS F ++ C    C         C N     C Y  +Y     T G+ + E ++ +G   
Sbjct: 107 SSDFTEVFCFSQTCQLALDEEDACGNSTSFTCPYAYQYGPGISTTGYISAEEVTAVGT-- 164

Query: 85  GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
              I   ALFGCS  +        DG  +GVLG SR   S +SQL      RFSY ++  
Sbjct: 165 --HITGRALFGCSLAS----TVPLDGE-SGVLGFSRGPYSLLSQLK---ISRFSYFMLPD 214

Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMN-FPPDT 201
             +   + S L  G D   +  S+++T  + +    + YY+ L  I +D++ ++  P  T
Sbjct: 215 DADKPDSESVLLLGDDAVPQTNSSRSTPLLRNEAYPDLYYVKLTGIKVDDKSLSGIPAGT 274

Query: 202 FDITVSG-EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL 260
           FD+  +G  GG ++ + S +TY     Y  L     S  +   +   +D    ++LCY +
Sbjct: 275 FDLAANGCSGGVVMSTLSPITYLQPAAYNALTRALASKIKSQPVRPKADDVADLRLCYNI 334

Query: 261 PETFN-RFPSMAFYF-----EDANLRIDGENVFIIDYENHFFLLAVAPH---DDLVALIG 311
               N  FP +   F       A + +   + FI +       L + P      + +++G
Sbjct: 335 QSVANLTFPKITLVFHGVDGRPAPMELTTAHYFIRENSTGLQCLTMLPTPAGSPVSSVLG 394

Query: 312 SQQQRDTRFVYDLNIDLLSFVK 333
           S  Q  T  +YDL    L+F K
Sbjct: 395 SLLQTGTHMIYDLRGGSLTFEK 416


>gi|219886223|gb|ACL53486.1| unknown [Zea mays]
 gi|238015146|gb|ACR38608.1| unknown [Zea mays]
 gi|413938611|gb|AFW73162.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938612|gb|AFW73163.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938613|gb|AFW73164.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
 gi|413938614|gb|AFW73165.1| hypothetical protein ZEAMMB73_440759 [Zea mays]
          Length = 467

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 82/361 (22%), Positives = 146/361 (40%), Gaps = 47/361 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           + R+ +GTP+K  ++++DTGS+L +                +F+P+ SSS+  ++C    
Sbjct: 130 VTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQQ 189

Query: 46  CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           C+              +  C+Y   Y D S + G+ + +T+S      G        +GC
Sbjct: 190 CSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF-----GSTSVPNFYYGC 244

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             DN G       G  AG++GL+R  +S + QL   +   FSYCL  P  +   +     
Sbjct: 245 GQDNEGLF-----GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCL--PTSSSSSSGYLSI 297

Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
              + G    +  A+  ++  ++ Y++ +  I +  +     P +   +       IIDS
Sbjct: 298 GSYNPGQYSYTPMASSSLD--DSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDS 350

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
           G+V+T   + VY  L +      +    A        +  C+       R P +   F  
Sbjct: 351 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSI---LDTCFQGQAARLRVPEVTMAFAG 407

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
                      ++D ++    LA AP     A+IG+ QQ+    VYD+    + F    C
Sbjct: 408 GAALKLAARNLLVDVDSATTCLAFAPARS-AAIIGNTQQQTFSVVYDVKNSKIGFAAGGC 466

Query: 337 S 337
           S
Sbjct: 467 S 467


>gi|326511104|dbj|BAK03251.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 631

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 89/364 (24%), Positives = 144/364 (39%), Gaps = 48/364 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL IGTP +   LI+D+GS + Y                F P  SS++  + C+  DCT
Sbjct: 90  TRLHIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSTYSPVKCNV-DCT 148

Query: 48  YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDE 105
                N QC Y  +YA+ S + G    + +S   + E K     A+FGC N   G  F +
Sbjct: 149 CDSDKN-QCTYERQYAEMSSSSGVLGEDIVSFGTESELKP--QRAVFGCENSETGDLFSQ 205

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
            A      G++GL R  +S + QL    +I   FS C   + +  G      +     M 
Sbjct: 206 HAD-----GIMGLGRGQLSIMDQLVDKGVIGDSFSMCYGGMDIGGGAMVLGAMPAPPGMI 260

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
           Y       +  +  P  +Y + LK++ +  + +   P  FD    G+ G ++DSG+   Y
Sbjct: 261 YTH-----SNAVRSP--YYNIELKEMHVAGKALRVDPRIFD----GKHGTVLDSGTTYAY 309

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA 277
                +    +   S     +  +  D      +C+      + +    FP +   F + 
Sbjct: 310 LPEQAFVAFKDAVSSQVHPLKKIRGPDS-NYKDICFAGAGRNVSQLSEVFPKVDMVFGNG 368

Query: 278 N-LRIDGENVFI--IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
             L +  EN        E  + L       D   L+G    R+T   YD + + + F K 
Sbjct: 369 QKLSLSPENYLFRHSKVEGAYCLGVFQNGKDPTTLLGGIVVRNTLVTYDRHNEKIGFWKT 428

Query: 335 NCSD 338
           NCS+
Sbjct: 429 NCSE 432


>gi|115465837|ref|NP_001056518.1| Os05g0596000 [Oryza sativa Japonica Group]
 gi|55733881|gb|AAV59388.1| unknown protein [Oryza sativa Japonica Group]
 gi|57900669|gb|AAW57794.1| unknown protein [Oryza sativa Japonica Group]
 gi|113580069|dbj|BAF18432.1| Os05g0596000 [Oryza sativa Japonica Group]
 gi|215697162|dbj|BAG91156.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215768162|dbj|BAH00391.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 535

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 79/336 (23%), Positives = 130/336 (38%), Gaps = 30/336 (8%)

Query: 21  SALIYAIFDPRKSSSFQKINCDHPDCT---YFKC----VNEQCVYTMKYADQSVTKGFAA 73
           + +I   + P KSSS+++  C    C    Y  C     N  C Y     D ++T G   
Sbjct: 180 TKVIMNWYRPAKSSSWRRFRCSQRACMDLPYNTCESPDQNTSCTYYQVMKDSTITSGIYG 239

Query: 74  HETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSII 133
            E  +V           G + GCS   HG   ++ DG    +L L     SF        
Sbjct: 240 QEKATVAVSDGTMKKLPGLVIGCSTFEHGGAVNSHDG----ILSLGNSPSSFGIAAARRF 295

Query: 134 KKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNE 193
             R S+CL+    +G   SSYL FG +   + P T  T  + + +  Y   +  I +  +
Sbjct: 296 GGRLSFCLLATT-SGRNASSYLTFGANPAVQAPGTMETPLL-YRDVAYGAHVTGILVGGQ 353

Query: 194 RMNFPPDTFDITVSG----EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD 249
            ++ PP+ +D    G    E G I+D+G+ +TY  S VY  +     S+      A++  
Sbjct: 354 PLDIPPEVWDEGPLGNDNPEAGIILDTGTSITYLVSAVYDPVTAALDSHLAHLPKAEIKG 413

Query: 250 CPEPIQLCYFL--------PETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAV 300
                + CY          P      PS +     DA L  D +++ + +       L  
Sbjct: 414 ----FEYCYNWTFAGDGVDPAHNVTIPSFSIEMAGDARLAADAKSIVVPEVVPGVVCLGF 469

Query: 301 APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
                  ++IG+   ++  +  D    +L F K+ C
Sbjct: 470 NRISQGPSIIGNVLMQEHIWEIDHMSTVLRFRKDKC 505


>gi|356523724|ref|XP_003530485.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 488

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 92/381 (24%), Positives = 155/381 (40%), Gaps = 68/381 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ IGTP K   L +DTGS +++                    ++D ++SSS + + CD
Sbjct: 85  AKIGIGTPPKNYYLQVDTGSDIMWVNCIQCKECPTRSSLGMDLTLYDIKESSSGKLVPCD 144

Query: 43  HPDCTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
              C             N  C Y   Y D S T G+   + +    V G  +  +     
Sbjct: 145 QEFCKEINGGLLTGCTANISCPYLEIYGDGSSTAGYFVKDIVLYDQVSGDLKTDSANGSI 204

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
           +FGC     G    + + AL G+LG  +   S ISQL S   +KK F++CL     NG  
Sbjct: 205 VFGCGARQSGDLSSSNEEALDGILGFGKANSSMISQLASSGKVKKMFAHCL-----NGVN 259

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
                  G  +   +P    T  + + P+  Y +++  + + +  ++   DT     S +
Sbjct: 260 GGGIFAIGHVV---QPKVNMTPLLPDQPH--YSVNMTAVQVGHTFLSLSTDT-----SAQ 309

Query: 210 G---GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-N 265
           G   G IIDSG+ L Y    +Y  L  K +S     ++  L D       C+   E+  +
Sbjct: 310 GDRKGTIIDSGTTLAYLPEGIYEPLVYKMISQHPDLKVQTLHD----EYTCFQYSESVDD 365

Query: 266 RFPSMAFYFEDA-NLRI-------DGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQR 316
            FP++ F+FE+  +L++          N + I ++N          D   + L+G     
Sbjct: 366 GFPAVTFFFENGLSLKVYPHDYLFPSVNFWCIGWQNS----GTQSRDSKNMTLLGDLVLS 421

Query: 317 DTRFVYDLNIDLLSFVKENCS 337
           +    YDL    + + + NCS
Sbjct: 422 NKLVFYDLENQAIGWAEYNCS 442


>gi|212275300|ref|NP_001130675.1| uncharacterized protein LOC100191778 precursor [Zea mays]
 gi|194706308|gb|ACF87238.1| unknown [Zea mays]
          Length = 467

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 82/361 (22%), Positives = 146/361 (40%), Gaps = 47/361 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           + R+ +GTP+K  ++++DTGS+L +                +F+P+ SSS+  ++C    
Sbjct: 130 VTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYTSVSCSAQQ 189

Query: 46  CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           C+              +  C+Y   Y D S + G+ + +T+S      G        +GC
Sbjct: 190 CSDLTTATLSPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF-----GSTSVPNFYYGC 244

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             DN G       G  AG++GL+R  +S + QL   +   FSYCL  P  +   +     
Sbjct: 245 GQDNEGLF-----GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCL--PTSSSSSSGYLSI 297

Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
              + G    +  A+  ++  ++ Y++ +  I +  +     P +   +       IIDS
Sbjct: 298 GSYNPGQYSYTPMASSSLD--DSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDS 350

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
           G+V+T   + VY  L +      +    A        +  C+       R P +   F  
Sbjct: 351 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSI---LDTCFQGQAARLRVPEVTMAFAG 407

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
                      ++D ++    LA AP     A+IG+ QQ+    VYD+    + F    C
Sbjct: 408 GAALKLAARNLLVDVDSATTCLAFAPARS-AAIIGNTQQQTFSVVYDVKNSKIGFAAGGC 466

Query: 337 S 337
           S
Sbjct: 467 S 467


>gi|449476186|ref|XP_004154665.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           2-like [Cucumis sativus]
          Length = 478

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 89/381 (23%), Positives = 158/381 (41%), Gaps = 70/381 (18%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           R+ IG+P     + +DTGS +++                    +++P+ SS+   I CD 
Sbjct: 76  RIGIGSPPNDFHVQVDTGSDILWVNCVGCSNCPKKSDIGVDLQLYNPKSSSTSTLITCDQ 135

Query: 44  PDC--TYFKCV-----NEQCVYTMKYADQSVTKGFAAHETISV---IGKGEGKAIFHGAL 93
           P C  TY   +     +  C Y + Y D S T G+  ++ I +   +G  +        +
Sbjct: 136 PFCSATYDAPIPGCKPDLLCQYKVIYGDGSATAGYFVNDYIQLQRAVGNHKTSETNGSIV 195

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
           FGC     G +  +   AL G+LG  +   S ISQL +   +KK F++CL      G + 
Sbjct: 196 FGCGAKQSG-ELGSSSEALDGILGFGQANSSMISQLAATGKVKKIFAHCLDSISGGGIFA 254

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEG 210
              +          P    T  +  PN  +Y + L  + + +  ++ P   F+   S + 
Sbjct: 255 IGEV--------VEPKLXNTPVV--PNQAHYNVVLNGVKVGDTALDLPLGLFE--TSYKR 302

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL--PETFNRFP 268
           G IIDSG+ L Y    +Y  L EK +      +L  + D     Q   F+      + FP
Sbjct: 303 GAIIDSGTTLAYLPESIYLPLMEKILGAQPDLKLRTVDD-----QFTCFVFDKNVDDGFP 357

Query: 269 SMAFYFEDA------------NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQR 316
           ++ F FE++             +R   ++V+ + ++N     A +   + V L+G    +
Sbjct: 358 TVTFKFEESLILTIYPHEYLFQIR---DDVWCVGWQNSG---AQSKDGNEVTLLGDLVLQ 411

Query: 317 DTRFVYDLNIDLLSFVKENCS 337
           +    Y+L    + + + NCS
Sbjct: 412 NKLVYYNLENQTIGWTEYNCS 432


>gi|255556768|ref|XP_002519417.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223541280|gb|EEF42831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 494

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 94/378 (24%), Positives = 156/378 (41%), Gaps = 64/378 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
            R+ +GTP +   + +DTGS +++                     FD   SS+ + + C 
Sbjct: 83  TRVKLGTPPREFNVQIDTGSDVLWVTCSSCSNCPQTSGLGIQLNYFDTTSSSTARLVPCS 142

Query: 43  HPDC------TYFKC--VNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHG 91
           HP C      T  +C   + QC Y  +Y D S T G+   +T    +V+G+         
Sbjct: 143 HPICTSQIQTTATQCPPQSNQCSYAFQYGDGSGTSGYYVSDTFYFDAVLGESLIANSSAA 202

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGE 149
            +FGCS    G D    D A+ G+ G  +  +S ISQL S  I  + FS+CL      GE
Sbjct: 203 IVFGCSTYQSG-DLTKTDKAVDGIFGFGQGELSVISQLSSHGITPRVFSHCL-----KGE 256

Query: 150 YT-SSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
            +    L  G  +    P    +  + + P+  Y L L+ I++  + +   P  F    S
Sbjct: 257 DSGGGILVLGEIL---EPGIVYSPLVPSQPH--YNLDLQSIAVSGQLLPIDPAAF--ATS 309

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI----QLCYFLPET 263
              G IID+G+ L Y   + Y    + FVS       A +S    P       CY +  +
Sbjct: 310 SNRGTIIDTGTTLAYLVEEAY----DPFVSAIT----AAVSQLATPTINKGNQCYLVSNS 361

Query: 264 FNR-FPSMAFYFE-DANLRIDGEN--VFIIDYEN-HFFLLAVAPHDDLVALIGSQQQRDT 318
            +  FP ++F F   A + +  E   +++ +Y     + +        + ++G    +D 
Sbjct: 362 VSEVFPPVSFNFAGGATMLLKPEEYLMYLTNYAGAALWCIGFQKIQGGITILGDLVLKDK 421

Query: 319 RFVYDLNIDLLSFVKENC 336
            FVYDL    + +   +C
Sbjct: 422 IFVYDLAHQRIGWANYDC 439


>gi|125558627|gb|EAZ04163.1| hypothetical protein OsI_26305 [Oryza sativa Indica Group]
          Length = 404

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 65/212 (30%), Positives = 96/212 (45%), Gaps = 41/212 (19%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCT 47
           + L IGTP     ++ DTGS+LI+                F P  SS+F K+ C    C 
Sbjct: 92  MNLSIGTPPVTFSVLADTGSSLIWTQCAPCTECAARPAPPFQPASSSTFSKLPCASSLCQ 151

Query: 48  -----YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                Y  C    CVY   Y     T G+ A ET+ V     G A F G  FGCS +N  
Sbjct: 152 FLTSPYRTCNATGCVYYYPYG-MGFTAGYLATETLHV-----GGASFPGVTFGCSTENGV 205

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
            +        +G++GL R  +S +SQ+G     RFSYCL     N +   S + FG+   
Sbjct: 206 GNSS------SGIVGLGRSPLSLVSQVG---VARFSYCL---RSNADAGDSPILFGSLAK 253

Query: 163 YRRPSTQATKFINHP----NNFYYLSLKDISI 190
               + Q+T  + +P    +++YY++L  I++
Sbjct: 254 VTGGNVQSTPLLENPEMPSSSYYYVNLTGITV 285


>gi|223975883|gb|ACN32129.1| unknown [Zea mays]
 gi|223975971|gb|ACN32173.1| unknown [Zea mays]
 gi|224034191|gb|ACN36171.1| unknown [Zea mays]
 gi|413938623|gb|AFW73174.1| aspartic proteinase nepenthesin-1 isoform 1 [Zea mays]
 gi|413938624|gb|AFW73175.1| aspartic proteinase nepenthesin-1 isoform 2 [Zea mays]
          Length = 465

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 82/361 (22%), Positives = 146/361 (40%), Gaps = 47/361 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           + R+ +GTP+K  ++++DTGS+L +                +F+P+ SSS+  ++C    
Sbjct: 128 VTRMGLGTPAKSYVMVVDTGSSLTWLQCSPCVVSCHRQSGPVFNPKASSSYASVSCSAQQ 187

Query: 46  CTYFKCV---------NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           C+              +  C+Y   Y D S + G+ + +T+S      G        +GC
Sbjct: 188 CSDLTTATLNPASCSTSNVCIYQASYGDSSFSVGYLSKDTVSF-----GSTSVPNFYYGC 242

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
             DN G       G  AG++GL+R  +S + QL   +   FSYCL  P  +   +     
Sbjct: 243 GQDNEGLF-----GQSAGLIGLARNKLSLLYQLAPSMGYSFSYCL--PTSSSSSSGYLSI 295

Query: 157 FGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
              + G    +  A+  ++  ++ Y++ +  I +  +     P +   +       IIDS
Sbjct: 296 GSYNPGQYSYTPMASSSLD--DSLYFIKMTGIKVAGK-----PLSVSSSAYSSLPTIIDS 348

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
           G+V+T   + VY  L +      +    A        +  C+       R P +   F  
Sbjct: 349 GTVITRLPTGVYSALSKAVAGAMKGTPRASAFSI---LDTCFQGQAARLRVPEVTMAFAG 405

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
                      ++D ++    LA AP     A+IG+ QQ+    VYD+    + F    C
Sbjct: 406 GAALKLAARNLLVDVDSATTCLAFAPARS-AAIIGNTQQQTFSVVYDVKNSKIGFAAGGC 464

Query: 337 S 337
           S
Sbjct: 465 S 465


>gi|56542455|gb|AAV92892.1| Avr9/Cf-9 rapidly elicited protein 36, partial [Nicotiana tabacum]
          Length = 191

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 52/168 (30%), Positives = 83/168 (49%), Gaps = 8/168 (4%)

Query: 175 NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEK 234
           NH   FYY+ +K + +  E +N P +T++++  G GG IIDSG+ L+YF    Y  + + 
Sbjct: 27  NHLETFYYVQIKSVIVGGEVLNIPEETWNLSTEGVGGTIIDSGTTLSYFAEPAYEIIKQA 86

Query: 235 FVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFEDANL-RIDGENVFIIDYE 292
           FV+  +R+ +  L D P  ++ CY +        PS    F D  +     EN FI    
Sbjct: 87  FVNKVKRYPI--LDDFP-ILKPCYNVSGVEKLELPSFGIVFGDGAIWTFPVENYFIKLEP 143

Query: 293 NHFFLLAV--APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
                LA+   PH  + ++IG+ QQ++   +YD     L F    C+D
Sbjct: 144 EDIVCLAILGTPHSAM-SIIGNYQQQNFHILYDTKRSRLGFAPRRCAD 190


>gi|255550723|ref|XP_002516410.1| pepsin A, putative [Ricinus communis]
 gi|223544445|gb|EEF45965.1| pepsin A, putative [Ricinus communis]
          Length = 416

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 102/417 (24%), Positives = 159/417 (38%), Gaps = 104/417 (24%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCD------------------ 42
           ++ L IGTP + + + +DTGS L +    P  + SF  ++CD                  
Sbjct: 13  LISLNIGTPPQVIQVYMDTGSDLTWV---PCGNLSFDCMDCDDYRNSKLMSAFSPSHSSS 69

Query: 43  -------HPDCT--------YFKCVNEQCV---------------YTMKYADQSVTKGFA 72
                   P CT        +  C    C                +   Y    V  G  
Sbjct: 70  SYRDSCASPYCTDIHSSDNSFDPCTVAGCSLSTLIKATCARPCPSFAYTYGAGGVVTGTL 129

Query: 73  AHETISVIGKGEGKAIFHGAL----FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQ 128
             +T+ V    EG A     +    FGC    +            G+ G  R T+SF SQ
Sbjct: 130 TRDTLRV---HEGPARVTKDIPKFCFGCVGSTYH--------EPIGIAGFVRGTLSFPSQ 178

Query: 129 LGSIIKKRFSYC-LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSL 185
           LG ++KK FS+C L     N    SS L  G      + + Q T  +  P   N+YY+ L
Sbjct: 179 LG-LLKKGFSHCFLAFKYANNPNISSPLVIGDTALSSKDNMQFTPMLKSPMYPNYYYIGL 237

Query: 186 KDISIDN-ERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQL 244
           + I++ N      P +  +    G GG +IDSG+  T+     Y +L   F +    +  
Sbjct: 238 EAITVGNVSATTVPLNLREFDSQGNGGMLIDSGTTYTHLPEPFYSQLLSIFKAIIT-YPR 296

Query: 245 AQLSDCPEPIQLCYFLPETFNR-------FPSMAFYFEDANLRIDGENV-FIIDYENHFF 296
           A   +      LCY +P   NR       FPS+ F+F +        NV F++   NHF+
Sbjct: 297 ATEVEMRAGFDLCYKVPCPNNRLTDDDNLFPSITFHFLN--------NVSFVLPQGNHFY 348

Query: 297 LLAVAPHDDLV----------------ALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            ++   +  +V                 + GS QQ++ + VYDL  + + F   +C+
Sbjct: 349 AMSAPSNSTVVKCLLFQSMADSDYGPAGVFGSFQQQNVQIVYDLEKERIGFQPMDCA 405


>gi|125553570|gb|EAY99279.1| hypothetical protein OsI_21243 [Oryza sativa Indica Group]
 gi|125605796|gb|EAZ44832.1| hypothetical protein OsJ_29469 [Oryza sativa Japonica Group]
          Length = 534

 Score = 79.7 bits (195), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 79/336 (23%), Positives = 130/336 (38%), Gaps = 30/336 (8%)

Query: 21  SALIYAIFDPRKSSSFQKINCDHPDCT---YFKC----VNEQCVYTMKYADQSVTKGFAA 73
           + +I   + P KSSS+++  C    C    Y  C     N  C Y     D ++T G   
Sbjct: 179 TKVIMNWYRPAKSSSWRRFRCSQRACMDLPYNTCESPDQNTSCTYYQVMKDSTITSGIYG 238

Query: 74  HETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSII 133
            E  +V           G + GCS   HG   ++ DG    +L L     SF        
Sbjct: 239 QEKATVAVSDGTMKKLPGLVIGCSTFEHGGAVNSHDG----ILSLGNSPSSFGIAAARRF 294

Query: 134 KKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNE 193
             R S+CL+    +G   SSYL FG +   + P T  T  + + +  Y   +  I +  +
Sbjct: 295 GGRLSFCLLATT-SGRNASSYLTFGANPAVQAPGTMETPLL-YRDVAYGAHVTGILVGGQ 352

Query: 194 RMNFPPDTFDITVSG----EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD 249
            ++ PP+ +D    G    E G I+D+G+ +TY  S VY  +     S+      A++  
Sbjct: 353 PLDIPPEVWDEGPLGNDNPEAGIILDTGTSITYLVSAVYDPVTAALDSHLAHLPKAEIKG 412

Query: 250 CPEPIQLCYFL--------PETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAV 300
                + CY          P      PS +     DA L  D +++ + +       L  
Sbjct: 413 ----FEYCYNWTFAGDGVDPAHNVTIPSFSIEMAGDARLAADAKSIVVPEVVPGVVCLGF 468

Query: 301 APHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
                  ++IG+   ++  +  D    +L F K+ C
Sbjct: 469 NRISQGPSIIGNVLMQEHIWEIDHMSTVLRFRKDKC 504


>gi|449527515|ref|XP_004170756.1| PREDICTED: aspartic proteinase nepenthesin-1-like, partial [Cucumis
           sativus]
          Length = 364

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 82/360 (22%), Positives = 142/360 (39%), Gaps = 44/360 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
           +VR  IGTP++ +LL LDT +   +             +F   KSSSF+ + C  P C  
Sbjct: 27  VVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSPQCNQ 86

Query: 49  F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
                C    C + + Y   +V       + +++              FGC     G   
Sbjct: 87  VPNPSCSGSACGFNLTYGSSTVAADLV-QDNLTLATDS-----VPSYTFGCIRKATGSSV 140

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
             +           R  +S + Q  S+ +  FSYCL  P       S  L+ G      R
Sbjct: 141 PPQGLLGL-----GRGPLSLLGQSQSLYQSTFSYCL--PSFKSVNFSGSLRLGPVAQPIR 193

Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
              + T  + +P  ++ YY++L  I +  + ++ PP       +   G +IDSG+  T  
Sbjct: 194 --IKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRL 251

Query: 224 HSDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRID 282
            +  Y  + ++F     R   ++ L         CY +P      P++ F F   N+ + 
Sbjct: 252 VAPAYTAVRDEFRRRVGRNVTVSSLGG----FDTCYTVPII---SPTITFMFAGMNVTLP 304

Query: 283 GENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
            +N  I         LA+A   D    ++ +I S QQ++ R ++D+    +   +E+CS 
Sbjct: 305 PDNFLIHSTSGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESCSS 364


>gi|326500408|dbj|BAK06293.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 475

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 79/333 (23%), Positives = 125/333 (37%), Gaps = 58/333 (17%)

Query: 27  IFDPRKSSSFQKINCDHPDCTYF---------KCVNEQCVYTMKYADQSVTKGFAAHETI 77
           +FDP  SS+   + C  P C            +  N +C Y ++Y+D   T G    +T+
Sbjct: 178 LFDPTTSSTAAAVRCRSPACRSLGPYGNGCSNRSANAECRYLIEYSDDRATAGTYMTDTL 237

Query: 78  SVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRF 137
           ++     G        FGCS+   G   D      AG + L     S ++Q    +   F
Sbjct: 238 TI----SGTTAVRNFRFGCSHAVRGRFSD----LTAGTMSLGGGAQSLLAQTARSLGNAF 289

Query: 138 SYCLVIPLPNGEYTSSYLKFGTDMGYRRPST--QATKFINHP-------NNFYYLSLKDI 188
           SYC    +P     S +L  G       P+T    T F   P        + Y + L+ I
Sbjct: 290 SYC----VPQAS-ASGFLSIGG------PATTNSTTVFATTPLVRSAINPSLYLVRLQGI 338

Query: 189 SIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLS 248
            +   R+  PP  F        G ++DS +V+T      Y  L   F +    +  +  +
Sbjct: 339 VVAGRRLGIPPVAF------SAGAVMDSSAVITQLPPTAYRALRRAFRNAMRAYPRSGAT 392

Query: 249 DCPEPIQLCY-FLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFF---LLAVAPHD 304
                +  CY FL  T  R P+++  F        G  V ++D         L   A   
Sbjct: 393 GT---LDTCYDFLGLTNVRVPAVSLVF-------GGGAVVVLDPPAVMIGGCLAFTATSS 442

Query: 305 DL-VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           DL +  IG+ QQ+    +YD+    + F +  C
Sbjct: 443 DLALGFIGNVQQQTHEVLYDVAAGGVGFRRGAC 475


>gi|115489316|ref|NP_001067145.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|77556903|gb|ABA99699.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113649652|dbj|BAF30164.1| Os12g0583300 [Oryza sativa Japonica Group]
 gi|125537189|gb|EAY83677.1| hypothetical protein OsI_38901 [Oryza sativa Indica Group]
          Length = 446

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 92/361 (25%), Positives = 144/361 (39%), Gaps = 39/361 (10%)

Query: 5   FIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDCTY 48
            IG P +    ++DTGS L++                  ++   SS+F  + C    C  
Sbjct: 95  LIGDPPQRAEALIDTGSDLVWTQCSTCLRKVCARQALPYYNSSASSTFAPVPCAARICA- 153

Query: 49  FKCVNEQCVYTMKYADQ-SVTKGFAAHETISVIGKGEGKAIFHGAL---FGCSNDNHGFD 104
               N+  ++    A   SV  G+ A      +G  E  A   G     FGC        
Sbjct: 154 ---ANDDIIHFCDLAAGCSVIAGYGAGVVAGTLGT-EAFAFQSGTAELAFGCVTFTR-IV 208

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
           + A  GA +G++GL R  +S +SQ G+    +FSYCL     N   T       +     
Sbjct: 209 QGALHGA-SGLIGLGRGRLSLVSQTGA---TKFSYCLTPYFHNNGATGHLFVGASASLGG 264

Query: 165 RPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSG----EGGCIIDSGS 218
                 T+F+  P    FYYL L  +++   R+  P   FD+         GG IIDSGS
Sbjct: 265 HGDVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGS 324

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED-A 277
             T    D Y  L  +  +      +A   D  +   LC    +     P++ F+F   A
Sbjct: 325 PFTSLVHDAYDALASELAARLNGSLVAPPPDADDG-ALCVARRDVGRVVPAVVFHFRGGA 383

Query: 278 NLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           ++ +  E+ +  +D       +A A      ++IG+ QQ++ R +YDL     SF   +C
Sbjct: 384 DMAVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADC 443

Query: 337 S 337
           S
Sbjct: 444 S 444


>gi|225440722|ref|XP_002275223.1| PREDICTED: aspartic proteinase nepenthesin-2 [Vitis vinifera]
 gi|147841923|emb|CAN65212.1| hypothetical protein VITISV_039022 [Vitis vinifera]
          Length = 458

 Score = 79.3 bits (194), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 91/389 (23%), Positives = 153/389 (39%), Gaps = 73/389 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------------IFDPRKSSSFQKINC 41
           + L  GTP + +  ++DTGS +++A                    IF+P  SSS + + C
Sbjct: 89  IPLSFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGC 148

Query: 42  DHPDCTYF-----------------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE 84
             P C                    KC +    YT++Y   + + GF   E +   GK  
Sbjct: 149 RDPKCANTSSPDVHLGCPRCNGNSKKCSHACPQYTLQYGTGAAS-GFFLLENLDFPGK-- 205

Query: 85  GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
                H  L GC+      D +    ALAG     R   S   Q+G    K+F+YCL   
Sbjct: 206 ---TIHKFLVGCTTSA---DREPSSDALAG---FGRTMFSLPMQMGV---KKFAYCLN-- 251

Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQA---TKFINHPNNF---YYLSLKDISIDNERMNFP 198
             + +Y  +       + Y    TQ      F+ +P ++   YYL +KD+ I N+ +  P
Sbjct: 252 --SHDYDDTRNSGKLILDYSDGETQGLSYAPFLKNPPDYPFYYYLGVKDMKIGNKLLRIP 309

Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
                      GG +IDSG    Y    V+  +  +      +++ +  ++    +  CY
Sbjct: 310 GKYLTPGSDSRGGVMIDSGFAYGYMTLPVFKIVTNELKKQMSKYRRSLEAETQSGLTPCY 369

Query: 259 -FLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAV---APHDDL------V 307
            F      + P + + F   AN+ + G N F++  E       V   +P ++L       
Sbjct: 370 NFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTNNLEFTPGPS 429

Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            ++G+ QQ D    +DL  + L F ++ C
Sbjct: 430 IILGNYQQVDHYVEFDLKNERLGFRQQTC 458


>gi|302817726|ref|XP_002990538.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
 gi|300141706|gb|EFJ08415.1| hypothetical protein SELMODRAFT_131679 [Selaginella moellendorffii]
          Length = 434

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 78/304 (25%), Positives = 122/304 (40%), Gaps = 54/304 (17%)

Query: 6   IGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDHPDC 46
           +GTP +   L +DTGS L++                     +D + S+S  K+ C  P C
Sbjct: 42  LGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVPCSDPSC 101

Query: 47  TYFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
           T    ++E       QC Y+ +Y D S T G+   + +  +       I     FGC   
Sbjct: 102 TLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATATVI-----FGCGFK 156

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR--FSYCLVIPLPNGEYTSSYLKF 157
             G D    + AL G++G     +SF SQL    K    F++C    L  GE     L  
Sbjct: 157 QSG-DLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHC----LDGGERGGGILVL 211

Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
           G  +    P  Q T  + + ++ Y + L+ IS++N  +   P  F   V    G I DSG
Sbjct: 212 GNVI---EPDIQYTPLVPYMSH-YNVVLQSISVNNANLTIDPKLFSNDV--MQGTIFDSG 265

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDA 277
           + L Y   + Y    +        F L          +L  F+   +  FP++  YFE A
Sbjct: 266 TTLAYLPDEAYQAFTQAVSLVVAPFLLCD-------TRLSRFI---YKLFPNVVLYFEGA 315

Query: 278 NLRI 281
           ++ +
Sbjct: 316 SMTL 319


>gi|125548492|gb|EAY94314.1| hypothetical protein OsI_16081 [Oryza sativa Indica Group]
          Length = 417

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 80/310 (25%), Positives = 127/310 (40%), Gaps = 66/310 (21%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +V+L IGTP       +DT S LI+               +F+PR SS++  + C    C
Sbjct: 90  LVKLGIGTPPYKFTAAIDTASDLIWTQCQPCTGCYHQVDPMFNPRVSSTYAALPCSSDTC 149

Query: 47  TYF---KCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 +C    +E C YT  Y+  + T+G  A + + +     G+  F G  FGCS  +
Sbjct: 150 DELDVHRCGHDDDESCQYTYTYSGNATTEGTLAVDKLVI-----GEDAFRGVAFGCSTSS 204

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD 160
            G    A     +GV+GL R  +S +SQL     +RF+YCL    P        L  G D
Sbjct: 205 TG---GAPPPQASGVVGLGRGPLSLVSQLSV---RRFAYCLP---PPASRIPGKLVLGAD 255

Query: 161 MGYRRPSTQ--ATKFINHPN--NFYYLSLKDISIDNERMNF------------------- 197
               R +T   A      P   ++YYL+L  + I +  M+                    
Sbjct: 256 ADAARNATNRIAVPMRRDPRYPSYYYLNLDGLLIGDRTMSLPPTTTTTATATATATAPAP 315

Query: 198 --PPDTFDITVSGEG--GCIIDSGSVLTYFHSDVYWKLHEKFVSYFE-RFQLAQLSDCPE 252
              P+   + V      G IID  S +T+  + +Y    ++ V+  E   +L + +    
Sbjct: 316 TPSPNATAVAVGDANRYGMIIDIASTITFLEASLY----DELVNDLEVEIRLPRGTGSSL 371

Query: 253 PIQLCYFLPE 262
            + LC+ LP+
Sbjct: 372 GLDLCFILPD 381


>gi|125571060|gb|EAZ12575.1| hypothetical protein OsJ_02481 [Oryza sativa Japonica Group]
          Length = 501

 Score = 79.3 bits (194), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 104/377 (27%), Positives = 154/377 (40%), Gaps = 66/377 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCT 47
            ++ +GTP    L++LDTGS +++               +FDPR S S+  ++C  P C 
Sbjct: 149 TKIGVGTPVTPALMVLDTGSDVVWLQCAPCRRCYDQSGQMFDPRASHSYGAVDCAAPLCR 208

Query: 48  YFKC-----VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                      + C+Y + Y D SVT G  A ET++      G  +   AL GC +DN G
Sbjct: 209 RLDSGGCDLRRKACLYQVAYGDGSVTAGDFATETLTFA---SGARVPRVAL-GCGHDNEG 264

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV---IPLPNGEYTSSYLKFGT 159
                   A AG+LGL R ++SF SQ+     + FSYCLV       +    SS + FG+
Sbjct: 265 LFV-----AAAGLLGLGRGSLSFPSQISRRFGRSFSYCLVDRTSSSASATSRSSTVTFGS 319

Query: 160 DMGYRRPSTQATKFINHPNN-------------FYYLSLKDISIDNERMNFPPDTFDITV 206
             G R       + + HP+                +   +       R+  PPD      
Sbjct: 320 --GAR---GALGRRVLHPDGEEPQDGDVLLRAAHGHQRRRRARPGRGRVRPPPD----PS 370

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKF--VSYFERFQLAQLSDCPEPIQL---CYFLP 261
           +G GG I+DSG           W    +    +   R   A L   P    L   CY L 
Sbjct: 371 TGRGGVIVDSG------RPSPAWARAGRTPPCATRSRAAAAGLRLSPGGFSLFDTCYDLS 424

Query: 262 E-TFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
                + P+++ +F   A   +  EN  I       F  A A  D  V++IG+ QQ+  R
Sbjct: 425 GLKVVKVPTVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFR 484

Query: 320 FVYDLNIDLLSFVKENC 336
            V+D +   L FV + C
Sbjct: 485 VVFDGDGQRLGFVPKGC 501


>gi|195626958|gb|ACG35309.1| aspartic proteinase nepenthesin-1 precursor [Zea mays]
          Length = 450

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 83/360 (23%), Positives = 149/360 (41%), Gaps = 47/360 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +VR  +GTP + +LL +DT +   +              A FDP  S+S++ + C  P C
Sbjct: 113 VVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPAASASYRTVPCGSPLC 172

Query: 47  TYFKCV-----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                       + C +++ YAD S+    +  ++++V G            FGC     
Sbjct: 173 AQAPNAACPPGGKACGFSLTYADSSLQAALS-QDSLAVAGNA-----VKAYTFGCLQRAT 226

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G     +      +LGL R  +SF+SQ   + +  FSYCL  P       S  L+ G + 
Sbjct: 227 GTAAPPQG-----LLGLGRGPLSFLSQTKDMYEATFSYCL--PSFKSLNFSGTLRLGRNG 279

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
             +R  T       H ++ YY+++  + +  + +  P   FD       G ++DSG++ T
Sbjct: 280 QPQRIKTTPLLANPHRSSLYYVNMTGVRVGRKVVPIP--AFDPATG--AGTVLDSGTMFT 335

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
              +  Y  + ++      R ++            C+    T   +P M   F+   + +
Sbjct: 336 RLVAPAYVAVRDEV-----RRRVGAPVSSLGGFDTCFN--TTAVAWPPMTLLFDGMQVTL 388

Query: 282 DGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
             ENV I         LA+A   D    ++ +I S QQ++ R ++D+    + F +E C+
Sbjct: 389 PEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERCT 448


>gi|218187618|gb|EEC70045.1| hypothetical protein OsI_00635 [Oryza sativa Indica Group]
          Length = 570

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 84/358 (23%), Positives = 137/358 (38%), Gaps = 70/358 (19%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDH 43
           ++ + +G+P + +L I DTGS L++                   FDP +SS++ +++C  
Sbjct: 102 LMTVNLGSPPRSMLAIADTGSDLVWVKCKKGNNDTSSAAAPTTQFDPSRSSTYGRVSCQT 161

Query: 44  PDCTYFKCVN----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAI----FHGALFG 95
             C             C Y   Y D S T G  + ET +    G G++       G  FG
Sbjct: 162 DACEALGRATCDDGSNCAYLYAYGDGSNTTGVLSTETFTFDDGGAGRSPRQVRIGGVKFG 221

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSS 153
           CS    G            ++GL    +S ++QLG    + +RFSYCLV   P+    SS
Sbjct: 222 CSTATAGSFPADG------LVGLGGGAVSLVTQLGGATSLGRRFSYCLV---PHSVNASS 272

Query: 154 YLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
            L FG       P   +T  + +       S +                          I
Sbjct: 273 ALNFGALADVTEPGAASTPLVGNKTVASAASSR-------------------------II 307

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP----ETFNRFPS 269
           +DSG+ LT+    +   + ++      R  L  +      +QLCY +     E     P 
Sbjct: 308 VDSGTTLTFLDPSLLGPIVDEL---SRRITLPPVQSPDGLLQLCYNVAGREVEAGESIPD 364

Query: 270 MAFYF-EDANLRIDGENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFVYDLN 325
           +   F   A + +  EN F+   E    L  VA  +   V+++G+  Q++    YDL+
Sbjct: 365 LTLEFGGGAAVALKPENAFVAVQEGTLCLAIVATTEQQPVSILGNLAQQNIHVGYDLD 422


>gi|449449334|ref|XP_004142420.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 441

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 82/360 (22%), Positives = 142/360 (39%), Gaps = 44/360 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
           +VR  IGTP++ +LL LDT +   +             +F   KSSSF+ + C  P C  
Sbjct: 104 VVRAKIGTPAQTLLLALDTSNDAAWIPCSGCIGCPSTTVFSSDKSSSFRPLPCQSPQCNQ 163

Query: 49  F---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
                C    C + + Y   +V       + +++              FGC     G   
Sbjct: 164 VPNPSCSGSACGFNLTYGSSTVAADLV-QDNLTLATDS-----VPSYTFGCIRKATGSSV 217

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
             +     G        +S + Q  S+ +  FSYCL  P       S  L+ G      R
Sbjct: 218 PPQGLLGLGR-----GPLSLLGQSQSLYQSTFSYCL--PSFKSVNFSGSLRLGPVAQPIR 270

Query: 166 PSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
              + T  + +P  ++ YY++L  I +  + ++ PP       +   G +IDSG+  T  
Sbjct: 271 --IKYTPLLRNPRRSSLYYVNLISIRVGRKIVDIPPSALAFNSATGAGTVIDSGTTFTRL 328

Query: 224 HSDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRID 282
            +  Y  + ++F     R   ++ L         CY +P      P++ F F   N+ + 
Sbjct: 329 VAPAYTAVRDEFRRRVGRNVTVSSLGG----FDTCYTVPII---SPTITFMFAGMNVTLP 381

Query: 283 GENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
            +N  I         LA+A   D    ++ +I S QQ++ R ++D+    +   +E+CS 
Sbjct: 382 PDNFLIHSTAGSTTCLAMAAAPDNVNSVLNVIASMQQQNHRILFDIPNSRVGVARESCSS 441


>gi|215766660|dbj|BAG98888.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 433

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 81/362 (22%), Positives = 149/362 (41%), Gaps = 49/362 (13%)

Query: 6   IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDHPDC 46
           IGTP+    + LDTGS   +                     +DPR S S +++ CD   C
Sbjct: 89  IGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC 148

Query: 47  TYFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGALFGCSNDNH 101
           T     N   +C Y   YAD  +T G    + +    + G G+ +       FGC     
Sbjct: 149 TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQS 208

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           G   ++   A+ G++G      + +SQL +    KK FS+CL      G +    +    
Sbjct: 209 GSLNNSAV-AIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEV---- 263

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                 P  + T  + +   ++ ++LK I++    +  P + F  T +   G  IDSGS 
Sbjct: 264 ----VEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGST 317

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
           L Y    +Y +L     +      +  + +     Q  +FL    ++FP + F+FE+ +L
Sbjct: 318 LVYLPEIIYSELILAVFAKHPDITMGAMYN----FQCFHFLGSVDDKFPKITFHFEN-DL 372

Query: 280 RIDGENV-FIIDYENHFFLL-----AVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
            +D     ++++YE + +        +  + D++ ++G     +   VYD+    + + +
Sbjct: 373 TLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMI-ILGDMVISNKVVVYDMEKQAIGWTE 431

Query: 334 EN 335
            N
Sbjct: 432 HN 433


>gi|218194598|gb|EEC77025.1| hypothetical protein OsI_15381 [Oryza sativa Indica Group]
          Length = 422

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 81/362 (22%), Positives = 149/362 (41%), Gaps = 49/362 (13%)

Query: 6   IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDHPDC 46
           IGTP+    + LDTGS   +                     +DPR S S +++ CD   C
Sbjct: 65  IGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC 124

Query: 47  TYFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGALFGCSNDNH 101
           T     N   +C Y   YAD  +T G    + +    + G G+ +       FGC     
Sbjct: 125 TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQS 184

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           G   ++   A+ G++G      + +SQL +    KK FS+CL      G +    +    
Sbjct: 185 GSLNNSAV-AIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEV---- 239

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                 P  + T  + +   ++ ++LK I++    +  P + F  T +   G  IDSGS 
Sbjct: 240 ----VEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGST 293

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
           L Y    +Y +L     +      +  + +     Q  +FL    ++FP + F+FE+ +L
Sbjct: 294 LVYLPEIIYSELILAVFAKHPDITMGAMYN----FQCFHFLGSVDDKFPKITFHFEN-DL 348

Query: 280 RIDGENV-FIIDYENHFFLL-----AVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVK 333
            +D     ++++YE + +        +  + D++ ++G     +   VYD+    + + +
Sbjct: 349 TLDVYPYDYLLEYEGNQYCFGFQDAGIHGYKDMI-ILGDMVISNKVVVYDMEKQAIGWTE 407

Query: 334 EN 335
            N
Sbjct: 408 HN 409


>gi|223942467|gb|ACN25317.1| unknown [Zea mays]
 gi|413936886|gb|AFW71437.1| pepsin A [Zea mays]
          Length = 491

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 97/382 (25%), Positives = 159/382 (41%), Gaps = 71/382 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            R+ IG+P KG  + +DTGS +++                     +DP  S +   + C+
Sbjct: 86  TRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TVGCE 143

Query: 43  HPDCTYFKC---------VNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
              C               +  C + + Y D S T GF   + +    V G G+      
Sbjct: 144 QEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTTSNA 203

Query: 91  GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNG 148
              FGC     G D  + + AL G+LG  +   S +SQL +   ++K F++CL      G
Sbjct: 204 SITFGCGA-QLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGG 262

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVS 207
            +    +         +P  + T  +  PN  +Y ++L+ IS+    +  P  TFD   S
Sbjct: 263 IFAIGNVV--------QPKVKTTPLV--PNVTHYNVNLQGISVGGATLQLPTSTFD---S 309

Query: 208 GEG-GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFN 265
           G+  G IIDSG+ L Y   +VY  L     + F+++Q   L +  + +  C+ F     +
Sbjct: 310 GDSKGTIIDSGTTLAYLPREVYRTL---LAAVFDKYQDLPLHNYQDFV--CFQFSGSIDD 364

Query: 266 RFPSMAFYFEDANLRIDGENVFIIDY--ENHFFLLAVAPHDDLVA--------LIGSQQQ 315
            FP + F FE  +L +   NV+  DY  +N   L  +   D  V         L+G    
Sbjct: 365 GFPVITFSFE-GDLTL---NVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVL 420

Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
            +   VYDL  +++ +   NCS
Sbjct: 421 SNKLVVYDLEKEVIGWTDYNCS 442


>gi|302803839|ref|XP_002983672.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
 gi|300148509|gb|EFJ15168.1| hypothetical protein SELMODRAFT_118648 [Selaginella moellendorffii]
          Length = 388

 Score = 79.0 bits (193), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 79/305 (25%), Positives = 122/305 (40%), Gaps = 56/305 (18%)

Query: 6   IGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDHPDC 46
           +GTP +   L +DTGS L++                     +D + S+S  K+ C  P C
Sbjct: 42  LGTPPRTYNLQVDTGSDLLWVNCHPCIGCPAFSDLKIPIVPYDVKASASSSKVPCSDPSC 101

Query: 47  TYFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
           T    ++E       QC Y+ +Y D S T G+   + +  +       I     FGC   
Sbjct: 102 TLITQISESGCNDQNQCGYSFQYGDGSGTLGYLVEDVLHYMVNATATVI-----FGCGFK 156

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR--FSYCLVIPLPNGEYTSSYLKF 157
             G D    + AL G++G     +SF SQL    K    F++C    L  GE     L  
Sbjct: 157 QSG-DLSTSERALDGIIGFGASDLSFNSQLAKQGKTPNVFAHC----LDGGERGGGILVL 211

Query: 158 GTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
           G  +    P  Q T  +  P  ++Y + L+ IS++N  +   P  F   V    G I DS
Sbjct: 212 GNVI---EPDIQYTPLV--PYMYHYNVVLQSISVNNANLTIDPKLFSNDV--MQGTIFDS 264

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
           G+ L Y   + Y    +        F L          +L  F+   +  FP++  YFE 
Sbjct: 265 GTTLAYLPDEAYQAFTQAVSLVVAPFLLCD-------TRLSRFI---YKLFPNVVLYFEG 314

Query: 277 ANLRI 281
           A++ +
Sbjct: 315 ASMTL 319


>gi|242095592|ref|XP_002438286.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
 gi|241916509|gb|EER89653.1| hypothetical protein SORBIDRAFT_10g011130 [Sorghum bicolor]
          Length = 495

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 85/334 (25%), Positives = 129/334 (38%), Gaps = 51/334 (15%)

Query: 28  FDPRKSSSFQKINCDHPDCTYFKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGK 86
           FDP  SSSF+ + C  PDC    C     C +T++ +      G    +T+++       
Sbjct: 185 FDPSMSSSFRSVLCGSPDCGGHSCSAGGSCTFTLQNSTFVFGNGTIVMDTLTL----SPS 240

Query: 87  AIFHGALFGCSN-DNHGFDEDARDGALAGVLGLSRVTISFISQL---GSIIKKRFSYCLV 142
           A F     GC   DN  F     DG   G + LS    S  +++          FSYC  
Sbjct: 241 ATFENFAVGCMQLDNDLF----TDGVAVGNIDLSLSRHSLATRVLNSSPPGMAAFSYC-- 294

Query: 143 IPLPNGEYTSSYLKFGTDMG--YRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFP 198
             LP    T  +L     +         +    + +P   NFYY+ L  I+I+ E +  P
Sbjct: 295 --LPADTDTHGFLTIAPALSDYSDHAGVKYVPLVTNPTGPNFYYVDLVAIAINGEDLPIP 352

Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
           P  F        G +IDS S  TY +  +Y  L ++F     ++Q         P+    
Sbjct: 353 PALFT-----GNGTMIDSQSAFTYLNPPIYAALRDEFRKAMLQYQ---------PVPAFG 398

Query: 259 FLPETFNRFPSMAFYFEDANLRI-DGENVFIIDYENHFFL--------------LAVAPH 303
            L   +N   +   Y  D  LR  +GE + + D +  +F                A AP 
Sbjct: 399 GLDTCYNFTLAENIYLPDITLRFSNGETMDLDDRQFMYFFREHLTDGFPFGCLAFAAAPD 458

Query: 304 DDLV-ALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            +     +GSQ QR    VYD+   +++FV   C
Sbjct: 459 QNFPWNYLGSQVQRTKEIVYDVRGGMVAFVPSRC 492


>gi|413936472|gb|AFW71023.1| hypothetical protein ZEAMMB73_589717 [Zea mays]
          Length = 289

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 74/290 (25%), Positives = 118/290 (40%), Gaps = 40/290 (13%)

Query: 53  NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGAL 112
            +QC + + YAD + T G  + + +++       AI     FGC +  H     A  G  
Sbjct: 34  GKQCGFAISYADGTSTVGAYSQDKLTL----APGAIVQNFYFGCGHGKH-----AVRGLF 84

Query: 113 AGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS----T 168
            GVLGL R+  S  ++ G +    FSYCL    P+      +L  G     + PS    T
Sbjct: 85  DGVLGLGRLRESLGARYGGV----FSYCL----PSVSSKPGFLALGAG---KNPSGFVFT 133

Query: 169 QATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
                   P  F  ++L  I++  ++++  P  F       GG I+DSG+V+T   S  Y
Sbjct: 134 PMGTVPGQPT-FSTVTLAGINVGGKKLDLRPSAF------SGGMIVDSGTVITGLQSTAY 186

Query: 229 WKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENV 286
             L   F    E ++L    D    +  CY L    N   P +A  F   A + +D  N 
Sbjct: 187 RALRSAFRKAMEAYRLLPNGD----LDTCYNLTGYKNVVVPKIALTFTGGATINLDVPNG 242

Query: 287 FIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            ++   N     A +  D    ++G+  QR    ++D +     F  + C
Sbjct: 243 ILV---NGCLAFAESGPDGSAGVLGNVNQRAFEVLFDTSTSKFGFRAKAC 289


>gi|356515690|ref|XP_003526531.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 439

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 87/361 (24%), Positives = 151/361 (41%), Gaps = 45/361 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
           +VR+ +GTP + + ++LDT +   +             F P+ S+S+  ++C  P C   
Sbjct: 100 VVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGCSDTTFSPKASTSYGPLDCSVPQCGQV 159

Query: 50  KCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
           + ++        C +   YA  S    F+A      +        ++   FGC N   G 
Sbjct: 160 RGLSCPATGTGACSFNQSYAGSS----FSATLVQDALRLATDVIPYYS--FGCVNAITGA 213

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A+           R  +S +SQ GS     FSYCL  P     Y S  LK G  +G 
Sbjct: 214 SVPAQGLLGL-----GRGPLSLLSQSGSNYSGIFSYCL--PSFKSYYFSGSLKLG-PVGQ 265

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            + S + T  +  P+  + YY++   IS+    + FP +      +   G IIDSG+V+T
Sbjct: 266 PK-SIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVIT 324

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-PSMAFYFEDANLR 280
            F   VY  + E+F           +         C+   +T+    P +  +FE  +L+
Sbjct: 325 RFVEPVYNAVREEFRKQVGGTTFTSIGA----FDTCFV--KTYETLAPPITLHFEGLDLK 378

Query: 281 IDGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  EN  I         LA+A   D    ++ +I + QQ++ R ++D+  + +   +E C
Sbjct: 379 LPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDIVNNKVGIAREVC 438

Query: 337 S 337
           +
Sbjct: 439 N 439


>gi|356553263|ref|XP_003544977.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 445

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 71/264 (26%), Positives = 110/264 (41%), Gaps = 44/264 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY-- 48
           +V L IGTP +   ++LDTGS L +          A FDP  SSSF  + C HP C    
Sbjct: 89  VVTLPIGTPPQPQQMVLDTGSQLSWIQCHNKTPPTASFDPSLSSSFYVLPCTHPLCKPRV 148

Query: 49  ------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                   C  N  C Y+  YAD +  +G    E ++         +    + GCS+++ 
Sbjct: 149 PDFTLPTTCDQNRLCHYSYFYADGTYAEGNLVREKLAFSPSQTTPPL----ILGCSSESR 204

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVI--PLPNGEYTSSYLKFGT 159
               DAR     G+LG++   +SF  Q       +FSYC+    P  N  + +     G 
Sbjct: 205 ----DAR-----GILGMNLGRLSFPFQAKVT---KFSYCVPTRQPANNNNFPTGSFYLGN 252

Query: 160 DMGYRR-------PSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
           +    R          Q+ +  N     Y + ++ I I   ++N PP  F     G G  
Sbjct: 253 NPNSARFRYVSMLTFPQSQRMPNLDPLAYTVPMQGIRIGGRKLNIPPSVFRPNAGGSGQT 312

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFV 236
           ++DSGS  T+     Y ++ E+ +
Sbjct: 313 MVDSGSEFTFLVDVAYDRVREEII 336


>gi|125532788|gb|EAY79353.1| hypothetical protein OsI_34482 [Oryza sativa Indica Group]
          Length = 394

 Score = 78.6 bits (192), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 82/361 (22%), Positives = 142/361 (39%), Gaps = 55/361 (15%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
           IGTP +    ++D    L++               +FDP  S++++   C  P C     
Sbjct: 57  IGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPLCESIPS 116

Query: 50  ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
               C    C Y     +   T G    +T +V   G  KA      FGC   +   D D
Sbjct: 117 DSRNCSGNVCAY-QASTNAGDTGGKVGTDTFAV---GTAKASLA---FGCVVAS---DID 166

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM--GYR 164
              G  +G++GL R   S ++Q G      FSYCL  P   G+ ++ +L     +  G +
Sbjct: 167 TMGGP-SGIVGLGRTPWSLVTQTG---VAAFSYCLA-PHDAGKNSALFLGSSAKLAGGGK 221

Query: 165 RPSTQATKFINHPN---NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
             ST       + N   N+Y + L+ +   +  +  PP    +        ++D+ S ++
Sbjct: 222 AASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTV--------LLDTFSPIS 273

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
           +     Y  + +          +A      EP  LC+         P + F F       
Sbjct: 274 FLVDGAYQAVKKAVTVAVGAPPMATPV---EPFDLCFPKSGASGAAPDLVFTFRGGAAMT 330

Query: 282 DGENVFIIDYENHFFLLAVAPHDDL-----VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
              + +++DY+N    LA+     L     ++L+GS QQ +  F++DL+ + LSF   +C
Sbjct: 331 VAASNYLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPADC 390

Query: 337 S 337
           +
Sbjct: 391 T 391


>gi|115483168|ref|NP_001065177.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|21717168|gb|AAM76361.1|AC074196_19 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433289|gb|AAP54827.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639786|dbj|BAF27091.1| Os10g0538200 [Oryza sativa Japonica Group]
 gi|215686408|dbj|BAG87693.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score = 78.6 bits (192), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 84/362 (23%), Positives = 145/362 (40%), Gaps = 57/362 (15%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
           IGTP +    ++D    L++               +FDP  S++++   C  P C     
Sbjct: 57  IGTPPQPASAVIDLAGELVWTQCKQCSRCFEQDTPLFDPTASNTYRAEPCGTPLCESIPS 116

Query: 50  ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
               C    C Y     +   T G    +T +V   G  KA      FGC   +   D D
Sbjct: 117 DSRNCSGNVCAY-QASTNAGDTGGKVGTDTFAV---GTAKASLA---FGCVVAS---DID 166

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM--GYR 164
              G  +G++GL R   S ++Q G      FSYCL  P   G  ++ +L     +  G +
Sbjct: 167 TMGGP-SGIVGLGRTPWSLVTQTG---VAAFSYCLA-PHDAGRNSALFLGSSAKLAGGGK 221

Query: 165 RPSTQATKFINHPN---NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
             ST       + N   N+Y + L+ +   +  +  PP    +        ++D+ S ++
Sbjct: 222 AASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTV--------LLDTFSPIS 273

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED-ANLR 280
           +     Y  + +   +      +A      EP  LC+         P + F F   A + 
Sbjct: 274 FLVDGAYQAVKKAVTAAVGAPPMATPV---EPFDLCFPKSGASGAAPDLVFTFRGGAAMT 330

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDL-----VALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
           +   N +++DY+N    LA+     L     ++L+GS QQ +  F++DL+ + LSF   +
Sbjct: 331 VPATN-YLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPAD 389

Query: 336 CS 337
           C+
Sbjct: 390 CT 391


>gi|259490398|ref|NP_001159203.1| uncharacterized protein LOC100304289 [Zea mays]
 gi|223942623|gb|ACN25395.1| unknown [Zea mays]
          Length = 378

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 88/373 (23%), Positives = 140/373 (37%), Gaps = 48/373 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI----------------FDPRKSSSFQKINCDHPD 45
           VR  +GTP++  +L+ DTGS L +                  F   +S S+  + C    
Sbjct: 16  VRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSSDT 75

Query: 46  CTYF------KCVN--EQCVYTMKYADQSVTKGFAAHETISVI----------GKGEGKA 87
           CT +       C +    C Y  +Y D S  +G    +  ++           G G  +A
Sbjct: 76  CTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGRRA 135

Query: 88  IFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPN 147
              G + GC+    G    + DG    VL L    ISF S+  +    RFSYCLV  L  
Sbjct: 136 KLQGVVLGCTATYDGQSFQSSDG----VLSLGNSNISFASRAAARFGGRFSYCLVDHLAP 191

Query: 148 GEYTSSYLKFGTDMGYRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
               SSYL FG            T  +     + FY +++  + +  E ++ P D +D  
Sbjct: 192 -RNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWD-- 248

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
           V   GG I+DSG+ LT   +  Y  +               +    +P + CY       
Sbjct: 249 VGRGGGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAM----DPFEYCYNWTAGAP 304

Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDL 324
             P +   F  +         ++ID       + V       V++IG+  Q++  + +DL
Sbjct: 305 EIPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDL 364

Query: 325 NIDLLSFVKENCS 337
               L F    C+
Sbjct: 365 RDRWLRFKHTRCA 377


>gi|413944387|gb|AFW77036.1| hypothetical protein ZEAMMB73_461996 [Zea mays]
          Length = 472

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 90/372 (24%), Positives = 144/372 (38%), Gaps = 63/372 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDH- 43
           +V L IGTP+    +++DTGS L +                 ++DP  SS++  + CD  
Sbjct: 128 VVTLGIGTPAVQQTVLIDTGSDLSWVQCKPCNSSSCYPQKDPLYDPTASSTYAPVPCDSK 187

Query: 44  ------PDCTYFKCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
                 PD     C N      C Y ++Y ++  T G  + ET+++  +   K       
Sbjct: 188 ACKDLVPDAYDHGCTNSSGTSLCQYGIEYGNRDTTVGVYSTETLTLSPQVSVKDFG---- 243

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
           FGC     G  +        G+LGL     S +SQ        FSYC    LP G  T+ 
Sbjct: 244 FGC-----GLVQQGTFDLFDGLLGLGGAPESLVSQTAETYGGAFSYC----LPPGNSTTG 294

Query: 154 YLKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITV 206
           +L  G        +     F+  P         FY ++L  +S+  + ++ PP       
Sbjct: 295 FLALGAPTN----NNDTAGFLFTPLHSLPEQATFYLVNLTGVSVGGKPLDIPPTVL---- 346

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN- 265
              GG IIDSG+++T      Y  L   F +    + L   ++  + +  CY      N 
Sbjct: 347 --SGGMIIDSGTIITGLPDTAYSALRTAFRTAMSAYPLLPPNN-DDVLDTCYNFTGIANV 403

Query: 266 RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDL 324
             P++A  F+  A + +D  +  +I         A    D  V +IG+  QR    +YD 
Sbjct: 404 TVPTVALTFDGGATIDLDVPSGVLI---QDCLAFAGGASDGDVGIIGNVNQRTFEVLYDS 460

Query: 325 NIDLLSFVKENC 336
               + F    C
Sbjct: 461 GRGHVGFRPGAC 472


>gi|356556809|ref|XP_003546713.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-1-like [Glycine max]
          Length = 444

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 78/362 (21%), Positives = 140/362 (38%), Gaps = 49/362 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY------------AIFDPRKSSSFQKINCDHPDCTY 48
           +VR   GTP++ +LL +DT +   +              F P KS++F+K+ C    C  
Sbjct: 107 IVRAKFGTPAQTLLLAMDTSNDAAWVPCTACVGCSTTTPFAPPKSTTFKKVGCGASQCKQ 166

Query: 49  FK---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
            +   C    C +   Y   SV       +T+++              FGC     G   
Sbjct: 167 VRNPTCDGSACAFNFTYGTSSVAASLV-QDTVTL-----ATDPVPAYTFGCIQKATGSSL 220

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM-GYR 164
             +           R  +S ++Q   + +  FSYCL        + +       D+    
Sbjct: 221 PPQGLLGL-----GRGPLSLLAQTQKLYQSTFSYCL------PSFKTLNFSGHXDLXPVA 269

Query: 165 RPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
           +P  Q      +P  ++ YY++L  I +    ++ PP+          G + DSG+V T 
Sbjct: 270 QPRDQVYPSFKNPRRSSLYYVNLVAIRVGRRIVDIPPEALAFNPXTGAGTVFDSGTVFTR 329

Query: 223 FHSDVYWKLHEKF---VSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
                Y  +  +F   VS  ++  +  L         CY +P      P++ F F   N+
Sbjct: 330 LVEPAYTAVRNEFRRRVSVHKKLTVTSLGG----FDTCYTVPIV---APTITFMFSGMNV 382

Query: 280 RIDGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            +  +N+ I         LA+AP  D    ++ +I + QQ++ R ++D+    L   +E 
Sbjct: 383 TLPPDNILIHSTAGSVTCLAMAPAPDNVNSVLNVIANMQQQNHRVLFDVPNSRLGVAREL 442

Query: 336 CS 337
           C+
Sbjct: 443 CT 444


>gi|125555046|gb|EAZ00652.1| hypothetical protein OsI_22673 [Oryza sativa Indica Group]
          Length = 340

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 73/256 (28%), Positives = 111/256 (43%), Gaps = 40/256 (15%)

Query: 20  GSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISV 79
           G A     FDP +SSSF  I C  P+C   +C    C +T+++ + +V  G    +T+++
Sbjct: 25  GGAPCDVAFDPSRSSSFAAIPCGSPECA-VECTGASCPFTIQFGNVTVANGTLVRDTLTL 83

Query: 80  IGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS-----IIK 134
                  A F G  FGC     G D D  DGA+ G++ LSR + S  S++ S        
Sbjct: 84  ----SPSATFAGFTFGC--IEVGADADTFDGAV-GLIDLSRSSHSLASRVISNGATTTTT 136

Query: 135 KRFSYCLVIPLPNGEYTSSYLKFGT--------DMGYRRPSTQATKFINHPNNFYYLSLK 186
             FSYCL  P  +   +  +L  G         D+ Y   S+      NHPN+ Y++ L 
Sbjct: 137 AAFSYCL--PSLSSTRSRGFLSIGASRPEYSGGDIKYAPMSSNP----NHPNS-YFVDLV 189

Query: 187 DISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQ 246
            IS+  E +  PP      V    G ++++ +  T+     Y  L + F     R  +AQ
Sbjct: 190 GISVGGEDLPVPP-----AVLAAHGTLLEAATEFTFLAPAAYAALRDAF-----RNDMAQ 239

Query: 247 LSDCP--EPIQLCYFL 260
               P    +  CY L
Sbjct: 240 YPAAPPFRVLDTCYNL 255


>gi|414589629|tpg|DAA40200.1| TPA: hypothetical protein ZEAMMB73_727364, partial [Zea mays]
          Length = 201

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 47/178 (26%), Positives = 79/178 (44%), Gaps = 13/178 (7%)

Query: 169 QATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSD 226
           Q T  +  P N  FYY+    +++   R+  P   F +   G GG I+DSG+ LT   + 
Sbjct: 27  QTTPLLQSPQNPTFYYVHFTGLTVGARRLRIPESAFALRPDGSGGVIVDSGTALTLLPAA 86

Query: 227 VYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR--------FPSMAFYFEDAN 278
           V   L E   ++ ++ +L   +       +C+ +P  + R         P M  +F+ A+
Sbjct: 87  V---LAEVVRAFRQQLRLPFANGGNPEDGVCFLVPAAWRRSSSTSQMPVPRMVLHFQGAD 143

Query: 279 LRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           L +   N  + D+      L +A   D  + IG+  Q+D R +YDL  + LS     C
Sbjct: 144 LDLPRRNYVLDDHRRGRLCLLLADSGDDGSTIGNLVQQDMRVLYDLEAETLSIAPARC 201


>gi|226531872|ref|NP_001147022.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
 gi|195606574|gb|ACG25117.1| aspartic proteinase nepenthesin-2 precursor [Zea mays]
          Length = 491

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 91/402 (22%), Positives = 162/402 (40%), Gaps = 85/402 (21%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------------AIFDPRKSSSFQKINCDHPD 45
           +GTP + + ++LDTGS L +                     +F P+ SSS + + C +P 
Sbjct: 105 LGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPS 164

Query: 46  CTYF--------KCVNEQC----------------VYTMKYADQSVTKGFAAHETISVIG 81
           C +         KC    C                 Y + Y   S T G    +T+    
Sbjct: 165 CQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGS-TAGLLIADTL---- 219

Query: 82  KGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
           +  G+A+  G + GCS         +     +G+ G  R   S  +QLG     +FSYCL
Sbjct: 220 RAPGRAV-PGFVLGCS-------LVSVHQPPSGLAGFGRGAPSVPAQLG---LPKFSYCL 268

Query: 142 VI------PLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNN-FYYLSLKDISIDNER 194
           +          +G         G  M Y  P  ++      P   +YYL+L+ +++  + 
Sbjct: 269 LSRRFDDNAAVSGSLVLGGTGGGEGMQYV-PLVKSAAGDKLPYGVYYYLALRGVTVGGKA 327

Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEP 253
           +  P   F    +G GG I+DSG+  TY    V+  + +  V+    R++ ++ ++    
Sbjct: 328 VRLPARAFAGNAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDGLG 387

Query: 254 IQLCYFLPETFNR--FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA-- 308
           +  C+ LP+       P ++F+FE  A +++  EN F++        + +A   D     
Sbjct: 388 LHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFGGGS 447

Query: 309 -----------LIGSQQQRDTRFVYDLNIDLLSFVKENCSDD 339
                      ++GS QQ++    YDL  + L F +++C+  
Sbjct: 448 GAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCTSS 489


>gi|413948408|gb|AFW81057.1| hypothetical protein ZEAMMB73_038743 [Zea mays]
          Length = 469

 Score = 78.2 bits (191), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 87/373 (23%), Positives = 140/373 (37%), Gaps = 48/373 (12%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI----------------FDPRKSSSFQKINCDHPD 45
           VR  +GTP++  +L+ DTGS L +                  F   +S S+  + C    
Sbjct: 107 VRFRVGTPAQPFVLVADTGSDLTWVKCRGAAGPPASDPPAREFRASESRSWAPLACSSDT 166

Query: 46  CTYF------KCVN--EQCVYTMKYADQSVTKGFAAHETISVI----------GKGEGKA 87
           CT +       C +    C Y  +Y D S  +G    +  ++           G G  +A
Sbjct: 167 CTSYVPFSLANCSSPASPCAYDYRYKDGSAARGVVGTDAATIALSGSGSEDGSGGGGRRA 226

Query: 88  IFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPN 147
              G + GC+    G    + DG    VL L    ISF S+  +    RFSYCLV  L  
Sbjct: 227 KLQGVVLGCTATYDGQSFQSSDG----VLSLGNSNISFASRAAARFGGRFSYCLVDHLAP 282

Query: 148 GEYTSSYLKFGTDMGYRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
               SSYL FG            T  +     + FY +++  + +  E ++ P D +D+ 
Sbjct: 283 -RNASSYLTFGPGPEGGGAPAARTPLVLDRRVSPFYAVAVDAVYVAGEALDIPADVWDVG 341

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
               GG I+DSG+ LT   +  Y  +               +    +P + CY       
Sbjct: 342 RG--GGAILDSGTSLTVLATPAYRAVVAALGGRLAALPRVAM----DPFEYCYNWTAGAP 395

Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDL 324
             P +   F  +         ++ID       + V       V++IG+  Q++  + +DL
Sbjct: 396 EIPKLEVSFAGSARLEPPAKSYVIDAAPGVKCIGVQEGAWPGVSVIGNILQQEHLWEFDL 455

Query: 325 NIDLLSFVKENCS 337
               L F    C+
Sbjct: 456 RDRWLRFKHTRCA 468


>gi|125533812|gb|EAY80360.1| hypothetical protein OsI_35532 [Oryza sativa Indica Group]
          Length = 428

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 91/360 (25%), Positives = 152/360 (42%), Gaps = 53/360 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
           ++ + +GTP+K  ++ +DTGS+  +   +       PR     +S++  K++C    C  
Sbjct: 83  VISVGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 142

Query: 49  F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 C + +    C + + Y D S + G    +T++     +  +      FGC+ D+
Sbjct: 143 GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSF----TFGCNLDS 198

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-----TSSYL 155
            G +E    G + G+LG+    +S + Q  S     FSYCL  PL   E      T+ Y 
Sbjct: 199 FGANEF---GNVDGLLGMGAGPMSVLKQ-SSPRFDGFSYCL--PLQKSERGFFSKTTGYF 252

Query: 156 KFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
             G      R   + TK +    N   +++ L  IS+D ER+   P  F        G +
Sbjct: 253 SLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS-----RKGVV 305

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAF 272
            DSGS L+Y        L ++      R   A+     E  + CY +        P+++ 
Sbjct: 306 FDSGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISL 361

Query: 273 YFED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLL 329
           +F+D A   +    VF+     E   + LA AP +  V++IGS  Q     VYDL   L+
Sbjct: 362 HFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIGSLMQTSKEVVYDLKRQLI 420


>gi|380719867|gb|AFD63134.1| aspartyl protease [Vitis quinquangularis]
          Length = 458

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 91/389 (23%), Positives = 151/389 (38%), Gaps = 73/389 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA--------------------IFDPRKSSSFQKINC 41
           + L  GTP + +  ++DTGS +++A                    IF+P  SSS + + C
Sbjct: 89  IPLSFGTPPQKLSFLMDTGSHVVWAPCTTHYTCTNCSFSNPKKVPIFNPELSSSDKILGC 148

Query: 42  DHPDCT-----------------YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE 84
             P C                    KC +    YT++Y   + + GF   E +   GK  
Sbjct: 149 RDPKCADTSSPBVHLGXPRCNGNSKKCSHACPQYTLQYGTGAAS-GFFLLENLDFPGK-- 205

Query: 85  GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
                H  L GC+      D +    ALAG     R   S   Q+G    K+F+YCL   
Sbjct: 206 ---TIHKFLVGCTTSA---DREPSSDALAG---FGRTMFSLPMQMGV---KKFAYCLN-- 251

Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQA---TKFINHPNNF---YYLSLKDISIDNERMNFP 198
             + +Y  +       + Y    TQ      F  +P ++   YYL +KD+ I N+ +  P
Sbjct: 252 --SHDYDDTRNSGKLILDYSDGETQGLSYAPFXKNPPDYPIYYYLGVKDMKIGNKVLRIP 309

Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY 258
                      GG +IDSG   +Y    V+  +  +      +++ +   +    +  CY
Sbjct: 310 GKYLTPGSDSRGGVVIDSGFAYSYMTLPVFKIVTNELKKQMSKYRRSLELEAQTGVTPCY 369

Query: 259 -FLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAV---APHDDL------V 307
            F      + P + + F   AN+ + G N F++  E       V   +P  +L       
Sbjct: 370 NFTGHKSIKIPDLIYQFTGGANMVVPGMNYFLLFSEASLGCFPVTTDSPTSNLEFTPGPS 429

Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            ++G+ QQ D    +DL  + L F ++ C
Sbjct: 430 IILGNYQQVDHYVEFDLKNERLGFRQQTC 458


>gi|297806153|ref|XP_002870960.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316797|gb|EFH47219.1| hypothetical protein ARALYDRAFT_908082 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 453

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 89/374 (23%), Positives = 153/374 (40%), Gaps = 59/374 (15%)

Query: 9   PSKGVLLILDTGSALIYAI------------FDPRKSSSFQKINCDHPDCTY-------- 48
           P + + +++DTGS L +              FDP +SSS+  I C  P C          
Sbjct: 82  PPQNISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIP 141

Query: 49  FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
             C +++ C  T+ YAD S ++G  A E I   G     +     +FGC     G D + 
Sbjct: 142 ASCDSDKLCHATLSYADASSSEGNLAAE-IFHFGNSTNDS---NLIFGCMGSVSGSDPE- 196

Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV----IP--LPNGEYTSSYLKFGTDM 161
            D    G+LG++R ++SFISQ+G     +FSYC+      P  L  G+   ++L   T +
Sbjct: 197 EDTKTTGLLGMNRGSLSFISQMG---FPKFSYCISGTDDFPGFLLLGDSNFTWL---TPL 250

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            Y      +T         Y + L  I ++ + +  P        +G G  ++DSG+  T
Sbjct: 251 NYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLLPDHTGAGQTMVDSGTQFT 310

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPE------TFNRFPSM 270
           +    VY  L   F++  +   +  + + PE      + LCY +          +R P++
Sbjct: 311 FLLGPVYTALRSDFLN--QTNGILTVYEDPEFVFQGTMDLCYRISPFRIRTGILHRLPTV 368

Query: 271 AFYFEDANLRIDGENVFI----IDYENHFFLLAVAPHDDLVAL----IGSQQQRDTRFVY 322
           +  FE A + + G+ +      +   N         + DL+ +    IG   Q++    +
Sbjct: 369 SLVFEGAEIAVSGQPLLYRVPHLTAGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEF 428

Query: 323 DLNIDLLSFVKENC 336
           DL    +      C
Sbjct: 429 DLQRSRIGLAPVQC 442


>gi|242063796|ref|XP_002453187.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
 gi|241933018|gb|EES06163.1| hypothetical protein SORBIDRAFT_04g001340 [Sorghum bicolor]
          Length = 493

 Score = 78.2 bits (191), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 86/366 (23%), Positives = 140/366 (38%), Gaps = 60/366 (16%)

Query: 9   PSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDCTYF---- 49
           P K   +++DTGS + +                +FDP  SS++   +C    C       
Sbjct: 150 PGKSQTMLIDTGSDISWVRCKPCWQQCRPQVDPLFDPSLSSTYSPFSCSSAACAQLFQEG 209

Query: 50  ---KCVNE-QCVYTMKYADQSV-TKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
               C +  QC Y   Y D SV T G  + +T++ +G      +     FGCS+   G  
Sbjct: 210 NANGCSSSGQCQYIAMYGDGSVGTTGTYSSDTLA-LGSNSNTVVVSKFRFGCSHAETGIT 268

Query: 105 EDARDGALAGVLGLSRVTISFISQL-GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
                              S +SQ  G+     FSYCL    P    +S +L  G     
Sbjct: 269 GLTAGLMGL-----GGGAQSLVSQTAGTFGTTAFSYCL----PPTPSSSGFLTLGAA--- 316

Query: 164 RRPSTQATKFINHP-------NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
               T +  F+  P         FY + L+ I +   +++ P   F        G I+DS
Sbjct: 317 ---GTSSAGFVKTPMLRSSQVPAFYGVRLEAIRVGGRQLSIPTTVF------SAGMIMDS 367

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNRFPSMAFYFE 275
           G+V+T      Y  L   F +  +++  A  S     +  C+ +  ++    P++A  F 
Sbjct: 368 GTVVTRLPPTAYSSLSSAFKAGMKQYPPAPSSAGGGFLDTCFDMSGQSSVSMPTVALVFS 427

Query: 276 DAN---LRIDGENVFIIDYENHFFLLA-VAPHDD-LVALIGSQQQRDTRFVYDLNIDLLS 330
            A    + +D   + +    +  F LA VA  DD    +IG+ QQR  + +YD+    + 
Sbjct: 428 GAGGAVVNLDASGILLQMETSSIFCLAFVATSDDGSTGIIGNVQQRTFQVLYDVAGGAVG 487

Query: 331 FVKENC 336
           F    C
Sbjct: 488 FKAGAC 493


>gi|255547548|ref|XP_002514831.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
 gi|223545882|gb|EEF47385.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis]
          Length = 488

 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 151/378 (39%), Gaps = 64/378 (16%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH 43
           ++ +G P K   + +DTGS +++                    ++DP+ S+S  +I CD 
Sbjct: 85  KIGLGNPPKDYYVQVDTGSDILWVNCANCDKCPTKSDLGVKLTLYDPQSSTSATRIYCDD 144

Query: 44  PDC--TYFK----CVNE-QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
             C  TY      C  +  C Y++ Y D S T GF   + +    V G  +  +     +
Sbjct: 145 DFCAATYNGVLQGCTKDLPCQYSVVYGDGSSTAGFFVKDNLQFDRVTGNLQTSSANGSVI 204

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
           FGC     G +      AL G+LG  +   S ISQL +   +K+ F++CL      G + 
Sbjct: 205 FGCGAKQSG-ELGTSSEALDGILGFGQANSSMISQLAAAGKVKRVFAHCLDNVKGGGIFA 263

Query: 152 SSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
              +          P    T  + N P+  Y + +K+I +    +  P D FD       
Sbjct: 264 IGEVV--------SPKVNTTPMVPNQPH--YNVVMKEIEVGGNVLELPTDIFD--TGDRR 311

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPS 269
           G IIDSG+ L Y    VY  +  K VS     +L  +    E    C+      N  FP 
Sbjct: 312 GTIIDSGTTLAYLPEVVYESMMTKIVSEQPGLKLHTV----EEQFTCFQYTGNVNEGFPV 367

Query: 270 MAFYFEDA-NLRID--------GENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTR 319
           + F+F  + +L ++         E V+   ++N      +   D   + L+G     +  
Sbjct: 368 VKFHFNGSLSLTVNPHDYLFQIHEEVWCFGWQNS----GMQSKDGRDMTLLGDLVLSNKL 423

Query: 320 FVYDLNIDLLSFVKENCS 337
            +YDL    + +   NCS
Sbjct: 424 VLYDLENQAIGWTDYNCS 441


>gi|125536419|gb|EAY82907.1| hypothetical protein OsI_38120 [Oryza sativa Indica Group]
          Length = 448

 Score = 77.8 bits (190), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 79/321 (24%), Positives = 134/321 (41%), Gaps = 58/321 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           +++  IG P   +   +DTGS L++               ++DP +S S  K+ C    C
Sbjct: 88  IMQFSIGEPPLLIWAEVDTGSDLMWVKCSPCNGCNPPPSPLYDPARSRSSGKLPCSSQLC 147

Query: 47  TYF---KCVNEQCV---------YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF 94
                 + +++QC          Y   ++    T+G    ET +    G+G  + +   F
Sbjct: 148 QALGRGRIISDQCSDDPPLCGYHYAYGHSGDHSTQGVLGTETFTF---GDGY-VANNVSF 203

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT--- 151
           G S+   G    ++ G  AG++GL R  +S +SQLG+    RF+YCL    PN   T   
Sbjct: 204 GRSDTIDG----SQFGGTAGLVGLGRGHLSLVSQLGA---GRFAYCLAAD-PNVYSTILF 255

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHP----NNFYYLSLKDISIDNERMNFPPDTFDITVS 207
            S     T  G       +T  + +P    +  YY++L+ IS+   R+     TF I   
Sbjct: 256 GSLAALDTSAG----DVSSTPLVTNPKPDRDTHYYVNLQGISVGGSRLPIKDGTFAINSD 311

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP--ETFN 265
           G GG   DSG++ T      Y  + +   S  +R       D       C+     +   
Sbjct: 312 GSGGVFFDSGAIDTSLKDAAYQVVRQAITSEIQRLGYDAGDDT------CFVAANQQAVA 365

Query: 266 RFPSMAFYFED-ANLRIDGEN 285
           + P +  +F+D A++ ++G N
Sbjct: 366 QMPPLVLHFDDGADMSLNGRN 386


>gi|356539818|ref|XP_003538390.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 457

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 78/272 (28%), Positives = 116/272 (42%), Gaps = 57/272 (20%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +V L IGTP +   ++LDTGS L +              A FDP  SS+F  + C HP C
Sbjct: 98  IVDLPIGTPPQVQPMVLDTGSQLSWIQCHKKAPAKPPPTASFDPSLSSTFSTLPCTHPVC 157

Query: 47  TY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-FGC 96
                       C  N  C Y+  YAD +  +G    E  +       +++F   L  GC
Sbjct: 158 KPRIPDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTF-----SRSLFTPPLILGC 212

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
           + ++     D R     G+LG++R  +SF SQ  S I K FSYC+    P       Y  
Sbjct: 213 ATES----TDPR-----GILGMNRGRLSFASQ--SKITK-FSYCV----PTRVTRPGYTP 256

Query: 157 FGTDMGYRRPSTQATKFI---------NHPNN---FYYLSLKDISIDNERMNFPPDTFDI 204
            G+      P++   ++I           PN     Y ++L+ I I   ++N  P  F  
Sbjct: 257 TGSFYLGHNPNSNTFRYIEMLTFARSQRMPNLDPLAYTVALQGIRIGGRKLNISPAVFRA 316

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFV 236
              G G  ++DSGS  TY  ++ Y K+  + V
Sbjct: 317 DAGGSGQTMLDSGSEFTYLVNEAYDKVRAEVV 348


>gi|88174563|gb|ABD39356.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 323

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 84/340 (24%), Positives = 144/340 (42%), Gaps = 47/340 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
           ++ + +GTPSK  +L +DTGS+  +   +       PR     +S++  K++C    C  
Sbjct: 2   VISVGLGTPSKTQILEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61

Query: 49  F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 C + +    C + + Y D S + G    +T++            G  FGC+ D+
Sbjct: 62  GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQKIPGFSFGCNMDS 117

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY---TSSYLKF 157
            G +E    G + G+LG+    +S + Q  S     FSYCL + +    +   T+ Y   
Sbjct: 118 FGANE---FGNVDGLLGMGAGPMSVLKQ-SSPTFDGFSYCLPLQMSERGFFSKTTGYFSL 173

Query: 158 GTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           G  +   R   + TK +    N   +++ L  IS+D ER+   P  F        G + D
Sbjct: 174 GGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFS-----RKGVVFD 228

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAFYF 274
           SGS L+Y        L ++      R   A+     E  + CY +        P+++ +F
Sbjct: 229 SGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISLHF 284

Query: 275 ED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
           +D A   +    VF+     E   + LA AP +  V++IG
Sbjct: 285 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIG 323


>gi|357125326|ref|XP_003564345.1| PREDICTED: aspartic proteinase-like protein 2-like [Brachypodium
           distachyon]
          Length = 506

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 88/376 (23%), Positives = 149/376 (39%), Gaps = 56/376 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
            R+ +G P+K   + +DTGS +++                     F+P  SS+  +I C 
Sbjct: 91  TRVKLGNPAKEYFVQIDTGSDILWVACSPCTGCPTSSGLNIQLEFFNPDSSSTSSRIPCS 150

Query: 43  HPDCTYFKCVNEQ-----------CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI 88
              CT      E            C YT  Y D S T GF   +T+   +V+G  +    
Sbjct: 151 DDRCTAALQTGEAVCQSSDSPSSPCGYTFTYGDGSGTSGFYVSDTMYFDTVMGNEQTANS 210

Query: 89  FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLP 146
               +FGCSN   G D    D A+ G+ G  +  +S +SQL S  +  K FS+C    L 
Sbjct: 211 SASVVFGCSNSQSG-DLMKTDRAVDGIFGFGQHQLSVVSQLYSLGVSPKTFSHC----LK 265

Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDIT 205
             +     L  G  +    P    T  + + P+  Y L+L+ I++  +++  P D+    
Sbjct: 266 GSDNGGGILVLGEIV---EPGLVFTPLVPSQPH--YNLNLESIAVSGQKL--PIDSSLFA 318

Query: 206 VSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
            S   G I+DSG+ L Y     Y    + F++         +         C+    + +
Sbjct: 319 TSNTQGTIVDSGTTLVYLVDGAY----DPFINAIAAAVSPSVRSVVSKGIQCFVTTSSVD 374

Query: 266 -RFPSMAFYFEDA-NLRIDGENVFIID--YENHFFLLAVAPHDDLVALIGSQQQRDTRFV 321
             FP+   YF+   ++ +  EN  +     +N+            + ++G    +D  FV
Sbjct: 375 SSFPTATLYFKGGVSMTVKPENYLLQQGSVDNNVLWCIGWQRSQGITILGDLVLKDKIFV 434

Query: 322 YDLNIDLLSFVKENCS 337
           YDL    + +   +CS
Sbjct: 435 YDLANMRMGWADYDCS 450


>gi|224140036|ref|XP_002323393.1| predicted protein [Populus trichocarpa]
 gi|222868023|gb|EEF05154.1| predicted protein [Populus trichocarpa]
          Length = 459

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 97/391 (24%), Positives = 153/391 (39%), Gaps = 71/391 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKI 39
           + L  GTP +    ++DTGS+L++                        F P+ SSS + I
Sbjct: 85  ISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSECNFPNIKKTGIPTFLPKLSSSSKLI 144

Query: 40  NCDHPDCTYF-------KC-----VNEQCV-----YTMKYADQSVTKGFAAHETISVIGK 82
            C +P C+         KC       + C      Y ++Y   S T G    ET+     
Sbjct: 145 GCKNPRCSMIFGPEIQSKCQECDSTAQNCTQTCPPYVIQYGSGS-TAGLLLSETLDF--- 200

Query: 83  GEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV 142
              K      L GCS  +    E        G+ G  R   S  SQLG    K+FSYCLV
Sbjct: 201 -PNKKTIPDFLVGCSIFSIKQPE--------GIAGFGRSPESLPSQLG---LKKFSYCLV 248

Query: 143 IPLPNGEYTSSYLKFGTDMGYRRPSTQA---TKFINHPN----NFYYLSLKDISIDNERM 195
               +   TSS L   T  G     T     T F+ +P     ++YY+ L++I I +  +
Sbjct: 249 SHAFDDTPTSSDLVLDTGSGSGVTKTAGLSHTPFLKNPTTAFRDYYYVLLRNIVIGDTHV 308

Query: 196 NFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ 255
             P         G GG I+DSG+  T+  + VY  + ++F      + +A        ++
Sbjct: 309 KVPYKFLVPGTDGNGGTIVDSGTTFTFMENPVYELVAKEFEKQMAHYTVATEIQNLTGLR 368

Query: 256 LCYFLP-ETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPH-------DDL 306
            CY +  E     P + F F+  A + +   N F I       L  V+ +          
Sbjct: 369 PCYNISGEKSLSVPDLIFQFKGGAKMALPLSNYFSIVDSGVICLTIVSDNVAGPGLGGGP 428

Query: 307 VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
             ++G+ QQR+    +DL  +   F +++C+
Sbjct: 429 AIILGNYQQRNFYVEFDLENEKFGFKQQSCA 459


>gi|212275143|ref|NP_001130306.1| uncharacterized protein LOC100191400 precursor [Zea mays]
 gi|194688798|gb|ACF78483.1| unknown [Zea mays]
 gi|194703430|gb|ACF85799.1| unknown [Zea mays]
 gi|194707192|gb|ACF87680.1| unknown [Zea mays]
 gi|223944599|gb|ACN26383.1| unknown [Zea mays]
 gi|223948667|gb|ACN28417.1| unknown [Zea mays]
 gi|414887962|tpg|DAA63976.1| TPA: aspartic proteinase nepenthesin-1 [Zea mays]
          Length = 450

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 80/360 (22%), Positives = 145/360 (40%), Gaps = 47/360 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC 46
           +VR  +GTP + +LL +DT +   +              A FDP  S+S++ + C  P C
Sbjct: 113 VVRASLGTPPQQLLLAVDTSNDASWIPCAGCAGCPTSSAAPFDPASSASYRTVPCGSPLC 172

Query: 47  TYFKCV-----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                       + C +++ YAD S+    +  ++++V G            FGC     
Sbjct: 173 AQAPNAACPPGGKACGFSLTYADSSLQAALS-QDSLAVAGNA-----VKAYTFGCLQRAT 226

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
           G     +           R  +SF+SQ   + +  FSYCL  P       S  L+ G + 
Sbjct: 227 GTAAPPQGLLGL-----GRGPLSFLSQTKDMYEATFSYCL--PSFKSLNFSGTLRLGRNG 279

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
             +R  T       H ++ YY+++  I +  + +  P   FD       G ++DSG++ T
Sbjct: 280 QPQRIKTTPLLANPHRSSLYYVNMTGIRVGRKVVPIP--AFDPATG--AGTVLDSGTMFT 335

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
              +  Y  + ++      R ++            C+    T   +P +   F+   + +
Sbjct: 336 RLVAPAYVAVRDEV-----RRRVGAPVSSLGGFDTCFN--TTAVAWPPVTLLFDGMQVTL 388

Query: 282 DGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
             ENV I         LA+A   D    ++ +I S QQ++ R ++D+    + F +E C+
Sbjct: 389 PEENVVIHSTYGTISCLAMAAAPDGVNTVLNVIASMQQQNHRVLFDVPNGRVGFARERCT 448


>gi|226492150|ref|NP_001146362.1| hypothetical protein precursor [Zea mays]
 gi|219886805|gb|ACL53777.1| unknown [Zea mays]
 gi|414878074|tpg|DAA55205.1| TPA: hypothetical protein ZEAMMB73_415404 [Zea mays]
          Length = 440

 Score = 77.8 bits (190), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 93/393 (23%), Positives = 145/393 (36%), Gaps = 88/393 (22%)

Query: 5   FIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDCTY- 48
            IG P +    I+DTGS LI+                 +DP +S + + + C+   C   
Sbjct: 76  LIGDPPQRAEAIIDTGSNLIWTQCSRCRPTCFRQNLPYYDPSRSRAARAVGCNDAACALG 135

Query: 49  --FKCV--NEQCVYTMKYADQSVTKGFAAH------ETISVIGKGEGKAIFHGALFGCSN 98
              +C+  N+ C     Y   ++    A        ET+S++             FGC  
Sbjct: 136 SETQCLSDNKTCAVVTGYGAGNIAGTLATENLTFQSETVSLV-------------FGCIV 182

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
                   + +GA +G++GL R  +S  SQLG     RFSYCL  P        S++  G
Sbjct: 183 VTK-LSPGSLNGA-SGIIGLGRGKLSLPSQLG---DTRFSYCLT-PYFEDTIEPSHMVVG 236

Query: 159 TDMGYRRPSTQATK-----FINHPNN-----FYYLSLKDISIDNERMNFPPDTFDITVSG 208
              G    S  +T      F+  P++     FYYL L  I+    ++  P   FD+    
Sbjct: 237 ASAGLINGSASSTPVTTVPFVRSPSDDPFSTFYYLPLTGITAGKVKLAVPSAAFDLRQVA 296

Query: 209 EG---GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
            G   G  IDSG+ LT      Y  L  +         +  L+       LC  L +   
Sbjct: 297 PGMWTGTFIDSGAPLTSLVDVAYQALRAELARQLGAALVQPLAGT-TGFDLCVALKDAER 355

Query: 266 RFPSMAFYFEDANLRIDGENV-FIIDYENHFFLLAVAPHDDLVA---------------- 308
             P +  +F   +    G     ++   N++     AP D   A                
Sbjct: 356 LVPPLVLHFGGGS----GTGTDLVVPPANYW-----APVDSATACMVVFSSVDRKSLPMN 406

Query: 309 ---LIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
              +IG+  Q++   +YDL   +LSF   +CS 
Sbjct: 407 ETTVIGNYMQQNMHVLYDLAGGVLSFQPADCSS 439


>gi|115483166|ref|NP_001065176.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|21717159|gb|AAM76352.1|AC074196_10 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433285|gb|AAP54823.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113639785|dbj|BAF27090.1| Os10g0537800 [Oryza sativa Japonica Group]
 gi|215692411|dbj|BAG87831.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 394

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 84/362 (23%), Positives = 145/362 (40%), Gaps = 57/362 (15%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
           IGTP +    ++D    L++               +FDP  S++++   C  P C     
Sbjct: 57  IGTPPQPASAVIDLAGELVWTQCKQCGRCFEQGTPLFDPTASNTYRAEPCGTPLCESIPS 116

Query: 50  ---KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
               C    C Y     +   T G    +T +V   G  KA      FGC   +   D D
Sbjct: 117 DVRNCSGNVCAYEAS-TNAGDTGGKVGTDTFAV---GTAKASLA---FGCVVAS---DID 166

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM--GYR 164
              G  +G++GL R   S ++Q G      FSYCL  P   G+ ++ +L     +  G +
Sbjct: 167 TMGGP-SGIVGLGRTPWSLVTQTG---VAAFSYCLA-PHDAGKNSALFLGSSAKLAGGGK 221

Query: 165 RPSTQATKFINHPN---NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
             ST       + N   N+Y + L+ +   +  +  PP    +        ++D+ S ++
Sbjct: 222 AASTPFVNISGNGNDLSNYYKVQLEGLKAGDAMIPLPPSGSTV--------LLDTFSPIS 273

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED-ANLR 280
           +     Y  + +          +A      EP  LC+         P + F F   A + 
Sbjct: 274 FLVDGAYQAVKKAVTVAVGAPPMATPV---EPFDLCFPKSGASGAAPDLVFTFRGGAAMT 330

Query: 281 IDGENVFIIDYENHFFLLAVAPHDDL-----VALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
           +   N +++DY+N    LA+     L     ++L+GS QQ +  F++DL+ + LSF   +
Sbjct: 331 VPATN-YLLDYKNGTVCLAMLSSARLNSTTELSLLGSLQQENIHFLFDLDKETLSFEPAD 389

Query: 336 CS 337
           C+
Sbjct: 390 CT 391


>gi|383165471|gb|AFG65613.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
          Length = 136

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 77/138 (55%), Gaps = 13/138 (9%)

Query: 27  IFDPRKSSSFQKINCDHPDCTY---FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGK 82
           I+DP +SS++ K++C    C     F+C +   C Y   Y D S+T G  ++ET+++  K
Sbjct: 6   IYDPARSSTYSKVSCKSLLCNALPDFECKSAAGCEYQYTYGDFSITVGILSYETLTLTSK 65

Query: 83  GEGKAIFHGALFGCS--NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
              + +     FGC   N+ +GFD+       AG++GL R  +S ISQL + + K+FSYC
Sbjct: 66  SGAEQLIPNFAFGCGQNNEGNGFDQG------AGIVGLGRGPLSLISQLSASMPKKFSYC 119

Query: 141 LVIPLPNGEYTSSYLKFG 158
           L+  + + +  +S L FG
Sbjct: 120 LMT-IDDSQSKTSPLMFG 136


>gi|302769978|ref|XP_002968408.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
 gi|300164052|gb|EFJ30662.1| hypothetical protein SELMODRAFT_89951 [Selaginella moellendorffii]
          Length = 492

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 93/366 (25%), Positives = 151/366 (41%), Gaps = 55/366 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCTY 48
           R+ IGTP     LI+DTGS + Y                F P  SSS++ + C   +C+ 
Sbjct: 38  RVKIGTPPHEFSLIVDTGSTVTYVPCSSCTHCGNHQDPRFSPALSSSYKPLECGS-ECST 96

Query: 49  FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE--GKAIFHGALFGCSNDNHGFDED 106
             C   +  Y  +YA++S + G    + I      +  G+ +    +FGC     G   D
Sbjct: 97  GFCDGSR-KYQRQYAEKSTSSGVLGKDVIGFSNSSDLGGQRL----VFGCETAETG---D 148

Query: 107 ARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
             D    G++GL R  +S I QL   + ++  FS C        +     +  G   G++
Sbjct: 149 LYDQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLC----YGGMDEGGGAMILG---GFQ 201

Query: 165 RPSTQA-TKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
            P     T    H + +Y L LK I +    +   P+ FD    G+ G ++DSG+   YF
Sbjct: 202 PPKDMVFTASDPHRSPYYNLMLKGIRVGGSPLRLKPEVFD----GKYGTVLDSGTTYAYF 257

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ----LCYFLPET-----FNRFPSMAFYF 274
               +    + F S  +  Q+  L + P P +    +CY    T        FPS+ F F
Sbjct: 258 PGAAF----QAFKSAVKE-QVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVF 312

Query: 275 EDA-NLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFV 332
            D  ++ +  EN +F     +  + L V  + D   L+G    R+    Y+     + F+
Sbjct: 313 GDGQSVTLSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIVRNMLVTYNRGKASIGFL 372

Query: 333 KENCSD 338
           K  C+D
Sbjct: 373 KTKCND 378


>gi|225440720|ref|XP_002275202.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Vitis vinifera]
          Length = 479

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 92/388 (23%), Positives = 155/388 (39%), Gaps = 69/388 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA----------------------IFDPRKSSSFQKI 39
           + L  GTP + +  ++DTGS +++A                      IF+P+ SSS + +
Sbjct: 89  IPLSFGTPPQKLSFLVDTGSHVVWAPCTTHYTCTNCSFSDAEPKKVPIFNPKLSSSSKIL 148

Query: 40  NCDHPDCTYFK-----------------CVNEQCVYTMKYADQSVTKGFAAHETISVIGK 82
            C +P C                     C +    Y+++Y   + +  F   E ++  GK
Sbjct: 149 GCRNPKCVNTSSPDVHLGCPPCNGNSKNCSHACPPYSLQYGTGASSGDFLL-ENLNFPGK 207

Query: 83  GEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV 142
                  H  L GC+    G   +    ALAG     R   S   Q+G    K+F+YCL 
Sbjct: 208 -----TIHEFLVGCTTSAVG---EVTSAALAG---FGRSMFSLPMQMGV---KKFAYCLN 253

Query: 143 IPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNF---YYLSLKDISIDNERMNFPP 199
               +    SS L      G  +  + A  F+ +P +F   YYL +KDI I N+ +  P 
Sbjct: 254 SHDYDDTRNSSKLILDYSDGETKGLSYA-PFLKNPPDFPIYYYLGVKDIKIGNKLLRIPS 312

Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY- 258
                   G GG +IDSG    Y    V+ K+  +      +++ +  ++    +  CY 
Sbjct: 313 KYLAPGSDGRGGLMIDSGFAYGYMTGPVFKKVTNELKKRMSKYRRSLEAEAEIGVTPCYN 372

Query: 259 FLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYE---NHFFLLAVAPHDDL------VA 308
           F  +   + P + + F   A + + G+N F++  E     F L   A  + L        
Sbjct: 373 FTGQKSIKIPDLIYQFRGGATMVVPGKNYFVLIPEISLACFPLTTDAGTNTLEFTPGPSI 432

Query: 309 LIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           ++G+ Q  D    +DL  + L F ++ C
Sbjct: 433 ILGNSQHVDYYVEFDLKNERLGFRQQTC 460


>gi|361068027|gb|AEW08325.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165459|gb|AFG65601.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165460|gb|AFG65602.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165461|gb|AFG65603.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165462|gb|AFG65604.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165463|gb|AFG65605.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165465|gb|AFG65607.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165466|gb|AFG65608.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165467|gb|AFG65609.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165468|gb|AFG65610.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165469|gb|AFG65611.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165472|gb|AFG65614.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165473|gb|AFG65615.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165474|gb|AFG65616.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165475|gb|AFG65617.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165476|gb|AFG65618.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
          Length = 136

 Score = 77.8 bits (190), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 77/138 (55%), Gaps = 13/138 (9%)

Query: 27  IFDPRKSSSFQKINCDHPDCTY---FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGK 82
           I+DP +SS++ K++C    C     F+C +   C Y   Y D S+T G  ++ET+++  K
Sbjct: 6   IYDPARSSTYSKVSCKSLLCNALPDFECKSTAGCEYQYTYGDFSITVGILSYETLTLTSK 65

Query: 83  GEGKAIFHGALFGCS--NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
              + +     FGC   N+ +GFD+       AG++GL R  +S ISQL + + K+FSYC
Sbjct: 66  SGAEQLIPNFAFGCGQNNEGNGFDQG------AGIVGLGRGPLSLISQLSASMPKKFSYC 119

Query: 141 LVIPLPNGEYTSSYLKFG 158
           L+  + + +  +S L FG
Sbjct: 120 LMT-IDDSQSKTSPLMFG 136


>gi|224091907|ref|XP_002309394.1| predicted protein [Populus trichocarpa]
 gi|222855370|gb|EEE92917.1| predicted protein [Populus trichocarpa]
          Length = 469

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 96/393 (24%), Positives = 152/393 (38%), Gaps = 77/393 (19%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKI 39
           + L  GTP +    ++DTGS+L++                        F P++SSS   I
Sbjct: 94  ISLNFGTPPQTTKFVMDTGSSLVWFPCTSRYLCSRCDFPNIEVTGIPTFIPKQSSSSNLI 153

Query: 40  NCDHPDCTYF-------KCVNEQCVYTMKYADQSV-----------TKGFAAHETISVIG 81
            C +  C++        KC  ++C  T +   QS            T G    ET+    
Sbjct: 154 GCKNHKCSWLFGPKVQSKC--QECDPTTQNCTQSCPPYVIQYGLGSTAGLLLSETLDF-- 209

Query: 82  KGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
               K    G L GCS  +    E        G+ G  R   S  SQLG    K+FSYCL
Sbjct: 210 --PHKKTIPGFLVGCSLFSIRQPE--------GIAGFGRSPESLPSQLG---LKKFSYCL 256

Query: 142 VIPLPNGEYTSSYLKFGTDMG---YRRPSTQATKFINHPN----NFYYLSLKDISIDNER 194
           V    +    SS L   T  G    + P    T F  +P     ++YY+ L++I I +  
Sbjct: 257 VSHAFDDTPASSDLVLDTGSGSDDTKTPGLSYTPFQKNPTAAFRDYYYVLLRNIVIGDTH 316

Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI 254
           +  P         G GG I+DSG+  T+    VY  + ++F      + +A        +
Sbjct: 317 VKVPYKFLVPGSDGNGGTIVDSGTTFTFMEKPVYELVAKEFEKQVAHYTVATEVQNQTGL 376

Query: 255 QLCYFLP-ETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA---- 308
           + C+ +  E     P   F+F+  A + +   N F         L  V+  D++      
Sbjct: 377 RPCFNISGEKSVSVPEFIFHFKGGAKMALPLANYFSFVDSGVICLTIVS--DNMSGSGIG 434

Query: 309 -----LIGSQQQRDTRFVYDLNIDLLSFVKENC 336
                ++G+ QQR+    +DL  +   F ++NC
Sbjct: 435 GGPAIILGNYQQRNFHVEFDLKNERFGFKQQNC 467


>gi|449482385|ref|XP_004156266.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 491

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 97/374 (25%), Positives = 157/374 (41%), Gaps = 57/374 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           ++ +G P++   + +DTGS +++                    +FD  KSSS + + C  
Sbjct: 87  KVKLGNPAREFNVQIDTGSDILWVTCSPCDGCPDSSGLGIELNLFDTTKSSSARVLPCTD 146

Query: 44  PDCTYFKCVNEQCV-------YTMKYADQSVTKGFAAHETISV-IGKGEGKAIFHGA--L 93
           P C       +QC+       Y+  Y D+S T GF   +++   I  GE       A  +
Sbjct: 147 PICAAVSTTTDQCLTQTDHCSYSFHYRDRSGTSGFYVTDSMHFDILLGESTIANSSATIV 206

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYT 151
           FGCS   +G D      AL G+ G  +   S ISQL S  I  K FS+C    L  GE  
Sbjct: 207 FGCSIYQYG-DLTRATKALDGIFGFGQGEFSVISQLSSRGITPKVFSHC----LKGGENG 261

Query: 152 SSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFP-PDTFDITVSGE 209
              L  G  +    PS   +  I + P+  Y L L+ I++  +   FP P  F I+ +GE
Sbjct: 262 GGILVLGEIL---EPSIVYSPLIPSQPH--YTLKLQSIALSGQL--FPNPTMFPISNAGE 314

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
              IIDSG+ L Y   +VY  +     S   +     +S   +  ++   + +    FP 
Sbjct: 315 --TIIDSGTTLAYLVEEVYDWIVSVITSAVSQSATPTISRGSQCFRVSMSVADI---FPV 369

Query: 270 MAFYFED-ANLRIDGENVFIID-----YE-NHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
           + F FE  A++ +  E     D     Y+    + +     +D + ++G    +D   VY
Sbjct: 370 LRFNFEGIASMVVTPEEYLQFDSIVSCYKFASLWCIGFQKAEDGLNILGDLVLKDKIIVY 429

Query: 323 DLNIDLLSFVKENC 336
           DL    + +   +C
Sbjct: 430 DLAQQRIGWANYDC 443


>gi|293334661|ref|NP_001168795.1| uncharacterized protein LOC100382594 precursor [Zea mays]
 gi|223973065|gb|ACN30720.1| unknown [Zea mays]
          Length = 631

 Score = 77.4 bits (189), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 90/364 (24%), Positives = 149/364 (40%), Gaps = 48/364 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+D+GS + Y                F P  SSS+  + C+  DCT
Sbjct: 90  TRLYIGTPPQEFALIVDSGSTVTYVPCSSCEQCGNHQDPRFQPDLSSSYSPVKCNV-DCT 148

Query: 48  YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDE 105
                 +QC Y  +YA+ S + G    + +S   + E K     A+FGC N   G  F +
Sbjct: 149 -CDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKP--QHAIFGCENSETGDLFSQ 205

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
            A      G++GL R  +S + QL    +I   FS C   + +  G      +    DM 
Sbjct: 206 HA-----DGIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGMLAPPDMI 260

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
           +       +  +  P  +Y + LK+I +  + +      F+     + G ++DSG+   Y
Sbjct: 261 FSN-----SDPLRSP--YYNIELKEIHVAGKALRVESRIFN----SKHGTVLDSGTTYAY 309

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA 277
                +    E   S     +  +  D P    +C+      + +    FP +   F + 
Sbjct: 310 LPEQAFVAFKEAVTSKVHSLKKIRGPD-PSYKDICFAGAGRNVSKLHEVFPDVDMVFGNG 368

Query: 278 N-LRIDGEN-VFIIDYENHFFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
             L +  EN +F     +  + L V  +  D   L+G    R+T   YD + + + F K 
Sbjct: 369 QKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKT 428

Query: 335 NCSD 338
           NCS+
Sbjct: 429 NCSE 432


>gi|302756119|ref|XP_002961483.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
 gi|300170142|gb|EFJ36743.1| hypothetical protein SELMODRAFT_76765 [Selaginella moellendorffii]
          Length = 388

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 87/387 (22%), Positives = 158/387 (40%), Gaps = 74/387 (19%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ +G P K  ++ +DTGS +++                    ++DPR+SS+   ++C 
Sbjct: 4   TQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVSCS 63

Query: 43  HPDCTYFKCVNE--------QCVYTMKYADQSVTKGFAAHETI--SVIGKGEGKAIFHGA 92
            P C   +   E         C Y   Y D S ++G+   + +  +VI            
Sbjct: 64  DPLCVRGRRFAEAQCSQATNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTSQV 123

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
           LFGCS    G D      A+ G++G  ++ +S  +QL +   I + FS+CL       E 
Sbjct: 124 LFGCSIRQTG-DLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL-------EG 175

Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGE 209
                      G   P    T  +  P++ +Y + L+ IS+++ R+  P D  D + + +
Sbjct: 176 EKRGGGILVIGGIAEPGMTYTPLV--PDSVHYNVVLRGISVNSNRL--PIDAEDFSSTND 231

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ----LCYFLPETF- 264
            G I+DSG+ L YF S  Y        + F +      S  P  +Q     C+ +     
Sbjct: 232 TGVIMDSGTTLAYFPSGAY--------NVFVQAIREATSATPVRVQGMDTQCFLVSGRLS 283

Query: 265 NRFPSMAFYFEDANLRIDGEN--------------VFIIDYENHFFLLAVAPHD-DLVAL 309
           + FP++   FE   + +  +N              V+ I +++     +  P D   + +
Sbjct: 284 DLFPNVTLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSS--SAGPKDGSQLTI 341

Query: 310 IGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +G    +D   VYDL+   + ++  NC
Sbjct: 342 LGDIVLKDKLVVYDLDNSRIGWMSYNC 368


>gi|356507997|ref|XP_003522749.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Glycine max]
          Length = 440

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 85/360 (23%), Positives = 149/360 (41%), Gaps = 45/360 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
           +VR+ +GTP + + ++LDT +   +             F P+ S+S+  ++C  P C   
Sbjct: 101 VVRVKLGTPGQLLFMVLDTSTDEAFVPCSGCTGCSDTTFSPKASTSYGPLDCSVPQCGQV 160

Query: 50  KCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
           + ++        C +   YA  S +      +++ +        +     FGC N   G 
Sbjct: 161 RGLSCPATGTGACSFNQSYAGSSFSATLV-QDSLRL-----ATDVIPNYSFGCVNAITGA 214

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A+           R  +S +SQ GS     FSYCL  P     Y S  LK G  +G 
Sbjct: 215 SVPAQGLLGL-----GRGPLSLLSQSGSNYSGIFSYCL--PSFKSYYFSGSLKLG-PVGQ 266

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            + S + T  +  P+  + YY++   IS+    + FP +      +   G IIDSG+V+T
Sbjct: 267 PK-SIRTTPLLRSPHRPSLYYVNFTGISVGRVLVPFPSEYLGFNPNTGSGTIIDSGTVIT 325

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF-PSMAFYFEDANLR 280
            F   VY  + E+F           +         C+   +T+    P +  +FE  +L+
Sbjct: 326 RFVEPVYNAVREEFRKQVGGTTFTSIGA----FDTCFV--KTYETLAPPITLHFEGLDLK 379

Query: 281 IDGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  EN  I         LA+A   D    ++ +I + QQ++ R ++D   + +   +E C
Sbjct: 380 LPLENSLIHSSAGSLACLAMAAAPDNVNSVLNVIANFQQQNLRILFDTVNNKVGIAREVC 439


>gi|383165464|gb|AFG65606.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
 gi|383165470|gb|AFG65612.1| Pinus taeda anonymous locus 2_6422_01 genomic sequence
          Length = 136

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 47/138 (34%), Positives = 78/138 (56%), Gaps = 13/138 (9%)

Query: 27  IFDPRKSSSFQKINCDHPDCTY---FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGK 82
           I+DP +SS++ K++C    C     F+C +   C Y   Y D S+T G  ++ET+++  K
Sbjct: 6   IYDPARSSTYSKVSCKSLLCNALPDFECKSTAGCEYQYTYGDFSITVGILSYETLTLTSK 65

Query: 83  GEGKAIFHGALFGC--SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
              + +     FGC  +N+ +GFD+ A      G++GL R  +S ISQL + + K+FSYC
Sbjct: 66  SGAEQLIPKFAFGCGQNNEGNGFDQGA------GIVGLGRGPLSLISQLSASMPKKFSYC 119

Query: 141 LVIPLPNGEYTSSYLKFG 158
           L+  + + +  +S L FG
Sbjct: 120 LMT-IDDSQSKTSPLMFG 136


>gi|21450872|gb|AAK44106.2|AF370291_1 unknown protein [Arabidopsis thaliana]
          Length = 375

 Score = 77.4 bits (189), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 85/368 (23%), Positives = 151/368 (41%), Gaps = 56/368 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
           +VR  +GTP + + ++LDT +  ++               F+   SS++  ++C    CT
Sbjct: 31  VVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCT 90

Query: 48  YFK---CVNEQ-----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
             +   C +       C +   Y   S        +T+++        +     FGC N 
Sbjct: 91  QARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTL-----APDVIPNFSFGCINS 145

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
             G     +     G++GL R  +S +SQ  S+    FSYCL  P     Y S  LK G 
Sbjct: 146 ASGNSLPPQ-----GLMGLGRGPMSLVSQTTSLYSGVFSYCL--PSFRSFYFSGSLKLGL 198

Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
            +G  + S + T  + +P   + YY++L  +S+ + ++   P       +   G IIDSG
Sbjct: 199 -LGQPK-SIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSG 256

Query: 218 SVLTYFHSDVYWKLHEKF-----VSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
           +V+T F   VY  + ++F     VS F    L     C        F  +  N  P +  
Sbjct: 257 TVITRFAQPVYEAIRDEFRKQVNVSSFS--TLGAFDTC--------FSADNENVAPKITL 306

Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAP----HDDLVALIGSQQQRDTRFVYDLNIDL 328
           +    +L++  EN  I         L++A      + ++ +I + QQ++ R ++D+    
Sbjct: 307 HMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSR 366

Query: 329 LSFVKENC 336
           +    E C
Sbjct: 367 IGIAPEPC 374


>gi|242050744|ref|XP_002463116.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
 gi|241926493|gb|EER99637.1| hypothetical protein SORBIDRAFT_02g038150 [Sorghum bicolor]
          Length = 632

 Score = 77.4 bits (189), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 89/364 (24%), Positives = 150/364 (41%), Gaps = 48/364 (13%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+D+GS + Y                F P  SSS+  + C+  DCT
Sbjct: 91  TRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCNV-DCT 149

Query: 48  YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDE 105
                 +QC Y  +YA+ S + G    + +S   + E K     A+FGC N   G  F +
Sbjct: 150 -CDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKP--QRAVFGCENSETGDLFSQ 206

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
            A      G++GL R  +S + QL    +I   FS C   + +  G      +   +DM 
Sbjct: 207 HAD-----GIMGLGRGQLSIMDQLVEKGVISDSFSLCYGGMDIGGGAMVLGGVPAPSDMV 261

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
           +       +  +  P  +Y + LK+I +  + +      F+     + G ++DSG+   Y
Sbjct: 262 FSH-----SDPLRSP--YYNIELKEIHVAGKALRVDSRVFN----SKHGTVLDSGTTYAY 310

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF-----LPETFNRFPSMAFYFEDA 277
                +    +   S     +  +  D P    +C+      + +    FP +   F + 
Sbjct: 311 LPEQAFVAFKDAVTSKVHSLKKIRGPD-PNYKDICFAGAGRNVSKLHEVFPDVDMVFGNG 369

Query: 278 N-LRIDGEN-VFIIDYENHFFLLAVAPH-DDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
             L +  EN +F     +  + L V  +  D   L+G    R+T   YD + + + F K 
Sbjct: 370 QKLSLTPENYLFRHSKVDGAYCLGVFQNGKDPTTLLGGIIVRNTLVTYDRHNEKIGFWKT 429

Query: 335 NCSD 338
           NCS+
Sbjct: 430 NCSE 433


>gi|226492633|ref|NP_001149953.1| LOC100283580 precursor [Zea mays]
 gi|195635701|gb|ACG37319.1| pepsin A [Zea mays]
          Length = 491

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 95/382 (24%), Positives = 159/382 (41%), Gaps = 71/382 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            R+ IG+P KG  + +DTGS +++                     +DP  S +   + C+
Sbjct: 86  TRIEIGSPPKGYYVQVDTGSDILWVNCIRCDGCPTRSGLGIELTQYDPAGSGT--TVGCE 143

Query: 43  HPDCTYFKC---------VNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
              C               +  C + + Y D S T GF   + +    V G G+      
Sbjct: 144 QEFCVANSAGGVPPTCPSTSSPCQFRITYGDGSTTTGFYVTDFVQYNQVSGNGQTTTSNA 203

Query: 91  GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNG 148
              FGC     G D  + + AL G+LG  +   S +SQL +   ++K F++CL      G
Sbjct: 204 SITFGCGA-QLGGDLGSSNQALDGILGFGQSDSSMLSQLAAARRVRKIFAHCLDTVRGGG 262

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVS 207
            +    +         +P  + T  +  PN  +Y ++L+ IS+    +  P  TFD   S
Sbjct: 263 IFAIGNVV--------QPKVKTTPLV--PNVTHYNVNLQGISVGGATLQLPTSTFD---S 309

Query: 208 GEG-GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFN 265
           G+  G IIDSG+ L Y   +VY  L     + F+++Q   L +  + +  C+ F     +
Sbjct: 310 GDSKGTIIDSGTTLAYLPREVYRTL---LAAVFDKYQDLPLHNYQDFV--CFQFSGSIDD 364

Query: 266 RFPSMAFYFEDANLRIDGENVFIIDY----ENHFFLL-----AVAPHDDL-VALIGSQQQ 315
            FP + F F+  +L +   NV+  DY     N  + +      V   D   + L+G    
Sbjct: 365 GFPVITFSFK-GDLTL---NVYPDDYLFQNRNDLYCMGFLDGGVQTKDGKDMLLLGDLVL 420

Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
            +   VYDL  +++ +   NCS
Sbjct: 421 SNKLVVYDLEKEVIGWTDYNCS 442


>gi|302776054|ref|XP_002971323.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
 gi|300161305|gb|EFJ27921.1| hypothetical protein SELMODRAFT_63598 [Selaginella moellendorffii]
          Length = 395

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 86/383 (22%), Positives = 154/383 (40%), Gaps = 66/383 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ +G P K  ++ +DTGS +++                    ++DPR+SS+   ++C 
Sbjct: 31  TQVGLGNPVKHYIVQVDTGSDVLWVNCRPCSGCPRKSALNIPLTMYDPRESSTTSLVSCS 90

Query: 43  HPDCTYFKCVNE--------QCVYTMKYADQSVTKGFAAHETI--SVIGKGEGKAIFHGA 92
            P C   +   E         C Y   Y D S ++G+   + +  +VI            
Sbjct: 91  DPLCVRGRRFAEAQCSQTTNNCEYIFSYGDGSTSEGYYVRDAMQYNVISSNGLANTTSQV 150

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
           LFGCS    G D      A+ G++G  ++ +S  +QL +   I + FS+CL       E 
Sbjct: 151 LFGCSIRQTG-DLSTSQQAVDGIIGFGQLELSVPNQLAAQQNIPRVFSHCL-------EG 202

Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGE 209
                      G   P    T  +  P++ +Y + L+ IS+++ R+  P D  D + + +
Sbjct: 203 EKRGGGILVIGGIAEPGMTYTPLV--PDSVHYNVVLRGISVNSNRL--PIDAEDFSSTND 258

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NRFP 268
            G I+DSG+ L YF S  Y      FV        A           C+ +     + FP
Sbjct: 259 TGVIMDSGTTLAYFPSGAY----NVFVQAIREATSATPVRVQGMDTQCFLVSGRLSDLFP 314

Query: 269 SMAFYFEDANLRIDGEN--------------VFIIDYENHFFLLAVAPHD-DLVALIGSQ 313
           ++   FE   + +  +N              V+ I +++     +  P D   + ++G  
Sbjct: 315 NVTLNFEGGAMELQPDNYLMWGGTAPTGTTDVWCIGWQSSSS--SAGPKDGSQLTILGDI 372

Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
             +D   VYDL+   + ++  NC
Sbjct: 373 VLKDKLVVYDLDNSRIGWMSYNC 395


>gi|224102847|ref|XP_002312826.1| predicted protein [Populus trichocarpa]
 gi|222849234|gb|EEE86781.1| predicted protein [Populus trichocarpa]
          Length = 445

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 96/397 (24%), Positives = 157/397 (39%), Gaps = 82/397 (20%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-----------------------AIFDPRKSSSFQK 38
           V L  GTP + +  I+DTGS +++                         F P++SSS + 
Sbjct: 69  VSLSFGTPPQTLSFIMDTGSDIVWFPCTSHYLCKHCSFSSSSPSSRIQPFIPKESSSSKL 128

Query: 39  INCDHPDCTYF--------------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE 84
           + C +P C++                C+N+ C   M +     T G A  ET+ +     
Sbjct: 129 LGCKNPKCSWIHHSNINCDQDCSIKSCLNQTCPPYMIFYGSGTTGGVALSETLHL--HSL 186

Query: 85  GKAIFHGALFGCSN-DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVI 143
            K  F   L GCS   +H           AG+ G  R   S  SQLG     +FSYCL+ 
Sbjct: 187 SKPNF---LVGCSVFSSH---------QPAGIAGFGRGLSSLPSQLG---LGKFSYCLLS 231

Query: 144 -PLPNGEYTSSYLKFGTDMGYRRPSTQA---TKFINHP--------NNFYYLSLKDISID 191
               +    SS L    +       T A   T F+ +P        + +YYL L+ I++ 
Sbjct: 232 HRFDDDTKKSSSLVLDMEQLDSDKKTNALVYTPFVKNPKVDNKSSFSVYYYLGLRRITVG 291

Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
              +  P         G GG IIDSG+  T+   + +  L ++F+   + ++  +  +  
Sbjct: 292 GHHVKVPYKYLSPGEDGNGGVIIDSGTTFTFMAREAFEPLSDEFIRQIKDYRRVKEIEDA 351

Query: 252 EPIQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA- 308
             ++ C+ + +     FP +  YF+  A++ +  EN F         L  V    D VA 
Sbjct: 352 IGLRPCFNVSDAKTVSFPELRLYFKGGADVALPVENYFAFVGGEVACLTVVT---DGVAG 408

Query: 309 ---------LIGSQQQRDTRFVYDLNIDLLSFVKENC 336
                    ++G+ Q ++    YDL  + L F +E C
Sbjct: 409 PERVGGPGMILGNFQMQNFYVEYDLRNERLGFKQEKC 445


>gi|147819672|emb|CAN76394.1| hypothetical protein VITISV_020864 [Vitis vinifera]
          Length = 507

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 79/315 (25%), Positives = 139/315 (44%), Gaps = 55/315 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ IGTPSK   + +DTGS +++                    ++D + S++   + CD
Sbjct: 80  AKIGIGTPSKDYYVQVDTGSDILWVNCAGCDRCPTKSDLGVDLTLYDMKASTTSDAVGCD 139

Query: 43  HPDCTYFK-----CV-NEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
              C+ +      C    QC+Y++ Y D S T G+   + +    + G  +        +
Sbjct: 140 DNFCSLYDGPLPGCKPGLQCLYSVLYGDGSSTTGYFVQDFVQYNRISGNFQTTPTNGTVV 199

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCL-------VIP 144
           FGC N   G +  +   AL G+LG  +   S +SQL S   +KK FS+CL       +  
Sbjct: 200 FGCGNKQSG-ELGSSSEALDGILGFGQANSSMLSQLASSGKVKKVFSHCLDNVDGGGIFA 258

Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDI 204
           +  GE     ++F               F++  +  Y + +K+I +  + ++ P D F+ 
Sbjct: 259 I--GEVVEPKVRF----LLMNSVMIVVLFLSRAH--YNVVMKEIEVGGDPLDVPSDAFE- 309

Query: 205 TVSGE-GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPE 262
             SG+  G IIDSG+ L YF  +VY  L EK +S     +L  +    E    C+ +   
Sbjct: 310 --SGDRKGTIIDSGTTLAYFPQEVYVPLIEKILSQQPDLRLHTV----EQAFTCFDYTGN 363

Query: 263 TFNRFPSMAFYFEDA 277
             + FP++  +F+ +
Sbjct: 364 VDDGFPTVTLHFDKS 378


>gi|224029721|gb|ACN33936.1| unknown [Zea mays]
 gi|413946782|gb|AFW79431.1| pepsin A [Zea mays]
          Length = 534

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 60/210 (28%), Positives = 90/210 (42%), Gaps = 14/210 (6%)

Query: 28  FDPRKSSSFQKINCDHPDCT---YFKCVN----EQCVYTMKYADQSVTKGFAAHETISVI 80
           + P KSSS+++I C   +C    Y  C +    E C Y  K  D +VT G    E  +V 
Sbjct: 187 YRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVTIGIYGKEKATVT 246

Query: 81  GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
                 A   G + GCS    G   DA D    GVL L    +SF         +RFS+C
Sbjct: 247 VSDGRMAKLPGLILGCSVLEAGGSVDAHD----GVLSLGNGDMSFAVHAAKRFGQRFSFC 302

Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFP 198
           L +   +    SSYL FG +     P T  T  + + +    Y   +  + +  ER++ P
Sbjct: 303 L-LSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAQVTGVLVGGERLDIP 361

Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
            + +D      GG I+D+ + +T    + Y
Sbjct: 362 DEVWDAERFVGGGVILDTSTSVTSLVPEAY 391


>gi|18391062|ref|NP_563851.1| aspartyl protease-like protein [Arabidopsis thaliana]
 gi|2160166|gb|AAB60729.1| F21M12.13 gene product [Arabidopsis thaliana]
 gi|21593996|gb|AAM65914.1| nucleoid DNA-binding-like protein [Arabidopsis thaliana]
 gi|26983826|gb|AAN86165.1| unknown protein [Arabidopsis thaliana]
 gi|332190367|gb|AEE28488.1| aspartyl protease-like protein [Arabidopsis thaliana]
          Length = 449

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 85/368 (23%), Positives = 151/368 (41%), Gaps = 56/368 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
           +VR  +GTP + + ++LDT +  ++               F+   SS++  ++C    CT
Sbjct: 105 VVRAKLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTAQCT 164

Query: 48  YFK---CVNEQ-----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
             +   C +       C +   Y   S        +T+++        +     FGC N 
Sbjct: 165 QARGLTCPSSSPQPSVCSFNQSYGGDSSFSASLVQDTLTL-----APDVIPNFSFGCINS 219

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
             G     +     G++GL R  +S +SQ  S+    FSYCL  P     Y S  LK G 
Sbjct: 220 ASGNSLPPQ-----GLMGLGRGPMSLVSQTTSLYSGVFSYCL--PSFRSFYFSGSLKLGL 272

Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
            +G  + S + T  + +P   + YY++L  +S+ + ++   P       +   G IIDSG
Sbjct: 273 -LGQPK-SIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDANSGAGTIIDSG 330

Query: 218 SVLTYFHSDVYWKLHEKF-----VSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAF 272
           +V+T F   VY  + ++F     VS F    L     C        F  +  N  P +  
Sbjct: 331 TVITRFAQPVYEAIRDEFRKQVNVSSFS--TLGAFDTC--------FSADNENVAPKITL 380

Query: 273 YFEDANLRIDGENVFIIDYENHFFLLAVAP----HDDLVALIGSQQQRDTRFVYDLNIDL 328
           +    +L++  EN  I         L++A      + ++ +I + QQ++ R ++D+    
Sbjct: 381 HMTSLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSR 440

Query: 329 LSFVKENC 336
           +    E C
Sbjct: 441 IGIAPEPC 448


>gi|226491620|ref|NP_001149154.1| pepsin A precursor [Zea mays]
 gi|195625132|gb|ACG34396.1| pepsin A [Zea mays]
          Length = 537

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 60/210 (28%), Positives = 90/210 (42%), Gaps = 14/210 (6%)

Query: 28  FDPRKSSSFQKINCDHPDCT---YFKCVN----EQCVYTMKYADQSVTKGFAAHETISVI 80
           + P KSSS+++I C   +C    Y  C +    E C Y  K  D +VT G    E  +V 
Sbjct: 190 YRPAKSSSWRRIRCSQKECAVLPYNTCQSPSKAESCSYFQKTQDGTVTIGIYGKEKATVT 249

Query: 81  GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
                 A   G + GCS    G   DA D    GVL L    +SF         +RFS+C
Sbjct: 250 VSDGRMAKLPGLILGCSVLEAGGSVDAHD----GVLSLGNGDMSFAVHAAKRFGQRFSFC 305

Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFP 198
           L +   +    SSYL FG +     P T  T  + + +    Y   +  + +  ER++ P
Sbjct: 306 L-LSANSSRDASSYLTFGPNPAVMGPGTMETDILYNVDVKPAYGAKVTGVLVGGERLDIP 364

Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
            + +D      GG I+D+ + +T    + Y
Sbjct: 365 DEVWDAERFVGGGVILDTSTSVTSLVPEAY 394


>gi|88174565|gb|ABD39357.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 83/340 (24%), Positives = 144/340 (42%), Gaps = 47/340 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
           ++ + +GTPSK  ++ +DTGS+  +   +       PR     +S++  K++C    C  
Sbjct: 2   VISVGLGTPSKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61

Query: 49  F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 C + +    C + + Y D S + G    +T++            G  FGC+ D+
Sbjct: 62  GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQKIPGFTFGCNMDS 117

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY---TSSYLKF 157
            G +E    G + G+LG+    +S + Q  S     FSYCL + +    +   T+ Y   
Sbjct: 118 FGANE---FGNVDGLLGMGAGQMSVLKQ-SSPTFDGFSYCLPLQMSERGFFSKTTGYFSL 173

Query: 158 GTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           G  +   R   + TK +    N   +++ L  IS+D ER+   P  F        G + D
Sbjct: 174 GGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFS-----RKGVVFD 228

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAFYF 274
           SGS L+Y        L ++      R   A+     E  + CY +        P+++ +F
Sbjct: 229 SGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISLHF 284

Query: 275 ED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
           +D A   +    VF+     E   + LA AP +  V++IG
Sbjct: 285 DDGARFDLGRHGVFVERSVQEQDVWCLAFAPTES-VSIIG 323


>gi|255571584|ref|XP_002526738.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223533927|gb|EEF35652.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 457

 Score = 77.0 bits (188), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 88/370 (23%), Positives = 152/370 (41%), Gaps = 59/370 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
           +++  IG+P+     I D+GS+L++                 +F+P KS ++ K  C+  
Sbjct: 102 VMKFSIGSPAVDTYAIPDSGSSLVWLQCGTPYCRNCYRQKIPLFNPSKSVTYMKRLCNTA 161

Query: 45  DC------TYFKCV--NEQCVYTMKYADQSVTKG------FAAHETISVIGKGEGKAIFH 90
           +C       Y++C   N+ C Y   Y D S T+G      F   E IS  G    + IF 
Sbjct: 162 ECRVALGDEYWRCKKPNQICKYHEDYLDDSYTEGVISTDIFTFPEHISGFGNYTLRIIF- 220

Query: 91  GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY 150
               GC  +N     D +     G++GL+    S + Q+      +FSYC+ I       
Sbjct: 221 ----GCGYNN----SDPQHFYPPGLVGLTNNKASLVGQMDV---DQFSYCVSIDTEQNLK 269

Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDN-ERMNFPPDTFDITVSGE 209
            S  ++FG        STQ     N    + + ++  I ++  E   +P   F  T  G+
Sbjct: 270 GSMEIRFGLAASISGHSTQLVP--NSDGWYIFKNVDGIYVNEFEVEGYPAWVFKYTEGGQ 327

Query: 210 GGCIIDSGSVLTYFHSDVY---WKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN- 265
           GG  +D+G+  T  H+ V     KL E+ ++       +         +LCYF  +    
Sbjct: 328 GGLTMDTGTTYTELHNSVMDPLIKLLEEHITIVPEKDYSN-----SGFELCYFSDDFLGA 382

Query: 266 RFPSMAFYF---EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVY 322
             P +   F   +D     +  N +  +  +   L     +   +++IG  Q RD +  Y
Sbjct: 383 TLPDIELRFTDNKDTYFSFNTRNAWTPNGRSQMCLAMFRTNG--MSIIGMHQLRDIKIGY 440

Query: 323 DLNIDLLSFV 332
           DL+ +++SF 
Sbjct: 441 DLHHNIVSFT 450


>gi|15241713|ref|NP_195839.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|75181297|sp|Q9LZL3.1|PCS1L_ARATH RecName: Full=Aspartic proteinase PCS1; AltName: Full=Aspartic
           protease 38; Short=AtASP38; AltName: Full=Protein EMBRYO
           DEFECTIVE 24; AltName: Full=Protein PROMOTION OF CELL
           SURVIVAL 1; Flags: Precursor
 gi|7340693|emb|CAB82992.1| putative protein [Arabidopsis thaliana]
 gi|50897174|gb|AAT85726.1| At5g02190 [Arabidopsis thaliana]
 gi|53828617|gb|AAU94418.1| At5g02190 [Arabidopsis thaliana]
 gi|110742159|dbj|BAE99007.1| hypothetical protein [Arabidopsis thaliana]
 gi|332003059|gb|AED90442.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 453

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 88/374 (23%), Positives = 152/374 (40%), Gaps = 59/374 (15%)

Query: 9   PSKGVLLILDTGSALIYAI------------FDPRKSSSFQKINCDHPDCTY-------- 48
           P + + +++DTGS L +              FDP +SSS+  I C  P C          
Sbjct: 82  PPQNISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIP 141

Query: 49  FKCVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
             C +++ C  T+ YAD S ++G  A E I   G     +     +FGC     G D + 
Sbjct: 142 ASCDSDKLCHATLSYADASSSEGNLAAE-IFHFGNSTNDS---NLIFGCMGSVSGSDPE- 196

Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV----IP--LPNGEYTSSYLKFGTDM 161
            D    G+LG++R ++SFISQ+G     +FSYC+      P  L  G+   ++L   T +
Sbjct: 197 EDTKTTGLLGMNRGSLSFISQMG---FPKFSYCISGTDDFPGFLLLGDSNFTWL---TPL 250

Query: 162 GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            Y      +T         Y + L  I ++ + +  P        +G G  ++DSG+  T
Sbjct: 251 NYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFT 310

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-----PIQLCYFLPET------FNRFPSM 270
           +    VY  L   F++      +  + + P+      + LCY +          +R P++
Sbjct: 311 FLLGPVYTALRSHFLNRTN--GILTVYEDPDFVFQGTMDLCYRISPVRIRSGILHRLPTV 368

Query: 271 AFYFEDANLRIDGENVFI----IDYENHFFLLAVAPHDDLVAL----IGSQQQRDTRFVY 322
           +  FE A + + G+ +      +   N         + DL+ +    IG   Q++    +
Sbjct: 369 SLVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTFGNSDLMGMEAYVIGHHHQQNMWIEF 428

Query: 323 DLNIDLLSFVKENC 336
           DL    +      C
Sbjct: 429 DLQRSRIGLAPVEC 442


>gi|357460823|ref|XP_003600693.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355489741|gb|AES70944.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 431

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 94/379 (24%), Positives = 152/379 (40%), Gaps = 69/379 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY----------AIFDPRKSSSFQKINCDHPDCTY-- 48
           ++ L IGTP +   ++LDTGS L +          A FDP  SS+F  + C HP C    
Sbjct: 76  IINLPIGTPPQTQPMVLDTGSQLSWIQCHKKQPPTASFDPSLSSTFSILPCTHPLCKPRI 135

Query: 49  ------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
                   C  N  C Y+  YAD +  +G    E  +         +    + GC+ ++ 
Sbjct: 136 PDFTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSRSVSTPPL----ILGCATES- 190

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDM 161
               D R     G+LG++   +SF  Q  S I K FSYC+    P  +    +   G+  
Sbjct: 191 ---TDPR-----GILGMNLGRLSFAKQ--SKITK-FSYCV----PPRQTRPGFTPTGSFY 235

Query: 162 GYRRPSTQATKFI-------NHPNNF----YYLSLKDISIDNERMNFPPDTFDITVSGEG 210
               PS++  K++           NF    Y + +  I I  +++N  P  F     G G
Sbjct: 236 LGNNPSSKGFKYVGMMTSSRQRMPNFDPLAYTIPMVGIRIAGKKLNISPAVFRADAGGSG 295

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFV-SYFERFQLAQLS--------DCPEPIQLCYFLP 261
             +IDSGS  TY  S+ Y K+  + V +   R +   +         D  + +++   + 
Sbjct: 296 QTMIDSGSEFTYLVSEAYDKVRAQVVRAVGPRLKKGYVYGGVADMCFDSVKAVEIGRLIG 355

Query: 262 ETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVA---LIGSQQQRDT 318
           E       M F FE     +  +   + D       + +   D L A   +IG+  Q++ 
Sbjct: 356 E-------MVFEFERGVEVVIPKERVLADVGGGVHCVGIGSSDKLGAASNIIGNFHQQNL 408

Query: 319 RFVYDLNIDLLSFVKENCS 337
              +DL    + F K +CS
Sbjct: 409 WVEFDLVRRRVGFGKADCS 427


>gi|297819968|ref|XP_002877867.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297323705|gb|EFH54126.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 469

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 93/394 (23%), Positives = 147/394 (37%), Gaps = 75/394 (19%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKI 39
           V L  GTPS+ +  + DTGS+L++                        F P+ SSS + I
Sbjct: 92  VSLSFGTPSQTIPFVFDTGSSLVWFPCTSRYLCSDCNFSGLDPTQIPRFIPKNSSSSRVI 151

Query: 40  NCDHPDCTYFKCVNEQC---------------VYTMKYADQSVTKGFAAHETISVIGKGE 84
            C +P C +    N QC                Y ++Y   S T G    E +       
Sbjct: 152 GCQNPKCQFLFGANVQCRGCDPNTRNCTVPCPPYILQYGLGS-TAGILISEKLDFP---- 206

Query: 85  GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
                   + GCS               AG+ G  R   S  SQ+     K FS+CLV  
Sbjct: 207 -DLTVPDFVVGCS--------VISTRTPAGIAGFGRGPESLPSQMK---LKSFSHCLVSR 254

Query: 145 LPNGEYTSSYLKFGTDMGYRR----PSTQATKFINHPN-------NFYYLSLKDISIDNE 193
             +    ++ L   T  G++     P    T F  +PN        +YYL+L+ I + ++
Sbjct: 255 RFDDTNVTTDLGLDTGSGHKSGSKTPGLSYTPFRKNPNVSNTAFLEYYYLNLRRIYVGSK 314

Query: 194 RMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP 253
            +  P        +G GG I+DSGS  T+    V+  + E+F +    +   +  +    
Sbjct: 315 HVKIPYKFLAPGTNGNGGSIVDSGSTFTFMERPVFELVAEEFATQMSNYTREKDLEKVSG 374

Query: 254 IQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDL----- 306
           I  C+ +    +   P + F F+  A + +   N F          L V   + +     
Sbjct: 375 IAPCFNISGKGDVTVPELIFEFKGGAKMELPLSNYFSFVGNADTVCLTVVSDNTVNPGGG 434

Query: 307 ---VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
                ++GS QQ++    YDL  D   F K+ CS
Sbjct: 435 TGPAIILGSFQQQNYLVEYDLENDRFGFAKKKCS 468


>gi|356568507|ref|XP_003552452.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 481

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 81/341 (23%), Positives = 143/341 (41%), Gaps = 38/341 (11%)

Query: 19  TGSALIYAIFDPRKSSSFQKINCDHPDCT------YFKCVNE-QCVYTMKYADQSVTKGF 71
           +G  +   ++DP  S + + + CD   CT         C     C Y++ Y D S T G 
Sbjct: 112 SGLGMDLTLYDPNLSKTSKAVPCDDEFCTSTYDGQISGCTKGMSCPYSITYGDGSTTSGS 171

Query: 72  AAHETIS---VIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQ 128
              + ++   V+G           +FGC +   G      D +L G++G  +   S +SQ
Sbjct: 172 YIKDDLTFDRVVGDLRTVPDNTSVIFGCGSKQSGTLSSTTDTSLDGIIGFGQANSSVLSQ 231

Query: 129 LGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLK 186
           L +   +K+ FS+CL      G +    +         +P  + T  +    + Y + LK
Sbjct: 232 LAAAGKVKRIFSHCLDSISGGGIFAIGEV--------VQPKVKTTPLLQGMAH-YNVVLK 282

Query: 187 DISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQ 246
           DI +  + +  P D  D + SG  G IIDSG+ L Y    +Y +L EK ++     +L  
Sbjct: 283 DIEVAGDPIQLPSDILD-SSSGR-GTIIDSGTTLAYLPVSIYDQLLEKILAQRSGMKLYL 340

Query: 247 LSDCPEPIQLCYFLPETFNR-FPSMAFYFEDA---------NLRIDGENVFIIDYENHFF 296
           + D  +     Y   E+ +  FP++ F FE+           L +  E+++ + ++    
Sbjct: 341 VED--QFTCFHYSDEESVDDLFPTVKFTFEEGLTLTTYPRDYLFLFKEDMWCVGWQKS-- 396

Query: 297 LLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            +A       + L+G     +   VYDL+   + +   NCS
Sbjct: 397 -MAQTKDGKELILLGDLVLANKLVVYDLDNMAIGWADYNCS 436


>gi|88174567|gb|ABD39358.1| chloroplast nucleoid DNA-binding protein [Oryza glumipatula]
          Length = 323

 Score = 76.6 bits (187), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 82/340 (24%), Positives = 144/340 (42%), Gaps = 47/340 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
           ++ + +GTP+K  ++ +DTGS+  +   +       PR     +S++  K++C    C  
Sbjct: 2   VISVGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61

Query: 49  F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 C + +    C + + Y D S + G    +T++            G  FGC+ D+
Sbjct: 62  GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQKIPGFTFGCNMDS 117

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY---TSSYLKF 157
            G +E    G + G+LG+    +S + Q  S     FSYCL + +    +   T+ Y   
Sbjct: 118 FGANEF---GNVDGLLGMGAGQMSVLKQ-SSPTFDGFSYCLPLQMSERGFFSKTTGYFSL 173

Query: 158 GTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           G  +   R   + TK +    N   +++ L  IS+D ER+   P  F        G + D
Sbjct: 174 GGKIAATRTDVRYTKMVARRKNTELFFVDLTAISVDGERLGLSPSIFS-----RKGVVFD 228

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAFYF 274
           SGS L+Y        L ++      R   A+     E  + CY +        P+++ +F
Sbjct: 229 SGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISLHF 284

Query: 275 ED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
           +D A   +    VF+     E   + LA AP +  V++IG
Sbjct: 285 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIG 323


>gi|356498711|ref|XP_003518193.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 466

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 94/398 (23%), Positives = 159/398 (39%), Gaps = 80/398 (20%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY------------------AIFDPRKSSSFQKINCDH 43
           + L  GTPS+    +LDTGS L++                    F P+ SSS + + C +
Sbjct: 88  IDLEFGTPSQTFPFVLDTGSTLVWLPCSSHYLCSKCNSFSNTPKFIPKNSSSSKFVGCTN 147

Query: 44  PDCTY-----------------FKCVNEQC-VYTMKYADQSVTKGFAAHETISVIGKGEG 85
           P C +                 F   ++ C  YT++Y   S T GF   E ++   K   
Sbjct: 148 PKCAWVFGPDVKSHCCRQDKAAFNNCSQTCPAYTVQYGLGS-TAGFLLSENLNFPTKK-- 204

Query: 86  KAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL 145
              +   L GCS               AG+ G  R   S  SQ+      RFSYCL+   
Sbjct: 205 ---YSDFLLGCS--------VVSVYQPAGIAGFGRGEESLPSQMN---LTRFSYCLLSHQ 250

Query: 146 PNGEYTSS---YLKFGTDMGYRRPSTQATKFINHPNN--------FYYLSLKDISIDNER 194
            +   T +    L+  +    +      T F+ +P          +YY++LK I +  +R
Sbjct: 251 FDDSATITSNLVLETASSRDGKTNGVSYTPFLKNPTTKKNPAFGAYYYITLKRIVVGEKR 310

Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI 254
           +  P    +  V G+GG I+DSGS  T+    ++  + ++F      +  A+ ++    +
Sbjct: 311 VRVPRRLLEPNVDGDGGFIVDSGSTFTFMERPIFDLVAQEFAKQVS-YTRAREAEKQFGL 369

Query: 255 QLCYFL---PETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA-- 308
             C+ L    ET + FP + F F   A +R+   N F +  +     L +   DD+    
Sbjct: 370 SPCFVLAGGAETAS-FPELRFEFRGGAKMRLPVANYFSLVGKGDVACLTIV-SDDVAGSG 427

Query: 309 -------LIGSQQQRDTRFVYDLNIDLLSFVKENCSDD 339
                  ++G+ QQ++    YDL  +   F  ++C  +
Sbjct: 428 GTVGPAVILGNYQQQNFYVEYDLENERFGFRSQSCQTN 465


>gi|115466068|ref|NP_001056633.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|55296446|dbj|BAD68569.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|55296924|dbj|BAD68375.1| putative nucleoid DNA-binding protein cnd41 [Oryza sativa Japonica
           Group]
 gi|113594673|dbj|BAF18547.1| Os06g0119600 [Oryza sativa Japonica Group]
 gi|215694767|dbj|BAG89958.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215737752|dbj|BAG96882.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 495

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 78/329 (23%), Positives = 131/329 (39%), Gaps = 50/329 (15%)

Query: 27  IFDPRKSSSFQKINCDHPDCTYFK------CVNEQCVYTMKYADQSVTKGFAAHETISVI 80
           +FDP  S+++  + C    C            N QC + + Y D S   G  + + ++ +
Sbjct: 198 LFDPAMSTTYAAVPCTSAACAQLGPYRRGCSANAQCQFGINYGDGSTATGTYSFDDLT-L 256

Query: 81  GKGEGKAIFHGALFGCSNDNHG--FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFS 138
           G  +   +  G  FGC++ + G  FD D     +AG L L   + S + Q  +   + FS
Sbjct: 257 GPYD---VIRGFRFGCAHADRGSAFDYD-----VAGSLALGGGSQSLVQQTATRYGRVFS 308

Query: 139 YCLVIPLPNGEYTSSYLKFGT--DMGYRRPSTQATKFINH--PNNFYYLSLKDISIDNER 194
           YC    LP    +  +L  G   +     PS  +T  ++      FY + L+ I +    
Sbjct: 309 YC----LPPTASSLGFLVLGVPPERAQLIPSFVSTPLLSSSMAPTFYRVLLRAIIVAGRP 364

Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI 254
           +  PP  F  +       +IDS ++++      Y  L   F S    ++ A       P+
Sbjct: 365 LAVPPAVFSAS------SVIDSSTIISRLPPTAYQALRAAFRSAMTMYRAA------PPV 412

Query: 255 QL---CY-FLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAP--HDDLV 307
            +   CY F        PS+A  F+  A + +D   + +         LA AP   D + 
Sbjct: 413 SILDTCYDFTGVRSITLPSIALVFDGGATVNLDAAGILLGS------CLAFAPTASDRMP 466

Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             IG+ QQ+    VYD+    + F    C
Sbjct: 467 GFIGNVQQKTLEVVYDVPAKAMRFRTAAC 495


>gi|225217039|gb|ACN85323.1| aspartic proteinase nepenthesin-1 precursor [Oryza brachyantha]
          Length = 287

 Score = 76.6 bits (187), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 75/305 (24%), Positives = 116/305 (38%), Gaps = 40/305 (13%)

Query: 45  DCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
           D T   C    C+Y ++Y D S T GF A +T+++           G  FGC   N G  
Sbjct: 10  DXTTRGCSGGHCLYGVQYGDGSYTIGFFAMDTLTL----SSHDAIKGFRFGCGERNEGLF 65

Query: 105 EDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYR 164
            +A     AG+LGL R   S   Q        F++C     P     + YL+FG      
Sbjct: 66  GEA-----AGLLGLGRGKTSLPVQTYDKYGGVFAHC----FPARSSGTGYLEFGPG---S 113

Query: 165 RPSTQAT-----KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
            P+  A        I+    FYY+ +  I +  + +  P   F        G I+DSG+V
Sbjct: 114 SPAVSAKLSTTPMLIDTGPTFYYVGMTGIRVGGKLLPIPQSVFAAA-----GTIVDSGTV 168

Query: 220 LTYFHSDVYWKLHEKFVSY-----FERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFY 273
           +T      Y  L   F +      ++R     L D       CY L        P+++  
Sbjct: 169 ITRLPPAAYSSLRSAFAASMAARGYKRAPALSLLD------TCYDLTGASEVAIPTVSLL 222

Query: 274 FEDA-NLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
           F+   +L +D    ++           A     D VA++G+ Q +    VYD+   ++ F
Sbjct: 223 FQGGVSLDVDASGIIYAASVSQACLGFAGNEAADDVAIVGNTQLKTFGVVYDIASKVVGF 282

Query: 332 VKENC 336
               C
Sbjct: 283 CPGAC 287


>gi|297723027|ref|NP_001173877.1| Os04g0336942 [Oryza sativa Japonica Group]
 gi|255675342|dbj|BAH92605.1| Os04g0336942 [Oryza sativa Japonica Group]
          Length = 388

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 76/312 (24%), Positives = 128/312 (41%), Gaps = 45/312 (14%)

Query: 6   IGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDHPDC 46
           IGTP+    + LDTGS   +                     +DPR S S +++ CD   C
Sbjct: 89  IGTPAVKYYVQLDTGSKAFWVNGISCKQCPHESDILRKLTFYDPRSSVSSKEVKCDDTIC 148

Query: 47  TYFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGALFGCSNDNH 101
           T     N   +C Y   YAD  +T G    + +    + G G+ +       FGC     
Sbjct: 149 TSRPPCNMTLRCPYITGYADGGLTMGILFTDLLHYHQLYGNGQTQPTSTSVTFGCGLQQS 208

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           G   ++   A+ G++G      + +SQL +    KK FS+CL      G +    +    
Sbjct: 209 GSLNNSAV-AIDGIIGFGNSNQTALSQLAAAGKTKKIFSHCLDSTNGGGIFAIGEV---- 263

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                 P  + T  + +   ++ ++LK I++    +  P + F  T +   G  IDSGS 
Sbjct: 264 ----VEPKVKTTPIVKNNEVYHLVNLKSINVAGTTLQLPANIFGTTKT--KGTFIDSGST 317

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANL 279
           L Y    +Y +L     +      +  + +     Q  +FL    ++FP + F+FE+ +L
Sbjct: 318 LVYLPEIIYSELILAVFAKHPDITMGAMYN----FQCFHFLGSVDDKFPKITFHFEN-DL 372

Query: 280 RIDGENVFIIDY 291
            +D   V+  DY
Sbjct: 373 TLD---VYPYDY 381


>gi|302783200|ref|XP_002973373.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
 gi|300159126|gb|EFJ25747.1| hypothetical protein SELMODRAFT_98841 [Selaginella moellendorffii]
          Length = 389

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 80/337 (23%), Positives = 139/337 (41%), Gaps = 43/337 (12%)

Query: 26  AIFDPRKSSSFQKINCDHPDCTYFKCV------NEQCVYTMKYADQSVTKGFAAHETISV 79
           ++F P  S+S  K+ C  P C+ F  V      +  C Y   Y     + G    + I+ 
Sbjct: 39  SLFQPGLSTSHTKLPCGSPSCSAFSAVSTSCGPSSSCSYNTSYGTNFSSAGDLVSD-IAT 97

Query: 80  IGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI-IKKRFS 138
           +     + +      GC  D+ G  E       +G +G  +  +SF+ QL ++  + +F 
Sbjct: 98  MDSVRNRKVAANLSLGCGRDSGGLLELLDT---SGFVGFDKGNVSFMGQLSALGYRSKFI 154

Query: 139 YCLVI-----PLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISID 191
           YCL        L  G Y        + M Y       T  I +P     Y+++L  ISID
Sbjct: 155 YCLPSDTFRGKLVIGNYKLRNASISSSMAY-------TPMITNPQAAELYFINLSTISID 207

Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
             +   P   F    +G GG +ID+ + L+Y  SD Y +L +   +Y     L ++S   
Sbjct: 208 KNKFQVPIQGF--LSNGTGGTVIDTTTFLSYLTSDFYTQLVQAIKNYTTN--LVEVSSSV 263

Query: 252 E---PIQLCYFLPETFNRFP---SMAFYFEDANLRIDGENVFIIDYE---NHFFLLAVAP 302
                ++LCY +    + FP   ++ ++F      ++    F++D     N+   +A+  
Sbjct: 264 ADALGVELCYNISAN-SDFPPPATLTYHFL-GGAGVEVSTWFLLDDSDSVNNTICMAIGR 321

Query: 303 HDDL---VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            + +   + +IG+ QQ D    YDL      F  + C
Sbjct: 322 SESVGPNLNVIGTYQQLDLTVEYDLEQMRYGFGAQGC 358


>gi|357118398|ref|XP_003560942.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 478

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 100/384 (26%), Positives = 154/384 (40%), Gaps = 57/384 (14%)

Query: 1   MVRLFIGTP-SKGVLLILDTGSALIYA-------------IFDPRKSSSFQKINCDHPDC 46
           ++ L IGTP  + V L LDTGS L++               FD   S +   + C  P C
Sbjct: 101 LIHLSIGTPRPQRVALTLDTGSDLVWTQCACHVCFAQPFPTFDALASQTTLAVPCSDPIC 160

Query: 47  TYFK-----CV--NEQCVYTMKYADQSVTKGFAAHETISVIG-KGEGKAIFHGAL----- 93
           T  K     C   +  C Y   YAD+S+T G    +T +    +G   +  H  +     
Sbjct: 161 TSGKYPLSGCTFNDNTCFYLYDYADKSITSGRIVEDTFTFRSPQGNNGSKAHAGVAVPNV 220

Query: 94  -FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
            FGC   N G  +       +G+ G SR  +S  SQL      RFS+C    + +   + 
Sbjct: 221 RFGCGQYNKGIFKSNE----SGIAGFSRGPMSLPSQLKV---ARFSHCFTA-IADARTSP 272

Query: 153 SYL--KFGTDM--GYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTF--DITV 206
            +L    G D    +     Q+T F N   + YYL+LK I++   R+      F    T 
Sbjct: 273 VFLGGAPGPDNLGAHATGPVQSTPFANSNGSLYYLTLKGITVGKTRLPLNALAFAGKGTG 332

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET--- 263
           SG GG IIDSG+ +      +Y  L   FV+   +  +A  S       LC+    +   
Sbjct: 333 SGSGGTIIDSGTGIRTLPGPMYRSLRAAFVARV-KLPVANESAADAESTLCFEAARSASL 391

Query: 264 -----FNRFPSMAFYFEDANLRIDGENVFIIDYENH------FFLLAVAPHDDLVALIGS 312
                    P +  +   A+  +  E+  +   E+         L+  +  D  + +IG+
Sbjct: 392 PPEAPAPALPKVVLHVAGADWDLPRESYVLDLLEDEDGSGSGLCLVMNSAGDSDLTIIGN 451

Query: 313 QQQRDTRFVYDLNIDLLSFVKENC 336
            QQ++    YDL  + L FV   C
Sbjct: 452 FQQQNMHVAYDLEKNKLVFVPARC 475


>gi|326524762|dbj|BAK04317.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 533

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 79/328 (24%), Positives = 136/328 (41%), Gaps = 29/328 (8%)

Query: 28  FDPRKSSSFQKINCDHPD-CTYFKCV-------NEQCVYTMKYADQSVTKGFAAHETISV 79
           + P +SSS+++  C   D C  F  V       NE C Y     D +VT+G    ET +V
Sbjct: 193 YRPARSSSWRRYRCSQRDTCGNFPYVACKTPDHNESCSYKQMLQDGTVTRGIFGRETATV 252

Query: 80  IGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSY 139
              G  +A   G + GCS    G   DA D    GVL L    +SF +  G   +  FS+
Sbjct: 253 SVSGGRQARLPGLVLGCSTYEAGGTVDAHD----GVLTLGNQHVSFGNIAGQSFQGLFSF 308

Query: 140 CLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLK--DISIDNERM-N 196
           CL +   +G   SSYL FG +             I +  N   + ++   + ++ +R+ N
Sbjct: 309 CL-LATHSGRDASSYLTFGPNPAIETGGVAGETDIIYVTNMPTMGVQVTGVLVNGQRLDN 367

Query: 197 FPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQL 256
            PP+ ++  V   GG  +D+G+ ++      Y  +      + +  +L ++SD  E  + 
Sbjct: 368 IPPEVWNYRV--HGGLNLDTGTSVSSLVEPAYGIVTRALARHLDP-KLEKVSDVIE-FEH 423

Query: 257 CYFL------PETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVAL 309
           CY        PET    P +    +  A +      V + +       L     +   ++
Sbjct: 424 CYKWDGVKPAPETI--VPKLELVLQGGARMEPSLTGVLMPEVVPGVACLGFWRRELGPSV 481

Query: 310 IGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +G+   ++  + +D     L F K+ C+
Sbjct: 482 LGNVHMQEHIWEFDSVKGKLRFKKDKCT 509


>gi|168042409|ref|XP_001773681.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675069|gb|EDQ61569.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 399

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 72/305 (23%), Positives = 129/305 (42%), Gaps = 48/305 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY--------------------AIFDPRKSSSFQKINC 41
            ++++GTP  G  + +DTGS + +                      +DP +SS+   ++C
Sbjct: 39  TKIYLGTPPVGYYVQVDTGSDVTWLNCAPCTSCVTETQLPSIKLTTYDPSRSSTDGALSC 98

Query: 42  DHPDCTYFKCVNE-------QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA-- 92
              +C      NE        C Y+  Y D S T+G+   + ++         +   A  
Sbjct: 99  RDSNCGAALGSNEVSCTSAGYCAYSTTYGDGSSTQGYFIQDVMTFQEIHNNTQVNGTASV 158

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
            FGC     G +      AL G++G  +  +S  SQL S+  +  RF++CL      G  
Sbjct: 159 YFGCGTTQSG-NLLMSSRALDGLIGFGQAAVSIPSQLASMGKVGNRFAHCLQGDNQGG-- 215

Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
               +  G+      P+   T  ++   N Y + +++I++ N R    P +FD T +  G
Sbjct: 216 --GTIVIGS---VSEPNISYTPIVSR--NHYAVGMQNIAV-NGRNVTTPASFDTTSTSAG 267

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSM 270
           G I+DSG+ L Y     Y     +FV+    F+ +  S   + +QL +   +    FP++
Sbjct: 268 GVIMDSGTTLAYLVDPAY----TQFVNAVSTFESSMFSSHSQCLQLAWCSLQA--DFPTV 321

Query: 271 AFYFE 275
             +F+
Sbjct: 322 KLFFD 326


>gi|238007638|gb|ACR34854.1| unknown [Zea mays]
 gi|413948713|gb|AFW81362.1| pepsin A [Zea mays]
          Length = 538

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 59/210 (28%), Positives = 90/210 (42%), Gaps = 14/210 (6%)

Query: 28  FDPRKSSSFQKINCDHPDCT---YFKCVN----EQCVYTMKYADQSVTKGFAAHETISVI 80
           + P KSSS+++I C   +C    Y  C +    E C Y  +  D ++T G    E  +V 
Sbjct: 189 YRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIYGKEKATVT 248

Query: 81  GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
                 A   G + GCS    G   DA D    GVL L    +SF         +RFS+C
Sbjct: 249 VSDGRMAKLPGLILGCSVLEAGGSVDAHD----GVLSLGNGEMSFAVHAAKRFGQRFSFC 304

Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFP 198
           L +   +    SSYL FG +     P T  T  + + +    Y   +  I +  ER++ P
Sbjct: 305 L-LSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDIP 363

Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
            + +D      GG I+D+ + +T    + Y
Sbjct: 364 QEIWDAEKVVGGGVILDTSTSVTSLVPEAY 393


>gi|359496797|ref|XP_002277380.2| PREDICTED: aspartic proteinase nepenthesin-2-like, partial [Vitis
           vinifera]
          Length = 358

 Score = 76.3 bits (186), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 64/259 (24%), Positives = 107/259 (41%), Gaps = 48/259 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
           V++  G+P++   +I+DTGS+L +                +FDP  S +++ ++C    C
Sbjct: 120 VKVGFGSPARYYSMIVDTGSSLSWLQCKPCVVYCHVQADPLFDPSASKTYKSLSCTSSQC 179

Query: 47  TYF----------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           +            +  +  CVYT  Y D S + G+ + + +++           G ++GC
Sbjct: 180 SSLVDATLNNPLCETSSNVCVYTASYGDSSYSMGYLSQDLLTL----APSQTLPGFVYGC 235

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
                G D D   G  AG+LGL R  +S + Q+ S     FSYCL  P   G     +L 
Sbjct: 236 -----GQDSDGLFGRAAGILGLGRNKLSMLGQVSSKFGYAFSYCL--PTRGG---GGFLS 285

Query: 157 FGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
            G        + + T     P N   Y+L L  I++    +      + +        II
Sbjct: 286 IG-KASLAGSAYKFTPMTTDPGNPSLYFLRLTAITVGGRALGVAAAQYRVPT------II 338

Query: 215 DSGSVLTYFHSDVYWKLHE 233
           DSG+V+T     VY    +
Sbjct: 339 DSGTVITRLPMSVYTPFQQ 357


>gi|226492334|ref|NP_001147965.1| pepsin A precursor [Zea mays]
 gi|195614874|gb|ACG29267.1| pepsin A [Zea mays]
          Length = 538

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 59/210 (28%), Positives = 90/210 (42%), Gaps = 14/210 (6%)

Query: 28  FDPRKSSSFQKINCDHPDCT---YFKCVN----EQCVYTMKYADQSVTKGFAAHETISVI 80
           + P KSSS+++I C   +C    Y  C +    E C Y  +  D ++T G    E  +V 
Sbjct: 189 YRPAKSSSWRRIRCSQKECALLPYNTCQSPSKAESCSYYQQMQDGTLTMGIYGKEKATVT 248

Query: 81  GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
                 A   G + GCS    G   DA D    GVL L    +SF         +RFS+C
Sbjct: 249 VSDGRMAKLPGLILGCSVLEAGGSVDAHD----GVLSLGNGEMSFAVHAAKRFGQRFSFC 304

Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFP 198
           L +   +    SSYL FG +     P T  T  + + +    Y   +  I +  ER++ P
Sbjct: 305 L-LSANSSRDASSYLTFGPNPAVMGPGTMETDIVYNVDVKPAYGPLVTGIFVGGERLDIP 363

Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
            + +D      GG I+D+ + +T    + Y
Sbjct: 364 QEIWDAEKVVGGGVILDTSTSVTSLVPEAY 393


>gi|219886219|gb|ACL53484.1| unknown [Zea mays]
 gi|219888509|gb|ACL54629.1| unknown [Zea mays]
 gi|414588374|tpg|DAA38945.1| TPA: nucellin-like aspartic protease [Zea mays]
          Length = 415

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 95/374 (25%), Positives = 146/374 (39%), Gaps = 62/374 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAIFD-PRKS-----------SSFQKINCDHPDCTYF 49
           V + IG P+K   L +DTGS L +   D P +S           ++ + + C +  CT  
Sbjct: 55  VTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPCANALCTAL 114

Query: 50  --------KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                   KC + +QC Y +KY D + ++G   +++ S+  +     I  G  FGC  D 
Sbjct: 115 HSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN--IRPGLTFGCGYDQ 172

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
                 A   A+ G+LGL R ++S +SQL    I K    +CL     NG     +L FG
Sbjct: 173 QVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLST---NG---GGFLFFG 226

Query: 159 TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
            D+      T          N+Y      +  D   +   P             + DSGS
Sbjct: 227 DDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEV----------VFDSGS 276

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-------NRFPSMA 271
             TYF +  Y  +         +  L Q+SD   P  LC+   + F       N F SM 
Sbjct: 277 TYTYFTAQPYQAVVSALKGGLSK-SLKQVSDPTLP--LCWKGQKAFKSVFDVKNEFKSMF 333

Query: 272 FYF---EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVA-----LIGSQQQRDTRFVYD 323
             F   ++A + I  EN  I+    +  L  +   D   A     +IG    +D   +YD
Sbjct: 334 LSFASAKNAAMEIPPENYLIVTKNGNVCLGIL---DGTAAKLSFNVIGDITMQDQMVIYD 390

Query: 324 LNIDLLSFVKENCS 337
                L + +  C+
Sbjct: 391 NEKSQLGWARGACT 404


>gi|357440289|ref|XP_003590422.1| Aspartic proteinase-like protein [Medicago truncatula]
 gi|355479470|gb|AES60673.1| Aspartic proteinase-like protein [Medicago truncatula]
          Length = 498

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 93/384 (24%), Positives = 161/384 (41%), Gaps = 74/384 (19%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           ++ +GTP +   + +DTGS +++                     FD   SS+   + C  
Sbjct: 87  KVKMGTPPREFTVQIDTGSDILWINCNTCSNCPKSSGLGIELNFFDTVGSSTAALVPCSD 146

Query: 44  PDCT------YFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
           P C         +C  +  QC YT +Y D S T G    + +    ++G+     +   A
Sbjct: 147 PMCASAIQGAAAQCSPQVNQCSYTFQYEDGSGTSGVYVSDAMYFDMILGQSTPANVASSA 206

Query: 93  --LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNG 148
             +FGCS    G D    D A+ G+LG     +S +SQL S  I  K FS+CL      G
Sbjct: 207 TIVFGCSTYQSG-DLTKTDKAVDGILGFGPGELSVVSQLSSRGITPKVFSHCLKGDGNGG 265

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
                 L  G  +    PS   +  + + P+  Y L+L+ I+++ + ++  P  F    S
Sbjct: 266 ----GILVLGEIL---EPSIVYSPLVPSQPH--YNLNLQSIAVNGQVLSINPAVF--ATS 314

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NR 266
            + G IIDSG+ L+Y   + Y  L     +   +F  + +S   +    CY +  +  + 
Sbjct: 315 DKRGTIIDSGTTLSYLVQEAYDPLVNAVDTAVSQFATSFISKGSQ----CYLVLTSIDDS 370

Query: 267 FPSMAFYFE-DANLRI------------DGENVFIIDYENHFFLLAVAPHDDLVALIGSQ 313
           FP+++F FE  A++ +            DG  ++ I ++            + V ++G  
Sbjct: 371 FPTVSFNFEGGASMDLKPSQYLLNRGFQDGAKMWCIGFQK---------VQEGVTILGDL 421

Query: 314 QQRDTRFVYDLNIDLLSFVKENCS 337
             +D   VYDL    + +   +CS
Sbjct: 422 VLKDKIVVYDLARQQIGWTNYDCS 445


>gi|195645150|gb|ACG42043.1| aspartic proteinase Asp1 precursor [Zea mays]
          Length = 415

 Score = 75.9 bits (185), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 95/374 (25%), Positives = 146/374 (39%), Gaps = 62/374 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAIFD-PRKS-----------SSFQKINCDHPDCTYF 49
           V + IG P+K   L +DTGS L +   D P +S           ++ + + C +  CT  
Sbjct: 55  VTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPCANALCTAL 114

Query: 50  --------KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                   KC + +QC Y +KY D + ++G   +++ S+  +     I  G  FGC  D 
Sbjct: 115 HSGQGSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN--IRPGLTFGCGYDQ 172

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
                 A   A+ G+LGL R ++S +SQL    I K    +CL     NG     +L FG
Sbjct: 173 QVGKNGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLST---NG---GGFLFFG 226

Query: 159 TDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
            D+      T          N+Y      +  D   +   P             + DSGS
Sbjct: 227 DDVVPSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEV----------VFDSGS 276

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-------NRFPSMA 271
             TYF +  Y  +         +  L Q+SD   P  LC+   + F       N F SM 
Sbjct: 277 TYTYFTAQPYQAVVSALKGGLSK-SLKQVSDPTLP--LCWKGQKAFKSVFDVKNEFKSMF 333

Query: 272 FYF---EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVA-----LIGSQQQRDTRFVYD 323
             F   ++A + I  EN  I+    +  L  +   D   A     +IG    +D   +YD
Sbjct: 334 LSFSSAKNAAMEIPPENYLIVTKNGNVCLGIL---DGTAAKLSFNVIGDITMQDQMVIYD 390

Query: 324 LNIDLLSFVKENCS 337
                L + +  C+
Sbjct: 391 NEKSQLGWARGACT 404


>gi|168060150|ref|XP_001782061.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666472|gb|EDQ53125.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 423

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 91/378 (24%), Positives = 147/378 (38%), Gaps = 63/378 (16%)

Query: 4   LFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPDCTY 48
           L +G+P K   L +DTGS L +A               +++P+K+   + ++C  P C  
Sbjct: 44  LLLGSPPKLYFLDMDTGSDLTWAQCDAPCRNCAIGPHGLYNPKKA---KVVDCHLPVCAQ 100

Query: 49  ------FKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 ++C ++  QC Y ++YAD S T G    +T++V     G  I   A+ GC  D 
Sbjct: 101 IQQGGSYECNSDVKQCDYEVEYADGSSTMGVLVEDTLTVR-LTNGTLIQTKAIIGCGYDQ 159

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
            G    +   +  GV+GLS   ++  +QL    IIK    +CL     +G     YL FG
Sbjct: 160 QGTLAKS-PASTDGVIGLSSSKVALPAQLAEKGIIKNVLGHCLA----DGSNGGGYLFFG 214

Query: 159 TDMGYRRPS--TQATKFINHPNNF-YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
            ++    PS     T  +  P    Y   L+ I    + +    D  D+T S     + D
Sbjct: 215 DEL---VPSWGMTWTPMMGKPEMLGYQARLQSIRYGGDSLVLNNDE-DLTRS-TSSVMFD 269

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
           SG+  TY     Y  +           ++   +  P     C+  P  F     +  YF+
Sbjct: 270 SGTSFTYLVPQAYASVLSAVTKQSGLLRVKSDTTLP----YCWRGPSPFQSITDVHQYFK 325

Query: 276 DANLRIDGENVF----IIDYENHFFLL-------------AVAPHDDLVALIGSQQQRDT 318
              L   G N F     +D     +L+             A     ++  +IG    R  
Sbjct: 326 TLTLDFGGRNWFATDSTLDLSPQGYLIVSTQGNVCLGILDASGASLEVTNIIGDVSMRGY 385

Query: 319 RFVYDLNIDLLSFVKENC 336
             VYD   D + +++ NC
Sbjct: 386 LVVYDNVRDRIGWIRRNC 403


>gi|449518783|ref|XP_004166415.1| PREDICTED: aspartic proteinase-like protein 2-like, partial
           [Cucumis sativus]
          Length = 420

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 79/324 (24%), Positives = 132/324 (40%), Gaps = 47/324 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ IGTPSK   + +DTGS +++                     +D  +S++ + ++CD
Sbjct: 89  AKIGIGTPSKDYYVQVDTGSDIVWVNCIQCRECPRTSSLGMELTPYDLEESTTGKLVSCD 148

Query: 43  HPDCTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
              C             N  C Y   Y D S T G+   + +    V G  E  A     
Sbjct: 149 EQFCLEVNGGPLSGCTTNMSCPYLQIYGDGSSTAGYFVKDYVQYNRVSGDLETTAANGSI 208

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
            FGC     G    + + AL G+LG  +   S ISQL S   +KK F++CL     +G  
Sbjct: 209 KFGCGARQSGDLGSSGEEALDGILGFGKSNSSIISQLASTRKVKKMFAHCL-----DGTN 263

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
                  G  +   +P    T  + N P+  Y +++  + + +  +N   D F+      
Sbjct: 264 GGGIFAMGHVV---QPKVNMTPLVPNQPH--YNVNMTGVQVGHIILNISADVFE--AGDR 316

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
            G IIDSG+ L Y    +Y  L  K +S     ++  +    +  Q   +     + FP 
Sbjct: 317 KGTIIDSGTTLAYLPELIYEPLVAKILSQQHNLEVQTIHGEYKCFQ---YSERVDDGFPP 373

Query: 270 MAFYFEDANLRIDGENVFIIDYEN 293
           + F+FE++ L     + ++  YEN
Sbjct: 374 VIFHFENSLLLKVYPHEYLFQYEN 397


>gi|413922067|gb|AFW61999.1| hypothetical protein ZEAMMB73_694403, partial [Zea mays]
          Length = 328

 Score = 75.9 bits (185), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 61/222 (27%), Positives = 95/222 (42%), Gaps = 43/222 (19%)

Query: 7   GTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC------ 46
           G+P+  + +I+DTGS L +               +FDP  S+++  + C+   C      
Sbjct: 103 GSPAANLTVIVDTGSDLTWVQCKPCSACYAQRDPLFDPAGSATYAAVRCNASACADSLRA 162

Query: 47  ---TYFKCVN-----EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSN 98
              T   C +     E+C Y + Y D S ++G  A +T+++     G A   G +FGC  
Sbjct: 163 ATGTPGSCGSTGAGSEKCYYALAYGDGSFSRGVLATDTVAL-----GGASLGGFVFGCGL 217

Query: 99  DNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
            N G       G  AG++GL R  +S +SQ  S     FSYCL          S  L  G
Sbjct: 218 SNRGLF-----GGTAGLMGLGRTELSLVSQTASRYGGVFSYCLPAATSGDASGSLSLGGG 272

Query: 159 TDMG--YRRPSTQA-TKFINHPNN--FYYLSLKDISIDNERM 195
            D    YR  +  A T+ I  P    FY+L++   ++    +
Sbjct: 273 DDAASSYRNTTPVAYTRMIADPAQPPFYFLNVTGAAVGGTAL 314


>gi|147771308|emb|CAN69536.1| hypothetical protein VITISV_043237 [Vitis vinifera]
          Length = 372

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 81/358 (22%), Positives = 147/358 (41%), Gaps = 43/358 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
           +VR  IGTP++ +L+ +DT S + +            +F+   S++++ + C    C   
Sbjct: 37  IVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQV 96

Query: 50  K---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
               C    C + + Y   S+    +  +TI++           G  FGC     G    
Sbjct: 97  PKPTCGGGVCSFNLTYGGSSLAANLS-QDTITLATDA-----VPGYSFGCIQKATGGSLP 150

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
           A+           R  +S +SQ  ++ +  FSYCL  P       S  L+ G     +R 
Sbjct: 151 AQGLLGL-----GRGPLSLLSQTQNLYQSTFSYCL--PSFKSLNFSGSLRLGPVGQPKR- 202

Query: 167 STQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
             + T  + +P   + Y+++L  + +    ++ PP +F    S   G I DSG+V T   
Sbjct: 203 -IKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLV 261

Query: 225 SDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
           +  Y  + + F +   R   +  L         CY +P      P++ F F   N+ +  
Sbjct: 262 TPAYIAVRDAFRNRVGRNLTVTSLGG----FDTCYTVPIA---APTITFMFTGMNVTLPP 314

Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +N+ I         LA+A   D    ++ +I + QQ++ R +YD+    L   +E C+
Sbjct: 315 DNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELCT 372


>gi|242089103|ref|XP_002440384.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
 gi|241945669|gb|EES18814.1| hypothetical protein SORBIDRAFT_09g030880 [Sorghum bicolor]
          Length = 555

 Score = 75.5 bits (184), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 65/229 (28%), Positives = 99/229 (43%), Gaps = 19/229 (8%)

Query: 28  FDPRKSSSFQKINCDHPDCT---YFKCVN----EQCVYTMKYADQSVTKGFAAHETISVI 80
           + P KSSS+++I C    C    Y  C +    E C Y  K  D +VT G   +E  +V 
Sbjct: 208 YRPAKSSSWRRIRCSEQQCAHLPYNTCQSPSKLESCSYYQKTQDGTVTIGIYGNEKATVT 267

Query: 81  GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
                 A   G + GCS    G   DA DG L+   G     I  + + G     RFS+C
Sbjct: 268 VSDGRMAKLPGLVLGCSVLEAGASVDAHDGVLSLGNGHMSFAIHAVLRFGG----RFSFC 323

Query: 141 LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFP 198
           L +   +    SSYL FG +     P T  T+ + + +    Y   +  + +  ER++ P
Sbjct: 324 L-LSANSSRDASSYLTFGPNPAVMGPGTMETEILYNVDVKAAYGPRVTAVLVGGERLDIP 382

Query: 199 PDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQL 247
            D ++I      G I+D+ + +T    + Y    E  V+  +R  LA L
Sbjct: 383 DDVWNIDKGLGSGVILDTSTSVTSLVPEAY----EPLVAALDR-HLAHL 426


>gi|414866064|tpg|DAA44621.1| TPA: putative aspartic protease family protein [Zea mays]
          Length = 454

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 89/392 (22%), Positives = 150/392 (38%), Gaps = 71/392 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDHP 44
           V + +G P + V ++LDTGS L +                 A F+   SS++   +C  P
Sbjct: 64  VPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSP 123

Query: 45  DCTYFK--------CV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
           +C +          C    +  C  ++ YAD S   G  A +T  + G    +A     L
Sbjct: 124 ECQWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGILAADTFLLGGAPPVRA-----L 178

Query: 94  FGC--SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
           FGC  S  +      +   A  G+LG++R ++SF++Q  ++   RF+YC+      G+  
Sbjct: 179 FGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCIA----PGD-G 230

Query: 152 SSYLKFGTDMGYRRPSTQATKFI--NHPNNF-----YYLSLKDISIDNERMNFPPDTFDI 204
              L  G D     P    T  I  + P  +     Y + L+ I +    +  P      
Sbjct: 231 PGLLVLGGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAP 290

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ----LCYFL 260
             +G G  ++DSG+  T+  +D Y  L  +F++      LA L +     Q     C+  
Sbjct: 291 DHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTSAL-LAPLGESDFVFQGAFDACFRA 349

Query: 261 PE-----TFNRFPSMAFYFEDANLRIDGEN-VFIIDYENHFFLLAVA------PHDDLVA 308
            E          P +      A + + GE  ++ +  E      A A       + D+  
Sbjct: 350 SEARVAAASQMLPEVGLVLRGAEVAVGGEKLLYRVPGERRGEGGAEAVWCLTFGNSDMAG 409

Query: 309 L----IGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +    IG   Q++    YDL    + F    C
Sbjct: 410 MSAYVIGHHHQQNVWVEYDLQNGRVGFAPARC 441


>gi|225465837|ref|XP_002264626.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 1 [Vitis
           vinifera]
          Length = 437

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 81/358 (22%), Positives = 147/358 (41%), Gaps = 43/358 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
           +VR  IGTP++ +L+ +DT S + +            +F+   S++++ + C    C   
Sbjct: 102 IVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQV 161

Query: 50  K---CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
               C    C + + Y   S+    +  +TI++           G  FGC     G    
Sbjct: 162 PKPTCGGGVCSFNLTYGGSSLAANLS-QDTITLATDA-----VPGYSFGCIQKATGGSLP 215

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
           A+           R  +S +SQ  ++ +  FSYCL  P       S  L+ G     +R 
Sbjct: 216 AQGLLGL-----GRGPLSLLSQTQNLYQSTFSYCL--PSFKSLNFSGSLRLGPVGQPKR- 267

Query: 167 STQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
             + T  + +P   + Y+++L  + +    ++ PP +F    S   G I DSG+V T   
Sbjct: 268 -IKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGAGTIFDSGTVFTRLV 326

Query: 225 SDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
           +  Y  + + F +   R   +  L         CY +P      P++ F F   N+ +  
Sbjct: 327 TPAYIAVRDAFRNRVGRNLTVTSLGG----FDTCYTVPIA---APTITFMFTGMNVTLPP 379

Query: 284 ENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +N+ I         LA+A   D    ++ +I + QQ++ R +YD+    L   +E C+
Sbjct: 380 DNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVPNSRLGVARELCT 437


>gi|226530102|ref|NP_001152414.1| PCS1 precursor [Zea mays]
 gi|195656033|gb|ACG47484.1| PCS1 [Zea mays]
          Length = 452

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 68/272 (25%), Positives = 112/272 (41%), Gaps = 50/272 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDHP 44
           V + +G P + V ++LDTGS L +                 A F+   SS++   +C  P
Sbjct: 62  VPVAVGAPPQNVTMVLDTGSELSWLRCNGSRVPSTPPPQAPAAFNGSASSTYAAAHCSSP 121

Query: 45  DCTYFK--------CVNE---QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL 93
           +C +          C       C  ++ YAD S   G  A +T  +     G A    AL
Sbjct: 122 ECQWRGRDLPVPPFCAGPPSXSCRVSLSYADASSADGILAADTFLL-----GGAPPVXAL 176

Query: 94  FGC--SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYT 151
           FGC  S  +      +   A  G+LG++R ++SF++Q  ++   RF+YC+      G+  
Sbjct: 177 FGCVTSYSSATATNSSDSEAATGLLGMNRGSLSFVTQTATL---RFAYCIA----PGD-G 228

Query: 152 SSYLKFGTDMGYRRPSTQATKFI--NHPNNF-----YYLSLKDISIDNERMNFPPDTFDI 204
              L  G D     P    T  I  + P  +     Y + L+ I +    +  P      
Sbjct: 229 PGLLVLGGDGAALAPQLNYTPLIQISRPLPYFDRVAYSVQLEGIRVGAALLPIPKSVLAP 288

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFV 236
             +G G  ++DSG+  T+  +D Y  L  +F+
Sbjct: 289 DHTGAGQTMVDSGTQFTFLLADAYAPLKGEFL 320


>gi|308081797|ref|NP_001182920.1| uncharacterized protein LOC100501208 [Zea mays]
 gi|238008190|gb|ACR35130.1| unknown [Zea mays]
 gi|413922182|gb|AFW62114.1| hypothetical protein ZEAMMB73_927324 [Zea mays]
          Length = 269

 Score = 75.5 bits (184), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 66/256 (25%), Positives = 119/256 (46%), Gaps = 28/256 (10%)

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSS 153
           FGC    +G    A     +G++G+S   +S + QL SI K  FSYCL    P  ++ +S
Sbjct: 26  FGCGKLTNGTIAGA-----SGIMGVSPGPLSVLKQL-SITK--FSYCLT---PFTDHKTS 74

Query: 154 YLKFGT--DMGYRRPS--TQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVS 207
            + FG   D+G  + +   Q    + +P  + +YY+ +  ISI ++R++ P     +   
Sbjct: 75  PVMFGAMADLGKYKTTGKVQTIPLLKNPVEDIYYYVPMVGISIGSKRLDVPEAILALRPD 134

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-- 265
           G GG ++DS + L Y     + +L +   +  E  +L   +   +   +C+ LP   +  
Sbjct: 135 GTGGTVLDSATTLAYLVEPAFKELKK---AVMEGMKLPAANRSIDDYPVCFELPRGMSME 191

Query: 266 --RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRF 320
             + P +  +F  DA + +  ++ F  +       LAV  AP +    +IG+ QQ++   
Sbjct: 192 GVQVPPLVLHFAGDAEMSLPRDSYF-QEPSPGMMCLAVMQAPFEGAPNVIGNVQQQNMHV 250

Query: 321 VYDLNIDLLSFVKENC 336
           +YDL     S+    C
Sbjct: 251 LYDLGNRKFSYAPTKC 266


>gi|357159298|ref|XP_003578403.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 442

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 97/378 (25%), Positives = 158/378 (41%), Gaps = 73/378 (19%)

Query: 5   FIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFKCVNEQCVYTMKYAD 64
            IG+P +    ++DTGS LI+     + +++    +C      Y+          +  AD
Sbjct: 91  LIGSPPQRTEALIDTGSDLIWT----QCATTCLPKSCAKQGLPYYNLSQSSTFVPVPCAD 146

Query: 65  QSVTKGFAAHETISVIGK----------GEGKAIFHGAL---------------FGC--- 96
           ++   GF A   + + G           G G+ I  G+L               FGC   
Sbjct: 147 KA---GFCAANGVHLCGLDGSCTFIASYGAGRVI--GSLGTESFAFESGTTSLAFGCVSL 201

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
           +    G   DA     +G++GL R  +S +SQ+G+    RFSYCL  P  +    SS+L 
Sbjct: 202 TRITSGALNDA-----SGLIGLGRGRLSLVSQIGAT---RFSYCLT-PYFHSSGASSHL- 251

Query: 157 FGTDMGYRRPSTQATKFINHPNN-----FYYLSLKDISIDNERM-NFPPDTFDITV---- 206
           F            +  F+  P +     FYYL L+ I++   R+      TF +      
Sbjct: 252 FVGASASLGGGGASMPFVKSPKDYPYSTFYYLPLEGITVGKTRLPAVNSTTFQLRQLFKG 311

Query: 207 SGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP----IQLCYFLPE 262
              GG IID+GS LT   S  Y  L E+  +     QL   S  P P    ++LC    E
Sbjct: 312 YWAGGVIIDTGSPLTQLASHAYEALKEEVAA-----QLGNGSLVPAPEDSGLELC-VARE 365

Query: 263 TFNRF-PSMAFYFED-ANLRIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTR 319
            F +  P++ F+F   A++ +   + +  +D      ++    +D   ++IG+ QQ+D  
Sbjct: 366 GFQKVVPALVFHFGGGADMAVPAASYWAPVDKAAACMMILEGGYD---SIIGNFQQQDMH 422

Query: 320 FVYDLNIDLLSFVKENCS 337
            +YDL     SF   +C+
Sbjct: 423 LLYDLRRGRFSFQTADCT 440


>gi|224058947|ref|XP_002299658.1| predicted protein [Populus trichocarpa]
 gi|222846916|gb|EEE84463.1| predicted protein [Populus trichocarpa]
          Length = 451

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 90/373 (24%), Positives = 144/373 (38%), Gaps = 55/373 (14%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           RL +G+P +   + +DTGS +++                     FDP  S +   I+C  
Sbjct: 93  RLQLGSPPRDFYVQIDTGSDVLWVSCSSCNGCPVSSGLHIPLNFFDPGSSPTASLISCSD 152

Query: 44  PDCTY--------FKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
             C+             N QC YT +Y D S T G+   + +   +++G    K      
Sbjct: 153 QRCSLGLQSSDSVCAAQNNQCGYTFQYGDGSGTSGYYVSDLLHFDTILGGSVMKNSSAPI 212

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
           +FGCS    G D    D A+ G+ G  +  +S ISQL S  I  + FS+C    L   + 
Sbjct: 213 VFGCSTLQTG-DLTKPDRAVDGIFGFGQQDMSVISQLASQGITPRVFSHC----LKGDDS 267

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
               L  G  +    P+   T  + + P+  Y L+L+ I ++ + +   P  F    S  
Sbjct: 268 GGGILVLGEIV---EPNIVYTPLVPSQPH--YNLNLQSIYVNGQTLAIDPSVF--ATSSN 320

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FP 268
            G IIDSG+ L Y     Y    + F+S         +S        CY    + N  FP
Sbjct: 321 QGTIIDSGTTLAYLTEAAY----DPFISAITSTVSPSVSPYLSKGNQCYLTSSSINDVFP 376

Query: 269 SMAFYFEDANLRIDGENVFIIDYE--NHFFLLAVA---PHDDLVALIGSQQQRDTRFVYD 323
            ++  F      I     ++I     N   L  V         + ++G    +D  FVYD
Sbjct: 377 QVSLNFAGGTSMILIPQDYLIQQSSINGAALWCVGFQKIQGQEITILGDLVLKDKIFVYD 436

Query: 324 LNIDLLSFVKENC 336
           +    + +   +C
Sbjct: 437 IAGQRIGWANYDC 449


>gi|147801191|emb|CAN68822.1| hypothetical protein VITISV_007106 [Vitis vinifera]
          Length = 443

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 99/212 (46%), Gaps = 39/212 (18%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDCTYFKC 51
           +G PS  V  I DTGS LI+               IFDP +S +++ ++ D P C   + 
Sbjct: 63  LGVPSTLVYGIADTGSELIWLQCLPCTHCYNQTPPIFDPAESYTYETVSSDSPICNAVRR 122

Query: 52  V-----NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
           +     ++ C Y   Y D + TKG  + +  +               FGCS+D       
Sbjct: 123 ISCREGDKSCCYQHTYGDGTTTKGTLSTDVFAFEDPTRTIVEVGYLTFGCSHDT-----K 177

Query: 107 AR-DGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTD---MG 162
           AR  G  AGV+GL+R   S +SQL     K+FSYC+VIP  +G  + S + FG+    +G
Sbjct: 178 ARLKGHQAGVVGLNRHPNSLVSQLKV---KKFSYCMVIPDDHG--SGSRMYFGSRAVILG 232

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNER 194
            + P       +    + Y+++LK IS+  E+
Sbjct: 233 GKTP------LLKGDYSHYFVTLKGISVGEEK 258


>gi|357465299|ref|XP_003602931.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355491979|gb|AES73182.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 438

 Score = 75.1 bits (183), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 87/360 (24%), Positives = 154/360 (42%), Gaps = 45/360 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGS--ALIYA---------IFDPRKSSSFQKINCDHPDCTYF 49
           +VR+ IGTP + + ++LDT +  A I +          F P  S+S+  + C  P C+  
Sbjct: 99  IVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGCSATTFSPNASTSYVPLECSVPQCSQV 158

Query: 50  KCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
           + ++        C +   YA  + +      +++ +        +     FG  N   G 
Sbjct: 159 RGLSCPATGSGACSFNKSYAGSTYSATLV-QDSLRL-----ATDVIPSYSFGSINAISGS 212

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A+           R  +S +SQ GS+    FSYCL  P     Y S  LK G  +G 
Sbjct: 213 SIPAQGLLGL-----GRGPLSLLSQTGSLYSGVFSYCL--PSFKSYYFSGSLKLG-PVGQ 264

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            + S + T  + +P   + Y+++L  I++    + FP +     V+   G IIDSG+V+T
Sbjct: 265 PK-SIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGSGTIIDSGTVIT 323

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
            F   VY  + ++F     R Q+            C F+       P++  +F D +L++
Sbjct: 324 RFVEPVYNAVRDEF-----RKQVTGPFSSLGAFDTC-FVKNYETLAPAITLHFTDLDLKL 377

Query: 282 DGENVFIIDYENHFFLLAVA--PHD---DLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             EN  I         LA+A  P +    ++ +I + QQ++ R ++D   + +   +E C
Sbjct: 378 PLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFDTVNNKVGIARELC 437


>gi|223973231|gb|ACN30803.1| unknown [Zea mays]
          Length = 459

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 91/402 (22%), Positives = 162/402 (40%), Gaps = 85/402 (21%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------------AIFDPRKSSSFQKINCDHPD 45
           +GTP + + ++LDTGS L +                     +F P+ SSS + + C +P 
Sbjct: 73  LGTPPQPLPVLLDTGSHLTWVPCTSSYECRNCSSPSASAVPVFHPKNSSSSRLVGCRNPS 132

Query: 46  CTYF--------KCVNEQC----------------VYTMKYADQSVTKGFAAHETISVIG 81
           C +         KC    C                 Y + Y   S T G    +T+    
Sbjct: 133 CQWVHSAANLATKCRRAPCSPGAANCPAAASNVCPPYAVVYGSGS-TAGLLIADTL---- 187

Query: 82  KGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCL 141
           +  G+A+  G + GCS         +     +G+ G  R   S  +QLG     +FSYCL
Sbjct: 188 RAPGRAV-PGFVLGCS-------LVSVHQPPSGLAGFGRGAPSVPAQLG---LPKFSYCL 236

Query: 142 VI------PLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNN-FYYLSLKDISIDNER 194
           +          +G         G  M Y  P  ++      P   +YYL+L+ +++  + 
Sbjct: 237 LSRRFDDNAAVSGSLVLGGTGGGEGMQYV-PLVKSAAGDKLPYGVYYYLALRGVTVGGKA 295

Query: 195 MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEP 253
           +  P   F    +G GG I+DSG+  TY    V+  + +  V+    R++ ++ ++    
Sbjct: 296 VRLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQPVADAVVAAVGGRYKRSKDAEDELG 355

Query: 254 IQLCYFLPETFNR--FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA-- 308
           +  C+ LP+       P ++F+FE  A +++  EN F++        + +A   D     
Sbjct: 356 LHPCFALPQGARSMALPELSFHFEGGAVMQLPVENYFVVAGRGAVEAICLAVVTDFSGGS 415

Query: 309 -----------LIGSQQQRDTRFVYDLNIDLLSFVKENCSDD 339
                      ++GS QQ++    YDL  + L F +++C+  
Sbjct: 416 GAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFRRQSCTSS 457


>gi|242089623|ref|XP_002440644.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
 gi|241945929|gb|EES19074.1| hypothetical protein SORBIDRAFT_09g004500 [Sorghum bicolor]
          Length = 469

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 90/361 (24%), Positives = 145/361 (40%), Gaps = 63/361 (17%)

Query: 6   IGTPSKGVLLILDTGSALIYAIFD--------------PRKSSSFQKINCDHPDCTYFK- 50
           IGTP + +  + DTGS LI+   D              P  SS+F ++ C    C   + 
Sbjct: 106 IGTPPQKLTALADTGSDLIWTKCDAGGGAAWGGSSSYHPNASSTFTRLPCSDRLCAALRS 165

Query: 51  -----CV--NEQCVYTMKYA---DQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                C     +C Y   Y    D   T+GF   ET ++ G         G  FGC+   
Sbjct: 166 YSLARCAAGGAECDYKYAYGLGDDPDFTQGFLGSETFTLGGDA-----VPGVGFGCTTAL 220

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT- 159
            G   D  +GA  G++GL R  +S +SQL +     F YCL          +S L FG  
Sbjct: 221 EG---DYGEGA--GLVGLGRGPLSLVSQLDA---GTFMYCLTADASK----ASPLLFGAL 268

Query: 160 -DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
             M       Q+T  +     FY ++L+ I+I +           +          DSG+
Sbjct: 269 ATMTGAGAGVQSTGLLAS-TTFYAVNLRSITIGSATTAGVGGPGGVV--------FDSGT 319

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE--PIQLCYFLPETFNRFPSMAFYFE- 275
            LTY     Y +    F+S     Q   L+        + CY  P++    P+M  +F+ 
Sbjct: 320 TLTYLAEPAYTEAKAAFLS-----QTTSLTPVEGRYGFEACYEKPDSARLIPAMVLHFDG 374

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKEN 335
            A++ +   N ++++ ++      V     L ++IG+  Q +   ++D+   +LSF   N
Sbjct: 375 GADMALPVAN-YVVEVDDGVVCWVVQRSPSL-SIIGNIMQMNYLVLHDVRKSVLSFQPAN 432

Query: 336 C 336
           C
Sbjct: 433 C 433


>gi|224140237|ref|XP_002323490.1| predicted protein [Populus trichocarpa]
 gi|222868120|gb|EEF05251.1| predicted protein [Populus trichocarpa]
          Length = 478

 Score = 75.1 bits (183), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 90/382 (23%), Positives = 158/382 (41%), Gaps = 72/382 (18%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           ++ +G+P +   + +DTGS +++                     FD   SS+   ++C  
Sbjct: 69  KVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGLVHCSD 128

Query: 44  PDCT------YFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
           P CT        +C  +  QC YT +Y D S T G+   +T+   +++  GE   +   A
Sbjct: 129 PICTSAVQTTVTQCSPQTNQCSYTFQYEDGSGTSGYYVSDTLYFDAIL--GESLVVNSSA 186

Query: 93  L--FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNG 148
           L  FGCS    G D    D A+ G+ G  +  +S ISQL +  I  + FS+CL      G
Sbjct: 187 LIVFGCSTFQSG-DLTMTDKAVDGIFGFGQGELSVISQLSTHGITPRVFSHCL-----KG 240

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
           E     +    ++    P    +  + + P+  Y L+L+ I+++ + +   P  F    S
Sbjct: 241 EGIGGGILVLGEI--LEPGMVYSPLVPSQPH--YNLNLQSIAVNGKLLPIDPSVF--ATS 294

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR- 266
              G I+DSG+ L Y  ++ Y    + FVS         ++        CY +  + ++ 
Sbjct: 295 NSQGTIVDSGTTLAYLVAEAY----DPFVSAVNVIVSPSVTPIISKGNQCYLVSTSVSQM 350

Query: 267 FPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQ-----------Q 315
           FP  +F F        G    ++  E++      +    ++  IG Q+            
Sbjct: 351 FPLASFNFA-------GGASMVLKPEDYLIPFGPSQGGSVMWCIGFQKVQGVTILGDLVL 403

Query: 316 RDTRFVYDLNIDLLSFVKENCS 337
           +D  FVYDL    + +   +CS
Sbjct: 404 KDKIFVYDLVRQRIGWANYDCS 425


>gi|225436397|ref|XP_002272121.1| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
          Length = 499

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 89/375 (23%), Positives = 152/375 (40%), Gaps = 58/375 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           R+ +G+P K   + +DTGS +++                     FDP  SS+   I+C  
Sbjct: 86  RVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLISCSD 145

Query: 44  PDCTY------FKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
             C+         C ++  QC+YT +Y D S T G+   + +   +++G     +     
Sbjct: 146 QRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS-SASI 204

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
           +FGCS    G D    D A+ G+ G  +  +S ISQ+ S  I  K FS+CL      G  
Sbjct: 205 VFGCSISQTG-DLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGGI 263

Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
                    D+ Y      +    + P+  Y L+L+ IS++ + +   P+ F    S   
Sbjct: 264 LVLGEIVEEDIVY------SPLVPSQPH--YNLNLQSISVNGKSLAIDPEVF--ATSTNR 313

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPS 269
           G I+DSG+ L Y   + Y    + FVS         +         CY +  +    FP+
Sbjct: 314 GTIVDSGTTLAYLAEEAY----DPFVSAITEAVSQSVRPLLSKGTQCYLITSSVKGIFPT 369

Query: 270 MAFYFEDA-NLRIDGENVFIIDYENHFFLLAV------APHDDLVALIGSQQQRDTRFVY 322
           ++  F    ++ +  E+  +   +N     AV            + ++G    +D  FVY
Sbjct: 370 VSLNFAGGVSMNLKPEDYLL--QQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVY 427

Query: 323 DLNIDLLSFVKENCS 337
           DL    + +   +CS
Sbjct: 428 DLAGQRIGWANYDCS 442


>gi|357127503|ref|XP_003565419.1| PREDICTED: probable aspartic protease At2g35615-like [Brachypodium
           distachyon]
          Length = 486

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 96/390 (24%), Positives = 149/390 (38%), Gaps = 77/390 (19%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------------AIFDPRKSSSFQKINCDH 43
           ++ + +GTP   VL I DTGS L++                   F P  SS++ ++ CD 
Sbjct: 111 LMAIEVGTPPVRVLAIADTGSDLVWVKCKGKDNDNNSTAPPSVYFVPSASSTYGRVGCDT 170

Query: 44  PDCTYFKCV-----NEQCVYTMKYADQSVTKGFAAHE--TISVIGKGEGKAIFHGAL--- 93
             C           +  C Y   Y D S   G  + E  T S I                
Sbjct: 171 KACRALSSAASCSPDGSCEYLYSYGDGSRASGQLSTETFTFSTIADSSKTNSHGNNNNNS 230

Query: 94  ------------FGCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFS 138
                       FGCS    G F  D        ++GL    +S  SQLG+   + ++FS
Sbjct: 231 SSHGQVEIAKLDFGCSTTTTGTFRADG-------LVGLGGGPVSLASQLGATTSLGRKFS 283

Query: 139 YCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFIN-HPNNFYYLSLKDISIDNERMNF 197
           YCL  P  N    SS L FG+      P   +T  I      +Y ++L  I++   +   
Sbjct: 284 YCLA-PYANTN-ASSALNFGSRAVVSEPGAASTPLITGEVETYYTIALDSINVAGTKRP- 340

Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI-QL 256
                  T + +   I+DSG+ LTY  S +   L +       R +L + ++ PE I  L
Sbjct: 341 -------TTAAQAHIIVDSGTTLTYLDSALLTPLVKDLT---RRIKLPR-AESPEKILDL 389

Query: 257 CYFLPETFNRFPSMAFYFEDANLRIDG--------ENVFIIDYENHFFLLAVAPHD-DLV 307
           CY +          A    D  L + G        +N F++  E    L  VA  +   V
Sbjct: 390 CYDISGVRGE---DALGIPDVTLVLGGGGEVTLKPDNTFVVVQEGVLCLALVATSERQSV 446

Query: 308 ALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           +++G+  Q++    YDL    ++F   +C+
Sbjct: 447 SILGNIAQQNLHVGYDLEKGTVTFAAADCA 476


>gi|357155293|ref|XP_003577072.1| PREDICTED: LOW QUALITY PROTEIN: probable aspartic protease
           At2g35615-like [Brachypodium distachyon]
          Length = 429

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 90/373 (24%), Positives = 142/373 (38%), Gaps = 66/373 (17%)

Query: 6   IGTPSKGVLLILDTGSALIY------------------AIFDPRKSSSFQKINCDHPDC- 46
           +GTP    L+ +DTGS L +                  ++FDP KS++++ + C   DC 
Sbjct: 81  LGTPPVANLVTVDTGSTLSWVVCQRCQISCHTTAPEAGSVFDPDKSTTYELVGCSSRDCA 140

Query: 47  -------TYFKCVNE--QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
                    F C+ E   C+Y+++Y      +  A       +      +I  G +FGCS
Sbjct: 141 DVQRSLVAPFGCIEETDTCLYSLRYGSGPSGQYSAGRLGTDKLTLASSSSIIDGFIFGCS 200

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKR-FSYCLVIPLPNGEYTSSYLK 156
            D      D+  G  +GV+G      SF +Q+      R FSYC     P       +L 
Sbjct: 201 GD------DSFKGYESGVIGFGGANFSFFNQVARQTNYRAFSYC----FPGDHTAEGFLS 250

Query: 157 FGTDMGYRRPSTQATKFINH--PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
            G    Y +     T  I H    + Y L   D+ +D  R+      +   +      ++
Sbjct: 251 IGA---YPKDELVYTNLIPHFGDRSVYSLQQIDMMVDGNRLQVDQSEYTKRM-----MVV 302

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-----FPS 269
           DSG+V T+    V+    +   S  +      LSD     + C F P   +       P+
Sbjct: 303 DSGTVDTFLLGPVFDAFSKAMASAMQAKGF--LSDTVG-TETC-FRPNGGDSVDSGDLPT 358

Query: 270 MAFYFEDANLRIDGENVFIIDYENH-FFLLAVAPHDDL-----VALIGSQQQRDTRFVYD 323
           +   F    L++  ENVF     +H    LA  P  D+     V ++G++     R VYD
Sbjct: 359 VEMRFIGTTLKLPPENVFHDLLPSHDKICLAFKP--DVAGVRNVQILGNKATXSFRVVYD 416

Query: 324 LNIDLLSFVKENC 336
           L      F    C
Sbjct: 417 LQAMYFGFQAGAC 429


>gi|21805926|gb|AAM76716.1| nucellin-like aspartic protease [Zea mays]
          Length = 357

 Score = 74.7 bits (182), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 94/370 (25%), Positives = 144/370 (38%), Gaps = 62/370 (16%)

Query: 6   IGTPSKGVLLILDTGSALIYAIFD-PRKS-----------SSFQKINCDHPDCTYF---- 49
           IG P+K   L +DTGS L +   D P +S           ++ + + C +  CT      
Sbjct: 1   IGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANRLVPCANALCTALHSGQ 60

Query: 50  ----KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFD 104
               KC + +QC Y +KY D + ++G   +++ S+  +     I  G  FGC  D     
Sbjct: 61  GSNNKCPSPKQCDYQIKYTDSASSQGVLINDSFSLPMRSSN--IRPGLTFGCGYDQQVGK 118

Query: 105 EDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
             A   A+ G+LGL R ++S +SQL    I K    +CL     NG     +L FG D+ 
Sbjct: 119 NGAVQAAIDGMLGLGRGSVSLVSQLKQQGITKNVVGHCLST---NG---GGFLFFGDDVV 172

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
                T          N+Y      +  D   +   P             + DSGS  TY
Sbjct: 173 PSSRVTWVPMAQRTSGNYYSPGSGTLYFDRRSLGVKPMEV----------VFDSGSTYTY 222

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-------NRFPSMAFYF- 274
           F +  Y  +         +  L Q+SD   P  LC+   + F       N F SM   F 
Sbjct: 223 FTAQPYQAVVSALKGGLSK-SLKQVSDPTLP--LCWKGQKAFKSVFDVKNEFKSMFLSFA 279

Query: 275 --EDANLRIDGENVFIIDYENHFFLLAVAPHDDLVA-----LIGSQQQRDTRFVYDLNID 327
             ++A + I  EN  I+    +  L  +   D   A     +IG    +D   +YD    
Sbjct: 280 SAKNAAMEIPPENYLIVTKNGNVCLGIL---DGTAAKLSFNVIGDITMQDQMVIYDNEKS 336

Query: 328 LLSFVKENCS 337
            L + +  C+
Sbjct: 337 QLGWARGACT 346


>gi|357481195|ref|XP_003610883.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355512218|gb|AES93841.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 315

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 79/309 (25%), Positives = 134/309 (43%), Gaps = 26/309 (8%)

Query: 40  NCDHPDC----TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
           +CD P C    T      ++C YT  Y D S+TKG  A +T +              LFG
Sbjct: 20  SCDSPLCHKLDTGVCSPEKRCNYTYGYGDNSLTKGVLAQDTATFTSNTGKLVSLSRFLFG 79

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSII-KKRFSYCLVIPLPNGEYTSSY 154
           C ++N G   D       G++GL     S ISQ+G +   K+FS CLV P       SS 
Sbjct: 80  CGHNNTGGFNDHE----MGLIGLGGGPTSLISQIGPLFGGKKFSQCLV-PFLTDIKISSR 134

Query: 155 LKFGTDMGYRRPSTQATKFINHPNNF--YYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
           + FG            T  +    +   Y+++L  IS+++  +       + T+  +G  
Sbjct: 135 MSFGKGSQVLGDGVVTTPLVQREQDMTSYFVTLLGISVEDTYL-----PMNSTIE-KGNM 188

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE-PIQLCYFLPETFNRFPSMA 271
           ++DSG+        +Y ++   +V       L  +++ P    QLCY   +T  + P++ 
Sbjct: 189 LVDSGTPPNILPQQLYDRV---YVEVKNNVPLELITNDPSLGPQLCYRT-QTNLKGPTLT 244

Query: 272 FYFEDANLRIDGENVFI--IDYENHFFLLAVAPHDDLVALI-GSQQQRDTRFVYDLNIDL 328
           ++FE ANL +     FI         F LA+  + +    + G+  Q +    +DL+  +
Sbjct: 245 YHFEGANLLLTPIQTFIPPTPETKGVFCLAINNYTNSNGGVYGNFAQSNYLIGFDLDRQV 304

Query: 329 LSFVKENCS 337
           +SF   +C+
Sbjct: 305 VSFKATDCT 313


>gi|359488213|ref|XP_002263620.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 434

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 100/375 (26%), Positives = 154/375 (41%), Gaps = 60/375 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDH------ 43
           +V L IGTP +   ++LDTGS L +             FDP  SSSF  + C+H      
Sbjct: 79  IVSLPIGTPPQTQQMVLDTGSQLSWIQCKVPPKTPPTAFDPLLSSSFSVLPCNHSLCKPR 138

Query: 44  -PDCTYFKCV--NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
            PD T       N  C Y+  YAD +  +G    E  +         +    + GC+ D+
Sbjct: 139 VPDYTLPTSCDQNRLCHYSYFYADGTYAEGNLVREKFTFSSSQTTPPL----ILGCATDS 194

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL-------PNGEY--- 150
                D +     G+LG++   +SF S L  I K  FSYC V P        P G +   
Sbjct: 195 ----SDTQ-----GILGMNLGRLSF-SSLAKISK--FSYC-VPPRRSQSGSSPTGSFYLG 241

Query: 151 ---TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
              +S+  K+   M YR    Q+ +  N     Y L +  I I+ +++N     F    S
Sbjct: 242 PNPSSAGFKYVNLMTYR----QSQRMPNLDPLAYTLPMLGIRINGKKLNISTSAFRADPS 297

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF--N 265
           G G  +IDSG+  T+   + Y K+ E+ V      +L +       + +C+         
Sbjct: 298 GAGQTLIDSGTWFTFLVDEAYSKVKEEIVK-LAGPKLKKGYVYGGSLDMCFDGDAMVIGR 356

Query: 266 RFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDL-VA--LIGSQQQRDTRFVY 322
              +MAF FE+    +      + D       L +   D L VA  +IG+  Q+D    +
Sbjct: 357 MIGNMAFEFENGVEIVVEREKMLADVGGGVQCLGIGRSDLLGVASNIIGNFHQQDLWVEF 416

Query: 323 DLNIDLLSFVKENCS 337
           DL    + F + +CS
Sbjct: 417 DLVGRRVGFGRTDCS 431


>gi|297734873|emb|CBI17107.3| unnamed protein product [Vitis vinifera]
          Length = 484

 Score = 74.7 bits (182), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 89/375 (23%), Positives = 152/375 (40%), Gaps = 58/375 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           R+ +G+P K   + +DTGS +++                     FDP  SS+   I+C  
Sbjct: 71  RVLLGSPPKEFYVQIDTGSDVLWVSCGSCNGCPQSSGLHIPLNFFDPGSSSTASLISCSD 130

Query: 44  PDCTY------FKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
             C+         C ++  QC+YT +Y D S T G+   + +   +++G     +     
Sbjct: 131 QRCSLGVQSSDAGCSSQGNQCIYTFQYGDGSGTSGYYVSDLLNFDAIVGSSVTNS-SASI 189

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
           +FGCS    G D    D A+ G+ G  +  +S ISQ+ S  I  K FS+CL      G  
Sbjct: 190 VFGCSISQTG-DLTKSDRAVDGIFGFGQQDMSVISQMSSQGITPKVFSHCLKGDGGGGGI 248

Query: 151 TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
                    D+ Y      +    + P+  Y L+L+ IS++ + +   P+ F    S   
Sbjct: 249 LVLGEIVEEDIVY------SPLVPSQPH--YNLNLQSISVNGKSLAIDPEVF--ATSTNR 298

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FPS 269
           G I+DSG+ L Y   + Y    + FVS         +         CY +  +    FP+
Sbjct: 299 GTIVDSGTTLAYLAEEAY----DPFVSAITEAVSQSVRPLLSKGTQCYLITSSVKGIFPT 354

Query: 270 MAFYFEDA-NLRIDGENVFIIDYENHFFLLAV------APHDDLVALIGSQQQRDTRFVY 322
           ++  F    ++ +  E+  +   +N     AV            + ++G    +D  FVY
Sbjct: 355 VSLNFAGGVSMNLKPEDYLL--QQNSIGDAAVWCIGFQKIQGQGITILGDLVLKDKIFVY 412

Query: 323 DLNIDLLSFVKENCS 337
           DL    + +   +CS
Sbjct: 413 DLAGQRIGWANYDCS 427


>gi|255583547|ref|XP_002532530.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223527742|gb|EEF29846.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 440

 Score = 74.3 bits (181), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 102/384 (26%), Positives = 154/384 (40%), Gaps = 74/384 (19%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPD 45
           +V L IGTP +   ++LDTGS L +                 FDP  SSSF  + C+HP 
Sbjct: 81  IVSLPIGTPPQTQQMVLDTGSQLSWIQCHKKSVPKKPPPTTSFDPSLSSSFSVLPCNHPL 140

Query: 46  CTY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGC 96
           C            C  N  C Y+  YAD +  +G    E I+         +    + GC
Sbjct: 141 CKPRIPDFTLPTTCDQNRLCHYSYFYADGTYAEGSLVREKITFSSSQSTPPL----ILGC 196

Query: 97  SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLK 156
           +  +   DE        G+LG++    SF SQ  + I K FSYC+    P  +  +    
Sbjct: 197 AEAST--DEK-------GILGMNLGRRSFASQ--AKISK-FSYCV----PTRQARAGLSS 240

Query: 157 FGTDMGYRRPSTQATKFIN---------HPNN---FYYLSLKDISIDNERMNFPPDTFDI 204
            G+      P++   ++IN          PN     Y + ++ I + N R+N     F  
Sbjct: 241 TGSFYLGNNPNSGRFQYINLLTFTPSQRSPNLDPLAYTIPMQGIRMGNARLNISATLFRP 300

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-----ERFQLAQLSDCPEPIQLCY- 258
             SG G  IIDSGS  TY   + Y K+ E+ V        + +    +SD      +C+ 
Sbjct: 301 DPSGAGQTIIDSGSEFTYLVDEAYNKVREEVVRLVGPKLKKGYVYGGVSD------MCFD 354

Query: 259 FLPETFNRF-PSMAFYFEDA-NLRIDGENVFIIDYENHFFLLAVAPHDDLVA---LIGSQ 313
             P    R   +M F FE    + ID   V + D       + +   + L A   +IG+ 
Sbjct: 355 GNPMEIGRLIGNMVFEFEKGVEIVIDKWRV-LADVGGGVHCIGIGRSEMLGAASNIIGNF 413

Query: 314 QQRDTRFVYDLNIDLLSFVKENCS 337
            Q++    YDL    +   K +CS
Sbjct: 414 HQQNLWVEYDLANRRIGLGKADCS 437


>gi|30688682|ref|NP_197676.2| aspartyl protease family protein [Arabidopsis thaliana]
 gi|110736370|dbj|BAF00154.1| protease-like protein [Arabidopsis thaliana]
 gi|332005704|gb|AED93087.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 493

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 89/376 (23%), Positives = 155/376 (41%), Gaps = 59/376 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           +L +GTP +   + +DTGS +++                     FDP  S +   I+C  
Sbjct: 84  KLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSD 143

Query: 44  PDCTY--------FKCVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
             C++            N  C YT +Y D S T GF   + +    ++G           
Sbjct: 144 QRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPV 203

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
           +FGCS    G D    D A+ G+ G  +  +S ISQL S  I  + FS+CL      GE 
Sbjct: 204 VFGCSTSQTG-DLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL-----KGEN 257

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
               +    ++    P+   T  + + P+  Y ++L  IS++ + +   P  F  T +G+
Sbjct: 258 GGGGILVLGEI--VEPNMVFTPLVPSQPH--YNVNLLSISVNGQALPINPSVFS-TSNGQ 312

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NRFP 268
            G IID+G+ L Y     Y    E   +   +     +S   +    CY +  +  + FP
Sbjct: 313 -GTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ----CYVITTSVGDIFP 367

Query: 269 SMAFYFE-DANLRIDGENVFIIDYENHFFLLAV------APHDDLVALIGSQQQRDTRFV 321
            ++  F   A++ ++ ++  I   +N+    AV         +  + ++G    +D  FV
Sbjct: 368 PVSLNFAGGASMFLNPQDYLI--QQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFV 425

Query: 322 YDLNIDLLSFVKENCS 337
           YDL    + +   +CS
Sbjct: 426 YDLVGQRIGWANYDCS 441


>gi|125579874|gb|EAZ21020.1| hypothetical protein OsJ_36669 [Oryza sativa Japonica Group]
          Length = 382

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 66/239 (27%), Positives = 103/239 (43%), Gaps = 12/239 (5%)

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
           AR  A +G++GL R  +S +SQ G+    +FSYCL     N   T       +       
Sbjct: 146 ARSMAPSGLMGLGRGRLSLVSQTGA---TKFSYCLTPYFHNNGATGHLFVGASASLGGHG 202

Query: 167 STQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSG----EGGCIIDSGSVL 220
               T+F+  P    FYYL L  +++   R+  P   FD+         GG IIDSGS  
Sbjct: 203 DVMTTQFVKGPKGSPFYYLPLIGLTVGETRLPIPATVFDLREVAPGLFSGGVIIDSGSPF 262

Query: 221 TYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED-ANL 279
           T    D Y  L  +  +      +A   D  +   LC    +     P++ F+F   A++
Sbjct: 263 TSLVHDAYDALASELAARLNGSLVAPPPDADDG-ALCVARRDVGRVVPAVVFHFRGGADM 321

Query: 280 RIDGENVFI-IDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
            +  E+ +  +D       +A A      ++IG+ QQ++ R +YDL     SF   +CS
Sbjct: 322 AVPAESYWAPVDKAAACMAIASAGPYRRQSVIGNYQQQNMRVLYDLANGDFSFQPADCS 380


>gi|449485448|ref|XP_004157171.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score = 74.3 bits (181), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 102/392 (26%), Positives = 161/392 (41%), Gaps = 86/392 (21%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKIN------- 40
           +V L IGTP +   L+LDTGS L +              +  P+ +S    ++       
Sbjct: 67  VVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKIKKRLPPLPKPKTTSFDPSLSSSFSLLP 126

Query: 41  CDHPDCTY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
           C+HP C            C  N  C Y+  YAD ++ +G    E  +         +   
Sbjct: 127 CNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPV--- 183

Query: 92  ALFGC---SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL--- 145
            + GC   S +N G            +LG++R  +SFISQ  + I K FSYC+       
Sbjct: 184 -ILGCAQASTENRG------------ILGMNRGRLSFISQ--AKISK-FSYCVPSRTGSN 227

Query: 146 PNGEY------TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPP 199
           P G +       SS  K+ T + +  P +Q++  ++     Y L +K I I  +R+N PP
Sbjct: 228 PTGLFYLGDNPNSSKFKYVTMLTF--PESQSSPNLDP--LAYTLPMKAIKIAGKRLNVPP 283

Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-----ERFQLAQLSDCPEPI 254
             F     G G  +IDSGS LTY   + Y K+ E+ V        + +  A ++D     
Sbjct: 284 AAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVAD----- 338

Query: 255 QLCY---FLPETFNRFPSMAFYFEDANLRI---DGENVFIIDYENHFFLLAVAPHDDL-- 306
            +C+      E   R   ++F F D  + I    GE V + + E     + +   + L  
Sbjct: 339 -MCFDAGVTAEVGRRIGGISFEF-DNGVEIFVGRGEGV-LTEVEKGVKCVGIGRSERLGI 395

Query: 307 -VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
              +IG+  Q++    YDL    + F    CS
Sbjct: 396 GSNIIGTVHQQNMWVEYDLANKRVGFGGAECS 427


>gi|302824729|ref|XP_002994005.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
 gi|300138167|gb|EFJ04945.1| hypothetical protein SELMODRAFT_431957 [Selaginella moellendorffii]
          Length = 462

 Score = 74.3 bits (181), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 90/375 (24%), Positives = 161/375 (42%), Gaps = 59/375 (15%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC----- 46
           +G+P +  +LI+DTGS L +               I+D  +S+S++ + C++        
Sbjct: 106 LGSPGQEAILIVDTGSELTWLQCLPCKVCAPSVDTIYDAARSASYRPVTCNNSQLCSNSS 165

Query: 47  --TYFKCV-NEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI-FHGALFGCSND 99
             TY  C    QC +   Y D S + G  + +T+   +V+G   GK +      FGC+  
Sbjct: 166 QGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVG---GKPVTVQDFAFGCA-- 220

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF-- 157
             G  E    GA +G+LGL+   ++   QLG     +FS+C   P  +    S+ + F  
Sbjct: 221 -QGDLELVPTGA-SGILGLNAGKMALPMQLGQRFGWKFSHCF--PDRSSHLNSTGVVFFG 276

Query: 158 GTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
             ++ + +    +    N      FY+++LK +SI++  + F P    +        I+D
Sbjct: 277 NAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVFLPRGSVV--------ILD 328

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-----ETFNRFPSM 270
           SGS  + F    + +L E F+ +          D    +  C+ +      E     PS+
Sbjct: 329 SGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSL 388

Query: 271 AFYFEDA-NLRIDGENVF--IIDYENHFFLLAVAPHD---DLVALIGSQQQRDTRFVYDL 324
           +  FED   + I    V   +  ++NH   +  A  D   + V +IG+ QQ++    YD+
Sbjct: 389 SLVFEDGVTIGIPSIGVLLPVARFQNH-VKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDI 447

Query: 325 NIDLLSFVKENCSDD 339
               + F + +C  D
Sbjct: 448 QRSRVGFARASCVID 462


>gi|10177232|dbj|BAB10606.1| protease-like protein [Arabidopsis thaliana]
          Length = 539

 Score = 73.9 bits (180), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 89/376 (23%), Positives = 155/376 (41%), Gaps = 59/376 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           +L +GTP +   + +DTGS +++                     FDP  S +   I+C  
Sbjct: 84  KLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSD 143

Query: 44  PDCTY--------FKCVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
             C++            N  C YT +Y D S T GF   + +    ++G           
Sbjct: 144 QRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPV 203

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
           +FGCS    G D    D A+ G+ G  +  +S ISQL S  I  + FS+CL      GE 
Sbjct: 204 VFGCSTSQTG-DLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL-----KGEN 257

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
               +    ++    P+   T  + + P+  Y ++L  IS++ + +   P  F  T +G+
Sbjct: 258 GGGGILVLGEI--VEPNMVFTPLVPSQPH--YNVNLLSISVNGQALPINPSVFS-TSNGQ 312

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NRFP 268
            G IID+G+ L Y     Y    E   +   +     +S   +    CY +  +  + FP
Sbjct: 313 -GTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ----CYVITTSVGDIFP 367

Query: 269 SMAFYFE-DANLRIDGENVFIIDYENHFFLLAV------APHDDLVALIGSQQQRDTRFV 321
            ++  F   A++ ++ ++  I   +N+    AV         +  + ++G    +D  FV
Sbjct: 368 PVSLNFAGGASMFLNPQDYLI--QQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFV 425

Query: 322 YDLNIDLLSFVKENCS 337
           YDL    + +   +CS
Sbjct: 426 YDLVGQRIGWANYDCS 441


>gi|125575542|gb|EAZ16826.1| hypothetical protein OsJ_32298 [Oryza sativa Japonica Group]
          Length = 396

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 92/369 (24%), Positives = 141/369 (38%), Gaps = 61/369 (16%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
           IGTP +    I+D    L++               +F P  SS+F+   C    C     
Sbjct: 49  IGTPPQPASAIIDVAGELVWTQCSRCSRCFKQDLPLFIPNASSTFRPEPCGTDACKSTPT 108

Query: 50  -KCVNEQCVY---TMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
             C  + C Y   T    D+  T G    ET ++   G   A      FGC   +   D 
Sbjct: 109 SNCSGDVCTYESTTNIRLDRHTTLGIVGTETFAI---GTATASLA---FGCVVAS---DI 159

Query: 106 DARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRR 165
           D  DG  +G +GL R   S ++Q+      +FSYCL    P G   SS L  G+      
Sbjct: 160 DTMDGT-SGFIGLGRTPRSLVAQMK---LTKFSYCLS---PRGTGKSSRLFLGSSAKLAG 212

Query: 166 -PSTQATKFI-----NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
             ST    FI     +  +++Y LSL  I   N  +         T    G  ++ + S 
Sbjct: 213 GESTSTAPFIKTSPDDDSHHYYLLSLDAIRAGNTTI--------ATAQSGGILVMHTVSP 264

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF--PSMAFYFEDA 277
            +      Y    +             ++  P+P  LC+     F+R   P + F F+ A
Sbjct: 265 FSLLVDSAYRAFKKAVTEAVGGAAEQPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGA 324

Query: 278 NLRIDGENVFIIDYENH-----FFLLAVAPHD----DLVALIGSQQQRDTRFVYDLNIDL 328
                    ++ID           +L++A  +    + V+++GS QQ D  F+YDL  + 
Sbjct: 325 AALTVPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKET 384

Query: 329 LSFVKENCS 337
           LSF   +CS
Sbjct: 385 LSFEPADCS 393


>gi|56784779|dbj|BAD82000.1| putative aspartic proteinase nepenthesin II [Oryza sativa Japonica
           Group]
          Length = 486

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 92/359 (25%), Positives = 142/359 (39%), Gaps = 56/359 (15%)

Query: 6   IGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTY--------------FKC 51
           +GTP + V  +LD  S  ++      + S+      D P  T                +C
Sbjct: 103 VGTPPQVVTGVLDITSDFVW-----MQCSACATCGADAPAATSAPPFYAFLSSTIREVRC 157

Query: 52  VNEQC---VYTMKYADQS------VTKGFAAHETISVIGKGE---GKAIFHGALFGCSND 99
            N  C   V     AD S      V  G AA+ T  ++             G +FGC+  
Sbjct: 158 ANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVIFGCAV- 216

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFG 158
                  A +G + GV+GL R  +S +SQL      RFSY L    P+      S++ F 
Sbjct: 217 -------ATEGDIGGVIGLGRGELSPVSQLQ---IGRFSYYLA---PDDAVDVGSFILFL 263

Query: 159 TDMGYRRPSTQATKFI--NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
            D   R     +T  +      + YY+ L  I +D E +  P  TFD+   G GG ++  
Sbjct: 264 DDAKPRTSRAVSTPLVASRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSI 323

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFE 275
              +T+  +  Y  + +   S  E  + A  S+    + LCY        + PSMA  F 
Sbjct: 324 TIPVTFLDAGAYKVVRQAMASKIE-LRAADGSEL--GLDLCYTSESLATAKVPSMALVFA 380

Query: 276 -DANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
             A + ++  N F +D       L +  +P  D  +L+GS  Q  T  +YD++   L F
Sbjct: 381 GGAVMELEMGNYFYMDSTTGLECLTILPSPAGD-GSLLGSLIQVGTHMIYDISGSRLVF 438


>gi|357491933|ref|XP_003616254.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517589|gb|AES99212.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 442

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 74/274 (27%), Positives = 114/274 (41%), Gaps = 61/274 (22%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQK-----------------INCDH 43
           +V L IGTP +   ++LDTGS L +     +K+   ++                 + C+H
Sbjct: 83  VVTLPIGTPPQLQQMVLDTGSQLSWIQCHNKKTPQKKQPPTTSSFDPSLSSSFFVLPCNH 142

Query: 44  PDCTY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALF 94
           P C            C  N  C Y+  YAD +  +G    E I+         I    + 
Sbjct: 143 PLCKPRVPDFSLPTDCDANSLCHYSYFYADGTYAEGNLVREKIAFSPSQTTPPI----IL 198

Query: 95  GCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
           GC+  +    +DAR     G+LG++   + F SQ   I K  FSYC  +P    +  S  
Sbjct: 199 GCATQS----DDAR-----GILGMNLGRLGFPSQ-AKITK--FSYC--VPTKQAQPASGS 244

Query: 155 LKFGTDMGYRRPSTQATKFIN---------HPNN---FYYLSLKDISIDNERMNFPPDTF 202
              G +     P++ + +++N          PN     Y L L+ ISI  +++N PP  F
Sbjct: 245 FYLGNN-----PASSSFRYVNLLTFGQSQRMPNLDPLAYTLPLQGISIGGKKLNIPPSVF 299

Query: 203 DITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFV 236
                G G  +IDSGS  TY   + Y  + E+ V
Sbjct: 300 KPNAGGSGQTMIDSGSEFTYLVDEAYNVIREELV 333


>gi|449495082|ref|XP_004159729.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase-like protein
           1-like [Cucumis sativus]
          Length = 524

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 90/370 (24%), Positives = 141/370 (38%), Gaps = 69/370 (18%)

Query: 6   IGTPSKGVLLILDTGSALIYA---------------------IFDPRKSSSFQKINCDHP 44
           +GTP    L+ LDTGS L +                      I+ P  SS+ +++ C   
Sbjct: 113 VGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKEVQCSSS 172

Query: 45  DCTYF-KCVN--EQCVYTMKY-ADQSVTKGFAAHETISVIGKG-EGKAIFHGALFGCSND 99
            C++  +C +  + C Y + Y +D + + G+   + + +     + K +      GC  D
Sbjct: 173 LCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNARITLGCGKD 232

Query: 100 NHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
             G F   A    L G LG+  V++  I     +I   FS C       G      ++FG
Sbjct: 233 QSGAFLSSAAPNGLFG-LGIENVSVPSILANAGLISNSFSLCF------GPARMGRIEFG 285

Query: 159 TDMGYRRPSTQATKF---INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
            D G   P    T F     HP   Y +S+  I +     +      D+ V      I D
Sbjct: 286 -DKG--SPGQNETPFNLGRRHPT--YNVSITQIGVGGHISDL-----DVAV------IFD 329

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
           SG+  TY +   Y    +KF S  E  Q    SD   P + CY L           F + 
Sbjct: 330 SGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDI--PFENCYELSPN-----QTTFTYP 382

Query: 276 DANLRIDGENVFIIDY--------ENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
             NL + G   F+I++            F LA+A   D + +IG         V+D    
Sbjct: 383 LMNLTMKGGGHFVINHPIVLISTESKRLFCLAIA-RSDSINIIGQNFMTGYHIVFDREKM 441

Query: 328 LLSFVKENCS 337
           +L + + NC+
Sbjct: 442 VLGWKESNCT 451


>gi|298204765|emb|CBI25263.3| unnamed protein product [Vitis vinifera]
          Length = 359

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 73/291 (25%), Positives = 110/291 (37%), Gaps = 75/291 (25%)

Query: 56  CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGV 115
           C Y + Y D S T+G   HE +       G  +    +FGC  +N G       G ++G+
Sbjct: 133 CNYAINYGDGSFTRGELGHEKLKF-----GTILVKDFIFGCGRNNKGLF-----GGVSGL 182

Query: 116 LGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFIN 175
           +GL R  +S ISQ                    E    Y                     
Sbjct: 183 MGLGRSDLSLISQ------------------TSENPQLY--------------------- 203

Query: 176 HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKF 235
              NFY+++L  ISI    +  P         G    ++DSG+V+T     +Y  L  +F
Sbjct: 204 ---NFYFINLTGISIGGVALQAPS-------VGPSRILVDSGTVITRLPPTIYKALKAEF 253

Query: 236 VSYFERFQLAQLSDCPEP----IQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFII 289
           +  F  F        P P    +  C+ L        P++  +FE +A L +D   VF  
Sbjct: 254 LKQFTGFP-------PAPAFSILDTCFNLSAYQEVDIPTIKMHFEGNAELTVDVTGVFYF 306

Query: 290 ---DYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
              D       LA   + D VA++G+ QQ++ R +YD     + F  E CS
Sbjct: 307 VKSDASQVCLALASLEYQDEVAILGNYQQKNLRVIYDTKETKVGFALETCS 357


>gi|255587337|ref|XP_002534234.1| pepsin A, putative [Ricinus communis]
 gi|223525662|gb|EEF28148.1| pepsin A, putative [Ricinus communis]
          Length = 468

 Score = 73.9 bits (180), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 103/404 (25%), Positives = 154/404 (38%), Gaps = 89/404 (22%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKI 39
           + L +GTPS+ V LI+DTGS+L++                        F PR SSS + I
Sbjct: 86  MSLSLGTPSQTVKLIMDTGSSLVWFPCTSRYVCASCNFPNTDITKIPKFMPRLSSSSKLI 145

Query: 40  NCDHPDCTYF-------KCVN-----EQCV-----YTMKYADQSVTKGFAAHETISVIGK 82
            C +P C +        KC N     + C      Y ++Y   S T G    ETI+   K
Sbjct: 146 GCKNPKCAWVFGSSVQSKCHNCNPQAQNCTQACPPYIIQYGLGS-TAGLLLSETINFPNK 204

Query: 83  GEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLV 142
                     L GCS  +    E        G+ G  R   S   QLG    K+FSYCLV
Sbjct: 205 -----TISDFLAGCSLLSTRQPE--------GIAGFGRSQESLPLQLG---LKKFSYCLV 248

Query: 143 IPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP-------------NNFYYLSLKDIS 189
               +    SS L    DMG     ++ T     P               +YY+ L+ I 
Sbjct: 249 SRRFDDSPVSSDLIL--DMGPSTSDSKTTGLSYTPFQKNLASQSNPAFQEYYYVMLRKII 306

Query: 190 IDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSD 249
           +    +  P         G GG I+DSGS  T+    V+  L ++F      + +A    
Sbjct: 307 VGKTHVKVPYSFLVPGSDGNGGTIVDSGSTFTFVEGHVFELLAKEFEKQMANYTVATNVQ 366

Query: 250 CPEPIQLCYFLP-ETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDDLV 307
               ++ C+ +  E     P + F F+  A +++   N F         L  V+  D+  
Sbjct: 367 KLTGLRPCFDISGEKSVVIPDLTFQFKGGAKMQLPLSNYFAFVDMGVVCLTIVS--DNAA 424

Query: 308 AL--------------IGSQQQRDTRFVYDLNIDLLSFVKENCS 337
           AL              +G+ QQ++    YDL  D   F +++C+
Sbjct: 425 ALGGDGGVRSSGPAIILGNFQQQNFYIEYDLENDRFGFKEQSCA 468


>gi|357160409|ref|XP_003578755.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 373

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 78/370 (21%), Positives = 150/370 (40%), Gaps = 64/370 (17%)

Query: 6   IGTPSKGVLLILDTGSALIYA------------------IFDPRKSSSFQKINCDHPDCT 47
           +GTP+   L+ +DTGS + +                    F+   SS+++++ C    C 
Sbjct: 29  LGTPAVFNLVTIDTGSTISWVQCQYCIVHCYTQDQRAGPTFNTSSSSTYRRVGCSAQVCH 88

Query: 48  YFK--------CVNEQ--CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
                      CV E+  C+Y+++YA    + G+ + + +++      +      +FGC 
Sbjct: 89  DMHVSQNIPSGCVEEEDSCIYSLRYASGEYSAGYLSQDRLTLANSYSIQKF----IFGCG 144

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIK-KRFSYCLVIPLPNGEYTSSYLK 156
           +DN        +G  AG++G    + SF +Q+  +     FSYC     P+ +    +L 
Sbjct: 145 SDNR------YNGHSAGIIGFGNKSYSFFNQIAQLTNYSAFSYC----FPSNQENEGFLS 194

Query: 157 FGTDMGYRRPSTQ--ATKFINHPNNF--YYLSLKDISIDNERMNFPPDTFDITVSGEGGC 212
            G    Y R S +   T+  ++  +   Y L   D+ ++  R+   P  +   ++     
Sbjct: 195 IGP---YVRDSNKLILTQLFDYGAHLPVYALQQFDMMVNGMRLQVDPPVYTTRMT----- 246

Query: 213 IIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFL---PETFNRFPS 269
           ++DSG+V T+  S V+  L              + SD  E   +C+        +++ P 
Sbjct: 247 VVDSGTVETFVLSPVFRALDRALTKAMVAEGYVRGSDSKE---ICFHSNGDSVDWSKLPV 303

Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHD---DLVALIGSQQQRDTRFVYDLNI 326
           +   F  + L++  ENVF  +  +        P D     V ++G++  R  R V+D+  
Sbjct: 304 VEIKFSRSILKLPAENVFYYETSDGSICSTFQPDDAGVPGVQILGNRATRSFRVVFDIQQ 363

Query: 327 DLLSFVKENC 336
               F    C
Sbjct: 364 RNFGFEAGAC 373


>gi|356543524|ref|XP_003540210.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 493

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 93/381 (24%), Positives = 145/381 (38%), Gaps = 69/381 (18%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           ++ +GTP     + +DTGS +++                     FDP  SS+   I C  
Sbjct: 81  KVQLGTPPVEFNVQIDTGSDVLWVSCNSCNGCPQTSGLQIQLNFFDPGSSSTSSMIACSD 140

Query: 44  PDCTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGAL-- 93
             C   K          N QC YT +Y D S T G+   + + +        IF G++  
Sbjct: 141 QRCNNGKQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHL------NTIFEGSMTT 194

Query: 94  -------FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIP 144
                  FGCSN   G D    D A+ G+ G  +  +S ISQL S  I  + FS+CL   
Sbjct: 195 NSTAPVVFGCSNQQTG-DLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRIFSHCLKGD 253

Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFD 203
              G      L  G  +    P+   T  +   P+  Y L+L+ IS++ + +      F 
Sbjct: 254 SSGG----GILVLGEIV---EPNIVYTSLVPAQPH--YNLNLQSISVNGQTLQIDSSVF- 303

Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET 263
              S   G I+DSG+ L Y   + Y    + FVS         +         CY +  +
Sbjct: 304 -ATSNSRGTIVDSGTTLAYLAEEAY----DPFVSAITAAIPQSVRTVVSRGNQCYLITSS 358

Query: 264 F-NRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAV------APHDDLVALIGSQQQR 316
             + FP ++  F      I     ++I  +N     AV            + ++G    +
Sbjct: 359 VTDVFPQVSLNFAGGASMILRPQDYLIQ-QNSIGGAAVWCIGFQKIQGQGITILGDLVLK 417

Query: 317 DTRFVYDLNIDLLSFVKENCS 337
           D   VYDL    + +   +CS
Sbjct: 418 DKIVVYDLAGQRIGWANYDCS 438


>gi|414887402|tpg|DAA63416.1| TPA: hypothetical protein ZEAMMB73_414910 [Zea mays]
          Length = 407

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 68/255 (26%), Positives = 109/255 (42%), Gaps = 39/255 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI--------------FDPRKSSSFQKINCDHPDCT 47
            RL+IGTP +   LI+D+GS + Y                F P  SSS+  + C+  DCT
Sbjct: 91  TRLYIGTPPQEFALIVDSGSTVTYVPCASCEQCGNHQDPRFQPDLSSSYSPVKCNV-DCT 149

Query: 48  YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG--FDE 105
                 +QC Y  +YA+ S + G    + +S   + E KA    A+FGC N   G  F +
Sbjct: 150 -CDSDKKQCTYERQYAEMSSSSGVLGEDIVSFGRESELKA--QRAVFGCENSETGDLFSQ 206

Query: 106 DARDGALAGVLGLSRVTISFISQL--GSIIKKRFSYCL-VIPLPNGEYTSSYLKFGTDMG 162
            A      G++GL R  +S + QL    +I   FS C   + +  G      +   +DM 
Sbjct: 207 HA-----DGIMGLGRGQLSIMDQLVEKGVINDSFSLCYGGMDIGGGAMVLGGVPTPSDMV 261

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
           + R     +  +  P  +Y + LK+I +  + +      FD     + G ++DSG+   Y
Sbjct: 262 FSR-----SDPLRSP--YYNIELKEIHVAGKALRVDSRIFD----SKHGTVLDSGTTYAY 310

Query: 223 FHSDVYWKLHEKFVS 237
                +    +   S
Sbjct: 311 LPEQAFMAFKDAVTS 325


>gi|297843774|ref|XP_002889768.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335610|gb|EFH66027.1| hypothetical protein ARALYDRAFT_471076 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 449

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 82/365 (22%), Positives = 148/365 (40%), Gaps = 51/365 (13%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKINCDHPDCT 47
           +VR  +GTP + + ++LDT +  ++               F+   SS++  ++C    CT
Sbjct: 106 VVRARLGTPPQLMFMVLDTSNDAVWLPCSGCSGCSNASTSFNTNSSSTYSTVSCSTTQCT 165

Query: 48  YFK---CVNEQ-----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSND 99
             +   C +       C +   Y   S        +T+++        +     FGC N 
Sbjct: 166 QARGLTCPSSTPQPSICSFNQSYGGDSSFSANLVQDTLTL-----SPDVIPNFSFGCINS 220

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
             G     +     G++GL R  +S +SQ  S+    FSYCL  P     Y S  LK G 
Sbjct: 221 ASGNSLPPQ-----GLMGLGRGPMSLVSQTTSLYSGVFSYCL--PSFRSFYFSGSLKLGL 273

Query: 160 DMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
            +G  + S + T  + +P   + YY++L  +S+ + ++   P       +   G IIDSG
Sbjct: 274 -LGQPK-SIRYTPLLRNPRRPSLYYVNLTGVSVGSVQVPVDPVYLTFDSNSGAGTIIDSG 331

Query: 218 SVLTYFHSDVYWKLHEKFVSYFER--FQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
           +V+T F   VY  + ++F          L     C        F  +  N  P +  +  
Sbjct: 332 TVITRFAQPVYEAIRDEFRKQVNGSFSTLGAFDTC--------FSADNENVTPKITLHMT 383

Query: 276 DANLRIDGENVFIIDYENHFFLLAVAP----HDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
             +L++  EN  I         L++A      + ++ +I + QQ++ R ++D+    +  
Sbjct: 384 SLDLKLPMENTLIHSSAGTLTCLSMAGIRQNANAVLNVIANLQQQNLRILFDVPNSRIGI 443

Query: 332 VKENC 336
             E C
Sbjct: 444 APEPC 448


>gi|255549236|ref|XP_002515672.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223545215|gb|EEF46724.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 492

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 87/376 (23%), Positives = 153/376 (40%), Gaps = 60/376 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH 43
           ++ IGTPSK   + +DTGS +++                    +++ + S S + + CD 
Sbjct: 89  KVGIGTPSKDYYVQVDTGSDIMWVNCIQCRECPRTSSLGMELTLYNIKDSVSGKLVPCDE 148

Query: 44  PDCTYFK-------CVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGAL 93
             C             N  C Y   Y D S T G+   + +    V G  +  +     +
Sbjct: 149 EFCYEVNGGPLSGCTANMSCPYLEIYGDGSSTAGYFVKDVVQYDRVSGDLQTTSSNGSVI 208

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYT 151
           FGC     G      + AL G+LG  +   S ISQL +   +KK F++CL     +G   
Sbjct: 209 FGCGARQSGDLGPTSEEALDGILGFGKSNSSMISQLAATRKVKKIFAHCL-----DGING 263

Query: 152 SSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
                 G  +   +P    T  I N P+  Y +++  + +  + ++ P + F+       
Sbjct: 264 GGIFAIGHVV---QPKVNMTPLIPNQPH--YNVNMTAVQVGEDFLHLPTEEFE--AGDRK 316

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSM 270
           G IIDSG+ L Y    VY  L  K +S     ++  + D     Q   +     + FP++
Sbjct: 317 GAIIDSGTTLAYLPEIVYEPLVSKIISQQPDLKVHIVRDEYTCFQ---YSGSVDDGFPNV 373

Query: 271 AFYFEDAN-LRIDG-------ENVFIIDYENHFFLLAVAPHDDL-VALIGSQQQRDTRFV 321
            F+FE++  L++         E ++ I ++N      +   D   + L+G     +   +
Sbjct: 374 TFHFENSVFLKVHPHEYLFPFEGLWCIGWQNS----GMQSRDRRNMTLLGDLVLSNKLVL 429

Query: 322 YDLNIDLLSFVKENCS 337
           YDL    + + + NCS
Sbjct: 430 YDLENQAIGWTEYNCS 445


>gi|115484503|ref|NP_001065913.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|62954888|gb|AAY23257.1| nucellin-like aspartic protease [Oryza sativa Japonica Group]
 gi|77549017|gb|ABA91814.1| Eukaryotic aspartyl protease family protein, expressed [Oryza
           sativa Japonica Group]
 gi|113644617|dbj|BAF27758.1| Os11g0183900 [Oryza sativa Japonica Group]
 gi|222615638|gb|EEE51770.1| hypothetical protein OsJ_33210 [Oryza sativa Japonica Group]
          Length = 418

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 90/372 (24%), Positives = 143/372 (38%), Gaps = 59/372 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
           V + IG P+K   L +DTGS L +                ++ P K+   + + C +  C
Sbjct: 59  VTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKN---KLVPCANSIC 115

Query: 47  TYF--------KCV-NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           T          KC   +QC Y +KY D++ + G    ++ S+  + +   +     FGC 
Sbjct: 116 TALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVMDSFSLPLRNKSN-VRPSLSFGCG 174

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYL 155
            D       A      G+LGL R ++S +SQL    I K    +CL            +L
Sbjct: 175 YDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCL------STSGGGFL 228

Query: 156 KFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
            FG DM      T  +   +   N+Y      +  D   ++  P             + D
Sbjct: 229 FFGDDMVPTSRVTWVSMVRSTSGNYYSPGSATLYFDRRSLSTKPMEV----------VFD 278

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-------FP 268
           SGS  TYF +  Y            +  L Q+SD   P  LC+   + F         F 
Sbjct: 279 SGSTYTYFSAQPYQATISAIKGSLSK-SLKQVSDPSLP--LCWKGQKAFKSVSDVKKDFK 335

Query: 269 SMAFYF-EDANLRIDGENVFIIDYENHFFL--LAVAPHDDLVALIGSQQQRDTRFVYDLN 325
           S+ F F ++A + I  EN  II    +  L  L  +      ++IG    +D   +YD  
Sbjct: 336 SLQFIFGKNAVMDIPPENYLIITKNGNVCLGILDGSAAKLSFSIIGDITMQDQMVIYDNE 395

Query: 326 IDLLSFVKENCS 337
              L +++ +CS
Sbjct: 396 KAQLGWIRGSCS 407


>gi|242070719|ref|XP_002450636.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
 gi|241936479|gb|EES09624.1| hypothetical protein SORBIDRAFT_05g008470 [Sorghum bicolor]
          Length = 410

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 88/355 (24%), Positives = 141/355 (39%), Gaps = 59/355 (16%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
           IGTP + +  + DTGS LI+A               + P KSSSF K+ C    C+    
Sbjct: 88  IGTPPQELSALADTGSDLIWAKCGACTRCVPQGSPSYYPNKSSSFSKLPCSGSLCSDLPS 147

Query: 50  -KCV--NEQCVYTMKYADQS----VTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
            +C     +C Y   Y   S     T+G+   ET ++     G     G  FGC+  + G
Sbjct: 148 SQCSAGGAECDYKYSYGLASDPHHYTQGYLGSETFTL-----GSDAVPGIGFGCTTMSEG 202

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
                            R  +S +SQL       FSYCL     +    +S L FG+   
Sbjct: 203 GYGSGSGLVGL-----GRGPLSLVSQLN---VGAFSYCLT----SDAAKTSPLLFGSG-A 249

Query: 163 YRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
                 Q+T  +     +Y ++L+ ISI                +G  G I DSG+ + +
Sbjct: 250 LTGAGVQSTPLLRTSTYYYTVNLESISIGAAT---------TAGTGSSGIIFDSGTTVAF 300

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRID 282
                Y    E  +S      +A   D  E   +C+    +   FPSM  +F+  ++ + 
Sbjct: 301 LAEPAYTLAKEAVLSQTTNLTMASGRDGYE---VCF--QTSGAVFPSMVLHFDGGDMDLP 355

Query: 283 GENVF-IIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            EN F  +D     +++  +P    ++++G+  Q +    YD+   +LSF   NC
Sbjct: 356 TENYFGAVDDSVSCWIVQKSPS---LSIVGNIMQMNYHIRYDVEKSMLSFQPANC 407


>gi|218189440|gb|EEC71867.1| hypothetical protein OsI_04576 [Oryza sativa Indica Group]
          Length = 508

 Score = 73.6 bits (179), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 90/359 (25%), Positives = 140/359 (38%), Gaps = 56/359 (15%)

Query: 6   IGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTY--------------FKC 51
           +GTP + V  +LD  S  ++      + S+      D P  T                +C
Sbjct: 103 VGTPPQVVTGVLDITSDFVW-----MQCSACATCGADAPAATSAPPFYAFLSSTIREVRC 157

Query: 52  VNEQC---VYTMKYADQS------VTKGFAAHETISVIGKGE---GKAIFHGALFGCSND 99
            N  C   V     AD S      V  G AA+ T  ++             G +FGC+  
Sbjct: 158 ANRGCQRLVPQTCSADDSPCGYSYVYGGGAANTTAGLLAVDAFAFATVRADGVIFGCAV- 216

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-TSSYLKFG 158
                  A +G + GV+GL R  +S +SQL      RFSY L    P+      S++ F 
Sbjct: 217 -------ATEGDIGGVIGLGRGELSLVSQL---QIGRFSYYLA---PDDAVDVGSFILFL 263

Query: 159 TDMGYRRPSTQATKFINH--PNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
            D   R     +T  + +    + YY+ L  I +D E +  P  TFD+   G GG ++  
Sbjct: 264 DDAKPRTSRAVSTPLVANRASRSLYYVELAGIRVDGEDLAIPRGTFDLQADGSGGVVLSI 323

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE-TFNRFPSMAFYFE 275
              +T+  +  Y  + +   S   +  L         + LCY        + PSMA  F 
Sbjct: 324 TIPVTFLDAGAYKVVRQAMAS---KIGLRAADGSELGLDLCYTSESLATAKVPSMALVFA 380

Query: 276 -DANLRIDGENVFIIDYENHFFLLAV--APHDDLVALIGSQQQRDTRFVYDLNIDLLSF 331
             A + ++  N F +D       L +  +P  D  +L+GS  Q  T  +YD++   L F
Sbjct: 381 GGAVMELEMGNYFYMDSTTGLECLTILPSPAGD-GSLLGSLIQVGTHMIYDISGSRLVF 438


>gi|356546036|ref|XP_003541438.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 486

 Score = 73.2 bits (178), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 93/384 (24%), Positives = 155/384 (40%), Gaps = 75/384 (19%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           ++ +GTP K   + +DTGS +++                     FD   SS+   I C  
Sbjct: 81  KVKMGTPPKEFNVQIDTGSDILWVNCNTCSNCPQSSQLGIELNFFDTVGSSTAALIPCSD 140

Query: 44  PDCT------YFKC---VNEQCVYTMKYADQSVTKGFAAHETI--SVIGKGEGKAIFHGA 92
           P CT        +C   VN QC YT +Y D S T G+   + +  S+I  G+  A+   A
Sbjct: 141 PICTSRVQGAAAECSPRVN-QCSYTFQYGDGSGTSGYYVSDAMYFSLI-MGQPPAVNSSA 198

Query: 93  --LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNG 148
             +FGCS    G D    D A+ G+ G     +S +SQL S  I  K FS+CL       
Sbjct: 199 TIVFGCSISQSG-DLTKTDKAVDGIFGFGPGPLSVVSQLSSRGITPKVFSHCL------- 250

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
           +                PS   +  + + P+  Y L+L+ I+++ + +   P  F I+ +
Sbjct: 251 KGDGDGGGVLVLGEILEPSIVYSPLVPSQPH--YNLNLQSIAVNGQLLPINPAVFSIS-N 307

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NR 266
             GG I+D G+ L Y   + Y  L     +   +      S   +    CY +  +  + 
Sbjct: 308 NRGGTIVDCGTTLAYLIQEAYDPLVTAINTAVSQSARQTNSKGNQ----CYLVSTSIGDI 363

Query: 267 FPSMAFYFEDA-------------NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQ 313
           FPS++  FE               N  +DG  ++ I ++            +  +++G  
Sbjct: 364 FPSVSLNFEGGASMVLKPEQYLMHNGYLDGAEMWCIGFQK---------FQEGASILGDL 414

Query: 314 QQRDTRFVYDLNIDLLSFVKENCS 337
             +D   VYD+    + +   +CS
Sbjct: 415 VLKDKIVVYDIAQQRIGWANYDCS 438


>gi|449456843|ref|XP_004146158.1| PREDICTED: aspartic proteinase-like protein 1-like [Cucumis
           sativus]
          Length = 547

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 90/370 (24%), Positives = 141/370 (38%), Gaps = 69/370 (18%)

Query: 6   IGTPSKGVLLILDTGSALIYA---------------------IFDPRKSSSFQKINCDHP 44
           +GTP    L+ LDTGS L +                      I+ P  SS+ +++ C   
Sbjct: 136 VGTPGVPYLVALDTGSDLFWLPCDCVNCITGLNTTQGPVNFNIYSPNNSSTSKEVQCSSS 195

Query: 45  DCTYF-KCVN--EQCVYTMKY-ADQSVTKGFAAHETISVIGKG-EGKAIFHGALFGCSND 99
            C++  +C +  + C Y + Y +D + + G+   + + +     + K +      GC  D
Sbjct: 196 LCSHLDQCSSPSDTCPYQVSYLSDNTSSTGYLVEDILHLTTNDVQSKPVNARITLGCGKD 255

Query: 100 NHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
             G F   A    L G LG+  V++  I     +I   FS C       G      ++FG
Sbjct: 256 QSGAFLSSAAPNGLFG-LGIENVSVPSILANAGLISNSFSLCF------GPARMGRIEFG 308

Query: 159 TDMGYRRPSTQATKF---INHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
            D G   P    T F     HP   Y +S+  I +     +      D+ V      I D
Sbjct: 309 -DKG--SPGQNETPFNLGRRHPT--YNVSITQIGVGGHISDL-----DVAV------IFD 352

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
           SG+  TY +   Y    +KF S  E  Q    SD   P + CY L           F + 
Sbjct: 353 SGTSFTYLNDPAYSLFADKFASMVEEKQFTMNSDI--PFENCYELSPN-----QTTFTYP 405

Query: 276 DANLRIDGENVFIIDY--------ENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNID 327
             NL + G   F+I++            F LA+A   D + +IG         V+D    
Sbjct: 406 LMNLTMKGGGHFVINHPIVLISTESKRLFCLAIA-RSDSINIIGQNFMTGYHIVFDREKM 464

Query: 328 LLSFVKENCS 337
           +L + + NC+
Sbjct: 465 VLGWKESNCT 474


>gi|224068901|ref|XP_002326227.1| predicted protein [Populus trichocarpa]
 gi|222833420|gb|EEE71897.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 87/379 (22%), Positives = 154/379 (40%), Gaps = 67/379 (17%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           ++ +G+P +   + +DTGS +++                     FD   SS+  ++ C  
Sbjct: 69  KVKLGSPPREFNVQIDTGSDVLWVCCNSCNNCPRTSGLGIQLNFFDSSSSSTAGQVRCSD 128

Query: 44  PDCT------YFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
           P CT        +C ++  QC YT +Y D S T G+   +T+   +++G+          
Sbjct: 129 PICTSAVQTTATQCSSQTDQCSYTFQYGDGSGTSGYYVSDTLYFDAILGQSLIDNSSALI 188

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
           +FGCS    G D    D A+ G+ G  +  +S ISQL +  I  + FS+CL      G  
Sbjct: 189 VFGCSAYQSG-DLTKTDKAVDGIFGFGQGELSVISQLSTRGITPRVFSHCLKGDGSGG-- 245

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
               L  G  +    P    +  + + P+  Y L+L  I+++ + +   P  F    S  
Sbjct: 246 --GILVLGEIL---EPGIVYSPLVPSQPH--YNLNLLSIAVNGQLLPIDPAAF--ATSNS 296

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FP 268
            G I+DSG+ L Y  ++ Y    + FVS         ++        CY +  + ++ FP
Sbjct: 297 QGTIVDSGTTLAYLVAEAY----DPFVSAVNAIVSPSVTPITSKGNQCYLVSTSVSQMFP 352

Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDL----------VALIGSQQQRDT 318
             +F F        G    ++  E++      +    +          V ++G    +D 
Sbjct: 353 LASFNFA-------GGASMVLKPEDYLIPFGSSGGSAMWCIGFQKVQGVTILGDLVLKDK 405

Query: 319 RFVYDLNIDLLSFVKENCS 337
            FVYDL    + +   +CS
Sbjct: 406 IFVYDLVRQRIGWANYDCS 424


>gi|84453222|dbj|BAE71208.1| hypothetical protein [Trifolium pratense]
 gi|84453226|dbj|BAE71210.1| hypothetical protein [Trifolium pratense]
          Length = 437

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 85/359 (23%), Positives = 151/359 (42%), Gaps = 44/359 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
           +VR+ IGTP + + ++LDT +   +             F P  S+SF  ++C  P C   
Sbjct: 99  VVRVKIGTPGQLLFMVLDTSTDEAFVPSSGCIGCSATTFYPNVSTSFVPLDCSVPQCGQV 158

Query: 50  KCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
           + ++        C +   YA  + +      +++ +        +     FG  N   G 
Sbjct: 159 RGLSCPATGSGACSFNQSYAGSTFSATLV-QDSLRL-----ATDVIPSYSFGSINAISGS 212

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A+           R  +S +SQ G+I    FSYCL  P     Y S  LK G  +G 
Sbjct: 213 SVPAQGLLGL-----GRGPLSLLSQSGAIYSGVFSYCL--PSFKSYYFSGSLKLG-PVGQ 264

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            + S + T  +++P+  + YY++L  IS+    +  P +      S   G IIDSG+V+T
Sbjct: 265 PK-SIRTTPLLHNPHRPSLYYVNLTAISVGRVYVPLPSELLAFNPSTGAGTIIDSGTVIT 323

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
            F   +Y  + ++F     R Q+            C F+       P++  +F D +L++
Sbjct: 324 RFVEPIYNAVRDEF-----RKQVTGPFSSLGAFDTC-FVKNYETLAPAITLHFTDLDLKL 377

Query: 282 DGENVFIIDYENHFFLLAVAPH----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             EN  I         LA+A      + ++ +I + QQ++ R ++D   + +   +E C
Sbjct: 378 PLENSLIHSSSGSLACLAMAAAPSNVNSVLNVIANFQQQNLRVLFDTVNNKVGIARELC 436


>gi|297720449|ref|NP_001172586.1| Os01g0776900 [Oryza sativa Japonica Group]
 gi|255673740|dbj|BAH91316.1| Os01g0776900 [Oryza sativa Japonica Group]
          Length = 381

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 71/261 (27%), Positives = 111/261 (42%), Gaps = 46/261 (17%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAI-------------------FDPRKSSSFQKINCD 42
            R+ +G+P K   + +DTGS +++                     F+P  SS+  KI C 
Sbjct: 93  TRVKLGSPPKEYFVQIDTGSDILWVACSPCTGCPSSSGLNIQLEFFNPDTSSTSSKIPCS 152

Query: 43  HPDCTYFKCVNEQ---------CVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFH 90
              CT     +E          C YT  Y D S T G+   +T+   +V+G  +      
Sbjct: 153 DDRCTAALQTSEAVCQTSDNSPCGYTFTYGDGSGTSGYYVSDTMYFDTVMGNEQTANSSA 212

Query: 91  GALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNG 148
             +FGCSN   G D    D A+ G+ G  +  +S +SQL S  +  K FS+CL       
Sbjct: 213 SIVFGCSNSQSG-DLTKTDRAVDGIFGFGQHQLSVVSQLNSLGVSPKVFSHCL----KGS 267

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
           +     L  G  +    P    T  + + P+  Y L+L+ I ++ +++  P D+   T S
Sbjct: 268 DNGGGILVLGEIV---EPGLVYTPLVPSQPH--YNLNLESIVVNGQKL--PIDSSLFTTS 320

Query: 208 GEGGCIIDSGSVLTYFHSDVY 228
              G I+DSG+ L Y     Y
Sbjct: 321 NTQGTIVDSGTTLAYLADGAY 341


>gi|296082172|emb|CBI21177.3| unnamed protein product [Vitis vinifera]
          Length = 376

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 46/151 (30%), Positives = 72/151 (47%), Gaps = 32/151 (21%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA---------------IFDPRKSSSFQKINCDHPD 45
           +V + +GTP + +  I DTGS L +                IF+P KS+S+  I+C  P 
Sbjct: 139 VVTVGLGTPKRDLTFIFDTGSDLTWTQCEPCARYCYHQQEPIFNPSKSTSYTNISCSSPT 198

Query: 46  CTYFK--------CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           C   K        C    CVY ++Y DQS + GF A + +++        +F+  LFGC 
Sbjct: 199 CDELKSGTGNSPSCSASTCVYGIQYGDQSYSVGFFAQDKLALT----STDVFNNFLFGCG 254

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQ 128
            +N G         +AG++GL R  +S +S+
Sbjct: 255 QNNRGLFV-----GVAGLIGLGRNALSLMSK 280


>gi|413944596|gb|AFW77245.1| hypothetical protein ZEAMMB73_545774 [Zea mays]
 gi|414876929|tpg|DAA54060.1| TPA: hypothetical protein ZEAMMB73_875469 [Zea mays]
          Length = 459

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 88/366 (24%), Positives = 146/366 (39%), Gaps = 70/366 (19%)

Query: 6   IGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDCTYF 49
           +GTP + +  + DTGS LI+A                 + P  SS+F K+ C    C+  
Sbjct: 97  MGTPPQKLTALADTGSDLIWAKCGGACTTSCEPQGSPSYLPNASSTFAKLPCSDRLCSLL 156

Query: 50  K--------CVNEQCVYTMKYA----DQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           +            +C Y   Y     D   T+GF A ET ++     G        FGC+
Sbjct: 157 RSDSVAWCAAAGAECDYRYSYGLGDDDHHYTQGFLARETFTL-----GADAVPSVRFGCT 211

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
             + G                 R  +S +SQL +     F YCL     +    +S L F
Sbjct: 212 TASEGGYGSGSGLVGL-----GRGPLSLVSQLNA---STFMYCLT----SDASKASPLLF 259

Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG---GCII 214
           G+         Q+T  +     FY ++L+ ISI +            T  G G   G + 
Sbjct: 260 GSLASLTGAQVQSTGLLAS-TTFYAVNLRSISIGSA-----------TTPGVGEPEGVVF 307

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPE----TFNRFPSM 270
           DSG+ LTY     Y +    F+S   +  L Q+ D  +  + C+  P     +    P+M
Sbjct: 308 DSGTTLTYLAEPAYSEAKAAFLS---QTSLDQVEDT-DGFEACFQKPANGRLSNAAVPTM 363

Query: 271 AFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLS 330
             +F+ A++ +   N ++++ E+      V     L ++IG+  Q +   ++D++  +LS
Sbjct: 364 VLHFDGADMALPVAN-YVVEVEDGVVCWIVQRSPSL-SIIGNIMQVNYLVLHDVHRSVLS 421

Query: 331 FVKENC 336
           F   NC
Sbjct: 422 FQPANC 427


>gi|357118738|ref|XP_003561107.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Brachypodium
           distachyon]
          Length = 491

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 75/335 (22%), Positives = 125/335 (37%), Gaps = 54/335 (16%)

Query: 27  IFDPRKSSSFQKINCDHPDCTYF-------KCVNEQCVYTMKYADQSVTKGFAAHETISV 79
           ++DP KSSS     C  P C              +QC Y ++Y D S + G    + +++
Sbjct: 186 LYDPSKSSSSAAFPCSSPACRNLGPYANGCTPAGDQCQYRVQYPDGSASAGTYISDVLTL 245

Query: 80  IGKGEGKAIFHGALFGCSNDNHGFDEDAR-DGALAGVLGLSRVTISFISQLGSIIKKRFS 138
                  AI     FGCS   H   +        +G++ L R   S  +Q  +     FS
Sbjct: 246 NPAKPASAISE-FRFGCS---HALLQPGSFSNKTSGIMALGRGAQSLPTQTKATYGDVFS 301

Query: 139 YCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP-------NNFYYLSLKDISID 191
           YC    LP     S +   G       P   A+++   P          Y + L  I + 
Sbjct: 302 YC----LPPTPVHSGFFILGV------PRVAASRYAVTPMLRSKAAPMLYLVRLIAIEVA 351

Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCP 251
            +R+  PP  F        G ++DS +++T      Y  L   FV+    ++ A      
Sbjct: 352 GKRLPVPPAVF------AAGAVMDSRTIVTRLPPTAYMALRAAFVAEMRAYRAAAPK--- 402

Query: 252 EPIQLCY------FLPETFNRFPSMAFYFEDAN--LRIDGENVFIIDYENHFFLLAVAPH 303
           E +  CY             + P +   F+  N  + +D   V +         LA AP+
Sbjct: 403 EHLDTCYDFSGAAPGGGGGVKLPKITLVFDGPNGAVELDPSGVLLDG------CLAFAPN 456

Query: 304 --DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             D +  +IG+ QQ+    +Y+++   + F +  C
Sbjct: 457 TDDQMTGIIGNVQQQALEVLYNVDGATVGFRRGAC 491


>gi|356503843|ref|XP_003520712.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Glycine max]
          Length = 474

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 97/396 (24%), Positives = 157/396 (39%), Gaps = 80/396 (20%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKI 39
           + L +GTP +    +LDTGS+L++                        F P+ SS+ + +
Sbjct: 94  IDLNLGTPPQTSPFVLDTGSSLVWFPCTSRYLCSHCNFPNIDTTKIPTFIPKNSSTAKLL 153

Query: 40  NCDHPDCTY-------FKCVNEQC------------VYTMKYADQSVTKGFAAHETISVI 80
            C +P C Y       F+C   QC             Y ++Y   S T GF   + ++  
Sbjct: 154 GCRNPKCGYIFGSDVQFRC--PQCKPESQNCSLTCPAYIIQYGLGS-TAGFLLLDNLNFP 210

Query: 81  GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYC 140
           GK          L GCS               +G+ G  R   S  SQ+     KRFSYC
Sbjct: 211 GK-----TVPQFLVGCS--------ILSIRQPSGIAGFGRGQESLPSQMN---LKRFSYC 254

Query: 141 LVIPLPNGEYTSS--YLKFGTDMGYRRPSTQATKFINHP--NN-----FYYLSLKDISID 191
           LV    +    SS   L+  +    +      T F ++P  NN     +YYL+L+ + + 
Sbjct: 255 LVSHRFDDTPQSSDLVLQISSTGDTKTNGLSYTPFRSNPSTNNPAFKEYYYLTLRKVIVG 314

Query: 192 NERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFER-FQLAQLSDC 250
            + +  P    +    G GG I+DSGS  T+    VY  + ++FV   E+ +  A+ ++ 
Sbjct: 315 GKDVKIPYTFLEPGSDGNGGTIVDSGSTFTFMERPVYNLVAQEFVKQLEKNYSRAEDAET 374

Query: 251 PEPIQLCYFLPETFN-RFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAV-------A 301
              +  C+ +       FP + F F+  A +    +N F +  +     L V        
Sbjct: 375 QSGLSPCFNISGVKTVTFPELTFKFKGGAKMTQPLQNYFSLVGDAEVVCLTVVSDGGAGP 434

Query: 302 PHDDLVALI-GSQQQRDTRFVYDLNIDLLSFVKENC 336
           P     A+I G+ QQ++    YDL  +   F   +C
Sbjct: 435 PKTTGPAIILGNYQQQNFYIEYDLENERFGFGPRSC 470


>gi|326512066|dbj|BAJ96014.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 485

 Score = 73.2 bits (178), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 91/374 (24%), Positives = 146/374 (39%), Gaps = 56/374 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            ++ IGTP+K   + +DTGS +++                    ++DP  SSS   + C 
Sbjct: 83  TQIGIGTPAKSYYVQVDTGSDILWVNCVFCDTCPRKSGLGIELTLYDPSGSSSGTGVTCG 142

Query: 43  HPDCTYF------KCVNEQ-CVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
              C          CV    C Y++ Y D S T GF   + +    V G  +        
Sbjct: 143 QDFCVATHGGVIPSCVPAAPCQYSISYGDGSSTTGFFVTDFLQYNQVSGNSQTTLANTSI 202

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
            FGC     G D  +   AL G+LG  +   S +SQL +   ++K F++CL      G +
Sbjct: 203 TFGCGAKIGG-DLGSSSQALDGILGFGQSNSSMLSQLAAAGKVRKVFAHCLDTINGGGIF 261

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
               +         +P    T  +   P+  Y ++L+ I +   ++  P + FDI  S  
Sbjct: 262 AIGDVV--------QPKVSTTPLVPGMPH--YNVNLEAIDVGGVKLQLPTNIFDIGES-- 309

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPS 269
            G IIDSG+ L Y    VY  +  K  + +    L    D     Q   +     + FP 
Sbjct: 310 KGTIIDSGTTLAYLPGVVYNAIMSKVFAQYGDMPLKNDQD----FQCFRYSGSVDDGFPI 365

Query: 270 MAFYFEDA-NLRIDGENVFIIDYENHFF-----LLAVAPHDDLVALIGSQQQRDTRFVYD 323
           + F+FE    L I   +    + E +        L      D+V L+G     +   +YD
Sbjct: 366 ITFHFEGGLPLNIHPHDYLFQNGELYCMGFQTGGLQTKDGKDMV-LLGDLAFSNRLVLYD 424

Query: 324 LNIDLLSFVKENCS 337
           L   ++ +   NCS
Sbjct: 425 LENQVIGWTDYNCS 438


>gi|255571588|ref|XP_002526740.1| aspartic-type endopeptidase, putative [Ricinus communis]
 gi|223533929|gb|EEF35654.1| aspartic-type endopeptidase, putative [Ricinus communis]
          Length = 471

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 94/376 (25%), Positives = 156/376 (41%), Gaps = 61/376 (16%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
           +++  IG+P      I DTGS +++                 +F+P KSS++    C H 
Sbjct: 109 VMKFNIGSPPVETYAIPDTGSNIVWIQCGSPICTNCYKQKIPLFNPTKSSTYAIRLCGHR 168

Query: 45  DCT--------YFKCVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG--- 91
           +C         Y  C +  + C Y + Y D S ++G  + + I+     E  A F     
Sbjct: 169 ECKQALWGLGEYLGCKSSVQVCRYHISYEDHSFSEGTISTDIITF---PEHIAEFGNYSL 225

Query: 92  -ALFGCS-NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP---LP 146
              FGC  N++    +D       GV+GL     S + QL      +FSYC+  P    P
Sbjct: 226 RMFFGCGYNNSETPGQDPNSFTAPGVVGLGNEMASLVGQL---TLGQFSYCISTPDVQKP 282

Query: 147 NGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFY-YLSLKDISIDNERMN-FPPDTFDI 204
           NG      ++FG        S  +T   N+   +Y + ++  I +D+ ++  +P   F  
Sbjct: 283 NGTIE---IRFGLAASI---SGHSTALANNLEGWYIFQNVDGIYVDDTKVKGYPEWVFQF 336

Query: 205 TVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSD--CPEPIQLCYFLP 261
              G GG I+DSG+  T    ++Y+   +  +    E+ +LA  +         LCY   
Sbjct: 337 AEGGIGGLIMDSGTTYT----ELYFSALDALIGELKEQIELAPDTQDHSNSNYSLCYNAA 392

Query: 262 E-TFNRFPSMAFYFED---ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRD 317
                  P++   F D   A       N +I D  N  + LA+      +++IG  Q RD
Sbjct: 393 NFLLTYVPAIELKFTDNKEAYFPFTLRNAWI-DNGNDQYCLAMFGTSG-ISIIGIYQHRD 450

Query: 318 TRFVYDLNIDLLSFVK 333
            +  YDL  +L+SF +
Sbjct: 451 IKIGYDLKYNLVSFTE 466


>gi|226532674|ref|NP_001151415.1| pepsin A precursor [Zea mays]
 gi|195646632|gb|ACG42784.1| pepsin A [Zea mays]
          Length = 492

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 95/384 (24%), Positives = 152/384 (39%), Gaps = 73/384 (19%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCD 42
            R+ IG+P KG  + +DTGS +++                     +DP  S +   + C+
Sbjct: 87  TRIEIGSPPKGYYVQVDTGSDILWVNGISCDGCPTRSGLGIELTQYDPAGSGT--TVGCE 144

Query: 43  HPDCTYFKCVN----------EQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIF 89
              C      +            C + + Y D S T GF   + +    V G G+     
Sbjct: 145 QEFCVANSAASGVPPACPSAASPCQFRITYGDGSSTTGFYVTDFVQYNQVSGNGQTTPSN 204

Query: 90  HGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPN 147
               FGC     G D  +   AL G+LG  +   S +SQL +   ++K F++CL      
Sbjct: 205 VSITFGCGA-QLGGDLGSSSQALDGILGFGQSDASMLSQLAAARKVRKIFAHCLDTVRGG 263

Query: 148 GEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITV 206
           G +    +        + P  + T  +  PN  +Y ++L+ IS+    +  P  TFD   
Sbjct: 264 GIFAIGNV-------VQPPIVKTTPLV--PNATHYNVNLQGISVGGATLQLPTSTFD--- 311

Query: 207 SGEG-GCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN 265
           SG+  G IIDSG+ L Y   +VY  L            +    D      +C+    + +
Sbjct: 312 SGDSKGTIIDSGTTLAYLPREVYRTLLTAVFDKHPDLAVRNYEDF-----ICFQFSGSLD 366

Query: 266 -RFPSMAFYFEDANLRIDGENVFIIDY----ENHFFLL-----AVAPHD--DLVALIGSQ 313
             FP + F FE  +L +   NV+  DY     N  + +      V   D  D+V L+G  
Sbjct: 367 EEFPVITFSFE-GDLTL---NVYPHDYLFQNGNDLYCMGFLDGGVQTKDGKDMV-LLGDL 421

Query: 314 QQRDTRFVYDLNIDLLSFVKENCS 337
              +   VYDL   ++ +   NCS
Sbjct: 422 VLSNKLVVYDLEKQVIGWTDYNCS 445


>gi|218185380|gb|EEC67807.1| hypothetical protein OsI_35373 [Oryza sativa Indica Group]
          Length = 418

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 89/372 (23%), Positives = 142/372 (38%), Gaps = 59/372 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDC 46
           V + IG P+K   L +DTGS L +                ++ P K+   + + C +  C
Sbjct: 59  VTMNIGDPAKPYFLDVDTGSDLTWLQCDAPCQSCNKVPHPLYRPTKN---KLVPCANSIC 115

Query: 47  TYF--------KCV-NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           T          KC   +QC Y +KY D++ + G    ++ S+  + +   +     FGC 
Sbjct: 116 TALHSGSSPNKKCTTQQQCDYQIKYTDKASSLGVLVTDSFSLPLRNKSN-VRPSLSFGCG 174

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYL 155
            D       A      G+LGL R ++S +SQL    I K    +CL            +L
Sbjct: 175 YDQQVGKNGAAPATTDGLLGLGRGSVSLLSQLKQQGITKNVLGHCL------STSGGGFL 228

Query: 156 KFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
            FG DM      T      +   N+Y      +  D   ++  P             + D
Sbjct: 229 FFGDDMVPTSRVTWVPMVRSTSGNYYSPGSATLYFDRRSLSTKPMEV----------VFD 278

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-------FP 268
           SGS  TYF +  Y            +  L Q+SD   P  LC+   + F         F 
Sbjct: 279 SGSTYTYFSAQPYQATISAIKGSLSK-SLKQVSDPSLP--LCWKGQKAFKSVSDVKKDFK 335

Query: 269 SMAFYF-EDANLRIDGENVFIIDYENHFFL--LAVAPHDDLVALIGSQQQRDTRFVYDLN 325
           S+ F F ++A + I  EN  I+    +  L  L  +      ++IG    +D   +YD  
Sbjct: 336 SLQFIFGKNAVMEIPPENYLIVTKNGNVCLGILDGSAAKLSFSIIGDITMQDQMVIYDNE 395

Query: 326 IDLLSFVKENCS 337
              L +++ +CS
Sbjct: 396 KAQLGWIRGSCS 407


>gi|21717160|gb|AAM76353.1|AC074196_11 putative nucleoid DNA binding protein [Oryza sativa Japonica Group]
 gi|31433304|gb|AAP54833.1| Eukaryotic aspartyl protease family protein [Oryza sativa Japonica
           Group]
 gi|125575544|gb|EAZ16828.1| hypothetical protein OsJ_32300 [Oryza sativa Japonica Group]
          Length = 419

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 90/372 (24%), Positives = 146/372 (39%), Gaps = 63/372 (16%)

Query: 6   IGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDCTYF 49
           IGTP + V  I+D    L++                 +FDP  S++++   C  P C   
Sbjct: 68  IGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSI 127

Query: 50  KCVNEQCVYTMKYADQSV---TKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
              N        Y   S+   T G A+ + I+ IG  EG+  F     GC   + G  + 
Sbjct: 128 PTRNCSGDGECGYEAPSMFGDTFGIASTDAIA-IGNAEGRLAF-----GCVVASDGSIDG 181

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
           A DG  +G +GL R   S + Q        FSYCL +  P G+ ++ +L     +     
Sbjct: 182 AMDGP-SGFVGLGRTPWSLVGQSN---VTAFSYCLALHGP-GKKSALFLGASAKLAGAGK 236

Query: 167 STQATKFIN-HPNN--------FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI---- 213
           S   T  +  H +N        +Y + L+ I           D      S  GG I    
Sbjct: 237 SNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAG--------DVAVAAASSGGGAITVLQ 288

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFY 273
           +++   L+Y     Y  L EK V+         +++ PEP  LC F     +  P + F 
Sbjct: 289 LETFRPLSYLPDAAYQAL-EKVVT--AALGSPSMANPPEPFDLC-FQNAAVSGVPDLVFT 344

Query: 274 FEDANLRIDGENVFIIDYENHFFLLAVA--------PHDDLVALIGSQQQRDTRFVYDLN 325
           F+         + +++   N    + ++          DD V+++GS  Q +  F++DL 
Sbjct: 345 FQGGATLTAQPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLE 404

Query: 326 IDLLSFVKENCS 337
            + LSF   +CS
Sbjct: 405 KETLSFEPADCS 416


>gi|358346726|ref|XP_003637416.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
 gi|355503351|gb|AES84554.1| Aspartic proteinase nepenthesin-2, partial [Medicago truncatula]
          Length = 165

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/161 (31%), Positives = 81/161 (50%), Gaps = 5/161 (3%)

Query: 178 NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVS 237
           + +YY+ L  IS+  E +  P  +F++  +G GG I+DSG+ +T   SDVY  + + FV 
Sbjct: 8   DTYYYVGLVGISVGGELLAIPETSFEVDSAGNGGIIVDSGTAVTRLQSDVYNVVRDAFVK 67

Query: 238 YFERFQLAQLSDCPEPIQLCYFL-PETFNRFPSMAFYF-EDANLRIDGENVFIIDYENHF 295
             +   LA  ++       CY L  +T    P++AF+F E   L +  +N  +       
Sbjct: 68  GTKDL-LA--TNEVSLFDTCYDLSSKTSVEVPTVAFHFGEGKVLVLPAKNYLVPVDSVGT 124

Query: 296 FLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           F  A AP    +++IG+ QQ+ TR  +DL   L+ F    C
Sbjct: 125 FCFAFAPTMSSLSIIGNIQQQGTRVSFDLANSLVGFSPNRC 165


>gi|388515789|gb|AFK45956.1| unknown [Medicago truncatula]
          Length = 225

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 65/233 (27%), Positives = 99/233 (42%), Gaps = 22/233 (9%)

Query: 113 AGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATK 172
           AG+LGL    +SF+ QLG      FSYCLV     G  +S  L+FG +            
Sbjct: 6   AGLLGLGSGPMSFVGQLGGQAGGTFSYCLV---SRGTESSGSLEFGRES--VPVGASWVS 60

Query: 173 FINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWK 230
            I++P   +FYY+ L  + +   R+    D F +   GEGG ++D+G+ +T   +  Y  
Sbjct: 61  LIHNPRAPSFYYIGLSGLGVGGLRVPISEDIFRLNELGEGGVVMDTGTAVTRLPAAAYNA 120

Query: 231 LHEKFVSYFERFQLAQLSDCPEPIQL-----CYFLPETFN-RFPSMAFYFEDAN-LRIDG 283
             + FV        AQ ++ P+   +     CY L      R P+++FYF     L +  
Sbjct: 121 FRDAFV--------AQTTNLPKTSGVSIFDTCYDLNGFVTVRVPTISFYFLGGPILTLPA 172

Query: 284 ENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            N  I       F  A AP    +++IG+ QQ       D     + F    C
Sbjct: 173 RNFLIPVDSVGTFCFAFAPSSSGLSIIGNIQQEGIEISVDGANGYIGFGPNIC 225


>gi|326490597|dbj|BAJ89966.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 450

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 87/386 (22%), Positives = 152/386 (39%), Gaps = 64/386 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSAL-----------IYAIFDPRKSSSFQKINCDHPDCTYF- 49
           V + +GTP + V ++LDTGS L             A F+   S ++  ++C  P C +  
Sbjct: 67  VSVVVGTPPQNVTMVLDTGSELSGLLCNGSSLSPPAPFNASASLTYSAVDCSSPACVWRG 126

Query: 50  ----------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS-- 97
                        +  C  ++ YAD S   G    +T  ++G    +A+   ALFGC   
Sbjct: 127 RDLPVRPFCDAPPSTSCRVSISYADASSADGHLVADTF-ILGT---QAV--PALFGCITS 180

Query: 98  -NDNHGFDEDARDG--ALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSY 154
            + +   +  A D   A  G+LG++R ++SF++Q  ++   RF+YC+      G      
Sbjct: 181 YSSSTAINSSATDPSEAATGLLGMNRGSLSFVTQTATL---RFAYCIAPGQGPGILLLGG 237

Query: 155 LKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
                      P  + ++ + + +   Y + L+ I + +  +  P        +G G  +
Sbjct: 238 DGGAAPPLNYTPLIEISQPLPYFDRVAYSVQLEGIRVGSALLQIPKSVLTPDHTGAGQTM 297

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLCYFLPE---- 262
           +DSG+  T+  +D Y  L  +F++   R  LA L    EP          C+  PE    
Sbjct: 298 VDSGTQFTFLLADAYAALKAEFLNQ-ARSLLAPLG---EPGFVFQGAFDACFRGPEERVS 353

Query: 263 -TFNRFPSMAFYFEDANLRIDGEN-VFIIDYENHFFLLAVA------PHDDLVAL----I 310
                 P +      A + + GE  ++ +  E      A A       + D+  +    I
Sbjct: 354 AASRLLPEVGLVLRGAEVAVAGEKLLYSVPGERRGEEGAEAVWCLTFGNSDMAGMSAYVI 413

Query: 311 GSQQQRDTRFVYDLNIDLLSFVKENC 336
           G   Q+D    YDL    + F    C
Sbjct: 414 GHHHQQDVWVEYDLQNGRVGFAPARC 439


>gi|302794412|ref|XP_002978970.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
 gi|300153288|gb|EFJ19927.1| hypothetical protein SELMODRAFT_418789 [Selaginella moellendorffii]
          Length = 462

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 90/375 (24%), Positives = 159/375 (42%), Gaps = 59/375 (15%)

Query: 6   IGTPSKGVLLILDTGSALIY--------------AIFDPRKSSSFQKINCDHPDC----- 46
           +G+P +  +LI+DTGS L +               I+D  +S S++ + C++        
Sbjct: 106 LGSPGQEAILIVDTGSELTWLKCLPCKVCAPSVDTIYDAARSVSYKPVTCNNSQLCSNSS 165

Query: 47  --TYFKCV-NEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAI-FHGALFGCSND 99
             TY  C    QC +   Y D S + G  + +T+   +V+G   GK +      FGC+  
Sbjct: 166 QGTYAYCARGSQCQFAAFYGDGSFSYGSLSTDTLIMETVVG---GKPVTVQDFAFGCA-- 220

Query: 100 NHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF-- 157
             G  E    GA +G+LGL+   ++   QLG     +FS+C   P  +    S+ + F  
Sbjct: 221 -QGDLELVPTGA-SGILGLNAGKMALPMQLGQRFGWKFSHCF--PDRSSHLNSTGVVFFG 276

Query: 158 GTDMGYRRPSTQATKFINHP--NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
             ++ + +    +    N      FY+++LK +SI++  +   P    +        I+D
Sbjct: 277 NAELPHEQVQYTSVALTNSELQRKFYHVALKGVSINSHELVLLPRGSVV--------ILD 328

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-----ETFNRFPSM 270
           SGS  + F    + +L E F+ +          D    +  C+ +      E     PS+
Sbjct: 329 SGSSFSSFVRPFHSQLREAFLKHRPPSLKHLEGDSFGDLGTCFKVSNDDIDELHRTLPSL 388

Query: 271 AFYFEDA-NLRIDGENVF--IIDYENHFFLLAVAPHD---DLVALIGSQQQRDTRFVYDL 324
           +  FED   + I    V   +  Y+NH   +  A  D   + V +IG+ QQ++    YD+
Sbjct: 389 SLVFEDGVTIGIPSIGVLLPVARYQNH-VKMCFAFEDGGPNPVNVIGNYQQQNLWVEYDI 447

Query: 325 NIDLLSFVKENCSDD 339
               + F + +C  D
Sbjct: 448 QRSRVGFARASCVID 462


>gi|242092902|ref|XP_002436941.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
 gi|241915164|gb|EER88308.1| hypothetical protein SORBIDRAFT_10g011760 [Sorghum bicolor]
          Length = 445

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 89/360 (24%), Positives = 142/360 (39%), Gaps = 63/360 (17%)

Query: 7   GTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHPDCTYFK 50
           GTP+   ++++DTGS L +                 +FDP  SS++  + C   +C    
Sbjct: 119 GTPAVPQVVVIDTGSDLTWLQCKPCSSGQCSPQKDPLFDPSHSSTYSAVPCASGECKKLA 178

Query: 51  -------CVNEQ-CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG 102
                  C N Q C + + Y D + T G    + +++       AI     FGC     G
Sbjct: 179 ADAYGSGCSNGQPCGFAISYVDGTSTVGVYGKDKLTL----APGAIVKDFYFGC-----G 229

Query: 103 FDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMG 162
             + +  G   G+LGL R++ S  +Q        FSYCL    P       +L FG    
Sbjct: 230 HSKSSLPGLFDGLLGLGRLSESLGAQY--GGGGGFSYCL----PAVNSKPGFLAFGAG-- 281

Query: 163 YRRPS----TQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGS 218
            R PS    T   +    P  F  ++L  I++  ++++  P  F       GG I+DSG+
Sbjct: 282 -RNPSGFVFTPMGRVPGQPT-FSTVTLAGITVGGKKLDLRPSAF------SGGMIVDSGT 333

Query: 219 VLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-RFPSMAFYFE-D 276
           V+T   S VY  L   F    + ++L         +  CY L    N   P +A  F   
Sbjct: 334 VVTVLQSTVYRALRAAFREAMKAYRLVH-----GDLDTCYDLTGYKNVVVPKIALTFSGG 388

Query: 277 ANLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           A + +D  N  ++   N     A    D    ++G+  QR    ++D +     F  + C
Sbjct: 389 ATINLDVPNGILV---NGCLAFAETGKDGTAGVLGNVNQRTFEVLFDTSASKFGFRAKAC 445


>gi|388516465|gb|AFK46294.1| unknown [Medicago truncatula]
          Length = 434

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 85/347 (24%), Positives = 149/347 (42%), Gaps = 45/347 (12%)

Query: 1   MVRLFIGTPSKGVLLILDTGS--ALIYA---------IFDPRKSSSFQKINCDHPDCTYF 49
           +VR+ IGTP + + ++LDT +  A I +          F P  S+S+  + C  P C+  
Sbjct: 99  IVRVKIGTPGQLLFMVLDTSTDEAFIPSSGCIGCSATTFSPNASTSYVPLECSVPQCSQV 158

Query: 50  KCVN------EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
           + ++        C +   YA  + +      +++ +        +     FG  N   G 
Sbjct: 159 RGLSCPATGSGACSFNKSYAGSTYSATLV-QDSLRL-----ATDVIPSYSFGSINAISGS 212

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
              A+           R  +S +SQ GS+    FSYCL  P     Y S  LK G  +G 
Sbjct: 213 SIPAQGLLGL-----GRGPLSLLSQTGSLYSGVFSYCL--PSFKSYYFSGSLKLG-PVGQ 264

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
            + S + T  + +P   + Y+++L  I++    + FP +     V+   G IIDSG+V+T
Sbjct: 265 PK-SIRTTPLLRNPRRPSLYFVNLTGITVGKVNVPFPKELLAFDVNTGSGTIIDSGTVIT 323

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
            F   VY  + ++F     R Q+            C F+       P++  +F D +L++
Sbjct: 324 RFVEPVYNAVRDEF-----RKQVTGPFSSLGAFDTC-FVKNYETLAPAITLHFTDLDLKL 377

Query: 282 DGENVFIIDYENHFFLLAVA--PHD---DLVALIGSQQQRDTRFVYD 323
             EN  I         LA+A  P +    ++ +I + QQ++ R ++D
Sbjct: 378 PLENSLIHSSSGSLACLAMASTPKNVNYTVLNVIANYQQQNLRVLFD 424


>gi|296085344|emb|CBI29076.3| unnamed protein product [Vitis vinifera]
          Length = 434

 Score = 72.8 bits (177), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 68/256 (26%), Positives = 107/256 (41%), Gaps = 38/256 (14%)

Query: 114 GVLGLSRVTISFISQLGSIIKKRFSYC-LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATK 172
           G+ G  R  +S  SQLG  ++K FS+C L     N    SS L  G          Q T 
Sbjct: 181 GIAGFGRGVLSLPSQLG-FLQKGFSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTS 239

Query: 173 FINHP--NNFYYLSLKDISIDNER-MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYW 229
            + +P   N+YY+ L+ I++ N   +  P    +    G GG IIDSG+  T+     Y 
Sbjct: 240 LLKNPMYPNYYYIGLEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYT 299

Query: 230 KLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-------RFPSMAFYFEDANLRID 282
           +L     S    +  AQ  +      LCY +P   N         PS++F+F +      
Sbjct: 300 QLLSMLQSIIT-YPRAQEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSN------ 352

Query: 283 GENV-FIIDYENHFFLLAVAPHDDLV----------------ALIGSQQQRDTRFVYDLN 325
             NV  ++   NHF+ +    +  +V                 + GS QQ++ + VYDL 
Sbjct: 353 --NVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDLE 410

Query: 326 IDLLSFVKENCSDDSA 341
            + + F   +C+  +A
Sbjct: 411 KERIGFQPMDCASAAA 426


>gi|359484086|ref|XP_002263357.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera]
          Length = 417

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 68/256 (26%), Positives = 107/256 (41%), Gaps = 38/256 (14%)

Query: 114 GVLGLSRVTISFISQLGSIIKKRFSYC-LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATK 172
           G+ G  R  +S  SQLG  ++K FS+C L     N    SS L  G          Q T 
Sbjct: 164 GIAGFGRGVLSLPSQLG-FLQKGFSHCFLGFKFANNPNISSPLVIGDLAISSNDHLQFTS 222

Query: 173 FINHP--NNFYYLSLKDISIDNER-MNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYW 229
            + +P   N+YY+ L+ I++ N   +  P    +    G GG IIDSG+  T+     Y 
Sbjct: 223 LLKNPMYPNYYYIGLEAITVGNATAIQVPSSLREFDSHGNGGMIIDSGTTYTHLPGPFYT 282

Query: 230 KLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN-------RFPSMAFYFEDANLRID 282
           +L     S    +  AQ  +      LCY +P   N         PS++F+F +      
Sbjct: 283 QLLSMLQSIIT-YPRAQEQEARTGFDLCYRIPCPNNVVTDHDHLLPSISFHFSN------ 335

Query: 283 GENV-FIIDYENHFFLLAVAPHDDLV----------------ALIGSQQQRDTRFVYDLN 325
             NV  ++   NHF+ +    +  +V                 + GS QQ++ + VYDL 
Sbjct: 336 --NVSLVLPQGNHFYAMGAPSNSTVVKCLLLQNMDDSDSGPAGVFGSFQQQNVKVVYDLE 393

Query: 326 IDLLSFVKENCSDDSA 341
            + + F   +C+  +A
Sbjct: 394 KERIGFQPMDCASAAA 409


>gi|18421660|ref|NP_568551.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|10177438|dbj|BAB10671.1| unnamed protein product [Arabidopsis thaliana]
 gi|15809850|gb|AAL06853.1| AT5g37540/mpa22_p_70 [Arabidopsis thaliana]
 gi|20260182|gb|AAM12989.1| unknown protein [Arabidopsis thaliana]
 gi|23197748|gb|AAN15401.1| unknown protein [Arabidopsis thaliana]
 gi|332006821|gb|AED94204.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 442

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 76/273 (27%), Positives = 113/273 (41%), Gaps = 57/273 (20%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
           ++ L IGTPS+   L+LDTGS L +                  FDP  SSSF  + C HP
Sbjct: 81  ILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 140

Query: 45  DCTY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
            C            C  N  C Y+  YAD +  +G    E  +         +    + G
Sbjct: 141 LCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPL----ILG 196

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           C+ ++   DE        G+LG++   +SFISQ  + I K FSYC+    P         
Sbjct: 197 CAKEST--DEK-------GILGMNLGRLSFISQ--AKISK-FSYCI----PTRSNRPGLA 240

Query: 156 KFGTDMGYRRPSTQATKFIN---------HPNN---FYYLSLKDISIDNERMNFPPDTFD 203
             G+      P+++  K+++          PN     Y + L+ I I  +R+N P   F 
Sbjct: 241 STGSFYLGDNPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLQGIRIGQKRLNIPGSVFR 300

Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFV 236
               G G  ++DSGS  T+     Y K+ E+ V
Sbjct: 301 PDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIV 333


>gi|413950927|gb|AFW83576.1| aspartic proteinase nepenthesin-2 [Zea mays]
          Length = 316

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 74/304 (24%), Positives = 118/304 (38%), Gaps = 40/304 (13%)

Query: 62  YADQSVTKGFAAHETISVI------GKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGV 115
           Y D S  +G    ++ ++       GK + +A   G + GC+    G    A DG    V
Sbjct: 22  YKDGSAARGTVGTDSATIALSGRRAGKKQRRAKLRGVVLGCTTSYTGESFLASDG----V 77

Query: 116 LGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATK--- 172
           L L    +SF S+  +    RFSYCLV  L     T SYL FG +      S   T    
Sbjct: 78  LSLGYSNVSFASRAAARFGGRFSYCLVDHLAPRNAT-SYLTFGPNPAVSSASASRTACAG 136

Query: 173 ------------FINHP-NNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
                        ++H    FY +++  +S+D E +  P   +D  V   GG I+DSG+ 
Sbjct: 137 SAAAPGARQTPLLLDHRMRPFYAVAVNGVSVDGELLRIPRLVWD--VQKGGGAILDSGTS 194

Query: 220 LTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFN------RFPSMAFY 273
           LT   S  Y       V+   +  +       +P   CY               P++A +
Sbjct: 195 LTVLVSPAY----RAVVAALGKKLVGLPRVAMDPFDYCYNWTSPLTGEDLAVAVPALAVH 250

Query: 274 FEDANLRIDGENVFIIDYENHFFLLAVAPHD-DLVALIGSQQQRDTRFVYDLNIDLLSFV 332
           F  +         ++ID       + +   D   V++IG+  Q++  + +DL    L F 
Sbjct: 251 FAGSARLQPPPKSYVIDAAPGVKCIGLQEGDWPGVSVIGNILQQEHLWEFDLKNRRLRFK 310

Query: 333 KENC 336
           +  C
Sbjct: 311 RSRC 314


>gi|6850312|gb|AAF29389.1|AC009999_9 Contains similarity to nucellin from Hordeum vulgare gb|U87148.
           ESTs gb|T22068, gb|F14251, gb|F14237, gb|F14242 come
           from this gene [Arabidopsis thaliana]
          Length = 388

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 68/265 (25%), Positives = 111/265 (41%), Gaps = 44/265 (16%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCD 42
            ++ IGTP+K   + +DTGS +++                    +++  +S S + ++CD
Sbjct: 82  AKIGIGTPAKSYYVQVDTGSDIMWVNCIQCKQCPRRSTLGIELTLYNIDESDSGKLVSCD 141

Query: 43  HPDCTYFK------C-VNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
              C          C  N  C Y   Y D S T G+   + +   SV G  + +      
Sbjct: 142 DDFCYQISGGPLSGCKANMSCPYLEIYGDGSSTAGYFVKDVVQYDSVAGDLKTQTANGSV 201

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEY 150
           +FGC     G  + + + AL G+LG  +   S ISQL S   +KK F++CL     +G  
Sbjct: 202 IFGCGARQSGDLDSSNEEALDGILGFGKANSSMISQLASSGRVKKIFAHCL-----DGRN 256

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
                  G  +   +P    T  + N P+  Y +++  + +  E +  P D F       
Sbjct: 257 GGGIFAIGRVV---QPKVNMTPLVPNQPH--YNVNMTAVQVGQEFLTIPADLFQ--PGDR 309

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEK 234
            G IIDSG+ L Y    +Y  L +K
Sbjct: 310 KGAIIDSGTTLAYLPEIIYEPLVKK 334


>gi|147859621|emb|CAN83119.1| hypothetical protein VITISV_043393 [Vitis vinifera]
          Length = 431

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 76/293 (25%), Positives = 131/293 (44%), Gaps = 43/293 (14%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCDHPDCTYFK------CV-NE 54
            ++ IGTP++   + ++        ++D ++S + + ++CD   C          C+ N 
Sbjct: 100 AKIGIGTPARDYYVQMEL------TLYDIKESLTGKLVSCDQDFCYAINGGPPSYCIANM 153

Query: 55  QCVYTMKYADQSVT-----KGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARD 109
            C YT  YAD S +     KG+      + I       +    L  CS    G  + + +
Sbjct: 154 SCSYTEIYADGSSSFGYFVKGYCTASKYNSIPHLNNNPLLEVPLR-CSATQSG--DLSSE 210

Query: 110 GALAGVLGLSRVTISFISQLGSI--IKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS 167
            AL G+LG  +   S ISQL S   ++K F++CL     +G         G  +   +P 
Sbjct: 211 EALDGILGFGKSNTSMISQLASSGKVRKMFAHCL-----DGLNGGGIFAIGHIV---QPK 262

Query: 168 TQATKFINHPNNFYY-LSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSD 226
              T  +  PN  +Y +++K + +    +N P D FD  V  + G IIDSG+ L Y    
Sbjct: 263 VNTTPLV--PNQTHYNVNMKAVEVGGYFLNLPTDVFD--VGDKKGTIIDSGTTLAYLPEV 318

Query: 227 VYWKLHEKFVSYFERFQLAQLSDCPEPIQL-CYFLPETF-NRFPSMAFYFEDA 277
           VY +L  K  S+    ++  + D     Q  C+   E+  + FP++ F+FE++
Sbjct: 319 VYDQLLSKIFSWQSDLKVHTIHD-----QFTCFQYSESLDDGFPAVTFHFENS 366


>gi|242041431|ref|XP_002468110.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
 gi|241921964|gb|EER95108.1| hypothetical protein SORBIDRAFT_01g039750 [Sorghum bicolor]
          Length = 467

 Score = 72.4 bits (176), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 89/410 (21%), Positives = 159/410 (38%), Gaps = 89/410 (21%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIY------------------AIFDPRKSSSFQKINCDH 43
           V + +G P + V ++LDTGS L +                  A F+   SS++   +C  
Sbjct: 61  VPVAVGAPPQNVTMVLDTGSELSWLLCNGSRVPSTPPQPQAPAAFNGSASSTYAAAHCSS 120

Query: 44  -PDCTYFK--------CV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
            P+C +          C    +  C  ++ YAD S   G  A +T  +     G A    
Sbjct: 121 SPECQWRGRDLPVPPFCAGPPSNSCRVSLSYADASSADGVLAADTFLL-----GGAPPVR 175

Query: 92  ALFGC--------SNDNHGFDEDAR----DGALAGVLGLSRVTISFISQLGSIIKKRFSY 139
           ALFGC        + D +G   DA       A  G+LG++R ++SF++Q G++   RF+Y
Sbjct: 176 ALFGCITSYSSSSTADGNGNGNDASATNSSEAATGLLGMNRGSLSFVTQTGTL---RFAY 232

Query: 140 CLV------IPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPNNFYY-LSLKDISIDN 192
           C+       + +  G+   + L     + Y  P  + ++ + + +   Y + L+ I +  
Sbjct: 233 CIAPGDGPGLLVLGGDGDGAALSAAPQLNYT-PLIEMSQPLPYFDRVAYSVQLEGIRVGA 291

Query: 193 ERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPE 252
             +  P        +G G  ++DSG+  T+  +D Y  L  +F++       A L+   E
Sbjct: 292 ALLPIPKSVLAPDHTGAGQTMVDSGTQFTFLLADAYAPLKGEFLNQTS----ALLAPLGE 347

Query: 253 P-------IQLCYFLPE-------TFNRFPSMAFYFEDANLRIDGEN-VFIIDYENH--- 294
           P          C+   E            P +      A + + GE  ++++  E     
Sbjct: 348 PDFVFQGAFDACFRASEARVAAATASQLLPEVGLVLRGAEVAVGGEKLLYMVPGERRGEG 407

Query: 295 ----FFLLAVAPHDDLVAL----IGSQQQRDTRFVYDLNIDLLSFVKENC 336
                + L    + D+  +    IG   Q++    YDL    + F    C
Sbjct: 408 GSEAVWCLTFG-NSDMAGMSAYVIGHHHQQNVWVEYDLQNSRVGFAPARC 456


>gi|224033419|gb|ACN35785.1| unknown [Zea mays]
 gi|413934980|gb|AFW69531.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 543

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 78/295 (26%), Positives = 114/295 (38%), Gaps = 60/295 (20%)

Query: 6   IGTPSKGVLLILDTGSALIYAIFD------------------------PRKSSSFQKINC 41
           +GTP+   L+ LDTGS L +   D                        PR+SS+ +++ C
Sbjct: 114 LGTPNATFLVALDTGSDLFWVPCDCRQCATIPSANGTGQDAPSLRPYSPRRSSTSKQVAC 173

Query: 42  DHPDCTY----FKCVNEQCVYTMKYADQSVTKGF-----AAHETISVIGKG-EGKAIFHG 91
           D+P C          N  C Y ++Y   + +          H T    G G  G+A+   
Sbjct: 174 DNPLCGQRNGCSAATNGSCPYEVQYVSANTSSSGVLVQDVLHLTRERPGPGAAGEALQAP 233

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL---GSIIKKRFSYCLVIPLPNG 148
            +FGC     G   D   GA+ G++GL    +S  S L   G +    FS C       G
Sbjct: 234 VVFGCGQVQTGAFLDGGGGAVDGLMGLGMGKVSVPSALAASGLVASDSFSMCF------G 287

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKF-INHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
           +     + FG D G R      T F +   N  Y +S   I + +E           +V+
Sbjct: 288 DDGVGRVNFG-DAGSR--GQAETPFTVRSLNPTYNVSFTSIGVGSE-----------SVA 333

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF--ERFQLAQLSDCPEPIQLCYFL 260
            E   ++DSG+  TY     Y +L  KF S     R   +  S  P P + CY L
Sbjct: 334 AEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEYCYRL 388


>gi|255545932|ref|XP_002514026.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
 gi|223547112|gb|EEF48609.1| Aspartic proteinase nepenthesin-1 precursor, putative [Ricinus
           communis]
          Length = 437

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 84/359 (23%), Positives = 145/359 (40%), Gaps = 43/359 (11%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA-----------IFDPRKSSSFQKINCDHPDCTY- 48
           +VR+ +GTP + + ++LDT +   +             F    SS++  ++C    CT  
Sbjct: 98  VVRVKLGTPGQFMFMVLDTSNDAAWVPCSGCTGCSSTTFSTNTSSTYGSLDCSMAQCTQV 157

Query: 49  --FKCV---NEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGF 103
             F C    +  CV+   Y   S        +++ ++       +     FGC N   G 
Sbjct: 158 RGFSCPATGSSSCVFNQSYGGDSSFSATLVEDSLRLVND-----VIPNFAFGCINSISGG 212

Query: 104 DEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
               +           R  +S I+Q GS+    FSYCL  P     Y S  LK G     
Sbjct: 213 SVPPQGLLGL-----GRGPLSLIAQSGSLYSGLFSYCL--PSFKSYYFSGSLKLGP--AG 263

Query: 164 RRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLT 221
           +  S + T  + +P+  + YY++L  +S+    +   P+      +   G IIDSG+V+T
Sbjct: 264 QPKSIRYTPLLRNPHRPSLYYVNLTGVSVGRTLVPIAPELLAFNPNTGAGTIIDSGTVIT 323

Query: 222 YFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRI 281
            F   +Y  + ++F     R Q+A           C F        P++  +F   NL +
Sbjct: 324 RFVQPIYTAIRDEF-----RKQVAGPFSSLGAFDTC-FAATNEAVAPAVTLHFTGLNLVL 377

Query: 282 DGENVFIIDYENHFFLLAVAPH----DDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
             EN  I         LA+A      + ++ +I + QQ++ R ++D+    L   +E C
Sbjct: 378 PMENSLIHSSAGSLACLAMAAAPNNVNSVLNVIANLQQQNLRLLFDVPNSRLGIARELC 436


>gi|125532793|gb|EAY79358.1| hypothetical protein OsI_34487 [Oryza sativa Indica Group]
          Length = 419

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 90/372 (24%), Positives = 145/372 (38%), Gaps = 63/372 (16%)

Query: 6   IGTPSKGVLLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDCTYF 49
           IGTP + V  I+D    L++                 +FDP  S++++   C  P C   
Sbjct: 68  IGTPPQAVSGIVDLSGELVWTQCAACRSSGCFKQELPVFDPSASNTYRAEQCGSPLCKSI 127

Query: 50  KCVNEQCVYTMKYADQSV---TKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDED 106
              N        Y   S+   T G A+ + I+ IG  EG+  F     GC   + G  + 
Sbjct: 128 PTRNCSGDGECGYEAPSMFGDTFGIASTDAIA-IGNAEGRLAF-----GCVVASDGSIDG 181

Query: 107 ARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
           A DG  +G +GL R   S + Q        FSYCL  P   G+ ++ +L     +     
Sbjct: 182 AMDGP-SGFVGLGRTPWSLVGQSN---VTAFSYCLA-PHGPGKKSALFLGASAKLAGAGK 236

Query: 167 STQATKFIN-HPNN--------FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI---- 213
           S   T  +  H +N        +Y + L+ I           D      S  GG I    
Sbjct: 237 SNPPTPLLGQHASNTSDDGSDPYYTVQLEGIKAG--------DVAVAAASSGGGAITILQ 288

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFY 273
           +++   L+Y     Y  L EK V+         +++ PEP  LC F     +  P + F 
Sbjct: 289 LETFRPLSYLPDAAYQAL-EKVVT--AALGSPSMANPPEPFDLC-FQNAAVSGVPDLVFT 344

Query: 274 FEDANLRIDGENVFIIDYENHFFLLAVA--------PHDDLVALIGSQQQRDTRFVYDLN 325
           F+         + +++   N    + ++          DD V+++GS  Q +  F++DL 
Sbjct: 345 FQGGATLTAPPSKYLLGDGNGNGTVCLSILSSTRLDSADDGVSILGSLLQENVHFLFDLE 404

Query: 326 IDLLSFVKENCS 337
            + LSF   +CS
Sbjct: 405 KETLSFEPADCS 416


>gi|226499286|ref|NP_001147826.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
 gi|195613980|gb|ACG28820.1| aspartic-type endopeptidase/ pepsin A [Zea mays]
          Length = 545

 Score = 72.0 bits (175), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 94/383 (24%), Positives = 142/383 (37%), Gaps = 77/383 (20%)

Query: 6   IGTPSKGVLLILDTGSALIYAIFD------------------------PRKSSSFQKINC 41
           +GTP+   L+ LDTGS L +   D                        PR+SS+ +++ C
Sbjct: 116 LGTPNATFLVALDTGSDLFWVPCDCRQCATIPSANATGPDAPPLRPYSPRRSSTSEQVAC 175

Query: 42  DHPDCTYFK----CVNEQCVYTMKYADQSVTKGF-----AAHETISVIGKG-EGKAIFHG 91
           D+P C          N  C Y ++Y   + +          H T    G G  G+A+   
Sbjct: 176 DNPLCGRRNGCSAATNGSCPYEVQYVSANTSSSGVLVQDVLHLTRERPGPGAAGEALQAP 235

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQL---GSIIKKRFSYCLVIPLPNG 148
            +FGC     G   D   GA+ G++GL    +S  S L   G +    FS C       G
Sbjct: 236 VVFGCGQVQTGAFLDDGGGAVDGLMGLGMGKVSVPSALAASGLVASDSFSMCF------G 289

Query: 149 EYTSSYLKFGTDMGYRRPSTQATKF-INHPNNFYYLSLKDISIDNERMNFPPDTFDITVS 207
           +     + FG D G R      T F +   N  Y +S   I I +E           +V+
Sbjct: 290 DDGVGRVNFG-DAGSR--GQAETPFTVRSLNPTYNVSFTSIGIGSE-----------SVA 335

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF--ERFQLAQLSDCPEPIQLCYFLPETFN 265
            E   ++DSG+  TY     Y +L  KF S     R   +  S  P P + CY L     
Sbjct: 336 AEFAAVMDSGTSFTYLSDPEYTQLATKFNSQVSERRVNFSSGSADPFPFEYCYRLSPN-- 393

Query: 266 RFPSMAFYFEDANLRIDGENVFII--------DYENHF--FLLAVAPHDDLVA--LIGSQ 313
                     D +L   G  +F +        D       + LA+  +D  +   +IG  
Sbjct: 394 ---QTEVAMPDVSLTAKGGALFPVTQPFIPVGDTTGRAIGYCLAIMRNDMAIGIDIIGQN 450

Query: 314 QQRDTRFVYDLNIDLLSFVKENC 336
                + V+D    +L + K +C
Sbjct: 451 FMTGLKVVFDRERSVLGWEKFDC 473


>gi|224136436|ref|XP_002322329.1| predicted protein [Populus trichocarpa]
 gi|222869325|gb|EEF06456.1| predicted protein [Populus trichocarpa]
          Length = 486

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 101/419 (24%), Positives = 157/419 (37%), Gaps = 100/419 (23%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFDPRKSSSFQKINCD------------------ 42
           ++ L IGTP + + +++DTGS L +    P  + SF  + CD                  
Sbjct: 83  LISLNIGTPPQVIQVLMDTGSDLTWV---PCGNLSFDCMECDDYRNNKLMATFSPSYSSS 139

Query: 43  ------------------HP--DCTYFKC-----VNEQCV-----YTMKYADQSVTKGFA 72
                             +P   CT   C     V   C      +   Y    V  G  
Sbjct: 140 SYRASCASPFCIDIHSSDNPLDTCTVAGCSLSTLVKATCSRPCPSFAYTYGAGGVVTGIL 199

Query: 73  AHETISVIGKGEGKAI-FHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS 131
             +T+ V G   G A       FGC    +      R+    G+ G  R T+S +SQLG 
Sbjct: 200 TRDTLRVNGSSPGVAKEIPKFCFGCVGSAY------REPI--GIAGFGRGTLSMVSQLG- 250

Query: 132 IIKKRFSYC-LVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP--NNFYYLSLKDI 188
            ++K FS+C L     N    SS L  G      +   Q T  +N P   NFYY+ L+ I
Sbjct: 251 FLQKGFSHCFLAFKYANNPNISSPLVVGDIALTSKDDMQFTPMLNSPMYPNFYYVGLEAI 310

Query: 189 SIDNERMNFPPDTF-DITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQL 247
           ++ N      P +  +    G GG  IDSG+  T+     Y ++     S     +   +
Sbjct: 311 TVGNVSATEVPSSLREFDSLGNGGMKIDSGTTYTHLPEPFYSQVLSILQSTINYPRDTGM 370

Query: 248 SDCPEPIQLCYFLPETFNR-------FPSMAFYFEDANLRIDGENV-FIIDYENHFFLLA 299
            +      LCY +P   N         PS+ F+F +        NV  ++   NHF+ ++
Sbjct: 371 -EMQTGFDLCYKVPRPNNNTLTSDDLLPSITFHFLN--------NVSLVLPQGNHFYPVS 421

Query: 300 VAPHDDLV-----------------ALIGSQQQRDTRFVYDLNIDLLSFVKENCSDDSA 341
            AP +  V                  + GS QQ++   VYDL  + + F   +C+  ++
Sbjct: 422 -APGNPAVVKCLMFQSTDDGDDGPAGVFGSFQQQNVEVVYDLEKERIGFQPMDCASAAS 479


>gi|88174554|gb|ABD39352.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
          Length = 321

 Score = 71.6 bits (174), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 143/342 (41%), Gaps = 53/342 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
           ++ + +GTPSK  ++ +DTGS+  +   +       PR     +S++  K++C    C  
Sbjct: 2   VISVGLGTPSKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61

Query: 49  F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 C + +    C + + Y D S + G    +T++            G  FGC+ D+
Sbjct: 62  GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQKIPGFSFGCNMDS 117

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-----TSSYL 155
            G +E    G + G+LG+    +S + Q  S     FSYCL  PL   E      T+ Y 
Sbjct: 118 FGANE---FGNVDGLLGMGAGAMSVLKQ-SSPTFDCFSYCL--PLQKSERGFFSKTTGYF 171

Query: 156 KFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
             G      R   + TK +    N   +++ L  IS+D ER+   P  F        G +
Sbjct: 172 SLGKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFS-----RKGVV 224

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAF 272
            DSGS L+Y        L ++      R   A+     E  + CY +        P+++ 
Sbjct: 225 FDSGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISL 280

Query: 273 YFED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
           +F+D A   +    VF+     E   + LA AP +  V++IG
Sbjct: 281 HFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIG 321


>gi|326495450|dbj|BAJ85821.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 491

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 79/330 (23%), Positives = 125/330 (37%), Gaps = 45/330 (13%)

Query: 26  AIFDPRKSSSFQKINCDHPDCTYF-----KCVNEQ----CVYTMKYADQSVTKGFAAHET 76
           A FDPR+SS+   + C    C         C        C+Y ++Y+D  +T G    +T
Sbjct: 188 AFFDPRRSSTGAPVRCGSRACRTLGGYANGCSKPNSTGDCLYRIEYSDHRLTLGTYMTDT 247

Query: 77  ISVIGKGEGKAIFHGALFGCSNDNHG-FDEDARDGALAGVLGLSRVTISFISQLGSIIKK 135
           +++         F    FGCS+   G F   A     +G + L     S +SQ       
Sbjct: 248 LTI----SPSTTFLNFRFGCSHAVRGKFSAQA-----SGTMSLGGGPQSLLSQTARAYGN 298

Query: 136 RFSYCLVIPLPNGEYTSSYLKFGTDMG-----YRRPSTQATKFINHPNNFYYLSLKDISI 190
            FSYC+  P   G  +      G D G        P  ++   IN     Y + L+ I +
Sbjct: 299 AFSYCVPGPSAAGFLSIGGPVNGDDGGGSGAFATTPLVRSANVIN--PTIYVVRLQGIEV 356

Query: 191 DNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKF----VSYFERFQLAQ 246
              R+N PP  F       GG ++DS +V+T      Y  L   F     +Y  R     
Sbjct: 357 AGRRLNVPPVVF------SGGTVMDSSAVITQLPPTAYRALRLAFRNAMRAYKTRAPTGN 410

Query: 247 LSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDDL 306
           L  C +      F+  +    P+++  F+   +   G    ++D    F  +A    D  
Sbjct: 411 LDTCFD------FVGVSKVTVPTVSLVFDGGAVIELGLLSVLLDSCLAFAPMAA---DFA 461

Query: 307 VALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
           +  IG+ QQ+    +YD+    + F    C
Sbjct: 462 LGFIGNVQQQTHEVLYDVAGGAVGFRHGAC 491


>gi|168027607|ref|XP_001766321.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682535|gb|EDQ68953.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 381

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 89/377 (23%), Positives = 142/377 (37%), Gaps = 70/377 (18%)

Query: 4   LFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSFQKINCDHPDCTY 48
           + IG P+K   L +DTGS L +                ++DP+K+   + ++C  P C  
Sbjct: 27  MLIGAPAKLYYLDMDTGSDLTWLQCDAPCRSCASGPHGLYDPKKA---RLVDCRVPLCAL 83

Query: 49  ------FKCVN--EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 + C     QC Y ++YAD S T G    +TI+++    G      A+ GC  D 
Sbjct: 84  VQQGGSYACGGPVRQCDYDVEYADGSSTMGVLMEDTITLL-LTNGTRSKTTAIIGCGYDQ 142

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
            G        +  GV+GLS   IS  SQL    I++    +CL      G     YL FG
Sbjct: 143 QGTLAQT-PASTDGVMGLSSAKISLPSQLAKKGIVRNVIGHCLA----GGSNGGGYLFFG 197

Query: 159 TDMGYRRPSTQATKFINHPNNFYYLSLKDIS--IDNERMNFPPDTFDITVSGEGGCIIDS 216
             +    P+   T        +  +  K I+  I  +  +    T DI     GG + DS
Sbjct: 198 DSL---VPALGMT--------WTPIMGKSITGNIGGKSGDADDKTGDI-----GGVMFDS 241

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFED 276
           G+  TY   + Y  +        E+  L ++      +  C+  P  F     +  YF+ 
Sbjct: 242 GTSFTYLVPEAYNAVLSAMEMQVEKSGLVRIKT-DNTLPFCWRGPSPFESVADVQRYFKT 300

Query: 277 AN--------------LRIDGENVFIIDYENHF---FLLAVAPHDDLVALIGSQQQRDTR 319
                           L +  E   I+  + +     L A     ++  +IG    R   
Sbjct: 301 VTLDFGKRNWYSASRVLELSPEGYLIVSTQGNVCLGILDASGASLEVTNIIGDVSMRGYL 360

Query: 320 FVYDLNIDLLSFVKENC 336
            VYD   + + +V+ NC
Sbjct: 361 VVYDNARNQIGWVRRNC 377


>gi|359482287|ref|XP_002263129.2| PREDICTED: aspartic proteinase-like protein 2 isoform 2 [Vitis
           vinifera]
 gi|297740017|emb|CBI30199.3| unnamed protein product [Vitis vinifera]
          Length = 502

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 92/378 (24%), Positives = 157/378 (41%), Gaps = 63/378 (16%)

Query: 3   RLFIGTPSKGVLLILDTGSALIY-------------------AIFDPRKSSSFQKINCDH 43
           ++ +G+P +   + +DTGS +++                   + FDP  SS+   ++C H
Sbjct: 89  KVKLGSPPREFNVQIDTGSDILWVTCNSCNDCPRTSGLGIELSFFDPSSSSTTSLVSCSH 148

Query: 44  PDCTYF------KCV--NEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHGA 92
           P CT        +C   + QC Y+  Y D S T G+   + +   +V+G           
Sbjct: 149 PICTSLVQTTAAECSPQSNQCSYSFHYGDGSGTTGYYVSDMLYFDTVLGDSLIANSSASI 208

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
           +FGCS    G D    D A+ G+ G  +  +S +SQL S  I  K FS+CL      GE 
Sbjct: 209 VFGCSTYQSG-DLTKVDKAIDGIFGFGQQDLSVVSQLSSLGITPKVFSHCL-----KGEG 262

Query: 151 -TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
                L  G  +    P+   +  +    + Y L+L+ IS++ + +   P  F    S  
Sbjct: 263 DGGGKLVLGEIL---EPNIIYSPLVPS-QSHYNLNLQSISVNGQLLPIDPAVF--ATSNN 316

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPI----QLCYFLPETFN 265
            G I+DSG+ LTY     Y    + FVS       A +S    P+      CY +  + +
Sbjct: 317 QGTIVDSGTTLTYLVETAY----DPFVSAIT----ATVSSSTTPVLSKGNQCYLVSTSVD 368

Query: 266 R-FPSMAFYFEDANLRI--DGENVFIIDYENHFFLLAVA---PHDDLVALIGSQQQRDTR 319
             FP ++  F      +   GE +  + + +   +  +      +  + ++G    +D  
Sbjct: 369 EIFPPVSLNFAGGASMVLKPGEYLMHLGFSDGAAMWCIGFQKVAEPGITILGDLVLKDKI 428

Query: 320 FVYDLNIDLLSFVKENCS 337
           FVYDL    + +   +CS
Sbjct: 429 FVYDLAHQRIGWANYDCS 446


>gi|356564743|ref|XP_003550608.1| PREDICTED: aspartic proteinase-like protein 2-like [Glycine max]
          Length = 490

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 90/375 (24%), Positives = 142/375 (37%), Gaps = 57/375 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           ++ +GTP     + +DTGS +++                     FDP  SS+   I C  
Sbjct: 78  KVQLGTPPVEFNVQIDTGSDVLWVSCNSCSGCPQTSGLQIQLNFFDPGSSSTSSMIACSD 137

Query: 44  PDC--------TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA--- 92
             C              N QC YT +Y D S T G+   + + +    EG    +     
Sbjct: 138 QRCNNGIQSSDATCSSQNNQCSYTFQYGDGSGTSGYYVSDMMHLNTIFEGSVTTNSTAPV 197

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
           +FGCSN   G D    D A+ G+ G  +  +S ISQL S  I  + FS+CL      G  
Sbjct: 198 VFGCSNQQTG-DLTKSDRAVDGIFGFGQQEMSVISQLSSQGIAPRVFSHCLKGDSSGG-- 254

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
               L  G  +    P+   T  +   P+  Y L+L+ I+++ + +      F    S  
Sbjct: 255 --GILVLGEIV---EPNIVYTSLVPAQPH--YNLNLQSIAVNGQTLQIDSSVF--ATSNS 305

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FP 268
            G I+DSG+ L Y   + Y    + FVS         +         CY +  +    FP
Sbjct: 306 RGTIVDSGTTLAYLAEEAY----DPFVSAITASIPQSVHTVVSRGNQCYLITSSVTEVFP 361

Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAV------APHDDLVALIGSQQQRDTRFVY 322
            ++  F      I     ++I  +N     AV            + ++G    +D   VY
Sbjct: 362 QVSLNFAGGASMILRPQDYLIQ-QNSIGGAAVWCIGFQKIQGQGITILGDLVLKDKIVVY 420

Query: 323 DLNIDLLSFVKENCS 337
           DL    + +   +CS
Sbjct: 421 DLAGQRIGWANYDCS 435


>gi|242072067|ref|XP_002451310.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
 gi|241937153|gb|EES10298.1| hypothetical protein SORBIDRAFT_05g027510 [Sorghum bicolor]
          Length = 509

 Score = 71.6 bits (174), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 79/354 (22%), Positives = 138/354 (38%), Gaps = 58/354 (16%)

Query: 14  LLILDTGSALIYA----------------IFDPRKSSSFQKINCDHPDCTYFK-----CV 52
           L++LDT S + +                 ++DP KS S +   C  P C         C 
Sbjct: 183 LMLLDTASDVAWVQCFPCPASQCYAQTDVLYDPSKSRSSESFACSSPTCRQLGPYANGCS 242

Query: 53  NE-----QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
           +      QC Y ++Y D S T G    + +S+    +         FGCS+   G    +
Sbjct: 243 SSSNSAGQCQYRVRYPDGSTTSGTLVADQLSLSPTSQVPKFE----FGCSHAARGSFSRS 298

Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS 167
           +    AG++ L R   S +SQ  +   + FSYC          T+S+  F      RR S
Sbjct: 299 K---TAGIMALGRGVQSLVSQTSTKYGQVFSYCF-------PPTASHKGFFVLGVPRRSS 348

Query: 168 TQ--ATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
           ++   T  +  P   Y + L+ I++  +R++ PP  F        G  +DS +V+T    
Sbjct: 349 SRYAVTPMLKTP-MLYQVRLEAIAVAGQRLDVPPTVF------AAGAALDSRTVITRLPP 401

Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCY-FLPETFNRFPSMAFYFE--DANLRID 282
             Y  L   F      ++ A  +     +  CY F   +    P+++  F+   A +++D
Sbjct: 402 TAYQALRSAFRDKMSMYRPAAANG---QLDTCYDFTGVSSIMLPTISLVFDRTGAGVQLD 458

Query: 283 GENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENC 336
              V    + +     + A  D    +IG  Q +    +Y++    + F +  C
Sbjct: 459 PSGVL---FGSCLAFASTAGDDRATGIIGFLQLQTIEVLYNVAGGSVGFRRGAC 509


>gi|449445943|ref|XP_004140731.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Cucumis sativus]
          Length = 430

 Score = 71.6 bits (174), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 101/392 (25%), Positives = 160/392 (40%), Gaps = 86/392 (21%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-------------AIFDPRKSSSFQKIN------- 40
           +V L IGTP +   L+LDTGS L +              +  P+ +S    ++       
Sbjct: 67  VVSLPIGTPPQPTDLVLDTGSQLSWIQCHDKKVKKRLPPLPKPKTASFDPSLSSSFSLLP 126

Query: 41  CDHPDCTY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHG 91
           C+HP C            C  N  C Y+  YAD ++ +G    E  +         +   
Sbjct: 127 CNHPICKPRIPDFTLPTSCDQNRLCHYSYFYADGTLAEGNLVREKFTFSKSLSTPPV--- 183

Query: 92  ALFGC---SNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPL--- 145
            + GC   S +N G            +LG++   +SFISQ  + I K FSYC+       
Sbjct: 184 -ILGCAQASTENRG------------ILGMNHGRLSFISQ--AKISK-FSYCVPSRTGSN 227

Query: 146 PNGEY------TSSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPP 199
           P G +       SS  K+ T + +  P +Q++  ++     Y L +K I I  +R+N PP
Sbjct: 228 PTGLFYLGDNPNSSKFKYVTMLTF--PESQSSPNLDP--LAYTLPMKAIKIAGKRLNIPP 283

Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-----ERFQLAQLSDCPEPI 254
             F     G G  +IDSGS LTY   + Y K+ E+ V        + +  A ++D     
Sbjct: 284 AAFKPDAGGSGQTMIDSGSDLTYLVDEAYEKVKEEVVRLVGAMMKKGYVYADVAD----- 338

Query: 255 QLCY---FLPETFNRFPSMAFYFEDANLRI---DGENVFIIDYENHFFLLAVAPHDDL-- 306
            +C+      E   R   ++F F D  + I    GE V + + E     + +   + L  
Sbjct: 339 -MCFDAGVTAEVGRRIGGISFEF-DNGVEIFVGRGEGV-LTEVEKGVKCVGIGRSERLGI 395

Query: 307 -VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
              +IG+  Q++    YDL    + F    CS
Sbjct: 396 GSNIIGTVHQQNMWVEYDLANKRVGFGGAECS 427


>gi|297801286|ref|XP_002868527.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314363|gb|EFH44786.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 444

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 74/273 (27%), Positives = 111/273 (40%), Gaps = 57/273 (20%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHP 44
           ++ L IGTPS+   L+LDTGS L +                  FDP  SSSF  + C HP
Sbjct: 82  ILSLPIGTPSQSQELVLDTGSQLSWIQCHPKKIKKPLPPPTTSFDPSLSSSFSDLPCSHP 141

Query: 45  DCTY--------FKC-VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFG 95
            C            C  N  C Y+  YAD +  +G    E  +         +    + G
Sbjct: 142 LCKPRIPDFTLPTSCDSNRLCHYSYFYADGTFAEGNLVKEKFTFSNSQTTPPL----ILG 197

Query: 96  CSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYL 155
           C+ ++           + G+LG++   +SFISQ  + I K FSYC+    P         
Sbjct: 198 CAKESTD---------VKGILGMNLGRLSFISQ--AKISK-FSYCI----PTRSNRPGLA 241

Query: 156 KFGTDMGYRRPSTQATKFIN---------HPNN---FYYLSLKDISIDNERMNFPPDTFD 203
             G+      P+++  K+++          PN     Y + L  I I  +R+N P   F 
Sbjct: 242 STGSFYLGENPNSRGFKYVSLLTFPQSQRMPNLDPLAYTVPLLGIRIGQKRLNIPSSVFR 301

Query: 204 ITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFV 236
               G G  ++DSGS  T+     Y K+ E+ V
Sbjct: 302 PDAGGSGQTMVDSGSEFTHLVDVAYDKVKEEIV 334


>gi|77808087|gb|AAS48510.2| aspartic protease [Fagopyrum esculentum]
 gi|82780908|gb|ABB88696.2| aspartic proteinase-like protein [Fagopyrum esculentum]
          Length = 447

 Score = 71.2 bits (173), Expect = 7e-10,   Method: Compositional matrix adjust.
 Identities = 87/377 (23%), Positives = 149/377 (39%), Gaps = 60/377 (15%)

Query: 6   IGTPSKGVLLILDTGSALIYA-------------------------IFDPRKSSSFQKIN 40
           +GTP + V L+LDTGS+L++                          I+   KSS+ Q + 
Sbjct: 80  LGTPPQKVSLVLDTGSSLVWTPCTIPTATYTCQNCTFSGVDPTKIPIYARNKSSTVQSLP 139

Query: 41  CDHPDCTYFKCVNEQCVYTMK--YADQSVTKGFAAHETIS-VIGKGEGKAIFHGALFGCS 97
           C  P C +    +  C  T +  Y       G    + +S V+G  +   I    LFGCS
Sbjct: 140 CRSPKCNWVFGSDLNCSTTKRCPYYGLEYGLGSTTGQLVSDVLGLSKLNRI-PDFLFGCS 198

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
             ++   E        G+ G  R   S  +QLG     +FSYCLV    +    S  L  
Sbjct: 199 LVSNRQPE--------GIAGFGRGLASIPAQLGLT---KFSYCLVSHRFDDTPQSGDLVL 247

Query: 158 GTDMGYRRPSTQA-----TKFINHP-----NNFYYLSLKDISIDNERMNFPPDTFDITVS 207
               G R     A       F   P     + +YY+SL  I +  + +  PP     +  
Sbjct: 248 --HRGRRHADAAANGVAYAPFTKSPALSPYSEYYYISLSKILVGGKDVPIPPRYLVPSKE 305

Query: 208 GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLP-ETFNR 266
           G+GG I+DSGS  T+    ++  +  +   +  +++ A+  +    +  CY +  ++   
Sbjct: 306 GDGGMIVDSGSTFTFMERIIFDPVARELEKHMTKYKRAKEIEDSSGLGPCYNITGQSEVD 365

Query: 267 FPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHDD------LVALIGSQQQRDTR 319
            P + F F+  AN+ +   + F +  +    +  +   D+         ++G+ QQ++  
Sbjct: 366 VPKLTFSFKGGANMDLPLTDYFSLVTDGVVCMTVLTDPDEPGSTTGPAIILGNYQQQNFY 425

Query: 320 FVYDLNIDLLSFVKENC 336
             YDL      F  + C
Sbjct: 426 IEYDLKKQRFGFKPQQC 442


>gi|296090179|emb|CBI39998.3| unnamed protein product [Vitis vinifera]
          Length = 334

 Score = 70.9 bits (172), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 82/355 (23%), Positives = 150/355 (42%), Gaps = 66/355 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDC 46
           ++++ IGTP   V  I DTGS L++               +FDP KS+SF++++C+    
Sbjct: 25  LMKISIGTPPFDVYGIYDTGSDLMWTQCLPCLSCYKQKNPMFDPSKSTSFKEVSCE---- 80

Query: 47  TYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHG-FDE 105
                 ++QC    +  D   +                        +FGC ++N G F+E
Sbjct: 81  ------SQQC----RLLDTPTS--------------------ILNIVFGCGHNNSGTFNE 110

Query: 106 DARDGALAGVLGLSRVTISFISQLGSII--KKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
           +       G+ G     +S  SQ+ S +   ++FS CLV P       +S + FG +   
Sbjct: 111 NE-----MGLFGTGGRPLSLTSQIMSTLGSGRKFSQCLV-PFRTDPSITSKIIFGPEAEV 164

Query: 164 RRPSTQATKFINHPN-NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTY 222
                 +T  +   +  +Y+++L  IS+ ++   F   +    ++ +G   ID+G+  T 
Sbjct: 165 SGSDVVSTPLVTKDDPTYYFVTLDGISVGDKLFPF---SSSSPMATKGNVFIDAGTPPTL 221

Query: 223 FHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRID 282
              D Y +L +      E   +  + D     QLCY    T    P +  +F+ A++++ 
Sbjct: 222 LPRDFYNRLVQGVK---EAIPMEPVQDPDLQPQLCY-RSATLIDGPILTAHFDGADVQLK 277

Query: 283 GENVFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
             N FI   E   +  A+ P D    + G+  Q +    +DL+   +SF   +C+
Sbjct: 278 PLNTFISPKEG-VYCFAMQPIDGDTGIFGNFVQMNFLIGFDLDGKKVSFKAVDCT 331


>gi|449456068|ref|XP_004145772.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449496218|ref|XP_004160076.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 500

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 90/376 (23%), Positives = 141/376 (37%), Gaps = 59/376 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           R+ +G P K   + +DTGS +++                     FDP  S++   ++C  
Sbjct: 86  RVQLGNPPKDFYVQIDTGSDVLWVSCNSCNGCPATSGLQIPLNFFDPGSSTTASLVSCSD 145

Query: 44  PDCTY------FKCVNE--QCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
             C          C  +  QC Y  +Y D S T G+   + I    VI            
Sbjct: 146 QICALGVQSSDSACFGQSNQCAYVFQYGDGSGTSGYYVMDMIHLDVVIDSSVTSNSSASV 205

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
           +FGCS    G D    D A+ G+ G  +  +S ISQL S  I  K FS+C    L   + 
Sbjct: 206 VFGCSTSQTG-DLTKSDRAVDGIFGFGQQDLSVISQLSSRGIAPKVFSHC----LKGDDS 260

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
               L  G  +    P+   T  + + P+  Y L+L+ IS++ + +   P  F    S  
Sbjct: 261 GGGILVLGEIV---EPNVVYTPLVPSQPH--YNLNLQSISVNGQVLPISPAVF--ATSSS 313

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-FP 268
            G IIDSG+ L Y   + Y      FV                    CY    + +  FP
Sbjct: 314 QGTIIDSGTTLAYLAEEAY----NAFVVAVTNIVSQSTQSVVLKGNRCYVTSSSVSDIFP 369

Query: 269 SMAFYFEDANLRIDGENVFIIDYENHFFLLAV-------APHDDLVALIGSQQQRDTRFV 321
            ++  F      + G   ++I  +N      V        P    + ++G    +D  F+
Sbjct: 370 QVSLNFAGGASLVLGAQDYLIQ-QNSVGGTTVWCIGFQKIPGQG-ITILGDLVLKDKIFI 427

Query: 322 YDLNIDLLSFVKENCS 337
           YDL    + +   +CS
Sbjct: 428 YDLANQRIGWTNYDCS 443


>gi|15235526|ref|NP_193028.1| aspartyl protease family protein [Arabidopsis thaliana]
 gi|5123933|emb|CAB45491.1| putative protein [Arabidopsis thaliana]
 gi|7267994|emb|CAB78334.1| putative protein [Arabidopsis thaliana]
 gi|332657803|gb|AEE83203.1| aspartyl protease family protein [Arabidopsis thaliana]
          Length = 389

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 89/364 (24%), Positives = 137/364 (37%), Gaps = 63/364 (17%)

Query: 1   MVRLFIGTPSKGVLLILDTGSAL---------------IYAIFDPRKSSSFQKINCD--H 43
           M  +  G+P K   L +DTGS+L               IY  + P  S +++   C+  H
Sbjct: 59  MAEIHFGSPQKKQFLHMDTGSSLTWTQCFPCSDCYAQKIYPKYRPAASITYRDAMCEDSH 118

Query: 44  PDCT---YFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
           P       F  +   C Y   Y D++  KG  A E I+V     G    HG  FGC+   
Sbjct: 119 PKSNPHFAFDPLTRICTYQQHYLDETNIKGTLAQEMITVDTHDGGFKRVHGVYFGCNT-- 176

Query: 101 HGFDEDARDGAL---AGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
                   DG+     G+LGL     S I + GS    +FS+CL      GE +      
Sbjct: 177 ------LSDGSYFTGTGILGLGVGKYSIIGEFGS----KFSFCL------GEISEPKASH 220

Query: 158 GTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSG 217
              +G           IN         L+ I +  E          IT+       +D+G
Sbjct: 221 NLILGDGANVQGHPTVINITEGHTIFQLESIIVGEE----------ITLDDPVQVFVDTG 270

Query: 218 SVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE-- 275
           S L++  +++Y+K  + F        L+      EP  LCY   +T  R   M   F+  
Sbjct: 271 STLSHLSTNLYYKFVDAFDDLIGSRPLSY-----EPT-LCY-KADTIERLEKMDVGFKFD 323

Query: 276 -DANLRIDGENVFIIDYENHFFLLAVAPHDDLVA--LIGSQQQRDTRFVYDLNIDLLSFV 332
             A L ++  N+FI         LA+  + +  +  +IG    +     YDL+       
Sbjct: 324 VGAELSVNIHNIFIQQGPPEIRCLAIQNNKESFSHVIIGVIAMQGYNVGYDLSAKTAYIN 383

Query: 333 KENC 336
           K++C
Sbjct: 384 KQDC 387


>gi|88174556|gb|ABD39353.1| chloroplast nucleoid DNA-binding protein [Oryza barthii]
          Length = 321

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 86/342 (25%), Positives = 143/342 (41%), Gaps = 53/342 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
           ++ + +GTPSK  ++ +DTGS+  +   +       PR     +S++  K++C    C  
Sbjct: 2   VISVGLGTPSKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61

Query: 49  F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 C + +    C + + Y D S + G    +T++            G  FGC+ D+
Sbjct: 62  GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQKIPGFSFGCNMDS 117

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-----TSSYL 155
            G +E    G + G+LG+    +S + Q  S     FSYCL  PL   E      T+ Y 
Sbjct: 118 FGANE---FGNVDGLLGMGAGAMSVLKQ-SSPTFDCFSYCL--PLQKSERGFFSKTTGYF 171

Query: 156 KFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
             G      R   + TK +    N   +++ L  IS+D ER+   P  F        G +
Sbjct: 172 SLGKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFS-----RKGVV 224

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAF 272
            DSGS L+Y        L ++      R   A+     E  + CY +        P+++ 
Sbjct: 225 FDSGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISL 280

Query: 273 YFED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
           +F+D A   +    VF+     E   + LA AP +  V++IG
Sbjct: 281 HFDDGARFDLGRGGVFVERSVQEQDVWCLAFAPTES-VSIIG 321


>gi|302774304|ref|XP_002970569.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
 gi|300162085|gb|EFJ28699.1| hypothetical protein SELMODRAFT_93861 [Selaginella moellendorffii]
          Length = 490

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 91/364 (25%), Positives = 150/364 (41%), Gaps = 53/364 (14%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYAI------------FDPRKSSSFQKINCDHPDCTYFK 50
           R+ IGTP     LI+D  S +                F P  SSS++ + C + +C+   
Sbjct: 38  RVKIGTPPHEFSLIVDRSSFVSPKTMFCSFFFLQDPRFSPALSSSYKPLECGN-ECSTGF 96

Query: 51  CVNEQCVYTMKYADQSVTKGFAAHETISVIGKGE--GKAIFHGALFGCSNDNHGFDEDAR 108
           C   +  Y  +YA++S + G    + IS     +  G+ +    +FGC     G   D  
Sbjct: 97  CDGSR-KYQRQYAEKSTSSGVLGKDVISFSNSSDLGGQRL----VFGCETAETG---DLY 148

Query: 109 DGALAGVLGLSRVTISFISQL--GSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRP 166
           D    G++GL R  +S I QL   + ++  FS C        +     +  G   G++ P
Sbjct: 149 DQTADGIIGLGRGPLSIIDQLVEKNAMEDVFSLCY----GGMDEGGGAMILG---GFQPP 201

Query: 167 STQA-TKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHS 225
                T    H + +Y L LK I +    +   P+ FD    G+ G ++DSG+   YF  
Sbjct: 202 KDMVFTSSDPHRSPYYNLMLKGIRVGGSPLRLKPEVFD----GKYGTVLDSGTTYAYFPG 257

Query: 226 DVYWKLHEKFVSYFERFQLAQLSDCPEPIQ----LCYFLPET-----FNRFPSMAFYFED 276
             +    + F S  +  Q+  L + P P +    +CY    T        FPS+ F F D
Sbjct: 258 AAF----QAFKSAVKE-QVGSLKEVPGPDEKFKDICYAGAGTNVSNLSQFFPSVDFVFGD 312

Query: 277 A-NLRIDGEN-VFIIDYENHFFLLAVAPHDDLVALIGSQQQRDTRFVYDLNIDLLSFVKE 334
             ++ +  EN +F     +  + L V  + D   L+G    R+    Y+     + F+K 
Sbjct: 313 GQSVTLSPENYLFRHTKISGAYCLGVFENGDPTTLLGGIIVRNMLVTYNRGKASIGFLKT 372

Query: 335 NCSD 338
            C+D
Sbjct: 373 KCND 376


>gi|297812425|ref|XP_002874096.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
 gi|297319933|gb|EFH50355.1| aspartyl protease family protein [Arabidopsis lyrata subsp. lyrata]
          Length = 493

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 85/376 (22%), Positives = 155/376 (41%), Gaps = 59/376 (15%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           ++ +G+P +   + +DTGS +++                     FDP  S +   ++C  
Sbjct: 84  KIRLGSPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTATPVSCSD 143

Query: 44  PDCTY--------FKCVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
             C++            N  C YT +Y D S T GF   + +    ++G           
Sbjct: 144 QRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPV 203

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
           +FGCS    G D    D A+ G+ G  +  +S ISQL S  +  + FS+CL      GE 
Sbjct: 204 VFGCSTSQTG-DLVKSDRAVDGIFGFGQQGMSVISQLASQGLAPRVFSHCL-----KGEN 257

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
               +    ++    P+   T  + + P+  Y ++L  IS++ + +   P  F  T +G+
Sbjct: 258 GGGGILVLGEI--VEPNMVFTPLVPSQPH--YNVNLLSISVNGQALPINPSVFS-TSNGQ 312

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-NRFP 268
            G IID+G+ L Y     Y    E   +   +     +S   +    CY +  +  + FP
Sbjct: 313 -GTIIDTGTTLAYLSEAAYVPFVEAITNAVSQSVRPVVSKGNQ----CYVIATSVADIFP 367

Query: 269 SMAFYFE-DANLRIDGENVFIIDYENHFFLLAV------APHDDLVALIGSQQQRDTRFV 321
            ++  F   A++ ++ ++  I   +N+    AV         +  + ++G    +D  FV
Sbjct: 368 PVSLNFAGGASMFLNPQDYLI--QQNNVGGTAVWCIGFQRIQNQGITILGDLVLKDKIFV 425

Query: 322 YDLNIDLLSFVKENCS 337
           YDL    + +   +CS
Sbjct: 426 YDLVGQRIGWANYDCS 441


>gi|242067689|ref|XP_002449121.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
 gi|241934964|gb|EES08109.1| hypothetical protein SORBIDRAFT_05g005400 [Sorghum bicolor]
          Length = 358

 Score = 70.5 bits (171), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 86/323 (26%), Positives = 130/323 (40%), Gaps = 59/323 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYAIFD-PRKS-----------SSFQKINCDHPDCTYF 49
           V + IG P+K   L +DTGS L +   D P +S           ++   + C +  CT  
Sbjct: 56  VTMNIGNPAKPYFLDVDTGSDLTWLQCDAPCRSCNKVPHPLYRPTANSLVPCANALCTAL 115

Query: 50  --------KCVN-EQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                   KC + +QC Y +KY D + ++G   ++  S+  +     I  G  FGC  D 
Sbjct: 116 HSGHGSNNKCPSPKQCDYQIKYTDSASSQGVLINDNFSLPMRSSN--IRPGLTFGCGYDQ 173

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYLKFG 158
                 A   A  G+LGL R ++S +SQL    I K    +CL     NG     +L FG
Sbjct: 174 QVGKNGAVQAATDGMLGLGRGSVSLVSQLKQQGITKNVLGHCLST---NG---GGFLFFG 227

Query: 159 TDMGYRRPSTQAT--KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDS 216
            D+    P+++ T         N+Y      +  D   +   P             + DS
Sbjct: 228 DDI---VPTSRVTWVPMAKISGNYYSPGSGTLYFDRRSLGVKPMEV----------VFDS 274

Query: 217 GSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETF-------NRFPS 269
           GS  TYF +  Y  +     S   +  L Q+SD   P  LC+  P+ F         F S
Sbjct: 275 GSTYTYFTAQPYQAVVSALKSGLSK-SLKQVSDPSLP--LCWKGPKAFKSVFDVKKEFKS 331

Query: 270 MAFYF---EDANLRIDGENVFII 289
           +   F   ++A + I  EN  I+
Sbjct: 332 LFLSFASAKNAVMEIPPENYLIV 354


>gi|296086208|emb|CBI31649.3| unnamed protein product [Vitis vinifera]
          Length = 761

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 79/370 (21%), Positives = 137/370 (37%), Gaps = 105/370 (28%)

Query: 2   VRLFIGTPSKGVLLILDTGSAL----------IYAIFDPRKSSSFQKINCDHPDCTYFKC 51
           V L +G+P + V ++LDTGS L          ++++FDP +SSS+  I C  P C     
Sbjct: 377 VSLTVGSPPQTVTMVLDTGSELSWLHCKKAPNLHSVFDPLRSSSYSPIPCTSPTC----- 431

Query: 52  VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGA 111
                            +     +T  +IG   G                          
Sbjct: 432 -----------------RTRTHSKTTGLIGMNRG-------------------------- 448

Query: 112 LAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQAT 171
                     ++SF++Q+G    ++FSYC+     +G+ +S  L FG        + + T
Sbjct: 449 ----------SLSFVTQMG---LQKFSYCI-----SGQDSSGILLFGESSFSWLKALKYT 490

Query: 172 KF--INHPNNF-----YYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFH 224
               I+ P  +     Y + L+ I + N  +  P   +    +G G  ++DSG+  T+  
Sbjct: 491 PLVQISTPLPYFDRVAYTVQLEGIKVANSMLQLPKSVYAPDHTGAGQTMVDSGTQFTFLL 550

Query: 225 SDVYWKLHEKFVSYFERFQLAQLSDCPEP-------IQLCYFLP---ETFNRFPSMAFYF 274
             VY  L  +FV    R   A L    +P       + LCY +P    T    P++   F
Sbjct: 551 GPVYTALKNEFV----RQTKASLKVLEDPNFVFQGAMDLCYRVPLTRRTLPPLPTVTLMF 606

Query: 275 EDANLRIDGENVF-----IIDYENHFFLLAVAPHDDLVA---LIGSQQQRDTRFVYDLNI 326
             A + +  E +      +I   +  +       + L     +IG   Q++    +DL  
Sbjct: 607 RGAEMSVSAERLMYRVPGVIRGSDSVYCFTFGNSELLGVESYIIGHHHQQNVWMEFDLAK 666

Query: 327 DLLSFVKENC 336
             + F +  C
Sbjct: 667 SRVGFAEVRC 676


>gi|449522369|ref|XP_004168199.1| PREDICTED: LOW QUALITY PROTEIN: aspartic proteinase
           nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 95/391 (24%), Positives = 150/391 (38%), Gaps = 75/391 (19%)

Query: 4   LFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKINC 41
           L  GTP + + LI DTGS+L++                        F P+ SSS + + C
Sbjct: 85  LSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGC 144

Query: 42  DHPDCTYF--KCVNEQC---------------VYTMKYADQSVTKGFAAHETISVIGKGE 84
            +P C++     V  QC                Y ++Y   S T G    ET+    K  
Sbjct: 145 QNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFPDK-- 201

Query: 85  GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
                   + GCS     F    +    +G+ G  R + S  SQ+G    K+F+YCL   
Sbjct: 202 ---XIPNFVVGCS-----FLSIHQP---SGIAGFGRGSESLPSQMG---LKKFAYCLASR 247

Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP---NN----FYYLSLKDISIDNERMNF 197
             +    S  L   +  G +      T F  +P   NN    +YYL+++ I + N+ +  
Sbjct: 248 KFDDSPHSGQLILDS-TGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKV 306

Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLC 257
           P         G GG IIDSGS  T+    V   +  +F      +  A   +    ++ C
Sbjct: 307 PYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPC 366

Query: 258 YFL-PETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHD---------DL 306
           + +  E   +FP + F F+  A   +   N F +   +    L V  H            
Sbjct: 367 FDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGP 426

Query: 307 VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
             ++G+ QQ++    YDL    L F ++ CS
Sbjct: 427 SVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457


>gi|357162717|ref|XP_003579500.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Brachypodium
           distachyon]
          Length = 488

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 98/411 (23%), Positives = 153/411 (37%), Gaps = 95/411 (23%)

Query: 6   IGTPSKGVLLILDTGSALIYA----------------------IFDPRKSSSFQKINCDH 43
           +GTP + + ++LDTGS L +                       +F P+ SSS + + C +
Sbjct: 97  LGTPPQPLPVLLDTGSHLSWVPCTSSYQCRNCSSSPSAMSAMAVFHPKNSSSSRLVGCRN 156

Query: 44  PDCTYFKCVNEQCV--------------YTMKYADQSVTKGFAAHETISV--IGKGEGKA 87
           P C +    +                  Y + Y   S T G    +T+ +         A
Sbjct: 157 PACRWIHSKSPSTCGSTGNNGNGDVCPPYLVVYGSGS-TSGLLISDTLRLSPSSSSSAPA 215

Query: 88  IFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVI-PLP 146
            F     GCS         +     +G+ G  R   S  SQL      +FSYCL+     
Sbjct: 216 PFRNFAIGCS-------IVSVHQPPSGLAGFGRGAPSVPSQLKV---PKFSYCLLSRRFD 265

Query: 147 NGEYTSSYLKFGTDM---GYRRPSTQATKFINHPNN------FYYLSLKDISIDNERMNF 197
           +    S  L  G  M   G ++ + Q    +N+  +      +YYL+L  IS+  + +N 
Sbjct: 266 DNSAVSGELVLGDAMVPAGKKKTTMQYVPLLNNAASKPPYSVYYYLALTGISVGGKPVNL 325

Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF-ERFQLAQLSDCPEPIQL 256
           P   F    S  GG IIDSG+  TY    V+  +     S    R+  ++  +    ++ 
Sbjct: 326 PSRAF--VPSSGGGAIIDSGTTFTYLDPTVFKPVAAAMESAVGGRYNRSRPVEDALGLRP 383

Query: 257 CYFLPETFNRFPSMAFYFEDANLRIDGENVFIIDYENHF---------------FLLAVA 301
           C+ LP      P  A    D  L+  G  V  +  EN+F                 LAV 
Sbjct: 384 CFALPPG----PGGAMELPDLELKFKGGAVMRLPVENYFVAAGPAGGPAAGPVAICLAVV 439

Query: 302 PHDDLVA------------LIGSQQQRDTRFVYDLNIDLLSFVKENCSDDS 340
              DL A            ++GS QQ++    YDL  + L F ++ C+  S
Sbjct: 440 --SDLPASGGDGAAAGPAIILGSFQQQNYHIEYDLGKERLGFRQQPCAPKS 488


>gi|357137832|ref|XP_003570503.1| PREDICTED: aspartic proteinase Asp1-like [Brachypodium distachyon]
          Length = 564

 Score = 70.1 bits (170), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 82/314 (26%), Positives = 125/314 (39%), Gaps = 52/314 (16%)

Query: 4   LFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSF-------QKINC 41
           +F+G P +   L +DTGS L +                ++ P K           Q++  
Sbjct: 198 IFVGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPPRDLLCQELQG 257

Query: 42  DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
           D   C   K    QC Y ++YAD+S + G  A + + +I    G+      +FGC+ D  
Sbjct: 258 DQNYCATCK----QCDYEIEYADRSSSMGVLAKDDMHMIATNGGREKLD-FVFGCAYDQQ 312

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           G           G+LGLS   IS  SQL S  II   F +C +   PNG     Y+  G 
Sbjct: 313 G-QLLTSPAKTDGILGLSSAAISLPSQLASQGIISNVFGHC-ITKEPNG---GGYMFLGD 367

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG----CIID 215
           D   R   T A      P+N Y+   + ++  ++++          + G+ G     I D
Sbjct: 368 DYVPRWGMTWA-PIRGGPDNLYHTEAQKVNYGDQQLR---------MHGQAGSSIQVIFD 417

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFE 275
           SGS  TY   ++Y KL       +  F +   SD   P  LC+           +  +F+
Sbjct: 418 SGSSYTYLPDEIYKKLVTAIKYDYPSF-VQDTSDTTLP--LCWKADFDVRYLEDVKQFFK 474

Query: 276 DANLRIDGENVFII 289
             NL   G   F+I
Sbjct: 475 PLNLHF-GNRWFVI 487


>gi|88174561|gb|ABD39355.1| chloroplast nucleoid DNA-binding protein [Oryza longistaminata]
          Length = 321

 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 83/340 (24%), Positives = 144/340 (42%), Gaps = 49/340 (14%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
           ++ + +GTPSK  +L +DTGS+  +   +       PR     +S++  K++C    C  
Sbjct: 2   VISVGLGTPSKTQILEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61

Query: 49  F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 C + +    C + + Y D S + G    +T++     +  +      FGC+ D+
Sbjct: 62  GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSF----SFGCNMDS 117

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY---TSSYLKF 157
            G +E    G + G+LG+    +S + Q  S     FSYCL + +    +   T+ Y   
Sbjct: 118 FGANE---FGNVDGLLGMGAGPMSVLKQ-SSPTFDGFSYCLPLQMSERGFFSKTTGYFSL 173

Query: 158 GTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIID 215
           G      R   + TK +    N   +++ L  IS+D ER+   P  F        G + D
Sbjct: 174 GKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSIFS-----RKGVVFD 226

Query: 216 SGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAFYF 274
           SGS L+Y        L ++      R   A+     E  + CY +        P+++ +F
Sbjct: 227 SGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISLHF 282

Query: 275 ED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
           +D A   +    VF+     E   + LA AP +  V++IG
Sbjct: 283 DDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIG 321


>gi|224104765|ref|XP_002313558.1| predicted protein [Populus trichocarpa]
 gi|222849966|gb|EEE87513.1| predicted protein [Populus trichocarpa]
          Length = 468

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 86/383 (22%), Positives = 143/383 (37%), Gaps = 71/383 (18%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCD 42
            RL +GTP +   + +DTGS +++                     FDP  S +   I+C 
Sbjct: 54  TRLQLGTPPRDFYVQIDTGSDVLWVSCGSCNGCPVNSGLHIPLNFFDPGSSPTASLISCS 113

Query: 43  HPDCTY--------FKCVNEQCVYTMKYADQSVTKGFAAHETI---SVIGKGEGKAIFHG 91
              C+             N  C Y  +Y D S T G+   + +   +V+G          
Sbjct: 114 DQRCSLGLQSSDSVCSAQNNLCGYNFQYGDGSGTSGYYVSDLLHFDTVLGGSVMNNSSAP 173

Query: 92  ALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGE 149
            +FGCS    G D    D A+ G+ G  +  +S +SQL S  I  + FS+C    L   +
Sbjct: 174 IVFGCSALQTG-DLTKSDRAVDGIFGFGQQDMSVVSQLASQGISPRAFSHC----LKGDD 228

Query: 150 YTSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSG 208
                L  G  +    P+   T  + + P+  Y L+++ IS++ + +   P  F    S 
Sbjct: 229 SGGGILVLGEIV---EPNIVYTPLVPSQPH--YNLNMQSISVNGQTLAIDPSVFG--TSS 281

Query: 209 EGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNR-F 267
             G IIDSG+ L Y     Y    + F+S         +         CY +  + N  F
Sbjct: 282 SQGTIIDSGTTLAYLAEAAY----DPFISAITSIVSPSVRPYLSKGNHCYLISSSINDIF 337

Query: 268 PSMAFYFEDA-------------NLRIDGENVFIIDYENHFFLLAVAPHDDLVALIGSQQ 314
           P ++  F                   I G  ++ I ++              + ++G   
Sbjct: 338 PQVSLNFAGGASMILIPQDYLIQQSSIGGAALWCIGFQKI--------QGQGITILGDLV 389

Query: 315 QRDTRFVYDLNIDLLSFVKENCS 337
            +D  FVYD+    + +   +CS
Sbjct: 390 LKDKIFVYDIANQRIGWANYDCS 412


>gi|449437856|ref|XP_004136706.1| PREDICTED: aspartic proteinase nepenthesin-2-like [Cucumis sativus]
          Length = 457

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 95/391 (24%), Positives = 150/391 (38%), Gaps = 75/391 (19%)

Query: 4   LFIGTPSKGVLLILDTGSALIY----------------------AIFDPRKSSSFQKINC 41
           L  GTP + + LI DTGS+L++                        F P+ SSS + + C
Sbjct: 85  LSFGTPQQTLHLIFDTGSSLVWFPCTSRYLCSECSFPKIDPTGIPRFVPKLSSSSKLVGC 144

Query: 42  DHPDCTYF--KCVNEQC---------------VYTMKYADQSVTKGFAAHETISVIGKGE 84
            +P C++     V  QC                Y ++Y   S T G    ET+    K  
Sbjct: 145 QNPKCSWIFGPDVKSQCRSCNPKTENCTQTCPAYVVQYGSGS-TAGLLLSETLDFPDKK- 202

Query: 85  GKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIP 144
                   + GCS     F    +    +G+ G  R + S  SQ+G    K+F+YCL   
Sbjct: 203 ----IPNFVVGCS-----FLSIHQP---SGIAGFGRGSESLPSQMG---LKKFAYCLASR 247

Query: 145 LPNGEYTSSYLKFGTDMGYRRPSTQATKFINHP---NN----FYYLSLKDISIDNERMNF 197
             +    S  L   +  G +      T F  +P   NN    +YYL+++ I + N+ +  
Sbjct: 248 KFDDSPHSGQLILDS-TGVKSSGLTYTPFRQNPSVSNNAYKEYYYLNIRKIIVGNQAVKV 306

Query: 198 PPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLC 257
           P         G GG IIDSGS  T+    V   +  +F      +  A   +    ++ C
Sbjct: 307 PYKFLVPGPDGNGGSIIDSGSTFTFMDKPVLEVVAREFEKQLANWTRATDVETLTGLRPC 366

Query: 258 YFL-PETFNRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLAVAPHD---------DL 306
           + +  E   +FP + F F+  A   +   N F +   +    L V  H            
Sbjct: 367 FDISKEKSVKFPELIFQFKGGAKWALPLNNYFALVSSSGVACLTVVTHQMEDGGGGGGGP 426

Query: 307 VALIGSQQQRDTRFVYDLNIDLLSFVKENCS 337
             ++G+ QQ++    YDL    L F ++ CS
Sbjct: 427 SVILGAFQQQNFYVEYDLVNQRLGFRQQTCS 457


>gi|42407406|dbj|BAD09564.1| nucleoid DNA-binding protein-like [Oryza sativa Japonica Group]
          Length = 205

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 80/175 (45%), Gaps = 15/175 (8%)

Query: 115 VLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPST----QA 170
           ++GL R  +S +SQLG     RFSYCL   L       ++  F T  G    S+    Q+
Sbjct: 1   MVGLGRGLLSLVSQLG---PSRFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGLPVQS 57

Query: 171 TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
           T  + +    + Y++SLK IS+  +R+   P  F I   G GG  IDSG+ LT+   DVY
Sbjct: 58  TPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDVY 117

Query: 229 WKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
             +  + VS      L   +D    ++ C+  P      P++     D  L  DG
Sbjct: 118 DAVRRELVSVLR--PLPPANDTEIGLETCFPWPPP----PTVTMTVPDMELHFDG 166


>gi|125561847|gb|EAZ07295.1| hypothetical protein OsI_29543 [Oryza sativa Indica Group]
          Length = 205

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 54/175 (30%), Positives = 80/175 (45%), Gaps = 15/175 (8%)

Query: 115 VLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPST----QA 170
           ++GL R  +S +SQLG     RFSYCL   L       ++  F T  G    S+    Q+
Sbjct: 1   MVGLGRGLLSLVSQLG---PSRFSYCLTSFLSPEPSRLNFGVFATLNGTNASSSGLPVQS 57

Query: 171 TKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVY 228
           T  + +    + Y++SLK IS+  +R+   P  F I   G GG  IDSG+ LT+   DVY
Sbjct: 58  TPLVVNAALPSLYFMSLKGISLGQKRLPIDPLVFAINDDGTGGVFIDSGTSLTWLQQDVY 117

Query: 229 WKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMAFYFEDANLRIDG 283
             +  + VS       A  +D    ++ C+  P      P++     D  L  DG
Sbjct: 118 DAVRRELVSVLRPLPPA--NDTEIGLETCFPWPPP----PTVTMTVPDMELHFDG 166


>gi|413938615|gb|AFW73166.1| hypothetical protein ZEAMMB73_633272 [Zea mays]
          Length = 386

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 90/372 (24%), Positives = 140/372 (37%), Gaps = 70/372 (18%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYA-----------------IFDPRKSSSFQKINCDH 43
           +V   +GTP     + +DTGS L +                  +FDP +SSS+  + C  
Sbjct: 49  VVTASLGTPGVAQTMEVDTGSDLSWVQCKPCAAAPSCYSQKDPLFDPAQSSSYAAVPCGG 108

Query: 44  PDCTYFKCVNEQCV------YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCS 97
           P C                 Y + Y D S T G  + +T+++       +   G  FGC 
Sbjct: 109 PVCAGLGIYAASACSAAQCGYVVSYGDGSNTTGVYSSDTLTL----SASSAVQGFFFGCG 164

Query: 98  NDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKF 157
           +   G         + G+LGL R   S + Q        FSYC    LP    T+ YL  
Sbjct: 165 HAQSGLFN-----GVDGLLGLGREQPSLVEQTAGTYGGVFSYC----LPTKPSTAGYLTL 215

Query: 158 GTDM-GYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCII 214
           G        P    T+ +  PN   +Y + L  IS+  ++++ P   F           +
Sbjct: 216 GVGGPSGAAPGFSTTQLLPSPNAPTYYVVMLTGISVGGQQLSVPASAFAGGTV------V 269

Query: 215 DSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEP----IQLCY-FLPETFNRFPS 269
           D+G+V+T      Y  L   F     R  +A       P    +  CY F        P+
Sbjct: 270 DTGTVVTRLPPTAYAALRSAF-----RSGMASYGYPTAPSNGILDTCYNFAGYGTVTLPN 324

Query: 270 MAFYF-EDANLRIDGENVFIIDYENHFFLLAVAP--HDDLVALIGSQQQRDTRFVYDLNI 326
           +A  F   A + +  + +        F  LA AP   D  +A++G+ QQR     +++ I
Sbjct: 325 VALTFGSGATVTLGADGIL------SFGCLAFAPSGSDGGMAILGNVQQRS----FEVRI 374

Query: 327 DLLS--FVKENC 336
           D  S  F   +C
Sbjct: 375 DGTSVGFKPSSC 386


>gi|225465839|ref|XP_002264668.1| PREDICTED: aspartic proteinase nepenthesin-1-like isoform 2 [Vitis
           vinifera]
          Length = 451

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 81/372 (21%), Positives = 147/372 (39%), Gaps = 57/372 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIY-----------AIFDPRKSSSFQKINCDHPDCTYF 49
           +VR  IGTP++ +L+ +DT S + +            +F+   S++++ + C    C   
Sbjct: 102 IVRAKIGTPAQTMLMAMDTSSDVAWIPCNGCLGCSSTLFNSPASTTYKSLGCQAAQCKQV 161

Query: 50  -----------------KCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA 92
                             C    C + + Y   S+    +  +TI++           G 
Sbjct: 162 LHLLSPLLTSPSVVPKPTCGGGVCSFNLTYGGSSLAANLS-QDTITLATDA-----VPGY 215

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTS 152
            FGC     G    A+           R  +S +SQ  ++ +  FSYCL  P       S
Sbjct: 216 SFGCIQKATGGSLPAQGLLGL-----GRGPLSLLSQTQNLYQSTFSYCL--PSFKSLNFS 268

Query: 153 SYLKFGTDMGYRRPSTQATKFINHPN--NFYYLSLKDISIDNERMNFPPDTFDITVSGEG 210
             L+ G     +R   + T  + +P   + Y+++L  + +    ++ PP +F    S   
Sbjct: 269 GSLRLGPVGQPKR--IKYTPLLKNPRRPSLYFVNLMAVRVGRRVVDVPPGSFTFNPSTGA 326

Query: 211 GCIIDSGSVLTYFHSDVYWKLHEKFVSYFER-FQLAQLSDCPEPIQLCYFLPETFNRFPS 269
           G I DSG+V T   +  Y  + + F +   R   +  L         CY +P      P+
Sbjct: 327 GTIFDSGTVFTRLVTPAYIAVRDAFRNRVGRNLTVTSLGG----FDTCYTVPIA---APT 379

Query: 270 MAFYFEDANLRIDGENVFIIDYENHFFLLAVAPHDD----LVALIGSQQQRDTRFVYDLN 325
           + F F   N+ +  +N+ I         LA+A   D    ++ +I + QQ++ R +YD+ 
Sbjct: 380 ITFMFTGMNVTLPPDNLLIHSTAGSTTCLAMAAAPDNVNSVLNVIANLQQQNHRLLYDVP 439

Query: 326 IDLLSFVKENCS 337
              L   +E C+
Sbjct: 440 NSRLGVARELCT 451


>gi|224031303|gb|ACN34727.1| unknown [Zea mays]
 gi|413923868|gb|AFW63800.1| hypothetical protein ZEAMMB73_012138 [Zea mays]
          Length = 557

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 70/252 (27%), Positives = 105/252 (41%), Gaps = 40/252 (15%)

Query: 4   LFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSF-------QKINC 41
           +FIG P +   L +DTGS L +                ++ P K           Q++  
Sbjct: 191 IFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPAKEKIVPPRDLLCQELQG 250

Query: 42  DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
           +   C   K    QC Y ++YADQS + G  A + + +I    G+      +FGC+ D  
Sbjct: 251 NQNYCETCK----QCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLD-FVFGCAYDQQ 305

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           G    +      G+LGLS   ISF SQL S  II   F +C+      G     Y+  G 
Sbjct: 306 G-QLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGG----GYMFLGD 360

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
           D   R   T  T   + P+N Y+     +   ++++   P+    TV      I DSGS 
Sbjct: 361 DYVPRWGVTW-TSIRSGPDNLYHTQAHHVKYGDQQLRR-PEQAGSTVQ----VIFDSGSS 414

Query: 220 LTYFHSDVYWKL 231
            TY  +++Y  L
Sbjct: 415 YTYLPNEIYENL 426


>gi|226495667|ref|NP_001146721.1| uncharacterized protein LOC100280323 [Zea mays]
 gi|219888491|gb|ACL54620.1| unknown [Zea mays]
          Length = 557

 Score = 69.7 bits (169), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 70/252 (27%), Positives = 105/252 (41%), Gaps = 40/252 (15%)

Query: 4   LFIGTPSKGVLLILDTGSALIY---------------AIFDPRKSSSF-------QKINC 41
           +FIG P +   L +DTGS L +                ++ P K           Q++  
Sbjct: 191 IFIGNPPRPYFLDVDTGSDLTWIQCDAPCTNFAKGPHPLYKPAKEKIVPPRDLLCQELQG 250

Query: 42  DHPDCTYFKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNH 101
           +   C   K    QC Y ++YADQS + G  A + + +I    G+      +FGC+ D  
Sbjct: 251 NQNYCETCK----QCDYEIEYADQSSSMGVLARDDMHMIATNGGREKLD-FVFGCAYDQQ 305

Query: 102 GFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSSYLKFGT 159
           G    +      G+LGLS   ISF SQL S  II   F +C+      G     Y+  G 
Sbjct: 306 G-QLLSSPAKTDGILGLSSAAISFPSQLASHGIIANVFGHCITREQGGG----GYMFLGD 360

Query: 160 DMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSV 219
           D   R   T  T   + P+N Y+     +   ++++   P+    TV      I DSGS 
Sbjct: 361 DYVPRWGVTW-TSIRSGPDNLYHTQAHHVKYGDQQLRR-PEQAGSTVQ----VIFDSGSS 414

Query: 220 LTYFHSDVYWKL 231
            TY  +++Y  L
Sbjct: 415 YTYLPNEIYENL 426


>gi|449440161|ref|XP_004137853.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
 gi|449521209|ref|XP_004167622.1| PREDICTED: aspartic proteinase-like protein 2-like [Cucumis
           sativus]
          Length = 492

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 83/333 (24%), Positives = 133/333 (39%), Gaps = 57/333 (17%)

Query: 29  DPRKSSSFQKINCDHPDCTYFKCVNE--QCVYTMKYADQSVTKGFAAHETI---SVIGKG 83
           DP  +S+FQ         T  +C+ +  QC YT +Y D S T G+   E++    V+G+ 
Sbjct: 141 DPICNSAFQT--------TATQCLTQSNQCSYTFQYGDGSGTSGYYVSESMYFDMVMGQS 192

Query: 84  EGKAIFHGALFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCL 141
                    +FGCS    G D    D A+ G+ G     +S ISQL +  I  K FS+CL
Sbjct: 193 MIANSSASVVFGCSTYQSG-DLTKSDHAIDGIFGFGPGDLSVISQLSARGITPKVFSHCL 251

Query: 142 VIPLPNGEYT-SSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPP 199
                 GE      L  G  +    P    +  + + P+  Y L L+ IS++ + +   P
Sbjct: 252 -----KGEGNGGGILVLGEVL---EPGIVYSPLVPSQPH--YNLYLQSISVNGQTLPIDP 301

Query: 200 DTFDITVSGEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYF 259
             F  ++    G IIDSG+ L Y   + Y      FVS         ++        CY 
Sbjct: 302 SVFATSI--NRGTIIDSGTTLAYLVEEAY----TPFVSAITAAVSQSVTPTISKGNQCYL 355

Query: 260 LPETFNR-FPSMAFYFEDANLRI-------------DGENVFIIDYENHFFLLAVAPHDD 305
           +  +    FP ++  F  +   +             DG  ++ I ++            +
Sbjct: 356 VSTSVGEIFPLVSLNFAGSASMVLKPEEYLMHLGFYDGAALWCIGFQK---------VQE 406

Query: 306 LVALIGSQQQRDTRFVYDLNIDLLSFVKENCSD 338
            V ++G    +D  FVYDL    + +   +CS 
Sbjct: 407 GVTILGDLVMKDKIFVYDLARQRIGWASYDCSQ 439


>gi|125527370|gb|EAY75484.1| hypothetical protein OsI_03384 [Oryza sativa Indica Group]
          Length = 453

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 97/366 (26%), Positives = 142/366 (38%), Gaps = 68/366 (18%)

Query: 6   IGTPSKGVLLILDTGSALIY------AIFDPRKSSSFQK--------INCDHPDCTYFK- 50
           IGTP+ G+    DTGS LI+      A   PR S S+          + C    C     
Sbjct: 98  IGTPATGLSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPR 157

Query: 51  --CVN--------EQCVYTMKYADQS----VTKGFAAHETISVIGKGEGKAIFHGALFGC 96
             C N          C Y   Y +       T+G    ET +    G+  A F G  FGC
Sbjct: 158 PLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTF---GDDAAAFPGIAFGC 214

Query: 97  S-NDNHGFDEDARDGALAGVLGLSRVTISFISQLG-SIIKKRFSYCLVIPLPNGEYTSSY 154
           +     GF      G  +G++GL R  +S ++QL       R S  L  P P      S+
Sbjct: 215 TLRSEGGF------GTGSGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSP-----ISF 263

Query: 155 LKFGTDMGYRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVS-GE 209
                  G    S  +T  + +P      FYY+ L  IS+  + +  P  TF    S G 
Sbjct: 264 GSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGA 323

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ-----LCYFLPETF 264
           GG I DSG+ LT      Y  + ++ +S    FQ       P P       +C+    + 
Sbjct: 324 GGVIFDSGTTLTMLPDPAYTLVRDELLSQMG-FQK------PPPAANDDDLICFTGGSST 376

Query: 265 NRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLA----VAPHDDLVALIGSQQQRDTR 319
             FPSM  +F+  A++ +  EN ++   +      A    V      + +IG+  Q D  
Sbjct: 377 TTFPSMVLHFDGGADMDLSTEN-YLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFH 435

Query: 320 FVYDLN 325
            V+DL+
Sbjct: 436 VVFDLS 441


>gi|51536458|gb|AAU05467.1| At5g22850 [Arabidopsis thaliana]
 gi|55733777|gb|AAV59285.1| At5g22850 [Arabidopsis thaliana]
          Length = 426

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 69/272 (25%), Positives = 112/272 (41%), Gaps = 45/272 (16%)

Query: 3   RLFIGTPSKGVLLILDTGSALIYA-------------------IFDPRKSSSFQKINCDH 43
           +L +GTP +   + +DTGS +++                     FDP  S +   I+C  
Sbjct: 84  KLRLGTPPRDFYVQVDTGSDVLWVSCASCNGCPQTSGLQIQLNFFDPGSSVTASPISCSD 143

Query: 44  PDCTY--------FKCVNEQCVYTMKYADQSVTKGFAAHETIS---VIGKGEGKAIFHGA 92
             C++            N  C YT +Y D S T GF   + +    ++G           
Sbjct: 144 QRCSWGIQSSDSGCSVQNNLCAYTFQYGDGSGTSGFYVSDVLQFDMIVGSSLVPNSTAPV 203

Query: 93  LFGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEY 150
           +FGCS    G D    D A+ G+ G  +  +S ISQL S  I  + FS+CL      GE 
Sbjct: 204 VFGCSTSQTG-DLVKSDRAVDGIFGFGQQGMSVISQLASQGIAPRVFSHCL-----KGEN 257

Query: 151 TSSYLKFGTDMGYRRPSTQATKFI-NHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGE 209
               +    ++    P+   T  + + P+  Y ++L  IS++ + +   P  F  T +G+
Sbjct: 258 GGGGILVLGEI--VEPNMVFTPLVPSQPH--YNVNLLSISVNGQALPINPSVFS-TSNGQ 312

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFER 241
            G IID+G+ L Y     Y    E   +   +
Sbjct: 313 -GTIIDTGTTLAYLSEAAYVPFVEAITNAVSQ 343


>gi|125571687|gb|EAZ13202.1| hypothetical protein OsJ_03122 [Oryza sativa Japonica Group]
          Length = 453

 Score = 69.3 bits (168), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 97/366 (26%), Positives = 142/366 (38%), Gaps = 68/366 (18%)

Query: 6   IGTPSKGVLLILDTGSALIY------AIFDPRKSSSFQK--------INCDHPDCTYFK- 50
           IGTP+ G+    DTGS LI+      A   PR S S+          + C    C     
Sbjct: 98  IGTPATGLSGEADTGSDLIWTKCGACARCSPRGSPSYYPTSSSSAAFVACGDRTCGELPR 157

Query: 51  --CVN--------EQCVYTMKYADQS----VTKGFAAHETISVIGKGEGKAIFHGALFGC 96
             C N          C Y   Y +       T+G    ET +    G+  A F G  FGC
Sbjct: 158 PLCSNVAGGGSGSGNCSYHYAYGNARDTHHYTEGILMTETFTF---GDDAAAFPGIAFGC 214

Query: 97  S-NDNHGFDEDARDGALAGVLGLSRVTISFISQLG-SIIKKRFSYCLVIPLPNGEYTSSY 154
           +     GF      G  +G++GL R  +S ++QL       R S  L  P P      S+
Sbjct: 215 TLRSEGGF------GTGSGLVGLGRGKLSLVTQLNVEAFGYRLSSDLSAPSP-----ISF 263

Query: 155 LKFGTDMGYRRPSTQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVS-GE 209
                  G    S  +T  + +P      FYY+ L  IS+  + +  P  TF    S G 
Sbjct: 264 GSLADVTGGNGDSFMSTPLLTNPVVQDLPFYYVGLTGISVGGKLVQIPSGTFSFDRSTGA 323

Query: 210 GGCIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQ-----LCYFLPETF 264
           GG I DSG+ LT      Y  + ++ +S    FQ       P P       +C+    + 
Sbjct: 324 GGVIFDSGTTLTMLPDPAYTLVRDELLSQMG-FQK------PPPAANDDDLICFTGGSST 376

Query: 265 NRFPSMAFYFE-DANLRIDGENVFIIDYENHFFLLA----VAPHDDLVALIGSQQQRDTR 319
             FPSM  +F+  A++ +  EN ++   +      A    V      + +IG+  Q D  
Sbjct: 377 TTFPSMVLHFDGGADMDLSTEN-YLPQMQGQNGETARCWSVVKSSQALTIIGNIMQMDFH 435

Query: 320 FVYDLN 325
            V+DL+
Sbjct: 436 VVFDLS 441


>gi|194708432|gb|ACF88300.1| unknown [Zea mays]
          Length = 452

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 84/367 (22%), Positives = 149/367 (40%), Gaps = 65/367 (17%)

Query: 21  SALIYAIFDPRKSSSFQKINCDHPDCTYF--------KCVNEQCV--------------- 57
           SA    +F P+ SSS + + C +P C +         KC    C                
Sbjct: 101 SASAVPVFHPKNSSSSRLVGCRNPSCQWVHSAANLATKCRRAPCSPGAANCPAAASNVCP 160

Query: 58  -YTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGALAGVL 116
            Y + Y   S T G    +T+    +  G+A+  G + GCS         +     +G+ 
Sbjct: 161 PYAVVYGSGS-TAGLLIADTL----RAPGRAV-PGFVLGCS-------LVSVHQPPSGLA 207

Query: 117 GLSRVTISFISQLGSIIKKRFSYCLVI------PLPNGEYTSSYLKFGTDMGYRRPSTQA 170
           G  R   S  +QLG     +FSYCL+          +G         G  M Y  P  ++
Sbjct: 208 GFGRGAPSVPAQLG---LPKFSYCLLSRRFDDNAAVSGSLVLGGTGGGEGMQYV-PLVKS 263

Query: 171 TKFINHPNN-FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYW 229
                 P   +YYL+L+ +++  + +  P   F    +G GG I+DSG+  TY    V+ 
Sbjct: 264 AAGDKLPYGVYYYLALRGVTVGGKAVRLPARAFAANAAGSGGTIVDSGTTFTYLDPTVFQ 323

Query: 230 KLHEKFVSYF-ERFQLAQLSDCPEPIQLCYFLPETFNR--FPSMAFYFE-DANLRIDGEN 285
            + +  V+    R++ ++ ++    +  C+ LP+       P ++F+FE  A +++  EN
Sbjct: 324 PVADAVVAAVGGRYKRSKDAEDELGLHPCFALPQGARSMALPELSFHFEGGAVMQLPVEN 383

Query: 286 VFIIDYENHFFLLAVAPHDDLVA-------------LIGSQQQRDTRFVYDLNIDLLSFV 332
            F++        + +A   D                ++GS QQ++    YDL  + L F 
Sbjct: 384 YFVVAGRGAVEAICLAVVTDFSGGSGAGNEGSGPAIILGSFQQQNYLVEYDLEKERLGFR 443

Query: 333 KENCSDD 339
           +++C+  
Sbjct: 444 RQSCTSS 450


>gi|125554848|gb|EAZ00454.1| hypothetical protein OsI_22475 [Oryza sativa Indica Group]
          Length = 538

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 68/248 (27%), Positives = 103/248 (41%), Gaps = 32/248 (12%)

Query: 4   LFIGTPSKGVLLILDTGSALIYAIFD---------PRKSSSFQKINCDHPDCTY------ 48
           ++IG P +   L +DTGS L +   D         P      +K N   P  +Y      
Sbjct: 163 MYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNVVPPRDSYCQELQG 222

Query: 49  ---FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              +   ++QC Y + YAD+S + G  A + + +I   +G+      +FGC  D  G + 
Sbjct: 223 NQNYGDTSKQCDYEITYADRSSSMGILARDNMQLI-TADGERENLDFVFGCGYDQQG-NL 280

Query: 106 DARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
            +      G+LGLS   IS  +QL S  II   F +C+     NG     Y+  G D   
Sbjct: 281 LSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPSNG----GYMFLGDDYVP 336

Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
           R   T      N P N Y   ++ ++  ++++N       +T       I DSGS  TY 
Sbjct: 337 RWGMTW-MPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQ-----VIFDSGSSYTYL 390

Query: 224 HSDVYWKL 231
             D Y  L
Sbjct: 391 PHDDYTNL 398


>gi|115467508|ref|NP_001057353.1| Os06g0268700 [Oryza sativa Japonica Group]
 gi|53791766|dbj|BAD53531.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|53793187|dbj|BAD54393.1| putative nucellin-like aspartic protease [Oryza sativa Japonica
           Group]
 gi|113595393|dbj|BAF19267.1| Os06g0268700 [Oryza sativa Japonica Group]
 gi|125596798|gb|EAZ36578.1| hypothetical protein OsJ_20919 [Oryza sativa Japonica Group]
 gi|215767941|dbj|BAH00170.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 538

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 68/248 (27%), Positives = 103/248 (41%), Gaps = 32/248 (12%)

Query: 4   LFIGTPSKGVLLILDTGSALIYAIFD---------PRKSSSFQKINCDHPDCTY------ 48
           ++IG P +   L +DTGS L +   D         P      +K N   P  +Y      
Sbjct: 163 MYIGNPPRPYFLDVDTGSDLTWIQCDAPCTNCAKGPHPLYKPEKPNVVPPRDSYCQELQG 222

Query: 49  ---FKCVNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDE 105
              +   ++QC Y + YAD+S + G  A + + +I   +G+      +FGC  D  G + 
Sbjct: 223 NQNYGDTSKQCDYEITYADRSSSMGILARDNMQLI-TADGERENLDFVFGCGYDQQG-NL 280

Query: 106 DARDGALAGVLGLSRVTISFISQLGS--IIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGY 163
            +      G+LGLS   IS  +QL S  II   F +C+     NG     Y+  G D   
Sbjct: 281 LSSPANTDGILGLSNAAISLPTQLASQGIISNVFGHCIAADPSNG----GYMFLGDDYVP 336

Query: 164 RRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
           R   T      N P N Y   ++ ++  ++++N       +T       I DSGS  TY 
Sbjct: 337 RWGMTW-MPIRNGPENLYSTEVQKVNYGDQQLNVRRKAGKLTQ-----VIFDSGSSYTYL 390

Query: 224 HSDVYWKL 231
             D Y  L
Sbjct: 391 PHDDYTNL 398


>gi|218184944|gb|EEC67371.1| hypothetical protein OsI_34484 [Oryza sativa Indica Group]
          Length = 396

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 90/365 (24%), Positives = 142/365 (38%), Gaps = 55/365 (15%)

Query: 6   IGTPSKGVLLILDTGSALIYA--------------IFDPRKSSSFQKINCDHPDCTYF-- 49
           IGTP +    I+D    L++               +F P  SS+F+   C    C     
Sbjct: 51  IGTPPQPASAIVDVAGELVWTQCSACRRCFKQDLPVFVPNASSTFKPEPCGTAVCESIPT 110

Query: 50  -KCVNEQCVYTMKYAD-QSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDA 107
             C  + C Y       +  T GFAA +T ++     G A    A FGC   +   D D 
Sbjct: 111 RSCSGDVCSYKGPPTQLRGNTSGFAATDTFAI-----GTATVRLA-FGCVVAS---DIDT 161

Query: 108 RDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPS 167
            DG  +G +GL R   S ++Q+      RFSYCL  P   G+ +  +L     +     +
Sbjct: 162 MDGP-SGFIGLGRTPWSLVAQMK---LTRFSYCLS-PRNTGKSSRLFLGSSAKLAGSEST 216

Query: 168 TQATKFINHPN----NFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYF 223
           + A      P+    N+Y LSL  I   N  +         T    G  ++ + S  +  
Sbjct: 217 STAPFIKTSPDDDGSNYYLLSLDAIRAGNTTI--------ATAQSGGILVMHTVSPFSLL 268

Query: 224 HSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRF--PSMAFYFEDANLRI 281
               Y    +             ++  P+P  LC+     F+R   P + F F+ A    
Sbjct: 269 VDSAYKAFKKAVTEAVGGAAAPPMATPPQPFDLCFKKAAGFSRATAPDLVFTFQGAAALT 328

Query: 282 DGENVFIIDYENH-----FFLLAVAPHD----DLVALIGSQQQRDTRFVYDLNIDLLSFV 332
                ++ID           +L++A  +    + V+++GS QQ D  F+YDL  + LSF 
Sbjct: 329 VPPAKYLIDVGEEKDTACAAILSMAWLNRTGLEGVSVLGSLQQEDVHFLYDLKKETLSFE 388

Query: 333 KENCS 337
             +CS
Sbjct: 389 PADCS 393


>gi|168064205|ref|XP_001784055.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664441|gb|EDQ51161.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 459

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 82/325 (25%), Positives = 131/325 (40%), Gaps = 63/325 (19%)

Query: 2   VRLFIGTPSKGVLLILDTGS-------------------ALIYAIFDPRKSSSFQKINCD 42
            R+++GTP +   + +DTGS                   AL  +IFDP KS+S   I+C 
Sbjct: 50  TRIYLGTPPQQFYVHVDTGSDVAWVNCVPCTNCKRASNVALPISIFDPEKSTSKTSISCT 109

Query: 43  HPDC---TYFKCV--NEQCVYTMKYADQSVTKGFAAHETISV--IGKGEGKAIFHGA--L 93
             +C   +  KC   +  C Y+  Y D S T G+  ++ +S   +  G   A    A   
Sbjct: 110 DEECYLASNSKCSFNSMSCPYSTLYGDGSSTAGYLINDVLSFNQVPSGNSTATSGTARLT 169

Query: 94  FGCSNDNHGFDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYT 151
           FGC ++  G           G++G  +  +S  SQL   ++    F++C    L      
Sbjct: 170 FGCGSNQTG------TWLTDGLVGFGQAEVSLPSQLSKQNVSVNIFAHC----LQGDNKG 219

Query: 152 SSYLKFGTDMGYRRPSTQATKFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGG 211
           S  L  G     R P    T  +    + Y + L +I +    +   P  FD+  S  GG
Sbjct: 220 SGTLVIGH---IREPGLVYTPIVPK-QSHYNVELLNIGVSGTNVT-TPTAFDL--SNSGG 272

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPETFNRFPSMA 271
            I+DSG+ LTY     Y           ++FQ A++ DC         LP  F  F ++ 
Sbjct: 273 VIMDSGTTLTYLVQPAY-----------DQFQ-AKVRDC----MRSGVLPVAFQFFCTIE 316

Query: 272 FYFEDANLRIDGENVFIIDYENHFF 296
            YF +  L   G    ++   ++ +
Sbjct: 317 GYFPNVTLYFAGGAAMLLSPSSYLY 341


>gi|297597434|ref|NP_001043968.2| Os01g0696800 [Oryza sativa Japonica Group]
 gi|255673588|dbj|BAF05882.2| Os01g0696800 [Oryza sativa Japonica Group]
          Length = 334

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 87/336 (25%), Positives = 130/336 (38%), Gaps = 54/336 (16%)

Query: 22  ALIYAIFDPRKSSSFQKINCDHPDCTYFK---CVN--------EQCVYTMKYAD----QS 66
           AL+  +  P  SSS   + C    C       C N          C Y   Y +      
Sbjct: 9   ALMLPLLYPTSSSSAAFVACGDRTCGELPRPLCSNVAGGGSGSGNCSYHYAYGNARDTHH 68

Query: 67  VTKGFAAHETISVIGKGEGKAIFHGALFGCS-NDNHGFDEDARDGALAGVLGLSRVTISF 125
            T+G    ET +    G+  A F G  FGC+     GF      G  +G++GL R  +S 
Sbjct: 69  YTEGILMTETFTF---GDDAAAFPGIAFGCTLRSEGGF------GTGSGLVGLGRGKLSL 119

Query: 126 ISQLG-SIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQATKFINHPN----NF 180
           ++QL       R S  L  P P      S+       G    S  +T  + +P      F
Sbjct: 120 VTQLNVEAFGYRLSSDLSAPSP-----ISFGSLADVTGGNGDSFMSTPLLTNPVVQDLPF 174

Query: 181 YYLSLKDISIDNERMNFPPDTFDITVS-GEGGCIIDSGSVLTYFHSDVYWKLHEKFVSYF 239
           YY+ L  IS+  + +  P  TF    S G GG I DSG+ LT      Y  + ++ +S  
Sbjct: 175 YYVGLTGISVGGKLVQIPSGTFSFDRSTGAGGVIFDSGTTLTMLPDPAYTLVRDELLSQM 234

Query: 240 ERFQLAQLSDCPEPIQ-----LCYFLPETFNRFPSMAFYFE-DANLRIDGENVFIIDYEN 293
             FQ       P P       +C+    +   FPSM  +F+  A++ +  EN ++   + 
Sbjct: 235 G-FQK------PPPAANDDDLICFTGGSSTTTFPSMVLHFDGGADMDLSTEN-YLPQMQG 286

Query: 294 HFFLLA----VAPHDDLVALIGSQQQRDTRFVYDLN 325
                A    V      + +IG+  Q D   V+DL+
Sbjct: 287 QNGETARCWSVVKSSQALTIIGNIMQMDFHVVFDLS 322


>gi|88174587|gb|ABD39368.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
          Length = 321

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 85/342 (24%), Positives = 144/342 (42%), Gaps = 53/342 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
           ++ + +GTPSK  ++ +DTGS+  +   +       PR     +S++  K++C    C  
Sbjct: 2   VISVGLGTPSKTQIVEIDTGSSASWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61

Query: 49  F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 C + +    C + + Y D S + G    +T++     +  +      FGC+ D+
Sbjct: 62  GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSF----TFGCNLDS 117

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-----TSSYL 155
            G +E    G + G+LG+    +S + Q  S     FSYCL  PL   E      T+ Y 
Sbjct: 118 FGANE---FGNVDGLLGMGAGPMSVLKQ-SSPTFDGFSYCL--PLQKSERGFFSKTTGYF 171

Query: 156 KFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
             G      R   + TK +    N   +++ L  IS+D ER+   P  F        G +
Sbjct: 172 SLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS-----RKGVV 224

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAF 272
            DSGS L+Y        L ++      R   A+     E  + CY +        P+++ 
Sbjct: 225 FDSGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISL 280

Query: 273 YFED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
           +F+D A   +    VF+     E   + LA AP +  V++IG
Sbjct: 281 HFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIG 321


>gi|357492303|ref|XP_003616440.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
 gi|355517775|gb|AES99398.1| Aspartic proteinase nepenthesin-1 [Medicago truncatula]
          Length = 521

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 85/360 (23%), Positives = 136/360 (37%), Gaps = 55/360 (15%)

Query: 2   VRLFIGTPSKGVLLILDTGSALIYA----------IFDPRKSSSFQKINCDHPDCTYFKC 51
           V L +G+P + V ++LDTGS L +           IF+P  SSS+    C  P C     
Sbjct: 38  VSLTVGSPPQRVTMVLDTGSELSWLHCKKLPNLNFIFNPLVSSSYTPTPCTSPIC----- 92

Query: 52  VNEQCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDNHGFDEDARDGA 111
                  T +  D        A++   +I    G     G +FGC +   G      D  
Sbjct: 93  -------TTQTRDLINPVSCDANKLCHIITFFVGGPAQRGMVFGCMDT--GTSSGDEDSK 143

Query: 112 LAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEYTSSYLKFGTDMGYRRPSTQAT 171
             G++G+   ++SF +Q+      +FSYC+     N + T   +        R      T
Sbjct: 144 TTGLMGMDLGSLSFSNQMR---LPKFSYCIS----NKDSTGVLVLENIANPPRLGPLHYT 196

Query: 172 KFINHPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEGGCIIDSGSVLTYFHSDVYWKL 231
             +       Y +        ++  F PD      +G G  ++DS +  T+    VY  L
Sbjct: 197 PLVKKTTPLPYFNRNCCLF--QKSAFLPDH-----TGAGQTMVDSATQFTFLRQPVYTAL 249

Query: 232 HEKFVSYFERFQLAQLSDCPE-----PIQLCYFLP--ETFNRFPSMAFYFEDANLRIDGE 284
             +F    +   L  L D P+      + LC+ +P   T    P +   F+ A LR+ GE
Sbjct: 250 KNEFAIQTKNI-LTPLGD-PKFVFQGVMDLCFRVPIGSTLPVLPVVTLMFDGAELRVTGE 307

Query: 285 NVFI----IDYENHFFLLAVAPHDDLVA----LIGSQQQRDTRFVYDLNIDLLSFVKENC 336
            +      +   N +       + DL+     +IG   QR+    YDL    + F   NC
Sbjct: 308 RLLYKVSNVAKSNSWIYCFTFGNSDLLGIEAFIIGHHHQRNVWMEYDLANSRIGFSDTNC 367


>gi|384252236|gb|EIE25712.1| acid protease [Coccomyxa subellipsoidea C-169]
          Length = 599

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 66/267 (24%), Positives = 115/267 (43%), Gaps = 56/267 (20%)

Query: 4   LFIGTPSKGVLLILDTGSALIY----------------AIFDPRKSSSFQKINCDHPDCT 47
           L +GTP++   +I+DTGS + Y                A FDP  SSS   I CD   C 
Sbjct: 66  LHLGTPARQFAVIVDTGSTITYVPCASCGRNCGPHHKDAAFDPASSSSSAVIGCDSDKCI 125

Query: 48  YFK----CVNE-QCVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGA---LFGCSND 99
             +    C  + +C Y   YA+QS + G    + +          +  GA   +FGC   
Sbjct: 126 CGRPPCGCSEKRECTYQRTYAEQSSSAGLLVSDQLQ---------LRDGAVEVVFGCETK 176

Query: 100 NHG--FDEDARDGALAGVLGLSRVTISFISQLG--SIIKKRFSYCLVIPLPNGEYTSSYL 155
             G  ++++A      G+LGL    +S ++QL    +I   F+ C      + E   + +
Sbjct: 177 ETGEIYNQEAD-----GILGLGNSEVSLVNQLAGSGVIDDVFALC----FGSVEGDGALM 227

Query: 156 KFGTDMGYRRPSTQATKFIN---HPNNFYYLSLKDISIDNERMNFPPDTFDITVSGEG-G 211
               D      + Q T  ++   HP ++Y + L+ + +  +++   P+ ++     EG G
Sbjct: 228 LGDVDAAEYDVALQYTALLSSLAHP-HYYSVQLEALWVGGQQLPVKPERYE-----EGYG 281

Query: 212 CIIDSGSVLTYFHSDVYWKLHEKFVSY 238
            ++DSG+  TY  S+ +    E   +Y
Sbjct: 282 TVLDSGTTFTYLPSEAFQLFKEAVSAY 308


>gi|88174579|gb|ABD39364.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Japonica
           Group]
 gi|88174585|gb|ABD39367.1| chloroplast nucleoid DNA-binding protein [Oryza sativa Indica
           Group]
 gi|88174595|gb|ABD39372.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174599|gb|ABD39374.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
 gi|88174607|gb|ABD39378.1| chloroplast nucleoid DNA-binding protein [Oryza rufipogon]
          Length = 321

 Score = 68.9 bits (167), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 84/342 (24%), Positives = 143/342 (41%), Gaps = 53/342 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
           ++ + +GTP+K  ++ +DTGS+  +   +       PR     +S++  K++C    C  
Sbjct: 2   VISVGLGTPAKTQIVEIDTGSSTSWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61

Query: 49  F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 C + +    C + + Y D S + G    +T++            G  FGC+ D+
Sbjct: 62  GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTF----SDVQKIPGFSFGCNMDS 117

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-----TSSYL 155
            G +E    G + G+LG+    +S + Q  S     FSYCL  PL   E      T+ Y 
Sbjct: 118 FGANEF---GNVDGLLGMGAGPMSVLKQ-SSPTFDCFSYCL--PLQKSERGFFSKTTGYF 171

Query: 156 KFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
             G      R   + TK +    N   +++ L  IS+D ER+   P  F        G +
Sbjct: 172 SLGKVA--TRTDVRYTKMVARKKNTELFFVDLTAISVDGERLGLSPSVFS-----RKGVV 224

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAF 272
            DSGS L+Y        L ++      +   A+     E  + CY +        P+++ 
Sbjct: 225 FDSGSELSYIPDRALSVLSQRIRELLLKRGAAE----EESERNCYDMRSVDEGDMPAISL 280

Query: 273 YFED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
           +F+D A   +    VF+     E   + LA AP +  V++IG
Sbjct: 281 HFDDGARFDLGSHGVFVERSVQEQDVWCLAFAPTES-VSIIG 321


>gi|88174569|gb|ABD39359.1| chloroplast nucleoid DNA-binding protein [Oryza nivara]
          Length = 321

 Score = 68.9 bits (167), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 84/342 (24%), Positives = 144/342 (42%), Gaps = 53/342 (15%)

Query: 1   MVRLFIGTPSKGVLLILDTGSALIYAIFD-------PR-----KSSSFQKINCDHPDCTY 48
           ++ + +GTP+K  ++ +DTGS+  +   +       PR     +S++  K++C    C  
Sbjct: 2   VISVGLGTPAKTQIVEIDTGSSTTWVFCECDGCHTNPRTFLQSRSTTCAKVSCGTSMCLL 61

Query: 49  F----KCVNEQ----CVYTMKYADQSVTKGFAAHETISVIGKGEGKAIFHGALFGCSNDN 100
                 C + +    C + + Y D S + G    +T++     +  +      FGC+ D+
Sbjct: 62  GGSDPHCQDSENYPDCPFRVSYQDGSASYGILYQDTLTFSDVQKIPSF----TFGCNLDS 117

Query: 101 HGFDEDARDGALAGVLGLSRVTISFISQLGSIIKKRFSYCLVIPLPNGEY-----TSSYL 155
            G +E    G + G+LG+    +S + Q  S     FSYCL  PL   E      T+ Y 
Sbjct: 118 FGANEF---GNVDGLLGMGAGPMSVLKQ-SSPTFDGFSYCL--PLQKSERGFFSKTTGYF 171

Query: 156 KFGTDMGYRRPSTQATKFINHPNN--FYYLSLKDISIDNERMNFPPDTFDITVSGEGGCI 213
             G      R   + TK +    N   +++ L  IS+D ER+   P  F        G +
Sbjct: 172 SLGKVA--TRTDVRYTKMVARRKNTELFFVDLAAISVDGERLGLSPSIFS-----RKGVV 224

Query: 214 IDSGSVLTYFHSDVYWKLHEKFVSYFERFQLAQLSDCPEPIQLCYFLPET-FNRFPSMAF 272
            DSGS L+Y        L ++      R   A+     E  + CY +        P+++ 
Sbjct: 225 FDSGSELSYIPDRALSVLSQRIRELLLRRGAAE----EESERNCYDMRSVDEGDMPAISL 280

Query: 273 YFED-ANLRIDGENVFIID--YENHFFLLAVAPHDDLVALIG 311
           +F+D A   +    VF+     E   + LA AP +  V++IG
Sbjct: 281 HFDDGARFDLGSRGVFVERSVQEQDVWCLAFAPTES-VSIIG 321


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.324    0.141    0.432 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,728,849,627
Number of Sequences: 23463169
Number of extensions: 251234616
Number of successful extensions: 437664
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 333
Number of HSP's successfully gapped in prelim test: 1354
Number of HSP's that attempted gapping in prelim test: 432420
Number of HSP's gapped (non-prelim): 1940
length of query: 341
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 198
effective length of database: 9,003,962,200
effective search space: 1782784515600
effective search space used: 1782784515600
T: 11
A: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 77 (34.3 bits)